Team Overview:
- This position will be part of the Group Platform Service and Engineering - Toolchain SRE team.
- Fits within the existing Engineering and Site Reliability Engineers team.
- Team comprises Software Engineers Developers, Site Reliability Engineers, and a thin layer of Operations, forming a true DevOps team.
- The Operations team member will work with peers in India and other global regions to maintain high service availability across applications and DevOps tools.
Key Responsibilities of the Associate Role:
- Work on Incident Management and Problem tickets and help close the incidents within SLA.
- Enhance and automate operations and support tasks using programming languages such as Java/Python, JavaScript, and others.
- Possess good knowledge of DevOps.
- Have scripting experience in Shell Scripting (Linux) or Python 3.8.
- Collaborate with cross-functional teams to ensure JIRAs and incidents are resolved effectively and meet quality standards.
- Document SOPs and maintenance processes to facilitate knowledge transfer and maintenance.
- Provide support and work in shifts: early India/Japan (6:30 AM to 3:30 PM IST), UK (1:00 PM to 10:00 PM IST), on a rotation basis, including weekend on-call support when required.
- Troubleshoot issues that arise with Toolchain or Unity Core Services platform and resolve them promptly.
- Possess strong communication skills and work collaboratively with others.
Skills and Qualifications:
- A bachelor's degree in computer science, information systems, or a related field, or equivalent work experience.
- 7+ years of experience working as DevOps or in Operations Support.
- Good work experience in Unix/Linux shell scripting.
- Good experience in JIRA, Confluence tools, and familiarity with CI/CD tools like Jenkins, GitLab, and infrastructure as code (IaC) tools like Ansible or Terraform.
- Must be familiar with and have worked with YAML, JSON, XML markup languages.
- Exposure to agile development practices.
- Enthusiasm and ability to learn and apply support technologies in the DevOps space.
- Strong analytical problem-solving skills.
- Strong communication skills for cross-team collaboration.
- Ability to influence business and IT stakeholders for continuous improvement.
- Strong communication and interpersonal skills, with the ability to communicate effectively with technical and non-technical audiences and development stakeholders.
Good To Have Skills:
- Experience working in the Banking and Financial IT domain.
- Python programming knowledge and skills.
- Experience with OpenTelemetry and monitoring tools like Prometheus, Grafana.
- Experience supporting applications for public/private/hybrid clouds.
- Knowledge/experience of any of the following technologies:
- AWS Cloud
- Kubernetes