
Search by job, company or skills
This job is no longer accepting applications
. Build, deploy and manage DevOps pipeline in AWS/ Azure /GCP using IAC (Terraform and Cloud native tooling)
. Provide day-to-day direction and innovation by enabling core platform capabilities, building cloud connectivity, infrastructure, and shared services at scale on Azure
. Manage and give guidance to the team on the development of highly scalable, flexible, and resilient cloud capabilities and patterns
. Develop long- and short-term work plans for the Cloud Engineering team to meet the goals sets in the enterprise Cloud Solution Design and Cloud Target architecture
. DevOps Pipeline integration with security scanning, security posture management and reporting tools
. Instrumenting resiliency parameters and monitoring metrics around stability, reliability and security Collaborate within the technical team to provide monitoring improvements, platform stability recommendations.
. Provides oversight for production operations to maximize reliability and automation. Create, maintaining and evolve SRE procedures and tooling
. Design and implement scalable and highly reliable solutions, build tools and services, full-stack observability, monitoring and event management integrations to monitor and advance the reliability and quality of services
. Implement AIOPS and Data driven operational tooling and dashboarding solution to improve operations decision making
. Develop and rollout CI/CD framework, Release management and Infra as a Code (IaC) solutions using Terraform/Jenkins/Bamboo/U-deploy across hybrid/multi-cloud Infrastructure.
. Facilitate the release management process and deployment management on Cloud/On-Prem Infrastructure for the application across multiple non-prod and prod environments
. Responsible for planning and executing code releases, deploying changes and container images, provision capacity and strategize rollback if needed
. Improve automation of deployment for production and pre-release environments, define monitoring requirements and implement automated incident resolution solutions
. Working closely with the Cyber security, engineering and operations team to bring policy as a code in to DevSecOps pipeline and improve cyber resiliency
. Develop re-usable BOTs and Workflows using PowerShell, Python, Ruby, Ansible, Chef, Puppet, Terraform, Git, django framework & REST APIs.
. Identify areas for process and efficiency improvement within Platform Services Operations recommend solutions and assist in overseeing implementation. Actively facilitate continuous improvement.
. Ensure all necessary operational processes and procedures are carried out with a high level of attention to detail, expediency and on-time delivery.
. Define and document standard run books and operating procedures. Create and maintain system information and architecture diagrams.
. Monitor various systems capacity and health indicators and trends provide analytics & forecasts for added or reduced capacity as required
Cloud Site Reliability Engineer(SRE)
DevOps, CI/CD
Maximise Cloud reliability and automation
Powershell, Python, Ruby
Monitor various systems capacity and health indicators
Provide Analytics and forecasts for added or reduced Capacity
Capgemini was founded by Serge Kampf in 1967 as an enterprise management and data processing company. The company was founded as the Société pour la Gestion de l'Entreprise et le Traitement de l'Information (Sogeti).In 1974 Sogeti acquired Gemini Computers Systems, a US company based in New York.In 1975, having made two major acquisitions of CAP (Centre d'Analyse et de Programmation) and Gemini Computer Systems, and following resolution of a dispute with the similarly named CAP UK over the international use of the name 'CAP', Sogeti renamed itself as CAP Gemini Sogeti.
Job ID: 71118935