We're seeking a Lead DevOps & Platform Engineer to own and evolve the CI/CD infrastructure, cloud platform operations, and automation strategy for the Merchandising & Pricing platforms. This role requires deep Azure cloud expertise, CI/CD pipeline engineering (Jenkins, Azure DevOps), Infrastructure as Code (Terraform, Ansible), and container orchestration (Docker, Kubernetes). Cloud certification is required, and GCP experience is beneficial.
Duties & Responsibilities
- Own and evolve CI/CD pipelines (Jenkins scripted pipelines with shared libraries, Azure DevOps Pipelines) for automated build, test, and deployment workflows across the Merchandising & Pricing platform ecosystem.
- Design, implement, and maintain Infrastructure as Code (IaC) using Terraform, Ansible, and ARM/Bicep templates for Azure environments.
- Manage and optimize containerized workloads using Docker and Kubernetes (AKS), ensuring high availability and scalability.
- Administer and optimize cloud infrastructure on Azure (primary), with GCP as secondary - including compute, networking, storage, identity, and security.
- Implement and maintain monitoring, alerting, and observability solutions (Prometheus, Grafana, New Relic, Azure Monitor) to ensure system health and rapid incident response.
- Automate operational tasks through scripting (Shell, Python, Groovy) to reduce manual intervention, eliminate toil, and improve reliability.
- Ensure platform security, compliance, and governance through automated policies, guardrails, and security scanning in CI/CD pipelines.
- Collaborate with application engineering teams to streamline deployment processes, reduce release cycle time, and improve developer experience.
- Drive adoption of GitOps practices (FluxCD, ArgoCD) for declarative infrastructure and application deployment.
- Lead incident response and root cause analysis for production infrastructure issues; drive post-incident improvements.
- Manage environment provisioning (dev, staging, production) and ensure consistency and parity across environments.
- Coordinate with US-based engineering leadership on platform priorities, sprint planning, and cross-team dependencies.
- Drive adoption of AI-assisted tools for infrastructure automation, anomaly detection, predictive monitoring, and operational intelligence across the platform engineering function.
Requirements
Years of Experience :
- 8+ years of DevOps, SRE, or platform engineering experience in enterprise environments.
- 4+ years working with Azure cloud services in production environments.
- 2+ years hands-on with Kubernetes orchestration (AKS preferred) in enterprise settings.
- Experience managing CI/CD pipelines at scale for distributed application teams.
Basic Qualifications
- Proficiency with CI/CD tools - Jenkins scripted pipelines with shared libraries, Azure DevOps Pipelines, and pipeline-as-code practices.
- Strong experience with Docker, Kubernetes (AKS), and container orchestration in production environments.
- Hands-on experience with Infrastructure as Code (Terraform, Ansible, ARM/Bicep templates).
- Deep Azure cloud expertise - compute, networking, storage, identity, and security.
- Cloud certification required - Azure Administrator (AZ-104), Azure DevOps Engineer (AZ-400), or equivalent.
- Strong scripting skills (Shell, Python, Groovy) for automation and tooling.
- Experience with monitoring and observability (Prometheus, Grafana, New Relic, Azure Monitor).
- Knowledge of security best practices, network architecture, and compliance in cloud environments.
- Strong knowledge of Git, branching strategies, and version control best practices.
- Demonstrated ability to work in Agile teams and deliver high-quality infrastructure solutions on time.
- AI-native SDLC mindset - proven ability to leverage AI-assisted tools for infrastructure automation, code generation, and operational efficiency.
(ref:hirist.tech)