Search by job, company or skills

Aptean India

Site Reliability Engineering Manager

10-12 Years
new job description bg glownew job description bg glownew job description bg svg
  • Posted 14 days ago
  • Be among the first 10 applicants
Early Applicant

Job Description

About the Role

We are looking for an experienced and hands-on Cloud Infrastructure & Operations Manager to lead a team of engineers responsible for managing the infrastructure layer of multi-tenant, cloud-hosted ERP products.

This role will oversee platform reliability, cloud operations, security, incident management, preventive maintenance, disaster recovery, and compliance across environments. The position also acts as a stage-gate for all production deployments, ensuring release readiness, rollback capability, and platform stability.

You will lead a team of 15 cloud and DevOps engineers and collaborate closely with Product, Security, and Engineering teams to maintain a reliable and scalable SaaS platform.

Key Responsibilities

Cloud Infrastructure Management

  • Oversee provisioning, monitoring, and scaling of cloud environments (primarily Microsoft Azure).
  • Ensure optimal platform performance, cost management, and operational stability.

SaaS Product Operations

  • Own the availability of Dev, UAT, and Production environments.
  • Plan platform upgrades, apply security patches, and manage certificates and access controls.

Incident Management

  • Lead incident response for outages and performance degradation.
  • Conduct Root Cause Analysis (RCA) and implement preventive improvements.

Preventive Operations

  • Define and execute regular health checks, patching schedules, and environment maintenance.
  • Optimize alerts and monitoring for proactive issue detection.

Disaster Recovery & Business Continuity

  • Design, maintain, and test DR and BCP strategies.
  • Ensure business continuity across all cloud-hosted environments.

Security & Compliance

  • Lead infrastructure compliance initiatives aligned with SOC 2 and ISO 27001 standards.
  • Work closely with Information Security and Audit teams to maintain compliance.

Production Deployment Governance

  • Act as the stage-gate authority for production releases.
  • Review deployment tickets and validate readiness, rollback strategies, and impact assessments.

Team Leadership

  • Lead, mentor, and develop a team of Cloud and DevOps engineers.
  • Build a culture focused on reliability, operational excellence, and continuous improvement.

Required Qualifications

  • B.E / B.Tech / MCA in Computer Science, Information Technology, or a related field.
  • 10+ years of experience in Cloud Infrastructure or SaaS Operations.
  • 3+ years of experience managing engineering teams in a cloud product environment.
  • Strong hands-on experience with Microsoft Azure:
  • Virtual Machines
  • PaaS services
  • Networking
  • Monitoring
  • Identity and Access Management
  • Experience supporting multi-tenant SaaS platforms.
  • Experience working with ERP platforms (SAP Cloud, Infor, Oracle Cloud, or custom ERP systems).
  • Understanding of DevOps practices, CI/CD pipelines, and Infrastructure as Code (IaC).
  • Familiarity with SOC 2, ISO 27001, and data privacy compliance frameworks.

Preferred Certifications

  • ITIL Certification
  • Site Reliability Engineering (SRE) certification

Technical Skills

Cloud Platforms

  • Microsoft Azure (App Services, VMs, Networking, Storage, Defender)

DevOps & Automation

  • Azure DevOps
  • GitHub Actions
  • CI/CD pipelines

Infrastructure as Code

  • Terraform
  • Bicep
  • ARM Templates

Monitoring & Observability

  • Azure Monitor
  • Application Insights
  • Log Analytics

Security & Access

  • Azure AD
  • Role-Based Access Control (RBAC)
  • Secret rotation and access governance

Disaster Recovery

  • Geo-redundancy strategies
  • DR drills
  • RTO/RPO planning

Tools

  • Azure DevOps
  • Jira
  • ServiceNow
  • Salesforce (case management)

Leadership Expectations

  • Build and mentor a high-performing cloud operations team
  • Manage on-call rotations and operational readiness
  • Drive continuous improvement in reliability, automation, and operational efficiency
  • Collaborate cross-functionally with Product, Engineering, and Security teams

Why Join Us

  • Lead cloud infrastructure for enterprise SaaS ERP platforms
  • Work with modern Azure-based cloud architectures
  • Build and scale high-reliability SaaS operations
  • Opportunity to lead and grow a strong engineering team

More Info

Job Type:
Industry:
Employment Type:

About Company

Job ID: 144668089