Lead Site Reliability Engineer

centre for computational technologies (cctech)

Pune, India

8-12 Years

Save

Posted 2 months ago
Be among the first 10 applicants

Early Applicant

Job Description

load_list_page(event)> Job listing
Job details

Job Information

Date Opened 04/13/2026
Industry IT Services
Job Type Full time
Work Experience 8-12 Years
City Pune City
State/Province Maharashtra
Country India
Zip/Postal Code 411057

About Us

CCTechs mission is to transform human life by the democratization of technology. We are a well established digital transformation company building the applications in the areas of CAD, CFD, Artificial Intelligence, Machine Learning, 3D Webapps, Augmented Reality, Digital Twin, and other enterprise applications.

We have two business divisions: product and consulting.

simulationHub is our flagship product and the manifestation of our vision. Currently, thousands of users use our CFD app in their upfront design process.

Our consulting division, with its partners such as Autodesk Forge, AWS and Azure, is helping the world's leading engineering organizations, many of which are Fortune 500 list of companies, in achieving digital supremacy.

Job Description

We are looking for Lead Site Reliability Engineers who combine deep reliability engineering expertise with strong ownership, communication, and systems thinking.

This is not a traditional operations-only role.

We Are Looking For Techno-business Leaders Who Can

operate and improve mission-critical systems,
drive architectural and reliability decisions,
collaborate directly with clients and stakeholders,
identify platform and business improvement opportunities,
act as trusted technical anchors for both CCTech and customer teams.

Someone who understands production deeply, thinks in systems, values automation over toil, and continuously improves reliability, scalability, and operational maturity.

This role is highly hands-on while also requiring leadership, initiative, and the ability to influence technical direction across teams.

Key Responsibilities

Own reliability, uptime, and operational health of mission-critical cloud systems
Design systems for scalability, resilience, operability, and cost efficiency

Drive SRE Best Practices Including

Observability,incident prevention,postmortems,reliability engineering and automation-first operations
Lead architecture and operational decisions balancing reliability, scalability, maintainability, and cost.
Build and evolve Infrastructure-as-Code, CI/CD pipelines, deployment workflows, and recovery automation
Design and improve monitoring, logging, alerting, and observability frameworks
Lead critical incident investigations, root cause analysis, and long-term corrective actions
Reduce operational toil through automation, reusable tooling, and engineering discipline
Collaborate directly with product teams, platform teams, and client stakeholders to align technical direction with business needs
Act as a trusted technical partner during architecture discussions, solution reviews, brainstorming sessions, and platform evolution initiatives
Proactively identify opportunities for platform improvements, operational maturity, automation, reliability optimization, and cost reduction.
Mentor engineers, raise technical standards, and contribute to building high-performing teams
Help shape SRE culture, operational maturity, and engineering practices across projects

Requirements

8–12 years of experience in SRE / DevOps / Cloud Engineering roles

Strong Hands-on Experience With

AWS production systems
Infrastructure-as-Code (Terraform / CloudFormation)
CI/CD pipelines and deployment automation
Containerized environments (Docker / Kubernetes / ECS)

Proven Experience In

designing and operating reliable distributed systems,
handling production incidents at scale,
debugging complex system failures,
improving system reliability and operational maturity,
driving automation-first engineering practices
Strong programming/scripting ability in Python (preferred), Go.
Strong understanding of distributed systems, observability, scalability, performance optimization, and cost-aware architecture.
Experience working closely with stakeholders, customers, or cross-functional teams in technical discussions and solution alignment
Ability to independently drive initiatives, influence decisions, and take ownership beyond assigned tasks
Excellent communication skills with the ability to explain technical concepts clearly to both engineering and non-engineering stakeholders

Good to Have

Experience with SLOs, SLIs, error budgets, and reliability governance
Exposure to API platforms, Identity systems (OAuth2/OIDC), or platform engineering initiatives
Experience with chaos engineering, failure testing, or resilience validation
Exposure to regulated or enterprise-scale environments
Background in backend engineering before transitioning into SRE/DevOps
Experience contributing to technical proposals, architecture reviews, or client-facing solution discussions

Benefits

High ownership role with direct impact on mission-critical systems
Opportunity to shape platform reliability, operational maturity, and engineering direction
Exposure to advanced areas such as Digital Twin, AI/ML systems, cloud-native platforms, and large-scale distributed architectures
Work closely with global engineering organizations and strategic technology partners
Opportunity to grow into a technical leadership and client-facing advisory role within CCTech.

check(event) ; career-website-detail-template-2 => apply(record.id,meta) mousedown=lyte-button => check(event) final-style=background-color:#2185D0;border-color:#2185D0;color:white; final-class=lyte-button lyteBackgroundColorBtn lyteSuccess lyte-rendered=>

More Info

Job Type:

Industry:

Function:

Employment Type:

About Company

centre for computational technologies (cctech)Job Source: www.linkedin.com

Job ID: 145951311

Jobs by Skill - IT

Jobs by Skill - Non IT

International Jobs

Last Updated: 07-07-2026 11:01:46 AM

Homejobs in PuneLead Site Reliability Engineer

Similar Jobs