Search by job, company or skills

Datavail

Senior Specialist - Cloud SRE

new job description bg glownew job description bg glownew job description bg svg
  • Posted 4 days ago
  • Be among the first 10 applicants
Early Applicant

Job Description

Job Description

Job Title: Senior Specialist Cloud SRE

Education: Bachelor's Degree

Experience: 8+ years

Location: Mumbai

As a Senior SRE Engineer (Cloud SRE Specialist), you will be responsible for ensuring the reliability, scalability, performance, and cost optimization of cloud services across AWS, Azure, and multi-cloud environments. You will act as the primary technical lead for assigned customers, manage incident escalations, drive automation-first practices, and mentor junior engineers. You will also collaborate closely with development teams to embed resilience and observability into applications.

Key Responsibilities:

Customer Leadership & Collaboration:

  • Serve as the primary technical point of contact for assigned customer accounts.
  • Provide regular updates and lead initiatives to improve customer environments.
  • Be highly familiar with assigned accounts to make tactical decisions without escalation.
  • Collaborate with customer development teams to align infrastructure with application requirements.
  • Incident & Problem Management
  • Lead incident response and postmortems, ensuring corrective and preventive measures.
  • Be the Tier 3 escalation point for offshore/onshore SRE teams.
  • Perform Root Cause Analysis (RCA) and validate work quality of Tier-2 engineers.
  • Develop and maintain incident response plans for security breaches and operational incidents.

Reliability Engineering:

  • Define and maintain SLIs/SLOs, track error budgets, and monitor alignment.
  • Participate in architecture discussions for high availability, disaster recovery, and scalability.
  • Integrate resilience patterns such as circuit breakers, retries, and bulkheading.
  • Use chaos engineering / fault injection practices where applicable.
  • Automation & Infrastructure as Code
  • Automate infrastructure and operations tasks using Terraform, CloudFormation, AWS CDK.
  • Build and maintain CI/CD pipelines with canary deployments and blue/green strategies.
  • Implement automation workflows with AWS Lambda, Step Functions, Azure Functions.
  • Monitoring & Observability
  • Implement observability systems: Prometheus, Grafana, OpenTelemetry, ELK, Jaeger.
  • Configure proactive monitoring and alerts using AWS CloudWatch / Azure Monitor.
  • Ensure visibility into metrics, traces, and logs for troubleshooting.
  • Cloud Infrastructure Management
  • Provision and manage VMs, storage, networking, VPNs, and ExpressRoute/Peering.
  • Manage patching, backups, encryption, decryption, and image management.
  • Optimize performance and cost via rightsizing, autoscaling, and reserved instances.
  • Manage identity and access controls (AWS IAM, Azure AD, RBAC).

Security & Compliance:

  • Implement and enforce security best practices across multi-cloud environments.
  • Ensure compliance with GDPR, HIPAA, and industry regulations.
  • Conduct regular audits and compliance reporting.
  • Mentoring & Knowledge Sharing
  • Coach and mentor Tier 2 and junior SREs.
  • Conduct reliability-focused design reviews.
  • Maintain up-to-date documentation, runbooks, and SOPs.

About Us

Datavail is a leading provider of data management, application development, analytics, and cloud services, with more than 1,000 professionals helping clients build and manage applications and data via a world-class tech-enabled delivery platform and software solutions across all leading technologies. For more than 17 years, Datavail has worked with thousands of companies spanning different industries and sizes, and is an AWS Advanced Tier Consulting Partner, a Microsoft Solutions Partner for Data & AI and Digital & App Innovation (Azure), an Oracle Partner, and a MySQL Partner.

About The Team

Datavail's Team of Cloud Experts Can Save You Time and Money

Our Cloud experts are capable to overcome every obstacle in helping clients manage everything from databases, analytics, reporting, migrations, and upgrades to monitoring and overall data management.

You can free up your IT resources to focus on growing your business rather than fighting fires. Our Cloud experts can guide you through strategic initiatives or support routine database management.

Cloud Managed Services

Datavail's business focuses on helping you use your data to drive business results through cost-saving services. The success of your business depends on how well you understand and manage your data. Our managed cloud services give you the power to unleash your organization's potential. We provide comprehensive and technically advanced support for Cloud Operation to ensure that your infrastructure is safe, secure, and managed with the utmost level of care.

Our delivery performance in data management leads the industry. We offer highly trained Cloud administrators via a 247, always on, always available, global delivery model.

With the combination of a proven delivery model and top-notch experience ensures that Datavail will remain the Cloud experts on demand you desire. Datavail's flexible and client focused services always add value to your organization.

More Info

Job Type:
Industry:
Employment Type:

About Company

Job ID: 135070181