Senior Reliability Engineer | Remote

Uplers

India

5-8 Years

Save

Posted 20 hours ago
Be among the first 10 applicants

Early Applicant

Job Description

Experience: 5.00 + years

Salary: USD 38000.00 / year (based on experience)

Expected Notice Period: 7 Days

Shift: (GMT+05:30) Asia/Kolkata (IST)

Opportunity Type: Remote

Placement Type: Full Time Contract for 6 Months(40 hrs a week/160 hrs a month)

(*Note: This is a requirement for one of Uplers client - A Renowned Hiring Product Company from USA)

What do you need for this opportunity

Must have skills required:

Cloud platforms (AWS, or Azure) and infrastructure-as-code, .NET Ruby on Rails Go OpenTelemetry LaunchDarkly Azure / GCP Kubernetes Terraform, Datadog, Grafana, opentelemetry, or equivalents, Prometheus, GCP

A Renowned Hiring Product Company from USA is Looking for:

India (Remote)

Contracting - 6 months

India (Remote)

5–8 years experience

This role is for engineers who find deep satisfaction in making systems genuinely

trustworthy — not building features, but raising the floor for everyone who does.

The opportunity

Own the reliability of systems that matter

Our products are only as good as the systems running them. As a Senior Reliability

Engineer, you'll have direct ownership over the health, observability, and long-term stability

of our platform — work that has a multiplying effect on every engineer and every customer

we have.

You'll cut through years of accumulated technical debt, modernize aging infrastructure, and

put in place the telemetry, guardrails, and performance improvements that make our

systems a source of confidence rather than concern. This isn't maintenance work — it's

foundational engineering with high leverage and high visibility.

What You'll Own

Drive the observability strategy — ensure application telemetry is meaningful,

actionable, and trusted by engineering teams across the org

Lead incident response improvements: build better runbooks, detection, and

post-mortems that shorten time-to-resolution and prevent recurrence

Own performance investigations end-to-end — identify bottlenecks, propose

solutions, and ship the fix

Eliminate error noise polluting telemetry, so signal is signal again and on-call isn't a

guessing game

Contribute to migrating legacy Rails and Go services to a modern .net, maintainable

stack

Partner closely with Cloud Infrastructure, Developer Experience, and Product

Engineering to make reliability a shared practice, not a silo

Apply AI-assisted tooling to accelerate reliability initiatives — from anomaly detection

to automated triage

Bring clear thinking to ambiguous situations — when something breaks in an

unexpected way, you're the person who figures out why

What We're Looking For

5–8 years of software engineering experience, with meaningful time spent on

infrastructure, platform, or reliability work

Strong hands-on experience with observability tooling — OpenTelemetry, Datadog,

Prometheus, Grafana, or equivalents

Comfort working across languages Good at .net and familiarity with Ruby on Rails

or Go is a plus

Experience diagnosing and resolving production performance issues at scale

A bias for clarity — you write clean runbooks, clear postmortems, and make your

work legible to others

Ability to work independently and make good judgment calls without a lot of

hand-holding

Strong communication skills — you can work across time zones and teams

effectively

Nice to have

Experience with cloud platforms (AWS, GCP, or Azure) and infrastructure-as-code

Background in SRE practices — SLOs, error budgets, capacity planning

Experience with service migrations or modernization projects in production systems

Familiarity with AI/ML-assisted observability or automated remediation workflows

Tech you'll work with

.NET
Ruby on Rails
Go
OpenTelemetry
LaunchDarkly
Azure / GCP
Kubernetes
Terraform

Who Thrives Here

Engineers who get quiet satisfaction from a clean dashboard, a postmortem that actually

prevents the next incident, or a migration that finally retires a component that's caused pain

for years. You don't need someone to define the problem for you — you find it, frame it, and

drive it to resolution.

If you're energized by ownership, frustrated by flaky systems, and want your work to raise

the standard for an entire engineering organization — this role was built for you.

How to apply for this opportunity

Step 1: Click On Apply! And Register or Login on our portal.
Step 2: Complete the Screening Form & Upload updated Resume
Step 3: Increase your chances to get shortlisted & meet the client for the Interview!

About Uplers:

Our goal is to make hiring reliable, simple, and fast. Our role will be to help all our talents find and apply for relevant contractual onsite opportunities and progress in their career. We will support any grievances or challenges you may face during the engagement.

(Note: There are many more opportunities apart from this on the portal. Depending on the assessments you clear, you can apply for them as well).

So, if you are ready for a new challenge, a great work environment, and an opportunity to take your career to the next level, don't hesitate to apply today. We are waiting for you!

More Info

Job Type:

Industry:

Function:

Employment Type:

About Company

UplersJob Source: www.linkedin.com

Job ID: 148912349

Jobs by Skill - IT

Jobs by Skill - Non IT

International Jobs

Last Updated: 10-06-2026 10:27:54 AM

Homejobs in IndiaSenior Reliability Engineer | Remote

Similar Jobs

Senior Site Reliability Engineer

Jobgether

6-8 yrs

India

Skills:

Cassandra, Prometheus, Mariadb, Bash, Grafana, Redis, Memcached, Ansible, Ceph, Puppet, Ruby, Kubernetes, Python, OpenStack Swift, Linux systems administration, Go, monitoring and observability tools, LAMP stack technologies

Senior Site Reliability Engineer

josys

5-7 yrs

Bengaluru, India

Skills:

Elk, Prometheus, Slas, Networking, Dns, Grafana, Cdn, Graylog, Python, AWS, Performance Tuning, Bash, Devops, High Availability, Gcp, Load Balancing, Azure, Kubernetes, SLIs, Go, Disaster Recovery, observability tools, Security, OpenTelemetry, Infrastructure Engineering, Site Reliability Engineering, log management tools, reliability metrics, SLOs, container orchestration, incident management frameworks

Senior Site Reliability Engineer ID61984

AgileEngine

4-6 yrs

India, Cochin / Kochi / Ernakulam

Skills:

Scripting, Java, Prometheus, Node.js, Grafana, Datadog, Kubernetes, Python, AWS

Senior Site Reliability Support Engineer

Equisoft

3-5 yrs

Hyderabad, India

Skills:

.NET, Java, Azure cloud services, Cosmos DB, Python, Azure Container Instances, Application logs, Azure SQL Database

Senior Site Reliability Engineer (EST)

Teikametrics

5-7 yrs

India

Skills:

Python, Databricks, Java, Postgres, Sentry, Kafka, Aws Rds, Datadog, AWS, Bash, Kubernetes, Docker, Terraform, Javascript, CircleCI, Argo Workflows, Opensearch

Do you want to see more relevant and perfect job for you?

Beware of Scammers

We don’t charge any money for job offers

What it feels like to have

48% more interview calls?

To get 5X more recruiter views on your profile

Real-time notifications

Discover new jobs, get recruiter notifications, track applications & more with the foundit App.

Scan to download foundit App