Search by job, company or skills

Uplers

Senior Reliability Engineer | Remote

5-8 Years
Save
  • Posted 20 hours ago
  • Be among the first 10 applicants
Early Applicant

Job Description

Experience: 5.00 + years

Salary: USD 38000.00 / year (based on experience)

Expected Notice Period: 7 Days

Shift: (GMT+05:30) Asia/Kolkata (IST)

Opportunity Type: Remote

Placement Type: Full Time Contract for 6 Months(40 hrs a week/160 hrs a month)

(*Note: This is a requirement for one of Uplers client - A Renowned Hiring Product Company from USA)

What do you need for this opportunity

Must have skills required:

Cloud platforms (AWS, or Azure) and infrastructure-as-code, .NET Ruby on Rails Go OpenTelemetry LaunchDarkly Azure / GCP Kubernetes Terraform, Datadog, Grafana, opentelemetry, or equivalents, Prometheus, GCP

A Renowned Hiring Product Company from USA is Looking for:

India (Remote)

Contracting - 6 months

India (Remote)

  • 5–8 years experience

This role is for engineers who find deep satisfaction in making systems genuinely

trustworthy — not building features, but raising the floor for everyone who does.

The opportunity

Own the reliability of systems that matter

Our products are only as good as the systems running them. As a Senior Reliability

Engineer, you'll have direct ownership over the health, observability, and long-term stability

of our platform — work that has a multiplying effect on every engineer and every customer

we have.

You'll cut through years of accumulated technical debt, modernize aging infrastructure, and

put in place the telemetry, guardrails, and performance improvements that make our

systems a source of confidence rather than concern. This isn't maintenance work — it's

foundational engineering with high leverage and high visibility.

What You'll Own

Drive the observability strategy — ensure application telemetry is meaningful,

actionable, and trusted by engineering teams across the org

Lead incident response improvements: build better runbooks, detection, and

post-mortems that shorten time-to-resolution and prevent recurrence

Own performance investigations end-to-end — identify bottlenecks, propose

solutions, and ship the fix

Eliminate error noise polluting telemetry, so signal is signal again and on-call isn't a

guessing game

Contribute to migrating legacy Rails and Go services to a modern .net, maintainable

stack

Partner closely with Cloud Infrastructure, Developer Experience, and Product

Engineering to make reliability a shared practice, not a silo

Apply AI-assisted tooling to accelerate reliability initiatives — from anomaly detection

to automated triage

Bring clear thinking to ambiguous situations — when something breaks in an

unexpected way, you're the person who figures out why

What We're Looking For

5–8 years of software engineering experience, with meaningful time spent on

infrastructure, platform, or reliability work

Strong hands-on experience with observability tooling — OpenTelemetry, Datadog,

Prometheus, Grafana, or equivalents

Comfort working across languages Good at .net and familiarity with Ruby on Rails

or Go is a plus

Experience diagnosing and resolving production performance issues at scale

A bias for clarity — you write clean runbooks, clear postmortems, and make your

work legible to others

Ability to work independently and make good judgment calls without a lot of

hand-holding

Strong communication skills — you can work across time zones and teams

effectively

Nice to have

Experience with cloud platforms (AWS, GCP, or Azure) and infrastructure-as-code

Background in SRE practices — SLOs, error budgets, capacity planning

Experience with service migrations or modernization projects in production systems

Familiarity with AI/ML-assisted observability or automated remediation workflows

Tech you'll work with

  • .NET
  • Ruby on Rails
  • Go
  • OpenTelemetry
  • LaunchDarkly
  • Azure / GCP
  • Kubernetes
  • Terraform

Who Thrives Here


Engineers who get quiet satisfaction from a clean dashboard, a postmortem that actually

prevents the next incident, or a migration that finally retires a component that's caused pain

for years. You don't need someone to define the problem for you — you find it, frame it, and

drive it to resolution.

If you're energized by ownership, frustrated by flaky systems, and want your work to raise

the standard for an entire engineering organization — this role was built for you.

How to apply for this opportunity

  • Step 1: Click On Apply! And Register or Login on our portal.
  • Step 2: Complete the Screening Form & Upload updated Resume
  • Step 3: Increase your chances to get shortlisted & meet the client for the Interview!

About Uplers:


Our goal is to make hiring reliable, simple, and fast. Our role will be to help all our talents find and apply for relevant contractual onsite opportunities and progress in their career. We will support any grievances or challenges you may face during the engagement.

(Note: There are many more opportunities apart from this on the portal. Depending on the assessments you clear, you can apply for them as well).

So, if you are ready for a new challenge, a great work environment, and an opportunity to take your career to the next level, don't hesitate to apply today. We are waiting for you!

More Info

Job Type:
Industry:
Employment Type:

About Company

Job ID: 148912349

Similar Jobs

India

Skills:

CassandraPrometheusMariadbBashGrafanaRedisMemcachedAnsibleCephPuppetRubyKubernetesPythonOpenStack SwiftLinux systems administrationGomonitoring and observability toolsLAMP stack technologies

Bengaluru, India

Skills:

ElkPrometheusSlasNetworkingDnsGrafanaCdnGraylogPythonAWSPerformance TuningBashDevopsHigh AvailabilityGcpLoad BalancingAzureKubernetesSLIsGoDisaster Recoveryobservability toolsSecurityOpenTelemetryInfrastructure EngineeringSite Reliability Engineeringlog management toolsreliability metricsSLOscontainer orchestrationincident management frameworks

India, Cochin / Kochi / Ernakulam

Skills:

ScriptingJavaPrometheusNode.jsGrafanaDatadogKubernetesPythonAWS

Hyderabad, India

Skills:

.NETJavaAzure cloud servicesCosmos DBPythonAzure Container InstancesApplication logsAzure SQL Database

India

Skills:

PythonDatabricksJavaPostgresSentryKafkaAws RdsDatadogAWSBashKubernetesDockerTerraformJavascriptCircleCIArgo WorkflowsOpensearch