Search by job, company or skills

T

Site Reliability Engineer

Save
  • Posted a day ago
  • Be among the first 10 applicants
Early Applicant

Job Description

Job Description

Required InformationDetails

1 Role** Developer

2 Required Technical Skill Set** SRE with Ansible, Python and Kubernetes

3 Desired Experience Range** 6 to 10 Yrs

5 Location of Requirement -Chennai


Desired Competencies (Technical/Behavioral Competency

)Must-Have** (Ideally should not be more than 3-5

)Strong hands-on with Kubernetes (deployment, scaling, RBAC, networking, Helm/Kustomize

)
Ansible for configuration management, automation, and infra as co

de
Python for tooling, automation, and platform integrati

ons
Expertise in Linux, networking (DNS, TCP/IP, HTTP), and shell scrip

ting
Observability: logging/metrics/tracing (e.g., Prometheus, Grafana, ELK/OpenSearch, OpenTelem

etry)
CI/CD pipelines (GitLab/GitHub Actions/Azure DevOps/Jenkins) and GitOps workflows (e.g., Ar

go CD)
Experience running production systems with on‑call support and incident r

esponse
Good-to-Have(Ideally should not be more t

han 3-5)Cloud platforms: AWS (EKS/AKS/GKE), Terrafor

m/Pulumi
Service meshes (Istio/Linkerd), Ingress/Nginx/Envoy, API

gateways
Security basics: secrets management, image scanning, Kubernetes security (OPA/Gatekeepe

r/Kyverno)
Message brokers/streaming: Kafka/RabbitMQ; ca

ches: Redis
Cost optimization, capacity planning, perfor

mance tuning
Familiarity with SRE practices: SLOs/SLIs, error budgets, runbooks

, postmortems

Responsibility of / Expectations

from the Role Design, build, and automate reliable Kubernetes platforms using Ansib

le & PythonImplement observability, alerting, and runbooks; drive SLO/SLI adoption and in

cident responseDevelop reusable automation/tooling for provisioning, deployments

, and day‑2 opsMaintain secure, scalable, and cost‑efficient infrastructure acro

ss environmentsCollaborate with Dev/Platform teams to improve resilience, performance, and r

elease velocityLead root-cause analysis and continuous improvement via postmortems an

d chaos testing

More Info

Job Type:
Industry:
Function:
Employment Type:

Job ID: 149768273

Similar Jobs

Chennai, India

Skills:

PrometheusElk StackBashGrafanaRedisRabbitmqGcpTerraformLinuxMySQLAnsibleApache KafkaMongoDBAzureOracleKubernetesPythonAWS

Hyderabad, Bengaluru, Chennai

Skills:

AgileSoftware Development Life CycleJavascriptSplunkAutomationJIRAPythonProduct managementOperationsMonitoring

Chennai, India

Skills:

Google Cloud PlatformLinux AdministrationAutomationKubernetesIncident ManagementGolangDockerDynatraceProduction Support

Bengaluru, Chennai

Skills:

UnixC++PerlData StructuresRubyPythonPerformance Tuning

Chennai, India

Skills:

S3CloudformationPostgreSQLPrometheusGrafanaJenkinsLambdaCloudFrontCloudwatchTerraformMySQLIamSqsKubernetesAWSGitHub ActionsGuardDutyFluxCDArgoCDCloudTrail