
Search by job, company or skills
Job Description
Required InformationDetails
1 Role** Developer
2 Required Technical Skill Set** SRE with Ansible, Python and Kubernetes
3 Desired Experience Range** 6 to 10 Yrs
5 Location of Requirement -Chennai
Desired Competencies (Technical/Behavioral Competency
)Must-Have** (Ideally should not be more than 3-5
)Strong hands-on with Kubernetes (deployment, scaling, RBAC, networking, Helm/Kustomize
)
Ansible for configuration management, automation, and infra as co
de
Python for tooling, automation, and platform integrati
ons
Expertise in Linux, networking (DNS, TCP/IP, HTTP), and shell scrip
ting
Observability: logging/metrics/tracing (e.g., Prometheus, Grafana, ELK/OpenSearch, OpenTelem
etry)
CI/CD pipelines (GitLab/GitHub Actions/Azure DevOps/Jenkins) and GitOps workflows (e.g., Ar
go CD)
Experience running production systems with on‑call support and incident r
esponse
Good-to-Have(Ideally should not be more t
han 3-5)Cloud platforms: AWS (EKS/AKS/GKE), Terrafor
m/Pulumi
Service meshes (Istio/Linkerd), Ingress/Nginx/Envoy, API
gateways
Security basics: secrets management, image scanning, Kubernetes security (OPA/Gatekeepe
r/Kyverno)
Message brokers/streaming: Kafka/RabbitMQ; ca
ches: Redis
Cost optimization, capacity planning, perfor
mance tuning
Familiarity with SRE practices: SLOs/SLIs, error budgets, runbooks
, postmortems
Responsibility of / Expectations
from the Role Design, build, and automate reliable Kubernetes platforms using Ansib
le & PythonImplement observability, alerting, and runbooks; drive SLO/SLI adoption and in
cident responseDevelop reusable automation/tooling for provisioning, deployments
, and day‑2 opsMaintain secure, scalable, and cost‑efficient infrastructure acro
ss environmentsCollaborate with Dev/Platform teams to improve resilience, performance, and r
elease velocityLead root-cause analysis and continuous improvement via postmortems an
d chaos testingJob ID: 149768273
Skills:
Prometheus, Elk Stack, Bash, Grafana, Redis, Rabbitmq, Gcp, Terraform, Linux, MySQL, Ansible, Apache Kafka, MongoDB, Azure, Oracle, Kubernetes, Python, AWS
Skills:
Agile, Software Development Life Cycle, Javascript, Splunk, Automation, JIRA, Python, Product management, Operations, Monitoring
Skills:
Google Cloud Platform, Linux Administration, Automation, Kubernetes, Incident Management, Golang, Docker, Dynatrace, Production Support
Skills:
Unix, C++, Perl, Data Structures, Ruby, Python, Performance Tuning
Skills:
S3, Cloudformation, PostgreSQL, Prometheus, Grafana, Jenkins, Lambda, CloudFront, Cloudwatch, Terraform, MySQL, Iam, Sqs, Kubernetes, AWS, GitHub Actions, GuardDuty, FluxCD, ArgoCD, CloudTrail
We don’t charge any money for job offers