Job Summary:
We are seeking an experienced and dynamic Lead Site Reliability Engineering (SRE) to
oversee the reliability, scalability, and performance of our critical Applications.
NOTE:
- This is an Application heavy SRE Role and NOT into Infra and DevOps SRE.
- Backend (Java) developers interested in transitioning to SRE roles are also encouraged to apply.
- The candidate should be proficient in hands on Java coding(mandatory).
(Not just into montitoring and application support of Java based applications).
Skills:
- Proficiency in Java (hands on coding) and Microservices
- Proficient with tools like Prometheus, Grafana,, Elastic APM, or New relic.
- Hands-on experience with CI/CD pipelines (e.g., Jenkins, Azure Pipelines etc).
- Skilled in automation frameworks and tools for infrastructure and application deployments
- Proven track record in handling incidents, post-mortems, and implementing solutions to prevent recurrence (L3)
- Experience with database optimization, Kafka, or other messaging systems
- Proficient in writing advance SQL Queries.