Skill: SRE, Python/Java, Agentic AI, Automation
Work loc: Bangalore
Key Responsibilities
- Design, develop, and maintain automation solutions to improve system reliability, operational efficiency, and incident response.
- Write clean, scalable, and maintainable code using Java or Python.
- Build bots, accelerators, or intelligent automations leveraging Agentic AI and modern AI frameworks.
- Collaborate closely with development, product, and support teams to ensure high availability and performance of critical systems.
- Apply SRE best practices for monitoring, alerting, incident management, and post-incident analysis.
- Support and maintain retail domain applications with a strong understanding of business workflows.
- Provide production support for critical systems in a support project environment, including troubleshooting and root cause analysis.
Continuously improve reliability through tooling, automation, and process optimization.
Required Skills & Qualifications
- Strong programming skills in Java and/or Python.
- Proven experience in building automations and scripts for operational use cases.
- Experience developing bots or accelerators using Agentic AI concepts.
- Solid understanding of SRE principles (availability, reliability, scalability, observability).
- Good knowledge of the retail domain (e.g., order management, inventory, supply chain, POS systems).
- Hands-on experience working in support projects with exposure to production environments.
- Strong problem-solving, debugging, and analytical skills.
Ability to work collaboratively in a fast-paced, agile environment