Worldwide Technology (WWT),a 36-year-old global technology solutions provider specializing in systems integration, Infra-Cloud security, application development, AI Services, and supply chain solutions. With a workforce of 10,000+ employees and strategic partnerships with leading OEMs such as Cisco, Dell EMC, Microsoft, and NVIDIA, WWT delivers cutting-edge infrastructure, cloud, security, and custom application services to clients across 35 countries. Our Advanced Technology Centers (ATCs)—lab setup environments spanning over one million square feet of world-class integration and distribution space enable us to deliver unmatched value and innovation at scale. Recognized as one of the Best Places to Work by Glassdoor and Fortune for 14 consecutive years, WWT is also ranked #6 on India's Great Place to Work list for 2025.
Worldwide Technology Holding Co, LLC. (WWT) currently has an exciting opportunity available for the role of : Site Reliability Engineer role at Bengaluru (Hybrid Opportunity). If you are interested in this opportunity, please respond with an updated resume and the required details at the bottom of this email.
Position: Site Reliability Engineer
Location: Bangalore (Hybrid)
Contract : Long-term
Shift timings:5:30 AM-1:00PM and 1:30 - 10 PM - these might change and go to night hours so they need to be flexible and okay if it changes in the future
Additional Notes:
- He emphasized the importance of previous SRE experience, Kubernetes, and AWS.
Responsibilities:
- Own the deployment and operation of critical collaboration services across cloud and hybrid environments, driving reliability and scalability.
- Design, evolve, and optimize CI/CD pipelines and automation, including AIfirst tooling for deployment, monitoring, and incident response.
- Lead incident response for complex production issues, perform root cause analysis, and drive systemic reliability and performance improvements.
- Use observability data to guide capacity planning, scaling strategies, and resource optimization across services.
- Define and champion operational best practices, documentation standards, and a culture of reliability and operational excellence.
Minimum Qualifications
- Bachelor's degree in computer science, Engineering, or related field (or equivalent experience) with 7–13 years in Site Reliability Engineering, Cloud Operations, or Systems Engineering.
- Strong hands-on experience operating production services using Docker and Kubernetes in cloud or hybrid environments.
- Proficiency in one or more programming or scripting languages (e.g., Python, Go, Bash) to build automation and operational tooling.
- Experience with monitoring, observability, and incident response in production environments, including on-call participation and post-incident reviews.
- Working knowledge of Linux systems, networking, distributed systems, CI/CD pipelines, infrastructure-as-code, and Git-based workflows.
Preferred Qualifications
- Experience operating large-scale, globally distributed SaaS platforms.
- Familiarity with hybrid cloud environments and multi-region deployments.
- Experience applying AI-assisted or automation-first approaches to SRE tooling and workflows.
- Strong written communication skills for creating clear operational documentation and runbooks.