Site Reliability Engineer (SRE) - Contract - Hyderabad (Hybrid role)

World Wide Technology

Hyderabad, India

Fresher

Save

Posted 8 hours ago
Be among the first 10 applicants

Early Applicant

Job Description

Worldwide Technology (WWT), a 36-year-old global technology solutions provider specializing in systems integration, Infra-Cloud security, application development, AI Services, and supply chain solutions. With a workforce of 10,000+ employees and strategic partnerships with leading OEMs such as Cisco, Dell EMC, Microsoft, and NVIDIA, WWT delivers cutting-edge infrastructure, cloud, security, and custom application services to clients across 35 countries. Our Advanced Technology Centers (ATCs)—lab setup environments spanning over one million square feet of world-class integration and distribution space enable us to deliver unmatched value and innovation at scale. Recognized as one of the Best Places to Work by Glassdoor and Fortune for 14 consecutive years, WWT is also ranked #6 on India's Great Place to Work list for 2025.

WorldWide Technology Holding Co, LLC. (WWT) We currently have an exciting opportunity for a Site Reliability Engineer (SRE) role in Hyderabad (Hybrid). If you are interested in this opportunity, please respond with an updated resume and the required details at the bottom of this email.

Position: Site Reliability Engineer (SRE)

Location: Hyderabad

Contract Opportunity

Responsibilities

Build, operate, and harden the Internal MCP Gateway and related platform services.
Define and implement observability, monitoring, and audit capabilities for agent tool calling and MCP traffic.
Ensure MCP platform components meet availability, performance, and reliability targets.
Support secure hosting and runtime environments for MCP servers and related services.
Partner with ML and Quality Engineers to support testing, evaluation, and safe rollout of platform changes.
Contribute to service management, incident response, and operational readiness for production AI platforms.
Help shape the long-term platform architecture supporting federated MCP scale.

Skills Required:

Strong Python experience for platform services and operational tooling.
Experience with observability and monitoring stacks (metrics, tracing, logging, dashboards). (Prometheus, Grafana)
SQL skills for operational analysis, reporting, and audit queries.
Hands-on experience operating services on AWS.
Knowledge of cloudnative reliability patterns and service lifecycle management.
Experience supporting platforms subject to audit, compliance, and regulated delivery processes.