Senior DevOps Engineer (AWS + Azure) – NoidaLocation: Noida (On-site)
Experience: 4–8 years
Type: Full-time
About the RoleWe're looking for a hands-on DevOps engineer who operates at the intersection of security, reliability, performance, and cost—not as trade-offs, but as simultaneous constraints.
This is not a pipeline-only role. You'll own infrastructure decisions end-to-end and work closely with engineering to build and scale multi-tenant, high-throughput systems.
What You'll Do- Design, build, and operate scalable infrastructure across AWS and Azure
- Architect and manage multi-tenant environments with strong isolation and efficiency
- Own Kubernetes clusters (deployment, scaling, upgrades, cost optimization)
- Build and maintain CI/CD pipelines with a focus on speed and reliability
- Optimize cloud costs proactively (FinOps mindset, not reactive)
- Implement security best practices across infrastructure (IAM, networking, secrets, compliance)
- Improve system observability (logs, metrics, tracing) and incident response
- Work closely with backend/frontend teams to ensure production readiness
Must-Have Experience- Deep hands-on experience with:
- AWS (EC2, ECS/EKS, RDS, S3, IAM, VPC)
- Azure (AKS, networking, identity, compute)
- Strong experience with:
- Kubernetes (production-grade clusters)
- Docker / containerization
- Infrastructure as Code (Terraform preferred)
- Experience with caching layers like Redis / ElastiCache
- Proven experience managing multi-tenant SaaS systems at scale
- Strong understanding of:
- System design trade-offs (latency, throughput, cost)
- Networking (VPCs, subnets, load balancers)
- Security (least privilege, secrets management, threat surface reduction)
What Good Looks Like in This Role- You've reduced cloud costs significantly without hurting performance
- You've designed systems that scale cleanly under load spikes
- You've handled real production incidents and improved systems afterward
- You think in terms of failure modes, not just happy paths
- You don't just deploy infra—you own its behavior in production
Nice to Have- Experience with service meshes (Istio, Linkerd)
- Experience with event-driven systems (Kafka, SQS, Pub/Sub)
- Exposure to security/compliance frameworks (SOC2, ISO, etc.)
- Scripting skills (Python, Bash, or Go)
Why Join Us- Work on real scaling problems, not toy systems
- High ownership and autonomy
- Opportunity to shape infrastructure from early stages to scale
To Be Considered for This Role- You must send email with your resume and a 2 mins self recorded video to [Confidential Information] and [HIDDEN TEXT]. LI applications will be ignored.
- Practical discussion on past systems you've built
- Deep dive into one production incident you handled
- Live problem-solving (architecture + trade-offs)
- Culture & ownership fit and working for a startup