Senior Devops Engineer

Fenix Commerce

Noida, India

4-8 Years

Save

Posted 11 hours ago
Be among the first 10 applicants

Early Applicant

Job Description

Senior DevOps Engineer (AWS + Azure) – Noida

Location: Noida (On-site)

Experience: 4–8 years

Type: Full-time

About the Role

We're looking for a hands-on DevOps engineer who operates at the intersection of security, reliability, performance, and cost—not as trade-offs, but as simultaneous constraints.

This is not a pipeline-only role. You'll own infrastructure decisions end-to-end and work closely with engineering to build and scale multi-tenant, high-throughput systems.

What You'll Do

Design, build, and operate scalable infrastructure across AWS and Azure
Architect and manage multi-tenant environments with strong isolation and efficiency
Own Kubernetes clusters (deployment, scaling, upgrades, cost optimization)
Build and maintain CI/CD pipelines with a focus on speed and reliability
Optimize cloud costs proactively (FinOps mindset, not reactive)
Implement security best practices across infrastructure (IAM, networking, secrets, compliance)
Improve system observability (logs, metrics, tracing) and incident response
Work closely with backend/frontend teams to ensure production readiness

Must-Have Experience

Deep hands-on experience with:
AWS (EC2, ECS/EKS, RDS, S3, IAM, VPC)
Azure (AKS, networking, identity, compute)
Strong experience with:
Kubernetes (production-grade clusters)
Docker / containerization
Infrastructure as Code (Terraform preferred)
Experience with caching layers like Redis / ElastiCache
Proven experience managing multi-tenant SaaS systems at scale
Strong understanding of:
System design trade-offs (latency, throughput, cost)
Networking (VPCs, subnets, load balancers)
Security (least privilege, secrets management, threat surface reduction)

What Good Looks Like in This Role

You've reduced cloud costs significantly without hurting performance
You've designed systems that scale cleanly under load spikes
You've handled real production incidents and improved systems afterward
You think in terms of failure modes, not just happy paths
You don't just deploy infra—you own its behavior in production

Nice to Have

Experience with service meshes (Istio, Linkerd)
Experience with event-driven systems (Kafka, SQS, Pub/Sub)
Exposure to security/compliance frameworks (SOC2, ISO, etc.)
Scripting skills (Python, Bash, or Go)

Why Join Us

Work on real scaling problems, not toy systems
High ownership and autonomy
Opportunity to shape infrastructure from early stages to scale

To Be Considered for This Role

You must send email with your resume and a 2 mins self recorded video to [Confidential Information] and [HIDDEN TEXT]. LI applications will be ignored.
Practical discussion on past systems you've built
Deep dive into one production incident you handled
Live problem-solving (architecture + trade-offs)
Culture & ownership fit and working for a startup