About the Role
We're seeking a highly skilled and motivated Junior DevOps Engineer to join our growing team. You'll take the lead in designing and managing robust infrastructure, CI/CD pipelines, observability platforms, and automation frameworks that power our core services with a strong emphasis on Azure Cloud, AKS, GitOps, and AI/ML workloads.
This is a high-impact role, ideal for someone who's comfortable owning systems end-to-end and collaborating across development, AI/ML, and product teams to deliver secure, scalable cloud-native solutions.
What You'll Do
- Design, deploy, and manage infrastructure on Azure, including AKS (Azure Kubernetes Service)
- Build and maintain CI/CD pipelines using GitHub Actions with full lifecycle automation
- Use Terraform and Helm to provision and manage infrastructure as code
- Implement GitOps workflows using ArgoCD, Flux, or Spinnaker
- Set up and operate end-to-end monitoring, logging, and alerting systems (Prometheus, Grafana, ELK, Azure Monitor)
- Support and scale AI/ML model deployments, including GPU infrastructure management
- Administer and optimize SQL and NoSQL databases (e.g., PostgreSQL, MongoDB, Cosmos DB)
- Manage secrets and credentials using Azure Key Vault or HashiCorp Vault
- Drive security, compliance, and cost optimization across environments
- Apply ITIL principles for incident, change, and release management
- Lead root cause analysis, postmortems, and continuous improvement initiatives
- Coach and mentor junior engineers and promote DevOps best practices
What We're Looking For
- 1+ years of hands-on experience in DevOps, SRE, or Cloud Infrastructure Engineering
- Strong expertise in Azure Cloud and AKS
- Proven experience with Docker, Kubernetes, and Helm
- Deep understanding of GitOps workflows using ArgoCD, Flux, or Spinnaker
- Advanced skills in Terraform and managing Infrastructure as Code
- Solid scripting experience with Python, Bash, or PowerShell
- Hands-on with full observability stack: Prometheus, Grafana, ELK, Azure Monitor, Alertmanager, etc.
- Experience deploying and managing AI/ML workloads, especially with GPU integration
- Expertise in both SQL and NoSQL databases (e.g., PostgreSQL, Cosmos DB, MongoDB)
- Working knowledge of secret management (Azure Key Vault, Vault)
- Familiarity with policy-as-code frameworks like OPA or Kyverno
- Proven ability to work in ITIL-aligned environments with structured processes
- Cloud cost monitoring and FinOps awareness
- Excellent verbal and written communication skills