We are looking for a DevOps Engineer to support and enhance our cloud-based production infrastructure. The ideal candidate will have hands-on experience with AWS, Kubernetes, Docker, and Azure DevOps (ADO), along with a strong foundation in DevSecOps principles. This role requires operational ownership, troubleshooting skills, and the ability to work closely with development, security, and operations teams to ensure reliability, scalability, and security of systems.
Key Responsibilities
- Support and maintain production cloud infrastructure hosted on AWS
- Manage and troubleshoot containerized workloads using Docker and Kubernetes
- Deploy, monitor, and maintain CI/CD pipelines using Azure DevOps (ADO)
- Perform incident management, root cause analysis, and production issue resolution
- Monitor system performance, availability, and capacity using standard observability tools
- Apply DevSecOps best practices, including security scanning, access control, and compliance checks
- Assist in infrastructure automation using Infrastructure-as-Code (IaC) tools
- Collaborate with development teams to ensure smooth application deployments
- Participate in change management, patching, upgrades, and routine maintenance activities
- Document operational procedures, runbooks, and troubleshooting guides
Mandatory Skills & Qualifications
Certifications (Required)
- ✅ AWS Certified Solutions Architect - Associate
- Terraform associate
Technical Skills (Required)
- Amazon Web Services (AWS)
- EC2, VPC, IAM, S3, CloudWatch (hands-on production exposure)
- Containerization & Orchestration
- Docker (image creation, container lifecycle management)
- Kubernetes (pods, deployments, services, basic troubleshooting)
- CI/CD
- Azure DevOps (ADO) - pipelines, builds, releases, repositories
- Operating Systems
- Strong working knowledge of Windows, Linux (RHEL/Ubuntu preferred)
- DevSecOps
- High-level understanding of secure CI/CD pipelines
- Familiarity with vulnerability scanning, secrets management, and security controls
Experience (Required)
- 2-3 years of overall experience supporting production infrastructure
- Experience working in on production support environments is a plus
Good to Have (Preferred)
- Infrastructure as Code (Terraform / CloudFormation)
- Basic scripting skills (Bash, Python)
- Experience with monitoring tools (CloudWatch, Prometheus, Grafana, ELK)
- Exposure to SRE concepts and reliability engineering
- Knowledge of ITIL processes (Incident, Change, Problem Management)
Soft Skills
- Strong troubleshooting and analytical skills
- Ability to work under pressure in production environments
- Good communication and documentation skills
- Team player with a proactive and ownership-driven mindset
To maintain a fair and genuine hiring process, we kindly ask that all candidates participate in interviews without the assistance of AI tools or external prompts. Our interview process is designed to assess your individual skills, experiences, and communication style. We value authenticity and want to ensure we're getting to know you-not a digital assistant. To help maintain this integrity, we ask to remove virtual backgrounds and include in-person interviews in our hiring process. Please note that use of AI-generated responses or third-party support during interviews will be grounds for disqualification from the recruitment process
Our Interview Practices