DevOps Engineer (1+ Years Experience)
Full-time | Gurugram | Immediate or 24 Week Joiners Preferred
We are looking for an outstanding DevOps Engineer who can work on automation, CI/CD, container orchestration, and cloud infrastructure. You will help build and maintain scalable, secure, and highly available systems used across multiple production environments.
Key Responsibilities
- Manage and automate deployments using CI/CD pipelines (Jenkins, GitHub/Bitbucket Pipelines).
- Build reproducible and reliable infrastructure using Terraform / Ansible
- Deploy and maintain applications on Kubernetes (Helm, manifests, controllers, namespaces).
- Manage containerization workflows using Docker and follow best practices for image builds.
- Implement secure and scalable networking using Nginx, Cloudflare, and DNS management.
- Set up monitoring, logging, and alerting for production systems.
- Improve system reliability, performance, and availability through automation.
- Collaborate closely with backend, frontend, and QA teams for smooth releases.
- Follow best practices including security hardening, OWASP guidelines, and infrastructure audits.
- Participate in incident handling, root cause analysis, and preventive improvements.
- Deployment experience ononprem system/ private clouds.
Required Skills
- 1+ years of hands-on DevOps or Platform Engineering experience
- Strong understanding of:
- Docker
- Kubernetes
- Linux administration
- CI/CD pipelines (Jenkins preferred)
- Experience deploying microservices or distributed systems in cloud or on-premise setups.
- Good understanding of Git workflows (Bitbucket/GitHub).
- Ability to write automation scripts (Python/Bash).
- Experience managing Nginx, reverse proxies, SSL/TLS, and Cloudflare settings.
- Familiarity with monitoring/logging (Loki, Prometheus, Grafana, ELK, etc.).
- Understanding of security best practices including OWASP and basic hardening.
- Good Knowledge of networking concepts (VPC, subnets, routing, ingress/egress).
- Hands-on deployment experience on AWS, DigitalOcean, or similar cloud providers.
- Implement and maintain robust database monitoring, backup, restore, and rollback strategies, ensuring high availability, performance optimization, and disaster-recovery readiness across all environments
- Apache Airflow operations optimizations.
Good to Have
- Experience with ArgoCD, CircleCI
- Exposure to SRE practices: SLOs, SLIs, SLAs, on-call readiness.
- Ability to design scalable infrastructure for multi-environment deployments.
- Experience with Playwright/Selenium for automated testing (optional).
Good understanding of Terraform / Ansible
Optimize infrastructure and LLM-related operational costs by monitoring token usage, improving
caching strategies, minimizing redundant calls, and implementing efficient request-handling
workflows.