Search by job, company or skills

Griphic

DevOps Engineer - L3 (Software Engineer II)

new job description bg glownew job description bg glownew job description bg svg
  • Posted a day ago
  • Be among the first 30 applicants
Early Applicant

Job Description

DevOps Engineer - L3 (Software Engineer II)

Location & Type: Delhi, Full-time

CTC Range (LPA): 29.00 - 36.25

Role Overview

Experienced DevOps engineer who can own and scale production infrastructure end-to-end - from CI/CD and IaC to observability and incident response. You'll lead design docs, harden reliability and security, drive cost/perf efficiency.

What You'll Do

  • Architect and maintain CI/CD pipelines (build, test, security scans, deploy, rollback) with quality gates and environment promotions.
  • Design and operate container platforms (ECS/EKS or equivalent), service discovery, blue/green & canary strategies, and autoscaling.
  • Implement Infrastructure as Code (Terraform/CDK/CloudFormation), enforce modular, reviewable, and drift-free infra.
  • Build observability: metrics/logs/traces, SLOs/SLIs, dashboards, and actionable alerts; reduce MTTR through runbooks and automation.
  • Champion platform reliability: capacity planning, HA/DR (multi-AZ), backup/restore testing, change management.
  • Own secrets management, IAM least-privilege, network policies, and baseline hardening (CIS where relevant).
  • Drive cost optimization (rightsizing, autoscaling policies, savings plans/spot, storage lifecycle) with monthly reporting.
  • Establish release/incident processes (postmortems, RCAs) and lead remediation to cut change failure rate.
  • Partner with Backend/AI teams to productize models/services (GPU pools, batching, caching layers) and streamline developer workflows.
  • lead design reviews, tech spikes, Monitoring and documentation.

Technical Qualifications

  • 3 - 4+ years in DevOps/SRE/Platform roles supporting production systems at scale.
  • Strong with AWS : VPC, IAM, ECS/EKS, ALB/NLB, RDS/Elasticache/Object storage, CloudWatch.
  • Proficient in Terraform (or CDK/CloudFormation), CI/CD (GitHub/GitLab/Jenkins/Argo) including artifacts and environment promotion.
  • Containers & orchestration: Docker, task definitions/helm charts, autoscaling, health checks, readiness/liveness.
  • Observability: Prometheus/Grafana, OpenTelemetry, log pipelines (ELK/CloudWatch/Datadog), alert routing.
  • Networking & security: VPC/Subnets, SGs/NACLs, TLS, DNS, WAF, IAM design, secrets (KMS/Parameter Store/Vault).
  • Scripting/automation in Python/Bash, configuration management (Ansible or equivalent).
  • Proven incident management: on-call practice, runbooks, RCAs, tuning alerts to reduce noise.

Nice to Have

  • Kubernetes (EKS) production experience, service mesh (Istio/Linkerd), GitOps (ArgoCD/Flux).
  • Image and dependency security (Trivy/Grype/Snyk), SBOMs, policy-as-code (OPA/Conftest).
  • Data platform ops (Postgres/Mongo backups, PITR, replicas), streaming (Kafka/Kinesis).
  • Edge/GPU workloads (Triton/TorchServe) and autoscaling for AI inference.

About the Company

Griphic is founded by IIT Delhi engineers with a vision to enrich lives through technological innovation. We combine cutting-edge AI with hyper-realistic virtual experiences to solve problems and disrupt industries. Our team includes IIT Delhi engineers, AI/ML experts, VR developers, and 3D specialists. Backed by SKETS Studio (700+ professionals in BIM, architecture, VR, and 3D visualization), we are building the future of immersive web applications.

More Info

Job Type:
Industry:
Function:
Employment Type:

About Company

Job ID: 143286519