Search by job, company or skills

Mumba Technologies, Inc.

Site Reliability Engineer

new job description bg glownew job description bg glownew job description bg svg
  • Posted 10 hours ago
  • Be among the first 10 applicants
Early Applicant

Job Description

Job Title: DevOps Infrastructure Engineer

Job Type: Full Time

Location: Noida 62

Shift Timing: Night Shift

Role Overview:

We are seeking an experienced DevOps Infrastructure Engineer with deep technical expertise in multi cloud infrastructure automation, distributed system design, and secure, high availability environments. This role will own delivery of resilient platform services across Azure, AWS, GCP, and OCI, with strong emphasis on IaC, Kubernetes, pipelines, and observability engineering.

Core Technical Responsibilities

Multi-Cloud Architecture & Engineering

Architect and automate provisioning of compute, networking, storage, and identity systems across Azure, AWS, GCP, and OCI

Implement advanced cloud networking (VNet/Transit Gateway, VPC Peering, ExpressRoute/VPN, DNS, WAF, Load Balancing)

Configure workload scaling policies, auto-healing, and disaster recovery strategies using multi-region / multi zone deployment models

Enable governance controls including RBAC, Azure Policies, Cloud Custodian, and Secrets Management (Key Vault, KMS, HashiCorp Vault)

Containers, Kubernetes & Platform Ops

Deploy and operate containerized workloads via AKS, EKS, GKE, and OCI Container Engine

Implement GitOps workflows using FluxCD or ArgoCD

Build secure container supply chains (image scanning, policy enforcement, artifact registries)

Tune and monitor cluster internals including etcd, CNI plugins, ingress controllers, and service mesh technologies

CI/CD Engineering & Automation

Engineer fully automated CI/CD pipelines using Azure DevOps, Jenkins, GitLab, or GitHub Actions

Integrate automated test suites, security scanning (SAST/DAST), compliance gates, artifact promotion, and bluegreen / canary deployments

Develop reusable Terraform modules and pipelines for scalable IaC delivery

Automate operational runbooks through scripting in PowerShell, Python, or Bash

Data Platform Infrastructure

Administer SQL and NoSQL systems (SQL Server, PostgreSQL, MySQL, Cosmos DB, Redis, MongoDB)

Design HA/replication strategies, automated failover, and query performance tuning

Implement backup orchestration, PITR, policy-based archiving, encryption, and audit enforcement Observability, SRE & Incident Response

Deploy telemetry pipelines ingesting metrics, logs, and traces into tools such as Prometheus, Grafana, DataDog, and Azure Monitor

Configure synthetic tests, distributed tracing, and SLO/SLI tracking aligned to reliability objectives

Lead root cause analysis and implement corrective automation to reduce MTTR

Required Technical Qualifications

5+ years hands-on DevOps / Cloud SRE engineering in production systems

Expert-level proficiency with Terraform, cloud-native networking, and Kubernetes

Multicloud platform experience including Azure, plus experience with GCP, AWS, and OCI

Strong understanding of identity federation, MFA/SSO, OAuth2, SAML, and workload identity

Demonstrated experience designing secure architectures aligned to NIST, CIS Benchmarks, and Zero Trust models

Fluent in scripting for automation and infrastructure operations (PowerShell, Python, Bash)

Proven reliability engineering experience delivering resilient distributed systems

Preferred Technical Skills

Azure AZ104, AZ305, DevOps, or equivalent cloud certifications

FinOps methodologies for cost control and workload efficiency

Service mesh experience (Istio, Linkerd, Consul)

Eventdriven architecture support (Kafka, EventHub, Pub/Sub)

Automation for patching, compliance, and vulnerability remediation

More Info

Job Type:
Industry:
Function:
Employment Type:

Job ID: 136400173

Similar Jobs