Search by job, company or skills

Bahwan Cybertek

DevOps / SRE Lead

Save
new job description bg glownew job description bg glownew job description bg svg
  • Posted 13 hours ago
  • Be among the first 10 applicants
Early Applicant

Job Description


Role description

We are looking for a technically strong and hands-on DevOps / SRE Lead to drive cloud-native infrastructure, reliability engineering, and developer experience for one of the world's leading travel technology platforms. The role demands deep expertise across the modern DevOps toolchain, cloud infrastructure and platform engineering practices - with the ability to own end-to-end delivery pipelines and champion SRE culture.


 


Key Responsibilities


      Design, build, and operate cloud-native infrastructure on Azure and on-premise data center using infrastructure-as-code principles (Ansible, Terraform).


      Architect and manage Kubernetes (AKS / self-managed) clusters at scale; enforce GitOps workflows


      Drive adoption of Platform Engineering practices - build internal developer platforms (IDPs) leveraging Backstage or equivalent to reduce cognitive load on dev teams.


      Manage and optimise container image lifecycle, registries (ACR, ECR), and multi-environment deployment strategies (blue-green, canary, rolling).


      Implement full-stack observability using the OpenTelemetry standard  -  metrics (Prometheus / Thanos), logs (Loki / EFK / OpenSearch), and traces (Jaeger, Tempo).


      Build and maintain Grafana dashboards, runbooks, and SLI/SLO frameworks; drive error-budget culture with tech/product teams.


      Lead incident response and continuous reliability improvement.


      Embed security into every layer: network policies, RBAC, OPA/Gatekeeper policies in Kubernetes, image signing (Cosign/Notary).


      Manage secrets hygiene, certificate lifecycle (cert-manager), and cloud IAM with least-privilege principles.


      Ensure compliance alignment (SOC 2, PCI-DSS awareness) for production workloads.


      Operate and optimise event-streaming infrastructure  -  Apache Kafka, NATS, or RabbitMQ.


      Support database reliability for PostgreSQL, MSSQL, MongoDB, Redis; coordinate DBA activities for backups, failover, and performance.


      Collaborate closely with application development, QA, product, and security teams to align DevOps strategy with business goals.


      Manage vendor and tool evaluations, present infrastructure roadmaps to technical leadership.

More Info

Job Type:
Industry:
Function:
Employment Type:

About Company

Job ID: 146836145

Similar Jobs