Search by job, company or skills

Deutsche Borse

Lead Site Reliability Engineer

Save
  • Posted 9 hours ago
  • Be among the first 10 applicants
Early Applicant

Job Description

About Deutsche Börse Group:

Headquartered in Frankfurt, Germany, we are a leading international exchange organization and market infrastructure provider. We empower investors, financial institutions, and companies by facilitating access to global capital markets. Our business areas cover the entire financial market transaction process chain, including trading, clearing, settlement and custody, digital assets and crypto, market analytics, and advanced electronic systems. As a technology-driven company, we develop and operate cutting-edge IT solutions globally.

About Deutsche Börse Group in India:

Our presence in Hyderabad serves as a key strategic hub, comprising India's top-tier tech talent. We focus on crafting advanced IT solutions that elevate market infrastructure and services. Together with our colleagues from across the globe, we are a team of highly skilled capital market engineers forming the backbone of financial markets worldwide. We harness the power of innovation in leading technology to create trust in the markets of today and tomorrow.

Site Reliability Engineer – Cloud Native Platforms (GCP / Azure)

Corporate IT – Deutsche Börse Group

We're building the next generation of cloud-native operations at Deutsche Börse Group. As part of our Corporate IT Cloud Infrastructure Operations team, we're looking for a Site Reliability Engineer (SRE) who's passionate about automation, reliability, and modern cloud technologies.

This is not just another ops role, it's an opportunity to shape how we run and scale our cloud-native platforms (GCP, Azure) in a highly regulated financial environment. You'll work at the intersection of development and infrastructure operations, helping us evolve our platform capabilities while ensuring performance, resilience, and security.

What You will Do:

  • Design, deploy and manage scalable and reliable systems in GCP and Azure, using serverless technologies, containerization, AI / ML and PaaS-based infrastructure as a code, with a strong focus on automation and observability.
  • Integrate AI / ML technologies into internal tools and workflows to drive automation and efficiency.
  • Operate and maintain geo-redundant, business-critical services leveraging automation and observability tools, ensure transparency and fast issue resolution.
  • Collaborate with Development Teams to implement best practice for cloud infrastructure, ensuring high availability and scalability of applications.

What You Bring:

  • Proven experience as a DevOps / SRE Engineer or a similar role
  • Expertise in managing and optimizing GCP or Azure cloud-native services and AI/ML integration.
  • Experience or knowledge of Container technology such as Docker, Buildah and Kubernetes (GKE, AKS)
  • Must have 2+ scripting and programming experience (Python, Bash)
  • Proficiency in infrastructure-as-code tools, particularly Terraform and ArgoCD
  • Familiarity with observability tools such as Prometheus, Grafana, OpenTelemetry
  • Solid understanding of CI/CD concepts

Why Join Us

  • Be part of a growing SRE team that's building new expertise in cloud-native operations.
  • Work with modern technologies in a mission-critical environment.
  • Help shape the future of IT operations at one of Europe's leading financial institutions.

More Info

Job Type:
Industry:
Employment Type:

About Company

Job ID: 148983735

Similar Jobs

Hyderabad, India

Skills:

CeleryDockerTerraformCosmos DBPostgres SqlPowerShellBashItilDatadogSqlArmKubernetesChecklyLog AnalyticsOpenTelemetryOpenAI APIsBicepApplication InsightsLangChainMicrosoft Azure CloudAI ML-based anomaly detectionPlaywrightKustoAzure Monitor

Hyderabad

Skills:

JavaClinuxDockerSite Reliability Engineer

Hyderabad, India

Skills:

.NETDatadogNetworking TechnologiesJavaScalabilityContinuous DeliveryGrafanaContinuous IntegrationECSTerraformSplunkSpring BootPerformanceDynatraceGitlabPrometheusKubernetesPythonDockerJenkinstoil reductionSecuritytelemetry collectionwhite and black box monitoringReliabilityenterprise system architecturealertingobservabilitySLO