Search by job, company or skills

three across

VP, Software Engineering - Observability

Save
new job description bg glownew job description bg glownew job description bg svg
  • Posted 19 hours ago
  • Be among the first 10 applicants
Early Applicant

Job Description

Role: VP, Software Engineering - Observability

Location: Pune

Work Mode: Hybrid

Overview:

We are currently hiring a Lead Software Engineer to drive the design and evolution of large-scale observability platforms for enterprise-grade systems.

This role sits at the intersection of engineering excellence, platform reliability, and strategic technology leadership, with a strong focus on building robust observability frameworks across distributed environments.

Key Responsibilities:

  • Design, develop, and scale high-quality software solutions aligned with modern engineering practices
  • Architect and implement observability platforms across logs, metrics, and traces
  • Drive adoption of OpenTelemetry, Prometheus, and Grafana across teams and systems
  • Define and operationalise SLOs and SLIs to improve system reliability and performance
  • Collaborate with cross-functional teams to align technology solutions with business objectives
  • Mentor engineers and promote best practices in instrumentation, monitoring, and alerting
  • Contribute to a culture of code quality, innovation, and continuous improvement

What We're Looking For:

  • Strong experience in large-scale distributed systems and observability
  • Hands-on expertise with OpenTelemetry, Prometheus, Grafana, and tracing platforms
  • Proficiency across multiple languages such as Java, Python, Go, or Node.js
  • Deep understanding of structured logging, metrics, and tracing frameworks
  • Proven ability to define and drive reliability engineering practices (SLOs/SLIs)
  • Experience in influencing engineering strategy and mentoring teams

You will play a critical role in shaping how modern systems are observed, measured, and improved; enabling engineering teams to build resilient and high-performing platforms at scale.

In a world of distributed systems, it's not the failure that defines engineering maturity, it's how quickly and intelligently you can observe, understand, and respond to it.

More Info

Job Type:
Industry:
Employment Type:

About Company

Job ID: 147193791

Similar Jobs

Pune, India

Skills:

Monitoring ToolscloudLinuxDistributed SystemsmetricsKubernetesPythonerror budgetslogstracesSLOsincident governanceobservability

Pune, India

Skills:

snowflake S3T-sqlPerformance TuningCassandraPostgreSQLPl SqlDdlInformaticaNosqlMySQLShell scriptingOraclePythonAWSDmlSQL ServerData ArchitectureAmazon RedshiftData WarehousingMongoDBAdvanced SqlEtlData Lake implementationEnterprise data modellingLake formationData modelling tools and frameworksGlue

Pune, India

Skills:

cloud hosting JavaData ServicesKafkaSpring BootUnit TestingJIRASqlSpringReactCamundaGitlabCode scannersAutomated functional testingPublic cloud storageNoSQL DBs

Pune, India

Skills:

snowflake JavaAws LambdaS3HadoopSpring BootKafkaAngularCloudwatchSparkDatabricksPythonGlueAthena

Pune, India

Skills:

TypescriptGcpDockerOauth2ReduxJwtReact NativeAzureKubernetesAWSJava Spring Boot