JobSummary
Wearelookingfor8+yearsexperiencedOpenTelemetry(OTel)OperationsEngineertomanageandmaintainobservabilitysolutionsacrossapplicationsandinfrastructure.Theroleinvolvesimplementing,monitoring,andoptimizingtelemetrydata(metrics,logs,andtraces)usingOpenTelemetrytoensuresystemreliability,performance,andoperationalvisibility.
KeyResponsibilities
- ImplementandmanageOpenTelemetry(OTel)instrumentationforapplicationsandservices.
- Configureandmaintainmetrics,logs,anddistributedtracingpipelines.
- Monitorsystemperformanceandtroubleshootissuesusingobservabilitytools.
- IntegrateOpenTelemetrywithmonitoringplatformssuchasGrafana,Prometheus,orsimilartools.
- Ensuretelemetrydatacollectionisoptimizedforperformanceandcost.
- CollaboratewithDevOps,SRE,anddevelopmentteamstoimprovesystemobservability.
- Managealerts,dashboards,andincidentanalysisbasedontelemetryinsights.
- Supportproductionenvironmentsandensurehighavailabilityofmonitoringsystems.
RequiredSkills
- Strong8+yearsexperiencewithOpenTelemetry(OTel)frameworksandcollectors.
- Knowledgeofobservabilityconcepts:metrics,logs,traces.
- ExperiencewithmonitoringtoolssuchasPrometheus,Grafana,Datadog,orELKstack.
- FamiliaritywithKubernetes,Docker,andcloudplatforms(AWS/Azure/GCP).
- ExperiencewithCI/CDpipelinesandDevOpspractices.
- Basicscriptingknowledge(Python,Bash,orsimilar).
PreferredQualifications
- ExperienceinSREorDevOpsoperationsroles.
- Understandingofmicroservicesarchitecture.
- Exposuretodistributedsystemsmonitoring.