Production Support Engineer
Mandatory Skills
Skillset: Incident management, Splunk Observability, Synthetics, Open Telemetry, GCP and Knowledge in AI based applications
- 3+ Years Production Support (3rd Level – Technical) experience.
- Experience with one of the previous roles: Software Engineer, DevOps Engineer, Cloud Infra Engineer
- Adept at navigating monitoring tools, logs and events to diagnose application issues.
- Experience with Cloud-Native concepts and technologies, particularly GCP.
- Experience with Incident Management. Preferably via ServiceNow.
- Experience with Monitoring/Observability tools such as Splunk Observability, Synthetics and Open Telemetry
- Diagnose and troubleshoot technical issues related to AI software and services
- Prior experience supporting production cloud infrastructure at scale (GCP, AWS etc.)
- Have good analytical and problem-solving skills. Can solve complex problems quickly.
- Can effectively communicate complex technical issues
- Motivated and pro-active. Able to deal with uncertainty and ambiguous requirements.
- Able to support multiple on-call time zones.
Non-mandatory, but advantageous, Requirements
- GCP and Kubernetes experience is a definite benefit.
- Experience with Unix/Linux networking and operating systems.
- Experience programming with Python or DevOps experience.
- Experience writing Infrastructure as Code (e.g. Terraform)