Job Role: Site Reliability Engineer- Splunk
Location: Delhi,Noida
Experience: 4- 8 Years
Responsibilities;
- Familiarity with monitoring and logging tool's such as (Splunk, datadog etc application insights, and Prometheus/Grafana.
- Should have ability to manage incidents effectively, troubleshoot issues swiftly, and perform root cause analysis to prevent future incidents.
- Deep understanding of systems engineering, including operating systems, networking, and cloud infrastructure. Proficiency in automation tools is crucial for maintaining system reliability at scale.
- Should be able to communicate effectively with team members and stakeholders, ensuring alignment, inspiring and motivating them to embrace new mindsets, cultures, and SRE working practices. This skill is crucial for driving meaningful change and fostering a collaborative environment where innovative ideas can thrive.
- Should be able to familiar with cloud platforms and services.
- Fluency in using Git and modern version control workflows.
- Excellent communication skills-written and verbal-are essential for effective collaboration across global teams
- Nice to have proficiency in scripting/programming with (Python, Ruby, Shell Scripting, Golang etc).