Search by job, company or skills

FIS

Site Reliability Engineer Senior (AzureCloud, Splunk)

6-10 Years
Save
new job description bg glownew job description bg glownew job description bg svg
  • Posted 6 hours ago
  • Be among the first 40 applicants
Early Applicant
Quick Apply

Job Description

What you will be doing:

Build software solutions and systems to manage platform infrastructure and applications.

Partner with development teams to improve services through rigorous testing and release procedures.

Participate in system design consulting, platform management, and capacity planning.

Improve reliability, quality, and time-to-market of our suite of software solutions.

Build monitoring that alerts on symptoms rather than on outages.

Run the production environment by monitoring availability and taking a holistic view of system health.

Measure and optimize system performance, with an eye toward pushing our capabilities forward, getting ahead of customer needs, and

innovating to continually improve.

Provide primary operational support and engineering for multiple large, distributed software applications.

Gather and analyze metrics from both operating systems and applications to assist in performance tuning and fault finding.

Create sustainable systems and services through automation and uplifts.

Balance feature development speed and reliability with well-defined service level objectives.

Partner with stakeholders to design and deliver a reliable, scalable, secure, and performant platform.

Stay current on technical trends to suggest innovative tools and approaches to problems.

A proactive approach to spotting problems, areas for improvement, and performance bottlenecks.

Identify and resolve problems promptly to meet and improve service levels and standards.

What you will need:

  • Bachelor s degree or the equivalent combination of education, training, or work experience.
  • 6+ yrs experience in building and contributing to scalability and monitoring of applications
  • Proficiency in Python
  • Good knowledge of SRE concepts especially towards application performance monitoring and building alerts and dashboards catering to SLIs and SLOs
  • Experience in Scripting in PowerShell, Bash
  • Proficiency with monitoring tools like Dynatrace, Prometheus, Grafana, Splunk, Nobl9
  • Knowledge of CI/CD tools such as Jenkins, GitLab CI, or Azure DevOps
  • Ability to work in an agile development environment where developers and testing personnel work closely together to ensure requirements are met or exceeded.
  • Ability to demonstrate interpersonal and teambuilding skills working with technical and non-technical individuals .

Added bonus if you have:

  • Experience of Agile Scrum / SAFe will be an added advantage
  • Fluent in English
  • Excellent communicator - ability to discuss technical and commercial solutions to internal and external parties and adapt depending on the technical or business focus of the discussion.
  • Organized approach - manage and adapt priorities according to client and internal requirements.
  • Self-starter but team mindset - work autonomously and as part of a global team

More Info

Job Type:
Industry:
Function:
Employment Type:
Open to candidates from:
Indian

About Company

Job ID: 108705965

Similar Jobs