Be responsible for production support & release management for application assigned - SRE C1 - Application Performance Management : APM(Dynatrace).
Should ensure application reliability, proactive monitoring, and incident resolution, while supporting enterprise observability and performance goals.
Should possess excellent troubleshooting and analytical skills.
Key Responsibilities
APM Tool Management
Configure, maintain, and optimize APM Tools (Dynatrace, Appdyanmics, Datadog, New Relic, Splunk, IBM Instana, Zoho Manage Engine etc.).
Ensure end-to-end visibility of application health and performance.
Managing Agent deployment, configure monitoring policy on Tenant settings.
Perform platform upgrade, configure full stack/real user/log monitoring.
Managing inventory of servers.
Monitoring & Incident Response
Establish KPIs and SLAs for application performance.
Monitor dashboards, detect anomalies, and lead incident triage.
Collaborate with DevOps and engineering teams for root cause analysis.
Performance Optimization
Identify recurring performance bottlenecks and propose improvements.
Support scalability and resilience initiatives across applications.
SRE Practices
Apply SRE principles such as error budgets, SLIs, and SLOs.
Contribute to automation of monitoring, alerting, and remediation workflows.
Other Key Responsibilities
Provide regular updates on application health and incidents.
Support reporting to senior managers and business stakeholders.
Coordinate with DevOps, infrastructure, security, DBA & application teams
Participate in release management and deployment activities
Prepare RCA, SOP, KT, and operational documentation
Support audits, compliance, and governance requirements
Track resolution of all open issues, be part of the solutioning team for war room / discussions
Manage audit queries related to Production environment
Mandatory Skills Required
Must have 2+ years experience in define and implement APM tool for critical applications
Lead deployment and integration of any of the industry lead APM tools like Dynatrace, Appdyanmics, Datadog, New Relic, Splunk, IBM Instana, Zoho ManageEngine etc.
Establish KPIs, SLAs, and proactive monitoring frameworks to ensure application reliability
Knowledge of distributed systems, APIs, and cloud-native architectures will be added advantage.
Ability to work in rotational shifts/on-call support
Worked in NBFC/BFSI Sector is mandatory.
Qualifications
Bachelor's or Master's degree in Computer Science, Information Technology, or related field.
Certifications in APM tools, ITIL, or cloud monitoring preferred.