
Search by job, company or skills
CheckMK Monitoring Engineer
Experience: 5-8Years
Location: PAN India
Notice Period: Immediate- 15days
Job Summary:
We are looking for a skilled CheckMK Monitoring Engineer to design implement operate and optimize enterprise wide monitoring solutions using CheckmMK. The role focuses on proactive monitoring of infrastructure cloud platforms and business critical applications ensuring high availability performance and compliance with SLAKPI targets in a global delivery environment
The candidate will work closely with Infrastructure Application Cloud and Service Management teams to improve observability reduce incidents and support continuous service improvement.
Key Responsibilities:
Monitoring Design Implementation
Design deploy and maintain CheckMK monitoring solutions for
Servers Windows Linux Unix
Network devices routers switches firewalls
Virtualization platforms VMware Hyper V
Cloud platforms Azure
Applications middleware and databases
Configure hosts services checks rulesets thresholds and alerts in CheckMK
Integrate agent based and agentless monitoring approaches
Operations Support:
Provide L2/L3 support for CheckMK monitoring platform
Troubleshoot monitoring gaps false alerts and performance issues
Ensure high availability and scalability of the monitoring platform
Perform regular health checks upgrades and patching of CheckMK
Application Service Monitoring
Work with application teams to define business relevant metrics and SLIs
Implement monitoring for application components such as
Web servers app servers databases
APIs batch jobs queues and services
Enable end to end service visibility and dependency mapping
Automation Integration
Automate monitoring onboarding using
CheckMK APIs
Scripts Python Bash PowerShell
Configuration management tools Ansible Terraform preferred
Integrate CheckMK with
ITSM tools ServiceNow
Notification tools Email Teams Slack PagerDuty etc
Dashboards and reporting tools Grafana optional
Reporting Governance
Create dashboards views and reports for
SLAKPI compliance
Availability and performance trends
Support audits operational reviews and service governance requirements
Maintain monitoring documentation SOPs and runbooks
Technical Skills Mandatory:
Strong hands on experience with CheckMK Enterprise or Raw Edition
Solid understanding of
Infrastructure monitoring concepts
Network protocols SNMP TCPIP HTTP
Linux and Windows OS administration
Experience monitoring
Servers networks storage virtualization
Alerting event correlation and root cause analysis skills
Technical Skills Good to Have:
Cloud monitoring AWS Azure GCP
Scripting Python Shell PowerShell
Experience with ITIL based operations
Knowledge of other monitoring tools Nagios Zabbix SCOM Dynatrace AppDynamics is a plus
Basic understanding of observability concepts metrics logs traces
Job ID: 147245781
We don’t charge any money for job offers