
Search by job, company or skills
Position: Monitoring Tools Developer
Location: OFFSHORE / INDIA
Working Hours: M-F, need to closely align with US hours
Start Date: ASAP – beginning/mid of May
Required Skills & Experience:
Plusses:
Job Description:
Insight Global is looking for a skilled Monitoring Tools Developer to design, build, and maintain enterprise‑grade monitoring and observability solutions. This role serves as the primary tools owner responsible for dashboards, alerts, data integrations, and automation across the monitoring stack. The ideal candidate brings strong hands‑on experience with Grafana, Zabbix, HyperDX, JavaScript, and API development, and is comfortable adapting to new tools as the monitoring and observability ecosystem evolves. Design, build, and maintain Grafana dashboards for infrastructure, application, and network monitoring, including advanced visualizations, variables, and alerting. Develop and manage Zabbix monitoring configurations, including templates, items, triggers, discovery rules, preprocessing logic, and notification workflows. Deploy, configure, and enhance HyperDX for application performance monitoring, tracing, log analysis, and alerting. Write JavaScript for Zabbix preprocessing, Grafana custom panels, data transformations, and automation logic. Build and maintain RESTful APIs, webhooks, and integrations to ingest, expose, and automate monitoring data across internal and third‑party systems. Integrate monitoring tools with platforms such as ServiceNow, ITSM tools, CMDBs, and Dynamics 365 for alert‑driven incident management. Automate monitoring operations using Node.js, Python, PowerShell, or Bash, and maintain configuration as code (JSON/YAML). Tune performance and reliability of monitoring platforms, including database optimization, proxy architecture, scaling, and high availability. Create runbooks, documentation, and operational procedures, and provide mentorship and knowledge transfer to operations teams. Participate in on‑call rotation to support monitoring platform availability and incident response.
Job ID: 146078985