Design & Implement Monitoring Solutions : Architect, configure, and deploy Zabbix standalone and failover solutions for enterprise infrastructure monitoring.
OS & Application Monitoring : Develop monitoring strategies for diverse operating systems (Windows, Linux, Solaris, AIX) and enterprise applications.
Event Management & ITSM Integration : Integrate Zabbix with event management systems (e.g., Netcool) and ITSM platforms (e.g., ServiceNow) for automated incident handling.
Automation & Scripting : Leverage automation frameworks and scripting (Shell, Python, APIs) to enhance monitoring capabilities and reduce manual intervention.
Scalability & Performance Optimization : Optimize monitoring infrastructure to scale across large environments, ensuring high availability and performance.
Customization & Dashboards : Develop custom Zabbix templates, triggers, and dashboards to provide meaningful insights into IT infrastructure health.
What you bring:
Expertise in Zabbix : At least 8 to 12 years of proven experience in designing, deploying, and managing Zabbix monitoring solutions, including failover configurations.
Monitoring & Integration Experience : Hands-on experience integrating Zabbix with ITSM tools, event management systems, and automation frameworks.
Scripting & Programming Skills : Proficiency in Shell scripting, Python, and REST API for automation and integrations.
Infrastructure Knowledge : Strong understanding of IT infrastructure, including servers, networks, virtualization, cloud platforms, and databases.
Performance Tuning & Troubleshooting : Ability to optimize Zabbix performance, troubleshoot monitoring issues, and ensure efficient alerting mechanisms.
Analytical & Problem-Solving Skills : Ability to analyze complex environments, propose monitoring solutions, and resolve issues proactively.
Added bonus if you have:
Knowledge of databases (MySQL, PostgreSQL) for Zabbix backend optimization.
Exposure to financial services technologies and domain knowledge is a plus.