Job Summary
Sr Consultant role focusing on LogicMonitor and Datadog based monitoring for hybrid environments delivering robust observability incident reduction and performance optimization for enterprise clients while collaborating closely with cross functional teams to design implement and maintain scalable monitoring solutions that align with organizational goals and compliance standards.
Responsibilities
- Design and implement comprehensive LogicMonitor and Datadog monitoring strategies that align with enterprise observability objectives and service level targets
- Configure advanced monitoring dashboards in LogicMonitor to provide meaningful visibility into infrastructure health application performance and capacity trends
- Develop tailored Datadog metric collections monitors and visualization assets that highlight critical service indicators and enable proactive incident detection
- Optimize alert rules and thresholds in LogicMonitor and Datadog to reduce noise prioritize critical events and improve response efficiency for support teams
- Collaborate with application infrastructure and security teams to identify key telemetry requirements and translate them into actionable monitoring configurations
- Conduct detailed root cause analyses using LogicMonitor and Datadog data to identify recurring issues and recommend sustainable remediation actions
- Create and maintain reusable monitoring templates runbooks and standard operating procedures that support consistent deployment across hybrid environments
- Integrate LogicMonitor and Datadog with ticketing and collaboration tools to streamline incident workflows and enhance transparency for stakeholders
- Perform regular health checks tuning exercises and platform upgrades for LogicMonitor and Datadog to ensure resilience availability and optimal performance
- Provide expert guidance to internal teams on observability best practices and help shape monitoring standards that support reliability and scalability
- Document monitoring architectures configurations and operational guidelines in a clear and comprehensive manner to support onboarding and knowledge sharing
- Coordinate with client partners to gather requirements present monitoring insights and translate feedback into iterative improvements for observability solutions
- Measure and report on the impact of monitoring enhancements by tracking incident reduction mean time to resolution and performance improvements across services
Qualifications
- Possess professional experience of eight to twelve years in enterprise monitoring and observability with strong focus on LogicMonitor and Datadog based solutions
- Demonstrate advanced proficiency in configuring LogicMonitor collectors data sources dashboards and alerting components for complex hybrid environments
- Demonstrate strong expertise in Datadog including custom metrics creation monitor configuration log analytics and visualization of service performance
- Exhibit solid understanding of infrastructure components such as servers networks cloud platforms and containers to design effective monitoring coverage
- Demonstrate ability to interpret performance data and logs to identify trends capacity risks and optimization opportunities that support business continuity
- Display strong skills in documenting technical solutions creating runbooks and communicating complex topics in a clear and concise manner to diverse audiences
- Demonstrate experience working in hybrid work models while collaborating with distributed teams using modern communication and project management tools
- Exhibit familiarity with incident management and problem management practices to align monitoring configurations with operational processes and governance
- Demonstrate strong analytical mindset with focus on reducing incidents improving service uptime and enhancing customer experience through observability improvements
- Display knowledge of scripting or automation tools that support integration and configuration of LogicMonitor and Datadog within enterprise environments
- Demonstrate commitment to continuous learning and staying current with emerging monitoring practices tools and industry trends relevant to observability