Job Description
Project Role : Custom Software Engineer
Project Role Description : Develop custom software solutions to design, code, and enhance components across systems or applications. Use modern frameworks and agile practices to deliver scalable, high-performing solutions tailored to specific business needs.
Must have skills : IBM Tivoli Netcool
Good to have skills : NA
Minimum 3 Year(s) Of Experience Is Required
Educational Qualification : 15 years full time education
Summary: As a Custom Software Engineer, you will engage in the development of custom software solutions that are designed to meet specific business needs. Your typical day will involve coding, enhancing components, and collaborating with team members to ensure the delivery of scalable and high-performing solutions using modern frameworks and agile practices. You will also participate in discussions to address challenges and contribute to the overall success of the projects you are involved in. Key Responsibilities 1. CloudPak for AIOps Configuration & Integration Install, configure, and operationalize IBM CloudPak for AIOps components including: o Event Manager o Topology Manager o Runbook Automation o AI Manager / Anomaly Detection Connect CloudPak with enterprise data sources: o Metrics (Prometheus, Instana, Datadog, New Relic) o Logs (Elastic, Splunk, Loki) o Application / middleware components o Kubernetes / OpenShift cluster events Configure event normalization, enrichment, deduplication, and correlation. 2. Migration from Netcool/NOI Lead the migration from IBM Netcool Operations Insight (NOI) / OMNIbus / Impact rules to CloudPak AIOps. Extract, review, and modernize existing: o Impact policies o Event enrichment logic o Probes, gateways, and integrations Re-engineer legacy NOI logic into CloudPak-native event and correlation policies. Plan and execute a multi-phase migration with minimal disruption. 3. Topology Data Ingestion & Refinement Design and implement pipelines for ingesting topology data from: o Kubernetes/OpenShift o Cloud providers (AWS/Azure) o Databases o Network devices o Monitoring tools Refine and enrich topology using: o Metadata tagging o Relationship mapping o Normalization and deduplication Build automation to maintain reliable and up-to-date service topology. 4. AIOps Automation & Self Service Enablement Develop runbook automations, workflows, and playbooks to accelerate: o Incident triage o Root cause analysis o Automated remediation Build templates and frameworks that allow business & application teams to: o Create their own runbooks o Configure event rules o Define custom insights o Onboard new services to AIOps Conduct workshops and training to drive adoption of AIOps capabilities. 5. Operational Excellence Build dashboards, alerts, KPIs, and monitoring around AIOps pipelines. Ensure reliability, high availability, and performance of the AIOps platform. Produce architecture diagrams, runbooks, and technical documentation. Required Skills & Experience 5+ years in Observability, Monitoring, AIOps, or SRE engineering roles. Proven hands-on experience with IBM CloudPak for AIOps 3.x or 4.x, including deployment, upgrades, and configuration. Strong background in: o Netcool/NOI, OMNIbus, Impact policies, event rules o Event processing, normalization, enrichment, and gateways o Topology discovery and CMDB integrations Solid understanding of: o Kubernetes / OpenShift o Prometheus/Grafana o Splunk, Elastic, or similar logging tools o ITSM systems (ServiceNow, Remedy) Experience creating automated runbooks and workflow automations. Familiarity with AI/ML-driven anomaly detection and event correlation concepts. Deploy, configure, and maintain IBM Cloud Pak for Watson AIOps on Red Hat OpenShift. Integrate logs, metrics, events, and ticketing systems (e.g., Splunk, Instana, ServiceNow, PagerDuty). Train and optimize AI models for anomaly detection, event grouping, change risk, and ticket similarity. Correlate events into actionable incidents and perform root cause analysis using the AIOps console and ChatOps. Develop automation policies and runbooks (Ansible) to streamline incident remediation. Build and manage real-time topology maps for service dependency visualization. Provide L2/L3 support, resolving installation issues, pod failures, and access problems. Strong understanding of AIOps, machine learning, and NLP concepts. Hands-on experience with Kubernetes/OpenShift and containerized platforms. Experience with APIs, gRPC integrations, and observability/ITSM tools (Instana, ServiceNow). Solid troubleshooting, automation, and communication skills. Language/Technology YAML, Ansible, Python, and REST APIs Good to have Hands on experience on Netcool suit Linux AWS Nice-to-Have Skills IBM certifications in AIOps or Netcool. Python or Bash scripting for integration and automation tasks. Experience with ServiceNow integrations, CMDB alignment, and discovery. Exposure to observability platforms such as Instana, Dynatrace, or Datadog. Understanding of cloud architecture (AWS preferred). Experience with Kafka, webhooks, and event streaming., 15 years full time education