Cognite operates at the forefront of
industrial digitalization, building AI and data solutions that solve some of the world's hardest, highest-impact problems. With unmatched industrial heritage and a comprehensive suite of AI capabilities, including low-code AI agents, Cognite accelerates the digital transformation to drive operational improvements.
Our moonshot is bold: unlock $100B in customer value by 2035 and redefine how global industry works.
What Cognite is Relentless to achieve
We thrive in challenges. We challenge assumptions. We execute with speed and ownership. If you view obstacles as signals to step forward - not step back - you'll feel at home here. Join us in this venture where AI and data meet ingenuity, and together, we forge the path to a smarter, more connected industrial future.
Role Overview
We are seeking an experienced L3 Support Engineer with strong data engineering and Python expertise to own complex production issues, perform deep root cause analysis, and work closely with engineering and DevOps teams to improve system stability, data reliability, and operational excellence.
Key Responsibilities
- Provide Level 3 support for complex production issues related to data pipelines and applications.
- Troubleshoot and resolve failures in data ingestion, transformation, and processing pipelines.
- Debug and optimize Python-based data services and jobs.
- Perform root cause analysis (RCA) for recurring incidents and implement permanent fixes.
- Support and improve CI/CD pipelines for data and application deployments.
- Collaborate with DevOps teams on deployments, rollbacks, and environment issues.
- Analyze application, data, and infrastructure logs to identify failure points.
- Monitor batch and streaming jobs; handle data quality and reconciliation issues.
- Provide guidance to L2 teams and contribute to operational runbooks.
- Participate in incident response, postmortems, and on-call rotations.
Required Skills & Qualifications
- 5+ years of experience in Production Support / Application Support (L3) or Data Engineering with a proven track record of debugging complex data platforms.
- Strong hands-on experience with Python for data processing and debugging.
- Solid understanding of data engineering concepts (ETL/ELT, batch vs streaming, data validation).
- Strong knowledge of CI/CD concepts and deployment workflows.
- Basic DevOps skills (Linux, containers, environment configuration).
- Experience with version control systems (Git).
- Strong understanding of REST APIs and service integrations.
- Excellent troubleshooting, analytical, and communication skills.
Nice to Have
- Experience with cloud platforms (AWS, Azure, or GCP).
- Knowledge of monitoring and logging tools (Prometheus, Grafana, ELK).
- Exposure to tools such as Airflow, Spark, Kafka, or similar.
- Exposure to Kubernetes or container orchestration.
- Experience with data warehouses (Snowflake, BigQuery, Redshift).
- ITIL or incident management experience.
What We Offer
- Ownership of critical production systems.
- Opportunity to influence system reliability and data quality.
Equal Opportunity
Cognite is committed to creating a diverse and inclusive environment at work and is proud to be an equal opportunity employer. All qualified applicants will receive the same level of consideration for employment.