Data Processing Principal Engineer

DataPelago

Mumbai, India

10-15 Years

This job is no longer accepting applications

Posted a month ago

Job Description

Lead architecture, design, and implementation of next-generation data processing engine built to exploit accelerated computing.

The developer will have the following specific responsibilities in achieving these objectives.

Responsibilities

Work with cross-functional teams to advance the architecture at the core of Datapelago's parallel and distributed execution engine based on accelerated computing
Lead the execution engine team in the design, implementation, and rollout of enterprise grade, highly reliable acceleration engine for data processing
Design, implement, test, and maintain major components of the execution engine
Analyze and identify areas for differentiation and improvements in the execution engine
Collaborate with other teams and team members to drive code reviews, design reviews,

performance and reliability reviews, and development process and to drive continuous improvement in all of these

Qualifications

B.S. EE/CS or equivalent with 15+ years experience or MS with 10+ years experience
10+ years of experience developing core components of an enterprise-grade database or analytics execution engine serving large-scale data processing workloads. Experience developing for platforms such as Apache Spark, Gluten, Velox, DataFusion preferred.
Experienced developing high-performance parallel implementations of data processing operators and functions, such as joins, aggregations, sorts
Experience leading 10+ teams in designing, developing, and releasing high-performance data processing engines for large production deployments
Strong programming ability in C, C++, and Rust
Strong development experience on Linux platforms