Key Skills:Real-Time Customer Data Platform (RT-CDP), Hadoop, Spark, Hive, EMR, Azure Databricks, Snowflake, GCP, Python, PySpark, Data Pipeline Architecture, Feature Engineering, Data Modeling, SQL, Azure SQL, Synapse Analytics
Roles & Responsibilities:
- Design, develop, and maintain scalable data architecture for data capture, storage, processing, serving, and querying.
- Build and manage data products, including personalization engines, recommendation platforms, customer segmentation, and analytics solutions.
- Collaborate with cross-functional teams across Flights, Hotels, Holidays, and Ground business units to support real-time streaming datasets.
- Develop, optimize, and maintain high-performance data pipelines and workflows for machine learning applications.
- Lead architectural decisions, providing technical direction and guidance for the data engineering team.
- Monitor, tune, and optimize Spark workloads and distributed data processing platforms to ensure responsiveness and efficiency.
- Implement best practices in data modeling, ETL, feature engineering, and pipeline automation.
- Ensure data quality, reliability, and governance across data platforms.
- Stay current with emerging technologies, frameworks, and industry best practices in data engineering.
- Mentor junior engineers and foster a culture of technical excellence within the team.
Experience Required:
- 6 - 12 years of experience in data engineering, with strong expertise in real-time customer data platforms, big data frameworks, and cloud-based analytics.
- Hands-on experience with Hadoop, Spark, Hive, EMR, Azure Databricks, Snowflake, and GCP.
- Experience building machine learning data pipelines and supporting ML/AI applications.
- Strong SQL, Python, PySpark, and data modeling skills.
- Proven ability to lead technical architecture and provide strategic technology direction.
Education:B.Tech