Project Role : Data Engineer
Project Role Description : Design, develop and maintain data solutions for data generation, collection, and processing. Create data pipelines, ensure data quality, and implement ETL (extract, transform and load) processes to migrate and deploy data across systems.
Must have skills : PySpark
Good to have skills : NA
Minimum 3 Year(s) Of Experience Is Required
Educational Qualification : 15 years full time education
Summary:
As a Data Engineer, a typical day involves designing, developing, and maintaining comprehensive data solutions that support the generation, collection, and processing of data. This role requires building and managing data pipelines that facilitate smooth data flow across various systems. The professional ensures the integrity and quality of data while implementing processes to extract, transform, and load data efficiently. Collaboration with different teams to deploy data solutions and troubleshoot any issues is a key part of daily activities, contributing to the overall data infrastructure and operational excellence.
Roles & Responsibilities:
- Expected to perform independently and become an SME.
- Required active participation/contribution in team discussions.
- Contribute in providing solutions to work related problems.
- Collaborate with cross-functional teams to understand data requirements and deliver scalable solutions.
- Monitor and optimize data pipelines to ensure high performance and reliability.
- Document data processes and workflows to maintain transparency and facilitate knowledge sharing.
- Assist junior team members by providing guidance and support to enhance their technical skills.
Professional & Technical Skills:
- Must To Have Skills: Proficiency in PySpark.
- Strong experience in building and managing ETL pipelines for large-scale data processing.
- Knowledge of data storage solutions and data modeling techniques.
- Ability to troubleshoot and optimize data workflows for efficiency and scalability.
- Familiarity with distributed computing frameworks and cloud-based data platforms.
Additional Information:
- The candidate should have minimum 3 years of experience in PySpark.
- This position is based at our Bengaluru office.
- A 15 years full time education is required.
, 15 years full time education