Search by job, company or skills

A

Data Engineer

Save
new job description bg glownew job description bg glow
  • Posted 22 hours ago
  • Be among the first 10 applicants
Early Applicant

Job Description

Project Role : Data Engineer

Project Role Description : Design, develop and maintain data solutions for data generation, collection, and processing. Create data pipelines, ensure data quality, and implement ETL (extract, transform and load) processes to migrate and deploy data across systems.

Must have skills : PySpark

Good to have skills : NA

Minimum 5 Year(s) Of Experience Is Required

Educational Qualification : 15 years full time education

Summary

As a Data Engineer, your typical day involves designing, developing, and maintaining comprehensive data solutions that support the generation, collection, and processing of data. You will be responsible for creating efficient data pipelines that facilitate smooth data flow and ensure the integrity and quality of data throughout its lifecycle. Your role includes implementing processes to extract, transform, and load data across various systems, enabling seamless data migration and deployment. You will work closely with different teams to support data-driven decision-making and optimize data infrastructure to meet evolving business needs.

Roles & Responsibilities

  • Expected to be an SME, collaborate and manage the team to perform.
  • Responsible for team decisions.
  • Engage with multiple teams and contribute on key decisions.
  • Provide solutions to problems for their immediate team and across multiple teams.
  • Lead efforts to optimize data workflows and improve system performance.
  • Mentor junior team members to enhance their technical skills and project contributions.
  • Coordinate cross-functional collaboration to align data engineering efforts with organizational goals.

Professional & Technical Skills

  • Must To Have Skills: Proficiency in PySpark.
  • Experience in building scalable data pipelines and ETL frameworks.
  • Strong knowledge of distributed computing and big data processing concepts.
  • Familiarity with cloud-based data platforms and storage solutions.
  • Ability to troubleshoot and optimize complex data workflows.
  • Understanding of data quality assurance and validation techniques.

Additional Information

  • The candidate should have minimum 5 years of experience in PySpark.
  • This position is based at our Gurugram office.
  • A 15 years full time education is required.

More Info

Job Type:
Industry:
Employment Type:

About Company

Job ID: 148569725

Similar Jobs

Gurugram, Gurugram, India

Skills:

SparksqlData Warehousing ConceptsSqlHiveBig Data TechnologiesBashPythonHadoopPysparkDataFrameAirflow

Gurugram, India

Skills:

SnaplogicBoomiWorkatoSqlAzure Data FactoryData GovernanceData PrivacyPythonData ProfilingData AnalysisFabric Azure cloud platformMDM implementationsData MappingETL processes

Gurugram

Skills:

data engineering AWSSparkPysparkPythonSqlAdvanced SqlEtlData WarehousingData ModelingData Lakes

Gurugram, Gurugram, India

Skills:

Data ModelingPysparkData ExtractionSqlAzure SynapseData QualityAzure MLAzure Data FactoryDatabricksData SecurityAzure DevOpsEtlFiveTranGit-based workflowGCP Cloud ComposerVertex AIGCP BigQueryGCP Cloud RunGCP DLP

Gurugram, Gurugram, India

Skills:

Data SecurityData GovernanceMicrosoft PurviewETL processesData quality managementCompliance standardsData pipelinesCloud-based data platforms