Search by job, company or skills

c5i

Data Engineer

Save
new job description bg glownew job description bg glow
  • Posted 3 hours ago
  • Be among the first 10 applicants
Early Applicant

Job Description

We are looking for a skilled Data Engineer with strong expertise in Python, PySpark and Scala to build and manage scalable data pipelines and support data processing across large datasets.

Key Responsibilities:

  • Design, develop, and maintain scalable data pipelines using PySpark
  • Work with Hadoop ecosystem for distributed data processing and storage
  • Develop and optimize Python-based data workflows
  • Schedule, monitor, and manage workflows using Airflow
  • Collaborate with cross-functional teams to ensure data availability and reliability

Must-have Skills:

  • Strong hands-on experience with PySpark
  • Good knowledge of Hadoop ecosystem (HDFS, Hive, etc.)
  • Proficiency in Python programming
  • Experience with Apache Airflow for workflow orchestration
  • Understanding of data processing, ETL concepts, and large-scale data systems

More Info

Job Type:
Industry:
Employment Type:

About Company

Job ID: 147487491

Similar Jobs

Bengaluru, India

Skills:

Machine LearningPythonData AnalysisRStatistical Modeling

Bengaluru, India

Skills:

Power BiPower AutomatePysparkData WarehousingSqlPandasDaxPythonETL ELT processesPower AppsDelta tablesLakehouse architectureMicrosoft FabricMedallion architecture

Bengaluru, India

Skills:

PytestApache SparkPythonApache Icebergdata engineering basicsETL pipelines

Bengaluru, India

Skills:

ElasticsearchDjangoApi DevelopmentRestful ServicesSqlPythonFAISSvector databasesMilvussearch stacks agents

Bengaluru, India

Skills:

PysparkScalaAzure DatabricksApi IntegrationDimensional ModelingSqlELTGitQuery TuningAzure Synapse AnalyticsSparkStar SchemaPythonAzure DevOpsEtlSnowflake schemaDatabricks jobsAzure SQL DatabaseGitHub ActionsCaching strategiesPartitioningDelta LakeADF pipelines