Search by job, company or skills

Accolite

Data Engineer

Save
new job description bg glownew job description bg glow
  • Posted an hour ago
  • Be among the first 10 applicants
Early Applicant

Job Description

Role Summary

The Data Engineer is the foundation of the AI CoE. You build and maintain the robust,

scalable, and efficient data pipelines that feed clean, reliable data to the data scientists

and ML models. You ensure that data is accessible, organized, and of high quality.

Key Responsibilities

  • Design, construct, install, and maintain large-scale data processing systems and

data lakes/warehouses.

  • Build scalable and reliable ETL/ELT pipelines to ingest data from diverse sources.
  • Ensure data quality and implement data governance and security standards.
  • Collaborate with Data Scientists to understand data requirements for modeling.
  • Optimize data systems for performance and cost.
  • Work with data architecture on data modeling and schema design.

Qualifications

  • Bachelor's or Master's in Computer Science or a related field.
  • 3+ years of experience in a Data Engineer role.
  • Expert-level proficiency in SQL and experience with big data technologies (e.g.,

Spark, Hadoop, Kafka).

  • Strong programming skills in Python, Scala, or Java.
  • Experience with cloud data services (e.g., AWS Redshift/S3, Azure Synapse/Data

Lake, GCP BigQuery).

  • Knowledge of data modeling and data warehousing concepts.

More Info

Job Type:
Industry:
Employment Type:

About Company

Job ID: 148089545

Similar Jobs

Gurugram, Gurugram, India

Skills:

Data TransformationRest ApisSqlPythonData quality checksDocumentationData management lifecycleAPI integrationsPipeline orchestration

Gurugram, Gurugram, India

Skills:

BigQueryPysparkApache SparkRedshiftSqlGitHiveGcpFastAPIAzurePythonArgo Workflows

Gurugram, India

Skills:

AWS GluePythonAWSApache IcebergApache Flink OSSClickHouseApache Spark OSSTrinoGoApache Pinot

Delhi, India

Skills:

PysparkApache SparkAutomationData QualityGitlabDatabricksData GovernancePythonCI CD PipelinesAI ML WorkflowsLLMOpsRAG PipelinesVector-Space ArchitecturesVector SearchSQL OptimizationmetadataDelta LakeSpark Performance OptimizationDatabricks REST APIsDistributed Data ProcessingScalable Data Platform Architecture

Noida, India

Skills:

Spark SQLPower BiPysparkSqlGoogle CloudData WarehousingAzureAWSEtlcdcDelta LakeMicrosoft Fabric