Search by job, company or skills

WNS Assistance

Data Engineer

Save
new job description bg glownew job description bg glow
  • Posted 4 days ago
  • Be among the first 30 applicants
Early Applicant

Job Description

About the Role

We are seeking a highly skilled Data Engineer with expertise in Google Cloud Platform (GCP), Generative AI, Python, SQL, ETL, Dataproc, and Airflow. The ideal candidate will play a key role in designing and implementing scalable cloud-based data platforms, modern ETL pipelines, and AI-enabled data solutions.

Key Responsibilities

  • Design, build, and maintain scalable ETL/ELT pipelines using Python and SQL.
  • Develop cloud-native data engineering solutions on GCP.
  • Implement batch and real-time data processing pipelines using Dataproc and Dataflow.
  • Build and manage workflow orchestration using Apache Airflow / Cloud Composer.
  • Work on large-scale structured and unstructured datasets for analytics and AI use cases.
  • Integrate Generative AI and LLM-based capabilities into enterprise data platforms.
  • Optimize BigQuery workloads and improve data processing efficiency.
  • Collaborate with data scientists, analysts, architects, and business teams to deliver data-driven solutions.
  • Ensure data quality, governance, scalability, reliability, and security.
  • Participate in architecture reviews and provide technical leadership and mentorship.

Required Skills

  • Strong experience in Google Cloud Platform (GCP) services:
  • BigQuery
  • Cloud Storage
  • Dataflow
  • Pub/Sub
  • Dataproc
  • Cloud Composer / Apache Airflow
  • Hands-on experience in Generative AI / LLM solutions.
  • Strong programming expertise in Python.
  • Advanced SQL development and query optimization skills.
  • Extensive experience in ETL/ELT pipeline development.
  • Hands-on experience with workflow orchestration using Apache Airflow.
  • Experience in distributed data processing using Dataproc / Spark / PySpark.
  • Good understanding of data warehousing concepts and cloud architecture.
  • Experience with Git, CI/CD pipelines, and Agile methodologies.

Preferred Skills

  • Experience with Vertex AI, LangChain, RAG architecture, vector databases, or AI orchestration frameworks.
  • Exposure to streaming technologies such as Kafka or Pub/Sub.
  • Knowledge of Infrastructure as Code tools such as Terraform.
  • Familiarity with MLOps concepts and ML pipeline integration.
  • GCP certifications are a plus.

Educational Qualification

  • Bachelor's or Master's degree in Computer Science, Engineering, Information Technology, or related discipline.

Desired Candidate Profile

  • Excellent analytical and problem-solving skills.
  • Strong communication and stakeholder management abilities.
  • Ability to work in a fast-paced, collaborative environment.
  • Passion for cloud technologies, modern data platforms, and AI innovation.

More Info

Job Type:
Industry:
Employment Type:

About Company

Job ID: 148451675

Similar Jobs

Noida, India

Skills:

HiveSqlAWSHDFS

Noida, India

Skills:

stream processing snowflake PysparkApache SparkApache AirflowApache KafkaDatabricksAdvanced SqlPythonBig Data ArchitectureETL Pipeline DevelopmentData Quality Validation

Gurugram, India

Skills:

BigQueryHadoopPysparkSpring BootDataprocSqlMapreduceJava 8Cloud StorageHiveCore JavaSparkDataFlowPub SubGCS

Noida, India

Skills:

Apache AirflowPostgreSQLData WarehousingApache SparkData ModelingDatabase Management SystemsSqlETL processespipeline optimization

Noida, India

Skills:

HadoopScalaPl SqlEmrSparksqlData ModelingDdlInformaticaBodiSSISSqlHiveOdiDatastageHiveqlSparkPythonMDXWarehousingKornShellETL pipelines