Search by job, company or skills

Digital Impetus

GCP Data Engineer

Save
  • Posted 8 hours ago
  • Be among the first 10 applicants
Early Applicant

Job Description

Job Description — GCP Data Engineer

Role: GCP Data Engineer

We are seeking a skilled and motivated GCP Data Engineer with strong expertise in Python, PySpark, Spark, SQL, and Google Cloud Platform (GCP) to design, build, and optimize scalable data pipelines and analytics solutions. The ideal candidate will work closely with data analysts, data scientists, and business stakeholders to deliver reliable and high-performance data platforms.

Key Responsibilities

  • Design, develop, and maintain scalable ETL/ELT pipelines using Python, PySpark, and Apache Spark
  • Build and manage data processing workflows on Google Cloud Platform (GCP)
  • Develop optimized SQL queries, stored procedures, and data models for analytics and reporting
  • Work with large-scale structured and unstructured datasets
  • Implement batch and real-time data processing solutions
  • Integrate data from multiple sources including APIs, databases, and cloud storage
  • Optimize Spark jobs for performance and cost efficiency
  • Collaborate with cross-functional teams to understand data requirements and deliver solutions
  • Ensure data quality, governance, security, and compliance standards
  • Monitor and troubleshoot production data pipelines and workflows
  • Participate in code reviews, testing, deployment, and documentation

Required Skills

  • Strong programming experience in Python
  • Hands-on experience with PySpark and Apache Spark
  • Strong knowledge of SQL and database concepts
  • Experience with Google Cloud Platform (GCP) services such as:
  • BigQuery
  • Cloud Storage
  • Dataproc
  • Dataflow
  • Pub/Sub
  • Composer / Airflow
  • Experience building ETL/data pipelines
  • Understanding of distributed computing concepts
  • Familiarity with data warehousing and data lake architectures
  • Knowledge of version control tools such as Git
  • Strong problem-solving and debugging skills

Preferred Qualifications

  • Experience with CI/CD pipelines and DevOps practices
  • Knowledge of Airflow orchestration
  • Familiarity with Kafka or streaming technologies
  • Exposure to Terraform or Infrastructure as Code
  • Certification in GCP is a plus
  • Experience in Agile/Scrum environments

Education & Experience

  • Bachelor's degree in Computer Science, Engineering, Information Technology, or related field
  • 4+ years of experience in Data Engineering or Big Data technologies

More Info

Job Type:
Industry:
Employment Type:

About Company

Job ID: 148888815

Similar Jobs

Gurugram, Chennai

Skills:

BigQueryDataFlowCloud StorageTerraformGCS

Chennai, India

Skills:

JavaUnixBigQueryHadoopScalaApache SparkBashDataprocSqlCloud StorageHiveGcpLinuxIamApache BeamDataFlowAirflowPub SubCloud Composer

Chennai, India

Skills:

Apache AirflowCloud StorageBigQueryGcpSparkKafkaDataFlowPythonSqlPub Sub

Chennai, India

Skills:

BigQueryHadoopPysparkData WarehousingBashSparksqlDataprocSqlCloud StorageHiveGcpIamDataFlowPythonAirflowDataFramePub Sub

Chennai, India

Skills:

Cloud StoragePysparkDataprocPythonTerraformGoogle Cloud PlatformTektonAirflowCloud SQLDataFusionData FlowDataPlexConfluent Kafka