
Search by job, company or skills

********************************
Note
Immediate Joiner/Serving Notice Candidate only
4 days office/1 day remote @ Chennai
Must 7+ years of exp in Data Engineering
********************************
Position Description
GCP Data Engineer to build cloud analytics platforms using lean Agile practices. You will design, modernize, and land data on the Google Cloud Platform (GCP) to support Enabling Platforms and Analytics. Experience with large-scale data warehouses, data lakes, and decentralized data architectures (like Data Mesh) on GCP is required. You will also design foundational data infrastructure to power enterprise AI, Machine Learning (ML), and Generative AI initiatives. We need candidates with a broad set of technology skills who can design optimal solutions using GCP and third-party technologies. You will: Implement Data Mesh principles (data as a product, data contracts, self-serve platforms). Design, build, and maintain high-performance dimensional data models (Star/Snowflake Schema) supporting BI, analytics, and AI/ML. Develop analytics data products using streaming and batch ingestion in GCP. Collaborate with Data Science/AI teams to architect MLOps pipelines and integrate GenAI (LLMs, Vector Search). Act as SME in Data Engineering, AI integrations, and GCP services, advocating for technical excellence. Work in collaborative, agile environments (including pairing/mobbing) with cross-functional engineers, product owners, and data stewards. Primary Skills Required: Experience building domain-specific, secure data products within a Data Mesh framework. Expertise in dimensional modeling (Star/Snowflake Schema) and optimizing schemas for BigQuery. Experience deploying automated, end-to-end data pipelines (concept to operations) with automated data lineage, quality frameworks, and observability. Ability to evaluate, prototype (PoC), and productionalize GCP tools for ingestion, integration, and reporting. Skill in translating business problems into technical data requirements alongside product management and architects. Experience with data governance, cataloging, and access control (e.g., GCP Dataplex, GCP Data Catalog) in decentralized environments. Experience Required: In-depth understanding of GCP architecture. Hands of experience in data engineering and analytics application development. Hands-on experience with GCP services: BigQuery, Dataflow, Dataform, Astronomer, Dataproc, Cloud Composer/Airflow, Cloud SQL, Compute Engine, Cloud Functions, Cloud Run, Artifact Registry, Cloud Build, Pub/Sub, and Dataplex, plus Apache Kafka. 5+ years of advanced SQL development and query optimization. 2+ years of development experience in Java or Python, and Apache Beam. 2+ years of building Tekton or similar CI/CD pipelines. Strong expertise in ETL/ELT pipelines, data cleaning, validation, and processing architecture.
Skills Required:
Big Query,, GCP, Cloud Composer, Google Cloud Platform - Biq Query, Data Flow, Dataproc, Data Fusion, TERRAFORM, Tekton,Cloud SQL, AIRFLOW, POSTGRES, Airflow PySpark, Python, API
Skills Preferred:
Vertex, AI/ML
Experience Required:
Engineer 3 Exp: 7+ years Data Engineering work experience
Additional Information :
• Experience building and productionalizing Machine Learning and Generative AI solutions using Vertex AI (including Gemini, Model Garden, Vector Search, and Vertex AI Pipelines), TensorFlow, BigQueryML, and AutoML.
• Proficient in Machine Learning model architecture, data pipeline interaction, and metrics interpretation. This includes designing, deploying, and monitoring end-to-end MLOps pipelines, managing feature stores, and orchestrating pipelines for Generative AI (including Retrieval-Augmented Generation (RAG) and LLM orchestration).
• Experience in building solution architecture, provisioning infrastructure (IaC via Terraform), and securing reliable, compliant, data-centric services and applications in GCP.
• Experience implementing data quality, lineage, and governance policies using Dataplex or other in a self-serve platform environment.
• Experience with development ecosystems such as Git, Jenkins, and CI/CD.
• Advanced experience with Analytics Engineering tools like DBT (Data Build Tool) or Dataform for modeling and transforming data within BigQuery.
• Experience working with Agile and Lean methodologies.
• Team player and attention to detail.
• Performance tuning experience (query optimization, partitioning, clustering, and cost management in BigQuery).
Education Required:
• Bachelor's degree in computer science or related scientific field.
• IT or related associated topics: data architect, data center, data integrity, data manager, data management, data scientist, data warehousing, SQL, AI and ML.
Job ID: 149092751
Skills:
Google Cloud Platform, Pyspark, Dataproc, Terraform, Postgres, Api, Python, Big Query, Airflow, Tekton, AIRFLOW, data fusion, Cloud SQL, Data Flow
Skills:
Spark SQL, BigQuery, Data Warehousing, Redshift, Sql, Gcp, Terraform, Azure Data Lake, Python, Aws S3, Airflow, Synapse, dbt, GCS
Skills:
BigQuery, Hadoop, Pyspark, Data Warehousing, Bash, Sparksql, Dataproc, Sql, Cloud Storage, Hive, Gcp, Iam, DataFlow, Python, Airflow, DataFrame, Pub Sub
Skills:
Cloud Storage, Pyspark, Dataproc, Python, Terraform, Google Cloud Platform, Tekton, Airflow, Cloud SQL, DataFusion, Data Flow, DataPlex, Confluent Kafka
Skills:
Apache Airflow, BigQuery, Terraform, Pyspark, PostgreSQL, Dataproc, DataFlow, Rest Apis, Python, data fusion, Cloud SQL
We don’t charge any money for job offers