Job Description
Strong experience with Google BigQuery Advanced proficiency in SQL Hands-on experience with GCP services: BigQuery Cloud Storage Dataflow / Dataproc Cloud Composer (Airflow) Experience in ETL/ELT pipeline development Good understanding of data warehousing concepts Familiarity with Python or PySpark for data processing Experience in performance tuning and query optimization Knowledge of data partitioning and clustering in BigQuery
Design, develop, and optimize BigQuery-based data warehouses and data marts Build and maintain ETL/ELT pipelines using GCP services Write efficient, complex SQL queries for data transformation and analysis Work with Cloud Composer (Airflow), Dataflow, or Dataproc for pipeline orchestration Perform data modelling (star/snowflake schema) for analytics use cases Optimize query performance and manage BigQuery cost efficiency Integrate data from various sources (APIs, Cloud Storage, databases, streaming sources) Implement data validation, quality checks, and monitoring mechanisms Collaborate with data analysts, data scientists, and business stakeholders Ensure security and governance best practices within GCP
Experience with CI/CD tools (Cloud Build, Jenkins, GitHub Actions) Knowledge of data visualization tools (Looker, Tableau, Power BI) Familiarity with streaming data pipelines (Pub/Sub) Experience with Infrastructure as Code (Terraform) Understanding of data governance and access control in GCP Bachelor's degree in Computer Science, IT, or related field GCP certifications (e.g., Professional Data Engineer) are a plus