Role: GCP Data Engineer
Work location: Anywhere in India (Remote)
Notice period: 15 days joiners
Shift timing: 3:00 PM to 12:00 AM (IST)
Required Skills & Qualifications:
- Google Cloud Platform (GCP) services/tools:
- BigQuery: Data warehousing, analytical queries
- Cloud Dataflow (Apache Beam): Batch and streaming ETL
- Cloud Run
- Cloud Function
- Cloud Storage
- Pub/Sub
- Cloud Composer (Apache Airflow): Workflow orchestration
- Programming Languages:
- Python
- SQL
- Data Sources & Extraction Techniques:
- Familiarity with Structured ERP Solutions (e.g., SAP, Oracle EBS, Salesforce)
- File Formats: Understanding different file types and how to parse them (CSV, JSON, XML, Parquet, ORC, Avro)
- Data Lake Concepts: How to store and manage raw, unstructured data efficiently (e.g., HDFS, S3, Cloud Storage)
- API Development & Consumption:
- Understanding how to consume data from REST APIs, SOAP services
- Ability to build internal APIs for data access if needed
- Knowledge of SAP SLT and configuring replications
- Security & Governance:
- IAM (Identity and Access Management): Best practices for least privilege, service accounts, and role-based access control for data pipelines
- Data Masking/Anonymization: Techniques for handling sensitive data