Job Summary:
We are seeking a Senior Full-Stack Data Engineer to design, develop, and manage scalable data pipelines, storage, and transformation solutions. The role involves working on cloud-based data platforms, data warehouse / lakehouse design, workflow automation, and data integration to support business intelligence and analytics. The ideal candidate will have strong expertise in Snowflake, DBT, and modern data technologies with a focus on performance, security (especially data segregation policies), governance, and automation.
Must-Have Skills (Mandatory):
- Proven expertise in Snowflake and DBT (hands-on experience required).
- Experience with Snaplogic, ETL/ELT pipelines, APIs, and data integration.
- Strong knowledge of data modeling, architecture, and pipeline optimization.
- Expertise in cloud platforms (AWS/Azure/GCP) and automation (Terraform, CloudFormation).
- Advanced SQL and Python programming skills.
- Hands-on experience with CI/CD tools (Git, GitHub Actions, Jenkins) and containerization (Docker, Kubernetes).
- Knowledge of real-time & batch processing (Kafka, Kinesis, Apache Airflow).
- Proven ability to implement data segregation policies, security (IAM, VPN, Encryption), and compliance (GDPR, CCPA).
Good-to-Have Skills (Optional):
- Experience with workflow orchestration tools like Dagster or Prefect.
- Background in data governance frameworks and metadata management.
- Exposure to Agile delivery methodologies (Scrum, Kanban).
- Strong stakeholder collaboration skills for both technical and business discussions.
Qualifications & Experience:
- Education: Bachelor's degree in Computer Science, Data Science, Information Systems, or related field (Master's preferred).
- Experience: 5–7 years in Data Engineering, Analytics Engineering, or related fields.
- Proven track record in designing, developing, and maintaining large-scale, cloud-based data platforms and pipelines.
- Strong problem-solving, analytical, and proactive mindset with an aptitude for continuous learning.
- Ability to manage projects efficiently while collaborating across multiple teams and stakeholders.