AuxoAI is seeking aSenior Data Engineerto lead the design, development, and optimization of modern data pipelines and cloud-native platforms usingGoogle Cloud Platform (GCP). This role is ideal for someone with deep experience building scalable batch and streaming data workflows, strong hands-on engineering skills, and a drive to mentor junior engineers.
You'll work closely with cross-functional teams to build production-grade pipelines using tools likeBigQuery, Dataflow, Pub/Sub, Cloud Composer, andDataform, enabling high-quality data delivery and analytics at scale.
Responsibilities:
- Build and optimizebatch and streaming data pipelinesusingApache Beam (Dataflow)
- Design and maintainBigQuery datasetsusing best practices in partitioning, clustering, and materialized views
- Develop and manageAirflow DAGsinCloud Composerfor workflow orchestration
- Implement SQL-based transformations usingDataform(or dbt)
- LeveragePub/Subfor event-driven ingestion andCloud Storagefor raw/lake layer data architecture
- Drive engineering best practices across CI/CD, testing, monitoring, and pipeline observability
- Partner with solution architects and product teams to translate data requirements into technical designs
- Mentor junior data engineers and support knowledge-sharing across the team
- Contribute to documentation, code reviews, sprint planning, and agile ceremonies
Requirements:
- 5+ years of hands-on experience indata engineering, with at least 2 years onGCP
- Proven expertise inBigQuery,Dataflow (Apache Beam),Cloud Composer (Airflow)
- Strong programming skills inPythonand/orJava
- Experience withSQL optimization,data modeling, andpipeline orchestration
- Familiarity withGit,CI/CD pipelines, and data quality monitoring frameworks
- Exposure toDataform,dbt, or similar tools for ELT workflows
- Solid understanding ofdata architecture,schema design, and performance tuning
- Excellent problem-solving and collaboration skills
Bonus Skills:
- GCP Professional Data Engineer certification
- Experience withVertex AI,Cloud Functions,Dataproc, orreal-time streaming architectures
- Familiarity withdata governance tools(e.g., Atlan, Collibra, Dataplex)
- Exposure toDocker/Kubernetes,API integration, andinfrastructure-as-code (Terraform)