About The Opportunity
A fast-growing technology services firm operating in the cloud data engineering and enterprise digital transformation space, we empower organizations across India to modernize data infrastructure, automate analytics pipelines, and scale AI/ML workloads using AWS-native tooling. Our clients span BFSI, healthcare, logistics, and retail—demanding high-availability, cost-optimized, and compliant data solutions built for the cloud-first era.
Role & Responsibilities
- Design, build, and maintain scalable ETL/ELT pipelines on AWS using Glue, Lambda, Step Functions, and SageMaker Pipelines.
- Architect data lakes and warehouses using S3, Redshift, and Athena; implement partitioning, compression, and lifecycle policies for cost-efficiency.
- Automate data ingestion from on-premises and cloud sources (APIs, RDBMS, SaaS) using AWS Data Pipeline, Kinesis, and SQS.
- Implement data quality checks, monitoring, and alerting using CloudWatch, AWS Glue DataBrew, and custom Python scripts.
- Collaborate with data analysts and ML engineers to provision secure, governed datasets for BI dashboards and model training.
- Optimize pipeline performance and cost via spot instances, auto-scaling, and serverless architectures; document architecture and operational runbooks.
Skills & Qualifications
Must-Have
- AWS Glue
- AWS Lambda
- AWS S3
- AWS Redshift
- Python
- SQL
- ETL/ELT pipelines
- CloudWatch
Preferred
- Athena
- Step Functions
- Kinesis
Benefits & Culture Highlights
- On-site collaborative workspace in tech hubs across India with hybrid flexibility in select roles.
- Access to AWS certification reimbursements and upskilling programs tailored to cloud engineering tracks.
- Project exposure to Fortune 500 clients across multiple verticals—accelerating your cloud data engineering portfolio.
Skills: glue,athena,aws,cloud