Data Engineer
Location: Chennai, Hyderabad, or Bangalore
Years of Experience: 7-10 Years
Detailed JD
We are seeking a Data Engineer with 69 years of experience to join our offshore teams. In this role, you will be responsible for building and managing data pipelines that enable advanced analytics and machine learning initiatives. You will collaborate with business and technical teams to ensure data is accessible, reliable, and optimized for AI/ML workloads, helping the organization unlock actionable insights and drive innovation through data-driven decision-making.
Responsibilities
- Design, develop, and maintain scalable data pipelines to support analytics and machine learning projects.
- Collaborate with stakeholders to understand data requirements and deliver solutions that meet business needs.
- Ensure data quality, consistency, and availability across multiple environments.
- Monitor and optimize workflows to maintain high performance and reliability.
- Support governance and compliance initiatives by implementing standards for data management and security.
- Participate in planning and execution of data integration, migration, and transformation projects.
Mandatory Skills
- AWS SageMaker (model training, deployment, monitoring) - Advanced
- AWS Cloud Services (S3, Lambda, Glue, Redshift, IAM) - Advanced
- Data Engineering Fundamentals (ETL, data pipelines, batch/stream processing) - Expert
- SQL (query optimization, joins, transformations) - Advanced
- Python (data manipulation, scripting, automation) - Advanced
Preferred Skills
- Data Governance & Security: Ensures compliance and protects sensitive data across pipelines and ML workflows.
- Communication & Collaboration: Needed to work effectively with distributed teams and business stakeholders.
- Performance Optimization: Important for scaling pipelines and reducing costs in cloud environments.
- Documentation & Knowledge Sharing: Critical for reproducibility and onboarding new team members.
- Problem Solving & Troubleshooting: Required to quickly resolve workflow issues and maintain reliability.
Qualifications: Bachelor's degree in computer science, Engineering, or a related field.