Job Title: Senior Data Engineer
Location: Chennai
Experience: 5+ Years
Employment Type: Full-Time
About the Opportunity
We are hiring on behalf of one of our client companies, a fast-growing data-driven organization focused on building scalable data platforms and advanced analytics solutions. The organization is working on modern data ecosystems, enabling real-time decision-making and business intelligence across multiple domains.
They are looking for a Senior Data Engineer to join their Data Platform team and contribute to building robust, scalable, and high-performance data infrastructure.
Role Overview
As a Senior Data Engineer, you will be responsible for designing, building, and maintaining large-scale data pipelines and data platforms. You will work closely with cross-functional teams including software engineers, data analysts, and business stakeholders to enable seamless data flow and analytics capabilities.
Key Responsibilities
- Design, deploy, and manage multi-node big data clusters across development, test, and production environments
- Build and optimize data pipelines for ingestion, transformation, and loading (ETL/ELT)
- Develop automation scripts for infrastructure provisioning and operations (Python/Shell)
- Work closely with Data Lake and BI teams to design scalable analytics and reporting solutions
- Design and maintain data models for reporting and analytics use cases
- Monitor system performance, troubleshoot issues, and optimize data platform efficiency
- Enable data accessibility and reliability across business applications
- Develop dashboards and KPIs to support business decision-making
- Collaborate with cross-functional teams in an Agile environment
- Implement automated solutions for recurring reporting and analytics workflows
- Ensure high availability, scalability, and reliability of the data ecosystem
Required Skills & Experience
- Strong experience in Big Data technologies: Apache Spark, Hadoop ecosystem, distributed systems, YARN
- Hands-on experience with Cloud Platforms: AWS or GCP (EMR preferred)
- Strong proficiency in SQL and Big Data query engines (e.g., Vertica, Dremio, or similar)
- Experience with ETL/ELT pipelines and data warehousing concepts (OLAP)
- Proficiency in Python and Shell scripting
- Experience with data modeling and data wrangling
- Strong debugging, monitoring, and troubleshooting skills
- Familiarity with Linux-based environments
- Experience with version control systems (Git)
- Exposure to data visualization tools (Tableau, Apache Superset, etc.) is a plus
- Understanding of data structures and distributed computing concepts
Preferred Qualifications
- Bachelor's or Master's degree in Computer Science or related field
- Experience working in product-based or high-scale environments
- Exposure to Agile/Scrum methodologies
- Strong communication and problem-solving skills
- Experience handling on-call production support
Why Consider This Opportunity
- Work on large-scale data platforms and modern data architectures
- Opportunity to collaborate with high-performing engineering teams
- Exposure to cutting-edge technologies in cloud and big data ecosystems
- Fast-paced environment with strong focus on innovation and impact