Search by job, company or skills

IRIS software

Data Engineer - Lead

Save
new job description bg glownew job description bg glow
  • Posted a day ago
  • Be among the first 20 applicants
Early Applicant

Job Description

Why Join Iris
Are you ready to do the best work of your career at one ofIndia's Top 25 Best Workplaces in IT industry Do you want to grow in an award-winning culture thattruly values your talent and ambitions
Join Iris Software - one of thefastest-growing IT services companies- whereyou own and shape your success story.
About Us
At Iris Software, our vision is to be our client's most trusted technology partner, and the first choice for the industry's top professionals to realize their full potential.
With over 4,300 associates across India, U.S.A, and Canada, we help our enterprise clients thrive with technology-enabled transformation across financial services, healthcare, transportation & logistics, and professional services.
Our work covers complex, mission-critical applications with the latest technologies, such as high-value complex Application & Product Engineering, Data & Analytics, Cloud, DevOps, Data & MLOps, Quality Engineering, and Business Automation.

Working with Us
At Iris, every role is more than a job - it's a launchpad for growth.
Our Employee Value Proposition, Build Your Future. Own Your Journey.reflects our belief that people thrive when they have ownership of their career and the right opportunities to shape it.
We foster a culture where your potential is valued, your voice matters, and your work creates real impact. With cutting-edge projects, personalized career development, continuous learning and mentorship, we support you to grow and become your best - both personally and professionally.
Curious what it's like to work at Iris Head to this video for an inside look at the people, the passion, and the possibilities. Watch it .

Job Description

PRIMARY RESPONSIBILITIES

  • Design and implement robust, scalable data pipelines to support AI/ML model development and deployment
  • Clean, transform, and curate structured and unstructured data from diverse sources to ensure model-ready quality
  • Collaborate with data scientists, ML engineers, and business teams to ensure data readiness, usability, and alignment with AI objectives
  • Develop andmaintainmetadata management, data lineage, and data quality frameworks to support AI governance and compliance
  • Enable advanced feature engineering capabilities and implement real-time data streaming solutions for AI applications
  • Design and deploy data collection instruments and oversee data aggregation processes
  • Ensure data compliance, privacy, and ethical use standards across all AI workflows
  • Support enterprise-wide data initiatives including business glossary development, taxonomy creation, and the DARE program's automation goals

KNOWLEDGE/SKILLS

  • Programming & Technical Skills:Advancedproficiencyin Python, SQL, and distributed computing frameworks (Spark, Hadoop)
  • Cloud Platforms:Hands-on experience with cloud data platforms (AWS, Azure, GCP) and their data services
  • Data Engineering Tools:Expertisewith data orchestration tools (Airflow, Prefect), ETL/ELT frameworks, and data pipeline optimization
  • AI/ML Data Specialization:Experience with ML feature stores, data versioning, model-ready data design, and real-time streaming technologies
  • Data Management:Strong understanding of data governance, metadata management, data quality frameworks, and lineage tracking
  • Compliance & Ethics:Knowledge of data privacy regulations, compliance standards, and ethical AI data practices
  • Collaboration:Excellent communication skills and ability to work effectively with cross-functional teams including data scientists, ML engineers, and business stakeholders

EDUCATION AND EXPERIENCE

Required:

  • Bachelor's degree in Computer Science, Data Engineering, Computer Engineering, or related technical field
  • 3+ years of experience in data engineering withdemonstratedexperience in AI/ML environments
  • Proficiencyin Python and SQL for data manipulation and pipeline development
  • Experience with cloud data platforms (AWS, Azure, or GCP)
  • Hands-on experience with data orchestration tools (e.g., Airflow, Prefect) and ETL frameworks

Preferred:

  • Master's degree in Computer Science, Data Engineering, or related field
  • 5+ years of experience in data engineering, preferably in AI/ML environments
  • Experience with real-time data processing (Kafka, Kinesis).
  • Exposure to LLMs and generative AI data preparation.
  • Knowledge ofMLOpsand integration with ML lifecycle tools.
  • Familiarity with BI tools and semantic layer design.

Must Have -
. Python & advanced SQL
. Spark / distributed processing
. Cloud platforms (AWS)
. Airflow / orchestration tools
. Data pipeline automation
. Data governance & quality

Nice to have -
. Kafka/Kinesis streaming
. Feature stores
. LLM‑ready data pipelines
. MLOps exposure

Mandatory Competencies

Cloud - AWS - AWS S3, S3 glacier, AWS EBS
Programming Language - Python - Python Scripting
DevOps/Configuration Mgmt - DevOps/Configuration Mgmt - Basic Bash/Shell script writing
Data Governance - Data Governance - Solidatus
Database - Database Programming - SQL
Big Data - Big Data - Pyspark
Cloud - AWS - Tensorflow on AWS, AWS Glue, AWS EMR, Amazon Data Pipeline, AWS Redshift
Beh - Communication and collaboration

Perks and Benefits for Irisians
Iris provides world-class benefits for a personalized employee experience. These benefits are designed to support financial, health and well-being needs of Irisians for a holistic professional and personal growth. Click to view the benefits.

More Info

Job Type:
Function:
Employment Type:

About Company

A strategic partner that transformational leaders can trust to realize the full potential of technology-enabled transformation.As a trusted technology partner, we focus our highly-experienced talent and rightsized teams to develop complex, mission-critical applications and solutions for leading enterprise across financial services, life sciences, including pharmaceutical, CROs and medical devices, manufacturing & logistics and educational services.

Job ID: 147775465

Similar Jobs

Noida, India

Skills:

HadoopPysparkAWS GlueKafkaRedshiftSqlData QualityDynamo DbIamSparkData ArchitectureAWS SNSData WarehousingPythonAws S3Data LakesData PipelinesDmsCloud Optimization

Noida

Skills:

SqlAWSAgile DevelopmentCassandraSpark

Noida, India

Skills:

GitSqlAzure DatabricksKubernetesPythonTerraformJenkinsScalaSparkAirflow

Noida, India

Skills:

IamSqlS3Data ArchitectureDynamo DbAWS GlueData WarehousingHadoopSesPysparkKafkaData QualitySnsPythonSparkRedshiftDmsCloud OptimizationData LakesData Pipelines

Noida, India

Skills:

Shell scriptingPython