
Search by job, company or skills
. The Lead Data Engineer is a strategic and technical leadership role responsible for architecting, scaling, and evolving enterprise-grade data platforms that enable advanced analytics, AI/ML, and data-driven decision-making. Reporting to the Senior Director of Data Platforms, this role will lead the design and governance of modern data architectures, drive innovation in AI orchestration, and ensure the delivery of secure, compliant, and high-performing data solutions.
. This position combines hands-on engineering expertise with architectural vision and cross-functional leadership. The Lead Data Engineer will guide engineering teams, influence platform strategy, and establish best practices across the organization's data ecosystem.
Basic Qualifications :
. Bachelor's or Master's degree in Computer Science, Engineering, Data Science, or related field.
. 8+ years of experience in data engineering and architecture, with a proven track record of leading large-scale data initiatives.
. Deep expertise in Python, PySpark.
. Strong hands-on experience with Databricks (Spark, Delta Lake, Workflows)
. Strong experience with AWS (S3, IAM, Textract, Bedrock or equivalent)
. Experience with design and implement scalable document ingestion pipelines using Databricks Auto Loader and AWS S3.
. Understanding of vector embeddings and semantic search
. Strong understanding of data governance, privacy, and compliance in regulated industries (healthcare, life sciences).
Good to Have :
. Advanced knowledge of data modeling, lakehouse/lake/warehouse design, and performance optimization.
. Familiarity with generative AI platforms and use cases.
. Contributions to open-source projects or thought leadership in data engineering/architecture.
. Experience with Agile methodologies, CI/CD, and DevOps practices.
. Exposure to FastAPI, or API-based ML services
. Experience evaluating LLM output quality
Key Responsibilities :
. Lead Engineering Teams: Provide technical leadership and mentorship to data engineers, fostering a culture of excellence, innovation, and continuous improvement.
. AI/ML Enablement: Collaborate with Data Science and ML Engineering teams to operationalize models, implement AI orchestration frameworks (e.g., MLflow, Airflow), and ensure scalable deployment pipelines.
. Platform Strategy & Governance: Define and enforce architectural standards, data governance policies, and compliance frameworks (HIPAA, SOC 2, GDPR, etc.) across the data platform.
. Performance & Reliability Optimization: Drive observability, automation, and performance tuning across data pipelines and infrastructure to ensure reliability at scale.
. Cross-Functional Collaboration: Partner with product, analytics, compliance, and infrastructure teams to align data architecture with business goals and regulatory requirements.
. Innovation & Thought Leadership: Stay ahead of industry trends, evaluate emerging technologies, and contribute to strategic decisions on platform evolution, including generative AI integration and event-driven systems.
Perks and Benefits for Irisians
Iris provides world-class benefits for a personalized employee experience. These benefits are designed to support financial, health and well-being needs of Irisians for a holistic professional and personal growth. Click to view the benefits.
A strategic partner that transformational leaders can trust to realize the full potential of technology-enabled transformation.As a trusted technology partner, we focus our highly-experienced talent and rightsized teams to develop complex, mission-critical applications and solutions for leading enterprise across financial services, life sciences, including pharmaceutical, CROs and medical devices, manufacturing & logistics and educational services.
Job ID: 142581555