Role: Life Sciences Data Architect
Experience: 12- 18 years
Location: Greater Noida, Pune, Hyderabad, 5 days WFO
We are seeking a seasoned Life Sciences Data Architect with deep expertise in designing and implementing modern data platforms and solutions for the life sciences domain. The ideal candidate will have strong knowledge of pharma, biotech, and medical devices data models, interoperability standards, and regulatory frameworks. Exposure to Industry 4.0 and broader healthcare domains is a plus. You will lead architecture, optimization, and governance of data platforms supporting analytics, reporting, and AI/ML workloads for global life sciences organizations.
Key Responsibilities
- Architect and implement life sciences data platforms (data mesh, data lakes, warehousing, streaming/incremental loads).
- Define data modeling strategies (star/snowflake/dimensional, data vault, hybrid) for healthcare/payer analytics and AI/ML use cases.
- Lead migration from legacy EDWs (Teradata, Oracle, SQL Server) to modern platforms like Snowflake, Databricks, ensuring performance tuning and validation.
- Implement ETL/ELT patterns using tools like Fivetran, Matillion, AWS Glue, ADF, Databricks, DBT; establish CI/CD pipelines for SQL/DBT code.
- Enforce data governance, lineage, access control, and PHI security (HIPAA, GDPR, regional compliance).
- Optimize platform performance (warehouse sizing, clustering keys, caching, materialized views).
- Define observability and alerting frameworks for pipeline health, data quality, and SLA monitoring.
- Mentor and lead data engineering teams, set standards, and deliver stakeholder demos.
Mandatory Skills
- 1218 years in data engineering/architecture, with at least 3+ years architecting modern data platforms (Snowflake, Databricks) in production.
- Proven experience across life sciences value chain: R&D, manufacturing, supply chain, commercial, patient safety, sustainability.
- Strong ETL, SQL, performance tuning, and data modeling skills; experience with DBT, Snowpipe, streams & tasks preferred.
- Hands-on with cloud platforms (AWS/Azure/GCP) and integration tools (Fivetran, Matillion, Informatica, ADF, Glue, Kafka).
- Familiarity with orchestration (Airflow, ADF), monitoring, data catalog tools (Collibra, Alation), and data quality frameworks.
- Expertise in security, RBAC, encryption, and compliance for healthcare data.
- Excellent stakeholder management and ability to present technical proposals to CxO and clinical leaders.
Desirable Skills
- Bachelor's in Computer Science, Information Systems, Bioinformatics; Master's preferred.
- Certifications in life sciences data standards or bioinformatics.
- Experience with BI tools (Power BI, Tableau, Looker).
- Familiarity with ML operationalization and MLOps best practices for healthcare analytics.