To succeed in this role, the candidate must have strong Data Engineering experience along with MDM knowledge. Candidates with only MDM experience are not eligible. Candidates must have data engineering experience on technologies like SQL, Python, PySpark, Databricks, AWS, API Integrations, along with knowledge of MDM (Master Data Management).
Roles & Responsibilities
- Develop the MDM backend solutions and implement ETL and Data Engineering pipelines using Databricks, AWS, Python/PySpark, SQL, etc.
- Lead the implementation and optimization of MDM solutions using Informatica or Reltio platforms.
- Perform data profiling and identify necessary Data Quality (DQ) rules.
- Define and drive enterprise-wide MDM architecture, including IDQ, data stewardship, and metadata workflows.
- Manage cloud-based infrastructure using AWS and Databricks to ensure scalability and performance.
- Ensure data integrity, lineage, and traceability across MDM pipelines and solutions.
- Provide mentorship and technical leadership to junior team members and ensure project delivery timelines.
- Support custom UI team integration with backend data using APIs or other methods to improve the data stewardship user experience.
Basic Qualifications and Experience
- Master's degree with 4 6 years of experience in Business, Engineering, IT or related field
- Bachelor's degree with 6 9 years of experience in Business, Engineering, IT or related field
- Diploma with 10 12 years of experience in Business, Engineering, IT or related field
Functional Skills
Must-Have Skills
- Strong understanding and hands-on experience with Databricks and AWS cloud services
- Proficiency in Python, PySpark, SQL, and Unix for data processing and orchestration
- Deep knowledge of MDM tools (e.g., Informatica, Reltio) and data quality frameworks (e.g., IDQ)
- Knowledge of customer master data (e.g., HCP, HCO)
- Experience with data modeling, governance, and DCR lifecycle management
- Ability to implement end-to-end integrations, including API-based, batch, and flat file-based integrations
- Strong experience with external data enrichments (e.g., Dun & Bradstreet - D&B)
- Expertise in match/merge logic and survivorship rule implementations
- Strong understanding of reference data and its integration with MDM
- Experience with custom workflows, data pipelines, or orchestration tools
Good-to-Have Skills
- Experience with Tableau or Power BI for reporting MDM insights
- Exposure to Data Science and GenAI capabilities
- Experience with Agile practices and tools (e.g., JIRA, Confluence)
- Prior experience in Pharma/Life Sciences domain
- Understanding of compliance and regulatory considerations in master data
Professional Certifications
- MDM Certification (e.g., Informatica, Reltio)
- Databricks Certification (Data Engineer or Architect)
- Any cloud certification (AWS or Azure)