Search by job, company or skills

Amgen Technology Private Limited

MDM Senior Data Scientist

5-9 Years
new job description bg glownew job description bg glownew job description bg svg
  • Posted 6 days ago
  • Be among the first 50 applicants
Early Applicant
Quick Apply

Job Description

  • Develop enterprise-level GenAI applications using LLM frameworks such as Langchain, Autogen, and Hugging Face.
  • Design and develop intelligent pipelines using PySpark, TensorFlow, and PyTorch within Databricks and AWS environments.
  • Implement embedding models and manage VectorStores for retrieval-augmented generation (RAG) solutions.
  • Integrate and leverage MDM platforms like Informatica and Reltio to supply high-quality structured data to ML systems.
  • Utilize SQL and Python for data engineering, data wrangling, and pipeline automation.
  • Build scalable APIs and services to serve GenAI models in production.
  • Lead cross-functional collaboration with data scientists, engineers, and product teams to scope, design, and deploy AI-powered systems.
  • Ensure model governance, version control, and auditability aligned with regulatory and compliance expectations.
  • Guide the team on development activities and lead solution discussions.

Basic Qualifications and Experience:

  • Master's degree with 4 - 6 years of experience in Business, Engineering, IT, or related field OR
  • Bachelor's degree with 6 - 9 years of experience in Business, Engineering, IT, or related field OR
  • Diploma with 10 - 12 years of experience in Business, Engineering, IT, or related field

Functional Skills:

Must-Have Skills:

  • 6+ years of experience working in AI/ML or Data Science roles, including designing and implementing GenAI solutions.
  • Hands-on experience with LLM frameworks and tools like Langchain, Autogen, Hugging Face, OpenAI APIs, and embedding models.
  • Strong programming background in Python, PySpark, TensorFlow, PyTorch, and SK-Learn.
  • Proven experience building and deploying AI/ML applications in cloud environments such as AWS.
  • Expertise in developing APIs, automation pipelines, and serving GenAI models using Django, FastAPI, and DataBricks.
  • Solid experience integrating and managing MDM tools (Informatica, Reltio) and applying data governance best practices.
  • Core technical capabilities in GenAI and Data Science.

Good-to-Have Skills:

  • Experience in Data Modeling, ETL development, and data profiling to support AI/ML workflows.
  • Knowledge of Life Sciences or Pharma industry standards and regulatory considerations.
  • Proficiency in Agile tools like JIRA and Confluence.
  • Familiarity with MongoDB, VectorStores, and scalable GenAI architecture principles.

Professional Certifications:

  • ETL certification (e.g., Informatica)
  • Data Analysis certification (SQL)
  • Cloud certification (AWS or Azure)
  • Data Science and ML certifications

Soft Skills:

  • Strong analytical skills for assessing and improving master data processes.
  • Excellent verbal and written communication skills for conveying complex data concepts.
  • Effective problem-solving skills to implement scalable solutions.
  • Ability to work effectively with global and virtual teams.

More Info

Job Type:
Industry:
Employment Type:
Open to candidates from:
Indian

About Company

Amgen harnesses the best of biology and technology to fight the world’s toughest diseases, and make people’s lives easier, fuller and longer. We discover, develop, manufacture and deliver innovative medicines to help millions of patients. Amgen helped establish the biotechnology industry more than 40 years ago and remains on the cutting-edge of innovation, using technology and human genetic data to push beyond what’s known today. Amgen is one of the 30 companies that comprise the Dow Jones Industrial Average, and we are also part of the Nasdaq-100 Index, which includes the largest and most innovative non-financial companies listed on the Nasdaq Stock Market based on market capitalization.

Job ID: 117037871