Search by job, company or skills

Persistent Systems

Senior Data Engineer

8-12 Years
Save
new job description bg glownew job description bg glow
  • Posted 22 hours ago
  • Be among the first 10 applicants
Early Applicant

Job Description

About Position:

We are seeking a highly skilled Senior Data Engineer to design and build AI-ready data platforms with a strong focus on Master Data Management (MDM), scalable data pipelines, and Retrieval-Augmented Generation (RAG) architectures. The role involves enabling enterprise-wide harmonized data models, semantic layers, and governed data products to support advanced AI/ML use cases in regulated healthcare environments.

  • Role: Senior Data Engineer
  • Location: All Persistent Location
  • Experience: 8 to 12 years
  • Job Type: Full Time Employment

What You'll Do:

  • Build and manage context/semantic data layer (MDM, harmonised entities) to make enterprise data AI‑ready
  • Design scalable ingestion & transformation pipelines for claims, denials and operational datasets
  • Implement vector DB + RAG pipelines (embedding, indexing, retrieval) for context-driven AI use cases
  • Ensure data quality, governance, lineage and auditability for regulated healthcare workflows
  • Expose data products/APIs for seamless integration with AI agents and downstream applications

Expertise You'll Bring:

  • Strong experience in Python, SQL, PySpark
  • Hands-on experience with data platforms: Databricks / Snowflake / Azure / AWS / GCP
  • Expertise in ETL/ELT pipeline design and orchestration (Airflow, DBT, etc.)
  • Experience with MDM tools/platforms (Informatica, Reltio, Profisee – preferred)
  • Knowledge of vector databases & RAG frameworks (LangChain, LlamaIndex)
  • Strong understanding of: Data modeling & schema design , Distributed data systems (Spark, Kafka, etc.)

Benefits:

  • Competitive salary and benefits package
  • Culture focused on talent development with quarterly growth opportunities and company-sponsored higher education and certifications
  • Opportunity to work with cutting-edge technologies
  • Employee engagement initiatives such as project parties, flexible work hours, and Long Service awards
  • Annual health check-ups
  • Insurance coverage: group term life, personal accident, and Mediclaim hospitalization for self, spouse, two children, and parents

Values-Driven, People-Centric & Inclusive Work Environment:

Persistent is dedicated to fostering diversity and inclusion in the workplace. We invite applications from all qualified individuals, including those with disabilities, and regardless of gender or gender preference. We welcome diverse candidates from all backgrounds.

  • We support hybrid work and flexible hours to fit diverse lifestyles.
  • Our office is accessibility-friendly, with ergonomic setups and assistive technologies to support employees with physical disabilities.
  • If you are a person with disabilities and have specific requirements, please inform us during the application process or at any time during your employment

Let's unleash your full potential at Persistent - persistent.com/careers

Persistent is an Equal Opportunity Employer and prohibits discrimination and harassment of any kind.

More Info

Job Type:
Industry:
Employment Type:

About Company

Job ID: 148570161

Similar Jobs

Pune, India

Skills:

Apache AirflowDockerApache KafkaKubernetesPythonSqlApache Spark StreamingFlink

Pune, India

Skills:

Database ManagementData ModelingApache AirflowAzure SynapseAzure Data FactoryDatabricksPythonStored proceduresAzure data lake storageLakehouse architecturesAzure data ecosystemData Lake TechnologiesMS FabricETL processesPySpark SQL

Pune, India

Skills:

DevopsAzure Data FactoryPysparkKafkaAzure DatabricksCosmos DBSqlEvent HubAzure Data Explorer

Pune, India

Skills:

Power BiSparkPythonquery developmentData Vault architectureMicrosoft Azure servicesMicrosoft FabricSQL databasesstored procedures

Pune, India

Skills:

T-sqlHadoopPower BiPysparkScalaSQL ServerApache SparkKafkaAzure DatabricksAzure MLAzure Data FactoryAzure Synapse AnalyticsAzure Data LakePythonEventHubStream AnalyticsAzure SQL DBAzure Analysis Services