Search by job, company or skills

OmniMD

Sr. Data Scientist

new job description bg glownew job description bg glownew job description bg svg
  • Posted 14 days ago
  • Be among the first 10 applicants
Early Applicant

Job Description

We are seeking a highly hands-on Data Scientistwith 5+ years of experience who is deeply proficient in Large Language Models (LLMs)both open-source and commercialand has strong expertise in prompt engineering, applied machine learning, and local LLM deployments.

This role is not purely academic. The ideal candidate will work on real-world AI systemsincluding AI Frontdesk, AI Clinician, AI RCM, multimodal agents, and healthcare-specific automation, with a focus on production-grade AI, domain-aligned reasoning, and privacy-aware architectures.

Key Responsibilities

1. LLM Research, Evaluation & Selection

  • Evaluate, benchmark, and compare open-source LLMs(LLaMA-2/3, Mistral, Mixtral, Falcon, Qwen, Phi, etc.) and commercial LLMs(OpenAI, Anthropic, Google, Azure).
  • Select appropriate models based on latency, accuracy, cost, explainability, and data-privacy requirements.
  • Maintain an internal LLM capability matrixmapped to specific business use cases.

2. Prompt Engineering & Reasoning Design

  • Design, test, and optimize prompt strategies:
  • Zero-shot, few-shot, chain-of-thought (where applicable)
  • Tool-calling and function-calling prompts
  • Multi-agent and planner-executor patterns
  • Build domain-aware promptsfor healthcare workflows (clinical notes, scheduling, RCM, patient communication).
  • Implement prompt versioning, prompt A/B testing, and regression checks.

3. Applied ML & Model Development

  • Build and fine-tune ML/DL models(classification, NER, summarization, clustering, recommendation).
  • Apply traditional ML + LLM hybridswhere LLMs alone are not optimal.
  • Perform feature engineering, model evaluation, and error analysis.
  • Work with structured (SQL/FHIR)and unstructured (text, audio)data.

4. Local LLM & On-Prem Deployment

  • Deploy and optimize local LLMsusing frameworks such as:
  • Ollama, vLLM, llama.cpp, HuggingFace Transformers
  • Implement quantization (4-bit/8-bit)and performance tuning.
  • Support air-gapped / HIPAA-compliantinference environments.
  • Integrate local models with microservices and APIs.

5. RAG & Knowledge Systems

  • Design and implement Retrieval-Augmented Generation (RAG)pipelines.
  • Work with vector databases (FAISS, Chroma, Weaviate, Pinecone).
  • Optimize chunking, embedding strategies, and relevance scoring.
  • Ensure traceability and citation of retrieved sources.

6. AI System Integration & Productionization

  • Collaborate with backend and frontend teams to integrate AI models into:
  • Spring Boot / FastAPI services
  • React-based applications
  • Implement monitoring for accuracy drift, latency, hallucinations, and cost.
  • Document AI behaviors clearly for BA, QA, and compliance teams.

7. Responsible AI & Compliance Awareness

  • Apply PHI-safe design principles(prompt redaction, data minimization).
  • Understand healthcare AI constraints (HIPAA, auditability, explainability).
  • Support human-in-the-loop and fallback mechanisms.

Required Skills & Qualifications

Core Technical Skills

  • Strong proficiency in Python(NumPy, Pandas, Scikit-learn).
  • Solid understanding of ML fundamentals(supervised/unsupervised learning).
  • Hands-on experience with LLMs (open-source + commercial).
  • Strong command of prompt engineering techniques.
  • Experience deploying models locally or in controlled environments.

LLM & AI Tooling

  • HuggingFace ecosystem
  • OpenAI / Anthropic APIs
  • Vector databases
  • LangChain / LlamaIndex (or equivalent orchestration frameworks)

Data & Systems

  • SQL and data modeling
  • REST APIs
  • Git, Docker (basic)
  • Linux environments

Preferred / Good-to-Have Skills

  • Experience in healthcare data(EHR, clinical text, FHIR concepts).
  • Exposure to multimodal AI(speech-to-text, text-to-speech).
  • Knowledge of model evaluation frameworksfor LLMs.
  • Familiarity with agentic AI architectures.
  • Experience working in startup or fast-moving product teams.

Research & Mindset Expectations (Important)

  • Strong inclination toward applied research, not just model usage.
  • Ability to read and translate research papers into working prototypes.
  • Curious, experimental, and iterative mindset.
  • Clear understanding that accuracy, safety, and explainabilitymatter more than flashy demos.

What We Offer

  • Opportunity to work on real production AI systemsused in US healthcare.
  • Exposure to end-to-end AI lifecycle: research prototype production.
  • Work with local LLMs, agentic systems, and multimodal AI.
  • High ownership, visibility, and learning curve.

More Info

Job Type:
Industry:
Function:
Employment Type:

About Company

Job ID: 142729649

Similar Jobs