Senior Data Scientist - NLP

Meril

Vapi, India

5-7 Years

Save

Posted 22 hours ago
Be among the first 10 applicants

Early Applicant

Job Description

Position Overview

Senior-level data scientist role focused on building and deploying production NLP systems on bare metal infrastructure. This position requires a research-oriented mindset with the ability to build first-in-class products by translating cutting-edge research into innovative production solutions.

Required Qualifications

Experience

Minimum 5 years in data science/ML engineering roles
Minimum 3 years tenure in most recent organization in a relevant data science/ML role
Proven track record of deploying ML models to production
Experience managing bare metal server infrastructure

Technical Skills

SQL

Advanced query optimization and performance tuning
Complex joins, window functions, CTEs
Experience with Snowflake, BigQuery, or Redshift
Database performance analysis and indexing strategies

NLP Technology Stack

Transformer architectures
RAG pipeline implementation
LangChain, LlamaIndex, or similar frameworks
Vector databases: Pinecone, Weaviate, Chroma, FAISS
Model fine-tuning: LoRA, QLoRA
Embedding models and semantic search
Prompt engineering techniques

Programming & ML Frameworks

Python (advanced level, production-grade code)
PyTorch or TensorFlow
HuggingFace Transformers
scikit-learn, XGBoost, LightGBM

Key Responsibilities

Technical Execution

Design and implement production NLP solutions using state-of-the-art language models
Build and optimize complex SQL data pipelines processing millions of records
Deploy ML models on bare metal GPU infrastructure
Configure and maintain GPU clusters for training and inference
Implement MLOps practices: versioning, monitoring, automated retraining
Optimize model inference for latency and throughput
Troubleshoot CUDA, driver, and hardware-level issues
Set up distributed training across physical servers
Research and prototype emerging ML techniques

Leadership & Strategy

Lead end-to-end ML projects from problem definition to production deployment
Drive innovation by researching and implementing first-in-class product features
Coordinate cross-functional teams including data engineers, domain experts, and full-stack developers to deliver integrated solutions
Define technical architecture and design decisions for ML systems
Drive adoption of ML best practices and engineering standards across teams
Collaborate with product and engineering leadership on ML roadmap and priorities
Present technical findings and recommendations to executive stakeholders
Own critical ML infrastructure decisions and vendor evaluations
Champion innovation by evaluating and integrating cutting-edge ML research
Lead cross-functional initiatives between data science, engineering, and product teams
Facilitate effective collaboration between technical and non-technical stakeholders
Translate latest research papers into production-ready solutions.

Required Competencies

Research-oriented mindset with ability to innovate and build first-in-class products
Ability to work independently with minimal supervision and drive projects autonomously
Strong analytical and quantitative aptitude
Excellent problem-solving and logical reasoning skills
Proven ability to collaborate with cross-functional teams (data engineers, domain experts, full-stack developers)
Strong communication skills to translate technical concepts for non-technical stakeholders
Willingness to explore uncharted territory and experiment with novel approaches
Self-motivated with strong ownership mentality
Strong understanding of hardware constraints and optimization
Ability to work independently with bare metal infrastructure
Experience with both cloud and on-premise deployments
Proven ability to take projects from research to production
Track record of staying current with ML research and innovations
Strong debugging and troubleshooting skills

Preferred Qualifications

Experience with pre-training multi-modal models (vision-language, audio-text, etc.)
Hands-on experience with large-scale distributed training frameworks (DeepSpeed, FSDP, Megatron-LM)
Contributions to open source ML projects
Technical blog or active GitHub portfolio
Experience with model quantization and efficient inference
Publications or conference presentations
Knowledge of multi-modal architectures (CLIP, Flamingo, GPT-4V style models)