Search by job, company or skills

Meril

Senior Data Scientist - NLP

5-7 Years
new job description bg glownew job description bg glownew job description bg svg
  • Posted 22 hours ago
  • Be among the first 10 applicants
Early Applicant

Job Description

Position Overview

Senior-level data scientist role focused on building and deploying production NLP systems on bare metal infrastructure. This position requires a research-oriented mindset with the ability to build first-in-class products by translating cutting-edge research into innovative production solutions.

Required Qualifications

Experience

  • Minimum 5 years in data science/ML engineering roles
  • Minimum 3 years tenure in most recent organization in a relevant data science/ML role
  • Proven track record of deploying ML models to production
  • Experience managing bare metal server infrastructure

Technical Skills

SQL

  • Advanced query optimization and performance tuning
  • Complex joins, window functions, CTEs
  • Experience with Snowflake, BigQuery, or Redshift
  • Database performance analysis and indexing strategies

NLP Technology Stack

  • Transformer architectures
  • RAG pipeline implementation
  • LangChain, LlamaIndex, or similar frameworks
  • Vector databases: Pinecone, Weaviate, Chroma, FAISS
  • Model fine-tuning: LoRA, QLoRA
  • Embedding models and semantic search
  • Prompt engineering techniques

Programming & ML Frameworks

  • Python (advanced level, production-grade code)
  • PyTorch or TensorFlow
  • HuggingFace Transformers
  • scikit-learn, XGBoost, LightGBM

Key Responsibilities

Technical Execution

  • Design and implement production NLP solutions using state-of-the-art language models
  • Build and optimize complex SQL data pipelines processing millions of records
  • Deploy ML models on bare metal GPU infrastructure
  • Configure and maintain GPU clusters for training and inference
  • Implement MLOps practices: versioning, monitoring, automated retraining
  • Optimize model inference for latency and throughput
  • Troubleshoot CUDA, driver, and hardware-level issues
  • Set up distributed training across physical servers
  • Research and prototype emerging ML techniques

Leadership & Strategy

  • Lead end-to-end ML projects from problem definition to production deployment
  • Drive innovation by researching and implementing first-in-class product features
  • Coordinate cross-functional teams including data engineers, domain experts, and full-stack developers to deliver integrated solutions
  • Define technical architecture and design decisions for ML systems
  • Drive adoption of ML best practices and engineering standards across teams
  • Collaborate with product and engineering leadership on ML roadmap and priorities
  • Present technical findings and recommendations to executive stakeholders
  • Own critical ML infrastructure decisions and vendor evaluations
  • Champion innovation by evaluating and integrating cutting-edge ML research
  • Lead cross-functional initiatives between data science, engineering, and product teams
  • Facilitate effective collaboration between technical and non-technical stakeholders
  • Translate latest research papers into production-ready solutions.

Required Competencies

  • Research-oriented mindset with ability to innovate and build first-in-class products
  • Ability to work independently with minimal supervision and drive projects autonomously
  • Strong analytical and quantitative aptitude
  • Excellent problem-solving and logical reasoning skills
  • Proven ability to collaborate with cross-functional teams (data engineers, domain experts, full-stack developers)
  • Strong communication skills to translate technical concepts for non-technical stakeholders
  • Willingness to explore uncharted territory and experiment with novel approaches
  • Self-motivated with strong ownership mentality
  • Strong understanding of hardware constraints and optimization
  • Ability to work independently with bare metal infrastructure
  • Experience with both cloud and on-premise deployments
  • Proven ability to take projects from research to production
  • Track record of staying current with ML research and innovations
  • Strong debugging and troubleshooting skills

Preferred Qualifications

  • Experience with pre-training multi-modal models (vision-language, audio-text, etc.)
  • Hands-on experience with large-scale distributed training frameworks (DeepSpeed, FSDP, Megatron-LM)
  • Contributions to open source ML projects
  • Technical blog or active GitHub portfolio
  • Experience with model quantization and efficient inference
  • Publications or conference presentations
  • Knowledge of multi-modal architectures (CLIP, Flamingo, GPT-4V style models)

More Info

Job Type:
Industry:
Function:
Employment Type:

About Company

Job ID: 136094939

Similar Jobs