Data Scientist

swits digital private limited

Gurugram, Gurugram, India

3-5 Years

Save

Posted 4 days ago
Be among the first 10 applicants

Early Applicant

Job Description

Job Title : Data Scientist

Location - Gurugram

Job Description

Do you believe in the missions Intelligence agencies Are you interested in solving complex

programmatic and technical issues

If you are interested in working on some of the most challenging technical and programmatic issues ,

we are interested in talking to you about Netomi work and career opportunities.

We are seeking a Senior Data Scientist with deep expertise in Large Language Models (LLMs) and

modern AI systems to join our team. This role combines cutting-edge research, rapid prototyping,

and production-grade implementation to deliver innovative AI-powered solutions. You will drive NLP

and machine learning projects from conception through deployment, working as both a direct

contributor and key technical advisor.

Job Responsibilities

Research & Innovation - Stay current with the latest LLM research, architectures, and

advancements in the field including real-time models and multimodal systems. Evaluate

emerging techniques and methodologies for potential application to business problems.

Monitor developments in transformer architectures, fine-tuning approaches, model

optimization, and real-time inference. Research and assess new LLM capabilities,

frameworks, and API features as they emerge

Solution Design & Prototyping - Identify and define approaches for complex AI challenges

leveraging state-of-the-art LLMs. Design and build proof-of-concept solutions to validate

technical feasibility. Rapidly prototype LLM-based applications using modern frameworks

and orchestration tools. Conduct rigorous experiments to evaluate different approaches

and methodologies. Work collaboratively in multi-disciplinary team environments and

establish professional networks with subject matter experts

Production Development & Software Engineering - Write clean, maintainable, production-

quality code following software engineering best practices and design patterns. Develop

robust, scalable agentic workflows using orchestration frameworks (such as LangGraph,

CrewAI, or similar). Implement advanced LLM features, including tool calling, function

calling, structured outputs, and multi-turn conversations. Build production-grade systems

utilizing Model Context Protocol (MCP) and other emerging standards. Design and

implement scalable, fault-tolerant architectures for real-time LLM-powered applications.

Conduct thorough code reviews and maintain high code quality standards. Optimize code

for performance, memory efficiency, and cost-effectiveness in production environments

Experimentation & Optimization - Design rigorous experiments to test hypotheses and

validate model performance. Develop evaluation frameworks for LLM outputs, system

performance, and user experience. Optimize prompt engineering strategies, fine-tuning

approaches, and inference efficiency. Conduct A/B tests, performance benchmarking, and

statistical analysis

Requirements

3-5 years of experience in data science, machine learning, and AI development with strong

focus on NLP and LLM applications

Bachelor's/Master's or higher degree in Computer Science, Machine Learning, Statistics,

or related technical field

Proven track record of building and deploying production ML/AI systems from research to

deployment

Mastery of Python with strong software engineering fundamentals (OOP, design patterns,

testing)

Deep hands-on experience with LLM frameworks and APIs (OpenAI, Anthropic, or similar)
Strong experience with at least one deep learning framework (PyTorch or TensorFlow)
Proficiency with modern ML orchestration and agentic frameworks (LangGraph, CrewAI,

LangChain, or similar)

Solid understanding of NLP techniques: embeddings, information extraction, semantic

search, classification

Experience with diverse ML models: neural networks, transformers, SVM, Random Forest,

clustering, Bayesian models

Hands-on experience with advanced LLM features: tool calling, function calling, multi-turn

conversations, structured outputs

Strong knowledge of software development practices: version control (Git), testing (pytest)
Experience with REST APIs, async programming, and building scalable backend services
Familiarity with vector databases and embedding systems (Pinecone, Weaviate, FAISS, or

similar)

Knowledge of distributed computing, cloud platforms (AWS, GCP, or Azure), and

containerization (Docker)

Strong experimental design skills with ability to formulate hypotheses and conduct rigorous

analysis

Excellent problem-solving abilities and intellectual curiosity to stay current with AI research
Self-motivated with proven ability to work collaboratively in multi-disciplinary teams

Bonus

Experience with voice/speech models and real-time audio processing (OpenAI Realtime

API or similar)

Knowledge of Model Context Protocol (MCP) and emerging LLM standards
Experience with MLOps tools and practices (model monitoring, versioning, A/B testing)
Contributions to open-source ML/AI projects or published research papers
Familiarity with streaming architectures and event-driven systems (Kafka, RabbitMQ)