Search by job, company or skills

Globiva

Globiva - AI Engineer - RAG Pipelines

new job description bg glownew job description bg glownew job description bg svg
  • Posted 2 days ago
  • Be among the first 10 applicants
Early Applicant

Job Description

AI Engineer (RAG/NLP) Gurgaon (On-site)

About The Role

We are seeking an experienced AI Engineer to design, build, and deploy advanced Generative AI and NLP solutions, with a focus on Retrieval-Augmented Generation (RAG) pipelines, document automation (OCR/ASR), and knowledge-assist systems. The ideal candidate will have strong hands-on experience with Python, Transformers, vector databases, and API deployment, and will be comfortable managing the entire lifecycle of AI models, from data preparation to monitoring in production.

Key Responsibilities

  • Design, build, and optimize RAG pipelines, including prompting, text chunking, retrieval, reranking, and evaluation using vector databases such as Chroma DB or Qdrant, and Transformer-based LLMs like Llama, Mistral, or BERT family models.
  • Productionize ASR systems (e.g., Whisper-large-v3) for call centre and voice-based use cases, ensuring improvements in both accuracy and latency.
  • Develop OCR and document digitization workflows using tools such as OpenCV and Tesseract, along with CNN/LSTM-based post-processing for unstructured PDFs and images.
  • Build and deploy APIs using Fast API or Flask, integrating with existing services and data sources such as MongoDB.
  • Orchestrate data and model workflows using Airflow, automating ETL processes, evaluation pipelines, and periodic retraining.
  • Implement CI/CD pipelines for model and API releases, ensuring strong testing, logging, and observability practices.
  • Manage both offline and online evaluation for metrics such as latency, accuracy, F1 score, ASR WER, and retrieval precision/recall, and provide detailed analytical reports and recommendations.
  • Collaborate closely with product and operations teams to translate business challenges into measurable ML objectives and service-level agreements (SLAs).

Must-Have Skills

  • Proficiency in Python (production-grade), PyTorch, and Hugging Face Transformers.
  • Strong understanding of RAG fundamentals, including text chunking, embedding selection, retrieval (dense and sparse), reranking, and evaluation frameworks.
  • Experience with vector search and data stores such as Chroma DB or similar technologies, along with solid data modelling and indexing expertise.
  • Practical knowledge of API development using Fast API or Flask, including RESTful best practices, authentication, rate limiting, and pagination.
  • Experience in MLOps using Docker, CI/CD, Linux, Git, and tools for logging and monitoring model services.
  • Exposure to OCR and ASR systems, particularly OpenCV, Tesseract, and Whisper (or equivalent frameworks).
  • Strong grasp of classical NLP and ML techniques, including tokenization, LSTMs/CNNs, XGBoost, and metric-driven Skills :
  • Experience fine-tuning large language models or encoders for classification, summarization, and domain adaptation.
  • Understanding of prompt engineering, tool integration, and evaluation for LLM-based applications.
  • Familiarity with scaling retrieval systems for low-latency, high-availability production environments.
  • Experience with document question answering, email or call centre analytics, or enterprise knowledge management.
  • Knowledge of agentic AI frameworks such as Lang Chain, Lang Graph, or Crew AI.

Qualifications

  • 2 to 5 years of hands-on experience in AI, ML, or NLP engineering with proven production ownership.
  • Bachelors degree in computer science, or a related field, or equivalent practical experience.
  • Strong portfolio or GitHub profile demonstrating shipped APIs, model implementations, and well-documented repositories with test coverage.

(ref:hirist.tech)

More Info

Job Type:
Industry:
Employment Type:

About Company

Job ID: 143745631