Generative AI Engineer

Exl

Noida, India

Fresher

Save

Posted 2 months ago
Over 50 applicants

Job Description

Data Scientist

key Responsibilities:

Fine-tune and adapt open-source LLMs (e.g., LLaMA 4, Mistral and Bert) using NVIDIA GPU tools.
Build AI agents using frameworks like LangChain, LangGraph, or AutoGen with structured workflows (memory, tools, retries, etc.).
Implement hybrid LLM solutions using OpenAI/Claude APIs and open-source models.
Develop APIs using FastAPI and containerize apps with Docker.
Deploy, monitor, and scale AI solutions on AWS, Azure, or similar cloud providers.
Collaborate with senior engineers to optimize performance and reliability of deployed systems.

Requirements:

Hands-on experience with LLM fine-tuning and NVIDIA GPU toolkits (CUDA).
Familiarity with LangChain or similar agent frameworks.
Experience developing APIs with FastAPI and deploying via Docker.
Proficiency in using OpenAI/Anthropic APIs and building basic RAG pipelines.
Solid foundation in Python, cloud deployment (AWS/Azure), and vector databases (e.g., FAISS, Pinecone).

Nice to have:

Exposure to tools like LangServe and Semantic Kernel
Familiarity with CI/CD pipelines and monitoring tools (e.g., GitHub Actions, Prometheus).
Contribution to open-source AI/ML projects.

Soft Skills:

· Strong communication skills - both verbal and written

· Excellent problem-solving and debugging skills

· Self-motivated with the ability to work independently and in a team

· Comfortable working with stakeholders across different time zones

More Info

Job Type:

Industry:

Function:

Employment Type:

About Company

ExlJob Source: www.linkedin.com

Job ID: 128718765

Jobs by Skill - IT

Jobs by Skill - Non IT

International Jobs

Last Updated: 11-05-2026 06:27:21 PM

Homejobs in NoidaGenerative AI Engineer

Similar Jobs

Generative AI Engineer

Xceedance

Delhi, India

Skills:

Jax, Tensorflow, Nltk, Pytorch, Docker, FastAPI, Python, LangChain, Hugging Face Transformers, MLflow, Pinecone, Anthropic, spaCy, Mistral, BERT, Milvus, OpenAI, LlamaIndex

Generative AI Engineer

Yabx Technologies

2-5 yrs

Gurugram, Gurugram, India

Skills:

Flask, FastAPI, Python, vector search, embeddings, session handling, context management, conversational AI workflows, RAG pipelines

Nlp, Data structures, Deep Learning, Prompt Engineering techniques, Semantic search techniques, Embedding models, Infrastructure setup, AI model inferencing, Parallel Processing, CUDA architecture, Transformer architectures

Generative AI Engineer

TAC Security

Delhi, India

Skills:

Distributed Systems, Python, multimodal systems, vector databases, Pinecone, custom retrieval layers, Swarm, Chroma, transformer architectures, RLHF, LangChain, CrewAI, AI safety, agent frameworks, AutoGen, LLM fine-tuning, LoRA, MLOps tooling, generative AI models, Milvus, OpenAI, Weaviate, QLoRA, LlamaIndex, red-teaming