
Search by job, company or skills
About the Role:-
We are seeking a Generative AI Engineer with 2+ years of experience to build and scale production-ready Agentic Systems using Large Language Models (LLMs). You will work on RAG pipelines, agent workflows, evaluation, and deployment of reliable AI systems.
Key Responsibilities:-
-Build retrieval-augmented generation (RAG) pipelines with embeddings, hybrid search, and reranking
-Implement agent orchestration, tool/function calling, and prompt management
-Develop evaluation, monitoring, and observability for LLM systems
-Ensure AI safety, governance, and data privacy best practices
-Optimize performance using caching, batching, and streaming
-Package solutions as APIs/SDKs and deploy using cloud-native tools
Tech Stack:-
-Languages: Python (primary), TypeScript
-Frameworks: FastAPI/Flask, LangChain/LlamaIndex
-LLMs: OpenAI, Anthropic, Google, Azure OpenAI, AWS Bedrock
-Infra: Docker, CI/CD, Terraform/CDK
-Cloud: AWS / Azure / GCP (any one)
Requirements:-
-2+ years of software or AI engineering experience
-Strong Python and backend development skills
-Hands-on experience with LLMs and cloud deployments
-Understanding of scalable systems and data security
What We Offer:-
-Work on cutting-edge Generative AI products
-High ownership and learning opportunities
-Competitive compensation and growth
Job ID: 136456609
Skills:
Gcp, Flask, FastAPI, Azure, Python, AWS, LangChain, embeddings, LLMs, Hugging Face, vector databases, Anthropic, Pinecone, Claude, LangGraph, FAISS, RAG, Streamlit, ChromaDB, semantic search
Skills:
Pandas, Python, LangChain, Generative AI, AI ML concepts, LLMs
Skills:
Machine Learning, Python, Tuning, Sql, Deep Learning, MLops, GPUs, AI Frameworks, LangGraph, Evaluation, API integrations, Deployment, LangChain, Generative AI, LLMs, Model training, prompt engineering, AutoGen, planning tool?use, Agentic AI, RAG, Cloud experience, LlamaIndex, autonomous agents
Skills:
Sql, ELT, Tensorflow, REST, MLops, Pytorch, Gcp, Docker, Kubernetes, Python, Etl, GenAI, GKE, scikit-learn
Skills:
Flask, FastAPI, Python, vector search, embeddings, session handling, context management, conversational AI workflows, RAG pipelines
We don’t charge any money for job offers