
Search by job, company or skills
Project description
We are seeking highly skilled and motivated Generative AI Engineers to join our growing AI team. You will be responsible for designing, developing, and deploying cutting-edge Generative AI solutions using LLMs, Transformers, and Diffusion models. This role involves working on enterprise-grade applications such as intelligent chatbots, document summarization, code assistants, and more.
Responsibilities
Design, prototype, and deploy Generative AI models (LLMs, Transformers, Diffusion models) for real-world enterprise use cases.
Build and fine-tune LLM-based applications such as: Chatbots, Document Q&A systems, Report generators, Code assistants
and Summarization tools.
Apply prompt engineering, Retrieval-Augmented Generation (RAG), and context-aware pipelines to enhance model accuracy and relevance.
Integrate AI models with enterprise systems, APIs, and data stores using Python, Java, or Node.js.
Collaborate with architects to define scalable, secure, and cost-efficient AI service architectures.
Implement AI/ML pipelines for training, validation, and deployment using tools like MLflow, Vertex AI, or Azure ML.
Monitor model performance, detect drift, and drive continuous improvement.
Optimize inference performance and cost through model compression, quantization, and API optimization.
Ensure compliance with AI ethics, security, and governance standards.
Prepare and curate training datasets (structured/unstructured text, images, code).
Apply data preprocessing, tokenization, and embedding generation techniques.
Work with vector databases (e.g., Pinecone, Weaviate, FAISS, Chroma) for semantic search and retrieval.
Partner with business stakeholders to identify and shape impactful AI use cases.
Contribute to the development of a strategic AI adoption roadmap and reusable AI Workbench/platform components.
Support POCs, pilots, and full-scale implementations using agile methodologies.
Document and present solution designs, technical findings, and outcomes to leadership and clients.
Skills
Must have
Overall 7+ years of experience and relevant 4+ years of GenAI.
Strong programming skills in Python (preferred), with experience in Java or Node.js.
Hands-on experience with LLMs (e.g., GPT, LLaMA, Claude, Mistral), Transformers, and Diffusion models.
Experience with Hugging Face Transformers, LangChain, LLM orchestration frameworks, and prompt tuning.
Familiarity with RAG pipelines, embedding models, and vector databases.
Experience with cloud platforms (AWS, GCP, Azure) and AI/ML services.
Knowledge of MLOps tools and practices (e.g., MLflow, Kubeflow, Vertex AI, Azure ML).
Strong understanding of data engineering, data pipelines, and ETL workflows.
Excellent problem-solving, communication, and stakeholder engagement skills.
Bachelor's or Master's degree in Computer Science, AI/ML, Data Science, or related field.
Nice to have
N/A
Job ID: 148323323
Skills:
Gcp, Containers, Azure, Python, AWS, LangChain, embeddings, LLMs, Hugging Face, LLM GenAI delivery, cloud platforms, vector databases, Pinecone, Anthropic, Claude, CI CD, Monitoring, LangGraph, FAISS, RAG, ChromaDB, semantic search
Skills:
Python Programming, LangChain, Agentic AI frameworks, Multi-Agent Systems, Prompt design and memory management, AI Agent Development, OpenAI Agent SDK, LLM evaluation methods, Prompt Engineering
Skills:
Rest Apis, Understanding of agentic AI multi-modal agents parallel processing optimization-evaluation workflows and routers, JSON webhooks, LLM models Claude OpenAI Perplexity, Complex AI automation workflows and AI wrapper techniques, custom node development using JavaScript, Strong knowledge of relational and vector databases PostgreSQL Pinecone MongoDB Elasticsearch Redis, Nice to have RAG systems knowledge
Skills:
Pandas, Python, LangChain, Generative AI, AI ML concepts, LLMs
Skills:
Machine Learning, Python, Tuning, Sql, Deep Learning, MLops, GPUs, AI Frameworks, LangGraph, Evaluation, API integrations, Deployment, LangChain, Generative AI, LLMs, Model training, prompt engineering, AutoGen, planning tool?use, Agentic AI, RAG, Cloud experience, LlamaIndex, autonomous agents
We don’t charge any money for job offers