Search by job, company or skills

A

Gen AI Engineer

3-5 Years
Save
  • Posted a month ago
  • Be among the first 10 applicants
Early Applicant

Job Description

Job Description

Hiring for Gen AI Engineer

About the Role

We are building next-generation AI-powered products that help businesses improve their digital presence and customer engagement in an AI-first world. As a fast-growing technology company, we are solving complex problems at the intersection of Generative AI, scalable systems, and intelligent user experiences.

We're looking for a Gen AI Engineer to build and scale AI-native products powered by LLMs, retrieval systems, autonomous workflows, and modern AI infrastructure. You'll work closely with product, backend, frontend, and data teams to design intelligent systems that deliver reliable, high-quality AI experiences for real-world users.

This is an opportunity to work on cutting-edge AI technologies, solve complex engineering problems, and help shape the future of AI-powered software products.


What You'll Do

  • Design, develop, and deploy AI-powered applications using Large Language Models (LLMs) and modern AI frameworks
  • Build intelligent workflows involving prompt engineering, retrieval-augmented generation (RAG), agents, orchestration systems, and evaluation pipelines
  • Develop scalable backend services and APIs that power AI-driven product experiences
  • Integrate and optimize LLM providers such as OpenAI, Anthropic, Gemini, or open-source models
  • Build AI pipelines for data ingestion, embeddings, retrieval, ranking, and context management
  • Improve latency, reliability, observability, and cost optimization across AI systems
  • Collaborate with frontend, backend, and product teams to deliver seamless AI experiences
  • Design and maintain evaluation systems for AI quality, hallucination reduction, and model performance
  • Experiment rapidly with new AI technologies, frameworks, and tooling
  • Write technical documentation, architecture notes, and operational runbooks

What We're Looking For

Gen AI & LLM Expertise

  • 3+ years of software engineering experience with exposure to AI/ML or Generative AI systems
  • Strong hands-on experience working with LLMs, prompt engineering, RAG pipelines, and AI orchestration frameworks
  • Familiarity with frameworks such as LangChain, LlamaIndex, DSPy, Semantic Kernel, or similar tools
  • Experience integrating commercial and/or open-source foundation models into production systems
  • Understanding of embeddings, vector databases, retrieval systems, and AI evaluation techniques

Engineering & System Design

  • Strong programming skills in Python, Node.js, or similar backend technologies
  • Experience building scalable APIs, distributed systems, and cloud-native applications
  • Familiarity with databases, asynchronous processing, caching, and event-driven architectures
  • Understanding of observability, monitoring, and debugging for AI-powered systems

AI Product Mindset

  • Ability to think beyond model outputs and focus on customer experience, reliability, and business outcomes
  • Strong understanding of AI failure modes, hallucinations, latency trade-offs, and human-in-the-loop workflows
  • Experience designing AI systems with quality, scalability, security, and cost in mind
  • Comfortable experimenting quickly and iterating based on product feedback

Infrastructure & Deployment

  • Experience with cloud platforms such as AWS, GCP, or Azure
  • Familiarity with Docker, Kubernetes, CI/CD pipelines, and deployment workflows
  • Experience with vector databases such as Pinecone, Weaviate, Chroma, or FAISS is preferred
  • Exposure to model serving, inference optimization, and GPU-based workloads is a plus

Engineering Excellence

  • Passion for writing clean, maintainable, and scalable code
  • Strong debugging and problem-solving skills across AI and backend systems
  • High ownership, bias for action, and pragmatic decision-making
  • Ability to balance experimentation with production reliability

Communication & Collaboration

  • Strong written and verbal communication skills
  • Ability to explain complex AI concepts clearly to technical and non-technical stakeholders
  • Collaborative mindset with the ability to thrive in fast-paced environments

Preferred Qualifications

  • Experience building AI copilots, AI agents, conversational systems, or autonomous workflows
  • Familiarity with fine-tuning, evaluation frameworks, and reinforcement learning concepts
  • Startup or high-growth product company experience is a plus
  • Contributions to AI open-source projects or research-driven engineering work are preferred

Why Join Us

  • Opportunity to work on cutting-edge AI-driven products
  • High ownership and direct impact on AI product direction
  • Collaborative and fast-moving engineering culture
  • Work with smart, ambitious, and supportive teammates
  • Exposure to the latest advancements in Generative AI and LLM ecosystems
  • Flexible work environment and growth opportunities
  • Competitive compensation and benefits package

#GenAIEngineer #GenerativeAI #LLM #AIEngineer #MachineLearning #ArtificialIntelligence #PromptEngineering #RAG #LangChain #LlamaIndex #OpenAI #Anthropic #AIProducts #PythonDeveloper #BackendEngineering #AIAgents #VectorDatabases #CloudEngineering #StartupHiring #TechJobs #EngineeringJobs #AIHiring #DeveloperJobs #TechCareers #HiringNow

Check Your Resume for Match

Upload your resume and our tool will compare it to the requirements for this job like recruiters do.

More Info

Job Type:
Function:
Employment Type:

About Company

Antal International is a global executive search organisation with over 130 offices in more than 30 countries. We have a network of over 800 people operating under the Antal brand, successfully placing talent for professional positions in over 75 countries around the world. We believe our value and uniqueness lie in our skill base and industry

Job ID: 148246683

Similar Jobs

Chennai, India

Skills:

TensorflowPytorchDockerKerasKubernetesPython

Hyderabad, India

Skills:

ContainersPytorchGcpAWSKubernetesPythonAzurescikit-learnES|QLElandHugging FaceembeddingsELSERanomaly detectionhybrid searchprompt designElasticsearch inference NLP capabilitiesLLM integrationvector searchML librariesGen AI RAGElastic MLdata frame analyticsretrieval pipelinesElastic StackElasticsearch DSLsemantic searchTransformers

Chennai, India

Skills:

BigQueryGoogle Cloud PlatformAPI designRESTMLopsDockerIamFastAPIKubernetesPythonGemini Enterprise Agent PlatformCloud FunctionsGRPCPub SubCloud RunGenAI applicationsVertex AIGCSRAG pipelines

Chennai, India

Skills:

JavaDevopsDockerTerraformFastAPIKubernetesPythonLangChainAzureMLGPU containersGovector databasesIaCAzure OpenAICUDA driversDevOps IaCGitHub ActionsasyncioAzure AKSPydanticBicepAzure AI SearchLlamaIndex

Bengaluru, India

Skills:

AgileJavaTensorflowAzure Machine LearningPytorchKubernetesPythonDockerApisGitRGPTHugging FaceAWS SageMakerData preprocessingGCP AI platformOpenAI APIAgentic AIGenerative AITransformers