
Search by job, company or skills
About the Role
Engineers who have built and shipped production-grade, AI-native platforms on
AWS. You'll work at the intersection of scalable backend systems and applied AI —
designing agentic workflows, integrating LLMs, and owning cloud infrastructure end-
to-end.
Key Responsibilities
Design and build scalable backend services and APIs (REST/gRPC) for AI-
powered products
Architect and deploy cloud-native solutions on AWS (ECS/EKS, Lambda,
SQS, S3, RDS/Aurora, Bedrock)
Build and maintain LLM integration layers — prompt pipelines, RAG systems,
vector stores
Develop and deploy agentic workflows using frameworks like LangGraph,
CrewAI, AutoGen, or similar
Own MLOps/LLMOps pipelines — model serving, evaluation, and
observability
Collaborate with product and ML teams to productionize AI features
Must-Have
6–8 years of backend engineering experience (Python and/or Node.js/Go)
Strong hands-on AWS experience — Serverless, container orchestration
Production experience with LLMs (OpenAI, Anthropic, Bedrock, or open-
source models)
Hands-on with at least one agentic framework (LangGraph, CrewAI,
AutoGen, Semantic Kernel, etc.)
Experience with vector databases (Pinecone, Weaviate, pgvector, or similar)
Solid understanding of RAG architecture, embeddings, and prompt
engineering
Good to Have
Model fine-tuning experience — SFT, LoRA/QLoRA on open-source models
(Llama, Mistral, etc.) using Hugging Face Transformers.
Fine-tuning pipelines on AWS (SageMaker Training Jobs, EC2 GPU, S3 data
management)
Familiarity with RLHF, or instruction tuning workflows
Multi-agent orchestration and tool use patterns
AI evaluation frameworks (RAGAS, DeepEval) and guardrails implementation
Prior experience building internal AI platforms or developer tooling
Job ID: 149071575
Skills:
Cloudformation, Tensorflow, Numpy, Jenkins, Git, Gcp, Pytorch, Pandas, Terraform, Spark, FastAPI, Azure, Python, Kubernetes, AWS, LangChain, scikit-learn, Hugging Face Transformers, MLflow, vLLM, TF Serving, GitHub Actions, TorchServe
Skills:
Java Programming Language, CSS, Debugging, Automation, Agile Methodology, HTML, Business Requirements, Application Development, Api, Web Services, Business Process, Python Programming Language, JavaScript Programming Language, Scrum Software Development, SQL Programming Language, Angular Web Framework
Skills:
SQL server, Deep Learning, Tensorflow, Postgres, Tableau, Presto, Machine Learning, Big Data Analytics, AWS, Pytorch, Powerbi, Python, Azure, Gcp, Scala, Spark, AI ML Modeling, Scikit-Learn, Data Stitching, Hugging Face Transformers, Sage Maker, Agentic AI Frameworks, LLM Expertise, Click house
Skills:
triton , Java, Rust, Distributed Systems, Kafka, Kotlin, Microservices, Kubernetes, High-throughput, LLMOps, Model Serving, Go, Inference, Pinecone, vLLM, GRPC, Vector Databases, Backend Scaling, RAG, Milvus, Low-latency, Weaviate
Skills:
Java, Gcp, Docker, Terraform, Ansible, Azure, Python, Kubernetes, AWS, CrewAI, LangChain, LLMs, Go, Pinecone, Semantic Kernel, AutoGen, FAISS, Weaviate
We don’t charge any money for job offers