
Search by job, company or skills
About the Company
We are looking for an experienced LLM Ops Engineer to own the end-to-end lifecycle of LLM applications in production - from model selection and pipeline design through fine-tuning, deployment, observability, and continuous improvement. This role sits at the intersection of ML Engineering, DevOps, and Data Engineering, and is critical to ensuring that GenAI systems are reliable, cost-efficient, and scalable in enterprise environments. You will partner closely with AI Research, Product, Platform, and Data Engineering teams.
About the Role
We are looking for an experienced LLM Ops Engineer to own the end-to-end lifecycle of LLM applications in production.
Responsibilities
Qualifications
Required Skills
Preferred Skills
Pay range and compensation package
6 – 10+ Years Overall in software / ML engineering
3+ Years Hands-on production LLM/ML lifecycle
Equal Opportunity Statement
We are committed to diversity and inclusivity.
Job ID: 148089179
Skills:
triton , Tensorflow, Machine Learning, Pytorch, MLops, Cuda, Generative AI, Transformer Models, GAN, AOT, TRT
Skills:
Algorithms, Hadoop, Node.js, Kafka, Tensorflow, Django, React, Pytorch, Gcp, Docker, Spark, data structures, Azure, Kubernetes, Python, AWS, Airflow, scikit-learn, transfer learning, generative AI technologies, prompt engineering, Vector Databases, RAG architectures
Skills:
Numpy, Pandas, Pytorch, Docker, Python, AWS, Airflow, agentic design patterns, scikit-learn, ML data libraries, MLflow, prompt design, LLM core concepts
Skills:
Python, cold-start problem-solving strategies, end-to-end ML pipelines, learning-to-rank techniques, feature engineering, deep retrieval models, offline and online evaluation, metric alignment for recommendation systems, model serving, collaborative filtering
Skills:
Computer Vision, Deep Learning, Tensorflow, Jax, AWS, Pytorch, Kubernetes, Python, Azure, Gcp, Docker, GANs, Inpainting methods, Image processing techniques, Image-to-image generation, VAEs, CNNs, Generative AI, Deploying Vision models on edge devices, Diffusion models
We don’t charge any money for job offers