
Search by job, company or skills
About the Role
We're looking for a mid-level Machine Learning Engineer to join our team and help build, deploy, and scale ML solutions that drive real impact. You'll work on the full ML lifecyclefrom problem formulation and experimentation to production deployment and monitoring. This role is ideal for someone who has moved beyond the basics and is ready to own projects while continuing to grow their expertise.
What You'll Do
You'll design and implement machine learning models to solve business problems, working closely with data scientists, software engineers, and product teams. Your responsibilities will include building data pipelines, training and evaluating models, deploying them to production environments, and monitoring their performance over time. You'll be responsible for hosting and serving open-source models at scale, optimizing inference performance, and fine-tuning models for specific use cases. You'll contribute to our ML infrastructure, help establish best practices, and mentor junior team members. We expect you to balance moves quickly with building robust, maintainable systems.
What We're Looking For
Nice to Have
Experience with Kubernetes for orchestration and scaling, other inference frameworks (TensorRT-LLM, DeepSpeed, Ray Serve), knowledge of model quantization techniques (GPTQ, AWQ, bitsandbytes), familiarity with distributed training frameworks, experience with vector databases and RAG architectures, contributions to open-source ML projects, or experience with MLOps tools and practices would all be valuable additions.
Job ID: 144630641