
Search by job, company or skills
We are looking for a talented and highly motivated engineer to help advance our effort on creating the most efficient Multi-Model LLM models, with a specific focus on creating value for Enterprises. The candidate will be responsible for training/finetuning of MLLM models, developing prototype solutions to IBM z17 AIU hardware, working closely with IBM AIU teams, and IBM scientists in a flexible and fun environment.
. 5+ years programming and hands on experience in python
. Experience with Finetuning Multi-Model LLM models and internals of training stacks
. Experience with Pytorch and FSDP
. Exposure to lare scale distributed training tuning
. Exposure to working with both text and image datasets
Exposure to Hugging Face Ecosystem, Trasnformer Models, Vision Models, and Diffusion Models
Exposure to distributed foundation model training
BTech/Masters/PhD in Computer Science or allied fields
ABOUT IBM IBM's greatest invention is the IBMer. We believe that through the application of intelligence, reason and science, we can improve business, society and the human condition, bringing the power of an open hybrid cloud and AI strategy to life for our clients and partners around the world. Restlessly reinventing since 1911, we are not only one of the largest corporate organizations in the world, we're also one of the biggest technology and consulting employers, with many of the Fortune 50 companies relying on the IBM Cloud to run their business. At IBM, we pride ourselves on being an early adopter of artificial intelligence, quantum computing and blockchain. Now it's time for you to join us on our journey to being a responsible technology innovator and a force for good in the world. IBM is proud to be an equal-opportunity employer. All qualified applicants will receive consideration for employment without regard to race, color, religion, sex, gender, gender identity or expression, sexual orientation, national origin, caste, genetics, pregnancy, disability, neurodivergence, age, veteran status, or other characteristics. IBM is also committed to compliance with all fair employment practices regarding citizenship and immigration status.
Job ID: 116908101
Skills:
Deep Learning, Pytorch, Python, ASR, Transformers, LLMs, distributed training, retrieval-augmented generation, agentic systems, speech understanding, Optimization, efficient inference
Skills:
metaheuristics , snowflake , Data Structures, Performance Tuning, Concurrency, MongoDB, Rest Apis, Python, heuristics, CPLEX, Ai, OR-Tools, Microservices Architecture, Operations Research, Gurobi
Skills:
Machine Learning, Python, Computer Vision, Deep Learning, Data-Driven Solutions, Edge Deployments, Driver Assistance Algorithms, Machine Learning Infrastructure
Skills:
Python, Statistical Analysis, eval frameworks, Transformers, Data Processing, RL for agents, DPO, Go, RLHF
Skills:
C, Java, Python, Kubernetes, Docker, Javascript, MLops, LLMOps, Realtime APIs, Distributed System Design, AI Domain Expertise
We don’t charge any money for job offers