
Search by job, company or skills

Job ID: 141187001
Skills:
Ml, Jax, Pytorch, Python, DPO, large language models, distributed training, synthetic data generation, RLAIF, Ai, SFT, RLHF, reward modeling, preference data curation, ppo
Skills:
Deep Learning, python, Machine Learning, Natural Language Processing, Tensorflow, gen ai
We don’t charge any money for job offers