
Search by job, company or skills
Showing 2 jobs
Skills:
Ml, Jax, Pytorch, Python, DPO, large language models, distributed training, synthetic data generation, RLAIF, Ai, SFT, RLHF, reward modeling, preference data curation, ppo
Skills:
Deep Learning, python, Machine Learning, Natural Language Processing, Tensorflow, gen ai
