Description:
As an Applied Research Engineer, you will play a key role in developing cutting-edge systems and methodologies to create, analyze, and leverage high-quality human-in-the-loop data for frontier model development. Your work will focus on designing and implementing advanced techniques that integrate human feedback into AI training processes, such as Reinforcement Learning from Human Feedback (RLHF) and Direct Preference Optimization (DPO). Additionally, you will innovate methods to measure and impr
Mar 14, 2025;
from:
dice.com