Hybrid Research Scientist, RL Training

Posted 3 days ago

Apply now

About the role

  • Research Scientist focusing on reinforcement learning for training large language models at Snorkel AI. Collaborating with research and engineering teams to advance RL data capabilities.

Responsibilities

  • Research and implement reinforcement learning techniques including GRPO, RLHF, RLAIF, DPO, and reward modeling
  • Design and build data pipelines that generate high-quality training signal for RL workflows
  • Prototype and iterate on end-to-end RL training recipes
  • Work closely with research scientists, ML engineers, and delivery teams
  • Stay current with the latest developments in large-scale muli-node LLM training

Requirements

  • Deep expertise in reinforcement learning from human or AI feedback
  • Experience training or fine-tuning 30B+ large language models at scale
  • Strong proficiency in Python and ML frameworks, especially PyTorch and HuggingFace
  • Solid software engineering fundamentals
  • Familiarity with ML infrastructure and cloud platforms
  • Comfort operating in a high-iteration environment
  • Ph.D. in machine learning, reinforcement learning, or a related field strongly preferred

Benefits

  • Health insurance
  • Professional development opportunities
  • Flexible work arrangements

Job title

Research Scientist, RL Training

Job type

Experience level

Mid levelSenior

Salary

$200,000 - $275,000 per year

Degree requirement

Postgraduate Degree

Location requirements

Report this job

See something inaccurate? Let us know and we'll update the listing.

Report job