Glossary ›
RLHF (Reinforcement Learning from Human Feedback).
training a model using human ratings of its outputs to make it more helpful and better-behaved.
training a model using human ratings of its outputs to make it more helpful and better-behaved. A big part of why modern assistants feel usable.
Updated 2026-06-03