RLHF (Reinforcement Learning from Human Feedback).

training a model using human ratings of its outputs to make it more helpful and better-behaved.

training a model using human ratings of its outputs to make it more helpful and better-behaved. A big part of why modern assistants feel usable.

Updated 2026-06-03