RLHF

Alias for Reinforcement Learning from Human Feedback.