technology hard Fill in the Blank

The technique that enables LLMs to learn from feedback without explicit labels is called ________ Learning.

Reinforcement / RLHF
Multimodal
Reinforcement
Building Energy / BEMS

Answer: Reinforcement / RLHF

Reinforcement Learning from Human Feedback (RLHF) trains reward models from human preferences, then optimizes LLM to maximize rewards. Critical for aligning AI with human values.

Topic Advanced AI/ML

Exam Relevance UPSC, Banking, SSC

More technology Questions Back to all questions

Quick Quiz Actions

Create a custom practice set

Weekly challenge coming soon

The technique that enables LLMs to learn from feedback without explicit labels is called ________ Learning.