Quick Quiz Actions

Generate Quiz

Create a custom practice set

Pick category, difficulty, number of questions, and time limit. Start instantly with your own quiz.

Generate Quiz

Weekly Quiz

Weekly challenge coming soon

No weekly quiz is published yet. Check the weekly page for the latest updates.

View Weekly Page

GK Question

technology hard fill_blank

The technique that enables LLMs to learn from feedback without explicit labels is called ________ Learning.

Answer: Reinforcement / RLHF

Reinforcement Learning from Human Feedback (RLHF) trains reward models from human preferences, then optimizes LLM to maximize rewards. Critical for aligning AI with human values.

Topic Advanced AI/ML

Exam Relevance UPSC, Banking, SSC

More in technology Back to all questions