Quick Quiz Actions

Generate Quiz

Create a custom practice set

Pick category, difficulty, number of questions, and time limit. Start instantly with your own quiz.

Generate Quiz

Weekly Quiz

Weekly challenge coming soon

No weekly quiz is published yet. Check the weekly page for the latest updates.

View Weekly Page

GK Question

technology medium mcq

Which technique enables LLMs to learn from human feedback to align with preferences?

Supervised Fine-Tuning
RLHF
Prompt Engineering
All of these

Answer: RLHF

Reinforcement Learning from Human Feedback (RLHF) trains reward models from human rankings, then optimizes LLM to maximize rewards. Critical for aligning AI with human values and safety.

Topic Advanced AI/ML

Exam Relevance UPSC, Banking, SSC

More in technology Back to all questions