Quick Quiz Actions

Generate Quiz

Create a custom practice set

Pick category, difficulty, number of questions, and time limit. Start instantly with your own quiz.

Generate Quiz

Weekly Quiz

Weekly challenge coming soon

No weekly quiz is published yet. Check the weekly page for the latest updates.

View Weekly Page

GK Question

technology hard fill_blank

The technique that enables LLMs to reason about visual inputs by combining vision and language models is called ________.

Answer: Multimodal Learning

Multimodal models (CLIP, LLaVA) process text, images, audio jointly, enabling visual question answering, image captioning, and cross-modal retrieval. Critical for next-gen AI applications.

Topic Advanced AI/ML

Exam Relevance UPSC, Banking, SSC

More in technology Back to all questions