GK Question

technology medium mcq

Which technology enables distributed processing of large datasets across clusters?

  1. MySQL
  2. Apache Spark
  3. MongoDB
  4. Redis

Answer: Apache Spark

Apache Spark provides in-memory distributed computing for big data: batch, streaming, ML, graph processing. Faster than Hadoop MapReduce due to in-memory processing. Supports Scala, Python, SQL. Critical for big data engineering and analytics questions.

Topic Data Engineering
Exam Relevance Banking, SSC JE, UPSC