18.0k views
0 votes
Explain the concept of Resilient Distributed Dataset (RDD).

a) RDD is a data storage format.
b) RDD is a programming language.
c) RDD is a type of database.
d) RDD is a fundamental data structure in Spark for distributed data processing.

User Jeremywat
by
8.7k points

1 Answer

7 votes

Final answer:

An RDD is a fundamental data structure in Spark for distributed data processing, allowing for parallel processing of data.

Step-by-step explanation:

An RDD, or Resilient Distributed Dataset, is a fundamental data structure in Spark for distributed data processing. It is not a data storage format, programming language, or type of database. RDDs are fault-tolerant collections of objects partitioned across multiple machines, allowing for parallel processing of data.

User Danko
by
7.3k points
Welcome to QAmmunity.org, where you can ask questions and receive answers from other members of our community.