18.0k views
0 votes
Explain the concept of Resilient Distributed Dataset (RDD).

a) RDD is a data storage format.
b) RDD is a programming language.
c) RDD is a type of database.
d) RDD is a fundamental data structure in Spark for distributed data processing.

User Jeremywat
by
8.7k points

1 Answer

7 votes

Final answer:

An RDD is a fundamental data structure in Spark for distributed data processing, allowing for parallel processing of data.

Step-by-step explanation:

An RDD, or Resilient Distributed Dataset, is a fundamental data structure in Spark for distributed data processing. It is not a data storage format, programming language, or type of database. RDDs are fault-tolerant collections of objects partitioned across multiple machines, allowing for parallel processing of data.

User Danko
by
7.3k points