Which statement is true about Spark's Resilient Distribution Dataset (RDD)? A) RDDs are immutable and can't be modified once created. B) RDDs are stored in a tradition…

Question

asked Apr 24, 2024 82.9k views

1 Answer

← Prev Question Next Question →

Ask a Question

Shinya Koizumi · Answer 1 · 2024-05-01T05:02:34+0000

Final answer:

The true statement is that RDDs are immutable and cannot be modified once created. They are stored across a cluster and support several programming languages, not only Python.

Step-by-step explanation:

The correct statement about Spark's Resilient Distributed Dataset (RDD) is A) RDDs are immutable and can't be modified once created. RDDs are a fundamental data structure of Apache Spark, which is a cluster-computing framework. Once an RDD has been created, it cannot be changed directly. However, it can be transformed into a new RDD through operations like map, filter, and reduce. These transformations allow you to produce new datasets by applying functions to the data. For example, you might have an RDD containing numbers and use a map transformation to create a new RDD with each number squared.

RDDs are not stored in a traditional relational database as suggested by option B. They are stored in memory or on a disk across a cluster in a fault-tolerant manner. Option C is incorrect as RDDs are immutable, which means they cannot be modified dynamically once defined. As for option D, this is also incorrect because RDDs can be used with several programming languages supported by Spark, including Scala, Java, and Python, not just Python.

Which statement is true about Spark's Resilient Distribution Dataset (RDD)? A) RDDs are immutable and can't be modified once created. B) RDDs are stored in a tradition…

Please log in or register to add a comment.

Please log in or register to answer this question.

1 Answer

Final answer:

Step-by-step explanation:

Please log in or register to add a comment.

Related questions

Categories

Other Questions