199k views
1 vote
Which two are features of the Hadoop Distributed File System (HDFS)? (Select two)

A) High-speed data processing
B) Data replication for fault tolerance
C) Support for complex SQL queries
D) Real-time streaming data processing

User Hovercouch
by
8.1k points

1 Answer

5 votes

Final answer:

The features of the Hadoop Distributed File System (HDFS) are A) High-speed data processing and B) Data replication for fault tolerance.

Step-by-step explanation:

The Hadoop Distributed File System (HDFS) is a highly reliable storage system designed for large-scale data storage and processing. Among its key features, we find Data replication for fault tolerance and High-speed data processing. Data replication is a cornerstone of HDFS, ensuring that data is preserved in the event of a failure. By creating multiple replicas of data blocks and distributing them across different nodes, HDFS guarantees that the system can recover data even if some nodes fail.

Another important feature of HDFS is its ability to facilitate High-speed data processing. HDFS accomplishes this by maintaining a close proximity between the data and computational resources, which minimizes data transfer times and allows for efficient processing of large data sets. However, it is important to note that HDFS does not support complex SQL queries directly - this functionality is typically provided by other tools in the Hadoop ecosystem such as Apache Hive or Apache Drill. The answer is option a and d.

User Dale Alleshouse
by
8.3k points