You have A 20 node Hadoop cluster, with 18 slave nodes and 2 master nodes running HDFS High Availability (HA). You want to minimize the chance of data loss in your cluster. What should you do?
A.
Add another master node to increase the number of nodes running the JournalNode which increases the number of machines available to HA to create a quorum
B.
Set an HDFS replication factor that provides data redundancy, protecting against node failure
C.
Run a Secondary NameNode on a different master from the NameNode in order to provide automatic recovery from a NameNode failure.
D.
Run the ResourceManager on a different master from the NameNode in order to load-share HDFS metadata processing
E.
Configure the cluster’s disk drives with an appropriate fault tolerant RAID level
Chosen Answer:
This is a voting comment (?). You can switch to a simple comment. It is better to Upvote an existing comment if you don't have anything to add.
Submit