DATABASE FUNDAMENTALS
BASICS OF BIG DATA
Question
[CLICK ON ANY CHOICE TO KNOW THE RIGHT ANSWER]
|
|
3
|
|
2
|
|
1
|
|
0
|
Detailed explanation-1: -Replication factor is 1 for HDFS.
Detailed explanation-2: -Pseudo Distributed Mode (Single Node Cluster): Hadoop configured to run on a single-node in a pseudo-distributed mode where each Hadoop daemon runs in a separate Java process. Custom configuration is required. Replication factor is set to one for HDFS. Used for real code to test in HDFS.
Detailed explanation-3: -Pseudo-distributed mode is also known as a single-node cluster where both NameNode and DataNode will reside on the same machine. Fully-distributed mode is the production mode of Hadoop where multiple nodes will be running.
Detailed explanation-4: -Here replication factor is changed to 5 using –setrep command.
Detailed explanation-5: -The Replication Factor (RF) is equivalent to the number of nodes where data (rows and partitions) are replicated. Data is replicated to multiple (RF=N) nodes. An RF of one means there is only one copy of a row in a cluster, and there is no way to recover the data if the node is compromised or goes down.