FUNDAMENTALS OF COMPUTER

DATABASE FUNDAMENTALS

BASICS OF BIG DATA

Question [CLICK ON ANY CHOICE TO KNOW THE RIGHT ANSWER]
Select two key advantages of Hadoop 2.X over 1.X!
A
The Name Node Job Tracker can handle more concurrent jobs
B
The inclusion of YARN allows for greater scaling of jobs
C
Faster recovery from Name Node failure do to Standby Name Node
D
Faster recovery from Name failure do to Secondary Name Node
E
Task Trackers on Data Nodes can increase the job loads
Explanation: 

Detailed explanation-1: -Hadoop 1. x supports one and only one programming model: MapReduce. Hadoop 2. x supports multiple programming models with YARN Component like MapReduce, Interative, Streaming, Graph, Spark, Storm etc.

Detailed explanation-2: -Hadoop 2.0 feature HDFS Federation allows horizontal scaling for Hadoop distributed file system (HDFS). This is one of the many sought after features by enterprise class Hadoop users such as Amazon and eBay. HDFS Federation supports multiple NameNodes and namespaces.

Detailed explanation-3: -High Availability was a new feature added to Hadoop 2. x to solve the Single point of failure problem in the older versions of Hadoop. As the Hadoop HDFS follows the master-slave architecture where the NameNode is the master node and maintains the filesystem tree. So HDFS cannot be used without NameNode.

Detailed explanation-4: -HDFS (storage) and YARN (processing) are the two core components of Apache Hadoop.

There is 1 question to complete.