DATABASE FUNDAMENTALS
BASICS OF BIG DATA
Question
[CLICK ON ANY CHOICE TO KNOW THE RIGHT ANSWER]
|
|
Hadoop Common
|
|
Hadoop MapReduce
|
|
Hadoop YARN
|
|
Hadoop Distributed File System (HDFS)
|
Detailed explanation-1: -MapReduce is a programming model for the parallel processing of large data sets on the distributed computing nodes in the cluster. MapReduce is the default processing framework for Hadoop, but Hadoop can also be used with other processing frameworks.
Detailed explanation-2: -Hadoop YARN It is the resource management unit of Hadoop and is available as a component of Hadoop version 2. Hadoop YARN acts like an OS to Hadoop. It is a file system that is built on top of HDFS. It is responsible for managing cluster resources to make sure you don’t overload one machine.
Detailed explanation-3: -The YARN or Yet Another Resource Negotiator is the update to Hadoop since its second version. It is responsible for Resource management and Job Scheduling.
Detailed explanation-4: -Oozie. It is a workflow scheduler system for managing apache Hadoop jobs. Oozie combines multiple jobs sequentially into one logical unit of work.
Detailed explanation-5: -HDFS: Hadoop Distributed File System. YARN: Yet Another Resource Negotiator. MapReduce: Programming based Data Processing. Spark: In-Memory data processing.