DATABASE FUNDAMENTALS
BASICS OF BIG DATA
Question
[CLICK ON ANY CHOICE TO KNOW THE RIGHT ANSWER]
|
|
Pig Latin
|
|
Oozie
|
|
Pig
|
|
Hive
|
Detailed explanation-1: -1. is a platform for constructing data flows for extract, transform, and load (ETL) processing and analysis of large datasets. Explanation: Apache Pig is a platform for analyzing large data sets that consists of a high-level language for expressing data analysis programs.
Detailed explanation-2: -Pig: A data flow language and execution environment for exploring very large datasets. Pig runs on HDFS and MapReduce clusters. HBase: A distributed, column-oriented database.
Detailed explanation-3: -MapReduce can best be described as a programming model used to develop Hadoop-based applications that can process massive amounts of unstructured data.
Detailed explanation-4: -Avro is a remote procedure call and data serialization framework developed within Hadoop project. It uses JSON for defining data types and protocols and serializes data in a compact binary format.
Detailed explanation-5: -Which of the following function is used to read data in PIG? Explanation: PigStorage is the default load function.