Consider the pseudo-code for MapReduces WordCount example (not shown here). Lets now assume that you want to determine the average length of all the words in a text file. Which part of the pseudo-code do you need to adapt?

BASICS OF BIG DATA

Please wait while the activity loads.
If this activity does not load, try refreshing your browser. Also, this page requires javascript. Please visit using a browser with javascript enabled.

If loading fails, click here to try again

Question [CLICK ON ANY CHOICE TO KNOW THE RIGHT ANSWER]

Consider the pseudo-code for MapReduce’s WordCount example (not shown here). Let’s now assume that you want to determine the average length of all the words in a text file. Which part of the pseudo-code do you need to adapt?

A	Only map()
B	Only reduce()
C	map() and reduce()
D	The code does not have to be changed

Explanation:

Detailed explanation-1: -A map() function can emit up to a maximum number of key/value pairs (depending on the Hadoop environment). A map() function can emit anything between zero and an unlimited number of key/value pairs. A reduce() function can iterate over key/value pairs multiple times.

Detailed explanation-2: -MapReduce Word Count is a framework which splits the chunk of data, sorts the map outputs and input to reduce tasks. A File-system stores the output and input of jobs. Re-execution of failed tasks, scheduling them and monitoring them is the task of the framework.

Detailed explanation-3: -The text from the input text file is tokenized into words to form a key value pair with all the words present in the input text file. The key is the word from the input file and value is ‘1’. This is how the MapReduce word count program executes and outputs the number of occurrences of a word in any given input file.

There is 1 question to complete.

FUNDAMENTALS OF COMPUTER

DATABASE FUNDAMENTALS

BASICS OF BIG DATA