- The map function:
- Input of the
map function =
(DocumentID,
Content of the document)
- Output of the
map function =
(word in document,
DocumentID)
|
Graphically:
- The shuffle mechanism
of the MapReduce framework will
then perform the following
movement (grouping) of
the outputs:
- The reduce( ) function:
- Input of the
reduce function = a
list of item of the form
(word,
DocID of the document)
- Output of the
reduce function =
reformat the
input into
(word,
list of
DocID )
|
Graphically:
|