Map Reduce - Data (Stream) - pairs

Mapreduce Pipeline

About

The MapReduce framework operates exclusively on

pairs]] conceivably of different types.

The key and value classes have to implement:

  • the Writable interface (to be serializable)
  • the WritableComparable interface (to facilitate sorting)

See also MapReduce - InputFormat

Articles Related
Example of pipeline

Input and Output types of a MapReduce job forms a pipeline:

(input) <k1, v1> -> map -> <k2, v2> -> combine -> <k2, v2> -> reduce -> <k3, v3> (output)

where:





Discover More
Mapreduce Pipeline
MapReduce - Application

Applications: specify the input/output locations supply map and reduce functions via implementations of appropriate interfaces and/or abstract-classes. These, and other job parameters, comprise...
Mapreduce Pipeline
MapReduce - Pipeline

A MapReduce app implements a pipeline where: the input is transformed in key value pair stream/data the stream/data is process in paralleled via a map operations the result is then combined/shuffled...



Share this page:
Follow us:
Task Runner