MapReduce - InputFormat

About

The default behavior of file-based InputFormats, typically sub-classes of FileInputFormat, is to split the input into logical InputSplits based on the total size, in bytes, of the input files.

Articles Related

Example

InputFormat:

org.apache.hadoop.mapreduce.lib.input.FileInputFormat - one file at a time
TextInputFormat - one line at a time