MapReduce - InputFormat
About
The default behavior of file-based InputFormats, typically sub-classes of FileInputFormat, is to split the input into logical InputSplits based on the total size, in bytes, of the input files.
Articles Related
Example
- org.apache.hadoop.mapreduce.lib.input.FileInputFormat - one file at a time
- TextInputFormat - one line at a time