Data Processing - Batch
The batch semantics means grouping data into batch in order to manipulate a lot of data at once as opposed to read and process each unit of data.
The data is stored in a container:
- IO - Block (Batch on device) for disk
- Data Processing - Buffer (Batch concept in code) for code (implemented by collection)
- a batch processing systems means starting a process, reading a lot of data and terminating it
- while a (online|continous in-memory) stream system means starting an endless process that will continuously process the data