Data Processing - (Batch|Bulk) Processing

1 - About

An batch processing systems (bulk,offline) means:

  • starting a process,
  • reading a lot of data in batch (in parallel if possible)
  • and terminating the process

2 - Article

3 - Implementation

Simple code iterates generally one tuple at a time (for example looping over rows in a table). This kind of algorithms are hard to optimize and parallelize compared to declarative set-oriented languages such as SQL.

4 - Batch vs Stream processing

Data Science
Data Analysis
Data Science
Linear Algebra Mathematics

Powered by ComboStrap