Table - Physical Data Structure
About
The different way, structure that exists to saved tabular data on (disk|memory).
Storage
Columnar format are generally slower to write than non-columnar file formats.
Columnar
- Table - Parquet Format (On Disk) (On Disk)
Row
On Disk and in Memory
Different trade offs
- On disk: Storage.
- Accessed by multiple queries.
- Priority to I/O reduction (but still needs good CPU throughput).
- Mostly Streaming access.
- In memory: Transient.
- Specific to one query execution.
- Priority to CPU throughput (but still needs good I/O).
- Streaming and Random access.