The data is automatically distributed by horizontal partition (sharding).
- When data is inserted into the cluster, the first step is to apply a hash function to the partition key to get a numeric token
- The coordinator assigns the data to a given partition
In a 3 replica node, if a request comes in for data, even if one of our replicas has gone down, the other two are still available to fulfill the request.
Data Storage Hierarchy
- Table: Logical
- Partition: Physical (one or more file)