Cassandra - Storage Layout

Data Modeling Chebotko Logical


The data is automatically distributed by horizontal partition (sharding).

Request processing:

  • When data is inserted into the cluster, the first step is to apply a hash function to the partition key to get a numeric token
  • The coordinator does a token lookup and retrieve the node that owns the numeric token called a replica (Each node owns a particular ranges of tokens - numeric)
  • The coordinator assigns the data to a given partition

In a 3 replica node, if a request comes in for data, even if one of our replicas has gone down, the other two are still available to fulfill the request.

Data Storage Hierarchy

  • Table: Logical
  • Partition: Physical (one or more file)

Discover More
Data Modeling Chebotko Logical
Cassandra - Partition

A partition in Cassandra is a unit of storage that does not get divided across nodes. A partition is an ordered dictionary (ordered by clustering key). A partition is a file and should be kept small...

Share this page:
Follow us:
Task Runner