Data Compression

Sorting Quicksort Anim

About

Data compression or source coding is the process of encoding information using fewer bits (or other information-bearing units) than an unencoded representation would use through use of specific encoding schemes.

Method

  • run-length encoding,
  • cluster coding
  • and dictionary coding.

Dictionary encoding

Columns are stored as sequences of bit-coded integers.

A check for equality can then be executed on the integers; for example, during scans or join operations. This is much faster than comparing, for example, string values.

Example

  • If a column is sorted, often there are repeated adjacent values

Algorithm

Library





Discover More
Sorting Quicksort Anim
Algorithm

An is a (procedure|method) for solving a problem. If there exists an algorithm, the function that performs it is called computable. Study of algorithms dates at least to Euclid and were formalized by...
Feature Extraction
Data Mining - (Feature|Attribute) Extraction Function

Feature extraction is the second class of methods for dimension reduction. dimension reduction It creates new attributes (features) using linear combinations of the (original|existing) attributes. ...
Columnar Physical Table Representation
Database - Column Oriented (Storage|Store)

Storing data in columns is functionally similar to having a built-in index for each column. This data structure is used in analytics and NoSql database. Columnar storage is a popular data structure in...
Data System Architecture
Number - Compression

Columns of numerical values can often be efficiently compressed using two approaches: bit packing and run-length encoding (RLE) Bit packing uses the fact that small integers do not need a...
Card Puncher Data Processing
Oracle Database - Compression

By using a data compression algorithm specifically designed for relational data, Oracle is able to compress data much more effectively than standard compression techniques. More significantly, unlike other...
Sap Hana Architecture
SAP Hana - Compression (In Column Store)

Compression algorithms (dictionary, RLE, sparse, and so on) applied to a table store column. The compression are only on the main storage Default is the Dictionary compression applied to all...
Cpu Moore Law Transistor
What is a CPU, the Central processing unit?

A CPU is just a device name that indicate a device that controls a computer system. A CPU is also known as: a or The fundamental operation of most CPUs, regardless of the physical form they take,...
Undraw File Manager Re Ms29
What is an archive ? (Zip, Tar, Jar, ...)

An archive is a tree of file and directory that are grouped in one file.
Map Of Internet 1973
What is the Open Systems Interconnection model (OSI model)?

This page is the Open Systems Interconnection (OSI) model that describes layers for data transmission. The elements of each layer are: the level the data unit the protocol Level Name ...



Share this page:
Follow us:
Task Runner