Table of Contents

Data Mining - Global vs Local

About

Global refers to calculation that are made over the whole data set whereas local refers to calculations that are made local to a point or a partition.

High dimension vs Local

In high dimension, it's really difficult to stay local.

Curse Of Dimensionality Radius Volume

Neighborhood Curse Of Dimensionality 1d 2d

Example: the percentage of volume that contains 10% of the data in an hypercube is:

To resolved this problem, (structured|parametrized) model have been introduced. The simplest one is the linear model.

Documentation / Reference