Data Mining - (Dimension|Feature) (Reduction)


In machine learning and statistics, dimensionality reduction is the process of reducing the number of random variables (features) under consideration and can be divided into:

This methods are some called “Model selection methods”.

They are an essential tool for data analysis, especially for big datasets involving many predictors.

In dimensionality reduction, the goal is to select/retain a subset of features while still retaining as much of the variance in the dataset as possible.



  • random projections
  • feature hashing

Documentation / Reference

Powered by ComboStrap