Data Mining - (Prediction|Guess)


Something predictable is showing a pattern and is therefore not truly random.

A high-entropy source is completely chaotic, is unpredictable, and is called true randomness.

Many forms of data mining model are predictive. For example, a model might predict income based on education and other demographic factors.

An accurate prediction function does not imply that the function is an accurate model of the phenomenon being analysed, just that it makes an accurate prediction given the data.

Predictive functions:

It's easier to invent the future than try to predict it.

President Hennessy

The best way to predict the future is to invent it.

Alan Kay

I've always been more interested in the future than in the past.

Grace Hopper

Prediction vs Forecasting

Forecasting is the process of making predictions of the future based on past and present data (time serie) whereas prediction is a more more general term.


Predictions have an associated probability that gives the degree of uncertainty.

Prediction probabilities are also known as confidence (How confident can I be of this prediction?).

This Prediction probability is one way to indicate:

Example: Actionable Information

Data mining can derive actionable information from large volumes of data.

For example:

  • a town planner might use a model that predicts income based on demographics to develop a plan for low-income housing.
  • a car leasing agency might a use model that identifies customer segments to design a promotion targeting high-value customers.

Documentation / Reference

Powered by ComboStrap