Data Mining - Clustering (Function|Model)

Thomas Bayes



To identify natural groupings in the data.

Useful for exploring data and finding natural groupings within the data.

Members of a cluster are more like each other than they are like members of a different cluster.

The process of clustering is really a process of choosing a good partition of the data.




Finds natural groupings in the data


Clustering models use descriptive data mining techniques, but they can be applied to classify cases according to their cluster assignments.

The model defines segments, or “clusters” of a population, then decides the likely cluster membership of each new case.


  • A model might identify the segment of the population that has an income within a specified range, that has a good driving record, and that leases a new car on a yearly basis.
  • Segment demographic data into clusters and rank the probability that an individual will belong to a given cluster
  • Common examples include finding new customer segments, and life sciences discovery.

Discover More
Thomas Bayes
(Data|Text) Mining - Word-sense disambiguation (WSD)

Word-sense disambiguation (WSD) is an open problem of natural language processing, which governs the process of identifying which sense of a word (i.e. meaning) is used in a sentence, when the word has...
Anomalies Election Fraud
Data Mining - (Anomaly|outlier) Detection

The goal of anomaly detection is to identify unusual or suspicious cases based on deviation from the norm within data that is seemingly homogeneous. Anomaly detection is an important tool: in data...
Data Mining - (Descriptive|Discovery) (Analysis|Statistics)

Descriptive analysis is also known as Descriptive statistics They are procedures used to summarize, organize, and simplify data. Descriptive function are always unsupervised See also . Visual...
Model Funny
Data Mining - (Function|Model)

The model is the function, equation, algorithm that predicts an outcome value from one of several predictors. During the training process, the models are build. A model uses a logic and one of several...
Data Mining Algorithm
Data Mining - Algorithms

An is a mathematical procedure for solving a specific kind of problem. For some data mining functions, you can choose among several algorithms. Algorithm Function Type Description Decision...
Thomas Bayes
Data Mining - Data Mining - (Data|Knowledge) Discovery - Statistical Learning

Data Mining can be defined as the automatic or semiautomatic task of extracting previously unknown information from a large quantity of data. Data mining try to discover in data unknown: unexpected...
Thomas Bayes
Data Mining - Grouping (Classification)

Classification in data mining
Thomas Bayes
Data Mining - Orthogonal Partitioning Clustering (O-Cluster or OC) algorithm

O-Cluster creates a hierarchical, grid-based clustering model. This Unsupervised algorithm creates clusters that define dense areas in the attribute space. A sensitivity parameter defines the baseline...
Thomas Bayes
Data Mining - Scoring (Applying)

The process of applying a model to new data is known as scoring. Apply data, also called scoring data, is the actual population to which a model is applied. Scoring operation for: classification,...
Thomas Bayes
Data Mining - k-Means Clustering algorithm

k-Means is an Unsupervised distance-based clustering algorithm that partitions the data into a predetermined number of clusters. Each cluster has a centroid (center of gravity). Cases (individuals...

Share this page:
Follow us:
Task Runner