# Data Mining - Clustering (Function|Model)

To identify natural groupings in the data.

Useful for exploring data and finding natural groupings within the data.

Members of a cluster are more like each other than they are like members of a different cluster.

The process of clustering is really a process of choosing a good partition of the data.

## Type

### Function

Finds natural groupings in the data

### Model

Clustering models use descriptive data mining techniques, but they can be applied to classify cases according to their cluster assignments.

The model defines segments, or “clusters” of a population, then decides the likely cluster membership of each new case.

## Example

• A model might identify the segment of the population that has an income within a specified range, that has a good driving record, and that leases a new car on a yearly basis.
• Segment demographic data into clusters and rank the probability that an individual will belong to a given cluster
• Common examples include finding new customer segments, and life sciences discovery.

Discover More
(Data|Text) Mining - Word-sense disambiguation (WSD)

Word-sense disambiguation (WSD) is an open problem of natural language processing, which governs the process of identifying which sense of a word (i.e. meaning) is used in a sentence, when the word has...
Data Mining - (Anomaly|outlier) Detection

The goal of anomaly detection is to identify unusual or suspicious cases based on deviation from the norm within data that is seemingly homogeneous. Anomaly detection is an important tool: in data...
Data Mining - (Descriptive|Discovery) (Analysis|Statistics)

Descriptive analysis is also known as Descriptive statistics They are procedures used to summarize, organize, and simplify data. Descriptive function are always unsupervised See also . Visual...
Data Mining - (Function|Model)

The model is the function, equation, algorithm that predicts an outcome value from one of several predictors. During the training process, the models are build. A model uses a logic and one of several...
Data Mining - Algorithms

An is a mathematical procedure for solving a specific kind of problem. For some data mining functions, you can choose among several algorithms. Algorithm Function Type Description Decision...
Data Mining - Data Mining - (Data|Knowledge) Discovery - Statistical Learning

Data Mining can be defined as the automatic or semiautomatic task of extracting previously unknown information from a large quantity of data. Data mining try to discover in data unknown: unexpected...
Data Mining - Grouping (Classification)

Classification in data mining
Data Mining - Orthogonal Partitioning Clustering (O-Cluster or OC) algorithm

O-Cluster creates a hierarchical, grid-based clustering model. This Unsupervised algorithm creates clusters that define dense areas in the attribute space. A sensitivity parameter defines the baseline...
Data Mining - Scoring (Applying)

The process of applying a model to new data is known as scoring. Apply data, also called scoring data, is the actual population to which a model is applied. Scoring operation for: classification,...
Data Mining - k-Means Clustering algorithm

k-Means is an Unsupervised distance-based clustering algorithm that partitions the data into a predetermined number of clusters. Each cluster has a centroid (center of gravity). Cases (individuals...