A sample is a smaller, random and representative (group|subset|data set) of the population.
Any one sample will never be perfect if we're only getting a random sample from a population.
Importance of Sampling
While data mining can be used to uncover patterns in data samples, it is important to be aware that:
- the use of non-representative samples of data may produce results that are not indicative of the domain.
- data mining will not find patterns that may be present in the domain, if those patterns are not present in the sample being “mined”. Data mining will only functions with indicative and representative data