Distribution - Measures of (center|central tendency) (Mean, Median, Mode)

Data System Architecture

About

A Measure of central tendency is a measure that describes the middle or center point of a distribution. A good measure of central tendency is representative of the distribution.

The mean, the median and the mode are measures of center.

Mean Vs Median

  • The mean is the most used measure as calculation of the central tendency.

The Mean (average) is the best measure of central tendency when the distribution is normal

  • Median (middle score) is preferred when there are extreme scores in the distribution

In a negatively skewed distribution, the median will be a little bit greater than the mean and in a positive skewed distribution, the median will be a little bit lower than the mean.

Example: Outliers Resistant

With the following Scores: 2 7 8:

  • the median is 7
  • and the Mean is 5,6

The better measure of your performance is the median.

The mean and median are so different because there is one score that is extremely different from the rest. In statistics, such extreme values are called outliers

The mean is affected by the presence of an outlier; however, the median is not.

A statistic that is not affected by outliers is called resistant.

The median is a resistant measure of center, and the mean is not resistant.

As a result, when we have a data set that contains an outliers, it is better to use the median to describe the center, rather than the mean.





Discover More
Card Puncher Data Processing
Data Processing - Selection

selection is a data processing operation. finding: the min, the max the median or even any kth element in sub-linear time
Mean
Distribution - (Mean|Average) (M| | )

The average is a measure of center that statisticians call the mean. To calculate the mean, you add all numbers and divide the total by the number of numbers (N). The mean is not resistant. The...
Data System Architecture
Quantile - (Median|Middle)

The median is a measure of center. The middle number of a set of data is the median. This measure is resistant. The median is a 50th percentile (or “middle” quartile). Half of the data is below the...
Data System Architecture
Statistics - (Data|Data Set) (Summary|Description) - Descriptive Statistics

Summary are a single value summarizing a array of data. They are: selected or calculated through reduction operations. They are an important element of descriptive analysis One of the most important...
Normal Distribution Cdf
Statistics - (Normal|Gaussian) Distribution - Bell Curve

A normal distribution is one of underlying assumptions of a lot of statistical procedures. In nature, every outcome that depends on the sum of many independent events will approximate the Gaussian distribution...
Statistics Mode
Statistics - Mode (Majority, Peak) (flatness, pointiness or modality)

The mode is a measure of center. It's the score that occurs most often. Mode can be used for nominal variables (that's not true for Mean and Median). Graphically, the peak of a histogram is the mode....



Share this page:
Follow us:
Task Runner