Statistics - (Data|Data Set) (Summary|Description) - Descriptive Statistics

Data System Architecture


Summary are a single value summarizing a array of data.

They are:

They are an important element of descriptive analysis

One of the most important tools for summarizing data is to display it visually. See the Anscombe's quartet to understand why ?

From a summary, you can make an inference on a individual without looking at the distribution

Have a look at the distribution before jumping to the summary (in order to not make a stereotype)

Same as: Function - (Aggregate | Aggregation)



Measures of center

In order to summarize a set of data, we want to use one single number or value to describe where it is centred.

This single value can be calculated through one of the following (statistic|measure) of center:


Another important element of a data set is how it is spread.

Measures of spread:



Apache Math Package specifically DescriptiveStatistics

Discover More
Consistent Hashing
Cryptography - Hash

A hash function is an encryption crypto algorithm that takes as data as input (possibly large and of variable-sized) and produces a short fixed-length integer value (generally printed as an hexadecimal...
Data Mining - (Descriptive|Discovery) (Analysis|Statistics)

Descriptive analysis is also known as Descriptive statistics They are procedures used to summarize, organize, and simplify data. Descriptive function are always unsupervised See also . Visual...
Card Puncher Data Processing
Data Processing - Operations / Operator

A data processing function takes an input and creates an output in a pipeline. transition in Automata functional interface in Functional Programming Filter in Data Processing (Shell and Log Pipeline)...
Utah Teapot
Data Visualisation - Histogram (Frequency distribution)

A histogram is a type of graph generally used to visualize a distribution An histogram is also known as a frequency distribution. Histograms can reveal information not captured by summary statistics...
Model Funny
Function - (Aggregate | Aggregation)

Aggregate functions return a single value calculated or selected from values that are in a aggregation relationship (ie a set) This values are also known as summary because they try to summarize...
Imperative Vs Functional
Functional Programming - Reduce - Reduction Operation (fold)

A (reduction|reduce) operation (also called a fold) is a functional programming function. Reduction operations are terminal operations. They takes a sequence of input elements and returns: a single...
Card Puncher Data Processing
R - Interaction Analysis

interaction with R . An interaction term between a numeric x and z is just the product of x and z. lm processes the “” operator between variables andautomatically: add the interaction...
Card Puncher Data Processing
R - Multiple Linear Regression

Multiple linear regression with R functions such as lm Unstandardized Multiple Regression Regression analyses, standardized (in the z scale). The point is a short-cut to select all variables....
Card Puncher Data Processing
R - Non-linear Effect Analysis

Non-linear Analysis with R. where: the quadratic is indicated by the power 2 (predictor1^2) As power has a meaning in this formula, the identity function (I) is used to protect it....
Card Puncher Data Processing
R - Simple Linear Regression

simple linear regression with R function such as lm Unstandardized Simple Regression Regression analyses, standardized (in the z scale). In simple regression, the standardized regression coefficient...

Share this page:
Follow us:
Task Runner