ataCadamia
Subscribe
Search Term
This is a sitemap over all available pages ordered by
namespaces
.
android
ant
antlr
apache
application
application_server
architecture
automata
aws
axon
azure
backup
bdd
bics
book
business
cassandra
code
communication_system
company
computer
counter
crypto
dat
data
data_mining
The 1 Percent Rule
Statistics - (Absolute|True) Zero
Data Mining - (Parameters | Model) (Accuracy | Precision | Fit | Performance) Metrics
Statistics - Adjusted R^2
Statistics - Akaike information criterion (AIC)
Data Mining - Algorithms
Data Mining - (Anomaly|outlier) Detection
Data Mining - Apriori algorithm
Data Mining - Association (Rules Function|Model) - Market Basket Analysis
Data Mining - Attribute (Importance|Selection) - Affinity Analysis
Machine Learning - Area under the curve (AUC)
Data Mining - Automatic Discovery
Machine learning - Bootstrap aggregating (bagging)
Statistics - (Base rate fallacy|Bonferroni's principle)
Machine Learning - (Baseline|Naive) classification (Zero R)
Statistics - Bayes’ Theorem (Probability)
Bayesian
Statistics - Benford's law (frequency distribution of digits)
Statistics - Best Subset Selection Regression
Statistics - Bias-variance trade-off (between overfitting and underfitting)
Statistics - Bias (Sampling error)
Statistics - Bayesian Information Criterion (BIC)
Statistics - R (Big R)
Statistics - Bimodal Distribution
Statistics - Binary logistic regression
Mathematics - Combination (Binomial coefficient|n choose k)
(Probability|Statistics) - Binomial Distribution
Data Mining, Machine Learning - Book
Data Mining - (Boosting|Gradient Boosting|Boosting trees)
Data Mining - Decision boundary Visualization
Machine Learning - (C4.5|J48) algorithm
Statistics - (Case-control|retrospective) sampling
Statistics - Causation - Causality (Cause and Effect) Relationship
Statistics - Cumulative Distribution Function (CDF)
Statistics - Centering Continous Predictors
Statistics - Central limit theorem (CLT)
Statistics - centroid (center of gravity)
Statistics - Chance
Data-Science - Cheatsheet
Data Mining - (Class|Category|Label) Target
Data Mining - (Classifier|Classification Function)
Data Mining - Clustering (Function|Model)
Coin Flipping
(Prediction|Recommender System) - Collaborative filtering
Data Mining - Competitions (Kaggle and others)
confidence_interval
Statistics - (Confidence|likelihood) (Prediction probabilities|Probability classification)
Statistics - Confounding (factor|variable) - (Confound|Confounder)
Machine Learning - Confusion Matrix
Data Mining - Content Analysis and Acquisition
Statistics - Continuous Variable
Convex
Statistics - Correlation (Coefficient analysis)
What is the Cosine Similarity or Cosine Distance? (Measure of Angle)
Statistics - Covariance
Statistics - Mallow's Cp
Statistics - Cross Product (of X and Y) (CP|SP)
(Statistics|Data Mining) - (K-Fold) Cross-validation (rotation estimation)
Statistics - (Periodicity|Periodic phenomena|Cycle)
Data Mining - Data Mining - (Data|Knowledge) Discovery - Statistical Learning
Data Mining - Data Point
Data Mining - Data (Preparation | Wrangling | Munging)
Data Mining - Data Product
Data - Science
Data Mining - Data Scientist
Data Mining - Decision Tree (DT) Algorithm
Machine Learning - Decision Stump
Machine Learning - Deep Learning (Network)
Statistics - (Degree|Level) of confidence
Statistics - Degree of freedom (df)
Statistics - (dependent|paired sample) t-test
Math - Derivative (Sensitivity to Change, Differentiation)
Statistics - Design Matrix (X)
Statistics - Deviance
Statistics - Deviation Score (for one observation)
Rolling a die (many dice)
Data Mining - (Dimension|Feature) (Reduction)
Data Mining - Dimensionality (number of variable, parameter) (P)
(Data|Text) Mining - Word-sense disambiguation (WSD)
discretization
Statistics - Quadratic discriminant analysis (QDA)
Statistics Learning - Discriminant analysis
Data Mining - (Discriminative|conditional) models
What is a Distance?
Statistics / Probability - Distribution - (Function)
Statistics - Dummy (Coding|Variable) - One-hot-encoding (OHE)
Statistics - Effect Size
Statistics - Effects (between predictor variable)
Data Mining - Elastic Net Model
Data Mining - Ensemble Learning (meta set)
Data Mining - Entropy (Information Gain)
Statistics Learning - (Error|misclassification) Rate - false (positives|negatives)
Statistics Learning - Prediction Error (Training versus Test)
estimation
Statistics - (Estimator|Point Estimate) - Predicted (Score|Target|Outcome| )
What is Event Detection or Event mining?
Statistics - Exponential Distribution
Statistics - (F-Statistic|F-test|F-ratio)
Statistics - F-distributions
Face Recognition
Statistics - Factor Analysis
Statistics - (Factor Variable|Qualitative Predictor)
Statistics - Factorial Anova
Feature Engineering
Data Mining - (Feature|Attribute) Extraction Function
Data Mining - Feature Hashing
Data Mining - (Attribute|Feature) (Selection|Importance)
Data Mining - Fraud Detection
Frequency Distribution
Statistics - (Frequency|Rate)
Data Mining - (Frequent itemsets|co-occurring items)
Statistics - Frequentist
Data Model - Fudge factor
Fuzzy Logic (Partial Truth)
Galton board
Statistics - Generalized additive model (GAM)
Statistics / Probability - Gaussian function ( )
Statistics - Gaussian processes (modelling probability distributions over functions)
Generalized Boosted Regression Models
Data Mining - Generative Model
getting_started
Statistics - Generalized Linear Models (GLM) - Extensions of the Linear Model
Data Mining - (Stochastic) Gradient descent (SGD)
Data Mining - User Group
Data Mining - Grouping (Classification)
Statistics - Head
Hierarchical Clustering
Data Mining - Hierarchy
Data Mining - High Dimension (Curse of Dimensionality)
Data Science - History
Statistics - Homoscedasticity
Statistics - Hypothesis (Tests|Testing)
Machine Learning - ID3 Algorithm
Data Mining - Intrusion detection systems (IDS) / Intrusion Prevention / Misuse
Machine Learning - Image classification
Statistics - independent t-test
Statistical - Inference
Data Mining - Information Gain
Information Retrieval
Statistics - (Interaction|Synergy) effect
Statistics - Intercept - Regression (coefficient|constant)
Data Mining - Model Interpretation
Statistics - (Interval|Delta) (Measurement)
Data Mining - Java API for data mining (JDM)
k-means
Statistics - Kernel
Machine Learning - K-Nearest Neighbors (KNN) algorithm - Instance based learning
Statistics - Knots (Cut points)
Statistics - Kurtosis (Distribution Tail extremity)
Statistical Learning - Lasso
Statistics - Standard Least Squares Fit (Gaussian linear model)
Statistics - Leptokurtic distribution
Statistics - (Level|Label)
Statistics - (Lying|Lie)
Data Mining - (Life cycle|Project|Data Pipeline)
Data Mining - Lift Chart
Statistics - Fisher (Multiple Linear Discriminant Analysis|multi-variant Gaussian)
Statistical Learning - Simple Linear Discriminant Analysis (LDA)
Machine Learning - Linear (Regression|Model)
Statistics - (Linear spline|Piecewise linear function)
Statistics - Little r - (Pearson product-moment Correlation coefficient)
Statistics - LOcal (Weighted) regrESSion (LOESS|LOWESS)
Data Mining - Global vs Local
Statistics - log-likelihood function (cross-entropy)
Machine Learning - Logistic regression (Classification Algorithm)
Statistics - (Logit|Logistic) (Function|Transformation)
Loss functions (Incorrect predictions penalty)
Data Science - (Kalman Filtering|Linear quadratic estimation (LQE))
Machine Learning
Statistics - Main Effect
Statistics - Probability mass function (PMF)
Data Mining - Maximum Entropy Algorithm
Statistics - Maximum likelihood
Statistics - Measure
Statistics - Measurement
Data Mining - (Missing Value|Not Available) NA
Data Mining - Model Size (d)
Model vs Expert
Statistics - Moderator Variable (Z) - Moderation
Statistics - (Average|Mean) Squared (MS) prediction error (MSE)
Data Mining - Multi-class (classification|problem)
Statistics Learning - Multi-variant logistic regression
Statistics - (Multiclass Logistic|multinomial) Regression
Multidimensional scaling ( similarity of individual cases in a dataset)
Statistics - Multiple Linear Regression
Data Mining - Naive Bayes (NB)
Machine Learning - (Probabilistic?) Neural Network (PNN)
Statistics - (No Predictor|Mean|Null) Model
Data Mining - Noise (Unwanted variation)
Machine Learning - Multi-response linear regression (Linear Decision trees)
Statistics - Non-linear (effect|function|model)
Data Mining - Non-Negative Matrix Factorization (NMF) Algorithm
Statistics - (Normal|Gaussian) Distribution - Bell Curve
Data Mining - Orthogonal Partitioning Clustering (O-Cluster or OC) algorithm
Probability - Odds (Ratio)
Machine Learning - (One|Simple) Rule - (One Level Decision Tree)
Data Mining - Outliers Cases
Machine Learning - (Overfitting|Overtraining|Robust|Generalization) (Underfitting)
Data Science - Over-generalization
Statistics - (Paretian|Power law) distribution
Statistics - Pareto ( Principle | Distribution )
Pascal Triangle
What is a Pattern ?
Data Mining - Principal Component (Analysis|Regression) (PCA|PCR)
Statistics - (Probability) Density Function (PDF)
Mathematics - Permutation (Ordered Combination)
Statistics - Piecewise polynomials
Data Mining - Partial least squares (PLS)
Data Mining - Predictive Model Markup Language (PMML)
Statistics - Poisson Distribution
Data Mining - (Global) Polynomial Regression (Degree)
Statistics - Population Parameter
Statistics - Post-hoc test
Statistics - Power of a test
prediction
Data Mining - Predictive Model Markup Language (PMML)
(Machine|Statistical) Learning - (Predictor|Feature|Regressor|Characteristic) - (Independent|Explanatory) Variable (X)
Statistics - Probability (of an event) / Likelihood
Data Mining - Probit Regression (probability on binary problem)
Data Mining - Problem
Statistics - Process control (SPC)
Data Mining - Pruning (a decision tree, decision rules)
pytorch
r_squared
random_forest
Statistics - Random Variable (Random quantity|Aleatory variable|Stochastic variable)
Statistics - Range
Data Mining - Rare Event
ratio
Statistics - Raw score
recommendations
Statistics - (Regression Coefficient|Weight|Slope) (B)
Statistics - Assumptions underlying correlation and regression analysis (Never trust summary statistics alone)
Statistics - Regression
(Machine learning|Inverse problems) - Regularization
Machine Learning - Reinforcement learning
Sampling - Sampling (With|without) replacement (WR|WOR)
Statistics - Research
residual
Statistics - Resistant
Data Mining - Result Considerations
Statistics - Ridge regression
Root Mean Square (RMS)
rmse
Statistics - ROC Plot and Area under the curve (AUC)
Machine Learning - Rote Classifier
rss
Data Mining - (Decision) Rule
sampling_distribution
Statistics - Sampling Error
Statistics - Sampling
Statistics - (Scales of measurement|Type of variables)
Statistics - Scale
Data Mining - Scoring (Applying)
Statistics - (Shrinkage|Regularization) of Regression Coefficients
Data Mining - Signal (Wanted Variation)
Statistics - Significance level
Statistics - (Significance | Significant) Effect
What is Similarity?
Machine Learning - (Univariate|Simple) Logistic regression
simple_regression
Statistics - Simple Effect
Statistics - Skew (-ed Distribution|Variable)
Machine Learning - Software
Statistics - ( Spread | Variability ) of a sample
Data Mining - Stacking
Statistics - Standard Deviation (SD|s| |RMS width)
Statistics - Standard Error (SE)
What is Normalize or Standardize?
(Statistics|Probability|Machine Learning|Data Mining|Data and Knowledge Discovery|Pattern Recognition|Data Science|Data Analysis)
Statistics - Statistic
Statistics - Forward and Backward Stepwise (Selection|Regression)
Machine Learning - (Supervised|Directed) Learning ( Training ) (Problem)
Data Mining - Support Vector Machines (SVM) algorithm
Statistics - Singular Value Decomposition (SVD)
Statistics - t-distributions
Statistics - (Student's) t-test (Mean Comparison)
Statistics - Tail
(Machine|Statistical) Learning - (Target|Learned|Outcome|Dependent|Response) (Attribute|Variable) (Y|DV)
test_error
Data Mining - Test Set
Test
Statistics - (Threshold|Cut-off) of binary classification
Titanic Data Set
Data Mining - Training Error
Data Mining - Training (Data|Set)
Data Mining - Nested (Transactional|Historical) Data
Statistics - Transform
Statistics - Treatments (Combination of factor level)
Statistics - True score (Classical test theory)
Data Mining - (True Function|Truth)
Statistics - (Total) Sum of the square (TSS|SS)
Statistics - Tuning Parameter
two_class
Statistical Learning - Two-fold validation
Data - Uncertainty
Statistics - Uniform Distribution (platykurtic)
Machine Learning - Unsupervised Learning ( Mining )
Statistics - Resampling through Random Percentage Split
Statistics - Validity (Valid Measures)
Statistics - (Variance|Dispersion|Mean Square) (MS)
Data Mining - Variation (Change?)
viz
Statistics vs (Machine Learning|Data Mining)
Statistics - Random Walk
weather
Statistics - Z Scale
Statistics - Z Score (Zero Mean) or Standard Score
data_storage
db
dit
dokuwiki
ebs
electronic
epm
exadata
exalytics
file
geometry
glassfish
gradle
iam
ide
infra
io
jenkins
jmeter
jpa
lang
legal
linear_algebra
linkedin
management
mapr
mapviewer
marketing
markup
mathematics
maven
ml
monitoring
natural_language
netbeans
network
obia
observable
odbc
oracle_spatial
oracle_sql_developer_data_modeler
os
pdf
playground
plotly
powerdesigner
presentation
problem
process
project_management
protocol
resume
salesforce
sap
signup
soft_skills
spatial
sport
sql_loader
ssh
statistics
system
test
tomcat
tool
trigonometry
ui
vagrant
video
virtualbox
viz
vlc
vm
web
web_server
weblog
weblogic
windows
winscp
Anne Frank
Arduino
European Article Number Check PlSql
Business Intelligence Application
BI Technologie Map
Calibre
Camera
Business Intelligence - Competitive Intelligence
Conference
Dataiku
Ditto
DLNA
Dot File
Elearning - MOOC
Engineering
Equation
Exchange Web Service (Outlook)
Family Link
Foot
Gamification / Human Computation (Captcha, )
Gdb
guitare
WebSite Heatmap Index
Home Automation
Icalendar (Ical)
Inertia
Intelligence Artificial (IA) - Laws of Robotics
Internet of Things (IoT) / machine-to-machine (M2M) - Sensor
Internet Relay Chat Protocol ( IRC )
Keepass - Installation Configuration
Kid
Kindle Format
Lego
Magnet (Blackboard)
Data Operation - (Approximate|exact) Match
Mind Map
My Laptop / My Ipad
Nerd
Newsfeed
Icon - Nicon
Windows NT LAN Manager (NTLM)
Open Document Format (ODF)
Openshot
Microsoft - Outlook Configuration
Patent
Raspberry Pi
Robot
RTFM - read the fuck manual
Screencast Software (Screenrecorder)
Sharepoint
Business Method - Six Sigma
Sonos
Sound / Audio
Datacadamia - data all the way
Company - Strategy
Tom Kyte
Trading (Trading Financial Model)
V380 Video Camera
VPS - Virtual Private Server