Recent Articles Changed
Time - (Unix|POSIX|Epoch) time

Modified Wednesday 18-06 07:50

Unix time (also known as POSIX time or Epoch time) is a system for describing instants in time. It's the number of milliseconds / second since a start date (generally Thursday, 1 January 1970) but if you start your own epoch, you may change it to get...

Excel Transpose Pivot
Dimensional Data Operation - (Pivot|Transpose|Cross-tab|Matrix)

Modified Monday 16-06 01:34

The pivot is an dimensional data operation where rows and columns are interchanged. Pivot is also known as: Transpose in the linear algebra word Matrix in the Microsoft world Cross-tab This operation is typically performed by the visualization...

Card Puncher Data Processing
Oracle Database - (Pivot|Unpivot) Statement

Modified Monday 16-06 01:31

Pivot statement in Oracle The column from the first select will be created with as prefix the value of the column in the for statement of the pivot statement. Unpivot can be used to see all column value in one column. You define the new column...

Card Puncher Data Processing
Data Processing - (Pipeline | Compose | Chain)

Modified Monday 16-06 07:56

A pipeline is a finite or infinite automata known as a stream) where: the data transition (pipe) from one state to another via a series of transformations (work) A pipeline creates a composition relationship. A pipeline is also known as: ...

Event Centric Thinking
What is a Stream? Also known as Pipe, Message Queue or Event Processing

Modified Monday 16-06 07:49

A stream is: a sequence of aninfinite cardinality (size) delivered atunknown time intervals. list In processing, a stream is a pipeline where the source cardinality is infinite (ie polled continuously to see if there is anything) Streams...

Event Centric Thinking
What is Stream Processing?

Modified Monday 16-06 07:45

Stream processing is the reactive design of data processing. Something happened (Event), Subscribe to it (Streams) Streaming Processing is also known as : Incremental Processing. or reactive stream (the reactive specification) It's called...

Stream Vs Batch
Stream vs Batch

Modified Monday 16-06 07:43

This article talks Stream Processing vs Batch Processing. The most important difference is that: in batch processing the size (cardinality) of the data to process is finite. The cardinality may be not known but will end at some time in the future,...

Card Puncher Data Processing
Functional Programming - Reduce - Reduction Operation (fold)

Modified Sunday 08-06 03:01

A (reduction|reduce) operation (also called a fold) is a functional programming function. Reduction operations are terminal operations. They takes a sequence of input elements and returns: a single summary result a collection or a map Example:...


Recent Articles Created
What is a Mail eXchange (or MX)?

Created Wednesday 05-03 01:45

A Mail eXchange (MX) is a smtp server that is responsible for : receiving emails for one or more domain and delivering them to a mail delivery agent (MDA) for final delivery The email servers IP are set in the DNS via the MX record. This is how...

Compiler
Compiler - Evaluate

Created Monday 03-02 00:22

To evaluate means to: transform an expression into a value according to the language rules

X.509 Public Key Infrastructure / Hierarchical PKI

Created Monday 03-02 09:04

The X.509 Public Key Infrastructure is a Public key infrastructure (PKI) that uses a hierarchical system of Certificate authorities that brings trust to the transactions by signing certificate. A Hierarchical PKI consists of: A certificate authority...

Card Puncher Data Processing
CoreUtils

Created Friday 01-11 01:48

Corutils is a package of essential executable command line client (used greatly in shell script) See also Utils Linux Name Description arch Print machine hardware name. b2sum Print or check BLAKE2 digests....

Card Puncher Data Processing
NeoVIM VI editor

Created Friday 11-10 08:38

NeoVIM is the latest VI editors in the series that enhanced VIM Installation Once installed the NeoVim editor is called via the nvim executable.

Card Puncher Data Processing
Ansible - Action Plugin Module Type

Created Friday 30-08 10:11

An Action Plugin is a type of module that: set up some values on the controller, (ie create a start state) then invoke an other module on the managed node that does something with these values.

Card Puncher Data Processing
Ansible - Ad-hoc command

Created Friday 30-08 10:11

Adhoc command are command executed as in the shell via ansible. You can therefore execute command on a whole cluster of server. This is because the default module of the ansible command line is command We define the servers in a ini inventory...

Card Puncher Data Processing
Ansible - ansible-playbook cli

Created Friday 30-08 10:11

You may get this problem when one of your file that defines ansible property has no content (ie is empty)


Data all the way

Always keep one hand firmly on data, Amos liked to say.

Data is what set psychology apart from philosophy, and physics from metaphysics.

Amos Tversky (The undoing project)
Data Warehouse Subsystem

Data Warehouse Subsystems

Developing a data warehouse means assembling a lot of subsystems in order to create a whole and coherent data application.

Discovers the 34 Kimball Subsystems and drill down to discover them

Data Warehouse Subsystems

Card Puncher Data Processing

Data Processing

Data processing has changed a lot since 1940 :) and luckily, we are not using punch card anymore.

Stream processing is becoming the norm in data integration task while batch processing stays the king in data analysis.

Data Processing

ENIAC, 1946

The first fully electronic digital computer

Eniac 1946

Table Recursion Rectangle

Data Modeling

Data Modeling is the translation of process into the world of data. The process is tricky but not insurmountable.

Data Modeling

Snowflake
Dimensional Data Modeling

Dimensional Data Modeling permits you to model a (hyper)cube and analyse a process through different perspective. You define dimensions, measures and metrics.

Dimensional Data Modeling

Most programs process some input to produce some output; that’s pretty much the definition of computing.

Function Home

Function

A Function is the basic block of all reusable code component. It is also becoming the central component of any serverless architecture.

Function

Knowledge isn't free. You have to pay attention.

Computing Division at the Department of the Treasury, mid 1920s

Computing Division At The Dept Of The Treasur 1920

Remington Rand Univac

Medieval Counting Table

Table / Relation

The table format (known also as a relation) is the most important data structure used in data analysis. Did you know that its name originate from the medieval counting table ?

Table

Education isn't something you can finish.