Text - Structure

Data System Architecture


When you find a structure in a string, there is a language.

It can be a loosely language such as:

When the string describes a computer format, the string/text is called a document


The schema is expressed generally via Regular Expressions.

For computer language, a grammar is composed of lexer/parser rule that are just Regular Expressions

But you may find parser that defines the structure manually at hand in the code for more grain control.

A regular expression is implemented as finite automata that's why when parsing any type of text (xml, json,..) with a library you are going from state to state.

Recommended Pages
Data System Architecture

A character is an atomic unit of text as specified by ISO/IEC 10646:2000 [ISO/IEC 10646] Every unit of text (character) is assigned a unique integer known as a code point. All the characters within a...
Card Puncher Data Processing
Markup Language (ML)

Markup language (xML) are rich text language (ie to turn text into visual) such as: images, links, tables, lists, and much more. By marking up the text, they give it a structure that:...
Data System Architecture
What is a document ?

The concept of document can be difficult to grasp. This articles gives an easy definition that fits the computer science world.

Share this page:
Follow us:
Task Runner