Text - Lowercase characters

Data System Architecture

About

lowercase characters are characters with a lower case.

Management

Unicode

In unicode, lowercase is a category. See Lowercase

Example from a u+0061 to z u+007A

Regexp capture

lowercase characters can be captured with:

[:lower:]  # for a to z
  • or expression
[a-z]+





Discover More
Card Puncher Data Processing
Antlr - Token

org/antlr/v4/runtime/Token A token can be defined via: a or the A token is primarily defined via a lexer rule (Lexical rule) Example: the lexical rule LOWERCASE that capture a string...
Text Mining
Natural Language Processing - (Tokenization|Parser|Text Segmentation|Word Break rules|Text Analysis)

Tokenization is the process of breaking input text into small indexing elements – tokens. Parsing and Tokenization are often call Text Analysis or Analysis in NLP. The tokens (or terms) are used either:...
Regexp
Regexp - Character Class (Character Set)

A character class defines a domain of permitted characters. character set ASCII characters with square brackets where: [ is the start character class definition ] is the end character class...
Data System Architecture
Text - String

A string is a finite sequence of characters with the same coding representation (ie character set). As a sequence is, a string is also characterized as an abstract data structure case...



Share this page:
Follow us:
Task Runner