XML - Character

Card Puncher Data Processing


character in XML.

Characters are denoted using the notation used in the Unicode Standard, that is, an optional U+ followed by their hexadecimal number, using at least 4 digits, such as “U+1234” or “U+10FFFD”. In XML or HTML this could be expressed as “ሴ” or “􏿽”.

Legal characters are:

XML processors accept any character in the range specified for Char. All XML processors accept the UTF-8 and UTF-16 encodings of Unicode

Type of character


A character reference refers to a specific character in the unicode character set, for example one not directly accessible from available input devices.


'&#' [0-9]+ ';'


'&#x' [0-9a-fA-F]+ ';'


of character reference:

Type <key>less-than</key> (&#x3C;) to save options.

where 3C is the LESS-THAN SIGN Math Symbol


Character data are all text that is not XML markup or comment.

You can define explicitly markup or comment as being character data with a character data section.


The ampersand character (&) and the left angle bracket (<) MUST NOT appear in their literal form, except when used as:

If they are needed elsewhere, they MUST be escaped using either:

The right angle bracket (>) may be represented using the string “ &gt; ”, and MUST, for compatibility, be escaped using either “ &gt; ” or a character reference when it appears in the string “ ]]> ” in content, when that string is not marking the end of a CDATA section.

Documentation / Reference

Discover More
Card Puncher Data Processing
XML - Character data (CDATA) - Escape

All text that is not markup or comment constitutes the character data of the document (known as CDATA). CDATAPCDATAchild If you want to embedded reserved characters xml character in your document,...
Card Puncher Data Processing
XML - Comment

Comments may appear anywhere in a document outside other markup. They are not part of the document's character data Note that the grammar does not allow a comment ending in --->. The following example...
Card Puncher Data Processing
XML - Entity (Physical Storage)

An XML document may consist of one or many physical units (storage) that are called entities. Entities may be either parsed or unparsed. A parsed entity contains text, a sequence of characters,...
Card Puncher Data Processing
XML - Markup

The function of the markup in an XML document is to describe its structure: storage (physical) and logical structure (and to associate attribute name-value pairs with its logical structures). Markup...
Card Puncher Data Processing
XML - Text

Text consists of intermingled: character data and markup. text is the data that happens between Element tags In the DOM, text are stored in the text node
Card Puncher Data Processing
Xml - Infoset

The XML Infoset: is a tree-based hierarchical representation of an XML document. is the abstract data and metadata (ie abstract means independently of this representation, independent of the actual...

Share this page:
Follow us:
Task Runner