About
A document is:
- a text
- that may be located
- in the content of a file,
- in the body of an HTTP request
- in a database
- in a string (memory)
- … and more
Location
Web
Documents on the web (saved in a body request) have generally by nature a hierarchical nature (as opposed to tabular)
This is generally a HTML document but it can be also other type of document such as:
- an email
- an XML document
- an SGML instance.
- an SVG document
- XUL
- …..
This document (file) can be rendered by the user agent (Generally a browser)
File System
The term document in a file system is generally a file including its structure and format, e.g. fonts, colors, and images.
Database
Database can store document in a field or as a row and offers function for document processing. See Database - Document Database
Natural Language Process / Full Text Search
In natural language processing / full text search, a document is composed of word (term) known also as bag of word. See Natural Language - Document