Data - Unstructured data - (structure-later|schema-later|schema on read) data


With schema-later data (as semi-structured data), we apply a schema after we read the data.

The knowledge of the schema is delegated at the code.

(Unstructured|Schema never)

Structured data is data organized into a schema such as tabular (rows, columns) whereas unstructured data has no pre-defined schema and therefore does not fit well into relational model. Unstructured data is typically text that you found in various forms.

Unstructured data is typically more lengthy and “verbose” than structured data. This verbosity of data can lead to loss of context when viewing results. Unstructured data search explores a number of facets and attributes, not just a single one. Also, unstructured data is often geared towards concepts not numbers.

Example of unstructured data container

  • Email,
  • documents,
    • Extensible Markup Language content
  • presentations,
  • web content
  • and social media.

Documentation / Reference

Powered by ComboStrap