Data Analysis - ad-hoc queries (exploratory)

Data System Architecture

About

An ad-hoc query is a query created on the fly by an analyst in order to answer specific business questions.

It's also known as:

  • speed-of-thought analysis
  • Impromptu query

This queries are:

  • constructed to answer immediate and specific business questions.
  • not known in advance.
  • exploratory ad-hoc queries

Performance

The central difference between ad hoc queries and reporting queries is the limited degree of foreknowledge that is available in order to tune the database administrator for them

As this queries are not known in advance, the database administrator (DBA) cannot optimize the system specifically for this set of queries. Consequently, execution time for those queries can be very long.

For instance, it doesn’t make much sense to use materialized views — for those kinds of queries it’s better to just keep all the raw events, and to build databases which can scan over them very quickly. Modern column stores have become very good at that.

Poke SQL - Query Performance

Type

Olap

OLAP queries allow for the exploration and analysis of business data to discover new and meaningful relationships and trends.

While this class of queries is similar to the “Ad hoc Queries” class, it is distinguished by a scenario-based user session in which a sequence of queries is submitted. Such a sequence may include both complex and simple queries.

Data mining

joins and large aggregations that return large data result sets for possible extraction.

Documentation / Reference





Discover More
Analytics Functions Snapshot
Analytic (Data Analyst)

A Data Analyst sees what data he has. A Data Scientist imagines what data he is lacking. Information analysis provides insight into : the past the present and the future of the business. See...
Data System Architecture
Cube - Multi-dimensional query (OLAP query)

An multi-dimensional query is a query that runs against a multi-dimensional structure (ie cube). It is also known as: an OLAP query. a business query multi-dimensional query are analytics query...
Data System Architecture
Data Analysis - Exploratory Data Analysis (EDA|Data exploration|Discovery)

Data exploration aims to have first look of the data. This is an interactive process (iterative) that is based on a serie of ad-hoc queries. The inverse of exploratory is Explanatory boxuancui/DataExplorer/DataExplorer...
Data System Architecture
Data Analysis - Reporting Query

Reporting queries are query: executed periodically to create well-known, pre-defined metrics that will answer questions the financial and operational health of a system (business). Although...
Cost Genome Sequencing Vs Moore Laws
Data Science - Big Data

Big Data describes data defined in terms of the 3Vs: volume, (A lot, Internet-scale data set.) velocity, (Quick) and variety. (In a lot of structure) Doug Laney of Gartner originally defined the...
Data System Architecture
SQL - Query (SELECT)

A query is a SQL statement that begins with the SELECT word. A stored query is called a view. The result of a query is stored in a result table, called the result-set. A query is itself a Relations...
Tpc H Schema
TPC - TPC-H Decision Support Benchmark Sample Schema

TPC-H is an ad-hoc, decision support benchmark. It consists of a suite of business oriented ad-hoc queries and concurrent data modifications. The TPC-H benchmark...
Card Puncher Data Processing
What is Data Processing (Data Integration)?

Card puncher Data processing is a more general term for manipulating data whereas data integration is the integration...



Share this page:
Follow us:
Task Runner