Table of Contents

Web Robot - Crawler

What is a web crawler ?

A web crawler is an crawler application that reads web resources (mostly a web page) and parse them to extract meaningful information.

Steps

A crawl cycle consists of 4 steps:

Crawler needs to respect the rate limiting configuration.

Implementation

Crawler are build with a headless browser library

Example:

List

Documentation / Reference