This repository contains the source code for a webcrawler developed for extracting data regardin identity documents from PRADO open source database In practice, for each page containing documents informations, these are stored in a JSON object and later persisted into a database.