Scraping of books.toscrape.com with BeautifulSoup4 and Requests, export data to .csv files and download cover images.
Implementation of the ETL process:
- Extract relevant and specific data from the source website;
- Transform, filter and clean data;
- Load data into searchable and retrievable files.
Tested on Windows 10, Python 3.10.6.
python<version> -m venv nom_env_virtuel
Activate the environment `mon_env_virtuel\Scripts\activate` (Windows)
pip<version> install -r requirements.txt