Para acessar a versão em português, clique aqui.
This project aims to extract, refine, and geocode data about Brazilian biofuel plants obtained through Certificates of Efficient Biofuel Production
issued by accredited inspection firms under RenovaBio, approved by the National Agency for Petroleum, Natural Gas and Biofuels (ANP).
The data generated after all processing can be used in geospatial analysis, with a certain margin of error in relation to the plants' locations.
The data pipeline was built using kedro 0.18.12.
- Extraction of raw data from efficient biofuel production certificates issued by ANP.
- Refinement of raw data, including data type standardization and arrangement.
- Geocoding of refined data using Selenium and Google Maps.
- Map visualization of the geocoded data using Folium.
The required libraries for project execution are listed in the src/requirements.txt
file. To install them, use the following command:
pip install -r src/requirements.txt
Additionally, you need to have the latest version of geckodriver in a known directory.
To execute the project, use the command:
kedro run
To execute a specific pipeline, use the command:
kedro run --pipeline <pipeline-name>
LinkedIn: Daniel Rodrigues