- This project requires Ollama, so ensure it is installed and running.
- For web scraping, you need to have Mozilla Firefox installed.
- Install requirements by running
pip install -r requirements.txt
in the directory of this project. It is recommended you do this in a virtual environment. - To replicate our tests, first delete the
Data
,Results
, andResponses
folders. - Run
main.py
to perform the experiments. Sometimes, during the web scraping Selenium using Mozilla Firefox may hang, and not move onto the next news article. In this case, simply restart the script, and it will continue from where it left off.-f
or--forecast
- Number of weeks to forecast. Defaults to twelve.-w
or--width
- The width of the figures. Defaults to five.-l
or--height
- The height of the figures. Defaults to four.-d
or--decimals
- The number of decimal spaces. Defaults to two.
- Under the
Data
folder, you will see the summaries of the news articles produced by the large language model. Under theResults
folder, you will see the results charts and plots.Actual.csv
- The actual COVID-19 hospitalizations to occur over the next given weeks from a given week.Baseline.csv
andFull Model.csv
- The baseline model and full model predictions of how many COVID-19 hospitalizations to occur over the next given weeks from a given week.Difference Baseline.csv
andDifference Full Model.csv
- The difference between the actual results and each of the model results.Success Rate.csv
- The success rate of each model, where success was determined if a forecast met or exceeded the actual amounts of hospitalizations to occur over a period.Average Difference.csv
- The average difference each model had from the actual amounts of hospitalizations to occur over a given period.Total Failures.csv
- The total failures which occurred for each model over a forecasting period.Total Excess.csv
- The total excess which occurred for each model over a forecasting period.
The data used for this experiment is taken from Covid Timeline Canada.