Please find in this repo:
-
Logbooks
-
Runs for sprint 1,2 and 3
-
The Doc2Vec notebook
-
the lemmatization script
-
The main notebook (main_notebook_ml_project.ipynb)
-
The final presentation (ml_final_presentation.pptx)
-
The 3 notebooks used to balance and clean the data set (rbc_notebook_sprintX.ipynb)
-
Notebooks for BERT can be found under runs_Sprint3/BERT