Machine learning tool

This repo is no longer maintained.

Machine learning tool

Flask api and the code to retrain the model, which requires data, both extracted out of SIA and some dumps out of old systems. For data, contact: m.sukel@amsterdam.nl

installation (ML train tool)

pip install -r requirements-train.txt

installation

Use the requirements.txt to run (flask) endpoint locally. This step can be skipped if you are using the docker container.

pip install -r requirements.txt

input data

csv input file with at least the following columns:

column	description
Main	Main category
Middle	Middle category
Sub	Sub category
Text	message

training model

navigate to app folder See python train.py for all options.

To train Middle and Sub categoeries use:

python train.py --csv file.csv --columns Middle,Sub

This step will generate a categories json file. Use this file to load the categories in the backend.

python manage.py load_categories <file.json>

To train Middle category use:

python train.py --csv file.csv --columns Middle

Rename resulting files to "main_model.pkl, sub_model.pkl, main_slugs.pkl, sub_slugs.pkl" and copy the pkl files into the classification endpoint.

running service

To load new model into flask (copy into app folder)

file	description
main_model.pkl	model for main category
sub_model.pkl	model for sub category
main_slugs.pkl	slugs for main category
sub_slugs.pkl	slugs for sub category

run docker-compose build

To activate the flask api run:

docker-compose up -d

To test the current loaded model, open web_pages/index.html or POST "text" to http://localhost:8140/signals_mltool/predict with the flask app running.

Name		Name	Last commit message	Last commit date
Latest commit History 12 Commits
.github/workflows		.github/workflows
app		app
notebook		notebook
web_pages		web_pages
.gitignore		.gitignore
Dockerfile		Dockerfile
Jenkinsfile		Jenkinsfile
LICENSE		LICENSE
README.md		README.md
docker-compose.yml		docker-compose.yml
requirements-train.txt		requirements-train.txt
requirements.txt		requirements.txt
test.sh		test.sh

Provide feedback

Saved searches

Use saved searches to filter your results more quickly

Repository files navigation

Machine learning tool

installation (ML train tool)

installation

input data

training model

running service

About

Releases 1

Packages

Contributors 4

Languages

License

Signalen/classification

Folders and files

Latest commit

History

Repository files navigation

Machine learning tool

installation (ML train tool)

installation

input data

training model

running service

About

Resources

License

Stars

Watchers

Forks

Releases 1

Packages 0

Contributors 4

Languages

Packages