bert4srl

This code shows a simple way to Fine-Tune BERT on the task of Semantic Role Labeling. It is mostly meant to show a simple way of finetuning BERT using SRL as an example.

Requirements

Python version >= 3.8
Hugging-Face Transformers >= 4.17.0

Data

This model was tested with Universal Proposition Banks dataset. Further compatibility for CoNLL-05, CoNLL-09, CoNLL-12 (all of them are licensed datasets) can be easily added by creating the appropriate objects for data pre_processing.

Usage

Pre-processing

python pre_processing/conll2json.py \
            --source_file data/en_ewt-up-dev.conllu \
            --output_file data/en_ewt-up-dev.jsonl \
            --src_lang "<EN>" \
            --token_type EN_CoNLLUP_Token

Train a Model

python3 finetune_bert.py --train_path data/en_ewt-up-train.jsonl --dev_path data/en_ewt-up-dev.jsonl --save_model_dir saved_models/MBERT_SRL \
        --epochs 10 --batch_size 16 --info_every 100 --bert_model bert-base-multilingual-cased

Make Predictions

python3 predict.py -m saved_models/EN_BERT_SRL --epoch 10 --test_path data/en_ewt-up-test.jsonl

Name		Name	Last commit message	Last commit date
Latest commit History 31 Commits
.idea		.idea
data		data
pre_processing		pre_processing
.gitignore		.gitignore
LICENSE		LICENSE
README.md		README.md
finetune_bert.py		finetune_bert.py
predict.py		predict.py
requirements.txt		requirements.txt
utils_srl.py		utils_srl.py

Provide feedback

Saved searches

Use saved searches to filter your results more quickly

Repository files navigation

bert4srl

Requirements

Data

Usage

Pre-processing

Train a Model

Make Predictions

About

Releases

Packages

Languages

License

angel-daza/bert4srl

Folders and files

Latest commit

History

Repository files navigation

bert4srl

Requirements

Data

Usage

Pre-processing

Train a Model

Make Predictions

About

Topics

Resources

License

Stars

Watchers

Forks

Releases

Packages 0

Languages

Packages