NLI Task with Adversarial Data Augmentation

This is part of the Multi Lingual Natural Language Processing exam of year 2023/2024 in M.Sc. Artificial Intelligence and Robotics.

Given Task

Design and implement a transformer-based model to perform Natural Language Inference on a subset of FEVER Dataset and in Adversarial Test set.

Report

To have a more comprehensive insight on the proposed solution and data augmentation pipeline please refer to MLNLP Adversarial Task Report

Approach

Model based on a finetuned distilBERT model (encoding head) along with a MLP classifier. It is also required to augment the data in order to perform better on the adversarial test.

Data Augmentation

The data augmentation pipeline consists of two steps:

Premises and Hypotheses editing with synonyms substitution of adjectives, nouns, verbs, and adverbs;
Neutral hypotheses generation with GPT-2 pretrained model

Name		Name	Last commit message	Last commit date
Latest commit History 2 Commits
src		src
LICENSE		LICENSE
README.md		README.md
augment.py		augment.py
main.py		main.py
report.pdf		report.pdf
requirements.txt		requirements.txt

Provide feedback

Saved searches

Use saved searches to filter your results more quickly

Repository files navigation

NLI Task with Adversarial Data Augmentation

Given Task

Report

Approach

Data Augmentation

About

Languages

License

dan-crdll/nli_adversarial_FEVER

Folders and files

Latest commit

History

Repository files navigation

NLI Task with Adversarial Data Augmentation

Given Task

Report

Approach

Data Augmentation

About

Topics

Resources

License

Stars

Watchers

Forks

Languages