-
Updated
Jun 27, 2023
nlp-dataset
Here are 13 public repositories matching this topic...
a novel Romanian language dataset for offensive message detection with manually annotated comment from a local Romanian news website (stiri de cluj) into five classes
-
Updated
Jun 13, 2023
RO-Offense: A Novel Romanian Dataset for Offensive Language in Online Comments
-
Updated
Feb 20, 2023 - Python
Repository for the LREC-COLING 2024 Paper: Persona-Based Corpus in the Diabetes Mellitus Domain – Applying a Human-Centered Approach to a Low-Resource Context
-
Updated
Mar 24, 2024
Persian News Dataset
-
Updated
Aug 15, 2022
Persian sms dataset
-
Updated
Aug 15, 2022
Persian Slang Words (dataset)
-
Updated
Aug 15, 2022
A meta enriched data set of German parliamental debates covering 74 years of plenary protocols.
-
Updated
Mar 4, 2024 - Python
Get a pragmatic assessment how understandable a German text is.
-
Updated
Sep 12, 2024 - Jupyter Notebook
Dataset for web-scaled information extraction.
-
Updated
Jul 26, 2023 - Python
A list of Romanian NLP Datasets
-
Updated
Oct 18, 2024
AfriSenti-SemEval Shared Task 12: Sentiment Analysis for African languages : https://afrisenti-semeval.github.io/
-
Updated
Jan 10, 2024 - Jupyter Notebook
Persian Swear Dataset - you can use in your production to filter unwanted content. دیتاست کلمات نامناسب و بد فارسی برای فیلتر کردن متن ها
-
Updated
Sep 30, 2024 - C#
Improve this page
Add a description, image, and links to the nlp-dataset topic page so that developers can more easily learn about it.
Add this topic to your repo
To associate your repository with the nlp-dataset topic, visit your repo's landing page and select "manage topics."