German NLP dataset for classification models in risk management

The dataset provides natural language data for training binary risk classification models in German language. It contains labelled text data for both risks and chances.

License: CC-BY-4.0, see https://choosealicense.com/licenses/cc-by-4.0/

Purpose and characteristics of the data set

Goal: Provide natural language text data for training binary classification models in risk management applications
Reason why: There aren't yet enough data sets publicly available that cover both German language and risk classes
Each record of the text data set is labelled as follows: 1 = the issue is a risk, 0 = the issue is a chance

Current status of the data set

Number of text data records labelled as "risk": 503
Number of text data records labelled as "chance": 503

Name		Name	Last commit message	Last commit date
Latest commit History 7 Commits
README.md		README.md
nlp-risk-management-data-set-german-language.txt		nlp-risk-management-data-set-german-language.txt

Provide feedback

Saved searches

Use saved searches to filter your results more quickly

Repository files navigation

German NLP dataset for classification models in risk management

Purpose and characteristics of the data set

Current status of the data set

About

Releases

Packages

michael-eble/german-nlp-dataset-risk-management

Folders and files

Latest commit

History

Repository files navigation

German NLP dataset for classification models in risk management

Purpose and characteristics of the data set

Current status of the data set

About

Topics

Resources

Stars

Watchers

Forks

Releases

Packages 0

Packages