Skip to content

EMBEDDIA/rushifteval_public

Repository files navigation

RuShiftEval public data

This repository contains public data and code for the RuShiftEval'21 shared task. The shared task aimed at evaluating the abilities of the existing NLP systems to detect the degree of diachronic semantic change for Russian nouns.

The shared task is now officially finished.

We are publishing the gold annotations and the leaderboard scores:

Data

RuShiftEval dataset features semantic change scores for three time periods (more details in the paper):

  • pre-Soviet VS Soviet (RuShiftEval-1)

  • Soviet VS post-Soviet (RuShiftEval-2)

  • pre-Soviet VS post-Soviet (RuShiftEval-3)

  • annotated_devset.tsv: gold semantic change estimations for the Development phase (aggregation of manually annotated data)

  • annotated_testset.tsv: gold semantic change estimations for the Evaluation phase (aggregation of manually annotated data)

  • leaderboard_results.tsv: a list of all Evaluation phase submissions with their corresponding scores.

The raw_annotations directory contains raw per-sentence annotator judgments.

Inter-rater agreement

Test set

Krippendorff's alpha Spearman rho
RuShiftEval-1 0.506 0.521
RuShiftEval-2 0.549 0.559
RuShiftEval-3 0.544 0.556

Development set

Krippendorff's alpha Spearman rho
RuShiftEval-1 0.592 0.613
RuShiftEval-2 0.609 0.627
RuShiftEval-3 0.597 0.632

Starting kits

In this directory, you will find submission templates for different phases of the shared task:

  • Practice phase
  • Development phase
  • Evaluation phase (will be published after February 22)

In particular, the templates provide you with the target words for which semantic change scores have to be predicted.

Baselines

In this directory, you will find code for various baseline approaches for semantic change detection.

References

inproceedings{rushifteval_dataset,
title = "Three-part diachronic semantic change dataset for {R}ussian",
author = "Kutuzov, Andrey and Pivovarova, Lidia",
booktitle = "Proceedings of the 2nd International Workshop on Computational Approaches to Historical Language Change",
year = "2021",
address = "online", publisher = "Association for Computational Linguistics",
}

@inproceedings{rushifteval2021,
title={Ru{S}hift{E}val: a shared task on semantic shift detection for {R}ussian},
author={Kutuzov, Andrey and Pivovarova, Lidia},
journal={Computational linguistics and intellectual technologies: Papers from the annual conference Dialogue},
year={2021},
}

About

Public data and code for the RuShiftEval'21 shared task

Resources

License

Stars

Watchers

Forks

Releases

No releases published

Packages

No packages published

Languages