UD_Czech-Poetry contains random samples of Czech 19th-century poetry from the Corpus of Czech Verse parsed with UDPipe2 (trained on UD Czech-PDT 2.11) and manually corrected.
The treebank consists of 29 randomly selected poems from the Corpus of Czech Verse parsed with UDPipe 2 (trained on UD Czech-PDT 2.11) and manually corrected to comply with the UD release of the FicTree treebank.
This work was supported by the Czech Science Foundation grant No. 23-07727S, European Poetry: Distant Reading. This work has used the tools and data provided by the LINDAT/CLARIAH-CZ project LM2023062; formerly LM2010013, LM2015071, LM2018101, supported by the Czech Ministry of Education, Sports and Youth under the programme LM of "Large Infrastructures".
- Tomáš Jelínek / Daniel Zeman (2022): UD_Czech-FicTree (v2.10) https://github.com/UniversalDependencies/UD_Czech-FicTree .
- Jelínek, Tomáš (2017): FicTree: A Manually Annotated Treebank of Czech Fiction. in: ITAT (= CEUR Workshop Proceedings). CEUR-WS.org. 181–185. (= CEUR Workshop Proceedings).
- Plecháč, Petr / Kolár, Robert (2015): "The corpus of Czech verse", in: Studia metrica et poetica 2 (1): 107–118.
- 2024-11-15 v2.15
- Nouns no longer distinguish Polarity. Negative nouns have negative lemmas.
- Conditional auxiliary "by" does not have Person (besides 3, it could be also 2).
- Short forms of adjectives now have Degree=Pos (instead of no Degree).
- Disambiguated NumType=Mult,Sets.
- 2023-11-15 v2.13
- Initial release in Universal Dependencies.
=== Machine-readable metadata (DO NOT REMOVE!) ================================ Data available since: UD v2.13 License: CC BY-SA 4.0 Includes text: yes Genre: poetry Lemmas: manual native UPOS: manual native XPOS: automatic Features: manual native Relations: manual native Contributors: Cinková, Silvie Contributing: here Contact: cinkova@ufal.mff.cuni.cz ===============================================================================