🏷 བོད་ཏོག [pʰøtɔk̚] Tibetan word tokenizer in Python
-
Updated
Nov 29, 2024 - Python
🏷 བོད་ཏོག [pʰøtɔk̚] Tibetan word tokenizer in Python
🦜 NLP for Tibetan, in Python.
repo for Tibetan corpora
[ACL'24] MC^2: A Multilingual Corpus of Minority Languages in China (Tibetan, Uyghur, Kazakh, and Mongolian)
Tibetan phonetics engine in Python
Basic Universal Dependencies Part-of-Speech Tagger for Tibetan
This Tibetan tokenizer based on Bi-LSTM+CRF methods, it was created with the aim of aiding researchers in the field of Tibetan natural language processing.
This app is a first step toward providing effective machine translation for the Classical Tibetan corpus of important religious, philosophical, and historical texts that were nearly lost during the invasion of Tibet.
An application of PyBo to Tibetan Spell-Checking
syllable-based diffs that make use of google's diff-match-patch and pybo's preprocess
Add a description, image, and links to the tibetan-nlp topic page so that developers can more easily learn about it.
To associate your repository with the tibetan-nlp topic, visit your repo's landing page and select "manage topics."