Skip to content

Language Specific Application

Nicolay Rusnachenko edited this page Oct 11, 2023 · 13 revisions

Russian Language 🇷🇺

Infer sentiment attitudes from text file with further D3JS-based demo launch:

python3 -m arelight.run.infer  \
    --sampling-framework "arekit" \
    --ner-model-name "ner_ontonotes_bert_mult" \
    --ner-types "ORG|PERSON|LOC|GPE" \
    --terms-per-context 50 \
    --sentence-parser "ru" \
    --text-b-type "nli_m" \
    --tokens-per-context 128 \
    --bert-framework "opennre" \
    --batch-size 10 \
    --pretrained-bert "DeepPavlov/rubert-base-cased" \
    --bert-torch-checkpoint "ra4-rsr1_DeepPavlov-rubert-base-cased_cls.pth.tar" \
    --backend "d3js_graphs" \
    --d3js-host 8000 \
    -o "output" \
    --from-files "<PATH-TO-TEXT-FILE>"

Sentiment Analysis Pipeline: ARElight core is powered by AREkit framework, responsible for raw text sampling. To annotate objects in text, we use BERT-based models trained on OntoNotes5 (powered by DeepPavlov) For relations annotation, we support OpenNRE BERT models. The default inference is pretrained BERT with transfer learning based on RuSentRel and RuAttitudes collections, that were sampled and translated into English via arekit-ss.

Clone this wiki locally