Natural Language Inference BOW, Word Embeddings and Symbolic Experiments at ASSIN-2 Dataset

In this repo, we conduct a preliminary analysis of different methods to address the Textual Entailment Recognition (RTE) task in Portuguese. We use the ASSIN-2 dataset as a benchmark to evaluate our models. Our work combines various textual representation approaches, including bag of words and word em- beddings, with machine learning models. Additionally, we present a rule-based approach. Our highest performance was achieved by the BERTimbau-large model fine-tuned on ASSIN-2, which attained an F 1 score of 0.89%, positioning it just 1% below the current state-of-the-art. Our ongoing experiment aims to combine our different approaches to leverage their full potential.

This repo was created as part of an activity from the Natural Language Processing course at ICMC - USP.

Name		Name	Last commit message	Last commit date
Latest commit History 29 Commits
data		data
metricas		metricas
notebooks		notebooks
src		src
.gitignore		.gitignore
CITATION.cff		CITATION.cff
PLN_Atividade3.pdf		PLN_Atividade3.pdf
README.md		README.md
escopo_atividade.pdf		escopo_atividade.pdf
requirements.txt		requirements.txt

Provide feedback

Saved searches

Use saved searches to filter your results more quickly

Repository files navigation

Natural Language Inference BOW, Word Embeddings and Symbolic Experiments at ASSIN-2 Dataset

About

Releases

Packages

Contributors 5

Languages

jmssouza/nlp_entailment

Folders and files

Latest commit

History

Repository files navigation

Natural Language Inference BOW, Word Embeddings and Symbolic Experiments at ASSIN-2 Dataset

About

Resources

Stars

Watchers

Forks

Releases

Packages 0

Contributors 5

Languages

Packages