This repo contains the experimental code for the paper: "Joint learning of text alignment and abstractive summarization for long documents via unbalanced optimal transport", published in "Natural Language Engineering" journal.
preprocess_with_sections.py: preprocess the datasets PubMed, arXiv, GovReport, and BillSum.
Folder of uot_summ_pgnet: PG-Net version of UOT_Summ
Folder of uot_summ_bart: BART version of UOT_Summ
More detailed usage descriptions will be added later.