针对LCSTS数据集,目前有三种预处理方式。
This repository holds the output of the repository:https://github.com/atulkum/pointer_summarizer
Here is the LCSTS_train_text dataset ===>链接:https://pan.baidu.com/s/1BmE_CKYu6F2nXexiMaTW8g 提取码:8azs
Here is all LCSTS dataset ===>链接:https://pan.baidu.com/s/1L9LDzDgPJNPtDgOwup0jjA 提取码:z871
Here is the FINISHED_FILES ===> 链接:https://pan.baidu.com/s/1QtyRmETIQiiO-nd_P-QwbQ 提取码:g20i
This repository holds the output of the repository:https://github.com/lancopku/Global-Encoding
(未完待续)
This repository holds the output of the repository:https://github.com/kururuken/BERT-Transformer-for-Summarization
Here is the LCSTS_train_text dataset ===>链接:https://pan.baidu.com/s/1VZVtweiR837npHezXPLirA 提取码:c19n
Here is processed LCSTS_train_text dataset ===>链接:https://pan.baidu.com/s/1A4Gl6wTiGLKt_gTvxMBKeQ 提取码:103f