This is the dataset repo for our paper A System for Worldwide COVID-19 Information Aggregation
The reliable-websites-dataset contains websites that provides reliable information about COVID-19, annotated by crowdsourcing workers with reasons to trust them and topics it contains.
The article-topic-dataset contains COVID-19 related articles with topics labels.
github repo containing the code of each module.
website: Japanese version, English version
Please cite our paper if you used our dataset or code:
@misc{2008.01523,
Author = {Akiko Aizawa and Frederic Bergeron and Junjie Chen and Fei Cheng and Katsuhiko Hayashi and Kentaro Inui and Hiroyoshi Ito and Daisuke Kawahara and Masaru Kitsuregawa and Hirokazu Kiyomaru and Masaki Kobayashi and Takashi Kodama and Sadao Kurohashi and Qianying Liu and Masaki Matsubara and Yusuke Miyao and Atsuyuki Morishima and Yugo Murawaki and Kazumasa Omura and Haiyue Song and Eiichiro Sumita and Shinji Suzuki and Ribeka Tanaka and Yu Tanaka and Masashi Toyoda and Nobuhiro Ueda and Honai Ueoka and Masao Utiyama and Ying Zhong},
Title = {A System for Worldwide COVID-19 Information Aggregation},
Year = {2020},
Eprint = {arXiv:2008.01523},
}
If you have any question, please contact song@nlp.ist.i.kyoto-u.ac.jp