annotations_creators | language | language_creators | license | multilinguality | pretty_name | size_categories | source_datasets | tags | task_categories | task_ids | ||||||||
---|---|---|---|---|---|---|---|---|---|---|---|---|---|---|---|---|---|---|
|
|
|
|
|
PosterErase |
|
|
|
- Dataset Card Creation Guide
- Homepage: https://github.com/alimama-creative/Self-supervised-Text-Erasing
- Repository: https://github.com/shunk031/huggingface-datasets_PosterErase
- Paper (Preprint): https://arxiv.org/abs/2204.12743
- Paper (ACMMM2022): https://dl.acm.org/doi/abs/10.1145/3503161.3547905
[More Information Needed]
The language data in PKU-PosterLayout is in Chinese (BCP-47 zh).
To use PosterErase dataset, you need to download the dataset via Alibaba Cloud. Then place the downloaded files in the following structure and specify its path.
/path/to/datasets
├── erase_1.zip
├── erase_2.zip
├── erase_3.zip
├── erase_4.zip
├── erase_5.zip
└── erase_6.zip
import datasets as ds
dataset = ds.load_dataset(
path="shunk031/PosterErase",
data_dir="/path/to/datasets/",
)
[More Information Needed]
[More Information Needed]
[More Information Needed]
[More Information Needed]
[More Information Needed]
[More Information Needed]
[More Information Needed]
[More Information Needed]
[More Information Needed]
[More Information Needed]
[More Information Needed]
[More Information Needed]
[More Information Needed]
[More Information Needed]
You can find the following statement in the license section of the dataset distribution location.
The dataset is distributed under the CC BY-SA 4.0 license.
However, the license setting on that page appears to be set to CC-BY-SA-NC 4.0.
@inproceedings{jiang2022self,
title={Self-supervised text erasing with controllable image synthesis},
author={Jiang, Gangwei and Wang, Shiyao and Ge, Tiezheng and Jiang, Yuning and Wei, Ying and Lian, Defu},
booktitle={Proceedings of the 30th ACM International Conference on Multimedia},
pages={1973--1983},
year={2022}
}
Thanks to alimama-creative for creating this dataset.