Laion art is a 8M samples laion5B subset with aesthetic > 8 pwatermark < 0.8 punsafe < 0.5 See full description
It is available at https://huggingface.co/datasets/laion/laion-art
A good use case is to train an image generation model.
wget https://huggingface.co/datasets/laion/laion-art/resolve/main/laion-art.parquet
img2dataset --url_list laion-art --input_format "parquet"\
--url_col "URL" --caption_col "TEXT" --output_format webdataset\
--output_folder laion-high-resolution --processes_count 16 --thread_count 64 --image_size 384\
--resize_only_if_bigger=True --resize_mode="keep_ratio" --skip_reencode=True \
--save_additional_columns '["similarity","hash","punsafe","pwatermark","aesthetic","LANGUAGE"]' --enable_wandb True