Impossible to ZIM files from upload.wikimedia.org #101

benoit74 · 2024-11-10T21:47:20Z

Since files are hosted on upload.wikimedia.org, we must comply with their User-Agent policy at https://meta.wikimedia.org/wiki/User-Agent_policy

I suggest we add a CLI option to pass a custom User-Agent to be used when downloading the nautilus files.

rgaudin · 2024-11-11T08:40:47Z

Indeed ; shouldn't scraperlib do this by default for stream_file?

benoit74 · 2024-11-11T09:14:51Z

shouldn't scraperlib do this by default for stream_file?

yes for upload.wikimedia.org ; CLI argument would still help for "less known" websites.

benoit74 added enhancement New feature or request question Further information is requested labels Nov 10, 2024

rgaudin added good first issue Good for newcomers and removed question Further information is requested labels Nov 11, 2024

benoit74 mentioned this issue Nov 22, 2024

Pass proper user-agent openzim/mindtouch#79

Open

Provide feedback