Skip to content
New issue

Have a question about this project? Sign up for a free GitHub account to open an issue and contact its maintainers and the community.

By clicking “Sign up for GitHub”, you agree to our terms of service and privacy statement. We’ll occasionally send you account related emails.

Already on GitHub? Sign in to your account

Including upstream list as submodule/subtree would be more transparent #13

Open
dvzrv opened this issue Jan 26, 2020 · 3 comments
Open

Comments

@dvzrv
Copy link

dvzrv commented Jan 26, 2020

During packaging of python-publicsuffix2 I realized, that downloading the publicsuffix list during build time makes it unreproducible (any time the package is rebuilt, it will have a different list).

My suggestion would be to include the publicsuffix list from upstream directly as e.g. a git submodule or git subtree (the latter is preferred as this way the files actually end up in an automatically generated tarball on github when tagging a release) and not download it during build time at all to ensure reproducibility raise transparency.
The data lives in this repository already, so it could also be copied manually, but IMHO a subtree or submodule is the more transparent way of dealing with this.

Currently only the wheel on pypi.org is really ensured to carry the currently bundled version of the publicsuffix list. For anyone else building this package, this assumption is not valid.

@dvzrv
Copy link
Author

dvzrv commented Jan 26, 2020

Oops, I just realized, that the list is actually not downloaded during build. Sorry for the noise.

However, including the list via git submodule/subtree would be more transparent nonetheless. Modifying this ticket accordingly!

@dvzrv dvzrv changed the title Downloading the list during build is unreproducible Including upstream list as submodule/subtree would be more transparent Jan 26, 2020
@pombredanne
Copy link
Member

pombredanne commented Jan 27, 2020

@dvzrv Thank you... I have never been a big fan of submodules because of principles that I can no longer articulate and I have forgotten...

Therefore I would be quite fine to have a PR to use submodules instead. You will have to provide some minimal doc to help me update the PSL!

(NB: as you noted the PSL is NOT fetched automatically during a build. You have to issue a python setup.py update_psl to do the update.)

@pombredanne
Copy link
Member

@dvzrv Could I interest you in a PR for the submodule?

Sign up for free to join this conversation on GitHub. Already have an account? Sign in to comment
Labels
None yet
Projects
None yet
Development

No branches or pull requests

2 participants