Skip to content
New issue

Have a question about this project? Sign up for a free GitHub account to open an issue and contact its maintainers and the community.

By clicking “Sign up for GitHub”, you agree to our terms of service and privacy statement. We’ll occasionally send you account related emails.

Already on GitHub? Sign in to your account

Generate Data Quality Report for the editorial team #31

Open
membranepotential opened this issue Jun 10, 2020 · 2 comments
Open

Generate Data Quality Report for the editorial team #31

membranepotential opened this issue Jun 10, 2020 · 2 comments
Assignees
Labels
backend Python stuff idea New feature or request reprioritize

Comments

@membranepotential
Copy link
Collaborator

membranepotential commented Jun 10, 2020

Maybe we can automatically find issues like

  • missing author first names
  • missing years, disciplines
  • All caps in titles
  • Misspelled, rare disciplines
  • Disc:* tags in keywords
  • Excessive amount of special chars in abstract
  • Inconsistent naming, i. e. Robin Carhartt-Harris, Robin L. Carhartt-Harris, Robin Lester Carhartt-Harris
  • ...?

And compile a list to send to the editorial team or maybe fix ourselves 🙀

Lets use this issue to keep track of common things we find and we could discover automatically in the future

@membranepotential membranepotential added the idea New feature or request label Jun 10, 2020
@membranepotential
Copy link
Collaborator Author

Best done in backend

@EloiMarin
Copy link

EloiMarin commented Mar 6, 2021

This is an example where the publication.year is undefined: psychedelic-strategies-alternative-phenomenologies-and-of-in-e039bfc5

Sign up for free to join this conversation on GitHub. Already have an account? Sign in to comment
Labels
backend Python stuff idea New feature or request reprioritize
Projects
None yet
Development

No branches or pull requests

3 participants