-
Notifications
You must be signed in to change notification settings - Fork 7
Home
Ludwig Hülk edited this page Sep 14, 2018
·
10 revisions
- Write data to the database schema sandbox using the OEP-API (template here)
- Make sure the table name follows the OEP Naming Conventions
- Write the metadata string (examples and templates here)
- Create an issue in this repository with tag review
- Get in contact with OEP reviewers and find responsible person
- If necessary, revise the data and metadata
- Get familiar with the OEP community and become a OEP data reviewer!
- only use lower case
- use underscores
- use ASCII characters only
- no points, no commas
- no spaces
- avoid dates
- name starts with the copyright owner, source, project or model name (e.g. zensus, eGo, oemof)
- main value (e.g. population)
- if separated by [attribute] (e.g. by_gender)
- with resolution [tupel] (e.g. per_mun)
Example: zensus_population_by_gender_per_mun Remember to add new energy-related abbreviations to the Glossary
- Data, metadata and additional material (e.g., documentation, article) has been provided
- User rights are set
- Include metadata string to repo
- Metadata file has header
- Metadata licensed with Public Domain (CC0)
- Authors included
- Metadata follows additional information
- All columns are described in resources/fields
- All languages are listed
- Open Data
- Suitable open license
- All sources included (all attributions correct)
- All links to sources included
- Add appropriate OEP tags
- Primary Key is set
- (PostGIS)-Geometry is in column named geom (vector) or rast (raster)
- Data type is geometry (or raster)
- One of the Geometric Types is defined
- The CRS (SRID) defined is defined as EPSG
- Original data stays with the original CRS
- Prefered CRS of the oedb are
- Spacial Index (GIST) on column geom
- All geometries are valid (ST_IsValid)
To get a basic understanding of CRS, see e.g. QGIS docs.
The database set-up of the OEP is designed to support users in achieving good data quality:
- Plausibility and integration tests are applied to identify mistakes in the data.
- When the number of users and reviewers becomes large enough, user evaluations and ratings on data quality will be implemented.
Further information and guidelines regarding data management and data publication can be found here: Open Knowledge Foundation, Open Data Foundation and Software Carpentry (e.g. here).
The quality of data is indicated by a badge, e.g.
- Bronze
- Silver
- Gold
- Platin
A certain badge implies that defined criteria are fulfilled, including subordinate ones (e.g. datasets holding a gold badge also fulfill criteria of bronze and silver).
- Bronze (must-have)
- Primary key
- Follows naming conventions
- Meta data exist
- ...
- Silver (should-have)
- Meta data exhaustive
- Spatial index defined
- ...
- Gold (good-to-have)
- Plausibility and integrity -> a testing script is provided for verification
- ...
- Platin (best-practice)
- Approved/rated positively by XX users
- ...