Use pip
to install this plugin. This example installs it in /vagrant
source /home/www-data/pyenv/bin/activate
pip install -e git+https://github.com/openresearchdata/ckanext-nesstar.git#egg=ckanext-nesstar --src /vagrant
cd /vagrant/ckanext-nesstar
pip install -r requirements.txt
python setup.py develop
Make sure the ckanext-oaipmh and ckanext-harvest extension is installed as well.
- add
nesstar_harvester
tockan.plugins
indevelopment.ini
(orproduction.ini
) - restart your webserver
- with the web browser go to
<your ckan url>/harvest/new
- as URL fill in the base URL of an OAI-PMH conforming repository, e.g. http://boris.unibe.ch/cgi/oai2 for more see http://www.openarchives.org/Register/BrowseSites
- select Source type
NESSTAR
- if your OAI-PMH needs credentials, add the following to the "Configuration" section:
{"username": "foo", "password": "bar" }
- if you only want to harvest a specific set, add the following to the "Configuration" section:
{"set": "baz"}
- Save
- on the harvest admin click Reharvest
On the command line do this:
- activate the python environment
cd
to the ckan directory, e.g./usr/lib/ckan/default/src/ckan
- start the consumers (NOTE: only run 1 gather and 1 fetch consumer per server):
paster --plugin=ckanext-nesstar harvester gather_consumer &
paster --plugin=ckanext-nesstar harvester fetch_consumer &
-
run the job:
paster --plugin=ckanext-nesstar harvester run
The harvester should now start and import the OAI-PMH metadata.
To make it easier to develop, tests are setup that allow to do that:
. ~/default/bin/activate
cd /vagrant/ckanext-nesstar
In this example the logging filter is used to only show messages of the harvester.