A CKAN extension for the Natural History Museum’s Data Portal.
This extension provides theming and specific functionality for the Natural History Museum’s Data Portal.
The codebase shows how to implement various plugins created by the Museum’s developers; notably our new ElasticSearch datastore with versioned records.
Installation is not going to be useful for anyone not working on the Data Portal, but the steps are left here for reference.
Path variables used below:
$INSTALL_FOLDER
(i.e. where CKAN is installed), e.g.
/usr/lib/ckan/default
$CONFIG_FILE
, e.g. /etc/ckan/default/development.ini
src
folder:cd $INSTALL_FOLDER/src
git clone https://github.com/NaturalHistoryMuseum/ckanext-nhm.git
. $INSTALL_FOLDER/bin/activate
cd $INSTALL_FOLDER/src/ckanext-nhm
pip install -r requirements.txt
cd $INSTALL_FOLDER/src/ckanext-nhm
python setup.py develop
$CONFIG_FILE
:ckan.plugins = ... nhm
These are the options that can be specified in to configure this extension.
Name | Description | Options |
---|---|---|
ckanext.nhm.specimen_resource_id |
ID for the specimens dataset | |
ckanext.nhm.indexlot_resource_id |
ID for the index lots dataset | |
ckanext.nhm.artefact_resource_id |
ID for the artefacts dataset | |
ckanext.nhm.abyssline_resource_id |
ID for the abyssline dataset | |
ckanext.nhm.mam.username |
Username for the Museum’s image server | |
ckanext.nhm.mam.password |
Password for the Museum’s image server |
record_show
Retrieve an individual record.
from ckan.plugins import toolkit
data_dict = {
'resource_id': RESOURCE_ID,
'record_id': RECORD_ID,
'version': OPTIONAL_RECORD_VERSION
}
toolkit.get_action('record_show')(
context,
data_dict
)
download_original_image
Request original image from the Museum’s image server.
from ckan.plugins import toolkit
data_dict = {
'resource_id': RESOURCE_ID,
'record_id': RECORD_ID,
'asset_id': IMAGE_ID,
'email': REQUESTER_EMAIL
}
toolkit.get_action('download_original_image')(
context,
data_dict
)
object_rdf
Get record RDF from its occurrence ID.
from ckan.plugins import toolkit
data_dict = {
'uuid': OCCURRENCE_ID,
'version': OPTIONAL_RECORD_VERSION
}
toolkit.get_action('object_rdf')(
context,
data_dict
)
create-dataset-vocabulary
Ensures the default dataset vocabulary and categories exists.
ckan -c $CONFIG_FILE nhm create-dataset-vocabulary
add-dataset-category
Adds the given category to the dataset category vocabulary.
ckan -c $CONFIG_FILE nhm delete-dataset-category $NAME
delete-dataset-category
Deletes the given dataset category from the vocabulary.
ckan -c $CONFIG_FILE nhm create-dataset-vocabulary $NAME
replace-resource-file
Replaces the file associated with $RESOURCE_ID
with $PATH
, e.g. to
replace a small dummy file with a large one that was too big to upload
initially.
ckan -c $CONFIG_FILE nhm replace-resource-file $RESOURCE_ID $PATH
Test coverage is currently super duper extremely limited.
To run the tests in this extension, there is a Docker compose configuration available in this repository to make it easy.
To run the tests against ckan 2.9.x on Python3:
docker-compose build
docker-compose run ckan
The ckan image uses the Dockerfile in the docker/
folder which is
based on openknowledge/ckan-dev:2.9
.