-
Notifications
You must be signed in to change notification settings - Fork 35
One click data dump
timrdf edited this page Jan 7, 2013
·
35 revisions
- Vocabulary of Interlinked Datasets (VoID) is an RDF vocabulary to describe sets of RDF graphs -- including where to download them.
- We use the Aggregating subsets of converted datasets pattern to produce the "one click data dump" within csv2rdf4lod-automation.
cr-full-dump.sh gathers all versioned dataset dump files into a single gzipped ntriples file that contains all RDF data in a csv2rdf4lod node. The URL of the "one-click data download" can (will) be found in the VoID description of the csv2rdf4lod node (e.g. http://healthdata.tw.rpi.edu/void.ttl). (TODO: the file is created from cron, but it isn't published and isn't mentioned in the void file yet).
Produced by aggregate-source-rdf.sh:
<http://purl.org/twc/health/void>
void:subset <http://purl.org/twc/health/source/healthdata-tw-rpi-edu/dataset/cr-full-dump/version/latest> .
<http://purl.org/twc/health/source/healthdata-tw-rpi-edu/dataset/cr-full-dump/version/latest>
a void:Dataset;
void:dataDump <SOME_FILE>
.
The cr-full-dump dataset contains mostly void:inDataset links from every resource in the csv2rdf4lod node to itself. It also includes a void:dataDump from the top-level dataset to the dump file that we created (it's just reusing the dump file from cr-full-dump.
<cowboy> void:inDataset <http://purl.org/twc/health/source/healthdata-tw-rpi-edu/dataset/cr-full-dump> .
dbpedia:Montana void:inDataset <http://purl.org/twc/health/source/healthdata-tw-rpi-edu/dataset/cr-full-dump> .
...
<http://purl.org/twc/health/void>
void:dataDump <SOME_FILE>
.