Skip to content

conversion:links_via

timrdf edited this page Feb 22, 2011 · 57 revisions

See conversion:Enhancement.

Example

diagram showing two datasets lod-linked to the same entity

What datasets mention Maine (results)?

PREFIX conversion: <http://purl.org/twc/vocab/conversion/>
SELECT ?g ?maine
WHERE {
  GRAPH ?g  {
    ?maine owl:sameAs <http://dbpedia.org/resource/Maine>
  }
  filter(?g != conversion:SameAsDataset)
}order by ?g ?maine

Introduction

Objects that are [promoted to Resources](conversion:range rdfs:Resource) may be linked to external resources. A list of RDF files containing mappings and a list of predicates to query those mappings may be specified (with conversion:links_via and conversion:subject_of, respectively). All predicates listed by conversion:subject_of will be used in all files listed by conversion:links_via. To express more granular control, use multiple ObjectSameAsEnhancements listing different files and predicates.

Relationship to owl:InverseFunctionalProperty

convesion:links_via cites lod-link files that should be used to perform locally scoped owl:InverseFunctionalProperty or owl:FunctionalProperty reasoning to derive an owl:sameAs for the subject or object of a triple created during conversion.

Writing the enhancement parameters

e.g. data-gov's 1147 dataset

countyoutflow0708.csv.e1.params.ttl:

@prefix dcterms: <http://purl.org/dc/terms/> .
@prefix :        <http://logd.tw.rpi.edu/source/data-gov/dataset/1147/params/enhancement/1/> .
 
:dataset a void:Dataset;
 
   conv:base_uri           "http://logd.tw.rpi.edu"^^xsd:anyURI;
   conv:source_identifier  "data-gov";
   conv:dataset_identifier "1147";
   conv:dataset_version    "2009-Oct-08";
   conv:conversion_process [
      a conv:RawConversionProcess;
      conv:enhancement_identifier "1";
      conv:enhance [
         ov:csvCol           1;
         ov:csvHeader       "State_Code_Origin";
         conv:property_name "state_code_origin";
         conv:range          rdfs:Resource;
         <font color="#FF0000">a conv:TypedResourcePromotionEnhancement;
         conv:range_name    "state";
 
         a conv:ObjectSameAsEnhancement;
         conv:links_via <http://www.rpi.edu/~lebot/lod-links/state-fips-dbpedia.ttl>,
                        <http://www.rpi.edu/~lebot/lod-links/state-fips-geonames.ttl>,
                        <http://www.rpi.edu/~lebot/lod-links/state-fips-govtrack.ttl>;
         conv:subject_of dcterms:identifier;</font>
      ];
   ];
.

The lod-link files state-fips-dbpedia.ttl, state-fips-geonames.ttl, state-fips-govtrack.ttl:

@prefix dcterms: <http://purl.org/dc/terms/> .
 
state-fips-dbpedia.ttl:
 <http://dbpedia.org/resource/Alabama> dc:identifier "AL", "01", "Alabama", "ALABAMA", "alabama" .
state-fips-geonames.ttl:
 <http://sws.geonames.org/4829764/> dc:identifier "AL", "01", "Alabama", "ALABAMA", "alabama" .
state-fips-govtrack.ttl
 <http://www.rdfabout.com/rdf/usgov/geo/us/AL> dc:identifier "01", "AL", "Alabama", "ALABAMA", "alabama" .

And input:

 @prefix ds1147: <http://logd.tw.rpi.edu/source/data-gov/dataset/1147/version/2009-Oct-08/> .
 
 ds1147:thing_1 raw:state_code_origin  "01".

becomes (e1)

@prefix e1: <http://logd.tw.rpi.edu/source/data-gov/dataset/1147/vocab/enhancement/1/> .
 
ds1147:thing_1  e1:state_code_origin <http://logd.tw.rpi.edu/source/data-gov/dataset/1147/type/state/01> .
 
<http://logd.tw.rpi.edu/source/data-gov/dataset/1147/type/state/01>
   rdfs:label "01";
   owl:sameAs <http://dbpedia.org/resource/Alabama>, 
              <http://sws.geonames.org/4829764/>, 
              <http://www.rdfabout.com/rdf/usgov/geo/us/AL>;
.

Queries

What predicates point to something that is owl:sameAs something else (results)?

PREFIX owl:        <http://www.w3.org/2002/07/owl#>
PREFIX dcterms:    <http://purl.org/dc/terms/>
PREFIX conversion: <http://purl.org/twc/vocab/conversion/>
PREFIX ov:         <http://open.vocab.org/terms/>
PREFIX e1:         <http://logd.tw.rpi.edu/source/epa-gov-mcmahon-ethan/dataset/environmental-reports/vocab/enhancement/1/>

SELECT distinct ?p
WHERE {
  GRAPH <http://logd.tw.rpi.edu/source/epa-gov-mcmahon-ethan/dataset/environmental-reports/version/2011-Jan-12>  {
    ?s ?p ?t .
    ?t owl:sameAs ?o .
    optional { ?o dcterms:isReferencedBy ?b } filter(!bound(?b))
  }
}

Quality Assurance: For a predicate that we expect to point to a lod-linked resource, which objects did not link (results)?

PREFIX owl:        <http://www.w3.org/2002/07/owl#>
PREFIX dcterms:    <http://purl.org/dc/terms/>
PREFIX conversion: <http://purl.org/twc/vocab/conversion/>
PREFIX ov:         <http://open.vocab.org/terms/>
PREFIX roe:        <http://logd.tw.rpi.edu/source/epa-gov-mcmahon-ethan/dataset/environmental-reports/vocab/>
PREFIX e1:         <http://logd.tw.rpi.edu/source/epa-gov-mcmahon-ethan/dataset/environmental-reports/vocab/enhancement/1/>

SELECT distinct ?taxon ?id
WHERE {
  GRAPH <http://logd.tw.rpi.edu/source/epa-gov-mcmahon-ethan/dataset/environmental-reports/version/2011-Jan-12>  {
    [] roe:epa_web_taxonomy_term ?taxon .
    optional { ?taxon dcterms:identifier ?id }
    optional { ?taxon owl:sameAs ?o } filter(!bound(?o))
  }
}
prefix geonames:   <http://www.geonames.org/ontology#>
prefix conversion: <http://purl.org/twc/vocab/conversion/>

select ?dataset count(*) as ?count
where {
  graph <http://purl.org/twc/vocab/conversion/ConversionProcess> {
    ?dataset conversion:conversion_process [
      conversion:enhancement_identifier ?e;
      conversion:enhance [ 
        conversion:links_via ?lod-links
      ]
    ] 
  }
}
group by ?dataset ?e
order by ?count

DRAFT

Regarding materializing from row to external same-as resources

 :thing_1 e1:state_code_origin <http://dbpedia.org/resource/Alabama> .

On one hand it would allow loading this RDF into a store with DBPedia data and querying directly across them without having to know there are owl:sameAs links. On the other, if you just wanted to query for ds1147:thing_1 e1:state_code_origin ?origin , materializing the owl:sameAs assertions would return you several results instead of just one (which might be expected).

Other datasets that benefit from this enhancement include Dataset 1330.

SubjectSameAsEnhancement parameter

conversion:enhance [
   a conv:SubjectSameAsEnhancement, conv:TypedResourcePromotionEnhancement;
   ov:csvCol           7;
   conv:property_name "state";
   conv:range          rdfs:Resource;
   conv:links_via      <http://url.to/my_mappings.rdf>;
   conv:subject_of      dcterms:identifier;
   # For TypedResourcePromotionEnhancement
   conv:type          "state";
];

Same multi-typed as ObjectSameAsEnhancement

e.g., nuclear reactor 957?

Clone this wiki locally