Date: (Created) 17 February 2012; (Last modified) 19 Feb 2014
Document Status: Not part of any standard (Type 3 document)
Permanent URL: http://code.google.com/p/tdwg-rdf/wiki/Beginners
TDWG Task Group: TDWG RDF/OWL Task Group
Contributors: Steve Baskauf (TDWG RDF/OWL Task Group)
Abstract: This document provides concise information about topics related to RDF and OWL in the context of the biodiversity informatics community. It is intended as an introduction for persons who are not already familiar with RDF and OWL and as a reference for persons who are familiar but would like organized access to additional reference material.
Licensed under Creative Commons Attribution 3.0 License
Table of Contents:
0. Introduction (this page)
2. What is RDF and what is it for?
4. Vocabularies, DCAM, and RDFS
5. Discovery, Transmission, and Storage of RDF data
Contents
The purpose of this guide is to provide concise information about topics related to RDF and OWL which are significant to the work of the TDWG RDF/OWL Task Group (RDF/OWL TG) in particular and to the biodiversity informatics community in general. It is intended as an introduction for persons who are not already familiar with RDF and OWL and as a reference for persons who are familiar but would like organized access to additional reference material. Although it is intended that the information provided be correct, because it is intended as an introduction the amount of technical jargon and formal language has been kept to a minimum. For those who are interested in normative descriptions of the concepts and technologies discussed here, footnotes are provided to hyperlinks to more detailed information.
It should also be noted that in almost every instance, the URIs that are used in the examples are "real" URIs (no fake "example.org" URIs) from the wild which in most cases will actually dereference (at least in the case of HTTP URIs and LSIDs). This is not to imply that there is something special about those URIs or that the examples that they provide are a model of any sort, but rather it is to give the reader an opportunity to actually try out the tools that have been provided for validating, describing, browsing, etc.
The guide is laid out into sections, each of which deals with a major issue relevant to the work of the RDF/OWL TG. This allows users who wish to access the guide as a reference to jump quickly to the point of interest. However, the later sections presuppose knowledge of concepts discussed in earlier sections, so a beginner may have difficulty starting at a later section.
Within each section, subsections are laid out hierarchically and linked in a Table of Contents at the top of the section page. At the end of each section is a numbered reference list which is also linked to the Table of Contents as well as the actual references in the text.
A beginner who wants to gain a basic understanding of RDF and OWL should start with section 1 and work through the guide sequentially. However, it is not intended that this guide be a tutorial capable of enabling a beginner to write RDF or OWL. Consulting additional references will undoubtedly be required.
The mention of any document or tool in this guide does not imply the endorsement of TDWG or the RDF/OWL TG. (The exception to this is ratified TDWG standards which are official recommendations of TDWG.) The referenced items below are simply ones which have been found to be helpful. The lists are not exhaustive; resources may be added when discovered or removed if they cease to be available.
An important general reference is Architecture of the World Wide Web, a W3C Recommendation.
The document Linked Data: Evolving the Web into a Global Data Space provides a general overview of Linked Data issues and implementation strategies.
See also a list of design issues intended to avoid repetition of discussions once resolved.
The Pedantic Web group is concerned with the quality of RDF data and has created a list of Frequently Observed Problems on the Web of Data.
Information Standards Quarterly had a special issue (Vol. 24, No. 2/3, doi:10.3789/isqv24n2-3.2012 devoted to Linked Data in Libraries, Archives, and Museums.
A PowerPoint presentation for an RDF Primer session at TDWG 2013 is here. The background video for the session is here.
RFC 3986. Uniform Resource Identifier (URI): Generic Syntax (section 1.2.2. defining "dereference", "resolution", and "representation")
RFC 3987. Internationalized Resource Identifiers (IRIs)
Compact URI (CURIE) Syntax 1.0 W3C Working Group Note
Cool URIs for the Semantic Web
RFC 2616. Hypertext Transfer Protocol -- HTTP/1.1 (section 1.3 Terminology including definition of "resource" and "representation") (section 12 on Content Negotiation)
Apache mod_rewrite (URL rewriting)
Best Practice Recipes for Publishing RDF Vocabularies (content negotiation and URI dereferencing)
GUID Applicability Statement - Current TDWG Standard (click here for PDF viewable via browser)
A Beginner's Guide to Persistent Identifiers published by the Global Biodiversity Information Facility(GBIF)
Integrated Digital Biocollections (iDigBio) GUID Statement
Life Sciences Identifiers Applicability Statement - Current TDWG Standard (click here for PDF viewable via browser)
Jones et al. 2011. Identifying and relating biological concepts in the Catalogue of Life. Journal of Biomedical Semantics 2011,2:7. doi:10.1186/2041-1480-2-7
To locate DOIs for titles (books, monographs, or whole journal runs) in the Biodiversity Heritage Library (BHL), locate the title through searching, then access the URL
http://www.biodiversitylibrary.org/bibliography/[title_number]
or
http://www.biodiversitylibrary.org/title/[title_number]
where [title_number]
is the BHL-assigned identifying number for the title. Currently, BHL does not assign DOIs to individual articles unless they happen to correspond to a monograph.
DOIs can be exported en masse through BHLs data export at http://biodivlib.wikispaces.com/Data+Exports and are exposed at the BHL API and OpenURL resolver http://biodivlib.wikispaces.com/Developer+Tools+and+API .
Thanks to Rod Page and Chris Freeland for this information.
An example of DOIs used with specimens
RDF Primer (intended for beginners)
RDF/XML Syntax Specification (normative)
Terse RDF Triple Language (Turtle)
JSON-LD JSON-based Serialization for Linked Data
RDF Vocabulary Description Language 1.0: RDF Schema (RDFS)
RSS specification (view source to see RDF)
SPARQL 1.1 Query Language (communication protocol)
W3C Interest Group note: Mapping and linking life science data using RDF (Thanks to Hilmar Lapp for providing the link.)
Early draft of expressing Darwin Core taxonomies in SKOS (Thomas Bandholtz)
Open Annotation Core Data Model (for creating associations between distant pieces of information)
Proposal for linking metadata to the web pages they describe
Information about the draft Shape Expressions language for validating RDF Shape Expressions primer Shape Expressions definition
DCMI Abstract Model (DCAM) specification (Relationship between DCAM and RDFS)
Representing DCAM constructs using the RDF Model
Notes on DCMI specifications for Dublin Core metadata in RDF
Domains and Ranges for DCMI Properties
Darwin Core abstract model (vs. DCAM)
OWL 2 Web Ontology Language Primer (for beginners)
OWL 2 Web Ontology Language Document Overview
OWL 2 Web Ontology Language RDF-Based Semantics
A Practical Guide To Building OWL Ontologies Using Protégé 4 and CO-ODE Tools
Hogan et al. 2011 Scalable OWL 2 Reasoning for Linked Data
Ontobee web server for Ontologies Can be set up with PURL redirection to dereference ontology terms. See this email for further information.
Some rule-based extensions to OWL/SPARQL/RDF
SWRL submission to W3C (stalled?)
SPIN submission to W3C (SPARQL-based)
W3C Rule Interchange Format (RIF) FAQ page
rdfEditor (syntax highlighting, validates and converts)
jEdit (plugin for XML will validate, no RDF plugin)
Protégé ontology editor (guide)
SWOOP tool for creating, editing, and debugging OWL ontologies
Pellet OWL Reasoner (Pellet tutorial)
ELK reasoner (supports the OWL 2 EL profile)
See also http://www.w3.org/RDF/ for a listing of tools relevant to RDF
mx - a Ruby-based platform that consists of a Ruby on Rails application and various supporting gems/libraries developed as a collaborative web-based content management system for biodiversity informatics.
Vapour Linked Data Validator for examining the dereferencing process for a URI
URI Debugger to dereference URIs and visualize the HTTP response of the server
Rod Page's OpenURL Resolver which can be used to proxy some identifiers as HTTP URIs which return RDF Paper: "bioGUID: resolving, discovering, and minting identifiers for biodiversity informatics" doi:10.1186/1471-2105-10-S14-S5
W3C RDF Validation Service (includes graphical display)
rdfabout.com RDF Validator and Converter (converts from XML to N3 and vice versa)
RSS validator (also XML validation)
WonderWeb OWL Ontology Validator
Mindswap OWL Consistency Checker
Live OWL Documentation Environment generates human-readable versions of OWL ontologies (see this documentation
Rod Page's "Status of Biodiversity Services" checker (takes a long time to run so be patient)
OpenLink Data Explorer Linked Data Browser
Semantic Information Mashup (sig.ma) using Sindice's semantic web index
URIBurner test basic SPARQL endpoint
URIBurner enhanced SPARQL endpoint
Audubon Core (AC) draft TDWG standard (draft term list)
Australian Plant Name Index (APNI) vocabulary (in RDF)
Bibliographic Ontology Specification (BIBO) (in RDF/N3)
Content-in-RDF10 (Representing Content in RDF 1.0 vocabulary)
Creative Commons Rights Expression Language (CC REL) (in RDF)
Darwin Core (DwC) TDWG Standard (homepage) (term quick reference guide) (term definitions in RDF) (type vocabulary) (type definitions in RDF)
darwin-sw (DSW) ontology (in RDF)
Dublin Core (DC) terms (in RDF) (type vocabulary) (in RDF)
FOAF Vocabulary Specification 0.98 (in RDF)
Basic Geo WGS84 lat/long (GEO) Vocabulary (in RDF)
GeoSciML Geologic Timescale model Ontology (in RDF)
Library of Congress Authorities and Vocabularies: ISO639 languages, MARC countries and geographic areas (Use view page source: ISO639, MARC geographic areas, etc. in RDF)
Linked Open Data of Ecology (LODE) vocabulary (Ecoinformatics Working Group of the TFRI)
MARC (Library of Congress) Relators list (terms are subPropertyOf dc:contributor and possibly other dc:terms) see also MARC Relator terms and Dublin Core
Natural Collections Descriptions (NCD) draft TDWG Standard (PDF non-normative document viewable in browser) (PDF normative document viewable in browser) (Collection in RDF) (Institution in RDF) (InstitutionType in RDF) (ContactDetails in RDF)
Open Annotation Core Data Model
OBO foundry (http://www.obofoundry.org/ ) ontologies:
-
Basic Formal Ontology (BFO) http://www.ifomis.org/bfo . Includes definition of "process" and "material entity".
-
Biological Collections Ontology (BCO) (browse draft here) (view draft OWL as RDF). Includes definition of "material sample", "sampling process", and "observing process".
-
Common Anatomy Reference Ontology (CARO) (browse here or here) (OWL as RDF). Includes definition of "multicellular organism" and "organism".
-
Environment Ontology (EnvO) http://www.environmentontology.org/ . Includes definition of "biome" and "environmental feature".
-
Gene Ontology (GO) http://www.geneontology.org/ . (browse). Includes definition of "biological process".
-
Information Artifact Ontology (IAO) http://code.google.com/p/information-artifact-ontology/ . Includes definition of "information content entity" and "data item".
-
Ontology for Biomedical Investigations (OBI) http://obi-ontology.org/ (OWL as RDF). Includes definition of "planned process".
-
Plant Ontology (PO) http://www.plantontology.org/ (OWL as RDF)
-
Population and Community Ontology (PCO) http://code.google.com/p/popcomm-ontology/ (http://code.google.com/p/popcomm-ontology/source/browse/trunk/pco.owl view OWL as RDF]). Includes definition of "species", "population", "species", and "community".
PROV (Provenance) Model primer
Publishing Requirements for Industry Standard Metadata (PRISM) (RDF???)
SKOS Simple Knowledge Organization System Namespace Document (in RDF)
Taxon Meta-Ontology TaxMeOn Associated paper: Making species checklists understandable to machines - a shift from relational databases to ontologies http://dx.doi.org/10.1186/2041-1480-5-40
Taxonomic Concept Transfer Schema (TCS) (TDWG Standard) (Users Guide PDF viewable in browser) (XML schema details PDF viewable in browser) (TaxonConcept in RDF) (TaxonName in RDF) (Common in RDF) Journal of Biomedical Semantics 2:7 article on application of LSIDs and TCS at Catalogue of Life
TaxonConcept Ontologies (in RDF)
TDWG Ontology (human readable) (in RDF)
Atlas of Living Australia (ALA) National Species List (NSL) Linked Data Services; uses DC, TCS, APNI (example resource)
Biodiversity Collections Index; uses DC, vCard, and NCD (example resource)
Bioimages; uses DC, DwC, TCS, FOAF, AC, DSW, and others (example resource) (example resource)
DOIs dereferenced as RDF; uses DC, BIBO, PRISM (example resource URI is http://dx.doi.org/10.1525/auk.2009.09022
but RDF can't be viewed in a browser directly)
International Chronostratigraphic Chart (2012) (Example resource as RDF: Middle Jurassic; URI=http://resource.geosciml.org/classifier/ics/ischart/MiddleJurassic)
Linked Open Data of Ecology; uses DwC, FOAF, GEO, LODE (example resource)
TaxonConcept; uses DC, FOAF, BIBO, TaxonConcept, SKOS, and others (example resource)
uBio; uses DC and locally defined predicates (example resource)
Virtual International Authority File (VIAF); URIs for persons (http://viaf.org/viaf/27063124 example resource)
Questions? Comments? Contact Steve Baskauf