Canberra Semantic Web Meetup, 2 August 2010
The talk objective is to encourage the Meetup members to participate and prepare the Sydney Amped Hack Day (October 16 in Sydney: http://ampedweb.org/ ).
1. Photo by JJ Halans
Semantic Mashups for Hack Days
Laurent Lefort, CSIRO ICT Centre, Information Engineering Laboratory
Canberra Semantic Meetup, 2 August 2010
2. Bio
• Over 25 years ago: Computer science
engineer
• Almost 10 years ago: CSIRO ICT Centre,
Canberra
• Over 5 years ago: Semantic Web
researcher
• Ontology design patterns
• 1 year ago: W3C Australia Office
manager & guest programmer for the
W3C Track at Web Directions South
• Armin Haller (Meetup co-organiser) is now
in charge of the Australian Office
• About 1 year ago: GovHack host
• Now: W3C Semantic Sensor Network XG
co-chair
• To be continued …
CSIRO. Semantic Mashups for Hack days
3. Taste le …
I use
1
(Gloze)
I want to use more PURLz
2
Linked Data Api
Callimachus
I don’t use (yet)
CSIRO. Semantic Mashups for Hack days
4. My semantic web journey
1995 2004 2006 2007 2008
Surfing www OWL-ifying EL+-ifying OWL-ifying OWL-ifying
to learn the “mother aircraft data to water vocabs O&M-based stds:
about o. from of all color test reasoners (4-in-one o.) uml, xsd, vocabs
Gio W.? charts” over large o.
1000
Process
IN
WHERE Sampling Sampled Feature
Observation WHAT
100 Action
W
Feature Feature property
Sensor HE Station Water
N Site
(Instrument) Platform Feature
Gauge/weir
10 Fact++ layout/profile Temporal types
Time* Geometrical types
RACER Event Result type
Pellet(Direct) Result
Survey type
1 Coord. Sys
CEL Vertical
Coord. Sys
0.1 Multi-dependent concepts
Chemistry
Observed Taxa
Feature-dep. parameters
Features types
property
Quantities
Feature-indep. parameters
0.01 Abstract concepts
Units
Semi-abstract concepts
Procedure Medium
1
2
5
6
3
2
0
9
4
7
7
Fraction
o1
o1
o6
o5
o2
o2
o3
o4
o2
o5
o2
Semi-primitive concepts
Primitive concepts Metadata ?
?
Institution Security classif.
O&M amd GFM stereotypes and project Processing & ?
Transaction type
Classes w/ ident. instances System interpolation ?
and author Missing data
Simple classes Validation & ?
Onto category to be defined quality flag ?
Time* : two O&M stereotypes (sampling time and result time)
2009 2009 2009 2009 2010
SKOS-ifying Triplifying Semantic Semantic Linking Open
UoM and UoM and Sensor Markup for Standards !
quantities quantities ontology XML-based stds
=
xlink:role C
=
xlink:href myC
CSIRO. Semantic Mashups for Hack days
5. Semantic Web and Web of Data
Linked Open Data
SKOS with data user
Also uses SPARQL
Ontologist
SKOS without data
Will use OWLLink
I want both
I need loose coupling (follow my nose)
I need tight coupling (reasoner-boosted links)
CSIRO. Semantic Mashups for Hack days
6. The Semantic Intensity scale
• RIF: to exchange rules among rule High Intensity
systems (which use a logic which is
different to what OWL uses)
• ---- Ontologies stop here ----
• OWL2: description logic stretched as far
as possible without breaking it: richer
axioms, meta-modelling, additional
2
capabilities to handle data
• OWL 1: triples arranged according to 1
description logic principles
• SPARQL: triples arranged according to
“web-ready schema”
• SKOS: triples structured as thesaurus,
simple taxonomies
• RDF: triples
Low Intensity apps
CSIRO. Semantic Mashups for Hack days
7. Producing good quality ontologies can
help to get good linked data pathways
• An ontology is always an
approximation of the intended
model
• To avoid mismatches between
the intent and the result, it is
better to have:
• domain experts with a broader
knowledge of the domain
• experts with a narrower field of
expertise
• But for experts to work
efficiently together, you’ll need
to solve the linguistic issues
first
Figure by Krzysztof Janowicz, “invited expert”, SSN XG
CSIRO. Semantic Mashups for Hack days
8. Solving linguistic issues
(W3C Semantic Sensor Network XG example)
• VIM_terms
• Measurement (VIM 2.1): process of experimentally obtaining one or more
quantity values that can reasonably be attributed to a quantity
• Measurement (MINET): Process of associating numbers, in an empirical
and objective way, to characteristics of objects and events of the real world
in a way so as to describe them
• SWE_terms
• Measurement (noun) (SensorML): an observation whose result is a
measure
• Measurement (verb) (SensorML): An instance of a procedure to estimate
the value of a natural phenomenon, typically involving an instrument or
sensor. This is implemented as a dynamic feature type, which has a
property containing the result of the measurement. The measurement
feature also has a location, time, and reference to the method used to
determine the value. A measurement feature effectively binds a value to a
location and to a method or instrument.
• Wikipedia_terms
• Measurement (Wikipedia) In science, measurement is the process of
obtaining the magnitude of a quantity, such as length or mass, relative to a
unit of measurement, such as a meter or a kilogram. The term can also be
used to refer to the result obtained after performing the process.
• Tip: check if Wikipedia has a disambiguation list
CSIRO. Semantic Mashups for Hack days
9. OWL in Protege
Protégé is coupled to a
description logic reasoner
which can infer
subsumption relationships
CSIRO. Semantic Mashups for Hack days
10. SKOS example in SESAME (with data)
RDF data can
be queried by
others using
a REST-
based service
(what is know
as a SPARQL
endpoint)
CSIRO. Semantic Mashups for Hack days
11. OWL with data in Protégé
OWL skeleton
populated with
data (here
SysML-QUDV)
Must have: a def. and a citation
CSIRO. Semantic Mashups for Hack days
12. Semantic Mashup options across the 3 main families
of W3C standards: XML, Semantic Web and HTML
XML + HTML HTML + content
(e.g. KML) from SPARQL
endpoints: OWL,
SKOS, RDF
HTML + RDFa
HTML + RDFa + content
sourced from annotated
XML, SPARQL endpoints
or even other RDFa pages
“Lifted” XML
XML annotated with
OWL, SKOS, RDF
CSIRO. Semantic Mashups for Hack days Semantic Web resources
13. Linking Open Standards
• Watch for the transition from XML-based standards to either
RDF-only or hybrid XML-RDF standards
• Watch for the transition from HTML-based standards to either
hybrid XML-RDF-HTML standards
• Standard Development Organisations should work together to
agree on common approaches to:
• Develop and apply a core set of principles and methods to mix
specifications snippets from XML, RDF and HTML
• Including extensible APIs like the SPARQL and RDFa
• Allow tight and loose couplings
• between ontologies
• between ontologies and linked open data resources
• between linked open data resources
• Split ontologies into manageable modules: the issue here is that
the definition of a module can vary a lot …
CSIRO. Semantic Mashups for Hack days
14. For the semantic hack fest, we have the Open
Data, but it’s not linked (not avail. as SPARQL)!
CSIRO. Semantic Mashups for Hack days
15. LOD constellation: 13.1 billions triples and
142 millions RDF links (late 2009)
CSIRO. Semantic Mashups for Hack days
16. Adding Australian content to the LOD constellation
Geo/Gov, People/communities, Science
Doap-
space Flickr
TO DO: exporter
Australian clouds
Crunch FOAF
base Profiles SIOC Revyu
Sites Open-
Guides
flickr
? DBLP
RKBEx
Geo- Project
Pub
Guide names Euro- Guten-
berg
wrappr
? Virtuoso
Sponger
plorer
stat
eprints
Open
Calais
?
Linked RKBECS
riese World
MDB South-
Fact- ampton IEEE
book
Linked RDF Book
GeoData DBpedia Mashup
US
Census lingvoj Freebase
W3C DBLP CiteSeer
Data Word Hann
Net over
Gov- DBLP
UMBEL
Track Berlin
Open
Cyc Yago
Also to consider …
TO DO: Australian links all the cultural clouds (e.g. Music)
All the health & life sciences clouds
CSIRO. Semantic Mashups for Hack days
17. Class-to-Class link pathways in the UMBEL LOD
Constellation (Oct 2008)
damltime -- Time Zone o.
doap -- Description of a
Project o.
foaf -- Friend-of-a-Friend o.
geo -- Geo wgs84 o.
geonames -- GeoNames o.
opencyc -- OpenCyc
knowledge base
sioc -- Socially Interlinked
Online Communities o.
sioc_types -- SIOC extension
umbel -- Upper Mapping and
Binding Exchange Layer o.
wordnet -- WordNet lexical o.
CSIRO. Semantic Mashups for Hack days
18. Why a semantic hack fest?
To ease the transition from Web 2.0 to Web 3.0
• Mashups developed for govhack, apps4nsw, app-my-state VIC
have used semantic web technos very rarely
• Report of the Gov 2.0 Taskforce
• […] Information should be: free, easily discoverable, based on
open standards and therefore machine-readable, properly
documented and therefore understandable, licensed to permit free
reuse and transformation by others
• […] governments have a role to play in leading and encouraging
the uptake of Web 3.0 technologies in support of greater
innovation based on the reuse of public sector information and
enhanced citizen/government interaction.
• e.g. RDFa usage in data.gov.au
• Having a semantic hack fest can help:
• To demonstrate benefits on concrete examples: hack days are a
great way to get awesome results
• To accelerate adoption at a larger scale by industry
CSIRO. Semantic Mashups for Hack days
19. The Amped Hack day
• Two free events organised by Web Directions South at the end
of the Web Directions Conference
• Atlanta September 25th
• Sydney October 16th
• Sydney event
• Hosted by the PowerHouse Museum
• Not yet fully booked: http://ampedweb.org/
• John Allsopp “likes the idea”
• Semantic Geeks! I love it. I think this is a great addition.
CSIRO. Semantic Mashups for Hack days
20. Semantic Mashup for Hack days: What’s required
• Ontologies which could add value to mashups: e.g. my colour
ontology for images,, …
• Australian content to play with: triplified (linkable) datasets for
different domains: .gov (and .com, .org), geo + …, science,
libraries/museums
• Lots of tools with hackable examples
• More publicity:
• Semantic hackfest talk scheduled in the W3C Australia track at the
Web Directions South Conference (just before Amped).
• Public announcement elsewhere: SW Meetup, THATCamp,
others?
• A workspace or two: one to prepare and one for the hack fest
participants
• Helpers (before the hack day) and mentors (during the event)
for all of the above
CSIRO. Semantic Mashups for Hack days
21. Ideas for the hack day
• 1st idea. Mix:
• Paul Hagon Searching by Colour Metadata Australia 2010
• My colour ontology (need some rework)
• If we put the two approaches together, we should learn if:
• We can build a better colour ontology
• We can build better links between colours and tags or topics
• We can demonstrate that ontologies are really adding value
• 2nd idea. Mix:
• Govhack’s mashups based on data about organisations
(companies, government agencies)
• The organisation ontology published by Epimorphics
CSIRO. Semantic Mashups for Hack days
22. Paul’s idea: correlate the flickr tags and the colours of
images selected with image magick
Paul Hagon Searching by Colour
Metadata Australia 2010
CSIRO. Semantic Mashups for Hack days
23. Bonus we can have a colour ontology which provides
the true tags
Color Quale
Spatial Region
• HTML tables
• 267 centroids
• 1000’s of individual
colours Munsell http://tx4.us/
Hue
NBS ISCC
centroids
Individual
colors
CSIRO. Semantic Mashups for Hack days
24. It is also citeable: the original references are provided
Kelly and Judd (1976)
Color Universal Language and Dictionary of Names.
National Bureau of Standards special publication 440
Maerz and Paul, Dictionary of Color
Plochere Color System
Ridgway, Color Standards and Color Nomenclature
Taylor, Knoche & Granville, Descriptive Color Names Dictionary
Textile Color Card Association Standard Color Card of America and U.S. Army Color Card
American Association of Textile Chemists and Colorists and Society of Dyers and Colourists
Colour Terms in Biology, H.A. Dade
Federal Specification TT-C-595, Colors; (for) Ready-Mixed Paints
Horticultural Colour Charts, R. F. Wilson
Commercial Standard CS147-47, Colors for Molded Urea Plastics
Commercial Standard CS156-49, Colors for Polystyrene Plastics
National Research Council, Rock-Color Chart
Postage-Stamp Color Names, William H. Beck
U.S. Department of Agriculture Soil Charts
House and Garden magazine, in Supplementary Index of Color names (Dictionary of Color)
CSIRO. Semantic Mashups for Hack days
25. Random example
Amber Yellow
http://tx4.us/nbs/nbs-a.htm
CSIRO. Semantic Mashups for Hack days
26. Another example: organizations (.gov, .com and .org.)
• Some effort required to fix the missing links: ontology then data
• Company codes: ANZSIC vs. UNSPCC
• Government codes: AGIFT thesaurus vs. UK’s Government
Category List
• Epimorphics (UK) and W3C eGov IG working on this topic
• Organization ontology developed to be LOD-ready
CSIRO. Semantic Mashups for Hack days
27. Linked Open Data (LOD) opportunities
• W3C
• eGov IG or incubators
• local initiative (W3C Australia to help – status to be defined)
• W3C track speakers (Web Directions South)
• Linked Open Data Around the Clock project (EU)
• Contact: Michael Hausenblas (DERI)
• Semantic Web Atlanta already involved (advisory committee)
• Amped hack day
• Opportunity to engage with front end developers
• Opportunity to liaise with other SW Meetups (Sydney in Australia,
Atlanta and maybe Austin in the US)
CSIRO. Semantic Mashups for Hack days
28. CSIRO ICT Centre
Laurent Lefort @laurentlefort
Ontologist, Information Engineering Lab
W3C Australia Office @W3CAustralia
Photo: Web Directions
CC BY-NC 2.0
CSIRO. Semantic Mashups for Hack days