Taxonomies are well known means to create knowledge models. With the rise of Linked data, more expressive ways to create knowledge graphs can be introduced on a cost-efficient basis. Thesauri or ontologies help to create applications like Dynamic Semantic Publishing (DSP) or knowledge bases in enterprises.
1. Introduction to the Semantic Web
From Taxonomies to Linked Open Data
Andreas Blumauer
Semantic Web Company
www.semantic-web.at
www.poolparty.biz
2. Welcome!
• CEO and managing partner of
Semantic Web Company
• Master's degree in business
informatics (University of
Vienna)
• Experienced consultant in the
areas of knowledge
management systems, linked
data, data analytics and
semantic technologies
• Product management of
PoolParty Semantic Suite
• Lecturer at several Universities
of Applied Sciences and at
Danube University Krems
• Editor of one the first
comprehensive books in the
area of the semantic web for
German speaking community
Semantic Web Knowledge
Management Text Analytics
Data Analytics Recommender Systems
Semantic Search Taxonomies
Thesaurus Management SKOS
Enterprise Data Integration Linked
Data Social Tagging Business
Intelligence Confluence Semantic
Sharepoint
3. What is the Semantic Web?
What is Linked Data?
Which knowledge models are used
on the Semantic Web?
Linked Data based Text Mining
PoolParty Semantic Suite
Examples & Demos
Agenda
4. Uniform Resource Identifiers
1. Give things / objects / resources a URI,
not only webpages!
http://www.bbc.co.uk/nature/species/Red_Squirrel
5. Semantic relations (‘triples’)
Sciurus vulgaris
Red squirrel
http://www.bbc.co.uk/nature/life/Sciuridae
2. Assign names to the resources and link resources to
others in a meaningful way!
http://www.bbc.co.uk/natu
re/habitats/Taiga
is a
lives in has label
other label
6. Establish links across data silos
3. Reuse existing information from all over the places,
internally and externally!
7. Resource Description Framework (RDF)
4. Use Semantic Web Standards to describe your
resources in a machine-readable format!
8. Mashups: Semantic Dynamic Publishing
5. Reuse assets and save production costs while
providing highly dynamic content
9. Semantic ‘Search’
6. Understand the user’s query on a conceptual level!
sciurus vulgaris search
‘sciurus vulgaris’ OR ‘red suirrel’
Show stories from the East of
England dealing with
mammals which also live in
the Taiga
go
10. Content recommendation
7. Provide users with recommendations from a
bird’s eye perspective!
Animals with same
• behaviour
• habitats
• family etc.
‘Similar’ news containing
• related places
• related animals etc.
• related videos (YouTube)
• related images (Flickr)
11. So, what exactly is ‘Linked Data’?
• ‘Linked Data’ is based on the
Semantic Web standards by
the World Wide Web
Consortium (W3C)
• Linked Data is a way to link &
integrate data based on web
technologies
• Linked Data IS NOT Open Data
automatically
• Linked Data can be used on
the web AND on the intranet
• The Linked Open Data Cloud
(LOD) consists of over 62 bio.
Facts/triples
(http://stats.lod2.eu/)
• Linked Data is an emerging
technology for enterprise
information integration
Data is no longer locked up in databases and its
non-intuitive schemas.
To access linked data, simply
use the web as your platform.
Graph-based models and SPARQL are in many
cases much more powerful than the relational
models and SQL.
12. What is the Semantic Web?
What is Linked Data?
Which knowledge models are used
on the Semantic Web?
Linked Data based Text Mining
PoolParty Semantic Suite
Examples & Demos
Agenda
17. Some advanced features of SPARQL
Shortest path between two nodes in a graph
Federated queries
Transitivities
Can also be used to query relational databases
See: RDB to RDF Mapping Language
18. What is the Semantic Web?
What is Linked Data?
Which knowledge models are used
on the Semantic Web?
Linked Data based Text Mining
PoolParty Semantic Suite
Examples & Demos
Agenda
19. Entity extraction based on Linked Data
http://www.bbc.co.uk/nature/
life/Red_Squirrel
Bushy-tailed squirrels
Squirrels
is a
is a
http://sws.geonames.org/2641455
England
United Kingdom
part of
part of
- population: 832500
- long: 1
- lat: 52.66667
21. What is the Semantic Web?
What is Linked Data?
Which knowledge models are used
on the Semantic Web?
Linked Data based Text Mining
PoolParty Semantic Suite
Examples & Demos
Agenda
25. Thesaurus Server & Extractor
1. Create Thesaurus
2. Get documents annotated
3. Receive RDF for further processing
26. PoolParty API
All RESTful services based on standards:
Future-proof application development
Thesaurus as a Service:
Query thesauri & embed them into your apps
Query thesauri + annotated documents:
Semantic Search
Text Mining as a Service:
All documents converted in RDF based on controlled
vocabularies
SPARQL-endpoints for all knowledge graphs
27. What is the Semantic Web?
What is Linked Data?
Which knowledge models are used
on the Semantic Web?
Linked Data based Text Mining
PoolParty Semantic Suite
Examples & Demos
Agenda
35. Classification based on Wikipedia
Nanotechnology researchers use graphene in
nanoelectronic chip breakthrough
(Nanowerk News) Electronic components built from
single molecules using chemical synthesis could pave
the way for smaller, faster and more green and
sustainable electronic devices. Now for the first time, a
transistor made from just one molecular monolayer has
been made to work where it really counts. On a
computer chip.
The molecular integrated circuit was created by a
group of chemists and physicists from the Department
of Chemistry Nano-Science Center at the University of
Copenhagen and Chinese Academy of Sciences,
Beijing. Their discovery has just been published online in
Advanced Materials ("Ultrathin Reduced Graphene
Oxide Films as Transparent Top-Contacts for Light
Switchable Solid-State Molecular Junctions"). The
breakthrough was made possible through an
innovative use of the two dimensional carbon material
graphene.
Kasper Nørgaard is an associate professor in chemistry
at the University of Copenhagen. He believes that the
first advantage of the newly developed graphene
chip will be to ease the testing of coming molecular
electronic components. But he is also confident, that it
represents a first step towards proper integrated
molecular circuits.
Read more:
http://www.nanowerk.com/news2/newsid=30975.php
#ixzz2WqZ2KyHW
36. PoolParty Enterprise Solutions
- Automatic content classification
based on linked open data sources like DBpedia and Geonames
- Targeted content distribution inside the intranet
support knowledge workers; provide semantic search
- Enterprise vocabulary management
understand the context, enrich content with definitions etc.
- Semantic knowledge bases
Use PoolParty as a linked data management system and
provide knowledge graphs in enterprise systems like Sharepoint
Several integrations with Sharepoint, Confluence, Drupal, Oracle Text, and
WordPress were implemented.