SlideShare ist ein Scribd-Unternehmen logo
1 von 95
The Semantic Web LandscapeA Practical Introduction Lee Feigenbaum VP Technology & Standards, Cambridge Semantics Co-chair, W3C SPARQL Working Group For CSHALS 2010 Tutorial Attendees February 24, 2010
The W3C HCLS interest group set out to use Semantic Web technologies to receive precise answers to a complex question: A Motivating Example: Drug Discovery Find me genes involved in signal transduction that are related to pyramidal neurons.
General search 223,000 hits, 0 results
Domain-limited search 2,580 potential results
Specific databases Too many silos!
A Semantic Web Approach Integrate disparate databases… MeSH PubMed Entrez Gene Gene Ontology …
A Semantic Web Approach (cont’d) …so that onequery…
A Semantic Web Approach (cont’d) …(trivially) spans several databases…
A Semantic Web Approach (cont’d) …to deliver targeted results…
Agreement on common terms and relationships Incremental, flexible data structure Good-enough modeling Query interface tailored to the data model What’s the trick?
What is the semantic web?
Names
Semantic Web Web of Data Giant Global Graph Data Web Web 3.0 Linked Data Web Semantic Data Web Branding
“The Semantic Web” a.k.a “Linked Open Data” Augments the World Wide Web Represents the Web’s information in a machine-readable fashion Enables… …targeted search …data browsing …automated agents What is it & why do we care? (1) World Wide Web : Web pages  ::  The Semantic Web : Data
“Semantic Web technologies” A family of technology standards that ‘play nice together’, including: Flexible data model Expressive ontology language Distributed query language Drive Web sites, enterprise applications What is it & why do we care? (2) The technologies enable us to build applications and solutions that were not possible, practical, or feasible traditionally.
A common set of technologies: ...enables diverse uses ...encourages interoperability A coherent set of technologies: …encourage incremental application …provide a substantial base for innovation A standard set of technologies: ...reduces proprietary vendor lock-in ...encourages many choices for tool sets A Common & Coherent Set of Technology Standards
The (In)Famous Layer Cake
Semantic Web Technology Timeline 2001 2004 2008 2010 2007 1999 RIF HCLS
As technologies & tools have evolved, Semantic Web advocates have progressed through stages: 2010: Where we are
2010: Where we’re not Image from Trey Ideker via Enoch Huang Semantic Web technologies are not a ‘magic crank’ for discovering new drugs (or solving other problems, for that matter)!
2010: Where we’re not (cont’d) XML vs. RDF? “Ontology” vs. “ontology”? Data integration vs. reasoning vs. KBs vs. search vs. app. development vs. … Semantic Web vs. Linked Data? The Semantic Web still suffers from confusing and conflicting messaging, each of which asserts it’s “correct”.
2010: Where we’re not (cont’d) People with appropriate skill sets for designing & building Semantic Web solutions are not widely available.
2010: Where we’re not (cont’d) We don’t yet have standard solutions for privacy, trust, probability, and other elements of the Semantic Web vision.
What do Semantic Web solutions look like?
RDF is… Resource Description Framework
RDF is… The data model of the Semantic Web.
RDF is… A schema-less data model that features unambiguous identifiers and named relations between pairs of resources.
RDF is… A labeled, directed graph of relations between resources and literal values. RDF graphs are collections of triples Triples are made up of a subject, a predicate, and an object Resources and relationships are named with URIs predicate subject object
“Lee Feigenbaum works for Cambridge Semantics” “Lee Feigenbaum was born in 1978” “Cambridge Semantics is headquartered in Massachusetts” Example RDF triples works  for born in headquartered Lee Feigenbaum Cambridge Semantics Lee Feigenbaum Cambridge Semantics 1978 Massachusetts
Triples connect to form graphs headquartered lives in Massachusetts born in capital works  for Lee Feigenbaum Cambridge Semantics Boston 1978
The graph data structure makes merging datawith shared identifiers trivial Triples act as a least common denominatorfor expressing data URIs for naming remove ambiguity …the same identifier means the same thing Why RDF? What’s different here?
Why RDF? Incremental Integration RelationalDatabase RDF
RDF is the model, for which there are several concrete syntaxes: RDF/XML – standard, complex XML syntax Turtle – common, textual, triples-oriented syntax N3 – more expressive superset of Turtle N-Triples – textual, line-oriented, useful for streaming What does RDF look like? When writing RDF by hand and in many guides, examples, and discussions these days, you’ll see Turtle most often.
Write a triple by writing its parts separated by spaces (subject predicate object) A Bit of Turtle @prefix ex: <http://example.org/myvocab/> . @prefix geo: <http://geonames.example/> . ex:LeeFeigenbaumex:employerex:CambridgeSemantics . ex:LeeFeigenbaumex:birthYear    1978 . ex:CambridgeSemanticsex:headquartersgeo:BostonMA . geo:BostonMAex:population   574000 .
SPARQL is… SPARQL Protocol And RDF Query Language
SPARQL is… The query language of the Semantic Web.
SPARQL is… A SQL-like language for querying sets of RDF graphs.
SPARQL is… A simple protocol for issuing queries and receiving results over HTTP. So… Every SPARQL client works with every SPARQL server!
SPARQL lets us: Pull information from structured and semi-structured data. Explore data by discovering unknown relationships. Query and search an integrated view of disparate data sources. Glue separate software applications together by transforming data from one vocabulary to another. Why SPARQL?
Dealer 1 Dealer 2 Dealer 3 Employee Directory ERP / Budget System Web EPA Fuel Efficiency Spreadsheet SPARQL Query Engine What automobiles  get more than 25 miles per gallon, fit within my department’s budget, and can be purchased at a dealer located within 10 miles of one of my employees? SELECT ?automobile WHERE {   ?automobile a ex:Car ; epa:mpg ?mpg ; ex:dealer ?dealer .    ?employee a ex:Employee ; geo:loc ?loc .    ?dealer geo:loc ?dealerloc .    FILTER(?mpg > 25 &&  geo:dist(?loc,  ?dealerloc) <= 10) . } Web dashboard SPARQL query
bio2rdf.org – querying life sciences data
bio2rdf.org – querying life sciences data
3 pieces of the Semantic Web technology stack are about describing a domain well enough to capture (some of) the meaning of resources and relationships in the domain RDF Schema OWL RIF From the explicit to the inferred Apply knowledge to data to get more data.
RDFS is… RDF Schema
Elements of: Vocabulary (defining terms) I define a relationship called “prescribed dose.” Schema (defining types) “prescribed dose” relates “treatments” to “dosages” (my prescribed dose is 2mg; therefore 2mg is a dosage) Taxonomy (defining hierarchies) Any “doctor” is a “medical professional” (therefore Dr. Brown is a medical professional) RDF Schema is…
WOL OWL is… Web Ontology Language
Elements of ontology Same/different identity “author” and “auteur” are the same relation two resources with the same “ISBN” are the same “book” More expressive type definitions A “cycle” is a “vehicle” with at least one “wheel” A “bicycle” is a “cycle” with exactly two “wheels” More expressive relation definitions “sibling” is a symmetric predicate the value of the “favorite dwarf” relation must be one of “happy”, “sleepy”, “sneezy”, “grumpy”,  “dopey”, “bashful”, “doc” OWL is…
A class is a (named) collection of things with similar attributes OWL: Rich Class Definitions
A class is a (named) collection of things with similar attributes OWL: Rich Class Definitions
A class is a (named) collection of things with similar attributes OWL: Rich Class Definitions
OWL: Rich Class Definitions
RIF is… Rules Interchange Format
Standard representation for exchanging sets of logical and business rules Logical rules A buyer buys an item from a seller if the seller sells the item to the buyer A customer becomes a "Gold" customer as soon as his cumulative purchases during the current year top $5000 Production rules Customers that become "Gold" customers must be notified immediately, and a golden customer card will be printed and sent to them within one week For shopping carts worth more than $1000, "Gold" customers receive an additional discount of 10% of the total amount RIF is…
Fantasy Land Architecture Ontology / Schema + Custom UI Custom UI Custom UI Custom UI Custom UI Custom UI
Reality Internet DB2 XML LDAP Directory Oracle RDB Custom UI Custom UI Custom UI Custom UI Custom UI Custom UI
GRDDL is… Gleaning Resource Descriptions from Dialects of Language
GRDDL is… A method for authoritatively getting RDF data from XML and XHTML documents.
GRDDL is… A mechanism for authoritatively deriving RDF data from families of XML and XHTML documents.
RDB2RDF is… Relational Database toRDF
RDB2RDF is… A W3C Working Group to define a standard way to map from relational databases to RDF (and SPARQL).
A simple set of 4 guidelines for publishing RDF data on the Web (over HTTP) Developed by Tim Berners-Lee in 2006 Use URIs as names for things ,[object Object],Use HTTP URIs  ,[object Object],When someone looks up a URI, provide useful information ,[object Object],Include links to other URIs ,[object Object],Linked Data is…
The LOD “cloud”, March 2009
Application specific portions of the cloud ,[object Object]
some by the W3C “Linking Open Drug Data” task force,[object Object]
RDFa is… A collection of HTML attributes that allow RDF to be embedded directly in Web pages.
Don’t Repeat Yourself (DRY) In-context metadata (copy & paste) Authoritative (no screen scrapig) Why RDFa?
RDFa in action
Semantic Web landscape today
Semantic Web Tools In 2010, there are a wide variety of open-source and commercial Semantic Web tools available.
Triple stores Built on relational database Native RDF store Development libraries Full-featured application servers Types of RDF Tools Most RDF tools contain some elements of each of these.
Community-maintained lists http://esw.w3.org/topic/SemanticWebTools Emphasis on large triple stores http://esw.w3.org/topic/LargeTripleStores Michael Bergman’s Sweet Tools searchable list: http://www.mkbergman.com/?page_id=325 Finding RDF Tools
Query engines Things that can run queries Most RDF stores provide a SPARQL engine Query rewriters E.g. to query relational databases (more later) Endpoints Things that accept queries on the Web and return results Client libraries Things that make it easy to ask queries Types of SPARQL Tools
Community-maintained list of query engines http://esw.w3.org/topic/SparqlImplementations Publicly accessible SPARQL endpoints http://esw.w3.org/topic/SparqlEndpoints Michael Bergman’s Sweet Tools searchable list: http://www.mkbergman.com/?page_id=325 Finding SPARQL Tools
Editors/environments Oiled, Protégé, Swoop, TopBraid, Ontotrack, … Developing Tools and Infrastructure
Editors/environments Oiled, Protégé, Swoop, TopBraid, Ontotrack, … Reasoning systems Cerebra, FaCT++, Kaon2, Pellet, Racer, CEL, … Developing Tools and Infrastructure Pellet KAON2 CEL
Visualizing and Publishing Vocabularies
Reusable, public ontologies FOAF The Event Ontology Measurement Units Ontology
Community-maintained list: http://esw.w3.org/topic/GrddlImplementations GRDDL tools Most GRDDL tools are adapters to existing RDF stores or SPARQL engines to allow loading or querying data from XML and XHTML sources.
What about… everything else? Standards don’t yet exist, but many tools exist to derive RDF and/or run SPARQL queries against other sources of data.
LDAP Directories Squirrel RDF http://jena.sourceforge.net/SquirrelRDF/
Excel spreadsheets Anzo for Excel http://www.cambridgesemantics.com/products/anzo_for_excel
Web-based data sources Virtuoso Sponger Cartridges http://virtuoso.openlinksw.com/dataspace/dav/wiki/Main/VirtSponger
Unstructured Text Calais http://www.opencalais.com/
Unstructured Text Zemanta Web Service http://developer.zemanta.com/
On the Web Google, Yahoo! Best Buy NY Times US Government UK Government Where is it being used?
Industries Oil & Gas (integration, classification) Finance (structured data, ontologies, XBRL) Publishing (metadata) Government (structured data, metadata, classification) Libraries & museums (metadata, classification) IT (rapid application development & evolution) Where is it being used?
Health Care Cleveland Clinic Clinical research Data integration, classification (= better search) UT School of Health Public health surveillance SAPPHIRE—classification, ontology-driven development Various Clinical Decision Support Agile, rule-driven, scalable in the face of change Where is it being used?
Life Sciences Agile knowledgebases at Pfizer Target assessment at Eli Lilly Integrated information links at Novartis Astra Zeneca, J&J, UCB, …	 Where is it being used? CSHALS chronicles many of these uses and many more.
Take-away Advice
These are horizontal, enabling technologies. But they apply particularly well to problems with these characteristics: Heterogeneous data from multiple sources Increasing reliance on connections within this data Rapidly changing information needs Significant early-mover advantage Large amounts of data that would benefit from classification Why are Semantic Web technologies appropriate for the life sciences? Many tactical and strategic challenges in the life sciences industry feature these traits.
Getting Started with Semantic Web technologies  Don’t boil the ocean.
Getting Started with Semantic Web technologies  Goal: quick tactical wins on the path to large strategic value Be sure to consider the operational ramifications Who does what differently? Ideal Semantic Web projects/applications have an incremental path towards broad deployment that generates demonstrable value along the way
Look beyond the core Semantic Web capabilities and consider: integration with existing enterprise systems development & extension models deployment, logging, maintenance, backup tooling user experience Choose practical, enterprise-ready tools If you choose to build new components and assemble existing components together, it’s quite likely you’ll end up reinventing the wheel.
What level of expertise is necessary? Technologies only? Technologies + API? Technologies + tooling? Tooling only? … How will we acquire the expertise? In-house (and if so, how?) Vendor services 3rd-party services Open-source community Plan for Acquiring Expertise
I’m always happy to field questions & engage in discussion: lee@cambridgesemantics.com Thanks & Discussion

Weitere ähnliche Inhalte

Was ist angesagt?

Analytics on Big Knowledge Graphs Deliver Entity Awareness and Help Data Linking
Analytics on Big Knowledge Graphs Deliver Entity Awareness and Help Data LinkingAnalytics on Big Knowledge Graphs Deliver Entity Awareness and Help Data Linking
Analytics on Big Knowledge Graphs Deliver Entity Awareness and Help Data LinkingOntotext
 
[Conference] Cognitive Graph Analytics on Company Data and News
[Conference] Cognitive Graph Analytics on Company Data and News[Conference] Cognitive Graph Analytics on Company Data and News
[Conference] Cognitive Graph Analytics on Company Data and NewsOntotext
 
Contributing to the Smart City Through Linked Library Data
Contributing to the Smart City Through Linked Library DataContributing to the Smart City Through Linked Library Data
Contributing to the Smart City Through Linked Library DataMarcia Zeng
 
State of the Semantic Web
State of the Semantic WebState of the Semantic Web
State of the Semantic WebIvan Herman
 
Inference on the Semantic Web
Inference on the Semantic WebInference on the Semantic Web
Inference on the Semantic WebMyungjin Lee
 
Haystack 2018 - Algorithmic Extraction of Keywords Concepts and Vocabularies
Haystack 2018 - Algorithmic Extraction of Keywords Concepts and VocabulariesHaystack 2018 - Algorithmic Extraction of Keywords Concepts and Vocabularies
Haystack 2018 - Algorithmic Extraction of Keywords Concepts and VocabulariesMax Irwin
 
From the Semantic Web to the Web of Data: ten years of linking up
From the Semantic Web to the Web of Data: ten years of linking upFrom the Semantic Web to the Web of Data: ten years of linking up
From the Semantic Web to the Web of Data: ten years of linking upDavide Palmisano
 
The Next Generation of AI-powered Search
The Next Generation of AI-powered SearchThe Next Generation of AI-powered Search
The Next Generation of AI-powered SearchTrey Grainger
 
RDF SHACL, Annotations, and Data Frames
RDF SHACL, Annotations, and Data FramesRDF SHACL, Annotations, and Data Frames
RDF SHACL, Annotations, and Data FramesKurt Cagle
 
Natural Language Search with Knowledge Graphs (Haystack 2019)
Natural Language Search with Knowledge Graphs (Haystack 2019)Natural Language Search with Knowledge Graphs (Haystack 2019)
Natural Language Search with Knowledge Graphs (Haystack 2019)Trey Grainger
 
The Semantic Knowledge Graph
The Semantic Knowledge GraphThe Semantic Knowledge Graph
The Semantic Knowledge GraphTrey Grainger
 
How to Build a Semantic Search System
How to Build a Semantic Search SystemHow to Build a Semantic Search System
How to Build a Semantic Search SystemTrey Grainger
 

Was ist angesagt? (17)

Analytics on Big Knowledge Graphs Deliver Entity Awareness and Help Data Linking
Analytics on Big Knowledge Graphs Deliver Entity Awareness and Help Data LinkingAnalytics on Big Knowledge Graphs Deliver Entity Awareness and Help Data Linking
Analytics on Big Knowledge Graphs Deliver Entity Awareness and Help Data Linking
 
Sindice warehousing meetup
Sindice warehousing meetupSindice warehousing meetup
Sindice warehousing meetup
 
[Conference] Cognitive Graph Analytics on Company Data and News
[Conference] Cognitive Graph Analytics on Company Data and News[Conference] Cognitive Graph Analytics on Company Data and News
[Conference] Cognitive Graph Analytics on Company Data and News
 
Contributing to the Smart City Through Linked Library Data
Contributing to the Smart City Through Linked Library DataContributing to the Smart City Through Linked Library Data
Contributing to the Smart City Through Linked Library Data
 
State of the Semantic Web
State of the Semantic WebState of the Semantic Web
State of the Semantic Web
 
Inference on the Semantic Web
Inference on the Semantic WebInference on the Semantic Web
Inference on the Semantic Web
 
Haystack 2018 - Algorithmic Extraction of Keywords Concepts and Vocabularies
Haystack 2018 - Algorithmic Extraction of Keywords Concepts and VocabulariesHaystack 2018 - Algorithmic Extraction of Keywords Concepts and Vocabularies
Haystack 2018 - Algorithmic Extraction of Keywords Concepts and Vocabularies
 
From the Semantic Web to the Web of Data: ten years of linking up
From the Semantic Web to the Web of Data: ten years of linking upFrom the Semantic Web to the Web of Data: ten years of linking up
From the Semantic Web to the Web of Data: ten years of linking up
 
The Next Generation of AI-powered Search
The Next Generation of AI-powered SearchThe Next Generation of AI-powered Search
The Next Generation of AI-powered Search
 
RDF SHACL, Annotations, and Data Frames
RDF SHACL, Annotations, and Data FramesRDF SHACL, Annotations, and Data Frames
RDF SHACL, Annotations, and Data Frames
 
Scaling the (evolving) web data –at low cost-
Scaling the (evolving) web data –at low cost-Scaling the (evolving) web data –at low cost-
Scaling the (evolving) web data –at low cost-
 
Extracting keywords from texts - Sanda Martincic Ipsic
Extracting keywords from texts - Sanda Martincic IpsicExtracting keywords from texts - Sanda Martincic Ipsic
Extracting keywords from texts - Sanda Martincic Ipsic
 
Natural Language Search with Knowledge Graphs (Haystack 2019)
Natural Language Search with Knowledge Graphs (Haystack 2019)Natural Language Search with Knowledge Graphs (Haystack 2019)
Natural Language Search with Knowledge Graphs (Haystack 2019)
 
Jpl presentation
Jpl presentationJpl presentation
Jpl presentation
 
The Semantic Knowledge Graph
The Semantic Knowledge GraphThe Semantic Knowledge Graph
The Semantic Knowledge Graph
 
NISO/DCMI Webinar: International Bibliographic Standards, Linked Data, and th...
NISO/DCMI Webinar: International Bibliographic Standards, Linked Data, and th...NISO/DCMI Webinar: International Bibliographic Standards, Linked Data, and th...
NISO/DCMI Webinar: International Bibliographic Standards, Linked Data, and th...
 
How to Build a Semantic Search System
How to Build a Semantic Search SystemHow to Build a Semantic Search System
How to Build a Semantic Search System
 

Ähnlich wie CSHALS 2010 W3C Semanic Web Tutorial

Linked Data Tutorial
Linked Data TutorialLinked Data Tutorial
Linked Data TutorialSören Auer
 
Linked data HHS 2015
Linked data HHS 2015Linked data HHS 2015
Linked data HHS 2015Cason Snow
 
Web 3 Mark Greaves
Web 3 Mark GreavesWeb 3 Mark Greaves
Web 3 Mark GreavesMediabistro
 
Understanding RDF: the Resource Description Framework in Context (1999)
Understanding RDF: the Resource Description Framework in Context  (1999)Understanding RDF: the Resource Description Framework in Context  (1999)
Understanding RDF: the Resource Description Framework in Context (1999)Dan Brickley
 
Lodlam saa 2011_jenelfarrell_2
Lodlam saa 2011_jenelfarrell_2Lodlam saa 2011_jenelfarrell_2
Lodlam saa 2011_jenelfarrell_2Jenel Farrell
 
RDFa Semantic Web
RDFa Semantic WebRDFa Semantic Web
RDFa Semantic WebRob Paok
 
Hack U Barcelona 2011
Hack U Barcelona 2011Hack U Barcelona 2011
Hack U Barcelona 2011Peter Mika
 
Semantic - Based Querying Using Ontology in Relational Database of Library Ma...
Semantic - Based Querying Using Ontology in Relational Database of Library Ma...Semantic - Based Querying Using Ontology in Relational Database of Library Ma...
Semantic - Based Querying Using Ontology in Relational Database of Library Ma...dannyijwest
 
Linked Data and Locah, UKSG2011
Linked Data and Locah, UKSG2011 Linked Data and Locah, UKSG2011
Linked Data and Locah, UKSG2011 Jane Stevenson
 
Publishing data on the Semantic Web
Publishing data on the Semantic WebPublishing data on the Semantic Web
Publishing data on the Semantic WebPeter Mika
 
Lee Iverson - How does the web connect content?
Lee Iverson - How does the web connect content?Lee Iverson - How does the web connect content?
Lee Iverson - How does the web connect content?Museums Computer Group
 
The web of interlinked data and knowledge stripped
The web of interlinked data and knowledge strippedThe web of interlinked data and knowledge stripped
The web of interlinked data and knowledge strippedSören Auer
 
Querying the Web of Data
Querying the Web of DataQuerying the Web of Data
Querying the Web of DataRinke Hoekstra
 
Web 3.0 & IoT (English)
Web 3.0 & IoT (English)Web 3.0 & IoT (English)
Web 3.0 & IoT (English)Peter Waher
 

Ähnlich wie CSHALS 2010 W3C Semanic Web Tutorial (20)

Linked Data Tutorial
Linked Data TutorialLinked Data Tutorial
Linked Data Tutorial
 
Semantic Web and Linked Open Data
Semantic Web and Linked Open DataSemantic Web and Linked Open Data
Semantic Web and Linked Open Data
 
Linked data and voyager
Linked data and voyagerLinked data and voyager
Linked data and voyager
 
Linked data HHS 2015
Linked data HHS 2015Linked data HHS 2015
Linked data HHS 2015
 
Web 3 Mark Greaves
Web 3 Mark GreavesWeb 3 Mark Greaves
Web 3 Mark Greaves
 
Understanding RDF: the Resource Description Framework in Context (1999)
Understanding RDF: the Resource Description Framework in Context  (1999)Understanding RDF: the Resource Description Framework in Context  (1999)
Understanding RDF: the Resource Description Framework in Context (1999)
 
Semantic web
Semantic web Semantic web
Semantic web
 
Lodlam saa 2011_jenelfarrell_2
Lodlam saa 2011_jenelfarrell_2Lodlam saa 2011_jenelfarrell_2
Lodlam saa 2011_jenelfarrell_2
 
Jpl presentation
Jpl presentationJpl presentation
Jpl presentation
 
Jpl presentation
Jpl presentationJpl presentation
Jpl presentation
 
RDFa Semantic Web
RDFa Semantic WebRDFa Semantic Web
RDFa Semantic Web
 
Hack U Barcelona 2011
Hack U Barcelona 2011Hack U Barcelona 2011
Hack U Barcelona 2011
 
Semantic - Based Querying Using Ontology in Relational Database of Library Ma...
Semantic - Based Querying Using Ontology in Relational Database of Library Ma...Semantic - Based Querying Using Ontology in Relational Database of Library Ma...
Semantic - Based Querying Using Ontology in Relational Database of Library Ma...
 
Semantics
SemanticsSemantics
Semantics
 
Linked Data and Locah, UKSG2011
Linked Data and Locah, UKSG2011 Linked Data and Locah, UKSG2011
Linked Data and Locah, UKSG2011
 
Publishing data on the Semantic Web
Publishing data on the Semantic WebPublishing data on the Semantic Web
Publishing data on the Semantic Web
 
Lee Iverson - How does the web connect content?
Lee Iverson - How does the web connect content?Lee Iverson - How does the web connect content?
Lee Iverson - How does the web connect content?
 
The web of interlinked data and knowledge stripped
The web of interlinked data and knowledge strippedThe web of interlinked data and knowledge stripped
The web of interlinked data and knowledge stripped
 
Querying the Web of Data
Querying the Web of DataQuerying the Web of Data
Querying the Web of Data
 
Web 3.0 & IoT (English)
Web 3.0 & IoT (English)Web 3.0 & IoT (English)
Web 3.0 & IoT (English)
 

Mehr von LeeFeigenbaum

Data Segmenting in Anzo
Data Segmenting in AnzoData Segmenting in Anzo
Data Segmenting in AnzoLeeFeigenbaum
 
Intro to the Semantic Web Landscape - 2011
Intro to the Semantic Web Landscape - 2011Intro to the Semantic Web Landscape - 2011
Intro to the Semantic Web Landscape - 2011LeeFeigenbaum
 
Evolution Towards Web 3.0: The Semantic Web
Evolution Towards Web 3.0: The Semantic WebEvolution Towards Web 3.0: The Semantic Web
Evolution Towards Web 3.0: The Semantic WebLeeFeigenbaum
 
Taking the Tech out of SemTech
Taking the Tech out of SemTechTaking the Tech out of SemTech
Taking the Tech out of SemTechLeeFeigenbaum
 
What;s Coming In SPARQL2?
What;s Coming In SPARQL2?What;s Coming In SPARQL2?
What;s Coming In SPARQL2?LeeFeigenbaum
 
Semantic Web Landscape 2009
Semantic Web Landscape 2009Semantic Web Landscape 2009
Semantic Web Landscape 2009LeeFeigenbaum
 

Mehr von LeeFeigenbaum (8)

Data Segmenting in Anzo
Data Segmenting in AnzoData Segmenting in Anzo
Data Segmenting in Anzo
 
Intro to the Semantic Web Landscape - 2011
Intro to the Semantic Web Landscape - 2011Intro to the Semantic Web Landscape - 2011
Intro to the Semantic Web Landscape - 2011
 
Evolution Towards Web 3.0: The Semantic Web
Evolution Towards Web 3.0: The Semantic WebEvolution Towards Web 3.0: The Semantic Web
Evolution Towards Web 3.0: The Semantic Web
 
Taking the Tech out of SemTech
Taking the Tech out of SemTechTaking the Tech out of SemTech
Taking the Tech out of SemTech
 
What;s Coming In SPARQL2?
What;s Coming In SPARQL2?What;s Coming In SPARQL2?
What;s Coming In SPARQL2?
 
SPARQL 1.1 Status
SPARQL 1.1 StatusSPARQL 1.1 Status
SPARQL 1.1 Status
 
SPARQL Cheat Sheet
SPARQL Cheat SheetSPARQL Cheat Sheet
SPARQL Cheat Sheet
 
Semantic Web Landscape 2009
Semantic Web Landscape 2009Semantic Web Landscape 2009
Semantic Web Landscape 2009
 

Kürzlich hochgeladen

Scale your database traffic with Read & Write split using MySQL Router
Scale your database traffic with Read & Write split using MySQL RouterScale your database traffic with Read & Write split using MySQL Router
Scale your database traffic with Read & Write split using MySQL RouterMydbops
 
TrustArc Webinar - How to Build Consumer Trust Through Data Privacy
TrustArc Webinar - How to Build Consumer Trust Through Data PrivacyTrustArc Webinar - How to Build Consumer Trust Through Data Privacy
TrustArc Webinar - How to Build Consumer Trust Through Data PrivacyTrustArc
 
Testing tools and AI - ideas what to try with some tool examples
Testing tools and AI - ideas what to try with some tool examplesTesting tools and AI - ideas what to try with some tool examples
Testing tools and AI - ideas what to try with some tool examplesKari Kakkonen
 
What is DBT - The Ultimate Data Build Tool.pdf
What is DBT - The Ultimate Data Build Tool.pdfWhat is DBT - The Ultimate Data Build Tool.pdf
What is DBT - The Ultimate Data Build Tool.pdfMounikaPolabathina
 
(How to Program) Paul Deitel, Harvey Deitel-Java How to Program, Early Object...
(How to Program) Paul Deitel, Harvey Deitel-Java How to Program, Early Object...(How to Program) Paul Deitel, Harvey Deitel-Java How to Program, Early Object...
(How to Program) Paul Deitel, Harvey Deitel-Java How to Program, Early Object...AliaaTarek5
 
Passkey Providers and Enabling Portability: FIDO Paris Seminar.pptx
Passkey Providers and Enabling Portability: FIDO Paris Seminar.pptxPasskey Providers and Enabling Portability: FIDO Paris Seminar.pptx
Passkey Providers and Enabling Portability: FIDO Paris Seminar.pptxLoriGlavin3
 
A Deep Dive on Passkeys: FIDO Paris Seminar.pptx
A Deep Dive on Passkeys: FIDO Paris Seminar.pptxA Deep Dive on Passkeys: FIDO Paris Seminar.pptx
A Deep Dive on Passkeys: FIDO Paris Seminar.pptxLoriGlavin3
 
The Role of FIDO in a Cyber Secure Netherlands: FIDO Paris Seminar.pptx
The Role of FIDO in a Cyber Secure Netherlands: FIDO Paris Seminar.pptxThe Role of FIDO in a Cyber Secure Netherlands: FIDO Paris Seminar.pptx
The Role of FIDO in a Cyber Secure Netherlands: FIDO Paris Seminar.pptxLoriGlavin3
 
Assure Ecommerce and Retail Operations Uptime with ThousandEyes
Assure Ecommerce and Retail Operations Uptime with ThousandEyesAssure Ecommerce and Retail Operations Uptime with ThousandEyes
Assure Ecommerce and Retail Operations Uptime with ThousandEyesThousandEyes
 
TeamStation AI System Report LATAM IT Salaries 2024
TeamStation AI System Report LATAM IT Salaries 2024TeamStation AI System Report LATAM IT Salaries 2024
TeamStation AI System Report LATAM IT Salaries 2024Lonnie McRorey
 
Arizona Broadband Policy Past, Present, and Future Presentation 3/25/24
Arizona Broadband Policy Past, Present, and Future Presentation 3/25/24Arizona Broadband Policy Past, Present, and Future Presentation 3/25/24
Arizona Broadband Policy Past, Present, and Future Presentation 3/25/24Mark Goldstein
 
The Fit for Passkeys for Employee and Consumer Sign-ins: FIDO Paris Seminar.pptx
The Fit for Passkeys for Employee and Consumer Sign-ins: FIDO Paris Seminar.pptxThe Fit for Passkeys for Employee and Consumer Sign-ins: FIDO Paris Seminar.pptx
The Fit for Passkeys for Employee and Consumer Sign-ins: FIDO Paris Seminar.pptxLoriGlavin3
 
[Webinar] SpiraTest - Setting New Standards in Quality Assurance
[Webinar] SpiraTest - Setting New Standards in Quality Assurance[Webinar] SpiraTest - Setting New Standards in Quality Assurance
[Webinar] SpiraTest - Setting New Standards in Quality AssuranceInflectra
 
Merck Moving Beyond Passwords: FIDO Paris Seminar.pptx
Merck Moving Beyond Passwords: FIDO Paris Seminar.pptxMerck Moving Beyond Passwords: FIDO Paris Seminar.pptx
Merck Moving Beyond Passwords: FIDO Paris Seminar.pptxLoriGlavin3
 
A Journey Into the Emotions of Software Developers
A Journey Into the Emotions of Software DevelopersA Journey Into the Emotions of Software Developers
A Journey Into the Emotions of Software DevelopersNicole Novielli
 
Moving Beyond Passwords: FIDO Paris Seminar.pdf
Moving Beyond Passwords: FIDO Paris Seminar.pdfMoving Beyond Passwords: FIDO Paris Seminar.pdf
Moving Beyond Passwords: FIDO Paris Seminar.pdfLoriGlavin3
 
How to Effectively Monitor SD-WAN and SASE Environments with ThousandEyes
How to Effectively Monitor SD-WAN and SASE Environments with ThousandEyesHow to Effectively Monitor SD-WAN and SASE Environments with ThousandEyes
How to Effectively Monitor SD-WAN and SASE Environments with ThousandEyesThousandEyes
 
The Ultimate Guide to Choosing WordPress Pros and Cons
The Ultimate Guide to Choosing WordPress Pros and ConsThe Ultimate Guide to Choosing WordPress Pros and Cons
The Ultimate Guide to Choosing WordPress Pros and ConsPixlogix Infotech
 
Sample pptx for embedding into website for demo
Sample pptx for embedding into website for demoSample pptx for embedding into website for demo
Sample pptx for embedding into website for demoHarshalMandlekar2
 
From Family Reminiscence to Scholarly Archive .
From Family Reminiscence to Scholarly Archive .From Family Reminiscence to Scholarly Archive .
From Family Reminiscence to Scholarly Archive .Alan Dix
 

Kürzlich hochgeladen (20)

Scale your database traffic with Read & Write split using MySQL Router
Scale your database traffic with Read & Write split using MySQL RouterScale your database traffic with Read & Write split using MySQL Router
Scale your database traffic with Read & Write split using MySQL Router
 
TrustArc Webinar - How to Build Consumer Trust Through Data Privacy
TrustArc Webinar - How to Build Consumer Trust Through Data PrivacyTrustArc Webinar - How to Build Consumer Trust Through Data Privacy
TrustArc Webinar - How to Build Consumer Trust Through Data Privacy
 
Testing tools and AI - ideas what to try with some tool examples
Testing tools and AI - ideas what to try with some tool examplesTesting tools and AI - ideas what to try with some tool examples
Testing tools and AI - ideas what to try with some tool examples
 
What is DBT - The Ultimate Data Build Tool.pdf
What is DBT - The Ultimate Data Build Tool.pdfWhat is DBT - The Ultimate Data Build Tool.pdf
What is DBT - The Ultimate Data Build Tool.pdf
 
(How to Program) Paul Deitel, Harvey Deitel-Java How to Program, Early Object...
(How to Program) Paul Deitel, Harvey Deitel-Java How to Program, Early Object...(How to Program) Paul Deitel, Harvey Deitel-Java How to Program, Early Object...
(How to Program) Paul Deitel, Harvey Deitel-Java How to Program, Early Object...
 
Passkey Providers and Enabling Portability: FIDO Paris Seminar.pptx
Passkey Providers and Enabling Portability: FIDO Paris Seminar.pptxPasskey Providers and Enabling Portability: FIDO Paris Seminar.pptx
Passkey Providers and Enabling Portability: FIDO Paris Seminar.pptx
 
A Deep Dive on Passkeys: FIDO Paris Seminar.pptx
A Deep Dive on Passkeys: FIDO Paris Seminar.pptxA Deep Dive on Passkeys: FIDO Paris Seminar.pptx
A Deep Dive on Passkeys: FIDO Paris Seminar.pptx
 
The Role of FIDO in a Cyber Secure Netherlands: FIDO Paris Seminar.pptx
The Role of FIDO in a Cyber Secure Netherlands: FIDO Paris Seminar.pptxThe Role of FIDO in a Cyber Secure Netherlands: FIDO Paris Seminar.pptx
The Role of FIDO in a Cyber Secure Netherlands: FIDO Paris Seminar.pptx
 
Assure Ecommerce and Retail Operations Uptime with ThousandEyes
Assure Ecommerce and Retail Operations Uptime with ThousandEyesAssure Ecommerce and Retail Operations Uptime with ThousandEyes
Assure Ecommerce and Retail Operations Uptime with ThousandEyes
 
TeamStation AI System Report LATAM IT Salaries 2024
TeamStation AI System Report LATAM IT Salaries 2024TeamStation AI System Report LATAM IT Salaries 2024
TeamStation AI System Report LATAM IT Salaries 2024
 
Arizona Broadband Policy Past, Present, and Future Presentation 3/25/24
Arizona Broadband Policy Past, Present, and Future Presentation 3/25/24Arizona Broadband Policy Past, Present, and Future Presentation 3/25/24
Arizona Broadband Policy Past, Present, and Future Presentation 3/25/24
 
The Fit for Passkeys for Employee and Consumer Sign-ins: FIDO Paris Seminar.pptx
The Fit for Passkeys for Employee and Consumer Sign-ins: FIDO Paris Seminar.pptxThe Fit for Passkeys for Employee and Consumer Sign-ins: FIDO Paris Seminar.pptx
The Fit for Passkeys for Employee and Consumer Sign-ins: FIDO Paris Seminar.pptx
 
[Webinar] SpiraTest - Setting New Standards in Quality Assurance
[Webinar] SpiraTest - Setting New Standards in Quality Assurance[Webinar] SpiraTest - Setting New Standards in Quality Assurance
[Webinar] SpiraTest - Setting New Standards in Quality Assurance
 
Merck Moving Beyond Passwords: FIDO Paris Seminar.pptx
Merck Moving Beyond Passwords: FIDO Paris Seminar.pptxMerck Moving Beyond Passwords: FIDO Paris Seminar.pptx
Merck Moving Beyond Passwords: FIDO Paris Seminar.pptx
 
A Journey Into the Emotions of Software Developers
A Journey Into the Emotions of Software DevelopersA Journey Into the Emotions of Software Developers
A Journey Into the Emotions of Software Developers
 
Moving Beyond Passwords: FIDO Paris Seminar.pdf
Moving Beyond Passwords: FIDO Paris Seminar.pdfMoving Beyond Passwords: FIDO Paris Seminar.pdf
Moving Beyond Passwords: FIDO Paris Seminar.pdf
 
How to Effectively Monitor SD-WAN and SASE Environments with ThousandEyes
How to Effectively Monitor SD-WAN and SASE Environments with ThousandEyesHow to Effectively Monitor SD-WAN and SASE Environments with ThousandEyes
How to Effectively Monitor SD-WAN and SASE Environments with ThousandEyes
 
The Ultimate Guide to Choosing WordPress Pros and Cons
The Ultimate Guide to Choosing WordPress Pros and ConsThe Ultimate Guide to Choosing WordPress Pros and Cons
The Ultimate Guide to Choosing WordPress Pros and Cons
 
Sample pptx for embedding into website for demo
Sample pptx for embedding into website for demoSample pptx for embedding into website for demo
Sample pptx for embedding into website for demo
 
From Family Reminiscence to Scholarly Archive .
From Family Reminiscence to Scholarly Archive .From Family Reminiscence to Scholarly Archive .
From Family Reminiscence to Scholarly Archive .
 

CSHALS 2010 W3C Semanic Web Tutorial

  • 1. The Semantic Web LandscapeA Practical Introduction Lee Feigenbaum VP Technology & Standards, Cambridge Semantics Co-chair, W3C SPARQL Working Group For CSHALS 2010 Tutorial Attendees February 24, 2010
  • 2. The W3C HCLS interest group set out to use Semantic Web technologies to receive precise answers to a complex question: A Motivating Example: Drug Discovery Find me genes involved in signal transduction that are related to pyramidal neurons.
  • 3. General search 223,000 hits, 0 results
  • 4. Domain-limited search 2,580 potential results
  • 6. A Semantic Web Approach Integrate disparate databases… MeSH PubMed Entrez Gene Gene Ontology …
  • 7. A Semantic Web Approach (cont’d) …so that onequery…
  • 8. A Semantic Web Approach (cont’d) …(trivially) spans several databases…
  • 9. A Semantic Web Approach (cont’d) …to deliver targeted results…
  • 10. Agreement on common terms and relationships Incremental, flexible data structure Good-enough modeling Query interface tailored to the data model What’s the trick?
  • 11. What is the semantic web?
  • 12. Names
  • 13. Semantic Web Web of Data Giant Global Graph Data Web Web 3.0 Linked Data Web Semantic Data Web Branding
  • 14. “The Semantic Web” a.k.a “Linked Open Data” Augments the World Wide Web Represents the Web’s information in a machine-readable fashion Enables… …targeted search …data browsing …automated agents What is it & why do we care? (1) World Wide Web : Web pages :: The Semantic Web : Data
  • 15. “Semantic Web technologies” A family of technology standards that ‘play nice together’, including: Flexible data model Expressive ontology language Distributed query language Drive Web sites, enterprise applications What is it & why do we care? (2) The technologies enable us to build applications and solutions that were not possible, practical, or feasible traditionally.
  • 16. A common set of technologies: ...enables diverse uses ...encourages interoperability A coherent set of technologies: …encourage incremental application …provide a substantial base for innovation A standard set of technologies: ...reduces proprietary vendor lock-in ...encourages many choices for tool sets A Common & Coherent Set of Technology Standards
  • 18. Semantic Web Technology Timeline 2001 2004 2008 2010 2007 1999 RIF HCLS
  • 19. As technologies & tools have evolved, Semantic Web advocates have progressed through stages: 2010: Where we are
  • 20. 2010: Where we’re not Image from Trey Ideker via Enoch Huang Semantic Web technologies are not a ‘magic crank’ for discovering new drugs (or solving other problems, for that matter)!
  • 21. 2010: Where we’re not (cont’d) XML vs. RDF? “Ontology” vs. “ontology”? Data integration vs. reasoning vs. KBs vs. search vs. app. development vs. … Semantic Web vs. Linked Data? The Semantic Web still suffers from confusing and conflicting messaging, each of which asserts it’s “correct”.
  • 22. 2010: Where we’re not (cont’d) People with appropriate skill sets for designing & building Semantic Web solutions are not widely available.
  • 23. 2010: Where we’re not (cont’d) We don’t yet have standard solutions for privacy, trust, probability, and other elements of the Semantic Web vision.
  • 24. What do Semantic Web solutions look like?
  • 25. RDF is… Resource Description Framework
  • 26. RDF is… The data model of the Semantic Web.
  • 27. RDF is… A schema-less data model that features unambiguous identifiers and named relations between pairs of resources.
  • 28. RDF is… A labeled, directed graph of relations between resources and literal values. RDF graphs are collections of triples Triples are made up of a subject, a predicate, and an object Resources and relationships are named with URIs predicate subject object
  • 29. “Lee Feigenbaum works for Cambridge Semantics” “Lee Feigenbaum was born in 1978” “Cambridge Semantics is headquartered in Massachusetts” Example RDF triples works for born in headquartered Lee Feigenbaum Cambridge Semantics Lee Feigenbaum Cambridge Semantics 1978 Massachusetts
  • 30. Triples connect to form graphs headquartered lives in Massachusetts born in capital works for Lee Feigenbaum Cambridge Semantics Boston 1978
  • 31. The graph data structure makes merging datawith shared identifiers trivial Triples act as a least common denominatorfor expressing data URIs for naming remove ambiguity …the same identifier means the same thing Why RDF? What’s different here?
  • 32. Why RDF? Incremental Integration RelationalDatabase RDF
  • 33. RDF is the model, for which there are several concrete syntaxes: RDF/XML – standard, complex XML syntax Turtle – common, textual, triples-oriented syntax N3 – more expressive superset of Turtle N-Triples – textual, line-oriented, useful for streaming What does RDF look like? When writing RDF by hand and in many guides, examples, and discussions these days, you’ll see Turtle most often.
  • 34. Write a triple by writing its parts separated by spaces (subject predicate object) A Bit of Turtle @prefix ex: <http://example.org/myvocab/> . @prefix geo: <http://geonames.example/> . ex:LeeFeigenbaumex:employerex:CambridgeSemantics . ex:LeeFeigenbaumex:birthYear 1978 . ex:CambridgeSemanticsex:headquartersgeo:BostonMA . geo:BostonMAex:population 574000 .
  • 35. SPARQL is… SPARQL Protocol And RDF Query Language
  • 36. SPARQL is… The query language of the Semantic Web.
  • 37. SPARQL is… A SQL-like language for querying sets of RDF graphs.
  • 38. SPARQL is… A simple protocol for issuing queries and receiving results over HTTP. So… Every SPARQL client works with every SPARQL server!
  • 39. SPARQL lets us: Pull information from structured and semi-structured data. Explore data by discovering unknown relationships. Query and search an integrated view of disparate data sources. Glue separate software applications together by transforming data from one vocabulary to another. Why SPARQL?
  • 40. Dealer 1 Dealer 2 Dealer 3 Employee Directory ERP / Budget System Web EPA Fuel Efficiency Spreadsheet SPARQL Query Engine What automobiles get more than 25 miles per gallon, fit within my department’s budget, and can be purchased at a dealer located within 10 miles of one of my employees? SELECT ?automobile WHERE { ?automobile a ex:Car ; epa:mpg ?mpg ; ex:dealer ?dealer . ?employee a ex:Employee ; geo:loc ?loc . ?dealer geo:loc ?dealerloc . FILTER(?mpg > 25 && geo:dist(?loc, ?dealerloc) <= 10) . } Web dashboard SPARQL query
  • 41. bio2rdf.org – querying life sciences data
  • 42. bio2rdf.org – querying life sciences data
  • 43. 3 pieces of the Semantic Web technology stack are about describing a domain well enough to capture (some of) the meaning of resources and relationships in the domain RDF Schema OWL RIF From the explicit to the inferred Apply knowledge to data to get more data.
  • 44. RDFS is… RDF Schema
  • 45. Elements of: Vocabulary (defining terms) I define a relationship called “prescribed dose.” Schema (defining types) “prescribed dose” relates “treatments” to “dosages” (my prescribed dose is 2mg; therefore 2mg is a dosage) Taxonomy (defining hierarchies) Any “doctor” is a “medical professional” (therefore Dr. Brown is a medical professional) RDF Schema is…
  • 46. WOL OWL is… Web Ontology Language
  • 47. Elements of ontology Same/different identity “author” and “auteur” are the same relation two resources with the same “ISBN” are the same “book” More expressive type definitions A “cycle” is a “vehicle” with at least one “wheel” A “bicycle” is a “cycle” with exactly two “wheels” More expressive relation definitions “sibling” is a symmetric predicate the value of the “favorite dwarf” relation must be one of “happy”, “sleepy”, “sneezy”, “grumpy”, “dopey”, “bashful”, “doc” OWL is…
  • 48. A class is a (named) collection of things with similar attributes OWL: Rich Class Definitions
  • 49. A class is a (named) collection of things with similar attributes OWL: Rich Class Definitions
  • 50. A class is a (named) collection of things with similar attributes OWL: Rich Class Definitions
  • 51. OWL: Rich Class Definitions
  • 52. RIF is… Rules Interchange Format
  • 53. Standard representation for exchanging sets of logical and business rules Logical rules A buyer buys an item from a seller if the seller sells the item to the buyer A customer becomes a "Gold" customer as soon as his cumulative purchases during the current year top $5000 Production rules Customers that become "Gold" customers must be notified immediately, and a golden customer card will be printed and sent to them within one week For shopping carts worth more than $1000, "Gold" customers receive an additional discount of 10% of the total amount RIF is…
  • 54. Fantasy Land Architecture Ontology / Schema + Custom UI Custom UI Custom UI Custom UI Custom UI Custom UI
  • 55. Reality Internet DB2 XML LDAP Directory Oracle RDB Custom UI Custom UI Custom UI Custom UI Custom UI Custom UI
  • 56. GRDDL is… Gleaning Resource Descriptions from Dialects of Language
  • 57. GRDDL is… A method for authoritatively getting RDF data from XML and XHTML documents.
  • 58. GRDDL is… A mechanism for authoritatively deriving RDF data from families of XML and XHTML documents.
  • 59. RDB2RDF is… Relational Database toRDF
  • 60. RDB2RDF is… A W3C Working Group to define a standard way to map from relational databases to RDF (and SPARQL).
  • 61.
  • 62. The LOD “cloud”, March 2009
  • 63.
  • 64.
  • 65. RDFa is… A collection of HTML attributes that allow RDF to be embedded directly in Web pages.
  • 66. Don’t Repeat Yourself (DRY) In-context metadata (copy & paste) Authoritative (no screen scrapig) Why RDFa?
  • 69. Semantic Web Tools In 2010, there are a wide variety of open-source and commercial Semantic Web tools available.
  • 70. Triple stores Built on relational database Native RDF store Development libraries Full-featured application servers Types of RDF Tools Most RDF tools contain some elements of each of these.
  • 71. Community-maintained lists http://esw.w3.org/topic/SemanticWebTools Emphasis on large triple stores http://esw.w3.org/topic/LargeTripleStores Michael Bergman’s Sweet Tools searchable list: http://www.mkbergman.com/?page_id=325 Finding RDF Tools
  • 72. Query engines Things that can run queries Most RDF stores provide a SPARQL engine Query rewriters E.g. to query relational databases (more later) Endpoints Things that accept queries on the Web and return results Client libraries Things that make it easy to ask queries Types of SPARQL Tools
  • 73. Community-maintained list of query engines http://esw.w3.org/topic/SparqlImplementations Publicly accessible SPARQL endpoints http://esw.w3.org/topic/SparqlEndpoints Michael Bergman’s Sweet Tools searchable list: http://www.mkbergman.com/?page_id=325 Finding SPARQL Tools
  • 74. Editors/environments Oiled, Protégé, Swoop, TopBraid, Ontotrack, … Developing Tools and Infrastructure
  • 75. Editors/environments Oiled, Protégé, Swoop, TopBraid, Ontotrack, … Reasoning systems Cerebra, FaCT++, Kaon2, Pellet, Racer, CEL, … Developing Tools and Infrastructure Pellet KAON2 CEL
  • 77. Reusable, public ontologies FOAF The Event Ontology Measurement Units Ontology
  • 78. Community-maintained list: http://esw.w3.org/topic/GrddlImplementations GRDDL tools Most GRDDL tools are adapters to existing RDF stores or SPARQL engines to allow loading or querying data from XML and XHTML sources.
  • 79. What about… everything else? Standards don’t yet exist, but many tools exist to derive RDF and/or run SPARQL queries against other sources of data.
  • 80. LDAP Directories Squirrel RDF http://jena.sourceforge.net/SquirrelRDF/
  • 81. Excel spreadsheets Anzo for Excel http://www.cambridgesemantics.com/products/anzo_for_excel
  • 82. Web-based data sources Virtuoso Sponger Cartridges http://virtuoso.openlinksw.com/dataspace/dav/wiki/Main/VirtSponger
  • 83. Unstructured Text Calais http://www.opencalais.com/
  • 84. Unstructured Text Zemanta Web Service http://developer.zemanta.com/
  • 85. On the Web Google, Yahoo! Best Buy NY Times US Government UK Government Where is it being used?
  • 86. Industries Oil & Gas (integration, classification) Finance (structured data, ontologies, XBRL) Publishing (metadata) Government (structured data, metadata, classification) Libraries & museums (metadata, classification) IT (rapid application development & evolution) Where is it being used?
  • 87. Health Care Cleveland Clinic Clinical research Data integration, classification (= better search) UT School of Health Public health surveillance SAPPHIRE—classification, ontology-driven development Various Clinical Decision Support Agile, rule-driven, scalable in the face of change Where is it being used?
  • 88. Life Sciences Agile knowledgebases at Pfizer Target assessment at Eli Lilly Integrated information links at Novartis Astra Zeneca, J&J, UCB, … Where is it being used? CSHALS chronicles many of these uses and many more.
  • 90. These are horizontal, enabling technologies. But they apply particularly well to problems with these characteristics: Heterogeneous data from multiple sources Increasing reliance on connections within this data Rapidly changing information needs Significant early-mover advantage Large amounts of data that would benefit from classification Why are Semantic Web technologies appropriate for the life sciences? Many tactical and strategic challenges in the life sciences industry feature these traits.
  • 91. Getting Started with Semantic Web technologies Don’t boil the ocean.
  • 92. Getting Started with Semantic Web technologies Goal: quick tactical wins on the path to large strategic value Be sure to consider the operational ramifications Who does what differently? Ideal Semantic Web projects/applications have an incremental path towards broad deployment that generates demonstrable value along the way
  • 93. Look beyond the core Semantic Web capabilities and consider: integration with existing enterprise systems development & extension models deployment, logging, maintenance, backup tooling user experience Choose practical, enterprise-ready tools If you choose to build new components and assemble existing components together, it’s quite likely you’ll end up reinventing the wheel.
  • 94. What level of expertise is necessary? Technologies only? Technologies + API? Technologies + tooling? Tooling only? … How will we acquire the expertise? In-house (and if so, how?) Vendor services 3rd-party services Open-source community Plan for Acquiring Expertise
  • 95. I’m always happy to field questions & engage in discussion: lee@cambridgesemantics.com Thanks & Discussion

Hinweis der Redaktion

  1. One of the goals of this tutorial is to de-mystify the all of the names of technologies, tools, projects, etc. that swirl around the Semantic Web story.And since I saw that as I researched this presentation, everyone seems to like this particular Gary Larson cartoon, it behooved me to include it.
  2. The good – emphasize the importance of the foundational layers (URIs and RDF) ; emphasizes the long-term roadmap/vision of what’s needed for the Semantic WebThe bad – implies that perhaps things can’t be taken serious until all the pieces are in place ; implies an order to the research ; various versions of the cake tell different stories (importance of XML, absence of query, lack of UI/application layer, …)Valentin Zacharias wrote about the “infamy” part of the layer cake here: http://www.valentinzacharias.de/blog/2007/04/ban-semantic-web-layer-cake.html
  3. The Ontology/ontology dichotomy is captured well by Jim Hendler at http://www.cs.rpi.edu/%7Ehendler/presentations/SemTech2008-2Towers.pdf
  4. Definition.
  5. Prescriptive.
  6. Descriptive.
  7. Formal.
  8. The first is as opposed to relational tables or XML schemas where the schema needs to be explicitly adjusted to accommodate whatever data is being merged.The second is due to the expressivity of the model – can handle lists, trees, n-ary relations, etc.The third is as opposed to table &amp; column identifiers or XML attribute names.
  9. Definition.
  10. Prescriptive.
  11. Descriptive.
  12. Descriptive (part 2). This is leagues ahead of the situation with SQL!
  13. http://bio2rdf.org/
  14. http://bio2rdf.org/
  15. Definition.
  16. Definition.
  17. Definition.