SlideShare ist ein Scribd-Unternehmen logo
1 von 35
Downloaden Sie, um offline zu lesen
The Web of Data
NISO Virtual Conference
19 February 2014
Ralph Swick, W3C
Agenda
• Data is changing our lives
• W3C’s traditional focus
• Expanding scope of W3C’s data activities
Web has transformed our relation
to computers and to data
• A computer in every pocket
• Apps leveraging context
– geolocation and other sensors
– social context (“I’m at the conference, too!”)
• Change in the use of search
– people search for answers, not sites
– answers from aggregated data
(Siri, Google Now, Wolfram Alpha)
Apps are using data from many
sources
•
•
•
•

Social networking
Mobile devices
Sensors
Open data
Imagine…
• A “Web” where
– documents are available for download
on the Internet
– but there would be no hyperlinks
among them
Data on the Web is not enough…
• We need a proper infrastructure for a
real Web of Data where:
– data are available on the Web
• accessible via standard Web technologies

– data are interlinked over the Web
– data can be integrated over the Web

• This is Linked Data
Agenda
• Data is changing our lives
• W3C’s traditional focus
• Expanding scope of W3C’s data activities
Semantic Web Core
•
•
•
•
•
•
•
•
•
•

RDF
RDF Schema
RDB2RDF
SPARQL
SKOS
OWL
RIF
LDP
POWDER
GRDDL

data model
vocabulary design
relational DB export
query
vocabulary description
ontological inference
rules interchange
read-write Web of Data
description resources
app-specific XML
Need for RDF schemas
• First step towards the “extra knowledge”:
– define the terms we can use
– what restrictions apply
– what extra relationships are there?

• “RDF Vocabulary Description Language”
– the term “Schema” is retained for historical
reasons…
Vocabularies
• There is a need for “languages” to
define such vocabularies
– to define those vocabularies
– to assign clear “semantics” on how new
relationships can be deduced
SKOS
• SKOS provides a simple bridge
between the “print world” and the
(Semantic) Web
• Thesauri, glossaries, etc., from the
library community can be made
available
• SKOS can also be used to organize,
e.g., tags, annotate other vocabularies,
…
Semantic Web/Linked Data Today
• Standards are mature
– some level of maintenance work is always needed

• Server-side applications dominate
• Commercial applications exist, e.g.:
– direct integration/usage of linked data on the Web
– consumption of other formats converted internally to a
common format (RDF)
Challenge: leverage data in
interoperable apps
• Public, private, behind enterprise firewalls
• From informal to highly curated
• From machine readable to human readable
– HTML tables, twitter feeds, local vocabularies,
spreadsheets, …
• Expressed in diverse data models
– tree, graph, table, …
• Serialized in many ways
– XML, CSV, RDF, PDF, JSON, HTML Tables,…
The Linking Open Data Project
Linked Data Principles
Is your data 5 Star?
Available on the Web in some format (i.e., use URI to access the
data)
Available as machine-readable structured data (e.g., excel instead
of an image scan)
As before, but using a non-proprietary format (e.g., CSV instead of
excel)
All the above, plus use open standards (RDF & Co.) to identify
things, so that people could point at your stuff
All the above, plus link your data to other people’s data to provide
context
A Three Star Example
The importance of Linked Data
• Provide a core set of data that
applications can build on
– stable references for “things”,
• e.g., http://dbpedia.org/resource/Kolkata/

– many many relationships that applications
may reuse
– a “nucleus” for a larger, semantically
enabled Web!
Linked Data Platform (LDP)
• Define an HTTP/RESTful based
infrastructure to publish, read, write, or
modify linked data
– typical usage: data intensive application in a
browser, application integration using shared
data…

• The infrastructure should be easy to
implement and install
– provides an “entry point” for Linked Data
applications!

• The work is nearing completion
RDF with HTML: RDFa
• By adding some “meta” information, the
same source can be reused
– typical example: your personal information,
like address, should be readable for humans
and processable by machines

• Some solutions have emerged:
– add extra statements in microdata or RDFa
that can be converted to RDF
• microdata can be used for a (useful) subset of RDF
• RDFa is, essentially, a complete serialization of
RDF
schema.org
• Schema.org is a cooperation of search engines
(Bing, Google, Yahoo!, and Yandex)
• It is a large vocabulary that they all understand
• The terms are extracted from
HTML5+microdata or HTML5+RDFa
– the various partners use it for different purposes
– it can be used by anyone outside of the search
world!
The Web of Data: The W3C Semantic Web Initiative
Some things to remember when
you publish data
• Publish your data first, do user interfaces later!
– the “raw data” can become useful on its own right
and others may use it
– you can add your added value later by providing nice
user access

• If possible, publish your data in RDF but if you
cannot, others may help you in conversions
– trust the community…

• Add links to other data. “Just” publishing isn’t
enough…
Some things to remember when
you publish data (2)
• Think about persistence and versioning
– others may depend on the data you publish…

• Be thoughtful about the URIs you choose
• Try to avoid reinventing the wheel when
choosing vocabularies
Some things to remember when
you publish data (3)
• Document your data, i.e., provide
metadata
– there are vocabularies to do this
•
•
•
•

Data Catalog Vocabulary (DCAT)
Vocabulary of Interlinked Datasets (VoID)
DCTERMS
vocabularies for licensing (Open Data Commons,
government licenses)
– this area is still very much in development…
Agenda
• Data is changing our lives
• W3C’s work on data integration
• Expanding scope of W3C’s data activities
New work underway
• CSV on the Web
• Data on the Web Best Practices

• Vocabulary management
What we are hearing
• CSV is everywhere
– can be huge data sets, not easily readable in a spreadsheet
or Google refine
– meaning of data not in machine-readable form
– data is not necessarily used for web-scale integration but
rather immediate usage

• Metadata is essential
• Conversion is an issue
• European Commission Study on business models
for Linked Open Government Data (BM4LOGD)
Linked Data Benefits (BM4LOD)
• Flexible data integration
– Streamlined internal processes
– Where working relationships already exist, much easier to
share
– Linking reference collections; discovery of new relationships

• Increase in data quality
– More use of data internally brings errors to light
– Use of open standards increases quality of system

• New services
• Cost reduction
– Increased efficiency
– Increase in data usage due to LOD enrichment
CSV on the Web
• How W3C can help
– metadata vocabulary to describe CSV data (structure,
reference to access rights, annotations, etc.)
– metadata discovery (e.g., part of an HTTP header, special
rows and columns, packaging formats…)
– mapping content to RDF, JSON, XML
Best practices
• Document best practices for the data publishers
– URI design, management of persistence, versioning
– business models
– use of core metadata vocabularies (provenance, access
control, ownership)

• Specific vocabularies
– quality, application descriptions, …
Vocabulary management:
challenge
• Interoperable vocabularies are key for (meta)data
• At the moment, it is a fairly chaotic world…
– many, possibly overlapping vocabularies
– difficult to locate the one that is needed
– vocabularies may not be properly managed, maintained,
versioned, provided persistence…
Vocabulary management: how
W3C can help
• Provide a space where
– communities can develop vocabularies (through, e.g.,
CGs, possibly WGs)
– host vocabularies at W3C if requested
– annotate vocabularies with a proper set of metadata terms
– establish a vocabulary directory

• The exact structure is still being discussed
Summary
• Data-driven smart apps are one of the major growth
engines for the worldwide software market.
• We need to meet developers where they are.
• 5 Star Benefits of LOD
–
–
–
–
–

Greater efficiency, better provision of the task
Greater flexibility leads to lower costs for future projects
New services, new connections, new discoveries
Improved navigation within and between datasets
Others can build apps based on your data
Available specifications:
Primers, Guides`
• Primers:
– RDF Primer
– OWL Guide

– SKOS Primer
– GRDDL Primer
– RDFa Primer

• The W3C Semantic Web Activity Wiki has links to all
the specifications
These slides are in the Web at
http://www.w3.org/2014/Talks
/0219-NISO-RRS
with thanks to Ivan Herman, W3C
and Phil Archer, W3C

Más contenido relacionado

Was ist angesagt?

ESWC SS 2012 - Wednesday Tutorial Barry Norton: Building (Production) Semanti...
ESWC SS 2012 - Wednesday Tutorial Barry Norton: Building (Production) Semanti...ESWC SS 2012 - Wednesday Tutorial Barry Norton: Building (Production) Semanti...
ESWC SS 2012 - Wednesday Tutorial Barry Norton: Building (Production) Semanti...eswcsummerschool
 
Hide the Stack: Toward Usable Linked Data
Hide the Stack:Toward Usable Linked DataHide the Stack:Toward Usable Linked Data
Hide the Stack: Toward Usable Linked Dataaba-sah
 
Designing Linked Data Software & Services for Libraries
Designing Linked Data Software & Services for LibrariesDesigning Linked Data Software & Services for Libraries
Designing Linked Data Software & Services for LibrariesRichard Wallis
 
4.16.15 Slides, “Enhancing Early Career Researcher Profiles: VIVO & ORCID Int...
4.16.15 Slides, “Enhancing Early Career Researcher Profiles: VIVO & ORCID Int...4.16.15 Slides, “Enhancing Early Career Researcher Profiles: VIVO & ORCID Int...
4.16.15 Slides, “Enhancing Early Career Researcher Profiles: VIVO & ORCID Int...DuraSpace
 
A review of the state of the art in Machine Learning on the Semantic Web
A review of the state of the art in Machine Learning on the Semantic WebA review of the state of the art in Machine Learning on the Semantic Web
A review of the state of the art in Machine Learning on the Semantic WebSimon Price
 
Entification: The Route to 'Useful' Library Data
Entification: The Route to 'Useful' Library DataEntification: The Route to 'Useful' Library Data
Entification: The Route to 'Useful' Library DataRichard Wallis
 
Exploring a world of networked information built from free-text metadata
Exploring a world of networked information built from free-text metadataExploring a world of networked information built from free-text metadata
Exploring a world of networked information built from free-text metadataShenghui Wang
 
Linked Data Best Practices and BibFrame
Linked Data Best Practices and BibFrameLinked Data Best Practices and BibFrame
Linked Data Best Practices and BibFrameRobert Sanderson
 
Linked Data for Libraries: Experiments between Cornell, Harvard and Stanford
Linked Data for Libraries: Experiments between Cornell, Harvard and StanfordLinked Data for Libraries: Experiments between Cornell, Harvard and Stanford
Linked Data for Libraries: Experiments between Cornell, Harvard and StanfordSimeon Warner
 
Envisioning Social Applications of Library Linked Data
Envisioning Social Applications of Library Linked DataEnvisioning Social Applications of Library Linked Data
Envisioning Social Applications of Library Linked DataUldis Bojars
 
Exploring the Networks in Open Public Data
Exploring the Networks in Open Public DataExploring the Networks in Open Public Data
Exploring the Networks in Open Public DataUldis Bojars
 
Linked data for Ebook discovery
Linked data for Ebook discoveryLinked data for Ebook discovery
Linked data for Ebook discoveryRichard Wallis
 

Was ist angesagt? (20)

ESWC SS 2012 - Wednesday Tutorial Barry Norton: Building (Production) Semanti...
ESWC SS 2012 - Wednesday Tutorial Barry Norton: Building (Production) Semanti...ESWC SS 2012 - Wednesday Tutorial Barry Norton: Building (Production) Semanti...
ESWC SS 2012 - Wednesday Tutorial Barry Norton: Building (Production) Semanti...
 
Todd Carpenter Presentation at Project Muse Publishers Meeting - April 24, 2014
Todd Carpenter Presentation at Project Muse Publishers Meeting - April 24, 2014Todd Carpenter Presentation at Project Muse Publishers Meeting - April 24, 2014
Todd Carpenter Presentation at Project Muse Publishers Meeting - April 24, 2014
 
Embedding Linked Data Invisibly into Web Pages: Strategies and Workflows for ...
Embedding Linked Data Invisibly into Web Pages: Strategies and Workflows for ...Embedding Linked Data Invisibly into Web Pages: Strategies and Workflows for ...
Embedding Linked Data Invisibly into Web Pages: Strategies and Workflows for ...
 
Hide the Stack: Toward Usable Linked Data
Hide the Stack:Toward Usable Linked DataHide the Stack:Toward Usable Linked Data
Hide the Stack: Toward Usable Linked Data
 
NISO/DCMI September 25 Webinar: Implementing Linked Data in Developing Countr...
NISO/DCMI September 25 Webinar: Implementing Linked Data in Developing Countr...NISO/DCMI September 25 Webinar: Implementing Linked Data in Developing Countr...
NISO/DCMI September 25 Webinar: Implementing Linked Data in Developing Countr...
 
Designing Linked Data Software & Services for Libraries
Designing Linked Data Software & Services for LibrariesDesigning Linked Data Software & Services for Libraries
Designing Linked Data Software & Services for Libraries
 
4.16.15 Slides, “Enhancing Early Career Researcher Profiles: VIVO & ORCID Int...
4.16.15 Slides, “Enhancing Early Career Researcher Profiles: VIVO & ORCID Int...4.16.15 Slides, “Enhancing Early Career Researcher Profiles: VIVO & ORCID Int...
4.16.15 Slides, “Enhancing Early Career Researcher Profiles: VIVO & ORCID Int...
 
A review of the state of the art in Machine Learning on the Semantic Web
A review of the state of the art in Machine Learning on the Semantic WebA review of the state of the art in Machine Learning on the Semantic Web
A review of the state of the art in Machine Learning on the Semantic Web
 
NISO Webinar: Content on the Go: Mobile Access to E-Resources
NISO Webinar: Content on the Go: Mobile Access to E-Resources NISO Webinar: Content on the Go: Mobile Access to E-Resources
NISO Webinar: Content on the Go: Mobile Access to E-Resources
 
Thompson 6-jun15-final
Thompson 6-jun15-finalThompson 6-jun15-final
Thompson 6-jun15-final
 
Entification: The Route to 'Useful' Library Data
Entification: The Route to 'Useful' Library DataEntification: The Route to 'Useful' Library Data
Entification: The Route to 'Useful' Library Data
 
April 24, 2013 NISO/DCMI Webinar: Deployment of RDA (Resource Description and...
April 24, 2013 NISO/DCMI Webinar: Deployment of RDA (Resource Description and...April 24, 2013 NISO/DCMI Webinar: Deployment of RDA (Resource Description and...
April 24, 2013 NISO/DCMI Webinar: Deployment of RDA (Resource Description and...
 
Exploring a world of networked information built from free-text metadata
Exploring a world of networked information built from free-text metadataExploring a world of networked information built from free-text metadata
Exploring a world of networked information built from free-text metadata
 
Linked Data Best Practices and BibFrame
Linked Data Best Practices and BibFrameLinked Data Best Practices and BibFrame
Linked Data Best Practices and BibFrame
 
Linked Data for Libraries: Experiments between Cornell, Harvard and Stanford
Linked Data for Libraries: Experiments between Cornell, Harvard and StanfordLinked Data for Libraries: Experiments between Cornell, Harvard and Stanford
Linked Data for Libraries: Experiments between Cornell, Harvard and Stanford
 
Envisioning Social Applications of Library Linked Data
Envisioning Social Applications of Library Linked DataEnvisioning Social Applications of Library Linked Data
Envisioning Social Applications of Library Linked Data
 
McDanold-1-jun15
McDanold-1-jun15McDanold-1-jun15
McDanold-1-jun15
 
Exploring the Networks in Open Public Data
Exploring the Networks in Open Public DataExploring the Networks in Open Public Data
Exploring the Networks in Open Public Data
 
Sept 16 NISO Two Part Webinar: The Practicality of Managing E, Part 2: Licensing
Sept 16 NISO Two Part Webinar: The Practicality of Managing E, Part 2: LicensingSept 16 NISO Two Part Webinar: The Practicality of Managing E, Part 2: Licensing
Sept 16 NISO Two Part Webinar: The Practicality of Managing E, Part 2: Licensing
 
Linked data for Ebook discovery
Linked data for Ebook discoveryLinked data for Ebook discovery
Linked data for Ebook discovery
 

Ähnlich wie The Web of Data: The W3C Semantic Web Initiative

Sands Fish - Knowing in the Age of Networked Knowledge
Sands Fish - Knowing in the Age of Networked KnowledgeSands Fish - Knowing in the Age of Networked Knowledge
Sands Fish - Knowing in the Age of Networked Knowledgesandsfish
 
Intro to the semantic web (for libraries)
Intro to the semantic web (for libraries) Intro to the semantic web (for libraries)
Intro to the semantic web (for libraries) robin fay
 
How does semantic technology work?
How does semantic technology work? How does semantic technology work?
How does semantic technology work? Graeme Wood
 
An Introduction to Semantic Web Technology
An Introduction to Semantic Web TechnologyAn Introduction to Semantic Web Technology
An Introduction to Semantic Web TechnologyAnkur Biswas
 
Linked Data for the Masses: The approach and the Software
Linked Data for the Masses: The approach and the SoftwareLinked Data for the Masses: The approach and the Software
Linked Data for the Masses: The approach and the SoftwareIMC Technologies
 
Overview of Big Data by Sunny
Overview of Big Data by SunnyOverview of Big Data by Sunny
Overview of Big Data by SunnyDignitasDigital1
 
Cloud-based Linked Data Management for Self-service Application Development
Cloud-based Linked Data Management for Self-service Application DevelopmentCloud-based Linked Data Management for Self-service Application Development
Cloud-based Linked Data Management for Self-service Application DevelopmentPeter Haase
 
Linked data and the future of libraries
Linked data and the future of librariesLinked data and the future of libraries
Linked data and the future of librariesRegan Harper
 
Identity Management: Tools, processes & services
Identity Management: Tools, processes & servicesIdentity Management: Tools, processes & services
Identity Management: Tools, processes & servicesJISC Netskills
 
Why I don't use Semantic Web technologies anymore, event if they still influe...
Why I don't use Semantic Web technologies anymore, event if they still influe...Why I don't use Semantic Web technologies anymore, event if they still influe...
Why I don't use Semantic Web technologies anymore, event if they still influe...Gautier Poupeau
 
Architect’s Open-Source Guide for a Data Mesh Architecture
Architect’s Open-Source Guide for a Data Mesh ArchitectureArchitect’s Open-Source Guide for a Data Mesh Architecture
Architect’s Open-Source Guide for a Data Mesh ArchitectureDatabricks
 
Linked open data project
Linked open data projectLinked open data project
Linked open data projectFaathima Fayaza
 
Data Lake Acceleration vs. Data Virtualization - What’s the difference?
Data Lake Acceleration vs. Data Virtualization - What’s the difference?Data Lake Acceleration vs. Data Virtualization - What’s the difference?
Data Lake Acceleration vs. Data Virtualization - What’s the difference?Denodo
 
Scaling up Linked Data
Scaling up Linked DataScaling up Linked Data
Scaling up Linked DataMarin Dimitrov
 
What flavor of linked data is best for your collection?
What flavor of linked data is best for your collection? What flavor of linked data is best for your collection?
What flavor of linked data is best for your collection? Debra Shapiro
 

Ähnlich wie The Web of Data: The W3C Semantic Web Initiative (20)

Sands Fish - Knowing in the Age of Networked Knowledge
Sands Fish - Knowing in the Age of Networked KnowledgeSands Fish - Knowing in the Age of Networked Knowledge
Sands Fish - Knowing in the Age of Networked Knowledge
 
Intro to the semantic web (for libraries)
Intro to the semantic web (for libraries) Intro to the semantic web (for libraries)
Intro to the semantic web (for libraries)
 
How does semantic technology work?
How does semantic technology work? How does semantic technology work?
How does semantic technology work?
 
An Introduction to Semantic Web Technology
An Introduction to Semantic Web TechnologyAn Introduction to Semantic Web Technology
An Introduction to Semantic Web Technology
 
unit 1 big data.pptx
unit 1 big data.pptxunit 1 big data.pptx
unit 1 big data.pptx
 
Linked Data for the Masses: The approach and the Software
Linked Data for the Masses: The approach and the SoftwareLinked Data for the Masses: The approach and the Software
Linked Data for the Masses: The approach and the Software
 
Overview of Big Data by Sunny
Overview of Big Data by SunnyOverview of Big Data by Sunny
Overview of Big Data by Sunny
 
Cloud-based Linked Data Management for Self-service Application Development
Cloud-based Linked Data Management for Self-service Application DevelopmentCloud-based Linked Data Management for Self-service Application Development
Cloud-based Linked Data Management for Self-service Application Development
 
Linked data and the future of libraries
Linked data and the future of librariesLinked data and the future of libraries
Linked data and the future of libraries
 
Identity Management: Tools, processes & services
Identity Management: Tools, processes & servicesIdentity Management: Tools, processes & services
Identity Management: Tools, processes & services
 
Why I don't use Semantic Web technologies anymore, event if they still influe...
Why I don't use Semantic Web technologies anymore, event if they still influe...Why I don't use Semantic Web technologies anymore, event if they still influe...
Why I don't use Semantic Web technologies anymore, event if they still influe...
 
NoSQL
NoSQLNoSQL
NoSQL
 
Semantic web
Semantic webSemantic web
Semantic web
 
Architecting Your First Big Data Implementation
Architecting Your First Big Data ImplementationArchitecting Your First Big Data Implementation
Architecting Your First Big Data Implementation
 
Architect’s Open-Source Guide for a Data Mesh Architecture
Architect’s Open-Source Guide for a Data Mesh ArchitectureArchitect’s Open-Source Guide for a Data Mesh Architecture
Architect’s Open-Source Guide for a Data Mesh Architecture
 
Linked open data project
Linked open data projectLinked open data project
Linked open data project
 
Data Lake Acceleration vs. Data Virtualization - What’s the difference?
Data Lake Acceleration vs. Data Virtualization - What’s the difference?Data Lake Acceleration vs. Data Virtualization - What’s the difference?
Data Lake Acceleration vs. Data Virtualization - What’s the difference?
 
Implementing Linked Data in Low-Resource Conditions
Implementing Linked Data in Low-Resource ConditionsImplementing Linked Data in Low-Resource Conditions
Implementing Linked Data in Low-Resource Conditions
 
Scaling up Linked Data
Scaling up Linked DataScaling up Linked Data
Scaling up Linked Data
 
What flavor of linked data is best for your collection?
What flavor of linked data is best for your collection? What flavor of linked data is best for your collection?
What flavor of linked data is best for your collection?
 

Mehr von National Information Standards Organization (NISO)

Mehr von National Information Standards Organization (NISO) (20)

Mattingly "Text and Data Mining: Building Data Driven Applications"
Mattingly "Text and Data Mining: Building Data Driven Applications"Mattingly "Text and Data Mining: Building Data Driven Applications"
Mattingly "Text and Data Mining: Building Data Driven Applications"
 
Mattingly "Text and Data Mining: Searching Vectors"
Mattingly "Text and Data Mining: Searching Vectors"Mattingly "Text and Data Mining: Searching Vectors"
Mattingly "Text and Data Mining: Searching Vectors"
 
Mattingly "Text Mining Techniques"
Mattingly "Text Mining Techniques"Mattingly "Text Mining Techniques"
Mattingly "Text Mining Techniques"
 
Mattingly "Text Processing for Library Data: Representing Text as Data"
Mattingly "Text Processing for Library Data: Representing Text as Data"Mattingly "Text Processing for Library Data: Representing Text as Data"
Mattingly "Text Processing for Library Data: Representing Text as Data"
 
Ross and Clark "Strategic Planning"
Ross and Clark "Strategic Planning"Ross and Clark "Strategic Planning"
Ross and Clark "Strategic Planning"
 
Mattingly "Data Mining Techniques: Classification and Clustering"
Mattingly "Data Mining Techniques: Classification and Clustering"Mattingly "Data Mining Techniques: Classification and Clustering"
Mattingly "Data Mining Techniques: Classification and Clustering"
 
Straza "Global collaboration towards equitable and open science: UNESCO Recom...
Straza "Global collaboration towards equitable and open science: UNESCO Recom...Straza "Global collaboration towards equitable and open science: UNESCO Recom...
Straza "Global collaboration towards equitable and open science: UNESCO Recom...
 
Lippincott "Beyond access: Accelerating discovery and increasing trust throug...
Lippincott "Beyond access: Accelerating discovery and increasing trust throug...Lippincott "Beyond access: Accelerating discovery and increasing trust throug...
Lippincott "Beyond access: Accelerating discovery and increasing trust throug...
 
Kriegsman "Integrating Open and Equitable Research into Open Science"
Kriegsman "Integrating Open and Equitable Research into Open Science"Kriegsman "Integrating Open and Equitable Research into Open Science"
Kriegsman "Integrating Open and Equitable Research into Open Science"
 
Mattingly "Ethics and Cleaning Data"
Mattingly "Ethics and Cleaning Data"Mattingly "Ethics and Cleaning Data"
Mattingly "Ethics and Cleaning Data"
 
Mercado-Lara "Open & Equitable Program"
Mercado-Lara "Open & Equitable Program"Mercado-Lara "Open & Equitable Program"
Mercado-Lara "Open & Equitable Program"
 
Ratner "Enhancing Open Science: Assessing Tools & Charting Progress"
Ratner "Enhancing Open Science: Assessing Tools & Charting Progress"Ratner "Enhancing Open Science: Assessing Tools & Charting Progress"
Ratner "Enhancing Open Science: Assessing Tools & Charting Progress"
 
Pfeiffer "Enhancing Open Science: Assessing Tools & Charting Progress"
Pfeiffer "Enhancing Open Science: Assessing Tools & Charting Progress"Pfeiffer "Enhancing Open Science: Assessing Tools & Charting Progress"
Pfeiffer "Enhancing Open Science: Assessing Tools & Charting Progress"
 
Hahnel “Mapping Progress: Reflections and Charting Future Pathways"
Hahnel “Mapping Progress: Reflections and Charting Future Pathways"Hahnel “Mapping Progress: Reflections and Charting Future Pathways"
Hahnel “Mapping Progress: Reflections and Charting Future Pathways"
 
Stall "Open Science: The Journey of a Scholarly Society"
Stall "Open Science: The Journey of a Scholarly Society"Stall "Open Science: The Journey of a Scholarly Society"
Stall "Open Science: The Journey of a Scholarly Society"
 
Hrynaszkiewicz "A Publisher's Perspective on Open Science"
Hrynaszkiewicz "A Publisher's Perspective on Open Science"Hrynaszkiewicz "A Publisher's Perspective on Open Science"
Hrynaszkiewicz "A Publisher's Perspective on Open Science"
 
Carter "Higher Education Leadership Initiative for Open Scholarship (HELIOS)"
Carter "Higher Education Leadership Initiative for Open Scholarship (HELIOS)"Carter "Higher Education Leadership Initiative for Open Scholarship (HELIOS)"
Carter "Higher Education Leadership Initiative for Open Scholarship (HELIOS)"
 
Webster "NISO Roundtable: Open Science and Libraries"
Webster "NISO Roundtable: Open Science and Libraries"Webster "NISO Roundtable: Open Science and Libraries"
Webster "NISO Roundtable: Open Science and Libraries"
 
Ratan "Are we there yet? Keeping the promise of open science"
Ratan "Are we there yet?  Keeping the promise of open science"Ratan "Are we there yet?  Keeping the promise of open science"
Ratan "Are we there yet? Keeping the promise of open science"
 
Tananbaum "NISO Tech Summit: Reflections on Reflections Upon The Year of Open...
Tananbaum "NISO Tech Summit: Reflections on Reflections Upon The Year of Open...Tananbaum "NISO Tech Summit: Reflections on Reflections Upon The Year of Open...
Tananbaum "NISO Tech Summit: Reflections on Reflections Upon The Year of Open...
 

Último

2024 March 11, Telehealth Billing- Current Telehealth CPT Codes & Telehealth ...
2024 March 11, Telehealth Billing- Current Telehealth CPT Codes & Telehealth ...2024 March 11, Telehealth Billing- Current Telehealth CPT Codes & Telehealth ...
2024 March 11, Telehealth Billing- Current Telehealth CPT Codes & Telehealth ...Marlene Maheu
 
Riti theory by Vamana Indian poetics.pptx
Riti theory by Vamana Indian poetics.pptxRiti theory by Vamana Indian poetics.pptx
Riti theory by Vamana Indian poetics.pptxDhatriParmar
 
30-de-thi-vao-lop-10-mon-tieng-anh-co-dap-an.doc
30-de-thi-vao-lop-10-mon-tieng-anh-co-dap-an.doc30-de-thi-vao-lop-10-mon-tieng-anh-co-dap-an.doc
30-de-thi-vao-lop-10-mon-tieng-anh-co-dap-an.docdieu18
 
POST ENCEPHALITIS case study Jitendra bhargav
POST ENCEPHALITIS case study  Jitendra bhargavPOST ENCEPHALITIS case study  Jitendra bhargav
POST ENCEPHALITIS case study Jitendra bhargavJitendra Bhargav
 
Arti Languages Pre Seed Send Ahead Pitchdeck 2024.pdf
Arti Languages Pre Seed Send Ahead Pitchdeck 2024.pdfArti Languages Pre Seed Send Ahead Pitchdeck 2024.pdf
Arti Languages Pre Seed Send Ahead Pitchdeck 2024.pdfwill854175
 
3.12.24 Freedom Summer in Mississippi.pptx
3.12.24 Freedom Summer in Mississippi.pptx3.12.24 Freedom Summer in Mississippi.pptx
3.12.24 Freedom Summer in Mississippi.pptxmary850239
 
BBA 205 BE UNIT 2 economic systems prof dr kanchan.pptx
BBA 205 BE UNIT 2 economic systems prof dr kanchan.pptxBBA 205 BE UNIT 2 economic systems prof dr kanchan.pptx
BBA 205 BE UNIT 2 economic systems prof dr kanchan.pptxProf. Kanchan Kumari
 
ICS2208 Lecture4 Intelligent Interface Agents.pdf
ICS2208 Lecture4 Intelligent Interface Agents.pdfICS2208 Lecture4 Intelligent Interface Agents.pdf
ICS2208 Lecture4 Intelligent Interface Agents.pdfVanessa Camilleri
 
LEAD6001 - Introduction to Advanced Stud
LEAD6001 - Introduction to Advanced StudLEAD6001 - Introduction to Advanced Stud
LEAD6001 - Introduction to Advanced StudDr. Bruce A. Johnson
 
LEAD6001 - Introduction to Advanced Stud
LEAD6001 - Introduction to Advanced StudLEAD6001 - Introduction to Advanced Stud
LEAD6001 - Introduction to Advanced StudDr. Bruce A. Johnson
 
AUDIENCE THEORY - PARTICIPATORY - JENKINS.pptx
AUDIENCE THEORY - PARTICIPATORY - JENKINS.pptxAUDIENCE THEORY - PARTICIPATORY - JENKINS.pptx
AUDIENCE THEORY - PARTICIPATORY - JENKINS.pptxiammrhaywood
 
AI Uses and Misuses: Academic and Workplace Applications
AI Uses and Misuses: Academic and Workplace ApplicationsAI Uses and Misuses: Academic and Workplace Applications
AI Uses and Misuses: Academic and Workplace ApplicationsStella Lee
 
3.14.24 The Selma March and the Voting Rights Act.pptx
3.14.24 The Selma March and the Voting Rights Act.pptx3.14.24 The Selma March and the Voting Rights Act.pptx
3.14.24 The Selma March and the Voting Rights Act.pptxmary850239
 
The basics of sentences session 8pptx.pptx
The basics of sentences session 8pptx.pptxThe basics of sentences session 8pptx.pptx
The basics of sentences session 8pptx.pptxheathfieldcps1
 
Dhavni Theory by Anandvardhana Indian Poetics
Dhavni Theory by Anandvardhana Indian PoeticsDhavni Theory by Anandvardhana Indian Poetics
Dhavni Theory by Anandvardhana Indian PoeticsDhatriParmar
 
Material Remains as Source of Ancient Indian History & Culture.ppt
Material Remains as Source of Ancient Indian History & Culture.pptMaterial Remains as Source of Ancient Indian History & Culture.ppt
Material Remains as Source of Ancient Indian History & Culture.pptBanaras Hindu University
 
The First National K12 TUG March 6 2024.pdf
The First National K12 TUG March 6 2024.pdfThe First National K12 TUG March 6 2024.pdf
The First National K12 TUG March 6 2024.pdfdogden2
 

Último (20)

2024 March 11, Telehealth Billing- Current Telehealth CPT Codes & Telehealth ...
2024 March 11, Telehealth Billing- Current Telehealth CPT Codes & Telehealth ...2024 March 11, Telehealth Billing- Current Telehealth CPT Codes & Telehealth ...
2024 March 11, Telehealth Billing- Current Telehealth CPT Codes & Telehealth ...
 
Riti theory by Vamana Indian poetics.pptx
Riti theory by Vamana Indian poetics.pptxRiti theory by Vamana Indian poetics.pptx
Riti theory by Vamana Indian poetics.pptx
 
30-de-thi-vao-lop-10-mon-tieng-anh-co-dap-an.doc
30-de-thi-vao-lop-10-mon-tieng-anh-co-dap-an.doc30-de-thi-vao-lop-10-mon-tieng-anh-co-dap-an.doc
30-de-thi-vao-lop-10-mon-tieng-anh-co-dap-an.doc
 
t-test Parametric test Biostatics and Research Methodology
t-test Parametric test Biostatics and Research Methodologyt-test Parametric test Biostatics and Research Methodology
t-test Parametric test Biostatics and Research Methodology
 
POST ENCEPHALITIS case study Jitendra bhargav
POST ENCEPHALITIS case study  Jitendra bhargavPOST ENCEPHALITIS case study  Jitendra bhargav
POST ENCEPHALITIS case study Jitendra bhargav
 
Arti Languages Pre Seed Send Ahead Pitchdeck 2024.pdf
Arti Languages Pre Seed Send Ahead Pitchdeck 2024.pdfArti Languages Pre Seed Send Ahead Pitchdeck 2024.pdf
Arti Languages Pre Seed Send Ahead Pitchdeck 2024.pdf
 
3.12.24 Freedom Summer in Mississippi.pptx
3.12.24 Freedom Summer in Mississippi.pptx3.12.24 Freedom Summer in Mississippi.pptx
3.12.24 Freedom Summer in Mississippi.pptx
 
BBA 205 BE UNIT 2 economic systems prof dr kanchan.pptx
BBA 205 BE UNIT 2 economic systems prof dr kanchan.pptxBBA 205 BE UNIT 2 economic systems prof dr kanchan.pptx
BBA 205 BE UNIT 2 economic systems prof dr kanchan.pptx
 
ICS2208 Lecture4 Intelligent Interface Agents.pdf
ICS2208 Lecture4 Intelligent Interface Agents.pdfICS2208 Lecture4 Intelligent Interface Agents.pdf
ICS2208 Lecture4 Intelligent Interface Agents.pdf
 
LEAD6001 - Introduction to Advanced Stud
LEAD6001 - Introduction to Advanced StudLEAD6001 - Introduction to Advanced Stud
LEAD6001 - Introduction to Advanced Stud
 
LEAD6001 - Introduction to Advanced Stud
LEAD6001 - Introduction to Advanced StudLEAD6001 - Introduction to Advanced Stud
LEAD6001 - Introduction to Advanced Stud
 
AUDIENCE THEORY - PARTICIPATORY - JENKINS.pptx
AUDIENCE THEORY - PARTICIPATORY - JENKINS.pptxAUDIENCE THEORY - PARTICIPATORY - JENKINS.pptx
AUDIENCE THEORY - PARTICIPATORY - JENKINS.pptx
 
AI Uses and Misuses: Academic and Workplace Applications
AI Uses and Misuses: Academic and Workplace ApplicationsAI Uses and Misuses: Academic and Workplace Applications
AI Uses and Misuses: Academic and Workplace Applications
 
3.14.24 The Selma March and the Voting Rights Act.pptx
3.14.24 The Selma March and the Voting Rights Act.pptx3.14.24 The Selma March and the Voting Rights Act.pptx
3.14.24 The Selma March and the Voting Rights Act.pptx
 
The basics of sentences session 8pptx.pptx
The basics of sentences session 8pptx.pptxThe basics of sentences session 8pptx.pptx
The basics of sentences session 8pptx.pptx
 
Dhavni Theory by Anandvardhana Indian Poetics
Dhavni Theory by Anandvardhana Indian PoeticsDhavni Theory by Anandvardhana Indian Poetics
Dhavni Theory by Anandvardhana Indian Poetics
 
Problems on Mean,Mode,Median Standard Deviation
Problems on Mean,Mode,Median Standard DeviationProblems on Mean,Mode,Median Standard Deviation
Problems on Mean,Mode,Median Standard Deviation
 
Least Significance Difference:Biostatics and Research Methodology
Least Significance Difference:Biostatics and Research MethodologyLeast Significance Difference:Biostatics and Research Methodology
Least Significance Difference:Biostatics and Research Methodology
 
Material Remains as Source of Ancient Indian History & Culture.ppt
Material Remains as Source of Ancient Indian History & Culture.pptMaterial Remains as Source of Ancient Indian History & Culture.ppt
Material Remains as Source of Ancient Indian History & Culture.ppt
 
The First National K12 TUG March 6 2024.pdf
The First National K12 TUG March 6 2024.pdfThe First National K12 TUG March 6 2024.pdf
The First National K12 TUG March 6 2024.pdf
 

The Web of Data: The W3C Semantic Web Initiative

  • 1. The Web of Data NISO Virtual Conference 19 February 2014 Ralph Swick, W3C
  • 2. Agenda • Data is changing our lives • W3C’s traditional focus • Expanding scope of W3C’s data activities
  • 3. Web has transformed our relation to computers and to data • A computer in every pocket • Apps leveraging context – geolocation and other sensors – social context (“I’m at the conference, too!”) • Change in the use of search – people search for answers, not sites – answers from aggregated data (Siri, Google Now, Wolfram Alpha)
  • 4. Apps are using data from many sources • • • • Social networking Mobile devices Sensors Open data
  • 5. Imagine… • A “Web” where – documents are available for download on the Internet – but there would be no hyperlinks among them
  • 6. Data on the Web is not enough… • We need a proper infrastructure for a real Web of Data where: – data are available on the Web • accessible via standard Web technologies – data are interlinked over the Web – data can be integrated over the Web • This is Linked Data
  • 7. Agenda • Data is changing our lives • W3C’s traditional focus • Expanding scope of W3C’s data activities
  • 8. Semantic Web Core • • • • • • • • • • RDF RDF Schema RDB2RDF SPARQL SKOS OWL RIF LDP POWDER GRDDL data model vocabulary design relational DB export query vocabulary description ontological inference rules interchange read-write Web of Data description resources app-specific XML
  • 9. Need for RDF schemas • First step towards the “extra knowledge”: – define the terms we can use – what restrictions apply – what extra relationships are there? • “RDF Vocabulary Description Language” – the term “Schema” is retained for historical reasons…
  • 10. Vocabularies • There is a need for “languages” to define such vocabularies – to define those vocabularies – to assign clear “semantics” on how new relationships can be deduced
  • 11. SKOS • SKOS provides a simple bridge between the “print world” and the (Semantic) Web • Thesauri, glossaries, etc., from the library community can be made available • SKOS can also be used to organize, e.g., tags, annotate other vocabularies, …
  • 12. Semantic Web/Linked Data Today • Standards are mature – some level of maintenance work is always needed • Server-side applications dominate • Commercial applications exist, e.g.: – direct integration/usage of linked data on the Web – consumption of other formats converted internally to a common format (RDF)
  • 13. Challenge: leverage data in interoperable apps • Public, private, behind enterprise firewalls • From informal to highly curated • From machine readable to human readable – HTML tables, twitter feeds, local vocabularies, spreadsheets, … • Expressed in diverse data models – tree, graph, table, … • Serialized in many ways – XML, CSV, RDF, PDF, JSON, HTML Tables,…
  • 14. The Linking Open Data Project
  • 15. Linked Data Principles Is your data 5 Star? Available on the Web in some format (i.e., use URI to access the data) Available as machine-readable structured data (e.g., excel instead of an image scan) As before, but using a non-proprietary format (e.g., CSV instead of excel) All the above, plus use open standards (RDF & Co.) to identify things, so that people could point at your stuff All the above, plus link your data to other people’s data to provide context
  • 16. A Three Star Example
  • 17. The importance of Linked Data • Provide a core set of data that applications can build on – stable references for “things”, • e.g., http://dbpedia.org/resource/Kolkata/ – many many relationships that applications may reuse – a “nucleus” for a larger, semantically enabled Web!
  • 18. Linked Data Platform (LDP) • Define an HTTP/RESTful based infrastructure to publish, read, write, or modify linked data – typical usage: data intensive application in a browser, application integration using shared data… • The infrastructure should be easy to implement and install – provides an “entry point” for Linked Data applications! • The work is nearing completion
  • 19. RDF with HTML: RDFa • By adding some “meta” information, the same source can be reused – typical example: your personal information, like address, should be readable for humans and processable by machines • Some solutions have emerged: – add extra statements in microdata or RDFa that can be converted to RDF • microdata can be used for a (useful) subset of RDF • RDFa is, essentially, a complete serialization of RDF
  • 20. schema.org • Schema.org is a cooperation of search engines (Bing, Google, Yahoo!, and Yandex) • It is a large vocabulary that they all understand • The terms are extracted from HTML5+microdata or HTML5+RDFa – the various partners use it for different purposes – it can be used by anyone outside of the search world!
  • 22. Some things to remember when you publish data • Publish your data first, do user interfaces later! – the “raw data” can become useful on its own right and others may use it – you can add your added value later by providing nice user access • If possible, publish your data in RDF but if you cannot, others may help you in conversions – trust the community… • Add links to other data. “Just” publishing isn’t enough…
  • 23. Some things to remember when you publish data (2) • Think about persistence and versioning – others may depend on the data you publish… • Be thoughtful about the URIs you choose • Try to avoid reinventing the wheel when choosing vocabularies
  • 24. Some things to remember when you publish data (3) • Document your data, i.e., provide metadata – there are vocabularies to do this • • • • Data Catalog Vocabulary (DCAT) Vocabulary of Interlinked Datasets (VoID) DCTERMS vocabularies for licensing (Open Data Commons, government licenses) – this area is still very much in development…
  • 25. Agenda • Data is changing our lives • W3C’s work on data integration • Expanding scope of W3C’s data activities
  • 26. New work underway • CSV on the Web • Data on the Web Best Practices • Vocabulary management
  • 27. What we are hearing • CSV is everywhere – can be huge data sets, not easily readable in a spreadsheet or Google refine – meaning of data not in machine-readable form – data is not necessarily used for web-scale integration but rather immediate usage • Metadata is essential • Conversion is an issue • European Commission Study on business models for Linked Open Government Data (BM4LOGD)
  • 28. Linked Data Benefits (BM4LOD) • Flexible data integration – Streamlined internal processes – Where working relationships already exist, much easier to share – Linking reference collections; discovery of new relationships • Increase in data quality – More use of data internally brings errors to light – Use of open standards increases quality of system • New services • Cost reduction – Increased efficiency – Increase in data usage due to LOD enrichment
  • 29. CSV on the Web • How W3C can help – metadata vocabulary to describe CSV data (structure, reference to access rights, annotations, etc.) – metadata discovery (e.g., part of an HTTP header, special rows and columns, packaging formats…) – mapping content to RDF, JSON, XML
  • 30. Best practices • Document best practices for the data publishers – URI design, management of persistence, versioning – business models – use of core metadata vocabularies (provenance, access control, ownership) • Specific vocabularies – quality, application descriptions, …
  • 31. Vocabulary management: challenge • Interoperable vocabularies are key for (meta)data • At the moment, it is a fairly chaotic world… – many, possibly overlapping vocabularies – difficult to locate the one that is needed – vocabularies may not be properly managed, maintained, versioned, provided persistence…
  • 32. Vocabulary management: how W3C can help • Provide a space where – communities can develop vocabularies (through, e.g., CGs, possibly WGs) – host vocabularies at W3C if requested – annotate vocabularies with a proper set of metadata terms – establish a vocabulary directory • The exact structure is still being discussed
  • 33. Summary • Data-driven smart apps are one of the major growth engines for the worldwide software market. • We need to meet developers where they are. • 5 Star Benefits of LOD – – – – – Greater efficiency, better provision of the task Greater flexibility leads to lower costs for future projects New services, new connections, new discoveries Improved navigation within and between datasets Others can build apps based on your data
  • 34. Available specifications: Primers, Guides` • Primers: – RDF Primer – OWL Guide – SKOS Primer – GRDDL Primer – RDFa Primer • The W3C Semantic Web Activity Wiki has links to all the specifications
  • 35. These slides are in the Web at http://www.w3.org/2014/Talks /0219-NISO-RRS with thanks to Ivan Herman, W3C and Phil Archer, W3C