SlideShare ist ein Scribd-Unternehmen logo
1 von 47
Bizer: Evolving the Web into a global Database – Advances and Applications, 30.1.2014 Slide 1
Prof. Dr. Christian Bizer
Evolving the Web into a global
Database
- Advances and Applications -
Bizer: Evolving the Web into a global Database – Advances and Applications, 30.1.2014 Slide 2
Data and Web Science Group @ University of Mannheim
− 3 Professors
• Prof. Dr. Heiner Stuckenschmidt
• Prof. Dr. Simone Paolo Ponzetto
• Prof. Dr. Christian Bizer
− 5 Post-Doctoral Researchers
− 18 PhD Students
− http://dws.informatik.uni-mannheim.de/
1. Research methods for integrating and mining large
amounts of heterogeneous information within
enterprise and open Web contexts.
2. Empirically analyze the content and structure of the
Web.
Bizer: Evolving the Web into a global Database – Advances and Applications, 30.1.2014 Slide 3
Querying the classic Web
DB
HTML
Bizer: Evolving the Web into a global Database – Advances and Applications, 30.1.2014 Slide 4
Long standing Goal
Query the Web like
a single, global
database
Bizer: Evolving the Web into a global Database – Advances and Applications, 30.1.2014 Slide 5
2001 Article: The Semantic Web
Envisions three things to happen:
1.people publish data in structured form
in addition to HTML pages on the Web
2.common vocabularies / ontologies are used
to represent data
3.people implement cool applications that
do smart things with the available data.
Tim Berners-Lee, James Hendler and Ora Lassila:
The Semantic Web. Scientific American, May 2001.
Bizer: Evolving the Web into a global Database – Advances and Applications, 30.1.2014 Slide 6
13 Years Later
There are 1.3 million publications about the
Semantic Web on Google Scholar, but
1. Do people publish structured data on the Web?
2. Do people agree on common vocabularies / ontologies?
3. What are the cool applications that exploit the data?
Bizer: Evolving the Web into a global Database – Advances and Applications, 30.1.2014 Slide 7
Outline
1. Linked Data
2. HTML-embedded Data
3. The Role of Wikipedia
4. Conclusions
Bizer: Evolving the Web into a global Database – Advances and Applications, 30.1.2014 Slide 8
1. Linked Data
B C
RDF
RDF
link
A D E
RDF
links
RDF
links
RDF
links
RDF
RDF
RDF
RDF
RDF RDF
RDF
RDF
RDF
• by using RDF to publish structured data on the Web
• by setting links between data items within different data sources.
Set of best practices for publishing structured data on
the Web in the form of a single global data graph.
Bizer: Evolving the Web into a global Database – Advances and Applications, 30.1.2014 Slide 9
Global Identifiers and Links as Integration Hints
 publishing Identity Links on the Web
 publishing Vocabulary Links on the Web
<http://www4.wiwiss.fu-berlin.de/is-group/resource/persons/Person4>
owl:sameAs
<http://dblp.l3s.de/d2r/resource/authors/Christian_Bizer> .
<http://xmlns.com/foaf/0.1/Person>
owl:equivalentClass
<http://dbpedia.org/ontology/Person> .
Bizer: Evolving the Web into a global Database – Advances and Applications, 30.1.2014 Slide 10
Effort Distribution between Publisher and Consumer
Publishers or third
parties provides
identity/vocabulary links
Consumer mines missing
identity/vocabulary links
Effort
Distribution
Bizer: Evolving the Web into a global Database – Advances and Applications, 30.1.2014 Slide 11
W3C Linking Open Data Project
− Grassroots community effort started in 2007 to
• publish existing open license datasets as Linked Data on the Web
• interlink things between different data sources
• maintain a data set catalog on the CKAN DataHub
Bizer: Evolving the Web into a global Database – Advances and Applications, 30.1.2014 Slide 12
LOD Datasets on the Web: September 2011
295 data sets
31,6 billion RDF triples
Bizer: Evolving the Web into a global Database – Advances and Applications, 30.1.2014 Slide 13
Newer statistics
− LODstats (University of Leipzig, 2014): 928 data sets
− LDspider Crawl (University of Mannheim, 2013): 850 data sets
Distribution by Topical Domain (September 2011)
Domain Data Sets Triples Percent RDF Links Percent
Media 25 1,841,852,061 5.82 % 50,440,705 10.01 %
Geographic 31 6,145,532,484 19.43 % 35,812,328 7.11 %
Government 49 13,315,009,400 42.09 % 19,343,519 3.84 %
Library 87 2,950,720,693 9.33 % 139,925,218 27.76 %
Cross-domain 41 4,184,635,715 13.23 % 63,183,065 12.54 %
Life sciences 41 3,036,336,004 9.60 % 191,844,090 38.06 %
User content 20 134,127,413 0.42 % 3,449,143 0.68 %
SUM 295 31,634,213,770 503,998,829
Bizer: Evolving the Web into a global Database – Advances and Applications, 30.1.2014 Slide 14
Ontological Agreement
− Out of the 295 data sources
• 102 (35%) only use terms from common vocabularies
• 105 (36%) only use proprietary terms
• 88 (29%) mix common and proprietary terms
− Popular Vocabularies
Vocabulary # Data Sets
Dublin Core 92 (31.19 %)
FOAF 81 (27.46 %)
SKOS 58 (19.66 %)
GEO 25 (8.47 %)
AKT 17 (5.76 %)
BIBO 14 (4.75 %)
Music Ontology 13 (4.41 %)
SIOC 10 (3.39 %)
Bizer: Evolving the Web into a global Database – Advances and Applications, 30.1.2014 Slide 15
Bizer: Evolving the Web into a global Database – Advances and Applications, 30.1.2014 Slide 16
Bizer: Evolving the Web into a global Database – Advances and Applications, 30.1.2014 Slide 17
Uptake in the Government Domain
− Goals
• Make data available to the public and other government agencies
• Ease data integration by providing unique identifiers and by setting links
− W3C Government Linked Data Working Group
Bizer: Evolving the Web into a global Database – Advances and Applications, 30.1.2014 Slide 18
Uptake in the Libraries Community
− Institutions publishing Linked Data
• Library of Congress (subject headings)
• German National Library (PND dataset and subject headings)
• Swedish National Library (Libris - catalog)
• Hungarian National Library (OPAC and Digital Library)
• Europeana Digital Library (4 million artifacts)
− Goals:
1. Integrate Library Catalogs on global scale
2. Interconnect resources between repositories
(by topic, by location, by historical period, by ...)
Bizer: Evolving the Web into a global Database – Advances and Applications, 30.1.2014 Slide 19
Industry Uptake
− Media Industry
• British Broadcasting Corporation
• New York Times
• Wolters Kluwer
• Springer
− Pharmaceutical Industry
• Johnson & Johnson
• Eli Lilly and Company
• AstraZeneca
− IT Industry
• IBM
Bizer: Evolving the Web into a global Database – Advances and Applications, 30.1.2014 Slide 20
2. HTML-embedded Data
Microformats
Microdata
RDFa
Websites semantically markup the
content of their HTML pages using:
Bizer: Evolving the Web into a global Database – Advances and Applications, 30.1.2014 Slide 21
Schema.org
− ask site owners since 2011
to markup data to enrich search results.
− 200+ Types: Event, Organization, Person, Place, Product, Review
− Encoding: Microdata or alternatively RDFa
Bizer: Evolving the Web into a global Database – Advances and Applications, 30.1.2014 Slide 22
Open Graph Protocol
− allows site owners to determine how
entities are described in Facebook
− relies on RDFa for encoding data in HTML pages
− available since April 2010
Bizer: Evolving the Web into a global Database – Advances and Applications, 30.1.2014 Slide 23
The Common Crawl
Bizer: Evolving the Web into a global Database – Advances and Applications, 30.1.2014 Slide 24
The WebDataCommons.org Project
− extracts all Microformat, Microdata, RDFa data from the
Common Crawl
− analyzes and provides the extracted data for download
− Two extractions runs
• 2009/2010 CC Corpus: 2.5 billion HTML pages  5.1 billion RDF triples
• 2012 CC Corpus: 3.0 billion HTML pages  7.3 billion RDF triples
− used 100 machines on Amazon EC2
• approx. 3000 machine/hours
(spot instances of type c1.xlarge)  550 EUR
− Jointed effort in the context of the EU project
Bizer: Evolving the Web into a global Database – Advances and Applications, 30.1.2014 Slide 25
Websites providing Structured Data (2012)
2.29 million websites (PLDs) out of 40 million
provide Microformat, Microdata or RDFa data
(5.65%)
369 million of the 3 billion pages contain
Microformat, Microdata or RDFa data (12.3%)
Google, October 2013:
15% of all websites provide structured data.
Bizer: Evolving the Web into a global Database – Advances and Applications, 30.1.2014 Slide 26
Breakdown by Encoding Format and Site Popularity
Grouped by Alexa Website Popularity Rank
(rank based on amount of page views)
Bizer: Evolving the Web into a global Database – Advances and Applications, 30.1.2014 Slide 27
− Top Classes:
− Topics
• CMS and Blog
metadata
• Product data
• Ratings/Reviews
• Company listings
RDFa Topics (CC 2012)
og = Facebook‘s Open
Graph Protocol
Bizer: Evolving the Web into a global Database – Advances and Applications, 30.1.2014 Slide 28
− Top Classes:
− Topics
• CMS and Blog
metadata
• Navigational
metadata
• Products and offers
• Business listings
• Ratings
• Places
• Events
Microdata Topics (CC 2012)
schema = Schema.org
datavoc = Google‘s
Rich Snippet Vocabulary
Bizer: Evolving the Web into a global Database – Advances and Applications, 30.1.2014 Slide 29
Class / Property Distribution
 A small set of
classes / properties
is used.
 Strong focus on
Schema.org and
Facebook vocabularies
Bizer: Evolving the Web into a global Database – Advances and Applications, 30.1.2014 Slide 30
Looking Deeper into the E-Commerce Data
Microdata
(2012)
Example Names:
• AppleMacBook Air MC968/A 11.6-Inch Laptop
• Apple MacBook Air 11-in, Intel Core i5 1.60GHz, 64 GB, Lion 10.7
Example Description:
• Faster Flash Storage with 64 GB Solid State Drive and USB 3.0 …
Bizer: Evolving the Web into a global Database – Advances and Applications, 30.1.2014 Slide 31
Usage of Schema.org Data @ Google
Rich snippets
within
search results
Bizer: Evolving the Web into a global Database – Advances and Applications, 30.1.2014 Slide 32
Usage of Open Graph Protocol Data @ Facebook
− allows site owners to determine how
entities are described in Facebook
Bizer: Evolving the Web into a global Database – Advances and Applications, 30.1.2014 Slide 33
Valuable Resource for Comparison Shopping Sites
− We analyzed 1.9 million product offers from 9200 shops
− We trained classifier for 9 product categories on product descriptions
from Amazon.
Bizer: Evolving the Web into a global Database – Advances and Applications, 30.1.2014 Slide 34
Identity Resolution for Electronic Products
− We trained parser for product descriptions on offers for electronic
products from Amazon.
− We used Silk framework for identity resolution.
Bizer: Evolving the Web into a global Database – Advances and Applications, 30.1.2014 Slide 35
Linked Data vs. HTML-embeded Data
LOD Cloud Microdata, Microformats, RDFa
< 1000 sources millions of sources
covers wider range of specific topics
focused on search engines and
Facebook
contains more complex
data structures
very simple and shallow
data structures
partial ontology agreement strong ontology agreement
data integration eased by RDF links
data integration requires NLP
techniques
Bizer: Evolving the Web into a global Database – Advances and Applications, 30.1.2014 Slide 36
Title
Description
Cross
Language
Links
Geo-
Coordinates
Images
Infoboxes
3. The Role of Wikipedia
Bizer: Evolving the Web into a global Database – Advances and Applications, 30.1.2014 Slide 37
Extracting Knowledge from Wikipedia
Bizer: Evolving the Web into a global Database – Advances and Applications, 30.1.2014 Slide 38
The DBpedia Knowledge Base - Version 3.9
− describes 4.00 million things, out of which
3.22 million are classified in a consistent ontology
using 529 classes and 2217 different properties
• 832,000 persons
• 639,000 places
• 209,000 organizations
• 116,000 music albums
− Altogether 2.46 billion pieces of information (RDF triples)
• 24,000,000 links to external web pages
• 27,200,000 external links into other RDF datasets
− DBpedia Internationalization
• provide data from 119 Wikipedia language editions for download
• 24 popular languages we provide cleaned infobox data
Bizer: Evolving the Web into a global Database – Advances and Applications, 30.1.2014 Slide 39
Bizer: Evolving the Web into a global Database – Advances and Applications, 30.1.2014 Slide 40
1. Answer fact queries: “birthdate michael douglas”
2. Compare things: „compare eiffel tower vs empire state building”
Applications of Google‘s Knowledge Graph
Bizer: Evolving the Web into a global Database – Advances and Applications, 30.1.2014 Slide 41
Applications of Google‘s Knowledge Graph
3. Enrich search results with infoboxes and lists
• Infoboxes might also contain Microdata/RDFa data, e.g. concerts of a band
3. Rank of search results using new Hummingbird ranking algorithm
Bizer: Evolving the Web into a global Database – Advances and Applications, 30.1.2014 Slide 42
DBpedia as Background Knowledge for Data Mining
− Which factors correlate with unemployment in France?
Bizer: Evolving the Web into a global Database – Advances and Applications, 30.1.2014 Slide 43
Unemployment Table with additional Attributes
Bizer: Evolving the Web into a global Database – Advances and Applications, 30.1.2014 Slide 44
RapidMiner Linked Open Data Extension
Allows you to
1. link local table to DBpedia and other LOD data sources
2. extend local table with additional attributes
3. mine extended tables using all Rapidminer features
Bizer: Evolving the Web into a global Database – Advances and Applications, 30.1.2014 Slide 45
Finding Correlations
− Use additional attributes to find interesting correlations
− Example correlation for unemployment in France:
• African islands, Islands in the Indian Ocean,
Outermost regions of the EU (positive)
• Population growth (positive)
• Disposable income (negative)
• Energy consumption (negative)
• Fast food restaurants (positive)
• Hospital beds/inhabitants
(negative)
• Police stations (positive)
Bizer: Evolving the Web into a global Database – Advances and Applications, 30.1.2014 Slide 46
Conclusions
1. Publication of Structured Data
• There is more data than most people from research and industry like
• Exciting test-bed for data profiling and data integration techniques
• Not even the research focus has moved to the integration of 1000s of
sources
1. Ontological Agreement
• Application-pull helps (Google et al.)
• But data source-specific attributes are also important
(e.g. in life science or statistics domain)
1. Applications
• the big players are moving
• there is a lot of experimentation in industry, but many efforts are still in the
prototype stage
Bizer: Evolving the Web into a global Database – Advances and Applications, 30.1.2014 Slide 47
Thanks
Mannheim Linked Open Data Meetup
−Free beer and food
−Talks by Springer, Wolters Kluwer, Semantic Web
Company, LOD2 project participants, DWS group
members
−Sunday, February 23, 2014, 6:30 PM
−http://www.meetup.com/OpenKnowledgeFoundation/M
annheim-DE/1092882/
Advertisement

Weitere ähnliche Inhalte

Was ist angesagt?

Linking Spatial Data From The Web
Linking Spatial Data From The WebLinking Spatial Data From The Web
Linking Spatial Data From The Webchristianhbecker
 
Opening Up The BL's Metadata
Opening Up The BL's MetadataOpening Up The BL's Metadata
Opening Up The BL's Metadatanw13
 
TIB's action for research data managament as a national library's strategy in...
TIB's action for research data managament as a national library's strategy in...TIB's action for research data managament as a national library's strategy in...
TIB's action for research data managament as a national library's strategy in...Peter Löwe
 
Online Learning and Linked Data: An Introduction
Online Learning and Linked Data: An IntroductionOnline Learning and Linked Data: An Introduction
Online Learning and Linked Data: An IntroductionEUCLID project
 
New approaches for data acquisition at europeana iiif, sitemaps and schema.o...
New approaches for data acquisition at europeana  iiif, sitemaps and schema.o...New approaches for data acquisition at europeana  iiif, sitemaps and schema.o...
New approaches for data acquisition at europeana iiif, sitemaps and schema.o...Nuno Freire
 
Building a Collection of the Historical UK Web for scholarly use
Building a Collection of the Historical UK Web for scholarly useBuilding a Collection of the Historical UK Web for scholarly use
Building a Collection of the Historical UK Web for scholarly useALISS
 
Delivering Linked Data Training to Data Science Practitioners
Delivering Linked Data Training to Data Science PractitionersDelivering Linked Data Training to Data Science Practitioners
Delivering Linked Data Training to Data Science PractitionersMarin Dimitrov
 
Introduction to Edinburgh University Data Library and national data services
Introduction to Edinburgh University Data Library and national data servicesIntroduction to Edinburgh University Data Library and national data services
Introduction to Edinburgh University Data Library and national data servicesEDINA, University of Edinburgh
 
LAK Dataset and Challenge (April 2013)
LAK Dataset and Challenge (April 2013)LAK Dataset and Challenge (April 2013)
LAK Dataset and Challenge (April 2013)Stefan Dietze
 
Decentralized Data Management for the Semantic Web
Decentralized Data Management for the Semantic WebDecentralized Data Management for the Semantic Web
Decentralized Data Management for the Semantic Webhala Skaf
 
Edinburgh DataShare – A DSpace Data Repository: Achievements and Aspirations
Edinburgh DataShare – A DSpace Data Repository: Achievements and Aspirations Edinburgh DataShare – A DSpace Data Repository: Achievements and Aspirations
Edinburgh DataShare – A DSpace Data Repository: Achievements and Aspirations EDINA, University of Edinburgh
 
Open Educational Data - Datasets and APIs (Athens Green Hackathon 2012)
Open Educational Data - Datasets and APIs (Athens Green Hackathon 2012)Open Educational Data - Datasets and APIs (Athens Green Hackathon 2012)
Open Educational Data - Datasets and APIs (Athens Green Hackathon 2012)Stefan Dietze
 
Maximising (Re)Usability of Resources using Linked Data
Maximising (Re)Usability of Resources using Linked DataMaximising (Re)Usability of Resources using Linked Data
Maximising (Re)Usability of Resources using Linked DataAsuncion Gomez-Perez
 
Geospatial metadata and spatial data workshop: 19 June 2014
Geospatial metadata and spatial data workshop: 19 June 2014Geospatial metadata and spatial data workshop: 19 June 2014
Geospatial metadata and spatial data workshop: 19 June 2014EDINA, University of Edinburgh
 
Demo: Profiling & Exploration of Linked Open Data
Demo: Profiling & Exploration of Linked Open DataDemo: Profiling & Exploration of Linked Open Data
Demo: Profiling & Exploration of Linked Open DataStefan Dietze
 

Was ist angesagt? (20)

Linking Spatial Data From The Web
Linking Spatial Data From The WebLinking Spatial Data From The Web
Linking Spatial Data From The Web
 
Opening Up The BL's Metadata
Opening Up The BL's MetadataOpening Up The BL's Metadata
Opening Up The BL's Metadata
 
TIB's action for research data managament as a national library's strategy in...
TIB's action for research data managament as a national library's strategy in...TIB's action for research data managament as a national library's strategy in...
TIB's action for research data managament as a national library's strategy in...
 
AddressingHistory - Tracing the Past
AddressingHistory - Tracing the PastAddressingHistory - Tracing the Past
AddressingHistory - Tracing the Past
 
Online Learning and Linked Data: An Introduction
Online Learning and Linked Data: An IntroductionOnline Learning and Linked Data: An Introduction
Online Learning and Linked Data: An Introduction
 
3e Studiedag Webarchivering - Promise
3e Studiedag Webarchivering - Promise3e Studiedag Webarchivering - Promise
3e Studiedag Webarchivering - Promise
 
New approaches for data acquisition at europeana iiif, sitemaps and schema.o...
New approaches for data acquisition at europeana  iiif, sitemaps and schema.o...New approaches for data acquisition at europeana  iiif, sitemaps and schema.o...
New approaches for data acquisition at europeana iiif, sitemaps and schema.o...
 
Building a Collection of the Historical UK Web for scholarly use
Building a Collection of the Historical UK Web for scholarly useBuilding a Collection of the Historical UK Web for scholarly use
Building a Collection of the Historical UK Web for scholarly use
 
Delivering Linked Data Training to Data Science Practitioners
Delivering Linked Data Training to Data Science PractitionersDelivering Linked Data Training to Data Science Practitioners
Delivering Linked Data Training to Data Science Practitioners
 
Edinburgh DataShare - DSpace for Data
Edinburgh DataShare - DSpace for DataEdinburgh DataShare - DSpace for Data
Edinburgh DataShare - DSpace for Data
 
Introduction to Edinburgh University Data Library and national data services
Introduction to Edinburgh University Data Library and national data servicesIntroduction to Edinburgh University Data Library and national data services
Introduction to Edinburgh University Data Library and national data services
 
Ukla uksg 2013_final
Ukla uksg 2013_finalUkla uksg 2013_final
Ukla uksg 2013_final
 
LAK Dataset and Challenge (April 2013)
LAK Dataset and Challenge (April 2013)LAK Dataset and Challenge (April 2013)
LAK Dataset and Challenge (April 2013)
 
Decentralized Data Management for the Semantic Web
Decentralized Data Management for the Semantic WebDecentralized Data Management for the Semantic Web
Decentralized Data Management for the Semantic Web
 
Edinburgh DataShare – A DSpace Data Repository: Achievements and Aspirations
Edinburgh DataShare – A DSpace Data Repository: Achievements and Aspirations Edinburgh DataShare – A DSpace Data Repository: Achievements and Aspirations
Edinburgh DataShare – A DSpace Data Repository: Achievements and Aspirations
 
Open Educational Data - Datasets and APIs (Athens Green Hackathon 2012)
Open Educational Data - Datasets and APIs (Athens Green Hackathon 2012)Open Educational Data - Datasets and APIs (Athens Green Hackathon 2012)
Open Educational Data - Datasets and APIs (Athens Green Hackathon 2012)
 
Maximising (Re)Usability of Resources using Linked Data
Maximising (Re)Usability of Resources using Linked DataMaximising (Re)Usability of Resources using Linked Data
Maximising (Re)Usability of Resources using Linked Data
 
Geospatial metadata and spatial data workshop: 19 June 2014
Geospatial metadata and spatial data workshop: 19 June 2014Geospatial metadata and spatial data workshop: 19 June 2014
Geospatial metadata and spatial data workshop: 19 June 2014
 
Demo: Profiling & Exploration of Linked Open Data
Demo: Profiling & Exploration of Linked Open DataDemo: Profiling & Exploration of Linked Open Data
Demo: Profiling & Exploration of Linked Open Data
 
Geoservices Activities at EDINA
Geoservices Activities at EDINAGeoservices Activities at EDINA
Geoservices Activities at EDINA
 

Andere mochten auch

database slide
database slidedatabase slide
database slideAkhil Nair
 
J2EE - Practical Overview
J2EE - Practical OverviewJ2EE - Practical Overview
J2EE - Practical OverviewSvetlin Nakov
 
Public Cloud Platforms for .NET Developers
Public Cloud Platforms for .NET DevelopersPublic Cloud Platforms for .NET Developers
Public Cloud Platforms for .NET DevelopersSvetlin Nakov
 
Writing High Quality Code in C#
Writing High Quality Code in C#Writing High Quality Code in C#
Writing High Quality Code in C#Svetlin Nakov
 
Nakov - .NET Framework Overview - English
Nakov - .NET Framework Overview - EnglishNakov - .NET Framework Overview - English
Nakov - .NET Framework Overview - EnglishSvetlin Nakov
 
RMAN best practices for RAC
RMAN best practices for RACRMAN best practices for RAC
RMAN best practices for RACSyed Hussain
 
14. Defining Classes
14. Defining Classes14. Defining Classes
14. Defining ClassesIntro C# Book
 
Payroll system
Payroll systemPayroll system
Payroll systemWirat Mojo
 
0. Course Introduction
0. Course Introduction0. Course Introduction
0. Course IntroductionIntro C# Book
 
Oracle database 12c new features
Oracle database 12c new featuresOracle database 12c new features
Oracle database 12c new featuresJakkrapat S.
 
Thesis about Computerized Payroll System for Barangay Hall, Dita
Thesis about Computerized Payroll System for Barangay Hall, DitaThesis about Computerized Payroll System for Barangay Hall, Dita
Thesis about Computerized Payroll System for Barangay Hall, DitaAcel Carl David O, Dolindo
 
HBase Vs Cassandra Vs MongoDB - Choosing the right NoSQL database
HBase Vs Cassandra Vs MongoDB - Choosing the right NoSQL databaseHBase Vs Cassandra Vs MongoDB - Choosing the right NoSQL database
HBase Vs Cassandra Vs MongoDB - Choosing the right NoSQL databaseEdureka!
 
Computerized payroll system
Computerized payroll systemComputerized payroll system
Computerized payroll systemFrancis Genavia
 
Payroll Management System
Payroll Management SystemPayroll Management System
Payroll Management SystemDheeraj Jha
 
Database Design Slide 1
Database Design Slide 1Database Design Slide 1
Database Design Slide 1ahfiki
 
Computer science project work
Computer science project workComputer science project work
Computer science project workrahulchamp2345
 
Payroll Management System SRS
Payroll Management System SRSPayroll Management System SRS
Payroll Management System SRSShubham Modi
 

Andere mochten auch (20)

database slide 1
database slide 1database slide 1
database slide 1
 
database slide
database slidedatabase slide
database slide
 
J2EE - Practical Overview
J2EE - Practical OverviewJ2EE - Practical Overview
J2EE - Practical Overview
 
Database slide
Database slideDatabase slide
Database slide
 
Public Cloud Platforms for .NET Developers
Public Cloud Platforms for .NET DevelopersPublic Cloud Platforms for .NET Developers
Public Cloud Platforms for .NET Developers
 
Writing High Quality Code in C#
Writing High Quality Code in C#Writing High Quality Code in C#
Writing High Quality Code in C#
 
Nakov - .NET Framework Overview - English
Nakov - .NET Framework Overview - EnglishNakov - .NET Framework Overview - English
Nakov - .NET Framework Overview - English
 
RMAN best practices for RAC
RMAN best practices for RACRMAN best practices for RAC
RMAN best practices for RAC
 
14. Defining Classes
14. Defining Classes14. Defining Classes
14. Defining Classes
 
Payroll system
Payroll systemPayroll system
Payroll system
 
0. Course Introduction
0. Course Introduction0. Course Introduction
0. Course Introduction
 
Oracle database 12c new features
Oracle database 12c new featuresOracle database 12c new features
Oracle database 12c new features
 
Thesis about Computerized Payroll System for Barangay Hall, Dita
Thesis about Computerized Payroll System for Barangay Hall, DitaThesis about Computerized Payroll System for Barangay Hall, Dita
Thesis about Computerized Payroll System for Barangay Hall, Dita
 
HBase Vs Cassandra Vs MongoDB - Choosing the right NoSQL database
HBase Vs Cassandra Vs MongoDB - Choosing the right NoSQL databaseHBase Vs Cassandra Vs MongoDB - Choosing the right NoSQL database
HBase Vs Cassandra Vs MongoDB - Choosing the right NoSQL database
 
Computerized payroll system
Computerized payroll systemComputerized payroll system
Computerized payroll system
 
Payroll Management System
Payroll Management SystemPayroll Management System
Payroll Management System
 
Database Design Slide 1
Database Design Slide 1Database Design Slide 1
Database Design Slide 1
 
Computer science project work
Computer science project workComputer science project work
Computer science project work
 
Payroll management
Payroll   managementPayroll   management
Payroll management
 
Payroll Management System SRS
Payroll Management System SRSPayroll Management System SRS
Payroll Management System SRS
 

Ähnlich wie Evolving the Web into a Global Database - Advances and Applications.

Linked Data (1st Linked Data Meetup Malmö)
Linked Data (1st Linked Data Meetup Malmö)Linked Data (1st Linked Data Meetup Malmö)
Linked Data (1st Linked Data Meetup Malmö)Anja Jentzsch
 
The Semantic Web Exists. What Next?
The Semantic Web Exists. What Next?The Semantic Web Exists. What Next?
The Semantic Web Exists. What Next?Anna Fensel
 
Libraries in the Big Data Era: Strategies and Challenges in Archiving and Sha...
Libraries in the Big Data Era: Strategies and Challenges in Archiving and Sha...Libraries in the Big Data Era: Strategies and Challenges in Archiving and Sha...
Libraries in the Big Data Era: Strategies and Challenges in Archiving and Sha...Peter Löwe
 
Linked Data - Overview and Potentials
Linked Data - Overview and PotentialsLinked Data - Overview and Potentials
Linked Data - Overview and PotentialsTobias Bürger
 
Linked Energy Data Generation
Linked Energy Data GenerationLinked Energy Data Generation
Linked Energy Data GenerationFilip Radulovic
 
Deployment of rd_fa_microdata_microformats_on_the_web
Deployment of rd_fa_microdata_microformats_on_the_webDeployment of rd_fa_microdata_microformats_on_the_web
Deployment of rd_fa_microdata_microformats_on_the_webSTIinnsbruck
 
Linked Open Data for Cultural Heritage
Linked Open Data for Cultural HeritageLinked Open Data for Cultural Heritage
Linked Open Data for Cultural HeritageNoreen Whysel
 
Exploring the Use of Linked Data to Bridge State and Federal Archives
Exploring the Use of Linked Data to Bridge State and Federal ArchivesExploring the Use of Linked Data to Bridge State and Federal Archives
Exploring the Use of Linked Data to Bridge State and Federal ArchivesJon Voss
 
RDMkit, a Research Data Management Toolkit. Built by the Community for the ...
RDMkit, a Research Data Management Toolkit.  Built by the Community for the ...RDMkit, a Research Data Management Toolkit.  Built by the Community for the ...
RDMkit, a Research Data Management Toolkit. Built by the Community for the ...Carole Goble
 
The Climate Tagger - a tagging and recommender service for climate informatio...
The Climate Tagger - a tagging and recommender service for climate informatio...The Climate Tagger - a tagging and recommender service for climate informatio...
The Climate Tagger - a tagging and recommender service for climate informatio...Martin Kaltenböck
 
DBpedia Tutorial - Feb 2015, Dublin
DBpedia Tutorial - Feb 2015, DublinDBpedia Tutorial - Feb 2015, Dublin
DBpedia Tutorial - Feb 2015, Dublinm_ackermann
 
Using DBpedia for Thesaurus Management and Linked Open Data Integration
Using DBpedia for Thesaurus Management and Linked Open Data IntegrationUsing DBpedia for Thesaurus Management and Linked Open Data Integration
Using DBpedia for Thesaurus Management and Linked Open Data IntegrationMartin Kaltenböck
 
Towards long-term preservation of linked data - the PRELIDA project
Towards long-term preservation of linked data - the PRELIDA projectTowards long-term preservation of linked data - the PRELIDA project
Towards long-term preservation of linked data - the PRELIDA projectPRELIDA Project
 

Ähnlich wie Evolving the Web into a Global Database - Advances and Applications. (20)

Linked Data (1st Linked Data Meetup Malmö)
Linked Data (1st Linked Data Meetup Malmö)Linked Data (1st Linked Data Meetup Malmö)
Linked Data (1st Linked Data Meetup Malmö)
 
The Semantic Web Exists. What Next?
The Semantic Web Exists. What Next?The Semantic Web Exists. What Next?
The Semantic Web Exists. What Next?
 
Here Comes Everything
Here Comes EverythingHere Comes Everything
Here Comes Everything
 
Libraries in the Big Data Era: Strategies and Challenges in Archiving and Sha...
Libraries in the Big Data Era: Strategies and Challenges in Archiving and Sha...Libraries in the Big Data Era: Strategies and Challenges in Archiving and Sha...
Libraries in the Big Data Era: Strategies and Challenges in Archiving and Sha...
 
Linked Data - Overview and Potentials
Linked Data - Overview and PotentialsLinked Data - Overview and Potentials
Linked Data - Overview and Potentials
 
Uc3 pasig-asis&t-2013-08-20-support-of-data-intensive-research
Uc3 pasig-asis&t-2013-08-20-support-of-data-intensive-researchUc3 pasig-asis&t-2013-08-20-support-of-data-intensive-research
Uc3 pasig-asis&t-2013-08-20-support-of-data-intensive-research
 
Linked Energy Data Generation
Linked Energy Data GenerationLinked Energy Data Generation
Linked Energy Data Generation
 
The Semantic Data Web, Sören Auer, University of Leipzig
The Semantic Data Web, Sören Auer, University of LeipzigThe Semantic Data Web, Sören Auer, University of Leipzig
The Semantic Data Web, Sören Auer, University of Leipzig
 
KEDL DBpedia 2019
KEDL DBpedia  2019KEDL DBpedia  2019
KEDL DBpedia 2019
 
Deployment of rd_fa_microdata_microformats_on_the_web
Deployment of rd_fa_microdata_microformats_on_the_webDeployment of rd_fa_microdata_microformats_on_the_web
Deployment of rd_fa_microdata_microformats_on_the_web
 
Linked Open Data for Cultural Heritage
Linked Open Data for Cultural HeritageLinked Open Data for Cultural Heritage
Linked Open Data for Cultural Heritage
 
Exploring the Use of Linked Data to Bridge State and Federal Archives
Exploring the Use of Linked Data to Bridge State and Federal ArchivesExploring the Use of Linked Data to Bridge State and Federal Archives
Exploring the Use of Linked Data to Bridge State and Federal Archives
 
RDMkit, a Research Data Management Toolkit. Built by the Community for the ...
RDMkit, a Research Data Management Toolkit.  Built by the Community for the ...RDMkit, a Research Data Management Toolkit.  Built by the Community for the ...
RDMkit, a Research Data Management Toolkit. Built by the Community for the ...
 
The Climate Tagger - a tagging and recommender service for climate informatio...
The Climate Tagger - a tagging and recommender service for climate informatio...The Climate Tagger - a tagging and recommender service for climate informatio...
The Climate Tagger - a tagging and recommender service for climate informatio...
 
CAEPIA 2011
CAEPIA 2011CAEPIA 2011
CAEPIA 2011
 
Linked Data
Linked DataLinked Data
Linked Data
 
LOD2 Webinar Series: CubeViz
LOD2 Webinar Series: CubeViz LOD2 Webinar Series: CubeViz
LOD2 Webinar Series: CubeViz
 
DBpedia Tutorial - Feb 2015, Dublin
DBpedia Tutorial - Feb 2015, DublinDBpedia Tutorial - Feb 2015, Dublin
DBpedia Tutorial - Feb 2015, Dublin
 
Using DBpedia for Thesaurus Management and Linked Open Data Integration
Using DBpedia for Thesaurus Management and Linked Open Data IntegrationUsing DBpedia for Thesaurus Management and Linked Open Data Integration
Using DBpedia for Thesaurus Management and Linked Open Data Integration
 
Towards long-term preservation of linked data - the PRELIDA project
Towards long-term preservation of linked data - the PRELIDA projectTowards long-term preservation of linked data - the PRELIDA project
Towards long-term preservation of linked data - the PRELIDA project
 

Mehr von Chris Bizer

GPT4 versus BERT: Which Foundation Model is better for Web Data Integration?
GPT4 versus BERT: Which Foundation Model is better for Web Data Integration?GPT4 versus BERT: Which Foundation Model is better for Web Data Integration?
GPT4 versus BERT: Which Foundation Model is better for Web Data Integration?Chris Bizer
 
Integrating Product Data from the Semantic Web using Deep Learning Techniques
Integrating Product Data from the Semantic Web using Deep Learning TechniquesIntegrating Product Data from the Semantic Web using Deep Learning Techniques
Integrating Product Data from the Semantic Web using Deep Learning TechniquesChris Bizer
 
Using the Semantic Web as Training Data for Product Matching
Using the Semantic Web as Training Data for Product MatchingUsing the Semantic Web as Training Data for Product Matching
Using the Semantic Web as Training Data for Product MatchingChris Bizer
 
JIST2019 Keynote: Completing Knowledge Graphs using Data from the Open Web
JIST2019 Keynote: Completing Knowledge Graphs using Data from the Open WebJIST2019 Keynote: Completing Knowledge Graphs using Data from the Open Web
JIST2019 Keynote: Completing Knowledge Graphs using Data from the Open WebChris Bizer
 
Schema.org Annotations and Web Tables: Underexploited Semantic Nuggets on the...
Schema.org Annotations and Web Tables: Underexploited Semantic Nuggets on the...Schema.org Annotations and Web Tables: Underexploited Semantic Nuggets on the...
Schema.org Annotations and Web Tables: Underexploited Semantic Nuggets on the...Chris Bizer
 
Is the Semantic Web what we expected? Adoption Patterns and Content-driven Ch...
Is the Semantic Web what we expected? Adoption Patterns and Content-driven Ch...Is the Semantic Web what we expected? Adoption Patterns and Content-driven Ch...
Is the Semantic Web what we expected? Adoption Patterns and Content-driven Ch...Chris Bizer
 
Data Search and Search Joins (Universität Heidelberg 2015)
Data Search and Search Joins (Universität Heidelberg 2015)Data Search and Search Joins (Universität Heidelberg 2015)
Data Search and Search Joins (Universität Heidelberg 2015)Chris Bizer
 
Exploring the Application Potential of Relational Web Tables
Exploring the Application Potential of Relational Web TablesExploring the Application Potential of Relational Web Tables
Exploring the Application Potential of Relational Web TablesChris Bizer
 
Evolving the Web into a Global Dataspace – Advances and Applications
Evolving the Web into a Global Dataspace – Advances and ApplicationsEvolving the Web into a Global Dataspace – Advances and Applications
Evolving the Web into a Global Dataspace – Advances and ApplicationsChris Bizer
 
Extending Tables with Data from over a Million Websites
 Extending Tables with Data from over a Million Websites Extending Tables with Data from over a Million Websites
Extending Tables with Data from over a Million WebsitesChris Bizer
 
Adoption of the Linked Data Best Practices in Different Topical Domains
Adoption of the Linked Data Best Practices in Different Topical DomainsAdoption of the Linked Data Best Practices in Different Topical Domains
Adoption of the Linked Data Best Practices in Different Topical DomainsChris Bizer
 
Graph Structure in the Web - Revisited. WWW2014 Web Science Track
Graph Structure in the Web - Revisited. WWW2014 Web Science TrackGraph Structure in the Web - Revisited. WWW2014 Web Science Track
Graph Structure in the Web - Revisited. WWW2014 Web Science TrackChris Bizer
 
Search Joins with the Web - ICDT2014 Invited Lecture
Search Joins with the Web - ICDT2014 Invited LectureSearch Joins with the Web - ICDT2014 Invited Lecture
Search Joins with the Web - ICDT2014 Invited LectureChris Bizer
 
DBpedia - An Interlinking Hub in the Web of Data
DBpedia - An Interlinking Hub in the Web of DataDBpedia - An Interlinking Hub in the Web of Data
DBpedia - An Interlinking Hub in the Web of DataChris Bizer
 

Mehr von Chris Bizer (14)

GPT4 versus BERT: Which Foundation Model is better for Web Data Integration?
GPT4 versus BERT: Which Foundation Model is better for Web Data Integration?GPT4 versus BERT: Which Foundation Model is better for Web Data Integration?
GPT4 versus BERT: Which Foundation Model is better for Web Data Integration?
 
Integrating Product Data from the Semantic Web using Deep Learning Techniques
Integrating Product Data from the Semantic Web using Deep Learning TechniquesIntegrating Product Data from the Semantic Web using Deep Learning Techniques
Integrating Product Data from the Semantic Web using Deep Learning Techniques
 
Using the Semantic Web as Training Data for Product Matching
Using the Semantic Web as Training Data for Product MatchingUsing the Semantic Web as Training Data for Product Matching
Using the Semantic Web as Training Data for Product Matching
 
JIST2019 Keynote: Completing Knowledge Graphs using Data from the Open Web
JIST2019 Keynote: Completing Knowledge Graphs using Data from the Open WebJIST2019 Keynote: Completing Knowledge Graphs using Data from the Open Web
JIST2019 Keynote: Completing Knowledge Graphs using Data from the Open Web
 
Schema.org Annotations and Web Tables: Underexploited Semantic Nuggets on the...
Schema.org Annotations and Web Tables: Underexploited Semantic Nuggets on the...Schema.org Annotations and Web Tables: Underexploited Semantic Nuggets on the...
Schema.org Annotations and Web Tables: Underexploited Semantic Nuggets on the...
 
Is the Semantic Web what we expected? Adoption Patterns and Content-driven Ch...
Is the Semantic Web what we expected? Adoption Patterns and Content-driven Ch...Is the Semantic Web what we expected? Adoption Patterns and Content-driven Ch...
Is the Semantic Web what we expected? Adoption Patterns and Content-driven Ch...
 
Data Search and Search Joins (Universität Heidelberg 2015)
Data Search and Search Joins (Universität Heidelberg 2015)Data Search and Search Joins (Universität Heidelberg 2015)
Data Search and Search Joins (Universität Heidelberg 2015)
 
Exploring the Application Potential of Relational Web Tables
Exploring the Application Potential of Relational Web TablesExploring the Application Potential of Relational Web Tables
Exploring the Application Potential of Relational Web Tables
 
Evolving the Web into a Global Dataspace – Advances and Applications
Evolving the Web into a Global Dataspace – Advances and ApplicationsEvolving the Web into a Global Dataspace – Advances and Applications
Evolving the Web into a Global Dataspace – Advances and Applications
 
Extending Tables with Data from over a Million Websites
 Extending Tables with Data from over a Million Websites Extending Tables with Data from over a Million Websites
Extending Tables with Data from over a Million Websites
 
Adoption of the Linked Data Best Practices in Different Topical Domains
Adoption of the Linked Data Best Practices in Different Topical DomainsAdoption of the Linked Data Best Practices in Different Topical Domains
Adoption of the Linked Data Best Practices in Different Topical Domains
 
Graph Structure in the Web - Revisited. WWW2014 Web Science Track
Graph Structure in the Web - Revisited. WWW2014 Web Science TrackGraph Structure in the Web - Revisited. WWW2014 Web Science Track
Graph Structure in the Web - Revisited. WWW2014 Web Science Track
 
Search Joins with the Web - ICDT2014 Invited Lecture
Search Joins with the Web - ICDT2014 Invited LectureSearch Joins with the Web - ICDT2014 Invited Lecture
Search Joins with the Web - ICDT2014 Invited Lecture
 
DBpedia - An Interlinking Hub in the Web of Data
DBpedia - An Interlinking Hub in the Web of DataDBpedia - An Interlinking Hub in the Web of Data
DBpedia - An Interlinking Hub in the Web of Data
 

Kürzlich hochgeladen

Low Rate Young Call Girls in Sector 63 Mamura Noida ✔️☆9289244007✔️☆ Female E...
Low Rate Young Call Girls in Sector 63 Mamura Noida ✔️☆9289244007✔️☆ Female E...Low Rate Young Call Girls in Sector 63 Mamura Noida ✔️☆9289244007✔️☆ Female E...
Low Rate Young Call Girls in Sector 63 Mamura Noida ✔️☆9289244007✔️☆ Female E...SofiyaSharma5
 
Challengers I Told Ya ShirtChallengers I Told Ya Shirt
Challengers I Told Ya ShirtChallengers I Told Ya ShirtChallengers I Told Ya ShirtChallengers I Told Ya Shirt
Challengers I Told Ya ShirtChallengers I Told Ya Shirtrahman018755
 
Call Girls In Defence Colony Delhi 💯Call Us 🔝8264348440🔝
Call Girls In Defence Colony Delhi 💯Call Us 🔝8264348440🔝Call Girls In Defence Colony Delhi 💯Call Us 🔝8264348440🔝
Call Girls In Defence Colony Delhi 💯Call Us 🔝8264348440🔝soniya singh
 
Call Girls Service Chandigarh Lucky ❤️ 7710465962 Independent Call Girls In C...
Call Girls Service Chandigarh Lucky ❤️ 7710465962 Independent Call Girls In C...Call Girls Service Chandigarh Lucky ❤️ 7710465962 Independent Call Girls In C...
Call Girls Service Chandigarh Lucky ❤️ 7710465962 Independent Call Girls In C...Sheetaleventcompany
 
AWS Community DAY Albertini-Ellan Cloud Security (1).pptx
AWS Community DAY Albertini-Ellan Cloud Security (1).pptxAWS Community DAY Albertini-Ellan Cloud Security (1).pptx
AWS Community DAY Albertini-Ellan Cloud Security (1).pptxellan12
 
Call Girls In Model Towh Delhi 💯Call Us 🔝8264348440🔝
Call Girls In Model Towh Delhi 💯Call Us 🔝8264348440🔝Call Girls In Model Towh Delhi 💯Call Us 🔝8264348440🔝
Call Girls In Model Towh Delhi 💯Call Us 🔝8264348440🔝soniya singh
 
Moving Beyond Twitter/X and Facebook - Social Media for local news providers
Moving Beyond Twitter/X and Facebook - Social Media for local news providersMoving Beyond Twitter/X and Facebook - Social Media for local news providers
Moving Beyond Twitter/X and Facebook - Social Media for local news providersDamian Radcliffe
 
GDG Cloud Southlake 32: Kyle Hettinger: Demystifying the Dark Web
GDG Cloud Southlake 32: Kyle Hettinger: Demystifying the Dark WebGDG Cloud Southlake 32: Kyle Hettinger: Demystifying the Dark Web
GDG Cloud Southlake 32: Kyle Hettinger: Demystifying the Dark WebJames Anderson
 
Call Girls Dubai Prolapsed O525547819 Call Girls In Dubai Princes$
Call Girls Dubai Prolapsed O525547819 Call Girls In Dubai Princes$Call Girls Dubai Prolapsed O525547819 Call Girls In Dubai Princes$
Call Girls Dubai Prolapsed O525547819 Call Girls In Dubai Princes$kojalkojal131
 
Pune Airport ( Call Girls ) Pune 6297143586 Hot Model With Sexy Bhabi Ready...
Pune Airport ( Call Girls ) Pune  6297143586  Hot Model With Sexy Bhabi Ready...Pune Airport ( Call Girls ) Pune  6297143586  Hot Model With Sexy Bhabi Ready...
Pune Airport ( Call Girls ) Pune 6297143586 Hot Model With Sexy Bhabi Ready...tanu pandey
 
Call Girls In Pratap Nagar Delhi 💯Call Us 🔝8264348440🔝
Call Girls In Pratap Nagar Delhi 💯Call Us 🔝8264348440🔝Call Girls In Pratap Nagar Delhi 💯Call Us 🔝8264348440🔝
Call Girls In Pratap Nagar Delhi 💯Call Us 🔝8264348440🔝soniya singh
 
Networking in the Penumbra presented by Geoff Huston at NZNOG
Networking in the Penumbra presented by Geoff Huston at NZNOGNetworking in the Penumbra presented by Geoff Huston at NZNOG
Networking in the Penumbra presented by Geoff Huston at NZNOGAPNIC
 
Call Girls In Ashram Chowk Delhi 💯Call Us 🔝8264348440🔝
Call Girls In Ashram Chowk Delhi 💯Call Us 🔝8264348440🔝Call Girls In Ashram Chowk Delhi 💯Call Us 🔝8264348440🔝
Call Girls In Ashram Chowk Delhi 💯Call Us 🔝8264348440🔝soniya singh
 
Hot Service (+9316020077 ) Goa Call Girls Real Photos and Genuine Service
Hot Service (+9316020077 ) Goa  Call Girls Real Photos and Genuine ServiceHot Service (+9316020077 ) Goa  Call Girls Real Photos and Genuine Service
Hot Service (+9316020077 ) Goa Call Girls Real Photos and Genuine Servicesexy call girls service in goa
 
Top Rated Pune Call Girls Daund ⟟ 6297143586 ⟟ Call Me For Genuine Sex Servi...
Top Rated  Pune Call Girls Daund ⟟ 6297143586 ⟟ Call Me For Genuine Sex Servi...Top Rated  Pune Call Girls Daund ⟟ 6297143586 ⟟ Call Me For Genuine Sex Servi...
Top Rated Pune Call Girls Daund ⟟ 6297143586 ⟟ Call Me For Genuine Sex Servi...Call Girls in Nagpur High Profile
 
DDoS In Oceania and the Pacific, presented by Dave Phelan at NZNOG 2024
DDoS In Oceania and the Pacific, presented by Dave Phelan at NZNOG 2024DDoS In Oceania and the Pacific, presented by Dave Phelan at NZNOG 2024
DDoS In Oceania and the Pacific, presented by Dave Phelan at NZNOG 2024APNIC
 
SEO Growth Program-Digital optimization Specialist
SEO Growth Program-Digital optimization SpecialistSEO Growth Program-Digital optimization Specialist
SEO Growth Program-Digital optimization SpecialistKHM Anwar
 

Kürzlich hochgeladen (20)

Low Rate Young Call Girls in Sector 63 Mamura Noida ✔️☆9289244007✔️☆ Female E...
Low Rate Young Call Girls in Sector 63 Mamura Noida ✔️☆9289244007✔️☆ Female E...Low Rate Young Call Girls in Sector 63 Mamura Noida ✔️☆9289244007✔️☆ Female E...
Low Rate Young Call Girls in Sector 63 Mamura Noida ✔️☆9289244007✔️☆ Female E...
 
Challengers I Told Ya ShirtChallengers I Told Ya Shirt
Challengers I Told Ya ShirtChallengers I Told Ya ShirtChallengers I Told Ya ShirtChallengers I Told Ya Shirt
Challengers I Told Ya ShirtChallengers I Told Ya Shirt
 
Call Girls In Defence Colony Delhi 💯Call Us 🔝8264348440🔝
Call Girls In Defence Colony Delhi 💯Call Us 🔝8264348440🔝Call Girls In Defence Colony Delhi 💯Call Us 🔝8264348440🔝
Call Girls In Defence Colony Delhi 💯Call Us 🔝8264348440🔝
 
Call Girls Service Chandigarh Lucky ❤️ 7710465962 Independent Call Girls In C...
Call Girls Service Chandigarh Lucky ❤️ 7710465962 Independent Call Girls In C...Call Girls Service Chandigarh Lucky ❤️ 7710465962 Independent Call Girls In C...
Call Girls Service Chandigarh Lucky ❤️ 7710465962 Independent Call Girls In C...
 
AWS Community DAY Albertini-Ellan Cloud Security (1).pptx
AWS Community DAY Albertini-Ellan Cloud Security (1).pptxAWS Community DAY Albertini-Ellan Cloud Security (1).pptx
AWS Community DAY Albertini-Ellan Cloud Security (1).pptx
 
Call Girls In South Ex 📱 9999965857 🤩 Delhi 🫦 HOT AND SEXY VVIP 🍎 SERVICE
Call Girls In South Ex 📱  9999965857  🤩 Delhi 🫦 HOT AND SEXY VVIP 🍎 SERVICECall Girls In South Ex 📱  9999965857  🤩 Delhi 🫦 HOT AND SEXY VVIP 🍎 SERVICE
Call Girls In South Ex 📱 9999965857 🤩 Delhi 🫦 HOT AND SEXY VVIP 🍎 SERVICE
 
Call Girls In Model Towh Delhi 💯Call Us 🔝8264348440🔝
Call Girls In Model Towh Delhi 💯Call Us 🔝8264348440🔝Call Girls In Model Towh Delhi 💯Call Us 🔝8264348440🔝
Call Girls In Model Towh Delhi 💯Call Us 🔝8264348440🔝
 
Moving Beyond Twitter/X and Facebook - Social Media for local news providers
Moving Beyond Twitter/X and Facebook - Social Media for local news providersMoving Beyond Twitter/X and Facebook - Social Media for local news providers
Moving Beyond Twitter/X and Facebook - Social Media for local news providers
 
GDG Cloud Southlake 32: Kyle Hettinger: Demystifying the Dark Web
GDG Cloud Southlake 32: Kyle Hettinger: Demystifying the Dark WebGDG Cloud Southlake 32: Kyle Hettinger: Demystifying the Dark Web
GDG Cloud Southlake 32: Kyle Hettinger: Demystifying the Dark Web
 
Call Girls Dubai Prolapsed O525547819 Call Girls In Dubai Princes$
Call Girls Dubai Prolapsed O525547819 Call Girls In Dubai Princes$Call Girls Dubai Prolapsed O525547819 Call Girls In Dubai Princes$
Call Girls Dubai Prolapsed O525547819 Call Girls In Dubai Princes$
 
Pune Airport ( Call Girls ) Pune 6297143586 Hot Model With Sexy Bhabi Ready...
Pune Airport ( Call Girls ) Pune  6297143586  Hot Model With Sexy Bhabi Ready...Pune Airport ( Call Girls ) Pune  6297143586  Hot Model With Sexy Bhabi Ready...
Pune Airport ( Call Girls ) Pune 6297143586 Hot Model With Sexy Bhabi Ready...
 
Rohini Sector 6 Call Girls Delhi 9999965857 @Sabina Saikh No Advance
Rohini Sector 6 Call Girls Delhi 9999965857 @Sabina Saikh No AdvanceRohini Sector 6 Call Girls Delhi 9999965857 @Sabina Saikh No Advance
Rohini Sector 6 Call Girls Delhi 9999965857 @Sabina Saikh No Advance
 
Call Girls In Pratap Nagar Delhi 💯Call Us 🔝8264348440🔝
Call Girls In Pratap Nagar Delhi 💯Call Us 🔝8264348440🔝Call Girls In Pratap Nagar Delhi 💯Call Us 🔝8264348440🔝
Call Girls In Pratap Nagar Delhi 💯Call Us 🔝8264348440🔝
 
Networking in the Penumbra presented by Geoff Huston at NZNOG
Networking in the Penumbra presented by Geoff Huston at NZNOGNetworking in the Penumbra presented by Geoff Huston at NZNOG
Networking in the Penumbra presented by Geoff Huston at NZNOG
 
Call Girls In Ashram Chowk Delhi 💯Call Us 🔝8264348440🔝
Call Girls In Ashram Chowk Delhi 💯Call Us 🔝8264348440🔝Call Girls In Ashram Chowk Delhi 💯Call Us 🔝8264348440🔝
Call Girls In Ashram Chowk Delhi 💯Call Us 🔝8264348440🔝
 
Hot Service (+9316020077 ) Goa Call Girls Real Photos and Genuine Service
Hot Service (+9316020077 ) Goa  Call Girls Real Photos and Genuine ServiceHot Service (+9316020077 ) Goa  Call Girls Real Photos and Genuine Service
Hot Service (+9316020077 ) Goa Call Girls Real Photos and Genuine Service
 
Top Rated Pune Call Girls Daund ⟟ 6297143586 ⟟ Call Me For Genuine Sex Servi...
Top Rated  Pune Call Girls Daund ⟟ 6297143586 ⟟ Call Me For Genuine Sex Servi...Top Rated  Pune Call Girls Daund ⟟ 6297143586 ⟟ Call Me For Genuine Sex Servi...
Top Rated Pune Call Girls Daund ⟟ 6297143586 ⟟ Call Me For Genuine Sex Servi...
 
DDoS In Oceania and the Pacific, presented by Dave Phelan at NZNOG 2024
DDoS In Oceania and the Pacific, presented by Dave Phelan at NZNOG 2024DDoS In Oceania and the Pacific, presented by Dave Phelan at NZNOG 2024
DDoS In Oceania and the Pacific, presented by Dave Phelan at NZNOG 2024
 
Rohini Sector 22 Call Girls Delhi 9999965857 @Sabina Saikh No Advance
Rohini Sector 22 Call Girls Delhi 9999965857 @Sabina Saikh No AdvanceRohini Sector 22 Call Girls Delhi 9999965857 @Sabina Saikh No Advance
Rohini Sector 22 Call Girls Delhi 9999965857 @Sabina Saikh No Advance
 
SEO Growth Program-Digital optimization Specialist
SEO Growth Program-Digital optimization SpecialistSEO Growth Program-Digital optimization Specialist
SEO Growth Program-Digital optimization Specialist
 

Evolving the Web into a Global Database - Advances and Applications.

  • 1. Bizer: Evolving the Web into a global Database – Advances and Applications, 30.1.2014 Slide 1 Prof. Dr. Christian Bizer Evolving the Web into a global Database - Advances and Applications -
  • 2. Bizer: Evolving the Web into a global Database – Advances and Applications, 30.1.2014 Slide 2 Data and Web Science Group @ University of Mannheim − 3 Professors • Prof. Dr. Heiner Stuckenschmidt • Prof. Dr. Simone Paolo Ponzetto • Prof. Dr. Christian Bizer − 5 Post-Doctoral Researchers − 18 PhD Students − http://dws.informatik.uni-mannheim.de/ 1. Research methods for integrating and mining large amounts of heterogeneous information within enterprise and open Web contexts. 2. Empirically analyze the content and structure of the Web.
  • 3. Bizer: Evolving the Web into a global Database – Advances and Applications, 30.1.2014 Slide 3 Querying the classic Web DB HTML
  • 4. Bizer: Evolving the Web into a global Database – Advances and Applications, 30.1.2014 Slide 4 Long standing Goal Query the Web like a single, global database
  • 5. Bizer: Evolving the Web into a global Database – Advances and Applications, 30.1.2014 Slide 5 2001 Article: The Semantic Web Envisions three things to happen: 1.people publish data in structured form in addition to HTML pages on the Web 2.common vocabularies / ontologies are used to represent data 3.people implement cool applications that do smart things with the available data. Tim Berners-Lee, James Hendler and Ora Lassila: The Semantic Web. Scientific American, May 2001.
  • 6. Bizer: Evolving the Web into a global Database – Advances and Applications, 30.1.2014 Slide 6 13 Years Later There are 1.3 million publications about the Semantic Web on Google Scholar, but 1. Do people publish structured data on the Web? 2. Do people agree on common vocabularies / ontologies? 3. What are the cool applications that exploit the data?
  • 7. Bizer: Evolving the Web into a global Database – Advances and Applications, 30.1.2014 Slide 7 Outline 1. Linked Data 2. HTML-embedded Data 3. The Role of Wikipedia 4. Conclusions
  • 8. Bizer: Evolving the Web into a global Database – Advances and Applications, 30.1.2014 Slide 8 1. Linked Data B C RDF RDF link A D E RDF links RDF links RDF links RDF RDF RDF RDF RDF RDF RDF RDF RDF • by using RDF to publish structured data on the Web • by setting links between data items within different data sources. Set of best practices for publishing structured data on the Web in the form of a single global data graph.
  • 9. Bizer: Evolving the Web into a global Database – Advances and Applications, 30.1.2014 Slide 9 Global Identifiers and Links as Integration Hints  publishing Identity Links on the Web  publishing Vocabulary Links on the Web <http://www4.wiwiss.fu-berlin.de/is-group/resource/persons/Person4> owl:sameAs <http://dblp.l3s.de/d2r/resource/authors/Christian_Bizer> . <http://xmlns.com/foaf/0.1/Person> owl:equivalentClass <http://dbpedia.org/ontology/Person> .
  • 10. Bizer: Evolving the Web into a global Database – Advances and Applications, 30.1.2014 Slide 10 Effort Distribution between Publisher and Consumer Publishers or third parties provides identity/vocabulary links Consumer mines missing identity/vocabulary links Effort Distribution
  • 11. Bizer: Evolving the Web into a global Database – Advances and Applications, 30.1.2014 Slide 11 W3C Linking Open Data Project − Grassroots community effort started in 2007 to • publish existing open license datasets as Linked Data on the Web • interlink things between different data sources • maintain a data set catalog on the CKAN DataHub
  • 12. Bizer: Evolving the Web into a global Database – Advances and Applications, 30.1.2014 Slide 12 LOD Datasets on the Web: September 2011 295 data sets 31,6 billion RDF triples
  • 13. Bizer: Evolving the Web into a global Database – Advances and Applications, 30.1.2014 Slide 13 Newer statistics − LODstats (University of Leipzig, 2014): 928 data sets − LDspider Crawl (University of Mannheim, 2013): 850 data sets Distribution by Topical Domain (September 2011) Domain Data Sets Triples Percent RDF Links Percent Media 25 1,841,852,061 5.82 % 50,440,705 10.01 % Geographic 31 6,145,532,484 19.43 % 35,812,328 7.11 % Government 49 13,315,009,400 42.09 % 19,343,519 3.84 % Library 87 2,950,720,693 9.33 % 139,925,218 27.76 % Cross-domain 41 4,184,635,715 13.23 % 63,183,065 12.54 % Life sciences 41 3,036,336,004 9.60 % 191,844,090 38.06 % User content 20 134,127,413 0.42 % 3,449,143 0.68 % SUM 295 31,634,213,770 503,998,829
  • 14. Bizer: Evolving the Web into a global Database – Advances and Applications, 30.1.2014 Slide 14 Ontological Agreement − Out of the 295 data sources • 102 (35%) only use terms from common vocabularies • 105 (36%) only use proprietary terms • 88 (29%) mix common and proprietary terms − Popular Vocabularies Vocabulary # Data Sets Dublin Core 92 (31.19 %) FOAF 81 (27.46 %) SKOS 58 (19.66 %) GEO 25 (8.47 %) AKT 17 (5.76 %) BIBO 14 (4.75 %) Music Ontology 13 (4.41 %) SIOC 10 (3.39 %)
  • 15. Bizer: Evolving the Web into a global Database – Advances and Applications, 30.1.2014 Slide 15
  • 16. Bizer: Evolving the Web into a global Database – Advances and Applications, 30.1.2014 Slide 16
  • 17. Bizer: Evolving the Web into a global Database – Advances and Applications, 30.1.2014 Slide 17 Uptake in the Government Domain − Goals • Make data available to the public and other government agencies • Ease data integration by providing unique identifiers and by setting links − W3C Government Linked Data Working Group
  • 18. Bizer: Evolving the Web into a global Database – Advances and Applications, 30.1.2014 Slide 18 Uptake in the Libraries Community − Institutions publishing Linked Data • Library of Congress (subject headings) • German National Library (PND dataset and subject headings) • Swedish National Library (Libris - catalog) • Hungarian National Library (OPAC and Digital Library) • Europeana Digital Library (4 million artifacts) − Goals: 1. Integrate Library Catalogs on global scale 2. Interconnect resources between repositories (by topic, by location, by historical period, by ...)
  • 19. Bizer: Evolving the Web into a global Database – Advances and Applications, 30.1.2014 Slide 19 Industry Uptake − Media Industry • British Broadcasting Corporation • New York Times • Wolters Kluwer • Springer − Pharmaceutical Industry • Johnson & Johnson • Eli Lilly and Company • AstraZeneca − IT Industry • IBM
  • 20. Bizer: Evolving the Web into a global Database – Advances and Applications, 30.1.2014 Slide 20 2. HTML-embedded Data Microformats Microdata RDFa Websites semantically markup the content of their HTML pages using:
  • 21. Bizer: Evolving the Web into a global Database – Advances and Applications, 30.1.2014 Slide 21 Schema.org − ask site owners since 2011 to markup data to enrich search results. − 200+ Types: Event, Organization, Person, Place, Product, Review − Encoding: Microdata or alternatively RDFa
  • 22. Bizer: Evolving the Web into a global Database – Advances and Applications, 30.1.2014 Slide 22 Open Graph Protocol − allows site owners to determine how entities are described in Facebook − relies on RDFa for encoding data in HTML pages − available since April 2010
  • 23. Bizer: Evolving the Web into a global Database – Advances and Applications, 30.1.2014 Slide 23 The Common Crawl
  • 24. Bizer: Evolving the Web into a global Database – Advances and Applications, 30.1.2014 Slide 24 The WebDataCommons.org Project − extracts all Microformat, Microdata, RDFa data from the Common Crawl − analyzes and provides the extracted data for download − Two extractions runs • 2009/2010 CC Corpus: 2.5 billion HTML pages  5.1 billion RDF triples • 2012 CC Corpus: 3.0 billion HTML pages  7.3 billion RDF triples − used 100 machines on Amazon EC2 • approx. 3000 machine/hours (spot instances of type c1.xlarge)  550 EUR − Jointed effort in the context of the EU project
  • 25. Bizer: Evolving the Web into a global Database – Advances and Applications, 30.1.2014 Slide 25 Websites providing Structured Data (2012) 2.29 million websites (PLDs) out of 40 million provide Microformat, Microdata or RDFa data (5.65%) 369 million of the 3 billion pages contain Microformat, Microdata or RDFa data (12.3%) Google, October 2013: 15% of all websites provide structured data.
  • 26. Bizer: Evolving the Web into a global Database – Advances and Applications, 30.1.2014 Slide 26 Breakdown by Encoding Format and Site Popularity Grouped by Alexa Website Popularity Rank (rank based on amount of page views)
  • 27. Bizer: Evolving the Web into a global Database – Advances and Applications, 30.1.2014 Slide 27 − Top Classes: − Topics • CMS and Blog metadata • Product data • Ratings/Reviews • Company listings RDFa Topics (CC 2012) og = Facebook‘s Open Graph Protocol
  • 28. Bizer: Evolving the Web into a global Database – Advances and Applications, 30.1.2014 Slide 28 − Top Classes: − Topics • CMS and Blog metadata • Navigational metadata • Products and offers • Business listings • Ratings • Places • Events Microdata Topics (CC 2012) schema = Schema.org datavoc = Google‘s Rich Snippet Vocabulary
  • 29. Bizer: Evolving the Web into a global Database – Advances and Applications, 30.1.2014 Slide 29 Class / Property Distribution  A small set of classes / properties is used.  Strong focus on Schema.org and Facebook vocabularies
  • 30. Bizer: Evolving the Web into a global Database – Advances and Applications, 30.1.2014 Slide 30 Looking Deeper into the E-Commerce Data Microdata (2012) Example Names: • AppleMacBook Air MC968/A 11.6-Inch Laptop • Apple MacBook Air 11-in, Intel Core i5 1.60GHz, 64 GB, Lion 10.7 Example Description: • Faster Flash Storage with 64 GB Solid State Drive and USB 3.0 …
  • 31. Bizer: Evolving the Web into a global Database – Advances and Applications, 30.1.2014 Slide 31 Usage of Schema.org Data @ Google Rich snippets within search results
  • 32. Bizer: Evolving the Web into a global Database – Advances and Applications, 30.1.2014 Slide 32 Usage of Open Graph Protocol Data @ Facebook − allows site owners to determine how entities are described in Facebook
  • 33. Bizer: Evolving the Web into a global Database – Advances and Applications, 30.1.2014 Slide 33 Valuable Resource for Comparison Shopping Sites − We analyzed 1.9 million product offers from 9200 shops − We trained classifier for 9 product categories on product descriptions from Amazon.
  • 34. Bizer: Evolving the Web into a global Database – Advances and Applications, 30.1.2014 Slide 34 Identity Resolution for Electronic Products − We trained parser for product descriptions on offers for electronic products from Amazon. − We used Silk framework for identity resolution.
  • 35. Bizer: Evolving the Web into a global Database – Advances and Applications, 30.1.2014 Slide 35 Linked Data vs. HTML-embeded Data LOD Cloud Microdata, Microformats, RDFa < 1000 sources millions of sources covers wider range of specific topics focused on search engines and Facebook contains more complex data structures very simple and shallow data structures partial ontology agreement strong ontology agreement data integration eased by RDF links data integration requires NLP techniques
  • 36. Bizer: Evolving the Web into a global Database – Advances and Applications, 30.1.2014 Slide 36 Title Description Cross Language Links Geo- Coordinates Images Infoboxes 3. The Role of Wikipedia
  • 37. Bizer: Evolving the Web into a global Database – Advances and Applications, 30.1.2014 Slide 37 Extracting Knowledge from Wikipedia
  • 38. Bizer: Evolving the Web into a global Database – Advances and Applications, 30.1.2014 Slide 38 The DBpedia Knowledge Base - Version 3.9 − describes 4.00 million things, out of which 3.22 million are classified in a consistent ontology using 529 classes and 2217 different properties • 832,000 persons • 639,000 places • 209,000 organizations • 116,000 music albums − Altogether 2.46 billion pieces of information (RDF triples) • 24,000,000 links to external web pages • 27,200,000 external links into other RDF datasets − DBpedia Internationalization • provide data from 119 Wikipedia language editions for download • 24 popular languages we provide cleaned infobox data
  • 39. Bizer: Evolving the Web into a global Database – Advances and Applications, 30.1.2014 Slide 39
  • 40. Bizer: Evolving the Web into a global Database – Advances and Applications, 30.1.2014 Slide 40 1. Answer fact queries: “birthdate michael douglas” 2. Compare things: „compare eiffel tower vs empire state building” Applications of Google‘s Knowledge Graph
  • 41. Bizer: Evolving the Web into a global Database – Advances and Applications, 30.1.2014 Slide 41 Applications of Google‘s Knowledge Graph 3. Enrich search results with infoboxes and lists • Infoboxes might also contain Microdata/RDFa data, e.g. concerts of a band 3. Rank of search results using new Hummingbird ranking algorithm
  • 42. Bizer: Evolving the Web into a global Database – Advances and Applications, 30.1.2014 Slide 42 DBpedia as Background Knowledge for Data Mining − Which factors correlate with unemployment in France?
  • 43. Bizer: Evolving the Web into a global Database – Advances and Applications, 30.1.2014 Slide 43 Unemployment Table with additional Attributes
  • 44. Bizer: Evolving the Web into a global Database – Advances and Applications, 30.1.2014 Slide 44 RapidMiner Linked Open Data Extension Allows you to 1. link local table to DBpedia and other LOD data sources 2. extend local table with additional attributes 3. mine extended tables using all Rapidminer features
  • 45. Bizer: Evolving the Web into a global Database – Advances and Applications, 30.1.2014 Slide 45 Finding Correlations − Use additional attributes to find interesting correlations − Example correlation for unemployment in France: • African islands, Islands in the Indian Ocean, Outermost regions of the EU (positive) • Population growth (positive) • Disposable income (negative) • Energy consumption (negative) • Fast food restaurants (positive) • Hospital beds/inhabitants (negative) • Police stations (positive)
  • 46. Bizer: Evolving the Web into a global Database – Advances and Applications, 30.1.2014 Slide 46 Conclusions 1. Publication of Structured Data • There is more data than most people from research and industry like • Exciting test-bed for data profiling and data integration techniques • Not even the research focus has moved to the integration of 1000s of sources 1. Ontological Agreement • Application-pull helps (Google et al.) • But data source-specific attributes are also important (e.g. in life science or statistics domain) 1. Applications • the big players are moving • there is a lot of experimentation in industry, but many efforts are still in the prototype stage
  • 47. Bizer: Evolving the Web into a global Database – Advances and Applications, 30.1.2014 Slide 47 Thanks Mannheim Linked Open Data Meetup −Free beer and food −Talks by Springer, Wolters Kluwer, Semantic Web Company, LOD2 project participants, DWS group members −Sunday, February 23, 2014, 6:30 PM −http://www.meetup.com/OpenKnowledgeFoundation/M annheim-DE/1092882/ Advertisement

Hinweis der Redaktion

  1. Hier noch applikationen dazu machen