SlideShare ist ein Scribd-Unternehmen logo
1 von 24
Downloaden Sie, um offline zu lesen
Scientific Data Publishing
Tobias Kuhn
http://www.tkuhn.org
@txkuhn
ETH Zurich
COSS Team Retreat
Santorini, Greece
27 June 2015
Increasing Importance of Scientific Data
London Underground staff sorting 4M used tickets to analyse line use in 1939
Image from: http://www.telegraph.co.uk/travel/picturegalleries/9791007/
The-history-of-the-Tube-in-pictures-150-years-of-London-Underground.html?frame=2447159
Tobias Kuhn, ETH Zurich Scientific Data Publishing 2 / 24
Problem:
Replication and Re-Use of Research Results
Exemplary Situation: Sue publishes a script that should allow
everybody to replicate her scientific analysis:
# Download data:
wget http://some-third-party.org/dataset/1.4
# Analyze data:
...
Problems:
• What if the resource becomes unavailable at this location?
• What if the third party silently changes that version of the
dataset?
• What if the web site gets hacked and the data manipulated?
Tobias Kuhn, ETH Zurich Scientific Data Publishing 3 / 24
Data Publishing, Archiving, and Re-Use
Scientific datasets become increasingly important, and these data are
increasingly produced and consumed directly by software.
Published data should therefore be:
• Verifiable (Is this really the data I am looking for?)
• Immutable (Can I be sure that it hasn’t been modified?)
• Permanent (Will it be available in 1, 5, 20 years from now?)
• Reliable (Can it be efficiently retrieved whenever needed?)
• Granular (Can I refer to individual data entries?)
• Semantic (Can it be automatically interpreted?)
• Linked (Does it use established identifiers and ontologies?)
• Trustworthy (Can I trust the source?)
Tobias Kuhn, ETH Zurich Scientific Data Publishing 4 / 24
Nanopublications:
Provenance-Aware Semantic Publishing
(based on RDF)
assertion
provenance
publication info
nanopublication
http://nanopub.org / @nanopub org
Tobias Kuhn, ETH Zurich Scientific Data Publishing 5 / 24
Identifiability Problem of URLs (Web Links)
http://some-third-party.org/dataset/1.4
?
Given a URL for a digital artifact, there is no reliable standard
procedure of checking whether a retrieved file really represents the
correct and original state of that artifact.
Solution: Identifiers that include (iterative) cryptographic hash
values (as applied, for example, by Git and Bitcoin)
Tobias Kuhn, ETH Zurich Scientific Data Publishing 6 / 24
Cryptographic Hash Values
A cryptographic hash value is a short random-looking sequence of
bytes calculated on a given input:
This is some text. ⇒ hRUv0M
The same input always leads to exactly the same value:
This is some text. ⇒ hRUv0M
Even a minimally modified input leads to a completely different value:
This is xome text. ⇒ sCtYbf
The input is not reconstructible from the hash value (in practice):
This is some text. hRUv0M
Given an input and a matching hash value, we can therefore be
perfectly sure that it was exactly that input that led to the hash.
Tobias Kuhn, ETH Zurich Scientific Data Publishing 7 / 24
Iterative Hashing
Hash values can be used as identifiers in an iterative fashion:
This is some text. ⇒ hRUv0M
This text is based on hRUv0M. ⇒ LwGqwX
This depends on LwGqwX. ⇒ civRbq
From a single identifier (such as civRbq), the entire reference tree
can be verified:
This is some text. ⇒ hRUv0M
This text is based on hRUv0M. ⇒ LwGqwX
This depends on LwGqwX. ⇒ civRbq
And any modification can be noticed:
This is xome text. ⇒  hRUv0M
This text is based on hRUv0M. ⇒ LwGqwX
This depends on LwGqwX. ⇒ civRbq
Tobias Kuhn, ETH Zurich Scientific Data Publishing 8 / 24
Trusty URIs: Cryptographic Hash Values for
Verifiable and Immutable Web Identifiers
Basic idea: Use of cryptographic hash values together with URIs
(˜= URLs) as identifiers for digital artifacts such as nanopublications.
Requirements:
• To allow for the verification of entire reference trees, the hash
should be part of the reference (i.e. the URI)
• Format-independent hash for different kinds of content
• To allow for meta-data, digital artifacts should be allowed to
contain self-references (i.e. their own URI)
.trighttp://example.org/r1. RA 5AbXdpz5DcaYXCh9l3eI9ruBosiL5XDU3rxBbBaUO70
Kuhn, Dumontier. Trusty URIs: Verifiable, Immutable, and Permanent Digital Artifacts for Linked Data. ESWC 2014.
Tobias Kuhn, ETH Zurich Scientific Data Publishing 9 / 24
Identifiers with Cryptographic Hash Values
Nanopublications with Trusty URIs are ...
Verifiable
+
Immutable
+
Permanent
.trighttp://example.org/r1. RA 5AbXdpz5DcaYXCh9l3eI9ruBosiL5XDU3rxBbBaUO70
http://trustyuri.net/
Kuhn, Dumontier. Trusty URIs: Verifiable, Immutable, and Permanent Digital Artifacts for Linked Data. ESWC 2014.
Tobias Kuhn, ETH Zurich Scientific Data Publishing 10 / 24
Extended Range of Verifiability Through
Iterative Hashing
http://...RAcbjcRI...
http://...RAQozo2w...
http://...RABMq4Wc...
http://...RAcbjcRI...
http://...RAQozo2w...
http://.../resource23
http://.../resource23
...
http://...RAUx3Pqu...
http://.../resource55
http://...RABMq4Wc...
http://.../resource55
http://...RARz0AX-...
...
http://...RAUx3Pqu...
...
http://...RARz0AX...
...
range of
verifiability
http://trustyuri.net
Kuhn, Dumontier. Trusty URIs: Verifiable, Immutable, and Permanent Digital Artifacts for Linked Data. ESWC 2014.
Tobias Kuhn, ETH Zurich Scientific Data Publishing 11 / 24
Defining Datasets with Nanopublication Indexes
(which are themselves Nanopublications)
appends
has sub-index
has
element
(a) (b)
(c) (f)
(d) (e)
Kuhn et al. Publishing without Publishers: a Decentralized Approach to Dissemination, Retrieval, and Archiving of
Data. ISWC 2015.
Tobias Kuhn, ETH Zurich Scientific Data Publishing 12 / 24
A Multi-Layer Architecture for
Reliable Scientific Data Publishing?
User Interfaces
Services:
Finding, querying, filtering,
and aggregating data
Tobias Kuhn, ETH Zurich Scientific Data Publishing 13 / 24
A Multi-Layer Architecture for
Reliable Scientific Data Publishing?
User Interfaces
Decentralized
Data Publishing
Network
hypotheses
Services:
Finding, querying, filtering,
and aggregating data
facts
measurements
opinions
meta-data
assessments
annotations
...
Tobias Kuhn, ETH Zurich Scientific Data Publishing 14 / 24
A Multi-Layer Architecture for
Reliable Scientific Data Publishing?
User Interfaces
Decentralized
Data Publishing
Network
hypotheses
Services:
Finding, querying, filtering,
and aggregating data
Science Bots:
Automated Tasks
facts
measurements
opinions
meta-data
assessments
annotations
...
Tobias Kuhn, ETH Zurich Scientific Data Publishing 15 / 24
Decentralized and Reliable Publishing with a
Nanopublication Server Network
Nanopublications
with Trusty URIs
Publication
Retrieval
Propagation /
Archiving
http://npmonitor.inn.ac
Kuhn et al. Publishing without Publishers: a Decentralized Approach to Dissemination, Retrieval, and Archiving of
Data. ISWC 2015.
Tobias Kuhn, ETH Zurich Scientific Data Publishing 16 / 24
Decentralized — Open — Real-time
• No a central authority: Everybody can set up a server and join
the network
• No restrictions on publication: Everybody can upload
nanopublications
• No delay between submission and publication: Nanopublications
are made public immediately
• No updates: If a nanopublication is modified, that makes it a
new nanopublication (enforced by trusty URIs)
• No queries: Only simple identifier-based lookup
Kuhn et al. Publishing without Publishers: a Decentralized Approach to Dissemination, Retrieval, and Archiving of
Data. ISWC 2015.
Tobias Kuhn, ETH Zurich Scientific Data Publishing 17 / 24
Using Nanopublication Datasets
Once published in the network, nanopublication indexes can be cited:
[7] Nanopubs converted from OpenBEL’s Small and Large Corpus 20131211.
Nanopublication index, 4 March 2014,
http://np.inn.ac/RAR5dwELYLKGSfrOclnWhjOj-2nGZN_8BW1JjxwFZINHw
Researchers can then fetch and reuse the data in a reliable and
prefectly reproducible manner:
# Download data:
np get -c RAR5dwELYLKGSfrOclnWhjOj-2nGZN 8BW1JjxwFZINHw
# Analyze data:
...
Existing data can be recombined in new indexes; and researchers can
unambiguously refer to the used datasets for new results:
this: prov:wasDerivedFrom nps:RAR5dwELYLKGSfrOclnWhjOj-2nGZN 8BW1Jjx
Kuhn et al. Publishing without Publishers: a Decentralized Approach to Dissemination, Retrieval, and Archiving of
Data. ISWC 2015.
Tobias Kuhn, ETH Zurich Scientific Data Publishing 18 / 24
Could these techniques and infrastructures allow us to
make a step forward in terms of automation in science?
S C I E N C E B O T S
Tobias Kuhn, ETH Zurich Scientific Data Publishing 19 / 24
Science Bots —
Scientists’ Little Helpers in the Future?
“Science bots” that autonomously publish results in their own name
could cover a wide variety of applications, for example:
nanopub
-lications
PubMed
abstracts
nanopub
-lications
sensor
data
nanopub
-lications
nanopub
-lications
text mining bot
inference bot
sensor bot
Kuhn. Science Bots: A Model for the Future of Scientific Computation? SAVE-SD, WWW 2015 Companion
Proceedings.
Tobias Kuhn, ETH Zurich Scientific Data Publishing 20 / 24
Quality Control with Reputation
Mechanisms and Network Metrics?
Robust automatic calculation of reputation metrics in a decentralized
and open system:
is contributed by
Kuhn. Science Bots: A Model for the Future of Scientific Computation? SAVE-SD, WWW 2015 Companion
Proceedings.
Tobias Kuhn, ETH Zurich Scientific Data Publishing 21 / 24
Quality Control with Reputation
Mechanisms and Network Metrics?
Robust automatic calculation of reputation metrics in a decentralized
and open system:
gives positive
assessment for
is contributed by
Kuhn. Science Bots: A Model for the Future of Scientific Computation? SAVE-SD, WWW 2015 Companion
Proceedings.
Tobias Kuhn, ETH Zurich Scientific Data Publishing 22 / 24
Quality Control with Reputation
Mechanisms and Network Metrics?
Robust automatic calculation of reputation metrics in a decentralized
and open system:
gives positive
assessment for
is contributed by
Eigenvector
centrality (0-100)
77
100
71 1
0
85
4
0.0
0 0
50
50 0
0.4 0.4
0
Kuhn. Science Bots: A Model for the Future of Scientific Computation? SAVE-SD, WWW 2015 Companion
Proceedings.
Tobias Kuhn, ETH Zurich Scientific Data Publishing 23 / 24
Thank you for your attention!
Questions?
Further information:
• Trusty URIs: http://trustyuri.net
• Nanopublications: http://nanopub.org
• Nanopublication Server Network:
http://arxiv.org/abs/1411.2749
• Science Bots: http://arxiv.org/abs/1503.04374
Tobias Kuhn, ETH Zurich Scientific Data Publishing 24 / 24

Weitere ähnliche Inhalte

Ähnlich wie Scientific Data Publishing

A Decentralized Network for Publishing Linked Data — Nanopublications, Trusty...
A Decentralized Network for Publishing Linked Data — Nanopublications, Trusty...A Decentralized Network for Publishing Linked Data — Nanopublications, Trusty...
A Decentralized Network for Publishing Linked Data — Nanopublications, Trusty...Tobias Kuhn
 
Science Bots: A Model for the Future of Scientific Computation?
Science Bots: A Model for the Future of Scientific Computation?Science Bots: A Model for the Future of Scientific Computation?
Science Bots: A Model for the Future of Scientific Computation?Tobias Kuhn
 
Nanopublications and Decentralized Publishing
Nanopublications and Decentralized PublishingNanopublications and Decentralized Publishing
Nanopublications and Decentralized PublishingTobias Kuhn
 
Semantic Publishing with Nanopublications
Semantic Publishing with Nanopublications Semantic Publishing with Nanopublications
Semantic Publishing with Nanopublications Tobias Kuhn
 
Trusty URIs: Verifiable, Immutable, and Permanent Digital Artifacts for Linke...
Trusty URIs: Verifiable, Immutable, and Permanent Digital Artifacts for Linke...Trusty URIs: Verifiable, Immutable, and Permanent Digital Artifacts for Linke...
Trusty URIs: Verifiable, Immutable, and Permanent Digital Artifacts for Linke...Tobias Kuhn
 
A Decentralized Approach to Dissemination, Retrieval, and Archiving of Data
A Decentralized Approach to Dissemination, Retrieval, and Archiving of DataA Decentralized Approach to Dissemination, Retrieval, and Archiving of Data
A Decentralized Approach to Dissemination, Retrieval, and Archiving of DataTobias Kuhn
 
Data Publishing and Post-Publication Reviews
Data Publishing and Post-Publication ReviewsData Publishing and Post-Publication Reviews
Data Publishing and Post-Publication ReviewsTobias Kuhn
 
HathiTrust Research Center Secure Commons
HathiTrust Research Center Secure CommonsHathiTrust Research Center Secure Commons
HathiTrust Research Center Secure CommonsBeth Plale
 
Semantic Publishing and Nanopublications
Semantic Publishing and NanopublicationsSemantic Publishing and Nanopublications
Semantic Publishing and NanopublicationsTobias Kuhn
 
Plale HathiTrust El Colegio de Mexico May2014
Plale HathiTrust El Colegio de Mexico May2014Plale HathiTrust El Colegio de Mexico May2014
Plale HathiTrust El Colegio de Mexico May2014Beth Plale
 
Research in Intelligent Systems and Data Science at the Knowledge Media Insti...
Research in Intelligent Systems and Data Science at the Knowledge Media Insti...Research in Intelligent Systems and Data Science at the Knowledge Media Insti...
Research in Intelligent Systems and Data Science at the Knowledge Media Insti...Enrico Motta
 
APIS. Digitale biographische Blütenlese
APIS. Digitale biographische BlütenleseAPIS. Digitale biographische Blütenlese
APIS. Digitale biographische Blütenleseeveline wandl-vogt
 
Introduction to OpenAIRE services and the OpenAIRE Research Graph
Introduction to OpenAIRE services and the OpenAIRE Research GraphIntroduction to OpenAIRE services and the OpenAIRE Research Graph
Introduction to OpenAIRE services and the OpenAIRE Research GraphOpenAIRE
 
Enabling Data-Intensive Science Through Data Infrastructures
Enabling Data-Intensive Science Through Data InfrastructuresEnabling Data-Intensive Science Through Data Infrastructures
Enabling Data-Intensive Science Through Data InfrastructuresLIBER Europe
 
HathiTrust Research Center Data Capsule Overview 09.10.14
HathiTrust Research Center Data Capsule Overview 09.10.14HathiTrust Research Center Data Capsule Overview 09.10.14
HathiTrust Research Center Data Capsule Overview 09.10.14Robert H. McDonald
 
Data Mining on Web URL Using Base 64 Encoding to Generate Secure URN
Data Mining on Web URL Using Base 64 Encoding to Generate Secure URNData Mining on Web URL Using Base 64 Encoding to Generate Secure URN
Data Mining on Web URL Using Base 64 Encoding to Generate Secure URNIJMTST Journal
 
AI Sensors and Dashboards: Gauging and Monitoring the Inferences Capabilities...
AI Sensors and Dashboards: Gauging and Monitoring the Inferences Capabilities...AI Sensors and Dashboards: Gauging and Monitoring the Inferences Capabilities...
AI Sensors and Dashboards: Gauging and Monitoring the Inferences Capabilities...Huber Flores
 
How to improve the acceptance of AltMetrics
How to improve the acceptance of AltMetricsHow to improve the acceptance of AltMetrics
How to improve the acceptance of AltMetricsuherb
 
HINC – Harmonizing Diverse Resource Information Across IoT, Network Functions...
HINC – Harmonizing Diverse Resource Information Across IoT, Network Functions...HINC – Harmonizing Diverse Resource Information Across IoT, Network Functions...
HINC – Harmonizing Diverse Resource Information Across IoT, Network Functions...Hong-Linh Truong
 

Ähnlich wie Scientific Data Publishing (20)

A Decentralized Network for Publishing Linked Data — Nanopublications, Trusty...
A Decentralized Network for Publishing Linked Data — Nanopublications, Trusty...A Decentralized Network for Publishing Linked Data — Nanopublications, Trusty...
A Decentralized Network for Publishing Linked Data — Nanopublications, Trusty...
 
Science Bots: A Model for the Future of Scientific Computation?
Science Bots: A Model for the Future of Scientific Computation?Science Bots: A Model for the Future of Scientific Computation?
Science Bots: A Model for the Future of Scientific Computation?
 
Nanopublications and Decentralized Publishing
Nanopublications and Decentralized PublishingNanopublications and Decentralized Publishing
Nanopublications and Decentralized Publishing
 
Semantic Publishing with Nanopublications
Semantic Publishing with Nanopublications Semantic Publishing with Nanopublications
Semantic Publishing with Nanopublications
 
Trusty URIs: Verifiable, Immutable, and Permanent Digital Artifacts for Linke...
Trusty URIs: Verifiable, Immutable, and Permanent Digital Artifacts for Linke...Trusty URIs: Verifiable, Immutable, and Permanent Digital Artifacts for Linke...
Trusty URIs: Verifiable, Immutable, and Permanent Digital Artifacts for Linke...
 
A Decentralized Approach to Dissemination, Retrieval, and Archiving of Data
A Decentralized Approach to Dissemination, Retrieval, and Archiving of DataA Decentralized Approach to Dissemination, Retrieval, and Archiving of Data
A Decentralized Approach to Dissemination, Retrieval, and Archiving of Data
 
Data Publishing and Post-Publication Reviews
Data Publishing and Post-Publication ReviewsData Publishing and Post-Publication Reviews
Data Publishing and Post-Publication Reviews
 
Nanopubs
NanopubsNanopubs
Nanopubs
 
HathiTrust Research Center Secure Commons
HathiTrust Research Center Secure CommonsHathiTrust Research Center Secure Commons
HathiTrust Research Center Secure Commons
 
Semantic Publishing and Nanopublications
Semantic Publishing and NanopublicationsSemantic Publishing and Nanopublications
Semantic Publishing and Nanopublications
 
Plale HathiTrust El Colegio de Mexico May2014
Plale HathiTrust El Colegio de Mexico May2014Plale HathiTrust El Colegio de Mexico May2014
Plale HathiTrust El Colegio de Mexico May2014
 
Research in Intelligent Systems and Data Science at the Knowledge Media Insti...
Research in Intelligent Systems and Data Science at the Knowledge Media Insti...Research in Intelligent Systems and Data Science at the Knowledge Media Insti...
Research in Intelligent Systems and Data Science at the Knowledge Media Insti...
 
APIS. Digitale biographische Blütenlese
APIS. Digitale biographische BlütenleseAPIS. Digitale biographische Blütenlese
APIS. Digitale biographische Blütenlese
 
Introduction to OpenAIRE services and the OpenAIRE Research Graph
Introduction to OpenAIRE services and the OpenAIRE Research GraphIntroduction to OpenAIRE services and the OpenAIRE Research Graph
Introduction to OpenAIRE services and the OpenAIRE Research Graph
 
Enabling Data-Intensive Science Through Data Infrastructures
Enabling Data-Intensive Science Through Data InfrastructuresEnabling Data-Intensive Science Through Data Infrastructures
Enabling Data-Intensive Science Through Data Infrastructures
 
HathiTrust Research Center Data Capsule Overview 09.10.14
HathiTrust Research Center Data Capsule Overview 09.10.14HathiTrust Research Center Data Capsule Overview 09.10.14
HathiTrust Research Center Data Capsule Overview 09.10.14
 
Data Mining on Web URL Using Base 64 Encoding to Generate Secure URN
Data Mining on Web URL Using Base 64 Encoding to Generate Secure URNData Mining on Web URL Using Base 64 Encoding to Generate Secure URN
Data Mining on Web URL Using Base 64 Encoding to Generate Secure URN
 
AI Sensors and Dashboards: Gauging and Monitoring the Inferences Capabilities...
AI Sensors and Dashboards: Gauging and Monitoring the Inferences Capabilities...AI Sensors and Dashboards: Gauging and Monitoring the Inferences Capabilities...
AI Sensors and Dashboards: Gauging and Monitoring the Inferences Capabilities...
 
How to improve the acceptance of AltMetrics
How to improve the acceptance of AltMetricsHow to improve the acceptance of AltMetrics
How to improve the acceptance of AltMetrics
 
HINC – Harmonizing Diverse Resource Information Across IoT, Network Functions...
HINC – Harmonizing Diverse Resource Information Across IoT, Network Functions...HINC – Harmonizing Diverse Resource Information Across IoT, Network Functions...
HINC – Harmonizing Diverse Resource Information Across IoT, Network Functions...
 

Mehr von Tobias Kuhn

Linked Data Publishing with Nanopublications
Linked Data Publishing with NanopublicationsLinked Data Publishing with Nanopublications
Linked Data Publishing with NanopublicationsTobias Kuhn
 
Genuine semantic publishing
Genuine semantic publishingGenuine semantic publishing
Genuine semantic publishingTobias Kuhn
 
The Controlled Natural Language of Randall Munroe’s Thing Explainer
The Controlled Natural Language of Randall Munroe’s Thing Explainer The Controlled Natural Language of Randall Munroe’s Thing Explainer
The Controlled Natural Language of Randall Munroe’s Thing Explainer Tobias Kuhn
 
nanopub-java: A Java Library for Nanopublications
nanopub-java: A Java Library for Nanopublicationsnanopub-java: A Java Library for Nanopublications
nanopub-java: A Java Library for NanopublicationsTobias Kuhn
 
Meme Extraction from Corpora of Scientific Literature using Citation Networks
Meme Extraction from Corpora of Scientific Literature using Citation NetworksMeme Extraction from Corpora of Scientific Literature using Citation Networks
Meme Extraction from Corpora of Scientific Literature using Citation NetworksTobias Kuhn
 
A Multilingual Semantic Wiki Based on Controlled Natural Language
A Multilingual Semantic Wiki Based on Controlled Natural LanguageA Multilingual Semantic Wiki Based on Controlled Natural Language
A Multilingual Semantic Wiki Based on Controlled Natural LanguageTobias Kuhn
 
Citation Graph Analysis to Identify Memes in Scientific Literature
Citation Graph Analysis to Identify Memes in Scientific LiteratureCitation Graph Analysis to Identify Memes in Scientific Literature
Citation Graph Analysis to Identify Memes in Scientific LiteratureTobias Kuhn
 
Automatische Übersetzung in einem multilingualen, semantischen Wiki
Automatische Übersetzung in einem multilingualen, semantischen WikiAutomatische Übersetzung in einem multilingualen, semantischen Wiki
Automatische Übersetzung in einem multilingualen, semantischen WikiTobias Kuhn
 
A Multilingual Semantic Wiki based on Attempto Controlled English and Grammat...
A Multilingual Semantic Wiki based on Attempto Controlled English and Grammat...A Multilingual Semantic Wiki based on Attempto Controlled English and Grammat...
A Multilingual Semantic Wiki based on Attempto Controlled English and Grammat...Tobias Kuhn
 
A Multilingual Semantic Wiki based on Attempto Controlled English and Grammat...
A Multilingual Semantic Wiki based on Attempto Controlled English and Grammat...A Multilingual Semantic Wiki based on Attempto Controlled English and Grammat...
A Multilingual Semantic Wiki based on Attempto Controlled English and Grammat...Tobias Kuhn
 
Improving Text Mining with Controlled Natural Language: A Case Study for Prot...
Improving Text Mining with Controlled Natural Language: A Case Study for Prot...Improving Text Mining with Controlled Natural Language: A Case Study for Prot...
Improving Text Mining with Controlled Natural Language: A Case Study for Prot...Tobias Kuhn
 
AceWiki: A Natural and Expressive Semantic Wiki
AceWiki: A Natural and Expressive Semantic WikiAceWiki: A Natural and Expressive Semantic Wiki
AceWiki: A Natural and Expressive Semantic WikiTobias Kuhn
 
AceWiki: Controlled English in a Semantic Wiki
AceWiki: Controlled English in a Semantic WikiAceWiki: Controlled English in a Semantic Wiki
AceWiki: Controlled English in a Semantic WikiTobias Kuhn
 
How to Evaluate Controlled Natural Languages
How to Evaluate Controlled Natural LanguagesHow to Evaluate Controlled Natural Languages
How to Evaluate Controlled Natural LanguagesTobias Kuhn
 
Wissensrepräsentation in kontrolliertem Englisch
Wissensrepräsentation in kontrolliertem EnglischWissensrepräsentation in kontrolliertem Englisch
Wissensrepräsentation in kontrolliertem EnglischTobias Kuhn
 
An Introduction to AceWiki
An Introduction to AceWikiAn Introduction to AceWiki
An Introduction to AceWikiTobias Kuhn
 
Codeco: A Grammar Notation for Controlled Natural Language in Predictive Editors
Codeco: A Grammar Notation for Controlled Natural Language in Predictive EditorsCodeco: A Grammar Notation for Controlled Natural Language in Predictive Editors
Codeco: A Grammar Notation for Controlled Natural Language in Predictive EditorsTobias Kuhn
 

Mehr von Tobias Kuhn (17)

Linked Data Publishing with Nanopublications
Linked Data Publishing with NanopublicationsLinked Data Publishing with Nanopublications
Linked Data Publishing with Nanopublications
 
Genuine semantic publishing
Genuine semantic publishingGenuine semantic publishing
Genuine semantic publishing
 
The Controlled Natural Language of Randall Munroe’s Thing Explainer
The Controlled Natural Language of Randall Munroe’s Thing Explainer The Controlled Natural Language of Randall Munroe’s Thing Explainer
The Controlled Natural Language of Randall Munroe’s Thing Explainer
 
nanopub-java: A Java Library for Nanopublications
nanopub-java: A Java Library for Nanopublicationsnanopub-java: A Java Library for Nanopublications
nanopub-java: A Java Library for Nanopublications
 
Meme Extraction from Corpora of Scientific Literature using Citation Networks
Meme Extraction from Corpora of Scientific Literature using Citation NetworksMeme Extraction from Corpora of Scientific Literature using Citation Networks
Meme Extraction from Corpora of Scientific Literature using Citation Networks
 
A Multilingual Semantic Wiki Based on Controlled Natural Language
A Multilingual Semantic Wiki Based on Controlled Natural LanguageA Multilingual Semantic Wiki Based on Controlled Natural Language
A Multilingual Semantic Wiki Based on Controlled Natural Language
 
Citation Graph Analysis to Identify Memes in Scientific Literature
Citation Graph Analysis to Identify Memes in Scientific LiteratureCitation Graph Analysis to Identify Memes in Scientific Literature
Citation Graph Analysis to Identify Memes in Scientific Literature
 
Automatische Übersetzung in einem multilingualen, semantischen Wiki
Automatische Übersetzung in einem multilingualen, semantischen WikiAutomatische Übersetzung in einem multilingualen, semantischen Wiki
Automatische Übersetzung in einem multilingualen, semantischen Wiki
 
A Multilingual Semantic Wiki based on Attempto Controlled English and Grammat...
A Multilingual Semantic Wiki based on Attempto Controlled English and Grammat...A Multilingual Semantic Wiki based on Attempto Controlled English and Grammat...
A Multilingual Semantic Wiki based on Attempto Controlled English and Grammat...
 
A Multilingual Semantic Wiki based on Attempto Controlled English and Grammat...
A Multilingual Semantic Wiki based on Attempto Controlled English and Grammat...A Multilingual Semantic Wiki based on Attempto Controlled English and Grammat...
A Multilingual Semantic Wiki based on Attempto Controlled English and Grammat...
 
Improving Text Mining with Controlled Natural Language: A Case Study for Prot...
Improving Text Mining with Controlled Natural Language: A Case Study for Prot...Improving Text Mining with Controlled Natural Language: A Case Study for Prot...
Improving Text Mining with Controlled Natural Language: A Case Study for Prot...
 
AceWiki: A Natural and Expressive Semantic Wiki
AceWiki: A Natural and Expressive Semantic WikiAceWiki: A Natural and Expressive Semantic Wiki
AceWiki: A Natural and Expressive Semantic Wiki
 
AceWiki: Controlled English in a Semantic Wiki
AceWiki: Controlled English in a Semantic WikiAceWiki: Controlled English in a Semantic Wiki
AceWiki: Controlled English in a Semantic Wiki
 
How to Evaluate Controlled Natural Languages
How to Evaluate Controlled Natural LanguagesHow to Evaluate Controlled Natural Languages
How to Evaluate Controlled Natural Languages
 
Wissensrepräsentation in kontrolliertem Englisch
Wissensrepräsentation in kontrolliertem EnglischWissensrepräsentation in kontrolliertem Englisch
Wissensrepräsentation in kontrolliertem Englisch
 
An Introduction to AceWiki
An Introduction to AceWikiAn Introduction to AceWiki
An Introduction to AceWiki
 
Codeco: A Grammar Notation for Controlled Natural Language in Predictive Editors
Codeco: A Grammar Notation for Controlled Natural Language in Predictive EditorsCodeco: A Grammar Notation for Controlled Natural Language in Predictive Editors
Codeco: A Grammar Notation for Controlled Natural Language in Predictive Editors
 

Kürzlich hochgeladen

Loudspeaker- direct radiating type and horn type.pptx
Loudspeaker- direct radiating type and horn type.pptxLoudspeaker- direct radiating type and horn type.pptx
Loudspeaker- direct radiating type and horn type.pptxpriyankatabhane
 
WEEK 4 PHYSICAL SCIENCE QUARTER 3 FOR G11
WEEK 4 PHYSICAL SCIENCE QUARTER 3 FOR G11WEEK 4 PHYSICAL SCIENCE QUARTER 3 FOR G11
WEEK 4 PHYSICAL SCIENCE QUARTER 3 FOR G11GelineAvendao
 
6.2 Pests of Sesame_Identification_Binomics_Dr.UPR
6.2 Pests of Sesame_Identification_Binomics_Dr.UPR6.2 Pests of Sesame_Identification_Binomics_Dr.UPR
6.2 Pests of Sesame_Identification_Binomics_Dr.UPRPirithiRaju
 
Q4-Mod-1c-Quiz-Projectile-333344444.pptx
Q4-Mod-1c-Quiz-Projectile-333344444.pptxQ4-Mod-1c-Quiz-Projectile-333344444.pptx
Q4-Mod-1c-Quiz-Projectile-333344444.pptxtuking87
 
Charateristics of the Angara-A5 spacecraft launched from the Vostochny Cosmod...
Charateristics of the Angara-A5 spacecraft launched from the Vostochny Cosmod...Charateristics of the Angara-A5 spacecraft launched from the Vostochny Cosmod...
Charateristics of the Angara-A5 spacecraft launched from the Vostochny Cosmod...Christina Parmionova
 
final waves properties grade 7 - third quarter
final waves properties grade 7 - third quarterfinal waves properties grade 7 - third quarter
final waves properties grade 7 - third quarterHanHyoKim
 
KDIGO-2023-CKD-Guideline-Public-Review-Draft_5-July-2023.pdf
KDIGO-2023-CKD-Guideline-Public-Review-Draft_5-July-2023.pdfKDIGO-2023-CKD-Guideline-Public-Review-Draft_5-July-2023.pdf
KDIGO-2023-CKD-Guideline-Public-Review-Draft_5-July-2023.pdfGABYFIORELAMALPARTID1
 
Forensic limnology of diatoms by Sanjai.pptx
Forensic limnology of diatoms by Sanjai.pptxForensic limnology of diatoms by Sanjai.pptx
Forensic limnology of diatoms by Sanjai.pptxkumarsanjai28051
 
complex analysis best book for solving questions.pdf
complex analysis best book for solving questions.pdfcomplex analysis best book for solving questions.pdf
complex analysis best book for solving questions.pdfSubhamKumar3239
 
Oxo-Acids of Halogens and their Salts.pptx
Oxo-Acids of Halogens and their Salts.pptxOxo-Acids of Halogens and their Salts.pptx
Oxo-Acids of Halogens and their Salts.pptxfarhanvvdk
 
The Sensory Organs, Anatomy and Function
The Sensory Organs, Anatomy and FunctionThe Sensory Organs, Anatomy and Function
The Sensory Organs, Anatomy and FunctionJadeNovelo1
 
Explainable AI for distinguishing future climate change scenarios
Explainable AI for distinguishing future climate change scenariosExplainable AI for distinguishing future climate change scenarios
Explainable AI for distinguishing future climate change scenariosZachary Labe
 
Observation of Gravitational Waves from the Coalescence of a 2.5–4.5 M⊙ Compa...
Observation of Gravitational Waves from the Coalescence of a 2.5–4.5 M⊙ Compa...Observation of Gravitational Waves from the Coalescence of a 2.5–4.5 M⊙ Compa...
Observation of Gravitational Waves from the Coalescence of a 2.5–4.5 M⊙ Compa...Sérgio Sacani
 
whole genome sequencing new and its types including shortgun and clone by clone
whole genome sequencing new  and its types including shortgun and clone by clonewhole genome sequencing new  and its types including shortgun and clone by clone
whole genome sequencing new and its types including shortgun and clone by clonechaudhary charan shingh university
 
GLYCOSIDES Classification Of GLYCOSIDES Chemical Tests Glycosides
GLYCOSIDES Classification Of GLYCOSIDES  Chemical Tests GlycosidesGLYCOSIDES Classification Of GLYCOSIDES  Chemical Tests Glycosides
GLYCOSIDES Classification Of GLYCOSIDES Chemical Tests GlycosidesNandakishor Bhaurao Deshmukh
 
6.1 Pests of Groundnut_Binomics_Identification_Dr.UPR
6.1 Pests of Groundnut_Binomics_Identification_Dr.UPR6.1 Pests of Groundnut_Binomics_Identification_Dr.UPR
6.1 Pests of Groundnut_Binomics_Identification_Dr.UPRPirithiRaju
 

Kürzlich hochgeladen (20)

Loudspeaker- direct radiating type and horn type.pptx
Loudspeaker- direct radiating type and horn type.pptxLoudspeaker- direct radiating type and horn type.pptx
Loudspeaker- direct radiating type and horn type.pptx
 
WEEK 4 PHYSICAL SCIENCE QUARTER 3 FOR G11
WEEK 4 PHYSICAL SCIENCE QUARTER 3 FOR G11WEEK 4 PHYSICAL SCIENCE QUARTER 3 FOR G11
WEEK 4 PHYSICAL SCIENCE QUARTER 3 FOR G11
 
6.2 Pests of Sesame_Identification_Binomics_Dr.UPR
6.2 Pests of Sesame_Identification_Binomics_Dr.UPR6.2 Pests of Sesame_Identification_Binomics_Dr.UPR
6.2 Pests of Sesame_Identification_Binomics_Dr.UPR
 
Q4-Mod-1c-Quiz-Projectile-333344444.pptx
Q4-Mod-1c-Quiz-Projectile-333344444.pptxQ4-Mod-1c-Quiz-Projectile-333344444.pptx
Q4-Mod-1c-Quiz-Projectile-333344444.pptx
 
Charateristics of the Angara-A5 spacecraft launched from the Vostochny Cosmod...
Charateristics of the Angara-A5 spacecraft launched from the Vostochny Cosmod...Charateristics of the Angara-A5 spacecraft launched from the Vostochny Cosmod...
Charateristics of the Angara-A5 spacecraft launched from the Vostochny Cosmod...
 
Let’s Say Someone Did Drop the Bomb. Then What?
Let’s Say Someone Did Drop the Bomb. Then What?Let’s Say Someone Did Drop the Bomb. Then What?
Let’s Say Someone Did Drop the Bomb. Then What?
 
final waves properties grade 7 - third quarter
final waves properties grade 7 - third quarterfinal waves properties grade 7 - third quarter
final waves properties grade 7 - third quarter
 
Interferons.pptx.
Interferons.pptx.Interferons.pptx.
Interferons.pptx.
 
PLASMODIUM. PPTX
PLASMODIUM. PPTXPLASMODIUM. PPTX
PLASMODIUM. PPTX
 
KDIGO-2023-CKD-Guideline-Public-Review-Draft_5-July-2023.pdf
KDIGO-2023-CKD-Guideline-Public-Review-Draft_5-July-2023.pdfKDIGO-2023-CKD-Guideline-Public-Review-Draft_5-July-2023.pdf
KDIGO-2023-CKD-Guideline-Public-Review-Draft_5-July-2023.pdf
 
Forensic limnology of diatoms by Sanjai.pptx
Forensic limnology of diatoms by Sanjai.pptxForensic limnology of diatoms by Sanjai.pptx
Forensic limnology of diatoms by Sanjai.pptx
 
complex analysis best book for solving questions.pdf
complex analysis best book for solving questions.pdfcomplex analysis best book for solving questions.pdf
complex analysis best book for solving questions.pdf
 
Oxo-Acids of Halogens and their Salts.pptx
Oxo-Acids of Halogens and their Salts.pptxOxo-Acids of Halogens and their Salts.pptx
Oxo-Acids of Halogens and their Salts.pptx
 
AZOTOBACTER AS BIOFERILIZER.PPTX
AZOTOBACTER AS BIOFERILIZER.PPTXAZOTOBACTER AS BIOFERILIZER.PPTX
AZOTOBACTER AS BIOFERILIZER.PPTX
 
The Sensory Organs, Anatomy and Function
The Sensory Organs, Anatomy and FunctionThe Sensory Organs, Anatomy and Function
The Sensory Organs, Anatomy and Function
 
Explainable AI for distinguishing future climate change scenarios
Explainable AI for distinguishing future climate change scenariosExplainable AI for distinguishing future climate change scenarios
Explainable AI for distinguishing future climate change scenarios
 
Observation of Gravitational Waves from the Coalescence of a 2.5–4.5 M⊙ Compa...
Observation of Gravitational Waves from the Coalescence of a 2.5–4.5 M⊙ Compa...Observation of Gravitational Waves from the Coalescence of a 2.5–4.5 M⊙ Compa...
Observation of Gravitational Waves from the Coalescence of a 2.5–4.5 M⊙ Compa...
 
whole genome sequencing new and its types including shortgun and clone by clone
whole genome sequencing new  and its types including shortgun and clone by clonewhole genome sequencing new  and its types including shortgun and clone by clone
whole genome sequencing new and its types including shortgun and clone by clone
 
GLYCOSIDES Classification Of GLYCOSIDES Chemical Tests Glycosides
GLYCOSIDES Classification Of GLYCOSIDES  Chemical Tests GlycosidesGLYCOSIDES Classification Of GLYCOSIDES  Chemical Tests Glycosides
GLYCOSIDES Classification Of GLYCOSIDES Chemical Tests Glycosides
 
6.1 Pests of Groundnut_Binomics_Identification_Dr.UPR
6.1 Pests of Groundnut_Binomics_Identification_Dr.UPR6.1 Pests of Groundnut_Binomics_Identification_Dr.UPR
6.1 Pests of Groundnut_Binomics_Identification_Dr.UPR
 

Scientific Data Publishing

  • 1. Scientific Data Publishing Tobias Kuhn http://www.tkuhn.org @txkuhn ETH Zurich COSS Team Retreat Santorini, Greece 27 June 2015
  • 2. Increasing Importance of Scientific Data London Underground staff sorting 4M used tickets to analyse line use in 1939 Image from: http://www.telegraph.co.uk/travel/picturegalleries/9791007/ The-history-of-the-Tube-in-pictures-150-years-of-London-Underground.html?frame=2447159 Tobias Kuhn, ETH Zurich Scientific Data Publishing 2 / 24
  • 3. Problem: Replication and Re-Use of Research Results Exemplary Situation: Sue publishes a script that should allow everybody to replicate her scientific analysis: # Download data: wget http://some-third-party.org/dataset/1.4 # Analyze data: ... Problems: • What if the resource becomes unavailable at this location? • What if the third party silently changes that version of the dataset? • What if the web site gets hacked and the data manipulated? Tobias Kuhn, ETH Zurich Scientific Data Publishing 3 / 24
  • 4. Data Publishing, Archiving, and Re-Use Scientific datasets become increasingly important, and these data are increasingly produced and consumed directly by software. Published data should therefore be: • Verifiable (Is this really the data I am looking for?) • Immutable (Can I be sure that it hasn’t been modified?) • Permanent (Will it be available in 1, 5, 20 years from now?) • Reliable (Can it be efficiently retrieved whenever needed?) • Granular (Can I refer to individual data entries?) • Semantic (Can it be automatically interpreted?) • Linked (Does it use established identifiers and ontologies?) • Trustworthy (Can I trust the source?) Tobias Kuhn, ETH Zurich Scientific Data Publishing 4 / 24
  • 5. Nanopublications: Provenance-Aware Semantic Publishing (based on RDF) assertion provenance publication info nanopublication http://nanopub.org / @nanopub org Tobias Kuhn, ETH Zurich Scientific Data Publishing 5 / 24
  • 6. Identifiability Problem of URLs (Web Links) http://some-third-party.org/dataset/1.4 ? Given a URL for a digital artifact, there is no reliable standard procedure of checking whether a retrieved file really represents the correct and original state of that artifact. Solution: Identifiers that include (iterative) cryptographic hash values (as applied, for example, by Git and Bitcoin) Tobias Kuhn, ETH Zurich Scientific Data Publishing 6 / 24
  • 7. Cryptographic Hash Values A cryptographic hash value is a short random-looking sequence of bytes calculated on a given input: This is some text. ⇒ hRUv0M The same input always leads to exactly the same value: This is some text. ⇒ hRUv0M Even a minimally modified input leads to a completely different value: This is xome text. ⇒ sCtYbf The input is not reconstructible from the hash value (in practice): This is some text. hRUv0M Given an input and a matching hash value, we can therefore be perfectly sure that it was exactly that input that led to the hash. Tobias Kuhn, ETH Zurich Scientific Data Publishing 7 / 24
  • 8. Iterative Hashing Hash values can be used as identifiers in an iterative fashion: This is some text. ⇒ hRUv0M This text is based on hRUv0M. ⇒ LwGqwX This depends on LwGqwX. ⇒ civRbq From a single identifier (such as civRbq), the entire reference tree can be verified: This is some text. ⇒ hRUv0M This text is based on hRUv0M. ⇒ LwGqwX This depends on LwGqwX. ⇒ civRbq And any modification can be noticed: This is xome text. ⇒ hRUv0M This text is based on hRUv0M. ⇒ LwGqwX This depends on LwGqwX. ⇒ civRbq Tobias Kuhn, ETH Zurich Scientific Data Publishing 8 / 24
  • 9. Trusty URIs: Cryptographic Hash Values for Verifiable and Immutable Web Identifiers Basic idea: Use of cryptographic hash values together with URIs (˜= URLs) as identifiers for digital artifacts such as nanopublications. Requirements: • To allow for the verification of entire reference trees, the hash should be part of the reference (i.e. the URI) • Format-independent hash for different kinds of content • To allow for meta-data, digital artifacts should be allowed to contain self-references (i.e. their own URI) .trighttp://example.org/r1. RA 5AbXdpz5DcaYXCh9l3eI9ruBosiL5XDU3rxBbBaUO70 Kuhn, Dumontier. Trusty URIs: Verifiable, Immutable, and Permanent Digital Artifacts for Linked Data. ESWC 2014. Tobias Kuhn, ETH Zurich Scientific Data Publishing 9 / 24
  • 10. Identifiers with Cryptographic Hash Values Nanopublications with Trusty URIs are ... Verifiable + Immutable + Permanent .trighttp://example.org/r1. RA 5AbXdpz5DcaYXCh9l3eI9ruBosiL5XDU3rxBbBaUO70 http://trustyuri.net/ Kuhn, Dumontier. Trusty URIs: Verifiable, Immutable, and Permanent Digital Artifacts for Linked Data. ESWC 2014. Tobias Kuhn, ETH Zurich Scientific Data Publishing 10 / 24
  • 11. Extended Range of Verifiability Through Iterative Hashing http://...RAcbjcRI... http://...RAQozo2w... http://...RABMq4Wc... http://...RAcbjcRI... http://...RAQozo2w... http://.../resource23 http://.../resource23 ... http://...RAUx3Pqu... http://.../resource55 http://...RABMq4Wc... http://.../resource55 http://...RARz0AX-... ... http://...RAUx3Pqu... ... http://...RARz0AX... ... range of verifiability http://trustyuri.net Kuhn, Dumontier. Trusty URIs: Verifiable, Immutable, and Permanent Digital Artifacts for Linked Data. ESWC 2014. Tobias Kuhn, ETH Zurich Scientific Data Publishing 11 / 24
  • 12. Defining Datasets with Nanopublication Indexes (which are themselves Nanopublications) appends has sub-index has element (a) (b) (c) (f) (d) (e) Kuhn et al. Publishing without Publishers: a Decentralized Approach to Dissemination, Retrieval, and Archiving of Data. ISWC 2015. Tobias Kuhn, ETH Zurich Scientific Data Publishing 12 / 24
  • 13. A Multi-Layer Architecture for Reliable Scientific Data Publishing? User Interfaces Services: Finding, querying, filtering, and aggregating data Tobias Kuhn, ETH Zurich Scientific Data Publishing 13 / 24
  • 14. A Multi-Layer Architecture for Reliable Scientific Data Publishing? User Interfaces Decentralized Data Publishing Network hypotheses Services: Finding, querying, filtering, and aggregating data facts measurements opinions meta-data assessments annotations ... Tobias Kuhn, ETH Zurich Scientific Data Publishing 14 / 24
  • 15. A Multi-Layer Architecture for Reliable Scientific Data Publishing? User Interfaces Decentralized Data Publishing Network hypotheses Services: Finding, querying, filtering, and aggregating data Science Bots: Automated Tasks facts measurements opinions meta-data assessments annotations ... Tobias Kuhn, ETH Zurich Scientific Data Publishing 15 / 24
  • 16. Decentralized and Reliable Publishing with a Nanopublication Server Network Nanopublications with Trusty URIs Publication Retrieval Propagation / Archiving http://npmonitor.inn.ac Kuhn et al. Publishing without Publishers: a Decentralized Approach to Dissemination, Retrieval, and Archiving of Data. ISWC 2015. Tobias Kuhn, ETH Zurich Scientific Data Publishing 16 / 24
  • 17. Decentralized — Open — Real-time • No a central authority: Everybody can set up a server and join the network • No restrictions on publication: Everybody can upload nanopublications • No delay between submission and publication: Nanopublications are made public immediately • No updates: If a nanopublication is modified, that makes it a new nanopublication (enforced by trusty URIs) • No queries: Only simple identifier-based lookup Kuhn et al. Publishing without Publishers: a Decentralized Approach to Dissemination, Retrieval, and Archiving of Data. ISWC 2015. Tobias Kuhn, ETH Zurich Scientific Data Publishing 17 / 24
  • 18. Using Nanopublication Datasets Once published in the network, nanopublication indexes can be cited: [7] Nanopubs converted from OpenBEL’s Small and Large Corpus 20131211. Nanopublication index, 4 March 2014, http://np.inn.ac/RAR5dwELYLKGSfrOclnWhjOj-2nGZN_8BW1JjxwFZINHw Researchers can then fetch and reuse the data in a reliable and prefectly reproducible manner: # Download data: np get -c RAR5dwELYLKGSfrOclnWhjOj-2nGZN 8BW1JjxwFZINHw # Analyze data: ... Existing data can be recombined in new indexes; and researchers can unambiguously refer to the used datasets for new results: this: prov:wasDerivedFrom nps:RAR5dwELYLKGSfrOclnWhjOj-2nGZN 8BW1Jjx Kuhn et al. Publishing without Publishers: a Decentralized Approach to Dissemination, Retrieval, and Archiving of Data. ISWC 2015. Tobias Kuhn, ETH Zurich Scientific Data Publishing 18 / 24
  • 19. Could these techniques and infrastructures allow us to make a step forward in terms of automation in science? S C I E N C E B O T S Tobias Kuhn, ETH Zurich Scientific Data Publishing 19 / 24
  • 20. Science Bots — Scientists’ Little Helpers in the Future? “Science bots” that autonomously publish results in their own name could cover a wide variety of applications, for example: nanopub -lications PubMed abstracts nanopub -lications sensor data nanopub -lications nanopub -lications text mining bot inference bot sensor bot Kuhn. Science Bots: A Model for the Future of Scientific Computation? SAVE-SD, WWW 2015 Companion Proceedings. Tobias Kuhn, ETH Zurich Scientific Data Publishing 20 / 24
  • 21. Quality Control with Reputation Mechanisms and Network Metrics? Robust automatic calculation of reputation metrics in a decentralized and open system: is contributed by Kuhn. Science Bots: A Model for the Future of Scientific Computation? SAVE-SD, WWW 2015 Companion Proceedings. Tobias Kuhn, ETH Zurich Scientific Data Publishing 21 / 24
  • 22. Quality Control with Reputation Mechanisms and Network Metrics? Robust automatic calculation of reputation metrics in a decentralized and open system: gives positive assessment for is contributed by Kuhn. Science Bots: A Model for the Future of Scientific Computation? SAVE-SD, WWW 2015 Companion Proceedings. Tobias Kuhn, ETH Zurich Scientific Data Publishing 22 / 24
  • 23. Quality Control with Reputation Mechanisms and Network Metrics? Robust automatic calculation of reputation metrics in a decentralized and open system: gives positive assessment for is contributed by Eigenvector centrality (0-100) 77 100 71 1 0 85 4 0.0 0 0 50 50 0 0.4 0.4 0 Kuhn. Science Bots: A Model for the Future of Scientific Computation? SAVE-SD, WWW 2015 Companion Proceedings. Tobias Kuhn, ETH Zurich Scientific Data Publishing 23 / 24
  • 24. Thank you for your attention! Questions? Further information: • Trusty URIs: http://trustyuri.net • Nanopublications: http://nanopub.org • Nanopublication Server Network: http://arxiv.org/abs/1411.2749 • Science Bots: http://arxiv.org/abs/1503.04374 Tobias Kuhn, ETH Zurich Scientific Data Publishing 24 / 24