SlideShare ist ein Scribd-Unternehmen logo
1 von 16
12015-06-10 InFoLiS II – Making Data citations a reality
InFoLiS II
Making data citations a reality
Stockholm, 2015-06-10
Dominique Ritze, Konstantin Baierer
22015-06-10 InFoLiS II – Making Data citations a reality
Current Situation
?
32015-06-10 InFoLiS II – Making Data citations a reality
Searchability and Links
42015-06-10 InFoLiS II – Making Data citations a reality
Reproducibility and Traceability
52015-06-10 InFoLiS II – Making Data citations a reality
5
Comparability
03.09.13
62015-06-10 InFoLiS II – Making Data citations a reality
Infolink: Detecting dataset patterns
1) Search for study term in text
2) Deduce pattern from context of term
3) Apply pattern to learn new study term
4) GOTO 1)
72015-06-10 InFoLiS II – Making Data citations a reality
Reference Extraction study1
study2
(Datenbasis: ALLBUS, SOEP, ZUMA-Standarddemografie, 1976–2002)
(Datenbasis: SOEP, Jugendliche der Befragungsjahre 2000 bis 2003)
(Datenbasis: ALLBUS, Eurobarometer 2007)
.*(Datenbasis: ,.*)
Link Generation
92015-06-10 InFoLiS II – Making Data citations a reality
Infrastructure
Internal API
Text Extraction
Pattern Learning
Reference Extraction
Link Generation
File Storage
Public API
JSON-LD ↔ RDF
REST API
Simple HTTP API
Resource Storage
HTTP
(JSON)
Command
Line
Indexing
System
Linked Data
Agent
HTTP
(Turtle)(native)
Browser
Plugin
HTTP
(RDF/XML)
API
Playground
HTTP
(JSON-LD)
HTTP
(JSON)
102015-06-10 InFoLiS II – Making Data citations a reality
Integration
Transformation
link1
link2
compatible
format
Integration
Integration
OAI-PMH
Primo
Enrichments
DDI
DC
112015-06-10 InFoLiS II – Making Data citations a reality
Link Generation
reference linkingstudy1
study2
|urn:nbn:de:0168-ssoar-206773|Publication|URN|Sozio-oekonmisches
Panel (SOEP)|SOEP|10.5684/soep.v27.2|Study|DOI|0.8|LitStudy automatic
link1
link2
122015-06-10 InFoLiS II – Making Data citations a reality
Challenges – Granularity
Solution:
– Build on DDI to describe parts and sets of research data
– Use contextual information for resolving
– Keep provenance
?
?
“ALLBUS 2000“
...
...
?
?
132015-06-10 InFoLiS II – Making Data citations a reality
|urn:nbn:de:0168-ssoar-206773|Publication|URN|Sozio-oekonmisches
Panel (SOEP)|SOEP|10.5684/soep.v27.2|Study|DOI|0.8|LitStudy automatic
Challenge – Provenance
Solution:
– Retain configuration of algorithms (esp. Learning and Matching)
– Immutable Linked Data resources with resolvable URI
142015-06-10 InFoLiS II – Making Data citations a reality
Challenge – Broaden scope
● Support more languages and fields
– English language
– Economic Research
● Subtle differences
– Punctuation
– Capitalization
– Footnotes / Endnotes
– Finding the right seeds
● Solution:
– Tweak algorithms and configurations
– Communicate results and repeat
152015-06-10 InFoLiS II – Making Data citations a reality
Challenge – Integration
• Integration into other systems
• Data up-to-dateness
• Provenance
162015-06-10 InFoLiS II – Making Data citations a reality
Next steps (Q3 / Q4 2015)
● Stress test the API http://infolis.gesis.org
● Integrate full set of ICPSR documents
● Integrate data from partner institutions and companies
● Develop reliability-based bootstrapping algorithm further
● Test workflows with research prototype
http://www.bib.uni-mannheim.de/vufind/
● Develop browser plugins/JS libraries to make use of Infolink
172015-06-10 InFoLiS II – Making Data citations a reality
Thank you for your attention!
Questions?
Keep in touch:
http://infolis.github.io
All Software is Open Source:
http://github.com/infolis

Weitere ähnliche Inhalte

Andere mochten auch

Prime Minister Thanks Employers Supporting The Armed Forces
Prime Minister Thanks Employers Supporting The Armed ForcesPrime Minister Thanks Employers Supporting The Armed Forces
Prime Minister Thanks Employers Supporting The Armed Forces
James Atkins
 

Andere mochten auch (18)

Normetal - company presentation
Normetal - company presentationNormetal - company presentation
Normetal - company presentation
 
modals
modals modals
modals
 
Presentación1
Presentación1Presentación1
Presentación1
 
Prime Minister Thanks Employers Supporting The Armed Forces
Prime Minister Thanks Employers Supporting The Armed ForcesPrime Minister Thanks Employers Supporting The Armed Forces
Prime Minister Thanks Employers Supporting The Armed Forces
 
How to make buildings on OSM!
How to make buildings on OSM!How to make buildings on OSM!
How to make buildings on OSM!
 
Open up your platform with Open Source and GitHub
Open up your platform with Open Source and GitHubOpen up your platform with Open Source and GitHub
Open up your platform with Open Source and GitHub
 
Re bound 2_1new
Re bound 2_1newRe bound 2_1new
Re bound 2_1new
 
Demartek Lenovo Storage S3200 MS Exchange Evaluation_2016-01
Demartek Lenovo Storage S3200 MS Exchange Evaluation_2016-01Demartek Lenovo Storage S3200 MS Exchange Evaluation_2016-01
Demartek Lenovo Storage S3200 MS Exchange Evaluation_2016-01
 
Why You Need To Get SIP Today
Why You Need To Get SIP TodayWhy You Need To Get SIP Today
Why You Need To Get SIP Today
 
Life with financial planning
Life with financial planningLife with financial planning
Life with financial planning
 
ITR Filing FAQ
ITR Filing FAQITR Filing FAQ
ITR Filing FAQ
 
Resume with cover letter
Resume with cover letterResume with cover letter
Resume with cover letter
 
Presenting
PresentingPresenting
Presenting
 
Scan0001
Scan0001Scan0001
Scan0001
 
Maroon 5
Maroon 5Maroon 5
Maroon 5
 
Seminariewerk sensewear
Seminariewerk sensewearSeminariewerk sensewear
Seminariewerk sensewear
 
Actividad 4
Actividad 4Actividad 4
Actividad 4
 
Go chem簡介 1027 1
Go chem簡介 1027 1Go chem簡介 1027 1
Go chem簡介 1027 1
 

Ähnlich wie Infolis II @ ELAG2015

“Publishing and Consuming Linked Data. (Lessons learnt when using LOD in an a...
“Publishing and Consuming Linked Data. (Lessons learnt when using LOD in an a...“Publishing and Consuming Linked Data. (Lessons learnt when using LOD in an a...
“Publishing and Consuming Linked Data. (Lessons learnt when using LOD in an a...
Marta Villegas
 
xldb2012_wed_0950_TimFrazier
xldb2012_wed_0950_TimFrazierxldb2012_wed_0950_TimFrazier
xldb2012_wed_0950_TimFrazier
Tim Frazier
 
Semantic Data Enrichment: a Human-in-the-Loop Perspective
Semantic Data Enrichment: a Human-in-the-Loop PerspectiveSemantic Data Enrichment: a Human-in-the-Loop Perspective
Semantic Data Enrichment: a Human-in-the-Loop Perspective
Università degli Studi di Milano-Bicocca
 
Graph Databases Lifecycle Methodology and Tool to Support Index/Store Versio...
Graph Databases Lifecycle Methodology  and Tool to Support Index/Store Versio...Graph Databases Lifecycle Methodology  and Tool to Support Index/Store Versio...
Graph Databases Lifecycle Methodology and Tool to Support Index/Store Versio...
Paolo Nesi
 

Ähnlich wie Infolis II @ ELAG2015 (20)

Integration of research literature and data (InFoLiS)
Integration of research literature and data (InFoLiS)Integration of research literature and data (InFoLiS)
Integration of research literature and data (InFoLiS)
 
“Publishing and Consuming Linked Data. (Lessons learnt when using LOD in an a...
“Publishing and Consuming Linked Data. (Lessons learnt when using LOD in an a...“Publishing and Consuming Linked Data. (Lessons learnt when using LOD in an a...
“Publishing and Consuming Linked Data. (Lessons learnt when using LOD in an a...
 
COBieOWL An OWL ontology based on COBie standard
COBieOWL An OWL ontology based on COBie standardCOBieOWL An OWL ontology based on COBie standard
COBieOWL An OWL ontology based on COBie standard
 
Visual Querying LOD sources with LODeX
 Visual Querying LOD sources with LODeX Visual Querying LOD sources with LODeX
Visual Querying LOD sources with LODeX
 
Presentation of agriopenlink @ EFITA (main program)
Presentation of agriopenlink @ EFITA (main program)Presentation of agriopenlink @ EFITA (main program)
Presentation of agriopenlink @ EFITA (main program)
 
Making Knowledge Infrastructure by “Identification”
Making Knowledge Infrastructure by “Identification” Making Knowledge Infrastructure by “Identification”
Making Knowledge Infrastructure by “Identification”
 
xldb2012_wed_0950_TimFrazier
xldb2012_wed_0950_TimFrazierxldb2012_wed_0950_TimFrazier
xldb2012_wed_0950_TimFrazier
 
Going for GOLD - Adventures in Open Linked Metadata
Going for GOLD - Adventures in Open Linked MetadataGoing for GOLD - Adventures in Open Linked Metadata
Going for GOLD - Adventures in Open Linked Metadata
 
Linked Open Data (LOD) part 3
Linked Open Data (LOD)  part 3Linked Open Data (LOD)  part 3
Linked Open Data (LOD) part 3
 
Linked Data Management
Linked Data ManagementLinked Data Management
Linked Data Management
 
OpenAIRE - Bridging the worlds where science is performed and science is publ...
OpenAIRE - Bridging the worlds where science is performed and science is publ...OpenAIRE - Bridging the worlds where science is performed and science is publ...
OpenAIRE - Bridging the worlds where science is performed and science is publ...
 
Man sze li fn-es_presentation_130506
Man sze li fn-es_presentation_130506Man sze li fn-es_presentation_130506
Man sze li fn-es_presentation_130506
 
Origins of FAIR webinar
Origins of FAIR webinarOrigins of FAIR webinar
Origins of FAIR webinar
 
Lod2
Lod2Lod2
Lod2
 
TEAMS 6, 7 and 8
TEAMS 6, 7 and 8TEAMS 6, 7 and 8
TEAMS 6, 7 and 8
 
Semantic Data Enrichment: a Human-in-the-Loop Perspective
Semantic Data Enrichment: a Human-in-the-Loop PerspectiveSemantic Data Enrichment: a Human-in-the-Loop Perspective
Semantic Data Enrichment: a Human-in-the-Loop Perspective
 
Research Data Shared Services
Research Data Shared ServicesResearch Data Shared Services
Research Data Shared Services
 
OpenAIRE Monitoring Services - EC FP7 & H2020 and other national funders (pre...
OpenAIRE Monitoring Services - EC FP7 & H2020 and other national funders (pre...OpenAIRE Monitoring Services - EC FP7 & H2020 and other national funders (pre...
OpenAIRE Monitoring Services - EC FP7 & H2020 and other national funders (pre...
 
ENVRIPLUS Data for Science Theme
ENVRIPLUS Data for Science ThemeENVRIPLUS Data for Science Theme
ENVRIPLUS Data for Science Theme
 
Graph Databases Lifecycle Methodology and Tool to Support Index/Store Versio...
Graph Databases Lifecycle Methodology  and Tool to Support Index/Store Versio...Graph Databases Lifecycle Methodology  and Tool to Support Index/Store Versio...
Graph Databases Lifecycle Methodology and Tool to Support Index/Store Versio...
 

Kürzlich hochgeladen

%+27788225528 love spells in Atlanta Psychic Readings, Attraction spells,Brin...
%+27788225528 love spells in Atlanta Psychic Readings, Attraction spells,Brin...%+27788225528 love spells in Atlanta Psychic Readings, Attraction spells,Brin...
%+27788225528 love spells in Atlanta Psychic Readings, Attraction spells,Brin...
masabamasaba
 
TECUNIQUE: Success Stories: IT Service provider
TECUNIQUE: Success Stories: IT Service providerTECUNIQUE: Success Stories: IT Service provider
TECUNIQUE: Success Stories: IT Service provider
mohitmore19
 
introduction-to-automotive Andoid os-csimmonds-ndctechtown-2021.pdf
introduction-to-automotive Andoid os-csimmonds-ndctechtown-2021.pdfintroduction-to-automotive Andoid os-csimmonds-ndctechtown-2021.pdf
introduction-to-automotive Andoid os-csimmonds-ndctechtown-2021.pdf
VishalKumarJha10
 
%+27788225528 love spells in Colorado Springs Psychic Readings, Attraction sp...
%+27788225528 love spells in Colorado Springs Psychic Readings, Attraction sp...%+27788225528 love spells in Colorado Springs Psychic Readings, Attraction sp...
%+27788225528 love spells in Colorado Springs Psychic Readings, Attraction sp...
masabamasaba
 

Kürzlich hochgeladen (20)

%+27788225528 love spells in Atlanta Psychic Readings, Attraction spells,Brin...
%+27788225528 love spells in Atlanta Psychic Readings, Attraction spells,Brin...%+27788225528 love spells in Atlanta Psychic Readings, Attraction spells,Brin...
%+27788225528 love spells in Atlanta Psychic Readings, Attraction spells,Brin...
 
%in Durban+277-882-255-28 abortion pills for sale in Durban
%in Durban+277-882-255-28 abortion pills for sale in Durban%in Durban+277-882-255-28 abortion pills for sale in Durban
%in Durban+277-882-255-28 abortion pills for sale in Durban
 
%in ivory park+277-882-255-28 abortion pills for sale in ivory park
%in ivory park+277-882-255-28 abortion pills for sale in ivory park %in ivory park+277-882-255-28 abortion pills for sale in ivory park
%in ivory park+277-882-255-28 abortion pills for sale in ivory park
 
Microsoft AI Transformation Partner Playbook.pdf
Microsoft AI Transformation Partner Playbook.pdfMicrosoft AI Transformation Partner Playbook.pdf
Microsoft AI Transformation Partner Playbook.pdf
 
SHRMPro HRMS Software Solutions Presentation
SHRMPro HRMS Software Solutions PresentationSHRMPro HRMS Software Solutions Presentation
SHRMPro HRMS Software Solutions Presentation
 
8257 interfacing 2 in microprocessor for btech students
8257 interfacing 2 in microprocessor for btech students8257 interfacing 2 in microprocessor for btech students
8257 interfacing 2 in microprocessor for btech students
 
Exploring the Best Video Editing App.pdf
Exploring the Best Video Editing App.pdfExploring the Best Video Editing App.pdf
Exploring the Best Video Editing App.pdf
 
Announcing Codolex 2.0 from GDK Software
Announcing Codolex 2.0 from GDK SoftwareAnnouncing Codolex 2.0 from GDK Software
Announcing Codolex 2.0 from GDK Software
 
The Ultimate Test Automation Guide_ Best Practices and Tips.pdf
The Ultimate Test Automation Guide_ Best Practices and Tips.pdfThe Ultimate Test Automation Guide_ Best Practices and Tips.pdf
The Ultimate Test Automation Guide_ Best Practices and Tips.pdf
 
TECUNIQUE: Success Stories: IT Service provider
TECUNIQUE: Success Stories: IT Service providerTECUNIQUE: Success Stories: IT Service provider
TECUNIQUE: Success Stories: IT Service provider
 
introduction-to-automotive Andoid os-csimmonds-ndctechtown-2021.pdf
introduction-to-automotive Andoid os-csimmonds-ndctechtown-2021.pdfintroduction-to-automotive Andoid os-csimmonds-ndctechtown-2021.pdf
introduction-to-automotive Andoid os-csimmonds-ndctechtown-2021.pdf
 
Unlocking the Future of AI Agents with Large Language Models
Unlocking the Future of AI Agents with Large Language ModelsUnlocking the Future of AI Agents with Large Language Models
Unlocking the Future of AI Agents with Large Language Models
 
%in kaalfontein+277-882-255-28 abortion pills for sale in kaalfontein
%in kaalfontein+277-882-255-28 abortion pills for sale in kaalfontein%in kaalfontein+277-882-255-28 abortion pills for sale in kaalfontein
%in kaalfontein+277-882-255-28 abortion pills for sale in kaalfontein
 
%+27788225528 love spells in Colorado Springs Psychic Readings, Attraction sp...
%+27788225528 love spells in Colorado Springs Psychic Readings, Attraction sp...%+27788225528 love spells in Colorado Springs Psychic Readings, Attraction sp...
%+27788225528 love spells in Colorado Springs Psychic Readings, Attraction sp...
 
Architecture decision records - How not to get lost in the past
Architecture decision records - How not to get lost in the pastArchitecture decision records - How not to get lost in the past
Architecture decision records - How not to get lost in the past
 
AI & Machine Learning Presentation Template
AI & Machine Learning Presentation TemplateAI & Machine Learning Presentation Template
AI & Machine Learning Presentation Template
 
call girls in Vaishali (Ghaziabad) 🔝 >༒8448380779 🔝 genuine Escort Service 🔝✔️✔️
call girls in Vaishali (Ghaziabad) 🔝 >༒8448380779 🔝 genuine Escort Service 🔝✔️✔️call girls in Vaishali (Ghaziabad) 🔝 >༒8448380779 🔝 genuine Escort Service 🔝✔️✔️
call girls in Vaishali (Ghaziabad) 🔝 >༒8448380779 🔝 genuine Escort Service 🔝✔️✔️
 
call girls in Vaishali (Ghaziabad) 🔝 >༒8448380779 🔝 genuine Escort Service 🔝✔️✔️
call girls in Vaishali (Ghaziabad) 🔝 >༒8448380779 🔝 genuine Escort Service 🔝✔️✔️call girls in Vaishali (Ghaziabad) 🔝 >༒8448380779 🔝 genuine Escort Service 🔝✔️✔️
call girls in Vaishali (Ghaziabad) 🔝 >༒8448380779 🔝 genuine Escort Service 🔝✔️✔️
 
%in Harare+277-882-255-28 abortion pills for sale in Harare
%in Harare+277-882-255-28 abortion pills for sale in Harare%in Harare+277-882-255-28 abortion pills for sale in Harare
%in Harare+277-882-255-28 abortion pills for sale in Harare
 
10 Trends Likely to Shape Enterprise Technology in 2024
10 Trends Likely to Shape Enterprise Technology in 202410 Trends Likely to Shape Enterprise Technology in 2024
10 Trends Likely to Shape Enterprise Technology in 2024
 

Infolis II @ ELAG2015

  • 1. 12015-06-10 InFoLiS II – Making Data citations a reality InFoLiS II Making data citations a reality Stockholm, 2015-06-10 Dominique Ritze, Konstantin Baierer
  • 2. 22015-06-10 InFoLiS II – Making Data citations a reality Current Situation ?
  • 3. 32015-06-10 InFoLiS II – Making Data citations a reality Searchability and Links
  • 4. 42015-06-10 InFoLiS II – Making Data citations a reality Reproducibility and Traceability
  • 5. 52015-06-10 InFoLiS II – Making Data citations a reality 5 Comparability 03.09.13
  • 6. 62015-06-10 InFoLiS II – Making Data citations a reality Infolink: Detecting dataset patterns 1) Search for study term in text 2) Deduce pattern from context of term 3) Apply pattern to learn new study term 4) GOTO 1)
  • 7. 72015-06-10 InFoLiS II – Making Data citations a reality Reference Extraction study1 study2 (Datenbasis: ALLBUS, SOEP, ZUMA-Standarddemografie, 1976–2002) (Datenbasis: SOEP, Jugendliche der Befragungsjahre 2000 bis 2003) (Datenbasis: ALLBUS, Eurobarometer 2007) .*(Datenbasis: ,.*) Link Generation
  • 8. 92015-06-10 InFoLiS II – Making Data citations a reality Infrastructure Internal API Text Extraction Pattern Learning Reference Extraction Link Generation File Storage Public API JSON-LD ↔ RDF REST API Simple HTTP API Resource Storage HTTP (JSON) Command Line Indexing System Linked Data Agent HTTP (Turtle)(native) Browser Plugin HTTP (RDF/XML) API Playground HTTP (JSON-LD) HTTP (JSON)
  • 9. 102015-06-10 InFoLiS II – Making Data citations a reality Integration Transformation link1 link2 compatible format Integration Integration OAI-PMH Primo Enrichments DDI DC
  • 10. 112015-06-10 InFoLiS II – Making Data citations a reality Link Generation reference linkingstudy1 study2 |urn:nbn:de:0168-ssoar-206773|Publication|URN|Sozio-oekonmisches Panel (SOEP)|SOEP|10.5684/soep.v27.2|Study|DOI|0.8|LitStudy automatic link1 link2
  • 11. 122015-06-10 InFoLiS II – Making Data citations a reality Challenges – Granularity Solution: – Build on DDI to describe parts and sets of research data – Use contextual information for resolving – Keep provenance ? ? “ALLBUS 2000“ ... ... ? ?
  • 12. 132015-06-10 InFoLiS II – Making Data citations a reality |urn:nbn:de:0168-ssoar-206773|Publication|URN|Sozio-oekonmisches Panel (SOEP)|SOEP|10.5684/soep.v27.2|Study|DOI|0.8|LitStudy automatic Challenge – Provenance Solution: – Retain configuration of algorithms (esp. Learning and Matching) – Immutable Linked Data resources with resolvable URI
  • 13. 142015-06-10 InFoLiS II – Making Data citations a reality Challenge – Broaden scope ● Support more languages and fields – English language – Economic Research ● Subtle differences – Punctuation – Capitalization – Footnotes / Endnotes – Finding the right seeds ● Solution: – Tweak algorithms and configurations – Communicate results and repeat
  • 14. 152015-06-10 InFoLiS II – Making Data citations a reality Challenge – Integration • Integration into other systems • Data up-to-dateness • Provenance
  • 15. 162015-06-10 InFoLiS II – Making Data citations a reality Next steps (Q3 / Q4 2015) ● Stress test the API http://infolis.gesis.org ● Integrate full set of ICPSR documents ● Integrate data from partner institutions and companies ● Develop reliability-based bootstrapping algorithm further ● Test workflows with research prototype http://www.bib.uni-mannheim.de/vufind/ ● Develop browser plugins/JS libraries to make use of Infolink
  • 16. 172015-06-10 InFoLiS II – Making Data citations a reality Thank you for your attention! Questions? Keep in touch: http://infolis.github.io All Software is Open Source: http://github.com/infolis