SlideShare ist ein Scribd-Unternehmen logo
1 von 30
Downloaden Sie, um offline zu lesen
Zaven Akopov (DESY -L-)
For the INSPIRE Collaboration
DESY Computing Seminar
Joint Project of CERN, DESY, Fermilab
and SLAC
SPIRES: wonderful system, largest HEP
database, best-curated content, but..old
engine (>30 years):
need a modern open-source multimedia digital
library
Unify SPIRES content with Invenio
platform
Invenio = Open source digital library
○ http://invenio-software.org
SPIRES + Invenio = InSpire
Invenio
Integrated digital library system
written largely in Python
MySQL database
modular built
Navigable collection tree
Documents organized in collections
Regular and virtual collection trees
Customizable portal-boxes for each collection
Powerful search engine
Specially designed indexes to provide fast search speed for
repositories of up to 2,000,000 records
Customizable simple and advanced search interfaces
Flexible metadata
Standard metadata format (MARC)
Handling articles, books, theses, photos, videos, museum
objects and more
User personalization
Baskets, e-mail notifications, comments, etc.
DESY participation
Input of Journal/Article Data
HEP Ontology (Keywords) Input
Hierarchy of HEP concepts based on
DESY HEP Thesaurus
DESY assigns keywords and
classification to HEP Articles since 1964
SPIRES/InSPIRE mirror website
Where are we?
First Beta site released April 2010
Production Beta released a week ago
http://inspirebeta.net
Live Now
Populated with SPIRES content daily
Additional features
Bugs are getting ironed out, but
already:
Figures/Plots extraction
Full-text search
More to come
Personal libraries, alerts
Claim my papers (with arXiv and ORCID
(Open Researcher and Contributor ID))
Submit theses and old non-arXiv
material
Attach non-text material
OCR of older materials
Even better feeds (with ADS, arXiv,
Publishers)
Automatic Disambiguation
Henning Weiler - PhD student@CERN
On 963 documents, 21 real authors
could be identified for the query
"Chen, G".
22 orphans remain
98% identified
User Accounts
Tied to academic affiliation
Ability to correct information and
claim papers
Corrections still vetted by staff
Add “corporate accounts” for
collaborations
Data - Soon
Partnership and interlinking with HEPData
HepData reloaded: reinventing the HEP data
archive.
Andy Buckley, Mike Whalley. Jun 2010.
e-Print: arXiv:1006.0517 [hep-ex]
http://hepdata.cedar.ac.uk/
HEPData+INSPIRE working with LHC and other
experiments to ease submission process and
interlinking
Move towards citation/tracking use – reputation…
Storage for other objects like ROOT, Mathematica,
etc.
Non-text material
Full-cycle of a publication
Up to now, we've captured product:
Papers
Considering Data
Currently, through DPHEP, opportunity to
build infrastructure for capturing the
process:
Internal Notes
Technical/Software Documentation
Logbooks
Wikis
Increasingly popular central place to
aggregate documentation
Users structure the data for us
Backups and 'dumps' are generally easy
to make
And usually in an easily digestible
format (like XML)
Tools
For MediaWiki, most of the essential
tools already exist.
Wikimedia Foundation (Wikipedia) is
interested in seeing what we do with them.
From discussions with them, they are
supportive of what we're trying to do
Nascent BaBar Wiki
MediaWiki Instance with:
162 content pages
201 total pages (talk, redirects, etc.)
22 registered users
Simple script can easily produce dumps.
Scenarios
Level 0 Service: Basic Preservation
Index and store wiki snapshot data as if it
were a scientific publication (with many
authors)
Level 1 Service: Readable Snapshots
Level 0 + read-only final version
respecting formatting, etc.
Level 2: Multiple Snapshots
Level 0 + Level 1 for each of multiple wiki
“release points”, with full(?) metadata
Linking with Papers
Publication/Drafting History: H1
Example
A publication history includes:
Set of preliminary results (typically, prepared
for/as conference reports), short papers with
associated figures.
Actual publication process which begins with a
pre-T0 report, which goes then through T0 talk
to First/Second/… draft.
Each draft stage has it’s set of answers
(comments by collaboration and answers to
them); typically a referee report
And a final version that goes to the journal.
Mock-Up
How does it work?
External Users can see the links from
Conference talks to final papers, but
nothing in between
Access control – must be registered and
validated (e-mail ping): already planned
“Corporate” accounts for collaboration to
update page
Individual access via connection with
collaboration…(Any paper? Current
membership? What about long-term?)
In development
Access
Main challenge: Access policies and their
technical implementation
Need input from collaborations to create policies.
One size does not fit all.
Easy – master access file maintained by coll.
But not long-term…
Medium – Computation based on author lists
(not always correct?)
Harder – Individual access lists depending on
date of object and date of access
OAIS (ISO standard) etc. can help us implement
these in line with archival best practices
Questions?
For more information on INSPIRE see
http://www.projecthepinspire.net
Just try it out!
http://inspirebeta.net

Weitere ähnliche Inhalte

Was ist angesagt?

The Rocky Road to Reuse
The Rocky Road to ReuseThe Rocky Road to Reuse
The Rocky Road to ReuseAnita de Waard
 
Enabling better science - Results and vision of the OpenAIRE infrastructure a...
Enabling better science - Results and vision of the OpenAIRE infrastructure a...Enabling better science - Results and vision of the OpenAIRE infrastructure a...
Enabling better science - Results and vision of the OpenAIRE infrastructure a...Paolo Manghi
 
A basic course on Reseach data management, part 2: protecting and organizing ...
A basic course on Reseach data management, part 2: protecting and organizing ...A basic course on Reseach data management, part 2: protecting and organizing ...
A basic course on Reseach data management, part 2: protecting and organizing ...Leon Osinski
 
Zudilova-Seinstra-Elsevier-data and the article of the future-nfdp13
Zudilova-Seinstra-Elsevier-data and the article of the future-nfdp13Zudilova-Seinstra-Elsevier-data and the article of the future-nfdp13
Zudilova-Seinstra-Elsevier-data and the article of the future-nfdp13DataDryad
 
Riding the wave - Paradigm shifts in information access
Riding the wave - Paradigm shifts in information accessRiding the wave - Paradigm shifts in information access
Riding the wave - Paradigm shifts in information accessdatacite
 
Clipper, research data network
Clipper, research data networkClipper, research data network
Clipper, research data networkJisc RDM
 
The University of Edinburgh Research Data Management Service Suite
The University of Edinburgh Research Data Management Service SuiteThe University of Edinburgh Research Data Management Service Suite
The University of Edinburgh Research Data Management Service SuiteRobin Rice
 
The University of Edinburgh Research Data Management Service Suite
The University of Edinburgh Research Data Management Service SuiteThe University of Edinburgh Research Data Management Service Suite
The University of Edinburgh Research Data Management Service SuiteRobin Rice
 
6.15.17 DSpace-Cris Webinar Presentation Slides
6.15.17 DSpace-Cris Webinar Presentation Slides6.15.17 DSpace-Cris Webinar Presentation Slides
6.15.17 DSpace-Cris Webinar Presentation SlidesDuraSpace
 
Access the world’s research outputs through the CORE API
Access the world’s research outputs through the CORE API Access the world’s research outputs through the CORE API
Access the world’s research outputs through the CORE API Matteo Cancellieri
 
Dataverse for Journals
Dataverse for JournalsDataverse for Journals
Dataverse for JournalsMerce Crosas
 
Visualizing Co-authorship Networks for Actionable Insights: Action Design Res...
Visualizing Co-authorship Networks for Actionable Insights: Action Design Res...Visualizing Co-authorship Networks for Actionable Insights: Action Design Res...
Visualizing Co-authorship Networks for Actionable Insights: Action Design Res...Jukka Huhtamäki
 
ElN - repository integration at the University of Goettingen
ElN - repository integration at the University of GoettingenElN - repository integration at the University of Goettingen
ElN - repository integration at the University of Goettingenrmacneil88
 
Who will use the open data? Mark Humphries keynote
Who will use the open data? Mark Humphries keynoteWho will use the open data? Mark Humphries keynote
Who will use the open data? Mark Humphries keynoteJisc RDM
 
DataCite How To: Use the MDS
DataCite How To: Use the MDSDataCite How To: Use the MDS
DataCite How To: Use the MDSFrauke Ziedorn
 
DOI registration with DataCite - COOPEUS, ENVRI, EUDAT workshop 2013
DOI registration with DataCite - COOPEUS, ENVRI, EUDAT workshop 2013DOI registration with DataCite - COOPEUS, ENVRI, EUDAT workshop 2013
DOI registration with DataCite - COOPEUS, ENVRI, EUDAT workshop 2013Frauke Ziedorn
 

Was ist angesagt? (20)

The Rocky Road to Reuse
The Rocky Road to ReuseThe Rocky Road to Reuse
The Rocky Road to Reuse
 
Enabling better science - Results and vision of the OpenAIRE infrastructure a...
Enabling better science - Results and vision of the OpenAIRE infrastructure a...Enabling better science - Results and vision of the OpenAIRE infrastructure a...
Enabling better science - Results and vision of the OpenAIRE infrastructure a...
 
A basic course on Reseach data management, part 2: protecting and organizing ...
A basic course on Reseach data management, part 2: protecting and organizing ...A basic course on Reseach data management, part 2: protecting and organizing ...
A basic course on Reseach data management, part 2: protecting and organizing ...
 
Zudilova-Seinstra-Elsevier-data and the article of the future-nfdp13
Zudilova-Seinstra-Elsevier-data and the article of the future-nfdp13Zudilova-Seinstra-Elsevier-data and the article of the future-nfdp13
Zudilova-Seinstra-Elsevier-data and the article of the future-nfdp13
 
Riding the wave - Paradigm shifts in information access
Riding the wave - Paradigm shifts in information accessRiding the wave - Paradigm shifts in information access
Riding the wave - Paradigm shifts in information access
 
Jan Brase: Data and Libraries - the DataCite consortium
Jan Brase: Data and Libraries - the DataCite consortiumJan Brase: Data and Libraries - the DataCite consortium
Jan Brase: Data and Libraries - the DataCite consortium
 
Clipper, research data network
Clipper, research data networkClipper, research data network
Clipper, research data network
 
The University of Edinburgh Research Data Management Service Suite
The University of Edinburgh Research Data Management Service SuiteThe University of Edinburgh Research Data Management Service Suite
The University of Edinburgh Research Data Management Service Suite
 
The University of Edinburgh Research Data Management Service Suite
The University of Edinburgh Research Data Management Service SuiteThe University of Edinburgh Research Data Management Service Suite
The University of Edinburgh Research Data Management Service Suite
 
6.15.17 DSpace-Cris Webinar Presentation Slides
6.15.17 DSpace-Cris Webinar Presentation Slides6.15.17 DSpace-Cris Webinar Presentation Slides
6.15.17 DSpace-Cris Webinar Presentation Slides
 
Access the world’s research outputs through the CORE API
Access the world’s research outputs through the CORE API Access the world’s research outputs through the CORE API
Access the world’s research outputs through the CORE API
 
Dataverse for Journals
Dataverse for JournalsDataverse for Journals
Dataverse for Journals
 
Visualizing Co-authorship Networks for Actionable Insights: Action Design Res...
Visualizing Co-authorship Networks for Actionable Insights: Action Design Res...Visualizing Co-authorship Networks for Actionable Insights: Action Design Res...
Visualizing Co-authorship Networks for Actionable Insights: Action Design Res...
 
Executable papers
Executable papersExecutable papers
Executable papers
 
ElN - repository integration at the University of Goettingen
ElN - repository integration at the University of GoettingenElN - repository integration at the University of Goettingen
ElN - repository integration at the University of Goettingen
 
Who will use the open data? Mark Humphries keynote
Who will use the open data? Mark Humphries keynoteWho will use the open data? Mark Humphries keynote
Who will use the open data? Mark Humphries keynote
 
DataCite How To: Use the MDS
DataCite How To: Use the MDSDataCite How To: Use the MDS
DataCite How To: Use the MDS
 
Pieper NISO Virtual Conf Feb17
Pieper NISO Virtual Conf Feb17Pieper NISO Virtual Conf Feb17
Pieper NISO Virtual Conf Feb17
 
DOI registration with DataCite - COOPEUS, ENVRI, EUDAT workshop 2013
DOI registration with DataCite - COOPEUS, ENVRI, EUDAT workshop 2013DOI registration with DataCite - COOPEUS, ENVRI, EUDAT workshop 2013
DOI registration with DataCite - COOPEUS, ENVRI, EUDAT workshop 2013
 
Brooking Ingesting Metadata - FINAL
Brooking Ingesting Metadata - FINALBrooking Ingesting Metadata - FINAL
Brooking Ingesting Metadata - FINAL
 

Andere mochten auch

ORGANIZADORES GRÁFICOS TICS
ORGANIZADORES GRÁFICOS TICSORGANIZADORES GRÁFICOS TICS
ORGANIZADORES GRÁFICOS TICSJessica Cruz
 
Mature, Episode 5: Groceries
Mature, Episode 5: GroceriesMature, Episode 5: Groceries
Mature, Episode 5: Groceriesstealmyscripts
 
2 summary plan description
2 summary plan description2 summary plan description
2 summary plan descriptionErin Kerrigan
 
Game Monetization Tips & Techniques
Game Monetization Tips & TechniquesGame Monetization Tips & Techniques
Game Monetization Tips & TechniquesMochammad Masbuchin
 
Mary C6 Evaluation Question 1
Mary C6 Evaluation Question 1Mary C6 Evaluation Question 1
Mary C6 Evaluation Question 1salesian2014as
 
Rethink mental illness
Rethink mental illnessRethink mental illness
Rethink mental illnessBecca Burrell
 
Mark Question 3 media studies evaluation
Mark Question 3 media studies evaluationMark Question 3 media studies evaluation
Mark Question 3 media studies evaluationsalesian2014as
 
Challenges and Advances in Large-scale DFT Calculations on GPUs using TeraChem
Challenges and Advances in Large-scale DFT Calculations on GPUs using TeraChemChallenges and Advances in Large-scale DFT Calculations on GPUs using TeraChem
Challenges and Advances in Large-scale DFT Calculations on GPUs using TeraChemCan Ozdoruk
 
How to make a hermes handbag, hermes birkin bag, kelly bags, shoulder bags
How to make a hermes handbag, hermes birkin bag, kelly bags, shoulder bags How to make a hermes handbag, hermes birkin bag, kelly bags, shoulder bags
How to make a hermes handbag, hermes birkin bag, kelly bags, shoulder bags eloger123
 
Apptividia product introduction
Apptividia product introductionApptividia product introduction
Apptividia product introductionapptividia
 
Limas 131127003659-phpapp02
Limas 131127003659-phpapp02Limas 131127003659-phpapp02
Limas 131127003659-phpapp02Phond Sarsen
 

Andere mochten auch (19)

ORGANIZADORES GRÁFICOS TICS
ORGANIZADORES GRÁFICOS TICSORGANIZADORES GRÁFICOS TICS
ORGANIZADORES GRÁFICOS TICS
 
A c rezumat
A c rezumatA c rezumat
A c rezumat
 
Mature, Episode 5: Groceries
Mature, Episode 5: GroceriesMature, Episode 5: Groceries
Mature, Episode 5: Groceries
 
Notice
NoticeNotice
Notice
 
Rextone engineering
Rextone engineeringRextone engineering
Rextone engineering
 
2 summary plan description
2 summary plan description2 summary plan description
2 summary plan description
 
Game Monetization Tips & Techniques
Game Monetization Tips & TechniquesGame Monetization Tips & Techniques
Game Monetization Tips & Techniques
 
Mary C6 Evaluation Question 1
Mary C6 Evaluation Question 1Mary C6 Evaluation Question 1
Mary C6 Evaluation Question 1
 
Whateverjeanne
WhateverjeanneWhateverjeanne
Whateverjeanne
 
Sinister Movie Review
Sinister Movie ReviewSinister Movie Review
Sinister Movie Review
 
Rethink mental illness
Rethink mental illnessRethink mental illness
Rethink mental illness
 
Mark Question 3 media studies evaluation
Mark Question 3 media studies evaluationMark Question 3 media studies evaluation
Mark Question 3 media studies evaluation
 
Psychology of bullying
Psychology of bullyingPsychology of bullying
Psychology of bullying
 
Challenges and Advances in Large-scale DFT Calculations on GPUs using TeraChem
Challenges and Advances in Large-scale DFT Calculations on GPUs using TeraChemChallenges and Advances in Large-scale DFT Calculations on GPUs using TeraChem
Challenges and Advances in Large-scale DFT Calculations on GPUs using TeraChem
 
How to make a hermes handbag, hermes birkin bag, kelly bags, shoulder bags
How to make a hermes handbag, hermes birkin bag, kelly bags, shoulder bags How to make a hermes handbag, hermes birkin bag, kelly bags, shoulder bags
How to make a hermes handbag, hermes birkin bag, kelly bags, shoulder bags
 
Apptividia product introduction
Apptividia product introductionApptividia product introduction
Apptividia product introduction
 
Limas 131127003659-phpapp02
Limas 131127003659-phpapp02Limas 131127003659-phpapp02
Limas 131127003659-phpapp02
 
Evaluation 1
Evaluation 1Evaluation 1
Evaluation 1
 
Perinatologi
PerinatologiPerinatologi
Perinatologi
 

Ähnlich wie SEO-Optimized Title for INSPIRE Collaboration Digital Library Project Presentation

CNI fall 2009 enhanced publications john_doove-SURFfoundation
CNI fall 2009 enhanced publications john_doove-SURFfoundationCNI fall 2009 enhanced publications john_doove-SURFfoundation
CNI fall 2009 enhanced publications john_doove-SURFfoundationJohn Doove
 
Open Archives Initiative Object Reuse and Exchange
Open Archives Initiative Object Reuse and ExchangeOpen Archives Initiative Object Reuse and Exchange
Open Archives Initiative Object Reuse and Exchangelagoze
 
Elsevier‘s RDM Program: Habits of Effective Data and the Bourne Ulitmatum
Elsevier‘s RDM Program: Habits of Effective Data and the Bourne UlitmatumElsevier‘s RDM Program: Habits of Effective Data and the Bourne Ulitmatum
Elsevier‘s RDM Program: Habits of Effective Data and the Bourne UlitmatumAnita de Waard
 
Academic SEO, or: How do I get my research to show up in search engines and d...
Academic SEO, or: How do I get my research to show up in search engines and d...Academic SEO, or: How do I get my research to show up in search engines and d...
Academic SEO, or: How do I get my research to show up in search engines and d...Open Knowledge Maps
 
eLanguage.net: Shifting the paradigm in Linguistics
eLanguage.net: Shifting the paradigm in LinguisticseLanguage.net: Shifting the paradigm in Linguistics
eLanguage.net: Shifting the paradigm in LinguisticsCornelius Puschmann
 
20110830 Introducing the Social Media Research Foundation
20110830 Introducing the Social Media Research Foundation20110830 Introducing the Social Media Research Foundation
20110830 Introducing the Social Media Research FoundationMarc Smith
 
Fedora Overview
Fedora OverviewFedora Overview
Fedora Overvieweposthumus
 
Open Access Week 2017: Life Sciences and Open Sciences - worfkflows and tools
Open Access Week 2017: Life Sciences and Open Sciences - worfkflows and toolsOpen Access Week 2017: Life Sciences and Open Sciences - worfkflows and tools
Open Access Week 2017: Life Sciences and Open Sciences - worfkflows and toolsOpenAIRE
 
Modular Documentation Joe Gelb Techshoret 2009
Modular Documentation Joe Gelb Techshoret 2009Modular Documentation Joe Gelb Techshoret 2009
Modular Documentation Joe Gelb Techshoret 2009Suite Solutions
 
20080903arsenalsofnemesis 04
20080903arsenalsofnemesis 0420080903arsenalsofnemesis 04
20080903arsenalsofnemesis 04Richard Ovenden
 
Feedable, Portable, Mashable, DITAble
Feedable, Portable, Mashable, DITAbleFeedable, Portable, Mashable, DITAble
Feedable, Portable, Mashable, DITAbleMichael Priestley
 
Poster: Using Open Source Tools to Improve Access to Oral History Collections
Poster: Using Open Source Tools to Improve Access to Oral History CollectionsPoster: Using Open Source Tools to Improve Access to Oral History Collections
Poster: Using Open Source Tools to Improve Access to Oral History CollectionsBecky Yoose
 
Lombardi Wikis - collaborative information development, with DITA XML in the mix
Lombardi Wikis - collaborative information development, with DITA XML in the mixLombardi Wikis - collaborative information development, with DITA XML in the mix
Lombardi Wikis - collaborative information development, with DITA XML in the mixguest47c1f1
 
Lombardi Wikis - a CenTex DITA UG panel presentation
Lombardi Wikis - a CenTex DITA UG panel presentationLombardi Wikis - a CenTex DITA UG panel presentation
Lombardi Wikis - a CenTex DITA UG panel presentationLisa Dyer
 

Ähnlich wie SEO-Optimized Title for INSPIRE Collaboration Digital Library Project Presentation (20)

CNI fall 2009 enhanced publications john_doove-SURFfoundation
CNI fall 2009 enhanced publications john_doove-SURFfoundationCNI fall 2009 enhanced publications john_doove-SURFfoundation
CNI fall 2009 enhanced publications john_doove-SURFfoundation
 
Open Archives Initiative Object Reuse and Exchange
Open Archives Initiative Object Reuse and ExchangeOpen Archives Initiative Object Reuse and Exchange
Open Archives Initiative Object Reuse and Exchange
 
Dspace
DspaceDspace
Dspace
 
Dspace
DspaceDspace
Dspace
 
Myresearchhelper
MyresearchhelperMyresearchhelper
Myresearchhelper
 
Elsevier‘s RDM Program: Habits of Effective Data and the Bourne Ulitmatum
Elsevier‘s RDM Program: Habits of Effective Data and the Bourne UlitmatumElsevier‘s RDM Program: Habits of Effective Data and the Bourne Ulitmatum
Elsevier‘s RDM Program: Habits of Effective Data and the Bourne Ulitmatum
 
Academic SEO, or: How do I get my research to show up in search engines and d...
Academic SEO, or: How do I get my research to show up in search engines and d...Academic SEO, or: How do I get my research to show up in search engines and d...
Academic SEO, or: How do I get my research to show up in search engines and d...
 
eLanguage.net: Shifting the paradigm in Linguistics
eLanguage.net: Shifting the paradigm in LinguisticseLanguage.net: Shifting the paradigm in Linguistics
eLanguage.net: Shifting the paradigm in Linguistics
 
20110830 Introducing the Social Media Research Foundation
20110830 Introducing the Social Media Research Foundation20110830 Introducing the Social Media Research Foundation
20110830 Introducing the Social Media Research Foundation
 
Fedora Overview
Fedora OverviewFedora Overview
Fedora Overview
 
Open Access Week 2017: Life Sciences and Open Sciences - worfkflows and tools
Open Access Week 2017: Life Sciences and Open Sciences - worfkflows and toolsOpen Access Week 2017: Life Sciences and Open Sciences - worfkflows and tools
Open Access Week 2017: Life Sciences and Open Sciences - worfkflows and tools
 
Modular Documentation Joe Gelb Techshoret 2009
Modular Documentation Joe Gelb Techshoret 2009Modular Documentation Joe Gelb Techshoret 2009
Modular Documentation Joe Gelb Techshoret 2009
 
Inroduction to Dspace
Inroduction to DspaceInroduction to Dspace
Inroduction to Dspace
 
20080903arsenalsofnemesis 04
20080903arsenalsofnemesis 0420080903arsenalsofnemesis 04
20080903arsenalsofnemesis 04
 
Feedable, Portable, Mashable, DITAble
Feedable, Portable, Mashable, DITAbleFeedable, Portable, Mashable, DITAble
Feedable, Portable, Mashable, DITAble
 
Poster: Using Open Source Tools to Improve Access to Oral History Collections
Poster: Using Open Source Tools to Improve Access to Oral History CollectionsPoster: Using Open Source Tools to Improve Access to Oral History Collections
Poster: Using Open Source Tools to Improve Access to Oral History Collections
 
Lombardi Wikis - collaborative information development, with DITA XML in the mix
Lombardi Wikis - collaborative information development, with DITA XML in the mixLombardi Wikis - collaborative information development, with DITA XML in the mix
Lombardi Wikis - collaborative information development, with DITA XML in the mix
 
Lombardi Wikis - a CenTex DITA UG panel presentation
Lombardi Wikis - a CenTex DITA UG panel presentationLombardi Wikis - a CenTex DITA UG panel presentation
Lombardi Wikis - a CenTex DITA UG panel presentation
 
Library cloud abcd
Library cloud   abcdLibrary cloud   abcd
Library cloud abcd
 
Week4
Week4Week4
Week4
 

Mehr von Zaven Hakopov

Digital repositories and Knowledge Management
Digital repositories and Knowledge ManagementDigital repositories and Knowledge Management
Digital repositories and Knowledge ManagementZaven Hakopov
 
INIS Activities Main with Animations.-for Show
INIS Activities Main with Animations.-for ShowINIS Activities Main with Animations.-for Show
INIS Activities Main with Animations.-for ShowZaven Hakopov
 
General introduction to Knowledge Management
General introduction to Knowledge ManagementGeneral introduction to Knowledge Management
General introduction to Knowledge ManagementZaven Hakopov
 
Research data: what can libraries do?
Research data: what can libraries do?Research data: what can libraries do?
Research data: what can libraries do?Zaven Hakopov
 

Mehr von Zaven Hakopov (7)

DPHEP_BLUETWO_001
DPHEP_BLUETWO_001DPHEP_BLUETWO_001
DPHEP_BLUETWO_001
 
finalDIS
finalDISfinalDIS
finalDIS
 
Digital repositories and Knowledge Management
Digital repositories and Knowledge ManagementDigital repositories and Knowledge Management
Digital repositories and Knowledge Management
 
INIS E-Learning
INIS E-LearningINIS E-Learning
INIS E-Learning
 
INIS Activities Main with Animations.-for Show
INIS Activities Main with Animations.-for ShowINIS Activities Main with Animations.-for Show
INIS Activities Main with Animations.-for Show
 
General introduction to Knowledge Management
General introduction to Knowledge ManagementGeneral introduction to Knowledge Management
General introduction to Knowledge Management
 
Research data: what can libraries do?
Research data: what can libraries do?Research data: what can libraries do?
Research data: what can libraries do?
 

SEO-Optimized Title for INSPIRE Collaboration Digital Library Project Presentation

  • 1. Zaven Akopov (DESY -L-) For the INSPIRE Collaboration DESY Computing Seminar
  • 2. Joint Project of CERN, DESY, Fermilab and SLAC SPIRES: wonderful system, largest HEP database, best-curated content, but..old engine (>30 years): need a modern open-source multimedia digital library Unify SPIRES content with Invenio platform Invenio = Open source digital library ○ http://invenio-software.org SPIRES + Invenio = InSpire
  • 3. Invenio Integrated digital library system written largely in Python MySQL database modular built Navigable collection tree Documents organized in collections Regular and virtual collection trees Customizable portal-boxes for each collection Powerful search engine Specially designed indexes to provide fast search speed for repositories of up to 2,000,000 records Customizable simple and advanced search interfaces Flexible metadata Standard metadata format (MARC) Handling articles, books, theses, photos, videos, museum objects and more User personalization Baskets, e-mail notifications, comments, etc.
  • 4. DESY participation Input of Journal/Article Data HEP Ontology (Keywords) Input Hierarchy of HEP concepts based on DESY HEP Thesaurus DESY assigns keywords and classification to HEP Articles since 1964 SPIRES/InSPIRE mirror website
  • 5. Where are we? First Beta site released April 2010 Production Beta released a week ago http://inspirebeta.net Live Now Populated with SPIRES content daily Additional features Bugs are getting ironed out, but already:
  • 6.
  • 7.
  • 8.
  • 9.
  • 10.
  • 11.
  • 12.
  • 15. More to come Personal libraries, alerts Claim my papers (with arXiv and ORCID (Open Researcher and Contributor ID)) Submit theses and old non-arXiv material Attach non-text material OCR of older materials Even better feeds (with ADS, arXiv, Publishers)
  • 16. Automatic Disambiguation Henning Weiler - PhD student@CERN On 963 documents, 21 real authors could be identified for the query "Chen, G". 22 orphans remain 98% identified
  • 17. User Accounts Tied to academic affiliation Ability to correct information and claim papers Corrections still vetted by staff Add “corporate accounts” for collaborations
  • 18. Data - Soon Partnership and interlinking with HEPData HepData reloaded: reinventing the HEP data archive. Andy Buckley, Mike Whalley. Jun 2010. e-Print: arXiv:1006.0517 [hep-ex] http://hepdata.cedar.ac.uk/ HEPData+INSPIRE working with LHC and other experiments to ease submission process and interlinking Move towards citation/tracking use – reputation… Storage for other objects like ROOT, Mathematica, etc.
  • 20. Full-cycle of a publication Up to now, we've captured product: Papers Considering Data Currently, through DPHEP, opportunity to build infrastructure for capturing the process: Internal Notes Technical/Software Documentation Logbooks
  • 21. Wikis Increasingly popular central place to aggregate documentation Users structure the data for us Backups and 'dumps' are generally easy to make And usually in an easily digestible format (like XML)
  • 22. Tools For MediaWiki, most of the essential tools already exist. Wikimedia Foundation (Wikipedia) is interested in seeing what we do with them. From discussions with them, they are supportive of what we're trying to do
  • 23. Nascent BaBar Wiki MediaWiki Instance with: 162 content pages 201 total pages (talk, redirects, etc.) 22 registered users Simple script can easily produce dumps.
  • 24. Scenarios Level 0 Service: Basic Preservation Index and store wiki snapshot data as if it were a scientific publication (with many authors) Level 1 Service: Readable Snapshots Level 0 + read-only final version respecting formatting, etc. Level 2: Multiple Snapshots Level 0 + Level 1 for each of multiple wiki “release points”, with full(?) metadata Linking with Papers
  • 25. Publication/Drafting History: H1 Example A publication history includes: Set of preliminary results (typically, prepared for/as conference reports), short papers with associated figures. Actual publication process which begins with a pre-T0 report, which goes then through T0 talk to First/Second/… draft. Each draft stage has it’s set of answers (comments by collaboration and answers to them); typically a referee report And a final version that goes to the journal.
  • 27.
  • 28. How does it work? External Users can see the links from Conference talks to final papers, but nothing in between Access control – must be registered and validated (e-mail ping): already planned “Corporate” accounts for collaboration to update page Individual access via connection with collaboration…(Any paper? Current membership? What about long-term?) In development
  • 29. Access Main challenge: Access policies and their technical implementation Need input from collaborations to create policies. One size does not fit all. Easy – master access file maintained by coll. But not long-term… Medium – Computation based on author lists (not always correct?) Harder – Individual access lists depending on date of object and date of access OAIS (ISO standard) etc. can help us implement these in line with archival best practices
  • 30. Questions? For more information on INSPIRE see http://www.projecthepinspire.net Just try it out! http://inspirebeta.net