SlideShare ist ein Scribd-Unternehmen logo
1 von 25
Downloaden Sie, um offline zu lesen
Improving Library Services with
Semantic Web Technology
 - in the realm of Repository Systems
Dr. Timo Borst

Head of IT Development
German National Library for Economics /
Leibniz-Information Centre Economics
Kiel/Hamburg, Germany

ICDK 2011
14th – 16th February, Gurgaon/India

                                          Die ZBW ist Mitglied der Leibniz-Gemeinschaft
Overview
1. Current situation: Distributed (meta-)data management in library
   applications

2. Popular approaches towards aggregation and homogeneity of
   metadata

3. Our approach: Integration and aggregation of authority values
   with Semantic Web technology
         a) General idea
         b) Use case: Indexing
         c) Use case: Retrieving

4. “Lightweight” integration into existing repository systems and
   service providers

5. Conclusion

                                                                    Seite 2
Current situation

•   The rise of repository systems for academic publishing…



•   …has led to a landscape of distributed systems, each of them
    holding its own metadata…



•   …which is harvested and aggregated by service providers




                                                                   Seite 3
Popular approaches towards aggregation and
homogeneity of metadata
•   Normalization in advance (before harvesting) requires

      •   a mandatory metadata scheme to be applied by the local repositories
      •   a set of controlled vocabularies (e.g. for publication types)
      •   an automatic validation of the harvested metadata

•   Normalization afterwards (after harvesting) requires

      •   the definition of a minimum set of metadata fields
      •   the definition of a basic intermediate metadata scheme for normalizing
          the heterogeneous metadata records,
      •   optionally data cleansing strategies like name disambiguation and
          automatic indexing on the basis of thesauri


Both approaches are problematic and reveal ambiguities on the aggregation level !



                                                                                    Seite 4
Current situation

•   …sounds easy and straight, but implies
    severe problems esp. with regard to
    ambiguity of
     • author names
     • subject headings




                                             Seite 5
Current situation
„The major difficulty we have found is with DSpace’s handling of
metadata. While we feel that the number of fields in Dublin Core is
adequate for most if not all uses (DCMI Usage Board 2006), we are
troubled by the lack of authority control when completing its fields.
Without some control over uniform titles, authors and subjects
accessing the items in the future will very problematic.“
S. Chabot (http://subjectobject.net/2006/11/09/the-dspace-digital-
repository-a-project-analysis/)
       „Neither the standards nor the software unterlying
      institutional repositories anticipated performing naming
      authority control on widely disparate metadata from
      highly unreliable sources.“
      D. Salo (http://minds.wisconsin.edu/handle/1793/31735)


                                                                     Seite 6
Our approach: Integration of authority values with
Semantic Web technology

•   General idea: “Provide a framework for integrating authority
    data, which is both normative and flexible enough to tolerate
    local idiosyncrasies on a string level.”
•   Approach: Concept modelling based on Semantic Web / SKOS
    standards




                                                                    Seite 7
Our approach: Integration of authority values with
Semantic Web technology




                                                Seite 8
Our approach: Integration of authority values with
Semantic Web technology – Web service
Example queries (for concepts):




http://zbw.eu/beta/stw-ws/suggest?query=finanzkr
…delivers all terms beginning with “finanzkr”

http://zbw.eu/beta/stw-ws/stw-ws-wrapper.php?service=labels&
concept=http://zbw.eu/stw/descriptor/19664-4&lang=en
…delivers all english synonyms of the german “Finanzkrise”

                                                               Seite 9
Use case: (Self-)Indexing
•   One of the most prominent use cases especially for librarians, but also
    for scientists and active users not familiar with subject specific
    vocabularies
•   Main goals:
     •    Support the process of indexing in order to achieve a classification
          of documents which is both coherent and flexible in the sense that
          it permits local idiosyncrasies related to authority terms
     •    Align different vocabularies in the sense that indexing in one
          vocabulary is automatically linked to another vocabulary
•   Implementation: Extension of the submission interface of our repository by
    integrating the terminology web service as an autosuggest function



                                                                        Seite 10
Use case: (Self-)Indexing

Submission form https://econstor.eu




                                      Seite 11
Use case: Retrieving
•   To be considered as the most important use case

•   Often leading into the classical dilemma of precision and
    recall
•   Main goal:
     • Support the process of retrieving, so users can find the
       relevant set of documents

•   Implementation: Automatic expansion of the original query with
    synonyms, narrower and related terms




                                                                 Seite 12
Use case: Retrieving

Expanded search for „financial crisis“ http://econstor.eu




                                                            Seite 13
Use case: Retrieving

Expanded search for „financial crisis“ http://econstor.eu




                                                            Seite 14
Use case: Retrieving

Expanded search for „financial crisis“ http://econstor.eu




                                                            Seite 15
Anwendungsfall_2: Suche




                          Seite 16
Anwendungsfall_2: Suche




                          Seite 17
“Lightweight” integration into existing repository systems
and service providers




                                                             Seite 18
“Lightweight” integration into existing repository systems
and service providers
Benefits
• „Lightweight“ extension of legacy systems
• Strategy of „least intrusion“: No update or migration needed
• No changes to the core system, only some changes to the data model
  may be required:
  • Additional column for storing the URI of the authority key
  • Export resp. harvesting of the authority as a resource must be able
      (->OAI-ORE)

• Other types of library applications suitable for these adaptations:
  •   catalogues
  •   portals (e.g. to generate publication lists from an identified author or
      thematic issues)
  •   Any collaborative system with annotation system

                                                                                 Seite 19
Zusammenfassung und Fazit
• Bibliotheksanwendungen erzeugen und verwalten jeweils eigene
  idiosynkratische Datenbestände.
• Dies erschwert die Pflege, den Austausch, die Aggregation und die
  Homogenisierung der (Meta-)Daten für erweiterte Dienste.
• Vorgelagerte Webservices als Teil einer übergreifenden Normdaten-
  Infrastruktur können frühzeitig zur Homogenisierung der Metadaten
  beitragen (bei gleichzeitiger Lokalisierung).
• Wenn diese Webservices verbreitet entstehen und genutzt werden,
  besteht die Chance zu einer weitergehenden Vernetzung lokal
  gepflegter Metadaten bei gleichzeitiger Verbesserung der
  datenbasierten Services.
• Die Möglichkeit zur „leichtgewichtigen Integration“ ist ein Angebot an
  Betreiber von Bibliotheksanwendungen, diese Webservices mit
  möglichst minimalem Aufwand in ihre Anwendungen zu integrieren.
                                                                   Seite 20
Vielen Dank!


Dr. Timo Borst
Deutsche Zentralbibliothek für
Wirtschaftswissenschaften /
Leibniz-Informationszentrum
Wirtschaft (ZBW)

t.borst@zbw.eu



                                 Seite 21
Anwendungsfall_3: Erfassung von Autoren


  •Der Normalfall in Katalogen - in anderen Erfassungssystemen bisher
  der Ausnahmefall
  •Nutzergruppen: BibliothekarInnen + WissenschaftlerInnen (?) +
  BibliotheksnutzerInnen (?)
  •Vorgang: Eingabe von AutorInnen-Namen
  •Zielstellung: Den Vorgang der Autorenerfassung mit Hilfe von
  Normdaten zu verbessern, die durch Webservices bereit gestellt werden




                                                                          Seite 22
Anwendungsfall_3: Erfassung von Autoren
•Erfassungsmaske unter http://87.106.250.18/beta/econstor/




                                                             Seite 23
Bisherige Lösungsansätze zur Aggregierung &
Homogenisierung
  •Metadatensuche durch Aggregatoren
  •     Parallele Abfrage entfernt-verteilter Systeme
  •     Rückgabe und Aufbereitung des Suchergebnisses als
        zusammengesetzte Trefferliste
  •Harvesting
  •     Regelmäßiges Einsammeln von entfernt-verteilten
        Metadaten
  •     Homogenisierung ex ante oder ex post
  •Föderierte Suche
  •…

                                                            Seite 24
•[1] http://wiki.dspace.org/index.php/Authority_Control_of_Metadata_Values
Literatur
  •[2] http://minds.wisconsin.edu/handle/1793/31735
   •[3] http://dsug09.ub.gu.se/index.php/dsug/dsug09/paper/view/22/3
   •[4] http://subjectobject.net/2006/11/09/the-dspace-digital-repository-a-project-analysis/
   •[5] http://code.google.com/p/dspace-agrisap/wiki/ThesaurusAddOn
   •[6] http://edoc.hu-berlin.de/conferences/dc-2008/subirats-imma-199/PDF/subirats.pdf
   •[7] http://www.jisc.ac.uk/media/documents/programmes/sharedservices/na
   mes-phase-one-final-report,.pdf
   •[8] http://idea.library.drexel.edu/bitstream/1860/3173/1/20070051011.pdf
   •[9] http://ptsefton.com/blog/2006/06/06/the_affiliation_issue_in
   _institutional_repository_software/
   •[10] http://library.ust.hk/info/nac/nac-technical.html
   •[11] http://www.seco.tkk.fi/publications/2009/kurki-hyvonen-onki-people-2009.pdf
   •[12] http://journals.sfu.ca/archivar/index.php/archivaria/article/download/11883/12836
   •[13] http://www.dini.de/fileadmin/workshops/oa-netzwerk-
   juni2009/vernetzungstage_2009_malitz.pdf




                                                                                    Seite 25

Weitere ähnliche Inhalte

Was ist angesagt?

The network reshapes the research library collection
The network reshapes the research library collectionThe network reshapes the research library collection
The network reshapes the research library collectionlisld
 
(Big) bibliographic data @ ScaDS project meeting - 2015-06-12
(Big) bibliographic data @ ScaDS project meeting - 2015-06-12(Big) bibliographic data @ ScaDS project meeting - 2015-06-12
(Big) bibliographic data @ ScaDS project meeting - 2015-06-12Felix Lohmeier
 
SWIB14 Weaving repository contents into the Semantic Web
SWIB14 Weaving repository contents into the Semantic WebSWIB14 Weaving repository contents into the Semantic Web
SWIB14 Weaving repository contents into the Semantic WebPascal-Nicolas Becker
 
Going, going, gone - Can legal deposit save us from the digital black hole? -...
Going, going, gone - Can legal deposit save us from the digital black hole? -...Going, going, gone - Can legal deposit save us from the digital black hole? -...
Going, going, gone - Can legal deposit save us from the digital black hole? -...CONUL Conference
 
Linked data and semantic wikis
Linked data and semantic wikisLinked data and semantic wikis
Linked data and semantic wikisSören Auer
 
The Heterogenous Zone: Six use cases for six research data collections in Edi...
The Heterogenous Zone: Six use cases for six research data collections in Edi...The Heterogenous Zone: Six use cases for six research data collections in Edi...
The Heterogenous Zone: Six use cases for six research data collections in Edi...EDINA, University of Edinburgh
 
A distributed network of digital heritage information - Semantics Amsterdam
A distributed network of digital heritage information - Semantics AmsterdamA distributed network of digital heritage information - Semantics Amsterdam
A distributed network of digital heritage information - Semantics AmsterdamEnno Meijers
 
DBpedia - An Interlinking Hub in the Web of Data
DBpedia - An Interlinking Hub in the Web of DataDBpedia - An Interlinking Hub in the Web of Data
DBpedia - An Interlinking Hub in the Web of DataChris Bizer
 
Libraries in the Big Data Era: Strategies and Challenges in Archiving and Sha...
Libraries in the Big Data Era: Strategies and Challenges in Archiving and Sha...Libraries in the Big Data Era: Strategies and Challenges in Archiving and Sha...
Libraries in the Big Data Era: Strategies and Challenges in Archiving and Sha...Peter Löwe
 
Multilingual presentation ifla 2013 08-19
Multilingual presentation ifla 2013 08-19Multilingual presentation ifla 2013 08-19
Multilingual presentation ifla 2013 08-19Janifer Gatenby
 
Harvesting Using the Open Archives Initiative Protocol: What Your OAI Stream ...
Harvesting Using the Open Archives Initiative Protocol: What Your OAI Stream ...Harvesting Using the Open Archives Initiative Protocol: What Your OAI Stream ...
Harvesting Using the Open Archives Initiative Protocol: What Your OAI Stream ...Sandra McIntyre
 
COMSODE networking session at ICT Lisbon 2015
COMSODE networking session at ICT Lisbon 2015COMSODE networking session at ICT Lisbon 2015
COMSODE networking session at ICT Lisbon 2015Comsode - FP7 project
 
“Selecting for Sustainability” Maine Shared Collections Strategy
“Selecting for Sustainability”Maine Shared Collections Strategy“Selecting for Sustainability”Maine Shared Collections Strategy
“Selecting for Sustainability” Maine Shared Collections StrategyMaine_SharedCollections
 
From local infrastructure to engagement - thinking about the library in the l...
From local infrastructure to engagement - thinking about the library in the l...From local infrastructure to engagement - thinking about the library in the l...
From local infrastructure to engagement - thinking about the library in the l...lisld
 
Open Archives Initiatives For Metadata Harvesting
Open Archives Initiatives For Metadata   HarvestingOpen Archives Initiatives For Metadata   Harvesting
Open Archives Initiatives For Metadata HarvestingNikesh Narayanan
 
Open Science Days 2014 - Becker - Repositories and Linked Data
Open Science Days 2014 - Becker - Repositories and Linked DataOpen Science Days 2014 - Becker - Repositories and Linked Data
Open Science Days 2014 - Becker - Repositories and Linked DataPascal-Nicolas Becker
 

Was ist angesagt? (20)

The network reshapes the research library collection
The network reshapes the research library collectionThe network reshapes the research library collection
The network reshapes the research library collection
 
(Big) bibliographic data @ ScaDS project meeting - 2015-06-12
(Big) bibliographic data @ ScaDS project meeting - 2015-06-12(Big) bibliographic data @ ScaDS project meeting - 2015-06-12
(Big) bibliographic data @ ScaDS project meeting - 2015-06-12
 
The WSTIERIA Project – A Web of Services
The  WSTIERIA Project – A Web of ServicesThe  WSTIERIA Project – A Web of Services
The WSTIERIA Project – A Web of Services
 
SWIB14 Weaving repository contents into the Semantic Web
SWIB14 Weaving repository contents into the Semantic WebSWIB14 Weaving repository contents into the Semantic Web
SWIB14 Weaving repository contents into the Semantic Web
 
Going, going, gone - Can legal deposit save us from the digital black hole? -...
Going, going, gone - Can legal deposit save us from the digital black hole? -...Going, going, gone - Can legal deposit save us from the digital black hole? -...
Going, going, gone - Can legal deposit save us from the digital black hole? -...
 
Cummings Level Up: Building Data Services
Cummings Level Up: Building Data ServicesCummings Level Up: Building Data Services
Cummings Level Up: Building Data Services
 
Linked data and semantic wikis
Linked data and semantic wikisLinked data and semantic wikis
Linked data and semantic wikis
 
The Heterogenous Zone: Six use cases for six research data collections in Edi...
The Heterogenous Zone: Six use cases for six research data collections in Edi...The Heterogenous Zone: Six use cases for six research data collections in Edi...
The Heterogenous Zone: Six use cases for six research data collections in Edi...
 
Open Spatial Data: Sources and Tools
Open Spatial Data: Sources and ToolsOpen Spatial Data: Sources and Tools
Open Spatial Data: Sources and Tools
 
OAI and OAI-PMH
OAI and OAI-PMHOAI and OAI-PMH
OAI and OAI-PMH
 
A distributed network of digital heritage information - Semantics Amsterdam
A distributed network of digital heritage information - Semantics AmsterdamA distributed network of digital heritage information - Semantics Amsterdam
A distributed network of digital heritage information - Semantics Amsterdam
 
DBpedia - An Interlinking Hub in the Web of Data
DBpedia - An Interlinking Hub in the Web of DataDBpedia - An Interlinking Hub in the Web of Data
DBpedia - An Interlinking Hub in the Web of Data
 
Libraries in the Big Data Era: Strategies and Challenges in Archiving and Sha...
Libraries in the Big Data Era: Strategies and Challenges in Archiving and Sha...Libraries in the Big Data Era: Strategies and Challenges in Archiving and Sha...
Libraries in the Big Data Era: Strategies and Challenges in Archiving and Sha...
 
Multilingual presentation ifla 2013 08-19
Multilingual presentation ifla 2013 08-19Multilingual presentation ifla 2013 08-19
Multilingual presentation ifla 2013 08-19
 
Harvesting Using the Open Archives Initiative Protocol: What Your OAI Stream ...
Harvesting Using the Open Archives Initiative Protocol: What Your OAI Stream ...Harvesting Using the Open Archives Initiative Protocol: What Your OAI Stream ...
Harvesting Using the Open Archives Initiative Protocol: What Your OAI Stream ...
 
COMSODE networking session at ICT Lisbon 2015
COMSODE networking session at ICT Lisbon 2015COMSODE networking session at ICT Lisbon 2015
COMSODE networking session at ICT Lisbon 2015
 
“Selecting for Sustainability” Maine Shared Collections Strategy
“Selecting for Sustainability”Maine Shared Collections Strategy“Selecting for Sustainability”Maine Shared Collections Strategy
“Selecting for Sustainability” Maine Shared Collections Strategy
 
From local infrastructure to engagement - thinking about the library in the l...
From local infrastructure to engagement - thinking about the library in the l...From local infrastructure to engagement - thinking about the library in the l...
From local infrastructure to engagement - thinking about the library in the l...
 
Open Archives Initiatives For Metadata Harvesting
Open Archives Initiatives For Metadata   HarvestingOpen Archives Initiatives For Metadata   Harvesting
Open Archives Initiatives For Metadata Harvesting
 
Open Science Days 2014 - Becker - Repositories and Linked Data
Open Science Days 2014 - Becker - Repositories and Linked DataOpen Science Days 2014 - Becker - Repositories and Linked Data
Open Science Days 2014 - Becker - Repositories and Linked Data
 

Andere mochten auch

Improving Patient Safety - Five years after the IOM Report
Improving Patient Safety - Five years after the IOM ReportImproving Patient Safety - Five years after the IOM Report
Improving Patient Safety - Five years after the IOM ReportISOB
 
Contact Center Report Positioning
Contact Center Report PositioningContact Center Report Positioning
Contact Center Report PositioningSpectrum
 
Improving the Development Process
Improving the Development ProcessImproving the Development Process
Improving the Development ProcessFreeBalance
 
Faculty Focus Special Report Effective Strategies for Improving College Teach...
Faculty Focus Special Report Effective Strategies for Improving College Teach...Faculty Focus Special Report Effective Strategies for Improving College Teach...
Faculty Focus Special Report Effective Strategies for Improving College Teach...Dillard University Library
 
Disaster Preparedness For People With Disabilities
Disaster Preparedness For People With  DisabilitiesDisaster Preparedness For People With  Disabilities
Disaster Preparedness For People With Disabilitieseruditemike
 

Andere mochten auch (6)

Marking report
Marking reportMarking report
Marking report
 
Improving Patient Safety - Five years after the IOM Report
Improving Patient Safety - Five years after the IOM ReportImproving Patient Safety - Five years after the IOM Report
Improving Patient Safety - Five years after the IOM Report
 
Contact Center Report Positioning
Contact Center Report PositioningContact Center Report Positioning
Contact Center Report Positioning
 
Improving the Development Process
Improving the Development ProcessImproving the Development Process
Improving the Development Process
 
Faculty Focus Special Report Effective Strategies for Improving College Teach...
Faculty Focus Special Report Effective Strategies for Improving College Teach...Faculty Focus Special Report Effective Strategies for Improving College Teach...
Faculty Focus Special Report Effective Strategies for Improving College Teach...
 
Disaster Preparedness For People With Disabilities
Disaster Preparedness For People With  DisabilitiesDisaster Preparedness For People With  Disabilities
Disaster Preparedness For People With Disabilities
 

Ähnlich wie Improving library services with semantic web technology in the realm of repositories

Cloud web scale discovery services landscape an overview
Cloud web scale discovery services landscape an overviewCloud web scale discovery services landscape an overview
Cloud web scale discovery services landscape an overviewNikesh Narayanan
 
Implementing web scale discovery services: special reference to Indian Librar...
Implementing web scale discovery services: special reference to Indian Librar...Implementing web scale discovery services: special reference to Indian Librar...
Implementing web scale discovery services: special reference to Indian Librar...Nikesh Narayanan
 
Evaluation of Web Scale Discovery Services
Evaluation of Web Scale Discovery ServicesEvaluation of Web Scale Discovery Services
Evaluation of Web Scale Discovery ServicesNikesh Narayanan
 
Connecting the Dots: Linking Digitized Collections Across Metadata Silos
Connecting the Dots: Linking Digitized Collections Across Metadata SilosConnecting the Dots: Linking Digitized Collections Across Metadata Silos
Connecting the Dots: Linking Digitized Collections Across Metadata SilosOCLC
 
Module 1 - Chapter1.pptx
Module 1 - Chapter1.pptxModule 1 - Chapter1.pptx
Module 1 - Chapter1.pptxSoniaDevi15
 
Session 1.4 a distributed network of heritage information
Session 1.4   a distributed network of heritage informationSession 1.4   a distributed network of heritage information
Session 1.4 a distributed network of heritage informationsemanticsconference
 
New ICT Trends and Issues of Librarianship
New ICT Trends and Issues of LibrarianshipNew ICT Trends and Issues of Librarianship
New ICT Trends and Issues of LibrarianshipLiaquat Rahoo
 
Research Object Composer: A Tool for Publishing Complex Data Objects in the C...
Research Object Composer: A Tool for Publishing Complex Data Objects in the C...Research Object Composer: A Tool for Publishing Complex Data Objects in the C...
Research Object Composer: A Tool for Publishing Complex Data Objects in the C...Anita de Waard
 
Collective Funding Models for OA Books 3 - Thoth presentation.pptx
Collective Funding Models for OA Books 3 - Thoth presentation.pptxCollective Funding Models for OA Books 3 - Thoth presentation.pptx
Collective Funding Models for OA Books 3 - Thoth presentation.pptxJisc
 
Linked Data (1st Linked Data Meetup Malmö)
Linked Data (1st Linked Data Meetup Malmö)Linked Data (1st Linked Data Meetup Malmö)
Linked Data (1st Linked Data Meetup Malmö)Anja Jentzsch
 
RESOURCE SHARING: A LIBRARY PERCEPTIVE
RESOURCE SHARING: A LIBRARY PERCEPTIVE RESOURCE SHARING: A LIBRARY PERCEPTIVE
RESOURCE SHARING: A LIBRARY PERCEPTIVE IAEME Publication
 
Open for Business - Open Archives, OpenURL, RSS and the Dublin Core
Open for Business - Open Archives, OpenURL, RSS and the Dublin CoreOpen for Business - Open Archives, OpenURL, RSS and the Dublin Core
Open for Business - Open Archives, OpenURL, RSS and the Dublin CoreAndy Powell
 
CLARIAH Toogdag 2018: A distributed network of digital heritage information
CLARIAH Toogdag 2018: A distributed network of digital heritage informationCLARIAH Toogdag 2018: A distributed network of digital heritage information
CLARIAH Toogdag 2018: A distributed network of digital heritage informationEnno Meijers
 
Open archives initiatives(final)
 Open archives initiatives(final) Open archives initiatives(final)
Open archives initiatives(final)floyd taag
 
Open archives initiatives(final)
 Open archives initiatives(final) Open archives initiatives(final)
Open archives initiatives(final)floyd taag
 
Open archives initiatives(final)
 Open archives initiatives(final) Open archives initiatives(final)
Open archives initiatives(final)floyd taag
 
-Open Archives Initiatives(final)
-Open Archives Initiatives(final)-Open Archives Initiatives(final)
-Open Archives Initiatives(final)floyd taag
 

Ähnlich wie Improving library services with semantic web technology in the realm of repositories (20)

Cloud web scale discovery services landscape an overview
Cloud web scale discovery services landscape an overviewCloud web scale discovery services landscape an overview
Cloud web scale discovery services landscape an overview
 
Implementing web scale discovery services: special reference to Indian Librar...
Implementing web scale discovery services: special reference to Indian Librar...Implementing web scale discovery services: special reference to Indian Librar...
Implementing web scale discovery services: special reference to Indian Librar...
 
Evaluation of Web Scale Discovery Services
Evaluation of Web Scale Discovery ServicesEvaluation of Web Scale Discovery Services
Evaluation of Web Scale Discovery Services
 
Connecting the Dots: Linking Digitized Collections Across Metadata Silos
Connecting the Dots: Linking Digitized Collections Across Metadata SilosConnecting the Dots: Linking Digitized Collections Across Metadata Silos
Connecting the Dots: Linking Digitized Collections Across Metadata Silos
 
Module 1 - Chapter1.pptx
Module 1 - Chapter1.pptxModule 1 - Chapter1.pptx
Module 1 - Chapter1.pptx
 
Breeding 1
Breeding 1Breeding 1
Breeding 1
 
Session 1.4 a distributed network of heritage information
Session 1.4   a distributed network of heritage informationSession 1.4   a distributed network of heritage information
Session 1.4 a distributed network of heritage information
 
New ICT Trends and Issues of Librarianship
New ICT Trends and Issues of LibrarianshipNew ICT Trends and Issues of Librarianship
New ICT Trends and Issues of Librarianship
 
Digital libraries
Digital librariesDigital libraries
Digital libraries
 
Research Object Composer: A Tool for Publishing Complex Data Objects in the C...
Research Object Composer: A Tool for Publishing Complex Data Objects in the C...Research Object Composer: A Tool for Publishing Complex Data Objects in the C...
Research Object Composer: A Tool for Publishing Complex Data Objects in the C...
 
Collective Funding Models for OA Books 3 - Thoth presentation.pptx
Collective Funding Models for OA Books 3 - Thoth presentation.pptxCollective Funding Models for OA Books 3 - Thoth presentation.pptx
Collective Funding Models for OA Books 3 - Thoth presentation.pptx
 
Linked Data (1st Linked Data Meetup Malmö)
Linked Data (1st Linked Data Meetup Malmö)Linked Data (1st Linked Data Meetup Malmö)
Linked Data (1st Linked Data Meetup Malmö)
 
RESOURCE SHARING: A LIBRARY PERCEPTIVE
RESOURCE SHARING: A LIBRARY PERCEPTIVE RESOURCE SHARING: A LIBRARY PERCEPTIVE
RESOURCE SHARING: A LIBRARY PERCEPTIVE
 
Open for Business - Open Archives, OpenURL, RSS and the Dublin Core
Open for Business - Open Archives, OpenURL, RSS and the Dublin CoreOpen for Business - Open Archives, OpenURL, RSS and the Dublin Core
Open for Business - Open Archives, OpenURL, RSS and the Dublin Core
 
Beitarie, "Toward Service-Oriented Librarianship"
Beitarie, "Toward Service-Oriented Librarianship"Beitarie, "Toward Service-Oriented Librarianship"
Beitarie, "Toward Service-Oriented Librarianship"
 
CLARIAH Toogdag 2018: A distributed network of digital heritage information
CLARIAH Toogdag 2018: A distributed network of digital heritage informationCLARIAH Toogdag 2018: A distributed network of digital heritage information
CLARIAH Toogdag 2018: A distributed network of digital heritage information
 
Open archives initiatives(final)
 Open archives initiatives(final) Open archives initiatives(final)
Open archives initiatives(final)
 
Open archives initiatives(final)
 Open archives initiatives(final) Open archives initiatives(final)
Open archives initiatives(final)
 
Open archives initiatives(final)
 Open archives initiatives(final) Open archives initiatives(final)
Open archives initiatives(final)
 
-Open Archives Initiatives(final)
-Open Archives Initiatives(final)-Open Archives Initiatives(final)
-Open Archives Initiatives(final)
 

Mehr von redsys

DSpace as publication platform
DSpace as publication platformDSpace as publication platform
DSpace as publication platformredsys
 
Einbindung von Linked Data in existierende Bibliotheksanswendungen
Einbindung von Linked Data in existierende BibliotheksanswendungenEinbindung von Linked Data in existierende Bibliotheksanswendungen
Einbindung von Linked Data in existierende Bibliotheksanswendungenredsys
 
Datenschutz für Bibliotheksanwendungen
Datenschutz für BibliotheksanwendungenDatenschutz für Bibliotheksanwendungen
Datenschutz für Bibliotheksanwendungenredsys
 
Medienkompetenz und Wikipedia an Hochschulen
Medienkompetenz und Wikipedia an HochschulenMedienkompetenz und Wikipedia an Hochschulen
Medienkompetenz und Wikipedia an Hochschulenredsys
 
Poster presentation
Poster presentationPoster presentation
Poster presentationredsys
 
Integration von Normdaten in Bibliotheksanwendungen auf der Basis von Semanti...
Integration von Normdaten in Bibliotheksanwendungen auf der Basis von Semanti...Integration von Normdaten in Bibliotheksanwendungen auf der Basis von Semanti...
Integration von Normdaten in Bibliotheksanwendungen auf der Basis von Semanti...redsys
 
Usage and impact of controlled vocabularies in a subject repository for index...
Usage and impact of controlled vocabularies in a subject repository for index...Usage and impact of controlled vocabularies in a subject repository for index...
Usage and impact of controlled vocabularies in a subject repository for index...redsys
 

Mehr von redsys (7)

DSpace as publication platform
DSpace as publication platformDSpace as publication platform
DSpace as publication platform
 
Einbindung von Linked Data in existierende Bibliotheksanswendungen
Einbindung von Linked Data in existierende BibliotheksanswendungenEinbindung von Linked Data in existierende Bibliotheksanswendungen
Einbindung von Linked Data in existierende Bibliotheksanswendungen
 
Datenschutz für Bibliotheksanwendungen
Datenschutz für BibliotheksanwendungenDatenschutz für Bibliotheksanwendungen
Datenschutz für Bibliotheksanwendungen
 
Medienkompetenz und Wikipedia an Hochschulen
Medienkompetenz und Wikipedia an HochschulenMedienkompetenz und Wikipedia an Hochschulen
Medienkompetenz und Wikipedia an Hochschulen
 
Poster presentation
Poster presentationPoster presentation
Poster presentation
 
Integration von Normdaten in Bibliotheksanwendungen auf der Basis von Semanti...
Integration von Normdaten in Bibliotheksanwendungen auf der Basis von Semanti...Integration von Normdaten in Bibliotheksanwendungen auf der Basis von Semanti...
Integration von Normdaten in Bibliotheksanwendungen auf der Basis von Semanti...
 
Usage and impact of controlled vocabularies in a subject repository for index...
Usage and impact of controlled vocabularies in a subject repository for index...Usage and impact of controlled vocabularies in a subject repository for index...
Usage and impact of controlled vocabularies in a subject repository for index...
 

Kürzlich hochgeladen

AXA XL - Insurer Innovation Award Americas 2024
AXA XL - Insurer Innovation Award Americas 2024AXA XL - Insurer Innovation Award Americas 2024
AXA XL - Insurer Innovation Award Americas 2024The Digital Insurer
 
Finding Java's Hidden Performance Traps @ DevoxxUK 2024
Finding Java's Hidden Performance Traps @ DevoxxUK 2024Finding Java's Hidden Performance Traps @ DevoxxUK 2024
Finding Java's Hidden Performance Traps @ DevoxxUK 2024Victor Rentea
 
Axa Assurance Maroc - Insurer Innovation Award 2024
Axa Assurance Maroc - Insurer Innovation Award 2024Axa Assurance Maroc - Insurer Innovation Award 2024
Axa Assurance Maroc - Insurer Innovation Award 2024The Digital Insurer
 
Exploring the Future Potential of AI-Enabled Smartphone Processors
Exploring the Future Potential of AI-Enabled Smartphone ProcessorsExploring the Future Potential of AI-Enabled Smartphone Processors
Exploring the Future Potential of AI-Enabled Smartphone Processorsdebabhi2
 
presentation ICT roal in 21st century education
presentation ICT roal in 21st century educationpresentation ICT roal in 21st century education
presentation ICT roal in 21st century educationjfdjdjcjdnsjd
 
TrustArc Webinar - Unlock the Power of AI-Driven Data Discovery
TrustArc Webinar - Unlock the Power of AI-Driven Data DiscoveryTrustArc Webinar - Unlock the Power of AI-Driven Data Discovery
TrustArc Webinar - Unlock the Power of AI-Driven Data DiscoveryTrustArc
 
Artificial Intelligence Chap.5 : Uncertainty
Artificial Intelligence Chap.5 : UncertaintyArtificial Intelligence Chap.5 : Uncertainty
Artificial Intelligence Chap.5 : UncertaintyKhushali Kathiriya
 
MS Copilot expands with MS Graph connectors
MS Copilot expands with MS Graph connectorsMS Copilot expands with MS Graph connectors
MS Copilot expands with MS Graph connectorsNanddeep Nachan
 
Rising Above_ Dubai Floods and the Fortitude of Dubai International Airport.pdf
Rising Above_ Dubai Floods and the Fortitude of Dubai International Airport.pdfRising Above_ Dubai Floods and the Fortitude of Dubai International Airport.pdf
Rising Above_ Dubai Floods and the Fortitude of Dubai International Airport.pdfOrbitshub
 
ICT role in 21st century education and its challenges
ICT role in 21st century education and its challengesICT role in 21st century education and its challenges
ICT role in 21st century education and its challengesrafiqahmad00786416
 
Corporate and higher education May webinar.pptx
Corporate and higher education May webinar.pptxCorporate and higher education May webinar.pptx
Corporate and higher education May webinar.pptxRustici Software
 
Navigating the Deluge_ Dubai Floods and the Resilience of Dubai International...
Navigating the Deluge_ Dubai Floods and the Resilience of Dubai International...Navigating the Deluge_ Dubai Floods and the Resilience of Dubai International...
Navigating the Deluge_ Dubai Floods and the Resilience of Dubai International...Orbitshub
 
Apidays New York 2024 - Accelerating FinTech Innovation by Vasa Krishnan, Fin...
Apidays New York 2024 - Accelerating FinTech Innovation by Vasa Krishnan, Fin...Apidays New York 2024 - Accelerating FinTech Innovation by Vasa Krishnan, Fin...
Apidays New York 2024 - Accelerating FinTech Innovation by Vasa Krishnan, Fin...apidays
 
DBX First Quarter 2024 Investor Presentation
DBX First Quarter 2024 Investor PresentationDBX First Quarter 2024 Investor Presentation
DBX First Quarter 2024 Investor PresentationDropbox
 
Ransomware_Q4_2023. The report. [EN].pdf
Ransomware_Q4_2023. The report. [EN].pdfRansomware_Q4_2023. The report. [EN].pdf
Ransomware_Q4_2023. The report. [EN].pdfOverkill Security
 
Cyberprint. Dark Pink Apt Group [EN].pdf
Cyberprint. Dark Pink Apt Group [EN].pdfCyberprint. Dark Pink Apt Group [EN].pdf
Cyberprint. Dark Pink Apt Group [EN].pdfOverkill Security
 
Manulife - Insurer Transformation Award 2024
Manulife - Insurer Transformation Award 2024Manulife - Insurer Transformation Award 2024
Manulife - Insurer Transformation Award 2024The Digital Insurer
 
EMPOWERMENT TECHNOLOGY GRADE 11 QUARTER 2 REVIEWER
EMPOWERMENT TECHNOLOGY GRADE 11 QUARTER 2 REVIEWEREMPOWERMENT TECHNOLOGY GRADE 11 QUARTER 2 REVIEWER
EMPOWERMENT TECHNOLOGY GRADE 11 QUARTER 2 REVIEWERMadyBayot
 
How to Troubleshoot Apps for the Modern Connected Worker
How to Troubleshoot Apps for the Modern Connected WorkerHow to Troubleshoot Apps for the Modern Connected Worker
How to Troubleshoot Apps for the Modern Connected WorkerThousandEyes
 
Boost Fertility New Invention Ups Success Rates.pdf
Boost Fertility New Invention Ups Success Rates.pdfBoost Fertility New Invention Ups Success Rates.pdf
Boost Fertility New Invention Ups Success Rates.pdfsudhanshuwaghmare1
 

Kürzlich hochgeladen (20)

AXA XL - Insurer Innovation Award Americas 2024
AXA XL - Insurer Innovation Award Americas 2024AXA XL - Insurer Innovation Award Americas 2024
AXA XL - Insurer Innovation Award Americas 2024
 
Finding Java's Hidden Performance Traps @ DevoxxUK 2024
Finding Java's Hidden Performance Traps @ DevoxxUK 2024Finding Java's Hidden Performance Traps @ DevoxxUK 2024
Finding Java's Hidden Performance Traps @ DevoxxUK 2024
 
Axa Assurance Maroc - Insurer Innovation Award 2024
Axa Assurance Maroc - Insurer Innovation Award 2024Axa Assurance Maroc - Insurer Innovation Award 2024
Axa Assurance Maroc - Insurer Innovation Award 2024
 
Exploring the Future Potential of AI-Enabled Smartphone Processors
Exploring the Future Potential of AI-Enabled Smartphone ProcessorsExploring the Future Potential of AI-Enabled Smartphone Processors
Exploring the Future Potential of AI-Enabled Smartphone Processors
 
presentation ICT roal in 21st century education
presentation ICT roal in 21st century educationpresentation ICT roal in 21st century education
presentation ICT roal in 21st century education
 
TrustArc Webinar - Unlock the Power of AI-Driven Data Discovery
TrustArc Webinar - Unlock the Power of AI-Driven Data DiscoveryTrustArc Webinar - Unlock the Power of AI-Driven Data Discovery
TrustArc Webinar - Unlock the Power of AI-Driven Data Discovery
 
Artificial Intelligence Chap.5 : Uncertainty
Artificial Intelligence Chap.5 : UncertaintyArtificial Intelligence Chap.5 : Uncertainty
Artificial Intelligence Chap.5 : Uncertainty
 
MS Copilot expands with MS Graph connectors
MS Copilot expands with MS Graph connectorsMS Copilot expands with MS Graph connectors
MS Copilot expands with MS Graph connectors
 
Rising Above_ Dubai Floods and the Fortitude of Dubai International Airport.pdf
Rising Above_ Dubai Floods and the Fortitude of Dubai International Airport.pdfRising Above_ Dubai Floods and the Fortitude of Dubai International Airport.pdf
Rising Above_ Dubai Floods and the Fortitude of Dubai International Airport.pdf
 
ICT role in 21st century education and its challenges
ICT role in 21st century education and its challengesICT role in 21st century education and its challenges
ICT role in 21st century education and its challenges
 
Corporate and higher education May webinar.pptx
Corporate and higher education May webinar.pptxCorporate and higher education May webinar.pptx
Corporate and higher education May webinar.pptx
 
Navigating the Deluge_ Dubai Floods and the Resilience of Dubai International...
Navigating the Deluge_ Dubai Floods and the Resilience of Dubai International...Navigating the Deluge_ Dubai Floods and the Resilience of Dubai International...
Navigating the Deluge_ Dubai Floods and the Resilience of Dubai International...
 
Apidays New York 2024 - Accelerating FinTech Innovation by Vasa Krishnan, Fin...
Apidays New York 2024 - Accelerating FinTech Innovation by Vasa Krishnan, Fin...Apidays New York 2024 - Accelerating FinTech Innovation by Vasa Krishnan, Fin...
Apidays New York 2024 - Accelerating FinTech Innovation by Vasa Krishnan, Fin...
 
DBX First Quarter 2024 Investor Presentation
DBX First Quarter 2024 Investor PresentationDBX First Quarter 2024 Investor Presentation
DBX First Quarter 2024 Investor Presentation
 
Ransomware_Q4_2023. The report. [EN].pdf
Ransomware_Q4_2023. The report. [EN].pdfRansomware_Q4_2023. The report. [EN].pdf
Ransomware_Q4_2023. The report. [EN].pdf
 
Cyberprint. Dark Pink Apt Group [EN].pdf
Cyberprint. Dark Pink Apt Group [EN].pdfCyberprint. Dark Pink Apt Group [EN].pdf
Cyberprint. Dark Pink Apt Group [EN].pdf
 
Manulife - Insurer Transformation Award 2024
Manulife - Insurer Transformation Award 2024Manulife - Insurer Transformation Award 2024
Manulife - Insurer Transformation Award 2024
 
EMPOWERMENT TECHNOLOGY GRADE 11 QUARTER 2 REVIEWER
EMPOWERMENT TECHNOLOGY GRADE 11 QUARTER 2 REVIEWEREMPOWERMENT TECHNOLOGY GRADE 11 QUARTER 2 REVIEWER
EMPOWERMENT TECHNOLOGY GRADE 11 QUARTER 2 REVIEWER
 
How to Troubleshoot Apps for the Modern Connected Worker
How to Troubleshoot Apps for the Modern Connected WorkerHow to Troubleshoot Apps for the Modern Connected Worker
How to Troubleshoot Apps for the Modern Connected Worker
 
Boost Fertility New Invention Ups Success Rates.pdf
Boost Fertility New Invention Ups Success Rates.pdfBoost Fertility New Invention Ups Success Rates.pdf
Boost Fertility New Invention Ups Success Rates.pdf
 

Improving library services with semantic web technology in the realm of repositories

  • 1. Improving Library Services with Semantic Web Technology - in the realm of Repository Systems Dr. Timo Borst Head of IT Development German National Library for Economics / Leibniz-Information Centre Economics Kiel/Hamburg, Germany ICDK 2011 14th – 16th February, Gurgaon/India Die ZBW ist Mitglied der Leibniz-Gemeinschaft
  • 2. Overview 1. Current situation: Distributed (meta-)data management in library applications 2. Popular approaches towards aggregation and homogeneity of metadata 3. Our approach: Integration and aggregation of authority values with Semantic Web technology a) General idea b) Use case: Indexing c) Use case: Retrieving 4. “Lightweight” integration into existing repository systems and service providers 5. Conclusion Seite 2
  • 3. Current situation • The rise of repository systems for academic publishing… • …has led to a landscape of distributed systems, each of them holding its own metadata… • …which is harvested and aggregated by service providers Seite 3
  • 4. Popular approaches towards aggregation and homogeneity of metadata • Normalization in advance (before harvesting) requires • a mandatory metadata scheme to be applied by the local repositories • a set of controlled vocabularies (e.g. for publication types) • an automatic validation of the harvested metadata • Normalization afterwards (after harvesting) requires • the definition of a minimum set of metadata fields • the definition of a basic intermediate metadata scheme for normalizing the heterogeneous metadata records, • optionally data cleansing strategies like name disambiguation and automatic indexing on the basis of thesauri Both approaches are problematic and reveal ambiguities on the aggregation level ! Seite 4
  • 5. Current situation • …sounds easy and straight, but implies severe problems esp. with regard to ambiguity of • author names • subject headings Seite 5
  • 6. Current situation „The major difficulty we have found is with DSpace’s handling of metadata. While we feel that the number of fields in Dublin Core is adequate for most if not all uses (DCMI Usage Board 2006), we are troubled by the lack of authority control when completing its fields. Without some control over uniform titles, authors and subjects accessing the items in the future will very problematic.“ S. Chabot (http://subjectobject.net/2006/11/09/the-dspace-digital- repository-a-project-analysis/) „Neither the standards nor the software unterlying institutional repositories anticipated performing naming authority control on widely disparate metadata from highly unreliable sources.“ D. Salo (http://minds.wisconsin.edu/handle/1793/31735) Seite 6
  • 7. Our approach: Integration of authority values with Semantic Web technology • General idea: “Provide a framework for integrating authority data, which is both normative and flexible enough to tolerate local idiosyncrasies on a string level.” • Approach: Concept modelling based on Semantic Web / SKOS standards Seite 7
  • 8. Our approach: Integration of authority values with Semantic Web technology Seite 8
  • 9. Our approach: Integration of authority values with Semantic Web technology – Web service Example queries (for concepts): http://zbw.eu/beta/stw-ws/suggest?query=finanzkr …delivers all terms beginning with “finanzkr” http://zbw.eu/beta/stw-ws/stw-ws-wrapper.php?service=labels& concept=http://zbw.eu/stw/descriptor/19664-4&lang=en …delivers all english synonyms of the german “Finanzkrise” Seite 9
  • 10. Use case: (Self-)Indexing • One of the most prominent use cases especially for librarians, but also for scientists and active users not familiar with subject specific vocabularies • Main goals: • Support the process of indexing in order to achieve a classification of documents which is both coherent and flexible in the sense that it permits local idiosyncrasies related to authority terms • Align different vocabularies in the sense that indexing in one vocabulary is automatically linked to another vocabulary • Implementation: Extension of the submission interface of our repository by integrating the terminology web service as an autosuggest function Seite 10
  • 11. Use case: (Self-)Indexing Submission form https://econstor.eu Seite 11
  • 12. Use case: Retrieving • To be considered as the most important use case • Often leading into the classical dilemma of precision and recall • Main goal: • Support the process of retrieving, so users can find the relevant set of documents • Implementation: Automatic expansion of the original query with synonyms, narrower and related terms Seite 12
  • 13. Use case: Retrieving Expanded search for „financial crisis“ http://econstor.eu Seite 13
  • 14. Use case: Retrieving Expanded search for „financial crisis“ http://econstor.eu Seite 14
  • 15. Use case: Retrieving Expanded search for „financial crisis“ http://econstor.eu Seite 15
  • 18. “Lightweight” integration into existing repository systems and service providers Seite 18
  • 19. “Lightweight” integration into existing repository systems and service providers Benefits • „Lightweight“ extension of legacy systems • Strategy of „least intrusion“: No update or migration needed • No changes to the core system, only some changes to the data model may be required: • Additional column for storing the URI of the authority key • Export resp. harvesting of the authority as a resource must be able (->OAI-ORE) • Other types of library applications suitable for these adaptations: • catalogues • portals (e.g. to generate publication lists from an identified author or thematic issues) • Any collaborative system with annotation system Seite 19
  • 20. Zusammenfassung und Fazit • Bibliotheksanwendungen erzeugen und verwalten jeweils eigene idiosynkratische Datenbestände. • Dies erschwert die Pflege, den Austausch, die Aggregation und die Homogenisierung der (Meta-)Daten für erweiterte Dienste. • Vorgelagerte Webservices als Teil einer übergreifenden Normdaten- Infrastruktur können frühzeitig zur Homogenisierung der Metadaten beitragen (bei gleichzeitiger Lokalisierung). • Wenn diese Webservices verbreitet entstehen und genutzt werden, besteht die Chance zu einer weitergehenden Vernetzung lokal gepflegter Metadaten bei gleichzeitiger Verbesserung der datenbasierten Services. • Die Möglichkeit zur „leichtgewichtigen Integration“ ist ein Angebot an Betreiber von Bibliotheksanwendungen, diese Webservices mit möglichst minimalem Aufwand in ihre Anwendungen zu integrieren. Seite 20
  • 21. Vielen Dank! Dr. Timo Borst Deutsche Zentralbibliothek für Wirtschaftswissenschaften / Leibniz-Informationszentrum Wirtschaft (ZBW) t.borst@zbw.eu Seite 21
  • 22. Anwendungsfall_3: Erfassung von Autoren •Der Normalfall in Katalogen - in anderen Erfassungssystemen bisher der Ausnahmefall •Nutzergruppen: BibliothekarInnen + WissenschaftlerInnen (?) + BibliotheksnutzerInnen (?) •Vorgang: Eingabe von AutorInnen-Namen •Zielstellung: Den Vorgang der Autorenerfassung mit Hilfe von Normdaten zu verbessern, die durch Webservices bereit gestellt werden Seite 22
  • 23. Anwendungsfall_3: Erfassung von Autoren •Erfassungsmaske unter http://87.106.250.18/beta/econstor/ Seite 23
  • 24. Bisherige Lösungsansätze zur Aggregierung & Homogenisierung •Metadatensuche durch Aggregatoren • Parallele Abfrage entfernt-verteilter Systeme • Rückgabe und Aufbereitung des Suchergebnisses als zusammengesetzte Trefferliste •Harvesting • Regelmäßiges Einsammeln von entfernt-verteilten Metadaten • Homogenisierung ex ante oder ex post •Föderierte Suche •… Seite 24
  • 25. •[1] http://wiki.dspace.org/index.php/Authority_Control_of_Metadata_Values Literatur •[2] http://minds.wisconsin.edu/handle/1793/31735 •[3] http://dsug09.ub.gu.se/index.php/dsug/dsug09/paper/view/22/3 •[4] http://subjectobject.net/2006/11/09/the-dspace-digital-repository-a-project-analysis/ •[5] http://code.google.com/p/dspace-agrisap/wiki/ThesaurusAddOn •[6] http://edoc.hu-berlin.de/conferences/dc-2008/subirats-imma-199/PDF/subirats.pdf •[7] http://www.jisc.ac.uk/media/documents/programmes/sharedservices/na mes-phase-one-final-report,.pdf •[8] http://idea.library.drexel.edu/bitstream/1860/3173/1/20070051011.pdf •[9] http://ptsefton.com/blog/2006/06/06/the_affiliation_issue_in _institutional_repository_software/ •[10] http://library.ust.hk/info/nac/nac-technical.html •[11] http://www.seco.tkk.fi/publications/2009/kurki-hyvonen-onki-people-2009.pdf •[12] http://journals.sfu.ca/archivar/index.php/archivaria/article/download/11883/12836 •[13] http://www.dini.de/fileadmin/workshops/oa-netzwerk- juni2009/vernetzungstage_2009_malitz.pdf Seite 25