SlideShare ist ein Scribd-Unternehmen logo
1 von 25
Downloaden Sie, um offline zu lesen
1
Intra- and interdisciplinary cross-
concordances for information retrieval
Philipp Mayr
GESIS – Leibniz Institute for the Social Sciences, Bonn, Germany
8th NKOS Workshop at the 13th ECDL Conference
Corfu, Greece, 01. October 2009
2
KoMoHe Project (2004-2007)
KoMoHe (Competence Center Modeling and
Treatment of Semantic Heterogeneity)
Goals:
– Models for searching heterogeneous collections
– Development, organization & management of
cross-walks between controlled vocabularies
– IR evaluation of the mappings (effectiveness of
intellectual mapping)
3
Relations
• Equivalence
• Narrower Term
• Broader Term
• Related Term
• Null: no mapping
manually created, directed relations between controlled
terms of two knowledge organization systems (KOS)
KOS 1 Relation KOS 2
Library = Bibliothéque
Library > Special library
Thesaurus < KOS
Hacker ^
Computers +
Security
Virus 0
4
Cross-concordances
• 25 Vocabularies in 64 cross-concordances
– Thesauri (16)
– Descriptor lists (4)
– Classifications (3)
– Subject heading lists (2)
• 380,000 mapped terms
• 465,000 relations
• 205,000 equivalence relations
• 13 German, 8 English, 1 Russian, 3 multilingual
5
Disciplines
Social
Sciences (10)
Gerontology
(1)
Universal (3)
Psychology
(1)
Pedagogics
(1)
Sports
science (2)
Economics
(2)
Political
science (3)
Medicine (1)Agricultural
science (1)
Information
science (1)
6
Net of Cross-concordances
Each node represents a KOS
7
Objectives
• Translate search terms into other terminologies
• Increase diversity of documents from different
databases
• Improve search experience without effort for
searcher
• Test the effect for IR in different disciplines (social
science and others)
8
Main questions
• What is examined?
– the quality of the mappings
– or the quality of the associated search
• Can we enable distributed search with the subject
access tools over several information systems?
– In one discipline
– Between at least two disciplines
• Is the impact of terminology mapping on recall
and precision measurable?
• The mappings are helpful to whom?
9
Information Retrieval Test
Question: How effective are the mappings in an
actual search? Does the application of term
mappings improve search over a non-transformed
subject (i.e. controlled vocabulary) search?
10
Information Retrieval Tests
• Thesauri mappings only
• Only equivalence relations
• Real queries (~6 per tested cross-concordance)
• Databases: 80,000 – 16 mio. documents
• Test 1 (CT  TT): 13 Cross-concordances
• Test 2 (FT  FT+TT): 8 Cross-concordances
11
Mayr & Petras, 2008
12
Steps
• Requesting recent research topics from our
partners (social science and others)
• Intellectually translating the topics into controlled
term searches in a KOS A
• Automatically translating the controlled terms via
HTS into the controlled terms of a KOS B
• Retrieving documents from two runs
1. Controlled term (CT) search (KOS A) in database B
2. Translated term (TT) search (KOS B) in database B
13
Information Retrieval Test CT-TT
DB A
Term a
Term b
Term c
…
Term n
DB B
Term a
Term b
Term c
…
Term n
HTS
Terms Voc A Terms Voc B
DB A
Term a
Term b
Term c
…
Term n
DB B
Terms Voc A
Scenario CT
Scenario TT
HTS
(Heterogeneity
Service) ~
Web service
providing the
mappings
Run 1
Run 2
14
Information Retrieval Tests
Test 1
Intradisciplinary:
Social sc. – Social sc.
TheSoz – DZI
DZI – TheSoz
TheSoz – SWD
SWD – TheSoz
CSA – TheSoz
• 5 concordances
• 3 databases
• 35 topics
Test 3
Interdisciplinary:
Int. Relations – Economics
Medical sc. – Psychology
IBLK – STW
STW – IBLK
Mesh – Psyndex
Psyndex – Mesh
• 4 concordances
• 4 databases
• 28 topics
Test 2
Interdisciplinary:
Social sc. – Psychology
Social sc. – Economics
TheSoz – Psyndex
Psyndex – TheSoz
TheSoz – STW
STW – TheSoz
• 4 concordances
• 3 databases
• 19 topics
15
Methodology
• Downloading the documents for both runs (CT, TT),
cutt-off: 1,000 docs
• Pooling both runs (CT, TT) for each topic
• Importing the documents into a assessment tool
• Relevance assessment of the documents by experts
• Analysis of the assessment data
– Retrieved: average number of retrieved documents (across all search types)
– Relevant: average number of relevant retrieved documents (across all search types)
– Rel_ret: average number of relevant retrieved documents for a particular search type
– Recall: proportion of relevant retrieved documents out of all relevant documents
(averaged across all queries of one search type)
– Precision: proportion of relevant retrieved documents out of all retrieved documents
(averaged across all queries of one search type)
16
Assessment of the documents: by experts
17
Information Retrieval Tests - Results
• CT  TT (Improvements in %)
Recall
= Hitrate
Precision
= Accuracy
Intradisciplinary +39% +34%
Interdisciplinary +136% +68%
Recall
= Hitrate
Precision
= Accuracy
Intradisciplinary +20% -12%
Interdisciplinary +24% -24%
• FT  FT+TT (Improvements in %)
Detailled results can be found in Mayr & Petras, 2008
18
Discussion
• Overlap and more identical terms in intradisciplinary
mappings
– Mapping in one discipline is simpler: just one expert
– Lesser effect on search
– Automatic mapping may be more useful in
intradisciplinary sets: mainly syntactic matching
• Language plays a major role
– we had just one bilingual mapping in the test
• Restrictions of the study: no real users or
interactions, only thesauri, KOS in German
19
Summary
Why are cross-concordances in one discipline less
effective for IR?
• Amount of identical terms are significantly higher
in one discipline (one language)
• No effective transformation possible for IR, if you
have identical terms
Mapping projects should more often perform IR
tests to measure the effect of their mappings.
20
Conclusion
• Cross-concordances improve subject search with
controlled terms & free-text search: larger
measurable effects on interdisciplinary mappings
• Only 24% relations utilized (equivalence)
• Potential:
– Other relations
– STR  CT translation
• More mappings which are not evaluated
• Mappings are used e.g. in portals like sowiport,
vascoda, ireon, … and other projects
21
Next steps
• Visualization of the terminology network
• Combined evaluation with other value-
added services (search term
recommendation)
• Conversion to SKOS
• Evaluation of other disciplines
• Evaluation of indirect term transformation
(term – switching term – end term)
22
Publications
Mayr, Philipp; Petras, Vivien (2008): Cross-concordances:
terminology mapping and its effectiveness for information
retrieval. In: 74th IFLA World Library and Information
Congress. Québec, Canada-
http://www.ifla.org/IV/ifla74/papers/129-Mayr_Petras-
en.pdf
Mayr, Philipp; Mutschke, Peter; Petras, Vivien (2008):
Reducing semantic complexity in distributed Digital
Libraries: treatment of term vagueness and document re-
ranking. In: Library Review. 57 (2008) 3. pp. 213-224.
http://arxiv.org/abs/0712.2449
23
Indirect term transformations
Social sciences – gerontology – medicine
24
Sowiport Search
25
KoMoHe Project
http://www.gesis.org/en/research/
information_technology/komohe.htm
E-mail: philipp.mayr@gesis.org

Weitere ähnliche Inhalte

Was ist angesagt?

Proposing a Scientific Paper Retrieval and Recommender Framework
Proposing a Scientific Paper Retrieval and Recommender FrameworkProposing a Scientific Paper Retrieval and Recommender Framework
Proposing a Scientific Paper Retrieval and Recommender FrameworkAravind Sesagiri Raamkumar
 
Data Mining and the Web_Past_Present and Future
Data Mining and the Web_Past_Present and FutureData Mining and the Web_Past_Present and Future
Data Mining and the Web_Past_Present and Futurefeiwin
 
Improving data quality at Europeana (SWIB 2016)
Improving data quality at Europeana (SWIB 2016)Improving data quality at Europeana (SWIB 2016)
Improving data quality at Europeana (SWIB 2016)Péter Király
 
CORE Analytics Dashboard
CORE Analytics DashboardCORE Analytics Dashboard
CORE Analytics Dashboardpetrknoth
 
SelQA: A New Benchmark for Selection-based Question Answering
SelQA: A New Benchmark for Selection-based Question AnsweringSelQA: A New Benchmark for Selection-based Question Answering
SelQA: A New Benchmark for Selection-based Question AnsweringJinho Choi
 
Automatic Classification of Springer Nature Proceedings with Smart Topic Miner
Automatic Classification of Springer Nature Proceedings with Smart Topic MinerAutomatic Classification of Springer Nature Proceedings with Smart Topic Miner
Automatic Classification of Springer Nature Proceedings with Smart Topic MinerFrancesco Osborne
 
The Web of Data: do we actually understand what we built?
The Web of Data: do we actually understand what we built?The Web of Data: do we actually understand what we built?
The Web of Data: do we actually understand what we built?Frank van Harmelen
 
Early Detection and Forecasting of Research Trends
Early Detection and Forecasting of Research TrendsEarly Detection and Forecasting of Research Trends
Early Detection and Forecasting of Research TrendsAngelo Salatino
 
Detection of Embryonic Research Topics by Analysing Semantic Topic Networks
Detection of Embryonic Research Topics by Analysing Semantic Topic NetworksDetection of Embryonic Research Topics by Analysing Semantic Topic Networks
Detection of Embryonic Research Topics by Analysing Semantic Topic NetworksAngelo Salatino
 
8th TUC Meeting – Marcus Paradies (SAP) Social Network Benchmark
8th TUC Meeting – Marcus Paradies (SAP) Social Network Benchmark8th TUC Meeting – Marcus Paradies (SAP) Social Network Benchmark
8th TUC Meeting – Marcus Paradies (SAP) Social Network BenchmarkLDBC council
 
Metadata quality Assurance Framework at QQML2016 - short
Metadata quality Assurance Framework at QQML2016 - shortMetadata quality Assurance Framework at QQML2016 - short
Metadata quality Assurance Framework at QQML2016 - shortPéter Király
 
Supporting Springer Nature Editors by means of Semantic Technologies
Supporting Springer Nature Editors by means of Semantic TechnologiesSupporting Springer Nature Editors by means of Semantic Technologies
Supporting Springer Nature Editors by means of Semantic TechnologiesFrancesco Osborne
 
Metadata Quality Assurance Framework at QQML2016 conference - full version
Metadata Quality Assurance Framework at QQML2016 conference - full versionMetadata Quality Assurance Framework at QQML2016 conference - full version
Metadata Quality Assurance Framework at QQML2016 conference - full versionPéter Király
 
Newsjunkie
NewsjunkieNewsjunkie
NewsjunkieFan Jin
 
Combining IR with Relevance Feedback for Concept Location
Combining IR with Relevance Feedback for Concept LocationCombining IR with Relevance Feedback for Concept Location
Combining IR with Relevance Feedback for Concept LocationSonia Haiduc
 
Topic modeling of marketing scientific papers: An experimental survey
Topic modeling of marketing scientific papers: An experimental surveyTopic modeling of marketing scientific papers: An experimental survey
Topic modeling of marketing scientific papers: An experimental surveyICDEcCnferenece
 

Was ist angesagt? (20)

Proposing a Scientific Paper Retrieval and Recommender Framework
Proposing a Scientific Paper Retrieval and Recommender FrameworkProposing a Scientific Paper Retrieval and Recommender Framework
Proposing a Scientific Paper Retrieval and Recommender Framework
 
Data Mining and the Web_Past_Present and Future
Data Mining and the Web_Past_Present and FutureData Mining and the Web_Past_Present and Future
Data Mining and the Web_Past_Present and Future
 
Improving data quality at Europeana (SWIB 2016)
Improving data quality at Europeana (SWIB 2016)Improving data quality at Europeana (SWIB 2016)
Improving data quality at Europeana (SWIB 2016)
 
CORE Analytics Dashboard
CORE Analytics DashboardCORE Analytics Dashboard
CORE Analytics Dashboard
 
SelQA: A New Benchmark for Selection-based Question Answering
SelQA: A New Benchmark for Selection-based Question AnsweringSelQA: A New Benchmark for Selection-based Question Answering
SelQA: A New Benchmark for Selection-based Question Answering
 
Data wrangling week2
Data wrangling week2Data wrangling week2
Data wrangling week2
 
Data wrangling week3
Data wrangling week3Data wrangling week3
Data wrangling week3
 
Automatic Classification of Springer Nature Proceedings with Smart Topic Miner
Automatic Classification of Springer Nature Proceedings with Smart Topic MinerAutomatic Classification of Springer Nature Proceedings with Smart Topic Miner
Automatic Classification of Springer Nature Proceedings with Smart Topic Miner
 
The Web of Data: do we actually understand what we built?
The Web of Data: do we actually understand what we built?The Web of Data: do we actually understand what we built?
The Web of Data: do we actually understand what we built?
 
Early Detection and Forecasting of Research Trends
Early Detection and Forecasting of Research TrendsEarly Detection and Forecasting of Research Trends
Early Detection and Forecasting of Research Trends
 
Detection of Embryonic Research Topics by Analysing Semantic Topic Networks
Detection of Embryonic Research Topics by Analysing Semantic Topic NetworksDetection of Embryonic Research Topics by Analysing Semantic Topic Networks
Detection of Embryonic Research Topics by Analysing Semantic Topic Networks
 
8th TUC Meeting – Marcus Paradies (SAP) Social Network Benchmark
8th TUC Meeting – Marcus Paradies (SAP) Social Network Benchmark8th TUC Meeting – Marcus Paradies (SAP) Social Network Benchmark
8th TUC Meeting – Marcus Paradies (SAP) Social Network Benchmark
 
Metadata quality Assurance Framework at QQML2016 - short
Metadata quality Assurance Framework at QQML2016 - shortMetadata quality Assurance Framework at QQML2016 - short
Metadata quality Assurance Framework at QQML2016 - short
 
Supporting Springer Nature Editors by means of Semantic Technologies
Supporting Springer Nature Editors by means of Semantic TechnologiesSupporting Springer Nature Editors by means of Semantic Technologies
Supporting Springer Nature Editors by means of Semantic Technologies
 
Data wrangling week 5
Data wrangling week 5Data wrangling week 5
Data wrangling week 5
 
Metadata Quality Assurance Framework at QQML2016 conference - full version
Metadata Quality Assurance Framework at QQML2016 conference - full versionMetadata Quality Assurance Framework at QQML2016 conference - full version
Metadata Quality Assurance Framework at QQML2016 conference - full version
 
Newsjunkie
NewsjunkieNewsjunkie
Newsjunkie
 
Data wrangling week 11
Data wrangling week 11Data wrangling week 11
Data wrangling week 11
 
Combining IR with Relevance Feedback for Concept Location
Combining IR with Relevance Feedback for Concept LocationCombining IR with Relevance Feedback for Concept Location
Combining IR with Relevance Feedback for Concept Location
 
Topic modeling of marketing scientific papers: An experimental survey
Topic modeling of marketing scientific papers: An experimental surveyTopic modeling of marketing scientific papers: An experimental survey
Topic modeling of marketing scientific papers: An experimental survey
 

Andere mochten auch

Communicative language teaching
Communicative language teachingCommunicative language teaching
Communicative language teachingPatrmartin
 
Communicative Language Teaching
Communicative Language TeachingCommunicative Language Teaching
Communicative Language Teachinglilianamonserrat
 
Communicative language teaching
Communicative language teachingCommunicative language teaching
Communicative language teachingElvis Plaza
 
British Literature Project
British Literature ProjectBritish Literature Project
British Literature ProjectJavier Aguirre
 
The American Literature: A Throwback to the Rich History of Now the Most Powe...
The American Literature: A Throwback to the Rich History of Now the Most Powe...The American Literature: A Throwback to the Rich History of Now the Most Powe...
The American Literature: A Throwback to the Rich History of Now the Most Powe...Alphred Jann Naparan
 
Audience reception theory
Audience reception theoryAudience reception theory
Audience reception theoryabibatr
 
British Literature Introduction
British Literature IntroductionBritish Literature Introduction
British Literature IntroductionGrahme Smith
 
Reception theory
Reception theoryReception theory
Reception theorylillytomjoe
 
British Literature
British LiteratureBritish Literature
British LiteratureLe Demacré
 
Media Reception theory
Media Reception theoryMedia Reception theory
Media Reception theorypradanayar
 
Reader response
Reader responseReader response
Reader responsealexm1316
 
Reader response and reception theory
Reader response and reception theoryReader response and reception theory
Reader response and reception theoryMohammed Raiyah
 
2.philip larkin _the_trees
2.philip larkin _the_trees2.philip larkin _the_trees
2.philip larkin _the_treesCharter College
 
British literature through time
British literature through timeBritish literature through time
British literature through timeursulahd
 
Stuart Hall’s Reception Theory
Stuart Hall’s Reception TheoryStuart Hall’s Reception Theory
Stuart Hall’s Reception Theoryalexeglen
 

Andere mochten auch (20)

Communicative language teaching
Communicative language teachingCommunicative language teaching
Communicative language teaching
 
Communicative Language Teaching
Communicative Language TeachingCommunicative Language Teaching
Communicative Language Teaching
 
Communicative language teaching
Communicative language teachingCommunicative language teaching
Communicative language teaching
 
Reception theory
Reception theoryReception theory
Reception theory
 
British Literature Project
British Literature ProjectBritish Literature Project
British Literature Project
 
The American Literature: A Throwback to the Rich History of Now the Most Powe...
The American Literature: A Throwback to the Rich History of Now the Most Powe...The American Literature: A Throwback to the Rich History of Now the Most Powe...
The American Literature: A Throwback to the Rich History of Now the Most Powe...
 
Audience reception theory
Audience reception theoryAudience reception theory
Audience reception theory
 
British Literature Introduction
British Literature IntroductionBritish Literature Introduction
British Literature Introduction
 
Reception theory
Reception theoryReception theory
Reception theory
 
British Literature
British LiteratureBritish Literature
British Literature
 
Introduction to american literature
Introduction to american literatureIntroduction to american literature
Introduction to american literature
 
Media Reception theory
Media Reception theoryMedia Reception theory
Media Reception theory
 
Intro. of ob
Intro. of  obIntro. of  ob
Intro. of ob
 
Affective stylistics
Affective stylisticsAffective stylistics
Affective stylistics
 
Reader response
Reader responseReader response
Reader response
 
Reception theory
Reception theoryReception theory
Reception theory
 
Reader response and reception theory
Reader response and reception theoryReader response and reception theory
Reader response and reception theory
 
2.philip larkin _the_trees
2.philip larkin _the_trees2.philip larkin _the_trees
2.philip larkin _the_trees
 
British literature through time
British literature through timeBritish literature through time
British literature through time
 
Stuart Hall’s Reception Theory
Stuart Hall’s Reception TheoryStuart Hall’s Reception Theory
Stuart Hall’s Reception Theory
 

Ähnlich wie Intra- and interdisciplinary cross-concordances for information retrieval

Search term recommendation and non-textual ranking evaluated
 Search term recommendation and non-textual ranking evaluated Search term recommendation and non-textual ranking evaluated
Search term recommendation and non-textual ranking evaluatedGESIS
 
Philosophy of IR Evaluation Ellen Voorhees
Philosophy of IR Evaluation Ellen VoorheesPhilosophy of IR Evaluation Ellen Voorhees
Philosophy of IR Evaluation Ellen Voorheesk21jag
 
Measuring the usefulness of Knowledge Organization Systems in Information Ret...
Measuring the usefulness of Knowledge Organization Systems in Information Ret...Measuring the usefulness of Knowledge Organization Systems in Information Ret...
Measuring the usefulness of Knowledge Organization Systems in Information Ret...GESIS
 
Slides: Concurrent Inference of Topic Models and Distributed Vector Represent...
Slides: Concurrent Inference of Topic Models and Distributed Vector Represent...Slides: Concurrent Inference of Topic Models and Distributed Vector Represent...
Slides: Concurrent Inference of Topic Models and Distributed Vector Represent...Parang Saraf
 
Survey on Common Strategies of Vocabulary Reuse in Linked Open Data Modeling ...
Survey on Common Strategies of Vocabulary Reuse in Linked Open Data Modeling ...Survey on Common Strategies of Vocabulary Reuse in Linked Open Data Modeling ...
Survey on Common Strategies of Vocabulary Reuse in Linked Open Data Modeling ...JohannWanja
 
Visualizing the Transcribe Bentham Corpus
Visualizing the Transcribe Bentham CorpusVisualizing the Transcribe Bentham Corpus
Visualizing the Transcribe Bentham CorpusUCLDH
 
Data and Software in Scientific Activities: a Literature Review
Data and Software in Scientific Activities: a Literature ReviewData and Software in Scientific Activities: a Literature Review
Data and Software in Scientific Activities: a Literature ReviewKai Li
 
Semantics-enhanced Cyberinfrastructure for ICMSE : Interoperability, Analyti...
Semantics-enhanced Cyberinfrastructure for ICMSE :  Interoperability, Analyti...Semantics-enhanced Cyberinfrastructure for ICMSE :  Interoperability, Analyti...
Semantics-enhanced Cyberinfrastructure for ICMSE : Interoperability, Analyti...Artificial Intelligence Institute at UofSC
 
TopicModels_BleiPaper_Summary.pptx
TopicModels_BleiPaper_Summary.pptxTopicModels_BleiPaper_Summary.pptx
TopicModels_BleiPaper_Summary.pptxKalpit Desai
 
Bibliometric-enhanced Retrieval Models for Big Scholarly Information Systems
Bibliometric-enhanced Retrieval Models for Big Scholarly Information SystemsBibliometric-enhanced Retrieval Models for Big Scholarly Information Systems
Bibliometric-enhanced Retrieval Models for Big Scholarly Information SystemsGESIS
 
Harnessing Textbooks for High-Quality Labeled Data: An Approach to Automatic ...
Harnessing Textbooks for High-Quality Labeled Data: An Approach to Automatic ...Harnessing Textbooks for High-Quality Labeled Data: An Approach to Automatic ...
Harnessing Textbooks for High-Quality Labeled Data: An Approach to Automatic ...Sergey Sosnovsky
 
empirical-SLR.pptx
empirical-SLR.pptxempirical-SLR.pptx
empirical-SLR.pptxJitha Kannan
 
UKSG webinar - Introduction to Text-Mining Research Papers with Petr Knoth an...
UKSG webinar - Introduction to Text-Mining Research Papers with Petr Knoth an...UKSG webinar - Introduction to Text-Mining Research Papers with Petr Knoth an...
UKSG webinar - Introduction to Text-Mining Research Papers with Petr Knoth an...UKSG: connecting the knowledge community
 
Keynote Exploring and Exploiting Official Publications
Keynote Exploring and Exploiting Official PublicationsKeynote Exploring and Exploiting Official Publications
Keynote Exploring and Exploiting Official Publicationsmaartenmarx
 
2012.10 - Workshop on Semantic Statistics - 1
2012.10 - Workshop on Semantic Statistics - 12012.10 - Workshop on Semantic Statistics - 1
2012.10 - Workshop on Semantic Statistics - 1Dr.-Ing. Thomas Hartmann
 
Knowledge Representation on the Web
Knowledge Representation on the WebKnowledge Representation on the Web
Knowledge Representation on the WebRinke Hoekstra
 
Rec4LRW – Scientific Paper Recommender System for Literature Review and Writing
Rec4LRW – Scientific Paper Recommender System for Literature Review and WritingRec4LRW – Scientific Paper Recommender System for Literature Review and Writing
Rec4LRW – Scientific Paper Recommender System for Literature Review and WritingAravind Sesagiri Raamkumar
 

Ähnlich wie Intra- and interdisciplinary cross-concordances for information retrieval (20)

Search term recommendation and non-textual ranking evaluated
 Search term recommendation and non-textual ranking evaluated Search term recommendation and non-textual ranking evaluated
Search term recommendation and non-textual ranking evaluated
 
Philosophy of IR Evaluation Ellen Voorhees
Philosophy of IR Evaluation Ellen VoorheesPhilosophy of IR Evaluation Ellen Voorhees
Philosophy of IR Evaluation Ellen Voorhees
 
Measuring the usefulness of Knowledge Organization Systems in Information Ret...
Measuring the usefulness of Knowledge Organization Systems in Information Ret...Measuring the usefulness of Knowledge Organization Systems in Information Ret...
Measuring the usefulness of Knowledge Organization Systems in Information Ret...
 
Slides: Concurrent Inference of Topic Models and Distributed Vector Represent...
Slides: Concurrent Inference of Topic Models and Distributed Vector Represent...Slides: Concurrent Inference of Topic Models and Distributed Vector Represent...
Slides: Concurrent Inference of Topic Models and Distributed Vector Represent...
 
Survey on Common Strategies of Vocabulary Reuse in Linked Open Data Modeling ...
Survey on Common Strategies of Vocabulary Reuse in Linked Open Data Modeling ...Survey on Common Strategies of Vocabulary Reuse in Linked Open Data Modeling ...
Survey on Common Strategies of Vocabulary Reuse in Linked Open Data Modeling ...
 
Öppen data och forskningens genomslag
Öppen data och forskningens genomslagÖppen data och forskningens genomslag
Öppen data och forskningens genomslag
 
Visualizing the Transcribe Bentham Corpus
Visualizing the Transcribe Bentham CorpusVisualizing the Transcribe Bentham Corpus
Visualizing the Transcribe Bentham Corpus
 
Data and Software in Scientific Activities: a Literature Review
Data and Software in Scientific Activities: a Literature ReviewData and Software in Scientific Activities: a Literature Review
Data and Software in Scientific Activities: a Literature Review
 
Semantics-enhanced Cyberinfrastructure for ICMSE : Interoperability, Analyti...
Semantics-enhanced Cyberinfrastructure for ICMSE :  Interoperability, Analyti...Semantics-enhanced Cyberinfrastructure for ICMSE :  Interoperability, Analyti...
Semantics-enhanced Cyberinfrastructure for ICMSE : Interoperability, Analyti...
 
TopicModels_BleiPaper_Summary.pptx
TopicModels_BleiPaper_Summary.pptxTopicModels_BleiPaper_Summary.pptx
TopicModels_BleiPaper_Summary.pptx
 
Bibliometric-enhanced Retrieval Models for Big Scholarly Information Systems
Bibliometric-enhanced Retrieval Models for Big Scholarly Information SystemsBibliometric-enhanced Retrieval Models for Big Scholarly Information Systems
Bibliometric-enhanced Retrieval Models for Big Scholarly Information Systems
 
2012.10 - DDI Lifecycle - Moving Forward
2012.10 - DDI Lifecycle - Moving Forward2012.10 - DDI Lifecycle - Moving Forward
2012.10 - DDI Lifecycle - Moving Forward
 
Szomszor "Methods and Tools for Scholarly Data Analytics"
Szomszor "Methods and Tools for Scholarly Data Analytics"Szomszor "Methods and Tools for Scholarly Data Analytics"
Szomszor "Methods and Tools for Scholarly Data Analytics"
 
Harnessing Textbooks for High-Quality Labeled Data: An Approach to Automatic ...
Harnessing Textbooks for High-Quality Labeled Data: An Approach to Automatic ...Harnessing Textbooks for High-Quality Labeled Data: An Approach to Automatic ...
Harnessing Textbooks for High-Quality Labeled Data: An Approach to Automatic ...
 
empirical-SLR.pptx
empirical-SLR.pptxempirical-SLR.pptx
empirical-SLR.pptx
 
UKSG webinar - Introduction to Text-Mining Research Papers with Petr Knoth an...
UKSG webinar - Introduction to Text-Mining Research Papers with Petr Knoth an...UKSG webinar - Introduction to Text-Mining Research Papers with Petr Knoth an...
UKSG webinar - Introduction to Text-Mining Research Papers with Petr Knoth an...
 
Keynote Exploring and Exploiting Official Publications
Keynote Exploring and Exploiting Official PublicationsKeynote Exploring and Exploiting Official Publications
Keynote Exploring and Exploiting Official Publications
 
2012.10 - Workshop on Semantic Statistics - 1
2012.10 - Workshop on Semantic Statistics - 12012.10 - Workshop on Semantic Statistics - 1
2012.10 - Workshop on Semantic Statistics - 1
 
Knowledge Representation on the Web
Knowledge Representation on the WebKnowledge Representation on the Web
Knowledge Representation on the Web
 
Rec4LRW – Scientific Paper Recommender System for Literature Review and Writing
Rec4LRW – Scientific Paper Recommender System for Literature Review and WritingRec4LRW – Scientific Paper Recommender System for Literature Review and Writing
Rec4LRW – Scientific Paper Recommender System for Literature Review and Writing
 

Mehr von GESIS

10th BIR Workshop @ECIR 2020: introduction
10th  BIR Workshop @ECIR 2020: introduction10th  BIR Workshop @ECIR 2020: introduction
10th BIR Workshop @ECIR 2020: introductionGESIS
 
From closed to open access: A case study of flipped journals
From closed to open access: A case study of flipped journalsFrom closed to open access: A case study of flipped journals
From closed to open access: A case study of flipped journalsGESIS
 
Highly cited references in PLOS ONE and their in-text usage over time
Highly cited references in PLOS ONE and their in-text usage over timeHighly cited references in PLOS ONE and their in-text usage over time
Highly cited references in PLOS ONE and their in-text usage over timeGESIS
 
4th Joint Workshop on Bibliometric-enhanced Information Retrieval and Natural...
4th Joint Workshop on Bibliometric-enhanced Information Retrieval and Natural...4th Joint Workshop on Bibliometric-enhanced Information Retrieval and Natural...
4th Joint Workshop on Bibliometric-enhanced Information Retrieval and Natural...GESIS
 
Bibliometric-enhanced Information Retrieval: Connecting IR with Bibliometrics
Bibliometric-enhanced Information Retrieval: Connecting IR with BibliometricsBibliometric-enhanced Information Retrieval: Connecting IR with Bibliometrics
Bibliometric-enhanced Information Retrieval: Connecting IR with BibliometricsGESIS
 
Analyzing the network structure and gender differences of the “NKOS community”
Analyzing the network structure and gender differences of the “NKOS community”Analyzing the network structure and gender differences of the “NKOS community”
Analyzing the network structure and gender differences of the “NKOS community”GESIS
 
Recent advances in the project EXCITE – Extraction of Citations from PDF Docu...
Recent advances in the project EXCITE – Extraction of Citations from PDF Docu...Recent advances in the project EXCITE – Extraction of Citations from PDF Docu...
Recent advances in the project EXCITE – Extraction of Citations from PDF Docu...GESIS
 
Searching beyond datasets in the Social Sciences
Searching beyond datasets in the Social SciencesSearching beyond datasets in the Social Sciences
Searching beyond datasets in the Social SciencesGESIS
 
Bedeutung von Text Mining am Beispiel der Sozialwissenschaften
Bedeutung von Text Mining am Beispiel der SozialwissenschaftenBedeutung von Text Mining am Beispiel der Sozialwissenschaften
Bedeutung von Text Mining am Beispiel der SozialwissenschaftenGESIS
 
Contextualised Browsing in a Digital Library’s Living Lab
Contextualised Browsing in a Digital Library’s Living LabContextualised Browsing in a Digital Library’s Living Lab
Contextualised Browsing in a Digital Library’s Living LabGESIS
 
41st European Conference on Information Retrieval (ECIR 2019)
41st European Conference on Information Retrieval (ECIR 2019)41st European Conference on Information Retrieval (ECIR 2019)
41st European Conference on Information Retrieval (ECIR 2019)GESIS
 
Offenes kollaboratives Schreiben: Eine „Open Science“-Infrastruktur am Beispi...
Offenes kollaboratives Schreiben: Eine „Open Science“-Infrastruktur am Beispi...Offenes kollaboratives Schreiben: Eine „Open Science“-Infrastruktur am Beispi...
Offenes kollaboratives Schreiben: Eine „Open Science“-Infrastruktur am Beispi...GESIS
 
A Complete Year of User Retrieval Sessions in a Social Sciences Academic Sear...
A Complete Year of User Retrieval Sessions in a Social Sciences Academic Sear...A Complete Year of User Retrieval Sessions in a Social Sciences Academic Sear...
A Complete Year of User Retrieval Sessions in a Social Sciences Academic Sear...GESIS
 
Challenges in Extracting and Managing References
Challenges in Extracting and Managing ReferencesChallenges in Extracting and Managing References
Challenges in Extracting and Managing ReferencesGESIS
 
Opening Scholarly Communication in Social Sciences by Connecting Collaborativ...
Opening Scholarly Communication in Social Sciences by Connecting Collaborativ...Opening Scholarly Communication in Social Sciences by Connecting Collaborativ...
Opening Scholarly Communication in Social Sciences by Connecting Collaborativ...GESIS
 
Recent Advances in Bibliometric-Enhanced Information Retrieval
Recent Advances in Bibliometric-Enhanced Information RetrievalRecent Advances in Bibliometric-Enhanced Information Retrieval
Recent Advances in Bibliometric-Enhanced Information RetrievalGESIS
 
Analyzing the research output presented at European Networked Knowledge Organ...
Analyzing the research output presented at European Networked Knowledge Organ...Analyzing the research output presented at European Networked Knowledge Organ...
Analyzing the research output presented at European Networked Knowledge Organ...GESIS
 
Introduction to the 15th NKOS workshop @TPDL2016
Introduction to the 15th NKOS workshop @TPDL2016Introduction to the 15th NKOS workshop @TPDL2016
Introduction to the 15th NKOS workshop @TPDL2016GESIS
 
Recent applications of Knowledge Organization Systems
Recent applications of Knowledge Organization SystemsRecent applications of Knowledge Organization Systems
Recent applications of Knowledge Organization SystemsGESIS
 
Using co-authorship networks for author name disambiguation
Using co-authorship networks for author name disambiguationUsing co-authorship networks for author name disambiguation
Using co-authorship networks for author name disambiguationGESIS
 

Mehr von GESIS (20)

10th BIR Workshop @ECIR 2020: introduction
10th  BIR Workshop @ECIR 2020: introduction10th  BIR Workshop @ECIR 2020: introduction
10th BIR Workshop @ECIR 2020: introduction
 
From closed to open access: A case study of flipped journals
From closed to open access: A case study of flipped journalsFrom closed to open access: A case study of flipped journals
From closed to open access: A case study of flipped journals
 
Highly cited references in PLOS ONE and their in-text usage over time
Highly cited references in PLOS ONE and their in-text usage over timeHighly cited references in PLOS ONE and their in-text usage over time
Highly cited references in PLOS ONE and their in-text usage over time
 
4th Joint Workshop on Bibliometric-enhanced Information Retrieval and Natural...
4th Joint Workshop on Bibliometric-enhanced Information Retrieval and Natural...4th Joint Workshop on Bibliometric-enhanced Information Retrieval and Natural...
4th Joint Workshop on Bibliometric-enhanced Information Retrieval and Natural...
 
Bibliometric-enhanced Information Retrieval: Connecting IR with Bibliometrics
Bibliometric-enhanced Information Retrieval: Connecting IR with BibliometricsBibliometric-enhanced Information Retrieval: Connecting IR with Bibliometrics
Bibliometric-enhanced Information Retrieval: Connecting IR with Bibliometrics
 
Analyzing the network structure and gender differences of the “NKOS community”
Analyzing the network structure and gender differences of the “NKOS community”Analyzing the network structure and gender differences of the “NKOS community”
Analyzing the network structure and gender differences of the “NKOS community”
 
Recent advances in the project EXCITE – Extraction of Citations from PDF Docu...
Recent advances in the project EXCITE – Extraction of Citations from PDF Docu...Recent advances in the project EXCITE – Extraction of Citations from PDF Docu...
Recent advances in the project EXCITE – Extraction of Citations from PDF Docu...
 
Searching beyond datasets in the Social Sciences
Searching beyond datasets in the Social SciencesSearching beyond datasets in the Social Sciences
Searching beyond datasets in the Social Sciences
 
Bedeutung von Text Mining am Beispiel der Sozialwissenschaften
Bedeutung von Text Mining am Beispiel der SozialwissenschaftenBedeutung von Text Mining am Beispiel der Sozialwissenschaften
Bedeutung von Text Mining am Beispiel der Sozialwissenschaften
 
Contextualised Browsing in a Digital Library’s Living Lab
Contextualised Browsing in a Digital Library’s Living LabContextualised Browsing in a Digital Library’s Living Lab
Contextualised Browsing in a Digital Library’s Living Lab
 
41st European Conference on Information Retrieval (ECIR 2019)
41st European Conference on Information Retrieval (ECIR 2019)41st European Conference on Information Retrieval (ECIR 2019)
41st European Conference on Information Retrieval (ECIR 2019)
 
Offenes kollaboratives Schreiben: Eine „Open Science“-Infrastruktur am Beispi...
Offenes kollaboratives Schreiben: Eine „Open Science“-Infrastruktur am Beispi...Offenes kollaboratives Schreiben: Eine „Open Science“-Infrastruktur am Beispi...
Offenes kollaboratives Schreiben: Eine „Open Science“-Infrastruktur am Beispi...
 
A Complete Year of User Retrieval Sessions in a Social Sciences Academic Sear...
A Complete Year of User Retrieval Sessions in a Social Sciences Academic Sear...A Complete Year of User Retrieval Sessions in a Social Sciences Academic Sear...
A Complete Year of User Retrieval Sessions in a Social Sciences Academic Sear...
 
Challenges in Extracting and Managing References
Challenges in Extracting and Managing ReferencesChallenges in Extracting and Managing References
Challenges in Extracting and Managing References
 
Opening Scholarly Communication in Social Sciences by Connecting Collaborativ...
Opening Scholarly Communication in Social Sciences by Connecting Collaborativ...Opening Scholarly Communication in Social Sciences by Connecting Collaborativ...
Opening Scholarly Communication in Social Sciences by Connecting Collaborativ...
 
Recent Advances in Bibliometric-Enhanced Information Retrieval
Recent Advances in Bibliometric-Enhanced Information RetrievalRecent Advances in Bibliometric-Enhanced Information Retrieval
Recent Advances in Bibliometric-Enhanced Information Retrieval
 
Analyzing the research output presented at European Networked Knowledge Organ...
Analyzing the research output presented at European Networked Knowledge Organ...Analyzing the research output presented at European Networked Knowledge Organ...
Analyzing the research output presented at European Networked Knowledge Organ...
 
Introduction to the 15th NKOS workshop @TPDL2016
Introduction to the 15th NKOS workshop @TPDL2016Introduction to the 15th NKOS workshop @TPDL2016
Introduction to the 15th NKOS workshop @TPDL2016
 
Recent applications of Knowledge Organization Systems
Recent applications of Knowledge Organization SystemsRecent applications of Knowledge Organization Systems
Recent applications of Knowledge Organization Systems
 
Using co-authorship networks for author name disambiguation
Using co-authorship networks for author name disambiguationUsing co-authorship networks for author name disambiguation
Using co-authorship networks for author name disambiguation
 

Kürzlich hochgeladen

Boost PC performance: How more available memory can improve productivity
Boost PC performance: How more available memory can improve productivityBoost PC performance: How more available memory can improve productivity
Boost PC performance: How more available memory can improve productivityPrincipled Technologies
 
Boost Fertility New Invention Ups Success Rates.pdf
Boost Fertility New Invention Ups Success Rates.pdfBoost Fertility New Invention Ups Success Rates.pdf
Boost Fertility New Invention Ups Success Rates.pdfsudhanshuwaghmare1
 
08448380779 Call Girls In Diplomatic Enclave Women Seeking Men
08448380779 Call Girls In Diplomatic Enclave Women Seeking Men08448380779 Call Girls In Diplomatic Enclave Women Seeking Men
08448380779 Call Girls In Diplomatic Enclave Women Seeking MenDelhi Call girls
 
Axa Assurance Maroc - Insurer Innovation Award 2024
Axa Assurance Maroc - Insurer Innovation Award 2024Axa Assurance Maroc - Insurer Innovation Award 2024
Axa Assurance Maroc - Insurer Innovation Award 2024The Digital Insurer
 
A Year of the Servo Reboot: Where Are We Now?
A Year of the Servo Reboot: Where Are We Now?A Year of the Servo Reboot: Where Are We Now?
A Year of the Servo Reboot: Where Are We Now?Igalia
 
The 7 Things I Know About Cyber Security After 25 Years | April 2024
The 7 Things I Know About Cyber Security After 25 Years | April 2024The 7 Things I Know About Cyber Security After 25 Years | April 2024
The 7 Things I Know About Cyber Security After 25 Years | April 2024Rafal Los
 
Understanding Discord NSFW Servers A Guide for Responsible Users.pdf
Understanding Discord NSFW Servers A Guide for Responsible Users.pdfUnderstanding Discord NSFW Servers A Guide for Responsible Users.pdf
Understanding Discord NSFW Servers A Guide for Responsible Users.pdfUK Journal
 
The Codex of Business Writing Software for Real-World Solutions 2.pptx
The Codex of Business Writing Software for Real-World Solutions 2.pptxThe Codex of Business Writing Software for Real-World Solutions 2.pptx
The Codex of Business Writing Software for Real-World Solutions 2.pptxMalak Abu Hammad
 
Raspberry Pi 5: Challenges and Solutions in Bringing up an OpenGL/Vulkan Driv...
Raspberry Pi 5: Challenges and Solutions in Bringing up an OpenGL/Vulkan Driv...Raspberry Pi 5: Challenges and Solutions in Bringing up an OpenGL/Vulkan Driv...
Raspberry Pi 5: Challenges and Solutions in Bringing up an OpenGL/Vulkan Driv...Igalia
 
From Event to Action: Accelerate Your Decision Making with Real-Time Automation
From Event to Action: Accelerate Your Decision Making with Real-Time AutomationFrom Event to Action: Accelerate Your Decision Making with Real-Time Automation
From Event to Action: Accelerate Your Decision Making with Real-Time AutomationSafe Software
 
Handwritten Text Recognition for manuscripts and early printed texts
Handwritten Text Recognition for manuscripts and early printed textsHandwritten Text Recognition for manuscripts and early printed texts
Handwritten Text Recognition for manuscripts and early printed textsMaria Levchenko
 
Factors to Consider When Choosing Accounts Payable Services Providers.pptx
Factors to Consider When Choosing Accounts Payable Services Providers.pptxFactors to Consider When Choosing Accounts Payable Services Providers.pptx
Factors to Consider When Choosing Accounts Payable Services Providers.pptxKatpro Technologies
 
Slack Application Development 101 Slides
Slack Application Development 101 SlidesSlack Application Development 101 Slides
Slack Application Development 101 Slidespraypatel2
 
08448380779 Call Girls In Friends Colony Women Seeking Men
08448380779 Call Girls In Friends Colony Women Seeking Men08448380779 Call Girls In Friends Colony Women Seeking Men
08448380779 Call Girls In Friends Colony Women Seeking MenDelhi Call girls
 
Automating Google Workspace (GWS) & more with Apps Script
Automating Google Workspace (GWS) & more with Apps ScriptAutomating Google Workspace (GWS) & more with Apps Script
Automating Google Workspace (GWS) & more with Apps Scriptwesley chun
 
Presentation on how to chat with PDF using ChatGPT code interpreter
Presentation on how to chat with PDF using ChatGPT code interpreterPresentation on how to chat with PDF using ChatGPT code interpreter
Presentation on how to chat with PDF using ChatGPT code interpreternaman860154
 
What Are The Drone Anti-jamming Systems Technology?
What Are The Drone Anti-jamming Systems Technology?What Are The Drone Anti-jamming Systems Technology?
What Are The Drone Anti-jamming Systems Technology?Antenna Manufacturer Coco
 
Finology Group – Insurtech Innovation Award 2024
Finology Group – Insurtech Innovation Award 2024Finology Group – Insurtech Innovation Award 2024
Finology Group – Insurtech Innovation Award 2024The Digital Insurer
 
Strategies for Unlocking Knowledge Management in Microsoft 365 in the Copilot...
Strategies for Unlocking Knowledge Management in Microsoft 365 in the Copilot...Strategies for Unlocking Knowledge Management in Microsoft 365 in the Copilot...
Strategies for Unlocking Knowledge Management in Microsoft 365 in the Copilot...Drew Madelung
 
08448380779 Call Girls In Civil Lines Women Seeking Men
08448380779 Call Girls In Civil Lines Women Seeking Men08448380779 Call Girls In Civil Lines Women Seeking Men
08448380779 Call Girls In Civil Lines Women Seeking MenDelhi Call girls
 

Kürzlich hochgeladen (20)

Boost PC performance: How more available memory can improve productivity
Boost PC performance: How more available memory can improve productivityBoost PC performance: How more available memory can improve productivity
Boost PC performance: How more available memory can improve productivity
 
Boost Fertility New Invention Ups Success Rates.pdf
Boost Fertility New Invention Ups Success Rates.pdfBoost Fertility New Invention Ups Success Rates.pdf
Boost Fertility New Invention Ups Success Rates.pdf
 
08448380779 Call Girls In Diplomatic Enclave Women Seeking Men
08448380779 Call Girls In Diplomatic Enclave Women Seeking Men08448380779 Call Girls In Diplomatic Enclave Women Seeking Men
08448380779 Call Girls In Diplomatic Enclave Women Seeking Men
 
Axa Assurance Maroc - Insurer Innovation Award 2024
Axa Assurance Maroc - Insurer Innovation Award 2024Axa Assurance Maroc - Insurer Innovation Award 2024
Axa Assurance Maroc - Insurer Innovation Award 2024
 
A Year of the Servo Reboot: Where Are We Now?
A Year of the Servo Reboot: Where Are We Now?A Year of the Servo Reboot: Where Are We Now?
A Year of the Servo Reboot: Where Are We Now?
 
The 7 Things I Know About Cyber Security After 25 Years | April 2024
The 7 Things I Know About Cyber Security After 25 Years | April 2024The 7 Things I Know About Cyber Security After 25 Years | April 2024
The 7 Things I Know About Cyber Security After 25 Years | April 2024
 
Understanding Discord NSFW Servers A Guide for Responsible Users.pdf
Understanding Discord NSFW Servers A Guide for Responsible Users.pdfUnderstanding Discord NSFW Servers A Guide for Responsible Users.pdf
Understanding Discord NSFW Servers A Guide for Responsible Users.pdf
 
The Codex of Business Writing Software for Real-World Solutions 2.pptx
The Codex of Business Writing Software for Real-World Solutions 2.pptxThe Codex of Business Writing Software for Real-World Solutions 2.pptx
The Codex of Business Writing Software for Real-World Solutions 2.pptx
 
Raspberry Pi 5: Challenges and Solutions in Bringing up an OpenGL/Vulkan Driv...
Raspberry Pi 5: Challenges and Solutions in Bringing up an OpenGL/Vulkan Driv...Raspberry Pi 5: Challenges and Solutions in Bringing up an OpenGL/Vulkan Driv...
Raspberry Pi 5: Challenges and Solutions in Bringing up an OpenGL/Vulkan Driv...
 
From Event to Action: Accelerate Your Decision Making with Real-Time Automation
From Event to Action: Accelerate Your Decision Making with Real-Time AutomationFrom Event to Action: Accelerate Your Decision Making with Real-Time Automation
From Event to Action: Accelerate Your Decision Making with Real-Time Automation
 
Handwritten Text Recognition for manuscripts and early printed texts
Handwritten Text Recognition for manuscripts and early printed textsHandwritten Text Recognition for manuscripts and early printed texts
Handwritten Text Recognition for manuscripts and early printed texts
 
Factors to Consider When Choosing Accounts Payable Services Providers.pptx
Factors to Consider When Choosing Accounts Payable Services Providers.pptxFactors to Consider When Choosing Accounts Payable Services Providers.pptx
Factors to Consider When Choosing Accounts Payable Services Providers.pptx
 
Slack Application Development 101 Slides
Slack Application Development 101 SlidesSlack Application Development 101 Slides
Slack Application Development 101 Slides
 
08448380779 Call Girls In Friends Colony Women Seeking Men
08448380779 Call Girls In Friends Colony Women Seeking Men08448380779 Call Girls In Friends Colony Women Seeking Men
08448380779 Call Girls In Friends Colony Women Seeking Men
 
Automating Google Workspace (GWS) & more with Apps Script
Automating Google Workspace (GWS) & more with Apps ScriptAutomating Google Workspace (GWS) & more with Apps Script
Automating Google Workspace (GWS) & more with Apps Script
 
Presentation on how to chat with PDF using ChatGPT code interpreter
Presentation on how to chat with PDF using ChatGPT code interpreterPresentation on how to chat with PDF using ChatGPT code interpreter
Presentation on how to chat with PDF using ChatGPT code interpreter
 
What Are The Drone Anti-jamming Systems Technology?
What Are The Drone Anti-jamming Systems Technology?What Are The Drone Anti-jamming Systems Technology?
What Are The Drone Anti-jamming Systems Technology?
 
Finology Group – Insurtech Innovation Award 2024
Finology Group – Insurtech Innovation Award 2024Finology Group – Insurtech Innovation Award 2024
Finology Group – Insurtech Innovation Award 2024
 
Strategies for Unlocking Knowledge Management in Microsoft 365 in the Copilot...
Strategies for Unlocking Knowledge Management in Microsoft 365 in the Copilot...Strategies for Unlocking Knowledge Management in Microsoft 365 in the Copilot...
Strategies for Unlocking Knowledge Management in Microsoft 365 in the Copilot...
 
08448380779 Call Girls In Civil Lines Women Seeking Men
08448380779 Call Girls In Civil Lines Women Seeking Men08448380779 Call Girls In Civil Lines Women Seeking Men
08448380779 Call Girls In Civil Lines Women Seeking Men
 

Intra- and interdisciplinary cross-concordances for information retrieval

  • 1. 1 Intra- and interdisciplinary cross- concordances for information retrieval Philipp Mayr GESIS – Leibniz Institute for the Social Sciences, Bonn, Germany 8th NKOS Workshop at the 13th ECDL Conference Corfu, Greece, 01. October 2009
  • 2. 2 KoMoHe Project (2004-2007) KoMoHe (Competence Center Modeling and Treatment of Semantic Heterogeneity) Goals: – Models for searching heterogeneous collections – Development, organization & management of cross-walks between controlled vocabularies – IR evaluation of the mappings (effectiveness of intellectual mapping)
  • 3. 3 Relations • Equivalence • Narrower Term • Broader Term • Related Term • Null: no mapping manually created, directed relations between controlled terms of two knowledge organization systems (KOS) KOS 1 Relation KOS 2 Library = Bibliothéque Library > Special library Thesaurus < KOS Hacker ^ Computers + Security Virus 0
  • 4. 4 Cross-concordances • 25 Vocabularies in 64 cross-concordances – Thesauri (16) – Descriptor lists (4) – Classifications (3) – Subject heading lists (2) • 380,000 mapped terms • 465,000 relations • 205,000 equivalence relations • 13 German, 8 English, 1 Russian, 3 multilingual
  • 5. 5 Disciplines Social Sciences (10) Gerontology (1) Universal (3) Psychology (1) Pedagogics (1) Sports science (2) Economics (2) Political science (3) Medicine (1)Agricultural science (1) Information science (1)
  • 6. 6 Net of Cross-concordances Each node represents a KOS
  • 7. 7 Objectives • Translate search terms into other terminologies • Increase diversity of documents from different databases • Improve search experience without effort for searcher • Test the effect for IR in different disciplines (social science and others)
  • 8. 8 Main questions • What is examined? – the quality of the mappings – or the quality of the associated search • Can we enable distributed search with the subject access tools over several information systems? – In one discipline – Between at least two disciplines • Is the impact of terminology mapping on recall and precision measurable? • The mappings are helpful to whom?
  • 9. 9 Information Retrieval Test Question: How effective are the mappings in an actual search? Does the application of term mappings improve search over a non-transformed subject (i.e. controlled vocabulary) search?
  • 10. 10 Information Retrieval Tests • Thesauri mappings only • Only equivalence relations • Real queries (~6 per tested cross-concordance) • Databases: 80,000 – 16 mio. documents • Test 1 (CT  TT): 13 Cross-concordances • Test 2 (FT  FT+TT): 8 Cross-concordances
  • 12. 12 Steps • Requesting recent research topics from our partners (social science and others) • Intellectually translating the topics into controlled term searches in a KOS A • Automatically translating the controlled terms via HTS into the controlled terms of a KOS B • Retrieving documents from two runs 1. Controlled term (CT) search (KOS A) in database B 2. Translated term (TT) search (KOS B) in database B
  • 13. 13 Information Retrieval Test CT-TT DB A Term a Term b Term c … Term n DB B Term a Term b Term c … Term n HTS Terms Voc A Terms Voc B DB A Term a Term b Term c … Term n DB B Terms Voc A Scenario CT Scenario TT HTS (Heterogeneity Service) ~ Web service providing the mappings Run 1 Run 2
  • 14. 14 Information Retrieval Tests Test 1 Intradisciplinary: Social sc. – Social sc. TheSoz – DZI DZI – TheSoz TheSoz – SWD SWD – TheSoz CSA – TheSoz • 5 concordances • 3 databases • 35 topics Test 3 Interdisciplinary: Int. Relations – Economics Medical sc. – Psychology IBLK – STW STW – IBLK Mesh – Psyndex Psyndex – Mesh • 4 concordances • 4 databases • 28 topics Test 2 Interdisciplinary: Social sc. – Psychology Social sc. – Economics TheSoz – Psyndex Psyndex – TheSoz TheSoz – STW STW – TheSoz • 4 concordances • 3 databases • 19 topics
  • 15. 15 Methodology • Downloading the documents for both runs (CT, TT), cutt-off: 1,000 docs • Pooling both runs (CT, TT) for each topic • Importing the documents into a assessment tool • Relevance assessment of the documents by experts • Analysis of the assessment data – Retrieved: average number of retrieved documents (across all search types) – Relevant: average number of relevant retrieved documents (across all search types) – Rel_ret: average number of relevant retrieved documents for a particular search type – Recall: proportion of relevant retrieved documents out of all relevant documents (averaged across all queries of one search type) – Precision: proportion of relevant retrieved documents out of all retrieved documents (averaged across all queries of one search type)
  • 16. 16 Assessment of the documents: by experts
  • 17. 17 Information Retrieval Tests - Results • CT  TT (Improvements in %) Recall = Hitrate Precision = Accuracy Intradisciplinary +39% +34% Interdisciplinary +136% +68% Recall = Hitrate Precision = Accuracy Intradisciplinary +20% -12% Interdisciplinary +24% -24% • FT  FT+TT (Improvements in %) Detailled results can be found in Mayr & Petras, 2008
  • 18. 18 Discussion • Overlap and more identical terms in intradisciplinary mappings – Mapping in one discipline is simpler: just one expert – Lesser effect on search – Automatic mapping may be more useful in intradisciplinary sets: mainly syntactic matching • Language plays a major role – we had just one bilingual mapping in the test • Restrictions of the study: no real users or interactions, only thesauri, KOS in German
  • 19. 19 Summary Why are cross-concordances in one discipline less effective for IR? • Amount of identical terms are significantly higher in one discipline (one language) • No effective transformation possible for IR, if you have identical terms Mapping projects should more often perform IR tests to measure the effect of their mappings.
  • 20. 20 Conclusion • Cross-concordances improve subject search with controlled terms & free-text search: larger measurable effects on interdisciplinary mappings • Only 24% relations utilized (equivalence) • Potential: – Other relations – STR  CT translation • More mappings which are not evaluated • Mappings are used e.g. in portals like sowiport, vascoda, ireon, … and other projects
  • 21. 21 Next steps • Visualization of the terminology network • Combined evaluation with other value- added services (search term recommendation) • Conversion to SKOS • Evaluation of other disciplines • Evaluation of indirect term transformation (term – switching term – end term)
  • 22. 22 Publications Mayr, Philipp; Petras, Vivien (2008): Cross-concordances: terminology mapping and its effectiveness for information retrieval. In: 74th IFLA World Library and Information Congress. Québec, Canada- http://www.ifla.org/IV/ifla74/papers/129-Mayr_Petras- en.pdf Mayr, Philipp; Mutschke, Peter; Petras, Vivien (2008): Reducing semantic complexity in distributed Digital Libraries: treatment of term vagueness and document re- ranking. In: Library Review. 57 (2008) 3. pp. 213-224. http://arxiv.org/abs/0712.2449
  • 23. 23 Indirect term transformations Social sciences – gerontology – medicine