SlideShare a Scribd company logo
1 of 34
Web of Science, Scopus, Dimensions, and beyond:
The evolving landscape of bibliometric data sources
Ludo Waltman, Martijn Visser, Nees Jan van Eck
Centre for Science and Technology Studies (CWTS), Leiden University
ROI-AV Conference: Visuals and Analytics that Matter
Copenhagen, Denmark
October 3, 2018
Outline
• Bibliometric data sources
• New opportunities for bibliometric visualization
1
Bibliometric data
sources
2
Introduction
• Increasing number of alternatives (Microsoft Academic, Dimensions,
Crossref, OpenCitations Corpus) to traditional bibliographic data sources
(Web of Science, Scopus, Google Scholar)
• Alternative data sources are more open than traditional ones
• How do the various data sources compare in terms of the completeness and
quality of their citation data?
3
Data sources
• Scopus
– May 2018
– Requires subscription
• Web of Science
– SCIE, SSCI, AHCI, CPCI
– June 2018
– Requires subscription
• Dimensions
– June 2018
– Openly available through web interface
• Crossref
– August 2018
– Openly available through API
4
Coverage of publications
5
All publications Publications with DOI
Publications with
unique DOI
Web of Science 40.1 100.0% 18.8 46.9% 18.8 46.9%
Scopus 44.9 100.0% 31.1 69.2% 30.6 68.3%
Dimensions 57.5 100.0% 55.1 95.9% 55.0 95.6%
Crossref 57.3 100.0% 57.3 100.0% 57.3 100.0%
• Publication counts in millions
• Time period 1996-2017
Coverage of publications: Dimensions vs. Scopus
6
Comparison of citation data
7
Scopus-WoS overlap: 460.0M
Only in Scopus: 24.9M
Only in WoS: 15.5M
Scopus-Dimensions overlap: 414.3M
Only in Scopus: 43.5M
Only in Dimensions: 17.9M
Scopus-Crossref overlap: 171.3M
Only in Scopus: 292.1M
Only in Crossref: 6.4M
In these pairwise comparisons of data sources, only
citation links between citing and cited publications
indexed in both data sources are considered
Causes of discrepancies between data sources
• Inaccuracies in references
• Inaccuracies in reference data
• Inaccuracies in citation matching
• Multiple versions of a publication
• Multiple records for a publication
• References not having been deposited, being closed or not having been
matched
8
Example: Discrepancies between Scopus and
Dimensions
9
Example: Discrepancies between Scopus and
Dimensions
10
Example: Discrepancies between Scopus and Web of
Science
11
Group author and/or supplement
seem to cause problems in Web
of Science
Example: Discrepancies within Web of Science
12
September 20, 2017
November 1, 2017
November 8, 2017
Example: Unmatched references in Crossref
13
Conclusions
• Substantial discrepancies between data sources
• Reasonably complete citation data in Dimensions
• Large gaps in citation data in Crossref, due to references not having been
deposited, being closed or not having been matched
• Need for transparent high-quality citation matching algorithm
• Completeness and quality of other metadata?
14
New opportunities for
bibliometric
visualization
15
VOSviewer
16
Users of VOSviewer
• Researchers
• Research institutions
• Research funders
• Scientific publishers
• Industry
17
Data sources supported by VOSviewer
18
Bibliometric networks
19
WoS
Scopus
Dimensions
PubMed
Crossref
Citation network
of pubs / journals / authors / orgs / countries
Co-authorship network
of authors / orgs / countries
Co-citation network
of pubs / journals / authors / orgs / countries
Co-occurrence network
of keywords / terms
Bibliographic coupling network
of pubs / journals / authors
Bibliographic
data source
Types of networks supported by each data source
20
Co-authorship Co-
occurrence
Citation Bibliographic
coupling
Co-citation
Web of Science
    
Scopus
    
Dimensions
    
PubMed
    
Crossref
    
Journal citation network based on Crossref data
21
Demonstration
• Journals:
– Journal of Informetrics (Elsevier)
– Scientometrics (Springer Nature)
• Time period: 2007-2016
• Data sources:
– Dimensions
– Crossref
– OCC (OpenCitations Corpus)
– COCI (OpenCitations Index of Crossref open DOI-to-DOI references)
22
23
24
25
26
Coverage of publications and citations for each data
source
27
Wish list for improving open data sources
• Expanding coverage of publications (OCC)
• Opening citations (Crossref)
• Opening other metadata, e.g. abstracts (Dimensions, Crossref, OCC)
• Improving completeness and standardization of metadata (Crossref)
• Speeding up APIs
28
Toward contextualized scientometrics
• Integrating data collection and visual analysis
• Interactive visual exploration (e.g., drilling down from high-level visual
overviews to underlying data)
• Large-scale visual analyses
• Moving from visualization as a tool for representation to visualization as a
tool for exploration, discussion and reflection
29
Supporting open
citations
30
Supporting open citations
31
www.issi-society.org/open-citations-letter/
April 26, 2018:
• 324 signatories
• 46 countries
Supporting open citations
32
www.issi-society.org/blog/posts/2018/april/open-citations-to-open-science/
Thank you for your attention!
33

More Related Content

What's hot

Bibliometrics - an overview
Bibliometrics - an overviewBibliometrics - an overview
Bibliometrics - an overview
claudia cavicchi
 
Institutional repositories
Institutional repositoriesInstitutional repositories
Institutional repositories
Smita Chandra
 
Lotka’s law a study with reference to the literature by
Lotka’s law a study with reference to the literature byLotka’s law a study with reference to the literature by
Lotka’s law a study with reference to the literature by
IAEME Publication
 

What's hot (20)

Bibliometrics - an overview
Bibliometrics - an overviewBibliometrics - an overview
Bibliometrics - an overview
 
Cite score
Cite scoreCite score
Cite score
 
Scopus Overview
Scopus OverviewScopus Overview
Scopus Overview
 
Institutional repositories
Institutional repositoriesInstitutional repositories
Institutional repositories
 
AGRIS.pptx
AGRIS.pptxAGRIS.pptx
AGRIS.pptx
 
Citation indexing
Citation indexingCitation indexing
Citation indexing
 
Research Methodology-02: Quality Indices
Research Methodology-02: Quality IndicesResearch Methodology-02: Quality Indices
Research Methodology-02: Quality Indices
 
Academic Publishing: Challenges and Opportunities
Academic Publishing: Challenges and OpportunitiesAcademic Publishing: Challenges and Opportunities
Academic Publishing: Challenges and Opportunities
 
Citation Database
Citation Database Citation Database
Citation Database
 
impact factor ,h index (1).pptx
impact factor ,h index (1).pptximpact factor ,h index (1).pptx
impact factor ,h index (1).pptx
 
Scientometrics for research assessment
Scientometrics for research assessmentScientometrics for research assessment
Scientometrics for research assessment
 
INFLIBNET.pptx
INFLIBNET.pptxINFLIBNET.pptx
INFLIBNET.pptx
 
Research Metrics
Research Metrics Research Metrics
Research Metrics
 
Chemical abstract
Chemical abstractChemical abstract
Chemical abstract
 
citation analysis
citation analysiscitation analysis
citation analysis
 
Lotka’s law a study with reference to the literature by
Lotka’s law a study with reference to the literature byLotka’s law a study with reference to the literature by
Lotka’s law a study with reference to the literature by
 
Measuring Scientific Productivity
Measuring Scientific ProductivityMeasuring Scientific Productivity
Measuring Scientific Productivity
 
Scopus
ScopusScopus
Scopus
 
INIS.pptx
INIS.pptxINIS.pptx
INIS.pptx
 
Presentation on Scopus
Presentation on ScopusPresentation on Scopus
Presentation on Scopus
 

Similar to Web of Science, Scopus, Dimensions, and beyond: The evolving landscape of bibliometric data sources

VOSviewer and CitNetExplorer: Software tools for bibliometric analysis of s...
VOSviewer and CitNetExplorer: Software tools for bibliometric analysis of s...VOSviewer and CitNetExplorer: Software tools for bibliometric analysis of s...
VOSviewer and CitNetExplorer: Software tools for bibliometric analysis of s...
Nees Jan van Eck
 
Capturing and Analyzing Publication, Citation and Usage Data for Contextual C...
Capturing and Analyzing Publication, Citation and Usage Data for Contextual C...Capturing and Analyzing Publication, Citation and Usage Data for Contextual C...
Capturing and Analyzing Publication, Citation and Usage Data for Contextual C...
NASIG
 
2014 ALA MW SPARC-ACRL Forum Talk
2014 ALA MW SPARC-ACRL Forum Talk2014 ALA MW SPARC-ACRL Forum Talk
2014 ALA MW SPARC-ACRL Forum Talk
Paul Bracke
 
VOSviewer and CitNetExplorer Tutorial
VOSviewer and CitNetExplorer TutorialVOSviewer and CitNetExplorer Tutorial
VOSviewer and CitNetExplorer Tutorial
Nees Jan van Eck
 

Similar to Web of Science, Scopus, Dimensions, and beyond: The evolving landscape of bibliometric data sources (20)

Comparing bibliographic data sources
Comparing bibliographic data sourcesComparing bibliographic data sources
Comparing bibliographic data sources
 
Open data sources in VOSviewer
Open data sources in VOSviewerOpen data sources in VOSviewer
Open data sources in VOSviewer
 
Bibliometrische visualisaties voor het bijhouden van wetenschappelijke litera...
Bibliometrische visualisaties voor het bijhouden van wetenschappelijke litera...Bibliometrische visualisaties voor het bijhouden van wetenschappelijke litera...
Bibliometrische visualisaties voor het bijhouden van wetenschappelijke litera...
 
British Library
British LibraryBritish Library
British Library
 
Visualizing science based on open data sources
Visualizing science based on open data sourcesVisualizing science based on open data sources
Visualizing science based on open data sources
 
Crossref as a source of open bibliographic metadata
Crossref as a source of open bibliographic metadataCrossref as a source of open bibliographic metadata
Crossref as a source of open bibliographic metadata
 
Swimming upstream: libraries and open scholarship
Swimming upstream: libraries and open scholarshipSwimming upstream: libraries and open scholarship
Swimming upstream: libraries and open scholarship
 
VOSviewer and CitNetExplorer: Software tools for bibliometric analysis of s...
VOSviewer and CitNetExplorer: Software tools for bibliometric analysis of s...VOSviewer and CitNetExplorer: Software tools for bibliometric analysis of s...
VOSviewer and CitNetExplorer: Software tools for bibliometric analysis of s...
 
Data sharing as part of the research ecosystem
Data sharing as part of the research ecosystemData sharing as part of the research ecosystem
Data sharing as part of the research ecosystem
 
Discovery Systems: Connecting the 21st Century Academic User to Content
Discovery Systems: Connecting the 21st Century Academic User to ContentDiscovery Systems: Connecting the 21st Century Academic User to Content
Discovery Systems: Connecting the 21st Century Academic User to Content
 
Capturing and Analyzing Publication, Citation and Usage Data for Contextual C...
Capturing and Analyzing Publication, Citation and Usage Data for Contextual C...Capturing and Analyzing Publication, Citation and Usage Data for Contextual C...
Capturing and Analyzing Publication, Citation and Usage Data for Contextual C...
 
2014 ALA MW SPARC-ACRL Forum Talk
2014 ALA MW SPARC-ACRL Forum Talk2014 ALA MW SPARC-ACRL Forum Talk
2014 ALA MW SPARC-ACRL Forum Talk
 
Kasyanov "Web of Science API Workshop"
Kasyanov "Web of Science API Workshop"Kasyanov "Web of Science API Workshop"
Kasyanov "Web of Science API Workshop"
 
From Bibliometrics to Cybermetrics - a book chapter by Nicola de Bellis
From Bibliometrics to Cybermetrics - a book chapter by Nicola de BellisFrom Bibliometrics to Cybermetrics - a book chapter by Nicola de Bellis
From Bibliometrics to Cybermetrics - a book chapter by Nicola de Bellis
 
PSP 2018 - The Changing discovery landscape: Tools and services from wiley
PSP 2018 - The Changing discovery landscape: Tools and services from wileyPSP 2018 - The Changing discovery landscape: Tools and services from wiley
PSP 2018 - The Changing discovery landscape: Tools and services from wiley
 
Multiple perspectives on bibliometric data
Multiple perspectives on bibliometric dataMultiple perspectives on bibliometric data
Multiple perspectives on bibliometric data
 
Levine-Clark, Michael, “Citation Indexes,” Seminario Entre Pares, Puebla, Mex...
Levine-Clark, Michael, “Citation Indexes,” Seminario Entre Pares, Puebla, Mex...Levine-Clark, Michael, “Citation Indexes,” Seminario Entre Pares, Puebla, Mex...
Levine-Clark, Michael, “Citation Indexes,” Seminario Entre Pares, Puebla, Mex...
 
VOSviewer and CitNetExplorer Tutorial
VOSviewer and CitNetExplorer TutorialVOSviewer and CitNetExplorer Tutorial
VOSviewer and CitNetExplorer Tutorial
 
Data Citation for small publishers
Data Citation for small publishersData Citation for small publishers
Data Citation for small publishers
 
Wnl sponsor 2 scopus
Wnl sponsor 2 scopusWnl sponsor 2 scopus
Wnl sponsor 2 scopus
 

More from Ludo Waltman

More from Ludo Waltman (20)

COVID-19 and its implications for the scholarly communication system
COVID-19 and its implications for the scholarly communication systemCOVID-19 and its implications for the scholarly communication system
COVID-19 and its implications for the scholarly communication system
 
Responsible journals: Making reading, evaluation and publishing open
Responsible journals: Making reading, evaluation and publishing openResponsible journals: Making reading, evaluation and publishing open
Responsible journals: Making reading, evaluation and publishing open
 
Open science: Implications for bibliometrics and scientometrics
Open science: Implications for bibliometrics and scientometricsOpen science: Implications for bibliometrics and scientometrics
Open science: Implications for bibliometrics and scientometrics
 
Ranking universities responsibly
Ranking universities responsiblyRanking universities responsibly
Ranking universities responsibly
 
Crossref LIVE19 - Researcher and metadata user view
Crossref LIVE19 - Researcher and metadata user viewCrossref LIVE19 - Researcher and metadata user view
Crossref LIVE19 - Researcher and metadata user view
 
Social sciences research addressing societal challenges
Social sciences research addressing societal challengesSocial sciences research addressing societal challenges
Social sciences research addressing societal challenges
 
The landscape of research on research
The landscape of research on researchThe landscape of research on research
The landscape of research on research
 
Open citations: Next steps
Open citations: Next stepsOpen citations: Next steps
Open citations: Next steps
 
New developments in the CWTS Leiden Ranking
New developments in the CWTS Leiden RankingNew developments in the CWTS Leiden Ranking
New developments in the CWTS Leiden Ranking
 
Science of science, scientometrics, and research policy: The need for quantit...
Science of science, scientometrics, and research policy: The need for quantit...Science of science, scientometrics, and research policy: The need for quantit...
Science of science, scientometrics, and research policy: The need for quantit...
 
Ranking universities responsibly
Ranking universities responsiblyRanking universities responsibly
Ranking universities responsibly
 
Ranking universities responsibly
Ranking universities responsiblyRanking universities responsibly
Ranking universities responsibly
 
A scientometric perspective on university ranking
A scientometric perspective on university rankingA scientometric perspective on university ranking
A scientometric perspective on university ranking
 
Contextualized scientometrics: What's behind the numbers?
Contextualized scientometrics: What's behind the numbers?Contextualized scientometrics: What's behind the numbers?
Contextualized scientometrics: What's behind the numbers?
 
An in-depth bibliometric perspective on China’s scientific performance
An in-depth bibliometric perspective on China’s scientific performanceAn in-depth bibliometric perspective on China’s scientific performance
An in-depth bibliometric perspective on China’s scientific performance
 
Scientific information retrieval: Challenges and opportunities
Scientific information retrieval: Challenges and opportunitiesScientific information retrieval: Challenges and opportunities
Scientific information retrieval: Challenges and opportunities
 
Responsible metrics: One size doesn't fit all
Responsible metrics: One size doesn't fit allResponsible metrics: One size doesn't fit all
Responsible metrics: One size doesn't fit all
 
Responsible use of university rankings
Responsible use of university rankingsResponsible use of university rankings
Responsible use of university rankings
 
Toward open citations: Why, how, and when?
Toward open citations: Why, how, and when?Toward open citations: Why, how, and when?
Toward open citations: Why, how, and when?
 
Bibliometric visualization using VOSviewer
Bibliometric visualization using VOSviewerBibliometric visualization using VOSviewer
Bibliometric visualization using VOSviewer
 

Recently uploaded

Cyathodium bryophyte: morphology, anatomy, reproduction etc.
Cyathodium bryophyte: morphology, anatomy, reproduction etc.Cyathodium bryophyte: morphology, anatomy, reproduction etc.
Cyathodium bryophyte: morphology, anatomy, reproduction etc.
Silpa
 
Asymmetry in the atmosphere of the ultra-hot Jupiter WASP-76 b
Asymmetry in the atmosphere of the ultra-hot Jupiter WASP-76 bAsymmetry in the atmosphere of the ultra-hot Jupiter WASP-76 b
Asymmetry in the atmosphere of the ultra-hot Jupiter WASP-76 b
Sérgio Sacani
 
Biogenic Sulfur Gases as Biosignatures on Temperate Sub-Neptune Waterworlds
Biogenic Sulfur Gases as Biosignatures on Temperate Sub-Neptune WaterworldsBiogenic Sulfur Gases as Biosignatures on Temperate Sub-Neptune Waterworlds
Biogenic Sulfur Gases as Biosignatures on Temperate Sub-Neptune Waterworlds
Sérgio Sacani
 
The Mariana Trench remarkable geological features on Earth.pptx
The Mariana Trench remarkable geological features on Earth.pptxThe Mariana Trench remarkable geological features on Earth.pptx
The Mariana Trench remarkable geological features on Earth.pptx
seri bangash
 
Module for Grade 9 for Asynchronous/Distance learning
Module for Grade 9 for Asynchronous/Distance learningModule for Grade 9 for Asynchronous/Distance learning
Module for Grade 9 for Asynchronous/Distance learning
levieagacer
 
Phenolics: types, biosynthesis and functions.
Phenolics: types, biosynthesis and functions.Phenolics: types, biosynthesis and functions.
Phenolics: types, biosynthesis and functions.
Silpa
 
Porella : features, morphology, anatomy, reproduction etc.
Porella : features, morphology, anatomy, reproduction etc.Porella : features, morphology, anatomy, reproduction etc.
Porella : features, morphology, anatomy, reproduction etc.
Silpa
 

Recently uploaded (20)

Cyathodium bryophyte: morphology, anatomy, reproduction etc.
Cyathodium bryophyte: morphology, anatomy, reproduction etc.Cyathodium bryophyte: morphology, anatomy, reproduction etc.
Cyathodium bryophyte: morphology, anatomy, reproduction etc.
 
Thyroid Physiology_Dr.E. Muralinath_ Associate Professor
Thyroid Physiology_Dr.E. Muralinath_ Associate ProfessorThyroid Physiology_Dr.E. Muralinath_ Associate Professor
Thyroid Physiology_Dr.E. Muralinath_ Associate Professor
 
TransientOffsetin14CAftertheCarringtonEventRecordedbyPolarTreeRings
TransientOffsetin14CAftertheCarringtonEventRecordedbyPolarTreeRingsTransientOffsetin14CAftertheCarringtonEventRecordedbyPolarTreeRings
TransientOffsetin14CAftertheCarringtonEventRecordedbyPolarTreeRings
 
Asymmetry in the atmosphere of the ultra-hot Jupiter WASP-76 b
Asymmetry in the atmosphere of the ultra-hot Jupiter WASP-76 bAsymmetry in the atmosphere of the ultra-hot Jupiter WASP-76 b
Asymmetry in the atmosphere of the ultra-hot Jupiter WASP-76 b
 
PATNA CALL GIRLS 8617370543 LOW PRICE ESCORT SERVICE
PATNA CALL GIRLS 8617370543 LOW PRICE ESCORT SERVICEPATNA CALL GIRLS 8617370543 LOW PRICE ESCORT SERVICE
PATNA CALL GIRLS 8617370543 LOW PRICE ESCORT SERVICE
 
GBSN - Biochemistry (Unit 2) Basic concept of organic chemistry
GBSN - Biochemistry (Unit 2) Basic concept of organic chemistry GBSN - Biochemistry (Unit 2) Basic concept of organic chemistry
GBSN - Biochemistry (Unit 2) Basic concept of organic chemistry
 
Molecular markers- RFLP, RAPD, AFLP, SNP etc.
Molecular markers- RFLP, RAPD, AFLP, SNP etc.Molecular markers- RFLP, RAPD, AFLP, SNP etc.
Molecular markers- RFLP, RAPD, AFLP, SNP etc.
 
Call Girls Ahmedabad +917728919243 call me Independent Escort Service
Call Girls Ahmedabad +917728919243 call me Independent Escort ServiceCall Girls Ahmedabad +917728919243 call me Independent Escort Service
Call Girls Ahmedabad +917728919243 call me Independent Escort Service
 
Genome sequencing,shotgun sequencing.pptx
Genome sequencing,shotgun sequencing.pptxGenome sequencing,shotgun sequencing.pptx
Genome sequencing,shotgun sequencing.pptx
 
FAIRSpectra - Enabling the FAIRification of Spectroscopy and Spectrometry
FAIRSpectra - Enabling the FAIRification of Spectroscopy and SpectrometryFAIRSpectra - Enabling the FAIRification of Spectroscopy and Spectrometry
FAIRSpectra - Enabling the FAIRification of Spectroscopy and Spectrometry
 
Biogenic Sulfur Gases as Biosignatures on Temperate Sub-Neptune Waterworlds
Biogenic Sulfur Gases as Biosignatures on Temperate Sub-Neptune WaterworldsBiogenic Sulfur Gases as Biosignatures on Temperate Sub-Neptune Waterworlds
Biogenic Sulfur Gases as Biosignatures on Temperate Sub-Neptune Waterworlds
 
Zoology 5th semester notes( Sumit_yadav).pdf
Zoology 5th semester notes( Sumit_yadav).pdfZoology 5th semester notes( Sumit_yadav).pdf
Zoology 5th semester notes( Sumit_yadav).pdf
 
Human & Veterinary Respiratory Physilogy_DR.E.Muralinath_Associate Professor....
Human & Veterinary Respiratory Physilogy_DR.E.Muralinath_Associate Professor....Human & Veterinary Respiratory Physilogy_DR.E.Muralinath_Associate Professor....
Human & Veterinary Respiratory Physilogy_DR.E.Muralinath_Associate Professor....
 
The Mariana Trench remarkable geological features on Earth.pptx
The Mariana Trench remarkable geological features on Earth.pptxThe Mariana Trench remarkable geological features on Earth.pptx
The Mariana Trench remarkable geological features on Earth.pptx
 
Module for Grade 9 for Asynchronous/Distance learning
Module for Grade 9 for Asynchronous/Distance learningModule for Grade 9 for Asynchronous/Distance learning
Module for Grade 9 for Asynchronous/Distance learning
 
Atp synthase , Atp synthase complex 1 to 4.
Atp synthase , Atp synthase complex 1 to 4.Atp synthase , Atp synthase complex 1 to 4.
Atp synthase , Atp synthase complex 1 to 4.
 
Phenolics: types, biosynthesis and functions.
Phenolics: types, biosynthesis and functions.Phenolics: types, biosynthesis and functions.
Phenolics: types, biosynthesis and functions.
 
Gwalior ❤CALL GIRL 84099*07087 ❤CALL GIRLS IN Gwalior ESCORT SERVICE❤CALL GIRL
Gwalior ❤CALL GIRL 84099*07087 ❤CALL GIRLS IN Gwalior ESCORT SERVICE❤CALL GIRLGwalior ❤CALL GIRL 84099*07087 ❤CALL GIRLS IN Gwalior ESCORT SERVICE❤CALL GIRL
Gwalior ❤CALL GIRL 84099*07087 ❤CALL GIRLS IN Gwalior ESCORT SERVICE❤CALL GIRL
 
Site Acceptance Test .
Site Acceptance Test                    .Site Acceptance Test                    .
Site Acceptance Test .
 
Porella : features, morphology, anatomy, reproduction etc.
Porella : features, morphology, anatomy, reproduction etc.Porella : features, morphology, anatomy, reproduction etc.
Porella : features, morphology, anatomy, reproduction etc.
 

Web of Science, Scopus, Dimensions, and beyond: The evolving landscape of bibliometric data sources

  • 1. Web of Science, Scopus, Dimensions, and beyond: The evolving landscape of bibliometric data sources Ludo Waltman, Martijn Visser, Nees Jan van Eck Centre for Science and Technology Studies (CWTS), Leiden University ROI-AV Conference: Visuals and Analytics that Matter Copenhagen, Denmark October 3, 2018
  • 2. Outline • Bibliometric data sources • New opportunities for bibliometric visualization 1
  • 4. Introduction • Increasing number of alternatives (Microsoft Academic, Dimensions, Crossref, OpenCitations Corpus) to traditional bibliographic data sources (Web of Science, Scopus, Google Scholar) • Alternative data sources are more open than traditional ones • How do the various data sources compare in terms of the completeness and quality of their citation data? 3
  • 5. Data sources • Scopus – May 2018 – Requires subscription • Web of Science – SCIE, SSCI, AHCI, CPCI – June 2018 – Requires subscription • Dimensions – June 2018 – Openly available through web interface • Crossref – August 2018 – Openly available through API 4
  • 6. Coverage of publications 5 All publications Publications with DOI Publications with unique DOI Web of Science 40.1 100.0% 18.8 46.9% 18.8 46.9% Scopus 44.9 100.0% 31.1 69.2% 30.6 68.3% Dimensions 57.5 100.0% 55.1 95.9% 55.0 95.6% Crossref 57.3 100.0% 57.3 100.0% 57.3 100.0% • Publication counts in millions • Time period 1996-2017
  • 7. Coverage of publications: Dimensions vs. Scopus 6
  • 8. Comparison of citation data 7 Scopus-WoS overlap: 460.0M Only in Scopus: 24.9M Only in WoS: 15.5M Scopus-Dimensions overlap: 414.3M Only in Scopus: 43.5M Only in Dimensions: 17.9M Scopus-Crossref overlap: 171.3M Only in Scopus: 292.1M Only in Crossref: 6.4M In these pairwise comparisons of data sources, only citation links between citing and cited publications indexed in both data sources are considered
  • 9. Causes of discrepancies between data sources • Inaccuracies in references • Inaccuracies in reference data • Inaccuracies in citation matching • Multiple versions of a publication • Multiple records for a publication • References not having been deposited, being closed or not having been matched 8
  • 10. Example: Discrepancies between Scopus and Dimensions 9
  • 11. Example: Discrepancies between Scopus and Dimensions 10
  • 12. Example: Discrepancies between Scopus and Web of Science 11 Group author and/or supplement seem to cause problems in Web of Science
  • 13. Example: Discrepancies within Web of Science 12 September 20, 2017 November 1, 2017 November 8, 2017
  • 15. Conclusions • Substantial discrepancies between data sources • Reasonably complete citation data in Dimensions • Large gaps in citation data in Crossref, due to references not having been deposited, being closed or not having been matched • Need for transparent high-quality citation matching algorithm • Completeness and quality of other metadata? 14
  • 18. Users of VOSviewer • Researchers • Research institutions • Research funders • Scientific publishers • Industry 17
  • 19. Data sources supported by VOSviewer 18
  • 20. Bibliometric networks 19 WoS Scopus Dimensions PubMed Crossref Citation network of pubs / journals / authors / orgs / countries Co-authorship network of authors / orgs / countries Co-citation network of pubs / journals / authors / orgs / countries Co-occurrence network of keywords / terms Bibliographic coupling network of pubs / journals / authors Bibliographic data source
  • 21. Types of networks supported by each data source 20 Co-authorship Co- occurrence Citation Bibliographic coupling Co-citation Web of Science      Scopus      Dimensions      PubMed      Crossref     
  • 22. Journal citation network based on Crossref data 21
  • 23. Demonstration • Journals: – Journal of Informetrics (Elsevier) – Scientometrics (Springer Nature) • Time period: 2007-2016 • Data sources: – Dimensions – Crossref – OCC (OpenCitations Corpus) – COCI (OpenCitations Index of Crossref open DOI-to-DOI references) 22
  • 24. 23
  • 25. 24
  • 26. 25
  • 27. 26
  • 28. Coverage of publications and citations for each data source 27
  • 29. Wish list for improving open data sources • Expanding coverage of publications (OCC) • Opening citations (Crossref) • Opening other metadata, e.g. abstracts (Dimensions, Crossref, OCC) • Improving completeness and standardization of metadata (Crossref) • Speeding up APIs 28
  • 30. Toward contextualized scientometrics • Integrating data collection and visual analysis • Interactive visual exploration (e.g., drilling down from high-level visual overviews to underlying data) • Large-scale visual analyses • Moving from visualization as a tool for representation to visualization as a tool for exploration, discussion and reflection 29
  • 34. Thank you for your attention! 33

Editor's Notes

  1. It is not certain why so many citation links are missing in WoS. Some references that are very similar to the ones above are linked in WoS. Probably it has to do with group author and supplement,