SlideShare ist ein Scribd-Unternehmen logo
1 von 34
Downloaden Sie, um offline zu lesen
Accuracy of citation data in Web of
Science and Scopus
Nees Jan van Eck and Ludo Waltman
Centre for Science and Technology Studies, Leiden University, Leiden (The Netherlands)
16th International Conference on Scientometrics & Informetrics
Wuhan, China, October 19, 2017
Introduction
• Question: Can we trust citation counts in WoS and
Scopus?
• Aim: To determine accuracy of citation data in WoS
and Scopus
– Accuracy of reference data
– Accuracy of citation matching
• Approach: Comparison of references in full text of
Elsevier publications with references in WoS and
Scopus
1
…
References
[1] Hirsch, JE (2005)
PNAS, 102, p.16569
[2] Egghe, L (2006)
Scientist, 20, p.15
…
Approach
2
…
References
[1] Hirsch, JE (2005)
PNAS, 102, p.16569
[2] Egghe, L (2006)
Scientist, 20, p.15
…
Original
(Elsevier) publication
WoS
…
References
[1] Hirsch, JE (2005)
PNAS, 102, p.16569
[2] Egghe, L (2006)
Scientist, 20, p.15
…
Scopus
…
References
[1] Hirsch, JE (2005)
PNAS, 102, p.16569
[2] Egghe, L (2006)
Scientist, 20, p.15
…
Elsevier data
• Elsevier ScienceDirect Article Retrieval API
• Subscription-based journal publications in period
1987-2016
• Publication and reference data in XML format
3
Linking Elsevier data with WoS and
Scopus
4
WoS Scopus
Time period 1987–2016 1996–2015
Document types article, review article, review,
conference paper
No. of linked publications 6M 5M
No. of references in Elsevier data 207M 172M
No. of references in WoS/Scopus 203M 170M
No. of linked references 136M 84M
Number of linked publications
5
Analysis based
on number of
references in
publication
6
Linked publications classified based
on number of references
7
WoS Scopus
Equal no. of references 77.2% 96.4%
More references 2.7% 1.2%
Fewer references 19.3% 1.2%
No references 0.8% 1.2%
More references in WoS
8
Analysis based
on linked
references
9
Linked references without corresponding
citation relation
10
Validation of linked references without
corresponding citation relation
• Random sample from 2015 publications
• WoS (60 cases)
– Missing reference: 33 (55.0%)
– Incorrect reference: 10 (16.7%)
– Error in reference: 16 (26.7%)
– No problem: 1 (1.5%)
• Scopus (30 cases)
– Missing reference: 6 (20.0%)
– Duplicate publications: 9 (30.0%)
– Citation matching problem: 15 (50.0%)
11
Missing references in WoS (1)
12
Missing references in WoS (1)
13
???
Missing references in WoS (2)
14
Missing references in WoS (2)
15
???
???
Incorrect references in WoS (1)
16
Incorrect references in WoS (1)
17
Incorrect references in WoS (1)
18
Incorrect references in WoS (2)
19
Original reference in publication WoS reference
J. Wang, J.K. Carson, M.F. North, D.J. Cleland, Int.
J. Heat Mass Transfer 49 (17) (2006) 3075–3083.
WANG J, 2006, CHINESE
CHEM LETT, V17, P49
Kanber B, Hartshorne TC, Horsfield MA, Naylor AR,
Robinson TG, Ramnarine KV. Dynamic variations in the
ultrasound gray-scale median of carotid artery
plaques. Cardiovasc Ultrasound 2013a;11:21.
KANBER B, 2013,
CEREBROVASC DIS S2, V35,
P21
Evans PD, Chowdhury MJA. Photoprotection of wood
using polyester-type UVabsorbers derived from the
reaction of 2 hydroxy-4(2,3-epoxypropoxy)-
benzophenone with dicarboxylic acid anhydrides. J
Wood Chem Technol 2010;30:186e204.
EVANS P, 2010, TLS-TIMES
LIT S 0326, P30
X. Li, S. Wang, Y. Chen, G. Liu, X. Yang,
Overexpression of CD40 in sacral chordomas and its
correlation with low tumor recurrence, Onkologie 36
(10) (2013) 567–571
LI XY, 2013, NANJING
NONGYE DAXUE, V36, P36
K. Zhang, H. Chen, G. Wu, K. Chen, H. Yang, High
expression of SPHK1 in sacral chordoma and
association with patients’ poor prognosis, Med.
Oncol. 31 (11) (2014) 247.
ZHANG K, 2014, IEEE T
PATTERN ANAL, V1, P1
Missing references in Scopus
20
Missing references in Scopus
21
No references
Duplicate publications in Scopus
22
Duplicate publications in Scopus
23
Duplicate publications in Scopus
24
Duplicate publications in Scopus
25
Citation matching problem in Scopus
26
27
Citation matching problem in Scopus
28
Citation matching problem in Scopus
Large
inaccuracies in
citation counts
of individual
publications
29
Interesting case
• Citation count (October 16, 2017):
– Scopus: 5,204
– WoS: 172
30
Differences in citation counts
between two versions of WoS
31
WoS
Dec 2016
WoS
Jun 2017
Newman, M.E.J., & Girvan, M. (2004). Finding
and evaluating community structure in
networks. Physical Review E, 69(2), 026113.
2,073 139
Newman, M.E.J. (2004). Fast algorithm for
detecting community structure in networks.
Physical Review E, 69(6), 066133.
436 1,070
Clauset, A., Newman, M.E.J., & Moore, C.
(2004). Finding community structure in very
large networks. Physical Review E, 70(6),
066111.
1,156 2,627
Conclusions
• Citation data suffers from significant inaccuracies
both in WoS and in Scopus
• WoS
– Incorrect references
– Missing references
• Scopus
– Duplicate publications
– Citation matching problems
• Both WoS and Scopus have inaccuracies in about 1%
of references
32
Thank you for your attention!
33

Weitere ähnliche Inhalte

Was ist angesagt?

Scientometric Analysis
Scientometric AnalysisScientometric Analysis
Scientometric Analysis
sumitbanshal
 

Was ist angesagt? (20)

Bibliometrische visualisaties voor het bijhouden van wetenschappelijke litera...
Bibliometrische visualisaties voor het bijhouden van wetenschappelijke litera...Bibliometrische visualisaties voor het bijhouden van wetenschappelijke litera...
Bibliometrische visualisaties voor het bijhouden van wetenschappelijke litera...
 
Visualizing science based on open data sources
Visualizing science based on open data sourcesVisualizing science based on open data sources
Visualizing science based on open data sources
 
Advanced citation matching and large-scale cited reference extraction
Advanced citation matching and large-scale cited reference extractionAdvanced citation matching and large-scale cited reference extraction
Advanced citation matching and large-scale cited reference extraction
 
Large-scale visualization of science: Methods, tools, and applications
Large-scale visualization of science: Methods, tools, and applicationsLarge-scale visualization of science: Methods, tools, and applications
Large-scale visualization of science: Methods, tools, and applications
 
Web of Science, Scopus, Dimensions, and beyond: The evolving landscape of bib...
Web of Science, Scopus, Dimensions, and beyond: The evolving landscape of bib...Web of Science, Scopus, Dimensions, and beyond: The evolving landscape of bib...
Web of Science, Scopus, Dimensions, and beyond: The evolving landscape of bib...
 
Scientific information retrieval: Challenges and opportunities
Scientific information retrieval: Challenges and opportunitiesScientific information retrieval: Challenges and opportunities
Scientific information retrieval: Challenges and opportunities
 
Intermediacy of publications
Intermediacy of publicationsIntermediacy of publications
Intermediacy of publications
 
Scientometric Analysis
Scientometric AnalysisScientometric Analysis
Scientometric Analysis
 
Comparing scientific performance across disciplines: Methodological and conce...
Comparing scientific performance across disciplines: Methodological and conce...Comparing scientific performance across disciplines: Methodological and conce...
Comparing scientific performance across disciplines: Methodological and conce...
 
VOSviewer: A software tool for analyzing and visualizing scientific literature
VOSviewer: A software tool for analyzing and visualizing scientific literatureVOSviewer: A software tool for analyzing and visualizing scientific literature
VOSviewer: A software tool for analyzing and visualizing scientific literature
 
Comparing bibliographic data sources
Comparing bibliographic data sourcesComparing bibliographic data sources
Comparing bibliographic data sources
 
Toward open citations: Why, how, and when?
Toward open citations: Why, how, and when?Toward open citations: Why, how, and when?
Toward open citations: Why, how, and when?
 
Citation analysis: State of the art, good practices, and future developments
Citation analysis: State of the art, good practices, and future developmentsCitation analysis: State of the art, good practices, and future developments
Citation analysis: State of the art, good practices, and future developments
 
Science Mapping and Research Positioning
Science Mapping and Research PositioningScience Mapping and Research Positioning
Science Mapping and Research Positioning
 
Au 2015
Au 2015Au 2015
Au 2015
 
A systematic empirical comparison of different approaches for normalizing cit...
A systematic empirical comparison of different approaches for normalizing cit...A systematic empirical comparison of different approaches for normalizing cit...
A systematic empirical comparison of different approaches for normalizing cit...
 
Scientometrics and semantic maps for development (Author: Iina Hellsten)
Scientometrics and semantic maps for development (Author: Iina Hellsten)Scientometrics and semantic maps for development (Author: Iina Hellsten)
Scientometrics and semantic maps for development (Author: Iina Hellsten)
 
information visualisation and its application in scientometrics
information visualisation and its application in scientometricsinformation visualisation and its application in scientometrics
information visualisation and its application in scientometrics
 
Twlyon2015 poster
Twlyon2015 posterTwlyon2015 poster
Twlyon2015 poster
 
Scientometrics
Scientometrics Scientometrics
Scientometrics
 

Ähnlich wie Accuracy of citation data in Web of Science and Scopus

Publishing for impact, VLAG phd week
Publishing for impact, VLAG phd weekPublishing for impact, VLAG phd week
Publishing for impact, VLAG phd week
Wouter Gerritsma
 
Publication strategy for LEI
Publication strategy for LEIPublication strategy for LEI
Publication strategy for LEI
Wouter Gerritsma
 
Publishing for impact by RIKILT
Publishing for impact by RIKILTPublishing for impact by RIKILT
Publishing for impact by RIKILT
Wouter Gerritsma
 
One Entry to Research: critical assessment of Web of Sceince, Scopus and Goog...
One Entry to Research: critical assessment of Web of Sceince, Scopus and Goog...One Entry to Research: critical assessment of Web of Sceince, Scopus and Goog...
One Entry to Research: critical assessment of Web of Sceince, Scopus and Goog...
nabot
 
Bibliometric analyses on repository contents for the evaluation of research a...
Bibliometric analyses on repository contents for the evaluation of research a...Bibliometric analyses on repository contents for the evaluation of research a...
Bibliometric analyses on repository contents for the evaluation of research a...
marco.vanveller
 
A new role for libraries in research assessments
A new role for libraries in research assessmentsA new role for libraries in research assessments
A new role for libraries in research assessments
Wouter Gerritsma
 

Ähnlich wie Accuracy of citation data in Web of Science and Scopus (20)

What is your h-index and other measures of impact
What is your h-index and other measures of impactWhat is your h-index and other measures of impact
What is your h-index and other measures of impact
 
Publication strategy WASS
Publication strategy WASSPublication strategy WASS
Publication strategy WASS
 
Metrics vs peer review: Why metrics can (and should?) be applied in the Socia...
Metrics vs peer review: Why metrics can (and should?) be applied in the Socia...Metrics vs peer review: Why metrics can (and should?) be applied in the Socia...
Metrics vs peer review: Why metrics can (and should?) be applied in the Socia...
 
Publishing for impact, VLAG phd week
Publishing for impact, VLAG phd weekPublishing for impact, VLAG phd week
Publishing for impact, VLAG phd week
 
Haustein, S. (2016). Analyzing, measuring and visualizing the success of inte...
Haustein, S. (2016). Analyzing, measuring and visualizing the success of inte...Haustein, S. (2016). Analyzing, measuring and visualizing the success of inte...
Haustein, S. (2016). Analyzing, measuring and visualizing the success of inte...
 
Sti 2014 sivertsen
Sti 2014 sivertsenSti 2014 sivertsen
Sti 2014 sivertsen
 
Citation analysis for research evaluation
Citation analysis for research evaluationCitation analysis for research evaluation
Citation analysis for research evaluation
 
Research assessment, strategic planning and research monitoring in St.Petersb...
Research assessment, strategic planning and research monitoring in St.Petersb...Research assessment, strategic planning and research monitoring in St.Petersb...
Research assessment, strategic planning and research monitoring in St.Petersb...
 
Publication strategy for LEI
Publication strategy for LEIPublication strategy for LEI
Publication strategy for LEI
 
Why do we need to model the science system?
Why do we need to model the science system?Why do we need to model the science system?
Why do we need to model the science system?
 
Publishing for impact by RIKILT
Publishing for impact by RIKILTPublishing for impact by RIKILT
Publishing for impact by RIKILT
 
Indicators of Innovative Research (Klavans, Boyack, Small, Sorensen, Ioannidis)
Indicators of Innovative Research (Klavans, Boyack, Small, Sorensen, Ioannidis)Indicators of Innovative Research (Klavans, Boyack, Small, Sorensen, Ioannidis)
Indicators of Innovative Research (Klavans, Boyack, Small, Sorensen, Ioannidis)
 
Presentation for GIRS
Presentation for GIRSPresentation for GIRS
Presentation for GIRS
 
Russell Group PVCs 3 Jun 09
Russell Group PVCs 3 Jun 09Russell Group PVCs 3 Jun 09
Russell Group PVCs 3 Jun 09
 
Dewis gender diversity and inclusion
Dewis gender diversity and inclusion Dewis gender diversity and inclusion
Dewis gender diversity and inclusion
 
One Entry to Research: critical assessment of Web of Sceince, Scopus and Goog...
One Entry to Research: critical assessment of Web of Sceince, Scopus and Goog...One Entry to Research: critical assessment of Web of Sceince, Scopus and Goog...
One Entry to Research: critical assessment of Web of Sceince, Scopus and Goog...
 
Bibliometric analyses on repository contents for the evaluation of research a...
Bibliometric analyses on repository contents for the evaluation of research a...Bibliometric analyses on repository contents for the evaluation of research a...
Bibliometric analyses on repository contents for the evaluation of research a...
 
citation analysis vlag
citation analysis vlagcitation analysis vlag
citation analysis vlag
 
Bib & ethics wildgaard
Bib & ethics wildgaardBib & ethics wildgaard
Bib & ethics wildgaard
 
A new role for libraries in research assessments
A new role for libraries in research assessmentsA new role for libraries in research assessments
A new role for libraries in research assessments
 

Mehr von Nees Jan van Eck

Network visualization: Fine-tuning layout techniques for different types of n...
Network visualization: Fine-tuning layout techniques for different types of n...Network visualization: Fine-tuning layout techniques for different types of n...
Network visualization: Fine-tuning layout techniques for different types of n...
Nees Jan van Eck
 

Mehr von Nees Jan van Eck (17)

Crossref as a source of open bibliographic metadata
Crossref as a source of open bibliographic metadataCrossref as a source of open bibliographic metadata
Crossref as a source of open bibliographic metadata
 
Community detection using citation relations and textual similarities in a la...
Community detection using citation relations and textual similarities in a la...Community detection using citation relations and textual similarities in a la...
Community detection using citation relations and textual similarities in a la...
 
Visualizing science using VOSviewer based on Crossref, Microsoft Academic, an...
Visualizing science using VOSviewer based on Crossref, Microsoft Academic, an...Visualizing science using VOSviewer based on Crossref, Microsoft Academic, an...
Visualizing science using VOSviewer based on Crossref, Microsoft Academic, an...
 
A scientometric perspective on university ranking
A scientometric perspective on university rankingA scientometric perspective on university ranking
A scientometric perspective on university ranking
 
Open data sources in VOSviewer
Open data sources in VOSviewerOpen data sources in VOSviewer
Open data sources in VOSviewer
 
A scientometric perspective on university ranking
A scientometric perspective on university rankingA scientometric perspective on university ranking
A scientometric perspective on university ranking
 
CWTS Leiden Ranking: An advanced bibliometric approach to university ranking
CWTS Leiden Ranking: An advanced bibliometric approach to university rankingCWTS Leiden Ranking: An advanced bibliometric approach to university ranking
CWTS Leiden Ranking: An advanced bibliometric approach to university ranking
 
Open data sources in VOSviewer
Open data sources in VOSviewerOpen data sources in VOSviewer
Open data sources in VOSviewer
 
Using full-text data to create improved term maps
Using full-text data to create improved term mapsUsing full-text data to create improved term maps
Using full-text data to create improved term maps
 
How to design a ranking system: Criteria and opportunities for a comparison
How to design a ranking system: Criteria and opportunities for a comparisonHow to design a ranking system: Criteria and opportunities for a comparison
How to design a ranking system: Criteria and opportunities for a comparison
 
Advanced bibliometric software tools for publishers and editors
Advanced bibliometric software tools for publishers and editorsAdvanced bibliometric software tools for publishers and editors
Advanced bibliometric software tools for publishers and editors
 
Large-scale analysis of bibliometric networks
Large-scale analysis of bibliometric networksLarge-scale analysis of bibliometric networks
Large-scale analysis of bibliometric networks
 
Large-scale analysis of bibliometric data sources
Large-scale analysis of bibliometric data sourcesLarge-scale analysis of bibliometric data sources
Large-scale analysis of bibliometric data sources
 
On cluster stability
On cluster stabilityOn cluster stability
On cluster stability
 
Network visualization: Fine-tuning layout techniques for different types of n...
Network visualization: Fine-tuning layout techniques for different types of n...Network visualization: Fine-tuning layout techniques for different types of n...
Network visualization: Fine-tuning layout techniques for different types of n...
 
Cluster stability
Cluster stabilityCluster stability
Cluster stability
 
CWTS Leiden Ranking: An advanced bibliometric approach to university ranking
CWTS Leiden Ranking: An advanced bibliometric approach to university rankingCWTS Leiden Ranking: An advanced bibliometric approach to university ranking
CWTS Leiden Ranking: An advanced bibliometric approach to university ranking
 

Kürzlich hochgeladen

+971581248768>> SAFE AND ORIGINAL ABORTION PILLS FOR SALE IN DUBAI AND ABUDHA...
+971581248768>> SAFE AND ORIGINAL ABORTION PILLS FOR SALE IN DUBAI AND ABUDHA...+971581248768>> SAFE AND ORIGINAL ABORTION PILLS FOR SALE IN DUBAI AND ABUDHA...
+971581248768>> SAFE AND ORIGINAL ABORTION PILLS FOR SALE IN DUBAI AND ABUDHA...
?#DUbAI#??##{{(☎️+971_581248768%)**%*]'#abortion pills for sale in dubai@
 
Asymmetry in the atmosphere of the ultra-hot Jupiter WASP-76 b
Asymmetry in the atmosphere of the ultra-hot Jupiter WASP-76 bAsymmetry in the atmosphere of the ultra-hot Jupiter WASP-76 b
Asymmetry in the atmosphere of the ultra-hot Jupiter WASP-76 b
Sérgio Sacani
 
Pests of mustard_Identification_Management_Dr.UPR.pdf
Pests of mustard_Identification_Management_Dr.UPR.pdfPests of mustard_Identification_Management_Dr.UPR.pdf
Pests of mustard_Identification_Management_Dr.UPR.pdf
PirithiRaju
 

Kürzlich hochgeladen (20)

Sector 62, Noida Call girls :8448380779 Model Escorts | 100% verified
Sector 62, Noida Call girls :8448380779 Model Escorts | 100% verifiedSector 62, Noida Call girls :8448380779 Model Escorts | 100% verified
Sector 62, Noida Call girls :8448380779 Model Escorts | 100% verified
 
High Profile 🔝 8250077686 📞 Call Girls Service in GTB Nagar🍑
High Profile 🔝 8250077686 📞 Call Girls Service in GTB Nagar🍑High Profile 🔝 8250077686 📞 Call Girls Service in GTB Nagar🍑
High Profile 🔝 8250077686 📞 Call Girls Service in GTB Nagar🍑
 
Call Girls Ahmedabad +917728919243 call me Independent Escort Service
Call Girls Ahmedabad +917728919243 call me Independent Escort ServiceCall Girls Ahmedabad +917728919243 call me Independent Escort Service
Call Girls Ahmedabad +917728919243 call me Independent Escort Service
 
+971581248768>> SAFE AND ORIGINAL ABORTION PILLS FOR SALE IN DUBAI AND ABUDHA...
+971581248768>> SAFE AND ORIGINAL ABORTION PILLS FOR SALE IN DUBAI AND ABUDHA...+971581248768>> SAFE AND ORIGINAL ABORTION PILLS FOR SALE IN DUBAI AND ABUDHA...
+971581248768>> SAFE AND ORIGINAL ABORTION PILLS FOR SALE IN DUBAI AND ABUDHA...
 
Thyroid Physiology_Dr.E. Muralinath_ Associate Professor
Thyroid Physiology_Dr.E. Muralinath_ Associate ProfessorThyroid Physiology_Dr.E. Muralinath_ Associate Professor
Thyroid Physiology_Dr.E. Muralinath_ Associate Professor
 
❤Jammu Kashmir Call Girls 8617697112 Personal Whatsapp Number 💦✅.
❤Jammu Kashmir Call Girls 8617697112 Personal Whatsapp Number 💦✅.❤Jammu Kashmir Call Girls 8617697112 Personal Whatsapp Number 💦✅.
❤Jammu Kashmir Call Girls 8617697112 Personal Whatsapp Number 💦✅.
 
GBSN - Microbiology (Unit 2)
GBSN - Microbiology (Unit 2)GBSN - Microbiology (Unit 2)
GBSN - Microbiology (Unit 2)
 
Site Acceptance Test .
Site Acceptance Test                    .Site Acceptance Test                    .
Site Acceptance Test .
 
Introduction to Viruses
Introduction to VirusesIntroduction to Viruses
Introduction to Viruses
 
Asymmetry in the atmosphere of the ultra-hot Jupiter WASP-76 b
Asymmetry in the atmosphere of the ultra-hot Jupiter WASP-76 bAsymmetry in the atmosphere of the ultra-hot Jupiter WASP-76 b
Asymmetry in the atmosphere of the ultra-hot Jupiter WASP-76 b
 
Kochi ❤CALL GIRL 84099*07087 ❤CALL GIRLS IN Kochi ESCORT SERVICE❤CALL GIRL
Kochi ❤CALL GIRL 84099*07087 ❤CALL GIRLS IN Kochi ESCORT SERVICE❤CALL GIRLKochi ❤CALL GIRL 84099*07087 ❤CALL GIRLS IN Kochi ESCORT SERVICE❤CALL GIRL
Kochi ❤CALL GIRL 84099*07087 ❤CALL GIRLS IN Kochi ESCORT SERVICE❤CALL GIRL
 
Forensic Biology & Its biological significance.pdf
Forensic Biology & Its biological significance.pdfForensic Biology & Its biological significance.pdf
Forensic Biology & Its biological significance.pdf
 
Locating and isolating a gene, FISH, GISH, Chromosome walking and jumping, te...
Locating and isolating a gene, FISH, GISH, Chromosome walking and jumping, te...Locating and isolating a gene, FISH, GISH, Chromosome walking and jumping, te...
Locating and isolating a gene, FISH, GISH, Chromosome walking and jumping, te...
 
9654467111 Call Girls In Raj Nagar Delhi Short 1500 Night 6000
9654467111 Call Girls In Raj Nagar Delhi Short 1500 Night 60009654467111 Call Girls In Raj Nagar Delhi Short 1500 Night 6000
9654467111 Call Girls In Raj Nagar Delhi Short 1500 Night 6000
 
Pests of mustard_Identification_Management_Dr.UPR.pdf
Pests of mustard_Identification_Management_Dr.UPR.pdfPests of mustard_Identification_Management_Dr.UPR.pdf
Pests of mustard_Identification_Management_Dr.UPR.pdf
 
GBSN - Microbiology (Unit 1)
GBSN - Microbiology (Unit 1)GBSN - Microbiology (Unit 1)
GBSN - Microbiology (Unit 1)
 
Grade 7 - Lesson 1 - Microscope and Its Functions
Grade 7 - Lesson 1 - Microscope and Its FunctionsGrade 7 - Lesson 1 - Microscope and Its Functions
Grade 7 - Lesson 1 - Microscope and Its Functions
 
High Class Escorts in Hyderabad ₹7.5k Pick Up & Drop With Cash Payment 969456...
High Class Escorts in Hyderabad ₹7.5k Pick Up & Drop With Cash Payment 969456...High Class Escorts in Hyderabad ₹7.5k Pick Up & Drop With Cash Payment 969456...
High Class Escorts in Hyderabad ₹7.5k Pick Up & Drop With Cash Payment 969456...
 
FAIRSpectra - Enabling the FAIRification of Analytical Science
FAIRSpectra - Enabling the FAIRification of Analytical ScienceFAIRSpectra - Enabling the FAIRification of Analytical Science
FAIRSpectra - Enabling the FAIRification of Analytical Science
 
Pulmonary drug delivery system M.pharm -2nd sem P'ceutics
Pulmonary drug delivery system M.pharm -2nd sem P'ceuticsPulmonary drug delivery system M.pharm -2nd sem P'ceutics
Pulmonary drug delivery system M.pharm -2nd sem P'ceutics
 

Accuracy of citation data in Web of Science and Scopus

  • 1. Accuracy of citation data in Web of Science and Scopus Nees Jan van Eck and Ludo Waltman Centre for Science and Technology Studies, Leiden University, Leiden (The Netherlands) 16th International Conference on Scientometrics & Informetrics Wuhan, China, October 19, 2017
  • 2. Introduction • Question: Can we trust citation counts in WoS and Scopus? • Aim: To determine accuracy of citation data in WoS and Scopus – Accuracy of reference data – Accuracy of citation matching • Approach: Comparison of references in full text of Elsevier publications with references in WoS and Scopus 1
  • 3. … References [1] Hirsch, JE (2005) PNAS, 102, p.16569 [2] Egghe, L (2006) Scientist, 20, p.15 … Approach 2 … References [1] Hirsch, JE (2005) PNAS, 102, p.16569 [2] Egghe, L (2006) Scientist, 20, p.15 … Original (Elsevier) publication WoS … References [1] Hirsch, JE (2005) PNAS, 102, p.16569 [2] Egghe, L (2006) Scientist, 20, p.15 … Scopus … References [1] Hirsch, JE (2005) PNAS, 102, p.16569 [2] Egghe, L (2006) Scientist, 20, p.15 …
  • 4. Elsevier data • Elsevier ScienceDirect Article Retrieval API • Subscription-based journal publications in period 1987-2016 • Publication and reference data in XML format 3
  • 5. Linking Elsevier data with WoS and Scopus 4 WoS Scopus Time period 1987–2016 1996–2015 Document types article, review article, review, conference paper No. of linked publications 6M 5M No. of references in Elsevier data 207M 172M No. of references in WoS/Scopus 203M 170M No. of linked references 136M 84M
  • 6. Number of linked publications 5
  • 7. Analysis based on number of references in publication 6
  • 8. Linked publications classified based on number of references 7 WoS Scopus Equal no. of references 77.2% 96.4% More references 2.7% 1.2% Fewer references 19.3% 1.2% No references 0.8% 1.2%
  • 11. Linked references without corresponding citation relation 10
  • 12. Validation of linked references without corresponding citation relation • Random sample from 2015 publications • WoS (60 cases) – Missing reference: 33 (55.0%) – Incorrect reference: 10 (16.7%) – Error in reference: 16 (26.7%) – No problem: 1 (1.5%) • Scopus (30 cases) – Missing reference: 6 (20.0%) – Duplicate publications: 9 (30.0%) – Citation matching problem: 15 (50.0%) 11
  • 13. Missing references in WoS (1) 12
  • 14. Missing references in WoS (1) 13 ???
  • 15. Missing references in WoS (2) 14
  • 16. Missing references in WoS (2) 15 ??? ???
  • 20. Incorrect references in WoS (2) 19 Original reference in publication WoS reference J. Wang, J.K. Carson, M.F. North, D.J. Cleland, Int. J. Heat Mass Transfer 49 (17) (2006) 3075–3083. WANG J, 2006, CHINESE CHEM LETT, V17, P49 Kanber B, Hartshorne TC, Horsfield MA, Naylor AR, Robinson TG, Ramnarine KV. Dynamic variations in the ultrasound gray-scale median of carotid artery plaques. Cardiovasc Ultrasound 2013a;11:21. KANBER B, 2013, CEREBROVASC DIS S2, V35, P21 Evans PD, Chowdhury MJA. Photoprotection of wood using polyester-type UVabsorbers derived from the reaction of 2 hydroxy-4(2,3-epoxypropoxy)- benzophenone with dicarboxylic acid anhydrides. J Wood Chem Technol 2010;30:186e204. EVANS P, 2010, TLS-TIMES LIT S 0326, P30 X. Li, S. Wang, Y. Chen, G. Liu, X. Yang, Overexpression of CD40 in sacral chordomas and its correlation with low tumor recurrence, Onkologie 36 (10) (2013) 567–571 LI XY, 2013, NANJING NONGYE DAXUE, V36, P36 K. Zhang, H. Chen, G. Wu, K. Chen, H. Yang, High expression of SPHK1 in sacral chordoma and association with patients’ poor prognosis, Med. Oncol. 31 (11) (2014) 247. ZHANG K, 2014, IEEE T PATTERN ANAL, V1, P1
  • 22. Missing references in Scopus 21 No references
  • 30. Large inaccuracies in citation counts of individual publications 29
  • 31. Interesting case • Citation count (October 16, 2017): – Scopus: 5,204 – WoS: 172 30
  • 32. Differences in citation counts between two versions of WoS 31 WoS Dec 2016 WoS Jun 2017 Newman, M.E.J., & Girvan, M. (2004). Finding and evaluating community structure in networks. Physical Review E, 69(2), 026113. 2,073 139 Newman, M.E.J. (2004). Fast algorithm for detecting community structure in networks. Physical Review E, 69(6), 066133. 436 1,070 Clauset, A., Newman, M.E.J., & Moore, C. (2004). Finding community structure in very large networks. Physical Review E, 70(6), 066111. 1,156 2,627
  • 33. Conclusions • Citation data suffers from significant inaccuracies both in WoS and in Scopus • WoS – Incorrect references – Missing references • Scopus – Duplicate publications – Citation matching problems • Both WoS and Scopus have inaccuracies in about 1% of references 32
  • 34. Thank you for your attention! 33