SlideShare ist ein Scribd-Unternehmen logo
1 von 14
Downloaden Sie, um offline zu lesen
The Distribution of References
in Scientific Papers:
an Analysis of the IMRaD Structure
ISSI 2013
Vienna, 16 July 2013
Marc Bertin, Iana Atanassova, Vincent Lariviere, Yves Gingras
Problem
Scientific papers usually follow a specific
rhetorical structure: the IMRaD structure
(Introduction, Method, Result and Discussion).
Questions:Questions:
What relationships exist between cited
references and the structure of the text?
How does the IMRaD structure affect the
distribution of references in scientific
papers?
Method
Corpus: 7 peer-reviewed academic journals:
PLoS series (ONE, Biology, Computational Biology,
Genetics, Medicine, Neglected Tropical Diseases,
Pathogens)
XML using Journal Article Tag Suite (JATS)XML using Journal Article Tag Suite (JATS)
More than 47,000 scientific articles
Identify the section structure of the articles
Identify cited references in the text
Study the distribution of references according
to the text progression and structure.
Sections Identification
• Section titles can vary according to the
article.
• e.g. "Method", "Methods", "Method and
Model"Model"
• Section titles were analyzed in order to
match each section with one of the
section types in the IMRaD structure.
Sentence Level Processing
We use sentences as basic units to model
text progression
Sentence segmentation allows us to work
with text elements that are smaller than
paragraphsparagraphs
Analysis of the punctuation of the text
following a set of typographic rules
For each sentence, we count the number of
references it contains and obtain their
distribution along the text.
Corpus
Cited References
Cited references are present as separate
elements in the XML structure
Special cases needing specific processing:
reference ranges
ResultsResults
PLoS ONE &
PLoS Computational Biology
PloS Genetics, PLoS
Pathogens & PLoS Biology
PLoS Medicine & PLoS
Neglected Tropical Diseases
IMRaD Structure
Conclusion
We have obtained the distribution of
cited references in scientific papers.
We have shown that this distribution
seems quite stable and maybe evenseems quite stable and maybe even
invariant if we take into account the
changes that occur in some journals in
the positions of the different sections in
the text of the articles.
Thank you!Thank you!

Weitere ähnliche Inhalte

Ähnlich wie The Distribution of References in Scientific Papers: an Analysis of the IMRaD Structure - ISSI-2013

Lexical Distribution in Citation Contexts through the IMRaD Standard - ECIR-2...
Lexical Distribution in Citation Contexts through the IMRaD Standard - ECIR-2...Lexical Distribution in Citation Contexts through the IMRaD Standard - ECIR-2...
Lexical Distribution in Citation Contexts through the IMRaD Standard - ECIR-2...Iana Atanassova
 
Analysing Author Name Mentions In Citation Contexts Of Highly Cited Publications
Analysing Author Name Mentions In Citation Contexts Of Highly Cited PublicationsAnalysing Author Name Mentions In Citation Contexts Of Highly Cited Publications
Analysing Author Name Mentions In Citation Contexts Of Highly Cited PublicationsTye Rausch
 
briefly describe the main biochemical actions of telomerase and discuss.docx
briefly describe the main biochemical actions of telomerase and discuss.docxbriefly describe the main biochemical actions of telomerase and discuss.docx
briefly describe the main biochemical actions of telomerase and discuss.docxsdfghj21
 
A Citation Centric Annotation Scheme For Scientific Articles
A Citation Centric Annotation Scheme For Scientific ArticlesA Citation Centric Annotation Scheme For Scientific Articles
A Citation Centric Annotation Scheme For Scientific ArticlesAndrea Porter
 
BioSharing overview - NIH bioCADDIE workshop on Common Data Elements, 8th May...
BioSharing overview - NIH bioCADDIE workshop on Common Data Elements, 8th May...BioSharing overview - NIH bioCADDIE workshop on Common Data Elements, 8th May...
BioSharing overview - NIH bioCADDIE workshop on Common Data Elements, 8th May...Susanna-Assunta Sansone
 
Literature Review Matrix
Literature Review MatrixLiterature Review Matrix
Literature Review MatrixUWS Library
 
CHEM281 2012
CHEM281 2012CHEM281 2012
CHEM281 2012jda90
 
Data Mining in Rediology reports
Data Mining in Rediology reportsData Mining in Rediology reports
Data Mining in Rediology reportsSaeed Mehrabi
 
A knowledge capture framework for domain specific search systems
A knowledge capture framework for domain specific search systemsA knowledge capture framework for domain specific search systems
A knowledge capture framework for domain specific search systemsramakanz
 
DirectionsLocate the annotated bibliography and outline you.docx
DirectionsLocate the annotated bibliography and outline you.docxDirectionsLocate the annotated bibliography and outline you.docx
DirectionsLocate the annotated bibliography and outline you.docxkimberly691
 
3. learning from other and reviewing the literature
3. learning from other and reviewing the literature3. learning from other and reviewing the literature
3. learning from other and reviewing the literatureToni Montellano
 
Semantic Integration for Heterogeneous Domain-specific Information: The NIF Case
Semantic Integration for Heterogeneous Domain-specific Information: The NIF CaseSemantic Integration for Heterogeneous Domain-specific Information: The NIF Case
Semantic Integration for Heterogeneous Domain-specific Information: The NIF CaseNeuroscience Information Framework
 
An Up-to-date Knowledge Base and Focused Exploration System for Human Perform...
An Up-to-date Knowledge Base and Focused Exploration System for Human Perform...An Up-to-date Knowledge Base and Focused Exploration System for Human Perform...
An Up-to-date Knowledge Base and Focused Exploration System for Human Perform...Artificial Intelligence Institute at UofSC
 

Ähnlich wie The Distribution of References in Scientific Papers: an Analysis of the IMRaD Structure - ISSI-2013 (20)

Lexical Distribution in Citation Contexts through the IMRaD Standard - ECIR-2...
Lexical Distribution in Citation Contexts through the IMRaD Standard - ECIR-2...Lexical Distribution in Citation Contexts through the IMRaD Standard - ECIR-2...
Lexical Distribution in Citation Contexts through the IMRaD Standard - ECIR-2...
 
2015.ESP
2015.ESP2015.ESP
2015.ESP
 
Analysing Author Name Mentions In Citation Contexts Of Highly Cited Publications
Analysing Author Name Mentions In Citation Contexts Of Highly Cited PublicationsAnalysing Author Name Mentions In Citation Contexts Of Highly Cited Publications
Analysing Author Name Mentions In Citation Contexts Of Highly Cited Publications
 
briefly describe the main biochemical actions of telomerase and discuss.docx
briefly describe the main biochemical actions of telomerase and discuss.docxbriefly describe the main biochemical actions of telomerase and discuss.docx
briefly describe the main biochemical actions of telomerase and discuss.docx
 
A Citation Centric Annotation Scheme For Scientific Articles
A Citation Centric Annotation Scheme For Scientific ArticlesA Citation Centric Annotation Scheme For Scientific Articles
A Citation Centric Annotation Scheme For Scientific Articles
 
BioSharing overview - NIH bioCADDIE workshop on Common Data Elements, 8th May...
BioSharing overview - NIH bioCADDIE workshop on Common Data Elements, 8th May...BioSharing overview - NIH bioCADDIE workshop on Common Data Elements, 8th May...
BioSharing overview - NIH bioCADDIE workshop on Common Data Elements, 8th May...
 
APA manual PPT 2
APA manual PPT 2APA manual PPT 2
APA manual PPT 2
 
Cocitation Networks and Random Walk
Cocitation Networks and Random WalkCocitation Networks and Random Walk
Cocitation Networks and Random Walk
 
Literature Review Matrix
Literature Review MatrixLiterature Review Matrix
Literature Review Matrix
 
CHEM281 2012
CHEM281 2012CHEM281 2012
CHEM281 2012
 
Ibn Sina
Ibn SinaIbn Sina
Ibn Sina
 
Data Mining in Rediology reports
Data Mining in Rediology reportsData Mining in Rediology reports
Data Mining in Rediology reports
 
A knowledge capture framework for domain specific search systems
A knowledge capture framework for domain specific search systemsA knowledge capture framework for domain specific search systems
A knowledge capture framework for domain specific search systems
 
DirectionsLocate the annotated bibliography and outline you.docx
DirectionsLocate the annotated bibliography and outline you.docxDirectionsLocate the annotated bibliography and outline you.docx
DirectionsLocate the annotated bibliography and outline you.docx
 
3. learning from other and reviewing the literature
3. learning from other and reviewing the literature3. learning from other and reviewing the literature
3. learning from other and reviewing the literature
 
Semantic Integration for Heterogeneous Domain-specific Information: The NIF Case
Semantic Integration for Heterogeneous Domain-specific Information: The NIF CaseSemantic Integration for Heterogeneous Domain-specific Information: The NIF Case
Semantic Integration for Heterogeneous Domain-specific Information: The NIF Case
 
An Up-to-date Knowledge Base and Focused Exploration System for Human Perform...
An Up-to-date Knowledge Base and Focused Exploration System for Human Perform...An Up-to-date Knowledge Base and Focused Exploration System for Human Perform...
An Up-to-date Knowledge Base and Focused Exploration System for Human Perform...
 
Marcu 2000 presentation
Marcu 2000 presentationMarcu 2000 presentation
Marcu 2000 presentation
 
7 calais
7 calais7 calais
7 calais
 
7 calais
7 calais7 calais
7 calais
 

Kürzlich hochgeladen

Unlocking the Potential: Deep dive into ocean of Ceramic Magnets.pptx
Unlocking  the Potential: Deep dive into ocean of Ceramic Magnets.pptxUnlocking  the Potential: Deep dive into ocean of Ceramic Magnets.pptx
Unlocking the Potential: Deep dive into ocean of Ceramic Magnets.pptxanandsmhk
 
Disentangling the origin of chemical differences using GHOST
Disentangling the origin of chemical differences using GHOSTDisentangling the origin of chemical differences using GHOST
Disentangling the origin of chemical differences using GHOSTSérgio Sacani
 
Raman spectroscopy.pptx M Pharm, M Sc, Advanced Spectral Analysis
Raman spectroscopy.pptx M Pharm, M Sc, Advanced Spectral AnalysisRaman spectroscopy.pptx M Pharm, M Sc, Advanced Spectral Analysis
Raman spectroscopy.pptx M Pharm, M Sc, Advanced Spectral AnalysisDiwakar Mishra
 
Botany 4th semester file By Sumit Kumar yadav.pdf
Botany 4th semester file By Sumit Kumar yadav.pdfBotany 4th semester file By Sumit Kumar yadav.pdf
Botany 4th semester file By Sumit Kumar yadav.pdfSumit Kumar yadav
 
Isotopic evidence of long-lived volcanism on Io
Isotopic evidence of long-lived volcanism on IoIsotopic evidence of long-lived volcanism on Io
Isotopic evidence of long-lived volcanism on IoSérgio Sacani
 
Recombination DNA Technology (Nucleic Acid Hybridization )
Recombination DNA Technology (Nucleic Acid Hybridization )Recombination DNA Technology (Nucleic Acid Hybridization )
Recombination DNA Technology (Nucleic Acid Hybridization )aarthirajkumar25
 
Artificial Intelligence In Microbiology by Dr. Prince C P
Artificial Intelligence In Microbiology by Dr. Prince C PArtificial Intelligence In Microbiology by Dr. Prince C P
Artificial Intelligence In Microbiology by Dr. Prince C PPRINCE C P
 
Is RISC-V ready for HPC workload? Maybe?
Is RISC-V ready for HPC workload? Maybe?Is RISC-V ready for HPC workload? Maybe?
Is RISC-V ready for HPC workload? Maybe?Patrick Diehl
 
A relative description on Sonoporation.pdf
A relative description on Sonoporation.pdfA relative description on Sonoporation.pdf
A relative description on Sonoporation.pdfnehabiju2046
 
Analytical Profile of Coleus Forskohlii | Forskolin .pptx
Analytical Profile of Coleus Forskohlii | Forskolin .pptxAnalytical Profile of Coleus Forskohlii | Forskolin .pptx
Analytical Profile of Coleus Forskohlii | Forskolin .pptxSwapnil Therkar
 
G9 Science Q4- Week 1-2 Projectile Motion.ppt
G9 Science Q4- Week 1-2 Projectile Motion.pptG9 Science Q4- Week 1-2 Projectile Motion.ppt
G9 Science Q4- Week 1-2 Projectile Motion.pptMAESTRELLAMesa2
 
Call Girls in Mayapuri Delhi 💯Call Us 🔝9953322196🔝 💯Escort.
Call Girls in Mayapuri Delhi 💯Call Us 🔝9953322196🔝 💯Escort.Call Girls in Mayapuri Delhi 💯Call Us 🔝9953322196🔝 💯Escort.
Call Girls in Mayapuri Delhi 💯Call Us 🔝9953322196🔝 💯Escort.aasikanpl
 
Physiochemical properties of nanomaterials and its nanotoxicity.pptx
Physiochemical properties of nanomaterials and its nanotoxicity.pptxPhysiochemical properties of nanomaterials and its nanotoxicity.pptx
Physiochemical properties of nanomaterials and its nanotoxicity.pptxAArockiyaNisha
 
CALL ON ➥8923113531 🔝Call Girls Kesar Bagh Lucknow best Night Fun service 🪡
CALL ON ➥8923113531 🔝Call Girls Kesar Bagh Lucknow best Night Fun service  🪡CALL ON ➥8923113531 🔝Call Girls Kesar Bagh Lucknow best Night Fun service  🪡
CALL ON ➥8923113531 🔝Call Girls Kesar Bagh Lucknow best Night Fun service 🪡anilsa9823
 
Lucknow 💋 Russian Call Girls Lucknow Finest Escorts Service 8923113531 Availa...
Lucknow 💋 Russian Call Girls Lucknow Finest Escorts Service 8923113531 Availa...Lucknow 💋 Russian Call Girls Lucknow Finest Escorts Service 8923113531 Availa...
Lucknow 💋 Russian Call Girls Lucknow Finest Escorts Service 8923113531 Availa...anilsa9823
 
SOLUBLE PATTERN RECOGNITION RECEPTORS.pptx
SOLUBLE PATTERN RECOGNITION RECEPTORS.pptxSOLUBLE PATTERN RECOGNITION RECEPTORS.pptx
SOLUBLE PATTERN RECOGNITION RECEPTORS.pptxkessiyaTpeter
 
Call Girls in Munirka Delhi 💯Call Us 🔝9953322196🔝 💯Escort.
Call Girls in Munirka Delhi 💯Call Us 🔝9953322196🔝 💯Escort.Call Girls in Munirka Delhi 💯Call Us 🔝9953322196🔝 💯Escort.
Call Girls in Munirka Delhi 💯Call Us 🔝9953322196🔝 💯Escort.aasikanpl
 
Nanoparticles synthesis and characterization​ ​
Nanoparticles synthesis and characterization​  ​Nanoparticles synthesis and characterization​  ​
Nanoparticles synthesis and characterization​ ​kaibalyasahoo82800
 
Cultivation of KODO MILLET . made by Ghanshyam pptx
Cultivation of KODO MILLET . made by Ghanshyam pptxCultivation of KODO MILLET . made by Ghanshyam pptx
Cultivation of KODO MILLET . made by Ghanshyam pptxpradhanghanshyam7136
 

Kürzlich hochgeladen (20)

Unlocking the Potential: Deep dive into ocean of Ceramic Magnets.pptx
Unlocking  the Potential: Deep dive into ocean of Ceramic Magnets.pptxUnlocking  the Potential: Deep dive into ocean of Ceramic Magnets.pptx
Unlocking the Potential: Deep dive into ocean of Ceramic Magnets.pptx
 
Disentangling the origin of chemical differences using GHOST
Disentangling the origin of chemical differences using GHOSTDisentangling the origin of chemical differences using GHOST
Disentangling the origin of chemical differences using GHOST
 
Raman spectroscopy.pptx M Pharm, M Sc, Advanced Spectral Analysis
Raman spectroscopy.pptx M Pharm, M Sc, Advanced Spectral AnalysisRaman spectroscopy.pptx M Pharm, M Sc, Advanced Spectral Analysis
Raman spectroscopy.pptx M Pharm, M Sc, Advanced Spectral Analysis
 
Botany 4th semester file By Sumit Kumar yadav.pdf
Botany 4th semester file By Sumit Kumar yadav.pdfBotany 4th semester file By Sumit Kumar yadav.pdf
Botany 4th semester file By Sumit Kumar yadav.pdf
 
Isotopic evidence of long-lived volcanism on Io
Isotopic evidence of long-lived volcanism on IoIsotopic evidence of long-lived volcanism on Io
Isotopic evidence of long-lived volcanism on Io
 
Recombination DNA Technology (Nucleic Acid Hybridization )
Recombination DNA Technology (Nucleic Acid Hybridization )Recombination DNA Technology (Nucleic Acid Hybridization )
Recombination DNA Technology (Nucleic Acid Hybridization )
 
Engler and Prantl system of classification in plant taxonomy
Engler and Prantl system of classification in plant taxonomyEngler and Prantl system of classification in plant taxonomy
Engler and Prantl system of classification in plant taxonomy
 
Artificial Intelligence In Microbiology by Dr. Prince C P
Artificial Intelligence In Microbiology by Dr. Prince C PArtificial Intelligence In Microbiology by Dr. Prince C P
Artificial Intelligence In Microbiology by Dr. Prince C P
 
Is RISC-V ready for HPC workload? Maybe?
Is RISC-V ready for HPC workload? Maybe?Is RISC-V ready for HPC workload? Maybe?
Is RISC-V ready for HPC workload? Maybe?
 
A relative description on Sonoporation.pdf
A relative description on Sonoporation.pdfA relative description on Sonoporation.pdf
A relative description on Sonoporation.pdf
 
Analytical Profile of Coleus Forskohlii | Forskolin .pptx
Analytical Profile of Coleus Forskohlii | Forskolin .pptxAnalytical Profile of Coleus Forskohlii | Forskolin .pptx
Analytical Profile of Coleus Forskohlii | Forskolin .pptx
 
G9 Science Q4- Week 1-2 Projectile Motion.ppt
G9 Science Q4- Week 1-2 Projectile Motion.pptG9 Science Q4- Week 1-2 Projectile Motion.ppt
G9 Science Q4- Week 1-2 Projectile Motion.ppt
 
Call Girls in Mayapuri Delhi 💯Call Us 🔝9953322196🔝 💯Escort.
Call Girls in Mayapuri Delhi 💯Call Us 🔝9953322196🔝 💯Escort.Call Girls in Mayapuri Delhi 💯Call Us 🔝9953322196🔝 💯Escort.
Call Girls in Mayapuri Delhi 💯Call Us 🔝9953322196🔝 💯Escort.
 
Physiochemical properties of nanomaterials and its nanotoxicity.pptx
Physiochemical properties of nanomaterials and its nanotoxicity.pptxPhysiochemical properties of nanomaterials and its nanotoxicity.pptx
Physiochemical properties of nanomaterials and its nanotoxicity.pptx
 
CALL ON ➥8923113531 🔝Call Girls Kesar Bagh Lucknow best Night Fun service 🪡
CALL ON ➥8923113531 🔝Call Girls Kesar Bagh Lucknow best Night Fun service  🪡CALL ON ➥8923113531 🔝Call Girls Kesar Bagh Lucknow best Night Fun service  🪡
CALL ON ➥8923113531 🔝Call Girls Kesar Bagh Lucknow best Night Fun service 🪡
 
Lucknow 💋 Russian Call Girls Lucknow Finest Escorts Service 8923113531 Availa...
Lucknow 💋 Russian Call Girls Lucknow Finest Escorts Service 8923113531 Availa...Lucknow 💋 Russian Call Girls Lucknow Finest Escorts Service 8923113531 Availa...
Lucknow 💋 Russian Call Girls Lucknow Finest Escorts Service 8923113531 Availa...
 
SOLUBLE PATTERN RECOGNITION RECEPTORS.pptx
SOLUBLE PATTERN RECOGNITION RECEPTORS.pptxSOLUBLE PATTERN RECOGNITION RECEPTORS.pptx
SOLUBLE PATTERN RECOGNITION RECEPTORS.pptx
 
Call Girls in Munirka Delhi 💯Call Us 🔝9953322196🔝 💯Escort.
Call Girls in Munirka Delhi 💯Call Us 🔝9953322196🔝 💯Escort.Call Girls in Munirka Delhi 💯Call Us 🔝9953322196🔝 💯Escort.
Call Girls in Munirka Delhi 💯Call Us 🔝9953322196🔝 💯Escort.
 
Nanoparticles synthesis and characterization​ ​
Nanoparticles synthesis and characterization​  ​Nanoparticles synthesis and characterization​  ​
Nanoparticles synthesis and characterization​ ​
 
Cultivation of KODO MILLET . made by Ghanshyam pptx
Cultivation of KODO MILLET . made by Ghanshyam pptxCultivation of KODO MILLET . made by Ghanshyam pptx
Cultivation of KODO MILLET . made by Ghanshyam pptx
 

The Distribution of References in Scientific Papers: an Analysis of the IMRaD Structure - ISSI-2013

  • 1. The Distribution of References in Scientific Papers: an Analysis of the IMRaD Structure ISSI 2013 Vienna, 16 July 2013 Marc Bertin, Iana Atanassova, Vincent Lariviere, Yves Gingras
  • 2. Problem Scientific papers usually follow a specific rhetorical structure: the IMRaD structure (Introduction, Method, Result and Discussion). Questions:Questions: What relationships exist between cited references and the structure of the text? How does the IMRaD structure affect the distribution of references in scientific papers?
  • 3. Method Corpus: 7 peer-reviewed academic journals: PLoS series (ONE, Biology, Computational Biology, Genetics, Medicine, Neglected Tropical Diseases, Pathogens) XML using Journal Article Tag Suite (JATS)XML using Journal Article Tag Suite (JATS) More than 47,000 scientific articles Identify the section structure of the articles Identify cited references in the text Study the distribution of references according to the text progression and structure.
  • 4. Sections Identification • Section titles can vary according to the article. • e.g. "Method", "Methods", "Method and Model"Model" • Section titles were analyzed in order to match each section with one of the section types in the IMRaD structure.
  • 5. Sentence Level Processing We use sentences as basic units to model text progression Sentence segmentation allows us to work with text elements that are smaller than paragraphsparagraphs Analysis of the punctuation of the text following a set of typographic rules For each sentence, we count the number of references it contains and obtain their distribution along the text.
  • 7. Cited References Cited references are present as separate elements in the XML structure Special cases needing specific processing: reference ranges
  • 9. PLoS ONE & PLoS Computational Biology
  • 11. PLoS Medicine & PLoS Neglected Tropical Diseases
  • 13. Conclusion We have obtained the distribution of cited references in scientific papers. We have shown that this distribution seems quite stable and maybe evenseems quite stable and maybe even invariant if we take into account the changes that occur in some journals in the positions of the different sections in the text of the articles.