SlideShare a Scribd company logo
1 of 57
The Path to Open Science with Illustrations from Computational Biology  Philip E. Bourne, University of California San Diego
My Message “Research is the insurance policy for the future”                                                Rick Rashid Open Research is the way to maximize  the return on that policy
My Message I come to you as a domain scientist compelled by the belief that open science is critical to maximizing the return on the research investment and the means to embrace the maximum number of scientists worldwide I commend the efforts of Microsoft to embrace open science within their business
My Hope Some of you will see the unmet needs in achieving open science today and will be motivated to contribute to meet those needs
Disclaimer This is one persons biased view… Computational biologist Maintainer of a well funded major biological resource – The Protein Data Bank Co-founder of an open access journal – PLoS Computational Biology Contributor to open source archives Firm believer there must be a business model Co-founder of a for-profit science dissemination company
What is the Protein Data Bank (PDB)? The single community owned worldwide repository containing structures of publically accessible biological macromolecules A resource used by ~ 200,000 individuals per month A resource distributing worldwide the equivalent to  ¼ the National Library of Congress each month A bicoastal resource 1TB
What Does the PDB Tell Me About Open Science? Is a biological database really different that a biological journal? User base is broadening – outreach more important Constant demand for better performance Increasing use of Web services (SOAP and now RESTful)  Uptake on the use of widgets has been slow Mobile use increasing Web 2.0 communications are in demand
So What Am I Thinking We Need Going Forward?  Science is increasingly digital – whether it be driven by observation or hypothesis Doing science is but a complicated workflow How we communicate science remains very much in the 16th century  There remain too many analog steps that make no sense these can be removed with human and machine consensus Our current workflow tools are inadequate We already have many of the components we just need to put it together in a 21st century printing press
The Game is Afoot – Pressure for Change is Growing
Its Chaos Out There – Not so Much the Information But the Filtering 1330 databases reported in NAR 2011 MetaBasehttp://biodatabase.org reports 2,651 entries edited 12,587 times PubMed contains ~21M entries (May 2011) ~100,000 papers indexed per month In Feb 2009: 67,406,898 interactive searches were done 92,216,786 entries were viewed PLoS Comp. Biol. 2005 1(3) e34
Drivers of Change: The Scientific Publishing Process is Too Slow to Respond to a Crisis – Either Global or Personal By the time the paper is published  we could all be dead http://knol.google.com/k/plos-currents-influenza#
Drivers of Change: In a time of crisis the need for fast access  to accurate data and any knowledge of that data are paramount Structure Summary page activity for H1N1 Influenza related structures Jan. 2008 Jan. 2009 Jan. 2010 Jul. 2009 Jul. 2008 Jul. 2010 3B7E: Neuraminidase of A/Brevig Mission/1/1918  H1N1 strain in complex with zanamivir 1RUZ: 1918 H1 Hemagglutinin * http://www.cdc.gov/h1n1flu/estimates/April_March_13.htm
For some people the scientific process may be too slow to save their life If that is not enough…
Josh Sommer – A Remarkable Young ManCo-founder & Executive Director the Chordoma Foundation http://sagecongress.org/Presentations/Sommer.pdf
Chordoma A rare form of brain cancer No known drugs Treatment – surgical resection followed by intense radiation therapy http://upload.wikimedia.org/wikipedia/commons/2/2b/Chordoma.JPG
Chordoma http://sagecongress.org/Presentations/Sommer.pdf
http://sagecongress.org/Presentations/Sommer.pdf
http://sagecongress.org/Presentations/Sommer.pdf
If I have seen further it is only by standing on the shoulders of giants Isaac Isaac Newton From Josh’s point of view the climb  up just takes too long > 15 years and > $850M to be  more precise Adapted: http://sagecongress.org/Presentations/Sommer.pdf
http://sagecongress.org/Presentations/Sommer.pdf
http://sagecongress.org/Presentations/Sommer.pdf
http://fora.tv/2010/04/23/Sage_Commons_Josh_Sommer_Chordoma_Foundation
Now we are all hopefully motivated let us break this down to what actually needs to be done in my opinion Here are a few big things ….. and a few very little things we are contributing by way            of example…
A Big Thing: The Academic Reward System Must Change The Right Thing To Do Reward Papers Grants Data availability Reviews Provision of metadata Open access Curation Alternative forms of dissemination New tools
The Reward System Must Change: Prerequisite
Prerequisite: Ability to Attribute ORCHID - It is DOIs for people Some scientists will resist
With Attribution and Licensing Accurate Tools Emerge http://pubnet.gersteinlab.org/ http://www.researcherid.com/ http://www.biomedexperts.com
From Accurate Tools Come New Metrics for Success
Accuracy Must Appeal to Scientists Surely? So why do we persist with the journal impact factor? Why do we not educate those that review us since only 1-2 of a committee of 6 or more actually know what we do and the rest fall back on false metrics P.E. Bourne 2011 Ten Simple Rules for Getting Ahead  as a Computational Biologist in Academia. PLoS Comp. Biol. 7(1) e1002001.
The Reward System Must Change The Right Thing To Do Reward Papers Grants Data availability Reviews Provision of metadata Open access Curation Alternative forms of dissemination New tools
Measure Data Contributions – A Step Towards Realizing the 4th Paradigm Data resources have an obligation to unify metadata availability to provide provenance information How can this happen? Scientists within one domain agree on a way We are committed to ORCID  (we already support DOIs) and  to adopting such a standard
The Reward System Must Change The Right Thing To Do Reward Papers Grants Reviews Provision of metadata Open access Curation Alternative forms of dissemination New tools Data availability
Scientists: Open Access (Biomedical Sciences Only) “I just submitted this paper yesterday, if anyone is interested in a copy email me afterwards” Hot-shot young assistant professor Why? ,[object Object]
 Stuff is happening quickly?
 Generational?
 Influence of other fields?
 Government pressure?PDB 10 years ago / today ,[object Object]
 Hold till pub 60/45
 Hold 1 year 30/5,[object Object]
We Need Data and Knowledge About That Data to Interoperate The Knowledge and Data Cycle 0. Full text of PLoS papers stored  in a database 4. The composite view has links to pertinent blocks  of literature text and back to the PDB 4. User clicks on content Metadata and webservices to data provide an interactiveview that can be annotated Selecting features provides a data/knowledge mashup Analysis leads to new content I can share 1. 3. A composite view of journal and database content results 1. A link brings up figures  from the paper 3. 2. 2. Clicking the paper figure retrieves data from the PDB which is analyzed PLoS Comp. Biol. 2005 1(3) e34
We Have a Long Way to Go, But … Crowd annotation works under selective circumstances – again related to reward We have some instances of data knowledge interoperability We have some instances of interactive papers We have a very active community refining the electronic printing press
The Protein Data Bank – A Best Case Scenario Paper not published unless data are deposited – strong data to literature correspondence Highly structured data conforming to extensive ontologies DOI’s assigned to every structure PLoS Comp. Biol. 2005 1(3) e34
Example Interoperability: The Database View BMC Bioinformatics 2010 11:220 www.rcsb.org/pdb/explore/literature.do?structureId=1TIM
Example Interoperability: The Literature View http://biolit.ucsd.edu Nucleic Acids Research 2008 36(S2) W385-389
Semantic Tagging & Widgets are Powerful Tools to Integrate Data and Knowledge of that Data, But as Yet Not Used Much Will Widgets and Semantic Tagging Change Computational Biology?  PLoS Comp. Biol. 6(2) e1000673
Semantic Tagging of Database Content in The Literature or Elsewhere http://www.rcsb.org/pdb/static.do?p=widgets/widgetShowcase.jsp PLoS Comp. Biol. 6(2) e1000673
The Publishers are Starting to Do It From Anita de Waard, Elsevier
Others are Doing It Very Successfully
This is Literature Post-processingBetter to Get the Authors Involved Authors are the absolute experts on the content More effective distribution of labor Add metadata before the article enters the publishing process
Word Add-in for authors Allows authors to add metadata as they write, before they submit the manuscript Authors are assisted by automated term recognition OBO ontologies Database IDs Metadata are embedded directly into the manuscript document via XML tags, OOXML format Open Machine-readable Open source, Microsoft Public License http://www.codeplex.com/ucsdbiolit
Challenges Authors  Carrot IF one or more publishers fast tracked a paper that had semantic markup it might catch on Publishers Carrot Competitive advantage
The Promise Cardiac Disease Literature Immunology Literature Shared Function
The Reward System Must Change The Right Thing To Do Reward Papers Grants Reviews Provision of metadata Curation Alternative forms of dissemination New tools Data availability Open access
Yes YouTube Can Increase the Rate of Discovery
The Lab Experiment Paper+Rich Media My students enjoyed the experience The shyest student was actually the most bold in front of the camera “We will become a generation of “science castors” They liked the exposure for the most part – rather than the PI it puts them out in front

More Related Content

What's hot

Digital library workshops in a nutshell
Digital library workshops in a nutshellDigital library workshops in a nutshell
Digital library workshops in a nutshell
HVCClibrary
 

What's hot (20)

Force11: Enabling transparency and efficiency in the research landscape
Force11: Enabling transparency and efficiency in the research landscapeForce11: Enabling transparency and efficiency in the research landscape
Force11: Enabling transparency and efficiency in the research landscape
 
From Theory to Practice: Can Opennesss Improve the Quality of OER Research?
From Theory to Practice: Can Opennesss Improve the Quality of OER Research? From Theory to Practice: Can Opennesss Improve the Quality of OER Research?
From Theory to Practice: Can Opennesss Improve the Quality of OER Research?
 
RDA Scholarly Infrastructure 2015
RDA Scholarly Infrastructure 2015RDA Scholarly Infrastructure 2015
RDA Scholarly Infrastructure 2015
 
Some Early Thoughts
Some Early ThoughtsSome Early Thoughts
Some Early Thoughts
 
Finding and Accessing Human Genomics Datasets
Finding and Accessing Human Genomics DatasetsFinding and Accessing Human Genomics Datasets
Finding and Accessing Human Genomics Datasets
 
Digital library workshops in a nutshell
Digital library workshops in a nutshellDigital library workshops in a nutshell
Digital library workshops in a nutshell
 
Rapid biomedical search
Rapid biomedical search Rapid biomedical search
Rapid biomedical search
 
How to Execute A Research Paper
How to Execute A Research PaperHow to Execute A Research Paper
How to Execute A Research Paper
 
Scott Edmunds ICIS talk at UC Davis: Open Publishing for the Big Data era
Scott Edmunds ICIS talk at UC Davis: Open Publishing for the Big Data eraScott Edmunds ICIS talk at UC Davis: Open Publishing for the Big Data era
Scott Edmunds ICIS talk at UC Davis: Open Publishing for the Big Data era
 
Understanding the Big Data Enterprise
Understanding the Big Data EnterpriseUnderstanding the Big Data Enterprise
Understanding the Big Data Enterprise
 
Executing the Research Paper
Executing the Research PaperExecuting the Research Paper
Executing the Research Paper
 
Scott Edmunds: Quantifying how FAIR is Hong Kong: The Hong Kong Shareability ...
Scott Edmunds: Quantifying how FAIR is Hong Kong: The Hong Kong Shareability ...Scott Edmunds: Quantifying how FAIR is Hong Kong: The Hong Kong Shareability ...
Scott Edmunds: Quantifying how FAIR is Hong Kong: The Hong Kong Shareability ...
 
Elsevier02012011
Elsevier02012011Elsevier02012011
Elsevier02012011
 
Nicole Nogoy's talk at eResearchNZ 2014: Improving data sharing, integration ...
Nicole Nogoy's talk at eResearchNZ 2014: Improving data sharing, integration ...Nicole Nogoy's talk at eResearchNZ 2014: Improving data sharing, integration ...
Nicole Nogoy's talk at eResearchNZ 2014: Improving data sharing, integration ...
 
Scott Edmunds talk at G3 (Great GigaScience & Galaxy) workshop: Open Data: th...
Scott Edmunds talk at G3 (Great GigaScience & Galaxy) workshop: Open Data: th...Scott Edmunds talk at G3 (Great GigaScience & Galaxy) workshop: Open Data: th...
Scott Edmunds talk at G3 (Great GigaScience & Galaxy) workshop: Open Data: th...
 
Genome sharing projects around the world nijmegen oct 29 - 2015
Genome sharing projects around the world   nijmegen oct 29 - 2015Genome sharing projects around the world   nijmegen oct 29 - 2015
Genome sharing projects around the world nijmegen oct 29 - 2015
 
Searching Deeply for Data, Results and Tools- What is Stopping Us?
Searching Deeply for Data, Results and Tools- What is Stopping Us?Searching Deeply for Data, Results and Tools- What is Stopping Us?
Searching Deeply for Data, Results and Tools- What is Stopping Us?
 
Democratising biodiversity and genomics research: open and citizen science to...
Democratising biodiversity and genomics research: open and citizen science to...Democratising biodiversity and genomics research: open and citizen science to...
Democratising biodiversity and genomics research: open and citizen science to...
 
An introduction to social media for scientists
An introduction to social media for scientistsAn introduction to social media for scientists
An introduction to social media for scientists
 
The new alchemy: Online networking, data sharing and research activity distri...
The new alchemy: Online networking, data sharing and research activity distri...The new alchemy: Online networking, data sharing and research activity distri...
The new alchemy: Online networking, data sharing and research activity distri...
 

Viewers also liked (6)

UCSD Deans and Chairs Presentation - PDB & Drug Discovery
UCSD Deans and Chairs Presentation - PDB & Drug DiscoveryUCSD Deans and Chairs Presentation - PDB & Drug Discovery
UCSD Deans and Chairs Presentation - PDB & Drug Discovery
 
ISCB Youth Symposium
ISCB Youth SymposiumISCB Youth Symposium
ISCB Youth Symposium
 
Towards the Digital Research Enterprise
Towards the Digital Research EnterpriseTowards the Digital Research Enterprise
Towards the Digital Research Enterprise
 
Sparc Funders Publishers Workshop 071015
Sparc Funders Publishers Workshop 071015Sparc Funders Publishers Workshop 071015
Sparc Funders Publishers Workshop 071015
 
Big Data in Biomedicine: Where is the NIH Headed
Big Data in Biomedicine: Where is the NIH HeadedBig Data in Biomedicine: Where is the NIH Headed
Big Data in Biomedicine: Where is the NIH Headed
 
Big Data as a Catalyst for Collaboration & Innovation
Big Data as a Catalyst for Collaboration & InnovationBig Data as a Catalyst for Collaboration & Innovation
Big Data as a Catalyst for Collaboration & Innovation
 

Similar to Cartegena051811

Similar to Cartegena051811 (20)

Big Data in Biomedicine – An NIH Perspective
Big Data in Biomedicine – An NIH PerspectiveBig Data in Biomedicine – An NIH Perspective
Big Data in Biomedicine – An NIH Perspective
 
Biomedical Research as Part of the Digital Enterprise
Biomedical Research as Part of the Digital EnterpriseBiomedical Research as Part of the Digital Enterprise
Biomedical Research as Part of the Digital Enterprise
 
Scholarly Communication for Bioinformatics Students
Scholarly Communication for Bioinformatics StudentsScholarly Communication for Bioinformatics Students
Scholarly Communication for Bioinformatics Students
 
Murpha11
Murpha11Murpha11
Murpha11
 
Open Data in a Global Ecosystem
Open Data in a Global EcosystemOpen Data in a Global Ecosystem
Open Data in a Global Ecosystem
 
What Will Be The Impact of Future Changes in Digital Scholarship on Marine Bi...
What Will Be The Impact of Future Changes in Digital Scholarship on Marine Bi...What Will Be The Impact of Future Changes in Digital Scholarship on Marine Bi...
What Will Be The Impact of Future Changes in Digital Scholarship on Marine Bi...
 
Data at the NIH: Some Early Thoughts
Data at the NIH: Some Early ThoughtsData at the NIH: Some Early Thoughts
Data at the NIH: Some Early Thoughts
 
NISO Virtual Conference Scientific Data Management: Caring for Your Instituti...
NISO Virtual Conference Scientific Data Management: Caring for Your Instituti...NISO Virtual Conference Scientific Data Management: Caring for Your Instituti...
NISO Virtual Conference Scientific Data Management: Caring for Your Instituti...
 
Open Access NBIC Workshop April 19, 2011
Open Access NBIC Workshop April 19, 2011Open Access NBIC Workshop April 19, 2011
Open Access NBIC Workshop April 19, 2011
 
There is No Intelligent Life Down Here
There is No Intelligent Life Down HereThere is No Intelligent Life Down Here
There is No Intelligent Life Down Here
 
Jim Gray Award Lecture
Jim Gray Award LectureJim Gray Award Lecture
Jim Gray Award Lecture
 
UCSD Library Presentation 10182010
UCSD Library Presentation 10182010UCSD Library Presentation 10182010
UCSD Library Presentation 10182010
 
AI from the Perspective of a School of Data Science
AI from the Perspective of a School of Data ScienceAI from the Perspective of a School of Data Science
AI from the Perspective of a School of Data Science
 
PhRMA Some Early Thoughts
PhRMA Some Early ThoughtsPhRMA Some Early Thoughts
PhRMA Some Early Thoughts
 
The PDB An Exemplar for Data Science To Date, But What About the Future?
The PDB An Exemplar for Data Science To Date, But What About the Future?The PDB An Exemplar for Data Science To Date, But What About the Future?
The PDB An Exemplar for Data Science To Date, But What About the Future?
 
If Data Are The New Oil, How Do We Prevent Global Warming?
If Data Are The New Oil, How Do We Prevent Global Warming?If Data Are The New Oil, How Do We Prevent Global Warming?
If Data Are The New Oil, How Do We Prevent Global Warming?
 
Slides for burroughs wellcome foundation ajw100611 sefinal
Slides for burroughs wellcome foundation ajw100611 sefinalSlides for burroughs wellcome foundation ajw100611 sefinal
Slides for burroughs wellcome foundation ajw100611 sefinal
 
Open Notebook Science and One Future for Scientific Research
Open Notebook Science and One Future for Scientific ResearchOpen Notebook Science and One Future for Scientific Research
Open Notebook Science and One Future for Scientific Research
 
Reaching out to collaborators and crowdsourcing for pharmaceutical research
Reaching out to collaborators and crowdsourcing for pharmaceutical research  Reaching out to collaborators and crowdsourcing for pharmaceutical research
Reaching out to collaborators and crowdsourcing for pharmaceutical research
 
What Bioinformaticians Need to Know About Digital Publishing Beyond the PDF
What Bioinformaticians Need to Know About Digital Publishing Beyond the PDFWhat Bioinformaticians Need to Know About Digital Publishing Beyond the PDF
What Bioinformaticians Need to Know About Digital Publishing Beyond the PDF
 

More from Philip Bourne

More from Philip Bourne (20)

Data Science and AI in Biomedicine: The World has Changed
Data Science and AI in Biomedicine: The World has ChangedData Science and AI in Biomedicine: The World has Changed
Data Science and AI in Biomedicine: The World has Changed
 
Data Science and AI in Biomedicine: The World has Changed
Data Science and AI in Biomedicine: The World has ChangedData Science and AI in Biomedicine: The World has Changed
Data Science and AI in Biomedicine: The World has Changed
 
AI in Medical Education A Meta View to Start a Conversation
AI in Medical Education A Meta View to Start a ConversationAI in Medical Education A Meta View to Start a Conversation
AI in Medical Education A Meta View to Start a Conversation
 
AI+ Now and Then How Did We Get Here And Where Are We Going
AI+ Now and Then How Did We Get Here And Where Are We GoingAI+ Now and Then How Did We Get Here And Where Are We Going
AI+ Now and Then How Did We Get Here And Where Are We Going
 
Thoughts on Biological Data Sustainability
Thoughts on Biological Data SustainabilityThoughts on Biological Data Sustainability
Thoughts on Biological Data Sustainability
 
What is FAIR Data and Who Needs It?
What is FAIR Data and Who Needs It?What is FAIR Data and Who Needs It?
What is FAIR Data and Who Needs It?
 
Data Science Meets Biomedicine, Does Anything Change
Data Science Meets Biomedicine, Does Anything ChangeData Science Meets Biomedicine, Does Anything Change
Data Science Meets Biomedicine, Does Anything Change
 
Data Science Meets Drug Discovery
Data Science Meets Drug DiscoveryData Science Meets Drug Discovery
Data Science Meets Drug Discovery
 
Biomedical Data Science: We Are Not Alone
Biomedical Data Science: We Are Not AloneBiomedical Data Science: We Are Not Alone
Biomedical Data Science: We Are Not Alone
 
BIMS7100-2023. Social Responsibility in Research
BIMS7100-2023. Social Responsibility in ResearchBIMS7100-2023. Social Responsibility in Research
BIMS7100-2023. Social Responsibility in Research
 
What Data Science Will Mean to You - One Person's View
What Data Science Will Mean to You - One Person's ViewWhat Data Science Will Mean to You - One Person's View
What Data Science Will Mean to You - One Person's View
 
Novo Nordisk 080522.pptx
Novo Nordisk 080522.pptxNovo Nordisk 080522.pptx
Novo Nordisk 080522.pptx
 
Towards a US Open research Commons (ORC)
Towards a US Open research Commons (ORC)Towards a US Open research Commons (ORC)
Towards a US Open research Commons (ORC)
 
COVID and Precision Education
COVID and Precision EducationCOVID and Precision Education
COVID and Precision Education
 
One View of Data Science
One View of Data ScienceOne View of Data Science
One View of Data Science
 
Cancer Research Meets Data Science — What Can We Do Together?
Cancer Research Meets Data Science — What Can We Do Together?Cancer Research Meets Data Science — What Can We Do Together?
Cancer Research Meets Data Science — What Can We Do Together?
 
Data Science Meets Open Scholarship – What Comes Next?
Data Science Meets Open Scholarship – What Comes Next?Data Science Meets Open Scholarship – What Comes Next?
Data Science Meets Open Scholarship – What Comes Next?
 
Data to Advance Sustainability
Data to Advance SustainabilityData to Advance Sustainability
Data to Advance Sustainability
 
Frontiers of Computing at the Cellular and Molecular Scales
Frontiers of Computing at the Cellular and Molecular ScalesFrontiers of Computing at the Cellular and Molecular Scales
Frontiers of Computing at the Cellular and Molecular Scales
 
Social Responsibility in Research
Social Responsibility in ResearchSocial Responsibility in Research
Social Responsibility in Research
 

Recently uploaded

Activity 01 - Artificial Culture (1).pdf
Activity 01 - Artificial Culture (1).pdfActivity 01 - Artificial Culture (1).pdf
Activity 01 - Artificial Culture (1).pdf
ciinovamais
 
Russian Escort Service in Delhi 11k Hotel Foreigner Russian Call Girls in Delhi
Russian Escort Service in Delhi 11k Hotel Foreigner Russian Call Girls in DelhiRussian Escort Service in Delhi 11k Hotel Foreigner Russian Call Girls in Delhi
Russian Escort Service in Delhi 11k Hotel Foreigner Russian Call Girls in Delhi
kauryashika82
 
An Overview of Mutual Funds Bcom Project.pdf
An Overview of Mutual Funds Bcom Project.pdfAn Overview of Mutual Funds Bcom Project.pdf
An Overview of Mutual Funds Bcom Project.pdf
SanaAli374401
 

Recently uploaded (20)

Código Creativo y Arte de Software | Unidad 1
Código Creativo y Arte de Software | Unidad 1Código Creativo y Arte de Software | Unidad 1
Código Creativo y Arte de Software | Unidad 1
 
Unit-V; Pricing (Pharma Marketing Management).pptx
Unit-V; Pricing (Pharma Marketing Management).pptxUnit-V; Pricing (Pharma Marketing Management).pptx
Unit-V; Pricing (Pharma Marketing Management).pptx
 
Nutritional Needs Presentation - HLTH 104
Nutritional Needs Presentation - HLTH 104Nutritional Needs Presentation - HLTH 104
Nutritional Needs Presentation - HLTH 104
 
Advance Mobile Application Development class 07
Advance Mobile Application Development class 07Advance Mobile Application Development class 07
Advance Mobile Application Development class 07
 
Mixin Classes in Odoo 17 How to Extend Models Using Mixin Classes
Mixin Classes in Odoo 17  How to Extend Models Using Mixin ClassesMixin Classes in Odoo 17  How to Extend Models Using Mixin Classes
Mixin Classes in Odoo 17 How to Extend Models Using Mixin Classes
 
Activity 01 - Artificial Culture (1).pdf
Activity 01 - Artificial Culture (1).pdfActivity 01 - Artificial Culture (1).pdf
Activity 01 - Artificial Culture (1).pdf
 
Explore beautiful and ugly buildings. Mathematics helps us create beautiful d...
Explore beautiful and ugly buildings. Mathematics helps us create beautiful d...Explore beautiful and ugly buildings. Mathematics helps us create beautiful d...
Explore beautiful and ugly buildings. Mathematics helps us create beautiful d...
 
Unit-IV; Professional Sales Representative (PSR).pptx
Unit-IV; Professional Sales Representative (PSR).pptxUnit-IV; Professional Sales Representative (PSR).pptx
Unit-IV; Professional Sales Representative (PSR).pptx
 
Russian Escort Service in Delhi 11k Hotel Foreigner Russian Call Girls in Delhi
Russian Escort Service in Delhi 11k Hotel Foreigner Russian Call Girls in DelhiRussian Escort Service in Delhi 11k Hotel Foreigner Russian Call Girls in Delhi
Russian Escort Service in Delhi 11k Hotel Foreigner Russian Call Girls in Delhi
 
Key note speaker Neum_Admir Softic_ENG.pdf
Key note speaker Neum_Admir Softic_ENG.pdfKey note speaker Neum_Admir Softic_ENG.pdf
Key note speaker Neum_Admir Softic_ENG.pdf
 
Mehran University Newsletter Vol-X, Issue-I, 2024
Mehran University Newsletter Vol-X, Issue-I, 2024Mehran University Newsletter Vol-X, Issue-I, 2024
Mehran University Newsletter Vol-X, Issue-I, 2024
 
An Overview of Mutual Funds Bcom Project.pdf
An Overview of Mutual Funds Bcom Project.pdfAn Overview of Mutual Funds Bcom Project.pdf
An Overview of Mutual Funds Bcom Project.pdf
 
Introduction to Nonprofit Accounting: The Basics
Introduction to Nonprofit Accounting: The BasicsIntroduction to Nonprofit Accounting: The Basics
Introduction to Nonprofit Accounting: The Basics
 
Presentation by Andreas Schleicher Tackling the School Absenteeism Crisis 30 ...
Presentation by Andreas Schleicher Tackling the School Absenteeism Crisis 30 ...Presentation by Andreas Schleicher Tackling the School Absenteeism Crisis 30 ...
Presentation by Andreas Schleicher Tackling the School Absenteeism Crisis 30 ...
 
Web & Social Media Analytics Previous Year Question Paper.pdf
Web & Social Media Analytics Previous Year Question Paper.pdfWeb & Social Media Analytics Previous Year Question Paper.pdf
Web & Social Media Analytics Previous Year Question Paper.pdf
 
APM Welcome, APM North West Network Conference, Synergies Across Sectors
APM Welcome, APM North West Network Conference, Synergies Across SectorsAPM Welcome, APM North West Network Conference, Synergies Across Sectors
APM Welcome, APM North West Network Conference, Synergies Across Sectors
 
Unit-IV- Pharma. Marketing Channels.pptx
Unit-IV- Pharma. Marketing Channels.pptxUnit-IV- Pharma. Marketing Channels.pptx
Unit-IV- Pharma. Marketing Channels.pptx
 
PROCESS RECORDING FORMAT.docx
PROCESS      RECORDING        FORMAT.docxPROCESS      RECORDING        FORMAT.docx
PROCESS RECORDING FORMAT.docx
 
Paris 2024 Olympic Geographies - an activity
Paris 2024 Olympic Geographies - an activityParis 2024 Olympic Geographies - an activity
Paris 2024 Olympic Geographies - an activity
 
Basic Civil Engineering first year Notes- Chapter 4 Building.pptx
Basic Civil Engineering first year Notes- Chapter 4 Building.pptxBasic Civil Engineering first year Notes- Chapter 4 Building.pptx
Basic Civil Engineering first year Notes- Chapter 4 Building.pptx
 

Cartegena051811

  • 1. The Path to Open Science with Illustrations from Computational Biology Philip E. Bourne, University of California San Diego
  • 2. My Message “Research is the insurance policy for the future” Rick Rashid Open Research is the way to maximize the return on that policy
  • 3. My Message I come to you as a domain scientist compelled by the belief that open science is critical to maximizing the return on the research investment and the means to embrace the maximum number of scientists worldwide I commend the efforts of Microsoft to embrace open science within their business
  • 4. My Hope Some of you will see the unmet needs in achieving open science today and will be motivated to contribute to meet those needs
  • 5. Disclaimer This is one persons biased view… Computational biologist Maintainer of a well funded major biological resource – The Protein Data Bank Co-founder of an open access journal – PLoS Computational Biology Contributor to open source archives Firm believer there must be a business model Co-founder of a for-profit science dissemination company
  • 6. What is the Protein Data Bank (PDB)? The single community owned worldwide repository containing structures of publically accessible biological macromolecules A resource used by ~ 200,000 individuals per month A resource distributing worldwide the equivalent to ¼ the National Library of Congress each month A bicoastal resource 1TB
  • 7. What Does the PDB Tell Me About Open Science? Is a biological database really different that a biological journal? User base is broadening – outreach more important Constant demand for better performance Increasing use of Web services (SOAP and now RESTful) Uptake on the use of widgets has been slow Mobile use increasing Web 2.0 communications are in demand
  • 8. So What Am I Thinking We Need Going Forward? Science is increasingly digital – whether it be driven by observation or hypothesis Doing science is but a complicated workflow How we communicate science remains very much in the 16th century There remain too many analog steps that make no sense these can be removed with human and machine consensus Our current workflow tools are inadequate We already have many of the components we just need to put it together in a 21st century printing press
  • 9. The Game is Afoot – Pressure for Change is Growing
  • 10. Its Chaos Out There – Not so Much the Information But the Filtering 1330 databases reported in NAR 2011 MetaBasehttp://biodatabase.org reports 2,651 entries edited 12,587 times PubMed contains ~21M entries (May 2011) ~100,000 papers indexed per month In Feb 2009: 67,406,898 interactive searches were done 92,216,786 entries were viewed PLoS Comp. Biol. 2005 1(3) e34
  • 11. Drivers of Change: The Scientific Publishing Process is Too Slow to Respond to a Crisis – Either Global or Personal By the time the paper is published we could all be dead http://knol.google.com/k/plos-currents-influenza#
  • 12. Drivers of Change: In a time of crisis the need for fast access to accurate data and any knowledge of that data are paramount Structure Summary page activity for H1N1 Influenza related structures Jan. 2008 Jan. 2009 Jan. 2010 Jul. 2009 Jul. 2008 Jul. 2010 3B7E: Neuraminidase of A/Brevig Mission/1/1918 H1N1 strain in complex with zanamivir 1RUZ: 1918 H1 Hemagglutinin * http://www.cdc.gov/h1n1flu/estimates/April_March_13.htm
  • 13. For some people the scientific process may be too slow to save their life If that is not enough…
  • 14. Josh Sommer – A Remarkable Young ManCo-founder & Executive Director the Chordoma Foundation http://sagecongress.org/Presentations/Sommer.pdf
  • 15. Chordoma A rare form of brain cancer No known drugs Treatment – surgical resection followed by intense radiation therapy http://upload.wikimedia.org/wikipedia/commons/2/2b/Chordoma.JPG
  • 19. If I have seen further it is only by standing on the shoulders of giants Isaac Isaac Newton From Josh’s point of view the climb up just takes too long > 15 years and > $850M to be more precise Adapted: http://sagecongress.org/Presentations/Sommer.pdf
  • 23. Now we are all hopefully motivated let us break this down to what actually needs to be done in my opinion Here are a few big things ….. and a few very little things we are contributing by way of example…
  • 24. A Big Thing: The Academic Reward System Must Change The Right Thing To Do Reward Papers Grants Data availability Reviews Provision of metadata Open access Curation Alternative forms of dissemination New tools
  • 25. The Reward System Must Change: Prerequisite
  • 26. Prerequisite: Ability to Attribute ORCHID - It is DOIs for people Some scientists will resist
  • 27. With Attribution and Licensing Accurate Tools Emerge http://pubnet.gersteinlab.org/ http://www.researcherid.com/ http://www.biomedexperts.com
  • 28. From Accurate Tools Come New Metrics for Success
  • 29. Accuracy Must Appeal to Scientists Surely? So why do we persist with the journal impact factor? Why do we not educate those that review us since only 1-2 of a committee of 6 or more actually know what we do and the rest fall back on false metrics P.E. Bourne 2011 Ten Simple Rules for Getting Ahead as a Computational Biologist in Academia. PLoS Comp. Biol. 7(1) e1002001.
  • 30. The Reward System Must Change The Right Thing To Do Reward Papers Grants Data availability Reviews Provision of metadata Open access Curation Alternative forms of dissemination New tools
  • 31. Measure Data Contributions – A Step Towards Realizing the 4th Paradigm Data resources have an obligation to unify metadata availability to provide provenance information How can this happen? Scientists within one domain agree on a way We are committed to ORCID (we already support DOIs) and to adopting such a standard
  • 32. The Reward System Must Change The Right Thing To Do Reward Papers Grants Reviews Provision of metadata Open access Curation Alternative forms of dissemination New tools Data availability
  • 33.
  • 34. Stuff is happening quickly?
  • 36. Influence of other fields?
  • 37.
  • 38. Hold till pub 60/45
  • 39.
  • 40. We Need Data and Knowledge About That Data to Interoperate The Knowledge and Data Cycle 0. Full text of PLoS papers stored in a database 4. The composite view has links to pertinent blocks of literature text and back to the PDB 4. User clicks on content Metadata and webservices to data provide an interactiveview that can be annotated Selecting features provides a data/knowledge mashup Analysis leads to new content I can share 1. 3. A composite view of journal and database content results 1. A link brings up figures from the paper 3. 2. 2. Clicking the paper figure retrieves data from the PDB which is analyzed PLoS Comp. Biol. 2005 1(3) e34
  • 41. We Have a Long Way to Go, But … Crowd annotation works under selective circumstances – again related to reward We have some instances of data knowledge interoperability We have some instances of interactive papers We have a very active community refining the electronic printing press
  • 42. The Protein Data Bank – A Best Case Scenario Paper not published unless data are deposited – strong data to literature correspondence Highly structured data conforming to extensive ontologies DOI’s assigned to every structure PLoS Comp. Biol. 2005 1(3) e34
  • 43. Example Interoperability: The Database View BMC Bioinformatics 2010 11:220 www.rcsb.org/pdb/explore/literature.do?structureId=1TIM
  • 44. Example Interoperability: The Literature View http://biolit.ucsd.edu Nucleic Acids Research 2008 36(S2) W385-389
  • 45.
  • 46. Semantic Tagging & Widgets are Powerful Tools to Integrate Data and Knowledge of that Data, But as Yet Not Used Much Will Widgets and Semantic Tagging Change Computational Biology? PLoS Comp. Biol. 6(2) e1000673
  • 47. Semantic Tagging of Database Content in The Literature or Elsewhere http://www.rcsb.org/pdb/static.do?p=widgets/widgetShowcase.jsp PLoS Comp. Biol. 6(2) e1000673
  • 48.
  • 49. The Publishers are Starting to Do It From Anita de Waard, Elsevier
  • 50. Others are Doing It Very Successfully
  • 51. This is Literature Post-processingBetter to Get the Authors Involved Authors are the absolute experts on the content More effective distribution of labor Add metadata before the article enters the publishing process
  • 52. Word Add-in for authors Allows authors to add metadata as they write, before they submit the manuscript Authors are assisted by automated term recognition OBO ontologies Database IDs Metadata are embedded directly into the manuscript document via XML tags, OOXML format Open Machine-readable Open source, Microsoft Public License http://www.codeplex.com/ucsdbiolit
  • 53. Challenges Authors Carrot IF one or more publishers fast tracked a paper that had semantic markup it might catch on Publishers Carrot Competitive advantage
  • 54. The Promise Cardiac Disease Literature Immunology Literature Shared Function
  • 55. The Reward System Must Change The Right Thing To Do Reward Papers Grants Reviews Provision of metadata Curation Alternative forms of dissemination New tools Data availability Open access
  • 56. Yes YouTube Can Increase the Rate of Discovery
  • 57. The Lab Experiment Paper+Rich Media My students enjoyed the experience The shyest student was actually the most bold in front of the camera “We will become a generation of “science castors” They liked the exposure for the most part – rather than the PI it puts them out in front
  • 58. Three Years Later - Organic Growth Some of their work viewed 20,000+ times Global audience of researchers, educators and academic/research institutions 60,000 unique visitors & 2M pageviews/month 16,000 registered users & 600 communities 5,000 uploads of video content (about journal articles, conferences, research news and classes) Growing 4-5% monthly Sustainability - evolving a business model supporting journals and conferences
  • 59. Are We There Yet? The Right Thing To Do Reward Papers Grants Data availability Reviews Provision of metadata Open access Curation Alternative forms of dissemination New tools TOOLS
  • 60. Pipeline Assembly Scientist Tools for idea management just appearing in the cloud Idea LIMS system too limited Experiment Institutional repositories – Do they really know what they are doing? Metrics of success interoperability etc. Data Conclusions It would not be the first wall to come down Publish uzar.wordpress.com
  • 61. Acknowledgements BioLit Team Lynn Fink Parker Williams Marco Martinez RahulChandran Greg Quinn Microsoft Scholarly Communications Pablo Fernicola Lee Dirks SavasParastitidas Alex Wade Tony Hey wwPDB team Andreas Prilc DimitrisDimitropoulos SciVee Team Apryl Bailey Leo Chalupa Lynn Fink Marc Friedman (CEO) Ken Liu Alex Ramos Willy Suwanto Ben Yukich http://www.scivee.tv http://biolit.ucsd.edu http//www.pdb.org http://www.codeplex.com/ucsdbiolit