1. A Brief History of The Future
of Research Communication
(and e-Scholarship)
Anita de Waard
VP Research Data Management, Elsevier
RDM Services
FSCI July 31, 2017
Disclaimers:
- All history is personal, this one especially so. Therefore, not trying to make any grand
statement about completeness: this is meant as a first list, feel free to add, correct etc.
- I am very aware of my specific (privileged, white, scientist European/American) cultural
background which colors my views and experiences, hope others can correct this bias
- I am not a historian, so my interpretation or selection has not been vetted or justified
- I am not speaking on behalf of Elsevier, and am not trying to sell anything.
2. Web
Some key tech milestones
1987 20171997 2007
1987: Apple releases Hypercard
1991: Birth of Gopher
1992: Birth of PDF
1993: Birth of the Web
1994: Yahoo! was launched
1997: Netflix offers online movie rental
1998: Google founded
1998: Birth of XML
1999: Birth of RDF & OWL
2004: Facebook launched
2006: Twitter launched
2006: Google Docs launched
2006: Spotify launched
2007: iPhone 1 launched
2011: Snapchat launched
2015: Mobile > desktop
Social
Mobile
Semantic
3. But 20 years before all that…
(NLS = oNLine System, the tool Doug Engelbart is showing)
http://www.dougengelbart.org/firsts/1968-demo-interactive.html#3
4. Six Themes from the Force11 Manifesto:
A. Changing the unit and form of
the research paper
B. Better support for the
scholarly lifecycle
C. Treating data, software, and
workflows as first-class
objects
D. New models for evaluation
and peer review
E. New roles for libraries
F. New business models
5. A. Changing the unit and form of the research paper
1998: Modular papers:
2000: Argumentation networks:
1987 20171997 2007
2015: Lorena Barba, GWU, teaching in Python:
6. B. Better support for the scholarly lifecycle
2003: myGrid/Taverna: Workflow tool
1987 20171997 2007
2007: Vistrails: Provenance-driven visualisation tool
2015: Riffyn, Internet of Things in the Lab:
7. 1971: PDB (Protein databank), International in 2003
C. Treating data, software, workflows as first-class objects
1987 20171997 2007
1999+: Earth Systems Grid Federation
2015: Computer-Aided Discovery systems
2016: FAIR Data Principles
https://www.force11.org/node/6062
2017: Human Cell Atlas sponsored
8. D. New models for evaluation and peer review
1987 20171997 2007
1996: Journal of Interactive Media and Education:
2012: Altmetrics: Moving to Include Social Mentions
2015: Making Data Count: Towards Data Metrics
9. E. New roles for libraries
1987 20171997 2007
1992: SAO/NASA Astrophysics Data System (ADS)
1998: Fedora-Dspace and other Digital Repositories
2013: SHARE: CoS, AAU, APLU and ARL
10. F. New business models
1987 20171997 2007
1991: ArXIv
Preprints 2016: ASAPBio
Preprints
1997: OA: Vitek Tracz
talks to David Lippmann,
BMC/PMC ensue 2002: Post-Publication
Peer Review: F1000
2016: Publisher-in-a-box:
Wellcome Open Research
2016: Big Data
to Knowledge
11. In Summary: A Lot Has Happened!
1987 20171997 2007
1998: Modular papers:
2000: Argumentation networks:
2015: Lorena Barba, GWU, teaching in Python:
2003: myGrid/Taverna: Workflow tool
2007: Vistrails: Provenance-driven visualisation tool
2015: Riffyn, Internet of Things in the Lab:
1971: PDB (Protein databank), International in 2003
2016: FAIR Data Principles
https://www.force11.org/node/6062
2017: Human Cell Atlas sponsored
1996: Journal of Interactive Media and Education:
2012: Altmetrics: Moving to Include Social Mentions
2015: Making Data Count: Towards Data Metrics
1992: SAO/NASA Astrophysics Data System (ADS)
1998: Fedora-Dspace and other Digital Repositories
2013: SHARE: CoS, AAU, APLU and ARL
1991: ArXIv
Preprints 2016: ASAPBio
Preprints
1997: OA: Vitek Tracz
talks to David Lippmann,
BMC/PMC ensue 2002: Post-Publication
Peer Review: F1000
2016: Publisher-in-a-box:
Wellcome Open Research
2016: Big Data
to Knowledge
12. But also, a lot has been missed.
1987 20171997 2007
The “Global South”/other cultures of science?
Education?
Citizen Science?
The Humanities???
13. A. Changing the unit and form of the research
paper: (Yolanda Gil, ISI)
B. Better support for the scholarly lifecycle:
Jen Smith, CoS
C. Treating data/software as first-class objects:
Dan Katz, University of Illinois
D. New models for evaluation and peer review:
Cameron Neylon, Perth
E. New roles for libraries:
Danny Kingsley, Cambridge
F. New business models:
Nicky Agate, MLA
What’s Next?
2017 20472027 2037
And all of you, about these and other topics!
14. • Force11 Manifesto: https://www.force11.org/about/manifesto
• Birth of gopher: https://en.wikipedia.org/wiki/Gopher_(protocol)
• W3C Announcement: http://tenyears-www.web.cern.ch/tenyears-
www/Declaration/Page1.html
• History of OA movement:
https://cshl.libguides.com/c.php?g=474046&p=3243855
• Development of the Open Access Journal from 1993 – 2009:
http://journals.plos.org/plosone/article?id=10.1371/journal.pone.002096
1
• History of ArXIv: https://en.wikipedia.org/wiki/ArXiv
• Imagining Tomorrow’s University, workshop inputs:
http://www.ncsa.illinois.edu/Conferences/ImagineU/participants.html
• History of Hypertext: http://cs.brown.edu/memex/home.html
References (1/3):
15. References (2/3):
A. New Formats:
• Modular papers: Kircz, J.G. (1998). Modularity: the next form of scientific information presentation? Journal of
Documentation. vol.54. No. 2. March 1998. pp. 210-235
• Argumentation networks: Buckingham Shum, S., Motta E. and Domingue, J. (2000). ScholOnto: An Ontology-Based
Digital Library Server for Research Documents and Discourse. International Journal on Digital Libraries, 3 (3), pp.
237-248
• Jupyter journals: e.g. http://lorenabarba.com/figshare/pygbe-on-localized-surface-plasmon-resonance-lspr/ and
http://nbviewer.jupyter.org/github/barbagroup/pygbe_lspr/blob/master/reports/02_Isolated_nanoparticle_verifica
tion.ipynb
B. Workflow tools:
• myGrid: Goble,C.A., Pettifer,S., Stevens,R. and Greenhalgh,C. (2003) Knowledge integration: in silico experiments in
bioinformatics. In Foster,I. and Kesselman,C. (eds), The Grid 2: Blueprint for a New Computing Infrastructure, 2nd
edn, Morgan Kaufmann.
• Taverna: Oinn, T.; Addis, M.; Ferris, J.; Marvin, D.; Senger, M.; Greenwood, M.; Carver, T.; Glover, K.; Pocock, M. R.;
Wipat, A.; Li, P. (2004). "Taverna: A tool for the composition and enactment of bioinformatics workflows".
Bioinformatics. 20 (17): 3045–3054. PMID 15201187. doi:10.1093/bioinformatics/bth361
• Vistrails: Using Provenance to Streamline Data Exploration through Visualization (by Steven P. Callahan, Juliana
Freire, Emanuele Santos, Carlos E. Scheidegger, Claudio T. Silva and Huy T. Vo) SCI Institute Technical Report, No.
UUSCI-2006-016, University of Utah, 2006.
• Riffyn: http://riffyn.com/product and Timothy Gardner, ‘A swan in the making’, Science 22 Aug 2014, Vol. 345,
Issue 6199, pp. 855, DOI: 10.1126/science.1259740
C. Data and Software:
• Protein data bank: Berman, Helen M., et al. "The Protein Data Bank, 1999–." International Tables for
Crystallography Volume F: Crystallography of biological macromolecules. Springer Netherlands, 2006. 675-684.
• Earth Systems Grid Federation: Cinquini, L., Crichton, D., Mattmann, C., Harney, J., Shipman, G., Wang, F.,
Ananthakrishnan, R., Miller, N., Denvil, S., Morgan, M. and Pobre, Z., 2014. The Earth System Grid Federation: An
open infrastructure for access to distributed geospatial data. Future Generation Computer Systems, 36, pp.400-417.
• Sense-making software: Li, J.D., Rude, C.M., Blair, D.M., Gowanlock, M.G., Herring, T.A. and Pankratius, V., 2016.
Computer aided detection of transient inflation events at Alaskan volcanoes using GPS measurements from 2005–
2015. Journal of Volcanology and Geothermal Research, 327, pp.634-642.
• Support for data: Chan-Zuckerberg announcement for support o human cell atlas:f
http://www.sanger.ac.uk/news/view/chan-zuckerberg-initiative-supports-human-cell-atlas-data-platform
16. D. Evaluation & Peer review:
• JIME: http://www.ariadne.ac.uk/issue5/jime
• Altmetrics:
– Priem, J., Piwowar, H.A., & Hemminger, B.M.. 2012. “Altmetrics in the Wild: Using Social Media to Explore Scholarly Impact”.
ArXiv: Digital Libraries (March 20, 2012). http://arxiv.org/abs/1203.4745
– Graphic comes from Simon Porte, August 2016 Counter-MDC: White Paper ‘A New ‘Research Data Mechanics’
• Making Data Count:
– Kratz, J. & Strasser, C. Making data count. Sci. Data 2:150039 doi: 10.1038/sdata.2015.39 (2015) and http://mdc.lagotto.io/
E. New roles for Libraries:
• ADS: Murray, S.S., Brugel, et al., 1992, September. The NASA Astrophysics Data System: A heterogeneous distributed processing
system application. In ESO Conference and Workshop Proceedings (Vol. 43):
http://articles.adsabs.harvard.edu/full/1992ESOC...43..387M
• Repository usage. Jose, S. Adoption of open source digital library software packages: A survey. in International Convention on
Automation of Libraries in Education and Research Institutions (CALIBER). 2007.
• Dspace: "DSpace: An Open Source Dynamic Digital Repository", D-Lib Magazine, January 2003,
http://www.dlib.org/dlib/january03/smith/01smith.html
• Fedora, 1998:
• SHARE: http://crln.acrl.org/index.php/crlnews/article/view/9586/10960
F. New business models:
• Preprints:
– History of ArXIv: https://en.wikipedia.org/wiki/ArXiv
– AsapBio: http://asapbio.org/
• OA/F1000:
– History of OA movement: https://cshl.libguides.com/c.php?g=474046&p=3243855
– Development of the Open Access Journal from 1993 – 2009:
http://journals.plos.org/plosone/article?id=10.1371/journal.pone.0020961
– About OA: https://arstechnica.com/science/2016/06/what-is-open-access-free-sharing-of-all-human-knowledge/4/
– Interview with Vitek Tracz: http://poynder.blogspot.com/2006/05/interview-with-vitek-tracz.html
– Wellcome Open Research: https://wellcomeopenresearch.org/about
References (3/3):
Hinweis der Redaktion
1987: Apple releases Hypercard: GUI’s, stacks, scripts, hypermedia
1991: Birth of Gopher, allowing ftp & hyperlinked menu’s
1992: Birth of PDF
1993: Birth of the Web, including http:// and Mosaic
1994: Yahoo! was launched as “Jerry and David's Guide to the World Wide Web"
1996: Pubmed was launched as search engine for Medline/Entrez
1997: Netflix offers online movie rental service
1998: Google was founded as student project at Stanford
1998: Birth of XML
1999: Birth of RDF
2002: Birth of OWL
2004: Facebook was launched as student project at Harvard
2006: Google Docs was launched, including docs & spreadsheets
2006: Twitter was launched as SMS service
2006: Spotify was launched
2007: iPhone 1 was launched
2011: Snapchat is launched
2015: Nr of mobile searches exceeds nr of searches on desktop
Also see work by Yolanda Gil, who will be here on Wednesday
Carole goble pioneered myGrid, then worked on Research Object concept as collective of paper/software/data
The file format initially used by the PDB was called the PDB file format. This original format was restricted by the width of computer punch cards to 80 characters per line
The CADS automatically generates algorithmic and parameter variants to help researchers explore multidimensional data processing search spaces efficiently.
2017 Chan-Zuckerberg announcement for support o human cell atlas:f http://www.sanger.ac.uk/news/view/chan-zuckerberg-initiative-supports-human-cell-atlas-data-platform
JIME: Cycles of closed and open peer review, D3E environment to allow discourse and threaded discussions
1992: Library becomes (part of) distributing knowledge, survey from 2007
1998: Libraries become content providers
2013: Libraries develop full cycle research environments