NISO Two Day Virtual Conference:
Using the Web as an E-Content Distribution Platform:
Challenges and Opportunities
Oct 21-22, 2014
Maryann Martone, Ph.D., Professor of Neuroscience, University of California, San Diego
The Future of Research Communications and e-Scholarship: Are we there yet?
1. Future of Research Communications and E-Scholarship
Are we there yet?
Maryann E. Martone, Ph. D.
Executive Director
Professor of Neuroscience, University of California, San Diego
2. What is FORCE11?
Future of Research Communications and E-Scholarship:
A grass roots effort to accelerate the pace and nature
of scholarly communications and e-scholarship through
technology, education and community
Why 11? We were born in 2011 in Dagstuhl,
Germany
Principles laid out in the FORCE11 Manifesto
FORCE11 launched in July 2012
3. Who is FORCE11?
Publishers
Social
Sciences
Science Humanities
Library and
Information
scientists
Scholars
Policy makers
Tool builders
Funders
Anyone who has a stake in moving scholarly communication into the 21st century
4. FORCE11 Vision
• Modern technologies enable vastly improve knowledge transfer and far wider
impact; freed from the restrictions of paper, numerous advantages appear
• We see a future in which scientific information and scholarly communication more
generally become part of a global, universal and explicit network of knowledge
• To enable this vision, we need to create and use new forms of scholarly
publication that work with reusable scholarly artifacts
• To obtain the benefits that networked knowledge promises, we have to put in
place reward systems that encourage scholars and researchers to participate and
contribute
• To ensure that this exciting future can develop and be sustained, we have to
support the rich, variegated, integrated and disparate knowledge offerings
that new technologies enable
Beyond the PDF Visual Notes by De Jongens van de Tekeningen is licensed under a Creative Commons Attribution 3.0 Unported License.
5. Old Model: Single type of content;
single mode of distribution
Scholar
Library
Scholar
Publisher
6. The future is now...
Scholar
Consumer
Data Repositories
Libraries
Code Repositories
Community databases/platforms
OA
Curators
Narrative
Nanopublications
Social
NetwoSrkoscial
Social
Networks
Networks
Peer Reviewers
Workflows
Data
Blogs/Wikis
Multimedia
Code
7. The duality of modern scholarship
Observation: Those who build information systems from the
machine side don’t understand the requirements of the
human very well
Those who build information systems from the human side,
don’t understand requirements of machines very well
Scholarship requires the ability to cite and track usage of
scholarly artifacts. In our current mode of working, there is no
way to easily track artifacts as they move through the
ecosystem; no way to incrementally add human expertise; no
way to alert everyone when things go wrong
8. Impetus for change: Is our current
method serving science?
47/50 major preclinical
published cancer studies
could not be replicated
“The scientific community
assumes that the claims in a
preclinical study can be taken at
face value-that although there
might be some errors in detail,
the main message of the paper
can be relied on and the data
will, for the most part, stand
the test of time. Unfortunately,
this is not always the case.”
Begley and Ellis, 29 MARCH 2012 | VOL 483 | NATURE | 531
9. A new platform for scholarly
communications
Components
• Authoring tools
– Optimized for mark up and linked content
• Containers
– Expand the objects that are considered “publications”
– Optimize the container for the content
• Processes
– Scholarship is code
• Mark up
– Data, claims, content suitable for the web
– Suitable identifier systems
• Reward systems
– Incentives to change
– Reward for new objects
Scholarship must move from a “single currency system”;
platforms must recognize diversity of output and representation
10. FORCE11.org
500 members from diverse stakeholder groups
• Community platform
– Meetings
– Discussions
– Tools and resources
– Blogs
– Event calendar
– Community projects
• Promote
interoperability
– Data Citation
– Resource identification
initiative
800
11. Promote community, cross-fertilization
and interoperability
• FORCE11 helps facilitate
communications across
disciplines and
communities
• Issues are not identical but
we can learn from each
other
– Enhanced publications
• Digital humanities +
– Dealing with data
• Science +
– Open Access
• Science +
“What is an ORCID id?”-computer scientist
12. Resource for scholarly communications:
People, organizations, publications, tools
ORCID
Research Data Alliance
Data journals
Workflows 4Ever
PeerJ, eLife
Data Verse
Impact Story, Rubriq
Sadie
Scalar
Hypothes.is
13. FORCE11 Working Groups
• FORCE11 provides a neutral convening place
for individuals to come together around issues
in scholarly communication
– FORCE11 provides web working space and
facilitation where possible
– 1K Challenge: Beyond the PDF
– Short term working groups with clear focus
• Deliverable specified
• Time line determined
14. Data: Who’s problem is it?
Scholar
Library
Computing
Scholar
Publisher
Domain-specific
Repository
Web
site/Personal
data
management
Scholars, Data Repositories, Institutional Repositories taking ownership of
data. Where should it go? Sometimes it can’t go anywhere.
15. A place to come together: Data
citation principles
•FORCE11 provides a neutral
space for bringing groups
together
•35 individuals
representing > 20
organizations concerned
with data citation
•Conducted a review of
current data citation
recommendations from 4
different organizations
•Arrived at a sense of
consensus principles
Data citation synthesis group:
http://www.force11.org/node/4
381
16. Joint Declaration of Data Citation
Principles
• Designed to be high
level and easy to
understand
• Supplemented with
a glossary,
references and
examples
1. Importance
2. Credit and attribution
3. Evidence
4. Unique Identification
5. Access
6. Persistence
7. Specificity and verifiability
8. Interoperability and
flexibility
http://www.force11.org/datacitation
17. Endorse the Principles!
• http://www.force11.org/datacitation/endorsements
185 individuals; 84 organizations
19. Unique ID’s for all! Resource
Identification Initiative
• It is currently impossible
to query the biomedical
literature to find out
what research resources
have been used to
produce the results of a
study
• Impossible to find all
studies that used a
resource
• Critical for
reproducibility and data
mining
• Critical for trouble-shooting
Faulty Antibodies Continue to Enter US and
European Markets, Warns Top Clinical
Chemistry Researcher-Genome Web Daily,
October 11, 2013
http://www.force11.org/resource_identification_initiative
20. Digital objects are a new beast
New modes of representation and verification
will be necessary
Trust: Not just
who produced it
but what
produced it
21. Resource Identification Initiative
• Have authors supply
appropriate identifiers for
key resources used within
a study such that they
are:
– Machine processible (i.e.,
unique identifier that
resolves to a single
resource)
– Outside of the paywall
– Uniform across journals
and publishers
Launched February 2014: > 30 journals
participating
22. Pilot Project
• Authors to identify 3 types of
research resources:
– Software/databases
– Antibodies
– Model organisms
• Include RRID in methods section
• Voluntary for authors
• Journals did not have to modify
their submission system
• Journals have flexibility in
implementation. Send request to
author at:
– Submission
– During review
– After acceptance
Launched February 2014: 3 month commitment and more…
23. Current Progress
• >160 articles have
appeared to date
• 29 journals
• >650 RRID’s
• 3 removed by
typesetting
•95% correct
•14% false negative rate
• thousands of antibodies
added from vendors, >200
added by individuals
• >90 software
tools/databases were
added to tool registry
Chemicon – out of
business, >8 yr
Millipore – just
joined Merck, URL
still works
Millipore / Chemicon
not a company
Authors cite
ID properly
Database available at: https://www.force11.org/node/5635
24. What can we do with an RRID?
• A resolver service
has been created
• 3rd party tools
are being created
to provide
linkage between
resources and
papers
– Utopia prototype
– ScienceDirect
Utopia Tools
http://scicrunch.com/resolver/RRID:nlx_144509
25. What have we learned?
• Authors are willing to adopt new types of
citations
– Meaningful to them
– Impact: Significant increase in identifiability
• Authors were fairly accurate at performing the
task
• RRID’s resolved by search engines without
requiring specialized citation services
• Citation drives registration
• Clear role for repositories as authorities
26. FORCE11 Vision
• Modern technologies enable vastly improve knowledge transfer and far wider
impact; freed from the restrictions of paper, numerous advantages appear
• We see a future in which scientific information and scholarly communication more
generally become part of a global, universal and explicit network of knowledge
• To enable this vision, we need to create and use new forms of scholarly
publication that work with reusable scholarly artifacts
• To obtain the benefits that networked knowledge promises, we have to put in
place reward systems that encourage scholars and researchers to participate and
contribute
• To ensure that this exciting future can develop and be sustained, we have to
support the rich, variegated, integrated and disparate knowledge offerings
that new technologies enable
Vibrant community working on the problems across many
dimensions; many more people and institutions care
27. Beyond the PDF
• Conference/unconferen
ce where all
stakeholders come
together as equals to
discuss issues
– Publishers
– Technologists
– Scholars
– Library scientists
• Incubator for change
• What would you do to
change scholarly
communication?
FORCE2015
San Diego, Jan 2011 ...... Amsterdam, March 2013........ Oxford, January 2015
https://www.force11.org/meetings/force2015
Editor's Notes
Current model: Scholars are producing multiple types of research objects; each goes to their own infrastructure with little coordination among them.
Consumer no longer exclusively a scholar: General public wants access to what they pay for; automated agents are accessing first and mining the content.
Libraries/journals think more along these lines than do the database providers