Research Data Access and Preservation Summit, 2016
Atlanta, GA
May 4-7, 2016
Presenters:
Sherry Lake, University of Virginia
Brianna Marshall, University of Wisconsin-Madison
Regina Raboin, University of Massachusetts Medical School
Andrew Johnson, University of Colorado
Brian Westra, University of Oregon
Panel lead:
Cynthia Hudson-Vitale, Washington University in St. Louis
2. Panelists
Andrew Johnson - DataQ
Sherry Lake - Virginia Collaborative Data Management Bootcamp
Brianna Marshall - Midwest Data Librarian Symposium
Regina Raboin - NECDMC/eSci portal
Brian Westra - DataCure
Moderator: Cynthia Hudson-Vitale, Washington University in St. Louis
3. This project was made possible in part by the Institute of Museum and
Library Services Sparks! Ignition Grant for Libraries SP-02-14-0020-14
Andrew Johnson
Research Data Librarian
University of Colorado Boulder
@PrezSeventeen
4. Academic Libraries and Research Data Services
(Tenopir, Birch, and Allard, 2012):
● “Reassigning existing library staff is the most common tactic for offering
RDS. This approach also needs to be supported with professional
development for staff so they can gain the required expertise to provide
the full range of RDS”
● “When libraries provide research data services related to reference,
consultation, or instruction, those services are most likely to be offered by
individual librarians or library staff members who are subject discipline
specialists.”
How do we provide support for individuals in these situations?
The Need
5. ● “Research data is intimidating!”
● “How can I take on research data support with so much else already on
my plate?!”
● “I need practical tools to use to help researchers with their data”
But Also:
● “Helping faculty and students with their data is an increasingly important
part of my liaison duties”
Librarian Feedback:
6. DataQ is a web-based collaborative tool to support librarians who are engaging
in research data services by providing:
● An online space for assistance and collaboration
● A community of research data experts and subject librarians
● A knowledge base of practical data information for librarians
7.
8.
9.
10. ● Andrew Johnson, University of Colorado Boulder, PI
● Megan Bresnahan, Tufts University, Co-PI
● Joni Blake, GWLA
● Greg Monaco, GPN
● Ann Riley, ACRL
DataQ Advisory Board
11. ● Kathleen Fear, University of Rochester
● Elena Feinstein, Duke University
● Cynthia Hudson-Vitale, Washington University in St. Louis
● Erica Johns, Cornell University
● Sherry Lake, University of Virginia
● Stanislav Pejša, Purdue University
● Sarah Pickle, The Claremont Colleges
● Amanda Rinehart, The Ohio State University
● Yasmeen Shorish, James Madison University
● Cecilia Smith, Texas A&M University
● Shea Swauger, University of Colorado Denver
● Steve Van Tuyl, Oregon State University
● Mary Whelan, Arizona State University
● Christie Wiley, University of Illinois at Urbana-Champaign
● Jackie Wirz, Oregon Health and Science University
DataQ Editorial Team
12. ● Thea Atwood, University of Massachusetts Amherst
● Carolyn Bishoff, University of Minnesota
● Rebekah Cummings, University of Utah
● Khue Dong, California State University, Long Beach
● Christopher Eaker, University of Tennessee
● Abigail Goben, University of Illinois at Chicago
● Renaine Julian, Florida State University
● Dessi Kirilova, Qualitative Data Repository
● Chris Kollen, University of Arizona
● Robert Olendorf, Los Alamos National Laboratory
● Laura Palumbo, Rutgers University
● Robert Sippel, Florida Institute of Technology
● Dan Valen, Figshare
DataQ Project Volunteers
13. ● Ask all of your questions related to research data and libraries at
http://ResearchDataQ.org!
● Many ways to participate:
○ Ask questions completely anonymously
○ Ask questions and provide your email address to just our team for
follow-up and clarification purposes
○ Register for a free DataQ account so that questions you ask will be
attributed to your username
○ Comment on all questions and answers (requires free account)
○ Browse or search answers to existing questions
○ Follow us on Twitter: @ResearchDataQ
DataQ Website
14. ● ~18,000 site visits from ~4,000 users
● 82 questions answered from these categories:
(Meta)DataQ
15. ● About research data and libraries?: http://ResearchDataQ.org/ask
● About the DataQ website?: http://ResearchDataQ.org/contact
● About the DataQ project?: andrew.m.johnson@colorado.edu
Questions?
16. Virginia Data Management
Bootcamp
Sherry Lake
Scholarly Repository Librarian
University of Virginia
May 4, 2016
Building the Research Data Community of Practice
RDAP 2016
22. Margaret E. Henderson, "Virginia Data Management Bootcamp: A Collaborative Initiative in Data Education" (April 9, 2015). University of Massachusetts and New England
Area Librarian e-Science Symposium. Paper 5. http://escholarship.umassmed.edu/escience_symposium/2015/posters/5
23. Planning
VA DM Bootcamp Team
• Meetings starting the summer before
• Debriefings after bootcamps
• Assessments – data after 2014 & 2015
Local (UVA Example)
• Marketing
• Registration
• Room scheduling/ Food (sponsors)
• Volunteers
• Video Release Forms
• Preparing copies of Handouts
24. Assessment
MOST USEFUL SESSION
Intellectual property, copyright: The speaker was very thorough
and used plain English rather than "lawyer-speak".
MOST USEFUL SESSION
Methods to scrub messy data with OpenRefine. I had no idea
such things were possible and they will be so useful!
LEAST USEFUL
Scholarly communication - nebulous subject
LEAST USEFUL
metadata - i didn't really follow what they were
talking about
ADDITIONAL COMMENTS SUGGESTIONS
I would say, keep doing this as it provides a great service to
researchers and librarians alike.
ADDITIONAL COMMENTS SUGGESTIONS
I felt that many of the topics did not go into depth
enough. I would have liked fewer topics, more in depth.
2014
MOST USEFUL SESSION
any sessions where there were hands-on activities were good
I personally learned something new from each session and found
them all to be very helpful.
Data Documentation and Metadata. My first time to learn the
Meatdata concept.
LEAST USEFUL
metadata - i didn't really follow what they were talking about
ADDITIONAL SUGGESTIONS
Better to have shorter more intense days. Maybe more days.
Real data sets to work with.
Overall, how likely are you to recommend this training to your
colleagues?
78% likely/very likely
Length of bootcamp (2 full days, optional .5)- 59% too long
2015
Met expectations:
92% agree/strongly agree
MOST USEFUL SESSION
Intellectual property, copyright: The speaker was very thorough and used plain English
rather than "lawyer-speak".
OpenRefine/Data Wrangling
Hearing multiple perspectives from other universities
Methods to scrub messy data with OpenRefine. I had no idea such things were possible
and they will be so useful!
LEAST USEFUL
Scholarly communication - nebulous subject
ADDITIONAL COMMENTS SUGGESTIONS
I felt that many of the topics did not go into depth enough. I would have liked fewer
topics, more in depth.
I would say, keep doing this as it provides a great service to researchers and librarians alike.
The technology needs to be more stable so that all sites see and hear everything at the same
time.
73% said length was just right
Overall, how likely are you to recommend this training to your colleagues?
77% likely
25. Assessment
2014
MOST USEFUL SESSION
any sessions where there were hands-on activities were good
I personally learned something new from each session and found
them all to be very helpful.
Data Documentation and Metadata. My first time to learn the
Meatdata concept.
LEAST USEFUL
metadata - i didn't really follow what they were talking about
ADDITIONAL SUGGESTIONS
Better to have shorter more intense days. Maybe more days.
Real data sets to work with.
Overall, how likely are you to recommend this training to your
colleagues?
78% likely/very likely
Length of bootcamp (2 full days, optional .5)- 59% too long
2015
Met expectations:
92% agree/strongly agree
MOST USEFUL SESSION
Intellectual property, copyright: The speaker was very thorough and used plain English
rather than "lawyer-speak".
OpenRefine/Data Wrangling
Hearing multiple perspectives from other universities
Methods to scrub messy data with OpenRefine. I had no idea such things were possible
and they will be so useful!
LEAST USEFUL
Scholarly communication - nebulous subject
ADDITIONAL COMMENTS SUGGESTIONS
I felt that many of the topics did not go into depth enough. I would have liked fewer
topics, more in depth.
I would say, keep doing this as it provides a great service to researchers and librarians alike.
The technology needs to be more stable so that all sites see and hear everything at the same
time.
73% said length was just right
Overall, how likely are you to recommend this training to your colleagues?
77% likely
26. What’s Next?
• Involving more Virginia institutions
• Offer smaller workshops/ sessions throughout
the year
• Larger collaboration (like DM Bootcamp)
every-other year
• Do more non-workshop collaborations
36. Building the Research Data
Community of Practice
University of Massachusetts Medical School
Regina Fisher Raboin
Associate Director, Library Education & Research
Lamar Soutter Library, University of Massachusetts Medical School
regina.raboin@umassmed.edu
RDAP Summit 2016
37. E-Science Portal for New England Librarians
A Librarian’s Link to e-Science Resources
Background
Goals
• Library roles in e-Science/Data
Science
• Fundamentals of domain
sciences
• Emerging trends in supporting
networked scientific research
Project
• Collaborative initiative
• Education, Application, Professional
Development
• Outreach, Connections,
Dissemination
• In-depth resource for data
management and literacy; research
environment
Results
• e-Science Symposium 2009 –
2016
• Science Boot Camp 2009 – 2016
• Portal 2010 - 2016
• Journal of eScience Librarianship
2012 - 2016
E-Science
Portal
38. New England e-Science Program
Community of
Practice
180+ Librarians
Contributors
NN/LM NER
IMLS
BLC
NAHSL
Portal and Advisory
Boards
Dissemination
Journal of eScience
Librarianship
e-Science Community
Blog
Twitter feeds
Tools/Resources
e-Science Portal
Content for RDM
class
e-Science Thesaurus
Professional
Development/Education
Professional Development Day
e-Science Symposium
Science Boot Camp
Simmons GSLIS class
Course: How You Teach RDM
New England Collaborative
Data Management Curriculum
(NECDMC)
39. New England Collaborative Data Management Curriculum
(NECDMC)
Background
Goals
• Built on Frameworks for Data
Management Curriculum
(IMLS/NNLM NER)
• Present and future researchers,
students, and librarians data
management learning needs
• Address federal compliance/ open
data requirements
Project
•Phase 1: Collaboration with other
libraries
• Modules, lesson plans,
instructional materials, research
cases; module evaluations
• Workshop resources and
community feedback created
Results
• Phase 2: Local piloting
• Phase 3: Piloting beyond
UMMS
• Evaluation, assessment,
feedback
NECDMC
(CC-BY)
40. Assessment
NECDMC Pilot Sites
(22 sites)
300+
Phase 1 - 20 questions, evaluations on
NECDMC site
Phase 2 – 12 questions, paper-based
Train-the-Trainers
(2 workshops, 2013 & 2014) 80
11-15 questions per module, paper-based
handout
Flexible Clinical Experience (FCE) 1
Two electronic evaluation instruments
provided by School of Medicine (SOM):
one for instructors (8 quest.), one for
student
(22 questions, including open-ended)
Type of Education Attendees Assessment
41. Lessons Learned
NECDMC
Modular format; cases; website
with ready-to-go content
Too much content; updating Creating MOOC
Train-the-Trainers
Demonstrations of teaching cases
and teaching methods for different
audiences;
customizability/flexibility
Time concerns
Pre-class surveys and encouraging
customization
Flexible Clinical Experience
Incorporation of a research
project; use of research data
management tools (e.g.
LabArchives, DMPTool);
collaboration with UMMS IT
and IRB
Time investment and prep for
a 1 week course; branding;
needed more sophisticated
content; one student
Incorporating local examples of data
management content and activities;
increase marketing for course
43. Selected Resources
E-Science Portal for New England Librarians; See especially ‘Data Management’, ‘Data Literacy’, and ‘Professional Education’, http:
//esciencelibrary.umassmed.edu/
Integrating Data Management Tools into Research Data Management Instruction, Rebecca Reznik-Zellen, Lisa A. Palmer and Julie
Goldman, Poster Session, University of Massachusetts and New England Area e-Science Symposium, April 6, 2016. http:
//escholarship.umassmed.edu/escience_symposium/2016/posters/8/
Ishida, Mayu. 2014. "The New England Collaborative Data Management Curriculum Pilot at the University of Manitoba: A Canadian
Experience." Journal of eScience Librarianship 3(1): e1061. http://dx.doi.org/10.7191/jeslib.2014.1061
Kafel, Donna, Myrna E. Morales, Robert J. Vander Hart, Sally A. Gore, Andrew Creamer, Javier Crespo, and Elaine R. Martin. 2012.
"Building an e-Science Portal for Librarians: A Model of Collaboration." Journal of eScience Librarianship 1(1): e1005. http://dx.doi.
org/10.7191/jeslib.2012.1005
New England Collaborative Data Management Curriculum (NECDMC); See especially ‘Resources from Workshops’, ‘Community
Bulletin Board’, and ‘Join the Collaboration’, http://library.umassmed.edu/necdmc/index
New England Region eScience Program (Data Management Curriculum and Data Tools Forum)
https://www.youtube.com/channel/UCmwetQc2pcwAwJYPDPucJIg
Piorun, Mary E., Donna Kafel, Tracey Leger-Hornby, Siamak Najafi, Elaine R. Martin, Paul Colombo, and Nancy R. LaPelle. 2012.
"Teaching Research Data Management: An Undergraduate/Graduate Curriculum." Journal of eScience Librarianship 1(1): e1003. http:
//dx.doi.org/10.7191/jeslib.2012.1003
Peters, Christie, and Porcia Vaughn. 2014. "Initiating Data Management Instruction to Graduate Students at the University of
Houston Using the New England Collaborative Data Management Curriculum." Journal of eScience Librarianship 3(1): e1064. http:
//dx.doi.org/10.7191/jeslib.2014.1064
Read, Kevin, Andrew T. Creamer, Donna Kafel, Robert J. Vander Hart, and Elaine R. Martin. 2013. "Building an eScience Thesaurus for
Librarians: A Collaboration Between the National Network of Libraries of Medicine, New England Region and an Associate Fellow at
the National Library of Medicine." Journal of eScience Librarianship 2(2): e1049. http://dx.doi.org/10.7191/jeslib.2013.1049
49. Origins
A ‘chance’ meeting: DigCCurr Professional Institute
(2009/2010)
● Dianne Dietrich - Cornell
● Susan Parham - Georgia Tech
● Jake Carlson - Michigan
● Brian Westra - Oregon
● Barrie Hayes - UNC Chapel Hill
52. Outcomes
Confluence wiki → Wordpress web site
Email list → Google Group
Informal practice → Code of Conduct
Events list Wiki page → Drupal events calendar
54. Opportunities and Challenges
80/20
Growing membership, and growing the membership
Volunteer leadership
Unaffiliated
Expansion in lists, interest and working groups, meetings
57. Discussion
● How others can do the same or be involved, new initiatives, future
training
● Thoughts on how we can create, or whether or not we need, a more
cohesive research data community of practice
● Where is there interest in a data services related topic - but little
opportunity for support/training? (such as: reaching out to faculty,
starting the conversations w/the research office, curating sensitive data,
de-identifying/anonymizing data)
58. Thank you!
Andrew Johnson
Andrew.M.Johnson@colorado.edu
Sherry Lake
sah@virginia.edu
Regina Fisher Raboin
regina.raboin@umassmed.edu
Brianna Marshall
brianna.marshall@wisc.edu
Brian Westra
bwestra@uoregon.edu
Cynthia Hudson-Vitale
chudson@wustl.edu