SlideShare ist ein Scribd-Unternehmen logo
1 von 34
COMING TO AN
UNDERSTANDING
A Cross-institutional Examination
of Assessments of Data Curation
Needs
Jake Carlson - Purdue University
Dianne Dietrich - Cornell University
Gail Steinhart - Cornell University
Alison Valk - Georgia Institute of Technology
Stephanie Wright - University of Washington
Dianne Dietrich
Planning & Data Management
Plans
Planning and Data Management
Plans
May 2010
October
2010
December
2010
January
2011
NSF press
release
indicating intent
to require data
management
plans with
grant
proposals.
NSF releases
specifics for
data
management
plan
requirement.
Cornell survey
distributed to
PIs and Co-PIs
of NSF grants.
NSF
requirement
goes into
effect.
Planning and Data Management
Plans
 How prepared are researchers to address data
management plan requirements?
 What is the potential impact of researcher
plans on existing Cornell services?
Planning and Data Management
Plans
0%
10%
20%
30%
40%
50%
60%
70%
80%
90%
100%
Each bar represents a question where respondents were asked to select "Yes", "No", or "I'm not sure"
Percentage of respondents who answered "I'm not sure"
for questions where that was an option
Adapted from Steinhart, et al. (2012) Prepared to Plan? A Snapshot of Research Readiness
to Address Data Management Planning Requirements. Journal of eScience Librarianship 1(2).
Planning and Data Management
Plans
0% 10% 20% 30% 40% 50%
No data
Up to 1 GB
1 GB - 100 GB
100 GB - 1 TB
1 TB - 100 TB
More than 100 TB
Responses to the question: "Given the NSF expectation to
share data ... how much data would you intend to share?"
Adapted from Steinhart, et al. (2012) Prepared to Plan? A Snapshot of Research Readiness
to Address Data Management Planning Requirements. Journal of eScience Librarianship 1(2).
Planning and Data Management
Plans
Yes
30%
I'm not sure
61%
No: 9%
I do not plan
to create
metadata
26%
I'm not sure
if I plan to
create metadata
32% I do plan to
create metadata
42%
Have you produced or do you anticipate
producing metadata for this project?
Adapted from Steinhart, et al. (2012) Prepared to Plan? A Snapshot of Research Readiness
to Address Data Management Planning Requirements. Journal of eScience Librarianship 1(2).
If you plan on
creating
metadata, does
it conform to
known
standards in
your discipline?
Planning and Data Management
Plans
0
10
20
30
40
50
60
70
Own
infrastructure
Campus solution Commercial
solution
Numberofresponses
Backup Strategy
Anticipated Backup Strategy by Size of Data
More than 100 TB
1 TB - 100 TB
100 GB - 1 TB
1 GB - 100 GB
Up to 1 GB
Adapted from Steinhart, et al. (2012) Prepared to Plan? A Snapshot of Research Readiness
to Address Data Management Planning Requirements. Journal of eScience Librarianship 1(2).
Stephanie Wright
Management
Management: UW
 Backgroun
d
 Services
 Survey &
Interviews
Management: Organization
Survey
 Guidance on data
organization (file
structure, file
naming, etc.) ranked
13th out of 14
 Tracking updates to
data (versioning)
ranked 8th
Image Credit: radrice “data cat finds no data”
http://blog.looxii.com/wp-content/uploads/2011/06/new-data-cat.jpg
Management: Organization
Interviews
 Whatever makes
sense to organizer
 More
planning, better
organization
 Especially true of
larger, well-funded
projects
“But that really was
sort of something we
addressed after the
fact, after we started
to go, „Huh, I‟m
naming them this
way, you‟re naming
them that way, and I
have no idea what
your naming
conventions mean.‟”
Management: Description
Survey
 1/3 didn‟t know of
metadata standard
 16% were able to
identify metadata
standard
 Metadata service
ranked 10th out of 14
Image & Quote Credit: NYU Health Sciences Libraries “Data Sharing and Management
Snafu in 3 Short Acts” http://www.youtube.com/watch?v=N2zK3sAtr-4
“Everything you
need to know about
the data is in the
article.”
Management: Description
Interviews
 Documentation is
biggest challenge in
data management
 Recognize role of
metatadata
 Time consuming, no
immediate benefit
 Data planning vs. data
forensics
“If I was gonna make
(the data) available
to other people, I
would feel some
responsibility in
documenting it a
little bit better.”
(Social Sciences)
Management: Summary
Services needed:
 Training on best
practices or general
strategies
 Tools that integrate
description and
organization of data
into the workflow
“I kind of feel like we’re
just making our way
through the wilderness.
And if there were
somebody who could
kind of hold our hands
and say, „Look, data
management is important
and here are some
strategies for going about
it…‟ That would be
great.”
Jake Carlson
Sharing
Sharing: Purdue
Background on Purdue‟s
work:
Primarily Interview
Driven
• Data Curation Profiles
• Data Management
Plans
• Data Information
Literacy
Sharing
 Willingness to Share
Generally, faculty are open
to sharing their data with
others.
There is an “underground
economy” of data sharing.
Factors in deciding whether
or not to share:
What will this person do
with my data?
How much time & effort will
it take me?Image Credit: andrew_mc_d “Share” http://www.flickr.com/photos/andrew_mc_d/452728652/
Sharing
Sharing
 Control
Issues in sharing data publicly:
Timing over when to release data.
Use - If anyone can get the data, anyone
can use it for whatever they want to
Misinterpretation - there‟s no guarantee
that someone won‟t misconstrue the data
Sharing
 Attribution
Generally expressed as need for others
to cite the data set (though not always)
“So for in my personal opinion, data citations
won‟t help me too much. Paper citations count
for everything. It counts for impact of the paper, it
counts for tenure, it counts for the profile of my
work.”
- Professor of Biochemistry
Sharing
 Documentation and
Description
"If you ask someone if you can
see their raw data, you might as
well be asking if you can look at
their underwear. It's really
problematic."
- Agronomy Professor
Sharing
 Services for Data Sharing at Purdue
Consultation & Collaboration with Data Producers
 Support "local" sharing
 Workflows
 Documentation
 Description
 Support "external" sharing
 Workflows
 Documentation
 Description
Alison Valk
Preservation
Background
“Develop campus
partnerships to
collect, manage, share, an
d preserve Georgia Tech
digital research data.”
“Improve and develop new
resources & services to
assist researchers with
data stewardship”
Preservation
IRB-approved research to determine
gaps in data curation services
provided to researchers.
Data assessment survey
Series of campus wide interviews
NSF DMP content analysis
Preservation
By combining information gathered
via the survey and the interviews, we
developed a clearer picture of the
research data curation needs on
campus.Out of 77 who completed
survey-
o 44 agreed to be interviewed
o 26 interviews completed
Preservation
Interview Team
Chris Doty
Susan Parham
Elizabeth Rolando
Alison Valk
10 Interview questions
“How important is it for you to
archive / preserve your data?”
“How important is it for you or
others to have access to your data
over the long-term?”
Preservation
Transcrib
e
interviews
Web application for
Qualitative & Mixed Methods research
Visualize major discussion points
or code correlations
Code
Correlation between
cost of working with
data –
to how strongly
participants feel data
should be
preserved…
Preservation
Storage prices no longer
cost prohibitive
Preservation
Lack of metadata or
curation =
unusable data
Data is often “lost”
when project participants
such as grad students leave
institution
Computing
professor:
“I don’t want to
micromanage my
research assistants”
Preservation
Some researchers
are using
Cloud based
tools, such as
DropBox etc. for
archiving –
Little concern for
security risks
associated.
Preservation
Next Steps:
Select Case studies-
o Researchers have volunteered to allow us
to archive their research data.
Increased Outreach- New Services
o Customized DMPtool
o Departmental Data Management Workshops
o More robust web presence
o Proof-of-concept Library hosted
Research Data Repository
Preservation
Questions?
 Jake Carlson
 @jrcarlso
 jakecarlson@purdue.edu
 Dianne Dietrich
 @nemka
 dd388@cornell.edu
 Gail Steinhart
 @gailst
 gss1@cornell.edu
 Alison Valk
 @valkcano
 alison.valk@library.gatech.edu
 Stephanie Wright
 @shefw
 swright@uw.edu

Weitere ähnliche Inhalte

Was ist angesagt?

Data management plans
Data management plansData management plans
Data management plansBrad Houston
 
Data Management Lab: Session 1 Slides
Data Management Lab: Session 1 SlidesData Management Lab: Session 1 Slides
Data Management Lab: Session 1 SlidesIUPUI
 
Data-Ed: Unlock Business Value through Data Quality Engineering
Data-Ed: Unlock Business Value through Data Quality EngineeringData-Ed: Unlock Business Value through Data Quality Engineering
Data-Ed: Unlock Business Value through Data Quality EngineeringDATAVERSITY
 
In Search of a Missing Link in the Data Deluge vs. Data Scarcity Debate
In Search of a Missing Link in the Data Deluge vs. Data Scarcity DebateIn Search of a Missing Link in the Data Deluge vs. Data Scarcity Debate
In Search of a Missing Link in the Data Deluge vs. Data Scarcity DebateNeuroscience Information Framework
 
Linking Data to Publications through Citation and Virtual Archives
Linking Data to Publications through Citation and Virtual ArchivesLinking Data to Publications through Citation and Virtual Archives
Linking Data to Publications through Citation and Virtual ArchivesMicah Altman
 
Data publishing at the UQ Library
Data publishing at the UQ LibraryData publishing at the UQ Library
Data publishing at the UQ LibraryARDC
 
Data management plans (dmp) for nsf
Data management plans (dmp) for nsfData management plans (dmp) for nsf
Data management plans (dmp) for nsfBrad Houston
 
Publishing perspectives on data management & future directions
Publishing perspectives on data management & future directionsPublishing perspectives on data management & future directions
Publishing perspectives on data management & future directionsARDC
 
Introduction to Data Management
Introduction to Data ManagementIntroduction to Data Management
Introduction to Data ManagementAmanda Whitmire
 
Managing data throughout the research lifecycle
Managing data throughout the research lifecycleManaging data throughout the research lifecycle
Managing data throughout the research lifecycleMarieke Guy
 
It, Innovation, And Leadership
It, Innovation, And LeadershipIt, Innovation, And Leadership
It, Innovation, And Leadershipbbutler
 
"Reproducibility from the Informatics Perspective"
"Reproducibility from the Informatics Perspective""Reproducibility from the Informatics Perspective"
"Reproducibility from the Informatics Perspective"Micah Altman
 
data management Wb
data management Wbdata management Wb
data management WbSurojit Saha
 
To architect or engineer? Lessons from DataPool on building RDM repositories
To architect or engineer? Lessons from DataPool on building RDM repositoriesTo architect or engineer? Lessons from DataPool on building RDM repositories
To architect or engineer? Lessons from DataPool on building RDM repositoriesjiscdatapool
 
Fsci 2018 monday30_july_am6
Fsci 2018 monday30_july_am6Fsci 2018 monday30_july_am6
Fsci 2018 monday30_july_am6ARDC
 
Data management plan template
Data management plan templateData management plan template
Data management plan template501 Commons
 
Winter school in research data science research data management - final
Winter school in research data science research data management - finalWinter school in research data science research data management - final
Winter school in research data science research data management - finalARDC
 

Was ist angesagt? (20)

Data management plans
Data management plansData management plans
Data management plans
 
Data Management Lab: Session 1 Slides
Data Management Lab: Session 1 SlidesData Management Lab: Session 1 Slides
Data Management Lab: Session 1 Slides
 
Data-Ed: Unlock Business Value through Data Quality Engineering
Data-Ed: Unlock Business Value through Data Quality EngineeringData-Ed: Unlock Business Value through Data Quality Engineering
Data-Ed: Unlock Business Value through Data Quality Engineering
 
In Search of a Missing Link in the Data Deluge vs. Data Scarcity Debate
In Search of a Missing Link in the Data Deluge vs. Data Scarcity DebateIn Search of a Missing Link in the Data Deluge vs. Data Scarcity Debate
In Search of a Missing Link in the Data Deluge vs. Data Scarcity Debate
 
Linking Data to Publications through Citation and Virtual Archives
Linking Data to Publications through Citation and Virtual ArchivesLinking Data to Publications through Citation and Virtual Archives
Linking Data to Publications through Citation and Virtual Archives
 
Data publishing at the UQ Library
Data publishing at the UQ LibraryData publishing at the UQ Library
Data publishing at the UQ Library
 
Data management plans (dmp) for nsf
Data management plans (dmp) for nsfData management plans (dmp) for nsf
Data management plans (dmp) for nsf
 
Publishing perspectives on data management & future directions
Publishing perspectives on data management & future directionsPublishing perspectives on data management & future directions
Publishing perspectives on data management & future directions
 
Introduction to Data Management
Introduction to Data ManagementIntroduction to Data Management
Introduction to Data Management
 
Uc3 pasig-asis&t-2013-08-20-support-of-data-intensive-research
Uc3 pasig-asis&t-2013-08-20-support-of-data-intensive-researchUc3 pasig-asis&t-2013-08-20-support-of-data-intensive-research
Uc3 pasig-asis&t-2013-08-20-support-of-data-intensive-research
 
Managing data throughout the research lifecycle
Managing data throughout the research lifecycleManaging data throughout the research lifecycle
Managing data throughout the research lifecycle
 
It, Innovation, And Leadership
It, Innovation, And LeadershipIt, Innovation, And Leadership
It, Innovation, And Leadership
 
"Reproducibility from the Informatics Perspective"
"Reproducibility from the Informatics Perspective""Reproducibility from the Informatics Perspective"
"Reproducibility from the Informatics Perspective"
 
data management Wb
data management Wbdata management Wb
data management Wb
 
To architect or engineer? Lessons from DataPool on building RDM repositories
To architect or engineer? Lessons from DataPool on building RDM repositoriesTo architect or engineer? Lessons from DataPool on building RDM repositories
To architect or engineer? Lessons from DataPool on building RDM repositories
 
2015 04-18-wilson cg
2015 04-18-wilson cg2015 04-18-wilson cg
2015 04-18-wilson cg
 
Fsci 2018 monday30_july_am6
Fsci 2018 monday30_july_am6Fsci 2018 monday30_july_am6
Fsci 2018 monday30_july_am6
 
Introduction to Research Data Management - 2015-05-27 - Social Sciences Divis...
Introduction to Research Data Management - 2015-05-27 - Social Sciences Divis...Introduction to Research Data Management - 2015-05-27 - Social Sciences Divis...
Introduction to Research Data Management - 2015-05-27 - Social Sciences Divis...
 
Data management plan template
Data management plan templateData management plan template
Data management plan template
 
Winter school in research data science research data management - final
Winter school in research data science research data management - finalWinter school in research data science research data management - final
Winter school in research data science research data management - final
 

Andere mochten auch

Why should researchers care about data curation?
Why should researchers care about data curation?Why should researchers care about data curation?
Why should researchers care about data curation?Varsha Khodiyar
 
data curation issues
data curation issuesdata curation issues
data curation issuesMichelle Hudson
 
Kurator: Towards Data Curation for Mere Mortals
Kurator: Towards Data Curation for Mere MortalsKurator: Towards Data Curation for Mere Mortals
Kurator: Towards Data Curation for Mere MortalsBertram Ludäscher
 
Data Curation @ SpazioDati - NEXA Lunch Seminar
Data Curation @ SpazioDati - NEXA Lunch SeminarData Curation @ SpazioDati - NEXA Lunch Seminar
Data Curation @ SpazioDati - NEXA Lunch SeminarSpazioDati
 
The importance of data curation on QSAR Modeling: PHYSPROP open data as a cas...
The importance of data curation on QSAR Modeling: PHYSPROP open data as a cas...The importance of data curation on QSAR Modeling: PHYSPROP open data as a cas...
The importance of data curation on QSAR Modeling: PHYSPROP open data as a cas...Kamel Mansouri
 

Andere mochten auch (6)

Why should researchers care about data curation?
Why should researchers care about data curation?Why should researchers care about data curation?
Why should researchers care about data curation?
 
data curation issues
data curation issuesdata curation issues
data curation issues
 
Kurator: Towards Data Curation for Mere Mortals
Kurator: Towards Data Curation for Mere MortalsKurator: Towards Data Curation for Mere Mortals
Kurator: Towards Data Curation for Mere Mortals
 
Data Curation @ SpazioDati - NEXA Lunch Seminar
Data Curation @ SpazioDati - NEXA Lunch SeminarData Curation @ SpazioDati - NEXA Lunch Seminar
Data Curation @ SpazioDati - NEXA Lunch Seminar
 
The importance of data curation on QSAR Modeling: PHYSPROP open data as a cas...
The importance of data curation on QSAR Modeling: PHYSPROP open data as a cas...The importance of data curation on QSAR Modeling: PHYSPROP open data as a cas...
The importance of data curation on QSAR Modeling: PHYSPROP open data as a cas...
 
Johnston - How to Curate Research Data
Johnston - How to Curate Research DataJohnston - How to Curate Research Data
Johnston - How to Curate Research Data
 

Ähnlich wie Coming to an Understanding: a Cross-institutional Examination of Assessments of Data Curation Needs

Research Data Management and Sharing for the Social Sciences and Humanities
Research Data Management and Sharing for the Social Sciences and HumanitiesResearch Data Management and Sharing for the Social Sciences and Humanities
Research Data Management and Sharing for the Social Sciences and HumanitiesRebekah Cummings
 
One View of Data Science
One View of Data ScienceOne View of Data Science
One View of Data SciencePhilip Bourne
 
Research Data Management in practice, RIA Data Management Workshop Brisbane 2017
Research Data Management in practice, RIA Data Management Workshop Brisbane 2017Research Data Management in practice, RIA Data Management Workshop Brisbane 2017
Research Data Management in practice, RIA Data Management Workshop Brisbane 2017ARDC
 
NCME Big Data in Education
NCME Big Data  in EducationNCME Big Data  in Education
NCME Big Data in EducationPhilip Piety
 
Data management plans
Data management plansData management plans
Data management plansBrad Houston
 
Data Management and Broader Impacts: a holistic approach
Data Management and Broader Impacts: a holistic approachData Management and Broader Impacts: a holistic approach
Data Management and Broader Impacts: a holistic approachMegan O'Donnell
 
DMPTool Webinar 8: Data Curation Profiles and the DMPTool (presented by Jake ...
DMPTool Webinar 8: Data Curation Profiles and the DMPTool (presented by Jake ...DMPTool Webinar 8: Data Curation Profiles and the DMPTool (presented by Jake ...
DMPTool Webinar 8: Data Curation Profiles and the DMPTool (presented by Jake ...University of California Curation Center
 
Data Policy for Open Science
Data Policy for Open ScienceData Policy for Open Science
Data Policy for Open ScienceMark Parsons
 
Presentation For Gene S Revision 3
Presentation For Gene S Revision 3Presentation For Gene S Revision 3
Presentation For Gene S Revision 3WSU Cougars
 
Practical Research Data Management: tools and approaches, pre- and post-award
Practical Research Data Management:  tools and approaches, pre- and post-awardPractical Research Data Management:  tools and approaches, pre- and post-award
Practical Research Data Management: tools and approaches, pre- and post-awardMartin Donnelly
 
Research process and research data management
Research  process and research data managementResearch  process and research data management
Research process and research data managementKen Chad Consulting Ltd
 
UKSG 2014 Breakout Session - Westminster Research Process and Research Data
UKSG 2014 Breakout Session - Westminster Research Process and Research DataUKSG 2014 Breakout Session - Westminster Research Process and Research Data
UKSG 2014 Breakout Session - Westminster Research Process and Research DataUKSG: connecting the knowledge community
 
Institutional Data Management Blueprint
Institutional Data Management BlueprintInstitutional Data Management Blueprint
Institutional Data Management BlueprintJisc
 
Meeting the NSF DMP Requirement June 13, 2012
Meeting the NSF DMP Requirement June 13, 2012Meeting the NSF DMP Requirement June 13, 2012
Meeting the NSF DMP Requirement June 13, 2012IUPUI
 
Predictive Analytics - How to get stuff out of your Crystal Ball
Predictive Analytics - How to get stuff out of your Crystal BallPredictive Analytics - How to get stuff out of your Crystal Ball
Predictive Analytics - How to get stuff out of your Crystal BallDATAVERSITY
 
Responsible conduct of research: Data Management
Responsible conduct of research: Data ManagementResponsible conduct of research: Data Management
Responsible conduct of research: Data ManagementC. Tobin Magle
 

Ähnlich wie Coming to an Understanding: a Cross-institutional Examination of Assessments of Data Curation Needs (20)

McGeary Data Curation Network: Developing and Scaling
McGeary Data Curation Network: Developing and ScalingMcGeary Data Curation Network: Developing and Scaling
McGeary Data Curation Network: Developing and Scaling
 
METRO RDM Webinar
METRO RDM WebinarMETRO RDM Webinar
METRO RDM Webinar
 
Research Data Management and Sharing for the Social Sciences and Humanities
Research Data Management and Sharing for the Social Sciences and HumanitiesResearch Data Management and Sharing for the Social Sciences and Humanities
Research Data Management and Sharing for the Social Sciences and Humanities
 
One View of Data Science
One View of Data ScienceOne View of Data Science
One View of Data Science
 
Research Data Management in practice, RIA Data Management Workshop Brisbane 2017
Research Data Management in practice, RIA Data Management Workshop Brisbane 2017Research Data Management in practice, RIA Data Management Workshop Brisbane 2017
Research Data Management in practice, RIA Data Management Workshop Brisbane 2017
 
NCME Big Data in Education
NCME Big Data  in EducationNCME Big Data  in Education
NCME Big Data in Education
 
Data management plans
Data management plansData management plans
Data management plans
 
Data Management and Broader Impacts: a holistic approach
Data Management and Broader Impacts: a holistic approachData Management and Broader Impacts: a holistic approach
Data Management and Broader Impacts: a holistic approach
 
DMPTool Webinar 8: Data Curation Profiles and the DMPTool (presented by Jake ...
DMPTool Webinar 8: Data Curation Profiles and the DMPTool (presented by Jake ...DMPTool Webinar 8: Data Curation Profiles and the DMPTool (presented by Jake ...
DMPTool Webinar 8: Data Curation Profiles and the DMPTool (presented by Jake ...
 
Data Policy for Open Science
Data Policy for Open ScienceData Policy for Open Science
Data Policy for Open Science
 
Data Policy for Open Science
Data Policy for Open ScienceData Policy for Open Science
Data Policy for Open Science
 
Presentation For Gene S Revision 3
Presentation For Gene S Revision 3Presentation For Gene S Revision 3
Presentation For Gene S Revision 3
 
Research data life cycle
Research data life cycleResearch data life cycle
Research data life cycle
 
Practical Research Data Management: tools and approaches, pre- and post-award
Practical Research Data Management:  tools and approaches, pre- and post-awardPractical Research Data Management:  tools and approaches, pre- and post-award
Practical Research Data Management: tools and approaches, pre- and post-award
 
Research process and research data management
Research  process and research data managementResearch  process and research data management
Research process and research data management
 
UKSG 2014 Breakout Session - Westminster Research Process and Research Data
UKSG 2014 Breakout Session - Westminster Research Process and Research DataUKSG 2014 Breakout Session - Westminster Research Process and Research Data
UKSG 2014 Breakout Session - Westminster Research Process and Research Data
 
Institutional Data Management Blueprint
Institutional Data Management BlueprintInstitutional Data Management Blueprint
Institutional Data Management Blueprint
 
Meeting the NSF DMP Requirement June 13, 2012
Meeting the NSF DMP Requirement June 13, 2012Meeting the NSF DMP Requirement June 13, 2012
Meeting the NSF DMP Requirement June 13, 2012
 
Predictive Analytics - How to get stuff out of your Crystal Ball
Predictive Analytics - How to get stuff out of your Crystal BallPredictive Analytics - How to get stuff out of your Crystal Ball
Predictive Analytics - How to get stuff out of your Crystal Ball
 
Responsible conduct of research: Data Management
Responsible conduct of research: Data ManagementResponsible conduct of research: Data Management
Responsible conduct of research: Data Management
 

Mehr von Stephanie Wright

Open Curriculum For Open Data Training
Open Curriculum For Open Data TrainingOpen Curriculum For Open Data Training
Open Curriculum For Open Data TrainingStephanie Wright
 
University of Washington Research Commons
University of Washington Research CommonsUniversity of Washington Research Commons
University of Washington Research CommonsStephanie Wright
 
Riding the Wave: Learning to Surf the Data Deluge
Riding the Wave: Learning to Surf the Data DelugeRiding the Wave: Learning to Surf the Data Deluge
Riding the Wave: Learning to Surf the Data DelugeStephanie Wright
 
Building Your Data Management Toolbox
Building Your Data Management ToolboxBuilding Your Data Management Toolbox
Building Your Data Management ToolboxStephanie Wright
 
Trailblazing in the Wilderness of Data Management
Trailblazing in the Wilderness of Data ManagementTrailblazing in the Wilderness of Data Management
Trailblazing in the Wilderness of Data ManagementStephanie Wright
 
UW Libraries Data Services Forum
UW Libraries Data Services ForumUW Libraries Data Services Forum
UW Libraries Data Services ForumStephanie Wright
 
Data Management: Tips & Tools
Data Management: Tips & ToolsData Management: Tips & Tools
Data Management: Tips & ToolsStephanie Wright
 

Mehr von Stephanie Wright (7)

Open Curriculum For Open Data Training
Open Curriculum For Open Data TrainingOpen Curriculum For Open Data Training
Open Curriculum For Open Data Training
 
University of Washington Research Commons
University of Washington Research CommonsUniversity of Washington Research Commons
University of Washington Research Commons
 
Riding the Wave: Learning to Surf the Data Deluge
Riding the Wave: Learning to Surf the Data DelugeRiding the Wave: Learning to Surf the Data Deluge
Riding the Wave: Learning to Surf the Data Deluge
 
Building Your Data Management Toolbox
Building Your Data Management ToolboxBuilding Your Data Management Toolbox
Building Your Data Management Toolbox
 
Trailblazing in the Wilderness of Data Management
Trailblazing in the Wilderness of Data ManagementTrailblazing in the Wilderness of Data Management
Trailblazing in the Wilderness of Data Management
 
UW Libraries Data Services Forum
UW Libraries Data Services ForumUW Libraries Data Services Forum
UW Libraries Data Services Forum
 
Data Management: Tips & Tools
Data Management: Tips & ToolsData Management: Tips & Tools
Data Management: Tips & Tools
 

KĂźrzlich hochgeladen

Visit to a blind student's school🧑‍🦯🧑‍🦯(community medicine)
Visit to a blind student's school🧑‍🦯🧑‍🦯(community medicine)Visit to a blind student's school🧑‍🦯🧑‍🦯(community medicine)
Visit to a blind student's school🧑‍🦯🧑‍🦯(community medicine)lakshayb543
 
Multi Domain Alias In the Odoo 17 ERP Module
Multi Domain Alias In the Odoo 17 ERP ModuleMulti Domain Alias In the Odoo 17 ERP Module
Multi Domain Alias In the Odoo 17 ERP ModuleCeline George
 
Reading and Writing Skills 11 quarter 4 melc 1
Reading and Writing Skills 11 quarter 4 melc 1Reading and Writing Skills 11 quarter 4 melc 1
Reading and Writing Skills 11 quarter 4 melc 1GloryAnnCastre1
 
31 ĐỀ THI THỬ VÀO LỚP 10 - TIẾNG ANH - FORM MỚI 2025 - 40 CÂU HỎI - BÙI VĂN V...
31 ĐỀ THI THỬ VÀO LỚP 10 - TIẾNG ANH - FORM MỚI 2025 - 40 CÂU HỎI - BÙI VĂN V...31 ĐỀ THI THỬ VÀO LỚP 10 - TIẾNG ANH - FORM MỚI 2025 - 40 CÂU HỎI - BÙI VĂN V...
31 ĐỀ THI THỬ VÀO LỚP 10 - TIẾNG ANH - FORM MỚI 2025 - 40 CÂU HỎI - BÙI VĂN V...Nguyen Thanh Tu Collection
 
4.16.24 21st Century Movements for Black Lives.pptx
4.16.24 21st Century Movements for Black Lives.pptx4.16.24 21st Century Movements for Black Lives.pptx
4.16.24 21st Century Movements for Black Lives.pptxmary850239
 
Team Lead Succeed – Helping you and your team achieve high-performance teamwo...
Team Lead Succeed – Helping you and your team achieve high-performance teamwo...Team Lead Succeed – Helping you and your team achieve high-performance teamwo...
Team Lead Succeed – Helping you and your team achieve high-performance teamwo...Association for Project Management
 
week 1 cookery 8 fourth - quarter .pptx
week 1 cookery 8  fourth  -  quarter .pptxweek 1 cookery 8  fourth  -  quarter .pptx
week 1 cookery 8 fourth - quarter .pptxJonalynLegaspi2
 
ICS2208 Lecture6 Notes for SL spaces.pdf
ICS2208 Lecture6 Notes for SL spaces.pdfICS2208 Lecture6 Notes for SL spaces.pdf
ICS2208 Lecture6 Notes for SL spaces.pdfVanessa Camilleri
 
Mythology Quiz-4th April 2024, Quiz Club NITW
Mythology Quiz-4th April 2024, Quiz Club NITWMythology Quiz-4th April 2024, Quiz Club NITW
Mythology Quiz-4th April 2024, Quiz Club NITWQuiz Club NITW
 
Concurrency Control in Database Management system
Concurrency Control in Database Management systemConcurrency Control in Database Management system
Concurrency Control in Database Management systemChristalin Nelson
 
Beauty Amidst the Bytes_ Unearthing Unexpected Advantages of the Digital Wast...
Beauty Amidst the Bytes_ Unearthing Unexpected Advantages of the Digital Wast...Beauty Amidst the Bytes_ Unearthing Unexpected Advantages of the Digital Wast...
Beauty Amidst the Bytes_ Unearthing Unexpected Advantages of the Digital Wast...DhatriParmar
 
Q-Factor General Quiz-7th April 2024, Quiz Club NITW
Q-Factor General Quiz-7th April 2024, Quiz Club NITWQ-Factor General Quiz-7th April 2024, Quiz Club NITW
Q-Factor General Quiz-7th April 2024, Quiz Club NITWQuiz Club NITW
 
Daily Lesson Plan in Mathematics Quarter 4
Daily Lesson Plan in Mathematics Quarter 4Daily Lesson Plan in Mathematics Quarter 4
Daily Lesson Plan in Mathematics Quarter 4JOYLYNSAMANIEGO
 
Measures of Position DECILES for ungrouped data
Measures of Position DECILES for ungrouped dataMeasures of Position DECILES for ungrouped data
Measures of Position DECILES for ungrouped dataBabyAnnMotar
 
Transaction Management in Database Management System
Transaction Management in Database Management SystemTransaction Management in Database Management System
Transaction Management in Database Management SystemChristalin Nelson
 
Grade Three -ELLNA-REVIEWER-ENGLISH.pptx
Grade Three -ELLNA-REVIEWER-ENGLISH.pptxGrade Three -ELLNA-REVIEWER-ENGLISH.pptx
Grade Three -ELLNA-REVIEWER-ENGLISH.pptxkarenfajardo43
 
4.11.24 Mass Incarceration and the New Jim Crow.pptx
4.11.24 Mass Incarceration and the New Jim Crow.pptx4.11.24 Mass Incarceration and the New Jim Crow.pptx
4.11.24 Mass Incarceration and the New Jim Crow.pptxmary850239
 
Unraveling Hypertext_ Analyzing Postmodern Elements in Literature.pptx
Unraveling Hypertext_ Analyzing  Postmodern Elements in  Literature.pptxUnraveling Hypertext_ Analyzing  Postmodern Elements in  Literature.pptx
Unraveling Hypertext_ Analyzing Postmodern Elements in Literature.pptxDhatriParmar
 

KĂźrzlich hochgeladen (20)

Visit to a blind student's school🧑‍🦯🧑‍🦯(community medicine)
Visit to a blind student's school🧑‍🦯🧑‍🦯(community medicine)Visit to a blind student's school🧑‍🦯🧑‍🦯(community medicine)
Visit to a blind student's school🧑‍🦯🧑‍🦯(community medicine)
 
Multi Domain Alias In the Odoo 17 ERP Module
Multi Domain Alias In the Odoo 17 ERP ModuleMulti Domain Alias In the Odoo 17 ERP Module
Multi Domain Alias In the Odoo 17 ERP Module
 
prashanth updated resume 2024 for Teaching Profession
prashanth updated resume 2024 for Teaching Professionprashanth updated resume 2024 for Teaching Profession
prashanth updated resume 2024 for Teaching Profession
 
Reading and Writing Skills 11 quarter 4 melc 1
Reading and Writing Skills 11 quarter 4 melc 1Reading and Writing Skills 11 quarter 4 melc 1
Reading and Writing Skills 11 quarter 4 melc 1
 
Paradigm shift in nursing research by RS MEHTA
Paradigm shift in nursing research by RS MEHTAParadigm shift in nursing research by RS MEHTA
Paradigm shift in nursing research by RS MEHTA
 
31 ĐỀ THI THỬ VÀO LỚP 10 - TIẾNG ANH - FORM MỚI 2025 - 40 CÂU HỎI - BÙI VĂN V...
31 ĐỀ THI THỬ VÀO LỚP 10 - TIẾNG ANH - FORM MỚI 2025 - 40 CÂU HỎI - BÙI VĂN V...31 ĐỀ THI THỬ VÀO LỚP 10 - TIẾNG ANH - FORM MỚI 2025 - 40 CÂU HỎI - BÙI VĂN V...
31 ĐỀ THI THỬ VÀO LỚP 10 - TIẾNG ANH - FORM MỚI 2025 - 40 CÂU HỎI - BÙI VĂN V...
 
4.16.24 21st Century Movements for Black Lives.pptx
4.16.24 21st Century Movements for Black Lives.pptx4.16.24 21st Century Movements for Black Lives.pptx
4.16.24 21st Century Movements for Black Lives.pptx
 
Team Lead Succeed – Helping you and your team achieve high-performance teamwo...
Team Lead Succeed – Helping you and your team achieve high-performance teamwo...Team Lead Succeed – Helping you and your team achieve high-performance teamwo...
Team Lead Succeed – Helping you and your team achieve high-performance teamwo...
 
week 1 cookery 8 fourth - quarter .pptx
week 1 cookery 8  fourth  -  quarter .pptxweek 1 cookery 8  fourth  -  quarter .pptx
week 1 cookery 8 fourth - quarter .pptx
 
ICS2208 Lecture6 Notes for SL spaces.pdf
ICS2208 Lecture6 Notes for SL spaces.pdfICS2208 Lecture6 Notes for SL spaces.pdf
ICS2208 Lecture6 Notes for SL spaces.pdf
 
Mythology Quiz-4th April 2024, Quiz Club NITW
Mythology Quiz-4th April 2024, Quiz Club NITWMythology Quiz-4th April 2024, Quiz Club NITW
Mythology Quiz-4th April 2024, Quiz Club NITW
 
Concurrency Control in Database Management system
Concurrency Control in Database Management systemConcurrency Control in Database Management system
Concurrency Control in Database Management system
 
Beauty Amidst the Bytes_ Unearthing Unexpected Advantages of the Digital Wast...
Beauty Amidst the Bytes_ Unearthing Unexpected Advantages of the Digital Wast...Beauty Amidst the Bytes_ Unearthing Unexpected Advantages of the Digital Wast...
Beauty Amidst the Bytes_ Unearthing Unexpected Advantages of the Digital Wast...
 
Q-Factor General Quiz-7th April 2024, Quiz Club NITW
Q-Factor General Quiz-7th April 2024, Quiz Club NITWQ-Factor General Quiz-7th April 2024, Quiz Club NITW
Q-Factor General Quiz-7th April 2024, Quiz Club NITW
 
Daily Lesson Plan in Mathematics Quarter 4
Daily Lesson Plan in Mathematics Quarter 4Daily Lesson Plan in Mathematics Quarter 4
Daily Lesson Plan in Mathematics Quarter 4
 
Measures of Position DECILES for ungrouped data
Measures of Position DECILES for ungrouped dataMeasures of Position DECILES for ungrouped data
Measures of Position DECILES for ungrouped data
 
Transaction Management in Database Management System
Transaction Management in Database Management SystemTransaction Management in Database Management System
Transaction Management in Database Management System
 
Grade Three -ELLNA-REVIEWER-ENGLISH.pptx
Grade Three -ELLNA-REVIEWER-ENGLISH.pptxGrade Three -ELLNA-REVIEWER-ENGLISH.pptx
Grade Three -ELLNA-REVIEWER-ENGLISH.pptx
 
4.11.24 Mass Incarceration and the New Jim Crow.pptx
4.11.24 Mass Incarceration and the New Jim Crow.pptx4.11.24 Mass Incarceration and the New Jim Crow.pptx
4.11.24 Mass Incarceration and the New Jim Crow.pptx
 
Unraveling Hypertext_ Analyzing Postmodern Elements in Literature.pptx
Unraveling Hypertext_ Analyzing  Postmodern Elements in  Literature.pptxUnraveling Hypertext_ Analyzing  Postmodern Elements in  Literature.pptx
Unraveling Hypertext_ Analyzing Postmodern Elements in Literature.pptx
 

Coming to an Understanding: a Cross-institutional Examination of Assessments of Data Curation Needs

  • 1. COMING TO AN UNDERSTANDING A Cross-institutional Examination of Assessments of Data Curation Needs Jake Carlson - Purdue University Dianne Dietrich - Cornell University Gail Steinhart - Cornell University Alison Valk - Georgia Institute of Technology Stephanie Wright - University of Washington
  • 2. Dianne Dietrich Planning & Data Management Plans
  • 3. Planning and Data Management Plans May 2010 October 2010 December 2010 January 2011 NSF press release indicating intent to require data management plans with grant proposals. NSF releases specifics for data management plan requirement. Cornell survey distributed to PIs and Co-PIs of NSF grants. NSF requirement goes into effect.
  • 4. Planning and Data Management Plans  How prepared are researchers to address data management plan requirements?  What is the potential impact of researcher plans on existing Cornell services?
  • 5. Planning and Data Management Plans 0% 10% 20% 30% 40% 50% 60% 70% 80% 90% 100% Each bar represents a question where respondents were asked to select "Yes", "No", or "I'm not sure" Percentage of respondents who answered "I'm not sure" for questions where that was an option Adapted from Steinhart, et al. (2012) Prepared to Plan? A Snapshot of Research Readiness to Address Data Management Planning Requirements. Journal of eScience Librarianship 1(2).
  • 6. Planning and Data Management Plans 0% 10% 20% 30% 40% 50% No data Up to 1 GB 1 GB - 100 GB 100 GB - 1 TB 1 TB - 100 TB More than 100 TB Responses to the question: "Given the NSF expectation to share data ... how much data would you intend to share?" Adapted from Steinhart, et al. (2012) Prepared to Plan? A Snapshot of Research Readiness to Address Data Management Planning Requirements. Journal of eScience Librarianship 1(2).
  • 7. Planning and Data Management Plans Yes 30% I'm not sure 61% No: 9% I do not plan to create metadata 26% I'm not sure if I plan to create metadata 32% I do plan to create metadata 42% Have you produced or do you anticipate producing metadata for this project? Adapted from Steinhart, et al. (2012) Prepared to Plan? A Snapshot of Research Readiness to Address Data Management Planning Requirements. Journal of eScience Librarianship 1(2). If you plan on creating metadata, does it conform to known standards in your discipline?
  • 8. Planning and Data Management Plans 0 10 20 30 40 50 60 70 Own infrastructure Campus solution Commercial solution Numberofresponses Backup Strategy Anticipated Backup Strategy by Size of Data More than 100 TB 1 TB - 100 TB 100 GB - 1 TB 1 GB - 100 GB Up to 1 GB Adapted from Steinhart, et al. (2012) Prepared to Plan? A Snapshot of Research Readiness to Address Data Management Planning Requirements. Journal of eScience Librarianship 1(2).
  • 10. Management: UW  Backgroun d  Services  Survey & Interviews
  • 11. Management: Organization Survey  Guidance on data organization (file structure, file naming, etc.) ranked 13th out of 14  Tracking updates to data (versioning) ranked 8th Image Credit: radrice “data cat finds no data” http://blog.looxii.com/wp-content/uploads/2011/06/new-data-cat.jpg
  • 12. Management: Organization Interviews  Whatever makes sense to organizer  More planning, better organization  Especially true of larger, well-funded projects “But that really was sort of something we addressed after the fact, after we started to go, „Huh, I‟m naming them this way, you‟re naming them that way, and I have no idea what your naming conventions mean.‟”
  • 13. Management: Description Survey  1/3 didn‟t know of metadata standard  16% were able to identify metadata standard  Metadata service ranked 10th out of 14 Image & Quote Credit: NYU Health Sciences Libraries “Data Sharing and Management Snafu in 3 Short Acts” http://www.youtube.com/watch?v=N2zK3sAtr-4 “Everything you need to know about the data is in the article.”
  • 14. Management: Description Interviews  Documentation is biggest challenge in data management  Recognize role of metatadata  Time consuming, no immediate benefit  Data planning vs. data forensics “If I was gonna make (the data) available to other people, I would feel some responsibility in documenting it a little bit better.” (Social Sciences)
  • 15. Management: Summary Services needed:  Training on best practices or general strategies  Tools that integrate description and organization of data into the workflow “I kind of feel like we’re just making our way through the wilderness. And if there were somebody who could kind of hold our hands and say, „Look, data management is important and here are some strategies for going about it…‟ That would be great.”
  • 17. Sharing: Purdue Background on Purdue‟s work: Primarily Interview Driven • Data Curation Profiles • Data Management Plans • Data Information Literacy
  • 18. Sharing  Willingness to Share Generally, faculty are open to sharing their data with others. There is an “underground economy” of data sharing. Factors in deciding whether or not to share: What will this person do with my data? How much time & effort will it take me?Image Credit: andrew_mc_d “Share” http://www.flickr.com/photos/andrew_mc_d/452728652/
  • 20. Sharing  Control Issues in sharing data publicly: Timing over when to release data. Use - If anyone can get the data, anyone can use it for whatever they want to Misinterpretation - there‟s no guarantee that someone won‟t misconstrue the data
  • 21. Sharing  Attribution Generally expressed as need for others to cite the data set (though not always) “So for in my personal opinion, data citations won‟t help me too much. Paper citations count for everything. It counts for impact of the paper, it counts for tenure, it counts for the profile of my work.” - Professor of Biochemistry
  • 22. Sharing  Documentation and Description "If you ask someone if you can see their raw data, you might as well be asking if you can look at their underwear. It's really problematic." - Agronomy Professor
  • 23. Sharing  Services for Data Sharing at Purdue Consultation & Collaboration with Data Producers  Support "local" sharing  Workflows  Documentation  Description  Support "external" sharing  Workflows  Documentation  Description
  • 25. Background “Develop campus partnerships to collect, manage, share, an d preserve Georgia Tech digital research data.” “Improve and develop new resources & services to assist researchers with data stewardship” Preservation
  • 26. IRB-approved research to determine gaps in data curation services provided to researchers. Data assessment survey Series of campus wide interviews NSF DMP content analysis Preservation
  • 27. By combining information gathered via the survey and the interviews, we developed a clearer picture of the research data curation needs on campus.Out of 77 who completed survey- o 44 agreed to be interviewed o 26 interviews completed Preservation
  • 28. Interview Team Chris Doty Susan Parham Elizabeth Rolando Alison Valk 10 Interview questions “How important is it for you to archive / preserve your data?” “How important is it for you or others to have access to your data over the long-term?” Preservation Transcrib e interviews Web application for Qualitative & Mixed Methods research Visualize major discussion points or code correlations Code
  • 29. Correlation between cost of working with data – to how strongly participants feel data should be preserved… Preservation
  • 30. Storage prices no longer cost prohibitive Preservation
  • 31. Lack of metadata or curation = unusable data Data is often “lost” when project participants such as grad students leave institution Computing professor: “I don’t want to micromanage my research assistants” Preservation
  • 32. Some researchers are using Cloud based tools, such as DropBox etc. for archiving – Little concern for security risks associated. Preservation
  • 33. Next Steps: Select Case studies- o Researchers have volunteered to allow us to archive their research data. Increased Outreach- New Services o Customized DMPtool o Departmental Data Management Workshops o More robust web presence o Proof-of-concept Library hosted Research Data Repository Preservation
  • 34. Questions?  Jake Carlson  @jrcarlso  jakecarlson@purdue.edu  Dianne Dietrich  @nemka  dd388@cornell.edu  Gail Steinhart  @gailst  gss1@cornell.edu  Alison Valk  @valkcano  alison.valk@library.gatech.edu  Stephanie Wright  @shefw  swright@uw.edu