SlideShare ist ein Scribd-Unternehmen logo
1 von 32
Trailblazing in the Wilderness of
Data Management
Where are we going and how do we get
there from here.
Stephanie Wright
Data Services Coordinator
University of Washington Libraries
Click to edit Master title style
AGENDA
• Definitions
• Why venture out
• Paths already taken
–Assessments of needs
–Existing programs
–Tools & resources
• Blazing your own trail
Montana State University – 21 June 2013
Definitions
• Data
• Data Management
• Big Data
• Long Tail of Data
• Acronyms
www.lib.washington.edu
Definitions
www.lib.washington.edu
DATA
By data, we do not mean a synonym for information. We
mean research data, that which is collected, observed,
or created, for purposes of analyzing to produce
original research results.
Research data may be created in tabular, textual,
statistical, numeric, geospatial, image, multimedia
or other formats.
(Adapted from DISC-UK DataShare Project, p. 16)
Definitions
www.lib.washington.edu
DATA
Data can be produced from a variety of processes
(e.g., observation, experimentation, simulation,
derivation, compilation), represented in numerous
forms and stored in many digital formats (e.g.,
ASCII, PDF, SPSS, Excel, TIFF, Java, FITS, CIF, ZVI)
The scope of this definition includes data from
disciplines in the sciences, social sciences, and
humanities.
(Adapted from MIT Libraries, “What is Data?”, 2009)
Definitions
www.lib.washington.edu
DATA MANAGEMENT
Pertains to the collection, cleaning, storage, sharing,
access, disposal, preservation and/or archiving of
research data.
(Adapted from University of North Carolina, Research Data Stewardship
Report, 2012)
Definitions
www.lib.washington.edu
BIG DATA
• Volume
• Velocity
• Variety
25 Definitions of Big Data:
http://www.opentracker.net/article/25-definitions-
big-data
– Now over 30 definitions
Definitions
www.lib.washington.edu
LONG TAIL OF DATA
Image credit: disruptormonkey.typepad.com
Acronyms
www.lib.washington.edu
• RDM – Research Data
Management
• IR – Institutional Repository
• DR – Data Repository
• DMP – Data Management Plan
Why Venture Out
• Funding agencies
• Universities
• Researchers
• Libraries
www.lib.washington.edu
Image credit: National Park Service, Yellowstone photo collection,
(http://www.nps.gov/features/yell/slidefile/mammals/wolf/Images/15314.jpg)
www.lib.washington.edu
Funding Agencies
www.lib.washington.edu
• 1998: NSF
• 2003: NIH
• 2011: NSF
• 2013: NSF, OSTP, OMB, NIH
Universities
www.lib.washington.edu
• Competitiveness
• Reduce duplication of effort
• Preserve the research record of the
institution
• Encourage innovation & discovery
Researchers
www.lib.washington.edu
• Verifiability & reproducibility
• Increased citation rates for publications
– (Piwowar et al, 2007)
• Preservation of individual scholarly record
• Save time by planning early
Libraries
www.lib.washington.edu
•Digital Preservation Network (DPN)
“The Digital Preservation Network is being
created by research-intensive universities to
ensure long-term preservation of the complete
digital scholarly record.”
http://d-p-n.org/
Libraries
www.lib.washington.edu
NSF Proposal & Award Policies &
Procedures Guide (Oct 2012)
“Instructions for preparation of the
Biographical Sketch have been revised to
rename the "Publications" section to
"Products" ....
(P)roducts may include, but are not limited
to, publications, data sets, software,
patents, and copyrights.”
Paths Already Taken
• Assessments
• Existing programs
• Tools & Resources
www.lib.washington.edu
Image credit: John W. Ridge
(http://commons.wikimedia.org/wiki/File:Yellowstone_Trail_Map.jpg)
Assessments
www.lib.washington.edu
• UNC (2012) “Research Data Stewardship
Report”
• University of Colorado Boulder (2012)
“Research Data Management @ UCB”
• Purdue “Data Curation Profiles Directory”
(http://docs.lib.purdue.edu/dcp/)
• More: Georgia Tech, Cornell, Houston,
Oregon….
Findings
www.lib.washington.edu
• Researchers use a wide variety of data
types – across disciplines
• Most researchers rely on themselves for
data management
• Researchers want to maintain control of
their data
• Many are unaware of existing services
• They want tools that work in existing
workflows
What’s Needed
www.lib.washington.edu
• Creating & maintaining DMPs
• Best practices guidance all along lifecycle
• Storage
– Short-term access
– Long-term access
– Backup
– Versioning
– Security
• Metadata creation
Existing Programs
www.lib.washington.edu
• Cornell
– Research Data Management Service Group
• Sr VP for Research and University Librarian
• Faculty Advisory Board
– 9 faculty across disciplines
– OSP & Office of Research Integrity & Assurance
• Management Council
– 2 librarians, 2 faculty, 2 IT, 1 research institute
Existing Programs
www.lib.washington.edu
• Purdue
– D2C2: Distributed Data Curation Center
• Executive Committee
– Dean of Libraries, VP of Research & VP of IT
• Library: consulting & metadata support
• IT: storage & research computing support
Existing Programs
www.lib.washington.edu
• University of Washington
– Data Services Program (1.5 FTE)
• Data Services Coordinator
• Data Services Communications & Curriculum Libn
– Data Services Team (10 members)
– Partnerships
• Research Centers (eSci, CSDE, IHME)
• Office of Research (OSP)
• Campus IT
• iSchool
Tools & Resources
www.lib.washington.edu
• Data Mgmt Planning: DMPTool
• Metadata & Sharing: DataUP
• Sharing & Storage: DataBib
• Citation: EZID
• Best Practices: DMVitals
Blazing Your Own Trail
www.lib.washington.edu
Image credit: Michigan State University Department of History,
HST 321: History of the American West
(http://history.msu.edu/hst321/files/2010/07/colter.jpg)
www.lib.washington.edu
• Identify needs
• Consider potential partners
• Scope
– Disciplines
– Specific areas of the data lifecycle
• Determine priorities
– New services? Enhance existing? Market
existing?
Where do you want to go?
www.lib.washington.edu
• Objective L1
– Assess and improve where needed, student
learning of critical knowledge & skills
• Objective D1
– Elevate the research excellence and
recognition of MSU faculty
• D1.2
• Objective D2
– Enhance infrastructure in support of research,
discovery and creative activities
MSU Strategic Plan
www.lib.washington.edu
• Support for active data storage
• Data security guidance
• Backup services
• Development of tools that can be
inserted into existing workflows
Campus IT
www.lib.washington.edu
• Guidance on legal / ethical
considerations
• Incorporate DM planning into
grant submission process
• New faculty data management
orientations
Office of Research
www.lib.washington.edu
• Market and provide access to
existing RDM resources
• Provide learning opportunities on
RDM best practices
• DMP consultation
• Storage (final)
• Metadata consultation
Libraries
www.lib.washington.edu
• University policy on data
management
• Integrate RDM activities into T&P
process
• Consider campus policy on open
data
University
Questions
Stephanie Wright
Data Services Coordinator
swright@uw.edu
@shefw
http://guides.lib.washington.edu/swright
Data Management Guide
http://guides.lib.washington.edu/dmg
ResearchWorks Data Services
http://researchworks.lib.washington.edu/rw-data.html

Weitere ähnliche Inhalte

Was ist angesagt?

RDAP14: Building a data management and curation program on a shoestring budget
RDAP14: Building a data management and curation program on a shoestring budgetRDAP14: Building a data management and curation program on a shoestring budget
RDAP14: Building a data management and curation program on a shoestring budgetASIS&T
 
Coping with Data for WHOI JP Students
Coping with Data for WHOI JP StudentsCoping with Data for WHOI JP Students
Coping with Data for WHOI JP StudentsCarly Strasser
 
Scratchpads: the Virtual Research Environment for biodiversity data
Scratchpads: the Virtual Research Environment for biodiversity dataScratchpads: the Virtual Research Environment for biodiversity data
Scratchpads: the Virtual Research Environment for biodiversity dataVince Smith
 
DataQ Project Update, May 29, 2015
DataQ Project Update, May 29, 2015DataQ Project Update, May 29, 2015
DataQ Project Update, May 29, 2015ResearchDataQ
 
Metadata enriching and filtering for enhanced collection discoverability
Metadata enriching and filtering for enhanced collection discoverability  Metadata enriching and filtering for enhanced collection discoverability
Metadata enriching and filtering for enhanced collection discoverability Getaneh Alemu
 
BIBLIOTECARII MANAGERI AI DATELOR, BIBLIOTECILE API-URI
BIBLIOTECARII MANAGERI AI DATELOR, BIBLIOTECILE API-URIBIBLIOTECARII MANAGERI AI DATELOR, BIBLIOTECILE API-URI
BIBLIOTECARII MANAGERI AI DATELOR, BIBLIOTECILE API-URINicolaie Constantinescu
 
Exploring a world of networked information built from free-text metadata
Exploring a world of networked information built from free-text metadataExploring a world of networked information built from free-text metadata
Exploring a world of networked information built from free-text metadataShenghui Wang
 
RDAP 15 Local ICPSR Data Curation Workshop Pilot Project
RDAP 15 Local ICPSR Data Curation Workshop Pilot ProjectRDAP 15 Local ICPSR Data Curation Workshop Pilot Project
RDAP 15 Local ICPSR Data Curation Workshop Pilot ProjectASIS&T
 

Was ist angesagt? (20)

RDAP14: Building a data management and curation program on a shoestring budget
RDAP14: Building a data management and curation program on a shoestring budgetRDAP14: Building a data management and curation program on a shoestring budget
RDAP14: Building a data management and curation program on a shoestring budget
 
Goldman "Collaboratively Build Data Science Services and Skills"
Goldman "Collaboratively Build Data Science Services and Skills"Goldman "Collaboratively Build Data Science Services and Skills"
Goldman "Collaboratively Build Data Science Services and Skills"
 
Johnston - How to Curate Research Data
Johnston - How to Curate Research DataJohnston - How to Curate Research Data
Johnston - How to Curate Research Data
 
Kristi Holmes. A bird’s-eye view of scholarship at the individual, institutio...
Kristi Holmes. A bird’s-eye view of scholarship at the individual, institutio...Kristi Holmes. A bird’s-eye view of scholarship at the individual, institutio...
Kristi Holmes. A bird’s-eye view of scholarship at the individual, institutio...
 
Coping with Data for WHOI JP Students
Coping with Data for WHOI JP StudentsCoping with Data for WHOI JP Students
Coping with Data for WHOI JP Students
 
Bracke may4-1
Bracke may4-1Bracke may4-1
Bracke may4-1
 
Scratchpads: the Virtual Research Environment for biodiversity data
Scratchpads: the Virtual Research Environment for biodiversity dataScratchpads: the Virtual Research Environment for biodiversity data
Scratchpads: the Virtual Research Environment for biodiversity data
 
Lowe NISO virtual conf feb17
Lowe NISO virtual conf feb17Lowe NISO virtual conf feb17
Lowe NISO virtual conf feb17
 
Oehrli - Creating Data Literate Students
Oehrli - Creating Data Literate StudentsOehrli - Creating Data Literate Students
Oehrli - Creating Data Literate Students
 
DataQ Project Update, May 29, 2015
DataQ Project Update, May 29, 2015DataQ Project Update, May 29, 2015
DataQ Project Update, May 29, 2015
 
Metadata enriching and filtering for enhanced collection discoverability
Metadata enriching and filtering for enhanced collection discoverability  Metadata enriching and filtering for enhanced collection discoverability
Metadata enriching and filtering for enhanced collection discoverability
 
Read Surkis Facilitating Development of Research Data Services
Read Surkis Facilitating Development of Research Data ServicesRead Surkis Facilitating Development of Research Data Services
Read Surkis Facilitating Development of Research Data Services
 
BIBLIOTECARII MANAGERI AI DATELOR, BIBLIOTECILE API-URI
BIBLIOTECARII MANAGERI AI DATELOR, BIBLIOTECILE API-URIBIBLIOTECARII MANAGERI AI DATELOR, BIBLIOTECILE API-URI
BIBLIOTECARII MANAGERI AI DATELOR, BIBLIOTECILE API-URI
 
Butler "Building Data Science Skills: Enhancing Core Capabilities and Underst...
Butler "Building Data Science Skills: Enhancing Core Capabilities and Underst...Butler "Building Data Science Skills: Enhancing Core Capabilities and Underst...
Butler "Building Data Science Skills: Enhancing Core Capabilities and Underst...
 
Emerging roles and collaborations in research support for academic health lib...
Emerging roles and collaborations in research support for academic health lib...Emerging roles and collaborations in research support for academic health lib...
Emerging roles and collaborations in research support for academic health lib...
 
Sept 18 NISO Webinar: Research Data Curation, Part 2: Libraries and Big Data ...
Sept 18 NISO Webinar: Research Data Curation, Part 2: Libraries and Big Data ...Sept 18 NISO Webinar: Research Data Curation, Part 2: Libraries and Big Data ...
Sept 18 NISO Webinar: Research Data Curation, Part 2: Libraries and Big Data ...
 
Exploring a world of networked information built from free-text metadata
Exploring a world of networked information built from free-text metadataExploring a world of networked information built from free-text metadata
Exploring a world of networked information built from free-text metadata
 
Hansen Metadata for Institutional Repositories
Hansen Metadata for Institutional RepositoriesHansen Metadata for Institutional Repositories
Hansen Metadata for Institutional Repositories
 
RDAP 15 Local ICPSR Data Curation Workshop Pilot Project
RDAP 15 Local ICPSR Data Curation Workshop Pilot ProjectRDAP 15 Local ICPSR Data Curation Workshop Pilot Project
RDAP 15 Local ICPSR Data Curation Workshop Pilot Project
 
Where's the Data?
Where's the Data?Where's the Data?
Where's the Data?
 

Andere mochten auch

Riding the Wave: Learning to Surf the Data Deluge
Riding the Wave: Learning to Surf the Data DelugeRiding the Wave: Learning to Surf the Data Deluge
Riding the Wave: Learning to Surf the Data DelugeStephanie Wright
 
Mobile phones and social networking
Mobile phones and social networkingMobile phones and social networking
Mobile phones and social networkingmydahraza
 
Open Curriculum For Open Data Training
Open Curriculum For Open Data TrainingOpen Curriculum For Open Data Training
Open Curriculum For Open Data TrainingStephanie Wright
 
University of Washington Research Commons
University of Washington Research CommonsUniversity of Washington Research Commons
University of Washington Research CommonsStephanie Wright
 
UW Libraries Data Services Forum
UW Libraries Data Services ForumUW Libraries Data Services Forum
UW Libraries Data Services ForumStephanie Wright
 
Mikrogazdálkodói beszámoló
Mikrogazdálkodói beszámolóMikrogazdálkodói beszámoló
Mikrogazdálkodói beszámoló40KJjuji
 
Building Your Data Management Toolbox
Building Your Data Management ToolboxBuilding Your Data Management Toolbox
Building Your Data Management ToolboxStephanie Wright
 
Coming to an Understanding: a Cross-institutional Examination of Assessments ...
Coming to an Understanding: a Cross-institutional Examination of Assessments ...Coming to an Understanding: a Cross-institutional Examination of Assessments ...
Coming to an Understanding: a Cross-institutional Examination of Assessments ...Stephanie Wright
 
Fermentasi Anggur
Fermentasi AnggurFermentasi Anggur
Fermentasi Anggurkai putri
 
Data Management: Tips & Tools
Data Management: Tips & ToolsData Management: Tips & Tools
Data Management: Tips & ToolsStephanie Wright
 

Andere mochten auch (12)

Riding the Wave: Learning to Surf the Data Deluge
Riding the Wave: Learning to Surf the Data DelugeRiding the Wave: Learning to Surf the Data Deluge
Riding the Wave: Learning to Surf the Data Deluge
 
Mobile phones and social networking
Mobile phones and social networkingMobile phones and social networking
Mobile phones and social networking
 
Open Curriculum For Open Data Training
Open Curriculum For Open Data TrainingOpen Curriculum For Open Data Training
Open Curriculum For Open Data Training
 
University of Washington Research Commons
University of Washington Research CommonsUniversity of Washington Research Commons
University of Washington Research Commons
 
UW Libraries Data Services Forum
UW Libraries Data Services ForumUW Libraries Data Services Forum
UW Libraries Data Services Forum
 
10 cuentos de nelson castañeda.
10 cuentos de nelson castañeda.10 cuentos de nelson castañeda.
10 cuentos de nelson castañeda.
 
Mikrogazdálkodói beszámoló
Mikrogazdálkodói beszámolóMikrogazdálkodói beszámoló
Mikrogazdálkodói beszámoló
 
Building Your Data Management Toolbox
Building Your Data Management ToolboxBuilding Your Data Management Toolbox
Building Your Data Management Toolbox
 
Presentation
PresentationPresentation
Presentation
 
Coming to an Understanding: a Cross-institutional Examination of Assessments ...
Coming to an Understanding: a Cross-institutional Examination of Assessments ...Coming to an Understanding: a Cross-institutional Examination of Assessments ...
Coming to an Understanding: a Cross-institutional Examination of Assessments ...
 
Fermentasi Anggur
Fermentasi AnggurFermentasi Anggur
Fermentasi Anggur
 
Data Management: Tips & Tools
Data Management: Tips & ToolsData Management: Tips & Tools
Data Management: Tips & Tools
 

Ähnlich wie Trailblazing in the Wilderness of Data Management

ICPSR Workshop Template - 2012/13
ICPSR Workshop Template - 2012/13ICPSR Workshop Template - 2012/13
ICPSR Workshop Template - 2012/13ICPSR
 
Re tooling for data management-support
Re tooling for data management-supportRe tooling for data management-support
Re tooling for data management-supportSherry Lake
 
Research Data Management in Academic Libraries: Meeting the Challenge
Research Data Management in Academic Libraries: Meeting the ChallengeResearch Data Management in Academic Libraries: Meeting the Challenge
Research Data Management in Academic Libraries: Meeting the ChallengeSpencer Keralis
 
Steven McEachern - ADA, DDI (metadata standard) and the Data Lifecycle
Steven McEachern - ADA, DDI (metadata standard) and the Data LifecycleSteven McEachern - ADA, DDI (metadata standard) and the Data Lifecycle
Steven McEachern - ADA, DDI (metadata standard) and the Data LifecycleSteve Androulakis
 
ADA, DDI and the data lifecycle - Steve McEachern - 7 April 2017
ADA, DDI and the data lifecycle - Steve McEachern - 7 April 2017ADA, DDI and the data lifecycle - Steve McEachern - 7 April 2017
ADA, DDI and the data lifecycle - Steve McEachern - 7 April 2017ARDC
 
Slides | Research data literacy and the library
Slides | Research data literacy and the librarySlides | Research data literacy and the library
Slides | Research data literacy and the libraryColleen DeLory
 
Slides | Research data literacy and the library
Slides | Research data literacy and the librarySlides | Research data literacy and the library
Slides | Research data literacy and the libraryLibrary_Connect
 
Research Data Management
Research Data ManagementResearch Data Management
Research Data Managementaaroncollie
 
Scottish Digital Library Consortium Meeting: Edinburgh DataShare
Scottish Digital Library Consortium Meeting: Edinburgh DataShareScottish Digital Library Consortium Meeting: Edinburgh DataShare
Scottish Digital Library Consortium Meeting: Edinburgh DataShareRobin Rice
 
Data Management for Research
Data Management for ResearchData Management for Research
Data Management for ResearchAaron Collie
 
PIDs, Data and Software: How Libraries Can Support Researchers in an Evolving...
PIDs, Data and Software: How Libraries Can Support Researchers in an Evolving...PIDs, Data and Software: How Libraries Can Support Researchers in an Evolving...
PIDs, Data and Software: How Libraries Can Support Researchers in an Evolving...Sarah Anna Stewart
 
RDAP 15: Research Data Integration in the Purdue Libraries
RDAP 15: Research Data Integration in the Purdue LibrariesRDAP 15: Research Data Integration in the Purdue Libraries
RDAP 15: Research Data Integration in the Purdue LibrariesASIS&T
 
Data Con LA 2019 - Data Science Education. Building Knowledge Graphs by Jose-...
Data Con LA 2019 - Data Science Education. Building Knowledge Graphs by Jose-...Data Con LA 2019 - Data Science Education. Building Knowledge Graphs by Jose-...
Data Con LA 2019 - Data Science Education. Building Knowledge Graphs by Jose-...Data Con LA
 
HKU Data Curation MLIM7350 Class 9
HKU Data Curation MLIM7350 Class 9 HKU Data Curation MLIM7350 Class 9
HKU Data Curation MLIM7350 Class 9 Scott Edmunds
 
Managing provenance in the Social Sciences: the Data Documentation Initiative...
Managing provenance in the Social Sciences: the Data Documentation Initiative...Managing provenance in the Social Sciences: the Data Documentation Initiative...
Managing provenance in the Social Sciences: the Data Documentation Initiative...ARDC
 
Open data and research data management at the University of Edinburgh: polici...
Open data and research data management at the University of Edinburgh: polici...Open data and research data management at the University of Edinburgh: polici...
Open data and research data management at the University of Edinburgh: polici...Robin Rice
 
ICPSR Data Services
ICPSR Data ServicesICPSR Data Services
ICPSR Data ServicesICPSR
 

Ähnlich wie Trailblazing in the Wilderness of Data Management (20)

ICPSR Workshop Template - 2012/13
ICPSR Workshop Template - 2012/13ICPSR Workshop Template - 2012/13
ICPSR Workshop Template - 2012/13
 
Re tooling for data management-support
Re tooling for data management-supportRe tooling for data management-support
Re tooling for data management-support
 
CDL research lifecycle
CDL research lifecycleCDL research lifecycle
CDL research lifecycle
 
Research Data Management in Academic Libraries: Meeting the Challenge
Research Data Management in Academic Libraries: Meeting the ChallengeResearch Data Management in Academic Libraries: Meeting the Challenge
Research Data Management in Academic Libraries: Meeting the Challenge
 
Steven McEachern - ADA, DDI (metadata standard) and the Data Lifecycle
Steven McEachern - ADA, DDI (metadata standard) and the Data LifecycleSteven McEachern - ADA, DDI (metadata standard) and the Data Lifecycle
Steven McEachern - ADA, DDI (metadata standard) and the Data Lifecycle
 
ADA, DDI and the data lifecycle - Steve McEachern - 7 April 2017
ADA, DDI and the data lifecycle - Steve McEachern - 7 April 2017ADA, DDI and the data lifecycle - Steve McEachern - 7 April 2017
ADA, DDI and the data lifecycle - Steve McEachern - 7 April 2017
 
Slides | Research data literacy and the library
Slides | Research data literacy and the librarySlides | Research data literacy and the library
Slides | Research data literacy and the library
 
Slides | Research data literacy and the library
Slides | Research data literacy and the librarySlides | Research data literacy and the library
Slides | Research data literacy and the library
 
Research Data Management
Research Data ManagementResearch Data Management
Research Data Management
 
Pace "How the Community Wants to Serve Its Constituents"
Pace "How the Community Wants to Serve Its Constituents"Pace "How the Community Wants to Serve Its Constituents"
Pace "How the Community Wants to Serve Its Constituents"
 
Scottish Digital Library Consortium Meeting: Edinburgh DataShare
Scottish Digital Library Consortium Meeting: Edinburgh DataShareScottish Digital Library Consortium Meeting: Edinburgh DataShare
Scottish Digital Library Consortium Meeting: Edinburgh DataShare
 
Data Management for Research
Data Management for ResearchData Management for Research
Data Management for Research
 
PIDs, Data and Software: How Libraries Can Support Researchers in an Evolving...
PIDs, Data and Software: How Libraries Can Support Researchers in an Evolving...PIDs, Data and Software: How Libraries Can Support Researchers in an Evolving...
PIDs, Data and Software: How Libraries Can Support Researchers in an Evolving...
 
RDAP 15: Research Data Integration in the Purdue Libraries
RDAP 15: Research Data Integration in the Purdue LibrariesRDAP 15: Research Data Integration in the Purdue Libraries
RDAP 15: Research Data Integration in the Purdue Libraries
 
Data Con LA 2019 - Data Science Education. Building Knowledge Graphs by Jose-...
Data Con LA 2019 - Data Science Education. Building Knowledge Graphs by Jose-...Data Con LA 2019 - Data Science Education. Building Knowledge Graphs by Jose-...
Data Con LA 2019 - Data Science Education. Building Knowledge Graphs by Jose-...
 
HKU Data Curation MLIM7350 Class 9
HKU Data Curation MLIM7350 Class 9 HKU Data Curation MLIM7350 Class 9
HKU Data Curation MLIM7350 Class 9
 
Managing provenance in the Social Sciences: the Data Documentation Initiative...
Managing provenance in the Social Sciences: the Data Documentation Initiative...Managing provenance in the Social Sciences: the Data Documentation Initiative...
Managing provenance in the Social Sciences: the Data Documentation Initiative...
 
Open data and research data management at the University of Edinburgh: polici...
Open data and research data management at the University of Edinburgh: polici...Open data and research data management at the University of Edinburgh: polici...
Open data and research data management at the University of Edinburgh: polici...
 
Rdm slides march 2014
Rdm slides march 2014Rdm slides march 2014
Rdm slides march 2014
 
ICPSR Data Services
ICPSR Data ServicesICPSR Data Services
ICPSR Data Services
 

Kürzlich hochgeladen

Unit-IV- Pharma. Marketing Channels.pptx
Unit-IV- Pharma. Marketing Channels.pptxUnit-IV- Pharma. Marketing Channels.pptx
Unit-IV- Pharma. Marketing Channels.pptxVishalSingh1417
 
2024-NATIONAL-LEARNING-CAMP-AND-OTHER.pptx
2024-NATIONAL-LEARNING-CAMP-AND-OTHER.pptx2024-NATIONAL-LEARNING-CAMP-AND-OTHER.pptx
2024-NATIONAL-LEARNING-CAMP-AND-OTHER.pptxMaritesTamaniVerdade
 
Measures of Dispersion and Variability: Range, QD, AD and SD
Measures of Dispersion and Variability: Range, QD, AD and SDMeasures of Dispersion and Variability: Range, QD, AD and SD
Measures of Dispersion and Variability: Range, QD, AD and SDThiyagu K
 
Measures of Central Tendency: Mean, Median and Mode
Measures of Central Tendency: Mean, Median and ModeMeasures of Central Tendency: Mean, Median and Mode
Measures of Central Tendency: Mean, Median and ModeThiyagu K
 
Presentation by Andreas Schleicher Tackling the School Absenteeism Crisis 30 ...
Presentation by Andreas Schleicher Tackling the School Absenteeism Crisis 30 ...Presentation by Andreas Schleicher Tackling the School Absenteeism Crisis 30 ...
Presentation by Andreas Schleicher Tackling the School Absenteeism Crisis 30 ...EduSkills OECD
 
ComPTIA Overview | Comptia Security+ Book SY0-701
ComPTIA Overview | Comptia Security+ Book SY0-701ComPTIA Overview | Comptia Security+ Book SY0-701
ComPTIA Overview | Comptia Security+ Book SY0-701bronxfugly43
 
microwave assisted reaction. General introduction
microwave assisted reaction. General introductionmicrowave assisted reaction. General introduction
microwave assisted reaction. General introductionMaksud Ahmed
 
Grant Readiness 101 TechSoup and Remy Consulting
Grant Readiness 101 TechSoup and Remy ConsultingGrant Readiness 101 TechSoup and Remy Consulting
Grant Readiness 101 TechSoup and Remy ConsultingTechSoup
 
Sociology 101 Demonstration of Learning Exhibit
Sociology 101 Demonstration of Learning ExhibitSociology 101 Demonstration of Learning Exhibit
Sociology 101 Demonstration of Learning Exhibitjbellavia9
 
TỔNG ÔN TẬP THI VÀO LỚP 10 MÔN TIẾNG ANH NĂM HỌC 2023 - 2024 CÓ ĐÁP ÁN (NGỮ Â...
TỔNG ÔN TẬP THI VÀO LỚP 10 MÔN TIẾNG ANH NĂM HỌC 2023 - 2024 CÓ ĐÁP ÁN (NGỮ Â...TỔNG ÔN TẬP THI VÀO LỚP 10 MÔN TIẾNG ANH NĂM HỌC 2023 - 2024 CÓ ĐÁP ÁN (NGỮ Â...
TỔNG ÔN TẬP THI VÀO LỚP 10 MÔN TIẾNG ANH NĂM HỌC 2023 - 2024 CÓ ĐÁP ÁN (NGỮ Â...Nguyen Thanh Tu Collection
 
Food Chain and Food Web (Ecosystem) EVS, B. Pharmacy 1st Year, Sem-II
Food Chain and Food Web (Ecosystem) EVS, B. Pharmacy 1st Year, Sem-IIFood Chain and Food Web (Ecosystem) EVS, B. Pharmacy 1st Year, Sem-II
Food Chain and Food Web (Ecosystem) EVS, B. Pharmacy 1st Year, Sem-IIShubhangi Sonawane
 
Unit-V; Pricing (Pharma Marketing Management).pptx
Unit-V; Pricing (Pharma Marketing Management).pptxUnit-V; Pricing (Pharma Marketing Management).pptx
Unit-V; Pricing (Pharma Marketing Management).pptxVishalSingh1417
 
This PowerPoint helps students to consider the concept of infinity.
This PowerPoint helps students to consider the concept of infinity.This PowerPoint helps students to consider the concept of infinity.
This PowerPoint helps students to consider the concept of infinity.christianmathematics
 
Unit-IV; Professional Sales Representative (PSR).pptx
Unit-IV; Professional Sales Representative (PSR).pptxUnit-IV; Professional Sales Representative (PSR).pptx
Unit-IV; Professional Sales Representative (PSR).pptxVishalSingh1417
 
Nutritional Needs Presentation - HLTH 104
Nutritional Needs Presentation - HLTH 104Nutritional Needs Presentation - HLTH 104
Nutritional Needs Presentation - HLTH 104misteraugie
 
Web & Social Media Analytics Previous Year Question Paper.pdf
Web & Social Media Analytics Previous Year Question Paper.pdfWeb & Social Media Analytics Previous Year Question Paper.pdf
Web & Social Media Analytics Previous Year Question Paper.pdfJayanti Pande
 
Key note speaker Neum_Admir Softic_ENG.pdf
Key note speaker Neum_Admir Softic_ENG.pdfKey note speaker Neum_Admir Softic_ENG.pdf
Key note speaker Neum_Admir Softic_ENG.pdfAdmir Softic
 
Holdier Curriculum Vitae (April 2024).pdf
Holdier Curriculum Vitae (April 2024).pdfHoldier Curriculum Vitae (April 2024).pdf
Holdier Curriculum Vitae (April 2024).pdfagholdier
 

Kürzlich hochgeladen (20)

Unit-IV- Pharma. Marketing Channels.pptx
Unit-IV- Pharma. Marketing Channels.pptxUnit-IV- Pharma. Marketing Channels.pptx
Unit-IV- Pharma. Marketing Channels.pptx
 
2024-NATIONAL-LEARNING-CAMP-AND-OTHER.pptx
2024-NATIONAL-LEARNING-CAMP-AND-OTHER.pptx2024-NATIONAL-LEARNING-CAMP-AND-OTHER.pptx
2024-NATIONAL-LEARNING-CAMP-AND-OTHER.pptx
 
Measures of Dispersion and Variability: Range, QD, AD and SD
Measures of Dispersion and Variability: Range, QD, AD and SDMeasures of Dispersion and Variability: Range, QD, AD and SD
Measures of Dispersion and Variability: Range, QD, AD and SD
 
Measures of Central Tendency: Mean, Median and Mode
Measures of Central Tendency: Mean, Median and ModeMeasures of Central Tendency: Mean, Median and Mode
Measures of Central Tendency: Mean, Median and Mode
 
Presentation by Andreas Schleicher Tackling the School Absenteeism Crisis 30 ...
Presentation by Andreas Schleicher Tackling the School Absenteeism Crisis 30 ...Presentation by Andreas Schleicher Tackling the School Absenteeism Crisis 30 ...
Presentation by Andreas Schleicher Tackling the School Absenteeism Crisis 30 ...
 
ComPTIA Overview | Comptia Security+ Book SY0-701
ComPTIA Overview | Comptia Security+ Book SY0-701ComPTIA Overview | Comptia Security+ Book SY0-701
ComPTIA Overview | Comptia Security+ Book SY0-701
 
microwave assisted reaction. General introduction
microwave assisted reaction. General introductionmicrowave assisted reaction. General introduction
microwave assisted reaction. General introduction
 
Grant Readiness 101 TechSoup and Remy Consulting
Grant Readiness 101 TechSoup and Remy ConsultingGrant Readiness 101 TechSoup and Remy Consulting
Grant Readiness 101 TechSoup and Remy Consulting
 
Sociology 101 Demonstration of Learning Exhibit
Sociology 101 Demonstration of Learning ExhibitSociology 101 Demonstration of Learning Exhibit
Sociology 101 Demonstration of Learning Exhibit
 
TỔNG ÔN TẬP THI VÀO LỚP 10 MÔN TIẾNG ANH NĂM HỌC 2023 - 2024 CÓ ĐÁP ÁN (NGỮ Â...
TỔNG ÔN TẬP THI VÀO LỚP 10 MÔN TIẾNG ANH NĂM HỌC 2023 - 2024 CÓ ĐÁP ÁN (NGỮ Â...TỔNG ÔN TẬP THI VÀO LỚP 10 MÔN TIẾNG ANH NĂM HỌC 2023 - 2024 CÓ ĐÁP ÁN (NGỮ Â...
TỔNG ÔN TẬP THI VÀO LỚP 10 MÔN TIẾNG ANH NĂM HỌC 2023 - 2024 CÓ ĐÁP ÁN (NGỮ Â...
 
Food Chain and Food Web (Ecosystem) EVS, B. Pharmacy 1st Year, Sem-II
Food Chain and Food Web (Ecosystem) EVS, B. Pharmacy 1st Year, Sem-IIFood Chain and Food Web (Ecosystem) EVS, B. Pharmacy 1st Year, Sem-II
Food Chain and Food Web (Ecosystem) EVS, B. Pharmacy 1st Year, Sem-II
 
Unit-V; Pricing (Pharma Marketing Management).pptx
Unit-V; Pricing (Pharma Marketing Management).pptxUnit-V; Pricing (Pharma Marketing Management).pptx
Unit-V; Pricing (Pharma Marketing Management).pptx
 
This PowerPoint helps students to consider the concept of infinity.
This PowerPoint helps students to consider the concept of infinity.This PowerPoint helps students to consider the concept of infinity.
This PowerPoint helps students to consider the concept of infinity.
 
Unit-IV; Professional Sales Representative (PSR).pptx
Unit-IV; Professional Sales Representative (PSR).pptxUnit-IV; Professional Sales Representative (PSR).pptx
Unit-IV; Professional Sales Representative (PSR).pptx
 
Asian American Pacific Islander Month DDSD 2024.pptx
Asian American Pacific Islander Month DDSD 2024.pptxAsian American Pacific Islander Month DDSD 2024.pptx
Asian American Pacific Islander Month DDSD 2024.pptx
 
Nutritional Needs Presentation - HLTH 104
Nutritional Needs Presentation - HLTH 104Nutritional Needs Presentation - HLTH 104
Nutritional Needs Presentation - HLTH 104
 
Web & Social Media Analytics Previous Year Question Paper.pdf
Web & Social Media Analytics Previous Year Question Paper.pdfWeb & Social Media Analytics Previous Year Question Paper.pdf
Web & Social Media Analytics Previous Year Question Paper.pdf
 
Key note speaker Neum_Admir Softic_ENG.pdf
Key note speaker Neum_Admir Softic_ENG.pdfKey note speaker Neum_Admir Softic_ENG.pdf
Key note speaker Neum_Admir Softic_ENG.pdf
 
INDIA QUIZ 2024 RLAC DELHI UNIVERSITY.pptx
INDIA QUIZ 2024 RLAC DELHI UNIVERSITY.pptxINDIA QUIZ 2024 RLAC DELHI UNIVERSITY.pptx
INDIA QUIZ 2024 RLAC DELHI UNIVERSITY.pptx
 
Holdier Curriculum Vitae (April 2024).pdf
Holdier Curriculum Vitae (April 2024).pdfHoldier Curriculum Vitae (April 2024).pdf
Holdier Curriculum Vitae (April 2024).pdf
 

Trailblazing in the Wilderness of Data Management

  • 1. Trailblazing in the Wilderness of Data Management Where are we going and how do we get there from here. Stephanie Wright Data Services Coordinator University of Washington Libraries
  • 2. Click to edit Master title style AGENDA • Definitions • Why venture out • Paths already taken –Assessments of needs –Existing programs –Tools & resources • Blazing your own trail Montana State University – 21 June 2013
  • 3. Definitions • Data • Data Management • Big Data • Long Tail of Data • Acronyms www.lib.washington.edu
  • 4. Definitions www.lib.washington.edu DATA By data, we do not mean a synonym for information. We mean research data, that which is collected, observed, or created, for purposes of analyzing to produce original research results. Research data may be created in tabular, textual, statistical, numeric, geospatial, image, multimedia or other formats. (Adapted from DISC-UK DataShare Project, p. 16)
  • 5. Definitions www.lib.washington.edu DATA Data can be produced from a variety of processes (e.g., observation, experimentation, simulation, derivation, compilation), represented in numerous forms and stored in many digital formats (e.g., ASCII, PDF, SPSS, Excel, TIFF, Java, FITS, CIF, ZVI) The scope of this definition includes data from disciplines in the sciences, social sciences, and humanities. (Adapted from MIT Libraries, “What is Data?”, 2009)
  • 6. Definitions www.lib.washington.edu DATA MANAGEMENT Pertains to the collection, cleaning, storage, sharing, access, disposal, preservation and/or archiving of research data. (Adapted from University of North Carolina, Research Data Stewardship Report, 2012)
  • 7. Definitions www.lib.washington.edu BIG DATA • Volume • Velocity • Variety 25 Definitions of Big Data: http://www.opentracker.net/article/25-definitions- big-data – Now over 30 definitions
  • 8. Definitions www.lib.washington.edu LONG TAIL OF DATA Image credit: disruptormonkey.typepad.com
  • 9. Acronyms www.lib.washington.edu • RDM – Research Data Management • IR – Institutional Repository • DR – Data Repository • DMP – Data Management Plan
  • 10. Why Venture Out • Funding agencies • Universities • Researchers • Libraries www.lib.washington.edu Image credit: National Park Service, Yellowstone photo collection, (http://www.nps.gov/features/yell/slidefile/mammals/wolf/Images/15314.jpg) www.lib.washington.edu
  • 11. Funding Agencies www.lib.washington.edu • 1998: NSF • 2003: NIH • 2011: NSF • 2013: NSF, OSTP, OMB, NIH
  • 12. Universities www.lib.washington.edu • Competitiveness • Reduce duplication of effort • Preserve the research record of the institution • Encourage innovation & discovery
  • 13. Researchers www.lib.washington.edu • Verifiability & reproducibility • Increased citation rates for publications – (Piwowar et al, 2007) • Preservation of individual scholarly record • Save time by planning early
  • 14. Libraries www.lib.washington.edu •Digital Preservation Network (DPN) “The Digital Preservation Network is being created by research-intensive universities to ensure long-term preservation of the complete digital scholarly record.” http://d-p-n.org/
  • 15. Libraries www.lib.washington.edu NSF Proposal & Award Policies & Procedures Guide (Oct 2012) “Instructions for preparation of the Biographical Sketch have been revised to rename the "Publications" section to "Products" .... (P)roducts may include, but are not limited to, publications, data sets, software, patents, and copyrights.”
  • 16. Paths Already Taken • Assessments • Existing programs • Tools & Resources www.lib.washington.edu Image credit: John W. Ridge (http://commons.wikimedia.org/wiki/File:Yellowstone_Trail_Map.jpg)
  • 17. Assessments www.lib.washington.edu • UNC (2012) “Research Data Stewardship Report” • University of Colorado Boulder (2012) “Research Data Management @ UCB” • Purdue “Data Curation Profiles Directory” (http://docs.lib.purdue.edu/dcp/) • More: Georgia Tech, Cornell, Houston, Oregon….
  • 18. Findings www.lib.washington.edu • Researchers use a wide variety of data types – across disciplines • Most researchers rely on themselves for data management • Researchers want to maintain control of their data • Many are unaware of existing services • They want tools that work in existing workflows
  • 19. What’s Needed www.lib.washington.edu • Creating & maintaining DMPs • Best practices guidance all along lifecycle • Storage – Short-term access – Long-term access – Backup – Versioning – Security • Metadata creation
  • 20. Existing Programs www.lib.washington.edu • Cornell – Research Data Management Service Group • Sr VP for Research and University Librarian • Faculty Advisory Board – 9 faculty across disciplines – OSP & Office of Research Integrity & Assurance • Management Council – 2 librarians, 2 faculty, 2 IT, 1 research institute
  • 21. Existing Programs www.lib.washington.edu • Purdue – D2C2: Distributed Data Curation Center • Executive Committee – Dean of Libraries, VP of Research & VP of IT • Library: consulting & metadata support • IT: storage & research computing support
  • 22. Existing Programs www.lib.washington.edu • University of Washington – Data Services Program (1.5 FTE) • Data Services Coordinator • Data Services Communications & Curriculum Libn – Data Services Team (10 members) – Partnerships • Research Centers (eSci, CSDE, IHME) • Office of Research (OSP) • Campus IT • iSchool
  • 23. Tools & Resources www.lib.washington.edu • Data Mgmt Planning: DMPTool • Metadata & Sharing: DataUP • Sharing & Storage: DataBib • Citation: EZID • Best Practices: DMVitals
  • 24. Blazing Your Own Trail www.lib.washington.edu Image credit: Michigan State University Department of History, HST 321: History of the American West (http://history.msu.edu/hst321/files/2010/07/colter.jpg)
  • 25. www.lib.washington.edu • Identify needs • Consider potential partners • Scope – Disciplines – Specific areas of the data lifecycle • Determine priorities – New services? Enhance existing? Market existing? Where do you want to go?
  • 26. www.lib.washington.edu • Objective L1 – Assess and improve where needed, student learning of critical knowledge & skills • Objective D1 – Elevate the research excellence and recognition of MSU faculty • D1.2 • Objective D2 – Enhance infrastructure in support of research, discovery and creative activities MSU Strategic Plan
  • 27. www.lib.washington.edu • Support for active data storage • Data security guidance • Backup services • Development of tools that can be inserted into existing workflows Campus IT
  • 28. www.lib.washington.edu • Guidance on legal / ethical considerations • Incorporate DM planning into grant submission process • New faculty data management orientations Office of Research
  • 29. www.lib.washington.edu • Market and provide access to existing RDM resources • Provide learning opportunities on RDM best practices • DMP consultation • Storage (final) • Metadata consultation Libraries
  • 30. www.lib.washington.edu • University policy on data management • Integrate RDM activities into T&P process • Consider campus policy on open data University
  • 32. Stephanie Wright Data Services Coordinator swright@uw.edu @shefw http://guides.lib.washington.edu/swright Data Management Guide http://guides.lib.washington.edu/dmg ResearchWorks Data Services http://researchworks.lib.washington.edu/rw-data.html

Hinweis der Redaktion

  1. Here is where I admit that perhaps my use of the terms trailblazing and wilderness of data mgmt might have been colored by the fact that y’all are so close to Yellowstone which has been one of my favorite places to visit since I was a child. But I defend my use of those words and hope to convince you over the next hour or so that I wasn’t really venturing too far into the realm of hyperbole when I came up with that title.
  2. Here is my map for this little journey. And here I want to take a moment to let you know that we have arranged for Q&A time at the end of my presentation portion but I also want you all to feel comfortable stopping me at any time and asking questions as I go along. Data management is a multi-faceted topic and I don’t want you to feel like you have to remember your ?’s til I’m done yakking then say “Remember that slide you had up 20 minutes ago?” I also recognize that people are at varying levels of understanding of the issues surrounding data management. In reality, everyone is new to this. I understand not everyone reads data mgmt needs assessments for fun. Please don’t be afraid to ask me to clarify anything.
  3. I don’t want to get bogged down in terminology & definitions, but I do want to make sure that I’m not speaking a different dialect or even a different language up here so I’ve outlined a few terms where I thought it might be useful to have some clarification.
  4. First, there’s “data”. You would not believe how many definitions you are for such a tiny word. This one used to be my favorite definition and was the one we used in our research data management needs survey we conducted last Fall. It’s adapted from the DISC-UK DataShare Report and I like it because 1) it’s short and 2) it doesn’t overtly align itself to a particular discipline or data format. It can be textual, images, videos, computer models… it’s all data. And when we’re talking data services, at least at UW, we’re mostly looking at supporting digital data services. Even with this definition some folks (usu Hum) don’t see what they do as “data”. So I’ve added another piece to my favorite definition.
  5. This is adapted from an MIT Libraries definition and I like it because it adds the variety of processes that can be used in the collection of data, as well as specifically stating that it is discipline agnostic. I don’t know if I would have gotten more responses from our Humanities colleagues on our survey if I had added this to the definition but when we get around to doing our focus groups with those researchers, I will ask them.
  6. Now there are many processes that data goes through. I already mentioned collection, but just as there is a lifecycle associated with research, there is also a lifecycle associated with data. There are a multitude of data lifecycle models out there. In essence, data management pertains to the various processes involved in managing data through the entire data lifecycle – from planning and collection, all the way through to preservation and archiving.
  7. This is not my favorite term but one hears it so much these days, I feel I need to talk about it. Many people refer to big data as data that are high volume, high velocity, and/or high variety information assets that require new forms of processing for decision making and insight. Large amounts of data (gigabytes, petabytes, yottabytes) Highly complex sets of data / flat schemas, few complex interrelationships Loosely structured data… or highly structured Technology that handles large and complex data sets Process for analyzing large and complex data sets Data sets that can generate insights previously impossible Availability of massive amounts of data In short, “big data” can mean any # of things, which is why I don’t use the term. So moving on.
  8. This is the term that probably requires the most explanation and you will probably most frequently hear it used in conjunction with the previous term because this is usually what “big data” is not and this graph actually explains it pretty well. The vertical axis (the up and down line) is Frequency of Use. The horizontal axis (side to side) is the total inventory of data – everything, all collected data. The green part represents datasets that are popular, widely used and well managed (think of all the climate data collected and maintained over a hundred years by the National Climate Data Center). The yellow part represents datasets that are less frequently used and are managed in some informal manner (maybe on departmental shared network folders). The red part – that is the long tail. It’s data that is rarely used and not managed in any kind of organized fashion. And it’s estimated that it’s 80-85% of all data collected. It’s that red part where many organizations tend to focus with data services because that’s where the needs are greatest. It’s the data that a researcher collected 10 yrs, 5yrs, 1 yr ago that’s sitting on a floppy, a CD or a thumb drive in, or worse, under a researcher’s desk. You may notice that size of the dataset is not represented on this graph. Size is not a factor in determining if a data set falls into the long tail.
  9. I try to avoid acronyms but after the first ten times, even I get tired of saying research data management over and over. Don’t think I need to define RDM any further since I already specified research data in my definition of data and defined data management. IR – Central location for storing an institution’s digital assets and intellectual outputs (e.g., MSU’s ScholarWorks) DR - Repository specifically designed for storage and access to data sets. Can be part of an IR. DMP - a document outlining how a researcher plans to manage data during and after a research project including how it will be organized, maintained and shared. 
  10. Alright, definitions done. On with the meat of it. So why is data management such a hot topic. Why do we even need to do anything differently than we’ve been doing in the past? I’m going to break things down by the different players involved.
  11. “As long as empirical research has existed, researchers have been doing “data management” in one form or another. However, funding agency mandates for doing formal data management are relatively recent. 1998 – NSF instituted DMP requirement 2003 – NIH implemented data sharing policy 2011 – NSF more strongly enforced DMP requirement 2013 – NSF changed merit review criteria for grant proposals to allow inclusion of datasets (Jan?); OSTP mandate for public access to federally funded research (Feb); OMB mandate for government Open Data (May); NIH enforcing public-access policy http://grants.nih.gov/grants/policy/data_sharing/ http://www.nsf.gov/pubs/policydocs/pappguide/nsf13001/gpg_sigchanges.jsp http://scholarlykitchen.sspnet.org/2013/02/25/expanding-public-access-to-the-results-of-federally-funded-research-first-impressions-on-the-us-governments-policy/ http://www.federalnewsradio.com/513/3316130/White-House-mandates-open-data-releases-new-tools http://www.nature.com/nm/journal/v19/n1/full/nm0113-3.html
  12. By providing support for data management, universities increase the competitiveness of their researchers for obtaining grants Maximize potential of researchers as they can reuse data already collected by others. Don’t think I need to say anything extra about that third point. Encourages innovation & discovery by allowing researchers to think of research questions in new ways using existing data
  13. As it gets harder and harder to obtain dollars for research, researchers are under increasing scrutiny to be able to verify their research. By following data mgmt best practices, you can produce your data and the associated documentation if needed to verify and reproduce your research. Heather Piwowar and friends published research in 2007 showing that publications with publicly available associated data had a 69% increase in citations. Let me tell you a story about a Nursing faculty member I interviewed as part of our RDM survey and follow-up interviews project this year. The week I went to interview her, she had just been told that the IT folks could not recover the over 30 years of research she had been saving on the departmental server when the hard drive failed. And when I say 30 years of research, I mean all her papers, her data, her codebooks, her scripts. Everything. Gone. She did not have this research anywhere else because she was under the assumption that it was being backed up. On that last point, planning for data mgmt tasks at the beginning of the research process is a lot less time consuming then doing the data forensics after the research project is over, over 5, 10, 15 years down the road.
  14. Alright, so why am I including libraries in this. Earlier this year the Digital Preservation Network (or DPN) was formed and the UW became a member and while not focused solely on data, its mission is to “ensure long-term preservation of the complete digital scholarly record”. Not just ejournal articles and ebooks, the COMPLETE digital scholarly record and data certainly is part of that. And if you’re wondering what this has to do with libraries, look at that last part of the mission statement again. Take out “digital” and isn’t that why academic libraries were born?
  15. Data is recognized as a valuable scholarly output. The NSF made that even more explicit in October of last year when it made this change to it’s Proposal Award & Policies Guide. If libraries don’t step up to the plate and provide data management support, everybody is going to try to figure out a way to do it themselves and that meets their individual needs. To put it in perspective, imagine if every department on campus was maintaining the books and journals in their own subject areas. This is what Libraries are supposed to do… it’s why we’re here. And now it’s time to take those skills librarians have always used to do those same things we’ve always done for traditional scholarly outputs and adapt them to meet scholarly data needs. Skills like how to organize information, metadata creation, providing access to information. Reference skills in particular are key: ability to liaise, to communicate across disciplines, to refer, consult, to teach. Off my librarian soapbox… for now.
  16. There has been a lot of work done in this area over the last several years. In order to get to the next section, about blazing your own trail, I thought it might be helpful to look at what’s already been done. There have been several data management needs assessments, there are some existing programs to look at, and a lot of useful tools have been developed to help support data mgmt needs.
  17. In my former life, I was an assessment librarian so it does my heart good to see so many folks out there that have been doing needs assessment for data management. I’ve listed a few of my favorites here. I’m extremely impressed with what Purdue has done with their Data Curation Profiles and they have now created a directory of profiles from not only Purdue, but profiles submitted by other institutions, as well. I’ve mentioned that we did a survey & interviews recently, though we haven’t yet published our results. I did get to present our preliminary findings at a conference recently with Georgia Tech, Cornell & Purdue and though there were differences in our methodologies, populations and findings, there are some needs that keep coming up across multiple assessments.
  18. Wide variety of data types, wide variety of file sizes
  19. Wide variety of data types, wide variety of file sizes
  20. It is centered in the Research Department of the Purdue University Libraries.  D2C2 is comprised of four core researchers who work closely with subject specialist liaisons in discipline areas throughout the Libraries
  21. 3 FTE who work with subject librarians
  22. An open source tool helping researchers document, manage, and archive their tabular data, DataUp operates within the scientist's workflow and integrates with Microsoft® Excel. tool for helping people identify and locate online repositories of research data rate the current state of the researcher’s data management practices. the system compares the information collected during the data interview process with these data management best practice statements.  a framework for comparing and improving departmental data management practices
  23. Alright, so we’ve talked about why data management is important, what’s been done in the area so far, let’s walk forward on how to provide support here at MSU. Let’s start with your strategic plan because you already have objectives listed there where parts of a data services program would fit in nicely
  24. Develop a separate RDM strategic plan I won’t go into the whole strategic planning process… there are several ways to go about it. UW Libraries uses the Balanced Scorecard system for its strategic plan. The Data Services Team and I have been working on a logic model to help us develop our programmatic strategic plan. Here are a few things to consider.
  25. You already have some starting points. Look at the MSU strategic plan. Data management isn’t just important for current researchers, but also for future researchers, as well. At UW, we are developing data management learning opportunities for librarians, faculty & students. Consider the integration of data literacy into grad research methods courses. D1.2 specifically mentions measuring achievement in this objective thru peer-reviewed publications and journal citations. I would suggest including in here other alternative metrics such as data set downloads and citations. Reuse of existing datasets for new research. D2 Sounds like you are already on your way with your recent release of the IR ScholarWorks. If so desired, you can also use your IR to support data management by allowing for the deposit of data sets in your IR.
  26. Now I’m spending the rest of my day after this presentation talking with different groups on campus so I can’t even begin to make any specific recommendations, but here are a few ideas. Some possible roles for campus IT. When I say active data storage, I’m talking about storage during the phase of research where data is being actively collected, accessed, manipulated and shared among collaborators. As opposed to the final version of a data set that is preserved for future reuse.
  27. Here are a few ideas for Office of Research. At UW we’re working with our Office of Sponsored Programs on that last bullet point. In a recent meeting, we talked about looking into the feasibility of coordination between my shop and OSP when a researcher is submitting a grant proposal to a funding agency that requires a DMP.
  28. I’ve already mentioned how librarians have certain skillsets that are conducive to data management support. Here is just a smattering of possible services they can provide. At the UW we provide all of these, though not at as high a level as I would like, but we’re working on that. And that’s something to keep in mind, as well. You don’t have to come out of the gate with everything polished. We sure didn’t. When the NSF announced it was enforcing the DMP mandate, I threw up a quick and dirty LibGuide on DMPs. The next year I rec’d a Friends of the Libraries grant to develop a more robust data management guide. It’s better, but it’s still not the site I want it to be.
  29. Consider what can be done at a broad university level, as well, not just by individual groups on campus. Here are a few suggestions on that front. In short, research data management services works well with the saying “It takes a village.” There are lots of parts to be played and there are some units more suited for fulfilling certain roles than others. There are many things that can be done to support data management. Some are low hanging fruit, some you might need a stepladder. The key is to do something. Because doing nothing really isn’t an option.