SlideShare ist ein Scribd-Unternehmen logo
1 von 12
Downloaden Sie, um offline zu lesen
Towards A Data Driven Understanding of Research Data
September 3, 2015
Montana State University, Research Council
Jerry Sheehan
Montana State University
Chief Information Officer
jsheehan@montana.edu
The “Consumerization” of Research Data
Trend 1 Costs and Capacity
• A “Consumer Effect Has” Pushed
Prices Down While Increasing
Performance.
• Users Can Easily Buy More Storage
Than They Need.
• There are No Enterprise Strategies
for Research Data Discovery.
• No explicit way to inventory
• Instruments have “bursty” behavior
when the move data on the
network
Montana State University-Information Technology Center
“Commodity” Data Laboratory Equipment @ Montana State
Device Data Generation Per Run
Illumina Genomic Sequence .5Tb to 1Tb per run
Confocal Microscope 50-100Gb per run
Transmission Electron Microcope 10-20Gb per run
Montana State University-Information Technology Center
Research Data Census was a Three Way Institutional Partnership
Information Technology Center
University Library Vice President for Research &
Economic Development
Montana State University-Information Technology Center
Response Rates and Demographics
Montana State University-Information Technology Center
What Types of Research Data Do You Have?
Montana State University-Information Technology Center
How Do You Store Your Data?
Montana State University-Information Technology Center
How Large is Your Research Data?
Montana State University-Information Technology Center
Who Do You Share Your Data With and When?
Montana State University-Information Technology Center
Statistically Significant Findings
Montana State University-Information Technology Center
•Researchers who share their data, regardless of who they share it with (colleagues, students, or non-MSU
researchers) also tend to download data from other sources or repositories (78 percent of people sharing their
data also download data, versus 37 percent of people not sharing their data; p-value: 1.67x10-7
).
•Researchers with large research data tend to download data from other sources or repositories (90 percent of
people with data sets above one terabyte also download data, versus 42 percent for people with data sets
below 10 Gb; p-value: 1.58x10-5
).
•Researchers who back up their data also tend to annotate it (55 percent of people who back up their data
also annotate it, versus 22 percent of people who don't back up their data; p-value: 5x10-3
).
•Researchers with large research data tend to annotate it (62 percent of people with data sets above one
terabyte also annotate their data, versus 39 percent of people with data sets below 10 Gb; p-value: 0.024).
•Researchers interested in learning more about data infrastructure and services who do not back up their data
cite technical barriers as their main reason for not doing so (p-value: 0.014).
Qualitative Interview Findings
Montana State University-Information Technology Center
•Researchers don’t usually describe their data by size, although many know the exact size of their data. Instead,
their standard practice is to describe how they transfer the file (via email, placed on hard drives, put in cloud
services, etc.
•Researchers' sense of when and how data is disseminated and shared varied widely.
•There is no common definition of “big data”. Definitions change between disciplines, researchers build “bigger
data” by aggregating many small research results.
•Without exception, interviewees described their research practices as involving collaboration with others, both
inside and outside the institution.
•All researchers responded positively when asked if they would engage MSU Library services that focus on data
set annotation and metadata markup, assistance with deposit in relevant data repositories, and educational
programs and training on campus IT resources.
Impacts of the the RDC
Montana State University-Information Technology Center
• Creation of a multi-stakeholder proposal ($500K) to the National Science Foundation for investment
in a science network for the Bozeman campus. PI: Jerry Sheehan, Co-PIs: Kenning Arlitsch, Ben
Poulter, Phil Stewart, and Mark Young.
• Input from the Research Data Census and the NSF Proposal is Driving FY16 Capital Investments for
Campus.
• New Collaboration between ITC and the Library to Bundle A Set of Data Services and Infrastructure
for the Montana State University Research Community.
• Formal Publication of Survey Results in On-Line Educause Review (Sept/Oct 2015).
• Modification of the Survey Instrument, Adoption of Instrument by Other MSU Campuses, and
Sharing of Instrument with Higher Education Community.

Weitere ähnliche Inhalte

Was ist angesagt?

Research Metadata Mechanics - Simon Porter
Research Metadata Mechanics - Simon PorterResearch Metadata Mechanics - Simon Porter
Research Metadata Mechanics - Simon PorterCASRAI
 
Helping Faculty Help Themselves: Open Access and Data Management Consulting A...
Helping Faculty Help Themselves: Open Access and Data Management Consulting A...Helping Faculty Help Themselves: Open Access and Data Management Consulting A...
Helping Faculty Help Themselves: Open Access and Data Management Consulting A...Spencer Keralis
 
Meeting Federal Research Requirements
Meeting Federal Research RequirementsMeeting Federal Research Requirements
Meeting Federal Research RequirementsICPSR
 
From Data Sharing to Data Stewardship
From Data Sharing to Data StewardshipFrom Data Sharing to Data Stewardship
From Data Sharing to Data StewardshipICPSR
 
What to do about data? An overview of guidelines and policies for dataset co...
What to do about data?  An overview of guidelines and policies for dataset co...What to do about data?  An overview of guidelines and policies for dataset co...
What to do about data? An overview of guidelines and policies for dataset co...Sarah Young
 
Library resources and services for grant development
Library resources and services for grant developmentLibrary resources and services for grant development
Library resources and services for grant developmentrds-wayne-edu
 
RDAP 16: DMPs and Public Access: Agency and Data Service Experiences
RDAP 16: DMPs and Public Access: Agency and Data Service ExperiencesRDAP 16: DMPs and Public Access: Agency and Data Service Experiences
RDAP 16: DMPs and Public Access: Agency and Data Service ExperiencesASIS&T
 
Building and providing data management services a framework for everyone!
Building and providing data management services  a framework for everyone!Building and providing data management services  a framework for everyone!
Building and providing data management services a framework for everyone!Renaine Julian
 
RDAP 16: If I could turn back time: Looking back on 2+ years of DMP consultin...
RDAP 16: If I could turn back time: Looking back on 2+ years of DMP consultin...RDAP 16: If I could turn back time: Looking back on 2+ years of DMP consultin...
RDAP 16: If I could turn back time: Looking back on 2+ years of DMP consultin...ASIS&T
 
RDAP14: Emerging role of UC Libraries in research data management education
RDAP14: Emerging role of UC Libraries in research data management educationRDAP14: Emerging role of UC Libraries in research data management education
RDAP14: Emerging role of UC Libraries in research data management educationASIS&T
 
RDAP 16: Data Management Plan Perspectives (Panel 5, DMPs and Public Access)
RDAP 16: Data Management Plan Perspectives (Panel 5, DMPs and Public Access)RDAP 16: Data Management Plan Perspectives (Panel 5, DMPs and Public Access)
RDAP 16: Data Management Plan Perspectives (Panel 5, DMPs and Public Access)ASIS&T
 
Data Sharing with ICPSR: Fueling the Cycle of Science through Discovery, Acce...
Data Sharing with ICPSR: Fueling the Cycle of Science through Discovery, Acce...Data Sharing with ICPSR: Fueling the Cycle of Science through Discovery, Acce...
Data Sharing with ICPSR: Fueling the Cycle of Science through Discovery, Acce...ICPSR
 

Was ist angesagt? (20)

RDAP 033111
RDAP 033111RDAP 033111
RDAP 033111
 
Uc3 pasig-asis&t-2013-08-20-support-of-data-intensive-research
Uc3 pasig-asis&t-2013-08-20-support-of-data-intensive-researchUc3 pasig-asis&t-2013-08-20-support-of-data-intensive-research
Uc3 pasig-asis&t-2013-08-20-support-of-data-intensive-research
 
Research Metadata Mechanics - Simon Porter
Research Metadata Mechanics - Simon PorterResearch Metadata Mechanics - Simon Porter
Research Metadata Mechanics - Simon Porter
 
Open Access as a Means to Produce High Quality Data
Open Access as a Means to Produce High Quality DataOpen Access as a Means to Produce High Quality Data
Open Access as a Means to Produce High Quality Data
 
Helping Faculty Help Themselves: Open Access and Data Management Consulting A...
Helping Faculty Help Themselves: Open Access and Data Management Consulting A...Helping Faculty Help Themselves: Open Access and Data Management Consulting A...
Helping Faculty Help Themselves: Open Access and Data Management Consulting A...
 
Meeting Federal Research Requirements
Meeting Federal Research RequirementsMeeting Federal Research Requirements
Meeting Federal Research Requirements
 
From Data Sharing to Data Stewardship
From Data Sharing to Data StewardshipFrom Data Sharing to Data Stewardship
From Data Sharing to Data Stewardship
 
La ricerca scientifica nell'era dei Big Data - Sabina Leonelli
La ricerca scientifica nell'era dei Big Data - Sabina LeonelliLa ricerca scientifica nell'era dei Big Data - Sabina Leonelli
La ricerca scientifica nell'era dei Big Data - Sabina Leonelli
 
What to do about data? An overview of guidelines and policies for dataset co...
What to do about data?  An overview of guidelines and policies for dataset co...What to do about data?  An overview of guidelines and policies for dataset co...
What to do about data? An overview of guidelines and policies for dataset co...
 
Library resources and services for grant development
Library resources and services for grant developmentLibrary resources and services for grant development
Library resources and services for grant development
 
RDAP 16: DMPs and Public Access: Agency and Data Service Experiences
RDAP 16: DMPs and Public Access: Agency and Data Service ExperiencesRDAP 16: DMPs and Public Access: Agency and Data Service Experiences
RDAP 16: DMPs and Public Access: Agency and Data Service Experiences
 
Building and providing data management services a framework for everyone!
Building and providing data management services  a framework for everyone!Building and providing data management services  a framework for everyone!
Building and providing data management services a framework for everyone!
 
RDAP 16: If I could turn back time: Looking back on 2+ years of DMP consultin...
RDAP 16: If I could turn back time: Looking back on 2+ years of DMP consultin...RDAP 16: If I could turn back time: Looking back on 2+ years of DMP consultin...
RDAP 16: If I could turn back time: Looking back on 2+ years of DMP consultin...
 
NISO Virtual Conference Scientific Data Management: Caring for Your Instituti...
NISO Virtual Conference Scientific Data Management: Caring for Your Instituti...NISO Virtual Conference Scientific Data Management: Caring for Your Instituti...
NISO Virtual Conference Scientific Data Management: Caring for Your Instituti...
 
RDAP14: Emerging role of UC Libraries in research data management education
RDAP14: Emerging role of UC Libraries in research data management educationRDAP14: Emerging role of UC Libraries in research data management education
RDAP14: Emerging role of UC Libraries in research data management education
 
NISO Virtual Conference Scientific Data Management: Caring for Your Instituti...
NISO Virtual Conference Scientific Data Management: Caring for Your Instituti...NISO Virtual Conference Scientific Data Management: Caring for Your Instituti...
NISO Virtual Conference Scientific Data Management: Caring for Your Instituti...
 
RDAP 16: Data Management Plan Perspectives (Panel 5, DMPs and Public Access)
RDAP 16: Data Management Plan Perspectives (Panel 5, DMPs and Public Access)RDAP 16: Data Management Plan Perspectives (Panel 5, DMPs and Public Access)
RDAP 16: Data Management Plan Perspectives (Panel 5, DMPs and Public Access)
 
Strasser "Effective data management and its role in open research"
Strasser "Effective data management and its role in open research"Strasser "Effective data management and its role in open research"
Strasser "Effective data management and its role in open research"
 
Data Sharing with ICPSR: Fueling the Cycle of Science through Discovery, Acce...
Data Sharing with ICPSR: Fueling the Cycle of Science through Discovery, Acce...Data Sharing with ICPSR: Fueling the Cycle of Science through Discovery, Acce...
Data Sharing with ICPSR: Fueling the Cycle of Science through Discovery, Acce...
 
Llebot "Research Data Support for Researchers: Metadata, Challenges, and Oppo...
Llebot "Research Data Support for Researchers: Metadata, Challenges, and Oppo...Llebot "Research Data Support for Researchers: Metadata, Challenges, and Oppo...
Llebot "Research Data Support for Researchers: Metadata, Challenges, and Oppo...
 

Andere mochten auch

Mte 533 differentiating instruction
Mte 533 differentiating instructionMte 533 differentiating instruction
Mte 533 differentiating instructionarbaker12
 
Occupational safety
Occupational safetyOccupational safety
Occupational safetyschulaes
 
презентация Ver.1.4
презентация Ver.1.4презентация Ver.1.4
презентация Ver.1.4chekina1990oksana
 
High Performance Cyberinfrastructure and Data Services
High Performance Cyberinfrastructure and Data ServicesHigh Performance Cyberinfrastructure and Data Services
High Performance Cyberinfrastructure and Data ServicesJerry Sheehan
 
All Copy Products Grand Junction
All Copy Products Grand JunctionAll Copy Products Grand Junction
All Copy Products Grand JunctionChris Williams
 
2015 TCS New York City Marathon Media Guide
2015 TCS New York City Marathon Media Guide2015 TCS New York City Marathon Media Guide
2015 TCS New York City Marathon Media Guidermolke
 
презентация сп4
презентация сп4презентация сп4
презентация сп4chekina1990oksana
 
Herramientas web 2
Herramientas web 2Herramientas web 2
Herramientas web 2Danny Pineda
 
Generalidades del turismo
Generalidades del turismoGeneralidades del turismo
Generalidades del turismoCielo Gris
 
Voici le printemps ! Conseils pour le nettoyage de printemps
Voici le printemps ! Conseils pour le nettoyage de printempsVoici le printemps ! Conseils pour le nettoyage de printemps
Voici le printemps ! Conseils pour le nettoyage de printempsWagenverkopen
 
Les voitures les moins fiables – quelles voitures éviter vendredi le 13 ?
Les voitures les moins fiables – quelles voitures éviter vendredi le 13 ?Les voitures les moins fiables – quelles voitures éviter vendredi le 13 ?
Les voitures les moins fiables – quelles voitures éviter vendredi le 13 ?Wagenverkopen
 

Andere mochten auch (16)

Mte 533 differentiating instruction
Mte 533 differentiating instructionMte 533 differentiating instruction
Mte 533 differentiating instruction
 
Occupational safety
Occupational safetyOccupational safety
Occupational safety
 
Welcome to the dsc
Welcome to the dscWelcome to the dsc
Welcome to the dsc
 
Munawla Profile
Munawla ProfileMunawla Profile
Munawla Profile
 
Draft review process
Draft review processDraft review process
Draft review process
 
презентация Ver.1.4
презентация Ver.1.4презентация Ver.1.4
презентация Ver.1.4
 
High Performance Cyberinfrastructure and Data Services
High Performance Cyberinfrastructure and Data ServicesHigh Performance Cyberinfrastructure and Data Services
High Performance Cyberinfrastructure and Data Services
 
JWEF
JWEFJWEF
JWEF
 
All Copy Products Grand Junction
All Copy Products Grand JunctionAll Copy Products Grand Junction
All Copy Products Grand Junction
 
2015 TCS New York City Marathon Media Guide
2015 TCS New York City Marathon Media Guide2015 TCS New York City Marathon Media Guide
2015 TCS New York City Marathon Media Guide
 
презентация сп4
презентация сп4презентация сп4
презентация сп4
 
Herramientas web 2
Herramientas web 2Herramientas web 2
Herramientas web 2
 
Generalidades del turismo
Generalidades del turismoGeneralidades del turismo
Generalidades del turismo
 
Voici le printemps ! Conseils pour le nettoyage de printemps
Voici le printemps ! Conseils pour le nettoyage de printempsVoici le printemps ! Conseils pour le nettoyage de printemps
Voici le printemps ! Conseils pour le nettoyage de printemps
 
Les voitures les moins fiables – quelles voitures éviter vendredi le 13 ?
Les voitures les moins fiables – quelles voitures éviter vendredi le 13 ?Les voitures les moins fiables – quelles voitures éviter vendredi le 13 ?
Les voitures les moins fiables – quelles voitures éviter vendredi le 13 ?
 
Wireless Communication Generations
Wireless Communication GenerationsWireless Communication Generations
Wireless Communication Generations
 

Ähnlich wie Research Data Census

Meeting Federal Research Requirements for Data Management Plans, Public Acces...
Meeting Federal Research Requirements for Data Management Plans, Public Acces...Meeting Federal Research Requirements for Data Management Plans, Public Acces...
Meeting Federal Research Requirements for Data Management Plans, Public Acces...ICPSR
 
One View of Data Science
One View of Data ScienceOne View of Data Science
One View of Data SciencePhilip Bourne
 
Magle data curation in libraries
Magle data curation in librariesMagle data curation in libraries
Magle data curation in librariesC. Tobin Magle
 
Library Analytics and Metrics Project
Library Analytics and Metrics Project Library Analytics and Metrics Project
Library Analytics and Metrics Project Ben Showers
 
Data Management and Broader Impacts: a holistic approach
Data Management and Broader Impacts: a holistic approachData Management and Broader Impacts: a holistic approach
Data Management and Broader Impacts: a holistic approachMegan O'Donnell
 
Supporting the NRP with a Lean CI Staff
Supporting the NRP with a Lean CI StaffSupporting the NRP with a Lean CI Staff
Supporting the NRP with a Lean CI StaffJerry Sheehan
 
Supporting the National Research Platform with a Lean Cyberinfrastructure (CI...
Supporting the National Research Platform with a Lean Cyberinfrastructure (CI...Supporting the National Research Platform with a Lean Cyberinfrastructure (CI...
Supporting the National Research Platform with a Lean Cyberinfrastructure (CI...Jerry Sheehan
 
PSB2014 A Vision for Biomedical Research
PSB2014 A Vision for Biomedical ResearchPSB2014 A Vision for Biomedical Research
PSB2014 A Vision for Biomedical ResearchPhilip Bourne
 
What Can Happen when Genome Sciences Meets Data Sciences?
What Can Happen when Genome Sciences Meets Data Sciences?What Can Happen when Genome Sciences Meets Data Sciences?
What Can Happen when Genome Sciences Meets Data Sciences?Philip Bourne
 
Research Data Management Guidance overview
Research Data Management Guidance overviewResearch Data Management Guidance overview
Research Data Management Guidance overviewAaron Collie
 
RDMG Service Overview
RDMG Service OverviewRDMG Service Overview
RDMG Service OverviewAaron Collie
 
Research Data Management in Academic Libraries: Meeting the Challenge
Research Data Management in Academic Libraries: Meeting the ChallengeResearch Data Management in Academic Libraries: Meeting the Challenge
Research Data Management in Academic Libraries: Meeting the ChallengeSpencer Keralis
 
The UVA School of Data Science
The UVA School of Data ScienceThe UVA School of Data Science
The UVA School of Data SciencePhilip Bourne
 
Data Sharing & Data Citation
Data Sharing & Data CitationData Sharing & Data Citation
Data Sharing & Data CitationMicah Altman
 
Digital Resources for Open Science
Digital Resources for Open ScienceDigital Resources for Open Science
Digital Resources for Open ScienceMartin Donnelly
 
Publishing perspectives on data management & future directions
Publishing perspectives on data management & future directionsPublishing perspectives on data management & future directions
Publishing perspectives on data management & future directionsARDC
 
Next generation data services at the Marriott Library
Next generation data services at the Marriott LibraryNext generation data services at the Marriott Library
Next generation data services at the Marriott LibraryRebekah Cummings
 
Data Management for Postgraduate students by Lynn Woolfrey
Data Management for Postgraduate students by Lynn WoolfreyData Management for Postgraduate students by Lynn Woolfrey
Data Management for Postgraduate students by Lynn Woolfreypvhead123
 

Ähnlich wie Research Data Census (20)

Meeting Federal Research Requirements for Data Management Plans, Public Acces...
Meeting Federal Research Requirements for Data Management Plans, Public Acces...Meeting Federal Research Requirements for Data Management Plans, Public Acces...
Meeting Federal Research Requirements for Data Management Plans, Public Acces...
 
One View of Data Science
One View of Data ScienceOne View of Data Science
One View of Data Science
 
Magle data curation in libraries
Magle data curation in librariesMagle data curation in libraries
Magle data curation in libraries
 
McGeary Data Curation Network: Developing and Scaling
McGeary Data Curation Network: Developing and ScalingMcGeary Data Curation Network: Developing and Scaling
McGeary Data Curation Network: Developing and Scaling
 
Library Analytics and Metrics Project
Library Analytics and Metrics Project Library Analytics and Metrics Project
Library Analytics and Metrics Project
 
Data Management and Broader Impacts: a holistic approach
Data Management and Broader Impacts: a holistic approachData Management and Broader Impacts: a holistic approach
Data Management and Broader Impacts: a holistic approach
 
Ratan "Are we there yet? Keeping the promise of open science"
Ratan "Are we there yet?  Keeping the promise of open science"Ratan "Are we there yet?  Keeping the promise of open science"
Ratan "Are we there yet? Keeping the promise of open science"
 
Supporting the NRP with a Lean CI Staff
Supporting the NRP with a Lean CI StaffSupporting the NRP with a Lean CI Staff
Supporting the NRP with a Lean CI Staff
 
Supporting the National Research Platform with a Lean Cyberinfrastructure (CI...
Supporting the National Research Platform with a Lean Cyberinfrastructure (CI...Supporting the National Research Platform with a Lean Cyberinfrastructure (CI...
Supporting the National Research Platform with a Lean Cyberinfrastructure (CI...
 
PSB2014 A Vision for Biomedical Research
PSB2014 A Vision for Biomedical ResearchPSB2014 A Vision for Biomedical Research
PSB2014 A Vision for Biomedical Research
 
What Can Happen when Genome Sciences Meets Data Sciences?
What Can Happen when Genome Sciences Meets Data Sciences?What Can Happen when Genome Sciences Meets Data Sciences?
What Can Happen when Genome Sciences Meets Data Sciences?
 
Research Data Management Guidance overview
Research Data Management Guidance overviewResearch Data Management Guidance overview
Research Data Management Guidance overview
 
RDMG Service Overview
RDMG Service OverviewRDMG Service Overview
RDMG Service Overview
 
Research Data Management in Academic Libraries: Meeting the Challenge
Research Data Management in Academic Libraries: Meeting the ChallengeResearch Data Management in Academic Libraries: Meeting the Challenge
Research Data Management in Academic Libraries: Meeting the Challenge
 
The UVA School of Data Science
The UVA School of Data ScienceThe UVA School of Data Science
The UVA School of Data Science
 
Data Sharing & Data Citation
Data Sharing & Data CitationData Sharing & Data Citation
Data Sharing & Data Citation
 
Digital Resources for Open Science
Digital Resources for Open ScienceDigital Resources for Open Science
Digital Resources for Open Science
 
Publishing perspectives on data management & future directions
Publishing perspectives on data management & future directionsPublishing perspectives on data management & future directions
Publishing perspectives on data management & future directions
 
Next generation data services at the Marriott Library
Next generation data services at the Marriott LibraryNext generation data services at the Marriott Library
Next generation data services at the Marriott Library
 
Data Management for Postgraduate students by Lynn Woolfrey
Data Management for Postgraduate students by Lynn WoolfreyData Management for Postgraduate students by Lynn Woolfrey
Data Management for Postgraduate students by Lynn Woolfrey
 

Mehr von Jerry Sheehan

IT Town Hall Montana State
IT Town Hall Montana StateIT Town Hall Montana State
IT Town Hall Montana StateJerry Sheehan
 
Scaling Approaches to the National Research Platform
Scaling Approaches to the National Research PlatformScaling Approaches to the National Research Platform
Scaling Approaches to the National Research PlatformJerry Sheehan
 
Performance Evaluations for UIT
Performance Evaluations for UITPerformance Evaluations for UIT
Performance Evaluations for UITJerry Sheehan
 
Montana State University's Bridger: A Science Driven Network Cyberinfrastruc...
Montana State University's Bridger:  A Science Driven Network Cyberinfrastruc...Montana State University's Bridger:  A Science Driven Network Cyberinfrastruc...
Montana State University's Bridger: A Science Driven Network Cyberinfrastruc...Jerry Sheehan
 
Technology, Complexity & Change: Creative Frictions of the Present
Technology, Complexity & Change:  Creative Frictions of the PresentTechnology, Complexity & Change:  Creative Frictions of the Present
Technology, Complexity & Change: Creative Frictions of the PresentJerry Sheehan
 
Research CI @ Montana State
Research CI @ Montana StateResearch CI @ Montana State
Research CI @ Montana StateJerry Sheehan
 

Mehr von Jerry Sheehan (7)

IT Town Hall Montana State
IT Town Hall Montana StateIT Town Hall Montana State
IT Town Hall Montana State
 
Scaling Approaches to the National Research Platform
Scaling Approaches to the National Research PlatformScaling Approaches to the National Research Platform
Scaling Approaches to the National Research Platform
 
Performance Evaluations for UIT
Performance Evaluations for UITPerformance Evaluations for UIT
Performance Evaluations for UIT
 
Montana State University's Bridger: A Science Driven Network Cyberinfrastruc...
Montana State University's Bridger:  A Science Driven Network Cyberinfrastruc...Montana State University's Bridger:  A Science Driven Network Cyberinfrastruc...
Montana State University's Bridger: A Science Driven Network Cyberinfrastruc...
 
Technology, Complexity & Change: Creative Frictions of the Present
Technology, Complexity & Change:  Creative Frictions of the PresentTechnology, Complexity & Change:  Creative Frictions of the Present
Technology, Complexity & Change: Creative Frictions of the Present
 
Research CI @ Montana State
Research CI @ Montana StateResearch CI @ Montana State
Research CI @ Montana State
 
Townhalloct2015
Townhalloct2015Townhalloct2015
Townhalloct2015
 

Kürzlich hochgeladen

Breaking the Kubernetes Kill Chain: Host Path Mount
Breaking the Kubernetes Kill Chain: Host Path MountBreaking the Kubernetes Kill Chain: Host Path Mount
Breaking the Kubernetes Kill Chain: Host Path MountPuma Security, LLC
 
Histor y of HAM Radio presentation slide
Histor y of HAM Radio presentation slideHistor y of HAM Radio presentation slide
Histor y of HAM Radio presentation slidevu2urc
 
04-2024-HHUG-Sales-and-Marketing-Alignment.pptx
04-2024-HHUG-Sales-and-Marketing-Alignment.pptx04-2024-HHUG-Sales-and-Marketing-Alignment.pptx
04-2024-HHUG-Sales-and-Marketing-Alignment.pptxHampshireHUG
 
Neo4j - How KGs are shaping the future of Generative AI at AWS Summit London ...
Neo4j - How KGs are shaping the future of Generative AI at AWS Summit London ...Neo4j - How KGs are shaping the future of Generative AI at AWS Summit London ...
Neo4j - How KGs are shaping the future of Generative AI at AWS Summit London ...Neo4j
 
Injustice - Developers Among Us (SciFiDevCon 2024)
Injustice - Developers Among Us (SciFiDevCon 2024)Injustice - Developers Among Us (SciFiDevCon 2024)
Injustice - Developers Among Us (SciFiDevCon 2024)Allon Mureinik
 
Factors to Consider When Choosing Accounts Payable Services Providers.pptx
Factors to Consider When Choosing Accounts Payable Services Providers.pptxFactors to Consider When Choosing Accounts Payable Services Providers.pptx
Factors to Consider When Choosing Accounts Payable Services Providers.pptxKatpro Technologies
 
Driving Behavioral Change for Information Management through Data-Driven Gree...
Driving Behavioral Change for Information Management through Data-Driven Gree...Driving Behavioral Change for Information Management through Data-Driven Gree...
Driving Behavioral Change for Information Management through Data-Driven Gree...Enterprise Knowledge
 
Kalyanpur ) Call Girls in Lucknow Finest Escorts Service 🍸 8923113531 🎰 Avail...
Kalyanpur ) Call Girls in Lucknow Finest Escorts Service 🍸 8923113531 🎰 Avail...Kalyanpur ) Call Girls in Lucknow Finest Escorts Service 🍸 8923113531 🎰 Avail...
Kalyanpur ) Call Girls in Lucknow Finest Escorts Service 🍸 8923113531 🎰 Avail...gurkirankumar98700
 
The 7 Things I Know About Cyber Security After 25 Years | April 2024
The 7 Things I Know About Cyber Security After 25 Years | April 2024The 7 Things I Know About Cyber Security After 25 Years | April 2024
The 7 Things I Know About Cyber Security After 25 Years | April 2024Rafal Los
 
Handwritten Text Recognition for manuscripts and early printed texts
Handwritten Text Recognition for manuscripts and early printed textsHandwritten Text Recognition for manuscripts and early printed texts
Handwritten Text Recognition for manuscripts and early printed textsMaria Levchenko
 
Apidays Singapore 2024 - Building Digital Trust in a Digital Economy by Veron...
Apidays Singapore 2024 - Building Digital Trust in a Digital Economy by Veron...Apidays Singapore 2024 - Building Digital Trust in a Digital Economy by Veron...
Apidays Singapore 2024 - Building Digital Trust in a Digital Economy by Veron...apidays
 
08448380779 Call Girls In Diplomatic Enclave Women Seeking Men
08448380779 Call Girls In Diplomatic Enclave Women Seeking Men08448380779 Call Girls In Diplomatic Enclave Women Seeking Men
08448380779 Call Girls In Diplomatic Enclave Women Seeking MenDelhi Call girls
 
The Role of Taxonomy and Ontology in Semantic Layers - Heather Hedden.pdf
The Role of Taxonomy and Ontology in Semantic Layers - Heather Hedden.pdfThe Role of Taxonomy and Ontology in Semantic Layers - Heather Hedden.pdf
The Role of Taxonomy and Ontology in Semantic Layers - Heather Hedden.pdfEnterprise Knowledge
 
A Domino Admins Adventures (Engage 2024)
A Domino Admins Adventures (Engage 2024)A Domino Admins Adventures (Engage 2024)
A Domino Admins Adventures (Engage 2024)Gabriella Davis
 
Unblocking The Main Thread Solving ANRs and Frozen Frames
Unblocking The Main Thread Solving ANRs and Frozen FramesUnblocking The Main Thread Solving ANRs and Frozen Frames
Unblocking The Main Thread Solving ANRs and Frozen FramesSinan KOZAK
 
Strategies for Unlocking Knowledge Management in Microsoft 365 in the Copilot...
Strategies for Unlocking Knowledge Management in Microsoft 365 in the Copilot...Strategies for Unlocking Knowledge Management in Microsoft 365 in the Copilot...
Strategies for Unlocking Knowledge Management in Microsoft 365 in the Copilot...Drew Madelung
 
Data Cloud, More than a CDP by Matt Robison
Data Cloud, More than a CDP by Matt RobisonData Cloud, More than a CDP by Matt Robison
Data Cloud, More than a CDP by Matt RobisonAnna Loughnan Colquhoun
 
How to convert PDF to text with Nanonets
How to convert PDF to text with NanonetsHow to convert PDF to text with Nanonets
How to convert PDF to text with Nanonetsnaman860154
 
Scaling API-first – The story of a global engineering organization
Scaling API-first – The story of a global engineering organizationScaling API-first – The story of a global engineering organization
Scaling API-first – The story of a global engineering organizationRadu Cotescu
 
Tata AIG General Insurance Company - Insurer Innovation Award 2024
Tata AIG General Insurance Company - Insurer Innovation Award 2024Tata AIG General Insurance Company - Insurer Innovation Award 2024
Tata AIG General Insurance Company - Insurer Innovation Award 2024The Digital Insurer
 

Kürzlich hochgeladen (20)

Breaking the Kubernetes Kill Chain: Host Path Mount
Breaking the Kubernetes Kill Chain: Host Path MountBreaking the Kubernetes Kill Chain: Host Path Mount
Breaking the Kubernetes Kill Chain: Host Path Mount
 
Histor y of HAM Radio presentation slide
Histor y of HAM Radio presentation slideHistor y of HAM Radio presentation slide
Histor y of HAM Radio presentation slide
 
04-2024-HHUG-Sales-and-Marketing-Alignment.pptx
04-2024-HHUG-Sales-and-Marketing-Alignment.pptx04-2024-HHUG-Sales-and-Marketing-Alignment.pptx
04-2024-HHUG-Sales-and-Marketing-Alignment.pptx
 
Neo4j - How KGs are shaping the future of Generative AI at AWS Summit London ...
Neo4j - How KGs are shaping the future of Generative AI at AWS Summit London ...Neo4j - How KGs are shaping the future of Generative AI at AWS Summit London ...
Neo4j - How KGs are shaping the future of Generative AI at AWS Summit London ...
 
Injustice - Developers Among Us (SciFiDevCon 2024)
Injustice - Developers Among Us (SciFiDevCon 2024)Injustice - Developers Among Us (SciFiDevCon 2024)
Injustice - Developers Among Us (SciFiDevCon 2024)
 
Factors to Consider When Choosing Accounts Payable Services Providers.pptx
Factors to Consider When Choosing Accounts Payable Services Providers.pptxFactors to Consider When Choosing Accounts Payable Services Providers.pptx
Factors to Consider When Choosing Accounts Payable Services Providers.pptx
 
Driving Behavioral Change for Information Management through Data-Driven Gree...
Driving Behavioral Change for Information Management through Data-Driven Gree...Driving Behavioral Change for Information Management through Data-Driven Gree...
Driving Behavioral Change for Information Management through Data-Driven Gree...
 
Kalyanpur ) Call Girls in Lucknow Finest Escorts Service 🍸 8923113531 🎰 Avail...
Kalyanpur ) Call Girls in Lucknow Finest Escorts Service 🍸 8923113531 🎰 Avail...Kalyanpur ) Call Girls in Lucknow Finest Escorts Service 🍸 8923113531 🎰 Avail...
Kalyanpur ) Call Girls in Lucknow Finest Escorts Service 🍸 8923113531 🎰 Avail...
 
The 7 Things I Know About Cyber Security After 25 Years | April 2024
The 7 Things I Know About Cyber Security After 25 Years | April 2024The 7 Things I Know About Cyber Security After 25 Years | April 2024
The 7 Things I Know About Cyber Security After 25 Years | April 2024
 
Handwritten Text Recognition for manuscripts and early printed texts
Handwritten Text Recognition for manuscripts and early printed textsHandwritten Text Recognition for manuscripts and early printed texts
Handwritten Text Recognition for manuscripts and early printed texts
 
Apidays Singapore 2024 - Building Digital Trust in a Digital Economy by Veron...
Apidays Singapore 2024 - Building Digital Trust in a Digital Economy by Veron...Apidays Singapore 2024 - Building Digital Trust in a Digital Economy by Veron...
Apidays Singapore 2024 - Building Digital Trust in a Digital Economy by Veron...
 
08448380779 Call Girls In Diplomatic Enclave Women Seeking Men
08448380779 Call Girls In Diplomatic Enclave Women Seeking Men08448380779 Call Girls In Diplomatic Enclave Women Seeking Men
08448380779 Call Girls In Diplomatic Enclave Women Seeking Men
 
The Role of Taxonomy and Ontology in Semantic Layers - Heather Hedden.pdf
The Role of Taxonomy and Ontology in Semantic Layers - Heather Hedden.pdfThe Role of Taxonomy and Ontology in Semantic Layers - Heather Hedden.pdf
The Role of Taxonomy and Ontology in Semantic Layers - Heather Hedden.pdf
 
A Domino Admins Adventures (Engage 2024)
A Domino Admins Adventures (Engage 2024)A Domino Admins Adventures (Engage 2024)
A Domino Admins Adventures (Engage 2024)
 
Unblocking The Main Thread Solving ANRs and Frozen Frames
Unblocking The Main Thread Solving ANRs and Frozen FramesUnblocking The Main Thread Solving ANRs and Frozen Frames
Unblocking The Main Thread Solving ANRs and Frozen Frames
 
Strategies for Unlocking Knowledge Management in Microsoft 365 in the Copilot...
Strategies for Unlocking Knowledge Management in Microsoft 365 in the Copilot...Strategies for Unlocking Knowledge Management in Microsoft 365 in the Copilot...
Strategies for Unlocking Knowledge Management in Microsoft 365 in the Copilot...
 
Data Cloud, More than a CDP by Matt Robison
Data Cloud, More than a CDP by Matt RobisonData Cloud, More than a CDP by Matt Robison
Data Cloud, More than a CDP by Matt Robison
 
How to convert PDF to text with Nanonets
How to convert PDF to text with NanonetsHow to convert PDF to text with Nanonets
How to convert PDF to text with Nanonets
 
Scaling API-first – The story of a global engineering organization
Scaling API-first – The story of a global engineering organizationScaling API-first – The story of a global engineering organization
Scaling API-first – The story of a global engineering organization
 
Tata AIG General Insurance Company - Insurer Innovation Award 2024
Tata AIG General Insurance Company - Insurer Innovation Award 2024Tata AIG General Insurance Company - Insurer Innovation Award 2024
Tata AIG General Insurance Company - Insurer Innovation Award 2024
 

Research Data Census

  • 1. Towards A Data Driven Understanding of Research Data September 3, 2015 Montana State University, Research Council Jerry Sheehan Montana State University Chief Information Officer jsheehan@montana.edu
  • 2. The “Consumerization” of Research Data Trend 1 Costs and Capacity • A “Consumer Effect Has” Pushed Prices Down While Increasing Performance. • Users Can Easily Buy More Storage Than They Need. • There are No Enterprise Strategies for Research Data Discovery. • No explicit way to inventory • Instruments have “bursty” behavior when the move data on the network Montana State University-Information Technology Center
  • 3. “Commodity” Data Laboratory Equipment @ Montana State Device Data Generation Per Run Illumina Genomic Sequence .5Tb to 1Tb per run Confocal Microscope 50-100Gb per run Transmission Electron Microcope 10-20Gb per run Montana State University-Information Technology Center
  • 4. Research Data Census was a Three Way Institutional Partnership Information Technology Center University Library Vice President for Research & Economic Development Montana State University-Information Technology Center
  • 5. Response Rates and Demographics Montana State University-Information Technology Center
  • 6. What Types of Research Data Do You Have? Montana State University-Information Technology Center
  • 7. How Do You Store Your Data? Montana State University-Information Technology Center
  • 8. How Large is Your Research Data? Montana State University-Information Technology Center
  • 9. Who Do You Share Your Data With and When? Montana State University-Information Technology Center
  • 10. Statistically Significant Findings Montana State University-Information Technology Center •Researchers who share their data, regardless of who they share it with (colleagues, students, or non-MSU researchers) also tend to download data from other sources or repositories (78 percent of people sharing their data also download data, versus 37 percent of people not sharing their data; p-value: 1.67x10-7 ). •Researchers with large research data tend to download data from other sources or repositories (90 percent of people with data sets above one terabyte also download data, versus 42 percent for people with data sets below 10 Gb; p-value: 1.58x10-5 ). •Researchers who back up their data also tend to annotate it (55 percent of people who back up their data also annotate it, versus 22 percent of people who don't back up their data; p-value: 5x10-3 ). •Researchers with large research data tend to annotate it (62 percent of people with data sets above one terabyte also annotate their data, versus 39 percent of people with data sets below 10 Gb; p-value: 0.024). •Researchers interested in learning more about data infrastructure and services who do not back up their data cite technical barriers as their main reason for not doing so (p-value: 0.014).
  • 11. Qualitative Interview Findings Montana State University-Information Technology Center •Researchers don’t usually describe their data by size, although many know the exact size of their data. Instead, their standard practice is to describe how they transfer the file (via email, placed on hard drives, put in cloud services, etc. •Researchers' sense of when and how data is disseminated and shared varied widely. •There is no common definition of “big data”. Definitions change between disciplines, researchers build “bigger data” by aggregating many small research results. •Without exception, interviewees described their research practices as involving collaboration with others, both inside and outside the institution. •All researchers responded positively when asked if they would engage MSU Library services that focus on data set annotation and metadata markup, assistance with deposit in relevant data repositories, and educational programs and training on campus IT resources.
  • 12. Impacts of the the RDC Montana State University-Information Technology Center • Creation of a multi-stakeholder proposal ($500K) to the National Science Foundation for investment in a science network for the Bozeman campus. PI: Jerry Sheehan, Co-PIs: Kenning Arlitsch, Ben Poulter, Phil Stewart, and Mark Young. • Input from the Research Data Census and the NSF Proposal is Driving FY16 Capital Investments for Campus. • New Collaboration between ITC and the Library to Bundle A Set of Data Services and Infrastructure for the Montana State University Research Community. • Formal Publication of Survey Results in On-Line Educause Review (Sept/Oct 2015). • Modification of the Survey Instrument, Adoption of Instrument by Other MSU Campuses, and Sharing of Instrument with Higher Education Community.