SlideShare ist ein Scribd-Unternehmen logo
1 von 37
Sharing Data-Rich Research
Through Repository Layering
Stephen Abrams
California Digital Library
Angela Rizk-Jackson
Julia Kochi
University of California, San Francisco
Noah Wittman
University of California, Berkeley
Why is data curation important?
 Accelerating scientific progress
 Enabling appropriate scrutiny and verification of results
 Promoting integrity and debate
 Facilitating new collaborations
 Avoiding needless duplication of effort
 Increasingly, complying with institutional policies, publication
requirements, and funder mandates
Cf. White and Teds (2011), “Making the case for research data management” DCC briefing paper,
www.dcc.ac.uk/resources/briefing-papers/making-case-rdm
The library’s role
 A continuation of its long-standing mission and practice to
connect patrons with content of interest in meaningful ways
across barriers of space and time
Cf. Tenopir et al. (2012), “Academic librarians and research data services: Preparation and attitudes,” 78th
IFLA General Conference and Assembly, Helsinki, conference.ifla.org/past/ifla78/116-tenopir-en.pdf
 Offering solutions that enhance the natural points of
alignment between the scholarly research and information
lifecycles
Publish
Reuse
ShareCreate
Discover
Collect
PreserveAccessResearchResearch CurationCuration
Scholarly lifecycle Information lifecycle
Merritt
 Curation repository available to the UC community and
external partners
 Preservation and access
 Content agnostic, model free
 Highly decentralized micro-services architecture
Cf. Abrams, Cruse, Kunze, and Minor (2011), “Curation micro-services: A pipeline metaphor for
repositories,” Journal of Digital Information 12(2), journals.tdl.org/jodi/article/view/1605
 26 curatorial units
 271 collections
 325,000 objects
 450,000 versions
 4,500,000 files
 13 TB
www.cdlib.org/uc3/merritt
merritt.cdlib.org
Merritt
Storage node
Storage
broker
Inventory
ONEShare UNM
storage node
Storage node
UI/API
UI/API
UI/API
LDAP
LDAP
LDAP
RDBMS
Fixity
User
agent
Message
queue
RDBMS
Load
balancer
Ingest
Load
balancer
Ingest
Ingest
EZID
No-SQL
DataCite
…
DataONE
member node
RDBMS
RDBMS
DataONE
coord’ing node
…
IDF
Load
balancer
Web of
Knowledge
Primo
SAN
SDSC
cloud
(Some) issues to address
 Scale
 Individual objects ranging from 0 to 47,000 files
 Individual files ranging from 0 to 14 GB
 Maintaining control
 Concern over potential loss of control over dissemination and
use of data
 User experience
 Switch from organizational to individual interaction
www.flickr.com/photos/vixon/116447718www.flickr.com/photos/traftery/4319529821www.flickr.com/photos/32195273@N05/51076852642
(Some) issues to address
 Scale
 Individual objects ranging from 0 to 47,000 files
 Individual files ranging from 0 to 14 GB
 Maintaining control
 Concern over potential loss of control over dissemination and
use of data
 User experience
 Switch from organizational to individual interaction
Augment repository function by composition (when possible)
and addition (when necessary)
 Loosely-coupled integration with external community supported
systems and services
Scale
 Avoiding client timeout
 ≤ 2 GB: File-based  stream-based AIP-to-DIP processing
 > 2 GB: Asynchronous delivery
 Email notification with personalized, time-limited URL
 Streamlined storage provisioning
 SDSC cloud
cloud.sdsc.edu
www.kevatron.co.uk/converting-8-24-bit-samples-in-coreaudio-on-ios www.flickr.com/photos/paulbhartzog/680749585
Control
 Data use agreements (DUAs)
 Explicit assertion of license requirements and terms of use
 Curatorial and consumer notification of acceptance
Cf. Brazhnik and Jones (2007), “Anatomy of data integration,” Journal of Biomedical Informatics 40(3): 252-
69, doi:10.1016/j.jbi.2006.09.001
From: no-reply-merritt@ucop.edu
Subject:Merritt DUA acceptance
Name: Stephen Abrams
Affiliation: California Digital Library
Collection: UCSF DataShare
Object: Frontotemporal Lobar Degeneration (FTLD)
Date: 2013-05-3109:50:34PDT
Terms of use: As part of this agreement, Consumer submits to the following
statements:
(1) I will receive access to de-identified data and will not attempt to establish the
identity of any of the study subjects.
(2) I will share these data only with my immediate co-workers, and I will not transfer
these data to other research groups. I understand that these data are available to
other research groups through the process by which I obtain them.
(3) I will require anyone in my group who utilizes these data, or anyone with whom I
share these data to comply with this data use agreement
...
User experience
 Due to its open eligibility policy, Merritt will always provide a
more generic UX than special-purpose or disciplinary systems
 Shifting user roles, shifting expectations
 Institutional  individual researcher
 Behavioral expectations set by the commercial/mobile web
User experience
 Due to its open eligibility policy, Merritt will always provide a
more generic UX than special-purpose or disciplinary systems
 Shifting user roles, shifting expectations
 Institutional  individual researcher
 Behavioral expectations set by the commercial web
 Integration with extant services that better provide the
desired UX
 DataShare
 Research Hub
DataShare
 “The goal of the DataShare project is to catalyze widespread
sharing of scientific research data”
datashare.ucsf.edu
 UCSF Clinical and Translational Science Institute
ctsi.ucsf.edu
 UCSF Library
www.library.ucsf.edu
 UCSF Center for Imaging of Neurodegenerative Disease
www.radiology.ucsf.edu/cind
 Architecture
 DataShare submission client (Ruby/Rails)
 Merritt curation repository
 DataShare discovery portal (XTF/Java)
DataShare
 Prepare
 Describe
 Upload
 Curate
 Discover
 Share
DataShare
 Prepare
 Best practice advice
 Describe
 Upload
 Curate
 Discover
 Share
DataShare
 Prepare
 Describe
 Schema-directed
metadata editor
 DataCite schema
schema.datacite.org
 Upload
 Curate
 Discover
 Share
DataShare
 Prepare
 Describe
 Upload
 File browse or
drag-n-drop
 Curate
 Discover
 Share
DataShare
 Prepare
 Describe
 Upload
 File browse or
drag-n-drop
 Curate
 Discover
 Share
DataShare
 Prepare
 Describe
 Upload
 Curate
 Manage datasets
 Discover
 Share
DataShare
 Prepare
 Describe
 Upload
 Curate
 Discover
 Faceted search and
browse
 Share
DataShare
 Prepare
 Describe
 Upload
 Curate
 Discover
 Share
 DataONE
 DataCite
 (soon) Primo
Web of Knowledge
 SEO
Merritt + DataShare
Storage node
Storage
broker
Inventory
ONEShare UNM
storage node
Storage node
UI/API
UI/API
UI/API
LDAP
LDAP
LDAP
RDBMS
Fixity
User
agent
Message
queue
RDBMS
Load
balancer
Ingest
Load
balancer
Ingest
Ingest
EZID
No-SQL
DataCite
…
DataONE
member node
RDBMS
RDBMS
DataONE
coord’ing node
…
IDF
Load
balancer
Web of
Knowledge
Primo
SAN
SDSC
cloud
DataShare
upload
Collection
Atom feed
XTF
xtf.cdlib.org
DataShare
portal
Lucene
Research Hub
 “Research Hub provides powerful tools for content
management and collaboration”
hub.berkeley.edu
 Alfresco CMS
www.alfresco.com
 770 projects, 3,900 users
 Personal file management
 Project collaboration
 Departmental resource pooling
 Research data management
 Desktop sync, mobile app, Adobe Creative Suite
 UC Berkeley Information Services and Technology
ist.berkeley.edu
Research Hub
 Prepare
 Acquire and
arrange
 Describe
 Upload
 Curate
 Discover
 Share
Research Hub
 Prepare
 Describe
 Schema-directed
metadata editors
 Upload
 Curate
 Discover
 Share
Research Hub
 Prepare
 Describe
 Upload
 Direct action
 Curate
 Discover
 Share
 Prepare
 Describe
 Upload
 Direct action
 Curate
 Discover
 Share
Research Hub
Research Hub
 Prepare
 Describe
 Upload
 Policy-based
workflow rules
 Curate
 Discover
 Share
Research Hub
 Prepare
 Describe
 Upload
 Drag-and-drop
 Curate
 Discover
 Share
Research Hub
 Prepare
 Describe
 Upload
 Confirmation
 Curate
 Discover
 Share
Research Hub
 Prepare
 Describe
 Upload
 Curate
 Manage datasets
 Discover
 Share
Research Hub
 Prepare
 Describe
 Upload
 Curate
 Discover
 Share
Research Hub
 Prepare
 Describe
 Upload
 Curate
 Discover
 Share
Merritt + DataShare + Research Hub
Storage node
Storage
broker
Inventory
ONEShare UNM
storage node
Storage node
UI/API
UI/API
UI/API
LDAP
LDAP
LDAP
RDBMS
Fixity
User
agent
Message
queue
RDBMS
Load
balancer
Ingest
Load
balancer
Ingest
Ingest
EZID
No-SQL
DataCite
…
DataONE
member node
RDBMS
RDBMS
DataONE
coord’ing node
…
IDF
Load
balancer
Web of
Knowledge
Primo
SAN
SDSC
cloud
DataShare
upload
Collection
Atom feed
XTF
xtf.cdlib.org
DataShare
portal
Lucene
Research
Hub
Next steps
 Self-service account
registration
 UCTrust and InCommon
Shibboleth federations
 Additional cloud-based
replication
 Outreach
 Integration with Open Context
archaeological portal
opencontext.org
 Atom-based submission
 Integration with Nuxeo
www.nuxeo.com
 UC system-wide DAMS solution
 Integration with Islandora
islandora.ca
 Collaboration with UCLA Library
 Tuque API
 Integration with DPN
www.dpn.org
Sharing research through repositories
 Conform to institutional policy, publication requirements, and
funder mandates
 Pro-active curation of valuable research outputs
 Stable citation and access
 High visibility publication and discovery
 Use metrics
Sharing research through repositories
 Conform to institutional policy, publication requirements, and
funder mandates
 Pro-active curation of valuable research outputs
 Stable citation and access
 High visibility publication and discovery
 Use metrics
 Repository layering as an appropriate division of labor
 Exploiting existing capabilities already in local use
For more information
 Merritt
www.cdlib.org/uc3/merritt
uc3@ucop.edu
Stephen Abrams David Loy
Patricia Cruse Mark Reyes
Shirin Faenza Joan Starr
Scott Fisher Carly Strasser
Erik Hetzner Marisa Strong
Joshua Hubbard Bhavitavya Vedula
Greg Janée Kenneth Weiss
John Kunze Perry Willet
Rosalie Lack
 DataShare
datashare.ucsf.edu
Geoffrey Boushey Julia Kochi
Anirvan Chatterjee Angela Rizk-Jackson
Maninder Kahlon Michael Weiner
 Research Hub
hub.berkeley.edu
Ian Crew Michael McCarthy (Tribloom)
Noah Wittman Patrick McGrath
www.slideshare.net/UC3/or-2013abramssharingdatarichresearch

Weitere ähnliche Inhalte

Was ist angesagt?

D paul ecn2013
D paul ecn2013D paul ecn2013
D paul ecn2013ECNOfficer
 
A Social Content Delivery Network for Scientific Cooperation: Vision, Design...
A Social Content Delivery Network for Scientific Cooperation: Vision,  Design...A Social Content Delivery Network for Scientific Cooperation: Vision,  Design...
A Social Content Delivery Network for Scientific Cooperation: Vision, Design...Simon Caton
 
Tools and Techniques for Creating, Maintaining, and Distributing Shareable Me...
Tools and Techniques for Creating, Maintaining, and Distributing Shareable Me...Tools and Techniques for Creating, Maintaining, and Distributing Shareable Me...
Tools and Techniques for Creating, Maintaining, and Distributing Shareable Me...Jenn Riley
 
Komatsoulis internet2 executive track
Komatsoulis internet2 executive trackKomatsoulis internet2 executive track
Komatsoulis internet2 executive trackGeorge Komatsoulis
 
Data Publishing at Harvard's Research Data Access Symposium
Data Publishing at Harvard's Research Data Access SymposiumData Publishing at Harvard's Research Data Access Symposium
Data Publishing at Harvard's Research Data Access SymposiumMerce Crosas
 
ESA14 Workshop on SEAD's Data Services and Tools
ESA14 Workshop on SEAD's Data Services and ToolsESA14 Workshop on SEAD's Data Services and Tools
ESA14 Workshop on SEAD's Data Services and ToolsSEAD
 
Digital Library Federation - DataNets Panel presentation (Nov. 1st, 2011)
Digital Library Federation - DataNets Panel presentation (Nov. 1st, 2011)Digital Library Federation - DataNets Panel presentation (Nov. 1st, 2011)
Digital Library Federation - DataNets Panel presentation (Nov. 1st, 2011)SEAD
 
SEAD slide set (October 2011)
SEAD slide set (October 2011)SEAD slide set (October 2011)
SEAD slide set (October 2011)SEAD
 
Dataverse, Cloud Dataverse, and DataTags
Dataverse, Cloud Dataverse, and DataTagsDataverse, Cloud Dataverse, and DataTags
Dataverse, Cloud Dataverse, and DataTagsMerce Crosas
 
Hughes RDAP11 Data Publication Repositories
Hughes RDAP11 Data Publication RepositoriesHughes RDAP11 Data Publication Repositories
Hughes RDAP11 Data Publication RepositoriesASIS&T
 
How Data Commons are Changing the Way that Large Datasets Are Analyzed and Sh...
How Data Commons are Changing the Way that Large Datasets Are Analyzed and Sh...How Data Commons are Changing the Way that Large Datasets Are Analyzed and Sh...
How Data Commons are Changing the Way that Large Datasets Are Analyzed and Sh...Robert Grossman
 
On Evaluating and Publishing Data Concerns for Data as a Service
On Evaluating and Publishing Data Concerns for Data as a ServiceOn Evaluating and Publishing Data Concerns for Data as a Service
On Evaluating and Publishing Data Concerns for Data as a ServiceHong-Linh Truong
 
Open Data is not Enough (final version)
Open Data is not Enough (final version)Open Data is not Enough (final version)
Open Data is not Enough (final version)Research Data Alliance
 
What is Data Commons and How Can Your Organization Build One?
What is Data Commons and How Can Your Organization Build One?What is Data Commons and How Can Your Organization Build One?
What is Data Commons and How Can Your Organization Build One?Robert Grossman
 
Komatsoulis internet2 global forum 2015
Komatsoulis internet2 global forum 2015Komatsoulis internet2 global forum 2015
Komatsoulis internet2 global forum 2015George Komatsoulis
 
Some Proposed Principles for Interoperating Cloud Based Data Platforms
Some Proposed Principles for Interoperating Cloud Based Data PlatformsSome Proposed Principles for Interoperating Cloud Based Data Platforms
Some Proposed Principles for Interoperating Cloud Based Data PlatformsRobert Grossman
 
A Gen3 Perspective of Disparate Data
A Gen3 Perspective of Disparate DataA Gen3 Perspective of Disparate Data
A Gen3 Perspective of Disparate DataRobert Grossman
 

Was ist angesagt? (20)

D paul ecn2013
D paul ecn2013D paul ecn2013
D paul ecn2013
 
A Social Content Delivery Network for Scientific Cooperation: Vision, Design...
A Social Content Delivery Network for Scientific Cooperation: Vision,  Design...A Social Content Delivery Network for Scientific Cooperation: Vision,  Design...
A Social Content Delivery Network for Scientific Cooperation: Vision, Design...
 
Tools and Techniques for Creating, Maintaining, and Distributing Shareable Me...
Tools and Techniques for Creating, Maintaining, and Distributing Shareable Me...Tools and Techniques for Creating, Maintaining, and Distributing Shareable Me...
Tools and Techniques for Creating, Maintaining, and Distributing Shareable Me...
 
Komatsoulis internet2 executive track
Komatsoulis internet2 executive trackKomatsoulis internet2 executive track
Komatsoulis internet2 executive track
 
Data Publishing at Harvard's Research Data Access Symposium
Data Publishing at Harvard's Research Data Access SymposiumData Publishing at Harvard's Research Data Access Symposium
Data Publishing at Harvard's Research Data Access Symposium
 
ESA14 Workshop on SEAD's Data Services and Tools
ESA14 Workshop on SEAD's Data Services and ToolsESA14 Workshop on SEAD's Data Services and Tools
ESA14 Workshop on SEAD's Data Services and Tools
 
Digital Library Federation - DataNets Panel presentation (Nov. 1st, 2011)
Digital Library Federation - DataNets Panel presentation (Nov. 1st, 2011)Digital Library Federation - DataNets Panel presentation (Nov. 1st, 2011)
Digital Library Federation - DataNets Panel presentation (Nov. 1st, 2011)
 
SEAD slide set (October 2011)
SEAD slide set (October 2011)SEAD slide set (October 2011)
SEAD slide set (October 2011)
 
Dataverse, Cloud Dataverse, and DataTags
Dataverse, Cloud Dataverse, and DataTagsDataverse, Cloud Dataverse, and DataTags
Dataverse, Cloud Dataverse, and DataTags
 
Hughes RDAP11 Data Publication Repositories
Hughes RDAP11 Data Publication RepositoriesHughes RDAP11 Data Publication Repositories
Hughes RDAP11 Data Publication Repositories
 
Digital Curation 101 - Taster
Digital Curation 101 - TasterDigital Curation 101 - Taster
Digital Curation 101 - Taster
 
Linked data in pharma R&D
Linked data in pharma R&DLinked data in pharma R&D
Linked data in pharma R&D
 
How Data Commons are Changing the Way that Large Datasets Are Analyzed and Sh...
How Data Commons are Changing the Way that Large Datasets Are Analyzed and Sh...How Data Commons are Changing the Way that Large Datasets Are Analyzed and Sh...
How Data Commons are Changing the Way that Large Datasets Are Analyzed and Sh...
 
On Evaluating and Publishing Data Concerns for Data as a Service
On Evaluating and Publishing Data Concerns for Data as a ServiceOn Evaluating and Publishing Data Concerns for Data as a Service
On Evaluating and Publishing Data Concerns for Data as a Service
 
OpenData Public Research, University of Toronto, Open Access Week, 25/11/2011
OpenData Public Research, University of Toronto, Open Access Week, 25/11/2011OpenData Public Research, University of Toronto, Open Access Week, 25/11/2011
OpenData Public Research, University of Toronto, Open Access Week, 25/11/2011
 
Open Data is not Enough (final version)
Open Data is not Enough (final version)Open Data is not Enough (final version)
Open Data is not Enough (final version)
 
What is Data Commons and How Can Your Organization Build One?
What is Data Commons and How Can Your Organization Build One?What is Data Commons and How Can Your Organization Build One?
What is Data Commons and How Can Your Organization Build One?
 
Komatsoulis internet2 global forum 2015
Komatsoulis internet2 global forum 2015Komatsoulis internet2 global forum 2015
Komatsoulis internet2 global forum 2015
 
Some Proposed Principles for Interoperating Cloud Based Data Platforms
Some Proposed Principles for Interoperating Cloud Based Data PlatformsSome Proposed Principles for Interoperating Cloud Based Data Platforms
Some Proposed Principles for Interoperating Cloud Based Data Platforms
 
A Gen3 Perspective of Disparate Data
A Gen3 Perspective of Disparate DataA Gen3 Perspective of Disparate Data
A Gen3 Perspective of Disparate Data
 

Andere mochten auch

Overcoming Obstacles to Sharing Research Data
Overcoming Obstacles to Sharing Research DataOvercoming Obstacles to Sharing Research Data
Overcoming Obstacles to Sharing Research DataBrian Hole
 
Supporting UC Research Data Management
Supporting UC Research Data ManagementSupporting UC Research Data Management
Supporting UC Research Data Managementslabrams
 
Libraries and Research Data Curation: Barriers and Incentives for Preservatio...
Libraries and Research Data Curation: Barriers and Incentives for Preservatio...Libraries and Research Data Curation: Barriers and Incentives for Preservatio...
Libraries and Research Data Curation: Barriers and Incentives for Preservatio...University of California Curation Center
 
NISO Webinar on data curation services at the CDL
NISO Webinar on data curation services at the CDLNISO Webinar on data curation services at the CDL
NISO Webinar on data curation services at the CDLCarly Strasser
 

Andere mochten auch (7)

Supporting UC Research Data Management
Supporting UC Research Data ManagementSupporting UC Research Data Management
Supporting UC Research Data Management
 
Overcoming Obstacles to Sharing Research Data
Overcoming Obstacles to Sharing Research DataOvercoming Obstacles to Sharing Research Data
Overcoming Obstacles to Sharing Research Data
 
Supporting UC Research Data Management
Supporting UC Research Data ManagementSupporting UC Research Data Management
Supporting UC Research Data Management
 
Uc3 pasig-asis&t-2013-08-20-support-of-data-intensive-research
Uc3 pasig-asis&t-2013-08-20-support-of-data-intensive-researchUc3 pasig-asis&t-2013-08-20-support-of-data-intensive-research
Uc3 pasig-asis&t-2013-08-20-support-of-data-intensive-research
 
Libraries and Research Data Curation: Barriers and Incentives for Preservatio...
Libraries and Research Data Curation: Barriers and Incentives for Preservatio...Libraries and Research Data Curation: Barriers and Incentives for Preservatio...
Libraries and Research Data Curation: Barriers and Incentives for Preservatio...
 
NISO Webinar on data curation services at the CDL
NISO Webinar on data curation services at the CDLNISO Webinar on data curation services at the CDL
NISO Webinar on data curation services at the CDL
 
What does "data publication" mean to researchers?
What does "data publication" mean to researchers?What does "data publication" mean to researchers?
What does "data publication" mean to researchers?
 

Ähnlich wie Or 2013-abrams-sharing-data-rich-research

Policy-based Data Management
Policy-based Data Management Policy-based Data Management
Policy-based Data Management Gary Wilhelm
 
How Cyverse.org enables scalable data discoverability and re-use
How Cyverse.org enables scalable data discoverability and re-useHow Cyverse.org enables scalable data discoverability and re-use
How Cyverse.org enables scalable data discoverability and re-useMatthew Vaughn
 
NIH Data Summit - The NIH Data Commons
NIH Data Summit - The NIH Data CommonsNIH Data Summit - The NIH Data Commons
NIH Data Summit - The NIH Data CommonsVivien Bonazzi
 
Exploration of a Data Landscape using a Collaborative Linked Data Framework.
Exploration of a Data Landscape using a Collaborative Linked Data Framework.Exploration of a Data Landscape using a Collaborative Linked Data Framework.
Exploration of a Data Landscape using a Collaborative Linked Data Framework.Laurent Alquier
 
Research Data Management, Challenges and Tools - Per Öster
Research Data Management, Challenges and Tools - Per Öster Research Data Management, Challenges and Tools - Per Öster
Research Data Management, Challenges and Tools - Per Öster LEARN Project
 
GlobusWorld 2019 Opening Keynote
GlobusWorld 2019 Opening KeynoteGlobusWorld 2019 Opening Keynote
GlobusWorld 2019 Opening KeynoteGlobus
 
Beyond Meta-Data: Nano-Publications Recording Scientific Endeavour
Beyond Meta-Data: Nano-Publications Recording Scientific EndeavourBeyond Meta-Data: Nano-Publications Recording Scientific Endeavour
Beyond Meta-Data: Nano-Publications Recording Scientific EndeavourKNOWeSCAPE2014
 
FAIRness Assessment of the Library of Integrated Network-based Cellular Signa...
FAIRness Assessment of the Library of Integrated Network-based Cellular Signa...FAIRness Assessment of the Library of Integrated Network-based Cellular Signa...
FAIRness Assessment of the Library of Integrated Network-based Cellular Signa...Kathleen Jagodnik
 
UK Digital Curation Centre: enabling research data management at the coalface
UK Digital Curation Centre: enabling research data management at the coalfaceUK Digital Curation Centre: enabling research data management at the coalface
UK Digital Curation Centre: enabling research data management at the coalfaceLizLyon
 
Modeling Data Life Cycles with PROV
Modeling Data Life Cycles with PROVModeling Data Life Cycles with PROV
Modeling Data Life Cycles with PROVEUDAT
 
Grid Computing July 2009
Grid Computing July 2009Grid Computing July 2009
Grid Computing July 2009Ian Foster
 
The eCrystals Federation
The eCrystals FederationThe eCrystals Federation
The eCrystals FederationManjulaPatel
 
Impact of Covid-19 on Learning and Education
Impact of Covid-19 on Learning and EducationImpact of Covid-19 on Learning and Education
Impact of Covid-19 on Learning and EducationMANENDRASINGH30
 
FAIRy stories: the FAIR Data principles in theory and in practice
FAIRy stories: the FAIR Data principles in theory and in practiceFAIRy stories: the FAIR Data principles in theory and in practice
FAIRy stories: the FAIR Data principles in theory and in practiceCarole Goble
 

Ähnlich wie Or 2013-abrams-sharing-data-rich-research (20)

Policy-based Data Management
Policy-based Data Management Policy-based Data Management
Policy-based Data Management
 
Cyberistructure
CyberistructureCyberistructure
Cyberistructure
 
How Cyverse.org enables scalable data discoverability and re-use
How Cyverse.org enables scalable data discoverability and re-useHow Cyverse.org enables scalable data discoverability and re-use
How Cyverse.org enables scalable data discoverability and re-use
 
NIH Data Summit - The NIH Data Commons
NIH Data Summit - The NIH Data CommonsNIH Data Summit - The NIH Data Commons
NIH Data Summit - The NIH Data Commons
 
Exploration of a Data Landscape using a Collaborative Linked Data Framework.
Exploration of a Data Landscape using a Collaborative Linked Data Framework.Exploration of a Data Landscape using a Collaborative Linked Data Framework.
Exploration of a Data Landscape using a Collaborative Linked Data Framework.
 
Research Data Management, Challenges and Tools - Per Öster
Research Data Management, Challenges and Tools - Per Öster Research Data Management, Challenges and Tools - Per Öster
Research Data Management, Challenges and Tools - Per Öster
 
DataShare for UC Campuses
DataShare for UC CampusesDataShare for UC Campuses
DataShare for UC Campuses
 
GlobusWorld 2019 Opening Keynote
GlobusWorld 2019 Opening KeynoteGlobusWorld 2019 Opening Keynote
GlobusWorld 2019 Opening Keynote
 
Intro to RDM
Intro to RDMIntro to RDM
Intro to RDM
 
Beyond Meta-Data: Nano-Publications Recording Scientific Endeavour
Beyond Meta-Data: Nano-Publications Recording Scientific EndeavourBeyond Meta-Data: Nano-Publications Recording Scientific Endeavour
Beyond Meta-Data: Nano-Publications Recording Scientific Endeavour
 
FAIRness Assessment of the Library of Integrated Network-based Cellular Signa...
FAIRness Assessment of the Library of Integrated Network-based Cellular Signa...FAIRness Assessment of the Library of Integrated Network-based Cellular Signa...
FAIRness Assessment of the Library of Integrated Network-based Cellular Signa...
 
UK Digital Curation Centre: enabling research data management at the coalface
UK Digital Curation Centre: enabling research data management at the coalfaceUK Digital Curation Centre: enabling research data management at the coalface
UK Digital Curation Centre: enabling research data management at the coalface
 
Modeling Data Life Cycles with PROV
Modeling Data Life Cycles with PROVModeling Data Life Cycles with PROV
Modeling Data Life Cycles with PROV
 
Grid Computing July 2009
Grid Computing July 2009Grid Computing July 2009
Grid Computing July 2009
 
Sgci esip-7-20-18
Sgci esip-7-20-18Sgci esip-7-20-18
Sgci esip-7-20-18
 
The eCrystals Federation
The eCrystals FederationThe eCrystals Federation
The eCrystals Federation
 
Impact of Covid-19 on Learning and Education
Impact of Covid-19 on Learning and EducationImpact of Covid-19 on Learning and Education
Impact of Covid-19 on Learning and Education
 
SomeSlides
SomeSlidesSomeSlides
SomeSlides
 
FAIRy stories: the FAIR Data principles in theory and in practice
FAIRy stories: the FAIR Data principles in theory and in practiceFAIRy stories: the FAIR Data principles in theory and in practice
FAIRy stories: the FAIR Data principles in theory and in practice
 
NISO Virtual Conference Scientific Data Management: Caring for Your Instituti...
NISO Virtual Conference Scientific Data Management: Caring for Your Instituti...NISO Virtual Conference Scientific Data Management: Caring for Your Instituti...
NISO Virtual Conference Scientific Data Management: Caring for Your Instituti...
 

Mehr von University of California Curation Center

ETDs: Electronic Thesis and Dissertation Service at the University of California
ETDs: Electronic Thesis and Dissertation Service at the University of CaliforniaETDs: Electronic Thesis and Dissertation Service at the University of California
ETDs: Electronic Thesis and Dissertation Service at the University of CaliforniaUniversity of California Curation Center
 
The UC Curation Center (UC3): Developing Tools & Services for Managing Research
The UC Curation Center (UC3): Developing Tools & Services for Managing ResearchThe UC Curation Center (UC3): Developing Tools & Services for Managing Research
The UC Curation Center (UC3): Developing Tools & Services for Managing ResearchUniversity of California Curation Center
 

Mehr von University of California Curation Center (20)

ETDs: Electronic Thesis and Dissertation Service at the University of California
ETDs: Electronic Thesis and Dissertation Service at the University of CaliforniaETDs: Electronic Thesis and Dissertation Service at the University of California
ETDs: Electronic Thesis and Dissertation Service at the University of California
 
Dash UCCSC 2016
Dash UCCSC 2016Dash UCCSC 2016
Dash UCCSC 2016
 
Uc3 ucacc-2015-11-16
Uc3 ucacc-2015-11-16Uc3 ucacc-2015-11-16
Uc3 ucacc-2015-11-16
 
Dash: data sharing made easy
Dash: data sharing made easyDash: data sharing made easy
Dash: data sharing made easy
 
CDL research lifecycle
CDL research lifecycleCDL research lifecycle
CDL research lifecycle
 
Ucmp 20150407
Ucmp 20150407Ucmp 20150407
Ucmp 20150407
 
Researcher perspectives on publication and peer review of data.
Researcher perspectives on publication and peer review of data.Researcher perspectives on publication and peer review of data.
Researcher perspectives on publication and peer review of data.
 
Enhancing DMPTool: Further Streamlineing Data Mangement Planning Process
Enhancing DMPTool: Further Streamlineing Data Mangement Planning ProcessEnhancing DMPTool: Further Streamlineing Data Mangement Planning Process
Enhancing DMPTool: Further Streamlineing Data Mangement Planning Process
 
DataShare: Empowering Researcher Data Curation
DataShare: Empowering Researcher Data CurationDataShare: Empowering Researcher Data Curation
DataShare: Empowering Researcher Data Curation
 
Future of web archiving
Future of web archivingFuture of web archiving
Future of web archiving
 
Data preservation 101
Data preservation 101Data preservation 101
Data preservation 101
 
Creating superior data management plans with the DMPTool
Creating superior data management plans with the DMPToolCreating superior data management plans with the DMPTool
Creating superior data management plans with the DMPTool
 
ESA Ignite talk on the DMPTool by S Abrams
ESA Ignite talk on the DMPTool by S AbramsESA Ignite talk on the DMPTool by S Abrams
ESA Ignite talk on the DMPTool by S Abrams
 
DMPTool2 Webinar #1 for Administrators
DMPTool2 Webinar #1 for AdministratorsDMPTool2 Webinar #1 for Administrators
DMPTool2 Webinar #1 for Administrators
 
DMPTool2 Administrator Webinar #2
DMPTool2 Administrator Webinar #2DMPTool2 Administrator Webinar #2
DMPTool2 Administrator Webinar #2
 
Helping librarians use the DMPTool as a centerpiece for data management
Helping librarians use the DMPTool as a centerpiece for data managementHelping librarians use the DMPTool as a centerpiece for data management
Helping librarians use the DMPTool as a centerpiece for data management
 
The UC Curation Center (UC3): Developing Tools & Services for Managing Research
The UC Curation Center (UC3): Developing Tools & Services for Managing ResearchThe UC Curation Center (UC3): Developing Tools & Services for Managing Research
The UC Curation Center (UC3): Developing Tools & Services for Managing Research
 
Dataset Metadata Publication Through EZID
Dataset Metadata Publication Through EZIDDataset Metadata Publication Through EZID
Dataset Metadata Publication Through EZID
 
DMPTool2: Improvements and Outreach
DMPTool2: Improvements and Outreach DMPTool2: Improvements and Outreach
DMPTool2: Improvements and Outreach
 
DMPTool Webinar 11: Complementary Tools
DMPTool Webinar 11: Complementary ToolsDMPTool Webinar 11: Complementary Tools
DMPTool Webinar 11: Complementary Tools
 

Kürzlich hochgeladen

How to Troubleshoot Apps for the Modern Connected Worker
How to Troubleshoot Apps for the Modern Connected WorkerHow to Troubleshoot Apps for the Modern Connected Worker
How to Troubleshoot Apps for the Modern Connected WorkerThousandEyes
 
SQL Database Design For Developers at php[tek] 2024
SQL Database Design For Developers at php[tek] 2024SQL Database Design For Developers at php[tek] 2024
SQL Database Design For Developers at php[tek] 2024Scott Keck-Warren
 
Tech-Forward - Achieving Business Readiness For Copilot in Microsoft 365
Tech-Forward - Achieving Business Readiness For Copilot in Microsoft 365Tech-Forward - Achieving Business Readiness For Copilot in Microsoft 365
Tech-Forward - Achieving Business Readiness For Copilot in Microsoft 3652toLead Limited
 
Factors to Consider When Choosing Accounts Payable Services Providers.pptx
Factors to Consider When Choosing Accounts Payable Services Providers.pptxFactors to Consider When Choosing Accounts Payable Services Providers.pptx
Factors to Consider When Choosing Accounts Payable Services Providers.pptxKatpro Technologies
 
Breaking the Kubernetes Kill Chain: Host Path Mount
Breaking the Kubernetes Kill Chain: Host Path MountBreaking the Kubernetes Kill Chain: Host Path Mount
Breaking the Kubernetes Kill Chain: Host Path MountPuma Security, LLC
 
GenCyber Cyber Security Day Presentation
GenCyber Cyber Security Day PresentationGenCyber Cyber Security Day Presentation
GenCyber Cyber Security Day PresentationMichael W. Hawkins
 
The Codex of Business Writing Software for Real-World Solutions 2.pptx
The Codex of Business Writing Software for Real-World Solutions 2.pptxThe Codex of Business Writing Software for Real-World Solutions 2.pptx
The Codex of Business Writing Software for Real-World Solutions 2.pptxMalak Abu Hammad
 
From Event to Action: Accelerate Your Decision Making with Real-Time Automation
From Event to Action: Accelerate Your Decision Making with Real-Time AutomationFrom Event to Action: Accelerate Your Decision Making with Real-Time Automation
From Event to Action: Accelerate Your Decision Making with Real-Time AutomationSafe Software
 
Human Factors of XR: Using Human Factors to Design XR Systems
Human Factors of XR: Using Human Factors to Design XR SystemsHuman Factors of XR: Using Human Factors to Design XR Systems
Human Factors of XR: Using Human Factors to Design XR SystemsMark Billinghurst
 
AI as an Interface for Commercial Buildings
AI as an Interface for Commercial BuildingsAI as an Interface for Commercial Buildings
AI as an Interface for Commercial BuildingsMemoori
 
Slack Application Development 101 Slides
Slack Application Development 101 SlidesSlack Application Development 101 Slides
Slack Application Development 101 Slidespraypatel2
 
A Domino Admins Adventures (Engage 2024)
A Domino Admins Adventures (Engage 2024)A Domino Admins Adventures (Engage 2024)
A Domino Admins Adventures (Engage 2024)Gabriella Davis
 
The 7 Things I Know About Cyber Security After 25 Years | April 2024
The 7 Things I Know About Cyber Security After 25 Years | April 2024The 7 Things I Know About Cyber Security After 25 Years | April 2024
The 7 Things I Know About Cyber Security After 25 Years | April 2024Rafal Los
 
Swan(sea) Song – personal research during my six years at Swansea ... and bey...
Swan(sea) Song – personal research during my six years at Swansea ... and bey...Swan(sea) Song – personal research during my six years at Swansea ... and bey...
Swan(sea) Song – personal research during my six years at Swansea ... and bey...Alan Dix
 
Unblocking The Main Thread Solving ANRs and Frozen Frames
Unblocking The Main Thread Solving ANRs and Frozen FramesUnblocking The Main Thread Solving ANRs and Frozen Frames
Unblocking The Main Thread Solving ANRs and Frozen FramesSinan KOZAK
 
Salesforce Community Group Quito, Salesforce 101
Salesforce Community Group Quito, Salesforce 101Salesforce Community Group Quito, Salesforce 101
Salesforce Community Group Quito, Salesforce 101Paola De la Torre
 
Enhancing Worker Digital Experience: A Hands-on Workshop for Partners
Enhancing Worker Digital Experience: A Hands-on Workshop for PartnersEnhancing Worker Digital Experience: A Hands-on Workshop for Partners
Enhancing Worker Digital Experience: A Hands-on Workshop for PartnersThousandEyes
 
My Hashitalk Indonesia April 2024 Presentation
My Hashitalk Indonesia April 2024 PresentationMy Hashitalk Indonesia April 2024 Presentation
My Hashitalk Indonesia April 2024 PresentationRidwan Fadjar
 
#StandardsGoals for 2024: What’s new for BISAC - Tech Forum 2024
#StandardsGoals for 2024: What’s new for BISAC - Tech Forum 2024#StandardsGoals for 2024: What’s new for BISAC - Tech Forum 2024
#StandardsGoals for 2024: What’s new for BISAC - Tech Forum 2024BookNet Canada
 
08448380779 Call Girls In Diplomatic Enclave Women Seeking Men
08448380779 Call Girls In Diplomatic Enclave Women Seeking Men08448380779 Call Girls In Diplomatic Enclave Women Seeking Men
08448380779 Call Girls In Diplomatic Enclave Women Seeking MenDelhi Call girls
 

Kürzlich hochgeladen (20)

How to Troubleshoot Apps for the Modern Connected Worker
How to Troubleshoot Apps for the Modern Connected WorkerHow to Troubleshoot Apps for the Modern Connected Worker
How to Troubleshoot Apps for the Modern Connected Worker
 
SQL Database Design For Developers at php[tek] 2024
SQL Database Design For Developers at php[tek] 2024SQL Database Design For Developers at php[tek] 2024
SQL Database Design For Developers at php[tek] 2024
 
Tech-Forward - Achieving Business Readiness For Copilot in Microsoft 365
Tech-Forward - Achieving Business Readiness For Copilot in Microsoft 365Tech-Forward - Achieving Business Readiness For Copilot in Microsoft 365
Tech-Forward - Achieving Business Readiness For Copilot in Microsoft 365
 
Factors to Consider When Choosing Accounts Payable Services Providers.pptx
Factors to Consider When Choosing Accounts Payable Services Providers.pptxFactors to Consider When Choosing Accounts Payable Services Providers.pptx
Factors to Consider When Choosing Accounts Payable Services Providers.pptx
 
Breaking the Kubernetes Kill Chain: Host Path Mount
Breaking the Kubernetes Kill Chain: Host Path MountBreaking the Kubernetes Kill Chain: Host Path Mount
Breaking the Kubernetes Kill Chain: Host Path Mount
 
GenCyber Cyber Security Day Presentation
GenCyber Cyber Security Day PresentationGenCyber Cyber Security Day Presentation
GenCyber Cyber Security Day Presentation
 
The Codex of Business Writing Software for Real-World Solutions 2.pptx
The Codex of Business Writing Software for Real-World Solutions 2.pptxThe Codex of Business Writing Software for Real-World Solutions 2.pptx
The Codex of Business Writing Software for Real-World Solutions 2.pptx
 
From Event to Action: Accelerate Your Decision Making with Real-Time Automation
From Event to Action: Accelerate Your Decision Making with Real-Time AutomationFrom Event to Action: Accelerate Your Decision Making with Real-Time Automation
From Event to Action: Accelerate Your Decision Making with Real-Time Automation
 
Human Factors of XR: Using Human Factors to Design XR Systems
Human Factors of XR: Using Human Factors to Design XR SystemsHuman Factors of XR: Using Human Factors to Design XR Systems
Human Factors of XR: Using Human Factors to Design XR Systems
 
AI as an Interface for Commercial Buildings
AI as an Interface for Commercial BuildingsAI as an Interface for Commercial Buildings
AI as an Interface for Commercial Buildings
 
Slack Application Development 101 Slides
Slack Application Development 101 SlidesSlack Application Development 101 Slides
Slack Application Development 101 Slides
 
A Domino Admins Adventures (Engage 2024)
A Domino Admins Adventures (Engage 2024)A Domino Admins Adventures (Engage 2024)
A Domino Admins Adventures (Engage 2024)
 
The 7 Things I Know About Cyber Security After 25 Years | April 2024
The 7 Things I Know About Cyber Security After 25 Years | April 2024The 7 Things I Know About Cyber Security After 25 Years | April 2024
The 7 Things I Know About Cyber Security After 25 Years | April 2024
 
Swan(sea) Song – personal research during my six years at Swansea ... and bey...
Swan(sea) Song – personal research during my six years at Swansea ... and bey...Swan(sea) Song – personal research during my six years at Swansea ... and bey...
Swan(sea) Song – personal research during my six years at Swansea ... and bey...
 
Unblocking The Main Thread Solving ANRs and Frozen Frames
Unblocking The Main Thread Solving ANRs and Frozen FramesUnblocking The Main Thread Solving ANRs and Frozen Frames
Unblocking The Main Thread Solving ANRs and Frozen Frames
 
Salesforce Community Group Quito, Salesforce 101
Salesforce Community Group Quito, Salesforce 101Salesforce Community Group Quito, Salesforce 101
Salesforce Community Group Quito, Salesforce 101
 
Enhancing Worker Digital Experience: A Hands-on Workshop for Partners
Enhancing Worker Digital Experience: A Hands-on Workshop for PartnersEnhancing Worker Digital Experience: A Hands-on Workshop for Partners
Enhancing Worker Digital Experience: A Hands-on Workshop for Partners
 
My Hashitalk Indonesia April 2024 Presentation
My Hashitalk Indonesia April 2024 PresentationMy Hashitalk Indonesia April 2024 Presentation
My Hashitalk Indonesia April 2024 Presentation
 
#StandardsGoals for 2024: What’s new for BISAC - Tech Forum 2024
#StandardsGoals for 2024: What’s new for BISAC - Tech Forum 2024#StandardsGoals for 2024: What’s new for BISAC - Tech Forum 2024
#StandardsGoals for 2024: What’s new for BISAC - Tech Forum 2024
 
08448380779 Call Girls In Diplomatic Enclave Women Seeking Men
08448380779 Call Girls In Diplomatic Enclave Women Seeking Men08448380779 Call Girls In Diplomatic Enclave Women Seeking Men
08448380779 Call Girls In Diplomatic Enclave Women Seeking Men
 

Or 2013-abrams-sharing-data-rich-research

  • 1. Sharing Data-Rich Research Through Repository Layering Stephen Abrams California Digital Library Angela Rizk-Jackson Julia Kochi University of California, San Francisco Noah Wittman University of California, Berkeley
  • 2. Why is data curation important?  Accelerating scientific progress  Enabling appropriate scrutiny and verification of results  Promoting integrity and debate  Facilitating new collaborations  Avoiding needless duplication of effort  Increasingly, complying with institutional policies, publication requirements, and funder mandates Cf. White and Teds (2011), “Making the case for research data management” DCC briefing paper, www.dcc.ac.uk/resources/briefing-papers/making-case-rdm
  • 3. The library’s role  A continuation of its long-standing mission and practice to connect patrons with content of interest in meaningful ways across barriers of space and time Cf. Tenopir et al. (2012), “Academic librarians and research data services: Preparation and attitudes,” 78th IFLA General Conference and Assembly, Helsinki, conference.ifla.org/past/ifla78/116-tenopir-en.pdf  Offering solutions that enhance the natural points of alignment between the scholarly research and information lifecycles Publish Reuse ShareCreate Discover Collect PreserveAccessResearchResearch CurationCuration Scholarly lifecycle Information lifecycle
  • 4. Merritt  Curation repository available to the UC community and external partners  Preservation and access  Content agnostic, model free  Highly decentralized micro-services architecture Cf. Abrams, Cruse, Kunze, and Minor (2011), “Curation micro-services: A pipeline metaphor for repositories,” Journal of Digital Information 12(2), journals.tdl.org/jodi/article/view/1605  26 curatorial units  271 collections  325,000 objects  450,000 versions  4,500,000 files  13 TB www.cdlib.org/uc3/merritt merritt.cdlib.org
  • 5. Merritt Storage node Storage broker Inventory ONEShare UNM storage node Storage node UI/API UI/API UI/API LDAP LDAP LDAP RDBMS Fixity User agent Message queue RDBMS Load balancer Ingest Load balancer Ingest Ingest EZID No-SQL DataCite … DataONE member node RDBMS RDBMS DataONE coord’ing node … IDF Load balancer Web of Knowledge Primo SAN SDSC cloud
  • 6. (Some) issues to address  Scale  Individual objects ranging from 0 to 47,000 files  Individual files ranging from 0 to 14 GB  Maintaining control  Concern over potential loss of control over dissemination and use of data  User experience  Switch from organizational to individual interaction www.flickr.com/photos/vixon/116447718www.flickr.com/photos/traftery/4319529821www.flickr.com/photos/32195273@N05/51076852642
  • 7. (Some) issues to address  Scale  Individual objects ranging from 0 to 47,000 files  Individual files ranging from 0 to 14 GB  Maintaining control  Concern over potential loss of control over dissemination and use of data  User experience  Switch from organizational to individual interaction Augment repository function by composition (when possible) and addition (when necessary)  Loosely-coupled integration with external community supported systems and services
  • 8. Scale  Avoiding client timeout  ≤ 2 GB: File-based  stream-based AIP-to-DIP processing  > 2 GB: Asynchronous delivery  Email notification with personalized, time-limited URL  Streamlined storage provisioning  SDSC cloud cloud.sdsc.edu www.kevatron.co.uk/converting-8-24-bit-samples-in-coreaudio-on-ios www.flickr.com/photos/paulbhartzog/680749585
  • 9. Control  Data use agreements (DUAs)  Explicit assertion of license requirements and terms of use  Curatorial and consumer notification of acceptance Cf. Brazhnik and Jones (2007), “Anatomy of data integration,” Journal of Biomedical Informatics 40(3): 252- 69, doi:10.1016/j.jbi.2006.09.001 From: no-reply-merritt@ucop.edu Subject:Merritt DUA acceptance Name: Stephen Abrams Affiliation: California Digital Library Collection: UCSF DataShare Object: Frontotemporal Lobar Degeneration (FTLD) Date: 2013-05-3109:50:34PDT Terms of use: As part of this agreement, Consumer submits to the following statements: (1) I will receive access to de-identified data and will not attempt to establish the identity of any of the study subjects. (2) I will share these data only with my immediate co-workers, and I will not transfer these data to other research groups. I understand that these data are available to other research groups through the process by which I obtain them. (3) I will require anyone in my group who utilizes these data, or anyone with whom I share these data to comply with this data use agreement ...
  • 10. User experience  Due to its open eligibility policy, Merritt will always provide a more generic UX than special-purpose or disciplinary systems  Shifting user roles, shifting expectations  Institutional  individual researcher  Behavioral expectations set by the commercial/mobile web
  • 11. User experience  Due to its open eligibility policy, Merritt will always provide a more generic UX than special-purpose or disciplinary systems  Shifting user roles, shifting expectations  Institutional  individual researcher  Behavioral expectations set by the commercial web  Integration with extant services that better provide the desired UX  DataShare  Research Hub
  • 12. DataShare  “The goal of the DataShare project is to catalyze widespread sharing of scientific research data” datashare.ucsf.edu  UCSF Clinical and Translational Science Institute ctsi.ucsf.edu  UCSF Library www.library.ucsf.edu  UCSF Center for Imaging of Neurodegenerative Disease www.radiology.ucsf.edu/cind  Architecture  DataShare submission client (Ruby/Rails)  Merritt curation repository  DataShare discovery portal (XTF/Java)
  • 13. DataShare  Prepare  Describe  Upload  Curate  Discover  Share
  • 14. DataShare  Prepare  Best practice advice  Describe  Upload  Curate  Discover  Share
  • 15. DataShare  Prepare  Describe  Schema-directed metadata editor  DataCite schema schema.datacite.org  Upload  Curate  Discover  Share
  • 16. DataShare  Prepare  Describe  Upload  File browse or drag-n-drop  Curate  Discover  Share
  • 17. DataShare  Prepare  Describe  Upload  File browse or drag-n-drop  Curate  Discover  Share
  • 18. DataShare  Prepare  Describe  Upload  Curate  Manage datasets  Discover  Share
  • 19. DataShare  Prepare  Describe  Upload  Curate  Discover  Faceted search and browse  Share
  • 20. DataShare  Prepare  Describe  Upload  Curate  Discover  Share  DataONE  DataCite  (soon) Primo Web of Knowledge  SEO
  • 21. Merritt + DataShare Storage node Storage broker Inventory ONEShare UNM storage node Storage node UI/API UI/API UI/API LDAP LDAP LDAP RDBMS Fixity User agent Message queue RDBMS Load balancer Ingest Load balancer Ingest Ingest EZID No-SQL DataCite … DataONE member node RDBMS RDBMS DataONE coord’ing node … IDF Load balancer Web of Knowledge Primo SAN SDSC cloud DataShare upload Collection Atom feed XTF xtf.cdlib.org DataShare portal Lucene
  • 22. Research Hub  “Research Hub provides powerful tools for content management and collaboration” hub.berkeley.edu  Alfresco CMS www.alfresco.com  770 projects, 3,900 users  Personal file management  Project collaboration  Departmental resource pooling  Research data management  Desktop sync, mobile app, Adobe Creative Suite  UC Berkeley Information Services and Technology ist.berkeley.edu
  • 23. Research Hub  Prepare  Acquire and arrange  Describe  Upload  Curate  Discover  Share
  • 24. Research Hub  Prepare  Describe  Schema-directed metadata editors  Upload  Curate  Discover  Share
  • 25. Research Hub  Prepare  Describe  Upload  Direct action  Curate  Discover  Share
  • 26.  Prepare  Describe  Upload  Direct action  Curate  Discover  Share Research Hub
  • 27. Research Hub  Prepare  Describe  Upload  Policy-based workflow rules  Curate  Discover  Share
  • 28. Research Hub  Prepare  Describe  Upload  Drag-and-drop  Curate  Discover  Share
  • 29. Research Hub  Prepare  Describe  Upload  Confirmation  Curate  Discover  Share
  • 30. Research Hub  Prepare  Describe  Upload  Curate  Manage datasets  Discover  Share
  • 31. Research Hub  Prepare  Describe  Upload  Curate  Discover  Share
  • 32. Research Hub  Prepare  Describe  Upload  Curate  Discover  Share
  • 33. Merritt + DataShare + Research Hub Storage node Storage broker Inventory ONEShare UNM storage node Storage node UI/API UI/API UI/API LDAP LDAP LDAP RDBMS Fixity User agent Message queue RDBMS Load balancer Ingest Load balancer Ingest Ingest EZID No-SQL DataCite … DataONE member node RDBMS RDBMS DataONE coord’ing node … IDF Load balancer Web of Knowledge Primo SAN SDSC cloud DataShare upload Collection Atom feed XTF xtf.cdlib.org DataShare portal Lucene Research Hub
  • 34. Next steps  Self-service account registration  UCTrust and InCommon Shibboleth federations  Additional cloud-based replication  Outreach  Integration with Open Context archaeological portal opencontext.org  Atom-based submission  Integration with Nuxeo www.nuxeo.com  UC system-wide DAMS solution  Integration with Islandora islandora.ca  Collaboration with UCLA Library  Tuque API  Integration with DPN www.dpn.org
  • 35. Sharing research through repositories  Conform to institutional policy, publication requirements, and funder mandates  Pro-active curation of valuable research outputs  Stable citation and access  High visibility publication and discovery  Use metrics
  • 36. Sharing research through repositories  Conform to institutional policy, publication requirements, and funder mandates  Pro-active curation of valuable research outputs  Stable citation and access  High visibility publication and discovery  Use metrics  Repository layering as an appropriate division of labor  Exploiting existing capabilities already in local use
  • 37. For more information  Merritt www.cdlib.org/uc3/merritt uc3@ucop.edu Stephen Abrams David Loy Patricia Cruse Mark Reyes Shirin Faenza Joan Starr Scott Fisher Carly Strasser Erik Hetzner Marisa Strong Joshua Hubbard Bhavitavya Vedula Greg Janée Kenneth Weiss John Kunze Perry Willet Rosalie Lack  DataShare datashare.ucsf.edu Geoffrey Boushey Julia Kochi Anirvan Chatterjee Angela Rizk-Jackson Maninder Kahlon Michael Weiner  Research Hub hub.berkeley.edu Ian Crew Michael McCarthy (Tribloom) Noah Wittman Patrick McGrath www.slideshare.net/UC3/or-2013abramssharingdatarichresearch

Hinweis der Redaktion

  1. Copyright © 2013 by The Regents of the University of CaliforniaThis work is made available under the terms of the Creative Commons Attribution-ShareAlike 3.0 license
  2. MatijaGrguric, Scale up! – Curved elements!, http://www.flickr.com/photos/32195273@N05/5107685264Tom Rafferty, Remote control!, http://www.flickr.com/photos/67945918@N00/4319529821Barry Egan, File rio 2006, http://www.flickr.com/photos/vixon/116447718
  3. Kevin Smith, Converting 8.24 bit samples in CoreAudio on iOS, http://www.kevatron.co.uk/converting-8-24-bit-samples-in-coreaudio-on-ios/Paul B. Hartzog, Server room, http://www.flickr.com/photos/paulbhartzog/680749585