An update on public access activities at the National Agricultural Library and next steps, presented 11 January 2017 at the Earth Science Information Partners (ESIP) meeting in Bethesda, Maryland.
Top Rated Pune Call Girls Bhosari โ 6297143586 โ Call Me For Genuine Sex Ser...
ย
Public access to research results at USDA
1. Cynthia Parr @cydparr
US Department of Agriculture
National Agricultural Library
11 January 2017
Public access to research results
at USDA
Credit: Phenocam Swan Lake Research Farm, MN
2. The Story
โข committed to public access
โข PubAg is well along
โข Ag Data Commons in the process
of making USDA-funded data
accessible
We
are
3. The Story part 2
public access isnโt good enoughBut
Therefore
we are
1. enhancing our platform
2. establishing sound curation policies
and processes
3. promoting machine-readability and
data stories
4. seeking a sustainable business
model
5. PubAg https://pubag.nal.usda.gov
โข Launched 2014
โข Almost 50K full-text peer-reviewed articles
โข More than 1 million citations for other papers
โข Will collaborate with US Forest Service Treesearch
โข Will expand beyond Agricultural Research Service full text
โข Will cooperate with the CHORUS publisher consortium
โข Will soon launch a redesign
6.
7. Transform agriculture to deliver a 20%
increase in quality production with 20%
lower environmental impact by 2025
-- USDA Agricultural Research Service
Public access is not enough
8. Goals for USDA digital scientific data
Who What
Researchers and funders Compliance with public access
Agencies Compliance with open data
Research (data submitters) Safe, citable place for data
Research (data users) Find and use awesome data
10. DKAN http://nucivic.com/dkan/
PRO
โข Open source community
โข Drupal modules for basic
CMS functions
โข Can feed Data.gov
โข Basic metadata already
supported
CON
โข Not designed for scientific
data or scientists
โข No links to literature
โข No Digital Object
Identifiers
โข Doesnโt handle dataset
relationships
โข Metadata inadequate for
compliance checking &
re-use
11. Use all this for some data intensive research
Ag Data Commons Pilot FY 2016
โข Self-submission accounts (almost 100 now)
โข More than 240 datasets (104 harvested)
โข Distributed curation
โข Links to PubAg, tagged with NAL thesaurus terms
โข DataCite Digital Object Identifiers, ORCIDs, FundRef
โข Methods metadata, data dictionaries for re-use
โข Designed to feed Data.gov
12. 2. Sound curation policies and processes
โข Who can submit?
โข What do we accept?
โข When do we assign DOIs?
โข What embargo periods are okay?
โข How much review of metadata
and data do we do?
โข Who reviews metadata and data?
โข How should data be organized?
โข When do we offer a group a
โcollectionโ?
โข Must we host all the data?
โข What can we automate?
โข How do we make things more
machine-readable?
โข When should datasets be versioned?
โข How do we handle preservation?
โข How much and what kind of data
storage do we need?
โข How do we avoid licensing and
โownershipโ confusion?
13. Research products
Include in the Ag Data Commons (or provide links)
โข Raw data files and/or Processed data files
โข Data dictionary or Readme
Do not submit with the data
โข Manuscript
โข Figures/tables from manuscript
14. Research products
Include as resources (resource can be URL pointer)
โข Web database
โข Software
โข Source code/Scripts/Workflows
โข User manuals
Do not submit with the data
โข Presentations associated with the study
โข News articles or press releases
โข Related or cited data
19. To sum up
committed to public accessUSDA
is public access isnโt good enoughBut
Therefore
20. Acknowledgements
Cynthia.Parr@ars.usda.gov
Susan McCarthy, Ursula Pieper, Erin Antognoli, Jon
Sears, Qing Qu, Jeff Campbell, Jocelyn
McNamara, Melissa Lohrey, Don Gourley,
GovDelivery, Angry Cactus team
The PubAg team, especially Melanie Gardner
UMD: Kerry Huller, Adam Kriesberg, Meghna
Sarin, Candice Ho
Other students: Jaylen Nathwani
Editor's Notes
In the Moving Beyond Mandates: Progress Towards Public Access and What the Future Holds session
, presentations should focus onย providing an overview of progress we've made in our respective organizations in the area of public/open access to data/information and what we see in terms of next steps and/or a vision for the future of open/public data.ย
https://phenocam.sr.unh.edu/webcam/sites/arsmnswanlake1/
Is anybody familiar with this book? โHouston we have a narrativeโ by Randy Olson.
ARS grand challenge
How does data help you do this?
It will help to provide people access to publications and data BUT
It doesnโt speed up the process of helping scientists discover raw data that could be re-used in large scale analysis, or metaanalyses, or models,
it doesnโt help assess data quality or fitness for use
it doesnโt speed up integration with environmental data
Help researchers and funders demonstrate compliance with public access directives
Ensure that federal scientific data is in data.gov in compliance with open data directives
Provide researchers with a safe, citable place for their data
Help researchers find and use awesome data for future research
How are we making USDA-funded data open, discoverable, safe
to serve the ARS researchers โ place to park there data safely, or point to it where it lives in some other trusted repository
Covers things like environmental measurements from the Long Term Agroecosystem Research initiative, genomics of livestock pests, and datasets related to modeling soil erosion, etc.
Drupal
Knowledge
Archive
Network
To date
Plan to become a trusted repository and feed data to Data.gov, the US governmentโs primary inventory of open government data
We have a human readable page with some text descriptions, attached files, structured metadata
To
Systems are not yet mature, populated, linked, or sustainable. They need to be all of these things in order to support the revolutionary advances in food security science research that are needed
Planning to scale ADC up to hold huge numbers of well documented machine-readable datasets โ but so far thatโs a lot of human effort and we don't yet have the automated process we need.
For example, there are dozens if not hundreds of publicly available PDFs that have the results of performance trials of different varieties of corn and soybeans โ but almost all of that old data is not machine readable. It may or may not be linked yet to Vouchers in the National Plant Germpasm repository
We are increasing our use of identifiers both in ADC and USFSC so that people could do this sort of automated work.
We also need to work to start with the itemization of the DTMS ontology terms in the data.gov datasets, and get those captured on the Ag Data Commons
DTMS is a research effort tied to a synthesis cetner that may not have a lot of longevity โ want to capture the results of their semantic work
But furthermore, we need to make sure that those ontology terms link to the existing agricultural thesauri and ontologies so that we can better link the data