SlideShare ist ein Scribd-Unternehmen logo
1 von 33
Peter Fox (RPI) @taswegian
NFDP 2013
May 22, 2013, Oxford, UK
The Now and Now for Data: Metaphors for
Making Data Publically Available
Am not going to …
http://mp-datamatters.blogspot.com/ Is Data Publication the Right Metaphor? http://dx.doi.org/10.2481/dsj.WDS-042
Just to get us going…
The latest (U.S. example)
International Council for Science – Strategic Coordinating
Committee on Information and Data - recommendation
http://eloquentscience.com/wp-
content/uploads/2011/04/open_access.jpg
http://www.icsu.org/publications/reports-and-reviews/strategic-coordinating-committee-on-information-and-data-report
OECD guidelines
= data access and
sharing policies
http://bernews.com/wp-content/uploads/2011/02/oecd-logo.jpg
ICSU SCCID recommendation
•Engage actively
– publishers of all kinds together
– library community
– scientific researchers
•To
– Document and promote community
best practice in the handling of
supplemental material, publication of
data and appropriate data citation.
http://www.leebullen.com/Fini
shed%20Pics/Scientists.jpg
?
Goal?
• Data as a first class object
• As a subject of
conversation (v. discourse)
• Metaphors to achieve this
abound and indicate a
particular stakeholder
perspective (worldview,
bias, edict, etc…)
It seems we are not quite there yet
• We*
are having
conversations (like the
one today) about data+x
(x=citation, publication,
integration, integrity,
ownership, trust, …)
• *
= ./ ../ // and / (unixtm
)
What if we had a conversation about this data?
20080602 Fox VSTO et al.
11
Metaphor!
12
Data Information Knowledge
Producers Consumers
Context
Presentation
Organization
Integration
Conversation
Creation
Gathering
Experience
• Ecosystem
• A framework
for talking
about data,
and …
Data perspective under some metaphors
13
Producers Consumers
Quality Control
Fitness for Purpose Fitness for Use
Quality Assessment
Trustee Trustor
For others: Is this separation good or not?
14
Producers Consumers
Quality Control
Fitness for Purpose Fitness for Use
Quality Assessment
Trustee Trustor
Publisher “Reader”
This may be us, or others
Technical advances
From: C. Borgman, 2008, NSF Cyberlearning Report
Global Change Information System (GCIS)
16
Vision:
A unified web based source of
authoritative, accessible, usable, and
timely information about climate and
global change for use by scientists,
decision makers, and the public.
Prototype Use Case
Name Discover and visit data center website of dataset used to generate report figure.
Goal The NCA Report reader sees a figure and wants to know where the data came from.
Summary A reader of the NCA is browsing the content via the website. He/she sees a figure and wants to know where the data came from. A reference
to the publication in which the figure originated appears in the figure caption. Selecting the link to the source publication displays a page of
information about the publication including, if available, the publication DOI. The page also includes references to the datasets cited in the
publication. Following each of dataset reference links presents a page of information about the dataset, including links back to the agency/data
center webpage describing the dataset in more detail and making the actual data available for order or download.
Actors Primary Actor - reader of the NCA
Preconditions Reader is viewing the NCA online report
Post Conditions Reader visits the data center dataset website
Normal Flow 1) System is presenting the NCA report to the reader in a web site. Presentation includes report figure with caption that includes reference to
source publication.
2) Reader selects publication reference in figure caption
3) System displays information about publication, including DOI (if available).
4) Publication information includes publication dataset citations.
5) Reader selects a dataset cited by the publication.
6) System displays information about dataset including links to agency / data center webpages where more information and (potentially) data
download links are available.
7) Reader selects the data center link and is redirected to data center dataset webpage.
Discover and visit data center website of dataset used to generate report figure.
Assessment links to information
18
Non-specialist Use Case
Name Find Latest Datasets by Keyword
Goal Search for datasets associated with the keyword “snow”, list search results by recentness of publication.
Summary User story:
I want to look for information concerning “snow.” I don’t know if it is a CLEAN word or a GCMD word or don’t even know what GCMD
or CLEAN is. How would I do it, and what would I see on my monitor during the process?
Assumptions The reader is not assumed to have knowledge regarding the GCMD Keywords (or other) vocabulary.
Actors Primary Actor - reader of the NCA
Preconditions TBD
Post Conditions Reader is presented with a list of datasets associated with the keyword “snow” sorted by dataset publication date.
Normal Flow TBD
Notes We are looking into two user interface options for dataset selection by keyword
1)As a free-text search where the user inputs “snow”.
2)Present the user a faceted browse interface with a vocabulary faceted which presents the user with terms from a structured vocabulary. The
user can manually select the term(s) which match or contain “snow”.
We intend to implement prototypes of both.
Search for datasets with the keyword “snow”, ….
Parsons
& Fox
Setting of the roles and relations
• Yes it is about contracts… of all sorts…
– An agency example, they are exploring a
number of metaphors
An un-named US govt. agency
Data Review!
From my Research Data Alliance talk; #5
• Please all SNAP your fingers (1, 2, 3,
NOW)
• <snap> the culture around data has to
change, as well as how we think about
paradigms (metaphors)
Call to discussion
• Multiple metaphors, many considerations
• An ecosystem approach allows multiple solutions in a
complex socio-technical system – transactions among
providers and consumers
– Significant opportunities for under-served data generators to get
their data ‘out there’ perhaps publication (still a metaphor!)
• Data Review !== Peer Review and more role disconnects
• <discuss>
• Please read our Data Science Journal essay and respond!
• Thanks for your attention - pfox@cs.rpi.edu , http://tw.rpi.edu
Back shed
Pros/Cons - Data Centres (‘big iron’)
• Volume
• Streamlined
• Automation
• Auditable
• Reprocessing capability
• Central authority
• Funded
• Over-reliance on automation
• Weak documentation
• Use is assumed
• Roles ill-defined, reputation?
• Does not handle heterogeneity
• Preservation ?
• Overly focused on generation
• …
Pros/Cons - Publishers
• Simple
• Tested
• Disseminated
• Shifted burden
• Imprimatur
• De-facto preservation
• Citable
• Based on science norms
• Locked
• Static/
• Not machine
accessible
• Cost?
• Not scalable
• Cannot verify use
Pros/Cons - Release (software)
• Many stages (alpha, beta,
release candidate, release)
• Versioned
• Documented and change
notified
• Intends to couple user
feedback to developers
• Packaged
• Licensing well thought out
• …
• Provenance implicit
• Preservation poorly dealt with
• Quality may be difficult to
determine
• Attribution not part of the mind-
set
• Derivative or embedded use
not always well defined
• …
Pros/Cons - Linked data
• Scales
• Built on web
• Simple model design
• Tested
• Disseminated
• Machine processable
• No central authority
• Heterogeneous
• Use not assumed
• Flexible evolution
• Supports encapsulation
• Poor versioning
• Poor auditing
• No imprimatur
• No preservation/ stewardship
• Not human friendly
• Heterogeneous vocab.
• Changes data model
• Unknown evolution
• …
33
.. Data has Lots of Audiences
From “Why EPO?”, a NASA internal
report on science education, 2005
More Strategic
Less Strategic
Science too!

Weitere ähnliche Inhalte

Was ist angesagt?

Was ist angesagt? (20)

NISO/NFAIS Joint Virtual Conference: Connecting the Library to the Wider Wor...
NISO/NFAIS Joint Virtual Conference:  Connecting the Library to the Wider Wor...NISO/NFAIS Joint Virtual Conference:  Connecting the Library to the Wider Wor...
NISO/NFAIS Joint Virtual Conference: Connecting the Library to the Wider Wor...
 
Publishing the Full Research Data Lifecycle
Publishing the Full Research Data LifecyclePublishing the Full Research Data Lifecycle
Publishing the Full Research Data Lifecycle
 
RDA-WDS Publishing Data Interest Group
RDA-WDS Publishing Data Interest GroupRDA-WDS Publishing Data Interest Group
RDA-WDS Publishing Data Interest Group
 
data citation
data citationdata citation
data citation
 
Optimising Scientific Knowledge Transfer: How Collective Sensemaking Can Ena...
Optimising Scientific Knowledge Transfer: How Collective Sensemaking Can Ena...Optimising Scientific Knowledge Transfer: How Collective Sensemaking Can Ena...
Optimising Scientific Knowledge Transfer: How Collective Sensemaking Can Ena...
 
NISO/NFAIS Joint Virtual Conference: Connecting the Library to the Wider Wor...
NISO/NFAIS Joint Virtual Conference:  Connecting the Library to the Wider Wor...NISO/NFAIS Joint Virtual Conference:  Connecting the Library to the Wider Wor...
NISO/NFAIS Joint Virtual Conference: Connecting the Library to the Wider Wor...
 
DataONE Education Module 08: Data Citation
DataONE Education Module 08: Data CitationDataONE Education Module 08: Data Citation
DataONE Education Module 08: Data Citation
 
bioCADDIE Webinar: The NIDDK Information Network (dkNET) - A Community Resear...
bioCADDIE Webinar: The NIDDK Information Network (dkNET) - A Community Resear...bioCADDIE Webinar: The NIDDK Information Network (dkNET) - A Community Resear...
bioCADDIE Webinar: The NIDDK Information Network (dkNET) - A Community Resear...
 
DataTags: Sharing Privacy Sensitive Data by Michael Bar-sinai
DataTags: Sharing Privacy Sensitive Data by Michael Bar-sinaiDataTags: Sharing Privacy Sensitive Data by Michael Bar-sinai
DataTags: Sharing Privacy Sensitive Data by Michael Bar-sinai
 
Keystone summer school_2015_miguel_antonio_ldcompression_4-joined
Keystone summer school_2015_miguel_antonio_ldcompression_4-joinedKeystone summer school_2015_miguel_antonio_ldcompression_4-joined
Keystone summer school_2015_miguel_antonio_ldcompression_4-joined
 
David Shotton - Research Integrity: Integrity of the published record
David Shotton - Research Integrity: Integrity of the published recordDavid Shotton - Research Integrity: Integrity of the published record
David Shotton - Research Integrity: Integrity of the published record
 
dkNET-NURSA Challenge Kick-Off Webinar 04/27/2017
dkNET-NURSA Challenge Kick-Off Webinar 04/27/2017dkNET-NURSA Challenge Kick-Off Webinar 04/27/2017
dkNET-NURSA Challenge Kick-Off Webinar 04/27/2017
 
DataONE Education Module 02: Data Sharing
DataONE Education Module 02: Data SharingDataONE Education Module 02: Data Sharing
DataONE Education Module 02: Data Sharing
 
dkNET Introductory Webinar 05/10/2017
dkNET Introductory Webinar 05/10/2017dkNET Introductory Webinar 05/10/2017
dkNET Introductory Webinar 05/10/2017
 
Introduction to linked data
Introduction to linked dataIntroduction to linked data
Introduction to linked data
 
Exploration, visualization and querying of linked open data sources
Exploration, visualization and querying of linked open data sourcesExploration, visualization and querying of linked open data sources
Exploration, visualization and querying of linked open data sources
 
Perception Determined Constructing Algorithm for Document Clustering
Perception Determined Constructing Algorithm for Document ClusteringPerception Determined Constructing Algorithm for Document Clustering
Perception Determined Constructing Algorithm for Document Clustering
 
UKSG 2018 Breakout - Trouble(shooting) with a capital T: how categorising and...
UKSG 2018 Breakout - Trouble(shooting) with a capital T: how categorising and...UKSG 2018 Breakout - Trouble(shooting) with a capital T: how categorising and...
UKSG 2018 Breakout - Trouble(shooting) with a capital T: how categorising and...
 
Linking Open Government Data at Scale
Linking Open Government Data at Scale Linking Open Government Data at Scale
Linking Open Government Data at Scale
 
Shareable Metadata for Visual Resources
Shareable Metadata for Visual ResourcesShareable Metadata for Visual Resources
Shareable Metadata for Visual Resources
 

Ähnlich wie Fox-Keynote-Now and Now of Data Publishing-nfdp13

Getting the Most Out of Your E-Resources: Measuring Success
Getting the Most Out of Your E-Resources: Measuring SuccessGetting the Most Out of Your E-Resources: Measuring Success
Getting the Most Out of Your E-Resources: Measuring Success
kramsey
 
Engaging Information Professionals in the Process of Authoritative Interlinki...
Engaging Information Professionals in the Process of Authoritative Interlinki...Engaging Information Professionals in the Process of Authoritative Interlinki...
Engaging Information Professionals in the Process of Authoritative Interlinki...
Lucy McKenna
 
#ALAAC15 Linked Data Love
#ALAAC15 Linked Data Love #ALAAC15 Linked Data Love
#ALAAC15 Linked Data Love
Kristi Holmes
 

Ähnlich wie Fox-Keynote-Now and Now of Data Publishing-nfdp13 (20)

Webinar@AIMS: LODE-BD
Webinar@AIMS: LODE-BDWebinar@AIMS: LODE-BD
Webinar@AIMS: LODE-BD
 
In search of lost knowledge: joining the dots with Linked Data
In search of lost knowledge: joining the dots with Linked DataIn search of lost knowledge: joining the dots with Linked Data
In search of lost knowledge: joining the dots with Linked Data
 
Getting the Most Out of Your E-Resources: Measuring Success
Getting the Most Out of Your E-Resources: Measuring SuccessGetting the Most Out of Your E-Resources: Measuring Success
Getting the Most Out of Your E-Resources: Measuring Success
 
Ordering the chaos: Creating websites with imperfect data
Ordering the chaos: Creating websites with imperfect dataOrdering the chaos: Creating websites with imperfect data
Ordering the chaos: Creating websites with imperfect data
 
Linked Energy Data Generation
Linked Energy Data GenerationLinked Energy Data Generation
Linked Energy Data Generation
 
Linked Data at the OU - the story so far
Linked Data at the OU - the story so farLinked Data at the OU - the story so far
Linked Data at the OU - the story so far
 
Engaging Information Professionals in the Process of Authoritative Interlinki...
Engaging Information Professionals in the Process of Authoritative Interlinki...Engaging Information Professionals in the Process of Authoritative Interlinki...
Engaging Information Professionals in the Process of Authoritative Interlinki...
 
Research Data Publishing
Research Data PublishingResearch Data Publishing
Research Data Publishing
 
Research Object Composer: A Tool for Publishing Complex Data Objects in the C...
Research Object Composer: A Tool for Publishing Complex Data Objects in the C...Research Object Composer: A Tool for Publishing Complex Data Objects in the C...
Research Object Composer: A Tool for Publishing Complex Data Objects in the C...
 
Tutorial Data Management and workflows
Tutorial Data Management and workflowsTutorial Data Management and workflows
Tutorial Data Management and workflows
 
Recognising data sharing
Recognising data sharingRecognising data sharing
Recognising data sharing
 
Big Data (SOCIOMETRIC METHODS FOR RELEVANCY ANALYSIS OF LONG TAIL SCIENCE D...
Big Data (SOCIOMETRIC METHODS FOR  RELEVANCY ANALYSIS OF LONG TAIL  SCIENCE D...Big Data (SOCIOMETRIC METHODS FOR  RELEVANCY ANALYSIS OF LONG TAIL  SCIENCE D...
Big Data (SOCIOMETRIC METHODS FOR RELEVANCY ANALYSIS OF LONG TAIL SCIENCE D...
 
Crediting informatics and data folks in life science teams
Crediting informatics and data folks in life science teamsCrediting informatics and data folks in life science teams
Crediting informatics and data folks in life science teams
 
data.ac.uk briefing paper
data.ac.uk briefing paperdata.ac.uk briefing paper
data.ac.uk briefing paper
 
A Clean Slate?
A Clean Slate?A Clean Slate?
A Clean Slate?
 
Establishing the Connection: Creating a Linked Data Version of the BNB
Establishing the Connection: Creating a Linked Data Version of the BNBEstablishing the Connection: Creating a Linked Data Version of the BNB
Establishing the Connection: Creating a Linked Data Version of the BNB
 
Structured data and metadata evaluation methodology for organizations looking...
Structured data and metadata evaluation methodology for organizations looking...Structured data and metadata evaluation methodology for organizations looking...
Structured data and metadata evaluation methodology for organizations looking...
 
Fsci 2018 friday3_august_am6
Fsci 2018 friday3_august_am6Fsci 2018 friday3_august_am6
Fsci 2018 friday3_august_am6
 
Data Citation Implementation Guidelines By Tim Clark
Data Citation Implementation Guidelines By Tim ClarkData Citation Implementation Guidelines By Tim Clark
Data Citation Implementation Guidelines By Tim Clark
 
#ALAAC15 Linked Data Love
#ALAAC15 Linked Data Love #ALAAC15 Linked Data Love
#ALAAC15 Linked Data Love
 

Mehr von DataDryad

Mehr von DataDryad (20)

Wood-RDA and-data publishing-nfdp13
Wood-RDA and-data publishing-nfdp13Wood-RDA and-data publishing-nfdp13
Wood-RDA and-data publishing-nfdp13
 
Smit-Scrap supplementary material-nfdp13
Smit-Scrap supplementary material-nfdp13Smit-Scrap supplementary material-nfdp13
Smit-Scrap supplementary material-nfdp13
 
Michener-institutional and subject-specific data repositories-nfdp13
Michener-institutional and subject-specific data repositories-nfdp13Michener-institutional and subject-specific data repositories-nfdp13
Michener-institutional and subject-specific data repositories-nfdp13
 
Hole-data journal-nfdp13
Hole-data journal-nfdp13Hole-data journal-nfdp13
Hole-data journal-nfdp13
 
Shotton force11-nfdp13
Shotton force11-nfdp13Shotton force11-nfdp13
Shotton force11-nfdp13
 
Coles partnerships quality and trust-nfdp13
Coles partnerships quality and trust-nfdp13Coles partnerships quality and trust-nfdp13
Coles partnerships quality and trust-nfdp13
 
Irving-TeraData: data and science driven big industry-nfdp13
Irving-TeraData: data and science driven big industry-nfdp13Irving-TeraData: data and science driven big industry-nfdp13
Irving-TeraData: data and science driven big industry-nfdp13
 
Mounce-Herding Cats
Mounce-Herding CatsMounce-Herding Cats
Mounce-Herding Cats
 
Pfeiffenberger-Data Policies and Sustainability-NFDP13
Pfeiffenberger-Data Policies and Sustainability-NFDP13Pfeiffenberger-Data Policies and Sustainability-NFDP13
Pfeiffenberger-Data Policies and Sustainability-NFDP13
 
Lyon-data metrics panel introduction-nfdp13
Lyon-data metrics panel introduction-nfdp13Lyon-data metrics panel introduction-nfdp13
Lyon-data metrics panel introduction-nfdp13
 
Lyon-data publishing challenges-nfdp13
Lyon-data publishing challenges-nfdp13Lyon-data publishing challenges-nfdp13
Lyon-data publishing challenges-nfdp13
 
Costas-data metrics-nfdp13
Costas-data metrics-nfdp13Costas-data metrics-nfdp13
Costas-data metrics-nfdp13
 
Mowlam-semantic publishing-up-nfdp13
Mowlam-semantic publishing-up-nfdp13Mowlam-semantic publishing-up-nfdp13
Mowlam-semantic publishing-up-nfdp13
 
Manola-open aire and data publishing-nfdp13
Manola-open aire and data publishing-nfdp13Manola-open aire and data publishing-nfdp13
Manola-open aire and data publishing-nfdp13
 
Zudilova-Seinstra-Elsevier-data and the article of the future-nfdp13
Zudilova-Seinstra-Elsevier-data and the article of the future-nfdp13Zudilova-Seinstra-Elsevier-data and the article of the future-nfdp13
Zudilova-Seinstra-Elsevier-data and the article of the future-nfdp13
 
Wilson-npg-scientific data-nfdp13
Wilson-npg-scientific data-nfdp13Wilson-npg-scientific data-nfdp13
Wilson-npg-scientific data-nfdp13
 
Pulverer-embo-source data-nfdp13
Pulverer-embo-source data-nfdp13Pulverer-embo-source data-nfdp13
Pulverer-embo-source data-nfdp13
 
Green-oecd and data publishing-nfdp13
Green-oecd and data publishing-nfdp13Green-oecd and data publishing-nfdp13
Green-oecd and data publishing-nfdp13
 
Lawrence-f1000-publishing with data-nfdp13
Lawrence-f1000-publishing with data-nfdp13Lawrence-f1000-publishing with data-nfdp13
Lawrence-f1000-publishing with data-nfdp13
 
Karunkara-Keynote-msf and open data-nfdp2013
Karunkara-Keynote-msf and open data-nfdp2013Karunkara-Keynote-msf and open data-nfdp2013
Karunkara-Keynote-msf and open data-nfdp2013
 

Kürzlich hochgeladen

Activity 01 - Artificial Culture (1).pdf
Activity 01 - Artificial Culture (1).pdfActivity 01 - Artificial Culture (1).pdf
Activity 01 - Artificial Culture (1).pdf
ciinovamais
 
1029-Danh muc Sach Giao Khoa khoi 6.pdf
1029-Danh muc Sach Giao Khoa khoi  6.pdf1029-Danh muc Sach Giao Khoa khoi  6.pdf
1029-Danh muc Sach Giao Khoa khoi 6.pdf
QucHHunhnh
 
Russian Escort Service in Delhi 11k Hotel Foreigner Russian Call Girls in Delhi
Russian Escort Service in Delhi 11k Hotel Foreigner Russian Call Girls in DelhiRussian Escort Service in Delhi 11k Hotel Foreigner Russian Call Girls in Delhi
Russian Escort Service in Delhi 11k Hotel Foreigner Russian Call Girls in Delhi
kauryashika82
 
BASLIQ CURRENT LOOKBOOK LOOKBOOK(1) (1).pdf
BASLIQ CURRENT LOOKBOOK  LOOKBOOK(1) (1).pdfBASLIQ CURRENT LOOKBOOK  LOOKBOOK(1) (1).pdf
BASLIQ CURRENT LOOKBOOK LOOKBOOK(1) (1).pdf
SoniaTolstoy
 
Kisan Call Centre - To harness potential of ICT in Agriculture by answer farm...
Kisan Call Centre - To harness potential of ICT in Agriculture by answer farm...Kisan Call Centre - To harness potential of ICT in Agriculture by answer farm...
Kisan Call Centre - To harness potential of ICT in Agriculture by answer farm...
Krashi Coaching
 

Kürzlich hochgeladen (20)

A Critique of the Proposed National Education Policy Reform
A Critique of the Proposed National Education Policy ReformA Critique of the Proposed National Education Policy Reform
A Critique of the Proposed National Education Policy Reform
 
Activity 01 - Artificial Culture (1).pdf
Activity 01 - Artificial Culture (1).pdfActivity 01 - Artificial Culture (1).pdf
Activity 01 - Artificial Culture (1).pdf
 
Sanyam Choudhary Chemistry practical.pdf
Sanyam Choudhary Chemistry practical.pdfSanyam Choudhary Chemistry practical.pdf
Sanyam Choudhary Chemistry practical.pdf
 
Introduction to Nonprofit Accounting: The Basics
Introduction to Nonprofit Accounting: The BasicsIntroduction to Nonprofit Accounting: The Basics
Introduction to Nonprofit Accounting: The Basics
 
INDIA QUIZ 2024 RLAC DELHI UNIVERSITY.pptx
INDIA QUIZ 2024 RLAC DELHI UNIVERSITY.pptxINDIA QUIZ 2024 RLAC DELHI UNIVERSITY.pptx
INDIA QUIZ 2024 RLAC DELHI UNIVERSITY.pptx
 
General AI for Medical Educators April 2024
General AI for Medical Educators April 2024General AI for Medical Educators April 2024
General AI for Medical Educators April 2024
 
Measures of Central Tendency: Mean, Median and Mode
Measures of Central Tendency: Mean, Median and ModeMeasures of Central Tendency: Mean, Median and Mode
Measures of Central Tendency: Mean, Median and Mode
 
Mattingly "AI & Prompt Design: Structured Data, Assistants, & RAG"
Mattingly "AI & Prompt Design: Structured Data, Assistants, & RAG"Mattingly "AI & Prompt Design: Structured Data, Assistants, & RAG"
Mattingly "AI & Prompt Design: Structured Data, Assistants, & RAG"
 
1029-Danh muc Sach Giao Khoa khoi 6.pdf
1029-Danh muc Sach Giao Khoa khoi  6.pdf1029-Danh muc Sach Giao Khoa khoi  6.pdf
1029-Danh muc Sach Giao Khoa khoi 6.pdf
 
Russian Escort Service in Delhi 11k Hotel Foreigner Russian Call Girls in Delhi
Russian Escort Service in Delhi 11k Hotel Foreigner Russian Call Girls in DelhiRussian Escort Service in Delhi 11k Hotel Foreigner Russian Call Girls in Delhi
Russian Escort Service in Delhi 11k Hotel Foreigner Russian Call Girls in Delhi
 
Sports & Fitness Value Added Course FY..
Sports & Fitness Value Added Course FY..Sports & Fitness Value Added Course FY..
Sports & Fitness Value Added Course FY..
 
Web & Social Media Analytics Previous Year Question Paper.pdf
Web & Social Media Analytics Previous Year Question Paper.pdfWeb & Social Media Analytics Previous Year Question Paper.pdf
Web & Social Media Analytics Previous Year Question Paper.pdf
 
The Most Excellent Way | 1 Corinthians 13
The Most Excellent Way | 1 Corinthians 13The Most Excellent Way | 1 Corinthians 13
The Most Excellent Way | 1 Corinthians 13
 
SOCIAL AND HISTORICAL CONTEXT - LFTVD.pptx
SOCIAL AND HISTORICAL CONTEXT - LFTVD.pptxSOCIAL AND HISTORICAL CONTEXT - LFTVD.pptx
SOCIAL AND HISTORICAL CONTEXT - LFTVD.pptx
 
Unit-IV- Pharma. Marketing Channels.pptx
Unit-IV- Pharma. Marketing Channels.pptxUnit-IV- Pharma. Marketing Channels.pptx
Unit-IV- Pharma. Marketing Channels.pptx
 
fourth grading exam for kindergarten in writing
fourth grading exam for kindergarten in writingfourth grading exam for kindergarten in writing
fourth grading exam for kindergarten in writing
 
BASLIQ CURRENT LOOKBOOK LOOKBOOK(1) (1).pdf
BASLIQ CURRENT LOOKBOOK  LOOKBOOK(1) (1).pdfBASLIQ CURRENT LOOKBOOK  LOOKBOOK(1) (1).pdf
BASLIQ CURRENT LOOKBOOK LOOKBOOK(1) (1).pdf
 
Advanced Views - Calendar View in Odoo 17
Advanced Views - Calendar View in Odoo 17Advanced Views - Calendar View in Odoo 17
Advanced Views - Calendar View in Odoo 17
 
Kisan Call Centre - To harness potential of ICT in Agriculture by answer farm...
Kisan Call Centre - To harness potential of ICT in Agriculture by answer farm...Kisan Call Centre - To harness potential of ICT in Agriculture by answer farm...
Kisan Call Centre - To harness potential of ICT in Agriculture by answer farm...
 
microwave assisted reaction. General introduction
microwave assisted reaction. General introductionmicrowave assisted reaction. General introduction
microwave assisted reaction. General introduction
 

Fox-Keynote-Now and Now of Data Publishing-nfdp13

  • 1. Peter Fox (RPI) @taswegian NFDP 2013 May 22, 2013, Oxford, UK The Now and Now for Data: Metaphors for Making Data Publically Available
  • 2. Am not going to … http://mp-datamatters.blogspot.com/ Is Data Publication the Right Metaphor? http://dx.doi.org/10.2481/dsj.WDS-042
  • 3. Just to get us going…
  • 4. The latest (U.S. example)
  • 5. International Council for Science – Strategic Coordinating Committee on Information and Data - recommendation http://eloquentscience.com/wp- content/uploads/2011/04/open_access.jpg http://www.icsu.org/publications/reports-and-reviews/strategic-coordinating-committee-on-information-and-data-report OECD guidelines = data access and sharing policies http://bernews.com/wp-content/uploads/2011/02/oecd-logo.jpg
  • 6. ICSU SCCID recommendation •Engage actively – publishers of all kinds together – library community – scientific researchers •To – Document and promote community best practice in the handling of supplemental material, publication of data and appropriate data citation. http://www.leebullen.com/Fini shed%20Pics/Scientists.jpg ?
  • 7. Goal? • Data as a first class object • As a subject of conversation (v. discourse) • Metaphors to achieve this abound and indicate a particular stakeholder perspective (worldview, bias, edict, etc…)
  • 8. It seems we are not quite there yet • We* are having conversations (like the one today) about data+x (x=citation, publication, integration, integrity, ownership, trust, …) • * = ./ ../ // and / (unixtm )
  • 9. What if we had a conversation about this data?
  • 10.
  • 11. 20080602 Fox VSTO et al. 11
  • 12. Metaphor! 12 Data Information Knowledge Producers Consumers Context Presentation Organization Integration Conversation Creation Gathering Experience • Ecosystem • A framework for talking about data, and …
  • 13. Data perspective under some metaphors 13 Producers Consumers Quality Control Fitness for Purpose Fitness for Use Quality Assessment Trustee Trustor
  • 14. For others: Is this separation good or not? 14 Producers Consumers Quality Control Fitness for Purpose Fitness for Use Quality Assessment Trustee Trustor Publisher “Reader” This may be us, or others
  • 15. Technical advances From: C. Borgman, 2008, NSF Cyberlearning Report
  • 16. Global Change Information System (GCIS) 16 Vision: A unified web based source of authoritative, accessible, usable, and timely information about climate and global change for use by scientists, decision makers, and the public.
  • 17. Prototype Use Case Name Discover and visit data center website of dataset used to generate report figure. Goal The NCA Report reader sees a figure and wants to know where the data came from. Summary A reader of the NCA is browsing the content via the website. He/she sees a figure and wants to know where the data came from. A reference to the publication in which the figure originated appears in the figure caption. Selecting the link to the source publication displays a page of information about the publication including, if available, the publication DOI. The page also includes references to the datasets cited in the publication. Following each of dataset reference links presents a page of information about the dataset, including links back to the agency/data center webpage describing the dataset in more detail and making the actual data available for order or download. Actors Primary Actor - reader of the NCA Preconditions Reader is viewing the NCA online report Post Conditions Reader visits the data center dataset website Normal Flow 1) System is presenting the NCA report to the reader in a web site. Presentation includes report figure with caption that includes reference to source publication. 2) Reader selects publication reference in figure caption 3) System displays information about publication, including DOI (if available). 4) Publication information includes publication dataset citations. 5) Reader selects a dataset cited by the publication. 6) System displays information about dataset including links to agency / data center webpages where more information and (potentially) data download links are available. 7) Reader selects the data center link and is redirected to data center dataset webpage. Discover and visit data center website of dataset used to generate report figure.
  • 18. Assessment links to information 18
  • 19. Non-specialist Use Case Name Find Latest Datasets by Keyword Goal Search for datasets associated with the keyword “snow”, list search results by recentness of publication. Summary User story: I want to look for information concerning “snow.” I don’t know if it is a CLEAN word or a GCMD word or don’t even know what GCMD or CLEAN is. How would I do it, and what would I see on my monitor during the process? Assumptions The reader is not assumed to have knowledge regarding the GCMD Keywords (or other) vocabulary. Actors Primary Actor - reader of the NCA Preconditions TBD Post Conditions Reader is presented with a list of datasets associated with the keyword “snow” sorted by dataset publication date. Normal Flow TBD Notes We are looking into two user interface options for dataset selection by keyword 1)As a free-text search where the user inputs “snow”. 2)Present the user a faceted browse interface with a vocabulary faceted which presents the user with terms from a structured vocabulary. The user can manually select the term(s) which match or contain “snow”. We intend to implement prototypes of both. Search for datasets with the keyword “snow”, ….
  • 21.
  • 22. Setting of the roles and relations • Yes it is about contracts… of all sorts… – An agency example, they are exploring a number of metaphors
  • 23. An un-named US govt. agency
  • 25. From my Research Data Alliance talk; #5 • Please all SNAP your fingers (1, 2, 3, NOW) • <snap> the culture around data has to change, as well as how we think about paradigms (metaphors)
  • 26. Call to discussion • Multiple metaphors, many considerations • An ecosystem approach allows multiple solutions in a complex socio-technical system – transactions among providers and consumers – Significant opportunities for under-served data generators to get their data ‘out there’ perhaps publication (still a metaphor!) • Data Review !== Peer Review and more role disconnects • <discuss> • Please read our Data Science Journal essay and respond! • Thanks for your attention - pfox@cs.rpi.edu , http://tw.rpi.edu
  • 28. Pros/Cons - Data Centres (‘big iron’) • Volume • Streamlined • Automation • Auditable • Reprocessing capability • Central authority • Funded • Over-reliance on automation • Weak documentation • Use is assumed • Roles ill-defined, reputation? • Does not handle heterogeneity • Preservation ? • Overly focused on generation • …
  • 29. Pros/Cons - Publishers • Simple • Tested • Disseminated • Shifted burden • Imprimatur • De-facto preservation • Citable • Based on science norms • Locked • Static/ • Not machine accessible • Cost? • Not scalable • Cannot verify use
  • 30. Pros/Cons - Release (software) • Many stages (alpha, beta, release candidate, release) • Versioned • Documented and change notified • Intends to couple user feedback to developers • Packaged • Licensing well thought out • … • Provenance implicit • Preservation poorly dealt with • Quality may be difficult to determine • Attribution not part of the mind- set • Derivative or embedded use not always well defined • …
  • 31. Pros/Cons - Linked data • Scales • Built on web • Simple model design • Tested • Disseminated • Machine processable • No central authority • Heterogeneous • Use not assumed • Flexible evolution • Supports encapsulation • Poor versioning • Poor auditing • No imprimatur • No preservation/ stewardship • Not human friendly • Heterogeneous vocab. • Changes data model • Unknown evolution • …
  • 32.
  • 33. 33 .. Data has Lots of Audiences From “Why EPO?”, a NASA internal report on science education, 2005 More Strategic Less Strategic Science too!

Hinweis der Redaktion

  1. http://brianwhitworth.com/STS/WordleCover.png Conversations about data. 1 st class v discourse
  2. http://gbeaubouef.files.wordpress.com/2012/01/slide1.jpg http://eksouth.weebly.com/uploads/6/5/2/3/6523461/8933221.jpg Or lecture you, or cheer lead.
  3. ICSU should establish a forum for the exploration and eventual agreement in relation to science of all the terms used under the broad umbrella of Open Access .
  4. ICSU should engage actively with publishers of all kinds together with the library community and with scientific researchers to document and promote community best practice in the handling of supplemental material, publication of data and appropriate data citation. http://www.einstruction.com/files/default/files/publishers.jpg
  5. http://www.aascend.org/wp-content/uploads/2012/03/argument-cartoon2.jpg
  6. Frohlich et al.
  7. The NCA report reader sees a figure and he/she wants to know where the data came from
  8. http://www.instantdisplay.co.uk/metaphors.jpg Table 1. Parsons and Fox, DSJ 2013.
  9. http://andreasgal.files.wordpress.com/2011/04/safety.jpg
  10. There are lots of different kinds of audiences interested in data. While we are talking about using data in the classroom today, several other audiences of are importance to Virtual observatories. In particular, on the more strategic end are groups that, while smaller, have great impact on the public ’s and the government’s perception of the value of the data and its providers. In this category, I would place both science policy specialists and the media. Policy specialists and decision makers have a tremendous impact on budgets, but also feel, at least at some level, beholden to the tax payers. They want to see the impact that data has on people’s lives. They are also looking for information that will help them made an informed decision. In addition, the media plays a critical role, providing about 85% of the science content to the general public. A third group that is worth considering is the educated general public (the science-attentive public). They take science very seriously and can be a vocal advocate for a scinetific resource -- look at the Hubble scenario as an example.