Software: impact, metrics, and citation

•

0 likes•495 views

Daniel S. Katz

Technology

Measuring Impact – Scenarios
1.  Developer of open source physics simulation
–  Possible metrics
•  How many downloads? (easiest to measure, least value)
•  How many contributors?
•  How many uses?
•  How many papers cite it?
•  How many papers that cite it are cited? (hardest to measure,
most value)

2.  Developer of open source math library
–  Possible metrics are similar, but citations are less
likely
–  What if users don’t download it?
•  It’s part of a distro
•  It’s pre-installed (and optimized) on an HPC system
•  It’s part of a cloud image
•  It’s a service

Vision for Metrics & Citation, part 1
•  Products (software, paper, data set) are
registered
–  Credit map (weighted list of contributors—people,
products, etc.) is an input
–  DOI is an output
–  Leads to transitive credit
•  E.g., paper 1 provides 25% credit to software A, and software A
provides 10% credit to library X -> library X gets 2.5% credit for
paper 1
•  Helps developer – “my tools are widely used, give me tenure” or
“NSF should fund my tool maintenance”
–  Issues:
•  Social: Trust in person who registers a product
–  This seems to work for papers today (without weights) for both
author lists and for citations
–  Do weights require more than human memory?
•  Technological: Registration system
–  Where is it/them, what are interfaces, how do they work together?

Vision for Metrics & Citation, part 2
•  Product usage is recorded
–  Where?
•  Both the developer and user want to track usage
•  Privacy issues? (legal, competitive, ...)
•  Via a phone home mechanism?
–  What does “using” a data set mean? And how could
trigger a usage record
–  Can general code be developed for this, to be
incorporated in software packages?
•  With user input, tie later products to usage
–  User may not know science outcome when using tool
–  After science outcome is known, may be hard to
determine which product usages were involved

Vision for Metrics & Citation, thoughts
•  Can this be done incrementally?
•  Lack of credit is a larger problem than often
perceived
–  Lack of credit is a disincentive for sharing software
and data
–  Providing credit would both remove disincentive as
well as adding incentive
–  See Lewin’s principal of force field analysis (1943)
•  For commercial tools, credit is tracked by $
–  But this doesn’t help understand what tools were used
for what outcomes
–  Does this encourage collaboration?
•  Could a more economic model be used?
–  NSF gives tokens are part of science grants, users
distribute tokens while/after using tools

Similar to Software: impact, metrics, and citation

Changing the Curation Equation: A Data Lifecycle Approach to Lowering Costs a...SEAD

Intoduction to software engineering part 1Rupesh Vaishnav

Citation and reproducibility in softwareDaniel S. Katz

Agile data scienceJoel Horwitz

Software engineeringDivyaSharma458

NISI Agile Software Architecture Slide DeckUtrecht University

20160607 citation4software panelDaniel S. Katz

Funding Software in AcademiaDaniel S. Katz

Inti escem-tours2012-acsTerritorial Intelligence

Research software identification - Catherine JonesJisc RDM

Software Ecosystems = Big DataTom Mens

Software Citation: Principles, Implementation, and ImpactDaniel S. Katz

unit 1.pptx regasts sthatbabs shshsbsvsbshsagarjsicg

Fundamentals of software sustainabilityDaniel S. Katz

Requiring Publicly-Funded Software, Algorithms, and Workflows to be Made Publ...Daniel S. Katz

Putting Linked Data to Use in a Large Higher-Education OrganisationMathieu d'Aquin

20160607 citation4software openingDaniel S. Katz

Project Documentation Student Management System format.pptxAjayPatre1

Introductionsarojbhavaraju5

Software Analyticsanapaula_spbr

Similar to Software: impact, metrics, and citation (20)

Changing the Curation Equation: A Data Lifecycle Approach to Lowering Costs a...

Intoduction to software engineering part 1

Citation and reproducibility in software

Agile data science

Software engineering

NISI Agile Software Architecture Slide Deck

20160607 citation4software panel

Funding Software in Academia

Inti escem-tours2012-acs

Research software identification - Catherine Jones

Software Ecosystems = Big Data

Software Citation: Principles, Implementation, and Impact

unit 1.pptx regasts sthatbabs shshsbsvsbsh

Fundamentals of software sustainability

Requiring Publicly-Funded Software, Algorithms, and Workflows to be Made Publ...

Putting Linked Data to Use in a Large Higher-Education Organisation

20160607 citation4software opening

Project Documentation Student Management System format.pptx

Introduction

Software Analytics

Recently uploaded

New from BookNet Canada for 2024: Loan Stars - Tech Forum 2024BookNet Canada

Ensuring Technical Readiness For Copilot in Microsoft 3652toLead Limited

Transcript: New from BookNet Canada for 2024: Loan Stars - Tech Forum 2024BookNet Canada

The Role of FIDO in a Cyber Secure Netherlands: FIDO Paris Seminar.pptxLoriGlavin3

Anypoint Exchange: It’s Not Just a Repo!Manik S Magar

DSPy a system for AI to Write Prompts and Do Fine TuningLars Bell

Training state-of-the-art general text embeddingZilliz

How AI, OpenAI, and ChatGPT impact business and software.Curtis Poe

Passkey Providers and Enabling Portability: FIDO Paris Seminar.pptxLoriGlavin3

Transcript: New from BookNet Canada for 2024: BNC CataList - Tech Forum 2024BookNet Canada

Nell’iperspazio con Rocket: il Framework Web di Rust!Commit University

Advanced Computer Architecture – An IntroductionDilum Bandara

Artificial intelligence in cctv survelliance.pptxhariprasad279825

TeamStation AI System Report LATAM IT Salaries 2024Lonnie McRorey

unit 4 immunoblotting technique complete.pptxBkGupta21

Digital Identity is Under Attack: FIDO Paris Seminar.pptxLoriGlavin3

Time Series Foundation Models - current state and future directionsNathaniel Shimoni

TrustArc Webinar - How to Build Consumer Trust Through Data PrivacyTrustArc

Gen AI in Business - Global Trends Report 2024.pdfAddepto

"ML in Production",Oleksandr BaganFwdays

Recently uploaded (20)

New from BookNet Canada for 2024: Loan Stars - Tech Forum 2024

Ensuring Technical Readiness For Copilot in Microsoft 365

Transcript: New from BookNet Canada for 2024: Loan Stars - Tech Forum 2024

The Role of FIDO in a Cyber Secure Netherlands: FIDO Paris Seminar.pptx

Anypoint Exchange: It’s Not Just a Repo!

DSPy a system for AI to Write Prompts and Do Fine Tuning

Training state-of-the-art general text embedding

How AI, OpenAI, and ChatGPT impact business and software.

Passkey Providers and Enabling Portability: FIDO Paris Seminar.pptx

Transcript: New from BookNet Canada for 2024: BNC CataList - Tech Forum 2024

Nell’iperspazio con Rocket: il Framework Web di Rust!

Advanced Computer Architecture – An Introduction

Artificial intelligence in cctv survelliance.pptx

TeamStation AI System Report LATAM IT Salaries 2024

unit 4 immunoblotting technique complete.pptx

Digital Identity is Under Attack: FIDO Paris Seminar.pptx

Time Series Foundation Models - current state and future directions

TrustArc Webinar - How to Build Consumer Trust Through Data Privacy

Gen AI in Business - Global Trends Report 2024.pdf

"ML in Production",Oleksandr Bagan

Software: impact, metrics, and citation

1. Software Impact, Metrics, and Citation Daniel S. Katz Program Director, Office of Cyberinfrastructure

2. Measuring Impact – Scenarios 1.  Developer of open source physics simulation –  Possible metrics •  How many downloads? (easiest to measure, least value) •  How many contributors? •  How many uses? •  How many papers cite it? •  How many papers that cite it are cited? (hardest to measure, most value) 2.  Developer of open source math library –  Possible metrics are similar, but citations are less likely –  What if users don’t download it? •  It’s part of a distro •  It’s pre-installed (and optimized) on an HPC system •  It’s part of a cloud image •  It’s a service

3. Vision for Metrics & Citation, part 1 •  Products (software, paper, data set) are registered –  Credit map (weighted list of contributors—people, products, etc.) is an input –  DOI is an output –  Leads to transitive credit •  E.g., paper 1 provides 25% credit to software A, and software A provides 10% credit to library X -> library X gets 2.5% credit for paper 1 •  Helps developer – “my tools are widely used, give me tenure” or “NSF should fund my tool maintenance” –  Issues: •  Social: Trust in person who registers a product –  This seems to work for papers today (without weights) for both author lists and for citations –  Do weights require more than human memory? •  Technological: Registration system –  Where is it/them, what are interfaces, how do they work together?

4. Vision for Metrics & Citation, part 2 •  Product usage is recorded –  Where? •  Both the developer and user want to track usage •  Privacy issues? (legal, competitive, ...) •  Via a phone home mechanism? –  What does “using” a data set mean? And how could trigger a usage record –  Can general code be developed for this, to be incorporated in software packages? •  With user input, tie later products to usage –  User may not know science outcome when using tool –  After science outcome is known, may be hard to determine which product usages were involved

5. Vision for Metrics & Citation, thoughts •  Can this be done incrementally? •  Lack of credit is a larger problem than often perceived –  Lack of credit is a disincentive for sharing software and data –  Providing credit would both remove disincentive as well as adding incentive –  See Lewin’s principal of force field analysis (1943) •  For commercial tools, credit is tracked by $ –  But this doesn’t help understand what tools were used for what outcomes –  Does this encourage collaboration? •  Could a more economic model be used? –  NSF gives tokens are part of science grants, users distribute tokens while/after using tools

Software: impact, metrics, and citation

Recommended

Recommended

More Related Content

Similar to Software: impact, metrics, and citation

Similar to Software: impact, metrics, and citation (20)

More from Daniel S. Katz

More from Daniel S. Katz (20)

Recently uploaded

Recently uploaded (20)

Software: impact, metrics, and citation