SlideShare ist ein Scribd-Unternehmen logo
1 von 19
Downloaden Sie, um offline zu lesen
1
Taxonomy Development Revisited
Lessons Learned*
Marcie Zaharee, PhD
Data Harmony User Conference
February 2014
mzaharee@mitre.org
1
*Fahsi, A., Zaharee, M. (2013). Framework for Developing an Intelligence
Reconnaissance and Surveillance (ISR) Operations Taxonomy.
MITRE Technical ReportApproved for Public Release
2
Overview
• Recap from last year
– ISR Operations Taxonomy Development effort and framework
• Lessons Learned
– Working in teams
– Working with SMEs
– Developing, Maintaining, Exporting, and Posting
• Summary
Approved for Public Release
3
ISR Operations Taxonomy*
Research Questions:
• Can an adequate unclassified ISR Operations taxonomy be
built from open source material?
• Can an unclassified ISR Operations taxonomy be designed
in a way that:
– Is repeatable?
– Is easily accessible and understandable to the end user?
• How can an ISR Operations taxonomy…
– Provide terms for population of metadata?
– Be effectively exported to a machine-readable language and used to
facilitate searches?
*Classification scheme to categorize ISR Operations data assets (i.e.,
platforms and sensors)
Approved for Public Release
4
Taxonomy Development Framework
Approved for public release 13-140
5
Approved for Public Release
6
Educate team members
Approved for Public Release
7
Build consensus, not necessarily unanimity
Approved for Public Release
8
Document decisions
Approved for Public Release
9
A network of experts is essential
Approved for Public Release
10
Establish guidelines when working with
SMEs
Approved for Public Release
11
Graphical Representation is helpful when
working with SMEs
Approved for Public Release
12
Establish file naming conventions
Approved for Public Release
13
Aggregation can present a challenge
Approved for Public Release
14
Authoritative Sources May Not Exist
Approved for Public Release
15
Maintaining terms is complex and
resource intensive
Approved for Public Release
16
Export terms in a machine readable
language
Approved for Public Release
17
Understand posting requirements when
working with repositories
Approved for Public Release
18
User Feedback
• Human Readable
– Download metrics on Data Services Environment reflect interest in the
ISR taxonomies
– Differences in opinion on categorization of terms and overall hierarchy
• Machine Ingestible
– Longer PT and NPT names may result in unreadable User Interface (UI)
list labels as well as impacting search
Approved for Public Release
19
Summary
• Successfully answered our research questions
– Can an adequate unclassified ISR Operations taxonomy be built from
open source material?
– Can an unclassified ISR Operations taxonomy be designed in a way
that it is repeatable and understandable?
– How can an ISR Operations taxonomy be exported to a machine-
readable language and used to facilitate searches and provide terms for
population of metadata
• Nine step framework essential to our proof of concept,
but not without its limitations
Approved for Public Release

Weitere ähnliche Inhalte

Ähnlich wie Case Study: Taxonomies as a Tool to Increase Discovery of Intelligence Community Data Assets

Requirementv4
Requirementv4Requirementv4
Requirementv4stat
 
Machine Learned Relevance at A Large Scale Search Engine
Machine Learned Relevance at A Large Scale Search EngineMachine Learned Relevance at A Large Scale Search Engine
Machine Learned Relevance at A Large Scale Search EngineSalford Systems
 
Requirements on No Requirements - When using agile is justified?
Requirements on No Requirements - When using agile is justified?Requirements on No Requirements - When using agile is justified?
Requirements on No Requirements - When using agile is justified?Ilia Bider
 
Technical competency dictionary for it
Technical competency dictionary for itTechnical competency dictionary for it
Technical competency dictionary for itConfidential
 
Building Data Science Teams, Abbreviated
Building Data Science Teams, AbbreviatedBuilding Data Science Teams, Abbreviated
Building Data Science Teams, AbbreviatedAllen Day, PhD
 
Relevancy and Search Quality Analysis - Search Technologies
Relevancy and Search Quality Analysis - Search TechnologiesRelevancy and Search Quality Analysis - Search Technologies
Relevancy and Search Quality Analysis - Search Technologiesenterprisesearchmeetup
 
11 Strategic Considerations for SharePoint Migrations
11 Strategic Considerations for SharePoint Migrations11 Strategic Considerations for SharePoint Migrations
11 Strategic Considerations for SharePoint MigrationsChristian Buckley
 
Transforming knowledge management for climate action
Transforming knowledge management for climate action  Transforming knowledge management for climate action
Transforming knowledge management for climate action weADAPT
 
Smart cities no ai without ia
Smart cities   no ai without iaSmart cities   no ai without ia
Smart cities no ai without iaFredric Landqvist
 
11 Towards a Research Agenda for Recommendation Systems in Requirements Engin...
11 Towards a Research Agenda for Recommendation Systems in Requirements Engin...11 Towards a Research Agenda for Recommendation Systems in Requirements Engin...
11 Towards a Research Agenda for Recommendation Systems in Requirements Engin...Walid Maalej
 
What is DITA? And Is It Right for Your Team or Project?
What is DITA? And Is It Right for Your Team or Project?What is DITA? And Is It Right for Your Team or Project?
What is DITA? And Is It Right for Your Team or Project?Toni Mantych, MA, PMP
 
Research software identification - Catherine Jones
Research software identification - Catherine JonesResearch software identification - Catherine Jones
Research software identification - Catherine JonesJisc RDM
 
Session 0.0 poster minutes madness
Session 0.0   poster minutes madnessSession 0.0   poster minutes madness
Session 0.0 poster minutes madnesssemanticsconference
 
Forrester Wave Report about Collaboration Platforms
Forrester Wave Report about Collaboration PlatformsForrester Wave Report about Collaboration Platforms
Forrester Wave Report about Collaboration PlatformsAndré Schmid
 
Digital Preservation Policies - SCAPE
Digital Preservation Policies - SCAPEDigital Preservation Policies - SCAPE
Digital Preservation Policies - SCAPESCAPE Project
 
Yet LXi — Learning Experience Interface Overview
Yet LXi — Learning Experience Interface Overview Yet LXi — Learning Experience Interface Overview
Yet LXi — Learning Experience Interface Overview Margaret Roth
 
Project On-Science
Project On-ScienceProject On-Science
Project On-ScienceAmrit Ravi
 
Selecting FOSS Softwares
Selecting FOSS SoftwaresSelecting FOSS Softwares
Selecting FOSS SoftwaresDong Calmada
 

Ähnlich wie Case Study: Taxonomies as a Tool to Increase Discovery of Intelligence Community Data Assets (20)

Requirementv4
Requirementv4Requirementv4
Requirementv4
 
Machine Learned Relevance at A Large Scale Search Engine
Machine Learned Relevance at A Large Scale Search EngineMachine Learned Relevance at A Large Scale Search Engine
Machine Learned Relevance at A Large Scale Search Engine
 
Requirements on No Requirements - When using agile is justified?
Requirements on No Requirements - When using agile is justified?Requirements on No Requirements - When using agile is justified?
Requirements on No Requirements - When using agile is justified?
 
Technical competency dictionary for it
Technical competency dictionary for itTechnical competency dictionary for it
Technical competency dictionary for it
 
Maruti gollapudi cv
Maruti gollapudi cvMaruti gollapudi cv
Maruti gollapudi cv
 
Building Data Science Teams, Abbreviated
Building Data Science Teams, AbbreviatedBuilding Data Science Teams, Abbreviated
Building Data Science Teams, Abbreviated
 
Relevancy and Search Quality Analysis - Search Technologies
Relevancy and Search Quality Analysis - Search TechnologiesRelevancy and Search Quality Analysis - Search Technologies
Relevancy and Search Quality Analysis - Search Technologies
 
11 Strategic Considerations for SharePoint Migrations
11 Strategic Considerations for SharePoint Migrations11 Strategic Considerations for SharePoint Migrations
11 Strategic Considerations for SharePoint Migrations
 
Transforming knowledge management for climate action
Transforming knowledge management for climate action  Transforming knowledge management for climate action
Transforming knowledge management for climate action
 
Smart cities no ai without ia
Smart cities   no ai without iaSmart cities   no ai without ia
Smart cities no ai without ia
 
11 Towards a Research Agenda for Recommendation Systems in Requirements Engin...
11 Towards a Research Agenda for Recommendation Systems in Requirements Engin...11 Towards a Research Agenda for Recommendation Systems in Requirements Engin...
11 Towards a Research Agenda for Recommendation Systems in Requirements Engin...
 
What is DITA? And Is It Right for Your Team or Project?
What is DITA? And Is It Right for Your Team or Project?What is DITA? And Is It Right for Your Team or Project?
What is DITA? And Is It Right for Your Team or Project?
 
Research software identification - Catherine Jones
Research software identification - Catherine JonesResearch software identification - Catherine Jones
Research software identification - Catherine Jones
 
Session 0.0 poster minutes madness
Session 0.0   poster minutes madnessSession 0.0   poster minutes madness
Session 0.0 poster minutes madness
 
Forrester Wave Report about Collaboration Platforms
Forrester Wave Report about Collaboration PlatformsForrester Wave Report about Collaboration Platforms
Forrester Wave Report about Collaboration Platforms
 
Digital Preservation Policies - SCAPE
Digital Preservation Policies - SCAPEDigital Preservation Policies - SCAPE
Digital Preservation Policies - SCAPE
 
Apache Solr vs Oracle Endeca
Apache Solr vs Oracle EndecaApache Solr vs Oracle Endeca
Apache Solr vs Oracle Endeca
 
Yet LXi — Learning Experience Interface Overview
Yet LXi — Learning Experience Interface Overview Yet LXi — Learning Experience Interface Overview
Yet LXi — Learning Experience Interface Overview
 
Project On-Science
Project On-ScienceProject On-Science
Project On-Science
 
Selecting FOSS Softwares
Selecting FOSS SoftwaresSelecting FOSS Softwares
Selecting FOSS Softwares
 

Mehr von Access Innovations, Inc.

Making AI Behave: Using Knowledge Domains to Produce Useful, Trustworthy Results
Making AI Behave: Using Knowledge Domains to Produce Useful, Trustworthy ResultsMaking AI Behave: Using Knowledge Domains to Produce Useful, Trustworthy Results
Making AI Behave: Using Knowledge Domains to Produce Useful, Trustworthy ResultsAccess Innovations, Inc.
 
ISO 25964-1Working Group ISO/TC 46/SC 9/WG 8
ISO 25964-1Working Group ISO/TC 46/SC 9/WG 8ISO 25964-1Working Group ISO/TC 46/SC 9/WG 8
ISO 25964-1Working Group ISO/TC 46/SC 9/WG 8Access Innovations, Inc.
 
Hindawi taxonomy and personalization 27.10 (1)
Hindawi taxonomy and personalization 27.10 (1)Hindawi taxonomy and personalization 27.10 (1)
Hindawi taxonomy and personalization 27.10 (1)Access Innovations, Inc.
 
Asco using ai-taxos-for meta-titles-february-2021
Asco using ai-taxos-for meta-titles-february-2021Asco using ai-taxos-for meta-titles-february-2021
Asco using ai-taxos-for meta-titles-february-2021Access Innovations, Inc.
 
Ai webinar 2 -what's in a name (consolidated pdf)
Ai webinar 2 -what's in a name (consolidated pdf)Ai webinar 2 -what's in a name (consolidated pdf)
Ai webinar 2 -what's in a name (consolidated pdf)Access Innovations, Inc.
 
Tagging overview - Why Keywords Don't Cut It
Tagging overview  - Why Keywords Don't Cut ItTagging overview  - Why Keywords Don't Cut It
Tagging overview - Why Keywords Don't Cut ItAccess Innovations, Inc.
 
DHUG 2018: Towards Web-Centric Repository Interoperability
DHUG 2018: Towards Web-Centric Repository InteroperabilityDHUG 2018: Towards Web-Centric Repository Interoperability
DHUG 2018: Towards Web-Centric Repository InteroperabilityAccess Innovations, Inc.
 
DHUG 2017 - Understanding ROI Just Enough to Get Your Project Funded
DHUG 2017 - Understanding ROI Just Enough to Get Your Project FundedDHUG 2017 - Understanding ROI Just Enough to Get Your Project Funded
DHUG 2017 - Understanding ROI Just Enough to Get Your Project FundedAccess Innovations, Inc.
 

Mehr von Access Innovations, Inc. (20)

Making AI Behave: Using Knowledge Domains to Produce Useful, Trustworthy Results
Making AI Behave: Using Knowledge Domains to Produce Useful, Trustworthy ResultsMaking AI Behave: Using Knowledge Domains to Produce Useful, Trustworthy Results
Making AI Behave: Using Knowledge Domains to Produce Useful, Trustworthy Results
 
ISO 25964-1Working Group ISO/TC 46/SC 9/WG 8
ISO 25964-1Working Group ISO/TC 46/SC 9/WG 8ISO 25964-1Working Group ISO/TC 46/SC 9/WG 8
ISO 25964-1Working Group ISO/TC 46/SC 9/WG 8
 
Smart submit
Smart submitSmart submit
Smart submit
 
Plos taxonomy beyond search dhug 2021
Plos taxonomy beyond search   dhug 2021Plos taxonomy beyond search   dhug 2021
Plos taxonomy beyond search dhug 2021
 
Hindawi taxonomy and personalization 27.10 (1)
Hindawi taxonomy and personalization 27.10 (1)Hindawi taxonomy and personalization 27.10 (1)
Hindawi taxonomy and personalization 27.10 (1)
 
Data harmonycloudpowerpointclientfacing
Data harmonycloudpowerpointclientfacingData harmonycloudpowerpointclientfacing
Data harmonycloudpowerpointclientfacing
 
Data harmony update 2021
Data harmony update 2021 Data harmony update 2021
Data harmony update 2021
 
Atypon dhug2021
Atypon dhug2021Atypon dhug2021
Atypon dhug2021
 
Asco using ai-taxos-for meta-titles-february-2021
Asco using ai-taxos-for meta-titles-february-2021Asco using ai-taxos-for meta-titles-february-2021
Asco using ai-taxos-for meta-titles-february-2021
 
Asce more than just topic taxonomies
Asce more than just topic taxonomiesAsce more than just topic taxonomies
Asce more than just topic taxonomies
 
Acs discoverability-dhug2021
Acs discoverability-dhug2021Acs discoverability-dhug2021
Acs discoverability-dhug2021
 
Ai webinar 2 -what's in a name (consolidated pdf)
Ai webinar 2 -what's in a name (consolidated pdf)Ai webinar 2 -what's in a name (consolidated pdf)
Ai webinar 2 -what's in a name (consolidated pdf)
 
Tagging overview - Why Keywords Don't Cut It
Tagging overview  - Why Keywords Don't Cut ItTagging overview  - Why Keywords Don't Cut It
Tagging overview - Why Keywords Don't Cut It
 
Health Affairs - Why Keywords Don't Cut It
Health Affairs - Why Keywords Don't Cut ItHealth Affairs - Why Keywords Don't Cut It
Health Affairs - Why Keywords Don't Cut It
 
Why Keywords Don't Cut It
Why Keywords Don't Cut ItWhy Keywords Don't Cut It
Why Keywords Don't Cut It
 
Data Harmony update 2020 final
Data Harmony update 2020 finalData Harmony update 2020 final
Data Harmony update 2020 final
 
Data Harmony Update 2020 final
Data Harmony Update 2020 finalData Harmony Update 2020 final
Data Harmony Update 2020 final
 
DHUG 2018: Towards Web-Centric Repository Interoperability
DHUG 2018: Towards Web-Centric Repository InteroperabilityDHUG 2018: Towards Web-Centric Repository Interoperability
DHUG 2018: Towards Web-Centric Repository Interoperability
 
DHUG 2018 - Florida Thesis OCR
DHUG 2018 - Florida Thesis OCRDHUG 2018 - Florida Thesis OCR
DHUG 2018 - Florida Thesis OCR
 
DHUG 2017 - Understanding ROI Just Enough to Get Your Project Funded
DHUG 2017 - Understanding ROI Just Enough to Get Your Project FundedDHUG 2017 - Understanding ROI Just Enough to Get Your Project Funded
DHUG 2017 - Understanding ROI Just Enough to Get Your Project Funded
 

Case Study: Taxonomies as a Tool to Increase Discovery of Intelligence Community Data Assets

  • 1. 1 Taxonomy Development Revisited Lessons Learned* Marcie Zaharee, PhD Data Harmony User Conference February 2014 mzaharee@mitre.org 1 *Fahsi, A., Zaharee, M. (2013). Framework for Developing an Intelligence Reconnaissance and Surveillance (ISR) Operations Taxonomy. MITRE Technical ReportApproved for Public Release
  • 2. 2 Overview • Recap from last year – ISR Operations Taxonomy Development effort and framework • Lessons Learned – Working in teams – Working with SMEs – Developing, Maintaining, Exporting, and Posting • Summary Approved for Public Release
  • 3. 3 ISR Operations Taxonomy* Research Questions: • Can an adequate unclassified ISR Operations taxonomy be built from open source material? • Can an unclassified ISR Operations taxonomy be designed in a way that: – Is repeatable? – Is easily accessible and understandable to the end user? • How can an ISR Operations taxonomy… – Provide terms for population of metadata? – Be effectively exported to a machine-readable language and used to facilitate searches? *Classification scheme to categorize ISR Operations data assets (i.e., platforms and sensors) Approved for Public Release
  • 4. 4 Taxonomy Development Framework Approved for public release 13-140
  • 6. 6 Educate team members Approved for Public Release
  • 7. 7 Build consensus, not necessarily unanimity Approved for Public Release
  • 9. 9 A network of experts is essential Approved for Public Release
  • 10. 10 Establish guidelines when working with SMEs Approved for Public Release
  • 11. 11 Graphical Representation is helpful when working with SMEs Approved for Public Release
  • 12. 12 Establish file naming conventions Approved for Public Release
  • 13. 13 Aggregation can present a challenge Approved for Public Release
  • 14. 14 Authoritative Sources May Not Exist Approved for Public Release
  • 15. 15 Maintaining terms is complex and resource intensive Approved for Public Release
  • 16. 16 Export terms in a machine readable language Approved for Public Release
  • 17. 17 Understand posting requirements when working with repositories Approved for Public Release
  • 18. 18 User Feedback • Human Readable – Download metrics on Data Services Environment reflect interest in the ISR taxonomies – Differences in opinion on categorization of terms and overall hierarchy • Machine Ingestible – Longer PT and NPT names may result in unreadable User Interface (UI) list labels as well as impacting search Approved for Public Release
  • 19. 19 Summary • Successfully answered our research questions – Can an adequate unclassified ISR Operations taxonomy be built from open source material? – Can an unclassified ISR Operations taxonomy be designed in a way that it is repeatable and understandable? – How can an ISR Operations taxonomy be exported to a machine- readable language and used to facilitate searches and provide terms for population of metadata • Nine step framework essential to our proof of concept, but not without its limitations Approved for Public Release