SlideShare ist ein Scribd-Unternehmen logo
1 von 38
Downloaden Sie, um offline zu lesen
Metadata Primer

                     Selvakumar T.S




1   August 9, 2009   Cadence Confidential: Cadence Internal Use Only
Source: Siderean Software, Inc.



    All of the answers are here. Now, what was the question?



2    August 9, 2009   Cadence Confidential: Cadence Internal Use Only
Issues with Information Access Today

    • Tons of content from disparate sources.
    • Cumbersome navigation.
    • Keyword search assumes you know what you are
      looking for.
    • L
      Large number of search results -- most of them
                  b   f      h     lt      t f th
      irrelevant.
    • Lack of context in search results.
    • Search engines rely on mathematical algorithms to
      determine relevance and ranking of search results.

           Fortune 500 companies lost $12 billion due to
                inability to find information in 2003
                                                 2003.
                                  -IDC
3   August 9, 2009   Cadence Confidential: Cadence Internal Use Only
A Quick Demo on Information
    Access Issues and Possibilities

     Source:




4
Agenda

    •   Understanding Metadata
    •   Metadata Applications
    •   Metadata Standards
    •   Working with Metadata
    •   Future of Metadata




5
Understanding Metadata




6
INVENT VE
     N E TI

                What is Metadata?
                  at s etadata

                Data that provides
                information about other
                data.
                   – Merriam Webster’s
                     Online Dictionary
                Data about data. For
                example, the title, subject,
                author, and size of a file
                constitute metadata about
                the file.
                    file
                   – Microsoft Computer
7
                     Dictionary, Fifth Edition
Metadata Example: File > Properties in Microsoft Office




8
Metadata Example: Album Information in Media
    Players




9
Metadata in HTML
     <META name=<property> content=“<value>” />




10   August 9, 2009   Cadence Confidential: Cadence Internal Use Only
Metadata Reflects Content and User Needs




11   August 9, 2009   Cadence Confidential: Cadence Internal Use Only
Types of Metadata

         Intrinsic: metadata that an object holds about itself
                     File name, file size …
          Descriptive: metadata that describes th object
          D    i ti      t d t th t d     ib the bj t
          Subject, title, audience, keywords …
        Metadata describes the who, what, when, where and
        Administrative and Rights: metadata used to manage
                    how about every facet of data
                                             data.
                             the object
12
             Create date, modify date, expiry
Metadata Applications




13   August 9, 2009   Cadence Confidential: Cadence Internal Use Only
Improved Search with Metadata
     • Filter search by metadata.




14   August 9, 2009   Cadence Confidential: Cadence Internal Use Only
Improved Navigation with Metadata
     • Aggregate topics with same metadata to create
       browseable indexes or categories.




15   August 9, 2009   Cadence Confidential: Cadence Internal Use Only
Display Context and Relationships with Metadata




                                               Cross-marketing on amazon.com.
                                                             g




16   August 9, 2009   Cadence Confidential: Cadence Internal Use Only
Personalization and customization
     • Display content according to role or audience
                                            audience.




17   August 9, 2009   Cadence Confidential: Cadence Internal Use Only
Other Metadata Applications

     • Discovery and compliance
            – Identify the need to update, retain, protect, and dispose content
              for i t
              f internal or regulatory requirements.
                         l       l t        i       t
     • Interoperability …
            – Content tagged with same metatags (
                         gg                      g (META name) from
                                                             )
              different sources can be easily integrated.


             Metadata allows unstructured content to be managed
                           like structured content.




18   August 9, 2009   Cadence Confidential: Cadence Internal Use Only
Metadata Standards




19   August 9, 2009   Cadence Confidential: Cadence Internal Use Only
Need for Metadata Standards

     • Different information providers using different metadata
       schemas.
     • Even metadata schemas of groups within organizations
       are different or out of sync.
     • The result:
            –     Inconsistent search results.
            –     Lack of interoperability.
            –     Information silos.
            –     …
      An US$ 2B Oil & Gas project suffered a loss of US$120M due to inability to
     locate a document or a misunderstanding about which document is needed.
                                -SchemaLogic, Inc.



20   August 9, 2009   Cadence Confidential: Cadence Internal Use Only
Some Metadata Standards

     • Dublin Core
     • Metadata support in DocBook and
                   pp
       DITA
     • IMS Global Learning Consortium
     • LOM (IEEE’s L
            (IEEE’ Learning Obj
                          i Object
       Metadata)
     • SCORM (ADL) - Learning Objects
     • EAD (Encoded Archival Description)


     Standard formats and approaches enable interoperability and
                      the sharing of metadata.
                                g

21   August 9, 2009   Cadence Confidential: Cadence Internal Use Only
Dublin Core

     • http://dublincore.org
     • General purpose metadata standard for use across
       domains.
     • 15 core elements.
     • El
       Element qualifiers t narrow th meaning of elements.
                t    lifi  to      the    i    f l      t
            – Example: A Date Created versus a Date Modified.
     • Encoding schemes: Controlled vocabularies or parsing
       rules to refine the interpretation of an element.
            – Example: A term from a controlled vocabulary such as the
              Library of Congress Subject Headings
                                          Headings.
     • Can be represented in HTML and in XML (RDF).



22   August 9, 2009   Cadence Confidential: Cadence Internal Use Only
Dublin Core Metadata Elements
     •    Title
     •    Creator
     •    Subject
               j
     •    Description
     •    Publisher
     •    Contributor
     •    Date
     •    Type
     •    Format
     •    Identifier
          Id tifi
     •    Source
     •    Language
     •    Relation
     •    Coverage
     •    Rights



23   August 9, 2009   Cadence Confidential: Cadence Internal Use Only
Dublin Core Metadata Example




                                                                        Source: http://www.sics.se/~preben/DC/DC_guide.html


24   August 9, 2009   Cadence Confidential: Cadence Internal Use Only
Metadata Support in DocBook

     • Metadata at different levels
            – title, info and bookinfo at book level
            – title, info and chapterinfo at chapter level
            – title, info and chapterinfo at section level


     • DocBook supports Dublin Core schema




25   August 9, 2009   Cadence Confidential: Cadence Internal Use Only
Metadata Support in DITA
     •      DITA supports a variety of standard and custom
            metadata:
            –         Author information
            –         Copyright information
            –         Product information
            –         Resource ID f h l systems
                      R          IDs for help t
            –         Document tracking information
            –         Audience information
            –         Keywords
                      K       d
            –         Custom metadata (otherprops)
     •      <prolog> element defines metadata at the topic level.
     •      <topicmeta> element defines metadata that applies to a
            topic when it appears in a map.
     •      Metadata at every level

26   August 9, 2009   Cadence Confidential: Cadence Internal Use Only
Sample of Metadata elements within <prolog> element

     <prolog>
            <author> (name of topic’s author)
            <copyright>
            <critdates> (document tracking information)
            <permissions>
            <publisher>
            <source>
            <metadata>
                      <audience> (intended audience)
                           type=“user | purchaser | administrator | … | other”
                           othertype=
                           j
                           job=“installing | customizing | administering | … | other”
                                         g             g               g
                           otherjob=
                           experiencelevel=“novice | general | expert”
                      <category> (content category used for grouping topics)
                      <keywords> (keywords for search engines)
                      <prodinfo>
                      <othermeta>
            …
                  …




27   August 9, 2009   Cadence Confidential: Cadence Internal Use Only
Working with Metadata




28   August 9, 2009   Cadence Confidential: Cadence Internal Use Only
Creating Metadata

     • Create it from scratch.
     • Reuse existing metadata and build on it.
     • Start with a standard.




29   August 9, 2009   Cadence Confidential: Cadence Internal Use Only
Metadata Process
              Create                                                Add
              Content                                                                   Publish
                                                                  Metadata

                                     Review &                                Review &
                                     Improve                                 Improve

     •    Identify content that will benefit from metadata using the 80/20 rule.
     •    Build a controlled vocabulary or use a vocabulary from a
          commercial source such as www.taxonomywarehouse.com.
                                                      y
            – Example: The Getty Thesaurus of Geographic Names (TGN)
     •    Apply metadata to content using templates or using indexing tools.
     •    Get it reviewed.
                 reviewed
     •    Evaluate search logs and user surveys to improve metadata.
     •    Continuously review metadata.


30   August 9, 2009   Cadence Confidential: Cadence Internal Use Only
Metadata Template: A Manual Approach




31   August 9, 2009   Cadence Confidential: Cadence Internal Use Only
Metadata Indexing and Discovery Tools

     •    Data Harmony http://www.dataharmony.com
     •    Interwoven MetaTagger http://www.interwoven.com
     •    Mondeca http://www.mondeca.com
     •    MultiTes http://www.multites.com/
     •    Synaptica http://www.synaptica.com
     •    SchemaLogic http://www.schemalogic.com
     •    WebChoir http //
                    http://www.webchoir.com
                                 ebchoir com
     •    WordMap http://www.wordmap.com/




32   August 9, 2009   Cadence Confidential: Cadence Internal Use Only
Future of Metadata




33   August 9, 2009   Cadence Confidential: Cadence Internal Use Only
Future of Metadata

     • Automated metadata generation.
     • Social tagging – tagging by users.
     • Geo tagging.




34   August 9, 2009   Cadence Confidential: Cadence Internal Use Only
Social Tagging Example:                                            tagging




35   August 9, 2009   Cadence Confidential: Cadence Internal Use Only
Social Tagging Example:




36   August 9, 2009   Cadence Confidential: Cadence Internal Use Only
Geo Tagging Example




37   August 9, 2009   Cadence Confidential: Cadence Internal Use Only
Q&A

38   August 9, 2009   Cadence Confidential: Cadence Internal Use Only

Weitere ähnliche Inhalte

Was ist angesagt?

Data Governance Best Practices, Assessments, and Roadmaps
Data Governance Best Practices, Assessments, and RoadmapsData Governance Best Practices, Assessments, and Roadmaps
Data Governance Best Practices, Assessments, and RoadmapsDATAVERSITY
 
Emerging Trends in Data Architecture – What’s the Next Big Thing?
Emerging Trends in Data Architecture – What’s the Next Big Thing?Emerging Trends in Data Architecture – What’s the Next Big Thing?
Emerging Trends in Data Architecture – What’s the Next Big Thing?DATAVERSITY
 
Easy Analytics on AWS with Amazon Redshift, Amazon QuickSight, and Amazon Mac...
Easy Analytics on AWS with Amazon Redshift, Amazon QuickSight, and Amazon Mac...Easy Analytics on AWS with Amazon Redshift, Amazon QuickSight, and Amazon Mac...
Easy Analytics on AWS with Amazon Redshift, Amazon QuickSight, and Amazon Mac...Amazon Web Services
 
Architect’s Open-Source Guide for a Data Mesh Architecture
Architect’s Open-Source Guide for a Data Mesh ArchitectureArchitect’s Open-Source Guide for a Data Mesh Architecture
Architect’s Open-Source Guide for a Data Mesh ArchitectureDatabricks
 
DataEd Online: Data Architecture and Data Modeling Differences — Achieving a ...
DataEd Online: Data Architecture and Data Modeling Differences — Achieving a ...DataEd Online: Data Architecture and Data Modeling Differences — Achieving a ...
DataEd Online: Data Architecture and Data Modeling Differences — Achieving a ...DATAVERSITY
 
Time to Talk about Data Mesh
Time to Talk about Data MeshTime to Talk about Data Mesh
Time to Talk about Data MeshLibbySchulze
 
The ABCs of Treating Data as Product
The ABCs of Treating Data as ProductThe ABCs of Treating Data as Product
The ABCs of Treating Data as ProductDATAVERSITY
 
Master Data Management - Aligning Data, Process, and Governance
Master Data Management - Aligning Data, Process, and GovernanceMaster Data Management - Aligning Data, Process, and Governance
Master Data Management - Aligning Data, Process, and GovernanceDATAVERSITY
 
Row or Columnar Database
Row or Columnar DatabaseRow or Columnar Database
Row or Columnar DatabaseBiju Nair
 
DAMA Feb2015 Mastering Master Data
DAMA Feb2015 Mastering Master DataDAMA Feb2015 Mastering Master Data
DAMA Feb2015 Mastering Master DataMary Levins, PMP
 
Free Training: How to Build a Lakehouse
Free Training: How to Build a LakehouseFree Training: How to Build a Lakehouse
Free Training: How to Build a LakehouseDatabricks
 
Glossaries, Dictionaries, and Catalogs Result in Data Governance
Glossaries, Dictionaries, and Catalogs Result in Data GovernanceGlossaries, Dictionaries, and Catalogs Result in Data Governance
Glossaries, Dictionaries, and Catalogs Result in Data GovernanceDATAVERSITY
 
The Data Driven University - Automating Data Governance and Stewardship in Au...
The Data Driven University - Automating Data Governance and Stewardship in Au...The Data Driven University - Automating Data Governance and Stewardship in Au...
The Data Driven University - Automating Data Governance and Stewardship in Au...Pieter De Leenheer
 
Considerations for Data Access in the Lakehouse
Considerations for Data Access in the LakehouseConsiderations for Data Access in the Lakehouse
Considerations for Data Access in the LakehouseDatabricks
 
[DSC Europe 22] Lakehouse architecture with Delta Lake and Databricks - Draga...
[DSC Europe 22] Lakehouse architecture with Delta Lake and Databricks - Draga...[DSC Europe 22] Lakehouse architecture with Delta Lake and Databricks - Draga...
[DSC Europe 22] Lakehouse architecture with Delta Lake and Databricks - Draga...DataScienceConferenc1
 

Was ist angesagt? (20)

Data Governance Best Practices, Assessments, and Roadmaps
Data Governance Best Practices, Assessments, and RoadmapsData Governance Best Practices, Assessments, and Roadmaps
Data Governance Best Practices, Assessments, and Roadmaps
 
Emerging Trends in Data Architecture – What’s the Next Big Thing?
Emerging Trends in Data Architecture – What’s the Next Big Thing?Emerging Trends in Data Architecture – What’s the Next Big Thing?
Emerging Trends in Data Architecture – What’s the Next Big Thing?
 
Easy Analytics on AWS with Amazon Redshift, Amazon QuickSight, and Amazon Mac...
Easy Analytics on AWS with Amazon Redshift, Amazon QuickSight, and Amazon Mac...Easy Analytics on AWS with Amazon Redshift, Amazon QuickSight, and Amazon Mac...
Easy Analytics on AWS with Amazon Redshift, Amazon QuickSight, and Amazon Mac...
 
Architect’s Open-Source Guide for a Data Mesh Architecture
Architect’s Open-Source Guide for a Data Mesh ArchitectureArchitect’s Open-Source Guide for a Data Mesh Architecture
Architect’s Open-Source Guide for a Data Mesh Architecture
 
DataEd Online: Data Architecture and Data Modeling Differences — Achieving a ...
DataEd Online: Data Architecture and Data Modeling Differences — Achieving a ...DataEd Online: Data Architecture and Data Modeling Differences — Achieving a ...
DataEd Online: Data Architecture and Data Modeling Differences — Achieving a ...
 
Data mesh
Data meshData mesh
Data mesh
 
Time to Talk about Data Mesh
Time to Talk about Data MeshTime to Talk about Data Mesh
Time to Talk about Data Mesh
 
The ABCs of Treating Data as Product
The ABCs of Treating Data as ProductThe ABCs of Treating Data as Product
The ABCs of Treating Data as Product
 
Master Data Management - Aligning Data, Process, and Governance
Master Data Management - Aligning Data, Process, and GovernanceMaster Data Management - Aligning Data, Process, and Governance
Master Data Management - Aligning Data, Process, and Governance
 
Row or Columnar Database
Row or Columnar DatabaseRow or Columnar Database
Row or Columnar Database
 
DAMA Feb2015 Mastering Master Data
DAMA Feb2015 Mastering Master DataDAMA Feb2015 Mastering Master Data
DAMA Feb2015 Mastering Master Data
 
Free Training: How to Build a Lakehouse
Free Training: How to Build a LakehouseFree Training: How to Build a Lakehouse
Free Training: How to Build a Lakehouse
 
Glossaries, Dictionaries, and Catalogs Result in Data Governance
Glossaries, Dictionaries, and Catalogs Result in Data GovernanceGlossaries, Dictionaries, and Catalogs Result in Data Governance
Glossaries, Dictionaries, and Catalogs Result in Data Governance
 
Data Vault Introduction
Data Vault IntroductionData Vault Introduction
Data Vault Introduction
 
The Data Driven University - Automating Data Governance and Stewardship in Au...
The Data Driven University - Automating Data Governance and Stewardship in Au...The Data Driven University - Automating Data Governance and Stewardship in Au...
The Data Driven University - Automating Data Governance and Stewardship in Au...
 
Considerations for Data Access in the Lakehouse
Considerations for Data Access in the LakehouseConsiderations for Data Access in the Lakehouse
Considerations for Data Access in the Lakehouse
 
[DSC Europe 22] Lakehouse architecture with Delta Lake and Databricks - Draga...
[DSC Europe 22] Lakehouse architecture with Delta Lake and Databricks - Draga...[DSC Europe 22] Lakehouse architecture with Delta Lake and Databricks - Draga...
[DSC Europe 22] Lakehouse architecture with Delta Lake and Databricks - Draga...
 
Data Sharing with Snowflake
Data Sharing with SnowflakeData Sharing with Snowflake
Data Sharing with Snowflake
 
Vue d'ensemble Dremio
Vue d'ensemble DremioVue d'ensemble Dremio
Vue d'ensemble Dremio
 
Metadata Workshop
Metadata WorkshopMetadata Workshop
Metadata Workshop
 

Andere mochten auch

The Birthday Cards I have received
The Birthday Cards I have receivedThe Birthday Cards I have received
The Birthday Cards I have receivedBharti Athray
 
UX and Semantic web UXCamp London 2014
UX and Semantic web UXCamp London 2014UX and Semantic web UXCamp London 2014
UX and Semantic web UXCamp London 2014Nur Karadeniz
 
InsideOut Development 2013 Holiday Card
InsideOut Development 2013 Holiday CardInsideOut Development 2013 Holiday Card
InsideOut Development 2013 Holiday CardInsideOut Development
 
Social Business Design: Web 2.0 NYC
Social Business Design: Web 2.0 NYCSocial Business Design: Web 2.0 NYC
Social Business Design: Web 2.0 NYCDachis Group
 
Great Tips to Help You File Your Taxes (And Get a Refund)
Great Tips to Help You File Your Taxes (And Get a Refund) Great Tips to Help You File Your Taxes (And Get a Refund)
Great Tips to Help You File Your Taxes (And Get a Refund) Experian_US
 
Unsung Heroes of PHP
Unsung Heroes of PHPUnsung Heroes of PHP
Unsung Heroes of PHPjsmith92
 
Condom Fashion
Condom FashionCondom Fashion
Condom FashionPeety G
 
Sports Illustrated Models 2006
Sports Illustrated Models 2006Sports Illustrated Models 2006
Sports Illustrated Models 2006Peety G
 
The Business of Business Cards
The Business of Business CardsThe Business of Business Cards
The Business of Business CardsFastUpFront
 
Infographic: Happy Employees
Infographic: Happy EmployeesInfographic: Happy Employees
Infographic: Happy EmployeesBrian Junyor
 
Linked In Recruiting Solutions
Linked In Recruiting SolutionsLinked In Recruiting Solutions
Linked In Recruiting SolutionsFrank Sherfey
 
eMarketer Webinar: Key Digital Trends for 2012
eMarketer Webinar: Key Digital Trends for 2012eMarketer Webinar: Key Digital Trends for 2012
eMarketer Webinar: Key Digital Trends for 2012eMarketer
 
Mockingjays in the workplace
Mockingjays in the workplaceMockingjays in the workplace
Mockingjays in the workplaceDavid Bradford
 
The Bull
The BullThe Bull
The BullPeety G
 

Andere mochten auch (17)

The Birthday Cards I have received
The Birthday Cards I have receivedThe Birthday Cards I have received
The Birthday Cards I have received
 
UX and Semantic web UXCamp London 2014
UX and Semantic web UXCamp London 2014UX and Semantic web UXCamp London 2014
UX and Semantic web UXCamp London 2014
 
InsideOut Development 2013 Holiday Card
InsideOut Development 2013 Holiday CardInsideOut Development 2013 Holiday Card
InsideOut Development 2013 Holiday Card
 
Highest paying-jobs-in-america
Highest paying-jobs-in-americaHighest paying-jobs-in-america
Highest paying-jobs-in-america
 
2012 GFPR Launch at IFPRI March 14 2013
2012 GFPR Launch at IFPRI March 14 20132012 GFPR Launch at IFPRI March 14 2013
2012 GFPR Launch at IFPRI March 14 2013
 
Social Business Design: Web 2.0 NYC
Social Business Design: Web 2.0 NYCSocial Business Design: Web 2.0 NYC
Social Business Design: Web 2.0 NYC
 
Great Tips to Help You File Your Taxes (And Get a Refund)
Great Tips to Help You File Your Taxes (And Get a Refund) Great Tips to Help You File Your Taxes (And Get a Refund)
Great Tips to Help You File Your Taxes (And Get a Refund)
 
Unsung Heroes of PHP
Unsung Heroes of PHPUnsung Heroes of PHP
Unsung Heroes of PHP
 
Condom Fashion
Condom FashionCondom Fashion
Condom Fashion
 
Sports Illustrated Models 2006
Sports Illustrated Models 2006Sports Illustrated Models 2006
Sports Illustrated Models 2006
 
The Business of Business Cards
The Business of Business CardsThe Business of Business Cards
The Business of Business Cards
 
Generation We
Generation WeGeneration We
Generation We
 
Infographic: Happy Employees
Infographic: Happy EmployeesInfographic: Happy Employees
Infographic: Happy Employees
 
Linked In Recruiting Solutions
Linked In Recruiting SolutionsLinked In Recruiting Solutions
Linked In Recruiting Solutions
 
eMarketer Webinar: Key Digital Trends for 2012
eMarketer Webinar: Key Digital Trends for 2012eMarketer Webinar: Key Digital Trends for 2012
eMarketer Webinar: Key Digital Trends for 2012
 
Mockingjays in the workplace
Mockingjays in the workplaceMockingjays in the workplace
Mockingjays in the workplace
 
The Bull
The BullThe Bull
The Bull
 

Ähnlich wie Metadata Primer

Introduction to Metadata for IDAH Fellows
Introduction to Metadata for IDAH FellowsIntroduction to Metadata for IDAH Fellows
Introduction to Metadata for IDAH FellowsJenn Riley
 
Metadata Strategies - Data Squared
Metadata Strategies - Data SquaredMetadata Strategies - Data Squared
Metadata Strategies - Data SquaredDATAVERSITY
 
A JCR View of the World - adaptTo() 2012 Berlin
A JCR View of the World - adaptTo() 2012 BerlinA JCR View of the World - adaptTo() 2012 Berlin
A JCR View of the World - adaptTo() 2012 BerlinAlexander Klimetschek
 
Intelligent Cloud Enablement
Intelligent Cloud EnablementIntelligent Cloud Enablement
Intelligent Cloud EnablementDocuLynx
 
Metadata-powered dissemination of content
Metadata-powered dissemination of contentMetadata-powered dissemination of content
Metadata-powered dissemination of contentNikos Manouselis
 
Cni research data_oxford_horstmann_jefferies
Cni research data_oxford_horstmann_jefferiesCni research data_oxford_horstmann_jefferies
Cni research data_oxford_horstmann_jefferiesBDLSS
 
DataCite and its DOI infrastructure - IASSIST 2013
DataCite and its DOI infrastructure - IASSIST 2013DataCite and its DOI infrastructure - IASSIST 2013
DataCite and its DOI infrastructure - IASSIST 2013Frauke Ziedorn
 
Creating a sustainable business model for a digital repository: the Dryad exp...
Creating a sustainable business model for a digital repository: the Dryad exp...Creating a sustainable business model for a digital repository: the Dryad exp...
Creating a sustainable business model for a digital repository: the Dryad exp...ASIS&T
 
DataONE Education Module 07: Metadata
DataONE Education Module 07: MetadataDataONE Education Module 07: Metadata
DataONE Education Module 07: MetadataDataONE
 
CNI 2018: A Research Object Authoring Tool for the Data Commons
CNI 2018: A Research Object Authoring Tool for the Data CommonsCNI 2018: A Research Object Authoring Tool for the Data Commons
CNI 2018: A Research Object Authoring Tool for the Data CommonsAnita de Waard
 
Documentum Data Models.ppt
Documentum Data Models.pptDocumentum Data Models.ppt
Documentum Data Models.pptHarysCv
 
Metadata and Tagging
Metadata and TaggingMetadata and Tagging
Metadata and Taggingpauloshea
 
Metadata: Towards Machine-Enabled Intelligence
Metadata: Towards Machine-Enabled Intelligence               Metadata: Towards Machine-Enabled Intelligence
Metadata: Towards Machine-Enabled Intelligence dannyijwest
 

Ähnlich wie Metadata Primer (20)

DITA Quick Start
DITA Quick StartDITA Quick Start
DITA Quick Start
 
Second Thoughts about Metadata Standards for Data
Second Thoughts about Metadata Standards for DataSecond Thoughts about Metadata Standards for Data
Second Thoughts about Metadata Standards for Data
 
Introduction to Metadata for IDAH Fellows
Introduction to Metadata for IDAH FellowsIntroduction to Metadata for IDAH Fellows
Introduction to Metadata for IDAH Fellows
 
Metadata Strategies - Data Squared
Metadata Strategies - Data SquaredMetadata Strategies - Data Squared
Metadata Strategies - Data Squared
 
A JCR View of the World - adaptTo() 2012 Berlin
A JCR View of the World - adaptTo() 2012 BerlinA JCR View of the World - adaptTo() 2012 Berlin
A JCR View of the World - adaptTo() 2012 Berlin
 
Intelligent Cloud Enablement
Intelligent Cloud EnablementIntelligent Cloud Enablement
Intelligent Cloud Enablement
 
Metadata-powered dissemination of content
Metadata-powered dissemination of contentMetadata-powered dissemination of content
Metadata-powered dissemination of content
 
Metadata lecture 5 part 2
Metadata lecture 5 part 2Metadata lecture 5 part 2
Metadata lecture 5 part 2
 
Cni research data_oxford_horstmann_jefferies
Cni research data_oxford_horstmann_jefferiesCni research data_oxford_horstmann_jefferies
Cni research data_oxford_horstmann_jefferies
 
Semtech2006
Semtech2006Semtech2006
Semtech2006
 
DataCite and its DOI infrastructure - IASSIST 2013
DataCite and its DOI infrastructure - IASSIST 2013DataCite and its DOI infrastructure - IASSIST 2013
DataCite and its DOI infrastructure - IASSIST 2013
 
Creating a sustainable business model for a digital repository: the Dryad exp...
Creating a sustainable business model for a digital repository: the Dryad exp...Creating a sustainable business model for a digital repository: the Dryad exp...
Creating a sustainable business model for a digital repository: the Dryad exp...
 
DataONE Education Module 07: Metadata
DataONE Education Module 07: MetadataDataONE Education Module 07: Metadata
DataONE Education Module 07: Metadata
 
CNI 2018: A Research Object Authoring Tool for the Data Commons
CNI 2018: A Research Object Authoring Tool for the Data CommonsCNI 2018: A Research Object Authoring Tool for the Data Commons
CNI 2018: A Research Object Authoring Tool for the Data Commons
 
Metadata 101
Metadata 101Metadata 101
Metadata 101
 
NISO Forum, Denver, Sept. 24, 2012: Scientific discovery and innovation in an...
NISO Forum, Denver, Sept. 24, 2012: Scientific discovery and innovation in an...NISO Forum, Denver, Sept. 24, 2012: Scientific discovery and innovation in an...
NISO Forum, Denver, Sept. 24, 2012: Scientific discovery and innovation in an...
 
L07 metadata
L07 metadataL07 metadata
L07 metadata
 
Documentum Data Models.ppt
Documentum Data Models.pptDocumentum Data Models.ppt
Documentum Data Models.ppt
 
Metadata and Tagging
Metadata and TaggingMetadata and Tagging
Metadata and Tagging
 
Metadata: Towards Machine-Enabled Intelligence
Metadata: Towards Machine-Enabled Intelligence               Metadata: Towards Machine-Enabled Intelligence
Metadata: Towards Machine-Enabled Intelligence
 

Kürzlich hochgeladen

How to write a Business Continuity Plan
How to write a Business Continuity PlanHow to write a Business Continuity Plan
How to write a Business Continuity PlanDatabarracks
 
From Family Reminiscence to Scholarly Archive .
From Family Reminiscence to Scholarly Archive .From Family Reminiscence to Scholarly Archive .
From Family Reminiscence to Scholarly Archive .Alan Dix
 
Take control of your SAP testing with UiPath Test Suite
Take control of your SAP testing with UiPath Test SuiteTake control of your SAP testing with UiPath Test Suite
Take control of your SAP testing with UiPath Test SuiteDianaGray10
 
Ensuring Technical Readiness For Copilot in Microsoft 365
Ensuring Technical Readiness For Copilot in Microsoft 365Ensuring Technical Readiness For Copilot in Microsoft 365
Ensuring Technical Readiness For Copilot in Microsoft 3652toLead Limited
 
Hyperautomation and AI/ML: A Strategy for Digital Transformation Success.pdf
Hyperautomation and AI/ML: A Strategy for Digital Transformation Success.pdfHyperautomation and AI/ML: A Strategy for Digital Transformation Success.pdf
Hyperautomation and AI/ML: A Strategy for Digital Transformation Success.pdfPrecisely
 
Tampa BSides - Chef's Tour of Microsoft Security Adoption Framework (SAF)
Tampa BSides - Chef's Tour of Microsoft Security Adoption Framework (SAF)Tampa BSides - Chef's Tour of Microsoft Security Adoption Framework (SAF)
Tampa BSides - Chef's Tour of Microsoft Security Adoption Framework (SAF)Mark Simos
 
Vertex AI Gemini Prompt Engineering Tips
Vertex AI Gemini Prompt Engineering TipsVertex AI Gemini Prompt Engineering Tips
Vertex AI Gemini Prompt Engineering TipsMiki Katsuragi
 
TrustArc Webinar - How to Build Consumer Trust Through Data Privacy
TrustArc Webinar - How to Build Consumer Trust Through Data PrivacyTrustArc Webinar - How to Build Consumer Trust Through Data Privacy
TrustArc Webinar - How to Build Consumer Trust Through Data PrivacyTrustArc
 
H2O.ai CEO/Founder: Sri Ambati Keynote at Wells Fargo Day
H2O.ai CEO/Founder: Sri Ambati Keynote at Wells Fargo DayH2O.ai CEO/Founder: Sri Ambati Keynote at Wells Fargo Day
H2O.ai CEO/Founder: Sri Ambati Keynote at Wells Fargo DaySri Ambati
 
CloudStudio User manual (basic edition):
CloudStudio User manual (basic edition):CloudStudio User manual (basic edition):
CloudStudio User manual (basic edition):comworks
 
Streamlining Python Development: A Guide to a Modern Project Setup
Streamlining Python Development: A Guide to a Modern Project SetupStreamlining Python Development: A Guide to a Modern Project Setup
Streamlining Python Development: A Guide to a Modern Project SetupFlorian Wilhelm
 
Nell’iperspazio con Rocket: il Framework Web di Rust!
Nell’iperspazio con Rocket: il Framework Web di Rust!Nell’iperspazio con Rocket: il Framework Web di Rust!
Nell’iperspazio con Rocket: il Framework Web di Rust!Commit University
 
Leverage Zilliz Serverless - Up to 50X Saving for Your Vector Storage Cost
Leverage Zilliz Serverless - Up to 50X Saving for Your Vector Storage CostLeverage Zilliz Serverless - Up to 50X Saving for Your Vector Storage Cost
Leverage Zilliz Serverless - Up to 50X Saving for Your Vector Storage CostZilliz
 
DevoxxFR 2024 Reproducible Builds with Apache Maven
DevoxxFR 2024 Reproducible Builds with Apache MavenDevoxxFR 2024 Reproducible Builds with Apache Maven
DevoxxFR 2024 Reproducible Builds with Apache MavenHervé Boutemy
 
SAP Build Work Zone - Overview L2-L3.pptx
SAP Build Work Zone - Overview L2-L3.pptxSAP Build Work Zone - Overview L2-L3.pptx
SAP Build Work Zone - Overview L2-L3.pptxNavinnSomaal
 
"ML in Production",Oleksandr Bagan
"ML in Production",Oleksandr Bagan"ML in Production",Oleksandr Bagan
"ML in Production",Oleksandr BaganFwdays
 
Gen AI in Business - Global Trends Report 2024.pdf
Gen AI in Business - Global Trends Report 2024.pdfGen AI in Business - Global Trends Report 2024.pdf
Gen AI in Business - Global Trends Report 2024.pdfAddepto
 
SIP trunking in Janus @ Kamailio World 2024
SIP trunking in Janus @ Kamailio World 2024SIP trunking in Janus @ Kamailio World 2024
SIP trunking in Janus @ Kamailio World 2024Lorenzo Miniero
 
Merck Moving Beyond Passwords: FIDO Paris Seminar.pptx
Merck Moving Beyond Passwords: FIDO Paris Seminar.pptxMerck Moving Beyond Passwords: FIDO Paris Seminar.pptx
Merck Moving Beyond Passwords: FIDO Paris Seminar.pptxLoriGlavin3
 

Kürzlich hochgeladen (20)

How to write a Business Continuity Plan
How to write a Business Continuity PlanHow to write a Business Continuity Plan
How to write a Business Continuity Plan
 
From Family Reminiscence to Scholarly Archive .
From Family Reminiscence to Scholarly Archive .From Family Reminiscence to Scholarly Archive .
From Family Reminiscence to Scholarly Archive .
 
Take control of your SAP testing with UiPath Test Suite
Take control of your SAP testing with UiPath Test SuiteTake control of your SAP testing with UiPath Test Suite
Take control of your SAP testing with UiPath Test Suite
 
Ensuring Technical Readiness For Copilot in Microsoft 365
Ensuring Technical Readiness For Copilot in Microsoft 365Ensuring Technical Readiness For Copilot in Microsoft 365
Ensuring Technical Readiness For Copilot in Microsoft 365
 
Hyperautomation and AI/ML: A Strategy for Digital Transformation Success.pdf
Hyperautomation and AI/ML: A Strategy for Digital Transformation Success.pdfHyperautomation and AI/ML: A Strategy for Digital Transformation Success.pdf
Hyperautomation and AI/ML: A Strategy for Digital Transformation Success.pdf
 
Tampa BSides - Chef's Tour of Microsoft Security Adoption Framework (SAF)
Tampa BSides - Chef's Tour of Microsoft Security Adoption Framework (SAF)Tampa BSides - Chef's Tour of Microsoft Security Adoption Framework (SAF)
Tampa BSides - Chef's Tour of Microsoft Security Adoption Framework (SAF)
 
DMCC Future of Trade Web3 - Special Edition
DMCC Future of Trade Web3 - Special EditionDMCC Future of Trade Web3 - Special Edition
DMCC Future of Trade Web3 - Special Edition
 
Vertex AI Gemini Prompt Engineering Tips
Vertex AI Gemini Prompt Engineering TipsVertex AI Gemini Prompt Engineering Tips
Vertex AI Gemini Prompt Engineering Tips
 
TrustArc Webinar - How to Build Consumer Trust Through Data Privacy
TrustArc Webinar - How to Build Consumer Trust Through Data PrivacyTrustArc Webinar - How to Build Consumer Trust Through Data Privacy
TrustArc Webinar - How to Build Consumer Trust Through Data Privacy
 
H2O.ai CEO/Founder: Sri Ambati Keynote at Wells Fargo Day
H2O.ai CEO/Founder: Sri Ambati Keynote at Wells Fargo DayH2O.ai CEO/Founder: Sri Ambati Keynote at Wells Fargo Day
H2O.ai CEO/Founder: Sri Ambati Keynote at Wells Fargo Day
 
CloudStudio User manual (basic edition):
CloudStudio User manual (basic edition):CloudStudio User manual (basic edition):
CloudStudio User manual (basic edition):
 
Streamlining Python Development: A Guide to a Modern Project Setup
Streamlining Python Development: A Guide to a Modern Project SetupStreamlining Python Development: A Guide to a Modern Project Setup
Streamlining Python Development: A Guide to a Modern Project Setup
 
Nell’iperspazio con Rocket: il Framework Web di Rust!
Nell’iperspazio con Rocket: il Framework Web di Rust!Nell’iperspazio con Rocket: il Framework Web di Rust!
Nell’iperspazio con Rocket: il Framework Web di Rust!
 
Leverage Zilliz Serverless - Up to 50X Saving for Your Vector Storage Cost
Leverage Zilliz Serverless - Up to 50X Saving for Your Vector Storage CostLeverage Zilliz Serverless - Up to 50X Saving for Your Vector Storage Cost
Leverage Zilliz Serverless - Up to 50X Saving for Your Vector Storage Cost
 
DevoxxFR 2024 Reproducible Builds with Apache Maven
DevoxxFR 2024 Reproducible Builds with Apache MavenDevoxxFR 2024 Reproducible Builds with Apache Maven
DevoxxFR 2024 Reproducible Builds with Apache Maven
 
SAP Build Work Zone - Overview L2-L3.pptx
SAP Build Work Zone - Overview L2-L3.pptxSAP Build Work Zone - Overview L2-L3.pptx
SAP Build Work Zone - Overview L2-L3.pptx
 
"ML in Production",Oleksandr Bagan
"ML in Production",Oleksandr Bagan"ML in Production",Oleksandr Bagan
"ML in Production",Oleksandr Bagan
 
Gen AI in Business - Global Trends Report 2024.pdf
Gen AI in Business - Global Trends Report 2024.pdfGen AI in Business - Global Trends Report 2024.pdf
Gen AI in Business - Global Trends Report 2024.pdf
 
SIP trunking in Janus @ Kamailio World 2024
SIP trunking in Janus @ Kamailio World 2024SIP trunking in Janus @ Kamailio World 2024
SIP trunking in Janus @ Kamailio World 2024
 
Merck Moving Beyond Passwords: FIDO Paris Seminar.pptx
Merck Moving Beyond Passwords: FIDO Paris Seminar.pptxMerck Moving Beyond Passwords: FIDO Paris Seminar.pptx
Merck Moving Beyond Passwords: FIDO Paris Seminar.pptx
 

Metadata Primer

  • 1. Metadata Primer Selvakumar T.S 1 August 9, 2009 Cadence Confidential: Cadence Internal Use Only
  • 2. Source: Siderean Software, Inc. All of the answers are here. Now, what was the question? 2 August 9, 2009 Cadence Confidential: Cadence Internal Use Only
  • 3. Issues with Information Access Today • Tons of content from disparate sources. • Cumbersome navigation. • Keyword search assumes you know what you are looking for. • L Large number of search results -- most of them b f h lt t f th irrelevant. • Lack of context in search results. • Search engines rely on mathematical algorithms to determine relevance and ranking of search results. Fortune 500 companies lost $12 billion due to inability to find information in 2003 2003. -IDC 3 August 9, 2009 Cadence Confidential: Cadence Internal Use Only
  • 4. A Quick Demo on Information Access Issues and Possibilities Source: 4
  • 5. Agenda • Understanding Metadata • Metadata Applications • Metadata Standards • Working with Metadata • Future of Metadata 5
  • 7. INVENT VE N E TI What is Metadata? at s etadata Data that provides information about other data. – Merriam Webster’s Online Dictionary Data about data. For example, the title, subject, author, and size of a file constitute metadata about the file. file – Microsoft Computer 7 Dictionary, Fifth Edition
  • 8. Metadata Example: File > Properties in Microsoft Office 8
  • 9. Metadata Example: Album Information in Media Players 9
  • 10. Metadata in HTML <META name=<property> content=“<value>” /> 10 August 9, 2009 Cadence Confidential: Cadence Internal Use Only
  • 11. Metadata Reflects Content and User Needs 11 August 9, 2009 Cadence Confidential: Cadence Internal Use Only
  • 12. Types of Metadata Intrinsic: metadata that an object holds about itself File name, file size … Descriptive: metadata that describes th object D i ti t d t th t d ib the bj t Subject, title, audience, keywords … Metadata describes the who, what, when, where and Administrative and Rights: metadata used to manage how about every facet of data data. the object 12 Create date, modify date, expiry
  • 13. Metadata Applications 13 August 9, 2009 Cadence Confidential: Cadence Internal Use Only
  • 14. Improved Search with Metadata • Filter search by metadata. 14 August 9, 2009 Cadence Confidential: Cadence Internal Use Only
  • 15. Improved Navigation with Metadata • Aggregate topics with same metadata to create browseable indexes or categories. 15 August 9, 2009 Cadence Confidential: Cadence Internal Use Only
  • 16. Display Context and Relationships with Metadata Cross-marketing on amazon.com. g 16 August 9, 2009 Cadence Confidential: Cadence Internal Use Only
  • 17. Personalization and customization • Display content according to role or audience audience. 17 August 9, 2009 Cadence Confidential: Cadence Internal Use Only
  • 18. Other Metadata Applications • Discovery and compliance – Identify the need to update, retain, protect, and dispose content for i t f internal or regulatory requirements. l l t i t • Interoperability … – Content tagged with same metatags ( gg g (META name) from ) different sources can be easily integrated. Metadata allows unstructured content to be managed like structured content. 18 August 9, 2009 Cadence Confidential: Cadence Internal Use Only
  • 19. Metadata Standards 19 August 9, 2009 Cadence Confidential: Cadence Internal Use Only
  • 20. Need for Metadata Standards • Different information providers using different metadata schemas. • Even metadata schemas of groups within organizations are different or out of sync. • The result: – Inconsistent search results. – Lack of interoperability. – Information silos. – … An US$ 2B Oil & Gas project suffered a loss of US$120M due to inability to locate a document or a misunderstanding about which document is needed. -SchemaLogic, Inc. 20 August 9, 2009 Cadence Confidential: Cadence Internal Use Only
  • 21. Some Metadata Standards • Dublin Core • Metadata support in DocBook and pp DITA • IMS Global Learning Consortium • LOM (IEEE’s L (IEEE’ Learning Obj i Object Metadata) • SCORM (ADL) - Learning Objects • EAD (Encoded Archival Description) Standard formats and approaches enable interoperability and the sharing of metadata. g 21 August 9, 2009 Cadence Confidential: Cadence Internal Use Only
  • 22. Dublin Core • http://dublincore.org • General purpose metadata standard for use across domains. • 15 core elements. • El Element qualifiers t narrow th meaning of elements. t lifi to the i f l t – Example: A Date Created versus a Date Modified. • Encoding schemes: Controlled vocabularies or parsing rules to refine the interpretation of an element. – Example: A term from a controlled vocabulary such as the Library of Congress Subject Headings Headings. • Can be represented in HTML and in XML (RDF). 22 August 9, 2009 Cadence Confidential: Cadence Internal Use Only
  • 23. Dublin Core Metadata Elements • Title • Creator • Subject j • Description • Publisher • Contributor • Date • Type • Format • Identifier Id tifi • Source • Language • Relation • Coverage • Rights 23 August 9, 2009 Cadence Confidential: Cadence Internal Use Only
  • 24. Dublin Core Metadata Example Source: http://www.sics.se/~preben/DC/DC_guide.html 24 August 9, 2009 Cadence Confidential: Cadence Internal Use Only
  • 25. Metadata Support in DocBook • Metadata at different levels – title, info and bookinfo at book level – title, info and chapterinfo at chapter level – title, info and chapterinfo at section level • DocBook supports Dublin Core schema 25 August 9, 2009 Cadence Confidential: Cadence Internal Use Only
  • 26. Metadata Support in DITA • DITA supports a variety of standard and custom metadata: – Author information – Copyright information – Product information – Resource ID f h l systems R IDs for help t – Document tracking information – Audience information – Keywords K d – Custom metadata (otherprops) • <prolog> element defines metadata at the topic level. • <topicmeta> element defines metadata that applies to a topic when it appears in a map. • Metadata at every level 26 August 9, 2009 Cadence Confidential: Cadence Internal Use Only
  • 27. Sample of Metadata elements within <prolog> element <prolog> <author> (name of topic’s author) <copyright> <critdates> (document tracking information) <permissions> <publisher> <source> <metadata> <audience> (intended audience) type=“user | purchaser | administrator | … | other” othertype= j job=“installing | customizing | administering | … | other” g g g otherjob= experiencelevel=“novice | general | expert” <category> (content category used for grouping topics) <keywords> (keywords for search engines) <prodinfo> <othermeta> … … 27 August 9, 2009 Cadence Confidential: Cadence Internal Use Only
  • 28. Working with Metadata 28 August 9, 2009 Cadence Confidential: Cadence Internal Use Only
  • 29. Creating Metadata • Create it from scratch. • Reuse existing metadata and build on it. • Start with a standard. 29 August 9, 2009 Cadence Confidential: Cadence Internal Use Only
  • 30. Metadata Process Create Add Content Publish Metadata Review & Review & Improve Improve • Identify content that will benefit from metadata using the 80/20 rule. • Build a controlled vocabulary or use a vocabulary from a commercial source such as www.taxonomywarehouse.com. y – Example: The Getty Thesaurus of Geographic Names (TGN) • Apply metadata to content using templates or using indexing tools. • Get it reviewed. reviewed • Evaluate search logs and user surveys to improve metadata. • Continuously review metadata. 30 August 9, 2009 Cadence Confidential: Cadence Internal Use Only
  • 31. Metadata Template: A Manual Approach 31 August 9, 2009 Cadence Confidential: Cadence Internal Use Only
  • 32. Metadata Indexing and Discovery Tools • Data Harmony http://www.dataharmony.com • Interwoven MetaTagger http://www.interwoven.com • Mondeca http://www.mondeca.com • MultiTes http://www.multites.com/ • Synaptica http://www.synaptica.com • SchemaLogic http://www.schemalogic.com • WebChoir http // http://www.webchoir.com ebchoir com • WordMap http://www.wordmap.com/ 32 August 9, 2009 Cadence Confidential: Cadence Internal Use Only
  • 33. Future of Metadata 33 August 9, 2009 Cadence Confidential: Cadence Internal Use Only
  • 34. Future of Metadata • Automated metadata generation. • Social tagging – tagging by users. • Geo tagging. 34 August 9, 2009 Cadence Confidential: Cadence Internal Use Only
  • 35. Social Tagging Example: tagging 35 August 9, 2009 Cadence Confidential: Cadence Internal Use Only
  • 36. Social Tagging Example: 36 August 9, 2009 Cadence Confidential: Cadence Internal Use Only
  • 37. Geo Tagging Example 37 August 9, 2009 Cadence Confidential: Cadence Internal Use Only
  • 38. Q&A 38 August 9, 2009 Cadence Confidential: Cadence Internal Use Only