SlideShare ist ein Scribd-Unternehmen logo
1 von 38
Downloaden Sie, um offline zu lesen
Metadata Primer

                     Selvakumar T.S




1   August 9, 2009   Cadence Confidential: Cadence Internal Use Only
Source: Siderean Software, Inc.



    All of the answers are here. Now, what was the question?



2    August 9, 2009   Cadence Confidential: Cadence Internal Use Only
Issues with Information Access Today

    • Tons of content from disparate sources.
    • Cumbersome navigation.
    • Keyword search assumes you know what you are
      looking for.
    • L
      Large number of search results -- most of them
                  b   f      h     lt      t f th
      irrelevant.
    • Lack of context in search results.
    • Search engines rely on mathematical algorithms to
      determine relevance and ranking of search results.

           Fortune 500 companies lost $12 billion due to
                inability to find information in 2003
                                                 2003.
                                  -IDC
3   August 9, 2009   Cadence Confidential: Cadence Internal Use Only
A Quick Demo on Information
    Access Issues and Possibilities

     Source:




4
Agenda

    •   Understanding Metadata
    •   Metadata Applications
    •   Metadata Standards
    •   Working with Metadata
    •   Future of Metadata




5
Understanding Metadata




6
INVENT VE
     N E TI

                What is Metadata?
                  at s etadata

                Data that provides
                information about other
                data.
                   – Merriam Webster’s
                     Online Dictionary
                Data about data. For
                example, the title, subject,
                author, and size of a file
                constitute metadata about
                the file.
                    file
                   – Microsoft Computer
7
                     Dictionary, Fifth Edition
Metadata Example: File > Properties in Microsoft Office




8
Metadata Example: Album Information in Media
    Players




9
Metadata in HTML
     <META name=<property> content=“<value>” />




10   August 9, 2009   Cadence Confidential: Cadence Internal Use Only
Metadata Reflects Content and User Needs




11   August 9, 2009   Cadence Confidential: Cadence Internal Use Only
Types of Metadata

         Intrinsic: metadata that an object holds about itself
                     File name, file size …
          Descriptive: metadata that describes th object
          D    i ti      t d t th t d     ib the bj t
          Subject, title, audience, keywords …
        Metadata describes the who, what, when, where and
        Administrative and Rights: metadata used to manage
                    how about every facet of data
                                             data.
                             the object
12
             Create date, modify date, expiry
Metadata Applications




13   August 9, 2009   Cadence Confidential: Cadence Internal Use Only
Improved Search with Metadata
     • Filter search by metadata.




14   August 9, 2009   Cadence Confidential: Cadence Internal Use Only
Improved Navigation with Metadata
     • Aggregate topics with same metadata to create
       browseable indexes or categories.




15   August 9, 2009   Cadence Confidential: Cadence Internal Use Only
Display Context and Relationships with Metadata




                                               Cross-marketing on amazon.com.
                                                             g




16   August 9, 2009   Cadence Confidential: Cadence Internal Use Only
Personalization and customization
     • Display content according to role or audience
                                            audience.




17   August 9, 2009   Cadence Confidential: Cadence Internal Use Only
Other Metadata Applications

     • Discovery and compliance
            – Identify the need to update, retain, protect, and dispose content
              for i t
              f internal or regulatory requirements.
                         l       l t        i       t
     • Interoperability …
            – Content tagged with same metatags (
                         gg                      g (META name) from
                                                             )
              different sources can be easily integrated.


             Metadata allows unstructured content to be managed
                           like structured content.




18   August 9, 2009   Cadence Confidential: Cadence Internal Use Only
Metadata Standards




19   August 9, 2009   Cadence Confidential: Cadence Internal Use Only
Need for Metadata Standards

     • Different information providers using different metadata
       schemas.
     • Even metadata schemas of groups within organizations
       are different or out of sync.
     • The result:
            –     Inconsistent search results.
            –     Lack of interoperability.
            –     Information silos.
            –     …
      An US$ 2B Oil & Gas project suffered a loss of US$120M due to inability to
     locate a document or a misunderstanding about which document is needed.
                                -SchemaLogic, Inc.



20   August 9, 2009   Cadence Confidential: Cadence Internal Use Only
Some Metadata Standards

     • Dublin Core
     • Metadata support in DocBook and
                   pp
       DITA
     • IMS Global Learning Consortium
     • LOM (IEEE’s L
            (IEEE’ Learning Obj
                          i Object
       Metadata)
     • SCORM (ADL) - Learning Objects
     • EAD (Encoded Archival Description)


     Standard formats and approaches enable interoperability and
                      the sharing of metadata.
                                g

21   August 9, 2009   Cadence Confidential: Cadence Internal Use Only
Dublin Core

     • http://dublincore.org
     • General purpose metadata standard for use across
       domains.
     • 15 core elements.
     • El
       Element qualifiers t narrow th meaning of elements.
                t    lifi  to      the    i    f l      t
            – Example: A Date Created versus a Date Modified.
     • Encoding schemes: Controlled vocabularies or parsing
       rules to refine the interpretation of an element.
            – Example: A term from a controlled vocabulary such as the
              Library of Congress Subject Headings
                                          Headings.
     • Can be represented in HTML and in XML (RDF).



22   August 9, 2009   Cadence Confidential: Cadence Internal Use Only
Dublin Core Metadata Elements
     •    Title
     •    Creator
     •    Subject
               j
     •    Description
     •    Publisher
     •    Contributor
     •    Date
     •    Type
     •    Format
     •    Identifier
          Id tifi
     •    Source
     •    Language
     •    Relation
     •    Coverage
     •    Rights



23   August 9, 2009   Cadence Confidential: Cadence Internal Use Only
Dublin Core Metadata Example




                                                                        Source: http://www.sics.se/~preben/DC/DC_guide.html


24   August 9, 2009   Cadence Confidential: Cadence Internal Use Only
Metadata Support in DocBook

     • Metadata at different levels
            – title, info and bookinfo at book level
            – title, info and chapterinfo at chapter level
            – title, info and chapterinfo at section level


     • DocBook supports Dublin Core schema




25   August 9, 2009   Cadence Confidential: Cadence Internal Use Only
Metadata Support in DITA
     •      DITA supports a variety of standard and custom
            metadata:
            –         Author information
            –         Copyright information
            –         Product information
            –         Resource ID f h l systems
                      R          IDs for help t
            –         Document tracking information
            –         Audience information
            –         Keywords
                      K       d
            –         Custom metadata (otherprops)
     •      <prolog> element defines metadata at the topic level.
     •      <topicmeta> element defines metadata that applies to a
            topic when it appears in a map.
     •      Metadata at every level

26   August 9, 2009   Cadence Confidential: Cadence Internal Use Only
Sample of Metadata elements within <prolog> element

     <prolog>
            <author> (name of topic’s author)
            <copyright>
            <critdates> (document tracking information)
            <permissions>
            <publisher>
            <source>
            <metadata>
                      <audience> (intended audience)
                           type=“user | purchaser | administrator | … | other”
                           othertype=
                           j
                           job=“installing | customizing | administering | … | other”
                                         g             g               g
                           otherjob=
                           experiencelevel=“novice | general | expert”
                      <category> (content category used for grouping topics)
                      <keywords> (keywords for search engines)
                      <prodinfo>
                      <othermeta>
            …
                  …




27   August 9, 2009   Cadence Confidential: Cadence Internal Use Only
Working with Metadata




28   August 9, 2009   Cadence Confidential: Cadence Internal Use Only
Creating Metadata

     • Create it from scratch.
     • Reuse existing metadata and build on it.
     • Start with a standard.




29   August 9, 2009   Cadence Confidential: Cadence Internal Use Only
Metadata Process
              Create                                                Add
              Content                                                                   Publish
                                                                  Metadata

                                     Review &                                Review &
                                     Improve                                 Improve

     •    Identify content that will benefit from metadata using the 80/20 rule.
     •    Build a controlled vocabulary or use a vocabulary from a
          commercial source such as www.taxonomywarehouse.com.
                                                      y
            – Example: The Getty Thesaurus of Geographic Names (TGN)
     •    Apply metadata to content using templates or using indexing tools.
     •    Get it reviewed.
                 reviewed
     •    Evaluate search logs and user surveys to improve metadata.
     •    Continuously review metadata.


30   August 9, 2009   Cadence Confidential: Cadence Internal Use Only
Metadata Template: A Manual Approach




31   August 9, 2009   Cadence Confidential: Cadence Internal Use Only
Metadata Indexing and Discovery Tools

     •    Data Harmony http://www.dataharmony.com
     •    Interwoven MetaTagger http://www.interwoven.com
     •    Mondeca http://www.mondeca.com
     •    MultiTes http://www.multites.com/
     •    Synaptica http://www.synaptica.com
     •    SchemaLogic http://www.schemalogic.com
     •    WebChoir http //
                    http://www.webchoir.com
                                 ebchoir com
     •    WordMap http://www.wordmap.com/




32   August 9, 2009   Cadence Confidential: Cadence Internal Use Only
Future of Metadata




33   August 9, 2009   Cadence Confidential: Cadence Internal Use Only
Future of Metadata

     • Automated metadata generation.
     • Social tagging – tagging by users.
     • Geo tagging.




34   August 9, 2009   Cadence Confidential: Cadence Internal Use Only
Social Tagging Example:                                            tagging




35   August 9, 2009   Cadence Confidential: Cadence Internal Use Only
Social Tagging Example:




36   August 9, 2009   Cadence Confidential: Cadence Internal Use Only
Geo Tagging Example




37   August 9, 2009   Cadence Confidential: Cadence Internal Use Only
Q&A

38   August 9, 2009   Cadence Confidential: Cadence Internal Use Only

Weitere ähnliche Inhalte

Was ist angesagt?

Streamline Data Governance with Egeria: The Industry's First Open Metadata St...
Streamline Data Governance with Egeria: The Industry's First Open Metadata St...Streamline Data Governance with Egeria: The Industry's First Open Metadata St...
Streamline Data Governance with Egeria: The Industry's First Open Metadata St...
DataWorks Summit
 
TensorFlow London 18: Dr Alastair Moore, Towards the use of Graphical Models ...
TensorFlow London 18: Dr Alastair Moore, Towards the use of Graphical Models ...TensorFlow London 18: Dr Alastair Moore, Towards the use of Graphical Models ...
TensorFlow London 18: Dr Alastair Moore, Towards the use of Graphical Models ...
Seldon
 

Was ist angesagt? (20)

Master Data Management's Place in the Data Governance Landscape
Master Data Management's Place in the Data Governance Landscape Master Data Management's Place in the Data Governance Landscape
Master Data Management's Place in the Data Governance Landscape
 
Introduction to DCAM, the Data Management Capability Assessment Model - Editi...
Introduction to DCAM, the Data Management Capability Assessment Model - Editi...Introduction to DCAM, the Data Management Capability Assessment Model - Editi...
Introduction to DCAM, the Data Management Capability Assessment Model - Editi...
 
Master Data Management (MDM) for Mid-Market
Master Data Management (MDM) for Mid-MarketMaster Data Management (MDM) for Mid-Market
Master Data Management (MDM) for Mid-Market
 
Data Governance and Metadata Management
Data Governance and Metadata ManagementData Governance and Metadata Management
Data Governance and Metadata Management
 
RWDG Webinar: Data Steward Definition and Other Data Governance Roles
RWDG Webinar: Data Steward Definition and Other Data Governance RolesRWDG Webinar: Data Steward Definition and Other Data Governance Roles
RWDG Webinar: Data Steward Definition and Other Data Governance Roles
 
Gartner: Master Data Management Functionality
Gartner: Master Data Management FunctionalityGartner: Master Data Management Functionality
Gartner: Master Data Management Functionality
 
Streamline Data Governance with Egeria: The Industry's First Open Metadata St...
Streamline Data Governance with Egeria: The Industry's First Open Metadata St...Streamline Data Governance with Egeria: The Industry's First Open Metadata St...
Streamline Data Governance with Egeria: The Industry's First Open Metadata St...
 
A Connections-first Approach to Supply Chain Optimization
A Connections-first Approach to Supply Chain OptimizationA Connections-first Approach to Supply Chain Optimization
A Connections-first Approach to Supply Chain Optimization
 
Lessons in Data Modeling: Why a Data Model is an Important Part of Your Data ...
Lessons in Data Modeling: Why a Data Model is an Important Part of Your Data ...Lessons in Data Modeling: Why a Data Model is an Important Part of Your Data ...
Lessons in Data Modeling: Why a Data Model is an Important Part of Your Data ...
 
Transport for London - London's Operations Digital Twin
Transport for London - London's Operations Digital TwinTransport for London - London's Operations Digital Twin
Transport for London - London's Operations Digital Twin
 
TensorFlow London 18: Dr Alastair Moore, Towards the use of Graphical Models ...
TensorFlow London 18: Dr Alastair Moore, Towards the use of Graphical Models ...TensorFlow London 18: Dr Alastair Moore, Towards the use of Graphical Models ...
TensorFlow London 18: Dr Alastair Moore, Towards the use of Graphical Models ...
 
Ebook - The Guide to Master Data Management
Ebook - The Guide to Master Data Management Ebook - The Guide to Master Data Management
Ebook - The Guide to Master Data Management
 
Reference master data management
Reference master data managementReference master data management
Reference master data management
 
RWDG Slides: What is a Data Steward to do?
RWDG Slides: What is a Data Steward to do?RWDG Slides: What is a Data Steward to do?
RWDG Slides: What is a Data Steward to do?
 
Big data in malaysia
Big data in malaysiaBig data in malaysia
Big data in malaysia
 
Reference Data Management
Reference Data ManagementReference Data Management
Reference Data Management
 
Data-Ed Webinar: Data Quality Engineering
Data-Ed Webinar: Data Quality EngineeringData-Ed Webinar: Data Quality Engineering
Data-Ed Webinar: Data Quality Engineering
 
MDM and Reference Data
MDM and Reference DataMDM and Reference Data
MDM and Reference Data
 
MDM & BI Strategy For Large Enterprises
MDM & BI Strategy For Large EnterprisesMDM & BI Strategy For Large Enterprises
MDM & BI Strategy For Large Enterprises
 
Requirements for a Master Data Management (MDM) Solution - Presentation
Requirements for a Master Data Management (MDM) Solution - PresentationRequirements for a Master Data Management (MDM) Solution - Presentation
Requirements for a Master Data Management (MDM) Solution - Presentation
 

Andere mochten auch

InsideOut Development 2013 Holiday Card
InsideOut Development 2013 Holiday CardInsideOut Development 2013 Holiday Card
InsideOut Development 2013 Holiday Card
InsideOut Development
 
Linked In Recruiting Solutions
Linked In Recruiting SolutionsLinked In Recruiting Solutions
Linked In Recruiting Solutions
Frank Sherfey
 

Andere mochten auch (17)

The Birthday Cards I have received
The Birthday Cards I have receivedThe Birthday Cards I have received
The Birthday Cards I have received
 
UX and Semantic web UXCamp London 2014
UX and Semantic web UXCamp London 2014UX and Semantic web UXCamp London 2014
UX and Semantic web UXCamp London 2014
 
InsideOut Development 2013 Holiday Card
InsideOut Development 2013 Holiday CardInsideOut Development 2013 Holiday Card
InsideOut Development 2013 Holiday Card
 
Highest paying-jobs-in-america
Highest paying-jobs-in-americaHighest paying-jobs-in-america
Highest paying-jobs-in-america
 
2012 GFPR Launch at IFPRI March 14 2013
2012 GFPR Launch at IFPRI March 14 20132012 GFPR Launch at IFPRI March 14 2013
2012 GFPR Launch at IFPRI March 14 2013
 
Social Business Design: Web 2.0 NYC
Social Business Design: Web 2.0 NYCSocial Business Design: Web 2.0 NYC
Social Business Design: Web 2.0 NYC
 
Great Tips to Help You File Your Taxes (And Get a Refund)
Great Tips to Help You File Your Taxes (And Get a Refund) Great Tips to Help You File Your Taxes (And Get a Refund)
Great Tips to Help You File Your Taxes (And Get a Refund)
 
Unsung Heroes of PHP
Unsung Heroes of PHPUnsung Heroes of PHP
Unsung Heroes of PHP
 
Condom Fashion
Condom FashionCondom Fashion
Condom Fashion
 
Sports Illustrated Models 2006
Sports Illustrated Models 2006Sports Illustrated Models 2006
Sports Illustrated Models 2006
 
The Business of Business Cards
The Business of Business CardsThe Business of Business Cards
The Business of Business Cards
 
Generation We
Generation WeGeneration We
Generation We
 
Infographic: Happy Employees
Infographic: Happy EmployeesInfographic: Happy Employees
Infographic: Happy Employees
 
Linked In Recruiting Solutions
Linked In Recruiting SolutionsLinked In Recruiting Solutions
Linked In Recruiting Solutions
 
eMarketer Webinar: Key Digital Trends for 2012
eMarketer Webinar: Key Digital Trends for 2012eMarketer Webinar: Key Digital Trends for 2012
eMarketer Webinar: Key Digital Trends for 2012
 
Mockingjays in the workplace
Mockingjays in the workplaceMockingjays in the workplace
Mockingjays in the workplace
 
The Bull
The BullThe Bull
The Bull
 

Ähnlich wie Metadata Primer

A JCR View of the World - adaptTo() 2012 Berlin
A JCR View of the World - adaptTo() 2012 BerlinA JCR View of the World - adaptTo() 2012 Berlin
A JCR View of the World - adaptTo() 2012 Berlin
Alexander Klimetschek
 
Cni research data_oxford_horstmann_jefferies
Cni research data_oxford_horstmann_jefferiesCni research data_oxford_horstmann_jefferies
Cni research data_oxford_horstmann_jefferies
BDLSS
 
DataCite and its DOI infrastructure - IASSIST 2013
DataCite and its DOI infrastructure - IASSIST 2013DataCite and its DOI infrastructure - IASSIST 2013
DataCite and its DOI infrastructure - IASSIST 2013
Frauke Ziedorn
 

Ähnlich wie Metadata Primer (20)

DITA Quick Start
DITA Quick StartDITA Quick Start
DITA Quick Start
 
Second Thoughts about Metadata Standards for Data
Second Thoughts about Metadata Standards for DataSecond Thoughts about Metadata Standards for Data
Second Thoughts about Metadata Standards for Data
 
Introduction to Metadata for IDAH Fellows
Introduction to Metadata for IDAH FellowsIntroduction to Metadata for IDAH Fellows
Introduction to Metadata for IDAH Fellows
 
Metadata Strategies - Data Squared
Metadata Strategies - Data SquaredMetadata Strategies - Data Squared
Metadata Strategies - Data Squared
 
A JCR View of the World - adaptTo() 2012 Berlin
A JCR View of the World - adaptTo() 2012 BerlinA JCR View of the World - adaptTo() 2012 Berlin
A JCR View of the World - adaptTo() 2012 Berlin
 
Intelligent Cloud Enablement
Intelligent Cloud EnablementIntelligent Cloud Enablement
Intelligent Cloud Enablement
 
Metadata-powered dissemination of content
Metadata-powered dissemination of contentMetadata-powered dissemination of content
Metadata-powered dissemination of content
 
Metadata lecture 5 part 2
Metadata lecture 5 part 2Metadata lecture 5 part 2
Metadata lecture 5 part 2
 
Cni research data_oxford_horstmann_jefferies
Cni research data_oxford_horstmann_jefferiesCni research data_oxford_horstmann_jefferies
Cni research data_oxford_horstmann_jefferies
 
Semtech2006
Semtech2006Semtech2006
Semtech2006
 
DataCite and its DOI infrastructure - IASSIST 2013
DataCite and its DOI infrastructure - IASSIST 2013DataCite and its DOI infrastructure - IASSIST 2013
DataCite and its DOI infrastructure - IASSIST 2013
 
Creating a sustainable business model for a digital repository: the Dryad exp...
Creating a sustainable business model for a digital repository: the Dryad exp...Creating a sustainable business model for a digital repository: the Dryad exp...
Creating a sustainable business model for a digital repository: the Dryad exp...
 
DataONE Education Module 07: Metadata
DataONE Education Module 07: MetadataDataONE Education Module 07: Metadata
DataONE Education Module 07: Metadata
 
CNI 2018: A Research Object Authoring Tool for the Data Commons
CNI 2018: A Research Object Authoring Tool for the Data CommonsCNI 2018: A Research Object Authoring Tool for the Data Commons
CNI 2018: A Research Object Authoring Tool for the Data Commons
 
Metadata 101
Metadata 101Metadata 101
Metadata 101
 
NISO Forum, Denver, Sept. 24, 2012: Scientific discovery and innovation in an...
NISO Forum, Denver, Sept. 24, 2012: Scientific discovery and innovation in an...NISO Forum, Denver, Sept. 24, 2012: Scientific discovery and innovation in an...
NISO Forum, Denver, Sept. 24, 2012: Scientific discovery and innovation in an...
 
L07 metadata
L07 metadataL07 metadata
L07 metadata
 
Documentum Data Models.ppt
Documentum Data Models.pptDocumentum Data Models.ppt
Documentum Data Models.ppt
 
Metadata and Tagging
Metadata and TaggingMetadata and Tagging
Metadata and Tagging
 
Metadata: Towards Machine-Enabled Intelligence
Metadata: Towards Machine-Enabled Intelligence               Metadata: Towards Machine-Enabled Intelligence
Metadata: Towards Machine-Enabled Intelligence
 

Kürzlich hochgeladen

EIS-Webinar-Prompt-Knowledge-Eng-2024-04-08.pptx
EIS-Webinar-Prompt-Knowledge-Eng-2024-04-08.pptxEIS-Webinar-Prompt-Knowledge-Eng-2024-04-08.pptx
EIS-Webinar-Prompt-Knowledge-Eng-2024-04-08.pptx
Earley Information Science
 
IAC 2024 - IA Fast Track to Search Focused AI Solutions
IAC 2024 - IA Fast Track to Search Focused AI SolutionsIAC 2024 - IA Fast Track to Search Focused AI Solutions
IAC 2024 - IA Fast Track to Search Focused AI Solutions
Enterprise Knowledge
 

Kürzlich hochgeladen (20)

EIS-Webinar-Prompt-Knowledge-Eng-2024-04-08.pptx
EIS-Webinar-Prompt-Knowledge-Eng-2024-04-08.pptxEIS-Webinar-Prompt-Knowledge-Eng-2024-04-08.pptx
EIS-Webinar-Prompt-Knowledge-Eng-2024-04-08.pptx
 
Boost Fertility New Invention Ups Success Rates.pdf
Boost Fertility New Invention Ups Success Rates.pdfBoost Fertility New Invention Ups Success Rates.pdf
Boost Fertility New Invention Ups Success Rates.pdf
 
Powerful Google developer tools for immediate impact! (2023-24 C)
Powerful Google developer tools for immediate impact! (2023-24 C)Powerful Google developer tools for immediate impact! (2023-24 C)
Powerful Google developer tools for immediate impact! (2023-24 C)
 
How to convert PDF to text with Nanonets
How to convert PDF to text with NanonetsHow to convert PDF to text with Nanonets
How to convert PDF to text with Nanonets
 
TrustArc Webinar - Stay Ahead of US State Data Privacy Law Developments
TrustArc Webinar - Stay Ahead of US State Data Privacy Law DevelopmentsTrustArc Webinar - Stay Ahead of US State Data Privacy Law Developments
TrustArc Webinar - Stay Ahead of US State Data Privacy Law Developments
 
A Domino Admins Adventures (Engage 2024)
A Domino Admins Adventures (Engage 2024)A Domino Admins Adventures (Engage 2024)
A Domino Admins Adventures (Engage 2024)
 
Factors to Consider When Choosing Accounts Payable Services Providers.pptx
Factors to Consider When Choosing Accounts Payable Services Providers.pptxFactors to Consider When Choosing Accounts Payable Services Providers.pptx
Factors to Consider When Choosing Accounts Payable Services Providers.pptx
 
A Call to Action for Generative AI in 2024
A Call to Action for Generative AI in 2024A Call to Action for Generative AI in 2024
A Call to Action for Generative AI in 2024
 
Handwritten Text Recognition for manuscripts and early printed texts
Handwritten Text Recognition for manuscripts and early printed textsHandwritten Text Recognition for manuscripts and early printed texts
Handwritten Text Recognition for manuscripts and early printed texts
 
Understanding Discord NSFW Servers A Guide for Responsible Users.pdf
Understanding Discord NSFW Servers A Guide for Responsible Users.pdfUnderstanding Discord NSFW Servers A Guide for Responsible Users.pdf
Understanding Discord NSFW Servers A Guide for Responsible Users.pdf
 
Apidays Singapore 2024 - Building Digital Trust in a Digital Economy by Veron...
Apidays Singapore 2024 - Building Digital Trust in a Digital Economy by Veron...Apidays Singapore 2024 - Building Digital Trust in a Digital Economy by Veron...
Apidays Singapore 2024 - Building Digital Trust in a Digital Economy by Veron...
 
Raspberry Pi 5: Challenges and Solutions in Bringing up an OpenGL/Vulkan Driv...
Raspberry Pi 5: Challenges and Solutions in Bringing up an OpenGL/Vulkan Driv...Raspberry Pi 5: Challenges and Solutions in Bringing up an OpenGL/Vulkan Driv...
Raspberry Pi 5: Challenges and Solutions in Bringing up an OpenGL/Vulkan Driv...
 
08448380779 Call Girls In Greater Kailash - I Women Seeking Men
08448380779 Call Girls In Greater Kailash - I Women Seeking Men08448380779 Call Girls In Greater Kailash - I Women Seeking Men
08448380779 Call Girls In Greater Kailash - I Women Seeking Men
 
A Year of the Servo Reboot: Where Are We Now?
A Year of the Servo Reboot: Where Are We Now?A Year of the Servo Reboot: Where Are We Now?
A Year of the Servo Reboot: Where Are We Now?
 
Mastering MySQL Database Architecture: Deep Dive into MySQL Shell and MySQL R...
Mastering MySQL Database Architecture: Deep Dive into MySQL Shell and MySQL R...Mastering MySQL Database Architecture: Deep Dive into MySQL Shell and MySQL R...
Mastering MySQL Database Architecture: Deep Dive into MySQL Shell and MySQL R...
 
Scaling API-first – The story of a global engineering organization
Scaling API-first – The story of a global engineering organizationScaling API-first – The story of a global engineering organization
Scaling API-first – The story of a global engineering organization
 
Axa Assurance Maroc - Insurer Innovation Award 2024
Axa Assurance Maroc - Insurer Innovation Award 2024Axa Assurance Maroc - Insurer Innovation Award 2024
Axa Assurance Maroc - Insurer Innovation Award 2024
 
IAC 2024 - IA Fast Track to Search Focused AI Solutions
IAC 2024 - IA Fast Track to Search Focused AI SolutionsIAC 2024 - IA Fast Track to Search Focused AI Solutions
IAC 2024 - IA Fast Track to Search Focused AI Solutions
 
Boost PC performance: How more available memory can improve productivity
Boost PC performance: How more available memory can improve productivityBoost PC performance: How more available memory can improve productivity
Boost PC performance: How more available memory can improve productivity
 
Automating Google Workspace (GWS) & more with Apps Script
Automating Google Workspace (GWS) & more with Apps ScriptAutomating Google Workspace (GWS) & more with Apps Script
Automating Google Workspace (GWS) & more with Apps Script
 

Metadata Primer

  • 1. Metadata Primer Selvakumar T.S 1 August 9, 2009 Cadence Confidential: Cadence Internal Use Only
  • 2. Source: Siderean Software, Inc. All of the answers are here. Now, what was the question? 2 August 9, 2009 Cadence Confidential: Cadence Internal Use Only
  • 3. Issues with Information Access Today • Tons of content from disparate sources. • Cumbersome navigation. • Keyword search assumes you know what you are looking for. • L Large number of search results -- most of them b f h lt t f th irrelevant. • Lack of context in search results. • Search engines rely on mathematical algorithms to determine relevance and ranking of search results. Fortune 500 companies lost $12 billion due to inability to find information in 2003 2003. -IDC 3 August 9, 2009 Cadence Confidential: Cadence Internal Use Only
  • 4. A Quick Demo on Information Access Issues and Possibilities Source: 4
  • 5. Agenda • Understanding Metadata • Metadata Applications • Metadata Standards • Working with Metadata • Future of Metadata 5
  • 7. INVENT VE N E TI What is Metadata? at s etadata Data that provides information about other data. – Merriam Webster’s Online Dictionary Data about data. For example, the title, subject, author, and size of a file constitute metadata about the file. file – Microsoft Computer 7 Dictionary, Fifth Edition
  • 8. Metadata Example: File > Properties in Microsoft Office 8
  • 9. Metadata Example: Album Information in Media Players 9
  • 10. Metadata in HTML <META name=<property> content=“<value>” /> 10 August 9, 2009 Cadence Confidential: Cadence Internal Use Only
  • 11. Metadata Reflects Content and User Needs 11 August 9, 2009 Cadence Confidential: Cadence Internal Use Only
  • 12. Types of Metadata Intrinsic: metadata that an object holds about itself File name, file size … Descriptive: metadata that describes th object D i ti t d t th t d ib the bj t Subject, title, audience, keywords … Metadata describes the who, what, when, where and Administrative and Rights: metadata used to manage how about every facet of data data. the object 12 Create date, modify date, expiry
  • 13. Metadata Applications 13 August 9, 2009 Cadence Confidential: Cadence Internal Use Only
  • 14. Improved Search with Metadata • Filter search by metadata. 14 August 9, 2009 Cadence Confidential: Cadence Internal Use Only
  • 15. Improved Navigation with Metadata • Aggregate topics with same metadata to create browseable indexes or categories. 15 August 9, 2009 Cadence Confidential: Cadence Internal Use Only
  • 16. Display Context and Relationships with Metadata Cross-marketing on amazon.com. g 16 August 9, 2009 Cadence Confidential: Cadence Internal Use Only
  • 17. Personalization and customization • Display content according to role or audience audience. 17 August 9, 2009 Cadence Confidential: Cadence Internal Use Only
  • 18. Other Metadata Applications • Discovery and compliance – Identify the need to update, retain, protect, and dispose content for i t f internal or regulatory requirements. l l t i t • Interoperability … – Content tagged with same metatags ( gg g (META name) from ) different sources can be easily integrated. Metadata allows unstructured content to be managed like structured content. 18 August 9, 2009 Cadence Confidential: Cadence Internal Use Only
  • 19. Metadata Standards 19 August 9, 2009 Cadence Confidential: Cadence Internal Use Only
  • 20. Need for Metadata Standards • Different information providers using different metadata schemas. • Even metadata schemas of groups within organizations are different or out of sync. • The result: – Inconsistent search results. – Lack of interoperability. – Information silos. – … An US$ 2B Oil & Gas project suffered a loss of US$120M due to inability to locate a document or a misunderstanding about which document is needed. -SchemaLogic, Inc. 20 August 9, 2009 Cadence Confidential: Cadence Internal Use Only
  • 21. Some Metadata Standards • Dublin Core • Metadata support in DocBook and pp DITA • IMS Global Learning Consortium • LOM (IEEE’s L (IEEE’ Learning Obj i Object Metadata) • SCORM (ADL) - Learning Objects • EAD (Encoded Archival Description) Standard formats and approaches enable interoperability and the sharing of metadata. g 21 August 9, 2009 Cadence Confidential: Cadence Internal Use Only
  • 22. Dublin Core • http://dublincore.org • General purpose metadata standard for use across domains. • 15 core elements. • El Element qualifiers t narrow th meaning of elements. t lifi to the i f l t – Example: A Date Created versus a Date Modified. • Encoding schemes: Controlled vocabularies or parsing rules to refine the interpretation of an element. – Example: A term from a controlled vocabulary such as the Library of Congress Subject Headings Headings. • Can be represented in HTML and in XML (RDF). 22 August 9, 2009 Cadence Confidential: Cadence Internal Use Only
  • 23. Dublin Core Metadata Elements • Title • Creator • Subject j • Description • Publisher • Contributor • Date • Type • Format • Identifier Id tifi • Source • Language • Relation • Coverage • Rights 23 August 9, 2009 Cadence Confidential: Cadence Internal Use Only
  • 24. Dublin Core Metadata Example Source: http://www.sics.se/~preben/DC/DC_guide.html 24 August 9, 2009 Cadence Confidential: Cadence Internal Use Only
  • 25. Metadata Support in DocBook • Metadata at different levels – title, info and bookinfo at book level – title, info and chapterinfo at chapter level – title, info and chapterinfo at section level • DocBook supports Dublin Core schema 25 August 9, 2009 Cadence Confidential: Cadence Internal Use Only
  • 26. Metadata Support in DITA • DITA supports a variety of standard and custom metadata: – Author information – Copyright information – Product information – Resource ID f h l systems R IDs for help t – Document tracking information – Audience information – Keywords K d – Custom metadata (otherprops) • <prolog> element defines metadata at the topic level. • <topicmeta> element defines metadata that applies to a topic when it appears in a map. • Metadata at every level 26 August 9, 2009 Cadence Confidential: Cadence Internal Use Only
  • 27. Sample of Metadata elements within <prolog> element <prolog> <author> (name of topic’s author) <copyright> <critdates> (document tracking information) <permissions> <publisher> <source> <metadata> <audience> (intended audience) type=“user | purchaser | administrator | … | other” othertype= j job=“installing | customizing | administering | … | other” g g g otherjob= experiencelevel=“novice | general | expert” <category> (content category used for grouping topics) <keywords> (keywords for search engines) <prodinfo> <othermeta> … … 27 August 9, 2009 Cadence Confidential: Cadence Internal Use Only
  • 28. Working with Metadata 28 August 9, 2009 Cadence Confidential: Cadence Internal Use Only
  • 29. Creating Metadata • Create it from scratch. • Reuse existing metadata and build on it. • Start with a standard. 29 August 9, 2009 Cadence Confidential: Cadence Internal Use Only
  • 30. Metadata Process Create Add Content Publish Metadata Review & Review & Improve Improve • Identify content that will benefit from metadata using the 80/20 rule. • Build a controlled vocabulary or use a vocabulary from a commercial source such as www.taxonomywarehouse.com. y – Example: The Getty Thesaurus of Geographic Names (TGN) • Apply metadata to content using templates or using indexing tools. • Get it reviewed. reviewed • Evaluate search logs and user surveys to improve metadata. • Continuously review metadata. 30 August 9, 2009 Cadence Confidential: Cadence Internal Use Only
  • 31. Metadata Template: A Manual Approach 31 August 9, 2009 Cadence Confidential: Cadence Internal Use Only
  • 32. Metadata Indexing and Discovery Tools • Data Harmony http://www.dataharmony.com • Interwoven MetaTagger http://www.interwoven.com • Mondeca http://www.mondeca.com • MultiTes http://www.multites.com/ • Synaptica http://www.synaptica.com • SchemaLogic http://www.schemalogic.com • WebChoir http // http://www.webchoir.com ebchoir com • WordMap http://www.wordmap.com/ 32 August 9, 2009 Cadence Confidential: Cadence Internal Use Only
  • 33. Future of Metadata 33 August 9, 2009 Cadence Confidential: Cadence Internal Use Only
  • 34. Future of Metadata • Automated metadata generation. • Social tagging – tagging by users. • Geo tagging. 34 August 9, 2009 Cadence Confidential: Cadence Internal Use Only
  • 35. Social Tagging Example: tagging 35 August 9, 2009 Cadence Confidential: Cadence Internal Use Only
  • 36. Social Tagging Example: 36 August 9, 2009 Cadence Confidential: Cadence Internal Use Only
  • 37. Geo Tagging Example 37 August 9, 2009 Cadence Confidential: Cadence Internal Use Only
  • 38. Q&A 38 August 9, 2009 Cadence Confidential: Cadence Internal Use Only