SlideShare ist ein Scribd-Unternehmen logo
1 von 29
Downloaden Sie, um offline zu lesen
Knowledge Engineering for
         TELDAP
                     Keh-Jiann Chen
                   Principal Investigator
  Core Platforms for Digital Contents Project, TELDAP
                     Research Fellow
Research Center for Information Technology Innovation &
   Institute of Information Science, Academia Sinica
Outline
   Introduction
   Union catalog
   Databases and metadata for digital
   contents and websites
   Knowledge engineering
   Future perspective
Introduction
 The integration and management of digital
 contents has become an important issue as
 the amount of digital contents produced from
 different projects and institutions increases
 rapidly.
 Our project goal is to achieve optimized
 preservation, retrieval, and presentation of
 digital collections.
1. Union Catalog
What is the union catalog¡H
  It is a catalog and portal for all digital collections of
  TELDAP.

  It is an integrated platform for browsing and searching
  entire digital contents of TELDAP.

  Metadata provides core descriptions and licensing
  information of each digital collection.
Browsing by topics
Search by keywords




                     Home Page of Union Catalogs
2. Databases and metadata for
 digital contents and websites
Metadata models for different types of
objects
   Archived digital items
     Union catalog metadata model- Dublin core+
   Web sites
     DCCAP (Dublin Core Collections Application Profile)
     Fields for internal used only
       Unique Identifier, Format, Evaluation, Cataloging History
   Documents
     Document metadata-Dublin core
Metadata for             Element                                  Definition
                                      Title   A name given to the resource


digital items¡G                    Creator    An entity primarily responsible for making the
                                              content of the resource

                    Subject and Keywords      The topic of the content of the resource

Over 2 million                Description     An account of the content of the resource

                                Publisher     An entity responsible for making the resource

digital items and                             available

                                              An entity responsible for making contributions to the
                              Contributor
                                              content of the resource

still increasing                     Date     A date associated with an event in the life cycle of
                                              the resource

                          Resource Type       The nature or genre of the content of the resource

                                   Format     The physical or digital manifestation of the resource


                       Resource Identifier    An unambiguous reference to the resource within a
                                              given context

                                   Source     A Reference to a resource from which the present
                                              resource is derived

                               Language       A language of the intellectual content of the
                                              resource

                                 Relation     A reference to a related resource

                                Coverage      The extent or scope of the content of the resource

                      Rights Management       Information about rights held in and over the
                                              resource
                                                                                               9
10
Metadata for websites
 Over 200 websites and still increasing
 Metadata
  DCCAP (Dublin Core Collections Application
  Profile)
  To Combine the standard with our requirements:
  19 data fields
Metadata for websites

      The Website Homepage Picture


      URL, Project Information




      Type, Name, Author, Subject,
      Description, Language,
      Item Type, Target

      Archived Information:
      URL, time, authorization


      Copyright, Purpose, Other Information


Figure: http://digitalarchives.tw
Dynamic categorization
User-oriented categorization
 General, elementary school students, high school
 students, researchers, …etc.
Topical-based categorization
 Archaeology, painting, animal, plant, document,
 …etc.
Functional-based categorization
 Research, education, business, technology,…
Categorization based on institutions
 Academia Sinica, Taiwan U., Palace museum,…
Figure: http://digitalarchives.tw



      Purpose: Education
      Target: Elementary school student,
              Junior high school student,
              Teacher…
      Select Items:
      According to 40 evaluation
      indicators, select top 5 websites

      Purpose: Creative applications
      Select Items:
      According to 40 evaluation
      indicators, select top 5 websites

      Purpose: Academic research
      Subject: Animal, Archaeology,
      Anthropology…
      Select Items:
      According to 40 evaluation
      indicators, select top 3 websites
Metadata for project documents
 Over 5000 documents and still increasing
 Metadata- Dublin core
 Construct Teldapwiki- A Wikipedia for Teldap
 http://wiki.teldap.tw/
3. Knowledge Engineering
Plans of making knowledge structures
for TELDAP
  Construct metadata models for different objects.
  Establish hyperlinks between contexts and
  objects.
    Develop keyword extraction tools.
    Design automatic tagging tools.
  Construct Teldap ontology and thesaurus
    Art & Architecture Thesaurus by Getty
    Chinese WordNet
(1) Metadata models for different objects
   Digital collections
     Union catalog metadata model- Dublin core+
   Web sites
     DCCAP (Dublin Core Collections Application Profile)
     Public fields
     Private fields
       Unique Identifier, Format, Evaluation, Cataloging History
   Documents
     Document metadata-Dublin core
(2) Establish hyperlinks between contents and
    objects
    Identify keywords in contents
    Tag keywords with related object hyperlinks
Develop hyperlink tagging tools
  Word segmentation tools
    Resolve word segmentation ambiguities and
    identify keywords.
    CKIP word segmentation system:
    http://ckipsvr.iis.sinica.edu.tw/
Develop hyperlink tagging tools
  TELDAP keyword dictionary
     Extract keywords from metadata and establish
     object-keyword relations.
        Extract text from XML data for each object
        The text are classified by topics, titles,
        descriptions, authors, locations, eras etc.
        From each class of text file extract keywords
        by automatic word segmentation and keyword
        extraction techniques.
Prototype system for hyperlink tagger
  Identify and select keywords from the input text
Prototype system for hyperlink tagger
  Produce text with hyperlinks
Prototype system for hyperlink tagger
  Hyperlinks point to the related digital collections
(3) Construct Teldap ontology and thesaurus
     Topical relation               Hypernym/hyponym
     Synonym relation                                [¹¾¡²B³       ]/[ªM
          =ÄFY©
           ª¬             = Sushi     ¡B½L¡B¸J¡BÂ|             ]
              =©µ¥-°p¤ý             Establish implicit links
                                    between objects by
                                    author, material,
                                    object type, …etc..
(3) Construct Teldap ontology and thesaurus
     Establish association links between
     Chinese keywords and Getty AAT.
     Merging Chinese WordNet with English
     WordNet
Future Perspectives
Technology development
 Construct multi-lingua thesauri – Getty AAT
 Maintain the TELDAP keyword and object relation
 database
 Construct name authority files, gazetteers, and
 universal calendars
 Design hyperlink taggers and keyword extension tools
 Designing authoring tool which provides hyperlinks of
 keyword related digital contents automatically
 Design knowledge-based content retrieval system
Future Perspectives
Content enrichment
 Within TELDAP¡G
   Standardize object metadata model and data format
   All TELDAP objects should have their metadata
   Writing scripts and stories for different topics with
   Wiki-like knowledge structure
   Enrich the digital collections
   Establish hyperlinks between text books and
   TELDAP collections
 Extend the knowledge sources¡G e.g. Wikipedia
Thank you for your attention!
        ·q½Ð«ü±

Weitere ähnliche Inhalte

Was ist angesagt?

Semantic Web Technologies For Digital Libraries
Semantic Web Technologies For Digital LibrariesSemantic Web Technologies For Digital Libraries
Semantic Web Technologies For Digital LibrariesNikesh Narayanan
 
Webinar slides: Interoperability between resources involved in TDM at the lev...
Webinar slides: Interoperability between resources involved in TDM at the lev...Webinar slides: Interoperability between resources involved in TDM at the lev...
Webinar slides: Interoperability between resources involved in TDM at the lev...openminted_eu
 
JeromeDL - the Semantic Digital Library
JeromeDL - the Semantic Digital LibraryJeromeDL - the Semantic Digital Library
JeromeDL - the Semantic Digital LibrarySebastian Ryszard Kruk
 
Hw09 Terapot Email Archiving With Hadoop
Hw09   Terapot  Email Archiving With HadoopHw09   Terapot  Email Archiving With Hadoop
Hw09 Terapot Email Archiving With HadoopCloudera, Inc.
 
Role of Ontologies in Semantic Digital Libraries
Role of Ontologies in Semantic Digital LibrariesRole of Ontologies in Semantic Digital Libraries
Role of Ontologies in Semantic Digital LibrariesSebastian Ryszard Kruk
 
Semantic Web and web of commerce - Disruptive technology
Semantic Web and web of commerce - Disruptive technologySemantic Web and web of commerce - Disruptive technology
Semantic Web and web of commerce - Disruptive technologySemantic Web San Diego
 
Inform: Targeting the Interest Graph
Inform: Targeting the Interest GraphInform: Targeting the Interest Graph
Inform: Targeting the Interest GraphVital.AI
 

Was ist angesagt? (9)

JeromeDL Tutorial
JeromeDL TutorialJeromeDL Tutorial
JeromeDL Tutorial
 
Semantic Web Technologies For Digital Libraries
Semantic Web Technologies For Digital LibrariesSemantic Web Technologies For Digital Libraries
Semantic Web Technologies For Digital Libraries
 
Saadallah vtls
Saadallah vtlsSaadallah vtls
Saadallah vtls
 
Webinar slides: Interoperability between resources involved in TDM at the lev...
Webinar slides: Interoperability between resources involved in TDM at the lev...Webinar slides: Interoperability between resources involved in TDM at the lev...
Webinar slides: Interoperability between resources involved in TDM at the lev...
 
JeromeDL - the Semantic Digital Library
JeromeDL - the Semantic Digital LibraryJeromeDL - the Semantic Digital Library
JeromeDL - the Semantic Digital Library
 
Hw09 Terapot Email Archiving With Hadoop
Hw09   Terapot  Email Archiving With HadoopHw09   Terapot  Email Archiving With Hadoop
Hw09 Terapot Email Archiving With Hadoop
 
Role of Ontologies in Semantic Digital Libraries
Role of Ontologies in Semantic Digital LibrariesRole of Ontologies in Semantic Digital Libraries
Role of Ontologies in Semantic Digital Libraries
 
Semantic Web and web of commerce - Disruptive technology
Semantic Web and web of commerce - Disruptive technologySemantic Web and web of commerce - Disruptive technology
Semantic Web and web of commerce - Disruptive technology
 
Inform: Targeting the Interest Graph
Inform: Targeting the Interest GraphInform: Targeting the Interest Graph
Inform: Targeting the Interest Graph
 

Andere mochten auch

Project Sales Corp - Hand Safety Range
Project Sales Corp - Hand Safety RangeProject Sales Corp - Hand Safety Range
Project Sales Corp - Hand Safety RangeProject Sales Corp
 
(Final) contribution and creation of new concepts in the bilingual thesaurus ...
(Final) contribution and creation of new concepts in the bilingual thesaurus ...(Final) contribution and creation of new concepts in the bilingual thesaurus ...
(Final) contribution and creation of new concepts in the bilingual thesaurus ...AAT Taiwan
 
PSC Hands Free Lifting and Hands-off Tools
PSC Hands Free Lifting and Hands-off ToolsPSC Hands Free Lifting and Hands-off Tools
PSC Hands Free Lifting and Hands-off ToolsProject Sales Corp
 
PSC Impact Glove Selection Guide 2016
PSC Impact Glove Selection Guide 2016PSC Impact Glove Selection Guide 2016
PSC Impact Glove Selection Guide 2016Project Sales Corp
 
How Art Is Relevant: An Introduction to the Online Exhibition
How Art Is Relevant: An Introduction to the Online ExhibitionHow Art Is Relevant: An Introduction to the Online Exhibition
How Art Is Relevant: An Introduction to the Online ExhibitionAAT Taiwan
 

Andere mochten auch (7)

Project Sales Corp - Hand Safety Range
Project Sales Corp - Hand Safety RangeProject Sales Corp - Hand Safety Range
Project Sales Corp - Hand Safety Range
 
(Final) contribution and creation of new concepts in the bilingual thesaurus ...
(Final) contribution and creation of new concepts in the bilingual thesaurus ...(Final) contribution and creation of new concepts in the bilingual thesaurus ...
(Final) contribution and creation of new concepts in the bilingual thesaurus ...
 
Finger saver user guide
Finger saver user guideFinger saver user guide
Finger saver user guide
 
PSC Hands Free Lifting and Hands-off Tools
PSC Hands Free Lifting and Hands-off ToolsPSC Hands Free Lifting and Hands-off Tools
PSC Hands Free Lifting and Hands-off Tools
 
PSC Impact Glove Selection Guide 2016
PSC Impact Glove Selection Guide 2016PSC Impact Glove Selection Guide 2016
PSC Impact Glove Selection Guide 2016
 
How Art Is Relevant: An Introduction to the Online Exhibition
How Art Is Relevant: An Introduction to the Online ExhibitionHow Art Is Relevant: An Introduction to the Online Exhibition
How Art Is Relevant: An Introduction to the Online Exhibition
 
Poster welcome back 1
Poster   welcome back 1Poster   welcome back 1
Poster welcome back 1
 

Ähnlich wie Knowledge Engineering for TELDAP

Union catalogandknowledge engineering for teldap
Union catalogandknowledge engineering for teldapUnion catalogandknowledge engineering for teldap
Union catalogandknowledge engineering for teldapAAT Taiwan
 
DSpace Training Presentation
DSpace Training PresentationDSpace Training Presentation
DSpace Training PresentationThomas King
 
Collision course presentation (corrrect)
Collision course presentation (corrrect)Collision course presentation (corrrect)
Collision course presentation (corrrect)William Worford
 
Annotating Digital Texts in the Brown University Library
Annotating Digital Texts in the Brown University LibraryAnnotating Digital Texts in the Brown University Library
Annotating Digital Texts in the Brown University LibraryTimothy Cole
 
Semantic Search using RDF Metadata (SemTech 2005)
Semantic Search using RDF Metadata (SemTech 2005)Semantic Search using RDF Metadata (SemTech 2005)
Semantic Search using RDF Metadata (SemTech 2005)Bradley Allen
 
Mapping cross-­domain metadata to the Europeana Data Model (EDM) - EDM introd...
Mapping cross-­domain metadata to the Europeana Data Model (EDM) - EDM introd...Mapping cross-­domain metadata to the Europeana Data Model (EDM) - EDM introd...
Mapping cross-­domain metadata to the Europeana Data Model (EDM) - EDM introd...Valentine Charles
 
The JISC Information Environment and VLEs
The JISC Information Environment and VLEsThe JISC Information Environment and VLEs
The JISC Information Environment and VLEsAndy Powell
 
Repositories thru the looking glass
Repositories thru the looking glassRepositories thru the looking glass
Repositories thru the looking glassEduserv Foundation
 
Digital library and metadata
Digital library and metadataDigital library and metadata
Digital library and metadataramncsi
 
Digital Libraries
Digital LibrariesDigital Libraries
Digital LibrariesJack Eapen
 
Digital Libraries
Digital LibrariesDigital Libraries
Digital LibrariesJack Eapen
 
Semantic Search tutorial at SemTech 2012
Semantic Search tutorial at SemTech 2012Semantic Search tutorial at SemTech 2012
Semantic Search tutorial at SemTech 2012Peter Mika
 
Linked data demystified:Practical efforts to transform CONTENTDM metadata int...
Linked data demystified:Practical efforts to transform CONTENTDM metadata int...Linked data demystified:Practical efforts to transform CONTENTDM metadata int...
Linked data demystified:Practical efforts to transform CONTENTDM metadata int...Cory Lampert
 
RO-Crate: packaging metadata love notes into FAIR Digital Objects
RO-Crate: packaging metadata love notes into FAIR Digital ObjectsRO-Crate: packaging metadata love notes into FAIR Digital Objects
RO-Crate: packaging metadata love notes into FAIR Digital ObjectsCarole Goble
 
Intro to Digitization Projects
Intro to Digitization ProjectsIntro to Digitization Projects
Intro to Digitization Projectszsrlibrary
 
The Rhetoric of Research Objects
The Rhetoric of Research ObjectsThe Rhetoric of Research Objects
The Rhetoric of Research ObjectsCarole Goble
 

Ähnlich wie Knowledge Engineering for TELDAP (20)

Union catalogandknowledge engineering for teldap
Union catalogandknowledge engineering for teldapUnion catalogandknowledge engineering for teldap
Union catalogandknowledge engineering for teldap
 
DSpace Training Presentation
DSpace Training PresentationDSpace Training Presentation
DSpace Training Presentation
 
Ontology based metadata schema for digital library projects in China
Ontology based metadata schema for digital library projects in ChinaOntology based metadata schema for digital library projects in China
Ontology based metadata schema for digital library projects in China
 
Collision course presentation (corrrect)
Collision course presentation (corrrect)Collision course presentation (corrrect)
Collision course presentation (corrrect)
 
Annotating Digital Texts in the Brown University Library
Annotating Digital Texts in the Brown University LibraryAnnotating Digital Texts in the Brown University Library
Annotating Digital Texts in the Brown University Library
 
B08 A3pc 90 Diapo Damy En
B08 A3pc 90 Diapo Damy EnB08 A3pc 90 Diapo Damy En
B08 A3pc 90 Diapo Damy En
 
Semantic Search using RDF Metadata (SemTech 2005)
Semantic Search using RDF Metadata (SemTech 2005)Semantic Search using RDF Metadata (SemTech 2005)
Semantic Search using RDF Metadata (SemTech 2005)
 
Mapping cross-­domain metadata to the Europeana Data Model (EDM) - EDM introd...
Mapping cross-­domain metadata to the Europeana Data Model (EDM) - EDM introd...Mapping cross-­domain metadata to the Europeana Data Model (EDM) - EDM introd...
Mapping cross-­domain metadata to the Europeana Data Model (EDM) - EDM introd...
 
The JISC Information Environment and VLEs
The JISC Information Environment and VLEsThe JISC Information Environment and VLEs
The JISC Information Environment and VLEs
 
Repositories thru the looking glass
Repositories thru the looking glassRepositories thru the looking glass
Repositories thru the looking glass
 
Digital library and metadata
Digital library and metadataDigital library and metadata
Digital library and metadata
 
Digital Libraries
Digital LibrariesDigital Libraries
Digital Libraries
 
Digital Libraries
Digital LibrariesDigital Libraries
Digital Libraries
 
Semantic Search tutorial at SemTech 2012
Semantic Search tutorial at SemTech 2012Semantic Search tutorial at SemTech 2012
Semantic Search tutorial at SemTech 2012
 
Linked data demystified:Practical efforts to transform CONTENTDM metadata int...
Linked data demystified:Practical efforts to transform CONTENTDM metadata int...Linked data demystified:Practical efforts to transform CONTENTDM metadata int...
Linked data demystified:Practical efforts to transform CONTENTDM metadata int...
 
RO-Crate: packaging metadata love notes into FAIR Digital Objects
RO-Crate: packaging metadata love notes into FAIR Digital ObjectsRO-Crate: packaging metadata love notes into FAIR Digital Objects
RO-Crate: packaging metadata love notes into FAIR Digital Objects
 
Sailing on the ocean of 1s and 0s
Sailing on the ocean of 1s and 0sSailing on the ocean of 1s and 0s
Sailing on the ocean of 1s and 0s
 
Intro to Digitization Projects
Intro to Digitization ProjectsIntro to Digitization Projects
Intro to Digitization Projects
 
The Rhetoric of Research Objects
The Rhetoric of Research ObjectsThe Rhetoric of Research Objects
The Rhetoric of Research Objects
 
Digital Libraries of the Future
Digital Libraries of the Future
Digital Libraries of the Future
Digital Libraries of the Future
 

Mehr von AAT Taiwan

German AAT 2013
German AAT 2013German AAT 2013
German AAT 2013AAT Taiwan
 
Chile AAT 2013
Chile AAT 2013Chile AAT 2013
Chile AAT 2013AAT Taiwan
 
The Dutch AAT 2013
The Dutch AAT 2013The Dutch AAT 2013
The Dutch AAT 2013AAT Taiwan
 
Challenges of Developing Terminology in Two Different Cultures
Challenges of Developing Terminology in Two Different CulturesChallenges of Developing Terminology in Two Different Cultures
Challenges of Developing Terminology in Two Different CulturesAAT Taiwan
 
2013 Sep Getty 刊物報導
2013 Sep Getty 刊物報導2013 Sep Getty 刊物報導
2013 Sep Getty 刊物報導AAT Taiwan
 
Generating Narratives through Timespace Data 台大數位典藏研究發展中心蔡炯民博士演講_20130605
Generating Narratives through Timespace Data 台大數位典藏研究發展中心蔡炯民博士演講_20130605Generating Narratives through Timespace Data 台大數位典藏研究發展中心蔡炯民博士演講_20130605
Generating Narratives through Timespace Data 台大數位典藏研究發展中心蔡炯民博士演講_20130605AAT Taiwan
 
2013 PNC: A Semantic Approach to Digital Art History- Sophy Shu-Jiun Chen
2013 PNC: A Semantic Approach to Digital Art History- Sophy Shu-Jiun Chen2013 PNC: A Semantic Approach to Digital Art History- Sophy Shu-Jiun Chen
2013 PNC: A Semantic Approach to Digital Art History- Sophy Shu-Jiun ChenAAT Taiwan
 
Making Chinese Art Accessible to Western Users- A Brief Report from AAT Taiwa...
Making Chinese Art Accessible to Western Users- A Brief Report from AAT Taiwa...Making Chinese Art Accessible to Western Users- A Brief Report from AAT Taiwa...
Making Chinese Art Accessible to Western Users- A Brief Report from AAT Taiwa...AAT Taiwan
 
2011 chinese aat update
2011 chinese aat update2011 chinese aat update
2011 chinese aat updateAAT Taiwan
 
Metadata for architectural contents in europe
Metadata for architectural contents in europeMetadata for architectural contents in europe
Metadata for architectural contents in europeAAT Taiwan
 
Te papa, collections online & thesauri
Te papa, collections online & thesauriTe papa, collections online & thesauri
Te papa, collections online & thesauriAAT Taiwan
 
An introduction to the name authority files in iran
An introduction to the name authority files in iranAn introduction to the name authority files in iran
An introduction to the name authority files in iranAAT Taiwan
 
Teldap4 getty multilingual vocab workshop2010
Teldap4 getty multilingual vocab workshop2010Teldap4 getty multilingual vocab workshop2010
Teldap4 getty multilingual vocab workshop2010AAT Taiwan
 
The spanish language version of the aat
The spanish language version of the  aatThe spanish language version of the  aat
The spanish language version of the aatAAT Taiwan
 
Illuminating Chaos Using Semantics to Harness the Web
Illuminating Chaos Using Semantics to Harness the WebIlluminating Chaos Using Semantics to Harness the Web
Illuminating Chaos Using Semantics to Harness the WebAAT Taiwan
 
Introduction and discussion about the AAT-Taiwan Management & Retrieval System
Introduction and discussion about the AAT-Taiwan Management & Retrieval SystemIntroduction and discussion about the AAT-Taiwan Management & Retrieval System
Introduction and discussion about the AAT-Taiwan Management & Retrieval SystemAAT Taiwan
 
Introduction about AAT-Taiwan Project
Introduction about AAT-Taiwan ProjectIntroduction about AAT-Taiwan Project
Introduction about AAT-Taiwan ProjectAAT Taiwan
 
(Final) cidoc 2009 chinese lang translation of the aat
(Final) cidoc 2009 chinese lang translation of the aat(Final) cidoc 2009 chinese lang translation of the aat
(Final) cidoc 2009 chinese lang translation of the aatAAT Taiwan
 

Mehr von AAT Taiwan (20)

German AAT 2013
German AAT 2013German AAT 2013
German AAT 2013
 
Chile AAT 2013
Chile AAT 2013Chile AAT 2013
Chile AAT 2013
 
The Dutch AAT 2013
The Dutch AAT 2013The Dutch AAT 2013
The Dutch AAT 2013
 
Challenges of Developing Terminology in Two Different Cultures
Challenges of Developing Terminology in Two Different CulturesChallenges of Developing Terminology in Two Different Cultures
Challenges of Developing Terminology in Two Different Cultures
 
2013 Sep Getty 刊物報導
2013 Sep Getty 刊物報導2013 Sep Getty 刊物報導
2013 Sep Getty 刊物報導
 
Generating Narratives through Timespace Data 台大數位典藏研究發展中心蔡炯民博士演講_20130605
Generating Narratives through Timespace Data 台大數位典藏研究發展中心蔡炯民博士演講_20130605Generating Narratives through Timespace Data 台大數位典藏研究發展中心蔡炯民博士演講_20130605
Generating Narratives through Timespace Data 台大數位典藏研究發展中心蔡炯民博士演講_20130605
 
2013 PNC: A Semantic Approach to Digital Art History- Sophy Shu-Jiun Chen
2013 PNC: A Semantic Approach to Digital Art History- Sophy Shu-Jiun Chen2013 PNC: A Semantic Approach to Digital Art History- Sophy Shu-Jiun Chen
2013 PNC: A Semantic Approach to Digital Art History- Sophy Shu-Jiun Chen
 
Making Chinese Art Accessible to Western Users- A Brief Report from AAT Taiwa...
Making Chinese Art Accessible to Western Users- A Brief Report from AAT Taiwa...Making Chinese Art Accessible to Western Users- A Brief Report from AAT Taiwa...
Making Chinese Art Accessible to Western Users- A Brief Report from AAT Taiwa...
 
2011 chinese aat update
2011 chinese aat update2011 chinese aat update
2011 chinese aat update
 
Metadata for architectural contents in europe
Metadata for architectural contents in europeMetadata for architectural contents in europe
Metadata for architectural contents in europe
 
Te papa, collections online & thesauri
Te papa, collections online & thesauriTe papa, collections online & thesauri
Te papa, collections online & thesauri
 
An introduction to the name authority files in iran
An introduction to the name authority files in iranAn introduction to the name authority files in iran
An introduction to the name authority files in iran
 
Teldap4 getty multilingual vocab workshop2010
Teldap4 getty multilingual vocab workshop2010Teldap4 getty multilingual vocab workshop2010
Teldap4 getty multilingual vocab workshop2010
 
The spanish language version of the aat
The spanish language version of the  aatThe spanish language version of the  aat
The spanish language version of the aat
 
The dutch aat
The dutch aatThe dutch aat
The dutch aat
 
Aat in german
Aat in germanAat in german
Aat in german
 
Illuminating Chaos Using Semantics to Harness the Web
Illuminating Chaos Using Semantics to Harness the WebIlluminating Chaos Using Semantics to Harness the Web
Illuminating Chaos Using Semantics to Harness the Web
 
Introduction and discussion about the AAT-Taiwan Management & Retrieval System
Introduction and discussion about the AAT-Taiwan Management & Retrieval SystemIntroduction and discussion about the AAT-Taiwan Management & Retrieval System
Introduction and discussion about the AAT-Taiwan Management & Retrieval System
 
Introduction about AAT-Taiwan Project
Introduction about AAT-Taiwan ProjectIntroduction about AAT-Taiwan Project
Introduction about AAT-Taiwan Project
 
(Final) cidoc 2009 chinese lang translation of the aat
(Final) cidoc 2009 chinese lang translation of the aat(Final) cidoc 2009 chinese lang translation of the aat
(Final) cidoc 2009 chinese lang translation of the aat
 

Kürzlich hochgeladen

Web & Social Media Analytics Previous Year Question Paper.pdf
Web & Social Media Analytics Previous Year Question Paper.pdfWeb & Social Media Analytics Previous Year Question Paper.pdf
Web & Social Media Analytics Previous Year Question Paper.pdfJayanti Pande
 
Beyond the EU: DORA and NIS 2 Directive's Global Impact
Beyond the EU: DORA and NIS 2 Directive's Global ImpactBeyond the EU: DORA and NIS 2 Directive's Global Impact
Beyond the EU: DORA and NIS 2 Directive's Global ImpactPECB
 
Russian Escort Service in Delhi 11k Hotel Foreigner Russian Call Girls in Delhi
Russian Escort Service in Delhi 11k Hotel Foreigner Russian Call Girls in DelhiRussian Escort Service in Delhi 11k Hotel Foreigner Russian Call Girls in Delhi
Russian Escort Service in Delhi 11k Hotel Foreigner Russian Call Girls in Delhikauryashika82
 
Holdier Curriculum Vitae (April 2024).pdf
Holdier Curriculum Vitae (April 2024).pdfHoldier Curriculum Vitae (April 2024).pdf
Holdier Curriculum Vitae (April 2024).pdfagholdier
 
Gardella_PRCampaignConclusion Pitch Letter
Gardella_PRCampaignConclusion Pitch LetterGardella_PRCampaignConclusion Pitch Letter
Gardella_PRCampaignConclusion Pitch LetterMateoGardella
 
1029 - Danh muc Sach Giao Khoa 10 . pdf
1029 -  Danh muc Sach Giao Khoa 10 . pdf1029 -  Danh muc Sach Giao Khoa 10 . pdf
1029 - Danh muc Sach Giao Khoa 10 . pdfQucHHunhnh
 
Sports & Fitness Value Added Course FY..
Sports & Fitness Value Added Course FY..Sports & Fitness Value Added Course FY..
Sports & Fitness Value Added Course FY..Disha Kariya
 
ICT Role in 21st Century Education & its Challenges.pptx
ICT Role in 21st Century Education & its Challenges.pptxICT Role in 21st Century Education & its Challenges.pptx
ICT Role in 21st Century Education & its Challenges.pptxAreebaZafar22
 
Activity 01 - Artificial Culture (1).pdf
Activity 01 - Artificial Culture (1).pdfActivity 01 - Artificial Culture (1).pdf
Activity 01 - Artificial Culture (1).pdfciinovamais
 
Making and Justifying Mathematical Decisions.pdf
Making and Justifying Mathematical Decisions.pdfMaking and Justifying Mathematical Decisions.pdf
Making and Justifying Mathematical Decisions.pdfChris Hunter
 
Ecological Succession. ( ECOSYSTEM, B. Pharmacy, 1st Year, Sem-II, Environmen...
Ecological Succession. ( ECOSYSTEM, B. Pharmacy, 1st Year, Sem-II, Environmen...Ecological Succession. ( ECOSYSTEM, B. Pharmacy, 1st Year, Sem-II, Environmen...
Ecological Succession. ( ECOSYSTEM, B. Pharmacy, 1st Year, Sem-II, Environmen...Shubhangi Sonawane
 
Unit-V; Pricing (Pharma Marketing Management).pptx
Unit-V; Pricing (Pharma Marketing Management).pptxUnit-V; Pricing (Pharma Marketing Management).pptx
Unit-V; Pricing (Pharma Marketing Management).pptxVishalSingh1417
 
Basic Civil Engineering first year Notes- Chapter 4 Building.pptx
Basic Civil Engineering first year Notes- Chapter 4 Building.pptxBasic Civil Engineering first year Notes- Chapter 4 Building.pptx
Basic Civil Engineering first year Notes- Chapter 4 Building.pptxDenish Jangid
 
fourth grading exam for kindergarten in writing
fourth grading exam for kindergarten in writingfourth grading exam for kindergarten in writing
fourth grading exam for kindergarten in writingTeacherCyreneCayanan
 
Explore beautiful and ugly buildings. Mathematics helps us create beautiful d...
Explore beautiful and ugly buildings. Mathematics helps us create beautiful d...Explore beautiful and ugly buildings. Mathematics helps us create beautiful d...
Explore beautiful and ugly buildings. Mathematics helps us create beautiful d...christianmathematics
 
Measures of Dispersion and Variability: Range, QD, AD and SD
Measures of Dispersion and Variability: Range, QD, AD and SDMeasures of Dispersion and Variability: Range, QD, AD and SD
Measures of Dispersion and Variability: Range, QD, AD and SDThiyagu K
 
SOCIAL AND HISTORICAL CONTEXT - LFTVD.pptx
SOCIAL AND HISTORICAL CONTEXT - LFTVD.pptxSOCIAL AND HISTORICAL CONTEXT - LFTVD.pptx
SOCIAL AND HISTORICAL CONTEXT - LFTVD.pptxiammrhaywood
 
Unit-IV- Pharma. Marketing Channels.pptx
Unit-IV- Pharma. Marketing Channels.pptxUnit-IV- Pharma. Marketing Channels.pptx
Unit-IV- Pharma. Marketing Channels.pptxVishalSingh1417
 

Kürzlich hochgeladen (20)

Web & Social Media Analytics Previous Year Question Paper.pdf
Web & Social Media Analytics Previous Year Question Paper.pdfWeb & Social Media Analytics Previous Year Question Paper.pdf
Web & Social Media Analytics Previous Year Question Paper.pdf
 
INDIA QUIZ 2024 RLAC DELHI UNIVERSITY.pptx
INDIA QUIZ 2024 RLAC DELHI UNIVERSITY.pptxINDIA QUIZ 2024 RLAC DELHI UNIVERSITY.pptx
INDIA QUIZ 2024 RLAC DELHI UNIVERSITY.pptx
 
Beyond the EU: DORA and NIS 2 Directive's Global Impact
Beyond the EU: DORA and NIS 2 Directive's Global ImpactBeyond the EU: DORA and NIS 2 Directive's Global Impact
Beyond the EU: DORA and NIS 2 Directive's Global Impact
 
Russian Escort Service in Delhi 11k Hotel Foreigner Russian Call Girls in Delhi
Russian Escort Service in Delhi 11k Hotel Foreigner Russian Call Girls in DelhiRussian Escort Service in Delhi 11k Hotel Foreigner Russian Call Girls in Delhi
Russian Escort Service in Delhi 11k Hotel Foreigner Russian Call Girls in Delhi
 
Holdier Curriculum Vitae (April 2024).pdf
Holdier Curriculum Vitae (April 2024).pdfHoldier Curriculum Vitae (April 2024).pdf
Holdier Curriculum Vitae (April 2024).pdf
 
Gardella_PRCampaignConclusion Pitch Letter
Gardella_PRCampaignConclusion Pitch LetterGardella_PRCampaignConclusion Pitch Letter
Gardella_PRCampaignConclusion Pitch Letter
 
1029 - Danh muc Sach Giao Khoa 10 . pdf
1029 -  Danh muc Sach Giao Khoa 10 . pdf1029 -  Danh muc Sach Giao Khoa 10 . pdf
1029 - Danh muc Sach Giao Khoa 10 . pdf
 
Sports & Fitness Value Added Course FY..
Sports & Fitness Value Added Course FY..Sports & Fitness Value Added Course FY..
Sports & Fitness Value Added Course FY..
 
ICT Role in 21st Century Education & its Challenges.pptx
ICT Role in 21st Century Education & its Challenges.pptxICT Role in 21st Century Education & its Challenges.pptx
ICT Role in 21st Century Education & its Challenges.pptx
 
Activity 01 - Artificial Culture (1).pdf
Activity 01 - Artificial Culture (1).pdfActivity 01 - Artificial Culture (1).pdf
Activity 01 - Artificial Culture (1).pdf
 
Making and Justifying Mathematical Decisions.pdf
Making and Justifying Mathematical Decisions.pdfMaking and Justifying Mathematical Decisions.pdf
Making and Justifying Mathematical Decisions.pdf
 
Ecological Succession. ( ECOSYSTEM, B. Pharmacy, 1st Year, Sem-II, Environmen...
Ecological Succession. ( ECOSYSTEM, B. Pharmacy, 1st Year, Sem-II, Environmen...Ecological Succession. ( ECOSYSTEM, B. Pharmacy, 1st Year, Sem-II, Environmen...
Ecological Succession. ( ECOSYSTEM, B. Pharmacy, 1st Year, Sem-II, Environmen...
 
Unit-V; Pricing (Pharma Marketing Management).pptx
Unit-V; Pricing (Pharma Marketing Management).pptxUnit-V; Pricing (Pharma Marketing Management).pptx
Unit-V; Pricing (Pharma Marketing Management).pptx
 
Basic Civil Engineering first year Notes- Chapter 4 Building.pptx
Basic Civil Engineering first year Notes- Chapter 4 Building.pptxBasic Civil Engineering first year Notes- Chapter 4 Building.pptx
Basic Civil Engineering first year Notes- Chapter 4 Building.pptx
 
fourth grading exam for kindergarten in writing
fourth grading exam for kindergarten in writingfourth grading exam for kindergarten in writing
fourth grading exam for kindergarten in writing
 
Explore beautiful and ugly buildings. Mathematics helps us create beautiful d...
Explore beautiful and ugly buildings. Mathematics helps us create beautiful d...Explore beautiful and ugly buildings. Mathematics helps us create beautiful d...
Explore beautiful and ugly buildings. Mathematics helps us create beautiful d...
 
Código Creativo y Arte de Software | Unidad 1
Código Creativo y Arte de Software | Unidad 1Código Creativo y Arte de Software | Unidad 1
Código Creativo y Arte de Software | Unidad 1
 
Measures of Dispersion and Variability: Range, QD, AD and SD
Measures of Dispersion and Variability: Range, QD, AD and SDMeasures of Dispersion and Variability: Range, QD, AD and SD
Measures of Dispersion and Variability: Range, QD, AD and SD
 
SOCIAL AND HISTORICAL CONTEXT - LFTVD.pptx
SOCIAL AND HISTORICAL CONTEXT - LFTVD.pptxSOCIAL AND HISTORICAL CONTEXT - LFTVD.pptx
SOCIAL AND HISTORICAL CONTEXT - LFTVD.pptx
 
Unit-IV- Pharma. Marketing Channels.pptx
Unit-IV- Pharma. Marketing Channels.pptxUnit-IV- Pharma. Marketing Channels.pptx
Unit-IV- Pharma. Marketing Channels.pptx
 

Knowledge Engineering for TELDAP

  • 1. Knowledge Engineering for TELDAP Keh-Jiann Chen Principal Investigator Core Platforms for Digital Contents Project, TELDAP Research Fellow Research Center for Information Technology Innovation & Institute of Information Science, Academia Sinica
  • 2. Outline Introduction Union catalog Databases and metadata for digital contents and websites Knowledge engineering Future perspective
  • 3. Introduction The integration and management of digital contents has become an important issue as the amount of digital contents produced from different projects and institutions increases rapidly. Our project goal is to achieve optimized preservation, retrieval, and presentation of digital collections.
  • 5. What is the union catalog¡H It is a catalog and portal for all digital collections of TELDAP. It is an integrated platform for browsing and searching entire digital contents of TELDAP. Metadata provides core descriptions and licensing information of each digital collection.
  • 6. Browsing by topics Search by keywords Home Page of Union Catalogs
  • 7. 2. Databases and metadata for digital contents and websites
  • 8. Metadata models for different types of objects Archived digital items Union catalog metadata model- Dublin core+ Web sites DCCAP (Dublin Core Collections Application Profile) Fields for internal used only Unique Identifier, Format, Evaluation, Cataloging History Documents Document metadata-Dublin core
  • 9. Metadata for Element Definition Title A name given to the resource digital items¡G Creator An entity primarily responsible for making the content of the resource Subject and Keywords The topic of the content of the resource Over 2 million Description An account of the content of the resource Publisher An entity responsible for making the resource digital items and available An entity responsible for making contributions to the Contributor content of the resource still increasing Date A date associated with an event in the life cycle of the resource Resource Type The nature or genre of the content of the resource Format The physical or digital manifestation of the resource Resource Identifier An unambiguous reference to the resource within a given context Source A Reference to a resource from which the present resource is derived Language A language of the intellectual content of the resource Relation A reference to a related resource Coverage The extent or scope of the content of the resource Rights Management Information about rights held in and over the resource 9
  • 10. 10
  • 11. Metadata for websites Over 200 websites and still increasing Metadata DCCAP (Dublin Core Collections Application Profile) To Combine the standard with our requirements: 19 data fields
  • 12. Metadata for websites The Website Homepage Picture URL, Project Information Type, Name, Author, Subject, Description, Language, Item Type, Target Archived Information: URL, time, authorization Copyright, Purpose, Other Information Figure: http://digitalarchives.tw
  • 13. Dynamic categorization User-oriented categorization General, elementary school students, high school students, researchers, …etc. Topical-based categorization Archaeology, painting, animal, plant, document, …etc. Functional-based categorization Research, education, business, technology,… Categorization based on institutions Academia Sinica, Taiwan U., Palace museum,…
  • 14. Figure: http://digitalarchives.tw Purpose: Education Target: Elementary school student, Junior high school student, Teacher… Select Items: According to 40 evaluation indicators, select top 5 websites Purpose: Creative applications Select Items: According to 40 evaluation indicators, select top 5 websites Purpose: Academic research Subject: Animal, Archaeology, Anthropology… Select Items: According to 40 evaluation indicators, select top 3 websites
  • 15. Metadata for project documents Over 5000 documents and still increasing Metadata- Dublin core Construct Teldapwiki- A Wikipedia for Teldap http://wiki.teldap.tw/
  • 17. Plans of making knowledge structures for TELDAP Construct metadata models for different objects. Establish hyperlinks between contexts and objects. Develop keyword extraction tools. Design automatic tagging tools. Construct Teldap ontology and thesaurus Art & Architecture Thesaurus by Getty Chinese WordNet
  • 18. (1) Metadata models for different objects Digital collections Union catalog metadata model- Dublin core+ Web sites DCCAP (Dublin Core Collections Application Profile) Public fields Private fields Unique Identifier, Format, Evaluation, Cataloging History Documents Document metadata-Dublin core
  • 19. (2) Establish hyperlinks between contents and objects Identify keywords in contents Tag keywords with related object hyperlinks
  • 20. Develop hyperlink tagging tools Word segmentation tools Resolve word segmentation ambiguities and identify keywords. CKIP word segmentation system: http://ckipsvr.iis.sinica.edu.tw/
  • 21. Develop hyperlink tagging tools TELDAP keyword dictionary Extract keywords from metadata and establish object-keyword relations. Extract text from XML data for each object The text are classified by topics, titles, descriptions, authors, locations, eras etc. From each class of text file extract keywords by automatic word segmentation and keyword extraction techniques.
  • 22. Prototype system for hyperlink tagger Identify and select keywords from the input text
  • 23. Prototype system for hyperlink tagger Produce text with hyperlinks
  • 24. Prototype system for hyperlink tagger Hyperlinks point to the related digital collections
  • 25. (3) Construct Teldap ontology and thesaurus Topical relation Hypernym/hyponym Synonym relation [¹¾¡²B³ ]/[ªM =ÄFY© ª¬ = Sushi ¡B½L¡B¸J¡BÂ| ] =©µ¥-°p¤ý Establish implicit links between objects by author, material, object type, …etc..
  • 26. (3) Construct Teldap ontology and thesaurus Establish association links between Chinese keywords and Getty AAT. Merging Chinese WordNet with English WordNet
  • 27. Future Perspectives Technology development Construct multi-lingua thesauri – Getty AAT Maintain the TELDAP keyword and object relation database Construct name authority files, gazetteers, and universal calendars Design hyperlink taggers and keyword extension tools Designing authoring tool which provides hyperlinks of keyword related digital contents automatically Design knowledge-based content retrieval system
  • 28. Future Perspectives Content enrichment Within TELDAP¡G Standardize object metadata model and data format All TELDAP objects should have their metadata Writing scripts and stories for different topics with Wiki-like knowledge structure Enrich the digital collections Establish hyperlinks between text books and TELDAP collections Extend the knowledge sources¡G e.g. Wikipedia
  • 29. Thank you for your attention! ·q½Ð«ü±