현재의 정보이용추세에 맞춰 각종 도서관 및 정보센터가 진화해서 궁극적으로 Social Semantic Digital Library가 될 필요성이 있다. 그러나 아직 대부분의 도서관이 그냥 자료를 전산화시킨 수준에 머물러 있다. 이미 디지털도서관이라는 것들도 Web에 Publish를 못할 뿐만 아니라, 혹시 연결이 되었다 하더라도 자료들 간의 관계 부족으로 Semantic level이 너무 떨어지고 있다. 우리 인간이 이해하는 대로 기계가 이해하게 하면서 Semantic level을 올릴 수는 없을까? 바로 이 점에서 Semantic web이 출발한다. 이 문제를 극복하고자 DCMI를 위시한 ‘다양한’ 메타데이터가 등장했으며 지금은 Linked Data까지 등장하고 있다. 메타데이터가 다양하게 존재하고 있기 때문에 다양하게 섞어서 사용할 수 있다는 장점이 있다. 다양한 상호운용성의 방법 (하나의 메타데이터로 통합 다양한 메타데이터 인정, Metadata Registry, Open Source의 활용 및 Linked data)이 있다. 그중에서 Linked Data를 중심으로 도서관이 어떻게 구체적으로 대응해 나가고 있는지를 미의회도서관 및 스웨덴의 LIBRIS를 중심으로 살펴보고, 이를 바탕으로 아직도 카탈로그의 캡슐에 묻혀 있는 우리나라 도서관의 목록의 실정을 언급하고 Linked data를 중심으로 적극적인 상호운용성의 방법을 제안하고 하고자 한다. 웹에 있는 다양한 Resource사이에 관계를 부여하면서 뜻을 더 명확하게 표현하자는 시도가 바로 Linked Data이다. 이 글에서는 맹자가 말하는 明志, 즉 ‘뜻을 분명히 밝히라’는 가르침을 Tim Berners-Lee가 어떻게 답하는 가를 밝힌다. 이에 그가 제안하는 dereferenceable uri, rdf 및 http를 통한 출판을 간단히 살피고, skos를 위시한 rdf vocabularies를 언급한다. 궁극적으로는 rdf 링크를 통한, 심지어는 다른 데이터 셋을 활용하여 끝없는 탐색을 하게 될 모델을 제공하고자 한다.
3. Intro: 정보검색의 영원한 숙제 False Positive / authority control ( 전거 통제 ) / … Disambiguation 및 Collocation + interoperability 식별자 특히 http uri
4. 1.1 Brief Overview: from MARC to Linked Data Card 목록 MARC: electronic version of the old card file . Internet 1 세대 : 데이터의 의미 가 아닌 구문 의 형태를 인식 Bush (1945) Tim Berners-Lee (1989) Global hypertext system 시맨틱웹 : 뜻을 살리자 the meaning ( semantics ) of information and services on the web is defined 차세대 웹
12. Meaningful Decompose 논리적으로 잘게 나눌 수는 없을까 ? 새롭게 배열 (collocate) 해주자 “ 의미 있게 묶어준다”고 하여 “ Collocation” 이라고 한다 이 큰 덩치를 어떻게 할까 ? 이젠 , 쓸모있는 부분에 작은 표식 (tag, metadata) 을 하고 필요한 만큼 연결시켜보자 -Link and Tag!!!
27. FRBR 의 1:1 principle: Starting of linked data in library community
28. Works, expressions, manifestations and relations between the two works by Morten Traavik and Joseph Conrad. Boxes are entities, relations are arrows.
29. The work "Heart of darkness" and some of its expressions and manifestations, and the persons responsible.
30. FRBR 의 활용 : unified database of information Music Ontology 문화 예술영역 / 저작권 등
31.
32.
33.
34.
35. 도서관 진화의 필요성 Social Semantic Digital Library Involves the community into sharing knowledge Semantic Digital Library Accessible by machines, not only with machines Digital Library Online, easy searching with a full-text index coherent, organized collection of resources / links to other libraries or information /united view Library Organized collection
45. Dbpedia 검색 ( 헤밍웨이 저작 ) "Ernest hemingway" 의 이름을 갖는 ?who 를 object 로 갖는 ?books(subject) 를 찾아라 " 헤밍웨이가 쓴 작품을 나열하라 "
46. Dbpedia 검색 ( 헤밍웨이 저작 ) 동명이인이 있을경우 , 모호성이 생길 수 있다 . 그래서 아래와 같이 명시적으로 헤밍웨이를 지정할 수 있다 . SELECT ?books WHERE { ?books dbpedia2:author < http://dbpedia.org/resource/Ernest_Hemingway >. } 즉 , 헤밍웨이의 uri 를 지정하는 것이다 . 바로 이것이 전거이다 .
74. Thank you " 책은 무생물이지만 , 책을 읽는 순간 생물이 되고 친구가 된다 . 책을 생물이 되게 해주고 보관하는 곳이 바로 도서관이다 "
Hinweis der Redaktion
http://docs.google.com/present/view?id=0ARP4fDAaJOdXZGNxajNqcGRfNDgyZjg3Y3FmNQ&hl=en The Future of Digital Librarianship in Academic Libraries An overview of nine areas in which the digital librarian of the future will be engaged Preservation Enable digital curators to mitigate risks to their data First and foremost,preserve the bits The line between access systems and archiving systems will continue to blur Scholarly Communication Advocate Open Access(OA) Who benefits from OA? university faculty / university libraries / university admin. / university students / local communities / scholars all over Educate scholars about copyright / Advocate for negotiation of copyright retention iii. Repositories The evolution of repositories collection-building tools (Greenstone, DLXS) display applications with harvestable metadata (DSpace, CONTENTdm) OAIS-inspired monolithic frameworks(Fedora, aDORe) decoupled micro-services + web architecture (California Digital Library, SWORD, OAI-ORE) Build trust in repositories How? By ensuring: content will not change except, e.g., for planned format migration derivatives and migrations of content are faithful to the original masters content is authentic and has documented provenance content is accessible via known identifiers Technologies for trust digital signatures persistent identifiers audit trails / provenance metadata content versioning authorization / authentication rights / access control Continue growing the repository Broad growth electronic theses & dissertations research data learning objects pre-prints electronic records public domain documents Deep growth Incorporate the repository into institutional programs and workflows Resolve the tension between preservation and growth If these sound like Herculean tasks, it's only because they are Fortunately, the future digital librarian will be endowed with Herculean strength Herculean strength in numbers The digital library is increasingly acknowledged as an institutional rather than individual endeavor Digital librarians will be successful to the extent that their colleagues and administration are involved iv. Outreach Situate the digital library within the institution by working with deans, provosts, and scholars The values of scholars and administration are different from those of the library Outreach will help frame the message Conversations institution-wide data census an institutional bibliography collaborative design of scholarly communication tools v. Digitization Guide the selection of content to be digitized Align digitization resources with scholarly communication needs vi. Metadata Serve as a conduit between library metadata and the &quot;outside world's&quot; best practices and 2.o-style uncontrolled metadata Outward Contribute authority metadata and schemas to efforts to build a more descriptive web Inward Add value to services by augmenting them with crowdsourced metadata The centrality of identifiers (the topic that launched a million flame-wars) vii. Discovery Make information easier to discover by surfacing relationships between objects in addition to traditional collection browse interfaces viii. Technology & Interoperability Lead the library‘ stechnological vanguard Much of the recentconvergence between library technology and the web has occurred in the digital library context Libraries andcyberinfrastructure Integrate library systems and processes into campus cyberinfrastructure How? Standards and best practices help, as does alignment withtechnologies from outside libraries ix. Sustainability Sustain the digital library through a framework of partnerships, institutional commitments, and technology strategies
Authority control fulfills two important functions. First, it enables catalogers to disambiguate items with similar or identical headings. For example, two authors who happen to have published under the same name can be distinguished from each other by adding middle initials, birth and/or death (or flourished, if these are unknown) dates, or a descriptive epithet to the heading of one (or both) authors. Second, authority control is used by catalogers to collocate materials that logically belong together, although they present themselves differently. For example, authority control enables a catalog to index all manifestations of a given book (including translations) together.
RECOMMENDATIONS TOP Warmer digital libraries cannot be achieved by software tools alone. The digital libraries themselves can take simple steps to make data and metadata more amenable to human and automated use, making their content more useful and useable. Only with proper and better access to linked data and metadata can the tools that computational biologists require be built. We make the following recommendations to achieve this goal. Simple URIs. URIs for human use should be as simple as possible, to allow easy linking to individual publications and their authors. Short URIs are much more likely to be used and cited [192] than longer, more complicated URIs. Persistent URIs. It has been noted many times before [193] , [194] , but it is worth repeatedly restating: persistent URIs make digital libraries a much more useful and usable place. Although URIs will inevitably decay [195] , [196] , many (but not all) will be preserved by the Internet Archive [197] , [198] , and every effort should be made to keep them persistent where possible. Exposing metadata. Publication metadata, in whatever style (EndNote, BibTeX, XML, RDF, etc.), should be transparently exposed and readily available, programmatically and manually, from URIs, HTML [199] , and PDF files of publications. Identifying publications. URNs (such as Digital Object Identifiers) should be used to identify publications wherever possible. Most large publishers already do this, although there are still many confounding exceptions. Identifying people. This problem is twofold: people need to be identified as users of a system and as authors of publications. To tackle the first issue, tools and libraries should use Single Sign On (SSO) schemes, such as OpenID [187] to provide access to personalized features where possible, as this prevents the endless and frustrating proliferation of username/passwords to identify users in Web applications. The second requires unique author identification, an ongoing and as yet unsolved issue for digital libraries. By following these recommendations, publishers, scientists, and libraries of all kinds can add significant value to the information they manage for the digital library. The Semantic Web is an evolving development of the World Wide Web in which the meaning ( semantics ) of information and services on the web is defined, making it possible for the web to &quot;understand&quot; and satisfy the requests of people and machines to use the web content .
[22] Linked Data may not yet be on the radar of the manufacturers of household goods, but it is definitely knocking on the door of libraries and library applications.
Hyperlink => typed link A revolution is happening in the structure of the web itself. It’s gentle and it’s powerful, Tim Berners-Lee invented the Web about 20 years ago, and 5 years later he started working about what the next generation of the web was going to be. It’s about moving from a Web of documents to a Web of data, where the data carries with it the meaning of each item of data. It doesn’t rely on understanding the documents. The data carries its own meaning and expresses that meaning.
1 개의 테이블에 해당 자료를 다 넣어도 별 무리가 없는데 , 좀 더 ' 명확 ' 하게 구분 짓기 위해서 테이블을 나누는 경우이다 . 심각하지도 어렵지도 않는 단순 명쾌한 관계로 연결 지을 수 있다 . 한 레코드 안에 모든 것을 다 가지고 있는 큰 덩치를 좀 더 훌륭한 기능을 수행할 수 있게 잘게 하나 하나의 개체로 나누어서 서로 관계를 맺어주자 중앙집권정부가 대내 대외업무를 다 맡지 말고 , 잘게 나누어진 지방정부들이 전권을 가지고 대외협력 등의 관계를 맺게 해주자 H umans will always stay one step ahead of the machines.
그러나 현재의 목록기술규칙은 물리적 형태를 기준으로 하고 있어 같은 콘텐츠이지만 형태가 다른 자료라는 것을 기술하기가 어렵다는 데 있다 .
기계는 아무 것도 몰라 !!! 단지 구문으로 형태가 같은 것을 찾아낼 뿐 !!!
Contents and Carriers 지금까지는 Carrier 위주로 목록을 해왔다 . 이용자들이 MARC 로 만들어진 목록을 본다고 해서 어떤 장점이 있던가 ? 오히려 그들은 복잡해진다 .
The term &quot;linked data&quot; describes a method of exposing and connecting data on the Web from different sources using URIs ( define ) (Uniform Resource Identifiers). URI is the generic term for names and addresses on the Web, including Web page URLs. documents on the web often have data within their texts—data that can be treated as information in its own right. Berners-Lee and colleagues recognized this when they envisioned that we could enhance the web of documents with a web of data that exists sometimes within and sometimes alongside the documents themselves. documents on the web often have data within their texts—data that can be treated as information in its own right. Berners-Lee and colleagues recognized this when they envisioned that we could enhance the web of documents with a web of data that exists sometimes within and sometimes alongside the documents themselves. The basic requirements that data has to fulfill in order to be part of the Semantic Web are: All entities of interest, such as information resources, real-world objects, and vocabulary terms should be identified by URI references. URI references should be dereference-able , meaning that an application can look up a URI over the HTTP protocol and retrieve RDF data about the identified resource. Data should be provided using the RDF/XML syntax. Data should be interlinked with other data. Thus resource descriptions should contain links to related information in the form of dereference-able URIs within RDF statements or as rdfs:seeAlso links.
All of the possible uses of library bibliographic data by others are unimaginable today, but we have already seen the popularity of nonlibrary use of bibliographic data in systems like Wikipedia and LibraryThing. The data provided in OCLC's WorldCat Identities research project gives us a hint of the richness of library data once it is freed from the capsules of the catalog and of the bibliographic record. Opening library data to the web in a linkable form should allow that richness to be explored on a global scale.
Full text 검색을 해주면서 쉽게 검색을 할 수 있게 해주는 현재의 Online Digital Library 개념에서 탈피해서 “통합메타데이터를 만들 수 없을까 ?” 라는 물음에서 Social Digital Library 또는 Social Semantic Digital Library 의 필요성이 대두될 것이다 . 서로 다른 메타데이터를 사용하고 있는 , 이를 테면 , 자원 (resources), 이용자 프로파일 (user profile), 북마크 (bookmarks) 또는 taxonomies 를 통합시켜서 “ 기계와의 (with machines)” 뿐만 아니라 “기계에 의해서 (by machines) 상황 속에서의 의미화 (make sense) 를 가능하게 하여 뜻있게 정보를 연결하게 해주자 . What is a Social Semantic Digital Library? Social Semantic digital libraries integrate information from various sources provide interoperability with other systems (not only libraries) deliver more robust, user friendly and adaptable search and browsing interfaces empowered by semantics and social networking
This allows the user to start with one data source and then move through a potentially endless Web of data sources connected by RDF links.
*** This page is a placeholder. Its main purpose is to serve links to the RDF representation and not yet to be visually pleasing or useful to a human. *** Triple store
The Metadata Registry provides services to developers and consumers of controlled vocabularies and is one of the first production deployments of the RDF-based Semantic Web Community's Simple Knowledge Organization System (SKOS)
The Web of Data can be accessed using Linked Data browsers, just as the traditional Web of documents is accessed using HTML browsers. However, instead of following links between HTML pages, Linked Data browsers enable users to navigate between different data sources by following RDF links. This allows the user to start with one data source and then move through a potentially endless Web of data sources connected by RDF links. For instance, while looking at data about a person from one source, a user might be interested in information about the person's home town. By following an RDF link, the user can navigate to information about that town contained in another dataset.