The document discusses the CIARD (Coherence in Information for Agricultural Research for Development) initiative and how it aims to create a global infrastructure for linked open data. It describes how FAO has worked for decades to make agricultural information more accessible, including through programs like AGRIS and AIMS. The CIARD initiative now involves over 100 partners working to coordinate their efforts and promote common data formats and systems. It outlines FAO's work on vocabularies like AGROVOC and how linked open data can help link distributed data sources in agriculture through applying standards.
1. The CIARD (Coherence in Information for Agricultural Research for development) initiative and a global infrastructure for linked open data Dr. Johannes Keizer Office ofKnowledge Exchange, Research and Extension Food andAgricultureOrganizationofthe UN Talk atWorldbank, 2011, May 17
2. We will promote research for food and agriculture, including research to adapt to, and mitigate climate change, and access to research results and technologies at national, regional and international levels. We will reinvigorate national research systems and will share information and best practices. We will improve access to knowledge. worldfoodsummit 2009
3. FAO has been engaged for decades in making agricultural development information more easily accessible and sharable among it's stakeholders. These efforts reach back to the early 70s when FAO set up the AGRIS program. Since the advent of the Internet the AIMS team at FAO HQ is working to make distributed data and information repositories interoperable. This work has been backed up on the institutional level by the CIARD (Coherence in Information for Agricultural Research for Development) initiative, in which FAO, GFAR, the CGIAR and many national partners collaborate. Technically FAO has underpinned this with the further development of the Agricultural Thesaurus AGROVOC and with initiatives on shared metadata sets (AGRIS AP) and ontologies. The paradigm and technology of linked open data, proposed by Tim Berners Lee some years ago, now provides a practical possibility to apply standard vocabularies and semantics to link distributed data that is published in a non proprietary format. The presentation will show the CIARD RING, ("routemap to information nodes and gateways"), demonstrate the AGROVOC LOD, will talk about the use of LOD in federating document repositories and will outline an Infrastructure for Information interoperability in Agricultural research and innovation
7. CIARD partners will (a) coordinate their efforts, (b) promote common formats, (c) adopt open systems
8.
9. Contribution and Participation in Science Territory size shows proportion of scientific papers published in 2001 by authors living there. Copyright SASI Group (University of Sheffield) and Mark Newman (University of Michigan)
11. RING – Numbers http://ring.ciard.net/totals Number of documents potentially reachable through the services registered in the RING. Types of service considered: document repositories and bibliographic databases.
26. Humboldt Squid page, pulled together from a diversity of Linked Data sources BBC TV Documentary BBC News item Wikipedia Animal Diversity Web:Nocturnal way of life
38. (quite easy to do, bibData map well to RDFThen Everyone who knows to write SparqlQeries could get all these publications with one shot for a new website on toxic wastes
39. Vocabularies and LOD Simply publishing your data as RDF does not link them to other data sets Creating this links by humans is interesting in detail, but unrealistic as mass processing Linking 2 standard vocabularies can link 200 datasets which use these standard vocabularies
40. RING routemapto information nodes and gateways VocBench concepts and entitiesreferencetriples Cloud storagefor RDF data triples Tools LOD enabled software LOD Generator triplifier, concept and entityidentifier Data Services Webservices + APIsto triple stores agINFRA - the elements
42. ….views into the construction site VocBench AGROVOC LOD on VocBench 1.1 LOD Generator Do you know openCalais? AgroTagger Testing Site LODE-BD The RING: http://ring.ciard.net Tools AgriDrupal AgriOceanDspace : http://193.190.8.15/agri3/
44. AGROVOC A multilingual agricultural vocabulary organized as concept scheme in 20 languages Covers agriculture, forestry, fisheries and related themes (food security, land use, environment, etc.) Organized in sub-vocabularies, e.g. chemicals, fisheries terms, scientific/common names of organisms Maintained by a global community (e.g. librarians, terminologists, information managers) using VocBench
54. AGROVOC Links after 3 weeks LOD Outlinks: GEMET-AGROVOC 1,198 RAMEAU-AGROVOC :700 Total Outlinks: 1898 Inlinks: AGROVOC-EUROVOC:1,297 AGROVOC-GEMET:1,198 AGROVOC-LCSH :1,093 AGROVOC-NAL: 13,390 AGROVOC-STW:1136 AGROVOC-RAMEAU:700 Total Inlinks:18,814
55. Europe:(It is better to use this example during the presentation)http://aims.fao.org/aos/agrovoc/c_2724From the Top concept:Ref: http://aims.fao.org/aos/agrovoc/c_7644Vocbench (Production)Ref: http://agrovoc.mimos.my/vocbenchv1.1i/VocBench(Sandbox)Ref:http://agrovoc.mimos.my/vocbenchv1.1i/
the chart on the homepage representing the distribution of services across "service types" (http://ring.ciard.net) (implemented with support from John Fereira); the geographic map on the homepage representing the geographic distribution of services;
a first attempt to provide some aggregated data on the number of contents / resources potentially reachable through the services registered in the RING: http://ring.ciard.net/totals
Whatdoesthismean in practice? I will show thiswithanexamplefrom the BBC. The biggestconsumers (and producers) of LOD are as I know the BBC and the New York times (Butnowalso the US government)
During the Web 1.0 phase, Webpageswerecomposedbyhumans. Todaymostwebpages are drivenbydatabasesthat can bedynamicallyqueried. Theycontainthrough RSS feedsalso data fromotherwebsitesThis BBC webpageis a big jumpfurther. I hasnotbeencomposedbyhumans and itisnotfromone database generated. Itisgeneratedfromdifferentdatasourcesthatwerepresentaslinked open data, linkedonlythrough common URIs
Ifresources are marked up withsemanticallydefined and machinereadableconcepts, they can belinked and mashed up preciselyaswehaveseen in the examplefrom the BBC.In thisexamplewe start withan AGRIS record on Hazardouswaste, whichisindexedwith AGROVOC. Alreadynowwe can easily link to material indexedwithEurovoc, hereanexamplefromEuroLex. If the UNBIS thesaurus wouldberestructuredto a conceptscheme and publishedas LOD, related UN documentscouldbeattachedautomaticallyby the machine.
Ifresources are marked up withsemanticallydefined and machinereadableconcepts, they can belinked and mashed up preciselyaswehaveseen in the examplefrom the BBC.In thisexamplewe start withan AGRIS record on Hazardouswaste, whichisindexedwith AGROVOC. Alreadynowwe can easily link to material indexedwithEurovoc, hereanexamplefromEuroLex. If the UNBIS thesaurus wouldberestructuredto a conceptscheme and publishedas LOD, related UN documentscouldbeattachedautomaticallyby the machine.
Ifresources are marked up withsemanticallydefined and machinereadableconcepts, they can belinked and mashed up preciselyaswehaveseen in the examplefrom the BBC.In thisexamplewe start withan AGRIS record on Hazardouswaste, whichisindexedwith AGROVOC. Alreadynowwe can easily link to material indexedwithEurovoc, hereanexamplefromEuroLex. If the UNBIS thesaurus wouldberestructuredto a conceptscheme and publishedas LOD, related UN documentscouldbeattachedautomaticallyby the machine.
Ifresources are marked up withsemanticallydefined and machinereadableconcepts, they can belinked and mashed up preciselyaswehaveseen in the examplefrom the BBC.In thisexamplewe start withan AGRIS record on Hazardouswaste, whichisindexedwith AGROVOC. Alreadynowwe can easily link to material indexedwithEurovoc, hereanexamplefromEuroLex. If the UNBIS thesaurus wouldberestructuredto a conceptscheme and publishedas LOD, related UN documentscouldbeattachedautomaticallyby the machine.
How does this work: A resource is connected with each concept URI in the web. The concepts between three vocabularies are having same literal which is connected with owl:sameAS/exactMatch relationship. As we are speakingaboutthesauri and notontologieswekept the relation tobechosenpurposelyvague. The conceptscouldbematchedwithowl:sameAS or the termscouldbematcheswith SKOS:exactMatch. A lotofdiscussion on thisisongoing
The mainintegrationworksthroughcommonsemanticsCore ofagINFRAtechnologyisaLODstoreofsharedencodedknowledgeorganizationsystemsan automaticmarkupto link structuredandunstructureddatasourcesthroughthissharedKnowledgeOrganizationsystemsSharing withinthe R.I.N.G.Partner registertheirservices, notechnicallimitationLOD – Wrapper for all participatingInstitutionsFor all registered services a „triplificationwrapper“ will besetupThe triplifierworkswith „agConceptsandagIdentities“ tocreatelinkeddataSteadilygrowing LOD ecosystemThe agINFRA LOD ecosystemoffers Webservices forthewww
Note: we identified outlinks to RAMAEU and GEMET, and they have taken them as inlinks to their own thesaurus.
- All links are checked by a domain expert.
Oneof the groundbreakingenterprises in this area isThomsonReuters “Open Calais”. Thisis a webservicethatprovidessemanticmark up foranyunstructured text thatyoufeedintotheir service The service is free ofCharge. Why? I will show youlater.
My team in collaborationwith the IndianInstituteofTechnology in Kanpur isdeveloping a similar service foroursubject area.
Wehavehere a text from 1964 without a bibliographic record at handabout a plantprotectionissue
Open Calais isverygood in thoseareas, in whichtheyhavetheirownelaboratedconceptschemeagainstwhich the texts are analyzed: “Places”, “Persons”, “Business Processes” , “IndustryTerms”, butitisweak in the specifictopicanalysis, whattheycall “social tags”
AgroTaggerstilllacksmanyof the sophisticated featuresof “Open Calais” ,butismuch, muchbetter in the subjectanalysisof the text