SlideShare ist ein Scribd-Unternehmen logo
1 von 36
Open For Development
                             EADI IMWG Conference 2012



                            Open @ FAO


Stephen.Katz@ fao.org (Twitter: @SteveK1958)
Chief, Knowledge Management and Library Services
Food and Agriculture Organization of the United Nations
Agenda
    Open @ FAO


1     Context and History of Open @ FAO


      Ongoing Practical Initiatives
2     • FAO Open Archive
      • Open Data (data.fao.org)
      • Data Governance and Standards

3
      Issues, Challenges and Lessons Learned



4     Group Discussion
Open @ FAO : Food For Thought?


Food for Thought
Food and Agriculture Organization of the United Nations
                         (FAO)


                         •   FAO is a specialized
                             agency of the United
                             Nations with its own
                             independent governance

                         •   190+ Member Countries

                         •   HQs in Rome, Offices in
                             over 80 countries with
                             over 5000 staff.
Food and Agriculture Organization of the United Nations
                         (FAO)


                   •   Collects, analyses, interprets and
                       disseminates information on
                       nutrition, food and agriculture

                   •   Policy Advice

                   •   Furnishes Technical Assistance

                   •   A Neutral Forum for International
                       Cooperation
FAO has been in the
“knowledge” business
since 1946!

Our mandate....
Ensure that the world’s
knowledge of food and
agriculture is available to
those who need it when
they need it and in a form
which they can access
and use.
Open @ FAO : A Bit of History




   1995 – Central Publishing Unit Abolished
   1996 – SGML Repository Proposal; FAOSTAT on-line
   1997 – Document Repository (XML Compatible)
   2003 – Document Repository (PDF)
   2007 – Open Archive Proposal (Fedora Commons)
   2010 – Open Data Repository Proposal (data.fao.org)
   2012 – OpenArchive.Fao.Org; Data.Fao.Org
FAO Open Archive
    Goals/Objectives

 To make FAO’s Global Public Goods openly
  accessible from a single access point
 To be able to exchange data in an open and
  standardized way
 To have a smooth/efficient workflow to
  manage FAO’s Institutional memory
 To integrate e-publishing and library workflows
FAO Open Archive
    Architecture
 Based on Open Source tools (Fedora
  Commons and Java)
 Based on modern standards for data
  management (MODS and FRBR)
 Allowing for easier management and sharing
  of multilingual content

And this is what it looks like:
Open Archive Resources
Available at Start-up Time
           Resource Type       Number of Records
     Full Text Documents   40,100



     Photos and Videos     17,100



     Audio Files           1,200
Open Data (data.fao.org)
Goals/Objectives
   To address fragmentation and duplication of
    information systems and data presently distributed
    across many organizational units
   http://data.fao.org: one-stop shop that aggregates,
    integrates, and catalogues data from multiple sources
    across FAO. Topics are related to nutrition, food and
    agriculture and include statistics, maps, pictures,
    documents and more.
Open Data (data.fao.org)
Guiding Principles
   Uniting FAO data with one brand : http://data.fao.org
   Engaging a Community : #FAOdata
   Mobile First
   Serve the data in the most convenient format
   Integrate, don't reimplement
data.fao.org - The Big Picture

                                                                           Specialised
             Website                 Services and Widgets                 application(s)
                                                                           consume/provide



                                Orchestration and Integration

Search            Catalogue        Statistics         Maps         Content          Infrastructure
                                   Statistical Data
Full text         Identity         Warehouse          Geospatial   Documents        Logging
Structured        Metadata                            Raster       Pictures         Caching
                  Linked Data      Time Series        Vector       Video            Security
                  ...              Indicators         Point        Multimedia       Audit
                                   Observations                    Pages            ...
Data Flow Architecture
                          Data
              Data       Source
             Source




              Ingest
                           Harmonise
 Data
                            Integrate
Source
                             Enrich

                                        Publish
     Data
    Source    Data
             Source
Landing Page
Some Numbers
356,000,000 Statistical values
  2 Terabytes

1,500,000 Statistical Maps

734,000 Geo Layers
 30 Terabytes

435 Documents

90 Pictures

25 Information Systems
Simple and few technologies 
   Opscode –Chef, Red Hat- RHQ, Nagios- Enterprises Nagios,
    Jenkins-CI – Opensource – Jenkins, Apache              Maven, Apache –
    SVN, Atlassian – JIRA, Apache – Jmeter, Sourceforge Opensource –
    Junit, Oracle- Java, Liferay - Liferay Portal, PostgreSQL – PostgreSQL
    PostGIS, Pgbouncer – pgbouncer, Pentaho - Pentaho BI, Analytical
    Labs – Saiku, Talend - Talend Open Studio, RedHat/Jboss -
    Application Server, Vmware - Server Virtualization,
   Exceliance – HAProxy, RedHat- Enterprise Linux Server, GeoNetwork
    OpenSource – GeoNetwork, OpenGeo, GeoSolutions, Fedora
    Commons, Refractions Research – GeoServer, Ontotext - OWLIM,
    RedHat/Jboss - JBoss AS / JEE, Alfresco - Activiti BPM Platform, jasig -
    CAS Client, Batchwork Software - Doc2Doc, Google Code Opensource -
    ZXing ("Zebra Crossing"), Highsoft Solutions AS – highcharts, Twitter –
    Bootstrap, Liferay - Alloy-UI, JQuery - JQuery-ui, Geert de Deckere -
    Graph Up and the there are all the SaaS products ….

    Further questions on data.fao.org to the Project Manager: Karl.Morteo@fao.org
http://www.ciard.net
CIARD – a global movement
                    •   All organizations that create and
   To make
                        possess public agricultural
    agricultural        research information disseminate
    research            and share it more widely
    information     •   CIARD partners create coherence
    and                 by a) coordinating their efforts, b)
    knowledge           promoting common formats, c)
    truly acessible     adopting open systems and
    to all              standards
                    •   Create a global network of public
                        collections of data and information
http://aims.fao.org
Distributed Data Sets
 •   stats
 •   gene banks
 •   gis data
 •   blogs,
 •   journals
 •   open archives
 •   raw data
 •   technologies
 •   learning objects
 •   ………..

How to make value added services?
How to infer new knowledge?
How to organize collaboration?
Maybe we really need this?...
…to
•   stats
•   gene banks
•   gis data
•   blogs,
•   journals
•   open archives
•   raw data
•   technologies
•   learning objects
•   ………..
Creating Linked Applications
OpenAgris

 Aggregates different data sources to expand
  knowledge about a topic
 Is a “linked-data” environment mashing-up
  interlinked datasets to create an integrated
  knowledge base
 OpenAgris uses the Agrovoc thesaurus as
  backbone to interlink to other existing
  datasets (DBPedia, WorldBank, Geopolitical
  Ontology…)
Open Archive : Issues, Challenges, Lessons

   Unclear Policy Framework
   Unclear collection selection policy
   Variable quality standards (content, legal, editorial,
    accountability)
   Licensing policy/conditions for re-use
   Working with partners and scientific journals
   Freely available but need attribution
   Supply vs demand (personal interest vs impact)
   Tension with Sales and Marketing needs

May Lead To Negative Consequences such as:
 Low credibility/trust, reputational risk, legal exposure?
Open Data : Issues, Challenges, Lessons

Well the same stuff as before really 
 Unclear Policy Framework
 Unclear collection selection policy
 Variable quality standards
 Licensing policy/conditions for re-use
 Working with partners
 Freely available but need attribution
 Supply vs demand (personal interest vs impact)
 Tension with Sales and Marketing needs


May Lead To Negative Consequences such as:
 Low credibility/trust, reputational risk, legal exposure?
Open Data : Issues, Challenges, Lessons

But also:
 Every data-type has it’s own standards (e.g. OGC for GIS,
  SDMX for stats, MODS for documents, IPTC for Photos)
 Aggregate data quality set by lowest common denominator
 Poor data governance leads to:
      Conflicting/contradictory data values from different sources
      Lack of agreement of definitions and concepts, and
      Insufficient metadata
      Comparing apples, pears and oranges (different units, different
       assumptions, different contexts)
May Lead To Negative Consequences such as:
 Low credibility/trust, reputational risk, legal exposure?
Thank you!

Time for Discussion

and soon for Lunch!

Weitere ähnliche Inhalte

Was ist angesagt?

WWW2014 Tutorial: Online Learning & Linked Data - Lessons Learned
WWW2014 Tutorial: Online Learning & Linked Data - Lessons LearnedWWW2014 Tutorial: Online Learning & Linked Data - Lessons Learned
WWW2014 Tutorial: Online Learning & Linked Data - Lessons LearnedStefan Dietze
 
Open Access: Open Access Looking for ways to increase the reach and impact of...
Open Access: Open Access Looking for ways to increase the reach and impact of...Open Access: Open Access Looking for ways to increase the reach and impact of...
Open Access: Open Access Looking for ways to increase the reach and impact of...librarianrafia
 
WWW2013 Tutorial: Linked Data & Education
WWW2013 Tutorial: Linked Data & EducationWWW2013 Tutorial: Linked Data & Education
WWW2013 Tutorial: Linked Data & EducationStefan Dietze
 
DataCite: the Perfect Complement to CrossRef
DataCite: the Perfect Complement to CrossRefDataCite: the Perfect Complement to CrossRef
DataCite: the Perfect Complement to CrossRefCrossref
 
Geospatial Metadata and Spatial Data: It's all Greek to me!
Geospatial Metadata and Spatial Data: It's all Greek to me!Geospatial Metadata and Spatial Data: It's all Greek to me!
Geospatial Metadata and Spatial Data: It's all Greek to me!EDINA, University of Edinburgh
 
Data Management Planning at the DCC
Data Management Planning at the DCCData Management Planning at the DCC
Data Management Planning at the DCCMartin Donnelly
 

Was ist angesagt? (7)

WWW2014 Tutorial: Online Learning & Linked Data - Lessons Learned
WWW2014 Tutorial: Online Learning & Linked Data - Lessons LearnedWWW2014 Tutorial: Online Learning & Linked Data - Lessons Learned
WWW2014 Tutorial: Online Learning & Linked Data - Lessons Learned
 
Open Access: Open Access Looking for ways to increase the reach and impact of...
Open Access: Open Access Looking for ways to increase the reach and impact of...Open Access: Open Access Looking for ways to increase the reach and impact of...
Open Access: Open Access Looking for ways to increase the reach and impact of...
 
WWW2013 Tutorial: Linked Data & Education
WWW2013 Tutorial: Linked Data & EducationWWW2013 Tutorial: Linked Data & Education
WWW2013 Tutorial: Linked Data & Education
 
DataCite: the Perfect Complement to CrossRef
DataCite: the Perfect Complement to CrossRefDataCite: the Perfect Complement to CrossRef
DataCite: the Perfect Complement to CrossRef
 
Geospatial Metadata and Spatial Data: It's all Greek to me!
Geospatial Metadata and Spatial Data: It's all Greek to me!Geospatial Metadata and Spatial Data: It's all Greek to me!
Geospatial Metadata and Spatial Data: It's all Greek to me!
 
RDA in a Nutshell - March 2020
RDA in a Nutshell - March 2020RDA in a Nutshell - March 2020
RDA in a Nutshell - March 2020
 
Data Management Planning at the DCC
Data Management Planning at the DCCData Management Planning at the DCC
Data Management Planning at the DCC
 

Ähnlich wie Open@Fao presentation at the EADI Open For Development Project, 2012

FAIR BioData Management
FAIR BioData ManagementFAIR BioData Management
FAIR BioData ManagementUlrike Wittig
 
2012 05 usain-johanneskeizer
2012 05 usain-johanneskeizer2012 05 usain-johanneskeizer
2012 05 usain-johanneskeizerJohannes Keizer
 
Being FAIR: FAIR data and model management SSBSS 2017 Summer School
Being FAIR:  FAIR data and model management SSBSS 2017 Summer SchoolBeing FAIR:  FAIR data and model management SSBSS 2017 Summer School
Being FAIR: FAIR data and model management SSBSS 2017 Summer SchoolCarole Goble
 
Introduction to knowledge sharing systems: considerations for the conceptual ...
Introduction to knowledge sharing systems: considerations for the conceptual ...Introduction to knowledge sharing systems: considerations for the conceptual ...
Introduction to knowledge sharing systems: considerations for the conceptual ...Nikos Manouselis
 
FAIRy stories: the FAIR Data principles in theory and in practice
FAIRy stories: the FAIR Data principles in theory and in practiceFAIRy stories: the FAIR Data principles in theory and in practice
FAIRy stories: the FAIR Data principles in theory and in practiceCarole Goble
 
EIA Biodiversity Data Mobilisation
EIA Biodiversity Data MobilisationEIA Biodiversity Data Mobilisation
EIA Biodiversity Data MobilisationVishwas Chavan
 
Scaling up food safety information transparency
Scaling up food safety information transparencyScaling up food safety information transparency
Scaling up food safety information transparencyNikos Manouselis
 
Conceptual Design of TAPipedia
Conceptual Design of TAPipediaConceptual Design of TAPipedia
Conceptual Design of TAPipediaNikos Manouselis
 
Making agricultural knowledge globally discoverable: are we there yet?
Making agricultural knowledge globally discoverable: are we there yet?Making agricultural knowledge globally discoverable: are we there yet?
Making agricultural knowledge globally discoverable: are we there yet?Nikos Manouselis
 
Agro-Know & the European agricultural research information ecosystem
Agro-Know & the European agricultural research information ecosystemAgro-Know & the European agricultural research information ecosystem
Agro-Know & the European agricultural research information ecosystemNikos Manouselis
 
ERA CoBioTech Data Management Webinar
ERA CoBioTech Data Management WebinarERA CoBioTech Data Management Webinar
ERA CoBioTech Data Management WebinarFAIRDOM
 
FAIRDOM data management support for ERACoBioTech Proposals
FAIRDOM data management support for ERACoBioTech ProposalsFAIRDOM data management support for ERACoBioTech Proposals
FAIRDOM data management support for ERACoBioTech ProposalsFAIRDOM
 
RDMkit, a Research Data Management Toolkit. Built by the Community for the ...
RDMkit, a Research Data Management Toolkit.  Built by the Community for the ...RDMkit, a Research Data Management Toolkit.  Built by the Community for the ...
RDMkit, a Research Data Management Toolkit. Built by the Community for the ...Carole Goble
 
NFDI Physical Sciences Colloquium - FAIR
NFDI Physical Sciences Colloquium - FAIRNFDI Physical Sciences Colloquium - FAIR
NFDI Physical Sciences Colloquium - FAIRSusanna-Assunta Sansone
 
Research Data Alliance .. The Why, How, What ...
Research Data Alliance .. The Why, How, What ... Research Data Alliance .. The Why, How, What ...
Research Data Alliance .. The Why, How, What ... Research Data Alliance
 
A coordinated framework for open data open science in Botswana/Simon Hodson
A coordinated framework for open data open science in Botswana/Simon HodsonA coordinated framework for open data open science in Botswana/Simon Hodson
A coordinated framework for open data open science in Botswana/Simon HodsonAfrican Open Science Platform
 
Why are e-Infrastructures useful from a small business perspective?
Why are e-Infrastructures useful from a small business perspective?Why are e-Infrastructures useful from a small business perspective?
Why are e-Infrastructures useful from a small business perspective?Nikos Manouselis
 
Management of Data Collections
Management of Data CollectionsManagement of Data Collections
Management of Data Collectionsabedejesus
 
NHM Data Portal: first steps toward the Graph-of-Life
NHM Data Portal: first steps toward the Graph-of-LifeNHM Data Portal: first steps toward the Graph-of-Life
NHM Data Portal: first steps toward the Graph-of-LifeEdward Baker
 
NHM Data Portal: first steps toward the Graph-of-Life
NHM Data Portal: first steps toward the Graph-of-LifeNHM Data Portal: first steps toward the Graph-of-Life
NHM Data Portal: first steps toward the Graph-of-LifeVince Smith
 

Ähnlich wie Open@Fao presentation at the EADI Open For Development Project, 2012 (20)

FAIR BioData Management
FAIR BioData ManagementFAIR BioData Management
FAIR BioData Management
 
2012 05 usain-johanneskeizer
2012 05 usain-johanneskeizer2012 05 usain-johanneskeizer
2012 05 usain-johanneskeizer
 
Being FAIR: FAIR data and model management SSBSS 2017 Summer School
Being FAIR:  FAIR data and model management SSBSS 2017 Summer SchoolBeing FAIR:  FAIR data and model management SSBSS 2017 Summer School
Being FAIR: FAIR data and model management SSBSS 2017 Summer School
 
Introduction to knowledge sharing systems: considerations for the conceptual ...
Introduction to knowledge sharing systems: considerations for the conceptual ...Introduction to knowledge sharing systems: considerations for the conceptual ...
Introduction to knowledge sharing systems: considerations for the conceptual ...
 
FAIRy stories: the FAIR Data principles in theory and in practice
FAIRy stories: the FAIR Data principles in theory and in practiceFAIRy stories: the FAIR Data principles in theory and in practice
FAIRy stories: the FAIR Data principles in theory and in practice
 
EIA Biodiversity Data Mobilisation
EIA Biodiversity Data MobilisationEIA Biodiversity Data Mobilisation
EIA Biodiversity Data Mobilisation
 
Scaling up food safety information transparency
Scaling up food safety information transparencyScaling up food safety information transparency
Scaling up food safety information transparency
 
Conceptual Design of TAPipedia
Conceptual Design of TAPipediaConceptual Design of TAPipedia
Conceptual Design of TAPipedia
 
Making agricultural knowledge globally discoverable: are we there yet?
Making agricultural knowledge globally discoverable: are we there yet?Making agricultural knowledge globally discoverable: are we there yet?
Making agricultural knowledge globally discoverable: are we there yet?
 
Agro-Know & the European agricultural research information ecosystem
Agro-Know & the European agricultural research information ecosystemAgro-Know & the European agricultural research information ecosystem
Agro-Know & the European agricultural research information ecosystem
 
ERA CoBioTech Data Management Webinar
ERA CoBioTech Data Management WebinarERA CoBioTech Data Management Webinar
ERA CoBioTech Data Management Webinar
 
FAIRDOM data management support for ERACoBioTech Proposals
FAIRDOM data management support for ERACoBioTech ProposalsFAIRDOM data management support for ERACoBioTech Proposals
FAIRDOM data management support for ERACoBioTech Proposals
 
RDMkit, a Research Data Management Toolkit. Built by the Community for the ...
RDMkit, a Research Data Management Toolkit.  Built by the Community for the ...RDMkit, a Research Data Management Toolkit.  Built by the Community for the ...
RDMkit, a Research Data Management Toolkit. Built by the Community for the ...
 
NFDI Physical Sciences Colloquium - FAIR
NFDI Physical Sciences Colloquium - FAIRNFDI Physical Sciences Colloquium - FAIR
NFDI Physical Sciences Colloquium - FAIR
 
Research Data Alliance .. The Why, How, What ...
Research Data Alliance .. The Why, How, What ... Research Data Alliance .. The Why, How, What ...
Research Data Alliance .. The Why, How, What ...
 
A coordinated framework for open data open science in Botswana/Simon Hodson
A coordinated framework for open data open science in Botswana/Simon HodsonA coordinated framework for open data open science in Botswana/Simon Hodson
A coordinated framework for open data open science in Botswana/Simon Hodson
 
Why are e-Infrastructures useful from a small business perspective?
Why are e-Infrastructures useful from a small business perspective?Why are e-Infrastructures useful from a small business perspective?
Why are e-Infrastructures useful from a small business perspective?
 
Management of Data Collections
Management of Data CollectionsManagement of Data Collections
Management of Data Collections
 
NHM Data Portal: first steps toward the Graph-of-Life
NHM Data Portal: first steps toward the Graph-of-LifeNHM Data Portal: first steps toward the Graph-of-Life
NHM Data Portal: first steps toward the Graph-of-Life
 
NHM Data Portal: first steps toward the Graph-of-Life
NHM Data Portal: first steps toward the Graph-of-LifeNHM Data Portal: first steps toward the Graph-of-Life
NHM Data Portal: first steps toward the Graph-of-Life
 

Kürzlich hochgeladen

Are Multi-Cloud and Serverless Good or Bad?
Are Multi-Cloud and Serverless Good or Bad?Are Multi-Cloud and Serverless Good or Bad?
Are Multi-Cloud and Serverless Good or Bad?Mattias Andersson
 
Nell’iperspazio con Rocket: il Framework Web di Rust!
Nell’iperspazio con Rocket: il Framework Web di Rust!Nell’iperspazio con Rocket: il Framework Web di Rust!
Nell’iperspazio con Rocket: il Framework Web di Rust!Commit University
 
TeamStation AI System Report LATAM IT Salaries 2024
TeamStation AI System Report LATAM IT Salaries 2024TeamStation AI System Report LATAM IT Salaries 2024
TeamStation AI System Report LATAM IT Salaries 2024Lonnie McRorey
 
From Family Reminiscence to Scholarly Archive .
From Family Reminiscence to Scholarly Archive .From Family Reminiscence to Scholarly Archive .
From Family Reminiscence to Scholarly Archive .Alan Dix
 
Transcript: New from BookNet Canada for 2024: BNC CataList - Tech Forum 2024
Transcript: New from BookNet Canada for 2024: BNC CataList - Tech Forum 2024Transcript: New from BookNet Canada for 2024: BNC CataList - Tech Forum 2024
Transcript: New from BookNet Canada for 2024: BNC CataList - Tech Forum 2024BookNet Canada
 
Use of FIDO in the Payments and Identity Landscape: FIDO Paris Seminar.pptx
Use of FIDO in the Payments and Identity Landscape: FIDO Paris Seminar.pptxUse of FIDO in the Payments and Identity Landscape: FIDO Paris Seminar.pptx
Use of FIDO in the Payments and Identity Landscape: FIDO Paris Seminar.pptxLoriGlavin3
 
unit 4 immunoblotting technique complete.pptx
unit 4 immunoblotting technique complete.pptxunit 4 immunoblotting technique complete.pptx
unit 4 immunoblotting technique complete.pptxBkGupta21
 
TrustArc Webinar - How to Build Consumer Trust Through Data Privacy
TrustArc Webinar - How to Build Consumer Trust Through Data PrivacyTrustArc Webinar - How to Build Consumer Trust Through Data Privacy
TrustArc Webinar - How to Build Consumer Trust Through Data PrivacyTrustArc
 
"Subclassing and Composition – A Pythonic Tour of Trade-Offs", Hynek Schlawack
"Subclassing and Composition – A Pythonic Tour of Trade-Offs", Hynek Schlawack"Subclassing and Composition – A Pythonic Tour of Trade-Offs", Hynek Schlawack
"Subclassing and Composition – A Pythonic Tour of Trade-Offs", Hynek SchlawackFwdays
 
The Ultimate Guide to Choosing WordPress Pros and Cons
The Ultimate Guide to Choosing WordPress Pros and ConsThe Ultimate Guide to Choosing WordPress Pros and Cons
The Ultimate Guide to Choosing WordPress Pros and ConsPixlogix Infotech
 
Moving Beyond Passwords: FIDO Paris Seminar.pdf
Moving Beyond Passwords: FIDO Paris Seminar.pdfMoving Beyond Passwords: FIDO Paris Seminar.pdf
Moving Beyond Passwords: FIDO Paris Seminar.pdfLoriGlavin3
 
The Fit for Passkeys for Employee and Consumer Sign-ins: FIDO Paris Seminar.pptx
The Fit for Passkeys for Employee and Consumer Sign-ins: FIDO Paris Seminar.pptxThe Fit for Passkeys for Employee and Consumer Sign-ins: FIDO Paris Seminar.pptx
The Fit for Passkeys for Employee and Consumer Sign-ins: FIDO Paris Seminar.pptxLoriGlavin3
 
Digital Identity is Under Attack: FIDO Paris Seminar.pptx
Digital Identity is Under Attack: FIDO Paris Seminar.pptxDigital Identity is Under Attack: FIDO Paris Seminar.pptx
Digital Identity is Under Attack: FIDO Paris Seminar.pptxLoriGlavin3
 
Dev Dives: Streamline document processing with UiPath Studio Web
Dev Dives: Streamline document processing with UiPath Studio WebDev Dives: Streamline document processing with UiPath Studio Web
Dev Dives: Streamline document processing with UiPath Studio WebUiPathCommunity
 
Streamlining Python Development: A Guide to a Modern Project Setup
Streamlining Python Development: A Guide to a Modern Project SetupStreamlining Python Development: A Guide to a Modern Project Setup
Streamlining Python Development: A Guide to a Modern Project SetupFlorian Wilhelm
 
Generative AI for Technical Writer or Information Developers
Generative AI for Technical Writer or Information DevelopersGenerative AI for Technical Writer or Information Developers
Generative AI for Technical Writer or Information DevelopersRaghuram Pandurangan
 
Gen AI in Business - Global Trends Report 2024.pdf
Gen AI in Business - Global Trends Report 2024.pdfGen AI in Business - Global Trends Report 2024.pdf
Gen AI in Business - Global Trends Report 2024.pdfAddepto
 
Scanning the Internet for External Cloud Exposures via SSL Certs
Scanning the Internet for External Cloud Exposures via SSL CertsScanning the Internet for External Cloud Exposures via SSL Certs
Scanning the Internet for External Cloud Exposures via SSL CertsRizwan Syed
 
Advanced Computer Architecture – An Introduction
Advanced Computer Architecture – An IntroductionAdvanced Computer Architecture – An Introduction
Advanced Computer Architecture – An IntroductionDilum Bandara
 
Commit 2024 - Secret Management made easy
Commit 2024 - Secret Management made easyCommit 2024 - Secret Management made easy
Commit 2024 - Secret Management made easyAlfredo García Lavilla
 

Kürzlich hochgeladen (20)

Are Multi-Cloud and Serverless Good or Bad?
Are Multi-Cloud and Serverless Good or Bad?Are Multi-Cloud and Serverless Good or Bad?
Are Multi-Cloud and Serverless Good or Bad?
 
Nell’iperspazio con Rocket: il Framework Web di Rust!
Nell’iperspazio con Rocket: il Framework Web di Rust!Nell’iperspazio con Rocket: il Framework Web di Rust!
Nell’iperspazio con Rocket: il Framework Web di Rust!
 
TeamStation AI System Report LATAM IT Salaries 2024
TeamStation AI System Report LATAM IT Salaries 2024TeamStation AI System Report LATAM IT Salaries 2024
TeamStation AI System Report LATAM IT Salaries 2024
 
From Family Reminiscence to Scholarly Archive .
From Family Reminiscence to Scholarly Archive .From Family Reminiscence to Scholarly Archive .
From Family Reminiscence to Scholarly Archive .
 
Transcript: New from BookNet Canada for 2024: BNC CataList - Tech Forum 2024
Transcript: New from BookNet Canada for 2024: BNC CataList - Tech Forum 2024Transcript: New from BookNet Canada for 2024: BNC CataList - Tech Forum 2024
Transcript: New from BookNet Canada for 2024: BNC CataList - Tech Forum 2024
 
Use of FIDO in the Payments and Identity Landscape: FIDO Paris Seminar.pptx
Use of FIDO in the Payments and Identity Landscape: FIDO Paris Seminar.pptxUse of FIDO in the Payments and Identity Landscape: FIDO Paris Seminar.pptx
Use of FIDO in the Payments and Identity Landscape: FIDO Paris Seminar.pptx
 
unit 4 immunoblotting technique complete.pptx
unit 4 immunoblotting technique complete.pptxunit 4 immunoblotting technique complete.pptx
unit 4 immunoblotting technique complete.pptx
 
TrustArc Webinar - How to Build Consumer Trust Through Data Privacy
TrustArc Webinar - How to Build Consumer Trust Through Data PrivacyTrustArc Webinar - How to Build Consumer Trust Through Data Privacy
TrustArc Webinar - How to Build Consumer Trust Through Data Privacy
 
"Subclassing and Composition – A Pythonic Tour of Trade-Offs", Hynek Schlawack
"Subclassing and Composition – A Pythonic Tour of Trade-Offs", Hynek Schlawack"Subclassing and Composition – A Pythonic Tour of Trade-Offs", Hynek Schlawack
"Subclassing and Composition – A Pythonic Tour of Trade-Offs", Hynek Schlawack
 
The Ultimate Guide to Choosing WordPress Pros and Cons
The Ultimate Guide to Choosing WordPress Pros and ConsThe Ultimate Guide to Choosing WordPress Pros and Cons
The Ultimate Guide to Choosing WordPress Pros and Cons
 
Moving Beyond Passwords: FIDO Paris Seminar.pdf
Moving Beyond Passwords: FIDO Paris Seminar.pdfMoving Beyond Passwords: FIDO Paris Seminar.pdf
Moving Beyond Passwords: FIDO Paris Seminar.pdf
 
The Fit for Passkeys for Employee and Consumer Sign-ins: FIDO Paris Seminar.pptx
The Fit for Passkeys for Employee and Consumer Sign-ins: FIDO Paris Seminar.pptxThe Fit for Passkeys for Employee and Consumer Sign-ins: FIDO Paris Seminar.pptx
The Fit for Passkeys for Employee and Consumer Sign-ins: FIDO Paris Seminar.pptx
 
Digital Identity is Under Attack: FIDO Paris Seminar.pptx
Digital Identity is Under Attack: FIDO Paris Seminar.pptxDigital Identity is Under Attack: FIDO Paris Seminar.pptx
Digital Identity is Under Attack: FIDO Paris Seminar.pptx
 
Dev Dives: Streamline document processing with UiPath Studio Web
Dev Dives: Streamline document processing with UiPath Studio WebDev Dives: Streamline document processing with UiPath Studio Web
Dev Dives: Streamline document processing with UiPath Studio Web
 
Streamlining Python Development: A Guide to a Modern Project Setup
Streamlining Python Development: A Guide to a Modern Project SetupStreamlining Python Development: A Guide to a Modern Project Setup
Streamlining Python Development: A Guide to a Modern Project Setup
 
Generative AI for Technical Writer or Information Developers
Generative AI for Technical Writer or Information DevelopersGenerative AI for Technical Writer or Information Developers
Generative AI for Technical Writer or Information Developers
 
Gen AI in Business - Global Trends Report 2024.pdf
Gen AI in Business - Global Trends Report 2024.pdfGen AI in Business - Global Trends Report 2024.pdf
Gen AI in Business - Global Trends Report 2024.pdf
 
Scanning the Internet for External Cloud Exposures via SSL Certs
Scanning the Internet for External Cloud Exposures via SSL CertsScanning the Internet for External Cloud Exposures via SSL Certs
Scanning the Internet for External Cloud Exposures via SSL Certs
 
Advanced Computer Architecture – An Introduction
Advanced Computer Architecture – An IntroductionAdvanced Computer Architecture – An Introduction
Advanced Computer Architecture – An Introduction
 
Commit 2024 - Secret Management made easy
Commit 2024 - Secret Management made easyCommit 2024 - Secret Management made easy
Commit 2024 - Secret Management made easy
 

Open@Fao presentation at the EADI Open For Development Project, 2012

  • 1. Open For Development EADI IMWG Conference 2012 Open @ FAO Stephen.Katz@ fao.org (Twitter: @SteveK1958) Chief, Knowledge Management and Library Services Food and Agriculture Organization of the United Nations
  • 2. Agenda Open @ FAO 1 Context and History of Open @ FAO Ongoing Practical Initiatives 2 • FAO Open Archive • Open Data (data.fao.org) • Data Governance and Standards 3 Issues, Challenges and Lessons Learned 4 Group Discussion
  • 3. Open @ FAO : Food For Thought? Food for Thought
  • 4. Food and Agriculture Organization of the United Nations (FAO) • FAO is a specialized agency of the United Nations with its own independent governance • 190+ Member Countries • HQs in Rome, Offices in over 80 countries with over 5000 staff.
  • 5. Food and Agriculture Organization of the United Nations (FAO) • Collects, analyses, interprets and disseminates information on nutrition, food and agriculture • Policy Advice • Furnishes Technical Assistance • A Neutral Forum for International Cooperation
  • 6. FAO has been in the “knowledge” business since 1946! Our mandate.... Ensure that the world’s knowledge of food and agriculture is available to those who need it when they need it and in a form which they can access and use.
  • 7. Open @ FAO : A Bit of History  1995 – Central Publishing Unit Abolished  1996 – SGML Repository Proposal; FAOSTAT on-line  1997 – Document Repository (XML Compatible)  2003 – Document Repository (PDF)  2007 – Open Archive Proposal (Fedora Commons)  2010 – Open Data Repository Proposal (data.fao.org)  2012 – OpenArchive.Fao.Org; Data.Fao.Org
  • 8. FAO Open Archive Goals/Objectives  To make FAO’s Global Public Goods openly accessible from a single access point  To be able to exchange data in an open and standardized way  To have a smooth/efficient workflow to manage FAO’s Institutional memory  To integrate e-publishing and library workflows
  • 9. FAO Open Archive Architecture  Based on Open Source tools (Fedora Commons and Java)  Based on modern standards for data management (MODS and FRBR)  Allowing for easier management and sharing of multilingual content And this is what it looks like:
  • 10.
  • 11.
  • 12.
  • 13.
  • 14.
  • 15.
  • 16. Open Archive Resources Available at Start-up Time Resource Type Number of Records Full Text Documents 40,100 Photos and Videos 17,100 Audio Files 1,200
  • 17. Open Data (data.fao.org) Goals/Objectives  To address fragmentation and duplication of information systems and data presently distributed across many organizational units  http://data.fao.org: one-stop shop that aggregates, integrates, and catalogues data from multiple sources across FAO. Topics are related to nutrition, food and agriculture and include statistics, maps, pictures, documents and more.
  • 18. Open Data (data.fao.org) Guiding Principles  Uniting FAO data with one brand : http://data.fao.org  Engaging a Community : #FAOdata  Mobile First  Serve the data in the most convenient format  Integrate, don't reimplement
  • 19. data.fao.org - The Big Picture Specialised Website Services and Widgets application(s) consume/provide Orchestration and Integration Search Catalogue Statistics Maps Content Infrastructure Statistical Data Full text Identity Warehouse Geospatial Documents Logging Structured Metadata Raster Pictures Caching Linked Data Time Series Vector Video Security ... Indicators Point Multimedia Audit Observations Pages ...
  • 20. Data Flow Architecture Data Data Source Source Ingest Harmonise Data Integrate Source Enrich Publish Data Source Data Source
  • 21.
  • 23. Some Numbers 356,000,000 Statistical values 2 Terabytes 1,500,000 Statistical Maps 734,000 Geo Layers 30 Terabytes 435 Documents 90 Pictures 25 Information Systems
  • 24. Simple and few technologies   Opscode –Chef, Red Hat- RHQ, Nagios- Enterprises Nagios, Jenkins-CI – Opensource – Jenkins, Apache Maven, Apache – SVN, Atlassian – JIRA, Apache – Jmeter, Sourceforge Opensource – Junit, Oracle- Java, Liferay - Liferay Portal, PostgreSQL – PostgreSQL PostGIS, Pgbouncer – pgbouncer, Pentaho - Pentaho BI, Analytical Labs – Saiku, Talend - Talend Open Studio, RedHat/Jboss - Application Server, Vmware - Server Virtualization,  Exceliance – HAProxy, RedHat- Enterprise Linux Server, GeoNetwork OpenSource – GeoNetwork, OpenGeo, GeoSolutions, Fedora Commons, Refractions Research – GeoServer, Ontotext - OWLIM, RedHat/Jboss - JBoss AS / JEE, Alfresco - Activiti BPM Platform, jasig - CAS Client, Batchwork Software - Doc2Doc, Google Code Opensource - ZXing ("Zebra Crossing"), Highsoft Solutions AS – highcharts, Twitter – Bootstrap, Liferay - Alloy-UI, JQuery - JQuery-ui, Geert de Deckere - Graph Up and the there are all the SaaS products …. Further questions on data.fao.org to the Project Manager: Karl.Morteo@fao.org
  • 26. CIARD – a global movement • All organizations that create and  To make possess public agricultural agricultural research information disseminate research and share it more widely information • CIARD partners create coherence and by a) coordinating their efforts, b) knowledge promoting common formats, c) truly acessible adopting open systems and to all standards • Create a global network of public collections of data and information
  • 28. Distributed Data Sets • stats • gene banks • gis data • blogs, • journals • open archives • raw data • technologies • learning objects • ……….. How to make value added services? How to infer new knowledge? How to organize collaboration? Maybe we really need this?...
  • 29. …to • stats • gene banks • gis data • blogs, • journals • open archives • raw data • technologies • learning objects • ………..
  • 31. OpenAgris  Aggregates different data sources to expand knowledge about a topic  Is a “linked-data” environment mashing-up interlinked datasets to create an integrated knowledge base  OpenAgris uses the Agrovoc thesaurus as backbone to interlink to other existing datasets (DBPedia, WorldBank, Geopolitical Ontology…)
  • 32.
  • 33. Open Archive : Issues, Challenges, Lessons  Unclear Policy Framework  Unclear collection selection policy  Variable quality standards (content, legal, editorial, accountability)  Licensing policy/conditions for re-use  Working with partners and scientific journals  Freely available but need attribution  Supply vs demand (personal interest vs impact)  Tension with Sales and Marketing needs May Lead To Negative Consequences such as:  Low credibility/trust, reputational risk, legal exposure?
  • 34. Open Data : Issues, Challenges, Lessons Well the same stuff as before really   Unclear Policy Framework  Unclear collection selection policy  Variable quality standards  Licensing policy/conditions for re-use  Working with partners  Freely available but need attribution  Supply vs demand (personal interest vs impact)  Tension with Sales and Marketing needs May Lead To Negative Consequences such as:  Low credibility/trust, reputational risk, legal exposure?
  • 35. Open Data : Issues, Challenges, Lessons But also:  Every data-type has it’s own standards (e.g. OGC for GIS, SDMX for stats, MODS for documents, IPTC for Photos)  Aggregate data quality set by lowest common denominator  Poor data governance leads to:  Conflicting/contradictory data values from different sources  Lack of agreement of definitions and concepts, and  Insufficient metadata  Comparing apples, pears and oranges (different units, different assumptions, different contexts) May Lead To Negative Consequences such as:  Low credibility/trust, reputational risk, legal exposure?
  • 36. Thank you! Time for Discussion and soon for Lunch!

Hinweis der Redaktion

  1. Metadata Object Description SchemaFunctional Requirements for Bibliographic Records
  2. QR Code, Quick Response Code
  3. International System for Agricultural Science and Technology