SlideShare ist ein Scribd-Unternehmen logo
1 von 36
Open For Development
                             EADI IMWG Conference 2012



                            Open @ FAO


Stephen.Katz@ fao.org (Twitter: @SteveK1958)
Chief, Knowledge Management and Library Services
Food and Agriculture Organization of the United Nations
Agenda
    Open @ FAO


1     Context and History of Open @ FAO


      Ongoing Practical Initiatives
2     • FAO Open Archive
      • Open Data (data.fao.org)
      • Data Governance and Standards

3
      Issues, Challenges and Lessons Learned



4     Group Discussion
Open @ FAO : Food For Thought?


Food for Thought
Food and Agriculture Organization of the United Nations
                         (FAO)


                         •   FAO is a specialized
                             agency of the United
                             Nations with its own
                             independent governance

                         •   190+ Member Countries

                         •   HQs in Rome, Offices in
                             over 80 countries with
                             over 5000 staff.
Food and Agriculture Organization of the United Nations
                         (FAO)


                   •   Collects, analyses, interprets and
                       disseminates information on
                       nutrition, food and agriculture

                   •   Policy Advice

                   •   Furnishes Technical Assistance

                   •   A Neutral Forum for International
                       Cooperation
FAO has been in the
“knowledge” business
since 1946!

Our mandate....
Ensure that the world’s
knowledge of food and
agriculture is available to
those who need it when
they need it and in a form
which they can access
and use.
Open @ FAO : A Bit of History




   1995 – Central Publishing Unit Abolished
   1996 – SGML Repository Proposal; FAOSTAT on-line
   1997 – Document Repository (XML Compatible)
   2003 – Document Repository (PDF)
   2007 – Open Archive Proposal (Fedora Commons)
   2010 – Open Data Repository Proposal (data.fao.org)
   2012 – OpenArchive.Fao.Org; Data.Fao.Org
FAO Open Archive
    Goals/Objectives

 To make FAO’s Global Public Goods openly
  accessible from a single access point
 To be able to exchange data in an open and
  standardized way
 To have a smooth/efficient workflow to
  manage FAO’s Institutional memory
 To integrate e-publishing and library workflows
FAO Open Archive
    Architecture
 Based on Open Source tools (Fedora
  Commons and Java)
 Based on modern standards for data
  management (MODS and FRBR)
 Allowing for easier management and sharing
  of multilingual content

And this is what it looks like:
Open Archive Resources
Available at Start-up Time
           Resource Type       Number of Records
     Full Text Documents   40,100



     Photos and Videos     17,100



     Audio Files           1,200
Open Data (data.fao.org)
Goals/Objectives
   To address fragmentation and duplication of
    information systems and data presently distributed
    across many organizational units
   http://data.fao.org: one-stop shop that aggregates,
    integrates, and catalogues data from multiple sources
    across FAO. Topics are related to nutrition, food and
    agriculture and include statistics, maps, pictures,
    documents and more.
Open Data (data.fao.org)
Guiding Principles
   Uniting FAO data with one brand : http://data.fao.org
   Engaging a Community : #FAOdata
   Mobile First
   Serve the data in the most convenient format
   Integrate, don't reimplement
data.fao.org - The Big Picture

                                                                           Specialised
             Website                 Services and Widgets                 application(s)
                                                                           consume/provide



                                Orchestration and Integration

Search            Catalogue        Statistics         Maps         Content          Infrastructure
                                   Statistical Data
Full text         Identity         Warehouse          Geospatial   Documents        Logging
Structured        Metadata                            Raster       Pictures         Caching
                  Linked Data      Time Series        Vector       Video            Security
                  ...              Indicators         Point        Multimedia       Audit
                                   Observations                    Pages            ...
Data Flow Architecture
                          Data
              Data       Source
             Source




              Ingest
                           Harmonise
 Data
                            Integrate
Source
                             Enrich

                                        Publish
     Data
    Source    Data
             Source
Landing Page
Some Numbers
356,000,000 Statistical values
  2 Terabytes

1,500,000 Statistical Maps

734,000 Geo Layers
 30 Terabytes

435 Documents

90 Pictures

25 Information Systems
Simple and few technologies 
   Opscode –Chef, Red Hat- RHQ, Nagios- Enterprises Nagios,
    Jenkins-CI – Opensource – Jenkins, Apache              Maven, Apache –
    SVN, Atlassian – JIRA, Apache – Jmeter, Sourceforge Opensource –
    Junit, Oracle- Java, Liferay - Liferay Portal, PostgreSQL – PostgreSQL
    PostGIS, Pgbouncer – pgbouncer, Pentaho - Pentaho BI, Analytical
    Labs – Saiku, Talend - Talend Open Studio, RedHat/Jboss -
    Application Server, Vmware - Server Virtualization,
   Exceliance – HAProxy, RedHat- Enterprise Linux Server, GeoNetwork
    OpenSource – GeoNetwork, OpenGeo, GeoSolutions, Fedora
    Commons, Refractions Research – GeoServer, Ontotext - OWLIM,
    RedHat/Jboss - JBoss AS / JEE, Alfresco - Activiti BPM Platform, jasig -
    CAS Client, Batchwork Software - Doc2Doc, Google Code Opensource -
    ZXing ("Zebra Crossing"), Highsoft Solutions AS – highcharts, Twitter –
    Bootstrap, Liferay - Alloy-UI, JQuery - JQuery-ui, Geert de Deckere -
    Graph Up and the there are all the SaaS products ….

    Further questions on data.fao.org to the Project Manager: Karl.Morteo@fao.org
http://www.ciard.net
CIARD – a global movement
                    •   All organizations that create and
   To make
                        possess public agricultural
    agricultural        research information disseminate
    research            and share it more widely
    information     •   CIARD partners create coherence
    and                 by a) coordinating their efforts, b)
    knowledge           promoting common formats, c)
    truly acessible     adopting open systems and
    to all              standards
                    •   Create a global network of public
                        collections of data and information
http://aims.fao.org
Distributed Data Sets
 •   stats
 •   gene banks
 •   gis data
 •   blogs,
 •   journals
 •   open archives
 •   raw data
 •   technologies
 •   learning objects
 •   ………..

How to make value added services?
How to infer new knowledge?
How to organize collaboration?
Maybe we really need this?...
…to
•   stats
•   gene banks
•   gis data
•   blogs,
•   journals
•   open archives
•   raw data
•   technologies
•   learning objects
•   ………..
Creating Linked Applications
OpenAgris

 Aggregates different data sources to expand
  knowledge about a topic
 Is a “linked-data” environment mashing-up
  interlinked datasets to create an integrated
  knowledge base
 OpenAgris uses the Agrovoc thesaurus as
  backbone to interlink to other existing
  datasets (DBPedia, WorldBank, Geopolitical
  Ontology…)
Open Archive : Issues, Challenges, Lessons

   Unclear Policy Framework
   Unclear collection selection policy
   Variable quality standards (content, legal, editorial,
    accountability)
   Licensing policy/conditions for re-use
   Working with partners and scientific journals
   Freely available but need attribution
   Supply vs demand (personal interest vs impact)
   Tension with Sales and Marketing needs

May Lead To Negative Consequences such as:
 Low credibility/trust, reputational risk, legal exposure?
Open Data : Issues, Challenges, Lessons

Well the same stuff as before really 
 Unclear Policy Framework
 Unclear collection selection policy
 Variable quality standards
 Licensing policy/conditions for re-use
 Working with partners
 Freely available but need attribution
 Supply vs demand (personal interest vs impact)
 Tension with Sales and Marketing needs


May Lead To Negative Consequences such as:
 Low credibility/trust, reputational risk, legal exposure?
Open Data : Issues, Challenges, Lessons

But also:
 Every data-type has it’s own standards (e.g. OGC for GIS,
  SDMX for stats, MODS for documents, IPTC for Photos)
 Aggregate data quality set by lowest common denominator
 Poor data governance leads to:
      Conflicting/contradictory data values from different sources
      Lack of agreement of definitions and concepts, and
      Insufficient metadata
      Comparing apples, pears and oranges (different units, different
       assumptions, different contexts)
May Lead To Negative Consequences such as:
 Low credibility/trust, reputational risk, legal exposure?
Thank you!

Time for Discussion

and soon for Lunch!

Weitere ähnliche Inhalte

Was ist angesagt?

WWW2014 Tutorial: Online Learning & Linked Data - Lessons Learned
WWW2014 Tutorial: Online Learning & Linked Data - Lessons LearnedWWW2014 Tutorial: Online Learning & Linked Data - Lessons Learned
WWW2014 Tutorial: Online Learning & Linked Data - Lessons LearnedStefan Dietze
 
Open Access: Open Access Looking for ways to increase the reach and impact of...
Open Access: Open Access Looking for ways to increase the reach and impact of...Open Access: Open Access Looking for ways to increase the reach and impact of...
Open Access: Open Access Looking for ways to increase the reach and impact of...librarianrafia
 
WWW2013 Tutorial: Linked Data & Education
WWW2013 Tutorial: Linked Data & EducationWWW2013 Tutorial: Linked Data & Education
WWW2013 Tutorial: Linked Data & EducationStefan Dietze
 
DataCite: the Perfect Complement to CrossRef
DataCite: the Perfect Complement to CrossRefDataCite: the Perfect Complement to CrossRef
DataCite: the Perfect Complement to CrossRefCrossref
 
Geospatial Metadata and Spatial Data: It's all Greek to me!
Geospatial Metadata and Spatial Data: It's all Greek to me!Geospatial Metadata and Spatial Data: It's all Greek to me!
Geospatial Metadata and Spatial Data: It's all Greek to me!EDINA, University of Edinburgh
 
Data Management Planning at the DCC
Data Management Planning at the DCCData Management Planning at the DCC
Data Management Planning at the DCCMartin Donnelly
 

Was ist angesagt? (7)

WWW2014 Tutorial: Online Learning & Linked Data - Lessons Learned
WWW2014 Tutorial: Online Learning & Linked Data - Lessons LearnedWWW2014 Tutorial: Online Learning & Linked Data - Lessons Learned
WWW2014 Tutorial: Online Learning & Linked Data - Lessons Learned
 
Open Access: Open Access Looking for ways to increase the reach and impact of...
Open Access: Open Access Looking for ways to increase the reach and impact of...Open Access: Open Access Looking for ways to increase the reach and impact of...
Open Access: Open Access Looking for ways to increase the reach and impact of...
 
WWW2013 Tutorial: Linked Data & Education
WWW2013 Tutorial: Linked Data & EducationWWW2013 Tutorial: Linked Data & Education
WWW2013 Tutorial: Linked Data & Education
 
DataCite: the Perfect Complement to CrossRef
DataCite: the Perfect Complement to CrossRefDataCite: the Perfect Complement to CrossRef
DataCite: the Perfect Complement to CrossRef
 
Geospatial Metadata and Spatial Data: It's all Greek to me!
Geospatial Metadata and Spatial Data: It's all Greek to me!Geospatial Metadata and Spatial Data: It's all Greek to me!
Geospatial Metadata and Spatial Data: It's all Greek to me!
 
RDA in a Nutshell - March 2020
RDA in a Nutshell - March 2020RDA in a Nutshell - March 2020
RDA in a Nutshell - March 2020
 
Data Management Planning at the DCC
Data Management Planning at the DCCData Management Planning at the DCC
Data Management Planning at the DCC
 

Ähnlich wie Open@Fao presentation at the EADI Open For Development Project, 2012

FAIR BioData Management
FAIR BioData ManagementFAIR BioData Management
FAIR BioData ManagementUlrike Wittig
 
2012 05 usain-johanneskeizer
2012 05 usain-johanneskeizer2012 05 usain-johanneskeizer
2012 05 usain-johanneskeizerJohannes Keizer
 
Being FAIR: FAIR data and model management SSBSS 2017 Summer School
Being FAIR:  FAIR data and model management SSBSS 2017 Summer SchoolBeing FAIR:  FAIR data and model management SSBSS 2017 Summer School
Being FAIR: FAIR data and model management SSBSS 2017 Summer SchoolCarole Goble
 
Introduction to knowledge sharing systems: considerations for the conceptual ...
Introduction to knowledge sharing systems: considerations for the conceptual ...Introduction to knowledge sharing systems: considerations for the conceptual ...
Introduction to knowledge sharing systems: considerations for the conceptual ...Nikos Manouselis
 
FAIRy stories: the FAIR Data principles in theory and in practice
FAIRy stories: the FAIR Data principles in theory and in practiceFAIRy stories: the FAIR Data principles in theory and in practice
FAIRy stories: the FAIR Data principles in theory and in practiceCarole Goble
 
EIA Biodiversity Data Mobilisation
EIA Biodiversity Data MobilisationEIA Biodiversity Data Mobilisation
EIA Biodiversity Data MobilisationVishwas Chavan
 
Scaling up food safety information transparency
Scaling up food safety information transparencyScaling up food safety information transparency
Scaling up food safety information transparencyNikos Manouselis
 
Conceptual Design of TAPipedia
Conceptual Design of TAPipediaConceptual Design of TAPipedia
Conceptual Design of TAPipediaNikos Manouselis
 
Making agricultural knowledge globally discoverable: are we there yet?
Making agricultural knowledge globally discoverable: are we there yet?Making agricultural knowledge globally discoverable: are we there yet?
Making agricultural knowledge globally discoverable: are we there yet?Nikos Manouselis
 
Agro-Know & the European agricultural research information ecosystem
Agro-Know & the European agricultural research information ecosystemAgro-Know & the European agricultural research information ecosystem
Agro-Know & the European agricultural research information ecosystemNikos Manouselis
 
ERA CoBioTech Data Management Webinar
ERA CoBioTech Data Management WebinarERA CoBioTech Data Management Webinar
ERA CoBioTech Data Management WebinarFAIRDOM
 
FAIRDOM data management support for ERACoBioTech Proposals
FAIRDOM data management support for ERACoBioTech ProposalsFAIRDOM data management support for ERACoBioTech Proposals
FAIRDOM data management support for ERACoBioTech ProposalsFAIRDOM
 
RDMkit, a Research Data Management Toolkit. Built by the Community for the ...
RDMkit, a Research Data Management Toolkit.  Built by the Community for the ...RDMkit, a Research Data Management Toolkit.  Built by the Community for the ...
RDMkit, a Research Data Management Toolkit. Built by the Community for the ...Carole Goble
 
NFDI Physical Sciences Colloquium - FAIR
NFDI Physical Sciences Colloquium - FAIRNFDI Physical Sciences Colloquium - FAIR
NFDI Physical Sciences Colloquium - FAIRSusanna-Assunta Sansone
 
Research Data Alliance .. The Why, How, What ...
Research Data Alliance .. The Why, How, What ... Research Data Alliance .. The Why, How, What ...
Research Data Alliance .. The Why, How, What ... Research Data Alliance
 
A coordinated framework for open data open science in Botswana/Simon Hodson
A coordinated framework for open data open science in Botswana/Simon HodsonA coordinated framework for open data open science in Botswana/Simon Hodson
A coordinated framework for open data open science in Botswana/Simon HodsonAfrican Open Science Platform
 
Why are e-Infrastructures useful from a small business perspective?
Why are e-Infrastructures useful from a small business perspective?Why are e-Infrastructures useful from a small business perspective?
Why are e-Infrastructures useful from a small business perspective?Nikos Manouselis
 
Management of Data Collections
Management of Data CollectionsManagement of Data Collections
Management of Data Collectionsabedejesus
 
NHM Data Portal: first steps toward the Graph-of-Life
NHM Data Portal: first steps toward the Graph-of-LifeNHM Data Portal: first steps toward the Graph-of-Life
NHM Data Portal: first steps toward the Graph-of-LifeEdward Baker
 
NHM Data Portal: first steps toward the Graph-of-Life
NHM Data Portal: first steps toward the Graph-of-LifeNHM Data Portal: first steps toward the Graph-of-Life
NHM Data Portal: first steps toward the Graph-of-LifeVince Smith
 

Ähnlich wie Open@Fao presentation at the EADI Open For Development Project, 2012 (20)

FAIR BioData Management
FAIR BioData ManagementFAIR BioData Management
FAIR BioData Management
 
2012 05 usain-johanneskeizer
2012 05 usain-johanneskeizer2012 05 usain-johanneskeizer
2012 05 usain-johanneskeizer
 
Being FAIR: FAIR data and model management SSBSS 2017 Summer School
Being FAIR:  FAIR data and model management SSBSS 2017 Summer SchoolBeing FAIR:  FAIR data and model management SSBSS 2017 Summer School
Being FAIR: FAIR data and model management SSBSS 2017 Summer School
 
Introduction to knowledge sharing systems: considerations for the conceptual ...
Introduction to knowledge sharing systems: considerations for the conceptual ...Introduction to knowledge sharing systems: considerations for the conceptual ...
Introduction to knowledge sharing systems: considerations for the conceptual ...
 
FAIRy stories: the FAIR Data principles in theory and in practice
FAIRy stories: the FAIR Data principles in theory and in practiceFAIRy stories: the FAIR Data principles in theory and in practice
FAIRy stories: the FAIR Data principles in theory and in practice
 
EIA Biodiversity Data Mobilisation
EIA Biodiversity Data MobilisationEIA Biodiversity Data Mobilisation
EIA Biodiversity Data Mobilisation
 
Scaling up food safety information transparency
Scaling up food safety information transparencyScaling up food safety information transparency
Scaling up food safety information transparency
 
Conceptual Design of TAPipedia
Conceptual Design of TAPipediaConceptual Design of TAPipedia
Conceptual Design of TAPipedia
 
Making agricultural knowledge globally discoverable: are we there yet?
Making agricultural knowledge globally discoverable: are we there yet?Making agricultural knowledge globally discoverable: are we there yet?
Making agricultural knowledge globally discoverable: are we there yet?
 
Agro-Know & the European agricultural research information ecosystem
Agro-Know & the European agricultural research information ecosystemAgro-Know & the European agricultural research information ecosystem
Agro-Know & the European agricultural research information ecosystem
 
ERA CoBioTech Data Management Webinar
ERA CoBioTech Data Management WebinarERA CoBioTech Data Management Webinar
ERA CoBioTech Data Management Webinar
 
FAIRDOM data management support for ERACoBioTech Proposals
FAIRDOM data management support for ERACoBioTech ProposalsFAIRDOM data management support for ERACoBioTech Proposals
FAIRDOM data management support for ERACoBioTech Proposals
 
RDMkit, a Research Data Management Toolkit. Built by the Community for the ...
RDMkit, a Research Data Management Toolkit.  Built by the Community for the ...RDMkit, a Research Data Management Toolkit.  Built by the Community for the ...
RDMkit, a Research Data Management Toolkit. Built by the Community for the ...
 
NFDI Physical Sciences Colloquium - FAIR
NFDI Physical Sciences Colloquium - FAIRNFDI Physical Sciences Colloquium - FAIR
NFDI Physical Sciences Colloquium - FAIR
 
Research Data Alliance .. The Why, How, What ...
Research Data Alliance .. The Why, How, What ... Research Data Alliance .. The Why, How, What ...
Research Data Alliance .. The Why, How, What ...
 
A coordinated framework for open data open science in Botswana/Simon Hodson
A coordinated framework for open data open science in Botswana/Simon HodsonA coordinated framework for open data open science in Botswana/Simon Hodson
A coordinated framework for open data open science in Botswana/Simon Hodson
 
Why are e-Infrastructures useful from a small business perspective?
Why are e-Infrastructures useful from a small business perspective?Why are e-Infrastructures useful from a small business perspective?
Why are e-Infrastructures useful from a small business perspective?
 
Management of Data Collections
Management of Data CollectionsManagement of Data Collections
Management of Data Collections
 
NHM Data Portal: first steps toward the Graph-of-Life
NHM Data Portal: first steps toward the Graph-of-LifeNHM Data Portal: first steps toward the Graph-of-Life
NHM Data Portal: first steps toward the Graph-of-Life
 
NHM Data Portal: first steps toward the Graph-of-Life
NHM Data Portal: first steps toward the Graph-of-LifeNHM Data Portal: first steps toward the Graph-of-Life
NHM Data Portal: first steps toward the Graph-of-Life
 

Kürzlich hochgeladen

Unblocking The Main Thread Solving ANRs and Frozen Frames
Unblocking The Main Thread Solving ANRs and Frozen FramesUnblocking The Main Thread Solving ANRs and Frozen Frames
Unblocking The Main Thread Solving ANRs and Frozen FramesSinan KOZAK
 
Maximizing Board Effectiveness 2024 Webinar.pptx
Maximizing Board Effectiveness 2024 Webinar.pptxMaximizing Board Effectiveness 2024 Webinar.pptx
Maximizing Board Effectiveness 2024 Webinar.pptxOnBoard
 
A Call to Action for Generative AI in 2024
A Call to Action for Generative AI in 2024A Call to Action for Generative AI in 2024
A Call to Action for Generative AI in 2024Results
 
Mastering MySQL Database Architecture: Deep Dive into MySQL Shell and MySQL R...
Mastering MySQL Database Architecture: Deep Dive into MySQL Shell and MySQL R...Mastering MySQL Database Architecture: Deep Dive into MySQL Shell and MySQL R...
Mastering MySQL Database Architecture: Deep Dive into MySQL Shell and MySQL R...Miguel Araújo
 
The 7 Things I Know About Cyber Security After 25 Years | April 2024
The 7 Things I Know About Cyber Security After 25 Years | April 2024The 7 Things I Know About Cyber Security After 25 Years | April 2024
The 7 Things I Know About Cyber Security After 25 Years | April 2024Rafal Los
 
CNv6 Instructor Chapter 6 Quality of Service
CNv6 Instructor Chapter 6 Quality of ServiceCNv6 Instructor Chapter 6 Quality of Service
CNv6 Instructor Chapter 6 Quality of Servicegiselly40
 
Breaking the Kubernetes Kill Chain: Host Path Mount
Breaking the Kubernetes Kill Chain: Host Path MountBreaking the Kubernetes Kill Chain: Host Path Mount
Breaking the Kubernetes Kill Chain: Host Path MountPuma Security, LLC
 
04-2024-HHUG-Sales-and-Marketing-Alignment.pptx
04-2024-HHUG-Sales-and-Marketing-Alignment.pptx04-2024-HHUG-Sales-and-Marketing-Alignment.pptx
04-2024-HHUG-Sales-and-Marketing-Alignment.pptxHampshireHUG
 
Presentation on how to chat with PDF using ChatGPT code interpreter
Presentation on how to chat with PDF using ChatGPT code interpreterPresentation on how to chat with PDF using ChatGPT code interpreter
Presentation on how to chat with PDF using ChatGPT code interpreternaman860154
 
GenCyber Cyber Security Day Presentation
GenCyber Cyber Security Day PresentationGenCyber Cyber Security Day Presentation
GenCyber Cyber Security Day PresentationMichael W. Hawkins
 
Boost PC performance: How more available memory can improve productivity
Boost PC performance: How more available memory can improve productivityBoost PC performance: How more available memory can improve productivity
Boost PC performance: How more available memory can improve productivityPrincipled Technologies
 
How to Troubleshoot Apps for the Modern Connected Worker
How to Troubleshoot Apps for the Modern Connected WorkerHow to Troubleshoot Apps for the Modern Connected Worker
How to Troubleshoot Apps for the Modern Connected WorkerThousandEyes
 
Google AI Hackathon: LLM based Evaluator for RAG
Google AI Hackathon: LLM based Evaluator for RAGGoogle AI Hackathon: LLM based Evaluator for RAG
Google AI Hackathon: LLM based Evaluator for RAGSujit Pal
 
Strategies for Unlocking Knowledge Management in Microsoft 365 in the Copilot...
Strategies for Unlocking Knowledge Management in Microsoft 365 in the Copilot...Strategies for Unlocking Knowledge Management in Microsoft 365 in the Copilot...
Strategies for Unlocking Knowledge Management in Microsoft 365 in the Copilot...Drew Madelung
 
Histor y of HAM Radio presentation slide
Histor y of HAM Radio presentation slideHistor y of HAM Radio presentation slide
Histor y of HAM Radio presentation slidevu2urc
 
How to convert PDF to text with Nanonets
How to convert PDF to text with NanonetsHow to convert PDF to text with Nanonets
How to convert PDF to text with Nanonetsnaman860154
 
Data Cloud, More than a CDP by Matt Robison
Data Cloud, More than a CDP by Matt RobisonData Cloud, More than a CDP by Matt Robison
Data Cloud, More than a CDP by Matt RobisonAnna Loughnan Colquhoun
 
08448380779 Call Girls In Diplomatic Enclave Women Seeking Men
08448380779 Call Girls In Diplomatic Enclave Women Seeking Men08448380779 Call Girls In Diplomatic Enclave Women Seeking Men
08448380779 Call Girls In Diplomatic Enclave Women Seeking MenDelhi Call girls
 
The Codex of Business Writing Software for Real-World Solutions 2.pptx
The Codex of Business Writing Software for Real-World Solutions 2.pptxThe Codex of Business Writing Software for Real-World Solutions 2.pptx
The Codex of Business Writing Software for Real-World Solutions 2.pptxMalak Abu Hammad
 
[2024]Digital Global Overview Report 2024 Meltwater.pdf
[2024]Digital Global Overview Report 2024 Meltwater.pdf[2024]Digital Global Overview Report 2024 Meltwater.pdf
[2024]Digital Global Overview Report 2024 Meltwater.pdfhans926745
 

Kürzlich hochgeladen (20)

Unblocking The Main Thread Solving ANRs and Frozen Frames
Unblocking The Main Thread Solving ANRs and Frozen FramesUnblocking The Main Thread Solving ANRs and Frozen Frames
Unblocking The Main Thread Solving ANRs and Frozen Frames
 
Maximizing Board Effectiveness 2024 Webinar.pptx
Maximizing Board Effectiveness 2024 Webinar.pptxMaximizing Board Effectiveness 2024 Webinar.pptx
Maximizing Board Effectiveness 2024 Webinar.pptx
 
A Call to Action for Generative AI in 2024
A Call to Action for Generative AI in 2024A Call to Action for Generative AI in 2024
A Call to Action for Generative AI in 2024
 
Mastering MySQL Database Architecture: Deep Dive into MySQL Shell and MySQL R...
Mastering MySQL Database Architecture: Deep Dive into MySQL Shell and MySQL R...Mastering MySQL Database Architecture: Deep Dive into MySQL Shell and MySQL R...
Mastering MySQL Database Architecture: Deep Dive into MySQL Shell and MySQL R...
 
The 7 Things I Know About Cyber Security After 25 Years | April 2024
The 7 Things I Know About Cyber Security After 25 Years | April 2024The 7 Things I Know About Cyber Security After 25 Years | April 2024
The 7 Things I Know About Cyber Security After 25 Years | April 2024
 
CNv6 Instructor Chapter 6 Quality of Service
CNv6 Instructor Chapter 6 Quality of ServiceCNv6 Instructor Chapter 6 Quality of Service
CNv6 Instructor Chapter 6 Quality of Service
 
Breaking the Kubernetes Kill Chain: Host Path Mount
Breaking the Kubernetes Kill Chain: Host Path MountBreaking the Kubernetes Kill Chain: Host Path Mount
Breaking the Kubernetes Kill Chain: Host Path Mount
 
04-2024-HHUG-Sales-and-Marketing-Alignment.pptx
04-2024-HHUG-Sales-and-Marketing-Alignment.pptx04-2024-HHUG-Sales-and-Marketing-Alignment.pptx
04-2024-HHUG-Sales-and-Marketing-Alignment.pptx
 
Presentation on how to chat with PDF using ChatGPT code interpreter
Presentation on how to chat with PDF using ChatGPT code interpreterPresentation on how to chat with PDF using ChatGPT code interpreter
Presentation on how to chat with PDF using ChatGPT code interpreter
 
GenCyber Cyber Security Day Presentation
GenCyber Cyber Security Day PresentationGenCyber Cyber Security Day Presentation
GenCyber Cyber Security Day Presentation
 
Boost PC performance: How more available memory can improve productivity
Boost PC performance: How more available memory can improve productivityBoost PC performance: How more available memory can improve productivity
Boost PC performance: How more available memory can improve productivity
 
How to Troubleshoot Apps for the Modern Connected Worker
How to Troubleshoot Apps for the Modern Connected WorkerHow to Troubleshoot Apps for the Modern Connected Worker
How to Troubleshoot Apps for the Modern Connected Worker
 
Google AI Hackathon: LLM based Evaluator for RAG
Google AI Hackathon: LLM based Evaluator for RAGGoogle AI Hackathon: LLM based Evaluator for RAG
Google AI Hackathon: LLM based Evaluator for RAG
 
Strategies for Unlocking Knowledge Management in Microsoft 365 in the Copilot...
Strategies for Unlocking Knowledge Management in Microsoft 365 in the Copilot...Strategies for Unlocking Knowledge Management in Microsoft 365 in the Copilot...
Strategies for Unlocking Knowledge Management in Microsoft 365 in the Copilot...
 
Histor y of HAM Radio presentation slide
Histor y of HAM Radio presentation slideHistor y of HAM Radio presentation slide
Histor y of HAM Radio presentation slide
 
How to convert PDF to text with Nanonets
How to convert PDF to text with NanonetsHow to convert PDF to text with Nanonets
How to convert PDF to text with Nanonets
 
Data Cloud, More than a CDP by Matt Robison
Data Cloud, More than a CDP by Matt RobisonData Cloud, More than a CDP by Matt Robison
Data Cloud, More than a CDP by Matt Robison
 
08448380779 Call Girls In Diplomatic Enclave Women Seeking Men
08448380779 Call Girls In Diplomatic Enclave Women Seeking Men08448380779 Call Girls In Diplomatic Enclave Women Seeking Men
08448380779 Call Girls In Diplomatic Enclave Women Seeking Men
 
The Codex of Business Writing Software for Real-World Solutions 2.pptx
The Codex of Business Writing Software for Real-World Solutions 2.pptxThe Codex of Business Writing Software for Real-World Solutions 2.pptx
The Codex of Business Writing Software for Real-World Solutions 2.pptx
 
[2024]Digital Global Overview Report 2024 Meltwater.pdf
[2024]Digital Global Overview Report 2024 Meltwater.pdf[2024]Digital Global Overview Report 2024 Meltwater.pdf
[2024]Digital Global Overview Report 2024 Meltwater.pdf
 

Open@Fao presentation at the EADI Open For Development Project, 2012

  • 1. Open For Development EADI IMWG Conference 2012 Open @ FAO Stephen.Katz@ fao.org (Twitter: @SteveK1958) Chief, Knowledge Management and Library Services Food and Agriculture Organization of the United Nations
  • 2. Agenda Open @ FAO 1 Context and History of Open @ FAO Ongoing Practical Initiatives 2 • FAO Open Archive • Open Data (data.fao.org) • Data Governance and Standards 3 Issues, Challenges and Lessons Learned 4 Group Discussion
  • 3. Open @ FAO : Food For Thought? Food for Thought
  • 4. Food and Agriculture Organization of the United Nations (FAO) • FAO is a specialized agency of the United Nations with its own independent governance • 190+ Member Countries • HQs in Rome, Offices in over 80 countries with over 5000 staff.
  • 5. Food and Agriculture Organization of the United Nations (FAO) • Collects, analyses, interprets and disseminates information on nutrition, food and agriculture • Policy Advice • Furnishes Technical Assistance • A Neutral Forum for International Cooperation
  • 6. FAO has been in the “knowledge” business since 1946! Our mandate.... Ensure that the world’s knowledge of food and agriculture is available to those who need it when they need it and in a form which they can access and use.
  • 7. Open @ FAO : A Bit of History  1995 – Central Publishing Unit Abolished  1996 – SGML Repository Proposal; FAOSTAT on-line  1997 – Document Repository (XML Compatible)  2003 – Document Repository (PDF)  2007 – Open Archive Proposal (Fedora Commons)  2010 – Open Data Repository Proposal (data.fao.org)  2012 – OpenArchive.Fao.Org; Data.Fao.Org
  • 8. FAO Open Archive Goals/Objectives  To make FAO’s Global Public Goods openly accessible from a single access point  To be able to exchange data in an open and standardized way  To have a smooth/efficient workflow to manage FAO’s Institutional memory  To integrate e-publishing and library workflows
  • 9. FAO Open Archive Architecture  Based on Open Source tools (Fedora Commons and Java)  Based on modern standards for data management (MODS and FRBR)  Allowing for easier management and sharing of multilingual content And this is what it looks like:
  • 10.
  • 11.
  • 12.
  • 13.
  • 14.
  • 15.
  • 16. Open Archive Resources Available at Start-up Time Resource Type Number of Records Full Text Documents 40,100 Photos and Videos 17,100 Audio Files 1,200
  • 17. Open Data (data.fao.org) Goals/Objectives  To address fragmentation and duplication of information systems and data presently distributed across many organizational units  http://data.fao.org: one-stop shop that aggregates, integrates, and catalogues data from multiple sources across FAO. Topics are related to nutrition, food and agriculture and include statistics, maps, pictures, documents and more.
  • 18. Open Data (data.fao.org) Guiding Principles  Uniting FAO data with one brand : http://data.fao.org  Engaging a Community : #FAOdata  Mobile First  Serve the data in the most convenient format  Integrate, don't reimplement
  • 19. data.fao.org - The Big Picture Specialised Website Services and Widgets application(s) consume/provide Orchestration and Integration Search Catalogue Statistics Maps Content Infrastructure Statistical Data Full text Identity Warehouse Geospatial Documents Logging Structured Metadata Raster Pictures Caching Linked Data Time Series Vector Video Security ... Indicators Point Multimedia Audit Observations Pages ...
  • 20. Data Flow Architecture Data Data Source Source Ingest Harmonise Data Integrate Source Enrich Publish Data Source Data Source
  • 21.
  • 23. Some Numbers 356,000,000 Statistical values 2 Terabytes 1,500,000 Statistical Maps 734,000 Geo Layers 30 Terabytes 435 Documents 90 Pictures 25 Information Systems
  • 24. Simple and few technologies   Opscode –Chef, Red Hat- RHQ, Nagios- Enterprises Nagios, Jenkins-CI – Opensource – Jenkins, Apache Maven, Apache – SVN, Atlassian – JIRA, Apache – Jmeter, Sourceforge Opensource – Junit, Oracle- Java, Liferay - Liferay Portal, PostgreSQL – PostgreSQL PostGIS, Pgbouncer – pgbouncer, Pentaho - Pentaho BI, Analytical Labs – Saiku, Talend - Talend Open Studio, RedHat/Jboss - Application Server, Vmware - Server Virtualization,  Exceliance – HAProxy, RedHat- Enterprise Linux Server, GeoNetwork OpenSource – GeoNetwork, OpenGeo, GeoSolutions, Fedora Commons, Refractions Research – GeoServer, Ontotext - OWLIM, RedHat/Jboss - JBoss AS / JEE, Alfresco - Activiti BPM Platform, jasig - CAS Client, Batchwork Software - Doc2Doc, Google Code Opensource - ZXing ("Zebra Crossing"), Highsoft Solutions AS – highcharts, Twitter – Bootstrap, Liferay - Alloy-UI, JQuery - JQuery-ui, Geert de Deckere - Graph Up and the there are all the SaaS products …. Further questions on data.fao.org to the Project Manager: Karl.Morteo@fao.org
  • 26. CIARD – a global movement • All organizations that create and  To make possess public agricultural agricultural research information disseminate research and share it more widely information • CIARD partners create coherence and by a) coordinating their efforts, b) knowledge promoting common formats, c) truly acessible adopting open systems and to all standards • Create a global network of public collections of data and information
  • 28. Distributed Data Sets • stats • gene banks • gis data • blogs, • journals • open archives • raw data • technologies • learning objects • ……….. How to make value added services? How to infer new knowledge? How to organize collaboration? Maybe we really need this?...
  • 29. …to • stats • gene banks • gis data • blogs, • journals • open archives • raw data • technologies • learning objects • ………..
  • 31. OpenAgris  Aggregates different data sources to expand knowledge about a topic  Is a “linked-data” environment mashing-up interlinked datasets to create an integrated knowledge base  OpenAgris uses the Agrovoc thesaurus as backbone to interlink to other existing datasets (DBPedia, WorldBank, Geopolitical Ontology…)
  • 32.
  • 33. Open Archive : Issues, Challenges, Lessons  Unclear Policy Framework  Unclear collection selection policy  Variable quality standards (content, legal, editorial, accountability)  Licensing policy/conditions for re-use  Working with partners and scientific journals  Freely available but need attribution  Supply vs demand (personal interest vs impact)  Tension with Sales and Marketing needs May Lead To Negative Consequences such as:  Low credibility/trust, reputational risk, legal exposure?
  • 34. Open Data : Issues, Challenges, Lessons Well the same stuff as before really   Unclear Policy Framework  Unclear collection selection policy  Variable quality standards  Licensing policy/conditions for re-use  Working with partners  Freely available but need attribution  Supply vs demand (personal interest vs impact)  Tension with Sales and Marketing needs May Lead To Negative Consequences such as:  Low credibility/trust, reputational risk, legal exposure?
  • 35. Open Data : Issues, Challenges, Lessons But also:  Every data-type has it’s own standards (e.g. OGC for GIS, SDMX for stats, MODS for documents, IPTC for Photos)  Aggregate data quality set by lowest common denominator  Poor data governance leads to:  Conflicting/contradictory data values from different sources  Lack of agreement of definitions and concepts, and  Insufficient metadata  Comparing apples, pears and oranges (different units, different assumptions, different contexts) May Lead To Negative Consequences such as:  Low credibility/trust, reputational risk, legal exposure?
  • 36. Thank you! Time for Discussion and soon for Lunch!

Hinweis der Redaktion

  1. Metadata Object Description SchemaFunctional Requirements for Bibliographic Records
  2. QR Code, Quick Response Code
  3. International System for Agricultural Science and Technology