Show me the Data! Seminar on Innovative Approaches to Turn Statistics into Knowledge 2009

•

1 gefällt mir•565 views

At the 2009 Seminar on Innovative Approaches to Turn Statistics into Knowledge (http://www.oecd.org/progress/ict/statknowledge), jointly organized by the OECD, US Census Bureau and World Bank, we proposed and demo'd a proof of concept on data sharing between international organizations. We demonstrated how open source tools could sit on top of existing infrastructure and reused visualization tools to show how data could be pulled and combined from the various organizations on the fly.

Technologie Business

Enough with the UIs

Show me the data!

Turning Statistics Into Knowledge, July 2009

Usability
Accessibility
Easier for people to ﬁnd data

Agenda

• The Problem
• Linked Data primer
• Prototype walk through
• Take homes and vision

History
• WHO: Openhealth prototype - global
disease incidence reporting platform
• OECD: QWIDS - Query Wizards for
International Development Statistics
• Bill & Melinda Gates Foundation
• International Aid Transparency Initiative

How much is being
spent on HIV/AIDS by
Japan?

Is aid tied to malaria
activities making a
difference?

Trying to reduce child
mortality by two-thirds

Our mission

• make it easy to answer the questions
• cross organization
• don’t reinvent the wheel
• keep it simple

“the possibility of
delivering abundant data
without the need for
massive centralization.”

“Japan is a DAC country”

<http://www.geonames.org/countries/#JP>
oecd:memberOf;
oecd:DAC.

Japan ∈ DAC
DAC ⊂ Donors

Japan ∈ Donors

“RT @NovakKevin:just
got word that Fed may
experiment with
#linkeddata. Great news
#semanticweb
#w3cegov #semtech”

Take Homes

• build shared software got Int’l Orgs
• sits on existing infrastructure
• end users can answer the harder questions
• Query and combine across organizations
• accessibility, usability

Where are we going?

• Funding from foundations
• Expressions of Interest
• WHO, OECD, IMF, WB, UNESCO,
UNCTAD, FAO

Come talk to us!

www.2paths.com/conf/tsik2009

aaron@2paths.com

michal@2paths.com

Weitere ähnliche Inhalte

Was ist angesagt?

Green "Building and Launching The Commons: Because the Scholarly Record has a...National Information Standards Organization (NISO)

Transforming Networking within ESIP using ResearchBitErin Robinson

Do & don't of supporting Open ScienceSarah Jones

Data Scientist: The Sexiest Job in the 21st CenturyLyn Fenex

The Force Awakens - Technology as a Force for Change or a Pathway to the Dark...anne spencer

Intervention Strategies for Increasing Engagement in CrowdsourcingSmart-Society-Project

Networkshop45 day three plenary sessionJisc

Procurement and housing: Creative Commissioning in Health and Social Care & ...Richard Harding

Online Participation 101 In Five Minutes (Gasp!)Intellitics, Inc.

Resource Overview for “A Universe of Stories”NCIL - STAR_Net

Training Seminar - The Data Design ProcessMaxwell Taylor

Alpsp final martoneMaryann Martone

Mejias "Making it work globally"National Information Standards Organization (NISO)

FOSS4G UK: Locus Charter: Helping to use location data ethically and responsiblyPLACE

Introduction to Big Data for LABDUGamuletc

RDAP14: Developing a cross-institutional data management plan for a major par...ASIS&T

Open science and data sharing: the DataFirst experience/Martin WittenbergAfrican Open Science Platform

Do This, Not That: Rowan-Salisbury SchoolsAnalisa Sorrells

Forever Autumn Community of Practice - Promoting Healthy Ageinganne spencer

Eduserv Symposium 2013 - Adapting to an Open Data WorldEduserv

Was ist angesagt? (20)

Green "Building and Launching The Commons: Because the Scholarly Record has a...

Transforming Networking within ESIP using ResearchBit

Do & don't of supporting Open Science

Data Scientist: The Sexiest Job in the 21st Century

The Force Awakens - Technology as a Force for Change or a Pathway to the Dark...

Intervention Strategies for Increasing Engagement in Crowdsourcing

Networkshop45 day three plenary session

Procurement and housing: Creative Commissioning in Health and Social Care & ...

Online Participation 101 In Five Minutes (Gasp!)

Resource Overview for “A Universe of Stories”

Training Seminar - The Data Design Process

Alpsp final martone

Mejias "Making it work globally"

FOSS4G UK: Locus Charter: Helping to use location data ethically and responsibly

Introduction to Big Data for LABDUG

RDAP14: Developing a cross-institutional data management plan for a major par...

Open science and data sharing: the DataFirst experience/Martin Wittenberg

Do This, Not That: Rowan-Salisbury Schools

Forever Autumn Community of Practice - Promoting Healthy Ageing

Eduserv Symposium 2013 - Adapting to an Open Data World

Andere mochten auch

Theory of irOmurbek Abdyckadyrov

Recommendation Lletter Stephanie JoyceMarjorie A. Wright

Sustainable Stadia in Sustainable Communities by Rachel Coxcoon, Centre for S...British Association for Sustainable Sport

Social media 5Inma Usano

Brittany kendricks resumeBrittany kendricks

Curso de FitballEducagratis

Snorlax. horacio german garciaRobertoOtazu

Montana webinarJanyce B Fadden

2k16 resume Anthony White

Radha_resume_Sept 2015radhakrishna murthy

asdAttila Varga

Compass Group UK & I - About UsMelanie Down

Strategic Doing on Australia's Sunshine Coast | A Regional Innovation LabEd Morrison

Los seres vivosJosue Quimuña

Femanism and interactionismAqsa Naeem

AREESHA TEXTILE INDUSTRIES Business Presentationareesha.textile.indu Arain

Use your website to invite students into your school’s story Higher Education Marketing

Use case diagramsAditya Mahagaonkar

Andere mochten auch (18)

Theory of ir

Recommendation Lletter Stephanie Joyce

Sustainable Stadia in Sustainable Communities by Rachel Coxcoon, Centre for S...

Social media 5

Brittany kendricks resume

Curso de Fitball

Snorlax. horacio german garcia

Montana webinar

2k16 resume

Radha_resume_Sept 2015

asd

Compass Group UK & I - About Us

Strategic Doing on Australia's Sunshine Coast | A Regional Innovation Lab

Los seres vivos

Femanism and interactionism

AREESHA TEXTILE INDUSTRIES Business Presentation

Use your website to invite students into your school’s story

Use case diagrams

Ähnlich wie Show me the Data! Seminar on Innovative Approaches to Turn Statistics into Knowledge 2009

GODAN presentation with South Chinese Scientific InstitutionsJohannes Keizer

Getting Started in Data ScienceThinkful

Getting started in Data Science (April 2017, Los Angeles)Thinkful

Big data and developmentSimone Sala

NetHope World Economic Forum - Data Driven DevelopmentBill Brindley

Career in Data Science (July 2017, DTLA)Thinkful

Research Life Cycle for GeoData 2014Carly Strasser

Gettind data usedRajiv Ranjan

Open Sesame: Open Data, Data Liberation and Opportunities for LibrariansCommunication and Media Studies, Carleton University

LENDING IPADS TO MEDICAL STAFF: INTEGRATING IN INFORMATION WORKFLOWGuus van den Brekel

Change IT! Voices 2015Deanna Kosaraju

Learning From the COViD-19 Global PandemicTyrone Grandison

APLIC 2012: Discovering & Dealing with DataHamilton Public Library

Webinar - How to Use Data Visualization Tools to Show ImpactTechSoup

Digital innovation v8Verinote

Presentation: Open Development and the World Bank by Sumir Lal, World BankComm Phil

Better Data for a Better WorldRothamsted Research, UK

Learning from past infrastructure to embrace friction and create the Research...Research Data Alliance

Improving Access to Research Data: What does changing legislation mean for y...Marieke Guy

Lessons Learned from Lod Failure and Big Data : The Future Trend Konkuk University

Ähnlich wie Show me the Data! Seminar on Innovative Approaches to Turn Statistics into Knowledge 2009 (20)

GODAN presentation with South Chinese Scientific Institutions

Getting Started in Data Science

Getting started in Data Science (April 2017, Los Angeles)

Big data and development

NetHope World Economic Forum - Data Driven Development

Career in Data Science (July 2017, DTLA)

Research Life Cycle for GeoData 2014

Gettind data used

Open Sesame: Open Data, Data Liberation and Opportunities for Librarians

LENDING IPADS TO MEDICAL STAFF: INTEGRATING IN INFORMATION WORKFLOW

Change IT! Voices 2015

Learning From the COViD-19 Global Pandemic

APLIC 2012: Discovering & Dealing with Data

Webinar - How to Use Data Visualization Tools to Show Impact

Digital innovation v8

Presentation: Open Development and the World Bank by Sumir Lal, World Bank

Better Data for a Better World

Learning from past infrastructure to embrace friction and create the Research...

Improving Access to Research Data: What does changing legislation mean for y...

Lessons Learned from Lod Failure and Big Data : The Future Trend

Kürzlich hochgeladen

+971581248768>> SAFE AND ORIGINAL ABORTION PILLS FOR SALE IN DUBAI AND ABUDHA...?#DUbAI#??##{{(☎️+971_581248768%)**%*]'#abortion pills for sale in dubai@

GenCyber Cyber Security Day PresentationMichael W. Hawkins

Workshop - Best of Both Worlds_ Combine KG and Vector search for enhanced R...Neo4j

2024: Domino Containers - The Next Step. News from the Domino Container commu...Martijn de Jong

Apidays Singapore 2024 - Building Digital Trust in a Digital Economy by Veron...apidays

Developing An App To Navigate The Roads of BrazilV3cube

Automating Google Workspace (GWS) & more with Apps Scriptwesley chun

How to Troubleshoot Apps for the Modern Connected WorkerThousandEyes

04-2024-HHUG-Sales-and-Marketing-Alignment.pptxHampshireHUG

TrustArc Webinar - Unlock the Power of AI-Driven Data DiscoveryTrustArc

How to Troubleshoot Apps for the Modern Connected WorkerThousandEyes

Understanding Discord NSFW Servers A Guide for Responsible Users.pdfUK Journal

TrustArc Webinar - Stay Ahead of US State Data Privacy Law DevelopmentsTrustArc

AWS Community Day CPH - Three problems of TerraformAndrey Devyatkin

From Event to Action: Accelerate Your Decision Making with Real-Time AutomationSafe Software

GenAI Risks & Security Meetup 01052024.pdflior mazor

Boost PC performance: How more available memory can improve productivityPrincipled Technologies

Scaling API-first – The story of a global engineering organizationRadu Cotescu

Handwritten Text Recognition for manuscripts and early printed textsMaria Levchenko

[2024]Digital Global Overview Report 2024 Meltwater.pdfhans926745

Kürzlich hochgeladen (20)

+971581248768>> SAFE AND ORIGINAL ABORTION PILLS FOR SALE IN DUBAI AND ABUDHA...

GenCyber Cyber Security Day Presentation

Workshop - Best of Both Worlds_ Combine KG and Vector search for enhanced R...

2024: Domino Containers - The Next Step. News from the Domino Container commu...

Apidays Singapore 2024 - Building Digital Trust in a Digital Economy by Veron...

Developing An App To Navigate The Roads of Brazil

Automating Google Workspace (GWS) & more with Apps Script

How to Troubleshoot Apps for the Modern Connected Worker

04-2024-HHUG-Sales-and-Marketing-Alignment.pptx

TrustArc Webinar - Unlock the Power of AI-Driven Data Discovery

How to Troubleshoot Apps for the Modern Connected Worker

Understanding Discord NSFW Servers A Guide for Responsible Users.pdf

TrustArc Webinar - Stay Ahead of US State Data Privacy Law Developments

AWS Community Day CPH - Three problems of Terraform

From Event to Action: Accelerate Your Decision Making with Real-Time Automation

GenAI Risks & Security Meetup 01052024.pdf

Boost PC performance: How more available memory can improve productivity

Scaling API-first – The story of a global engineering organization

Handwritten Text Recognition for manuscripts and early printed texts

[2024]Digital Global Overview Report 2024 Meltwater.pdf

Show me the Data! Seminar on Innovative Approaches to Turn Statistics into Knowledge 2009

1. Enough with the UIs Show me the data! Turning Statistics Into Knowledge, July 2009

7. Usability Accessibility Easier for people to ﬁnd data

10. Agenda • The Problem • Linked Data primer • Prototype walk through • Take homes and vision

11. History • WHO: Openhealth prototype - global disease incidence reporting platform • OECD: QWIDS - Query Wizards for International Development Statistics • Bill & Melinda Gates Foundation • International Aid Transparency Initiative

12. Answer Questions

13. Where are ﬂu pandemics erupting?

14. How much is being spent on HIV/AIDS by Japan?

15. Is aid tied to malaria activities making a difference?

16. Trying to reduce child mortality by two-thirds

17. Where is this data?

18.

19.

20.

21.

22.

23.

24.

25. Our mission • make it easy to answer the questions • cross organization • don’t reinvent the wheel • keep it simple

26. “the possibility of delivering abundant data without the need for massive centralization.”

27. Agenda • The Problem • Linked Data primer • Prototype walk through • Take homes and vision

28. Linked Data, Explained

29. Current web + Some new technology

30. Mature technology

31. Anyone can say anything about anything.

32. “Japan is a DAC country” <http://www.geonames.org/countries/#JP> oecd:memberOf; oecd:DAC.

33. Standard interchange

34. Naturally extensible

35. Queryable SPARQL

36. Provenance

37.

38. Trust

39.

40.

41. Inference

42. Japan ∈ DAC DAC ⊂ Donors Japan ∈ Donors

43. That’s it! (on linked data)

44. Agenda • The Problem • Linked Data primer • Prototype walk through • Take homes and vision

45. Is aid tied to malaria activities making a difference?

46.

47.

48.

49.

50.

51.

52.

53.

54.

55. Embeddable Maps! (no really)

56. Agenda • The Problem • Linked Data Primer • Prototype walk through • Take homes and vision

57. Critical Adoption

58.

59.

60.

61.

62.

63.

64.

65.

66.

67.

68. “RT @NovakKevin:just got word that Fed may experiment with #linkeddata. Great news #semanticweb #w3cegov #semtech”

69. Take Homes • build shared software got Int’l Orgs • sits on existing infrastructure • end users can answer the harder questions • Query and combine across organizations • accessibility, usability

70. Where are we going? • Funding from foundations • Expressions of Interest • WHO, OECD, IMF, WB, UNESCO, UNCTAD, FAO

71. Come talk to us! www.2paths.com/conf/tsik2009 aaron@2paths.com michal@2paths.com

Hinweis der Redaktion

my name is Aaron Gladders, that&#x2019;s me on the bottom left my colleague, Michal Urbanski and his recent addition
we work @ 2Paths
we&#x2019;re from canada yo
specifically Vancouver, where known for our rain
but it&#x2019;s also a beautiful city http://www.flickr.com/photos/tommyauphoto/2579810410/sizes/o/
Our Focus: making it easier for people (and machines!) to find data
At 2Paths we&#x2019;re not about world domination. We want to feed our kids....
and ideally make the world a better place
So what is the problem There are lots of UI&#x2019;s out there, some great ones that we&#x2019;ve seen here and we&#x2019;ve also heard from some great story tellers. But they need access to the data in an accessible way to tell their stories. As we heard yesterday, much of the north american focus is on just getting access to the data.
So a little history. This began for us with the WHO Openhealth prototype. It was a global disease incidence reporting platform, meant to assemble data from all over the world on the diseases occurring there. Ultimately this was to show up on portals, with the associated grids, graphs and maps. Later we worked with making it easier for people to dig for the data they wanted, with the OECD Query Wizard for Int&#x2019;l Development Statistics. This was built on top of .STAT, which you say yesterday in Trevor Fletcher&#x2019;s gangster flick. In the last while we&#x2019;ve been helping the BMGF to classify their information and ultimately to share it. And our most recent addition has been towards IATI, helping to define which tech should be used for donor governments to share their aid activities, to improve aid transparency and effectiveness.
Really though, we&#x2019;re trying to answer questions. Such as...
Where are flu pandemics erupting. Generally you would goto the WHO for this.
How much is being spent on HIV/AIDS by Japan. Here you would goto the OECD
What if we want to get a little more complicated - You&#x2019;d have to goto the WHO and the OECD
And even more complicated, when data comes from places all over the world like with the Millenium Development Goals
So where is this data?
You&#x2019;d go to NSO&#x2019;s
Or the websites of Int&#x2019;l Orgs. Note some of these orgs are providing data feeds - OECD (not advertised, but they are there, we used them and we even built an nice RESTful one. It&#x2019;s very exciting that the World Bank has a public API
More recently you can go to data.un.ORG
And now with the United States, data.gov
Really though, you go to as many sources as you can find (or even just one) download it
combine it, chart it
and maybe map it (and with tools like Google Fusion, it&#x2019;s a snap)
Our goal was to make it easy to answer these questions. That required cross org data mashups. But we wanted to leverage the existing tools out there, and more importantly, keep it simple.
We quickly zero&#x2019;d in on semantic web tech - flexibility for each org define their information as they needed but allow for mappings between, in a queryable way.
I&#x2019;m here to present a quick primer on Linked Data for the uninitiated. We&#x2019;re going to quickly go over what it is, and why we should be paying attention to it.
The main technology behind linked data is one we should already be familiar with: the semantic web. It is, essentially, an extension of our current web. It grafts some new standards onto existing ones, in order to give meaning to content.
The technology driving the semantic web has been around for a while. While it may have started out as a highly academic exercise, it has evolved into a very compelling platform for the sharing of data. Semantic technologies have also benefitted from a lot of different areas, from better XML support in languages to the emergence of a new class of semantic-specific vendors. It&#x2019;s a technology that has &#x201C;escaped the lab&#x201D;, so to speak, and is being used to solve actual problems, today.
If you had to summarize semantic technology in one sentence, it would be &#x201C;Anyone can say anything, about anything&#x201D;. However, what&#x2019;s novel about it is that the way you say things is standardized, because ...
... the &#x201C;meaning&#x201D; of statement isn&#x2019;t intended only for us, it&#x2019;s also for machines, for tools and agents who can act on _behalf_ of people. Here we see a statement, &#x201C;Japan is a DAC country&#x201D;, represented in an example form that a machine would understand.
If we represent all our information in these semantic formats, we can leverage a significant number of tools that already understand them. We still have to do some work, just like we used to rolling out own XML formats for moving data around, except now we&#x2019;re more likely to use other people&#x2019;s vocabularies or produce vocabularies that others can use. These vocabularies are the &#x201C;link&#x201D; of linked data.
Because the semantic web has been in development for some time, there are already a number of existing vocabularies you can use to describe your data. Using them is the normal and natural way of participating in the world of linked data. In the case where you need to define your vocabulary because a suitable one doesn&#x2019;t already exist, you will want others to use it as well, making the entire semantic ecosystem naturally extensible. Once you release and describe your data, others can easily say things about it, or use your vocabulary to describe *their* data, or link their data to yours.
A large amount of linked data is of limited utility if we have no way to find what we&#x2019;re looking for. Relatively recently, the semantic web gained a nice, shiny query language called SPARQL. It became an official W3C recommendation at the beginning of 2008, so it&#x2019;s pretty new, but it&#x2019;s already become quite a popular tool for making complex queries into distributed stores of linked data.
There&#x2019;s another aspect of the semantic web that we should find very interesting as we examine the world of linked data. The idea of &#x201C;provenance&#x201D;, which lets you trace where and from who a piece of data comes from, will be highly useful to organizations which are concerned about data accuracy.
Suppose you are looking at a chart of data you&#x2019;ve found online. With a proper provenance system in place, you would be able to tell where that chart&#x2019;s data came from, and more importantly, you could determine whether or not you can ...
... trust it. By enumerating your trusted data sources, a system can automatically determine whether the data you&#x2019;re looking at is trustworthy by examining its provenance.
Suppose, that chart had used data from Wikipedia in addition to officially published figures from the OECD and the WHO. In this case, you might want to bit more careful using the data from that chart.
Of course, if Wikipedia also has a provenance system in place, ideally your software will follow that chain and perhaps you can trust that data after all.
The last thing that I&#x2019;d like to touch on with respect to semantic technology is the idea of inference. This basically means that a semantic system is able to derive &#x201C;new knowledge&#x201D; based on things it already knows.
A basic example would be, given a system that knows &#x201C;Japan is a DAC country&#x201D; and &#x201C;DAC countries are donors&#x201D;, it would be able to infer that Japan is a donor country. This is one of the keys for semantic technology... it&#x2019;s arguably what&#x2019;s driving widespread semantic adoption. Once the data and metadata are there, this is the tool that will really drive innovation.
And that&#x2019;s it. Hopefully that gives you a reasonable picture as to what linked data and the semantic web are, and why we ought to be interested in them.
And now we&#x2019;re going to take a walkthrough a little prototype we&#x2019;ve done up, in order to demonstrate a working system that can make use of distributed, linked data. I&#x2019;m presenting screenshots because we&#x2019;re paranoid about demo curses, but we do have a running system available here today, which we&#x2019;re willing to show you under less stressful circumstances.

Show me the Data! Seminar on Innovative Approaches to Turn Statistics into Knowledge 2009

Empfohlen

Empfohlen

Weitere ähnliche Inhalte

Was ist angesagt?

Was ist angesagt? (20)

Andere mochten auch

Andere mochten auch (18)

Ähnlich wie Show me the Data! Seminar on Innovative Approaches to Turn Statistics into Knowledge 2009

Ähnlich wie Show me the Data! Seminar on Innovative Approaches to Turn Statistics into Knowledge 2009 (20)

Kürzlich hochgeladen

Kürzlich hochgeladen (20)

Show me the Data! Seminar on Innovative Approaches to Turn Statistics into Knowledge 2009

Hinweis der Redaktion