SlideShare ist ein Scribd-Unternehmen logo
1 von 30
In the Age of Open InformationDo-It-Yourself Analytical Mashups on Schema-optional DataKatrin BraunschweigJulian EberiusMaik ThieleWolfgang Lehner OUTPUT 2011
The Roots ofOpen Data The open society is a concept originally developed by philosopher Karl Popper In open societies, government is responsive and tolerant, and political mechanisms are transparent and flexible The state keeps no secrets from itself in the    public sense It is a non-authoritarian society in which    all are trusted with the knowledge of all
1984 - Freedom of Information Campaign starts up
Why Data Shouldbe Open Many scientific data can be deemed to belong to the commons (“the human race”), e.g. the human genome, medical science, environmental data They have an infrastructural role essential for scientific endeavour (e.g. in Geographic Information Systems and maps) Data published in scientific articles are factual and therefore not copyrightable Public money was used to fund    the work and so it should be    universally available It was created by or at a    government institution
Open Data – Examples
data.gov
data.gov.uk
data.worldbank.org
unData
OpenStreetMap
CivicApplicationsbased on Open Data
Explore How U.S. Budget Proposal
Mapnificient
Schooloscope
FluglÀrmkarte (taz.de) Database Journalism
Open Data – Challenges an   Challenges Lots ofcontributors / maintainers Small informationpieces distributed, decentralised     and verylooselycoupled Different degreeofschemainformationand  metadata Innovation / unexpectedreuse Nostandardizeddevelopmentprocess Contributions Schema-optional datastore, collaborativeschemaaugmentation (basicoperators) Measuredegreeofschemainformation Non-destructiveschemachanges Capture dataprovenance Visualizationsand interactionpatterns Iterative and guideddevelopment Data and visualizationrecommendation
The Big Picture
Do-It-Yourself Schema Augmentation Application ReferenceNode AttributeTypes EntityTypes NoType NoType ET1 AT1 ET4 AT2 AT4 ET2 ET3 AT3 Schema  Augmentation Automated Schema Extraction AT1 : value AT2 : value AT3 : value AT4 : value E V E V AT TT AT TT ET Relational Table CSV File
Do-It-Yourself Analytical Mashups Process Query „number of cafes vs. age distribution per district of Dresden“ Look upfittingdatasets Computesuitablevisualization Computeinteraction / explorationfeatures
Do-It-Yourself Analytical Mashups (2) Process Query „number of cafes vs. age distribution per district of Dresden“ Look upfittingdatasets Computesuitablevisualization Computeinteraction / explorationfeatures NLP techniques  + Lookup services (e.g. GeoNames) number of cafes vs.age distribution perdistrict of Dresden natural geographic entity value dimensions relations/operations
Do-It-Yourself Analytical Mashups (3) Process Query „number of cafes vs. age distribution per district of Dresden“ Look upfittingdatasets Computesuitablevisualization Computeinteraction / explorationfeatures Ambiguity userfeedback OR
Do-It-Yourself Analytical Mashups (4) Process Query „number of cafes vs. age distribution per district of Dresden“ Look upfittingdatasets Computesuitablevisualization Computeinteraction / explorationfeatures
Do-It-Yourself Analytical Mashups (5) Process Query „number of cafes vs. age distribution per district of Dresden“ Look upfittingdatasets Computesuitablevisualization Computeinteraction / explorationfeatures number of cafes  age distribution  Too much information for one visualization enableexploration, e.g., clicking a district in themapopenshistogram
Demo
Map-centricweb application Mobile  application 3rd-party  applications # # # REST Interface PersistenceLayer Open CivicPlatformforDresden Mobile Application  Add new requests by guiding the user through a wizard-style input form Show (own) reports and there current rating and processing actualstate Visualize all reports on a map Subscribe to a set of urban district and notify the user about news Web Application Filter the requests by their category, their creation time (last 24 hours, last week, last month, all) Change the requests state (open, closed,  closed) for authorized users Zoom in/out and adapt the type of visualization if the issue density gets very sparse
Open CivicPlatformfor Dresden (2)
Open CivicPlatformforDresden (3)
New York – Example
New York – Example (2)
In the Age of Open InformationDo-It-Yourself Analytical Mashups on Schema-optional DataKatrin BraunschweigJulian EberiusMaik ThieleWolfgang Lehner OUTPUT 2011

Weitere Àhnliche Inhalte

Ähnlich wie In the Age of Open Information - Do-It-Yourself Analytical Mashups on Schema-optional Data

open-data-presentation.pptx
open-data-presentation.pptxopen-data-presentation.pptx
open-data-presentation.pptx
DennicaRivera
 

Ähnlich wie In the Age of Open Information - Do-It-Yourself Analytical Mashups on Schema-optional Data (20)

What Data Can Do: A Typology of Mechanisms . AngĂšle Christin
What Data Can Do: A Typology of Mechanisms . AngĂšle Christin What Data Can Do: A Typology of Mechanisms . AngĂšle Christin
What Data Can Do: A Typology of Mechanisms . AngĂšle Christin
 
Nordic health data metadata
Nordic health data   metadataNordic health data   metadata
Nordic health data metadata
 
Digital research: Collections, data, tools and methods
Digital research: Collections, data, tools and methods Digital research: Collections, data, tools and methods
Digital research: Collections, data, tools and methods
 
Ongoing Research in Data Studies
Ongoing Research in Data StudiesOngoing Research in Data Studies
Ongoing Research in Data Studies
 
Digital Humanities and “Digital” Social Sciences
Digital Humanities and “Digital” Social SciencesDigital Humanities and “Digital” Social Sciences
Digital Humanities and “Digital” Social Sciences
 
Scraping the Social? Issues in real-time social research (Departmental Semina...
Scraping the Social? Issues in real-time social research (Departmental Semina...Scraping the Social? Issues in real-time social research (Departmental Semina...
Scraping the Social? Issues in real-time social research (Departmental Semina...
 
(Re-)configuring data ethnography. How to engage with and make sense of data ...
(Re-)configuring data ethnography. How to engage with and make sense of data ...(Re-)configuring data ethnography. How to engage with and make sense of data ...
(Re-)configuring data ethnography. How to engage with and make sense of data ...
 
Digital Methods by Richard Rogers
Digital Methods by Richard RogersDigital Methods by Richard Rogers
Digital Methods by Richard Rogers
 
APLIC 2014 - Social Observatories Coordinating Network
APLIC 2014 - Social Observatories Coordinating NetworkAPLIC 2014 - Social Observatories Coordinating Network
APLIC 2014 - Social Observatories Coordinating Network
 
Zeng marcia ifla-subjectaccesssmartdatadh
Zeng marcia ifla-subjectaccesssmartdatadhZeng marcia ifla-subjectaccesssmartdatadh
Zeng marcia ifla-subjectaccesssmartdatadh
 
Crowdsourcing Approaches for Smart City Open Data Management
Crowdsourcing Approaches for Smart City Open Data ManagementCrowdsourcing Approaches for Smart City Open Data Management
Crowdsourcing Approaches for Smart City Open Data Management
 
1. Data Science overview - part1.pptx
1. Data Science overview - part1.pptx1. Data Science overview - part1.pptx
1. Data Science overview - part1.pptx
 
open-data-presentation.pptx
open-data-presentation.pptxopen-data-presentation.pptx
open-data-presentation.pptx
 
Engines of Order. Social Media and the Rise of Algorithmic Knowing.
Engines of Order. Social Media and the Rise of Algorithmic Knowing.Engines of Order. Social Media and the Rise of Algorithmic Knowing.
Engines of Order. Social Media and the Rise of Algorithmic Knowing.
 
Open Data Journalism
Open Data JournalismOpen Data Journalism
Open Data Journalism
 
A politics of counting - putting people back into big data
A politics of counting - putting people back into big dataA politics of counting - putting people back into big data
A politics of counting - putting people back into big data
 
ECSA, the ECSA principles, and the ECSA Characteristics of Citizen Science
ECSA, the ECSA principles, and the ECSA Characteristics of Citizen ScienceECSA, the ECSA principles, and the ECSA Characteristics of Citizen Science
ECSA, the ECSA principles, and the ECSA Characteristics of Citizen Science
 
Accessing and Using Big Data to Advance Social Science Knowledge
Accessing and Using Big Data to Advance Social Science KnowledgeAccessing and Using Big Data to Advance Social Science Knowledge
Accessing and Using Big Data to Advance Social Science Knowledge
 
Data Science-1 (1).ppt
Data Science-1 (1).pptData Science-1 (1).ppt
Data Science-1 (1).ppt
 
Artificial Intelligence For Investigative Reporting
Artificial Intelligence For Investigative ReportingArtificial Intelligence For Investigative Reporting
Artificial Intelligence For Investigative Reporting
 

KĂŒrzlich hochgeladen

CNv6 Instructor Chapter 6 Quality of Service
CNv6 Instructor Chapter 6 Quality of ServiceCNv6 Instructor Chapter 6 Quality of Service
CNv6 Instructor Chapter 6 Quality of Service
giselly40
 
IAC 2024 - IA Fast Track to Search Focused AI Solutions
IAC 2024 - IA Fast Track to Search Focused AI SolutionsIAC 2024 - IA Fast Track to Search Focused AI Solutions
IAC 2024 - IA Fast Track to Search Focused AI Solutions
Enterprise Knowledge
 

KĂŒrzlich hochgeladen (20)

Partners Life - Insurer Innovation Award 2024
Partners Life - Insurer Innovation Award 2024Partners Life - Insurer Innovation Award 2024
Partners Life - Insurer Innovation Award 2024
 
Exploring the Future Potential of AI-Enabled Smartphone Processors
Exploring the Future Potential of AI-Enabled Smartphone ProcessorsExploring the Future Potential of AI-Enabled Smartphone Processors
Exploring the Future Potential of AI-Enabled Smartphone Processors
 
[2024]Digital Global Overview Report 2024 Meltwater.pdf
[2024]Digital Global Overview Report 2024 Meltwater.pdf[2024]Digital Global Overview Report 2024 Meltwater.pdf
[2024]Digital Global Overview Report 2024 Meltwater.pdf
 
presentation ICT roal in 21st century education
presentation ICT roal in 21st century educationpresentation ICT roal in 21st century education
presentation ICT roal in 21st century education
 
Tech Trends Report 2024 Future Today Institute.pdf
Tech Trends Report 2024 Future Today Institute.pdfTech Trends Report 2024 Future Today Institute.pdf
Tech Trends Report 2024 Future Today Institute.pdf
 
Strategies for Landing an Oracle DBA Job as a Fresher
Strategies for Landing an Oracle DBA Job as a FresherStrategies for Landing an Oracle DBA Job as a Fresher
Strategies for Landing an Oracle DBA Job as a Fresher
 
08448380779 Call Girls In Friends Colony Women Seeking Men
08448380779 Call Girls In Friends Colony Women Seeking Men08448380779 Call Girls In Friends Colony Women Seeking Men
08448380779 Call Girls In Friends Colony Women Seeking Men
 
Driving Behavioral Change for Information Management through Data-Driven Gree...
Driving Behavioral Change for Information Management through Data-Driven Gree...Driving Behavioral Change for Information Management through Data-Driven Gree...
Driving Behavioral Change for Information Management through Data-Driven Gree...
 
CNv6 Instructor Chapter 6 Quality of Service
CNv6 Instructor Chapter 6 Quality of ServiceCNv6 Instructor Chapter 6 Quality of Service
CNv6 Instructor Chapter 6 Quality of Service
 
Strategies for Unlocking Knowledge Management in Microsoft 365 in the Copilot...
Strategies for Unlocking Knowledge Management in Microsoft 365 in the Copilot...Strategies for Unlocking Knowledge Management in Microsoft 365 in the Copilot...
Strategies for Unlocking Knowledge Management in Microsoft 365 in the Copilot...
 
Raspberry Pi 5: Challenges and Solutions in Bringing up an OpenGL/Vulkan Driv...
Raspberry Pi 5: Challenges and Solutions in Bringing up an OpenGL/Vulkan Driv...Raspberry Pi 5: Challenges and Solutions in Bringing up an OpenGL/Vulkan Driv...
Raspberry Pi 5: Challenges and Solutions in Bringing up an OpenGL/Vulkan Driv...
 
🐬 The future of MySQL is Postgres 🐘
🐬  The future of MySQL is Postgres   🐘🐬  The future of MySQL is Postgres   🐘
🐬 The future of MySQL is Postgres 🐘
 
IAC 2024 - IA Fast Track to Search Focused AI Solutions
IAC 2024 - IA Fast Track to Search Focused AI SolutionsIAC 2024 - IA Fast Track to Search Focused AI Solutions
IAC 2024 - IA Fast Track to Search Focused AI Solutions
 
Axa Assurance Maroc - Insurer Innovation Award 2024
Axa Assurance Maroc - Insurer Innovation Award 2024Axa Assurance Maroc - Insurer Innovation Award 2024
Axa Assurance Maroc - Insurer Innovation Award 2024
 
ProductAnonymous-April2024-WinProductDiscovery-MelissaKlemke
ProductAnonymous-April2024-WinProductDiscovery-MelissaKlemkeProductAnonymous-April2024-WinProductDiscovery-MelissaKlemke
ProductAnonymous-April2024-WinProductDiscovery-MelissaKlemke
 
Powerful Google developer tools for immediate impact! (2023-24 C)
Powerful Google developer tools for immediate impact! (2023-24 C)Powerful Google developer tools for immediate impact! (2023-24 C)
Powerful Google developer tools for immediate impact! (2023-24 C)
 
How to Troubleshoot Apps for the Modern Connected Worker
How to Troubleshoot Apps for the Modern Connected WorkerHow to Troubleshoot Apps for the Modern Connected Worker
How to Troubleshoot Apps for the Modern Connected Worker
 
08448380779 Call Girls In Diplomatic Enclave Women Seeking Men
08448380779 Call Girls In Diplomatic Enclave Women Seeking Men08448380779 Call Girls In Diplomatic Enclave Women Seeking Men
08448380779 Call Girls In Diplomatic Enclave Women Seeking Men
 
Automating Google Workspace (GWS) & more with Apps Script
Automating Google Workspace (GWS) & more with Apps ScriptAutomating Google Workspace (GWS) & more with Apps Script
Automating Google Workspace (GWS) & more with Apps Script
 
08448380779 Call Girls In Civil Lines Women Seeking Men
08448380779 Call Girls In Civil Lines Women Seeking Men08448380779 Call Girls In Civil Lines Women Seeking Men
08448380779 Call Girls In Civil Lines Women Seeking Men
 

In the Age of Open Information - Do-It-Yourself Analytical Mashups on Schema-optional Data

  • 1. In the Age of Open InformationDo-It-Yourself Analytical Mashups on Schema-optional DataKatrin BraunschweigJulian EberiusMaik ThieleWolfgang Lehner OUTPUT 2011
  • 2. The Roots ofOpen Data The open society is a concept originally developed by philosopher Karl Popper In open societies, government is responsive and tolerant, and political mechanisms are transparent and flexible The state keeps no secrets from itself in the public sense It is a non-authoritarian society in which all are trusted with the knowledge of all
  • 3. 1984 - Freedom of Information Campaign starts up
  • 4. Why Data Shouldbe Open Many scientific data can be deemed to belong to the commons (“the human race”), e.g. the human genome, medical science, environmental data They have an infrastructural role essential for scientific endeavour (e.g. in Geographic Information Systems and maps) Data published in scientific articles are factual and therefore not copyrightable Public money was used to fund the work and so it should be universally available It was created by or at a government institution
  • 5. Open Data – Examples
  • 12. Explore How U.S. Budget Proposal
  • 16. Open Data – Challenges an Challenges Lots ofcontributors / maintainers Small informationpieces distributed, decentralised and verylooselycoupled Different degreeofschemainformationand metadata Innovation / unexpectedreuse Nostandardizeddevelopmentprocess Contributions Schema-optional datastore, collaborativeschemaaugmentation (basicoperators) Measuredegreeofschemainformation Non-destructiveschemachanges Capture dataprovenance Visualizationsand interactionpatterns Iterative and guideddevelopment Data and visualizationrecommendation
  • 18. Do-It-Yourself Schema Augmentation Application ReferenceNode AttributeTypes EntityTypes NoType NoType ET1 AT1 ET4 AT2 AT4 ET2 ET3 AT3 Schema Augmentation Automated Schema Extraction AT1 : value AT2 : value AT3 : value AT4 : value E V E V AT TT AT TT ET Relational Table CSV File
  • 19. Do-It-Yourself Analytical Mashups Process Query „number of cafes vs. age distribution per district of Dresden“ Look upfittingdatasets Computesuitablevisualization Computeinteraction / explorationfeatures
  • 20. Do-It-Yourself Analytical Mashups (2) Process Query „number of cafes vs. age distribution per district of Dresden“ Look upfittingdatasets Computesuitablevisualization Computeinteraction / explorationfeatures NLP techniques + Lookup services (e.g. GeoNames) number of cafes vs.age distribution perdistrict of Dresden natural geographic entity value dimensions relations/operations
  • 21. Do-It-Yourself Analytical Mashups (3) Process Query „number of cafes vs. age distribution per district of Dresden“ Look upfittingdatasets Computesuitablevisualization Computeinteraction / explorationfeatures Ambiguity userfeedback OR
  • 22. Do-It-Yourself Analytical Mashups (4) Process Query „number of cafes vs. age distribution per district of Dresden“ Look upfittingdatasets Computesuitablevisualization Computeinteraction / explorationfeatures
  • 23. Do-It-Yourself Analytical Mashups (5) Process Query „number of cafes vs. age distribution per district of Dresden“ Look upfittingdatasets Computesuitablevisualization Computeinteraction / explorationfeatures number of cafes age distribution Too much information for one visualization enableexploration, e.g., clicking a district in themapopenshistogram
  • 24. Demo
  • 25. Map-centricweb application Mobile application 3rd-party applications # # # REST Interface PersistenceLayer Open CivicPlatformforDresden Mobile Application Add new requests by guiding the user through a wizard-style input form Show (own) reports and there current rating and processing actualstate Visualize all reports on a map Subscribe to a set of urban district and notify the user about news Web Application Filter the requests by their category, their creation time (last 24 hours, last week, last month, all) Change the requests state (open, closed, closed) for authorized users Zoom in/out and adapt the type of visualization if the issue density gets very sparse
  • 28. New York – Example
  • 29. New York – Example (2)
  • 30. In the Age of Open InformationDo-It-Yourself Analytical Mashups on Schema-optional DataKatrin BraunschweigJulian EberiusMaik ThieleWolfgang Lehner OUTPUT 2011