SlideShare ist ein Scribd-Unternehmen logo
1 von 33
knowerce|co




                 Datacamp
                 Fair-Play Data




Stefan Urbanek                    November 2009
stefan@knowerce.sk
knowerce|co




Web and Data
knowerce|co

     government         register of
       web site        organisations




data sitting in isolated data silos
                                              3
knowerce|co




                             huh?




we do not have data on web as data
                                     4
knowerce|co




           ≠

document       data

                             5
knowerce|co




The Datacamp
knowerce|co




=   +



               7
knowerce|co




                   +

         manage data
with the quality process of an enterprise


                                                   8
knowerce|co


For Visitors




data catalogue   searching   sharing




                                              9
knowerce|co


For Owners




data descriptions   importing   quality management




                                                  10
knowerce|co


For Remixers



             application
        programming interface




                                      11
knowerce|co




                  Fair-play       Data*




*open, social, shareable and with quality assurance
                                                      12
knowerce|co




The Application
14
15
16
17
Quality Conflicts
           browsing




           editing

                      18
19
Searching

■ Google-style keywords: michal kovac
■ exclusion with minus sign: michal -kovac
■ wildcards: michal kovac*
■ refinements:
  ■ field:name field:city
  ■ dataset:eurodonations
  ■ category:donations

                                             20
knowerce|co




Under the Hood*


            *slightly technical
files with data

      web




staging        ETL             datasets       application
              extraction
            transformation
                loading



                                                            22
company register extraction

                   public procurement extraction

                   ...




      extraction          staging                loading   datasets
web     manager


                  temporary or downloaded files


                                                                23
knowerce|co




     Application
Programming Interface
25
ugly <html>   neat CSV
                         26
<?xml version="1.0" encoding="UTF-8"?>
<dataset-description>
  <category-id type="integer">1</category-id>
  <collection-mode></collection-mode>
  <created-at type="datetime">2009-09-13T09:59:51Z</created-at>
  <data-provider></data-provider>
  <data-source-type></data-source-type>
  <database nil="true"></database>
  <format-rule-id type="integer" nil="true"></format-rule-id>
  <granularity></granularity>
  <id type="integer">5</id>
  <identifier>ds_eurodonations</identifier>




               data description
                                                                  27
API




      28
knowerce|co




open and reusable data   29
knowerce|co




new applications         30
knowerce|co




“raw data now!”


         Tim Berners-Lee @ TED
knowerce|co




Thank you
knowerce|co


Copyrights and Credits
■   Silos by Noodle Snacks: http://commons.wikimedia.org/wiki/File:Maria_Cement_Silos.jpg, CC Attribution, Share Alike 3.0 Unported
■   Icons by Oxygen Team: http://www.iconfinder.net/search/1/?q=iconset:oxygen, GPL
■   Icons by Alessandro Rei, KDE, GPL
■   Angel Wings by *Spyrogs, Deviant Art: http://spirogs.deviantart.com/art/Angel-Wings-Tatoo-87089782
■   Coins by Mnemo, Wikimedia Commons, http://commons.wikimedia.org/wiki/File:Swedish_coins_20050924.jpg, CC Attribution, Share Alike 3.0
■   Folder icon: Benji Garner, Icon set: Rise, Free for commercial use
■   Application icon by Sergio Sanchez Lopez, GPL
■   Network icon by Everaldo Coelho, Icon set: Crystal Clear, LGPL




                                                                                                                                                  33

Weitere ähnliche Inhalte

Ähnlich wie Datacamp @ Bar Camp Bratislava

Tech4Africa - Opportunities around Big Data
Tech4Africa - Opportunities around Big DataTech4Africa - Opportunities around Big Data
Tech4Africa - Opportunities around Big Data
Steve Watt
 
Public private-cloud
Public private-cloudPublic private-cloud
Public private-cloud
Jamie Taylor
 

Ähnlich wie Datacamp @ Bar Camp Bratislava (20)

Infochimps + CloudCon: Infinite Monkey Theorem
Infochimps + CloudCon: Infinite Monkey TheoremInfochimps + CloudCon: Infinite Monkey Theorem
Infochimps + CloudCon: Infinite Monkey Theorem
 
From open data to API-driven business
From open data to API-driven businessFrom open data to API-driven business
From open data to API-driven business
 
Tech4Africa - Opportunities around Big Data
Tech4Africa - Opportunities around Big DataTech4Africa - Opportunities around Big Data
Tech4Africa - Opportunities around Big Data
 
Steve Watt Presentation
Steve Watt PresentationSteve Watt Presentation
Steve Watt Presentation
 
From a hack to Data Mesh (Devoxx 2022)
From a hack to Data Mesh (Devoxx 2022)From a hack to Data Mesh (Devoxx 2022)
From a hack to Data Mesh (Devoxx 2022)
 
Datenstrategie der Zukunft - Technologietrends, die Sie kennen müssen
Datenstrategie der Zukunft - Technologietrends, die Sie kennen müssenDatenstrategie der Zukunft - Technologietrends, die Sie kennen müssen
Datenstrategie der Zukunft - Technologietrends, die Sie kennen müssen
 
What is the future of data strategy?
What is the future of data strategy?What is the future of data strategy?
What is the future of data strategy?
 
DevOpsDaysRiga 2018: Eric Skoglund, Lars Albertsson - Kubernetes as data plat...
DevOpsDaysRiga 2018: Eric Skoglund, Lars Albertsson - Kubernetes as data plat...DevOpsDaysRiga 2018: Eric Skoglund, Lars Albertsson - Kubernetes as data plat...
DevOpsDaysRiga 2018: Eric Skoglund, Lars Albertsson - Kubernetes as data plat...
 
Kubernetes as data platform
Kubernetes as data platformKubernetes as data platform
Kubernetes as data platform
 
DataOps - The Foundation for Your Agile Data Architecture
DataOps - The Foundation for Your Agile Data ArchitectureDataOps - The Foundation for Your Agile Data Architecture
DataOps - The Foundation for Your Agile Data Architecture
 
Data Science at Scale - The DevOps Approach
Data Science at Scale - The DevOps ApproachData Science at Scale - The DevOps Approach
Data Science at Scale - The DevOps Approach
 
DevOps for Data Engineers - Automate Your Data Science Pipeline with Ansible,...
DevOps for Data Engineers - Automate Your Data Science Pipeline with Ansible,...DevOps for Data Engineers - Automate Your Data Science Pipeline with Ansible,...
DevOps for Data Engineers - Automate Your Data Science Pipeline with Ansible,...
 
[Europe merge world tour] Perforce Europe Merge World Tour Keynote
[Europe   merge world tour] Perforce Europe Merge World Tour Keynote[Europe   merge world tour] Perforce Europe Merge World Tour Keynote
[Europe merge world tour] Perforce Europe Merge World Tour Keynote
 
Neo4j Graph Summit 2024 Workshop - EMEA - Breda_and_Munchen.pdf
Neo4j Graph Summit 2024 Workshop - EMEA - Breda_and_Munchen.pdfNeo4j Graph Summit 2024 Workshop - EMEA - Breda_and_Munchen.pdf
Neo4j Graph Summit 2024 Workshop - EMEA - Breda_and_Munchen.pdf
 
Libera la potenza del Machine Learning
Libera la potenza del Machine LearningLibera la potenza del Machine Learning
Libera la potenza del Machine Learning
 
Showcase Your Data w/ RedMonk Analytics
Showcase Your Data w/ RedMonk AnalyticsShowcase Your Data w/ RedMonk Analytics
Showcase Your Data w/ RedMonk Analytics
 
A Key to Real-time Insights in a Post-COVID World (ASEAN)
A Key to Real-time Insights in a Post-COVID World (ASEAN)A Key to Real-time Insights in a Post-COVID World (ASEAN)
A Key to Real-time Insights in a Post-COVID World (ASEAN)
 
A view of graph data usage by Cerved
A view of graph data usage by CervedA view of graph data usage by Cerved
A view of graph data usage by Cerved
 
Public private-cloud
Public private-cloudPublic private-cloud
Public private-cloud
 
Open Data Open Innovation and The Cloud gayler berlin nov12
Open Data Open Innovation and The Cloud   gayler berlin nov12Open Data Open Innovation and The Cloud   gayler berlin nov12
Open Data Open Innovation and The Cloud gayler berlin nov12
 

Kürzlich hochgeladen

IAC 2024 - IA Fast Track to Search Focused AI Solutions
IAC 2024 - IA Fast Track to Search Focused AI SolutionsIAC 2024 - IA Fast Track to Search Focused AI Solutions
IAC 2024 - IA Fast Track to Search Focused AI Solutions
Enterprise Knowledge
 
CNv6 Instructor Chapter 6 Quality of Service
CNv6 Instructor Chapter 6 Quality of ServiceCNv6 Instructor Chapter 6 Quality of Service
CNv6 Instructor Chapter 6 Quality of Service
giselly40
 

Kürzlich hochgeladen (20)

Driving Behavioral Change for Information Management through Data-Driven Gree...
Driving Behavioral Change for Information Management through Data-Driven Gree...Driving Behavioral Change for Information Management through Data-Driven Gree...
Driving Behavioral Change for Information Management through Data-Driven Gree...
 
presentation ICT roal in 21st century education
presentation ICT roal in 21st century educationpresentation ICT roal in 21st century education
presentation ICT roal in 21st century education
 
08448380779 Call Girls In Friends Colony Women Seeking Men
08448380779 Call Girls In Friends Colony Women Seeking Men08448380779 Call Girls In Friends Colony Women Seeking Men
08448380779 Call Girls In Friends Colony Women Seeking Men
 
The Role of Taxonomy and Ontology in Semantic Layers - Heather Hedden.pdf
The Role of Taxonomy and Ontology in Semantic Layers - Heather Hedden.pdfThe Role of Taxonomy and Ontology in Semantic Layers - Heather Hedden.pdf
The Role of Taxonomy and Ontology in Semantic Layers - Heather Hedden.pdf
 
The 7 Things I Know About Cyber Security After 25 Years | April 2024
The 7 Things I Know About Cyber Security After 25 Years | April 2024The 7 Things I Know About Cyber Security After 25 Years | April 2024
The 7 Things I Know About Cyber Security After 25 Years | April 2024
 
Workshop - Best of Both Worlds_ Combine KG and Vector search for enhanced R...
Workshop - Best of Both Worlds_ Combine  KG and Vector search for  enhanced R...Workshop - Best of Both Worlds_ Combine  KG and Vector search for  enhanced R...
Workshop - Best of Both Worlds_ Combine KG and Vector search for enhanced R...
 
From Event to Action: Accelerate Your Decision Making with Real-Time Automation
From Event to Action: Accelerate Your Decision Making with Real-Time AutomationFrom Event to Action: Accelerate Your Decision Making with Real-Time Automation
From Event to Action: Accelerate Your Decision Making with Real-Time Automation
 
Mastering MySQL Database Architecture: Deep Dive into MySQL Shell and MySQL R...
Mastering MySQL Database Architecture: Deep Dive into MySQL Shell and MySQL R...Mastering MySQL Database Architecture: Deep Dive into MySQL Shell and MySQL R...
Mastering MySQL Database Architecture: Deep Dive into MySQL Shell and MySQL R...
 
Strategize a Smooth Tenant-to-tenant Migration and Copilot Takeoff
Strategize a Smooth Tenant-to-tenant Migration and Copilot TakeoffStrategize a Smooth Tenant-to-tenant Migration and Copilot Takeoff
Strategize a Smooth Tenant-to-tenant Migration and Copilot Takeoff
 
IAC 2024 - IA Fast Track to Search Focused AI Solutions
IAC 2024 - IA Fast Track to Search Focused AI SolutionsIAC 2024 - IA Fast Track to Search Focused AI Solutions
IAC 2024 - IA Fast Track to Search Focused AI Solutions
 
CNv6 Instructor Chapter 6 Quality of Service
CNv6 Instructor Chapter 6 Quality of ServiceCNv6 Instructor Chapter 6 Quality of Service
CNv6 Instructor Chapter 6 Quality of Service
 
Strategies for Unlocking Knowledge Management in Microsoft 365 in the Copilot...
Strategies for Unlocking Knowledge Management in Microsoft 365 in the Copilot...Strategies for Unlocking Knowledge Management in Microsoft 365 in the Copilot...
Strategies for Unlocking Knowledge Management in Microsoft 365 in the Copilot...
 
Axa Assurance Maroc - Insurer Innovation Award 2024
Axa Assurance Maroc - Insurer Innovation Award 2024Axa Assurance Maroc - Insurer Innovation Award 2024
Axa Assurance Maroc - Insurer Innovation Award 2024
 
A Domino Admins Adventures (Engage 2024)
A Domino Admins Adventures (Engage 2024)A Domino Admins Adventures (Engage 2024)
A Domino Admins Adventures (Engage 2024)
 
🐬 The future of MySQL is Postgres 🐘
🐬  The future of MySQL is Postgres   🐘🐬  The future of MySQL is Postgres   🐘
🐬 The future of MySQL is Postgres 🐘
 
Evaluating the top large language models.pdf
Evaluating the top large language models.pdfEvaluating the top large language models.pdf
Evaluating the top large language models.pdf
 
Apidays Singapore 2024 - Building Digital Trust in a Digital Economy by Veron...
Apidays Singapore 2024 - Building Digital Trust in a Digital Economy by Veron...Apidays Singapore 2024 - Building Digital Trust in a Digital Economy by Veron...
Apidays Singapore 2024 - Building Digital Trust in a Digital Economy by Veron...
 
Finology Group – Insurtech Innovation Award 2024
Finology Group – Insurtech Innovation Award 2024Finology Group – Insurtech Innovation Award 2024
Finology Group – Insurtech Innovation Award 2024
 
GenCyber Cyber Security Day Presentation
GenCyber Cyber Security Day PresentationGenCyber Cyber Security Day Presentation
GenCyber Cyber Security Day Presentation
 
Partners Life - Insurer Innovation Award 2024
Partners Life - Insurer Innovation Award 2024Partners Life - Insurer Innovation Award 2024
Partners Life - Insurer Innovation Award 2024
 

Datacamp @ Bar Camp Bratislava