SlideShare ist ein Scribd-Unternehmen logo
1 von 19
StatMine – prototype
0.2
Edwin de Jonge, Jan van der Laan & Jessica Solcer
Statistics Netherlands (CBS)
NTTS 2013, March 6 2013
StatMine
Goal: Improve use figures Statistics Netherlands
How: Add Analysis layer to OutputDB (StatLine)
Working approach:
•
•
•
•

Formulate improvement
Develop software prototype
Test prototype on (real) users
Evaluate

But why?
StatMine

2
Mission SN

“The mission of Statistics Netherlands is to publish
reliable and coherent statistical information that
meets the needs of society” (source: www.cbs.nl)

StatMine 0.2

3
Mission SN

“The mission of Statistics Netherlands is to publish
reliable and coherent statistical information that
meets the needs of society” (source: www.cbs.nl)

StatMine 0.2

4
Evidence-based
policy

5
What is the state of the Netherlands?

StatLine contains over
1.000.000.000 figures!

StatMine

6
Problem 1
Figures ≠ Information

StatMine

7
1. Figures ≠ Information
We know (from user study):
• Some important user don’t get the most out of
StatLine:
• Data journalists
• Policy makers

• They don’t find and see interesting
information, because of tabular presention (data =
table)

StatMine 0.2

8
Solution 1
Visualize
data!

StatMine

9
Problem 2.
Fragmented information

StatMine

10
2. Fragmented information
For policy makers and journalist most information in
OutputDB is fragmented:
• Users need to combine fragments from different
statistics
• Diabetes (insuline usage, hospital admissions,
mortality, visits to doctor, obesity)
• Energy consumption vs economic growth
• Income vs economic growth
• (Perceived) public safety vs registered crimes
StatMine 0.2

11
2. Solution:
Let users
combine
tables

(even if we
wouldn’t …)

StatMine

12
Prototype StatMine 0.2
Implements:
• Visual interactive data browsing
• Combining fragments of different tables

Tested on:
• 40 SN employees (++)
• 40 policy makers (++)

StatMine 0.2

13
Line chart

Bar chart

- Show development

- Compare

Bubble/scatter chart

Mosaic chart

- Show correlation

- Show structure

StatMine 0.2

14
Small multiples

StatMine 0.2

15
StatMine

16
Technical
HTML5
JSON

R

JavaScript

CSS
SVG

• Runs on desktop
• makkelijk over te zetten naar webserver

StatMine 0.2

17
Currently (2013)
• All Official Statistics have confidence interval.
• StatMine 0.3 will test if showing uncertainty
improves/changes understanding of (quality of)
figures.
• May lead to publishing interval estimates (in stead
of point estimates).

StatMine

18
Conclusion
• Visual data browsing is promising for
• Our own statisticians (quality control)
• External policy makers and journalists

• Using real end users for testing is very helpful:
• Lots of suggestions for improvement from users
• Users feel involved in innovation process of NSI

StatMine

19

Weitere ähnliche Inhalte

Was ist angesagt?

When Big Data and Predictive Analytics Collide: Visual Magic Happens
When Big Data and Predictive Analytics Collide: Visual Magic HappensWhen Big Data and Predictive Analytics Collide: Visual Magic Happens
When Big Data and Predictive Analytics Collide: Visual Magic HappensInfini Graph
 
Scary Reporting Projects. Fighting the Data Demon.
Scary Reporting Projects. Fighting the Data Demon. Scary Reporting Projects. Fighting the Data Demon.
Scary Reporting Projects. Fighting the Data Demon. LiveStories
 
Case of success: Visualization as an example for exercising democratic transp...
Case of success: Visualization as an example for exercising democratic transp...Case of success: Visualization as an example for exercising democratic transp...
Case of success: Visualization as an example for exercising democratic transp...Big Data Spain
 
The true meaning of data by Maciej Dabrowski
The true meaning of data by Maciej Dabrowski   The true meaning of data by Maciej Dabrowski
The true meaning of data by Maciej Dabrowski Altocloud
 
The true meaning of data
The true meaning of dataThe true meaning of data
The true meaning of datamdabrowski
 
Statistics vs machine learning: which is more powerful
Statistics vs machine learning: which is more powerfulStatistics vs machine learning: which is more powerful
Statistics vs machine learning: which is more powerfulStat Analytica
 
What's new with analytics in academia?
What's new with analytics in academia?What's new with analytics in academia?
What's new with analytics in academia?InfoTrust LLC
 

Was ist angesagt? (8)

Carrying out analysis
Carrying out analysisCarrying out analysis
Carrying out analysis
 
When Big Data and Predictive Analytics Collide: Visual Magic Happens
When Big Data and Predictive Analytics Collide: Visual Magic HappensWhen Big Data and Predictive Analytics Collide: Visual Magic Happens
When Big Data and Predictive Analytics Collide: Visual Magic Happens
 
Scary Reporting Projects. Fighting the Data Demon.
Scary Reporting Projects. Fighting the Data Demon. Scary Reporting Projects. Fighting the Data Demon.
Scary Reporting Projects. Fighting the Data Demon.
 
Case of success: Visualization as an example for exercising democratic transp...
Case of success: Visualization as an example for exercising democratic transp...Case of success: Visualization as an example for exercising democratic transp...
Case of success: Visualization as an example for exercising democratic transp...
 
The true meaning of data by Maciej Dabrowski
The true meaning of data by Maciej Dabrowski   The true meaning of data by Maciej Dabrowski
The true meaning of data by Maciej Dabrowski
 
The true meaning of data
The true meaning of dataThe true meaning of data
The true meaning of data
 
Statistics vs machine learning: which is more powerful
Statistics vs machine learning: which is more powerfulStatistics vs machine learning: which is more powerful
Statistics vs machine learning: which is more powerful
 
What's new with analytics in academia?
What's new with analytics in academia?What's new with analytics in academia?
What's new with analytics in academia?
 

Andere mochten auch

Grieco - input2012
Grieco -  input2012Grieco -  input2012
Grieco - input2012INPUT 2012
 
Advance statistics 2
Advance statistics 2Advance statistics 2
Advance statistics 2Tim Arroyo
 
Using Technology To Achieve Total Worker Health
Using Technology To Achieve Total Worker HealthUsing Technology To Achieve Total Worker Health
Using Technology To Achieve Total Worker HealthMedgate Inc.
 
Social Media Statistics - 2010 update
Social Media Statistics - 2010 updateSocial Media Statistics - 2010 update
Social Media Statistics - 2010 updateSocial Media MC
 
Alarming Social Media Statistics for Real Estate Professionals
Alarming Social Media Statistics for Real Estate ProfessionalsAlarming Social Media Statistics for Real Estate Professionals
Alarming Social Media Statistics for Real Estate ProfessionalsDoug Devitre
 
Technology and open knowledge in sports statistics
Technology and open knowledge in sports statisticsTechnology and open knowledge in sports statistics
Technology and open knowledge in sports statisticsdwiederman
 
22538598 introduction-to-research-methodology-acccording-to-jntu-hyd-mba-syll...
22538598 introduction-to-research-methodology-acccording-to-jntu-hyd-mba-syll...22538598 introduction-to-research-methodology-acccording-to-jntu-hyd-mba-syll...
22538598 introduction-to-research-methodology-acccording-to-jntu-hyd-mba-syll...lavanya758
 
PPT for report-Cambodai
PPT for report-CambodaiPPT for report-Cambodai
PPT for report-Cambodaijayan_sri
 
Ta2.09 5 bersales.philippines digitization ppp bersales jan 14 2017
Ta2.09 5 bersales.philippines digitization ppp bersales jan 14 2017Ta2.09 5 bersales.philippines digitization ppp bersales jan 14 2017
Ta2.09 5 bersales.philippines digitization ppp bersales jan 14 2017Statistics South Africa
 
Best Computer Jobs for the Future | High Pay & Fast Growth
Best Computer Jobs for the Future | High Pay & Fast GrowthBest Computer Jobs for the Future | High Pay & Fast Growth
Best Computer Jobs for the Future | High Pay & Fast GrowthITCareerFinder
 
Turning Numbers into Knowledge: A Statistics Dashboard
Turning Numbers into Knowledge: A Statistics DashboardTurning Numbers into Knowledge: A Statistics Dashboard
Turning Numbers into Knowledge: A Statistics DashboardWiLS
 
Teaching High School Statistics and use of Technology
Teaching High School Statistics and use of TechnologyTeaching High School Statistics and use of Technology
Teaching High School Statistics and use of Technologysimoninamerica
 
Marketing Music Education: Recent facts, quotes and statistics that YOU can u...
Marketing Music Education: Recent facts, quotes and statistics that YOU can u...Marketing Music Education: Recent facts, quotes and statistics that YOU can u...
Marketing Music Education: Recent facts, quotes and statistics that YOU can u...Kathleen Heuer
 
Using assessment data
Using assessment dataUsing assessment data
Using assessment datafcaristo
 
Maddaloni, daniela, descriptive statistics
Maddaloni, daniela, descriptive statisticsMaddaloni, daniela, descriptive statistics
Maddaloni, daniela, descriptive statisticsdvmaddaloni
 
WSC 2011, advanced tutorial on simulation in Statistics
WSC 2011, advanced tutorial on simulation in StatisticsWSC 2011, advanced tutorial on simulation in Statistics
WSC 2011, advanced tutorial on simulation in StatisticsChristian Robert
 
Introduction to Twitter in Higher Education workshop for SIGMA 2014
Introduction to Twitter in Higher Education workshop  for SIGMA 2014Introduction to Twitter in Higher Education workshop  for SIGMA 2014
Introduction to Twitter in Higher Education workshop for SIGMA 2014Alex Spiers
 
Advance Statistics - Wilcoxon Signed Rank Test
Advance Statistics - Wilcoxon Signed Rank TestAdvance Statistics - Wilcoxon Signed Rank Test
Advance Statistics - Wilcoxon Signed Rank TestJoshua Batalla
 

Andere mochten auch (20)

Grieco - input2012
Grieco -  input2012Grieco -  input2012
Grieco - input2012
 
Advance statistics 2
Advance statistics 2Advance statistics 2
Advance statistics 2
 
Using Technology To Achieve Total Worker Health
Using Technology To Achieve Total Worker HealthUsing Technology To Achieve Total Worker Health
Using Technology To Achieve Total Worker Health
 
Social Media Statistics - 2010 update
Social Media Statistics - 2010 updateSocial Media Statistics - 2010 update
Social Media Statistics - 2010 update
 
Bus and coach
Bus and coachBus and coach
Bus and coach
 
Alarming Social Media Statistics for Real Estate Professionals
Alarming Social Media Statistics for Real Estate ProfessionalsAlarming Social Media Statistics for Real Estate Professionals
Alarming Social Media Statistics for Real Estate Professionals
 
Technology and open knowledge in sports statistics
Technology and open knowledge in sports statisticsTechnology and open knowledge in sports statistics
Technology and open knowledge in sports statistics
 
22538598 introduction-to-research-methodology-acccording-to-jntu-hyd-mba-syll...
22538598 introduction-to-research-methodology-acccording-to-jntu-hyd-mba-syll...22538598 introduction-to-research-methodology-acccording-to-jntu-hyd-mba-syll...
22538598 introduction-to-research-methodology-acccording-to-jntu-hyd-mba-syll...
 
PPT for report-Cambodai
PPT for report-CambodaiPPT for report-Cambodai
PPT for report-Cambodai
 
Ta2.09 5 bersales.philippines digitization ppp bersales jan 14 2017
Ta2.09 5 bersales.philippines digitization ppp bersales jan 14 2017Ta2.09 5 bersales.philippines digitization ppp bersales jan 14 2017
Ta2.09 5 bersales.philippines digitization ppp bersales jan 14 2017
 
Best Computer Jobs for the Future | High Pay & Fast Growth
Best Computer Jobs for the Future | High Pay & Fast GrowthBest Computer Jobs for the Future | High Pay & Fast Growth
Best Computer Jobs for the Future | High Pay & Fast Growth
 
Turning Numbers into Knowledge: A Statistics Dashboard
Turning Numbers into Knowledge: A Statistics DashboardTurning Numbers into Knowledge: A Statistics Dashboard
Turning Numbers into Knowledge: A Statistics Dashboard
 
Chapter 01
Chapter 01Chapter 01
Chapter 01
 
Teaching High School Statistics and use of Technology
Teaching High School Statistics and use of TechnologyTeaching High School Statistics and use of Technology
Teaching High School Statistics and use of Technology
 
Marketing Music Education: Recent facts, quotes and statistics that YOU can u...
Marketing Music Education: Recent facts, quotes and statistics that YOU can u...Marketing Music Education: Recent facts, quotes and statistics that YOU can u...
Marketing Music Education: Recent facts, quotes and statistics that YOU can u...
 
Using assessment data
Using assessment dataUsing assessment data
Using assessment data
 
Maddaloni, daniela, descriptive statistics
Maddaloni, daniela, descriptive statisticsMaddaloni, daniela, descriptive statistics
Maddaloni, daniela, descriptive statistics
 
WSC 2011, advanced tutorial on simulation in Statistics
WSC 2011, advanced tutorial on simulation in StatisticsWSC 2011, advanced tutorial on simulation in Statistics
WSC 2011, advanced tutorial on simulation in Statistics
 
Introduction to Twitter in Higher Education workshop for SIGMA 2014
Introduction to Twitter in Higher Education workshop  for SIGMA 2014Introduction to Twitter in Higher Education workshop  for SIGMA 2014
Introduction to Twitter in Higher Education workshop for SIGMA 2014
 
Advance Statistics - Wilcoxon Signed Rank Test
Advance Statistics - Wilcoxon Signed Rank TestAdvance Statistics - Wilcoxon Signed Rank Test
Advance Statistics - Wilcoxon Signed Rank Test
 

Ähnlich wie StatMine (New Technologies and Techniques for Statistics)

StatMine, visual exploration of output data
StatMine, visual exploration of output dataStatMine, visual exploration of output data
StatMine, visual exploration of output dataEdwin de Jonge
 
Views you can use: data visualization | LSC Technology Initiative Grant Confe...
Views you can use: data visualization | LSC Technology Initiative Grant Confe...Views you can use: data visualization | LSC Technology Initiative Grant Confe...
Views you can use: data visualization | LSC Technology Initiative Grant Confe...Legal Services Corporation
 
SC6 Workshop 1: Big Data Europe platform requirements and draft architecture:...
SC6 Workshop 1: Big Data Europe platform requirements and draft architecture:...SC6 Workshop 1: Big Data Europe platform requirements and draft architecture:...
SC6 Workshop 1: Big Data Europe platform requirements and draft architecture:...BigData_Europe
 
WWV2015: Jibes Paul van der Hulst big data
WWV2015: Jibes Paul van der Hulst big dataWWV2015: Jibes Paul van der Hulst big data
WWV2015: Jibes Paul van der Hulst big datawebwinkelvakdag
 
Responsible Data Science at Statistics Netherlands
Responsible Data Science at Statistics NetherlandsResponsible Data Science at Statistics Netherlands
Responsible Data Science at Statistics NetherlandsPiet J.H. Daas
 
Statista Corporate Account Features
Statista Corporate Account FeaturesStatista Corporate Account Features
Statista Corporate Account FeaturesStatista
 
Data Mining & Predictive Analytics - Lesson 14 - Concepts Recapitulation and ...
Data Mining & Predictive Analytics - Lesson 14 - Concepts Recapitulation and ...Data Mining & Predictive Analytics - Lesson 14 - Concepts Recapitulation and ...
Data Mining & Predictive Analytics - Lesson 14 - Concepts Recapitulation and ...Michael Lew
 
big data analytics pgpmx2015
big data analytics pgpmx2015big data analytics pgpmx2015
big data analytics pgpmx2015Sanmeet Dhokay
 
Equals Seed Funding Presentation
Equals Seed Funding PresentationEquals Seed Funding Presentation
Equals Seed Funding PresentationDevLoadco
 
Economics & Statistics Insights in Data Science by DataPerts Technologies
Economics & Statistics Insights in Data Science by DataPerts TechnologiesEconomics & Statistics Insights in Data Science by DataPerts Technologies
Economics & Statistics Insights in Data Science by DataPerts TechnologiesRavindra Panwar
 
Why Everything You Know About bigdata Is A Lie
Why Everything You Know About bigdata Is A LieWhy Everything You Know About bigdata Is A Lie
Why Everything You Know About bigdata Is A LieSunil Ranka
 
Tableau 2018 - Introduction to Visual analytics
Tableau 2018 - Introduction to Visual analyticsTableau 2018 - Introduction to Visual analytics
Tableau 2018 - Introduction to Visual analyticsArun K
 
Big Data Tools PowerPoint Presentation Slides
Big Data Tools PowerPoint Presentation SlidesBig Data Tools PowerPoint Presentation Slides
Big Data Tools PowerPoint Presentation SlidesSlideTeam
 
Scaling up your Analytics & Insights
Scaling up your Analytics & InsightsScaling up your Analytics & Insights
Scaling up your Analytics & InsightsLoQutus
 
Big Data Analytics for BI, BA and QA
Big Data Analytics for BI, BA and QABig Data Analytics for BI, BA and QA
Big Data Analytics for BI, BA and QADmitry Tolpeko
 
Big Data & Business Analytics: Understanding the Marketspace
Big Data & Business Analytics: Understanding the MarketspaceBig Data & Business Analytics: Understanding the Marketspace
Big Data & Business Analytics: Understanding the MarketspaceBala Iyer
 
Introduction to einstein analytics
Introduction to einstein analyticsIntroduction to einstein analytics
Introduction to einstein analyticsJan Vandevelde
 
Introduction to einstein analytics
Introduction to einstein analyticsIntroduction to einstein analytics
Introduction to einstein analyticsSteven Hugo
 
How to Achieve Self-Service Analytics with a Governed Data Services Layer (UK)
How to Achieve Self-Service Analytics with a Governed Data Services Layer (UK)How to Achieve Self-Service Analytics with a Governed Data Services Layer (UK)
How to Achieve Self-Service Analytics with a Governed Data Services Layer (UK)Denodo
 

Ähnlich wie StatMine (New Technologies and Techniques for Statistics) (20)

StatMine
StatMineStatMine
StatMine
 
StatMine, visual exploration of output data
StatMine, visual exploration of output dataStatMine, visual exploration of output data
StatMine, visual exploration of output data
 
Views you can use: data visualization | LSC Technology Initiative Grant Confe...
Views you can use: data visualization | LSC Technology Initiative Grant Confe...Views you can use: data visualization | LSC Technology Initiative Grant Confe...
Views you can use: data visualization | LSC Technology Initiative Grant Confe...
 
SC6 Workshop 1: Big Data Europe platform requirements and draft architecture:...
SC6 Workshop 1: Big Data Europe platform requirements and draft architecture:...SC6 Workshop 1: Big Data Europe platform requirements and draft architecture:...
SC6 Workshop 1: Big Data Europe platform requirements and draft architecture:...
 
WWV2015: Jibes Paul van der Hulst big data
WWV2015: Jibes Paul van der Hulst big dataWWV2015: Jibes Paul van der Hulst big data
WWV2015: Jibes Paul van der Hulst big data
 
Responsible Data Science at Statistics Netherlands
Responsible Data Science at Statistics NetherlandsResponsible Data Science at Statistics Netherlands
Responsible Data Science at Statistics Netherlands
 
Statista Corporate Account Features
Statista Corporate Account FeaturesStatista Corporate Account Features
Statista Corporate Account Features
 
Data Mining & Predictive Analytics - Lesson 14 - Concepts Recapitulation and ...
Data Mining & Predictive Analytics - Lesson 14 - Concepts Recapitulation and ...Data Mining & Predictive Analytics - Lesson 14 - Concepts Recapitulation and ...
Data Mining & Predictive Analytics - Lesson 14 - Concepts Recapitulation and ...
 
big data analytics pgpmx2015
big data analytics pgpmx2015big data analytics pgpmx2015
big data analytics pgpmx2015
 
Equals Seed Funding Presentation
Equals Seed Funding PresentationEquals Seed Funding Presentation
Equals Seed Funding Presentation
 
Economics & Statistics Insights in Data Science by DataPerts Technologies
Economics & Statistics Insights in Data Science by DataPerts TechnologiesEconomics & Statistics Insights in Data Science by DataPerts Technologies
Economics & Statistics Insights in Data Science by DataPerts Technologies
 
Why Everything You Know About bigdata Is A Lie
Why Everything You Know About bigdata Is A LieWhy Everything You Know About bigdata Is A Lie
Why Everything You Know About bigdata Is A Lie
 
Tableau 2018 - Introduction to Visual analytics
Tableau 2018 - Introduction to Visual analyticsTableau 2018 - Introduction to Visual analytics
Tableau 2018 - Introduction to Visual analytics
 
Big Data Tools PowerPoint Presentation Slides
Big Data Tools PowerPoint Presentation SlidesBig Data Tools PowerPoint Presentation Slides
Big Data Tools PowerPoint Presentation Slides
 
Scaling up your Analytics & Insights
Scaling up your Analytics & InsightsScaling up your Analytics & Insights
Scaling up your Analytics & Insights
 
Big Data Analytics for BI, BA and QA
Big Data Analytics for BI, BA and QABig Data Analytics for BI, BA and QA
Big Data Analytics for BI, BA and QA
 
Big Data & Business Analytics: Understanding the Marketspace
Big Data & Business Analytics: Understanding the MarketspaceBig Data & Business Analytics: Understanding the Marketspace
Big Data & Business Analytics: Understanding the Marketspace
 
Introduction to einstein analytics
Introduction to einstein analyticsIntroduction to einstein analytics
Introduction to einstein analytics
 
Introduction to einstein analytics
Introduction to einstein analyticsIntroduction to einstein analytics
Introduction to einstein analytics
 
How to Achieve Self-Service Analytics with a Governed Data Services Layer (UK)
How to Achieve Self-Service Analytics with a Governed Data Services Layer (UK)How to Achieve Self-Service Analytics with a Governed Data Services Layer (UK)
How to Achieve Self-Service Analytics with a Governed Data Services Layer (UK)
 

Mehr von Edwin de Jonge

Validatetools, resolve and simplify contradictive or data validation rules
Validatetools, resolve and simplify contradictive or data validation rulesValidatetools, resolve and simplify contradictive or data validation rules
Validatetools, resolve and simplify contradictive or data validation rulesEdwin de Jonge
 
Data error! But where?
Data error! But where?Data error! But where?
Data error! But where?Edwin de Jonge
 
Daff: diff, patch and merge for data.frame
Daff: diff, patch and merge for data.frameDaff: diff, patch and merge for data.frame
Daff: diff, patch and merge for data.frameEdwin de Jonge
 
Chunked, dplyr for large text files
Chunked, dplyr for large text filesChunked, dplyr for large text files
Chunked, dplyr for large text filesEdwin de Jonge
 
Heatmaps best practices Strata Hadoop
Heatmaps best practices Strata HadoopHeatmaps best practices Strata Hadoop
Heatmaps best practices Strata HadoopEdwin de Jonge
 
Docopt, beautiful command-line options for R, user2014
Docopt, beautiful command-line options for R,  user2014Docopt, beautiful command-line options for R,  user2014
Docopt, beautiful command-line options for R, user2014Edwin de Jonge
 
Big Data Visualization
Big Data VisualizationBig Data Visualization
Big Data VisualizationEdwin de Jonge
 
ffbase, statistical functions for large datasets
ffbase, statistical functions for large datasetsffbase, statistical functions for large datasets
ffbase, statistical functions for large datasetsEdwin de Jonge
 
Tabplotd3, interactive inspection of large data
Tabplotd3, interactive inspection of large dataTabplotd3, interactive inspection of large data
Tabplotd3, interactive inspection of large dataEdwin de Jonge
 
Big data as a source for official statistics
Big data as a source for official statisticsBig data as a source for official statistics
Big data as a source for official statisticsEdwin de Jonge
 
Statmine, Visuele dataexploratie
Statmine, Visuele dataexploratieStatmine, Visuele dataexploratie
Statmine, Visuele dataexploratieEdwin de Jonge
 

Mehr von Edwin de Jonge (13)

sdcSpatial user!2019
sdcSpatial user!2019sdcSpatial user!2019
sdcSpatial user!2019
 
Validatetools, resolve and simplify contradictive or data validation rules
Validatetools, resolve and simplify contradictive or data validation rulesValidatetools, resolve and simplify contradictive or data validation rules
Validatetools, resolve and simplify contradictive or data validation rules
 
Data error! But where?
Data error! But where?Data error! But where?
Data error! But where?
 
Daff: diff, patch and merge for data.frame
Daff: diff, patch and merge for data.frameDaff: diff, patch and merge for data.frame
Daff: diff, patch and merge for data.frame
 
Chunked, dplyr for large text files
Chunked, dplyr for large text filesChunked, dplyr for large text files
Chunked, dplyr for large text files
 
Heatmaps best practices Strata Hadoop
Heatmaps best practices Strata HadoopHeatmaps best practices Strata Hadoop
Heatmaps best practices Strata Hadoop
 
Docopt, beautiful command-line options for R, user2014
Docopt, beautiful command-line options for R,  user2014Docopt, beautiful command-line options for R,  user2014
Docopt, beautiful command-line options for R, user2014
 
Big data experiments
Big data experimentsBig data experiments
Big data experiments
 
Big Data Visualization
Big Data VisualizationBig Data Visualization
Big Data Visualization
 
ffbase, statistical functions for large datasets
ffbase, statistical functions for large datasetsffbase, statistical functions for large datasets
ffbase, statistical functions for large datasets
 
Tabplotd3, interactive inspection of large data
Tabplotd3, interactive inspection of large dataTabplotd3, interactive inspection of large data
Tabplotd3, interactive inspection of large data
 
Big data as a source for official statistics
Big data as a source for official statisticsBig data as a source for official statistics
Big data as a source for official statistics
 
Statmine, Visuele dataexploratie
Statmine, Visuele dataexploratieStatmine, Visuele dataexploratie
Statmine, Visuele dataexploratie
 

Kürzlich hochgeladen

TrustArc Webinar - Stay Ahead of US State Data Privacy Law Developments
TrustArc Webinar - Stay Ahead of US State Data Privacy Law DevelopmentsTrustArc Webinar - Stay Ahead of US State Data Privacy Law Developments
TrustArc Webinar - Stay Ahead of US State Data Privacy Law DevelopmentsTrustArc
 
GenCyber Cyber Security Day Presentation
GenCyber Cyber Security Day PresentationGenCyber Cyber Security Day Presentation
GenCyber Cyber Security Day PresentationMichael W. Hawkins
 
Automating Google Workspace (GWS) & more with Apps Script
Automating Google Workspace (GWS) & more with Apps ScriptAutomating Google Workspace (GWS) & more with Apps Script
Automating Google Workspace (GWS) & more with Apps Scriptwesley chun
 
Strategies for Unlocking Knowledge Management in Microsoft 365 in the Copilot...
Strategies for Unlocking Knowledge Management in Microsoft 365 in the Copilot...Strategies for Unlocking Knowledge Management in Microsoft 365 in the Copilot...
Strategies for Unlocking Knowledge Management in Microsoft 365 in the Copilot...Drew Madelung
 
Tata AIG General Insurance Company - Insurer Innovation Award 2024
Tata AIG General Insurance Company - Insurer Innovation Award 2024Tata AIG General Insurance Company - Insurer Innovation Award 2024
Tata AIG General Insurance Company - Insurer Innovation Award 2024The Digital Insurer
 
From Event to Action: Accelerate Your Decision Making with Real-Time Automation
From Event to Action: Accelerate Your Decision Making with Real-Time AutomationFrom Event to Action: Accelerate Your Decision Making with Real-Time Automation
From Event to Action: Accelerate Your Decision Making with Real-Time AutomationSafe Software
 
2024: Domino Containers - The Next Step. News from the Domino Container commu...
2024: Domino Containers - The Next Step. News from the Domino Container commu...2024: Domino Containers - The Next Step. News from the Domino Container commu...
2024: Domino Containers - The Next Step. News from the Domino Container commu...Martijn de Jong
 
08448380779 Call Girls In Greater Kailash - I Women Seeking Men
08448380779 Call Girls In Greater Kailash - I Women Seeking Men08448380779 Call Girls In Greater Kailash - I Women Seeking Men
08448380779 Call Girls In Greater Kailash - I Women Seeking MenDelhi Call girls
 
How to convert PDF to text with Nanonets
How to convert PDF to text with NanonetsHow to convert PDF to text with Nanonets
How to convert PDF to text with Nanonetsnaman860154
 
Apidays Singapore 2024 - Building Digital Trust in a Digital Economy by Veron...
Apidays Singapore 2024 - Building Digital Trust in a Digital Economy by Veron...Apidays Singapore 2024 - Building Digital Trust in a Digital Economy by Veron...
Apidays Singapore 2024 - Building Digital Trust in a Digital Economy by Veron...apidays
 
CNv6 Instructor Chapter 6 Quality of Service
CNv6 Instructor Chapter 6 Quality of ServiceCNv6 Instructor Chapter 6 Quality of Service
CNv6 Instructor Chapter 6 Quality of Servicegiselly40
 
The 7 Things I Know About Cyber Security After 25 Years | April 2024
The 7 Things I Know About Cyber Security After 25 Years | April 2024The 7 Things I Know About Cyber Security After 25 Years | April 2024
The 7 Things I Know About Cyber Security After 25 Years | April 2024Rafal Los
 
Histor y of HAM Radio presentation slide
Histor y of HAM Radio presentation slideHistor y of HAM Radio presentation slide
Histor y of HAM Radio presentation slidevu2urc
 
Salesforce Community Group Quito, Salesforce 101
Salesforce Community Group Quito, Salesforce 101Salesforce Community Group Quito, Salesforce 101
Salesforce Community Group Quito, Salesforce 101Paola De la Torre
 
A Domino Admins Adventures (Engage 2024)
A Domino Admins Adventures (Engage 2024)A Domino Admins Adventures (Engage 2024)
A Domino Admins Adventures (Engage 2024)Gabriella Davis
 
The Role of Taxonomy and Ontology in Semantic Layers - Heather Hedden.pdf
The Role of Taxonomy and Ontology in Semantic Layers - Heather Hedden.pdfThe Role of Taxonomy and Ontology in Semantic Layers - Heather Hedden.pdf
The Role of Taxonomy and Ontology in Semantic Layers - Heather Hedden.pdfEnterprise Knowledge
 
Workshop - Best of Both Worlds_ Combine KG and Vector search for enhanced R...
Workshop - Best of Both Worlds_ Combine  KG and Vector search for  enhanced R...Workshop - Best of Both Worlds_ Combine  KG and Vector search for  enhanced R...
Workshop - Best of Both Worlds_ Combine KG and Vector search for enhanced R...Neo4j
 
Factors to Consider When Choosing Accounts Payable Services Providers.pptx
Factors to Consider When Choosing Accounts Payable Services Providers.pptxFactors to Consider When Choosing Accounts Payable Services Providers.pptx
Factors to Consider When Choosing Accounts Payable Services Providers.pptxKatpro Technologies
 
08448380779 Call Girls In Friends Colony Women Seeking Men
08448380779 Call Girls In Friends Colony Women Seeking Men08448380779 Call Girls In Friends Colony Women Seeking Men
08448380779 Call Girls In Friends Colony Women Seeking MenDelhi Call girls
 
Raspberry Pi 5: Challenges and Solutions in Bringing up an OpenGL/Vulkan Driv...
Raspberry Pi 5: Challenges and Solutions in Bringing up an OpenGL/Vulkan Driv...Raspberry Pi 5: Challenges and Solutions in Bringing up an OpenGL/Vulkan Driv...
Raspberry Pi 5: Challenges and Solutions in Bringing up an OpenGL/Vulkan Driv...Igalia
 

Kürzlich hochgeladen (20)

TrustArc Webinar - Stay Ahead of US State Data Privacy Law Developments
TrustArc Webinar - Stay Ahead of US State Data Privacy Law DevelopmentsTrustArc Webinar - Stay Ahead of US State Data Privacy Law Developments
TrustArc Webinar - Stay Ahead of US State Data Privacy Law Developments
 
GenCyber Cyber Security Day Presentation
GenCyber Cyber Security Day PresentationGenCyber Cyber Security Day Presentation
GenCyber Cyber Security Day Presentation
 
Automating Google Workspace (GWS) & more with Apps Script
Automating Google Workspace (GWS) & more with Apps ScriptAutomating Google Workspace (GWS) & more with Apps Script
Automating Google Workspace (GWS) & more with Apps Script
 
Strategies for Unlocking Knowledge Management in Microsoft 365 in the Copilot...
Strategies for Unlocking Knowledge Management in Microsoft 365 in the Copilot...Strategies for Unlocking Knowledge Management in Microsoft 365 in the Copilot...
Strategies for Unlocking Knowledge Management in Microsoft 365 in the Copilot...
 
Tata AIG General Insurance Company - Insurer Innovation Award 2024
Tata AIG General Insurance Company - Insurer Innovation Award 2024Tata AIG General Insurance Company - Insurer Innovation Award 2024
Tata AIG General Insurance Company - Insurer Innovation Award 2024
 
From Event to Action: Accelerate Your Decision Making with Real-Time Automation
From Event to Action: Accelerate Your Decision Making with Real-Time AutomationFrom Event to Action: Accelerate Your Decision Making with Real-Time Automation
From Event to Action: Accelerate Your Decision Making with Real-Time Automation
 
2024: Domino Containers - The Next Step. News from the Domino Container commu...
2024: Domino Containers - The Next Step. News from the Domino Container commu...2024: Domino Containers - The Next Step. News from the Domino Container commu...
2024: Domino Containers - The Next Step. News from the Domino Container commu...
 
08448380779 Call Girls In Greater Kailash - I Women Seeking Men
08448380779 Call Girls In Greater Kailash - I Women Seeking Men08448380779 Call Girls In Greater Kailash - I Women Seeking Men
08448380779 Call Girls In Greater Kailash - I Women Seeking Men
 
How to convert PDF to text with Nanonets
How to convert PDF to text with NanonetsHow to convert PDF to text with Nanonets
How to convert PDF to text with Nanonets
 
Apidays Singapore 2024 - Building Digital Trust in a Digital Economy by Veron...
Apidays Singapore 2024 - Building Digital Trust in a Digital Economy by Veron...Apidays Singapore 2024 - Building Digital Trust in a Digital Economy by Veron...
Apidays Singapore 2024 - Building Digital Trust in a Digital Economy by Veron...
 
CNv6 Instructor Chapter 6 Quality of Service
CNv6 Instructor Chapter 6 Quality of ServiceCNv6 Instructor Chapter 6 Quality of Service
CNv6 Instructor Chapter 6 Quality of Service
 
The 7 Things I Know About Cyber Security After 25 Years | April 2024
The 7 Things I Know About Cyber Security After 25 Years | April 2024The 7 Things I Know About Cyber Security After 25 Years | April 2024
The 7 Things I Know About Cyber Security After 25 Years | April 2024
 
Histor y of HAM Radio presentation slide
Histor y of HAM Radio presentation slideHistor y of HAM Radio presentation slide
Histor y of HAM Radio presentation slide
 
Salesforce Community Group Quito, Salesforce 101
Salesforce Community Group Quito, Salesforce 101Salesforce Community Group Quito, Salesforce 101
Salesforce Community Group Quito, Salesforce 101
 
A Domino Admins Adventures (Engage 2024)
A Domino Admins Adventures (Engage 2024)A Domino Admins Adventures (Engage 2024)
A Domino Admins Adventures (Engage 2024)
 
The Role of Taxonomy and Ontology in Semantic Layers - Heather Hedden.pdf
The Role of Taxonomy and Ontology in Semantic Layers - Heather Hedden.pdfThe Role of Taxonomy and Ontology in Semantic Layers - Heather Hedden.pdf
The Role of Taxonomy and Ontology in Semantic Layers - Heather Hedden.pdf
 
Workshop - Best of Both Worlds_ Combine KG and Vector search for enhanced R...
Workshop - Best of Both Worlds_ Combine  KG and Vector search for  enhanced R...Workshop - Best of Both Worlds_ Combine  KG and Vector search for  enhanced R...
Workshop - Best of Both Worlds_ Combine KG and Vector search for enhanced R...
 
Factors to Consider When Choosing Accounts Payable Services Providers.pptx
Factors to Consider When Choosing Accounts Payable Services Providers.pptxFactors to Consider When Choosing Accounts Payable Services Providers.pptx
Factors to Consider When Choosing Accounts Payable Services Providers.pptx
 
08448380779 Call Girls In Friends Colony Women Seeking Men
08448380779 Call Girls In Friends Colony Women Seeking Men08448380779 Call Girls In Friends Colony Women Seeking Men
08448380779 Call Girls In Friends Colony Women Seeking Men
 
Raspberry Pi 5: Challenges and Solutions in Bringing up an OpenGL/Vulkan Driv...
Raspberry Pi 5: Challenges and Solutions in Bringing up an OpenGL/Vulkan Driv...Raspberry Pi 5: Challenges and Solutions in Bringing up an OpenGL/Vulkan Driv...
Raspberry Pi 5: Challenges and Solutions in Bringing up an OpenGL/Vulkan Driv...
 

StatMine (New Technologies and Techniques for Statistics)

  • 1. StatMine – prototype 0.2 Edwin de Jonge, Jan van der Laan & Jessica Solcer Statistics Netherlands (CBS) NTTS 2013, March 6 2013
  • 2. StatMine Goal: Improve use figures Statistics Netherlands How: Add Analysis layer to OutputDB (StatLine) Working approach: • • • • Formulate improvement Develop software prototype Test prototype on (real) users Evaluate But why? StatMine 2
  • 3. Mission SN “The mission of Statistics Netherlands is to publish reliable and coherent statistical information that meets the needs of society” (source: www.cbs.nl) StatMine 0.2 3
  • 4. Mission SN “The mission of Statistics Netherlands is to publish reliable and coherent statistical information that meets the needs of society” (source: www.cbs.nl) StatMine 0.2 4
  • 6. What is the state of the Netherlands? StatLine contains over 1.000.000.000 figures! StatMine 6
  • 7. Problem 1 Figures ≠ Information StatMine 7
  • 8. 1. Figures ≠ Information We know (from user study): • Some important user don’t get the most out of StatLine: • Data journalists • Policy makers • They don’t find and see interesting information, because of tabular presention (data = table) StatMine 0.2 8
  • 11. 2. Fragmented information For policy makers and journalist most information in OutputDB is fragmented: • Users need to combine fragments from different statistics • Diabetes (insuline usage, hospital admissions, mortality, visits to doctor, obesity) • Energy consumption vs economic growth • Income vs economic growth • (Perceived) public safety vs registered crimes StatMine 0.2 11
  • 12. 2. Solution: Let users combine tables (even if we wouldn’t …) StatMine 12
  • 13. Prototype StatMine 0.2 Implements: • Visual interactive data browsing • Combining fragments of different tables Tested on: • 40 SN employees (++) • 40 policy makers (++) StatMine 0.2 13
  • 14. Line chart Bar chart - Show development - Compare Bubble/scatter chart Mosaic chart - Show correlation - Show structure StatMine 0.2 14
  • 17. Technical HTML5 JSON R JavaScript CSS SVG • Runs on desktop • makkelijk over te zetten naar webserver StatMine 0.2 17
  • 18. Currently (2013) • All Official Statistics have confidence interval. • StatMine 0.3 will test if showing uncertainty improves/changes understanding of (quality of) figures. • May lead to publishing interval estimates (in stead of point estimates). StatMine 18
  • 19. Conclusion • Visual data browsing is promising for • Our own statisticians (quality control) • External policy makers and journalists • Using real end users for testing is very helpful: • Lots of suggestions for improvement from users • Users feel involved in innovation process of NSI StatMine 19