SlideShare ist ein Scribd-Unternehmen logo
1 von 44
Datavis Barista: How to choose what
dataviz tool, and when
Jen Stirrup
Founder, Data Relish
Level: Intermediate
Who am I? Jen Stirrup
What dataviz tool to choose, and
when?
• SSRS
• Excel
• Tableau
• Power BI
• Datazen
• Kibana
What makes a good Visualisation?
• Effective
• Accurate
• Efficient
• Aesthetics
• Adaptable
https://www.quora.com/What-are-the-worst-infographics-youve-ever-seen
http://www.designyourway.net/blog/inspiration/when-infographics-go-
bad-or-how-not-to-design-data-visualization/
What makes a good Visualisation?
• Effective
• Accurate
• Efficient
• Aesthetics
• Adaptable
Excel, Datazen, SSRS, Power BI
SQL Server 2016
SQL Server 2016
SQL Server 2016
Datazen
Treemaps in Power BI
Waterfall Charts in Power BI
Example in Power BI, SSRS
• Demo
Why R?
• most widely used data analysis software - used by 2M + data scientist,
statisticians and analysts
• Most powerful statistical programming language
• flexible, extensible and comprehensive for productivity
• Create beautiful and unique data visualisations - as seen in New York Times,
Twitter and Flowing Data
• Thriving open-source community - leading edge of analytics research
• Fills the talent gap - new graduates prefer R.
Growth in Demand
• Rexer Data Mining survey, 2007 - 2013
• R is the highest paid IT skill Dice.com, Jan 2014
• R most used-data science language after SQL -
O'Reilly, Jan 2014
• R is used by 70% of data miners. Rexer, Sept 2013
Growth in Demand
• R is #15 of all programming languages. REdMonk, Jan
2014
• R growing faster than any other data science language.
KDNuggs.
• R is in-memory and limited in the size of data that you
can process.
What do I need to install?
• Install R – www.r-project.org
• Install Rstudio – www.rstudio.com
• Handy Shortcuts
• Tab – autocomplete of available functions
• Control and Up Arrow – History
• Control and enter – executes the line of code
What tools do we have in R?
• 80% of your time will be spent preparing and wrangling data
• The remainder of your time will be spent complaining about it.
• dplyr: the essential data manipulation toolset
• In data wrangling, what are the main tasks?
• – Filtering rows
– Selecting columns of data
– Adding new variables
– Sorting
– Aggregating
Example in R
• Demo
The Big Data problem
• Reaction Time
• Enrichment
• Insights
• Optimize for query, not for storage.
Can you check the errors between 12.02 and 12.04
yesterday?
Can you check the errors between 12.02 and 12.04 last
Friday?
…. Are you kidding me?
What is Kibana?
Kibana
• It is highly customizable dashboarding
• It is constituted of panels:
– Time picker / Query / Filtering
– Charts / Table / Text
Flexible analytics and visualization platform
Real-time summary and charting of streaming
data
Intuitive interface for a variety of users
Instant sharing and embedding of dashboards
To better understand large volumes of data..
• easily create bar charts
• line and scatter plots
• Histograms
• pie charts
• maps.
To better understand large volumes of data..
• easily create bar charts
• line and scatter plots
• Histograms
• pie charts
• maps.
Kibana DataViz Types
Default Chart Types
Chart Type Basis Values Types Purpose
Histogram Timestamp based Count, Mean, Total Barlines, stacks,
percentages
Queries
Table Paging Fields list Highlighting,
sorting
Fine grained
analysis
Pie Charts Terms Missing terms,
other
Doughnut, legends,
tables
Proportion
Summary
• SSRS
• Excel
• Tableau
• Power BI
• Datazen
• Kibana

Weitere ähnliche Inhalte

Was ist angesagt?

Optier presentation for open analytics event
Optier presentation for open analytics eventOptier presentation for open analytics event
Optier presentation for open analytics event
Open Analytics
 
Big data-science-oanyc
Big data-science-oanycBig data-science-oanyc
Big data-science-oanyc
Open Analytics
 
Big data bi-mature-oanyc summit
Big data bi-mature-oanyc summitBig data bi-mature-oanyc summit
Big data bi-mature-oanyc summit
Open Analytics
 

Was ist angesagt? (20)

Optier presentation for open analytics event
Optier presentation for open analytics eventOptier presentation for open analytics event
Optier presentation for open analytics event
 
Big data-science-oanyc
Big data-science-oanycBig data-science-oanyc
Big data-science-oanyc
 
Big data bi-mature-oanyc summit
Big data bi-mature-oanyc summitBig data bi-mature-oanyc summit
Big data bi-mature-oanyc summit
 
Synapse NanoApps
Synapse NanoAppsSynapse NanoApps
Synapse NanoApps
 
Rijuta Wagh Resume
Rijuta Wagh ResumeRijuta Wagh Resume
Rijuta Wagh Resume
 
Kyvos Insights
Kyvos Insights Kyvos Insights
Kyvos Insights
 
Office 360 and Spark
Office 360 and Spark Office 360 and Spark
Office 360 and Spark
 
Counting is easy, Measuring is Hard! Dashboard Design
Counting is easy, Measuring is Hard! Dashboard DesignCounting is easy, Measuring is Hard! Dashboard Design
Counting is easy, Measuring is Hard! Dashboard Design
 
Exploring Big Data Analytics Tools
Exploring Big Data Analytics ToolsExploring Big Data Analytics Tools
Exploring Big Data Analytics Tools
 
Big data from the trenches
Big data from the trenchesBig data from the trenches
Big data from the trenches
 
Knowledge Graphs - Journey to the Connected Enterprise - Data Strategy and An...
Knowledge Graphs - Journey to the Connected Enterprise - Data Strategy and An...Knowledge Graphs - Journey to the Connected Enterprise - Data Strategy and An...
Knowledge Graphs - Journey to the Connected Enterprise - Data Strategy and An...
 
Building data "Py-pelines"
Building data "Py-pelines"Building data "Py-pelines"
Building data "Py-pelines"
 
Big Data Visualisation with Hadoop and PowerPivot
Big Data Visualisation with Hadoop and PowerPivotBig Data Visualisation with Hadoop and PowerPivot
Big Data Visualisation with Hadoop and PowerPivot
 
Bi 2.0 hadoop everywhere
Bi 2.0   hadoop everywhereBi 2.0   hadoop everywhere
Bi 2.0 hadoop everywhere
 
Initiate Edinburgh 2019 - Big Data Meets AI
Initiate Edinburgh 2019 - Big Data Meets AIInitiate Edinburgh 2019 - Big Data Meets AI
Initiate Edinburgh 2019 - Big Data Meets AI
 
Top Big data Analytics tools: Emerging trends and Best practices
Top Big data Analytics tools: Emerging trends and Best practicesTop Big data Analytics tools: Emerging trends and Best practices
Top Big data Analytics tools: Emerging trends and Best practices
 
Data Con LA 2019 - Big Data Modeling with Spark SQL: Make data valuable by Ja...
Data Con LA 2019 - Big Data Modeling with Spark SQL: Make data valuable by Ja...Data Con LA 2019 - Big Data Modeling with Spark SQL: Make data valuable by Ja...
Data Con LA 2019 - Big Data Modeling with Spark SQL: Make data valuable by Ja...
 
The Rise of the DataOps - Dataiku - J On the Beach 2016
The Rise of the DataOps - Dataiku - J On the Beach 2016 The Rise of the DataOps - Dataiku - J On the Beach 2016
The Rise of the DataOps - Dataiku - J On the Beach 2016
 
Kyvos insights
Kyvos insightsKyvos insights
Kyvos insights
 
Here are some of the things our Data Analytics team can do
Here are some of the things our Data Analytics team can doHere are some of the things our Data Analytics team can do
Here are some of the things our Data Analytics team can do
 

Andere mochten auch

Visualising the tabular model for power view upload
Visualising the tabular model for power view uploadVisualising the tabular model for power view upload
Visualising the tabular model for power view upload
Jen Stirrup
 

Andere mochten auch (20)

Restructuring Technical Debt - A Software and System Quality Approach
Restructuring Technical Debt - A Software and System Quality ApproachRestructuring Technical Debt - A Software and System Quality Approach
Restructuring Technical Debt - A Software and System Quality Approach
 
Realtime analytics with_hadoop
Realtime analytics with_hadoopRealtime analytics with_hadoop
Realtime analytics with_hadoop
 
Cloud computing by Bhavesh
Cloud computing by BhaveshCloud computing by Bhavesh
Cloud computing by Bhavesh
 
Visualising the tabular model for power view upload
Visualising the tabular model for power view uploadVisualising the tabular model for power view upload
Visualising the tabular model for power view upload
 
Digital Pragmatism with Business Intelligence, Big Data and Data Visualisation
Digital Pragmatism with Business Intelligence, Big Data and Data VisualisationDigital Pragmatism with Business Intelligence, Big Data and Data Visualisation
Digital Pragmatism with Business Intelligence, Big Data and Data Visualisation
 
Cloud Computing Architecture Primer
Cloud Computing Architecture PrimerCloud Computing Architecture Primer
Cloud Computing Architecture Primer
 
System Quality Attributes for Software Architecture
System Quality Attributes for Software ArchitectureSystem Quality Attributes for Software Architecture
System Quality Attributes for Software Architecture
 
Windows Azure HDInsight Service
Windows Azure HDInsight ServiceWindows Azure HDInsight Service
Windows Azure HDInsight Service
 
How Universities Use Big Data to Transform Education
How Universities Use Big Data to Transform EducationHow Universities Use Big Data to Transform Education
How Universities Use Big Data to Transform Education
 
Intorducing Big Data and Microsoft Azure
Intorducing Big Data and Microsoft AzureIntorducing Big Data and Microsoft Azure
Intorducing Big Data and Microsoft Azure
 
Hive - 1455: Cloud Storage
Hive - 1455: Cloud StorageHive - 1455: Cloud Storage
Hive - 1455: Cloud Storage
 
How to Use Apache Zeppelin with HWX HDB
How to Use Apache Zeppelin with HWX HDBHow to Use Apache Zeppelin with HWX HDB
How to Use Apache Zeppelin with HWX HDB
 
Building the Data Lake with Azure Data Factory and Data Lake Analytics
Building the Data Lake with Azure Data Factory and Data Lake AnalyticsBuilding the Data Lake with Azure Data Factory and Data Lake Analytics
Building the Data Lake with Azure Data Factory and Data Lake Analytics
 
Dynamic Column Masking and Row-Level Filtering in HDP
Dynamic Column Masking and Row-Level Filtering in HDPDynamic Column Masking and Row-Level Filtering in HDP
Dynamic Column Masking and Row-Level Filtering in HDP
 
Microsoft cloud big data strategy
Microsoft cloud big data strategyMicrosoft cloud big data strategy
Microsoft cloud big data strategy
 
Big Data Analytics in the Cloud with Microsoft Azure
Big Data Analytics in the Cloud with Microsoft AzureBig Data Analytics in the Cloud with Microsoft Azure
Big Data Analytics in the Cloud with Microsoft Azure
 
Big Data in Azure
Big Data in AzureBig Data in Azure
Big Data in Azure
 
Modern Data Architecture for a Data Lake with Informatica and Hortonworks Dat...
Modern Data Architecture for a Data Lake with Informatica and Hortonworks Dat...Modern Data Architecture for a Data Lake with Informatica and Hortonworks Dat...
Modern Data Architecture for a Data Lake with Informatica and Hortonworks Dat...
 
Data science with Windows Azure - A Brief Introduction
Data science with Windows Azure - A Brief IntroductionData science with Windows Azure - A Brief Introduction
Data science with Windows Azure - A Brief Introduction
 
Spark with Azure HDInsight - Tampa Bay Data Science - Adnan Masood, PhD
Spark with Azure HDInsight  - Tampa Bay Data Science - Adnan Masood, PhDSpark with Azure HDInsight  - Tampa Bay Data Science - Adnan Masood, PhD
Spark with Azure HDInsight - Tampa Bay Data Science - Adnan Masood, PhD
 

Ähnlich wie Business Intelligence Barista: What DataViz Tool to Use, and When?

Power BI - 2016 - Public
Power BI - 2016 - PublicPower BI - 2016 - Public
Power BI - 2016 - Public
Julian Payne
 
The New Frontier: Optimizing Big Data Exploration
The New Frontier: Optimizing Big Data ExplorationThe New Frontier: Optimizing Big Data Exploration
The New Frontier: Optimizing Big Data Exploration
Inside Analysis
 

Ähnlich wie Business Intelligence Barista: What DataViz Tool to Use, and When? (20)

Power BI - 2016 - Public
Power BI - 2016 - PublicPower BI - 2016 - Public
Power BI - 2016 - Public
 
Visualising montioring and evaluation data
Visualising montioring and evaluation dataVisualising montioring and evaluation data
Visualising montioring and evaluation data
 
Tableau Seattle BI Event How Tableau Changed My Life
Tableau Seattle BI Event How Tableau Changed My LifeTableau Seattle BI Event How Tableau Changed My Life
Tableau Seattle BI Event How Tableau Changed My Life
 
The New Frontier: Optimizing Big Data Exploration
The New Frontier: Optimizing Big Data ExplorationThe New Frontier: Optimizing Big Data Exploration
The New Frontier: Optimizing Big Data Exploration
 
Tableau
TableauTableau
Tableau
 
Text Mining & Sentiment Analysis with Power BI & Azure
Text Mining & Sentiment Analysis with Power BI & AzureText Mining & Sentiment Analysis with Power BI & Azure
Text Mining & Sentiment Analysis with Power BI & Azure
 
Education Analytics
Education AnalyticsEducation Analytics
Education Analytics
 
Analytical tools
Analytical toolsAnalytical tools
Analytical tools
 
Data Visualization - UC Analytics Conference 2018
Data Visualization - UC Analytics Conference 2018Data Visualization - UC Analytics Conference 2018
Data Visualization - UC Analytics Conference 2018
 
Market research of the analytics tools
Market research of the analytics toolsMarket research of the analytics tools
Market research of the analytics tools
 
Tableau
TableauTableau
Tableau
 
Riga dev day 2016 adding a data reservoir and oracle bdd to extend your ora...
Riga dev day 2016   adding a data reservoir and oracle bdd to extend your ora...Riga dev day 2016   adding a data reservoir and oracle bdd to extend your ora...
Riga dev day 2016 adding a data reservoir and oracle bdd to extend your ora...
 
Public Administration Analytics
Public Administration AnalyticsPublic Administration Analytics
Public Administration Analytics
 
IT and Telco Analytics
IT and Telco AnalyticsIT and Telco Analytics
IT and Telco Analytics
 
Spark Summit Keynote by Suren Nathan
Spark Summit Keynote by Suren NathanSpark Summit Keynote by Suren Nathan
Spark Summit Keynote by Suren Nathan
 
Pharma Analytics
Pharma AnalyticsPharma Analytics
Pharma Analytics
 
Human Resources Analytics
Human Resources AnalyticsHuman Resources Analytics
Human Resources Analytics
 
Career in Data Using Tableau
Career in Data Using TableauCareer in Data Using Tableau
Career in Data Using Tableau
 
Lean Analytics: How to get more out of your data science team
Lean Analytics: How to get more out of your data science teamLean Analytics: How to get more out of your data science team
Lean Analytics: How to get more out of your data science team
 
Design Systems at Scale
Design Systems at ScaleDesign Systems at Scale
Design Systems at Scale
 

Mehr von Jen Stirrup

Mehr von Jen Stirrup (20)

AI Applications in Healthcare and Medicine.pdf
AI Applications in Healthcare and Medicine.pdfAI Applications in Healthcare and Medicine.pdf
AI Applications in Healthcare and Medicine.pdf
 
BUILDING A STRONG FOUNDATION FOR SUCCESS WITH BI AND DIGITAL TRANSFORMATION
BUILDING A STRONG FOUNDATION FOR SUCCESS WITH BI AND DIGITAL TRANSFORMATIONBUILDING A STRONG FOUNDATION FOR SUCCESS WITH BI AND DIGITAL TRANSFORMATION
BUILDING A STRONG FOUNDATION FOR SUCCESS WITH BI AND DIGITAL TRANSFORMATION
 
CuRious about R in Power BI? End to end R in Power BI for beginners
CuRious about R in Power BI? End to end R in Power BI for beginners CuRious about R in Power BI? End to end R in Power BI for beginners
CuRious about R in Power BI? End to end R in Power BI for beginners
 
Artificial Intelligence Ethics keynote: With Great Power, comes Great Respons...
Artificial Intelligence Ethics keynote: With Great Power, comes Great Respons...Artificial Intelligence Ethics keynote: With Great Power, comes Great Respons...
Artificial Intelligence Ethics keynote: With Great Power, comes Great Respons...
 
1 Introduction to Microsoft data platform analytics for release
1 Introduction to Microsoft data platform analytics for release1 Introduction to Microsoft data platform analytics for release
1 Introduction to Microsoft data platform analytics for release
 
5 Comparing Microsoft Big Data Technologies for Analytics
5 Comparing Microsoft Big Data Technologies for Analytics5 Comparing Microsoft Big Data Technologies for Analytics
5 Comparing Microsoft Big Data Technologies for Analytics
 
Comparing Microsoft Big Data Platform Technologies
Comparing Microsoft Big Data Platform TechnologiesComparing Microsoft Big Data Platform Technologies
Comparing Microsoft Big Data Platform Technologies
 
Introduction to Analytics with Azure Notebooks and Python
Introduction to Analytics with Azure Notebooks and PythonIntroduction to Analytics with Azure Notebooks and Python
Introduction to Analytics with Azure Notebooks and Python
 
Sales Analytics in Power BI
Sales Analytics in Power BISales Analytics in Power BI
Sales Analytics in Power BI
 
Analytics for Marketing
Analytics for MarketingAnalytics for Marketing
Analytics for Marketing
 
Diversity and inclusion for the newbies and doers
Diversity and inclusion for the newbies and doersDiversity and inclusion for the newbies and doers
Diversity and inclusion for the newbies and doers
 
Artificial Intelligence from the Business perspective
Artificial Intelligence from the Business perspectiveArtificial Intelligence from the Business perspective
Artificial Intelligence from the Business perspective
 
How to be successful with Artificial Intelligence - from small to success
How to be successful with Artificial Intelligence - from small to successHow to be successful with Artificial Intelligence - from small to success
How to be successful with Artificial Intelligence - from small to success
 
Artificial Intelligence: Winning the Red Queen’s Race Keynote at ESPC with Je...
Artificial Intelligence: Winning the Red Queen’s Race Keynote at ESPC with Je...Artificial Intelligence: Winning the Red Queen’s Race Keynote at ESPC with Je...
Artificial Intelligence: Winning the Red Queen’s Race Keynote at ESPC with Je...
 
Data Visualization dataviz superpower
Data Visualization dataviz superpowerData Visualization dataviz superpower
Data Visualization dataviz superpower
 
R - what do the numbers mean? #RStats
R - what do the numbers mean? #RStatsR - what do the numbers mean? #RStats
R - what do the numbers mean? #RStats
 
Artificial Intelligence and Deep Learning in Azure, CNTK and Tensorflow
Artificial Intelligence and Deep Learning in Azure, CNTK and TensorflowArtificial Intelligence and Deep Learning in Azure, CNTK and Tensorflow
Artificial Intelligence and Deep Learning in Azure, CNTK and Tensorflow
 
Blockchain Demystified for Business Intelligence Professionals
Blockchain Demystified for Business Intelligence ProfessionalsBlockchain Demystified for Business Intelligence Professionals
Blockchain Demystified for Business Intelligence Professionals
 
Examples of the worst data visualization ever
Examples of the worst data visualization everExamples of the worst data visualization ever
Examples of the worst data visualization ever
 
Lighting up Big Data Analytics with Apache Spark in Azure
Lighting up Big Data Analytics with Apache Spark in AzureLighting up Big Data Analytics with Apache Spark in Azure
Lighting up Big Data Analytics with Apache Spark in Azure
 

Kürzlich hochgeladen

Artificial Intelligence: Facts and Myths
Artificial Intelligence: Facts and MythsArtificial Intelligence: Facts and Myths
Artificial Intelligence: Facts and Myths
Joaquim Jorge
 
Histor y of HAM Radio presentation slide
Histor y of HAM Radio presentation slideHistor y of HAM Radio presentation slide
Histor y of HAM Radio presentation slide
vu2urc
 

Kürzlich hochgeladen (20)

Scaling API-first – The story of a global engineering organization
Scaling API-first – The story of a global engineering organizationScaling API-first – The story of a global engineering organization
Scaling API-first – The story of a global engineering organization
 
Strategies for Unlocking Knowledge Management in Microsoft 365 in the Copilot...
Strategies for Unlocking Knowledge Management in Microsoft 365 in the Copilot...Strategies for Unlocking Knowledge Management in Microsoft 365 in the Copilot...
Strategies for Unlocking Knowledge Management in Microsoft 365 in the Copilot...
 
Artificial Intelligence: Facts and Myths
Artificial Intelligence: Facts and MythsArtificial Intelligence: Facts and Myths
Artificial Intelligence: Facts and Myths
 
A Domino Admins Adventures (Engage 2024)
A Domino Admins Adventures (Engage 2024)A Domino Admins Adventures (Engage 2024)
A Domino Admins Adventures (Engage 2024)
 
A Year of the Servo Reboot: Where Are We Now?
A Year of the Servo Reboot: Where Are We Now?A Year of the Servo Reboot: Where Are We Now?
A Year of the Servo Reboot: Where Are We Now?
 
08448380779 Call Girls In Greater Kailash - I Women Seeking Men
08448380779 Call Girls In Greater Kailash - I Women Seeking Men08448380779 Call Girls In Greater Kailash - I Women Seeking Men
08448380779 Call Girls In Greater Kailash - I Women Seeking Men
 
Axa Assurance Maroc - Insurer Innovation Award 2024
Axa Assurance Maroc - Insurer Innovation Award 2024Axa Assurance Maroc - Insurer Innovation Award 2024
Axa Assurance Maroc - Insurer Innovation Award 2024
 
How to Troubleshoot Apps for the Modern Connected Worker
How to Troubleshoot Apps for the Modern Connected WorkerHow to Troubleshoot Apps for the Modern Connected Worker
How to Troubleshoot Apps for the Modern Connected Worker
 
Handwritten Text Recognition for manuscripts and early printed texts
Handwritten Text Recognition for manuscripts and early printed textsHandwritten Text Recognition for manuscripts and early printed texts
Handwritten Text Recognition for manuscripts and early printed texts
 
Data Cloud, More than a CDP by Matt Robison
Data Cloud, More than a CDP by Matt RobisonData Cloud, More than a CDP by Matt Robison
Data Cloud, More than a CDP by Matt Robison
 
GenCyber Cyber Security Day Presentation
GenCyber Cyber Security Day PresentationGenCyber Cyber Security Day Presentation
GenCyber Cyber Security Day Presentation
 
A Call to Action for Generative AI in 2024
A Call to Action for Generative AI in 2024A Call to Action for Generative AI in 2024
A Call to Action for Generative AI in 2024
 
Breaking the Kubernetes Kill Chain: Host Path Mount
Breaking the Kubernetes Kill Chain: Host Path MountBreaking the Kubernetes Kill Chain: Host Path Mount
Breaking the Kubernetes Kill Chain: Host Path Mount
 
Apidays Singapore 2024 - Building Digital Trust in a Digital Economy by Veron...
Apidays Singapore 2024 - Building Digital Trust in a Digital Economy by Veron...Apidays Singapore 2024 - Building Digital Trust in a Digital Economy by Veron...
Apidays Singapore 2024 - Building Digital Trust in a Digital Economy by Veron...
 
Bajaj Allianz Life Insurance Company - Insurer Innovation Award 2024
Bajaj Allianz Life Insurance Company - Insurer Innovation Award 2024Bajaj Allianz Life Insurance Company - Insurer Innovation Award 2024
Bajaj Allianz Life Insurance Company - Insurer Innovation Award 2024
 
Tata AIG General Insurance Company - Insurer Innovation Award 2024
Tata AIG General Insurance Company - Insurer Innovation Award 2024Tata AIG General Insurance Company - Insurer Innovation Award 2024
Tata AIG General Insurance Company - Insurer Innovation Award 2024
 
🐬 The future of MySQL is Postgres 🐘
🐬  The future of MySQL is Postgres   🐘🐬  The future of MySQL is Postgres   🐘
🐬 The future of MySQL is Postgres 🐘
 
Histor y of HAM Radio presentation slide
Histor y of HAM Radio presentation slideHistor y of HAM Radio presentation slide
Histor y of HAM Radio presentation slide
 
08448380779 Call Girls In Diplomatic Enclave Women Seeking Men
08448380779 Call Girls In Diplomatic Enclave Women Seeking Men08448380779 Call Girls In Diplomatic Enclave Women Seeking Men
08448380779 Call Girls In Diplomatic Enclave Women Seeking Men
 
TrustArc Webinar - Stay Ahead of US State Data Privacy Law Developments
TrustArc Webinar - Stay Ahead of US State Data Privacy Law DevelopmentsTrustArc Webinar - Stay Ahead of US State Data Privacy Law Developments
TrustArc Webinar - Stay Ahead of US State Data Privacy Law Developments
 

Business Intelligence Barista: What DataViz Tool to Use, and When?

  • 1. Datavis Barista: How to choose what dataviz tool, and when Jen Stirrup Founder, Data Relish Level: Intermediate
  • 2. Who am I? Jen Stirrup
  • 3. What dataviz tool to choose, and when? • SSRS • Excel • Tableau • Power BI • Datazen • Kibana
  • 4. What makes a good Visualisation? • Effective • Accurate • Efficient • Aesthetics • Adaptable
  • 5.
  • 6.
  • 7.
  • 8.
  • 10.
  • 11.
  • 13.
  • 14. What makes a good Visualisation? • Effective • Accurate • Efficient • Aesthetics • Adaptable
  • 22. Example in Power BI, SSRS • Demo
  • 23. Why R? • most widely used data analysis software - used by 2M + data scientist, statisticians and analysts • Most powerful statistical programming language • flexible, extensible and comprehensive for productivity • Create beautiful and unique data visualisations - as seen in New York Times, Twitter and Flowing Data • Thriving open-source community - leading edge of analytics research • Fills the talent gap - new graduates prefer R.
  • 24. Growth in Demand • Rexer Data Mining survey, 2007 - 2013 • R is the highest paid IT skill Dice.com, Jan 2014 • R most used-data science language after SQL - O'Reilly, Jan 2014 • R is used by 70% of data miners. Rexer, Sept 2013
  • 25. Growth in Demand • R is #15 of all programming languages. REdMonk, Jan 2014 • R growing faster than any other data science language. KDNuggs. • R is in-memory and limited in the size of data that you can process.
  • 26. What do I need to install? • Install R – www.r-project.org • Install Rstudio – www.rstudio.com • Handy Shortcuts • Tab – autocomplete of available functions • Control and Up Arrow – History • Control and enter – executes the line of code
  • 27. What tools do we have in R? • 80% of your time will be spent preparing and wrangling data • The remainder of your time will be spent complaining about it. • dplyr: the essential data manipulation toolset • In data wrangling, what are the main tasks? • – Filtering rows – Selecting columns of data – Adding new variables – Sorting – Aggregating
  • 29. The Big Data problem • Reaction Time • Enrichment • Insights • Optimize for query, not for storage.
  • 30. Can you check the errors between 12.02 and 12.04 yesterday?
  • 31. Can you check the errors between 12.02 and 12.04 last Friday? …. Are you kidding me?
  • 32.
  • 33.
  • 35.
  • 36. Kibana • It is highly customizable dashboarding • It is constituted of panels: – Time picker / Query / Filtering – Charts / Table / Text
  • 37. Flexible analytics and visualization platform Real-time summary and charting of streaming data Intuitive interface for a variety of users Instant sharing and embedding of dashboards
  • 38. To better understand large volumes of data.. • easily create bar charts • line and scatter plots • Histograms • pie charts • maps.
  • 39. To better understand large volumes of data.. • easily create bar charts • line and scatter plots • Histograms • pie charts • maps.
  • 41. Default Chart Types Chart Type Basis Values Types Purpose Histogram Timestamp based Count, Mean, Total Barlines, stacks, percentages Queries Table Paging Fields list Highlighting, sorting Fine grained analysis Pie Charts Terms Missing terms, other Doughnut, legends, tables Proportion
  • 42.
  • 43.
  • 44. Summary • SSRS • Excel • Tableau • Power BI • Datazen • Kibana

Hinweis der Redaktion

  1. We will look at: introductory R and why it's useful, and where to go for more information. We will learn statistics and R by looking at: independent events, dependent probability, combinatorics, hypothesis testing, descriptive statistics, random variables, probability distributions, regression, and inferential statistics. We will loosely base the curriculum on the Khan Academy statistics course, but we aim to help the curious, the scared, and the rookie.
  2. Effective: the viewer gets it (ease of interpretation) Accurate: sufficient for correct quantitative evaluation. Lie factor = size of visual effect/size of data effect Efficient: minimize data-ink ratio and chart-junk, show data, maximize data-ink ratio, brase non-data-ink, brase redundant data-ink Aesthetics: must not offend viewer's senses (e.g. moire patterns) Adaptable: can adjust to serve multiple needs
  3. Effective: the viewer gets it (ease of interpretation) Accurate: sufficient for correct quantitative evaluation. Lie factor = size of visual effect/size of data effect Efficient: minimize data-ink ratio and chart-junk, show data, maximize data-ink ratio, brase non-data-ink, brase redundant data-ink Aesthetics: must not offend viewer's senses (e.g. moire patterns) Adaptable: can adjust to serve multiple needs
  4. https://powerbi.uservoice.com/assets/82213473/treemapNoOwl.gif
  5. Whether measured by more than 6,100 add-on packages, the 41,000+ members of LinkedIn’s R group or the 170+ R Meetup groups
  6. Filtering rows creates a subset Columns are regarded as variables
  7. Frustrating, huh?
  8. Sad
  9. Web front end to search / graph and more Advantages: A better UI Better than the old frontend Ruby / framework sinatra