SlideShare ist ein Scribd-Unternehmen logo
1 von 13
© 2015 EXASOL AG
Big Data Analytics
Dave Shuttleworth – Principal Consultant, Exasol UK
email: dave.shuttleworth@exasol.com
Twitter: @EXA_DaveS
© 2015 EXASOL AG
 2014-2015 – EXASOL UK – Principal Consultant
 Introducing EXASOL DBMS technology into UK
 2003 - 2014 – Intelligent Edge Group – Principal Consultant
 Data Warehouse design and migration from older technologies to new MPP DBMS
 Business Intelligence infrastructure architect
 New DBMS technology assessment
 1992 - 2003 – WhiteCross Systems (now Kognitio) – Principal Consultant
 Pre-sales and post-sales technical support
 1989 -1992 – Teradata – Consultant
 Pre-sales and post-sales technical support
 1980 -1989 – Data General (now part of EMC) – Systems engineer
 Pre-sales and post-sales technical support
 1975 -1980 – UK retailer – Analyst programmer
 Applications design and implementation, system management and tuning
My background
© 2015 EXASOL AG
 a column store, in-memory, massively parallel processing (MPP)
database
 modern software designed for analytics
 runs on standard x86 hardware
 Uses standard SQL language (with optional extensions)
 suitable for any scale of data & any number of users
 mature, proven & very cost effective
 quick to implement & easy to operate
The World’s Fastest Analytic Database
What is Exasol?
© 2015 EXASOL AG
QphH@1000 GB 1,000,000 2,000,000 3,000,000 4.000,000
Sept ´14
April ´14
June ´12
Feb ´14
Dec ´13
Aug ´11
Sept ´11
Oct ´11
Dec ´11
Source: www.tpc.org / Sept 22, 2015
We are the benchmark leader
5,246,338
Microsoft 134,117
Oracle 201,487
Oracle 209,533
Microsoft 219,887
Sybase IQ 258,474
Oracle 326,454
Vectorwise 445,529
Microsoft 519,976
On 1 Terabyte of data - an order of magnitude faster than its closest rival
Queries per hour
© 2015 EXASOL AG
Unrivalled price/performance at any scale
4th Position
3rd Position
2nd Position
EXASolution 5.0
0
1,000,000
2,000,000
3,000,000
4,000,000
5,000,000
6,000,000
7,000,000
8,000,000
9,000,000
10,000,000
11,000,000
100GB
300GB
1TB
3TB
10TB
100TB
Performance(QphH)
TPC-H Scale Factor
Source: www.tpc.org / Sept 22, 2014
The larger the data the
greater the EXASolution
advantage - 66% less cost
on average than the
nearest competitor
© 2015 EXASOL AG
• There are many examples of architecture diagrams available which try
to encompass every potentially relevant technology – probably
including (but not limited to) the following:
Requirements for Big Data Analytics (?)
•Streaming data
•Social media data
•Internet of Things
•SQL/NoSQL/NewSQL
databases
•Hadoop and
associated stuff
•Data in memory
•Cloud computing
•Unstructured data
•ETL/ELT/ETLT
•Data Mining
•Predictive Analytics
•Data Quality
•MDM
•Graph databases
•MPP
•etc, etc
© 2015 EXASOL AG
An Architecture for Big Data Analytics (?)
© 2015 EXASOL AG
• Some observations based on work with Exasol, Netezza, Teradata etc.
• It’s extremely rare to find a new ‘greenfield’ environment – there’s
generally some ‘legacy’ stuff which has to be accommodated
• Many clients are only just starting to experiment with newer
technologies such as cloud and Hadoop
• More advanced users are realising that new technologies don’t
necessarily have all the answers
• Some of the newer technologies need new skills – which might not
be easy/cheap to find
• Some metrics – setting up Cloud and Hadoop
• > 150 total steps
• > 30 decision points
• 2-6 months
In the real world..
© 2015 EXASOL AG
The good news…
 Newer technologies are driving implementation prices down
 New technologies support agile development approaches
 As newer technologies mature they become easier to integrate
 with each other
 with legacy systems
 New vendors are addressing the ‘integration complexity’ issues (e.g.
Cazena)
 It is possible and practical to choose the ‘right tool for the job’ – a
single vendor is no longer expected to provide everything
 There are existing examples in production already
© 2015 EXASOL AG
King – getting to know 500 million players..
© 2015 EXASOL AG
Evolution not Revolution
End Users
Mobile Devices
BI & Reporting
Applications
Data
Integration,
ETL and
Replication
• Ported
• Bundled
• Custom
• Prototyping
• Ad Hoc
• Dashboard
• Statistics
• Data Mining
• Analytics
ERP
CRM
SCM
Legacy
OLTP
Enterprise Data
Warehouse
External Data
EXASOL
NoSQL, Graph,
etc
© 2015 EXASOL AG
• Data and database technology isn’t going away!
• New database approaches are being developed to address the
requirements of flexibility, scalability etc
• These technologies drive an increasing need for more analysts,
database designers, data scientists
• Hybrid systems are becoming the norm, with companies mixing ‘best
of breed’ technologies (possibly open source) to get the best and
most cost-effective results – use ‘the right tool for the job’
• New vendors are addressing the complexity problems
• SQL databases will continue to be widely utilised – but alongside
other technologies and integration will become tighter
Summary
© 2015 EXASOL AG
Dave Shuttleworth
Twitter: @EXA_Daves
Email: dave.shuttleworth@exasol.com
Any questions?

Weitere ähnliche Inhalte

Was ist angesagt?

Replatform your Teradata to a Next-Gen Cloud Data Platform in Weeks, Not Years
Replatform your Teradata to a Next-Gen Cloud Data Platform in Weeks, Not YearsReplatform your Teradata to a Next-Gen Cloud Data Platform in Weeks, Not Years
Replatform your Teradata to a Next-Gen Cloud Data Platform in Weeks, Not Years
VMware Tanzu
 

Was ist angesagt? (20)

Downsizing Data Centers by NetApp IT
Downsizing Data Centers by NetApp ITDownsizing Data Centers by NetApp IT
Downsizing Data Centers by NetApp IT
 
SnapLogic Technology Open House – January 2018
SnapLogic Technology Open House – January 2018SnapLogic Technology Open House – January 2018
SnapLogic Technology Open House – January 2018
 
Transform Your Enterprise Faster with Seamless Hybrid Cloud from Netapp
Transform Your Enterprise Faster with Seamless Hybrid Cloud from NetappTransform Your Enterprise Faster with Seamless Hybrid Cloud from Netapp
Transform Your Enterprise Faster with Seamless Hybrid Cloud from Netapp
 
Better Business in a Flash
Better Business in a FlashBetter Business in a Flash
Better Business in a Flash
 
2014 Predictions: Jay Kidd
2014 Predictions: Jay Kidd2014 Predictions: Jay Kidd
2014 Predictions: Jay Kidd
 
Revolutionising Storage for your Future Business Requirements
Revolutionising Storage for your Future Business RequirementsRevolutionising Storage for your Future Business Requirements
Revolutionising Storage for your Future Business Requirements
 
TWNW_VistagePPT
TWNW_VistagePPTTWNW_VistagePPT
TWNW_VistagePPT
 
NetApp’s Video Surveillance Storage Solution Infographic
NetApp’s Video Surveillance Storage Solution InfographicNetApp’s Video Surveillance Storage Solution Infographic
NetApp’s Video Surveillance Storage Solution Infographic
 
Keith Prabhu - Big Data Cloud Computing
Keith Prabhu - Big Data Cloud ComputingKeith Prabhu - Big Data Cloud Computing
Keith Prabhu - Big Data Cloud Computing
 
Webinar: Which Storage Architecture is Best for Splunk Analytics?
Webinar: Which Storage Architecture is Best for Splunk Analytics?Webinar: Which Storage Architecture is Best for Splunk Analytics?
Webinar: Which Storage Architecture is Best for Splunk Analytics?
 
Pick a Winner: How to Choose a Data Warehouse
Pick a Winner: How to Choose a Data WarehousePick a Winner: How to Choose a Data Warehouse
Pick a Winner: How to Choose a Data Warehouse
 
Denodo Cloud Survey Results 2017
Denodo Cloud Survey Results 2017Denodo Cloud Survey Results 2017
Denodo Cloud Survey Results 2017
 
NetApp Epic Story: Orange Business Services
NetApp Epic Story: Orange Business ServicesNetApp Epic Story: Orange Business Services
NetApp Epic Story: Orange Business Services
 
Accelerate, Simplify, and Be Future-Ready with NetApp for SAP
Accelerate, Simplify, and Be Future-Ready with NetApp for SAPAccelerate, Simplify, and Be Future-Ready with NetApp for SAP
Accelerate, Simplify, and Be Future-Ready with NetApp for SAP
 
10 Good Reasons: NetApp HCI
10 Good Reasons: NetApp HCI10 Good Reasons: NetApp HCI
10 Good Reasons: NetApp HCI
 
How to accelerate Splunk analytics
How to accelerate Splunk analyticsHow to accelerate Splunk analytics
How to accelerate Splunk analytics
 
Get Savvy with Snowflake
Get Savvy with SnowflakeGet Savvy with Snowflake
Get Savvy with Snowflake
 
4 Ways FlexPod Forms the Foundation for Cisco and NetApp Success
4 Ways FlexPod Forms the Foundation for Cisco and NetApp Success4 Ways FlexPod Forms the Foundation for Cisco and NetApp Success
4 Ways FlexPod Forms the Foundation for Cisco and NetApp Success
 
Replatform your Teradata to a Next-Gen Cloud Data Platform in Weeks, Not Years
Replatform your Teradata to a Next-Gen Cloud Data Platform in Weeks, Not YearsReplatform your Teradata to a Next-Gen Cloud Data Platform in Weeks, Not Years
Replatform your Teradata to a Next-Gen Cloud Data Platform in Weeks, Not Years
 
Dataiku data science studio
Dataiku data science studioDataiku data science studio
Dataiku data science studio
 

Andere mochten auch

Andere mochten auch (9)

Beyond SQL: Managing Events and Relationships in Social Care
Beyond SQL: Managing Events and Relationships in Social CareBeyond SQL: Managing Events and Relationships in Social Care
Beyond SQL: Managing Events and Relationships in Social Care
 
Nick Keen Data governance in the Environment Agency
Nick Keen   Data governance in the Environment AgencyNick Keen   Data governance in the Environment Agency
Nick Keen Data governance in the Environment Agency
 
The CDO Challenge 24-11-16
The CDO Challenge 24-11-16The CDO Challenge 24-11-16
The CDO Challenge 24-11-16
 
Nicola Askham Key concepts in data governance
Nicola Askham   Key concepts in data governanceNicola Askham   Key concepts in data governance
Nicola Askham Key concepts in data governance
 
John Stuart-Clarke - beginning the data governance journey - 8th june 2016
John Stuart-Clarke - beginning the data governance journey - 8th june 2016John Stuart-Clarke - beginning the data governance journey - 8th june 2016
John Stuart-Clarke - beginning the data governance journey - 8th june 2016
 
Moving from 3rd Normal Form to a web enabled world 22-9-15
Moving from 3rd Normal Form to a web enabled world   22-9-15Moving from 3rd Normal Form to a web enabled world   22-9-15
Moving from 3rd Normal Form to a web enabled world 22-9-15
 
Michael Bironneau Data governance and the IoT
Michael Bironneau   Data governance and the IoTMichael Bironneau   Data governance and the IoT
Michael Bironneau Data governance and the IoT
 
The CDO Agenda: how data architecture can help?
The CDO Agenda: how data architecture can help?The CDO Agenda: how data architecture can help?
The CDO Agenda: how data architecture can help?
 
Nigel Turner data governance is not boring
Nigel Turner   data governance is not boringNigel Turner   data governance is not boring
Nigel Turner data governance is not boring
 

Ähnlich wie Big Data Analytics, Dave Shuttleworth - 22-9-15

Big Data LDN 2016: All data is equal – but some data is more equal than others
Big Data LDN 2016: All data is equal – but some data is more equal than othersBig Data LDN 2016: All data is equal – but some data is more equal than others
Big Data LDN 2016: All data is equal – but some data is more equal than others
Matt Stubbs
 
Oracle Sistemas Convergentes
Oracle Sistemas ConvergentesOracle Sistemas Convergentes
Oracle Sistemas Convergentes
Fran Navarro
 
Future of Power: Power Strategy and Offerings for Denmark - Steve Sibley
Future of Power: Power Strategy and Offerings for Denmark - Steve SibleyFuture of Power: Power Strategy and Offerings for Denmark - Steve Sibley
Future of Power: Power Strategy and Offerings for Denmark - Steve Sibley
IBM Danmark
 
클라우드에서의 데이터 웨어하우징 & 비즈니스 인텔리전스
클라우드에서의 데이터 웨어하우징 & 비즈니스 인텔리전스클라우드에서의 데이터 웨어하우징 & 비즈니스 인텔리전스
클라우드에서의 데이터 웨어하우징 & 비즈니스 인텔리전스
Amazon Web Services Korea
 

Ähnlich wie Big Data Analytics, Dave Shuttleworth - 22-9-15 (20)

Big Data LDN 2016: All data is equal – but some data is more equal than others
Big Data LDN 2016: All data is equal – but some data is more equal than othersBig Data LDN 2016: All data is equal – but some data is more equal than others
Big Data LDN 2016: All data is equal – but some data is more equal than others
 
Analytics in a Day Ft. Synapse Virtual Workshop
Analytics in a Day Ft. Synapse Virtual WorkshopAnalytics in a Day Ft. Synapse Virtual Workshop
Analytics in a Day Ft. Synapse Virtual Workshop
 
Oracle Sistemas Convergentes
Oracle Sistemas ConvergentesOracle Sistemas Convergentes
Oracle Sistemas Convergentes
 
Sudhir Rawat, Sr Techonology Evangelist at Microsoft SQL Business Intelligenc...
Sudhir Rawat, Sr Techonology Evangelist at Microsoft SQL Business Intelligenc...Sudhir Rawat, Sr Techonology Evangelist at Microsoft SQL Business Intelligenc...
Sudhir Rawat, Sr Techonology Evangelist at Microsoft SQL Business Intelligenc...
 
SQL vs NoSQL: Why you’ll never dump your relations - Dave Shuttleworth, EXASOL
SQL vs NoSQL: Why you’ll never dump your relations - Dave Shuttleworth, EXASOLSQL vs NoSQL: Why you’ll never dump your relations - Dave Shuttleworth, EXASOL
SQL vs NoSQL: Why you’ll never dump your relations - Dave Shuttleworth, EXASOL
 
Future of Power: Power Strategy and Offerings for Denmark - Steve Sibley
Future of Power: Power Strategy and Offerings for Denmark - Steve SibleyFuture of Power: Power Strategy and Offerings for Denmark - Steve Sibley
Future of Power: Power Strategy and Offerings for Denmark - Steve Sibley
 
Modern Data Warehouse Overview
Modern Data Warehouse OverviewModern Data Warehouse Overview
Modern Data Warehouse Overview
 
Real-world Cloud HPC at Scale, for Production Workloads (BDT212) | AWS re:Inv...
Real-world Cloud HPC at Scale, for Production Workloads (BDT212) | AWS re:Inv...Real-world Cloud HPC at Scale, for Production Workloads (BDT212) | AWS re:Inv...
Real-world Cloud HPC at Scale, for Production Workloads (BDT212) | AWS re:Inv...
 
Trivadis Azure Data Lake
Trivadis Azure Data LakeTrivadis Azure Data Lake
Trivadis Azure Data Lake
 
클라우드에서의 데이터 웨어하우징 & 비즈니스 인텔리전스
클라우드에서의 데이터 웨어하우징 & 비즈니스 인텔리전스클라우드에서의 데이터 웨어하우징 & 비즈니스 인텔리전스
클라우드에서의 데이터 웨어하우징 & 비즈니스 인텔리전스
 
Give Your Organization Better, Faster Insights & Answers with High Performanc...
Give Your Organization Better, Faster Insights & Answers with High Performanc...Give Your Organization Better, Faster Insights & Answers with High Performanc...
Give Your Organization Better, Faster Insights & Answers with High Performanc...
 
Microsoft Fabric Introduction
Microsoft Fabric IntroductionMicrosoft Fabric Introduction
Microsoft Fabric Introduction
 
Dell Digital Transformation Through AI and Data Analytics Webinar
Dell Digital Transformation Through AI and  Data Analytics WebinarDell Digital Transformation Through AI and  Data Analytics Webinar
Dell Digital Transformation Through AI and Data Analytics Webinar
 
Azure Data Explorer deep dive - review 04.2020
Azure Data Explorer deep dive - review 04.2020Azure Data Explorer deep dive - review 04.2020
Azure Data Explorer deep dive - review 04.2020
 
Customer migration to Azure SQL database, December 2019
Customer migration to Azure SQL database, December 2019Customer migration to Azure SQL database, December 2019
Customer migration to Azure SQL database, December 2019
 
Is it sensible to use Data Vault at all? Conclusions from a project.
Is it sensible to use Data Vault at all? Conclusions from a project.Is it sensible to use Data Vault at all? Conclusions from a project.
Is it sensible to use Data Vault at all? Conclusions from a project.
 
Optimizing workload deployments to accelerate business outcomes
Optimizing workload deployments to accelerate business outcomes Optimizing workload deployments to accelerate business outcomes
Optimizing workload deployments to accelerate business outcomes
 
Top Trends in Building Data Lakes for Machine Learning and AI
Top Trends in Building Data Lakes for Machine Learning and AI Top Trends in Building Data Lakes for Machine Learning and AI
Top Trends in Building Data Lakes for Machine Learning and AI
 
Serverless machine learning architectures at Helixa
Serverless machine learning architectures at HelixaServerless machine learning architectures at Helixa
Serverless machine learning architectures at Helixa
 
How Yellowbrick Data Integrates to Existing Environments Webcast
How Yellowbrick Data Integrates to Existing Environments WebcastHow Yellowbrick Data Integrates to Existing Environments Webcast
How Yellowbrick Data Integrates to Existing Environments Webcast
 

Kürzlich hochgeladen

Data Analytics for Digital Marketing Lecture for Advanced Digital & Social Me...
Data Analytics for Digital Marketing Lecture for Advanced Digital & Social Me...Data Analytics for Digital Marketing Lecture for Advanced Digital & Social Me...
Data Analytics for Digital Marketing Lecture for Advanced Digital & Social Me...
Valters Lauzums
 
一比一原版西悉尼大学毕业证成绩单如何办理
一比一原版西悉尼大学毕业证成绩单如何办理一比一原版西悉尼大学毕业证成绩单如何办理
一比一原版西悉尼大学毕业证成绩单如何办理
pyhepag
 
如何办理滑铁卢大学毕业证(Waterloo毕业证)成绩单本科学位证原版一比一
如何办理滑铁卢大学毕业证(Waterloo毕业证)成绩单本科学位证原版一比一如何办理滑铁卢大学毕业证(Waterloo毕业证)成绩单本科学位证原版一比一
如何办理滑铁卢大学毕业证(Waterloo毕业证)成绩单本科学位证原版一比一
0uyfyq0q4
 
一比一原版纽卡斯尔大学毕业证成绩单如何办理
一比一原版纽卡斯尔大学毕业证成绩单如何办理一比一原版纽卡斯尔大学毕业证成绩单如何办理
一比一原版纽卡斯尔大学毕业证成绩单如何办理
cyebo
 
NO1 Best Kala Jadu Expert Specialist In Germany Kala Jadu Expert Specialist I...
NO1 Best Kala Jadu Expert Specialist In Germany Kala Jadu Expert Specialist I...NO1 Best Kala Jadu Expert Specialist In Germany Kala Jadu Expert Specialist I...
NO1 Best Kala Jadu Expert Specialist In Germany Kala Jadu Expert Specialist I...
Amil baba
 
1:1原版定制伦敦政治经济学院毕业证(LSE毕业证)成绩单学位证书留信学历认证
1:1原版定制伦敦政治经济学院毕业证(LSE毕业证)成绩单学位证书留信学历认证1:1原版定制伦敦政治经济学院毕业证(LSE毕业证)成绩单学位证书留信学历认证
1:1原版定制伦敦政治经济学院毕业证(LSE毕业证)成绩单学位证书留信学历认证
dq9vz1isj
 
一比一原版麦考瑞大学毕业证成绩单如何办理
一比一原版麦考瑞大学毕业证成绩单如何办理一比一原版麦考瑞大学毕业证成绩单如何办理
一比一原版麦考瑞大学毕业证成绩单如何办理
cyebo
 
如何办理英国卡迪夫大学毕业证(Cardiff毕业证书)成绩单留信学历认证
如何办理英国卡迪夫大学毕业证(Cardiff毕业证书)成绩单留信学历认证如何办理英国卡迪夫大学毕业证(Cardiff毕业证书)成绩单留信学历认证
如何办理英国卡迪夫大学毕业证(Cardiff毕业证书)成绩单留信学历认证
ju0dztxtn
 
1:1原版定制利物浦大学毕业证(Liverpool毕业证)成绩单学位证书留信学历认证
1:1原版定制利物浦大学毕业证(Liverpool毕业证)成绩单学位证书留信学历认证1:1原版定制利物浦大学毕业证(Liverpool毕业证)成绩单学位证书留信学历认证
1:1原版定制利物浦大学毕业证(Liverpool毕业证)成绩单学位证书留信学历认证
ppy8zfkfm
 

Kürzlich hochgeladen (20)

123.docx. .
123.docx.                                 .123.docx.                                 .
123.docx. .
 
Data Analytics for Digital Marketing Lecture for Advanced Digital & Social Me...
Data Analytics for Digital Marketing Lecture for Advanced Digital & Social Me...Data Analytics for Digital Marketing Lecture for Advanced Digital & Social Me...
Data Analytics for Digital Marketing Lecture for Advanced Digital & Social Me...
 
一比一原版西悉尼大学毕业证成绩单如何办理
一比一原版西悉尼大学毕业证成绩单如何办理一比一原版西悉尼大学毕业证成绩单如何办理
一比一原版西悉尼大学毕业证成绩单如何办理
 
Formulas dax para power bI de microsoft.pdf
Formulas dax para power bI de microsoft.pdfFormulas dax para power bI de microsoft.pdf
Formulas dax para power bI de microsoft.pdf
 
Generative AI for Trailblazers_ Unlock the Future of AI.pdf
Generative AI for Trailblazers_ Unlock the Future of AI.pdfGenerative AI for Trailblazers_ Unlock the Future of AI.pdf
Generative AI for Trailblazers_ Unlock the Future of AI.pdf
 
如何办理滑铁卢大学毕业证(Waterloo毕业证)成绩单本科学位证原版一比一
如何办理滑铁卢大学毕业证(Waterloo毕业证)成绩单本科学位证原版一比一如何办理滑铁卢大学毕业证(Waterloo毕业证)成绩单本科学位证原版一比一
如何办理滑铁卢大学毕业证(Waterloo毕业证)成绩单本科学位证原版一比一
 
Aggregations - The Elasticsearch "GROUP BY"
Aggregations - The Elasticsearch "GROUP BY"Aggregations - The Elasticsearch "GROUP BY"
Aggregations - The Elasticsearch "GROUP BY"
 
Seven tools of quality control.slideshare
Seven tools of quality control.slideshareSeven tools of quality control.slideshare
Seven tools of quality control.slideshare
 
How I opened a fake bank account and didn't go to prison
How I opened a fake bank account and didn't go to prisonHow I opened a fake bank account and didn't go to prison
How I opened a fake bank account and didn't go to prison
 
2024 Q2 Orange County (CA) Tableau User Group Meeting
2024 Q2 Orange County (CA) Tableau User Group Meeting2024 Q2 Orange County (CA) Tableau User Group Meeting
2024 Q2 Orange County (CA) Tableau User Group Meeting
 
一比一原版纽卡斯尔大学毕业证成绩单如何办理
一比一原版纽卡斯尔大学毕业证成绩单如何办理一比一原版纽卡斯尔大学毕业证成绩单如何办理
一比一原版纽卡斯尔大学毕业证成绩单如何办理
 
NO1 Best Kala Jadu Expert Specialist In Germany Kala Jadu Expert Specialist I...
NO1 Best Kala Jadu Expert Specialist In Germany Kala Jadu Expert Specialist I...NO1 Best Kala Jadu Expert Specialist In Germany Kala Jadu Expert Specialist I...
NO1 Best Kala Jadu Expert Specialist In Germany Kala Jadu Expert Specialist I...
 
1:1原版定制伦敦政治经济学院毕业证(LSE毕业证)成绩单学位证书留信学历认证
1:1原版定制伦敦政治经济学院毕业证(LSE毕业证)成绩单学位证书留信学历认证1:1原版定制伦敦政治经济学院毕业证(LSE毕业证)成绩单学位证书留信学历认证
1:1原版定制伦敦政治经济学院毕业证(LSE毕业证)成绩单学位证书留信学历认证
 
一比一原版麦考瑞大学毕业证成绩单如何办理
一比一原版麦考瑞大学毕业证成绩单如何办理一比一原版麦考瑞大学毕业证成绩单如何办理
一比一原版麦考瑞大学毕业证成绩单如何办理
 
如何办理英国卡迪夫大学毕业证(Cardiff毕业证书)成绩单留信学历认证
如何办理英国卡迪夫大学毕业证(Cardiff毕业证书)成绩单留信学历认证如何办理英国卡迪夫大学毕业证(Cardiff毕业证书)成绩单留信学历认证
如何办理英国卡迪夫大学毕业证(Cardiff毕业证书)成绩单留信学历认证
 
Atlantic Grupa Case Study (Mintec Data AI)
Atlantic Grupa Case Study (Mintec Data AI)Atlantic Grupa Case Study (Mintec Data AI)
Atlantic Grupa Case Study (Mintec Data AI)
 
1:1原版定制利物浦大学毕业证(Liverpool毕业证)成绩单学位证书留信学历认证
1:1原版定制利物浦大学毕业证(Liverpool毕业证)成绩单学位证书留信学历认证1:1原版定制利物浦大学毕业证(Liverpool毕业证)成绩单学位证书留信学历认证
1:1原版定制利物浦大学毕业证(Liverpool毕业证)成绩单学位证书留信学历认证
 
The Significance of Transliteration Enhancing
The Significance of Transliteration EnhancingThe Significance of Transliteration Enhancing
The Significance of Transliteration Enhancing
 
Statistics Informed Decisions Using Data 5th edition by Michael Sullivan solu...
Statistics Informed Decisions Using Data 5th edition by Michael Sullivan solu...Statistics Informed Decisions Using Data 5th edition by Michael Sullivan solu...
Statistics Informed Decisions Using Data 5th edition by Michael Sullivan solu...
 
NOAM AAUG Adobe Summit 2024: Summit Slam Dunks
NOAM AAUG Adobe Summit 2024: Summit Slam DunksNOAM AAUG Adobe Summit 2024: Summit Slam Dunks
NOAM AAUG Adobe Summit 2024: Summit Slam Dunks
 

Big Data Analytics, Dave Shuttleworth - 22-9-15

  • 1. © 2015 EXASOL AG Big Data Analytics Dave Shuttleworth – Principal Consultant, Exasol UK email: dave.shuttleworth@exasol.com Twitter: @EXA_DaveS
  • 2. © 2015 EXASOL AG  2014-2015 – EXASOL UK – Principal Consultant  Introducing EXASOL DBMS technology into UK  2003 - 2014 – Intelligent Edge Group – Principal Consultant  Data Warehouse design and migration from older technologies to new MPP DBMS  Business Intelligence infrastructure architect  New DBMS technology assessment  1992 - 2003 – WhiteCross Systems (now Kognitio) – Principal Consultant  Pre-sales and post-sales technical support  1989 -1992 – Teradata – Consultant  Pre-sales and post-sales technical support  1980 -1989 – Data General (now part of EMC) – Systems engineer  Pre-sales and post-sales technical support  1975 -1980 – UK retailer – Analyst programmer  Applications design and implementation, system management and tuning My background
  • 3. © 2015 EXASOL AG  a column store, in-memory, massively parallel processing (MPP) database  modern software designed for analytics  runs on standard x86 hardware  Uses standard SQL language (with optional extensions)  suitable for any scale of data & any number of users  mature, proven & very cost effective  quick to implement & easy to operate The World’s Fastest Analytic Database What is Exasol?
  • 4. © 2015 EXASOL AG QphH@1000 GB 1,000,000 2,000,000 3,000,000 4.000,000 Sept ´14 April ´14 June ´12 Feb ´14 Dec ´13 Aug ´11 Sept ´11 Oct ´11 Dec ´11 Source: www.tpc.org / Sept 22, 2015 We are the benchmark leader 5,246,338 Microsoft 134,117 Oracle 201,487 Oracle 209,533 Microsoft 219,887 Sybase IQ 258,474 Oracle 326,454 Vectorwise 445,529 Microsoft 519,976 On 1 Terabyte of data - an order of magnitude faster than its closest rival Queries per hour
  • 5. © 2015 EXASOL AG Unrivalled price/performance at any scale 4th Position 3rd Position 2nd Position EXASolution 5.0 0 1,000,000 2,000,000 3,000,000 4,000,000 5,000,000 6,000,000 7,000,000 8,000,000 9,000,000 10,000,000 11,000,000 100GB 300GB 1TB 3TB 10TB 100TB Performance(QphH) TPC-H Scale Factor Source: www.tpc.org / Sept 22, 2014 The larger the data the greater the EXASolution advantage - 66% less cost on average than the nearest competitor
  • 6. © 2015 EXASOL AG • There are many examples of architecture diagrams available which try to encompass every potentially relevant technology – probably including (but not limited to) the following: Requirements for Big Data Analytics (?) •Streaming data •Social media data •Internet of Things •SQL/NoSQL/NewSQL databases •Hadoop and associated stuff •Data in memory •Cloud computing •Unstructured data •ETL/ELT/ETLT •Data Mining •Predictive Analytics •Data Quality •MDM •Graph databases •MPP •etc, etc
  • 7. © 2015 EXASOL AG An Architecture for Big Data Analytics (?)
  • 8. © 2015 EXASOL AG • Some observations based on work with Exasol, Netezza, Teradata etc. • It’s extremely rare to find a new ‘greenfield’ environment – there’s generally some ‘legacy’ stuff which has to be accommodated • Many clients are only just starting to experiment with newer technologies such as cloud and Hadoop • More advanced users are realising that new technologies don’t necessarily have all the answers • Some of the newer technologies need new skills – which might not be easy/cheap to find • Some metrics – setting up Cloud and Hadoop • > 150 total steps • > 30 decision points • 2-6 months In the real world..
  • 9. © 2015 EXASOL AG The good news…  Newer technologies are driving implementation prices down  New technologies support agile development approaches  As newer technologies mature they become easier to integrate  with each other  with legacy systems  New vendors are addressing the ‘integration complexity’ issues (e.g. Cazena)  It is possible and practical to choose the ‘right tool for the job’ – a single vendor is no longer expected to provide everything  There are existing examples in production already
  • 10. © 2015 EXASOL AG King – getting to know 500 million players..
  • 11. © 2015 EXASOL AG Evolution not Revolution End Users Mobile Devices BI & Reporting Applications Data Integration, ETL and Replication • Ported • Bundled • Custom • Prototyping • Ad Hoc • Dashboard • Statistics • Data Mining • Analytics ERP CRM SCM Legacy OLTP Enterprise Data Warehouse External Data EXASOL NoSQL, Graph, etc
  • 12. © 2015 EXASOL AG • Data and database technology isn’t going away! • New database approaches are being developed to address the requirements of flexibility, scalability etc • These technologies drive an increasing need for more analysts, database designers, data scientists • Hybrid systems are becoming the norm, with companies mixing ‘best of breed’ technologies (possibly open source) to get the best and most cost-effective results – use ‘the right tool for the job’ • New vendors are addressing the complexity problems • SQL databases will continue to be widely utilised – but alongside other technologies and integration will become tighter Summary
  • 13. © 2015 EXASOL AG Dave Shuttleworth Twitter: @EXA_Daves Email: dave.shuttleworth@exasol.com Any questions?