SlideShare ist ein Scribd-Unternehmen logo
1 von 26
IT STRATEGY FOR SCALABLE ANALYTICS, 
Copyr i g ht © 2013, SAS Ins t i tut e Inc . Al l r ights reser ve d . 
MODERN DATA ARCHITECTURES
Copyr i g ht © 2013, SAS Ins t i tut e Inc . Al l r ights reser ve d . 
MODERN ARCHITECTURES
Copyr i g ht © 2013, SAS Ins t i tut e Inc . Al l r ights reser ve d . 
STUNNING FACT 
Making the Modern World: Materials and Dematerialization - Vaclav Smil
Copyr i g ht © 2013, SAS Ins t i tut e Inc . Al l r ights reser ve d .
Copyr i g ht © 2013, SAS Ins t i tut e Inc . Al l r ights reser ve d . 
Shift in Mindset 
Scarcity 
‱ Technology 
constrained 
‱ Process-centric 
‱ Focus on cost 
control 
Everything is forbidden 
unless 
it is permitted 
Abundance 
‱ Focus on value 
‱ Discovery-centric 
‱ Technology empowered 
Everything is permitted 
unless 
it is forbidden
Trends Big Data, Storage, Hadoop & In-memory Technology 
THE PERFECT STORM: STORAGE TECHNOLOGY COSTS AND CPU SPEED 
Cost of Storage, Memory, Computing 
‱ In 2000 a GB of Disk $17 today < $0.07 
‱ In 2000 a GB of Ram $1800 today < $1 
‱ In 2009 a TB of RDBMS was $70K today < $ 20K 
Cost per Terabyte 
$- $20 $40 $60 $80 $100 
Hadoop 
Microsoft PDW 
Oracle 
Greenplum 
Teradata 
Vertica 
Copyr i g ht © 2013, SAS Ins t i tut e Inc . Al l r ights reser ve d . 
Thousands 
Today 2009
Copyr i g ht © 2013, SAS Ins t i tut e Inc . Al l r ights reser ve d . 
MODERN REALITY 
‱ Commoditization 
‱ Architectures 
‱ Scale Infrastructure 
‱New Complex Streams 
‱ Perishable Considerations 
‱ Cost Data 
‱New Category of Business Problems 
‱ Analytical Algorithms 
‱Operationalization Analytics
Finding treasures in unstructured data 
like social media or survey tools 
that could uncover insights 
about consumer sentiment 
Copyright © 2011, SAS Inst itute Inc. Al l rights reserved. 
8 
Leveraging historical data 
to drive better insight into 
decision-making 
for the future 
Mine transaction databases 
for data of spending patterns 
that indicate a stolen card.. 
Analyze massive 
amounts of data in 
order to accurately 
identify areas likely to 
produce the most 
profitable results 
FORECASTING 
DATA MINING 
TEXT ANALYTICS 
OPTIMIZATION 
STATISTICS 
ADVANCED ANALYTICS 
INFORMATION 
MANAGEMENT
Copyr i g ht © 2013, SAS Ins t i tut e Inc . Al l r ights reser ve d . 
CURRENT TRENDS IN ANALYTICS 
Complex Business Problems Are Driving Analytics Innovation 
Speed Will Be Of Essence 
Leverage Analytics To Unlock The Information Contained In 
Unstructured Data 
Operationalizing Analytics
CURRENT AND FUTURE ARCHITECTURES 
Copyr i g ht © 2013, SAS Ins t i tut e Inc . Al l r ights reser ve d .
WHERE WE ARE 
TODAY? SETTING THE SCENE 
Operational 
Data Sources 
Copyr i g ht © 2013, SAS Ins t i tut e Inc . Al l r ights reser ve d . 
EDW 
Data Mart 
Data Mart 
Analytic 
Mart 
Analytic 
Mart 
BI and 
Analytics 
Unstructured, Semi-structured and Streaming 
data (i.e. sensor data) handled often outside the 
Warehouse flow
WHERE DOES 
HADOOP FIT? HADOOP AS A “NEW DATA” STORE 
Operational 
Data Sources 
Copyr i g ht © 2013, SAS Ins t i tut e Inc . Al l r ights reser ve d . 
EDW 
Data Mart 
Data Mart 
Analytic 
Mart 
Analytic 
Mart 
BI and 
Analytics
WHERE DOES 
HADOOP FIT? HADOOP AS AN ADDITIONAL INPUT TO THE EDW 
Operational 
Data Sources 
Copyr i g ht © 2013, SAS Ins t i tut e Inc . Al l r ights reser ve d . 
EDW 
Data Mart 
Data Mart 
Analytic 
Mart 
Analytic 
Mart 
Analytic 
Mart 
Data Mart 
BI and 
Analytics
WHERE DOES 
HADOOP FIT? 
Operational 
Data Sources EDW 
Copyr i g ht © 2013, SAS Ins t i tut e Inc . Al l r ights reser ve d . 
HADOOP DATA PLATFORM AS A “STAGING LAYER” AS 
PART OF A “DATA LAKE” – Downstream stores could be 
Hadoop, data appliances or an RDBMS 
Data Mart 
Data Mart 
Analytic 
Mart 
Analytic 
Mart 
BI and 
Analytics
Copyr i g ht © 2013, SAS Ins t i tut e Inc . Al l r ights reser ve d . 
SAS BIG DATA STRATEGY – SAS AREAS 
15
SAS & HADOOP SASÂź WITHIN THE HADOOP ECOSYSTEM 
Copyr i g ht © 2013, SAS Ins t i tut e Inc . Al l r ights reser ve d . 
SAS Metadata 
Base SAS & SAS/ACCESS¼ to Hadoopℱ 
Impala 
Next-Gen 
SASÂź User 
User 
Interface 
Metadata 
Data 
Access 
Data 
Processing 
File 
System 
SASÂź User 
SAS¼ LASRℱ Analytic 
Server 
SASÂź High- 
Performance 
Analytic Procedures 
MPI Based 
HDFS 
Pig 
Map Reduce 
SASÂź Visual 
Analytics 
In-Memory 
Data Access 
SASÂź 
Enterprise 
Minerℱ 
SASÂź Data 
Integration 
SASÂź 
Enterprise 
GuideÂź 
Hive 
SAS Embedded 
Process 
Accelerators 
SASÂź In-Memory 
Statistics for 
Haodop
IN SUMMARY SAS ENABLES THE ENTIRE LIFECYCLE AROUND HADOOP 
SAS Visual Analytics 
Decision Manager 
SAS Scoring Accelerator for Hadoop 
SAS Code Accelerator for Hadoop 
Copyr i g ht © 2013, SAS Ins t i tut e Inc . Al l r ights reser ve d . 
IDENTIFY / 
FORMULATE 
PROBLEM 
DATA 
PREPARATION 
DATA 
EXPLORATION 
TRANSFORM 
& SELECT 
BUILD 
MODEL 
VALIDATE 
MODEL 
DEPLOY 
MODEL 
EVALUATE / 
MONITOR 
RESULTS 
SAS Visual Analytics 
SAS Visual Statistics 
SAS In-Memory Statistics for Hadoop 
Done using either the Data 
Preparation, Data Exploration 
or Build Model Tools 
SAS High Performance Analytics Offerings 
supported by relevant clients like SAS 
Enterprise Miner, SAS/STAT etc. 
Decision Manager 
Done using either the Data Preparation, 
Data Exploration or Build Model Tools
Copyr i g ht © 2013, SAS Ins t i tut e Inc . Al l r ights reser ve d . 
SASÂź VISUAL ANALYTICS 
A SINGLE SOLUTION FOR DATA DISCOVERY, 
VISUALIZATION, ANALYTICS AND REPORTING
SASÂź VISUAL 
ANALYTICS 
Copyr i g ht © 2013, SAS Ins t i tut e Inc . Al l r ights reser ve d . 
EXAMPLE: TEXT ANALYSIS GIVES YOU INSIGHT TO 
CUSTOMER EXPERIENCE AND OPINION 
VISUALIZATION POWERED BY 
SAS ANALYTICS Analytics applied 
to text provides 
real MEANING
VISUALIZATION EXAMPLES 
Copyr i g ht © 2013, SAS Ins t i tut e Inc . Al l r ights reser ve d .
Copyr i g ht © 2013, SAS Ins t i tut e Inc . Al l r ights reser ve d . 
SASÂź VISUAL STATISTICS
DATA TO DECISION LIFECYCLE 
Copyr i g ht © 2013, SAS Ins t i tut e Inc . Al l r ights reser ve d . 
SASÂź Visual Statistics 
MANAGE 
DATA 
COMPETITIVE 
ADVANTAGE 
TEXT 
EXPLORE 
DATA 
DEVELOP 
MODELS 
DEPLOY & 
MONITOR
APPLICATION 
AREAS 
Copyr i g ht © 2013, SAS Ins t i tut e Inc . Al l r ights reser ve d . 
Segmentation 
Classification 
Prediction 
Ad-hoc Discovery 
Data Preparation
SAS IN-MEMORY STATISTICS FOR HADOOP 
Copyr i g ht © 2013, SAS Ins t i tut e Inc . Al l r ights reser ve d .
SASÂź IN-MEMORY 
STATISTICS FOR 
HADOOP 
Copyr i g ht © 2013, SAS Ins t i tut e Inc . Al l r ights reser ve d . 
WHY IT IS IMPORTANT? 
SPEED 
Multi-user interactive analytics 
environment for increased productivity 
Proven state-of-the-art 
statistical algorithms and 
machine learning techniques 
Highly scalable, in-memory 
environment grows easily as needed 
Memory and data efficient for 
a significant reduction of data 
latency to rapidly analyze 
large and complex data in 
Hadoop 
PRECISION 
SCALABLE 
INTERACTIVE
Copyr i g ht © 2013, SAS Ins t i tut e Inc . Al l r ights reser ve d . sas.com

Weitere Àhnliche Inhalte

Was ist angesagt?

Hadoop World 2011: Big Data Architecture: Integrating Hadoop with Other Enter...
Hadoop World 2011: Big Data Architecture: Integrating Hadoop with Other Enter...Hadoop World 2011: Big Data Architecture: Integrating Hadoop with Other Enter...
Hadoop World 2011: Big Data Architecture: Integrating Hadoop with Other Enter...Cloudera, Inc.
 
Machine learning overview (with SAS software)
Machine learning overview (with SAS software)Machine learning overview (with SAS software)
Machine learning overview (with SAS software)Longhow Lam
 
Teradata Aster: Big Data Discovery Made Easy
Teradata Aster: Big Data Discovery Made EasyTeradata Aster: Big Data Discovery Made Easy
Teradata Aster: Big Data Discovery Made EasyTIBCO Spotfire
 
Exclusive Verizon Employee Webinar: Getting More From Your CDR Data
Exclusive Verizon Employee Webinar: Getting More From Your CDR DataExclusive Verizon Employee Webinar: Getting More From Your CDR Data
Exclusive Verizon Employee Webinar: Getting More From Your CDR DataPentaho
 
Data Management for High Performance Analytics
Data Management for High Performance AnalyticsData Management for High Performance Analytics
Data Management for High Performance AnalyticsMary Snyder
 
Apache Hadoop and its role in Big Data architecture - Himanshu Bari
Apache Hadoop and its role in Big Data architecture - Himanshu BariApache Hadoop and its role in Big Data architecture - Himanshu Bari
Apache Hadoop and its role in Big Data architecture - Himanshu Barijaxconf
 
Webinar | Using Hadoop Analytics to Gain a Big Data Advantage
Webinar | Using Hadoop Analytics to Gain a Big Data AdvantageWebinar | Using Hadoop Analytics to Gain a Big Data Advantage
Webinar | Using Hadoop Analytics to Gain a Big Data AdvantageCloudera, Inc.
 
101 ab 1345-1415
101 ab 1345-1415101 ab 1345-1415
101 ab 1345-1415Chiou-Nan Chen
 
The Value of the Modern Data Architecture with Apache Hadoop and Teradata
The Value of the Modern Data Architecture with Apache Hadoop and Teradata The Value of the Modern Data Architecture with Apache Hadoop and Teradata
The Value of the Modern Data Architecture with Apache Hadoop and Teradata Hortonworks
 
Big Data Integration Webinar: Getting Started With Hadoop Big Data
Big Data Integration Webinar: Getting Started With Hadoop Big DataBig Data Integration Webinar: Getting Started With Hadoop Big Data
Big Data Integration Webinar: Getting Started With Hadoop Big DataPentaho
 
Big Data, Big Thinking: Untapped Opportunities
Big Data, Big Thinking: Untapped OpportunitiesBig Data, Big Thinking: Untapped Opportunities
Big Data, Big Thinking: Untapped OpportunitiesSAP Technology
 
Hadoop World 2011: The Blind Men and the Elephant - Matthew Aslett - The 451 ...
Hadoop World 2011: The Blind Men and the Elephant - Matthew Aslett - The 451 ...Hadoop World 2011: The Blind Men and the Elephant - Matthew Aslett - The 451 ...
Hadoop World 2011: The Blind Men and the Elephant - Matthew Aslett - The 451 ...Cloudera, Inc.
 
Big data and its impact on SOA
Big data and its impact on SOABig data and its impact on SOA
Big data and its impact on SOADemed L'Her
 
Bay Area Hadoop User Group
Bay Area Hadoop User GroupBay Area Hadoop User Group
Bay Area Hadoop User GroupPentaho
 
Hadoop 2.0: YARN to Further Optimize Data Processing
Hadoop 2.0: YARN to Further Optimize Data ProcessingHadoop 2.0: YARN to Further Optimize Data Processing
Hadoop 2.0: YARN to Further Optimize Data ProcessingHortonworks
 
The Future of Data Warehousing and Data Integration
The Future of Data Warehousing and Data IntegrationThe Future of Data Warehousing and Data Integration
The Future of Data Warehousing and Data IntegrationEric Kavanagh
 
Combining SAS Office Analytics, SAS Visual Analytics, and SAS Studio.
Combining SAS Office Analytics, SAS Visual Analytics, and SAS Studio.Combining SAS Office Analytics, SAS Visual Analytics, and SAS Studio.
Combining SAS Office Analytics, SAS Visual Analytics, and SAS Studio.SAS Canada
 

Was ist angesagt? (20)

Hadoop World 2011: Big Data Architecture: Integrating Hadoop with Other Enter...
Hadoop World 2011: Big Data Architecture: Integrating Hadoop with Other Enter...Hadoop World 2011: Big Data Architecture: Integrating Hadoop with Other Enter...
Hadoop World 2011: Big Data Architecture: Integrating Hadoop with Other Enter...
 
Machine learning overview (with SAS software)
Machine learning overview (with SAS software)Machine learning overview (with SAS software)
Machine learning overview (with SAS software)
 
Teradata Aster: Big Data Discovery Made Easy
Teradata Aster: Big Data Discovery Made EasyTeradata Aster: Big Data Discovery Made Easy
Teradata Aster: Big Data Discovery Made Easy
 
Exclusive Verizon Employee Webinar: Getting More From Your CDR Data
Exclusive Verizon Employee Webinar: Getting More From Your CDR DataExclusive Verizon Employee Webinar: Getting More From Your CDR Data
Exclusive Verizon Employee Webinar: Getting More From Your CDR Data
 
Data Management for High Performance Analytics
Data Management for High Performance AnalyticsData Management for High Performance Analytics
Data Management for High Performance Analytics
 
Apache Hadoop and its role in Big Data architecture - Himanshu Bari
Apache Hadoop and its role in Big Data architecture - Himanshu BariApache Hadoop and its role in Big Data architecture - Himanshu Bari
Apache Hadoop and its role in Big Data architecture - Himanshu Bari
 
Webinar | Using Hadoop Analytics to Gain a Big Data Advantage
Webinar | Using Hadoop Analytics to Gain a Big Data AdvantageWebinar | Using Hadoop Analytics to Gain a Big Data Advantage
Webinar | Using Hadoop Analytics to Gain a Big Data Advantage
 
101 ab 1345-1415
101 ab 1345-1415101 ab 1345-1415
101 ab 1345-1415
 
The Value of the Modern Data Architecture with Apache Hadoop and Teradata
The Value of the Modern Data Architecture with Apache Hadoop and Teradata The Value of the Modern Data Architecture with Apache Hadoop and Teradata
The Value of the Modern Data Architecture with Apache Hadoop and Teradata
 
Big Data Integration Webinar: Getting Started With Hadoop Big Data
Big Data Integration Webinar: Getting Started With Hadoop Big DataBig Data Integration Webinar: Getting Started With Hadoop Big Data
Big Data Integration Webinar: Getting Started With Hadoop Big Data
 
Big Data, Big Thinking: Untapped Opportunities
Big Data, Big Thinking: Untapped OpportunitiesBig Data, Big Thinking: Untapped Opportunities
Big Data, Big Thinking: Untapped Opportunities
 
Oracle's BigData solutions
Oracle's BigData solutionsOracle's BigData solutions
Oracle's BigData solutions
 
Hadoop World 2011: The Blind Men and the Elephant - Matthew Aslett - The 451 ...
Hadoop World 2011: The Blind Men and the Elephant - Matthew Aslett - The 451 ...Hadoop World 2011: The Blind Men and the Elephant - Matthew Aslett - The 451 ...
Hadoop World 2011: The Blind Men and the Elephant - Matthew Aslett - The 451 ...
 
Big data and its impact on SOA
Big data and its impact on SOABig data and its impact on SOA
Big data and its impact on SOA
 
Technical presentation
Technical presentationTechnical presentation
Technical presentation
 
Bay Area Hadoop User Group
Bay Area Hadoop User GroupBay Area Hadoop User Group
Bay Area Hadoop User Group
 
Ask bigger questions
Ask bigger questionsAsk bigger questions
Ask bigger questions
 
Hadoop 2.0: YARN to Further Optimize Data Processing
Hadoop 2.0: YARN to Further Optimize Data ProcessingHadoop 2.0: YARN to Further Optimize Data Processing
Hadoop 2.0: YARN to Further Optimize Data Processing
 
The Future of Data Warehousing and Data Integration
The Future of Data Warehousing and Data IntegrationThe Future of Data Warehousing and Data Integration
The Future of Data Warehousing and Data Integration
 
Combining SAS Office Analytics, SAS Visual Analytics, and SAS Studio.
Combining SAS Office Analytics, SAS Visual Analytics, and SAS Studio.Combining SAS Office Analytics, SAS Visual Analytics, and SAS Studio.
Combining SAS Office Analytics, SAS Visual Analytics, and SAS Studio.
 

Andere mochten auch

SAS and Cloudera – Analytics at Scale
SAS and Cloudera – Analytics at ScaleSAS and Cloudera – Analytics at Scale
SAS and Cloudera – Analytics at ScaleCloudera, Inc.
 
Administrative Reporting of SAS Visual Analytics 7.1 and Integration with E...
Administrative Reporting of SAS Visual Analytics 7.1  and Integration with  E...Administrative Reporting of SAS Visual Analytics 7.1  and Integration with  E...
Administrative Reporting of SAS Visual Analytics 7.1 and Integration with E...Francesco Marelli
 
Visual Analytics
Visual AnalyticsVisual Analytics
Visual Analyticsmeganfulton3
 
Hadoop and Spark for the SAS Developer
Hadoop and Spark for the SAS DeveloperHadoop and Spark for the SAS Developer
Hadoop and Spark for the SAS DeveloperDataWorks Summit
 
SAS on Your (Apache) Cluster, Serving your Data (Analysts)
SAS on Your (Apache) Cluster, Serving your Data (Analysts)SAS on Your (Apache) Cluster, Serving your Data (Analysts)
SAS on Your (Apache) Cluster, Serving your Data (Analysts)DataWorks Summit
 
Predictive Analytics: It's The Intervention That Matters
Predictive Analytics: It's The Intervention That MattersPredictive Analytics: It's The Intervention That Matters
Predictive Analytics: It's The Intervention That MattersHealth Catalyst
 
The art of startup pitching
The art of startup pitching The art of startup pitching
The art of startup pitching Lior Vaknin
 
2011.02.18 marco parenzan - case study. conversione di una applicazione for...
2011.02.18   marco parenzan - case study. conversione di una applicazione for...2011.02.18   marco parenzan - case study. conversione di una applicazione for...
2011.02.18 marco parenzan - case study. conversione di una applicazione for...Marco Parenzan
 
Virtual Call and Contact Center
Virtual Call and Contact CenterVirtual Call and Contact Center
Virtual Call and Contact CenterVCC Live
 
2011-06-19 ASIP Santé RIR "DMP en région et perspectives de déploiement du DMP"
2011-06-19 ASIP Santé RIR "DMP en région et perspectives de déploiement du DMP"2011-06-19 ASIP Santé RIR "DMP en région et perspectives de déploiement du DMP"
2011-06-19 ASIP Santé RIR "DMP en région et perspectives de déploiement du DMP"ASIP Santé
 
J G B resume teaching 2014 dr
J G B resume teaching 2014 drJ G B resume teaching 2014 dr
J G B resume teaching 2014 drJose Gerena
 
OOW16 - Ready or Not: Applying Secure Configuration to Oracle E-Business Suit...
OOW16 - Ready or Not: Applying Secure Configuration to Oracle E-Business Suit...OOW16 - Ready or Not: Applying Secure Configuration to Oracle E-Business Suit...
OOW16 - Ready or Not: Applying Secure Configuration to Oracle E-Business Suit...vasuballa
 
SAS Training session - By Pratima
SAS Training session  -  By Pratima SAS Training session  -  By Pratima
SAS Training session - By Pratima Pratima Pandey
 
Hadoop and Big Data
Hadoop and Big DataHadoop and Big Data
Hadoop and Big DataHarshdeep Kaur
 
Ukuran statistik bag 2
Ukuran statistik bag 2Ukuran statistik bag 2
Ukuran statistik bag 2intanbuhatii
 
SAS Visual Analytics
SAS Visual AnalyticsSAS Visual Analytics
SAS Visual AnalyticsEvan Greenberg
 
Big Data Analytics in Government
Big Data Analytics in GovernmentBig Data Analytics in Government
Big Data Analytics in GovernmentDeepak Ramanathan
 
SAS for Claims Fraud
SAS for Claims FraudSAS for Claims Fraud
SAS for Claims Fraudstuartdrose
 

Andere mochten auch (20)

SAS and Cloudera – Analytics at Scale
SAS and Cloudera – Analytics at ScaleSAS and Cloudera – Analytics at Scale
SAS and Cloudera – Analytics at Scale
 
Administrative Reporting of SAS Visual Analytics 7.1 and Integration with E...
Administrative Reporting of SAS Visual Analytics 7.1  and Integration with  E...Administrative Reporting of SAS Visual Analytics 7.1  and Integration with  E...
Administrative Reporting of SAS Visual Analytics 7.1 and Integration with E...
 
Visual Analytics
Visual AnalyticsVisual Analytics
Visual Analytics
 
Hadoop and Spark for the SAS Developer
Hadoop and Spark for the SAS DeveloperHadoop and Spark for the SAS Developer
Hadoop and Spark for the SAS Developer
 
SAS on Your (Apache) Cluster, Serving your Data (Analysts)
SAS on Your (Apache) Cluster, Serving your Data (Analysts)SAS on Your (Apache) Cluster, Serving your Data (Analysts)
SAS on Your (Apache) Cluster, Serving your Data (Analysts)
 
Predictive Analytics: It's The Intervention That Matters
Predictive Analytics: It's The Intervention That MattersPredictive Analytics: It's The Intervention That Matters
Predictive Analytics: It's The Intervention That Matters
 
Technix-Pro Sas certified base programmer
Technix-Pro Sas certified base programmerTechnix-Pro Sas certified base programmer
Technix-Pro Sas certified base programmer
 
Technix-Pro PMP + FinTech Bundle Package
Technix-Pro PMP + FinTech Bundle PackageTechnix-Pro PMP + FinTech Bundle Package
Technix-Pro PMP + FinTech Bundle Package
 
The art of startup pitching
The art of startup pitching The art of startup pitching
The art of startup pitching
 
2011.02.18 marco parenzan - case study. conversione di una applicazione for...
2011.02.18   marco parenzan - case study. conversione di una applicazione for...2011.02.18   marco parenzan - case study. conversione di una applicazione for...
2011.02.18 marco parenzan - case study. conversione di una applicazione for...
 
Virtual Call and Contact Center
Virtual Call and Contact CenterVirtual Call and Contact Center
Virtual Call and Contact Center
 
2011-06-19 ASIP Santé RIR "DMP en région et perspectives de déploiement du DMP"
2011-06-19 ASIP Santé RIR "DMP en région et perspectives de déploiement du DMP"2011-06-19 ASIP Santé RIR "DMP en région et perspectives de déploiement du DMP"
2011-06-19 ASIP Santé RIR "DMP en région et perspectives de déploiement du DMP"
 
J G B resume teaching 2014 dr
J G B resume teaching 2014 drJ G B resume teaching 2014 dr
J G B resume teaching 2014 dr
 
OOW16 - Ready or Not: Applying Secure Configuration to Oracle E-Business Suit...
OOW16 - Ready or Not: Applying Secure Configuration to Oracle E-Business Suit...OOW16 - Ready or Not: Applying Secure Configuration to Oracle E-Business Suit...
OOW16 - Ready or Not: Applying Secure Configuration to Oracle E-Business Suit...
 
SAS Training session - By Pratima
SAS Training session  -  By Pratima SAS Training session  -  By Pratima
SAS Training session - By Pratima
 
Hadoop and Big Data
Hadoop and Big DataHadoop and Big Data
Hadoop and Big Data
 
Ukuran statistik bag 2
Ukuran statistik bag 2Ukuran statistik bag 2
Ukuran statistik bag 2
 
SAS Visual Analytics
SAS Visual AnalyticsSAS Visual Analytics
SAS Visual Analytics
 
Big Data Analytics in Government
Big Data Analytics in GovernmentBig Data Analytics in Government
Big Data Analytics in Government
 
SAS for Claims Fraud
SAS for Claims FraudSAS for Claims Fraud
SAS for Claims Fraud
 

Ähnlich wie SAS Modernization architectures - Big Data Analytics

SAS Data Management for Analytics: potenzia le tue analisi e sostieni l’innov...
SAS Data Management for Analytics: potenzia le tue analisi e sostieni l’innov...SAS Data Management for Analytics: potenzia le tue analisi e sostieni l’innov...
SAS Data Management for Analytics: potenzia le tue analisi e sostieni l’innov...SAS Italy
 
2015 HortonWorks MDA Roadshow Presentation
2015 HortonWorks MDA Roadshow Presentation2015 HortonWorks MDA Roadshow Presentation
2015 HortonWorks MDA Roadshow PresentationFelix Liao
 
Accelerate Your Big Data Analytics Efforts with SAS and Hadoop
Accelerate Your Big Data Analytics Efforts with SAS and HadoopAccelerate Your Big Data Analytics Efforts with SAS and Hadoop
Accelerate Your Big Data Analytics Efforts with SAS and HadoopDataWorks Summit
 
What is the Value of SAS Analytics?
What is the Value of SAS Analytics?What is the Value of SAS Analytics?
What is the Value of SAS Analytics?SAS Canada
 
How to Merge the Data Lake and the Data Warehouse: The Power of a Unified Ana...
How to Merge the Data Lake and the Data Warehouse: The Power of a Unified Ana...How to Merge the Data Lake and the Data Warehouse: The Power of a Unified Ana...
How to Merge the Data Lake and the Data Warehouse: The Power of a Unified Ana...Enterprise Management Associates
 
GITEX Big Data Conference 2014 – SAP Presentation
GITEX Big Data Conference 2014 – SAP PresentationGITEX Big Data Conference 2014 – SAP Presentation
GITEX Big Data Conference 2014 – SAP PresentationPedro Pereira
 
Hadoop Reporting and Analysis - Jaspersoft
Hadoop Reporting and Analysis - JaspersoftHadoop Reporting and Analysis - Jaspersoft
Hadoop Reporting and Analysis - JaspersoftHortonworks
 
Leveraging Cloud Analytics to Support Data-Driven Decisions
Leveraging Cloud Analytics to Support Data-Driven DecisionsLeveraging Cloud Analytics to Support Data-Driven Decisions
Leveraging Cloud Analytics to Support Data-Driven DecisionsAmazon Web Services
 
In-Memory Database Platform for Big Data
In-Memory Database Platform for Big DataIn-Memory Database Platform for Big Data
In-Memory Database Platform for Big DataSAP Technology
 
Big data tim
Big data timBig data tim
Big data timT Weir
 
Cwin16 tls-partner-sas new-open_analytics_platform
Cwin16 tls-partner-sas new-open_analytics_platformCwin16 tls-partner-sas new-open_analytics_platform
Cwin16 tls-partner-sas new-open_analytics_platformCapgemini
 
Shop.org 2017 Tech talk do more that just analytics open platforms and sas dr...
Shop.org 2017 Tech talk do more that just analytics open platforms and sas dr...Shop.org 2017 Tech talk do more that just analytics open platforms and sas dr...
Shop.org 2017 Tech talk do more that just analytics open platforms and sas dr...National Retail Federation
 
Big Idea For Big Data
Big Idea For Big DataBig Idea For Big Data
Big Idea For Big DataDexlab Analytics
 
Big Data for Product Managers
Big Data for Product ManagersBig Data for Product Managers
Big Data for Product ManagersPentaho
 
SAS - Visual Analytics a ĆŸivĂĄ ukĂĄzka
SAS - Visual Analytics a ĆŸivĂĄ ukĂĄzkaSAS - Visual Analytics a ĆŸivĂĄ ukĂĄzka
SAS - Visual Analytics a ĆŸivĂĄ ukĂĄzkaMarketingArrowECS_CZ
 
Big Data Solutions on Cloud – The Way Forward by Kiththi Perera SLT
Big Data Solutions on Cloud – The Way Forward by Kiththi Perera SLTBig Data Solutions on Cloud – The Way Forward by Kiththi Perera SLT
Big Data Solutions on Cloud – The Way Forward by Kiththi Perera SLTKiththi Perera
 
Big data solutions on cloud – the way forward
Big data solutions on cloud – the way forwardBig data solutions on cloud – the way forward
Big data solutions on cloud – the way forwardKiththi Perera
 
Making Hadoop Ready for the Enterprise
Making Hadoop Ready for the Enterprise Making Hadoop Ready for the Enterprise
Making Hadoop Ready for the Enterprise DataWorks Summit
 
Data & Analytics with CIS & Microsoft Platforms
Data & Analytics with CIS & Microsoft PlatformsData & Analytics with CIS & Microsoft Platforms
Data & Analytics with CIS & Microsoft PlatformsSonata Software
 

Ähnlich wie SAS Modernization architectures - Big Data Analytics (20)

SAS Data Management for Analytics: potenzia le tue analisi e sostieni l’innov...
SAS Data Management for Analytics: potenzia le tue analisi e sostieni l’innov...SAS Data Management for Analytics: potenzia le tue analisi e sostieni l’innov...
SAS Data Management for Analytics: potenzia le tue analisi e sostieni l’innov...
 
Business Visualization: Dashboard & Storyboarding
Business Visualization: Dashboard & StoryboardingBusiness Visualization: Dashboard & Storyboarding
Business Visualization: Dashboard & Storyboarding
 
2015 HortonWorks MDA Roadshow Presentation
2015 HortonWorks MDA Roadshow Presentation2015 HortonWorks MDA Roadshow Presentation
2015 HortonWorks MDA Roadshow Presentation
 
Accelerate Your Big Data Analytics Efforts with SAS and Hadoop
Accelerate Your Big Data Analytics Efforts with SAS and HadoopAccelerate Your Big Data Analytics Efforts with SAS and Hadoop
Accelerate Your Big Data Analytics Efforts with SAS and Hadoop
 
What is the Value of SAS Analytics?
What is the Value of SAS Analytics?What is the Value of SAS Analytics?
What is the Value of SAS Analytics?
 
How to Merge the Data Lake and the Data Warehouse: The Power of a Unified Ana...
How to Merge the Data Lake and the Data Warehouse: The Power of a Unified Ana...How to Merge the Data Lake and the Data Warehouse: The Power of a Unified Ana...
How to Merge the Data Lake and the Data Warehouse: The Power of a Unified Ana...
 
GITEX Big Data Conference 2014 – SAP Presentation
GITEX Big Data Conference 2014 – SAP PresentationGITEX Big Data Conference 2014 – SAP Presentation
GITEX Big Data Conference 2014 – SAP Presentation
 
Hadoop Reporting and Analysis - Jaspersoft
Hadoop Reporting and Analysis - JaspersoftHadoop Reporting and Analysis - Jaspersoft
Hadoop Reporting and Analysis - Jaspersoft
 
Leveraging Cloud Analytics to Support Data-Driven Decisions
Leveraging Cloud Analytics to Support Data-Driven DecisionsLeveraging Cloud Analytics to Support Data-Driven Decisions
Leveraging Cloud Analytics to Support Data-Driven Decisions
 
In-Memory Database Platform for Big Data
In-Memory Database Platform for Big DataIn-Memory Database Platform for Big Data
In-Memory Database Platform for Big Data
 
Big data tim
Big data timBig data tim
Big data tim
 
Cwin16 tls-partner-sas new-open_analytics_platform
Cwin16 tls-partner-sas new-open_analytics_platformCwin16 tls-partner-sas new-open_analytics_platform
Cwin16 tls-partner-sas new-open_analytics_platform
 
Shop.org 2017 Tech talk do more that just analytics open platforms and sas dr...
Shop.org 2017 Tech talk do more that just analytics open platforms and sas dr...Shop.org 2017 Tech talk do more that just analytics open platforms and sas dr...
Shop.org 2017 Tech talk do more that just analytics open platforms and sas dr...
 
Big Idea For Big Data
Big Idea For Big DataBig Idea For Big Data
Big Idea For Big Data
 
Big Data for Product Managers
Big Data for Product ManagersBig Data for Product Managers
Big Data for Product Managers
 
SAS - Visual Analytics a ĆŸivĂĄ ukĂĄzka
SAS - Visual Analytics a ĆŸivĂĄ ukĂĄzkaSAS - Visual Analytics a ĆŸivĂĄ ukĂĄzka
SAS - Visual Analytics a ĆŸivĂĄ ukĂĄzka
 
Big Data Solutions on Cloud – The Way Forward by Kiththi Perera SLT
Big Data Solutions on Cloud – The Way Forward by Kiththi Perera SLTBig Data Solutions on Cloud – The Way Forward by Kiththi Perera SLT
Big Data Solutions on Cloud – The Way Forward by Kiththi Perera SLT
 
Big data solutions on cloud – the way forward
Big data solutions on cloud – the way forwardBig data solutions on cloud – the way forward
Big data solutions on cloud – the way forward
 
Making Hadoop Ready for the Enterprise
Making Hadoop Ready for the Enterprise Making Hadoop Ready for the Enterprise
Making Hadoop Ready for the Enterprise
 
Data & Analytics with CIS & Microsoft Platforms
Data & Analytics with CIS & Microsoft PlatformsData & Analytics with CIS & Microsoft Platforms
Data & Analytics with CIS & Microsoft Platforms
 

KĂŒrzlich hochgeladen

Mg Road Call Girls Service: 🍓 7737669865 🍓 High Profile Model Escorts | Banga...
Mg Road Call Girls Service: 🍓 7737669865 🍓 High Profile Model Escorts | Banga...Mg Road Call Girls Service: 🍓 7737669865 🍓 High Profile Model Escorts | Banga...
Mg Road Call Girls Service: 🍓 7737669865 🍓 High Profile Model Escorts | Banga...amitlee9823
 
Vip Mumbai Call Girls Marol Naka Call On 9920725232 With Body to body massage...
Vip Mumbai Call Girls Marol Naka Call On 9920725232 With Body to body massage...Vip Mumbai Call Girls Marol Naka Call On 9920725232 With Body to body massage...
Vip Mumbai Call Girls Marol Naka Call On 9920725232 With Body to body massage...amitlee9823
 
Call Girls In Bellandur ☎ 7737669865 đŸ„” Book Your One night Stand
Call Girls In Bellandur ☎ 7737669865 đŸ„” Book Your One night StandCall Girls In Bellandur ☎ 7737669865 đŸ„” Book Your One night Stand
Call Girls In Bellandur ☎ 7737669865 đŸ„” Book Your One night Standamitlee9823
 
April 2024 - Crypto Market Report's Analysis
April 2024 - Crypto Market Report's AnalysisApril 2024 - Crypto Market Report's Analysis
April 2024 - Crypto Market Report's Analysismanisha194592
 
Call Girls Jalahalli Just Call 👗 7737669865 👗 Top Class Call Girl Service Ban...
Call Girls Jalahalli Just Call 👗 7737669865 👗 Top Class Call Girl Service Ban...Call Girls Jalahalli Just Call 👗 7737669865 👗 Top Class Call Girl Service Ban...
Call Girls Jalahalli Just Call 👗 7737669865 👗 Top Class Call Girl Service Ban...amitlee9823
 
Call Girls In Doddaballapur Road ☎ 7737669865 đŸ„” Book Your One night Stand
Call Girls In Doddaballapur Road ☎ 7737669865 đŸ„” Book Your One night StandCall Girls In Doddaballapur Road ☎ 7737669865 đŸ„” Book Your One night Stand
Call Girls In Doddaballapur Road ☎ 7737669865 đŸ„” Book Your One night Standamitlee9823
 
Call Girls Hsr Layout Just Call 👗 7737669865 👗 Top Class Call Girl Service Ba...
Call Girls Hsr Layout Just Call 👗 7737669865 👗 Top Class Call Girl Service Ba...Call Girls Hsr Layout Just Call 👗 7737669865 👗 Top Class Call Girl Service Ba...
Call Girls Hsr Layout Just Call 👗 7737669865 👗 Top Class Call Girl Service Ba...amitlee9823
 
CebaBaby dropshipping via API with DroFX.pptx
CebaBaby dropshipping via API with DroFX.pptxCebaBaby dropshipping via API with DroFX.pptx
CebaBaby dropshipping via API with DroFX.pptxolyaivanovalion
 
Discover Why Less is More in B2B Research
Discover Why Less is More in B2B ResearchDiscover Why Less is More in B2B Research
Discover Why Less is More in B2B Researchmichael115558
 
Al Barsha Escorts $#$ O565212860 $#$ Escort Service In Al Barsha
Al Barsha Escorts $#$ O565212860 $#$ Escort Service In Al BarshaAl Barsha Escorts $#$ O565212860 $#$ Escort Service In Al Barsha
Al Barsha Escorts $#$ O565212860 $#$ Escort Service In Al BarshaAroojKhan71
 
5CL-ADBA,5cladba, Chinese supplier, safety is guaranteed
5CL-ADBA,5cladba, Chinese supplier, safety is guaranteed5CL-ADBA,5cladba, Chinese supplier, safety is guaranteed
5CL-ADBA,5cladba, Chinese supplier, safety is guaranteedamy56318795
 
Mature dropshipping via API with DroFx.pptx
Mature dropshipping via API with DroFx.pptxMature dropshipping via API with DroFx.pptx
Mature dropshipping via API with DroFx.pptxolyaivanovalion
 
Capstone Project on IBM Data Analytics Program
Capstone Project on IBM Data Analytics ProgramCapstone Project on IBM Data Analytics Program
Capstone Project on IBM Data Analytics ProgramMoniSankarHazra
 
BDSM⚡Call Girls in Mandawali Delhi >àŒ’8448380779 Escort Service
BDSM⚡Call Girls in Mandawali Delhi >àŒ’8448380779 Escort ServiceBDSM⚡Call Girls in Mandawali Delhi >àŒ’8448380779 Escort Service
BDSM⚡Call Girls in Mandawali Delhi >àŒ’8448380779 Escort ServiceDelhi Call girls
 
Sampling (random) method and Non random.ppt
Sampling (random) method and Non random.pptSampling (random) method and Non random.ppt
Sampling (random) method and Non random.pptDr. Soumendra Kumar Patra
 
Digital Advertising Lecture for Advanced Digital & Social Media Strategy at U...
Digital Advertising Lecture for Advanced Digital & Social Media Strategy at U...Digital Advertising Lecture for Advanced Digital & Social Media Strategy at U...
Digital Advertising Lecture for Advanced Digital & Social Media Strategy at U...Valters Lauzums
 
Accredited-Transport-Cooperatives-Jan-2021-Web.pdf
Accredited-Transport-Cooperatives-Jan-2021-Web.pdfAccredited-Transport-Cooperatives-Jan-2021-Web.pdf
Accredited-Transport-Cooperatives-Jan-2021-Web.pdfadriantubila
 

KĂŒrzlich hochgeladen (20)

(NEHA) Call Girls Katra Call Now 8617697112 Katra Escorts 24x7
(NEHA) Call Girls Katra Call Now 8617697112 Katra Escorts 24x7(NEHA) Call Girls Katra Call Now 8617697112 Katra Escorts 24x7
(NEHA) Call Girls Katra Call Now 8617697112 Katra Escorts 24x7
 
Mg Road Call Girls Service: 🍓 7737669865 🍓 High Profile Model Escorts | Banga...
Mg Road Call Girls Service: 🍓 7737669865 🍓 High Profile Model Escorts | Banga...Mg Road Call Girls Service: 🍓 7737669865 🍓 High Profile Model Escorts | Banga...
Mg Road Call Girls Service: 🍓 7737669865 🍓 High Profile Model Escorts | Banga...
 
Vip Mumbai Call Girls Marol Naka Call On 9920725232 With Body to body massage...
Vip Mumbai Call Girls Marol Naka Call On 9920725232 With Body to body massage...Vip Mumbai Call Girls Marol Naka Call On 9920725232 With Body to body massage...
Vip Mumbai Call Girls Marol Naka Call On 9920725232 With Body to body massage...
 
Call Girls In Bellandur ☎ 7737669865 đŸ„” Book Your One night Stand
Call Girls In Bellandur ☎ 7737669865 đŸ„” Book Your One night StandCall Girls In Bellandur ☎ 7737669865 đŸ„” Book Your One night Stand
Call Girls In Bellandur ☎ 7737669865 đŸ„” Book Your One night Stand
 
April 2024 - Crypto Market Report's Analysis
April 2024 - Crypto Market Report's AnalysisApril 2024 - Crypto Market Report's Analysis
April 2024 - Crypto Market Report's Analysis
 
Anomaly detection and data imputation within time series
Anomaly detection and data imputation within time seriesAnomaly detection and data imputation within time series
Anomaly detection and data imputation within time series
 
Call Girls Jalahalli Just Call 👗 7737669865 👗 Top Class Call Girl Service Ban...
Call Girls Jalahalli Just Call 👗 7737669865 👗 Top Class Call Girl Service Ban...Call Girls Jalahalli Just Call 👗 7737669865 👗 Top Class Call Girl Service Ban...
Call Girls Jalahalli Just Call 👗 7737669865 👗 Top Class Call Girl Service Ban...
 
CHEAP Call Girls in Saket (-DELHI )🔝 9953056974🔝(=)/CALL GIRLS SERVICE
CHEAP Call Girls in Saket (-DELHI )🔝 9953056974🔝(=)/CALL GIRLS SERVICECHEAP Call Girls in Saket (-DELHI )🔝 9953056974🔝(=)/CALL GIRLS SERVICE
CHEAP Call Girls in Saket (-DELHI )🔝 9953056974🔝(=)/CALL GIRLS SERVICE
 
Call Girls In Doddaballapur Road ☎ 7737669865 đŸ„” Book Your One night Stand
Call Girls In Doddaballapur Road ☎ 7737669865 đŸ„” Book Your One night StandCall Girls In Doddaballapur Road ☎ 7737669865 đŸ„” Book Your One night Stand
Call Girls In Doddaballapur Road ☎ 7737669865 đŸ„” Book Your One night Stand
 
Call Girls Hsr Layout Just Call 👗 7737669865 👗 Top Class Call Girl Service Ba...
Call Girls Hsr Layout Just Call 👗 7737669865 👗 Top Class Call Girl Service Ba...Call Girls Hsr Layout Just Call 👗 7737669865 👗 Top Class Call Girl Service Ba...
Call Girls Hsr Layout Just Call 👗 7737669865 👗 Top Class Call Girl Service Ba...
 
CebaBaby dropshipping via API with DroFX.pptx
CebaBaby dropshipping via API with DroFX.pptxCebaBaby dropshipping via API with DroFX.pptx
CebaBaby dropshipping via API with DroFX.pptx
 
Discover Why Less is More in B2B Research
Discover Why Less is More in B2B ResearchDiscover Why Less is More in B2B Research
Discover Why Less is More in B2B Research
 
Al Barsha Escorts $#$ O565212860 $#$ Escort Service In Al Barsha
Al Barsha Escorts $#$ O565212860 $#$ Escort Service In Al BarshaAl Barsha Escorts $#$ O565212860 $#$ Escort Service In Al Barsha
Al Barsha Escorts $#$ O565212860 $#$ Escort Service In Al Barsha
 
5CL-ADBA,5cladba, Chinese supplier, safety is guaranteed
5CL-ADBA,5cladba, Chinese supplier, safety is guaranteed5CL-ADBA,5cladba, Chinese supplier, safety is guaranteed
5CL-ADBA,5cladba, Chinese supplier, safety is guaranteed
 
Mature dropshipping via API with DroFx.pptx
Mature dropshipping via API with DroFx.pptxMature dropshipping via API with DroFx.pptx
Mature dropshipping via API with DroFx.pptx
 
Capstone Project on IBM Data Analytics Program
Capstone Project on IBM Data Analytics ProgramCapstone Project on IBM Data Analytics Program
Capstone Project on IBM Data Analytics Program
 
BDSM⚡Call Girls in Mandawali Delhi >àŒ’8448380779 Escort Service
BDSM⚡Call Girls in Mandawali Delhi >àŒ’8448380779 Escort ServiceBDSM⚡Call Girls in Mandawali Delhi >àŒ’8448380779 Escort Service
BDSM⚡Call Girls in Mandawali Delhi >àŒ’8448380779 Escort Service
 
Sampling (random) method and Non random.ppt
Sampling (random) method and Non random.pptSampling (random) method and Non random.ppt
Sampling (random) method and Non random.ppt
 
Digital Advertising Lecture for Advanced Digital & Social Media Strategy at U...
Digital Advertising Lecture for Advanced Digital & Social Media Strategy at U...Digital Advertising Lecture for Advanced Digital & Social Media Strategy at U...
Digital Advertising Lecture for Advanced Digital & Social Media Strategy at U...
 
Accredited-Transport-Cooperatives-Jan-2021-Web.pdf
Accredited-Transport-Cooperatives-Jan-2021-Web.pdfAccredited-Transport-Cooperatives-Jan-2021-Web.pdf
Accredited-Transport-Cooperatives-Jan-2021-Web.pdf
 

SAS Modernization architectures - Big Data Analytics

  • 1. IT STRATEGY FOR SCALABLE ANALYTICS, Copyr i g ht © 2013, SAS Ins t i tut e Inc . Al l r ights reser ve d . MODERN DATA ARCHITECTURES
  • 2. Copyr i g ht © 2013, SAS Ins t i tut e Inc . Al l r ights reser ve d . MODERN ARCHITECTURES
  • 3. Copyr i g ht © 2013, SAS Ins t i tut e Inc . Al l r ights reser ve d . STUNNING FACT Making the Modern World: Materials and Dematerialization - Vaclav Smil
  • 4. Copyr i g ht © 2013, SAS Ins t i tut e Inc . Al l r ights reser ve d .
  • 5. Copyr i g ht © 2013, SAS Ins t i tut e Inc . Al l r ights reser ve d . Shift in Mindset Scarcity ‱ Technology constrained ‱ Process-centric ‱ Focus on cost control Everything is forbidden unless it is permitted Abundance ‱ Focus on value ‱ Discovery-centric ‱ Technology empowered Everything is permitted unless it is forbidden
  • 6. Trends Big Data, Storage, Hadoop & In-memory Technology THE PERFECT STORM: STORAGE TECHNOLOGY COSTS AND CPU SPEED Cost of Storage, Memory, Computing ‱ In 2000 a GB of Disk $17 today < $0.07 ‱ In 2000 a GB of Ram $1800 today < $1 ‱ In 2009 a TB of RDBMS was $70K today < $ 20K Cost per Terabyte $- $20 $40 $60 $80 $100 Hadoop Microsoft PDW Oracle Greenplum Teradata Vertica Copyr i g ht © 2013, SAS Ins t i tut e Inc . Al l r ights reser ve d . Thousands Today 2009
  • 7. Copyr i g ht © 2013, SAS Ins t i tut e Inc . Al l r ights reser ve d . MODERN REALITY ‱ Commoditization ‱ Architectures ‱ Scale Infrastructure ‱New Complex Streams ‱ Perishable Considerations ‱ Cost Data ‱New Category of Business Problems ‱ Analytical Algorithms ‱Operationalization Analytics
  • 8. Finding treasures in unstructured data like social media or survey tools that could uncover insights about consumer sentiment Copyright © 2011, SAS Inst itute Inc. Al l rights reserved. 8 Leveraging historical data to drive better insight into decision-making for the future Mine transaction databases for data of spending patterns that indicate a stolen card.. Analyze massive amounts of data in order to accurately identify areas likely to produce the most profitable results FORECASTING DATA MINING TEXT ANALYTICS OPTIMIZATION STATISTICS ADVANCED ANALYTICS INFORMATION MANAGEMENT
  • 9. Copyr i g ht © 2013, SAS Ins t i tut e Inc . Al l r ights reser ve d . CURRENT TRENDS IN ANALYTICS Complex Business Problems Are Driving Analytics Innovation Speed Will Be Of Essence Leverage Analytics To Unlock The Information Contained In Unstructured Data Operationalizing Analytics
  • 10. CURRENT AND FUTURE ARCHITECTURES Copyr i g ht © 2013, SAS Ins t i tut e Inc . Al l r ights reser ve d .
  • 11. WHERE WE ARE TODAY? SETTING THE SCENE Operational Data Sources Copyr i g ht © 2013, SAS Ins t i tut e Inc . Al l r ights reser ve d . EDW Data Mart Data Mart Analytic Mart Analytic Mart BI and Analytics Unstructured, Semi-structured and Streaming data (i.e. sensor data) handled often outside the Warehouse flow
  • 12. WHERE DOES HADOOP FIT? HADOOP AS A “NEW DATA” STORE Operational Data Sources Copyr i g ht © 2013, SAS Ins t i tut e Inc . Al l r ights reser ve d . EDW Data Mart Data Mart Analytic Mart Analytic Mart BI and Analytics
  • 13. WHERE DOES HADOOP FIT? HADOOP AS AN ADDITIONAL INPUT TO THE EDW Operational Data Sources Copyr i g ht © 2013, SAS Ins t i tut e Inc . Al l r ights reser ve d . EDW Data Mart Data Mart Analytic Mart Analytic Mart Analytic Mart Data Mart BI and Analytics
  • 14. WHERE DOES HADOOP FIT? Operational Data Sources EDW Copyr i g ht © 2013, SAS Ins t i tut e Inc . Al l r ights reser ve d . HADOOP DATA PLATFORM AS A “STAGING LAYER” AS PART OF A “DATA LAKE” – Downstream stores could be Hadoop, data appliances or an RDBMS Data Mart Data Mart Analytic Mart Analytic Mart BI and Analytics
  • 15. Copyr i g ht © 2013, SAS Ins t i tut e Inc . Al l r ights reser ve d . SAS BIG DATA STRATEGY – SAS AREAS 15
  • 16. SAS & HADOOP SASÂź WITHIN THE HADOOP ECOSYSTEM Copyr i g ht © 2013, SAS Ins t i tut e Inc . Al l r ights reser ve d . SAS Metadata Base SAS & SAS/ACCESSÂź to Hadoopℱ Impala Next-Gen SASÂź User User Interface Metadata Data Access Data Processing File System SASÂź User SASÂź LASRℱ Analytic Server SASÂź High- Performance Analytic Procedures MPI Based HDFS Pig Map Reduce SASÂź Visual Analytics In-Memory Data Access SASÂź Enterprise Minerℱ SASÂź Data Integration SASÂź Enterprise GuideÂź Hive SAS Embedded Process Accelerators SASÂź In-Memory Statistics for Haodop
  • 17. IN SUMMARY SAS ENABLES THE ENTIRE LIFECYCLE AROUND HADOOP SAS Visual Analytics Decision Manager SAS Scoring Accelerator for Hadoop SAS Code Accelerator for Hadoop Copyr i g ht © 2013, SAS Ins t i tut e Inc . Al l r ights reser ve d . IDENTIFY / FORMULATE PROBLEM DATA PREPARATION DATA EXPLORATION TRANSFORM & SELECT BUILD MODEL VALIDATE MODEL DEPLOY MODEL EVALUATE / MONITOR RESULTS SAS Visual Analytics SAS Visual Statistics SAS In-Memory Statistics for Hadoop Done using either the Data Preparation, Data Exploration or Build Model Tools SAS High Performance Analytics Offerings supported by relevant clients like SAS Enterprise Miner, SAS/STAT etc. Decision Manager Done using either the Data Preparation, Data Exploration or Build Model Tools
  • 18. Copyr i g ht © 2013, SAS Ins t i tut e Inc . Al l r ights reser ve d . SASÂź VISUAL ANALYTICS A SINGLE SOLUTION FOR DATA DISCOVERY, VISUALIZATION, ANALYTICS AND REPORTING
  • 19. SASÂź VISUAL ANALYTICS Copyr i g ht © 2013, SAS Ins t i tut e Inc . Al l r ights reser ve d . EXAMPLE: TEXT ANALYSIS GIVES YOU INSIGHT TO CUSTOMER EXPERIENCE AND OPINION VISUALIZATION POWERED BY SAS ANALYTICS Analytics applied to text provides real MEANING
  • 20. VISUALIZATION EXAMPLES Copyr i g ht © 2013, SAS Ins t i tut e Inc . Al l r ights reser ve d .
  • 21. Copyr i g ht © 2013, SAS Ins t i tut e Inc . Al l r ights reser ve d . SASÂź VISUAL STATISTICS
  • 22. DATA TO DECISION LIFECYCLE Copyr i g ht © 2013, SAS Ins t i tut e Inc . Al l r ights reser ve d . SASÂź Visual Statistics MANAGE DATA COMPETITIVE ADVANTAGE TEXT EXPLORE DATA DEVELOP MODELS DEPLOY & MONITOR
  • 23. APPLICATION AREAS Copyr i g ht © 2013, SAS Ins t i tut e Inc . Al l r ights reser ve d . Segmentation Classification Prediction Ad-hoc Discovery Data Preparation
  • 24. SAS IN-MEMORY STATISTICS FOR HADOOP Copyr i g ht © 2013, SAS Ins t i tut e Inc . Al l r ights reser ve d .
  • 25. SASÂź IN-MEMORY STATISTICS FOR HADOOP Copyr i g ht © 2013, SAS Ins t i tut e Inc . Al l r ights reser ve d . WHY IT IS IMPORTANT? SPEED Multi-user interactive analytics environment for increased productivity Proven state-of-the-art statistical algorithms and machine learning techniques Highly scalable, in-memory environment grows easily as needed Memory and data efficient for a significant reduction of data latency to rapidly analyze large and complex data in Hadoop PRECISION SCALABLE INTERACTIVE
  • 26. Copyr i g ht © 2013, SAS Ins t i tut e Inc . Al l r ights reser ve d . sas.com

Hinweis der Redaktion

  1. http://www.theatlantic.com/infocus/2013/08/26-years-of-growth-shanghai-then-and-now/100569/?_ga=1.189576738.867316293.1402968281
  2. We need to change the way we think.
  3. Currently, most organizations use one or more of these techniques to solve a departmental specific business problem. As such, even though valuable, from an enterprise standpoint, the value is somewhat marginalized or not optimized. What we see is that departments, such as Marketing, Finance or Risk are using one technique to solve a problem but not multiple techniques. So, for instance, in a bank, the risk department will use econometric forecasting to manage the treasury portfolio and overall risk
so they think. But they are not proactively looking at text (emails and chat), to see where the bank could be exposed. Or a retailer uses forecasting to predict what will sell, but they are not looking at on line sentiment and customer segmentation to optimize what product they should offer what customer and how to optimize distribution all as part of the same process. These are examples of why these techniques should be pulled together for the greater good.
  4. When we plug them all together this is a simplified view of the classical situation today at many customers. They have data flowing left to right normally in a batch job. Some advanced customers try to move data more or less in real-time from left to right or minimize the time it takes to go from left to right. The data and analytic marts are updated when the EDW is refreshed .. Most of that deals with the structured data. The other sources are normally handled differently if at all outside of that flow. As you can see there are multiple places where data is being stored, in different formats and when you add it all together this is why the data landscape of organizations is normally very expensive. What is clear is that with growing data volumes this is a space where more and more cost is going to be incurred. It therefore makes a lot of sense that Hadoop is being looked at as it promises to get that cost back under control and bring more of this data into one common managed lower cost data architecture!
  5. In the first scenario you see organizations are looking at Hadoop to handle new types of data that are not yet currently under the control of the EDW. This includes unstructured Data, Semi-Structured Data or Data that is “not yet known to be useful”. Companies like this as it does not impact existing warehouse or mart efforts but it allows them to try to extract value from data that they have which may not be being utilized today. In this setup all the “new data” can be brought together in one place irrelevant of format and then experimentation to extract value can start. Mostly, in this scenario, customers are leveraging the HDFS component of Hadoop together with side projects such as Apache Tika which lets you tag unstructured data, much like we do with SAS Content Categorization although not as advanced, so that you can search for documents with key words etc. In general people see Hadoop in this world as a way to support innovative business strategies requiring new data and/or as a way to get existing unstructured and semi-structured data into one governed location at the lowest cost. Given the small number of users and experimental nature of this space it is often where you will find the most coders working on Hadoop.
  6. In the second scenario we find organizations looking to use Hadoop to handle the new types of data as in the last scenario but then to feed new insights into the EDW for mass consumption. Essentially in this model Hadoop is used for unstructured, semi-structured and not yet known to be valuable data. In this scenario again the existing EDW process is not impacted but extra data flows from Hadoop might be added when there is something valuable that needs exposing to the masses. Essentially Hadoop complements the rest of the data strategy and the EDW remains the single source for most users even if they access via a downstream mart. Earlier I mentioned that when something valuable was found in Hadoop it may be added to the EDW. In this scenario things that are useful are often found through the use of BI and Analytics on the data held in Hadoop. Sometimes that will be direct against Hadoop and sometimes there will be some data transformation back out of Hadoop into some other RDBMS or other valid store where people will then work hence why I have drawn some marts there. The number of users against Hadoop in this case will increase over the first scenario but it may still not be for the masses. The real idea here is to contain the cost of a burgeoning EDW by not simply throwing all the data directly there as we would have done in the past until it is found to be useful for the masses thus slowing the warehouse growth. Secondly the hope is that this new environment will provide the organization a very low cost way to incubate innovative business strategies that often require massive volumes and varieties of data which once proven might be supported in a more robust and costly EDW. At the end of the day the name of the game is to move only what is valuable to the expensive EDW store going forwards but at the same time not disrupting what is in place today. Something to remember here is that this is the scenario being encouraged by people like Teradata, IBM and Oracle who now sell Hadoop appliances so do not be surprised to hear this as a way forwards from some customers in the future since it does guarantee continued EDW growth which is something they are very worried about because of the advent of Hadoop.
  7. The last scenario is perhaps the most explosive and the one that requires changes on all sides of the IT landscape. Interestingly I have seen this strategy at a large financial institution as well as at some smaller customers. The idea here is to start to put ALL data into Hadoop first. From there a number of things can happen: One The Hadoop platform might just be used as a place to land, transform and cleanse data before building the more traditional EDW and Data or Analytic Marts. If you like a powerful staging layer. Effectively what this does is allows you to offload all the data transformations from EDW and Mart process to Hadoop so that you are essentially able to leverage all the power of the set of computers to prepare data faster than ever and then just copy it to where it needs to go for downstream applications to use. HSBC, who were spoken about in a previous session, gave an example of this usage scenario having got an existing batch job to go from 3 hours to 10 mins on Hadoop. Just think about Hadoop becoming the ETL engine for all data in an organization! Two In some cases the idea of downstream marts goes away and all historical and detailed data is kept forever accessible in Hadoop making it the EDW. This is the ultimate aim of the Hadoop vendors and why the EDW and data appliance guys are so worried. The truth is getting rid of some sort of EDW is not going to happen anytime soon because of the relative immaturity of parts of the Hadoop ecosystem which make it not really suitable for a number of regulatory type workloads. So in my opinion it is very likely we will EDW’s continue for some time and that we will see marts built off Hadoop to keep things running today and continue to leverage previous investments. At the end of the day you will hear the phrase “data lake”. In some companies this means moving all their data to Hadoop and in other companies it is the term used to describe their next generation data landscape where Hadoop plays a role.
  8. SAS enables the entire lifecycle around Hadoop. In a Bi/Reporting context, this can mean a traditional, Business as Usual approach, using SAS/Access to access Hadoop as a data store, just like we do with an RDBMS and accessing with a SAS client such as Enterprise Guide. This can also mean a transformational approach, leveraging in-memory Hadoop architectures for unprecedented performance and interactive visualization capabilities with Visual Analytics. In an analytics context, this can also mean a traditional, Business as usual approach, using SAS/Access to access Hadoop as a data store with SAS/STAT or Enterprise Miner. This can also mean a transformational approach, leveraging in-memory Hadoop architectures for unprecendeted performance (via LASR server), advanced analytic exploration capabilities (via SAS In-Memory Statistics for Hadoop) and advanced analytic prototyping and visualization (via SAS Visual Statistics). Finally, this also means exploiting Hadoop for operational analytics by leveraging in-database technologies to score inside of Hadoop (using the SAS Scoring Accelerator for Hadoop and the SAS Code Accelerator for Hadoop). We should also continue to emphasize how the SAS High-Performance Analytics Server Products, in tandem with SAS Enterprise Miner and SAS Decision Manager, allow organizations to take analytically-derived strategic insights and push them to decision points throughout the organization. This integrated ability to centralize and operationalize analytics, remains one of SAS’ key differentiators against different types of key competitors.
  9. Updated: MARCH 2014 for Visual Analytics release of v6.4
  10. Example: Applying text analysis to twitter streams, or to customer comments in call logs, can give quick insight into the “hot topics” discussed. It’s more than a simple world=cloud that shows which topics are being discussed, but the analytics applied behind the scenes determine which words are used most frequently –so you can determine which topics are the most “important” and warrant further understanding/exploration.
  11. SAS Visual Statistics 6.4 Release Date/Month: July 2014 Contact: Tapan Patel
  12. Irrespective of big data or large data, every analytics project should go through the iterative analytics (data to decision) lifecycle. Typically four steps involved are: manage/prepare data, explore/visualize, model and deploy & monitor. The role of SAS Visual Statistics is to (primarily) address the data exploration/visualization and model development stages of the analytics lifecycle. It allows customers to understand on why certain events, outcomes happen and what are the key relationships. Users ask for more interactions from the data, demand drill-down, etc. to identify the root cause and use the information to build predictive models. It allows customers to build and refine predictive models to assess a future outcome and explain what will happen? For example, is the transaction fraudulent or not or to assess future risk of repayment or how risky is the portfolio given certain conditions in future? Users can dynamically see the impact of changing model properties/parameters and fine tune the model to arrive at the desired results. Of course SAS Visual Statistics also provides the capability to generate score code for deployment purposes.
  13. Clustering is the task of segmenting a heterogeneous population into a number of more homogenous subgroups or clusters. Segmentation does not rely on predefined classes or examples. The records are grouped together on the basis of self-similarity. Clustering is often done as a prelude to some other form of data mining. For example, market segmentation – cluster of customers with similar buying habits and find out which promotion would work best. Classification deals with prediction of discrete outcomes: Yes/No, Churn/No churn, Fraud/No Fraud, credit applicants as low, medium or high risk. Estimation is another form of classification task wherein it deals with continuously valued outcomes (i.e. individual records are rank ordered). Examples – estimating credit card balance, propensity to purchase, probability that someone will respond to balance transfer notification, etc. Prediction deals with classifying records according to some future behavior or estimated future value
for example, predicting the size of the balance that will be transferred if a credit card prospects accepts a balance transfer offer, predicting box office receipts SAS Visual Analytics provides the core capability for ad-hoc data discovery, data exploration and basis data preparation required for model building.
  14. Precision - You can leverage the most proven and state-of-the-art analytical algorithms, text analytics, forecasting, recommendation engine, and machine learning techniques to get the best business results Scalable – As you data, users and problems get more complex, we can scale. Speed – Memory and data efficient for a significant reduction of data latency to rapidly analyze large and complex data in Hadoop Interactive - Multi-user interactive analytics environment for increased productivity Â