SlideShare ist ein Scribd-Unternehmen logo
1 von 19
Building Enterprise
Advance Analytics Platform
SoCal Data Science Conference 09.25.2016
Raymond Fu
Practice Architect
Trace3
T3
22
Raymond Fu
Practice Architect, Trace3
16 years of IT experience specializing in big data, business intelligence, and
enterprise architecture. 10 year corporate career with Bank of America
highlighted by leading many data integrations and warehousing initiatives from
mergers and acquisitions.
Founded his own technology company Xceed Consulting Group in 2012 enabling
data driven solutions.
Joined California based consulting company Trace3 in 2016 as a practice architect
for the Data Intelligence team.
Blog: Everything About Data
Twitter: @RaymondxFu
• Typically, organizations got a firm grasp on required People, Process, and
Technology to deliver capabilities, articulate end-to-end roadmap, identify
platforms and resources.
• Big Data disrupts the traditional architecture paradigm. Organizations may have
an idea or interest, but they don’t necessarily know what will come out of it.
• The answer or outcome for an initial question will trigger the next set of
questions. It requires a unique combination of skill sets, the likes of which are new
and not in abundance.
• The pursuit of the answer is advanced analytics.
Big Data Disruption
3
Advanced Analytics Definition
• The process, tools, technology, and collaboration to create predictive
models that enable/drive strategic and operational decisions. The
predictive models (1) generate insights and hypotheses and (2) test/score
them through experiments, so organizations KNOW what works better.
• Predictive models are created using machine learning, deep learning,
advanced data management tools and visualization tools
• An integral part of Advanced Analytics includes the operationalization of the
predictive models so they can be rapidly scored and decisioned at scale
Advanced Analytics Relevancy
5
Organizations’ goals
Advanced Analytics’ goals
What’s different today
Obstacles to the goals
Advanced Analytics Process
6
• Domain
knowledge
• Hypothesis
development
• Model architecture
• Algorithm selection and development
• Feature engineering
• Visualization
Collaboration
Reproducibility
• Data mining
• Statistical data shaping
• Training
• Cross-validation testing
• Environment and libraries
Production
feature
generation,
modeling, testing
Deployment
Parallel
experiments
• Performance
assessment
• Connectivity
• Landing
• Ingestion
• Knowledge
• Preparation
Business metric
assessment
Data
management
Analytics creation
(business modeling)
Analytics operationalization
(model production and deployment)
Organization
and business
impact
• Continuous
integration
and
deployment
• Model iteration
and redeployment
IT/DE, DS LoB, DS DS, IT/DE, LoB LoB, DS, IT/DE
• R-T and batch
scoring
• Decisioning
Enterprise Big Data Strategy
• Information management
• Data architecture, data governance and meta data management.
• Address key issues such as data integration and data quality.
• Data platform modernization
• Enterprise data warehouse offload.
• Data lake platform assessment.
• Advanced Analytics
• Methodology
• Tools recommendation
• Operationalization
• Step 1 – Establish Business Context and Scope (incubate ideas)
• Step 2 – Establish an Architecture Vision
• Step 3 – Assess the Current State
• Step 4 – Establish Future State and Economic Model
• Step 5 – Develop a Strategic Roadmap
• Step 6 – Establish Governance over the Architecture
Enterprise Architecture Approach
Establishing an Architecture Vision
9
The architecture development process needs to be more fluid and different from SDLC-like
architecture process. It must allow organizations to continuously assess progress, correct
course where needed, balance cost, and gain acceptance.
Advanced Analytics Capabilities
10
Category Capability Items
Organization and
business impact
Fast, informed
decisions
• Time from question to hypothesis to model implementation to informed decision
Strategic and
operational
role
• Degree of input into business/policy decisions
• Perceived and quantified value of analytics
Analytics
operationalization
Model
performance
• Execution of experiments in parallel
• Model performance for scoring and decisioning
Model
deployment
• Continuous integration and deployment
Analytics creation
Efficient model
creation
• Use of data mining and visualization tools
• Rapidly spun-up environment customized to individual data scientists that enables execution of large data sets and highly
mathematical algorithms
• Collaboration among data scientists and between data scientist and lines of business; reuse of data sets and models
• Model reproducibility (including versions, algorithms, data sets, parameters, notes, environment)
Appropriate
model selection
• Understanding, and appropriate use, of model architecture and algorithms, feature engineering, hyper parameterization,
statistical and mathematical concepts, training and validation, scoring, and decisioning
• Use of ML and DL concepts, tools, and libraries
• Use of graph systems
Data
management
Data capability • Infrastructure and tools to access and cleanse data
Data
knowledge and
confidence
• Understanding of, and confidence in, data (e.g. what is available, their relationships)
Data access • Access to internal and external data through infrastructure, logical associations, and tools
Enterprise Information Management Capabilities
11
Advanced Analytics Reference Architect
12
13
Structured data source Unstructured data source
RDBM
S
Big
Data
Business Intelligence / Data Visualization Advanced
Analytics
HDFS NoSQL Cloud Storage
ETLETL
Teradata
Operation
CRM ERP Accounting Clickstream Sensor Info Images/Video Event Logs Social Media
Tools
Real-time
Streaming
Library (ML and DL) Online ML
AWS
Azure
torch
Machine Learning API
Google Prediction
AWS
Azure
BigML
IBM Watson
Advanced Analytics Services
14
Service Type Services
Overall
Assessment
• Advanced Analytics assessment
Architecture
• Architecture for data science
• Architecture for cloud analytics
ETL/ELT
• Data source identification and
integration
• Data virtualization
• Data preparation
Data analysis
and modeling
(data science)
• Statistical / quantitative analysis
• Descriptive analysis
• Predictive modeling
• Machine learning
• Deep learning
• Graph systems
• Simulation and optimization
Service Type Services
Visualization and
insight
presentation and
recommendations
• Data exploration / mining / advanced
visualization to understand the data
• Insight presentation and recommendations
Tools
recommendation
• Infrastructure
• Software tools
• Software environment, programming, libraries
Process
improvement
• Analytics process improvement
• Data governance
• Model governance
• Continuous integration and deployment of
models
Organizational
capabilities
• Advanced analytics organization structure and
roles
• Advanced analytics training
• Advanced analytics staff augmentation
Best Practice
15
• Align Analytics with Specific Business Goals
• Ease Skills Shortage with Standards and Governance
• Optimize Knowledge Transfer with a Center of Excellence
• Top Payoff is Aligning Unstructured with Structured Data
• Plan Your Discovery Lab for Performance
• Align with the Cloud Operating Model
Example 1: Oracle
16
Example 2: Google Cloud Platform – Building
Blocks
17
Example 2: Google Cloud Platform – Stepping Stone
18
Thank you! 19

Weitere ähnliche Inhalte

Was ist angesagt?

Microsoft Business Intelligence - Practical Approach & Overview
Microsoft Business Intelligence - Practical Approach & OverviewMicrosoft Business Intelligence - Practical Approach & Overview
Microsoft Business Intelligence - Practical Approach & Overview
Li Ken Chong
 
Mastering Customer Data on Apache Spark
Mastering Customer Data on Apache SparkMastering Customer Data on Apache Spark
Mastering Customer Data on Apache Spark
Caserta
 
Data-Ed Online Presents: Data Warehouse Strategies
Data-Ed Online Presents: Data Warehouse StrategiesData-Ed Online Presents: Data Warehouse Strategies
Data-Ed Online Presents: Data Warehouse Strategies
DATAVERSITY
 

Was ist angesagt? (20)

The Emerging Role of the Data Lake
The Emerging Role of the Data LakeThe Emerging Role of the Data Lake
The Emerging Role of the Data Lake
 
Architecting Data For The Modern Enterprise - Data Summit 2017, Closing Keynote
Architecting Data For The Modern Enterprise - Data Summit 2017, Closing KeynoteArchitecting Data For The Modern Enterprise - Data Summit 2017, Closing Keynote
Architecting Data For The Modern Enterprise - Data Summit 2017, Closing Keynote
 
Building a New Platform for Customer Analytics
Building a New Platform for Customer Analytics Building a New Platform for Customer Analytics
Building a New Platform for Customer Analytics
 
Microsoft Business Intelligence - Practical Approach & Overview
Microsoft Business Intelligence - Practical Approach & OverviewMicrosoft Business Intelligence - Practical Approach & Overview
Microsoft Business Intelligence - Practical Approach & Overview
 
Agile Big Data Analytics Development: An Architecture-Centric Approach
Agile Big Data Analytics Development: An Architecture-Centric ApproachAgile Big Data Analytics Development: An Architecture-Centric Approach
Agile Big Data Analytics Development: An Architecture-Centric Approach
 
Building New Data Ecosystem for Customer Analytics, Strata + Hadoop World, 2016
Building New Data Ecosystem for Customer Analytics, Strata + Hadoop World, 2016Building New Data Ecosystem for Customer Analytics, Strata + Hadoop World, 2016
Building New Data Ecosystem for Customer Analytics, Strata + Hadoop World, 2016
 
Spring 2017 Sage 300 (Accpac) Users Group
Spring 2017 Sage 300 (Accpac) Users GroupSpring 2017 Sage 300 (Accpac) Users Group
Spring 2017 Sage 300 (Accpac) Users Group
 
Making Big Data Easy for Everyone
Making Big Data Easy for EveryoneMaking Big Data Easy for Everyone
Making Big Data Easy for Everyone
 
Extended Data Warehouse - A New Data Architecture for Modern BI with Claudia ...
Extended Data Warehouse - A New Data Architecture for Modern BI with Claudia ...Extended Data Warehouse - A New Data Architecture for Modern BI with Claudia ...
Extended Data Warehouse - A New Data Architecture for Modern BI with Claudia ...
 
Enable Better Decision Making with Power BI Visualizations & Modern Data Estate
Enable Better Decision Making with Power BI Visualizations & Modern Data EstateEnable Better Decision Making with Power BI Visualizations & Modern Data Estate
Enable Better Decision Making with Power BI Visualizations & Modern Data Estate
 
A modern, flexible approach to Hadoop implementation incorporating innovation...
A modern, flexible approach to Hadoop implementation incorporating innovation...A modern, flexible approach to Hadoop implementation incorporating innovation...
A modern, flexible approach to Hadoop implementation incorporating innovation...
 
Predictive Analytics - Big Data Warehousing Meetup
Predictive Analytics - Big Data Warehousing MeetupPredictive Analytics - Big Data Warehousing Meetup
Predictive Analytics - Big Data Warehousing Meetup
 
Enterprise Search: Addressing the First Problem of Big Data & Analytics - Sta...
Enterprise Search: Addressing the First Problem of Big Data & Analytics - Sta...Enterprise Search: Addressing the First Problem of Big Data & Analytics - Sta...
Enterprise Search: Addressing the First Problem of Big Data & Analytics - Sta...
 
The Business Value of Big Data
The Business Value of Big DataThe Business Value of Big Data
The Business Value of Big Data
 
Mastering Customer Data on Apache Spark
Mastering Customer Data on Apache SparkMastering Customer Data on Apache Spark
Mastering Customer Data on Apache Spark
 
You're the New CDO, Now What?
You're the New CDO, Now What?You're the New CDO, Now What?
You're the New CDO, Now What?
 
Big Data: Setting Up the Big Data Lake
Big Data: Setting Up the Big Data LakeBig Data: Setting Up the Big Data Lake
Big Data: Setting Up the Big Data Lake
 
Creating a DevOps Practice for Analytics -- Strata Data, September 28, 2017
Creating a DevOps Practice for Analytics -- Strata Data, September 28, 2017Creating a DevOps Practice for Analytics -- Strata Data, September 28, 2017
Creating a DevOps Practice for Analytics -- Strata Data, September 28, 2017
 
Data-Ed Online Presents: Data Warehouse Strategies
Data-Ed Online Presents: Data Warehouse StrategiesData-Ed Online Presents: Data Warehouse Strategies
Data-Ed Online Presents: Data Warehouse Strategies
 
Data lake benefits
Data lake benefitsData lake benefits
Data lake benefits
 

Andere mochten auch

Andere mochten auch (7)

Building A Self Service Analytics Platform on Hadoop
Building A Self Service Analytics Platform on HadoopBuilding A Self Service Analytics Platform on Hadoop
Building A Self Service Analytics Platform on Hadoop
 
Accelerating Insight - Smart Data Lake Customer Success Stories
Accelerating Insight - Smart Data Lake Customer Success StoriesAccelerating Insight - Smart Data Lake Customer Success Stories
Accelerating Insight - Smart Data Lake Customer Success Stories
 
Big data it’s impact on the finance function
Big data it’s impact on the finance functionBig data it’s impact on the finance function
Big data it’s impact on the finance function
 
Best Practices for the Hadoop Data Warehouse: EDW 101 for Hadoop Professionals
Best Practices for the Hadoop Data Warehouse: EDW 101 for Hadoop ProfessionalsBest Practices for the Hadoop Data Warehouse: EDW 101 for Hadoop Professionals
Best Practices for the Hadoop Data Warehouse: EDW 101 for Hadoop Professionals
 
Data Warehouse Design and Best Practices
Data Warehouse Design and Best PracticesData Warehouse Design and Best Practices
Data Warehouse Design and Best Practices
 
Webinar | Using Big Data and Predictive Analytics to Empower Distribution and...
Webinar | Using Big Data and Predictive Analytics to Empower Distribution and...Webinar | Using Big Data and Predictive Analytics to Empower Distribution and...
Webinar | Using Big Data and Predictive Analytics to Empower Distribution and...
 
Big data architectures and the data lake
Big data architectures and the data lakeBig data architectures and the data lake
Big data architectures and the data lake
 

Ähnlich wie Building enterprise advance analytics platform

Ähnlich wie Building enterprise advance analytics platform (20)

ADV Slides: What the Aspiring or New Data Scientist Needs to Know About the E...
ADV Slides: What the Aspiring or New Data Scientist Needs to Know About the E...ADV Slides: What the Aspiring or New Data Scientist Needs to Know About the E...
ADV Slides: What the Aspiring or New Data Scientist Needs to Know About the E...
 
Gse uk-cedrinemadera-2018-shared
Gse uk-cedrinemadera-2018-sharedGse uk-cedrinemadera-2018-shared
Gse uk-cedrinemadera-2018-shared
 
Data-Ed Slides: Data Modeling Strategies - Getting Your Data Ready for the Ca...
Data-Ed Slides: Data Modeling Strategies - Getting Your Data Ready for the Ca...Data-Ed Slides: Data Modeling Strategies - Getting Your Data Ready for the Ca...
Data-Ed Slides: Data Modeling Strategies - Getting Your Data Ready for the Ca...
 
Geek Sync I Does Data Modeling Have Business Value?
Geek Sync I Does Data Modeling Have Business Value?Geek Sync I Does Data Modeling Have Business Value?
Geek Sync I Does Data Modeling Have Business Value?
 
MDM & BI Strategy For Large Enterprises
MDM & BI Strategy For Large EnterprisesMDM & BI Strategy For Large Enterprises
MDM & BI Strategy For Large Enterprises
 
DevOps Spain 2019. Olivier Perard-Oracle
DevOps Spain 2019. Olivier Perard-OracleDevOps Spain 2019. Olivier Perard-Oracle
DevOps Spain 2019. Olivier Perard-Oracle
 
From Foundation to Mastery – Building a Mature Analytics Roadmap - Manav Misra
From Foundation to Mastery – Building a Mature Analytics Roadmap - Manav MisraFrom Foundation to Mastery – Building a Mature Analytics Roadmap - Manav Misra
From Foundation to Mastery – Building a Mature Analytics Roadmap - Manav Misra
 
Building the Artificially Intelligent Enterprise
Building the Artificially Intelligent EnterpriseBuilding the Artificially Intelligent Enterprise
Building the Artificially Intelligent Enterprise
 
Lean Analytics: How to get more out of your data science team
Lean Analytics: How to get more out of your data science teamLean Analytics: How to get more out of your data science team
Lean Analytics: How to get more out of your data science team
 
Trends in Enterprise Advanced Analytics
Trends in Enterprise Advanced AnalyticsTrends in Enterprise Advanced Analytics
Trends in Enterprise Advanced Analytics
 
Team Data Science Process Presentation (TDSP), Aug 29, 2017
Team Data Science Process Presentation (TDSP), Aug 29, 2017Team Data Science Process Presentation (TDSP), Aug 29, 2017
Team Data Science Process Presentation (TDSP), Aug 29, 2017
 
SPSChicagoBurbs 2019 - What is CDM and CDS?
SPSChicagoBurbs 2019 - What is CDM and CDS?SPSChicagoBurbs 2019 - What is CDM and CDS?
SPSChicagoBurbs 2019 - What is CDM and CDS?
 
ODSC East 2018
ODSC East 2018ODSC East 2018
ODSC East 2018
 
Why an AI-Powered Data Catalog Tool is Critical to Business Success
Why an AI-Powered Data Catalog Tool is Critical to Business SuccessWhy an AI-Powered Data Catalog Tool is Critical to Business Success
Why an AI-Powered Data Catalog Tool is Critical to Business Success
 
Empowering Business & IT Teams:  Modern Data Catalog Requirements
Empowering Business & IT Teams:  Modern Data Catalog RequirementsEmpowering Business & IT Teams:  Modern Data Catalog Requirements
Empowering Business & IT Teams:  Modern Data Catalog Requirements
 
MLOps - The Assembly Line of ML
MLOps - The Assembly Line of MLMLOps - The Assembly Line of ML
MLOps - The Assembly Line of ML
 
What Data Do You Have and Where is It?
What Data Do You Have and Where is It? What Data Do You Have and Where is It?
What Data Do You Have and Where is It?
 
Data-Ed Webinar: Data Modeling Fundamentals
Data-Ed Webinar: Data Modeling FundamentalsData-Ed Webinar: Data Modeling Fundamentals
Data-Ed Webinar: Data Modeling Fundamentals
 
Adding Hadoop to Your Analytics Mix?
Adding Hadoop to Your Analytics Mix?Adding Hadoop to Your Analytics Mix?
Adding Hadoop to Your Analytics Mix?
 
Tips and Tricks to be an Effective Data Scientist
Tips and Tricks to be an Effective Data ScientistTips and Tricks to be an Effective Data Scientist
Tips and Tricks to be an Effective Data Scientist
 

Kürzlich hochgeladen

Why Teams call analytics are critical to your entire business
Why Teams call analytics are critical to your entire businessWhy Teams call analytics are critical to your entire business
Why Teams call analytics are critical to your entire business
panagenda
 
Finding Java's Hidden Performance Traps @ DevoxxUK 2024
Finding Java's Hidden Performance Traps @ DevoxxUK 2024Finding Java's Hidden Performance Traps @ DevoxxUK 2024
Finding Java's Hidden Performance Traps @ DevoxxUK 2024
Victor Rentea
 
Architecting Cloud Native Applications
Architecting Cloud Native ApplicationsArchitecting Cloud Native Applications
Architecting Cloud Native Applications
WSO2
 
Cloud Frontiers: A Deep Dive into Serverless Spatial Data and FME
Cloud Frontiers:  A Deep Dive into Serverless Spatial Data and FMECloud Frontiers:  A Deep Dive into Serverless Spatial Data and FME
Cloud Frontiers: A Deep Dive into Serverless Spatial Data and FME
Safe Software
 
Modular Monolith - a Practical Alternative to Microservices @ Devoxx UK 2024
Modular Monolith - a Practical Alternative to Microservices @ Devoxx UK 2024Modular Monolith - a Practical Alternative to Microservices @ Devoxx UK 2024
Modular Monolith - a Practical Alternative to Microservices @ Devoxx UK 2024
Victor Rentea
 
+971581248768>> SAFE AND ORIGINAL ABORTION PILLS FOR SALE IN DUBAI AND ABUDHA...
+971581248768>> SAFE AND ORIGINAL ABORTION PILLS FOR SALE IN DUBAI AND ABUDHA...+971581248768>> SAFE AND ORIGINAL ABORTION PILLS FOR SALE IN DUBAI AND ABUDHA...
+971581248768>> SAFE AND ORIGINAL ABORTION PILLS FOR SALE IN DUBAI AND ABUDHA...
?#DUbAI#??##{{(☎️+971_581248768%)**%*]'#abortion pills for sale in dubai@
 

Kürzlich hochgeladen (20)

DEV meet-up UiPath Document Understanding May 7 2024 Amsterdam
DEV meet-up UiPath Document Understanding May 7 2024 AmsterdamDEV meet-up UiPath Document Understanding May 7 2024 Amsterdam
DEV meet-up UiPath Document Understanding May 7 2024 Amsterdam
 
Why Teams call analytics are critical to your entire business
Why Teams call analytics are critical to your entire businessWhy Teams call analytics are critical to your entire business
Why Teams call analytics are critical to your entire business
 
Exploring Multimodal Embeddings with Milvus
Exploring Multimodal Embeddings with MilvusExploring Multimodal Embeddings with Milvus
Exploring Multimodal Embeddings with Milvus
 
How to Troubleshoot Apps for the Modern Connected Worker
How to Troubleshoot Apps for the Modern Connected WorkerHow to Troubleshoot Apps for the Modern Connected Worker
How to Troubleshoot Apps for the Modern Connected Worker
 
Apidays New York 2024 - APIs in 2030: The Risk of Technological Sleepwalk by ...
Apidays New York 2024 - APIs in 2030: The Risk of Technological Sleepwalk by ...Apidays New York 2024 - APIs in 2030: The Risk of Technological Sleepwalk by ...
Apidays New York 2024 - APIs in 2030: The Risk of Technological Sleepwalk by ...
 
Exploring the Future Potential of AI-Enabled Smartphone Processors
Exploring the Future Potential of AI-Enabled Smartphone ProcessorsExploring the Future Potential of AI-Enabled Smartphone Processors
Exploring the Future Potential of AI-Enabled Smartphone Processors
 
presentation ICT roal in 21st century education
presentation ICT roal in 21st century educationpresentation ICT roal in 21st century education
presentation ICT roal in 21st century education
 
Axa Assurance Maroc - Insurer Innovation Award 2024
Axa Assurance Maroc - Insurer Innovation Award 2024Axa Assurance Maroc - Insurer Innovation Award 2024
Axa Assurance Maroc - Insurer Innovation Award 2024
 
"I see eyes in my soup": How Delivery Hero implemented the safety system for ...
"I see eyes in my soup": How Delivery Hero implemented the safety system for ..."I see eyes in my soup": How Delivery Hero implemented the safety system for ...
"I see eyes in my soup": How Delivery Hero implemented the safety system for ...
 
[BuildWithAI] Introduction to Gemini.pdf
[BuildWithAI] Introduction to Gemini.pdf[BuildWithAI] Introduction to Gemini.pdf
[BuildWithAI] Introduction to Gemini.pdf
 
AWS Community Day CPH - Three problems of Terraform
AWS Community Day CPH - Three problems of TerraformAWS Community Day CPH - Three problems of Terraform
AWS Community Day CPH - Three problems of Terraform
 
Boost Fertility New Invention Ups Success Rates.pdf
Boost Fertility New Invention Ups Success Rates.pdfBoost Fertility New Invention Ups Success Rates.pdf
Boost Fertility New Invention Ups Success Rates.pdf
 
Finding Java's Hidden Performance Traps @ DevoxxUK 2024
Finding Java's Hidden Performance Traps @ DevoxxUK 2024Finding Java's Hidden Performance Traps @ DevoxxUK 2024
Finding Java's Hidden Performance Traps @ DevoxxUK 2024
 
Architecting Cloud Native Applications
Architecting Cloud Native ApplicationsArchitecting Cloud Native Applications
Architecting Cloud Native Applications
 
Connector Corner: Accelerate revenue generation using UiPath API-centric busi...
Connector Corner: Accelerate revenue generation using UiPath API-centric busi...Connector Corner: Accelerate revenue generation using UiPath API-centric busi...
Connector Corner: Accelerate revenue generation using UiPath API-centric busi...
 
Cloud Frontiers: A Deep Dive into Serverless Spatial Data and FME
Cloud Frontiers:  A Deep Dive into Serverless Spatial Data and FMECloud Frontiers:  A Deep Dive into Serverless Spatial Data and FME
Cloud Frontiers: A Deep Dive into Serverless Spatial Data and FME
 
Modular Monolith - a Practical Alternative to Microservices @ Devoxx UK 2024
Modular Monolith - a Practical Alternative to Microservices @ Devoxx UK 2024Modular Monolith - a Practical Alternative to Microservices @ Devoxx UK 2024
Modular Monolith - a Practical Alternative to Microservices @ Devoxx UK 2024
 
Apidays New York 2024 - The Good, the Bad and the Governed by David O'Neill, ...
Apidays New York 2024 - The Good, the Bad and the Governed by David O'Neill, ...Apidays New York 2024 - The Good, the Bad and the Governed by David O'Neill, ...
Apidays New York 2024 - The Good, the Bad and the Governed by David O'Neill, ...
 
+971581248768>> SAFE AND ORIGINAL ABORTION PILLS FOR SALE IN DUBAI AND ABUDHA...
+971581248768>> SAFE AND ORIGINAL ABORTION PILLS FOR SALE IN DUBAI AND ABUDHA...+971581248768>> SAFE AND ORIGINAL ABORTION PILLS FOR SALE IN DUBAI AND ABUDHA...
+971581248768>> SAFE AND ORIGINAL ABORTION PILLS FOR SALE IN DUBAI AND ABUDHA...
 
TrustArc Webinar - Unlock the Power of AI-Driven Data Discovery
TrustArc Webinar - Unlock the Power of AI-Driven Data DiscoveryTrustArc Webinar - Unlock the Power of AI-Driven Data Discovery
TrustArc Webinar - Unlock the Power of AI-Driven Data Discovery
 

Building enterprise advance analytics platform

  • 1. Building Enterprise Advance Analytics Platform SoCal Data Science Conference 09.25.2016 Raymond Fu Practice Architect Trace3 T3
  • 2. 22 Raymond Fu Practice Architect, Trace3 16 years of IT experience specializing in big data, business intelligence, and enterprise architecture. 10 year corporate career with Bank of America highlighted by leading many data integrations and warehousing initiatives from mergers and acquisitions. Founded his own technology company Xceed Consulting Group in 2012 enabling data driven solutions. Joined California based consulting company Trace3 in 2016 as a practice architect for the Data Intelligence team. Blog: Everything About Data Twitter: @RaymondxFu
  • 3. • Typically, organizations got a firm grasp on required People, Process, and Technology to deliver capabilities, articulate end-to-end roadmap, identify platforms and resources. • Big Data disrupts the traditional architecture paradigm. Organizations may have an idea or interest, but they don’t necessarily know what will come out of it. • The answer or outcome for an initial question will trigger the next set of questions. It requires a unique combination of skill sets, the likes of which are new and not in abundance. • The pursuit of the answer is advanced analytics. Big Data Disruption 3
  • 4. Advanced Analytics Definition • The process, tools, technology, and collaboration to create predictive models that enable/drive strategic and operational decisions. The predictive models (1) generate insights and hypotheses and (2) test/score them through experiments, so organizations KNOW what works better. • Predictive models are created using machine learning, deep learning, advanced data management tools and visualization tools • An integral part of Advanced Analytics includes the operationalization of the predictive models so they can be rapidly scored and decisioned at scale
  • 5. Advanced Analytics Relevancy 5 Organizations’ goals Advanced Analytics’ goals What’s different today Obstacles to the goals
  • 6. Advanced Analytics Process 6 • Domain knowledge • Hypothesis development • Model architecture • Algorithm selection and development • Feature engineering • Visualization Collaboration Reproducibility • Data mining • Statistical data shaping • Training • Cross-validation testing • Environment and libraries Production feature generation, modeling, testing Deployment Parallel experiments • Performance assessment • Connectivity • Landing • Ingestion • Knowledge • Preparation Business metric assessment Data management Analytics creation (business modeling) Analytics operationalization (model production and deployment) Organization and business impact • Continuous integration and deployment • Model iteration and redeployment IT/DE, DS LoB, DS DS, IT/DE, LoB LoB, DS, IT/DE • R-T and batch scoring • Decisioning
  • 7. Enterprise Big Data Strategy • Information management • Data architecture, data governance and meta data management. • Address key issues such as data integration and data quality. • Data platform modernization • Enterprise data warehouse offload. • Data lake platform assessment. • Advanced Analytics • Methodology • Tools recommendation • Operationalization
  • 8. • Step 1 – Establish Business Context and Scope (incubate ideas) • Step 2 – Establish an Architecture Vision • Step 3 – Assess the Current State • Step 4 – Establish Future State and Economic Model • Step 5 – Develop a Strategic Roadmap • Step 6 – Establish Governance over the Architecture Enterprise Architecture Approach
  • 9. Establishing an Architecture Vision 9 The architecture development process needs to be more fluid and different from SDLC-like architecture process. It must allow organizations to continuously assess progress, correct course where needed, balance cost, and gain acceptance.
  • 10. Advanced Analytics Capabilities 10 Category Capability Items Organization and business impact Fast, informed decisions • Time from question to hypothesis to model implementation to informed decision Strategic and operational role • Degree of input into business/policy decisions • Perceived and quantified value of analytics Analytics operationalization Model performance • Execution of experiments in parallel • Model performance for scoring and decisioning Model deployment • Continuous integration and deployment Analytics creation Efficient model creation • Use of data mining and visualization tools • Rapidly spun-up environment customized to individual data scientists that enables execution of large data sets and highly mathematical algorithms • Collaboration among data scientists and between data scientist and lines of business; reuse of data sets and models • Model reproducibility (including versions, algorithms, data sets, parameters, notes, environment) Appropriate model selection • Understanding, and appropriate use, of model architecture and algorithms, feature engineering, hyper parameterization, statistical and mathematical concepts, training and validation, scoring, and decisioning • Use of ML and DL concepts, tools, and libraries • Use of graph systems Data management Data capability • Infrastructure and tools to access and cleanse data Data knowledge and confidence • Understanding of, and confidence in, data (e.g. what is available, their relationships) Data access • Access to internal and external data through infrastructure, logical associations, and tools
  • 13. 13 Structured data source Unstructured data source RDBM S Big Data Business Intelligence / Data Visualization Advanced Analytics HDFS NoSQL Cloud Storage ETLETL Teradata Operation CRM ERP Accounting Clickstream Sensor Info Images/Video Event Logs Social Media Tools Real-time Streaming Library (ML and DL) Online ML AWS Azure torch Machine Learning API Google Prediction AWS Azure BigML IBM Watson
  • 14. Advanced Analytics Services 14 Service Type Services Overall Assessment • Advanced Analytics assessment Architecture • Architecture for data science • Architecture for cloud analytics ETL/ELT • Data source identification and integration • Data virtualization • Data preparation Data analysis and modeling (data science) • Statistical / quantitative analysis • Descriptive analysis • Predictive modeling • Machine learning • Deep learning • Graph systems • Simulation and optimization Service Type Services Visualization and insight presentation and recommendations • Data exploration / mining / advanced visualization to understand the data • Insight presentation and recommendations Tools recommendation • Infrastructure • Software tools • Software environment, programming, libraries Process improvement • Analytics process improvement • Data governance • Model governance • Continuous integration and deployment of models Organizational capabilities • Advanced analytics organization structure and roles • Advanced analytics training • Advanced analytics staff augmentation
  • 15. Best Practice 15 • Align Analytics with Specific Business Goals • Ease Skills Shortage with Standards and Governance • Optimize Knowledge Transfer with a Center of Excellence • Top Payoff is Aligning Unstructured with Structured Data • Plan Your Discovery Lab for Performance • Align with the Cloud Operating Model
  • 17. Example 2: Google Cloud Platform – Building Blocks 17
  • 18. Example 2: Google Cloud Platform – Stepping Stone 18