SlideShare ist ein Scribd-Unternehmen logo
1 von 20
© 2017, Amazon Web Services, Inc. or its Affiliates. All rights reserved.
Comprehensive Big Data Architecture Made Easy:
The AWS Marketplace Intelligent Analytical System
Luis Daniel Soto, AWS.
@luisdans
AWS Re:INVENT HANDS-ON WORKSHOP
Kim Schmidt, President & CIO/Dataleader.io
@dataleader
GPSWKS301
November 28, 2017
AWS re:INVENT
© 2017, Amazon Web Services, Inc. or its Affiliates. All rights reserved.
Big Data Workshop Audience and Goals
Workshop Audience:
• APN Consulting Partners
• Organizations of all sizes
Workshop Goals:
• Show you the benefits of integrating AWS services and solutions from AWS
Marketplace
• Exercise 1: Build a data pipeline that will stream live data into Amazon
Redshift
• Exercise 2: Leverage machine learning to generate predictive analytics
• Learn how to avoid common errors when building a Big Data Architecture
© 2017, Amazon Web Services, Inc. or its Affiliates. All rights reserved.
What we’ll be doing today
1. Introduction (5 minutes)
2. Collect and Store Data (60 minutes)
• DEMO #1: Extending on-premise data load to AWS
• COLLABORATIVE TEAM EXERCISE: Building a data pipeline
3. Transform & Analyze (55 minutes)
• DEMO #2: Orchestrate, transform and aggregate data on Amazon Redshift
• COLLABORATIVE TEAM EXERCISE: Predictive Analytics with Machine Learning
• DEMO #3: Visualization of prediction output for real-time
4. Operations Management (25 minutes)
• PRESENTATION: AWS Marketplace Intelligent Analytical System
• DISCUSSION: Other challenges on building a end-to-end Big Data architecture
5. Workshop Wrap-up (5 minutes)
Collect & Store Transform
& Analyze
Operations
Management
© 2017, Amazon Web Services, Inc. or its Affiliates. All rights reserved.
GENERATE COLLECT STORE ANALYZE/PROCESS CONSUME
Increasing variety of
sources
Little visibility into
new data
Myopic view of
existing data
Difficulty
consolidating data
across different
sources and locations
Challenges
normalizing/
transforming/
aggregating data into
a standardized
format
Unable to capture
and/or process data
as quickly as it is
being generated
Unfamiliarity with
modern data
management
techniques
Lack of necessary skills
to implement and
maintain new
technologies
Scaling IT
infrastructure
Inability to process
data in a timely
manner once its
needed
Limited resources
and capabilities to
experiment and
iterate
Processing all data in
various formats
Predicting future
required capacity
Make more intelligent
business decisions
Limited adoption due
to rigidity and
inflexibility of legacy
BI tools
Be able to run queries
quickly
Get to data-driven
results faster
What we hear from our customers
Big Data Architecture
© 2017, Amazon Web Services, Inc. or its Affiliates. All rights reserved.
Immediate availability
Broad & deep capabilities
Trusted & secure
Large partner ecosystem
http://aws.amazon.com/mp
Big Data on AWS iAS
© 2017, Amazon Web Services, Inc. or its Affiliates. All rights reserved.
DEMO 1: EXTEND YOUR ON-PREMISES
DATA MANAGEMENT TO AWS
Transform
& Analyze
Operations
Management
AWS MARKETPLACE: INTELLIGENT ANALYTICAL SYSTEM
Collect & Store
© 2017, Amazon Web Services, Inc. or its Affiliates. All rights reserved.
SoftNAS Cloud High-Performance Cloud NAS
© 2017, Amazon Web Services, Inc. or its Affiliates. All rights reserved.
• Software-only NAS virtual appliance
built for the cloud
• Full protocol support: CIFS, NFS, AFP,
iSCSI
• High Availability
• Snapshots / Rollbacks
• Replication
• Deduplication
• Compression
• Price and Performance Tunable
• Active Directory / LDAP integration
• Scales from Gigabytes to Petabytes
SoftNAS Cloud Virtual NAS Overview
© 2017, Amazon Web Services, Inc. or its Affiliates. All rights reserved.
Transform
& Analyze
Operations
Management
Collect & Store
WORKSHOP ACTIVITY 1: CREATING A
REAL-TIME STREAMING DATA PIPELINE
AWS MARKETPLACE: INTELLIGENT ANALYTICAL SYSTEM
© 2017, Amazon Web Services, Inc. or its Affiliates. All rights reserved.
1. See how to use serverless AWS Lambda to transform streaming data from a
sample weblog generator
2. Define an Amazon Kinesis Firehose Delivery Stream
3. Set up Amazon Kinesis Streams
4. Monitor the live stream using Amazon CloudWatch
5. Query the data in Amazon Redshift via a customized command line
Collaborative Team Exercise #1
© 2017, Amazon Web Services, Inc. or its Affiliates. All rights reserved.
Operations
Management
Collect & Store
DEMO 2: MPP DATA TRANSFORMATION
& ORCHESTRATION
AWS MARKETPLACE: INTELLIGENT ANALYTICAL SYSTEM
Transform
& Analyze
© 2017, Amazon Web Services, Inc. or its Affiliates. All rights reserved.
© 2017, Amazon Web Services, Inc. or its Affiliates. All rights reserved.
Operations
Management
Collect & Store
WORKSHOP ACTIVITY 2:
PREDICTIVE ANALYTICS
AWS MARKETPLACE: INTELLIGENT ANALYTICAL SYSTEM
Transform
& Analyze
© 2017, Amazon Web Services, Inc. or its Affiliates. All rights reserved.
1. Query data in Amazon S3
2. Design a predictive scoring formula using PredicSis.ai
3. Compute predictive models to apply to real-time customer data that can
be used to discover outliers who are ready right now to buy a Tesla
Collaborative Team Exercise #2
© 2017, Amazon Web Services, Inc. or its Affiliates. All rights reserved.
Operations
Management
Collect & Store
DEMO 3: REAL-TIME DATA
VISUALIZATIONS AND ALERTS
AWS MARKETPLACE: INTELLIGENT ANALYTICAL SYSTEM
Transform
& Analyze
© 2017, Amazon Web Services, Inc. or its Affiliates. All rights reserved.
Collect & Store
OPERATIONS MANAGEMENT
AWS MARKETPLACE: INTELLIGENT ANALYTICAL SYSTEM
Transform
& Analyze
Operations
Management
© 2017, Amazon Web Services, Inc. or its Affiliates. All rights reserved.
iAS: System Architecture and Design
© 2017, Amazon Web Services, Inc. or its Affiliates. All rights reserved.
GROUP DISCUSSION
AWS MARKETPLACE: INTELLIGENT ANALYTICAL SYSTEM
• What challenges is my customer/organization facing
in building a Big Data Architecture?
• What capabilities is my organization missing?
• What parts would I want to further customize?
Collect & Store Transform
& Analyze
Operations
Management
© 2017, Amazon Web Services, Inc. or its Affiliates. All rights reserved.
Key Takeaways
There is an increase of importance on new Big Data capabilities including:
• Securely combining data residing on-premises with Cloud applications
• Abandon historical-only business analysis to include predictive and other modern analytics to
immediately act upon key business insights
• Break down data silos and harness data in real time from all over the globe
• The importance of building a modular Big Data architecture
AWS Marketplace enables agility and experimentation
• Combining AWS services with solutions in AWS Marketplace are “pieces of the puzzle” that
can be replaced whenever newer products and services are released
• Easily evaluate innovative software solutions, pay only for what you use
Download the step-by-step guide for the Intelligent Analytical System (iAS)
• https://aws.amazon.com/mp/mp_solution
• https://aws-kimsshmidt.com
© 2017, Amazon Web Services, Inc. or its Affiliates. All rights reserved.
Thank you!
@ l u i s d a n s
@ d a t a l e a d e r
http://aws.amazon.com/mp

Weitere ähnliche Inhalte

Was ist angesagt?

Was ist angesagt? (20)

Big Data on AWS - Toronto FSI Symposium - October 2016
Big Data on AWS - Toronto FSI Symposium - October 2016Big Data on AWS - Toronto FSI Symposium - October 2016
Big Data on AWS - Toronto FSI Symposium - October 2016
 
AIOps: Steps Towards Autonomous Operations (DEV301-R1) - AWS re:Invent 2018
AIOps: Steps Towards Autonomous Operations (DEV301-R1) - AWS re:Invent 2018AIOps: Steps Towards Autonomous Operations (DEV301-R1) - AWS re:Invent 2018
AIOps: Steps Towards Autonomous Operations (DEV301-R1) - AWS re:Invent 2018
 
SRV307 Applying AWS Purpose-Built Database Strategy: Match Your Workload to ...
 SRV307 Applying AWS Purpose-Built Database Strategy: Match Your Workload to ... SRV307 Applying AWS Purpose-Built Database Strategy: Match Your Workload to ...
SRV307 Applying AWS Purpose-Built Database Strategy: Match Your Workload to ...
 
Social Media Analytics with Amazon QuickSight (ANT370) - AWS re:Invent 2018
Social Media Analytics with Amazon QuickSight (ANT370) - AWS re:Invent 2018Social Media Analytics with Amazon QuickSight (ANT370) - AWS re:Invent 2018
Social Media Analytics with Amazon QuickSight (ANT370) - AWS re:Invent 2018
 
Managed NoSQL databases
Managed NoSQL databasesManaged NoSQL databases
Managed NoSQL databases
 
Best Practices to Secure Data Lake on AWS (ANT327) - AWS re:Invent 2018
Best Practices to Secure Data Lake on AWS (ANT327) - AWS re:Invent 2018Best Practices to Secure Data Lake on AWS (ANT327) - AWS re:Invent 2018
Best Practices to Secure Data Lake on AWS (ANT327) - AWS re:Invent 2018
 
Architecting for Real-Time Insights with Amazon Kinesis (ANT310) - AWS re:Inv...
Architecting for Real-Time Insights with Amazon Kinesis (ANT310) - AWS re:Inv...Architecting for Real-Time Insights with Amazon Kinesis (ANT310) - AWS re:Inv...
Architecting for Real-Time Insights with Amazon Kinesis (ANT310) - AWS re:Inv...
 
Business Intelligence in Minutes with Amazon Athena and Amazon QuickSight - A...
Business Intelligence in Minutes with Amazon Athena and Amazon QuickSight - A...Business Intelligence in Minutes with Amazon Athena and Amazon QuickSight - A...
Business Intelligence in Minutes with Amazon Athena and Amazon QuickSight - A...
 
Security Challenges and Use Cases in the Modern Application Build-and-Deploy ...
Security Challenges and Use Cases in the Modern Application Build-and-Deploy ...Security Challenges and Use Cases in the Modern Application Build-and-Deploy ...
Security Challenges and Use Cases in the Modern Application Build-and-Deploy ...
 
Which Database is Right for Your Serverless Application (ARC215) - AWS re:Inv...
Which Database is Right for Your Serverless Application (ARC215) - AWS re:Inv...Which Database is Right for Your Serverless Application (ARC215) - AWS re:Inv...
Which Database is Right for Your Serverless Application (ARC215) - AWS re:Inv...
 
Welcome & AWS Big Data Solution Overview
Welcome & AWS Big Data Solution OverviewWelcome & AWS Big Data Solution Overview
Welcome & AWS Big Data Solution Overview
 
Visualization with Amazon QuickSight
Visualization with Amazon QuickSightVisualization with Amazon QuickSight
Visualization with Amazon QuickSight
 
AWS Summit 2013 | India - Petabyte Scale Data Warehousing at Low Cost, Abhish...
AWS Summit 2013 | India - Petabyte Scale Data Warehousing at Low Cost, Abhish...AWS Summit 2013 | India - Petabyte Scale Data Warehousing at Low Cost, Abhish...
AWS Summit 2013 | India - Petabyte Scale Data Warehousing at Low Cost, Abhish...
 
Architecting an Open Data Lake for the Enterprise
Architecting an Open Data Lake for the EnterpriseArchitecting an Open Data Lake for the Enterprise
Architecting an Open Data Lake for the Enterprise
 
One Data Lake, Many Uses: Enabling Multi-Tenant Analytics with Amazon EMR (AN...
One Data Lake, Many Uses: Enabling Multi-Tenant Analytics with Amazon EMR (AN...One Data Lake, Many Uses: Enabling Multi-Tenant Analytics with Amazon EMR (AN...
One Data Lake, Many Uses: Enabling Multi-Tenant Analytics with Amazon EMR (AN...
 
12 Steps to Cloud
12 Steps to Cloud12 Steps to Cloud
12 Steps to Cloud
 
Visualization with Amazon QuickSight
Visualization with Amazon QuickSightVisualization with Amazon QuickSight
Visualization with Amazon QuickSight
 
Data Privacy & Governance in the Age of Big Data: Deploy a De-Identified Data...
Data Privacy & Governance in the Age of Big Data: Deploy a De-Identified Data...Data Privacy & Governance in the Age of Big Data: Deploy a De-Identified Data...
Data Privacy & Governance in the Age of Big Data: Deploy a De-Identified Data...
 
Building Your Data Lake on AWS - Level 200
Building Your Data Lake on AWS - Level 200Building Your Data Lake on AWS - Level 200
Building Your Data Lake on AWS - Level 200
 
Enabling a Digital Platform with Microservices Architecture (ARC218-S) - AWS ...
Enabling a Digital Platform with Microservices Architecture (ARC218-S) - AWS ...Enabling a Digital Platform with Microservices Architecture (ARC218-S) - AWS ...
Enabling a Digital Platform with Microservices Architecture (ARC218-S) - AWS ...
 

Ähnlich wie Comprehensive Big Data Analytics Architecture Made Easy - The AWS Marketplace Intelligent Analytical System - GPSWKS301 - re:Invent 2017

Ähnlich wie Comprehensive Big Data Analytics Architecture Made Easy - The AWS Marketplace Intelligent Analytical System - GPSWKS301 - re:Invent 2017 (20)

How Citrix Uses AWS Marketplace Solutions to Accelerate Analytic Workloads on...
How Citrix Uses AWS Marketplace Solutions to Accelerate Analytic Workloads on...How Citrix Uses AWS Marketplace Solutions to Accelerate Analytic Workloads on...
How Citrix Uses AWS Marketplace Solutions to Accelerate Analytic Workloads on...
 
MSC203_How Citrix Uses AWS Marketplace Solutions To Accelerate Analytic Workl...
MSC203_How Citrix Uses AWS Marketplace Solutions To Accelerate Analytic Workl...MSC203_How Citrix Uses AWS Marketplace Solutions To Accelerate Analytic Workl...
MSC203_How Citrix Uses AWS Marketplace Solutions To Accelerate Analytic Workl...
 
Automating Big Data Technologies for Faster Time-to-Value
 Automating Big Data Technologies for Faster Time-to-Value Automating Big Data Technologies for Faster Time-to-Value
Automating Big Data Technologies for Faster Time-to-Value
 
Architecting an Open Data Lake for the Enterprise
 Architecting an Open Data Lake for the Enterprise  Architecting an Open Data Lake for the Enterprise
Architecting an Open Data Lake for the Enterprise
 
TiVo: How to Scale New Products with a Data Lake on AWS and Qubole
 TiVo: How to Scale New Products with a Data Lake on AWS and Qubole TiVo: How to Scale New Products with a Data Lake on AWS and Qubole
TiVo: How to Scale New Products with a Data Lake on AWS and Qubole
 
TiVo: How to Scale New Products with a Data Lake on AWS and Qubole
 TiVo: How to Scale New Products with a Data Lake on AWS and Qubole TiVo: How to Scale New Products with a Data Lake on AWS and Qubole
TiVo: How to Scale New Products with a Data Lake on AWS and Qubole
 
Leveraging Data Analytics in the Cloud to Support Data-Driven Decisions
Leveraging Data Analytics in the Cloud to Support Data-Driven DecisionsLeveraging Data Analytics in the Cloud to Support Data-Driven Decisions
Leveraging Data Analytics in the Cloud to Support Data-Driven Decisions
 
Fanatics Ingests Streaming Data to a Data Lake on AWS
Fanatics Ingests Streaming Data to a Data Lake on AWSFanatics Ingests Streaming Data to a Data Lake on AWS
Fanatics Ingests Streaming Data to a Data Lake on AWS
 
McGraw-Hill Optimizes Analytics Workloads with Databricks
 McGraw-Hill Optimizes Analytics Workloads with Databricks McGraw-Hill Optimizes Analytics Workloads with Databricks
McGraw-Hill Optimizes Analytics Workloads with Databricks
 
利用 Amazon QuickSight 視覺化分析服務剖析資料
利用 Amazon QuickSight 視覺化分析服務剖析資料利用 Amazon QuickSight 視覺化分析服務剖析資料
利用 Amazon QuickSight 視覺化分析服務剖析資料
 
GPSTEC326-GPS Industry 4.0 AI and the Future of Manufacturing
GPSTEC326-GPS Industry 4.0 AI and the Future of ManufacturingGPSTEC326-GPS Industry 4.0 AI and the Future of Manufacturing
GPSTEC326-GPS Industry 4.0 AI and the Future of Manufacturing
 
GPS: Industry 4.0: AI and the Future of Manufacturing - GPSTEC326 - re:Invent...
GPS: Industry 4.0: AI and the Future of Manufacturing - GPSTEC326 - re:Invent...GPS: Industry 4.0: AI and the Future of Manufacturing - GPSTEC326 - re:Invent...
GPS: Industry 4.0: AI and the Future of Manufacturing - GPSTEC326 - re:Invent...
 
Amazon Web Services
Amazon Web ServicesAmazon Web Services
Amazon Web Services
 
Citrix Moves Data to Amazon Redshift Fast with Matillion ETL
 Citrix Moves Data to Amazon Redshift Fast with Matillion ETL Citrix Moves Data to Amazon Redshift Fast with Matillion ETL
Citrix Moves Data to Amazon Redshift Fast with Matillion ETL
 
How Amazon.com Uses AWS Analytics: Data Analytics Week SF
How Amazon.com Uses AWS Analytics: Data Analytics Week SFHow Amazon.com Uses AWS Analytics: Data Analytics Week SF
How Amazon.com Uses AWS Analytics: Data Analytics Week SF
 
How Amazon uses AWS Analytics
How Amazon uses AWS AnalyticsHow Amazon uses AWS Analytics
How Amazon uses AWS Analytics
 
規劃大規模遷移到 AWS 的最佳實踐
規劃大規模遷移到 AWS 的最佳實踐規劃大規模遷移到 AWS 的最佳實踐
規劃大規模遷移到 AWS 的最佳實踐
 
How a Global Healthcare Company Built a Migration Factory to Quickly Move Tho...
How a Global Healthcare Company Built a Migration Factory to Quickly Move Tho...How a Global Healthcare Company Built a Migration Factory to Quickly Move Tho...
How a Global Healthcare Company Built a Migration Factory to Quickly Move Tho...
 
Visualizing Big Data Insights with Amazon QuickSight
Visualizing Big Data Insights with Amazon QuickSightVisualizing Big Data Insights with Amazon QuickSight
Visualizing Big Data Insights with Amazon QuickSight
 
How Amazon.com Uses AWS Analytics
How Amazon.com Uses AWS AnalyticsHow Amazon.com Uses AWS Analytics
How Amazon.com Uses AWS Analytics
 

Mehr von Amazon Web Services

Tools for building your MVP on AWS
Tools for building your MVP on AWSTools for building your MVP on AWS
Tools for building your MVP on AWS
Amazon Web Services
 
How to Build a Winning Pitch Deck
How to Build a Winning Pitch DeckHow to Build a Winning Pitch Deck
How to Build a Winning Pitch Deck
Amazon Web Services
 
Building a web application without servers
Building a web application without serversBuilding a web application without servers
Building a web application without servers
Amazon Web Services
 
AWS_HK_StartupDay_Building Interactive websites while automating for efficien...
AWS_HK_StartupDay_Building Interactive websites while automating for efficien...AWS_HK_StartupDay_Building Interactive websites while automating for efficien...
AWS_HK_StartupDay_Building Interactive websites while automating for efficien...
Amazon Web Services
 

Mehr von Amazon Web Services (20)

Come costruire servizi di Forecasting sfruttando algoritmi di ML e deep learn...
Come costruire servizi di Forecasting sfruttando algoritmi di ML e deep learn...Come costruire servizi di Forecasting sfruttando algoritmi di ML e deep learn...
Come costruire servizi di Forecasting sfruttando algoritmi di ML e deep learn...
 
Big Data per le Startup: come creare applicazioni Big Data in modalità Server...
Big Data per le Startup: come creare applicazioni Big Data in modalità Server...Big Data per le Startup: come creare applicazioni Big Data in modalità Server...
Big Data per le Startup: come creare applicazioni Big Data in modalità Server...
 
Esegui pod serverless con Amazon EKS e AWS Fargate
Esegui pod serverless con Amazon EKS e AWS FargateEsegui pod serverless con Amazon EKS e AWS Fargate
Esegui pod serverless con Amazon EKS e AWS Fargate
 
Costruire Applicazioni Moderne con AWS
Costruire Applicazioni Moderne con AWSCostruire Applicazioni Moderne con AWS
Costruire Applicazioni Moderne con AWS
 
Come spendere fino al 90% in meno con i container e le istanze spot
Come spendere fino al 90% in meno con i container e le istanze spot Come spendere fino al 90% in meno con i container e le istanze spot
Come spendere fino al 90% in meno con i container e le istanze spot
 
Open banking as a service
Open banking as a serviceOpen banking as a service
Open banking as a service
 
Rendi unica l’offerta della tua startup sul mercato con i servizi Machine Lea...
Rendi unica l’offerta della tua startup sul mercato con i servizi Machine Lea...Rendi unica l’offerta della tua startup sul mercato con i servizi Machine Lea...
Rendi unica l’offerta della tua startup sul mercato con i servizi Machine Lea...
 
OpsWorks Configuration Management: automatizza la gestione e i deployment del...
OpsWorks Configuration Management: automatizza la gestione e i deployment del...OpsWorks Configuration Management: automatizza la gestione e i deployment del...
OpsWorks Configuration Management: automatizza la gestione e i deployment del...
 
Microsoft Active Directory su AWS per supportare i tuoi Windows Workloads
Microsoft Active Directory su AWS per supportare i tuoi Windows WorkloadsMicrosoft Active Directory su AWS per supportare i tuoi Windows Workloads
Microsoft Active Directory su AWS per supportare i tuoi Windows Workloads
 
Computer Vision con AWS
Computer Vision con AWSComputer Vision con AWS
Computer Vision con AWS
 
Database Oracle e VMware Cloud on AWS i miti da sfatare
Database Oracle e VMware Cloud on AWS i miti da sfatareDatabase Oracle e VMware Cloud on AWS i miti da sfatare
Database Oracle e VMware Cloud on AWS i miti da sfatare
 
Crea la tua prima serverless ledger-based app con QLDB e NodeJS
Crea la tua prima serverless ledger-based app con QLDB e NodeJSCrea la tua prima serverless ledger-based app con QLDB e NodeJS
Crea la tua prima serverless ledger-based app con QLDB e NodeJS
 
API moderne real-time per applicazioni mobili e web
API moderne real-time per applicazioni mobili e webAPI moderne real-time per applicazioni mobili e web
API moderne real-time per applicazioni mobili e web
 
Database Oracle e VMware Cloud™ on AWS: i miti da sfatare
Database Oracle e VMware Cloud™ on AWS: i miti da sfatareDatabase Oracle e VMware Cloud™ on AWS: i miti da sfatare
Database Oracle e VMware Cloud™ on AWS: i miti da sfatare
 
Tools for building your MVP on AWS
Tools for building your MVP on AWSTools for building your MVP on AWS
Tools for building your MVP on AWS
 
How to Build a Winning Pitch Deck
How to Build a Winning Pitch DeckHow to Build a Winning Pitch Deck
How to Build a Winning Pitch Deck
 
Building a web application without servers
Building a web application without serversBuilding a web application without servers
Building a web application without servers
 
Fundraising Essentials
Fundraising EssentialsFundraising Essentials
Fundraising Essentials
 
AWS_HK_StartupDay_Building Interactive websites while automating for efficien...
AWS_HK_StartupDay_Building Interactive websites while automating for efficien...AWS_HK_StartupDay_Building Interactive websites while automating for efficien...
AWS_HK_StartupDay_Building Interactive websites while automating for efficien...
 
Introduzione a Amazon Elastic Container Service
Introduzione a Amazon Elastic Container ServiceIntroduzione a Amazon Elastic Container Service
Introduzione a Amazon Elastic Container Service
 

Comprehensive Big Data Analytics Architecture Made Easy - The AWS Marketplace Intelligent Analytical System - GPSWKS301 - re:Invent 2017

  • 1. © 2017, Amazon Web Services, Inc. or its Affiliates. All rights reserved. Comprehensive Big Data Architecture Made Easy: The AWS Marketplace Intelligent Analytical System Luis Daniel Soto, AWS. @luisdans AWS Re:INVENT HANDS-ON WORKSHOP Kim Schmidt, President & CIO/Dataleader.io @dataleader GPSWKS301 November 28, 2017 AWS re:INVENT
  • 2. © 2017, Amazon Web Services, Inc. or its Affiliates. All rights reserved. Big Data Workshop Audience and Goals Workshop Audience: • APN Consulting Partners • Organizations of all sizes Workshop Goals: • Show you the benefits of integrating AWS services and solutions from AWS Marketplace • Exercise 1: Build a data pipeline that will stream live data into Amazon Redshift • Exercise 2: Leverage machine learning to generate predictive analytics • Learn how to avoid common errors when building a Big Data Architecture
  • 3. © 2017, Amazon Web Services, Inc. or its Affiliates. All rights reserved. What we’ll be doing today 1. Introduction (5 minutes) 2. Collect and Store Data (60 minutes) • DEMO #1: Extending on-premise data load to AWS • COLLABORATIVE TEAM EXERCISE: Building a data pipeline 3. Transform & Analyze (55 minutes) • DEMO #2: Orchestrate, transform and aggregate data on Amazon Redshift • COLLABORATIVE TEAM EXERCISE: Predictive Analytics with Machine Learning • DEMO #3: Visualization of prediction output for real-time 4. Operations Management (25 minutes) • PRESENTATION: AWS Marketplace Intelligent Analytical System • DISCUSSION: Other challenges on building a end-to-end Big Data architecture 5. Workshop Wrap-up (5 minutes) Collect & Store Transform & Analyze Operations Management
  • 4. © 2017, Amazon Web Services, Inc. or its Affiliates. All rights reserved. GENERATE COLLECT STORE ANALYZE/PROCESS CONSUME Increasing variety of sources Little visibility into new data Myopic view of existing data Difficulty consolidating data across different sources and locations Challenges normalizing/ transforming/ aggregating data into a standardized format Unable to capture and/or process data as quickly as it is being generated Unfamiliarity with modern data management techniques Lack of necessary skills to implement and maintain new technologies Scaling IT infrastructure Inability to process data in a timely manner once its needed Limited resources and capabilities to experiment and iterate Processing all data in various formats Predicting future required capacity Make more intelligent business decisions Limited adoption due to rigidity and inflexibility of legacy BI tools Be able to run queries quickly Get to data-driven results faster What we hear from our customers Big Data Architecture
  • 5. © 2017, Amazon Web Services, Inc. or its Affiliates. All rights reserved. Immediate availability Broad & deep capabilities Trusted & secure Large partner ecosystem http://aws.amazon.com/mp Big Data on AWS iAS
  • 6. © 2017, Amazon Web Services, Inc. or its Affiliates. All rights reserved. DEMO 1: EXTEND YOUR ON-PREMISES DATA MANAGEMENT TO AWS Transform & Analyze Operations Management AWS MARKETPLACE: INTELLIGENT ANALYTICAL SYSTEM Collect & Store
  • 7. © 2017, Amazon Web Services, Inc. or its Affiliates. All rights reserved. SoftNAS Cloud High-Performance Cloud NAS
  • 8. © 2017, Amazon Web Services, Inc. or its Affiliates. All rights reserved. • Software-only NAS virtual appliance built for the cloud • Full protocol support: CIFS, NFS, AFP, iSCSI • High Availability • Snapshots / Rollbacks • Replication • Deduplication • Compression • Price and Performance Tunable • Active Directory / LDAP integration • Scales from Gigabytes to Petabytes SoftNAS Cloud Virtual NAS Overview
  • 9. © 2017, Amazon Web Services, Inc. or its Affiliates. All rights reserved. Transform & Analyze Operations Management Collect & Store WORKSHOP ACTIVITY 1: CREATING A REAL-TIME STREAMING DATA PIPELINE AWS MARKETPLACE: INTELLIGENT ANALYTICAL SYSTEM
  • 10. © 2017, Amazon Web Services, Inc. or its Affiliates. All rights reserved. 1. See how to use serverless AWS Lambda to transform streaming data from a sample weblog generator 2. Define an Amazon Kinesis Firehose Delivery Stream 3. Set up Amazon Kinesis Streams 4. Monitor the live stream using Amazon CloudWatch 5. Query the data in Amazon Redshift via a customized command line Collaborative Team Exercise #1
  • 11. © 2017, Amazon Web Services, Inc. or its Affiliates. All rights reserved. Operations Management Collect & Store DEMO 2: MPP DATA TRANSFORMATION & ORCHESTRATION AWS MARKETPLACE: INTELLIGENT ANALYTICAL SYSTEM Transform & Analyze
  • 12. © 2017, Amazon Web Services, Inc. or its Affiliates. All rights reserved.
  • 13. © 2017, Amazon Web Services, Inc. or its Affiliates. All rights reserved. Operations Management Collect & Store WORKSHOP ACTIVITY 2: PREDICTIVE ANALYTICS AWS MARKETPLACE: INTELLIGENT ANALYTICAL SYSTEM Transform & Analyze
  • 14. © 2017, Amazon Web Services, Inc. or its Affiliates. All rights reserved. 1. Query data in Amazon S3 2. Design a predictive scoring formula using PredicSis.ai 3. Compute predictive models to apply to real-time customer data that can be used to discover outliers who are ready right now to buy a Tesla Collaborative Team Exercise #2
  • 15. © 2017, Amazon Web Services, Inc. or its Affiliates. All rights reserved. Operations Management Collect & Store DEMO 3: REAL-TIME DATA VISUALIZATIONS AND ALERTS AWS MARKETPLACE: INTELLIGENT ANALYTICAL SYSTEM Transform & Analyze
  • 16. © 2017, Amazon Web Services, Inc. or its Affiliates. All rights reserved. Collect & Store OPERATIONS MANAGEMENT AWS MARKETPLACE: INTELLIGENT ANALYTICAL SYSTEM Transform & Analyze Operations Management
  • 17. © 2017, Amazon Web Services, Inc. or its Affiliates. All rights reserved. iAS: System Architecture and Design
  • 18. © 2017, Amazon Web Services, Inc. or its Affiliates. All rights reserved. GROUP DISCUSSION AWS MARKETPLACE: INTELLIGENT ANALYTICAL SYSTEM • What challenges is my customer/organization facing in building a Big Data Architecture? • What capabilities is my organization missing? • What parts would I want to further customize? Collect & Store Transform & Analyze Operations Management
  • 19. © 2017, Amazon Web Services, Inc. or its Affiliates. All rights reserved. Key Takeaways There is an increase of importance on new Big Data capabilities including: • Securely combining data residing on-premises with Cloud applications • Abandon historical-only business analysis to include predictive and other modern analytics to immediately act upon key business insights • Break down data silos and harness data in real time from all over the globe • The importance of building a modular Big Data architecture AWS Marketplace enables agility and experimentation • Combining AWS services with solutions in AWS Marketplace are “pieces of the puzzle” that can be replaced whenever newer products and services are released • Easily evaluate innovative software solutions, pay only for what you use Download the step-by-step guide for the Intelligent Analytical System (iAS) • https://aws.amazon.com/mp/mp_solution • https://aws-kimsshmidt.com
  • 20. © 2017, Amazon Web Services, Inc. or its Affiliates. All rights reserved. Thank you! @ l u i s d a n s @ d a t a l e a d e r http://aws.amazon.com/mp