SlideShare a Scribd company logo
1 of 35
Download to read offline
Cloudwick© 2015 Cloudwick. All rights reserved. Confidential and Proprietary.
Data Ingestion, Transformation, and Visualization
with Amazon Web Services
Sponsored	by	Amazon	Web	Services	and	Intel	(Venue	)
Presented	by	Arun	Kumar	Palathumpattu	(Cloudwick)
Cloudwick © 2015 Cloudwick. All rights reserved. Confidential and Proprietary.
Cloudwick
Cloudwick © 2015 Cloudwick. All rights reserved. Confidential and Proprietary.
Agenda Cloudwick
• Introduction
• Common	Challenges	in	Data	Analytics	Environments
• Creating	Data	Lake	– Single	source	of	truth
• Build	Effective	Data	work	flow
• Demo
• Q&A
Cloudwick © 2015 Cloudwick. All rights reserved. Confidential and Proprietary.
Top 5 Challenges in Building Data Analysis
Environments
Cloudwick
Cloudwick © 2015 Cloudwick. All rights reserved. Confidential and Proprietary.
# 1 Market Landscape
Cloudwick
Cloudwick © 2015 Cloudwick. All rights reserved. Confidential and Proprietary.
Cloudwick
Image	:	http://mwicorp.com/lake-okeechobee-water-transfer/
Structured
# 2 Issues of Storing All the Data
Cloudwick © 2015 Cloudwick. All rights reserved. Confidential and Proprietary.
# 3 Time it Takes to Find the Useful Information
Cloudwick
Image	:	https://img.clipartfest.com/0ffb2c38607970437e20a6fdd2872eb9_4-benefits-from-taking-guitar-time-value-of-money-clipart_500-500.png
Cloudwick © 2015 Cloudwick. All rights reserved. Confidential and Proprietary.
# 4 Security
Cloudwick
Image	:	https://insights.ubuntu.com/2017/03/20/three-flaws-at-the-heart-of-iot-security/
Cloudwick © 2015 Cloudwick. All rights reserved. Confidential and Proprietary.
# 5 Skill Gap
Cloudwick
https://www.linkedin.com/pulse/wonder-three-years-analytics-big-data-skills-most-demand-hosseini
Evolution	of	Data	Architectures
1985:	Data	Warehouse	Appliances Benefits
• Consolidated	multiple	decision	support	
environments	(i.e.	databases)	into	a	single	
architecture
• Best	performance	available	at	time	of	conception,	
hence	the	expensive	licenses
• Worked	well	with	structured,	columnar	data
• Could	build	customized	data	marts	on	top
Shared	Storage	Tier
(NAS	Appliance)
Compute	
Node
Compute	
Node
Compute	
Node
Compute	
Node
• Proprietary	software	license	paid	per	node	per	
year
• Gold-plated	hardware	available	only	from	the	
vendor	with	per	node	per	year	cost
Constraints
• Proprietary	software	license	paid	per	node	per	year
• Gold-plated	hardware	available	only	from	the	
vendor	with	per	node	per	year	cost
• Could	not	handle	unstructured	data	sets
• Heavy	ETL	&	data	cleansing
Evolution	of	Data	Architectures
2006:	Hadoop	Clusters
CPU
Memory
HDFS	Storage
Hadoop	Master	Node
CPU
Memory
HDFS	Storage
CPU
Memory
HDFS	Storage
Improvements
• Open	source	based	software	license!!!
• Commodity	white	box	servers!!!!
• Could	handle	structured	&	unstructured	data	sets
• Many	different	applications	within	the	framework	
(MapReduce,	Spark,	Hive,	Pig,	HBase,	Presto,	etc.)
Constraints
• HDFS	3X	replication	to	protect	against	node	failure	
gets	expensive	at	scale
• 500	TB	data	set	=	1.5	PB	cluster
• Local	storage	means	you	must	scale	and	pay	for	
CPU	&	memory	resources	when	adding	data	
capacity
• General	purpose,	monolithic cluster	with	many	
different	apps	on	same	hardware
• Still	a	data	silo
Cloudwick © 2015 Cloudwick. All rights reserved. Confidential and Proprietary.
Legacy Data Architectures Exist as Isolated Data Silos
Hadoop
Cluster
SQL
Database
Data
Warehouse
Appliance
Evolution	of	Data	Architectures
2009:	Decoupled	EMR	Architecture
CPU
Memory
Hadoop	Master	Node
CPU
Memory
CPU
Memory
Improvements
• Decoupled	storage	&	compute
• Scale	CPU	and	memory	resources	independently	
and	up	&	down
• Only	pay	for	the	500	TB	data	set	(not	3X)
• Multi-physical	facility	replication	via	S3
• Multiple	clusters	can	run	in	parallel	against	shared	
data	in	S3
• Each	job	gets	its	own	optimized	cluster.	i.e.	Spark	
on	memory	intensive,	Hive	on	CPU	intensive,	HBase
on	I/O	intensive,	etc.
Constraints
• Still	have	a	cluster	to	provision	and	manage
• Must	expose	EMR	cluster	to	SQL	users	via	Hive,	
Presto,	etc.
S3	as	HDFS
Evolution	of	Data	Architectures
2012:	Amazon	Redshift	– Cloud	DW
Improvements
Constraints
• Still	have	to	load	data	into	a	schema	
Leader node
Compute node
10 GigE
(HPC)
Ingestion
Backup
Restore
Customer VPC
Internal VPC
BI tools SQL clientsAnalytics tools
Compute node Compute node
JDBC/ODBC
• Automated	installation,	patching,	backups
• No	servers	to	manage	and	maintain
• MPP	Columnar	relational	database
• $1,000	/	TB	/	Year
• Accessible	to	any	ODBC	or	JDBC	BI	Tool
Evolution	of	Data	Architectures
Today:	Clusterless Improvements
• No	cluster/infrastructure	to	manage
• Business	users	and	analysts	can	write	SQL	without	
having	to	provision	a	cluster	or	touch	infrastructure
• Pay	by	the	query
• Zero	Administration	
• Process	data	where	it	lives
Constraints
• Limited	to	SQL,	Hive	and	Spark	jobs	today.	More	
frameworks	to	come!
SQL	Interface	in	web	
browser
Athena	for	SQL
S3	Data	Lake
Glue	for	ETL
S3	Data	Lake
Spark	&	Hive	Interface	in	
web	browser
Cloudwick © 2015 Cloudwick. All rights reserved. Confidential and Proprietary.
©2015, Amazon Web Services, Inc. or its affiliates. All rights reserved.
Building a Flexible Data Lake
Architecture on AWS
Cloudwick © 2015 Cloudwick. All rights reserved. Confidential and Proprietary.
Enter Data Lake Architectures
Data Lake is a new and increasingly
popular architecture to store and analyze
massive volumes and heterogeneous
types of data.
Separating your storage and compute
allows you to scale each component as
required
“How can I scale up with the
volume of data being generated?”
Cloudwick © 2015 Cloudwick. All rights reserved. Confidential and Proprietary.
Benefits of a Data Lake – All Data in One Place
Store and analyze all of your data,
from all of your sources, in one
centralized location.
“Why is the data distributed in
many locations? Where is the
single source of truth ?”
Quickly ingest data
without needing to force it into a
pre-defined schema.
“How can I collect data quickly
from various sources and store it
efficiently?”
Cloudwick © 2015 Cloudwick. All rights reserved. Confidential and Proprietary.
Benefits of a Data Lake – Schema on Read
“Is there a way I can apply multiple
analytics and processing frameworks
to the same data?”
A Data Lake enables ad-hoc
analysis by applying schemas
on read, not write.
Benefits	of	an	AWS	S3	Data	Lake
Fixed	Cluster	Data	Lake AWS	S3	Data	Lake
• Limited	to	only	the	single	tool	contained	on	
the	cluster	(i.e.	Hadoop	or	data	warehouse	
or	Cassandra,	etc.).	Use	cases	&	ecosystem	
tools	change	rapidly
• Expensive	to	add	nodes	to	add	storage	
capacity
• Expensive	to	replicate	data	against	node	
loss
• Complexity	in	scaling	local	storage	capacity
• Long	refresh	cycles	to	add	additional	
storage	equipment
• Decouple	storage	and	compute	by	making	
S3	object	based	storage,	not	a	fixed	tool	
cluster	the	data	lake
• Flexibility	to	use	any	and	all	tools	in	the	
ecosystem.	The	right	tool	for	the	job
• Future	proof	your	architecture.	As	new	use	
cases	and	new	tools	emerge	you	can	plug	
and	play	current	best	of	breed.
Cloudwick © 2015 Cloudwick. All rights reserved. Confidential and Proprietary.
Designed for 11 9s
of durability
Designed for
99.99% availability
Durable Available High performance
Multiple upload
Range GET
Store as much as you need
Scale storage and compute
independently
No minimum usage commitments
Scalable
Amazon EMR
Amazon Redshift
Amazon DynamoDB
Integrated
Simple REST API
AWS SDKs
Read-after-create consistency
Event notification
Lifecycle policies
Easy to use
S3 for data lake
Cloudwick © 2015 Cloudwick. All rights reserved. Confidential and Proprietary.
Building a Data Lake on AWS
Kinesis Firehose
Athena
Query Service
Cloudwick © 2015 Cloudwick. All rights reserved. Confidential and Proprietary.
Processing & Analytics
Real-time Batch
AI & Predictive
BI & Data Visualization
Transactional &
RDBMS
AWS Lambda
Apache Storm
on EMR
Apache Flink
on EMR
Spark Streaming
on EMR
Elasticsearch
Service
Kinesis Analytics,
Kinesis Streams
DynamoDB
NoSQL DB Relational Database
Aurora
EMR
Hadoop, Spark,
Presto
Redshift
Data Warehouse
Athena
Query Service
Amazon Lex
Speech
recognition
Amazon
Rekognition
Amazon Polly
Text to speech
Machine Learning
Predictive analytics
Kinesis Streams
& Firehose
Summary	of	AWS	Analytics,	Database	&	AI	Tools
Amazon	Redshift
Enterprise	Data	Warehouse
Amazon	EMR
Hadoop/Spark
Amazon	Athena
Clusterless SQL
Amazon	Glue
Clusterless ETL
Amazon	Aurora
Managed	Relational	Database
Amazon	Machine	Learning
Predictive	Analytics
Amazon	Quicksight
Business	Intelligence/Visualization
Amazon	ElasticSearch Service
ElasticSearch
Amazon	ElastiCache
Redis In-memory	Datastore
Amazon	DynamoDB
Managed	NoSQL	Database
Amazon	Rekognition
Deep	Learning-based	Image	Recognition
Amazon	Lex
Voice	or	Text	Chatbots
Cloudwick © 2015 Cloudwick. All rights reserved. Confidential and Proprietary.
Encryption ComplianceSecurity
Identity and Access
Management (IAM) policies
Bucket policies
Access Control Lists (ACLs)
Private VPC endpoints to
Amazon S3
SSL endpoints
Server Side Encryption
(SSE-S3)
S3 Server Side
Encryption with
provided keys (SSE-C,
SSE-KMS)
Client-side Encryption
Buckets access logs
Lifecycle Management
Policies
Access Control Lists
(ACLs)
Versioning & MFA
deletes
Certifications – HIPAA,
PCI, SOC 1/2/3 etc.
Implement the right cloud security controls
Cloudwick © 2015 Cloudwick. All rights reserved. Confidential and Proprietary.
More Efficient Data Lake Architectures
Cloudwick
Cloudwick © 2015 Cloudwick. All rights reserved. Confidential and Proprietary.
What is a Modern Enterprise Data
Warehouse?
A Modern Enterprise Data warehouse
(EDW), is designed to support rapid data
growth, quick analytics over relational,
non-relational as well as streaming data,
with an easy and single interface to
consume all these types of data.
Cloudwick © 2015 Cloudwick. All rights reserved. Confidential and Proprietary.
Data
Consumption
Building a Modern EDW on AWS
Amazon
Kinesis
Firehose
Amazon S3
Amazon S3
Amazon
EMR
Amaz
on
Redsh
ift
Streamin
g
Un-
structured
Relational
Amaz
on
RDS
Amaz
on
Athen
a
Amazon
Quicksig
ht
Amazo
n
Machin
e
Learnin
g
Analyze
Predict
Data
Mart
EDW
Ad-hoc
query
Data
Ingestion
Cloudwick © 2015 Cloudwick. All rights reserved. Confidential and Proprietary.
What is Real-time analytics platform?
Real-time Analytics platform provides an
ability to load and analyze streaming
data, and build custom streaming data
applications for specialized needs. This
platform can handle staggering amounts
of streaming data – sometimes TBs per
hour – that need to be collected, stored,
and processed continuously.
Cloudwick © 2015 Cloudwick. All rights reserved. Confidential and Proprietary.
Building a Real-time analytics platform (Streaming
vs batching)
Stream Delivery
Stream Analytics
Stream processing
Event Driven
Kinesis
Streami
ng
Cloudwick © 2015 Cloudwick. All rights reserved. Confidential and Proprietary.
Simplify Big Data Processing
ingest /
collect
store process /
analyze
consume /
visualize
data answers
Time to Answer (Latency)
Throughput
Cost
Cloudwick © 2015 Cloudwick. All rights reserved. Confidential and Proprietary.
Amazon QuickSight is a Business Analytics Service that lets business users
quickly and easily visualize, explore, and share insights from their data.
Cloudwick © 2015 Cloudwick. All rights reserved. Confidential and Proprietary.
Amazon S3 Data Lake
Visualizing the Data Lake
Amazon Athena
Cloudwick © 2015 Cloudwick. All rights reserved. Confidential and Proprietary.
Cloudwick
DEMO
Cloudwick© 2015 Cloudwick. All rights reserved. Confidential and Proprietary.
Credits
Event	Sponsored	by	Amazon	Web	Services	
Venue	by	Intel
Presented	by	Arun	Kumar	Palathumpattu	(Cloudwick)
Slides	credit	:	AWS	+	Cloudwick

More Related Content

What's hot

Windows Workloads on AWS - AWS Innovate Toronto
Windows Workloads on AWS - AWS Innovate TorontoWindows Workloads on AWS - AWS Innovate Toronto
Windows Workloads on AWS - AWS Innovate TorontoAmazon Web Services
 
Scaling Ideas: Accelerating Research with AWS - Technical 301
Scaling Ideas: Accelerating Research with AWS - Technical 301Scaling Ideas: Accelerating Research with AWS - Technical 301
Scaling Ideas: Accelerating Research with AWS - Technical 301Amazon Web Services
 
Hybrid Cloud Storage: Why HUSCO International Left Traditional Storage Behind
 Hybrid Cloud Storage: Why HUSCO International Left Traditional Storage Behind Hybrid Cloud Storage: Why HUSCO International Left Traditional Storage Behind
Hybrid Cloud Storage: Why HUSCO International Left Traditional Storage BehindAmazon Web Services
 
Get Started Today with Cloud-Ready Contracts | AWS Public Sector Summit 2017
Get Started Today with Cloud-Ready Contracts | AWS Public Sector Summit 2017Get Started Today with Cloud-Ready Contracts | AWS Public Sector Summit 2017
Get Started Today with Cloud-Ready Contracts | AWS Public Sector Summit 2017Amazon Web Services
 
2016 summits - future of enterprise it
2016 summits - future of enterprise it2016 summits - future of enterprise it
2016 summits - future of enterprise itAmazon Web Services
 
Modernizing upstream workflows with aws storage - john mallory
Modernizing upstream workflows with aws storage -  john malloryModernizing upstream workflows with aws storage -  john mallory
Modernizing upstream workflows with aws storage - john malloryAmazon Web Services
 
Best Practices in Planning a Large-Scale Migration to AWS - May 2017 AWS Onli...
Best Practices in Planning a Large-Scale Migration to AWS - May 2017 AWS Onli...Best Practices in Planning a Large-Scale Migration to AWS - May 2017 AWS Onli...
Best Practices in Planning a Large-Scale Migration to AWS - May 2017 AWS Onli...Amazon Web Services
 
Immersion Day - Como a AWS apoia a estratégia analítica de sua empresa
Immersion Day - Como a AWS apoia a estratégia analítica de sua empresaImmersion Day - Como a AWS apoia a estratégia analítica de sua empresa
Immersion Day - Como a AWS apoia a estratégia analítica de sua empresaAmazon Web Services LATAM
 
Following Well Architected Frameworks - Lunch and Learn.pdf
Following Well Architected Frameworks - Lunch and Learn.pdfFollowing Well Architected Frameworks - Lunch and Learn.pdf
Following Well Architected Frameworks - Lunch and Learn.pdfAmazon Web Services
 
Introduction to Hybrid Cloud on AWS - AWS Online Tech Talks
Introduction to Hybrid Cloud on AWS - AWS Online Tech TalksIntroduction to Hybrid Cloud on AWS - AWS Online Tech Talks
Introduction to Hybrid Cloud on AWS - AWS Online Tech TalksAmazon Web Services
 
Analytics on the Cloud with Tableau on AWS
Analytics on the Cloud with Tableau on AWSAnalytics on the Cloud with Tableau on AWS
Analytics on the Cloud with Tableau on AWSAmazon Web Services
 
Bridge OLTP and Stream Processing with Amazon Kinesis, AWS Lambda, & MongoDB ...
Bridge OLTP and Stream Processing with Amazon Kinesis, AWS Lambda, & MongoDB ...Bridge OLTP and Stream Processing with Amazon Kinesis, AWS Lambda, & MongoDB ...
Bridge OLTP and Stream Processing with Amazon Kinesis, AWS Lambda, & MongoDB ...Amazon Web Services
 
Back Up and Manage On-Premises and Cloud-Native Workloads with Rubrik on AWS ...
Back Up and Manage On-Premises and Cloud-Native Workloads with Rubrik on AWS ...Back Up and Manage On-Premises and Cloud-Native Workloads with Rubrik on AWS ...
Back Up and Manage On-Premises and Cloud-Native Workloads with Rubrik on AWS ...Amazon Web Services
 
SRV307 Applying AWS Purpose-Built Database Strategy: Match Your Workload to ...
 SRV307 Applying AWS Purpose-Built Database Strategy: Match Your Workload to ... SRV307 Applying AWS Purpose-Built Database Strategy: Match Your Workload to ...
SRV307 Applying AWS Purpose-Built Database Strategy: Match Your Workload to ...Amazon Web Services
 
Aberdeen Oil & Gas Event - AWS Partner Eurotech
Aberdeen Oil & Gas Event - AWS Partner EurotechAberdeen Oil & Gas Event - AWS Partner Eurotech
Aberdeen Oil & Gas Event - AWS Partner EurotechAmazon Web Services
 
AWS re:Invent 2016: How to move 1,000 VMs and Biz Critical Apps to AWS in 6 m...
AWS re:Invent 2016: How to move 1,000 VMs and Biz Critical Apps to AWS in 6 m...AWS re:Invent 2016: How to move 1,000 VMs and Biz Critical Apps to AWS in 6 m...
AWS re:Invent 2016: How to move 1,000 VMs and Biz Critical Apps to AWS in 6 m...Amazon Web Services
 
What Organizational and Governance Changes Do I Need to Make Prior to Migrati...
What Organizational and Governance Changes Do I Need to Make Prior to Migrati...What Organizational and Governance Changes Do I Need to Make Prior to Migrati...
What Organizational and Governance Changes Do I Need to Make Prior to Migrati...Amazon Web Services
 
Adoption of VMware Cloud on AWS is Accelerating in the Enterprise
Adoption of VMware Cloud on AWS is Accelerating in the Enterprise Adoption of VMware Cloud on AWS is Accelerating in the Enterprise
Adoption of VMware Cloud on AWS is Accelerating in the Enterprise Amazon Web Services
 
How to Accelerate the Adoption of AWS and Reduce Cost and Risk with a Data F...
 How to Accelerate the Adoption of AWS and Reduce Cost and Risk with a Data F... How to Accelerate the Adoption of AWS and Reduce Cost and Risk with a Data F...
How to Accelerate the Adoption of AWS and Reduce Cost and Risk with a Data F...Amazon Web Services
 

What's hot (20)

Windows Workloads on AWS - AWS Innovate Toronto
Windows Workloads on AWS - AWS Innovate TorontoWindows Workloads on AWS - AWS Innovate Toronto
Windows Workloads on AWS - AWS Innovate Toronto
 
Scaling Ideas: Accelerating Research with AWS - Technical 301
Scaling Ideas: Accelerating Research with AWS - Technical 301Scaling Ideas: Accelerating Research with AWS - Technical 301
Scaling Ideas: Accelerating Research with AWS - Technical 301
 
Hybrid Cloud Storage: Why HUSCO International Left Traditional Storage Behind
 Hybrid Cloud Storage: Why HUSCO International Left Traditional Storage Behind Hybrid Cloud Storage: Why HUSCO International Left Traditional Storage Behind
Hybrid Cloud Storage: Why HUSCO International Left Traditional Storage Behind
 
Get Started Today with Cloud-Ready Contracts | AWS Public Sector Summit 2017
Get Started Today with Cloud-Ready Contracts | AWS Public Sector Summit 2017Get Started Today with Cloud-Ready Contracts | AWS Public Sector Summit 2017
Get Started Today with Cloud-Ready Contracts | AWS Public Sector Summit 2017
 
2016 summits - future of enterprise it
2016 summits - future of enterprise it2016 summits - future of enterprise it
2016 summits - future of enterprise it
 
Modernizing upstream workflows with aws storage - john mallory
Modernizing upstream workflows with aws storage -  john malloryModernizing upstream workflows with aws storage -  john mallory
Modernizing upstream workflows with aws storage - john mallory
 
AWS glue technical enablement training
AWS glue technical enablement trainingAWS glue technical enablement training
AWS glue technical enablement training
 
Best Practices in Planning a Large-Scale Migration to AWS - May 2017 AWS Onli...
Best Practices in Planning a Large-Scale Migration to AWS - May 2017 AWS Onli...Best Practices in Planning a Large-Scale Migration to AWS - May 2017 AWS Onli...
Best Practices in Planning a Large-Scale Migration to AWS - May 2017 AWS Onli...
 
Immersion Day - Como a AWS apoia a estratégia analítica de sua empresa
Immersion Day - Como a AWS apoia a estratégia analítica de sua empresaImmersion Day - Como a AWS apoia a estratégia analítica de sua empresa
Immersion Day - Como a AWS apoia a estratégia analítica de sua empresa
 
Following Well Architected Frameworks - Lunch and Learn.pdf
Following Well Architected Frameworks - Lunch and Learn.pdfFollowing Well Architected Frameworks - Lunch and Learn.pdf
Following Well Architected Frameworks - Lunch and Learn.pdf
 
Introduction to Hybrid Cloud on AWS - AWS Online Tech Talks
Introduction to Hybrid Cloud on AWS - AWS Online Tech TalksIntroduction to Hybrid Cloud on AWS - AWS Online Tech Talks
Introduction to Hybrid Cloud on AWS - AWS Online Tech Talks
 
Analytics on the Cloud with Tableau on AWS
Analytics on the Cloud with Tableau on AWSAnalytics on the Cloud with Tableau on AWS
Analytics on the Cloud with Tableau on AWS
 
Bridge OLTP and Stream Processing with Amazon Kinesis, AWS Lambda, & MongoDB ...
Bridge OLTP and Stream Processing with Amazon Kinesis, AWS Lambda, & MongoDB ...Bridge OLTP and Stream Processing with Amazon Kinesis, AWS Lambda, & MongoDB ...
Bridge OLTP and Stream Processing with Amazon Kinesis, AWS Lambda, & MongoDB ...
 
Back Up and Manage On-Premises and Cloud-Native Workloads with Rubrik on AWS ...
Back Up and Manage On-Premises and Cloud-Native Workloads with Rubrik on AWS ...Back Up and Manage On-Premises and Cloud-Native Workloads with Rubrik on AWS ...
Back Up and Manage On-Premises and Cloud-Native Workloads with Rubrik on AWS ...
 
SRV307 Applying AWS Purpose-Built Database Strategy: Match Your Workload to ...
 SRV307 Applying AWS Purpose-Built Database Strategy: Match Your Workload to ... SRV307 Applying AWS Purpose-Built Database Strategy: Match Your Workload to ...
SRV307 Applying AWS Purpose-Built Database Strategy: Match Your Workload to ...
 
Aberdeen Oil & Gas Event - AWS Partner Eurotech
Aberdeen Oil & Gas Event - AWS Partner EurotechAberdeen Oil & Gas Event - AWS Partner Eurotech
Aberdeen Oil & Gas Event - AWS Partner Eurotech
 
AWS re:Invent 2016: How to move 1,000 VMs and Biz Critical Apps to AWS in 6 m...
AWS re:Invent 2016: How to move 1,000 VMs and Biz Critical Apps to AWS in 6 m...AWS re:Invent 2016: How to move 1,000 VMs and Biz Critical Apps to AWS in 6 m...
AWS re:Invent 2016: How to move 1,000 VMs and Biz Critical Apps to AWS in 6 m...
 
What Organizational and Governance Changes Do I Need to Make Prior to Migrati...
What Organizational and Governance Changes Do I Need to Make Prior to Migrati...What Organizational and Governance Changes Do I Need to Make Prior to Migrati...
What Organizational and Governance Changes Do I Need to Make Prior to Migrati...
 
Adoption of VMware Cloud on AWS is Accelerating in the Enterprise
Adoption of VMware Cloud on AWS is Accelerating in the Enterprise Adoption of VMware Cloud on AWS is Accelerating in the Enterprise
Adoption of VMware Cloud on AWS is Accelerating in the Enterprise
 
How to Accelerate the Adoption of AWS and Reduce Cost and Risk with a Data F...
 How to Accelerate the Adoption of AWS and Reduce Cost and Risk with a Data F... How to Accelerate the Adoption of AWS and Reduce Cost and Risk with a Data F...
How to Accelerate the Adoption of AWS and Reduce Cost and Risk with a Data F...
 

Similar to Ingest, Transform & Visualize w Amazon Web Services

Dynamic Infrastructure and The Cloud
Dynamic Infrastructure and The CloudDynamic Infrastructure and The Cloud
Dynamic Infrastructure and The CloudNew Relic
 
2017 04-05 aws summit - sydney
2017 04-05 aws summit - sydney2017 04-05 aws summit - sydney
2017 04-05 aws summit - sydneyLee Atchison
 
Microservices and serverless for MegaStartups - DLD TLV 2017
Microservices and serverless for MegaStartups - DLD TLV 2017Microservices and serverless for MegaStartups - DLD TLV 2017
Microservices and serverless for MegaStartups - DLD TLV 2017Boaz Ziniman
 
Microservices? Dynamic Infrastructure? - Adventures in Keeping Your Applicati...
Microservices? Dynamic Infrastructure? - Adventures in Keeping Your Applicati...Microservices? Dynamic Infrastructure? - Adventures in Keeping Your Applicati...
Microservices? Dynamic Infrastructure? - Adventures in Keeping Your Applicati...Amazon Web Services
 
Database Freedom - ADB304 - Santa Clara AWS Summit
Database Freedom - ADB304 - Santa Clara AWS SummitDatabase Freedom - ADB304 - Santa Clara AWS Summit
Database Freedom - ADB304 - Santa Clara AWS SummitAmazon Web Services
 
Introduction to AWS Lake Formation.pptx
Introduction to AWS Lake Formation.pptxIntroduction to AWS Lake Formation.pptx
Introduction to AWS Lake Formation.pptxSwathiPonugumati
 
vBrownBag AWS Certified SysOps : Associate Domain 4
vBrownBag AWS Certified SysOps : Associate Domain 4vBrownBag AWS Certified SysOps : Associate Domain 4
vBrownBag AWS Certified SysOps : Associate Domain 4Eric Santelices
 
Introduction to Cloud Computing with Amazon Web Services and Customer Case Study
Introduction to Cloud Computing with Amazon Web Services and Customer Case StudyIntroduction to Cloud Computing with Amazon Web Services and Customer Case Study
Introduction to Cloud Computing with Amazon Web Services and Customer Case StudyAmazon Web Services
 
AWS business essentials - Toronto
AWS   business essentials - TorontoAWS   business essentials - Toronto
AWS business essentials - TorontoAmazon Web Services
 
AWS re:Invent 2016: How to Build a Big Data Analytics Data Lake (LFS303)
AWS re:Invent 2016: How to Build a Big Data Analytics Data Lake (LFS303)AWS re:Invent 2016: How to Build a Big Data Analytics Data Lake (LFS303)
AWS re:Invent 2016: How to Build a Big Data Analytics Data Lake (LFS303)Amazon Web Services
 
Best of reI:nvent Tel Aviv 2015 - Keynote
Best of reI:nvent Tel Aviv 2015 - KeynoteBest of reI:nvent Tel Aviv 2015 - Keynote
Best of reI:nvent Tel Aviv 2015 - KeynoteAmazon Web Services
 
Cloudreach Voices - The Cloud: What Does it Mean for your Current Applications
Cloudreach Voices - The Cloud: What Does it Mean for your Current ApplicationsCloudreach Voices - The Cloud: What Does it Mean for your Current Applications
Cloudreach Voices - The Cloud: What Does it Mean for your Current ApplicationsCloudreach
 
Building Your Data Lake on AWS - Level 200
Building Your Data Lake on AWS - Level 200Building Your Data Lake on AWS - Level 200
Building Your Data Lake on AWS - Level 200Amazon Web Services
 
AWS Webcast - Splunk and Autodesk
AWS Webcast - Splunk and AutodeskAWS Webcast - Splunk and Autodesk
AWS Webcast - Splunk and AutodeskAmazon Web Services
 
Expanding Your Data Center with Hybrid Cloud Infrastructure
Expanding Your Data Center with Hybrid Cloud InfrastructureExpanding Your Data Center with Hybrid Cloud Infrastructure
Expanding Your Data Center with Hybrid Cloud InfrastructureAmazon Web Services
 
PaaS or Fail: Rule the Cloud with Altus
PaaS or Fail: Rule the Cloud with AltusPaaS or Fail: Rule the Cloud with Altus
PaaS or Fail: Rule the Cloud with AltusCloudera, Inc.
 
ABD202_Best Practices for Building Serverless Big Data Applications
ABD202_Best Practices for Building Serverless Big Data ApplicationsABD202_Best Practices for Building Serverless Big Data Applications
ABD202_Best Practices for Building Serverless Big Data ApplicationsAmazon Web Services
 

Similar to Ingest, Transform & Visualize w Amazon Web Services (20)

Dynamic Infrastructure and The Cloud
Dynamic Infrastructure and The CloudDynamic Infrastructure and The Cloud
Dynamic Infrastructure and The Cloud
 
2017 04-05 aws summit - sydney
2017 04-05 aws summit - sydney2017 04-05 aws summit - sydney
2017 04-05 aws summit - sydney
 
Microservices and serverless for MegaStartups - DLD TLV 2017
Microservices and serverless for MegaStartups - DLD TLV 2017Microservices and serverless for MegaStartups - DLD TLV 2017
Microservices and serverless for MegaStartups - DLD TLV 2017
 
Microservices? Dynamic Infrastructure? - Adventures in Keeping Your Applicati...
Microservices? Dynamic Infrastructure? - Adventures in Keeping Your Applicati...Microservices? Dynamic Infrastructure? - Adventures in Keeping Your Applicati...
Microservices? Dynamic Infrastructure? - Adventures in Keeping Your Applicati...
 
Database Freedom - ADB304 - Santa Clara AWS Summit
Database Freedom - ADB304 - Santa Clara AWS SummitDatabase Freedom - ADB304 - Santa Clara AWS Summit
Database Freedom - ADB304 - Santa Clara AWS Summit
 
Big Data Building Blocks with AWS Cloud
Big Data Building Blocks with AWS CloudBig Data Building Blocks with AWS Cloud
Big Data Building Blocks with AWS Cloud
 
Introduction to AWS Lake Formation.pptx
Introduction to AWS Lake Formation.pptxIntroduction to AWS Lake Formation.pptx
Introduction to AWS Lake Formation.pptx
 
vBrownBag AWS Certified SysOps : Associate Domain 4
vBrownBag AWS Certified SysOps : Associate Domain 4vBrownBag AWS Certified SysOps : Associate Domain 4
vBrownBag AWS Certified SysOps : Associate Domain 4
 
Migrating your IT - Final
Migrating your IT - FinalMigrating your IT - Final
Migrating your IT - Final
 
Introduction to Cloud Computing with Amazon Web Services and Customer Case Study
Introduction to Cloud Computing with Amazon Web Services and Customer Case StudyIntroduction to Cloud Computing with Amazon Web Services and Customer Case Study
Introduction to Cloud Computing with Amazon Web Services and Customer Case Study
 
AWS business essentials - Toronto
AWS   business essentials - TorontoAWS   business essentials - Toronto
AWS business essentials - Toronto
 
AWS business essentials
AWS business essentials AWS business essentials
AWS business essentials
 
AWS re:Invent 2016: How to Build a Big Data Analytics Data Lake (LFS303)
AWS re:Invent 2016: How to Build a Big Data Analytics Data Lake (LFS303)AWS re:Invent 2016: How to Build a Big Data Analytics Data Lake (LFS303)
AWS re:Invent 2016: How to Build a Big Data Analytics Data Lake (LFS303)
 
Best of reI:nvent Tel Aviv 2015 - Keynote
Best of reI:nvent Tel Aviv 2015 - KeynoteBest of reI:nvent Tel Aviv 2015 - Keynote
Best of reI:nvent Tel Aviv 2015 - Keynote
 
Cloudreach Voices - The Cloud: What Does it Mean for your Current Applications
Cloudreach Voices - The Cloud: What Does it Mean for your Current ApplicationsCloudreach Voices - The Cloud: What Does it Mean for your Current Applications
Cloudreach Voices - The Cloud: What Does it Mean for your Current Applications
 
Building Your Data Lake on AWS - Level 200
Building Your Data Lake on AWS - Level 200Building Your Data Lake on AWS - Level 200
Building Your Data Lake on AWS - Level 200
 
AWS Webcast - Splunk and Autodesk
AWS Webcast - Splunk and AutodeskAWS Webcast - Splunk and Autodesk
AWS Webcast - Splunk and Autodesk
 
Expanding Your Data Center with Hybrid Cloud Infrastructure
Expanding Your Data Center with Hybrid Cloud InfrastructureExpanding Your Data Center with Hybrid Cloud Infrastructure
Expanding Your Data Center with Hybrid Cloud Infrastructure
 
PaaS or Fail: Rule the Cloud with Altus
PaaS or Fail: Rule the Cloud with AltusPaaS or Fail: Rule the Cloud with Altus
PaaS or Fail: Rule the Cloud with Altus
 
ABD202_Best Practices for Building Serverless Big Data Applications
ABD202_Best Practices for Building Serverless Big Data ApplicationsABD202_Best Practices for Building Serverless Big Data Applications
ABD202_Best Practices for Building Serverless Big Data Applications
 

More from BigDataCamp

BigDataCamp LA 2014 Schedule
BigDataCamp LA 2014 ScheduleBigDataCamp LA 2014 Schedule
BigDataCamp LA 2014 ScheduleBigDataCamp
 
5 kinesis lightning
5 kinesis lightning5 kinesis lightning
5 kinesis lightningBigDataCamp
 
4 hadoop for-the-disillusioned
4 hadoop for-the-disillusioned4 hadoop for-the-disillusioned
4 hadoop for-the-disillusionedBigDataCamp
 
3 analytic strategies shree dandekar dell 12-10-13
3 analytic strategies shree dandekar dell 12-10-133 analytic strategies shree dandekar dell 12-10-13
3 analytic strategies shree dandekar dell 12-10-13BigDataCamp
 
2 one spot redshift bigdatacamp 1.02
2 one spot redshift bigdatacamp 1.022 one spot redshift bigdatacamp 1.02
2 one spot redshift bigdatacamp 1.02BigDataCamp
 
1 big datacampdell2013
1 big datacampdell20131 big datacampdell2013
1 big datacampdell2013BigDataCamp
 
Stefan Groschupf of Datameer Gives Lightning Talk at BigDataCamp
Stefan Groschupf of Datameer Gives Lightning Talk at BigDataCampStefan Groschupf of Datameer Gives Lightning Talk at BigDataCamp
Stefan Groschupf of Datameer Gives Lightning Talk at BigDataCampBigDataCamp
 
Richard Cole of Amazon Gives Lightning Tallk at BigDataCamp
Richard Cole of Amazon Gives Lightning Tallk at BigDataCampRichard Cole of Amazon Gives Lightning Tallk at BigDataCamp
Richard Cole of Amazon Gives Lightning Tallk at BigDataCampBigDataCamp
 
Stefan Groschupf of Datameer Gives Lightning Tallk at BigDataCamp
Stefan Groschupf of Datameer Gives Lightning Tallk at BigDataCampStefan Groschupf of Datameer Gives Lightning Tallk at BigDataCamp
Stefan Groschupf of Datameer Gives Lightning Tallk at BigDataCampBigDataCamp
 
Sam Charrington Of Appistry Gives Lighting Talk
Sam Charrington Of Appistry Gives Lighting TalkSam Charrington Of Appistry Gives Lighting Talk
Sam Charrington Of Appistry Gives Lighting TalkBigDataCamp
 
Steve Woolege Of Aster Data Gives Lightning Talk At BigDataCamp
Steve Woolege Of Aster Data Gives Lightning Talk At BigDataCampSteve Woolege Of Aster Data Gives Lightning Talk At BigDataCamp
Steve Woolege Of Aster Data Gives Lightning Talk At BigDataCampBigDataCamp
 

More from BigDataCamp (11)

BigDataCamp LA 2014 Schedule
BigDataCamp LA 2014 ScheduleBigDataCamp LA 2014 Schedule
BigDataCamp LA 2014 Schedule
 
5 kinesis lightning
5 kinesis lightning5 kinesis lightning
5 kinesis lightning
 
4 hadoop for-the-disillusioned
4 hadoop for-the-disillusioned4 hadoop for-the-disillusioned
4 hadoop for-the-disillusioned
 
3 analytic strategies shree dandekar dell 12-10-13
3 analytic strategies shree dandekar dell 12-10-133 analytic strategies shree dandekar dell 12-10-13
3 analytic strategies shree dandekar dell 12-10-13
 
2 one spot redshift bigdatacamp 1.02
2 one spot redshift bigdatacamp 1.022 one spot redshift bigdatacamp 1.02
2 one spot redshift bigdatacamp 1.02
 
1 big datacampdell2013
1 big datacampdell20131 big datacampdell2013
1 big datacampdell2013
 
Stefan Groschupf of Datameer Gives Lightning Talk at BigDataCamp
Stefan Groschupf of Datameer Gives Lightning Talk at BigDataCampStefan Groschupf of Datameer Gives Lightning Talk at BigDataCamp
Stefan Groschupf of Datameer Gives Lightning Talk at BigDataCamp
 
Richard Cole of Amazon Gives Lightning Tallk at BigDataCamp
Richard Cole of Amazon Gives Lightning Tallk at BigDataCampRichard Cole of Amazon Gives Lightning Tallk at BigDataCamp
Richard Cole of Amazon Gives Lightning Tallk at BigDataCamp
 
Stefan Groschupf of Datameer Gives Lightning Tallk at BigDataCamp
Stefan Groschupf of Datameer Gives Lightning Tallk at BigDataCampStefan Groschupf of Datameer Gives Lightning Tallk at BigDataCamp
Stefan Groschupf of Datameer Gives Lightning Tallk at BigDataCamp
 
Sam Charrington Of Appistry Gives Lighting Talk
Sam Charrington Of Appistry Gives Lighting TalkSam Charrington Of Appistry Gives Lighting Talk
Sam Charrington Of Appistry Gives Lighting Talk
 
Steve Woolege Of Aster Data Gives Lightning Talk At BigDataCamp
Steve Woolege Of Aster Data Gives Lightning Talk At BigDataCampSteve Woolege Of Aster Data Gives Lightning Talk At BigDataCamp
Steve Woolege Of Aster Data Gives Lightning Talk At BigDataCamp
 

Recently uploaded

10 Trends Likely to Shape Enterprise Technology in 2024
10 Trends Likely to Shape Enterprise Technology in 202410 Trends Likely to Shape Enterprise Technology in 2024
10 Trends Likely to Shape Enterprise Technology in 2024Mind IT Systems
 
+971565801893>>SAFE AND ORIGINAL ABORTION PILLS FOR SALE IN DUBAI AND ABUDHAB...
+971565801893>>SAFE AND ORIGINAL ABORTION PILLS FOR SALE IN DUBAI AND ABUDHAB...+971565801893>>SAFE AND ORIGINAL ABORTION PILLS FOR SALE IN DUBAI AND ABUDHAB...
+971565801893>>SAFE AND ORIGINAL ABORTION PILLS FOR SALE IN DUBAI AND ABUDHAB...Health
 
VTU technical seminar 8Th Sem on Scikit-learn
VTU technical seminar 8Th Sem on Scikit-learnVTU technical seminar 8Th Sem on Scikit-learn
VTU technical seminar 8Th Sem on Scikit-learnAmarnathKambale
 
TECUNIQUE: Success Stories: IT Service provider
TECUNIQUE: Success Stories: IT Service providerTECUNIQUE: Success Stories: IT Service provider
TECUNIQUE: Success Stories: IT Service providermohitmore19
 
Reassessing the Bedrock of Clinical Function Models: An Examination of Large ...
Reassessing the Bedrock of Clinical Function Models: An Examination of Large ...Reassessing the Bedrock of Clinical Function Models: An Examination of Large ...
Reassessing the Bedrock of Clinical Function Models: An Examination of Large ...harshavardhanraghave
 
Unlocking the Future of AI Agents with Large Language Models
Unlocking the Future of AI Agents with Large Language ModelsUnlocking the Future of AI Agents with Large Language Models
Unlocking the Future of AI Agents with Large Language Modelsaagamshah0812
 
Introducing Microsoft’s new Enterprise Work Management (EWM) Solution
Introducing Microsoft’s new Enterprise Work Management (EWM) SolutionIntroducing Microsoft’s new Enterprise Work Management (EWM) Solution
Introducing Microsoft’s new Enterprise Work Management (EWM) SolutionOnePlan Solutions
 
5 Signs You Need a Fashion PLM Software.pdf
5 Signs You Need a Fashion PLM Software.pdf5 Signs You Need a Fashion PLM Software.pdf
5 Signs You Need a Fashion PLM Software.pdfWave PLM
 
AI Mastery 201: Elevating Your Workflow with Advanced LLM Techniques
AI Mastery 201: Elevating Your Workflow with Advanced LLM TechniquesAI Mastery 201: Elevating Your Workflow with Advanced LLM Techniques
AI Mastery 201: Elevating Your Workflow with Advanced LLM TechniquesVictorSzoltysek
 
Software Quality Assurance Interview Questions
Software Quality Assurance Interview QuestionsSoftware Quality Assurance Interview Questions
Software Quality Assurance Interview QuestionsArshad QA
 
The Guide to Integrating Generative AI into Unified Continuous Testing Platfo...
The Guide to Integrating Generative AI into Unified Continuous Testing Platfo...The Guide to Integrating Generative AI into Unified Continuous Testing Platfo...
The Guide to Integrating Generative AI into Unified Continuous Testing Platfo...kalichargn70th171
 
introduction-to-automotive Andoid os-csimmonds-ndctechtown-2021.pdf
introduction-to-automotive Andoid os-csimmonds-ndctechtown-2021.pdfintroduction-to-automotive Andoid os-csimmonds-ndctechtown-2021.pdf
introduction-to-automotive Andoid os-csimmonds-ndctechtown-2021.pdfVishalKumarJha10
 
Shapes for Sharing between Graph Data Spaces - and Epistemic Querying of RDF-...
Shapes for Sharing between Graph Data Spaces - and Epistemic Querying of RDF-...Shapes for Sharing between Graph Data Spaces - and Epistemic Querying of RDF-...
Shapes for Sharing between Graph Data Spaces - and Epistemic Querying of RDF-...Steffen Staab
 
The Real-World Challenges of Medical Device Cybersecurity- Mitigating Vulnera...
The Real-World Challenges of Medical Device Cybersecurity- Mitigating Vulnera...The Real-World Challenges of Medical Device Cybersecurity- Mitigating Vulnera...
The Real-World Challenges of Medical Device Cybersecurity- Mitigating Vulnera...ICS
 
How To Troubleshoot Collaboration Apps for the Modern Connected Worker
How To Troubleshoot Collaboration Apps for the Modern Connected WorkerHow To Troubleshoot Collaboration Apps for the Modern Connected Worker
How To Troubleshoot Collaboration Apps for the Modern Connected WorkerThousandEyes
 
call girls in Vaishali (Ghaziabad) 🔝 >༒8448380779 🔝 genuine Escort Service 🔝✔️✔️
call girls in Vaishali (Ghaziabad) 🔝 >༒8448380779 🔝 genuine Escort Service 🔝✔️✔️call girls in Vaishali (Ghaziabad) 🔝 >༒8448380779 🔝 genuine Escort Service 🔝✔️✔️
call girls in Vaishali (Ghaziabad) 🔝 >༒8448380779 🔝 genuine Escort Service 🔝✔️✔️Delhi Call girls
 
How to Choose the Right Laravel Development Partner in New York City_compress...
How to Choose the Right Laravel Development Partner in New York City_compress...How to Choose the Right Laravel Development Partner in New York City_compress...
How to Choose the Right Laravel Development Partner in New York City_compress...software pro Development
 
HR Software Buyers Guide in 2024 - HRSoftware.com
HR Software Buyers Guide in 2024 - HRSoftware.comHR Software Buyers Guide in 2024 - HRSoftware.com
HR Software Buyers Guide in 2024 - HRSoftware.comFatema Valibhai
 

Recently uploaded (20)

10 Trends Likely to Shape Enterprise Technology in 2024
10 Trends Likely to Shape Enterprise Technology in 202410 Trends Likely to Shape Enterprise Technology in 2024
10 Trends Likely to Shape Enterprise Technology in 2024
 
+971565801893>>SAFE AND ORIGINAL ABORTION PILLS FOR SALE IN DUBAI AND ABUDHAB...
+971565801893>>SAFE AND ORIGINAL ABORTION PILLS FOR SALE IN DUBAI AND ABUDHAB...+971565801893>>SAFE AND ORIGINAL ABORTION PILLS FOR SALE IN DUBAI AND ABUDHAB...
+971565801893>>SAFE AND ORIGINAL ABORTION PILLS FOR SALE IN DUBAI AND ABUDHAB...
 
VTU technical seminar 8Th Sem on Scikit-learn
VTU technical seminar 8Th Sem on Scikit-learnVTU technical seminar 8Th Sem on Scikit-learn
VTU technical seminar 8Th Sem on Scikit-learn
 
TECUNIQUE: Success Stories: IT Service provider
TECUNIQUE: Success Stories: IT Service providerTECUNIQUE: Success Stories: IT Service provider
TECUNIQUE: Success Stories: IT Service provider
 
Reassessing the Bedrock of Clinical Function Models: An Examination of Large ...
Reassessing the Bedrock of Clinical Function Models: An Examination of Large ...Reassessing the Bedrock of Clinical Function Models: An Examination of Large ...
Reassessing the Bedrock of Clinical Function Models: An Examination of Large ...
 
Unlocking the Future of AI Agents with Large Language Models
Unlocking the Future of AI Agents with Large Language ModelsUnlocking the Future of AI Agents with Large Language Models
Unlocking the Future of AI Agents with Large Language Models
 
Introducing Microsoft’s new Enterprise Work Management (EWM) Solution
Introducing Microsoft’s new Enterprise Work Management (EWM) SolutionIntroducing Microsoft’s new Enterprise Work Management (EWM) Solution
Introducing Microsoft’s new Enterprise Work Management (EWM) Solution
 
5 Signs You Need a Fashion PLM Software.pdf
5 Signs You Need a Fashion PLM Software.pdf5 Signs You Need a Fashion PLM Software.pdf
5 Signs You Need a Fashion PLM Software.pdf
 
Vip Call Girls Noida ➡️ Delhi ➡️ 9999965857 No Advance 24HRS Live
Vip Call Girls Noida ➡️ Delhi ➡️ 9999965857 No Advance 24HRS LiveVip Call Girls Noida ➡️ Delhi ➡️ 9999965857 No Advance 24HRS Live
Vip Call Girls Noida ➡️ Delhi ➡️ 9999965857 No Advance 24HRS Live
 
AI Mastery 201: Elevating Your Workflow with Advanced LLM Techniques
AI Mastery 201: Elevating Your Workflow with Advanced LLM TechniquesAI Mastery 201: Elevating Your Workflow with Advanced LLM Techniques
AI Mastery 201: Elevating Your Workflow with Advanced LLM Techniques
 
Software Quality Assurance Interview Questions
Software Quality Assurance Interview QuestionsSoftware Quality Assurance Interview Questions
Software Quality Assurance Interview Questions
 
The Guide to Integrating Generative AI into Unified Continuous Testing Platfo...
The Guide to Integrating Generative AI into Unified Continuous Testing Platfo...The Guide to Integrating Generative AI into Unified Continuous Testing Platfo...
The Guide to Integrating Generative AI into Unified Continuous Testing Platfo...
 
Microsoft AI Transformation Partner Playbook.pdf
Microsoft AI Transformation Partner Playbook.pdfMicrosoft AI Transformation Partner Playbook.pdf
Microsoft AI Transformation Partner Playbook.pdf
 
introduction-to-automotive Andoid os-csimmonds-ndctechtown-2021.pdf
introduction-to-automotive Andoid os-csimmonds-ndctechtown-2021.pdfintroduction-to-automotive Andoid os-csimmonds-ndctechtown-2021.pdf
introduction-to-automotive Andoid os-csimmonds-ndctechtown-2021.pdf
 
Shapes for Sharing between Graph Data Spaces - and Epistemic Querying of RDF-...
Shapes for Sharing between Graph Data Spaces - and Epistemic Querying of RDF-...Shapes for Sharing between Graph Data Spaces - and Epistemic Querying of RDF-...
Shapes for Sharing between Graph Data Spaces - and Epistemic Querying of RDF-...
 
The Real-World Challenges of Medical Device Cybersecurity- Mitigating Vulnera...
The Real-World Challenges of Medical Device Cybersecurity- Mitigating Vulnera...The Real-World Challenges of Medical Device Cybersecurity- Mitigating Vulnera...
The Real-World Challenges of Medical Device Cybersecurity- Mitigating Vulnera...
 
How To Troubleshoot Collaboration Apps for the Modern Connected Worker
How To Troubleshoot Collaboration Apps for the Modern Connected WorkerHow To Troubleshoot Collaboration Apps for the Modern Connected Worker
How To Troubleshoot Collaboration Apps for the Modern Connected Worker
 
call girls in Vaishali (Ghaziabad) 🔝 >༒8448380779 🔝 genuine Escort Service 🔝✔️✔️
call girls in Vaishali (Ghaziabad) 🔝 >༒8448380779 🔝 genuine Escort Service 🔝✔️✔️call girls in Vaishali (Ghaziabad) 🔝 >༒8448380779 🔝 genuine Escort Service 🔝✔️✔️
call girls in Vaishali (Ghaziabad) 🔝 >༒8448380779 🔝 genuine Escort Service 🔝✔️✔️
 
How to Choose the Right Laravel Development Partner in New York City_compress...
How to Choose the Right Laravel Development Partner in New York City_compress...How to Choose the Right Laravel Development Partner in New York City_compress...
How to Choose the Right Laravel Development Partner in New York City_compress...
 
HR Software Buyers Guide in 2024 - HRSoftware.com
HR Software Buyers Guide in 2024 - HRSoftware.comHR Software Buyers Guide in 2024 - HRSoftware.com
HR Software Buyers Guide in 2024 - HRSoftware.com
 

Ingest, Transform & Visualize w Amazon Web Services

  • 1. Cloudwick© 2015 Cloudwick. All rights reserved. Confidential and Proprietary. Data Ingestion, Transformation, and Visualization with Amazon Web Services Sponsored by Amazon Web Services and Intel (Venue ) Presented by Arun Kumar Palathumpattu (Cloudwick)
  • 2. Cloudwick © 2015 Cloudwick. All rights reserved. Confidential and Proprietary. Cloudwick
  • 3. Cloudwick © 2015 Cloudwick. All rights reserved. Confidential and Proprietary. Agenda Cloudwick • Introduction • Common Challenges in Data Analytics Environments • Creating Data Lake – Single source of truth • Build Effective Data work flow • Demo • Q&A
  • 4. Cloudwick © 2015 Cloudwick. All rights reserved. Confidential and Proprietary. Top 5 Challenges in Building Data Analysis Environments Cloudwick
  • 5. Cloudwick © 2015 Cloudwick. All rights reserved. Confidential and Proprietary. # 1 Market Landscape Cloudwick
  • 6. Cloudwick © 2015 Cloudwick. All rights reserved. Confidential and Proprietary. Cloudwick Image : http://mwicorp.com/lake-okeechobee-water-transfer/ Structured # 2 Issues of Storing All the Data
  • 7. Cloudwick © 2015 Cloudwick. All rights reserved. Confidential and Proprietary. # 3 Time it Takes to Find the Useful Information Cloudwick Image : https://img.clipartfest.com/0ffb2c38607970437e20a6fdd2872eb9_4-benefits-from-taking-guitar-time-value-of-money-clipart_500-500.png
  • 8. Cloudwick © 2015 Cloudwick. All rights reserved. Confidential and Proprietary. # 4 Security Cloudwick Image : https://insights.ubuntu.com/2017/03/20/three-flaws-at-the-heart-of-iot-security/
  • 9. Cloudwick © 2015 Cloudwick. All rights reserved. Confidential and Proprietary. # 5 Skill Gap Cloudwick https://www.linkedin.com/pulse/wonder-three-years-analytics-big-data-skills-most-demand-hosseini
  • 10. Evolution of Data Architectures 1985: Data Warehouse Appliances Benefits • Consolidated multiple decision support environments (i.e. databases) into a single architecture • Best performance available at time of conception, hence the expensive licenses • Worked well with structured, columnar data • Could build customized data marts on top Shared Storage Tier (NAS Appliance) Compute Node Compute Node Compute Node Compute Node • Proprietary software license paid per node per year • Gold-plated hardware available only from the vendor with per node per year cost Constraints • Proprietary software license paid per node per year • Gold-plated hardware available only from the vendor with per node per year cost • Could not handle unstructured data sets • Heavy ETL & data cleansing
  • 11. Evolution of Data Architectures 2006: Hadoop Clusters CPU Memory HDFS Storage Hadoop Master Node CPU Memory HDFS Storage CPU Memory HDFS Storage Improvements • Open source based software license!!! • Commodity white box servers!!!! • Could handle structured & unstructured data sets • Many different applications within the framework (MapReduce, Spark, Hive, Pig, HBase, Presto, etc.) Constraints • HDFS 3X replication to protect against node failure gets expensive at scale • 500 TB data set = 1.5 PB cluster • Local storage means you must scale and pay for CPU & memory resources when adding data capacity • General purpose, monolithic cluster with many different apps on same hardware • Still a data silo
  • 12. Cloudwick © 2015 Cloudwick. All rights reserved. Confidential and Proprietary. Legacy Data Architectures Exist as Isolated Data Silos Hadoop Cluster SQL Database Data Warehouse Appliance
  • 13. Evolution of Data Architectures 2009: Decoupled EMR Architecture CPU Memory Hadoop Master Node CPU Memory CPU Memory Improvements • Decoupled storage & compute • Scale CPU and memory resources independently and up & down • Only pay for the 500 TB data set (not 3X) • Multi-physical facility replication via S3 • Multiple clusters can run in parallel against shared data in S3 • Each job gets its own optimized cluster. i.e. Spark on memory intensive, Hive on CPU intensive, HBase on I/O intensive, etc. Constraints • Still have a cluster to provision and manage • Must expose EMR cluster to SQL users via Hive, Presto, etc. S3 as HDFS
  • 14. Evolution of Data Architectures 2012: Amazon Redshift – Cloud DW Improvements Constraints • Still have to load data into a schema Leader node Compute node 10 GigE (HPC) Ingestion Backup Restore Customer VPC Internal VPC BI tools SQL clientsAnalytics tools Compute node Compute node JDBC/ODBC • Automated installation, patching, backups • No servers to manage and maintain • MPP Columnar relational database • $1,000 / TB / Year • Accessible to any ODBC or JDBC BI Tool
  • 15. Evolution of Data Architectures Today: Clusterless Improvements • No cluster/infrastructure to manage • Business users and analysts can write SQL without having to provision a cluster or touch infrastructure • Pay by the query • Zero Administration • Process data where it lives Constraints • Limited to SQL, Hive and Spark jobs today. More frameworks to come! SQL Interface in web browser Athena for SQL S3 Data Lake Glue for ETL S3 Data Lake Spark & Hive Interface in web browser
  • 16. Cloudwick © 2015 Cloudwick. All rights reserved. Confidential and Proprietary. ©2015, Amazon Web Services, Inc. or its affiliates. All rights reserved. Building a Flexible Data Lake Architecture on AWS
  • 17. Cloudwick © 2015 Cloudwick. All rights reserved. Confidential and Proprietary. Enter Data Lake Architectures Data Lake is a new and increasingly popular architecture to store and analyze massive volumes and heterogeneous types of data. Separating your storage and compute allows you to scale each component as required “How can I scale up with the volume of data being generated?”
  • 18. Cloudwick © 2015 Cloudwick. All rights reserved. Confidential and Proprietary. Benefits of a Data Lake – All Data in One Place Store and analyze all of your data, from all of your sources, in one centralized location. “Why is the data distributed in many locations? Where is the single source of truth ?” Quickly ingest data without needing to force it into a pre-defined schema. “How can I collect data quickly from various sources and store it efficiently?”
  • 19. Cloudwick © 2015 Cloudwick. All rights reserved. Confidential and Proprietary. Benefits of a Data Lake – Schema on Read “Is there a way I can apply multiple analytics and processing frameworks to the same data?” A Data Lake enables ad-hoc analysis by applying schemas on read, not write.
  • 20. Benefits of an AWS S3 Data Lake Fixed Cluster Data Lake AWS S3 Data Lake • Limited to only the single tool contained on the cluster (i.e. Hadoop or data warehouse or Cassandra, etc.). Use cases & ecosystem tools change rapidly • Expensive to add nodes to add storage capacity • Expensive to replicate data against node loss • Complexity in scaling local storage capacity • Long refresh cycles to add additional storage equipment • Decouple storage and compute by making S3 object based storage, not a fixed tool cluster the data lake • Flexibility to use any and all tools in the ecosystem. The right tool for the job • Future proof your architecture. As new use cases and new tools emerge you can plug and play current best of breed.
  • 21. Cloudwick © 2015 Cloudwick. All rights reserved. Confidential and Proprietary. Designed for 11 9s of durability Designed for 99.99% availability Durable Available High performance Multiple upload Range GET Store as much as you need Scale storage and compute independently No minimum usage commitments Scalable Amazon EMR Amazon Redshift Amazon DynamoDB Integrated Simple REST API AWS SDKs Read-after-create consistency Event notification Lifecycle policies Easy to use S3 for data lake
  • 22. Cloudwick © 2015 Cloudwick. All rights reserved. Confidential and Proprietary. Building a Data Lake on AWS Kinesis Firehose Athena Query Service
  • 23. Cloudwick © 2015 Cloudwick. All rights reserved. Confidential and Proprietary. Processing & Analytics Real-time Batch AI & Predictive BI & Data Visualization Transactional & RDBMS AWS Lambda Apache Storm on EMR Apache Flink on EMR Spark Streaming on EMR Elasticsearch Service Kinesis Analytics, Kinesis Streams DynamoDB NoSQL DB Relational Database Aurora EMR Hadoop, Spark, Presto Redshift Data Warehouse Athena Query Service Amazon Lex Speech recognition Amazon Rekognition Amazon Polly Text to speech Machine Learning Predictive analytics Kinesis Streams & Firehose
  • 25. Cloudwick © 2015 Cloudwick. All rights reserved. Confidential and Proprietary. Encryption ComplianceSecurity Identity and Access Management (IAM) policies Bucket policies Access Control Lists (ACLs) Private VPC endpoints to Amazon S3 SSL endpoints Server Side Encryption (SSE-S3) S3 Server Side Encryption with provided keys (SSE-C, SSE-KMS) Client-side Encryption Buckets access logs Lifecycle Management Policies Access Control Lists (ACLs) Versioning & MFA deletes Certifications – HIPAA, PCI, SOC 1/2/3 etc. Implement the right cloud security controls
  • 26. Cloudwick © 2015 Cloudwick. All rights reserved. Confidential and Proprietary. More Efficient Data Lake Architectures Cloudwick
  • 27. Cloudwick © 2015 Cloudwick. All rights reserved. Confidential and Proprietary. What is a Modern Enterprise Data Warehouse? A Modern Enterprise Data warehouse (EDW), is designed to support rapid data growth, quick analytics over relational, non-relational as well as streaming data, with an easy and single interface to consume all these types of data.
  • 28. Cloudwick © 2015 Cloudwick. All rights reserved. Confidential and Proprietary. Data Consumption Building a Modern EDW on AWS Amazon Kinesis Firehose Amazon S3 Amazon S3 Amazon EMR Amaz on Redsh ift Streamin g Un- structured Relational Amaz on RDS Amaz on Athen a Amazon Quicksig ht Amazo n Machin e Learnin g Analyze Predict Data Mart EDW Ad-hoc query Data Ingestion
  • 29. Cloudwick © 2015 Cloudwick. All rights reserved. Confidential and Proprietary. What is Real-time analytics platform? Real-time Analytics platform provides an ability to load and analyze streaming data, and build custom streaming data applications for specialized needs. This platform can handle staggering amounts of streaming data – sometimes TBs per hour – that need to be collected, stored, and processed continuously.
  • 30. Cloudwick © 2015 Cloudwick. All rights reserved. Confidential and Proprietary. Building a Real-time analytics platform (Streaming vs batching) Stream Delivery Stream Analytics Stream processing Event Driven Kinesis Streami ng
  • 31. Cloudwick © 2015 Cloudwick. All rights reserved. Confidential and Proprietary. Simplify Big Data Processing ingest / collect store process / analyze consume / visualize data answers Time to Answer (Latency) Throughput Cost
  • 32. Cloudwick © 2015 Cloudwick. All rights reserved. Confidential and Proprietary. Amazon QuickSight is a Business Analytics Service that lets business users quickly and easily visualize, explore, and share insights from their data.
  • 33. Cloudwick © 2015 Cloudwick. All rights reserved. Confidential and Proprietary. Amazon S3 Data Lake Visualizing the Data Lake Amazon Athena
  • 34. Cloudwick © 2015 Cloudwick. All rights reserved. Confidential and Proprietary. Cloudwick DEMO
  • 35. Cloudwick© 2015 Cloudwick. All rights reserved. Confidential and Proprietary. Credits Event Sponsored by Amazon Web Services Venue by Intel Presented by Arun Kumar Palathumpattu (Cloudwick) Slides credit : AWS + Cloudwick