SlideShare ist ein Scribd-Unternehmen logo
1 von 62
HBX: Harvard Business School’s Digital
Education Goes Data-Centric with
Amazon Redshift + Informatica
Today’s Presenters
David Potes, Manager, Solutions Architecture, Amazon Web Services
Andrew McIntyre, Solutions Architect, Informatica
Ryan Frazier, Director, Systems Engineering & Operations, HBX
Today’s Agenda
• An overview of AWS and AWS Marketplace, with an emphasis on
AWS Big Data solutions
• Informatica solution overview
• The HBX success story with AWS and Informatica
• Q&A/Discussion
Learning Objectives
1. Becoming a data-centric organization for best business results
2. The benefits of agile analytics with cloud data warehouse
3. Rapidly integrating high volume, disparate data into a trusted source
Amazon Redshift
shift
Fast, simple, petabyte-scale data warehousing for $1,000/TB/Year
140+ features
AWS Big Data Portfolio
Collect Store Analyze
Amazon Kinesis
Firehose
AWS Direct
Connect
Amazon
Snowball
Amazon Kinesis
Analytics
Amazon Kinesis
Streams
Amazon S3 Amazon Glacier
Amazon
CloudSearch
Amazon RDS,
Amazon Aurora
Amazon
Dynamo DB
Amazon
Elasticsearch
Amazon EMR Amazon EC2
Amazon
Redshift
Amazon Machine
Learning
Amazon
QuickSight
AWS Data
Pipeline
AWS Database Migration Service AWS Glue
Amazon
Athena
Legacy architectural models lead to dark data
0
200
400
600
800
1000
1200
Enterprise Data Data in Warehouse
Very Expensive
Lock-In
Proprietary
Inflexible licensing
Traditional Data Warehousing
Business
Reporting
Complex pipelines
and queries
Secure and
Compliant
Easy Migration – Point & Click using AWS Database Migration Service
Secure & Compliant – End-to-End Encryption. SOC 1/2/3, PCI-DSS, HIPAA and FedRAMP compliant
Large Ecosystem – Variety of cloud and on-premises BI and ETL tools
Japanese Mobile
Phone Provider
Powering 100 marketplaces
in 50 countries
World’s Largest Children’s
Book Publisher
Bulk Loads
and Updates
Log Analysis
Log & Machine
IOT Data
Clickstream
Events Data
Time-Series
Data
Cheap – Analyze large volumes of data cost-effectively
Fast – Massively Parallel Processing (MPP) and columnar architecture for fast queries and parallel loads
Near real-time – Micro-batch loading and Amazon Kinesis Firehose for near-real time analytics
Interactive data analysis and
recommendation engine
Ride analytics for pricing
and product development
Ad prediction and
on-demand analytics
Business Applications
Multi-Tenant BI
Applications
Back-end
services
Analytics as a
Service
Fully Managed – Provisioning, backups, upgrades, security, compression all come built-in so you can
focus on your business applications
Ease of Chargeback – Pay as you go, add clusters as needed. A few big common clusters, several
data marts
Service Oriented Architecture – Integrated with other AWS services. Easy to plug into your pipeline
Infosys Information
Platform (IIP)
Analytics-as-a-
Service
Product and Consumer
Analytics
Redshift is used for mission-critical workloads
Financial and
management reporting
Payments to suppliers
and billing workflows
Web/Mobile clickstream
and event analysis
Recommendation and
predictive analytics
Amazon Redshift is available everywhere AWS is
Dublin
Frankfurt
London
Seoul
Sydney
Tokyo
Singapore
Beijing
Mumbai
Sao Paulo
US East - Virginia
US West - Oregon
US West – Northern California
GovCloud
Columbus Ohio
Montreal
Currently Available
Coming soon
Amazon Redshift is fast
“Did I mention that it’s ridiculously fast? We’re using
it to provide our analysts with an alternative to Hadoop”
“After investigating Redshift, Snowflake, and
BigQuery, we found that Redshift offers top-of-the-
line performance at best-in-market price points”
“…[Redshift] performance has blown away everyone
here. We generally see 50-100X speedup over Hive”
“We regularly process multibillion row datasets
and we do that in a matter of hours. We are heading
to up to 10 times more data volumes in the next couple
of years, easily”
“We saw a 2X performance improvement on a wide
variety of workloads. The more complex the queries,
the higher the performance improvement”
“On our previous big data warehouse system, it took
around 45 minutes to run a query against a year of
data, but that number went down to just 25 seconds
using Amazon Redshift”
And has gotten faster...
5X Query throughput improvement this year
 Memory allocation (launched)
 Improved commit and I/O logic (launched)
 Queue hopping (launched)
 Query monitoring rules (coming soon)
 Power start (coming soon)
 Short query bias (coming soon)
10X Vacuuming performance improvement
 Ensures data is sorted for efficient and fast I/O
 Reclaims space from deleted rows
 Enhanced vacuum performance leads to better system throughput
Fast
Efficient
Amazon Redshift is easy to use
“With Amazon Redshift and Tableau, anyone in the
company can set up any queries they like - from how
users are reacting to a feature, to growth by demographic or
geography, to the impact sales efforts had in different areas”
“The doors were blown wide open to create custom
dashboards for anyone to instantly go in and see and
assess what is going in our ad delivery landscape,
something we have never been able to do until now.”
Provides an easy-to-use mechanism for querying data with
quick and uniform response times that analysts can use to
run research projects and perform in-depth analysis…We don’t
have to pre-allocate resources and can easily scale up to meet
demand and then scale down for efficiency”
Amazon Redshift is easy to use
Provisioning in
minutes
Automatic patching SQL - Data loading
Backups are built-in Security is built-in Compression is built-in
Amazon Redshift is cheap
“450,000 online queries 98 percent faster than previous
traditional data center, while reducing infrastructure costs by
80 percent.”
“Annual costs of Redshift are equivalent to just the annual
maintenance of some of the cheaper on-premises options
for data warehouses..”
“Most competing data warehousing solutions would have cost
us up to $1 million a year. By contrast, Amazon Redshift costs
us just $100,000 all-in, representing a total cost savings of
around 90%”
Durability and availability
Automated backups Cross-region backups Cluster-level mirroring
Streaming restore Monitoring Ticketing
Amazon Redshift is secure
End-to-End
data encryption
Alerts & Notifications Virtual private cloud
AWS KMS & HSM Audit logging Certifications & Compliance
Amazon Redshift
shift
Fast, simple, petabyte-scale data warehousing for $1,000/TB/Year
Available now
 Queue hopping
 10X VACUUM performance improvement
 Node fault tolerance
 Enhanced VPC routing
 IAM support for LOAD/UNLOAD
 Auto compression for CTAS
 TimestampTZ datatype
 Query Monitoring rules
Coming soon
 Automatic and incremental background
VACUUM
 Short query bias
 Power start
 IAM Authentication for DB users
 Auto compression for new tables
 Enhanced JSON & AVRO ingestion performance
`
Informatica for AWS
22 © Informatica. Proprietary and Confidential.
Digital Transformation is happening Now !
NewOld
23 © Informatica. Proprietary and Confidential.
Paths to Cloud Data Warehousing and Analytics
Extend Migrate Born in the Cloud
• Quickly meet new
business demands
• Variety & Volumes
of data for analysis
• Current warehouse not
performing & need to
scale
• Reduce costs (platform &
maintenance)
• Agile Self-Service
Analytics
• Highly Scalable
• Elasticity
25 May 2017
BUILDING A CLOUD BASED
DATA WAREHOUSE
hbx.hbs.ed
RYAN FRAZIER
Director, Systems Engineering & Operations
Harvard Business School/HBX
@rrfrazier
Agenda
25
• About HBX
• HBX Data Management Initiative
• Architecture & Implementation
• Challenges
• Reflections
Reimagining Digital Education @ HBS
 Newest division at HBS, tasked with reimagining business education
for the digital age
 First course June 2014 to deliver HBS experience online
 Multiple Course Offerings
 CORe (Business Analytics, Financial Accounting, and
Economics for Managers)
 Disruptive Strategy with Clayton Christensen
 Leading with Finance
 Negotiation Mastery
 Managing Your Career Development
The teaching model sets HBX apart
from many online learning options
and is reflective of the HBS in-
person classroom approach
HBX Overview
27
 Mainly
asynchronous online
business education
 Engagement
through student
interaction in
cohorts of ~400
 Case-based
learning with highly
interactive teaching
elements and peer
help
HBX Course
Platform
(AWS)
28
29
 Studio-based
virtual classroom
 Synchronous
audio/video with
chat, polls, boards
 Up to 60 global
students on studio
wall, hundreds or
more observers
HBX Live
Platform
Developing a Data-Driven Culture
Why Create a Data-Driven Culture?
31
Improve Effectiveness
• Scale data intensive
activities like marketing,
admissions, & grading
• Use data to test ideas
and improve quality of
decisions
Enhance Outcomes
• Identify challenging
content
• Evaluate and improve
interactive content, social
engagement & retention
• Proactively support
struggling students
Refine Pedagogy
• Evaluate new
pedagogical
approaches
• Optimize evaluation
approaches
• Support pedagogical
research activities and
innovation
Foster Innovation & Continuous Improvement
• Identify and evaluate innovation opportunities
• Drive continuous improvement
Students Staff Faculty
Data Management Program Objectives
32
Integrate Data Sources into
Comprehensive Data Warehouse
Build Reports and Dashboards
Enable Self Service Ensure Data Quality and Integrity
Enablers for Building Data Driven Culture at HBX
• Use off-shore partner
Mindtree to accelerate
• Active engagement of
vendors on technology
challenges
• Short internal presentations
• Data Analysis Exercise at
all-staff team meeting
• Active interest & involvement
from Business Areas
• Alignment to organizational
priorities
• HBX willingness to try new
things
• Helps drive engagement
with vendors
Education
Strong Partners Program Governance
Experimentation
34Building a Data Management Practice
Core Tool Selection
35
Data Warehouse: Amazon Redshift + Snowflake
• Chose Redshift for scalability, performance, ease of management
• Aligns with AWS platform/ecosystem focus at HBX
• Easy integration with other AWS services
• Simplified vendor, contract, and cost management
• Leverage existing operations tools for monitoring, alerting, etc.
• Snowflake for JSON data lake and intermediate processing
ETL: Informatica Cloud
• Myriad of pluggable connectors
• S3, Redshift, Salesforce, ServiceNow, ODBC, REST
• Cloud-based architecture w/minimal infrastructure management
• Productivity-focused development tools for rapid implementation
• Extensible offering for future initiatives (MDM, Customer 360)
• Aligned to larger University vendor relationship & investments
Reporting/Analytics: Tableau
HBX Data Management by the Numbers
36
Source Systems
• 4 Major Systems
• 46 databases
• 598 Tables
• 98 Mongo Collections
Data Warehouse
• 3 Redshift Nodes
• 1 Snowflake warehouse
• 18 Schemas
• 434 tables
• 7479 fields
• 1,184,804,787 rows
• 278 GB
Daily ETL Process
• 770 jobs
• 35 MM+ rows
* Updated 5/2017
Course
Platform
Ver. B
HBX Core Data Ecosystem (at Launch)
37
Course
Platform
Ver. A
Historical Data
MongoDB
MySQL
Admin System
MySQL
Amazon
Redshift
About HBX
HBX Data
Management Initiative
Architecture &
Implementation
Challenges
Reflections
Tableau
Server
Secure
Agent
Informatica
Cloud
Services
MongoDB
MySQL
MongoDB
MySQL
Reporting
Copy
Metadata
HBX Data Ecosystem—Spring 2017
Reporting
Copy
Course Platform
MongoDB MySQL
Historical Data
MongoDB MySQL
Admin
System
MySQL
Salesforce
sync
ServiceNo
w
Google
Analytics
HBX AWS Acct
Snowflake
Virtual DWUpdated 3/31/2017
Snowflake
Metadata
Services
Snowflake Data
Persistence
Interactive
Reports
External Data Sources
Ad Hoc Query/Reporting
Course Platform
MongoDB data (json)
HBX Live
Hubspot
Ad Hoc Query/Reporting
Local Data Center
Firehose
Tableau
Secure
Agent
Existing
Data Flow
Proposed
Future
Data Flow
Informatica
Cloud
Services
Amazon
Redshift
39
Reflections on HBX Data Program
Achieving Technical Objectives
40
Integrate Data Sources into
Comprehensive Data Warehouse
Build Reports and Dashboards
Enable Self Service Ensure Data Quality and
Integrity
Daily load from 4 core systems
(Legacy Business System,
Salesforce, Course Platform,
HBX Live)
33 production reports, serving all
HBX business units
>30 monthly active Tableau users
(~40% of staff)
0.5 FTE technical QA staff and
robust development/test/release
process
Reflections on HBX Implementation
41
Cloud Services Significantly Increase Agility
• Rapid provisioning of services
• Easy, on-demand scaling as business grows
• Cost-effective to support multiple environments
• Out-of-the box flexibility and feature enhancements
Semi-structured data stores can present challenges for reporting
• Think about data structures during development
• Consider your data pipeline and how data will be used
• Rapidly evolving and maturing space
QA & Testing are Critical to Success
• Reports and results only as good as the input
• Plead with your users to report problems and concerns!
HBX Business Outcomes
42
Automation of Manual Business Processes
• Moving from complex, time-consuming, error-prone
spreadsheet processes to purpose-built data products
Deeper Insight Into Prospects and Participants
• Ability to bridge top (marketing, leads) and bottom
(applicants, registrants, and completers) of funnel: who are
our prospects, how do we improve our yield, and how does
this relate to outcomes?
Growth of a “Data Driven” Culture
• Increasing use of data to understand our users, identify
challenges and opportunities, and drive decisions
• More focus on hypothesis driven experiments and A/B
testing
What’s Next and Why?
43
Analytics
• Greater understanding of
users
• Research and pedagogy
Additional Data Sources
• Greater understanding of users &
systems
• Automate additional processes
Streaming & Machine Data
• Real-time data reporting
• Access additional data
Machine Learning
• Automate additional
processes
`
Informatica for AWS
45 © Informatica. Proprietary and Confidential.
RedShift Upsert – The Coding Way !
Extract the data from source
Put into flat files and compress
Transfer Compressed Files To S3
Wait for S3 Consistency
Copy Data From S3 Into Staging
TableInner Join With Target Table To
Delete Rows To Be Updated
Insert Updated Rows From Staging Table
Delete Staging Table
Delete Files From S3
Create Staging Table in Redshift
1
2
3
4
5
6
7
8
9
10
Or, Do It In 3 Simple Steps…
46 © Informatica. Proprietary and Confidential.
RedShift Upsert – The Informatica Way !
Choose Upsert Operation1
Map Your Fields2
Run Or Schedule!3
47 © Informatica. Proprietary and Confidential.
Agile Integration and Scalability
Unlock your data
1 2
Scalability &
Operation Confidence
UI maximizes productivity
for developers
& citizen integrators
Codeless development
Out of box templates & wizards
Easy to use & highly reusable
3
Hundreds of pre-built
connectors for cloud and
on-premise data sources
Optimized for
largest data volumes
Monitoring & Administration
CONNECT DEVELOP DEPLOY
48 © Informatica. Proprietary and Confidential.
Cloud Apps (SaaS)Data Stores
DBs, DWs, Big Data, Cloud
Enterprise Systems B2B
Middleware and Tech
Analytics
Social Apps
100’s of Connectors, For Any Data Source
49 © Informatica. Proprietary and Confidential.
Informatica for RedShift Overview
ETL (1, 2, 3)
1. Bulk Source Data
Ingestion
2. Multi-part load into S3
of compressed files
3. Copy S3 data into
RedShift Staging
ELT (4, 5, 6)
SQL Pushdown for RedShift
to RedShift Table
Integrations within same
RedShift Cluster
Redshift
Staging
Amazon
S3
1
2
3
4 5
Redshift
Intermediate
Redshift
Analytics
6
4 5
Same RedShift Cluster
50 © Informatica. Proprietary and Confidential.
Optimized Data Ingestion into RedShift
1. Source Bulk Data Loader
2. Partitions - parallel data pipelines
3. Local staging files
4. S3 Parallel Upload
5. Copy Command to RedShift
51 © Informatica. Proprietary and Confidential.
Batching of Reads
• Reduce I/O Trips
• Default 2 Million rows
• 3x improvement by increasing batch
size to 16 Million rows for TPC-H
Schema
Key Range Partitioning
• 4x Performance Improvement with 16
partitions for TPC-H Schema
• Optimal when partition keys are same
as RedShift Distribution Keys
Bulk APIs and
Batch Rows for
Fast Ingestion
Key Range Partitioning
Optimized Data Ingestion – Batching &
Partitioning
52 © Informatica. Proprietary and Confidential.
Informatica for Amazon Redshift: We Have Your Back!
Robust Comprehensive
 Partitioning
 SQL Pushdown
 Optimized Lookups
 Multi-part Upload & Download
 Compression before S3
Upload
Flexible
 AWS KMS Support
 IAM Roles
 Client & Server Side Encryption
 S3 VPC Endpoint
Secure
 Error management, Notifications, & Alerts
 Error Tables or S3 Buckets for repair
 Auto-handle special characters
 Dynamically create targets
 PRIMARY, FOREIGN, & UNIQUE
Keys
 SORT & DISTRIBUTION keys
 ANALYZE & VACUUM options
 Configurable S3 Copy Options
 Pre and Post SQL
 SQL Overrides
 S3 data retention policies
 AWS Multi-Region support
 Secure Agent on premise
 Informatica Hosted agent
 Agent on AWS
 Configurable S3 Copy
Options
 Dynamic S3 Buckets
High Performance
53 © Informatica. Proprietary and Confidential.
Informatica Products on AWS
Power Center
Informatica
Cloud
Big Data
Management
Enterprise
Informatica
Catalog
Informatica
Cloud
Intelligent Data
Lake
Informatica
Data Quality
Enterprise
Informatica
Catalog
Power Center
Master Data
Management
Big Data
Management
Informatica
Data Quality
Certified
Available
54
Try & Buy Informatica Cloud on AWS Marketplace
http://infa.media/AWSRedshift
Easy Deployment from
AWS Marketplace
Flexible pricing options
• Hourly Pricing (PAYG)
• Contract Pricing (Annual
Subscription)
55
PowerCenter on AWS Marketplace
PowerCenter Quick Start on AWS PowerCenter on AWS Marketplace
56
Informatica Cloud/iPaaS Proven Success
1 T+
Records/month
120% growth YoY
3M+
Integrations/day
200%+ growth YoY
7000+
Customers
Connecting 100,000 applications, DBs and other
end points
100+
OEMS
150+
iPaaS Connectors
>300%
Growth of API volume in
CY16
57
Sample Joint Customers
Informatica’s Comprehensive Solution
Intelligent
Data Platform
ACLOUD
REAL TIME/
STREAMING
BIG
DATA
TRADITIONAL
DATA
INTEGRATION
BIG DATA
MANAGEMENT
MASTER DATA
MANAGEMENT
DATA
QUALITY
DATA
SECURITY
CLOUD DATA
MANAGEMENT
Products
Solutions
MONITOR AND MANAGE
CONNECTIVITY
COMPUTE
Enterprise Cloud
Data Management
CUSTOMER
360
DATA
GOVERNANCE
REFERENCE
360
INTELLIGEN
T
DATA LAKE
SECURE@SOURC
E
PRODUCT
360
ENTERPRISE
INFORMATION
CATALOG
SUPPLIER
360
(ENTERPRISE UNIFIED METADATA
INTELLIGENCE)
Look No Further
60 © Informatica. Proprietary and Confidential.
Learn more…..
Learn & Prepare
• Cloud Analytics with
Informatica Cloud &
Amazon Redshift
• PowerCenter on AWS
• Data Lakes on AWS
Get Started on AWS MarketplaceDeep-Dive
`
Thank You
Q&A

Weitere ähnliche Inhalte

Was ist angesagt?

Getting started on your AWS migration journey
Getting started on your AWS migration journeyGetting started on your AWS migration journey
Getting started on your AWS migration journeyAmazon Web Services
 
IDC and AWS Joint Webinar: Getting the most bang for your buck with EC2 Spot -
IDC and AWS Joint Webinar: Getting the most bang for your buck with EC2 Spot - IDC and AWS Joint Webinar: Getting the most bang for your buck with EC2 Spot -
IDC and AWS Joint Webinar: Getting the most bang for your buck with EC2 Spot - Amazon Web Services
 
Building an Investment Case for Mass Migrations to AWS - AWS Summit SG 2017
Building an Investment Case for Mass Migrations to AWS - AWS Summit SG 2017Building an Investment Case for Mass Migrations to AWS - AWS Summit SG 2017
Building an Investment Case for Mass Migrations to AWS - AWS Summit SG 2017Amazon Web Services
 
AWS Technical Due Diligence Workshop Session Two
AWS Technical Due Diligence Workshop Session TwoAWS Technical Due Diligence Workshop Session Two
AWS Technical Due Diligence Workshop Session TwoTom Laszewski
 
Hybrid Cloud Customer Use Cases on AWS
Hybrid Cloud Customer Use Cases on AWSHybrid Cloud Customer Use Cases on AWS
Hybrid Cloud Customer Use Cases on AWSTom Laszewski
 
Enterprise Adoption – Patterns for Success with AWS - Business
Enterprise Adoption – Patterns for Success with AWS - BusinessEnterprise Adoption – Patterns for Success with AWS - Business
Enterprise Adoption – Patterns for Success with AWS - BusinessAmazon Web Services
 
Simplify Your Migration to AWS and Cut Costs by 30% with TSO Logic
 Simplify Your Migration to AWS and Cut Costs by 30% with TSO Logic Simplify Your Migration to AWS and Cut Costs by 30% with TSO Logic
Simplify Your Migration to AWS and Cut Costs by 30% with TSO LogicAmazon Web Services
 
(ISM315) How to Quantify TCO & Increase Business Value Gains Using AWS
(ISM315) How to Quantify TCO & Increase Business Value Gains Using AWS(ISM315) How to Quantify TCO & Increase Business Value Gains Using AWS
(ISM315) How to Quantify TCO & Increase Business Value Gains Using AWSAmazon Web Services
 
AWS re:Invent 2016: Building Enterprise Cloud Operations As a Service with T-...
AWS re:Invent 2016: Building Enterprise Cloud Operations As a Service with T-...AWS re:Invent 2016: Building Enterprise Cloud Operations As a Service with T-...
AWS re:Invent 2016: Building Enterprise Cloud Operations As a Service with T-...Amazon Web Services
 
AWS Public Sector Symposium 2014 Canberra | Keynote
AWS Public Sector Symposium 2014 Canberra | KeynoteAWS Public Sector Symposium 2014 Canberra | Keynote
AWS Public Sector Symposium 2014 Canberra | KeynoteAmazon Web Services
 
Migrating into a cloud
Migrating into a cloudMigrating into a cloud
Migrating into a cloudANUSUYA T K
 
AWSome Day Hong Kong and Taipei - Opening Keynote
AWSome Day Hong Kong and Taipei - Opening KeynoteAWSome Day Hong Kong and Taipei - Opening Keynote
AWSome Day Hong Kong and Taipei - Opening KeynoteAmazon Web Services
 
Cloud Adoption Framework - AWS Innovate Ottawa:
 Cloud Adoption Framework - AWS Innovate Ottawa: Cloud Adoption Framework - AWS Innovate Ottawa:
Cloud Adoption Framework - AWS Innovate Ottawa:Amazon Web Services
 
Keynote Roberto Delamora - AWS Cloud Experience Argentina
Keynote Roberto Delamora - AWS Cloud Experience ArgentinaKeynote Roberto Delamora - AWS Cloud Experience Argentina
Keynote Roberto Delamora - AWS Cloud Experience ArgentinaAmazon Web Services LATAM
 
AWSome Day 2014 Kuala Lumpur - Keynote
AWSome Day 2014 Kuala Lumpur - KeynoteAWSome Day 2014 Kuala Lumpur - Keynote
AWSome Day 2014 Kuala Lumpur - KeynoteAmazon Web Services
 
Demystifying Cloud Economics - How to Build an Investment Case for Scale Migr...
Demystifying Cloud Economics - How to Build an Investment Case for Scale Migr...Demystifying Cloud Economics - How to Build an Investment Case for Scale Migr...
Demystifying Cloud Economics - How to Build an Investment Case for Scale Migr...Amazon Web Services
 

Was ist angesagt? (20)

Future of Enterprise IT
Future of Enterprise IT Future of Enterprise IT
Future of Enterprise IT
 
Getting started on your AWS migration journey
Getting started on your AWS migration journeyGetting started on your AWS migration journey
Getting started on your AWS migration journey
 
IDC and AWS Joint Webinar: Getting the most bang for your buck with EC2 Spot -
IDC and AWS Joint Webinar: Getting the most bang for your buck with EC2 Spot - IDC and AWS Joint Webinar: Getting the most bang for your buck with EC2 Spot -
IDC and AWS Joint Webinar: Getting the most bang for your buck with EC2 Spot -
 
Building an Investment Case for Mass Migrations to AWS - AWS Summit SG 2017
Building an Investment Case for Mass Migrations to AWS - AWS Summit SG 2017Building an Investment Case for Mass Migrations to AWS - AWS Summit SG 2017
Building an Investment Case for Mass Migrations to AWS - AWS Summit SG 2017
 
AWS Technical Due Diligence Workshop Session Two
AWS Technical Due Diligence Workshop Session TwoAWS Technical Due Diligence Workshop Session Two
AWS Technical Due Diligence Workshop Session Two
 
Hybrid Cloud Customer Use Cases on AWS
Hybrid Cloud Customer Use Cases on AWSHybrid Cloud Customer Use Cases on AWS
Hybrid Cloud Customer Use Cases on AWS
 
Defining Your Cloud Strategy
Defining Your Cloud StrategyDefining Your Cloud Strategy
Defining Your Cloud Strategy
 
Enterprise Adoption – Patterns for Success with AWS - Business
Enterprise Adoption – Patterns for Success with AWS - BusinessEnterprise Adoption – Patterns for Success with AWS - Business
Enterprise Adoption – Patterns for Success with AWS - Business
 
Simplify Your Migration to AWS and Cut Costs by 30% with TSO Logic
 Simplify Your Migration to AWS and Cut Costs by 30% with TSO Logic Simplify Your Migration to AWS and Cut Costs by 30% with TSO Logic
Simplify Your Migration to AWS and Cut Costs by 30% with TSO Logic
 
(ISM315) How to Quantify TCO & Increase Business Value Gains Using AWS
(ISM315) How to Quantify TCO & Increase Business Value Gains Using AWS(ISM315) How to Quantify TCO & Increase Business Value Gains Using AWS
(ISM315) How to Quantify TCO & Increase Business Value Gains Using AWS
 
Cloud Foundations
Cloud FoundationsCloud Foundations
Cloud Foundations
 
AWS re:Invent 2016: Building Enterprise Cloud Operations As a Service with T-...
AWS re:Invent 2016: Building Enterprise Cloud Operations As a Service with T-...AWS re:Invent 2016: Building Enterprise Cloud Operations As a Service with T-...
AWS re:Invent 2016: Building Enterprise Cloud Operations As a Service with T-...
 
AWS Public Sector Symposium 2014 Canberra | Keynote
AWS Public Sector Symposium 2014 Canberra | KeynoteAWS Public Sector Symposium 2014 Canberra | Keynote
AWS Public Sector Symposium 2014 Canberra | Keynote
 
Migrating into a cloud
Migrating into a cloudMigrating into a cloud
Migrating into a cloud
 
Cloud Migration: A How-To Guide
Cloud Migration: A How-To GuideCloud Migration: A How-To Guide
Cloud Migration: A How-To Guide
 
AWSome Day Hong Kong and Taipei - Opening Keynote
AWSome Day Hong Kong and Taipei - Opening KeynoteAWSome Day Hong Kong and Taipei - Opening Keynote
AWSome Day Hong Kong and Taipei - Opening Keynote
 
Cloud Adoption Framework - AWS Innovate Ottawa:
 Cloud Adoption Framework - AWS Innovate Ottawa: Cloud Adoption Framework - AWS Innovate Ottawa:
Cloud Adoption Framework - AWS Innovate Ottawa:
 
Keynote Roberto Delamora - AWS Cloud Experience Argentina
Keynote Roberto Delamora - AWS Cloud Experience ArgentinaKeynote Roberto Delamora - AWS Cloud Experience Argentina
Keynote Roberto Delamora - AWS Cloud Experience Argentina
 
AWSome Day 2014 Kuala Lumpur - Keynote
AWSome Day 2014 Kuala Lumpur - KeynoteAWSome Day 2014 Kuala Lumpur - Keynote
AWSome Day 2014 Kuala Lumpur - Keynote
 
Demystifying Cloud Economics - How to Build an Investment Case for Scale Migr...
Demystifying Cloud Economics - How to Build an Investment Case for Scale Migr...Demystifying Cloud Economics - How to Build an Investment Case for Scale Migr...
Demystifying Cloud Economics - How to Build an Investment Case for Scale Migr...
 

Ähnlich wie HBX: Harvard Business School's Digital Education Goes Data-Centric with Amazon Redshift + Informatica

AWS Webcast - Informatica - Big Data Solutions Showcase
AWS Webcast - Informatica - Big Data Solutions ShowcaseAWS Webcast - Informatica - Big Data Solutions Showcase
AWS Webcast - Informatica - Big Data Solutions ShowcaseAmazon Web Services
 
AWS re:Invent 2016: What’s New with Amazon Redshift (BDA304)
AWS re:Invent 2016: What’s New with Amazon Redshift (BDA304)AWS re:Invent 2016: What’s New with Amazon Redshift (BDA304)
AWS re:Invent 2016: What’s New with Amazon Redshift (BDA304)Amazon Web Services
 
Track 3 Session 4_企業工作負載遷移至 AWS 的最佳實踐
Track 3 Session 4_企業工作負載遷移至 AWS 的最佳實踐Track 3 Session 4_企業工作負載遷移至 AWS 的最佳實踐
Track 3 Session 4_企業工作負載遷移至 AWS 的最佳實踐Amazon Web Services
 
(BDT201) Big Data and HPC State of the Union | AWS re:Invent 2014
(BDT201) Big Data and HPC State of the Union | AWS re:Invent 2014(BDT201) Big Data and HPC State of the Union | AWS re:Invent 2014
(BDT201) Big Data and HPC State of the Union | AWS re:Invent 2014Amazon Web Services
 
Introduction to the AWS Cloud from Digital Tuesday Meetup
Introduction to the AWS Cloud from Digital Tuesday MeetupIntroduction to the AWS Cloud from Digital Tuesday Meetup
Introduction to the AWS Cloud from Digital Tuesday MeetupIan Massingham
 
Innovation, Cloud Powered - Dr Werner Vogels
Innovation, Cloud Powered - Dr Werner VogelsInnovation, Cloud Powered - Dr Werner Vogels
Innovation, Cloud Powered - Dr Werner VogelsAmazon Web Services
 
Cloud Native, Cloud First and Hybrid: How Different Organizations are Approac...
Cloud Native, Cloud First and Hybrid: How Different Organizations are Approac...Cloud Native, Cloud First and Hybrid: How Different Organizations are Approac...
Cloud Native, Cloud First and Hybrid: How Different Organizations are Approac...Amazon Web Services
 
Migrate and Manage Workloads with Apps Associates
Migrate and Manage Workloads with Apps AssociatesMigrate and Manage Workloads with Apps Associates
Migrate and Manage Workloads with Apps AssociatesAmazon Web Services
 
Vn introduction to cloud computing with amazon web services
Vn   introduction to cloud computing with amazon web servicesVn   introduction to cloud computing with amazon web services
Vn introduction to cloud computing with amazon web servicesAWS Vietnam Community
 
AWS Summit 2013 | Singapore - Big Data Analytics, Presented by AWS, Intel and...
AWS Summit 2013 | Singapore - Big Data Analytics, Presented by AWS, Intel and...AWS Summit 2013 | Singapore - Big Data Analytics, Presented by AWS, Intel and...
AWS Summit 2013 | Singapore - Big Data Analytics, Presented by AWS, Intel and...Amazon Web Services
 
Amazon Web Services
Amazon Web ServicesAmazon Web Services
Amazon Web ServicesJisc
 
透過 Amazon Redshift 打造數據分析服務及 Amazon Redshift 新功能案例介紹
透過 Amazon Redshift 打造數據分析服務及 Amazon Redshift 新功能案例介紹透過 Amazon Redshift 打造數據分析服務及 Amazon Redshift 新功能案例介紹
透過 Amazon Redshift 打造數據分析服務及 Amazon Redshift 新功能案例介紹Amazon Web Services
 
Vancouver keynote - AWS Innovate - Sam Elmalak
Vancouver keynote - AWS Innovate - Sam ElmalakVancouver keynote - AWS Innovate - Sam Elmalak
Vancouver keynote - AWS Innovate - Sam ElmalakAmazon Web Services
 
Choosing technologies for a big data solution in the cloud
Choosing technologies for a big data solution in the cloudChoosing technologies for a big data solution in the cloud
Choosing technologies for a big data solution in the cloudJames Serra
 
Jeff Kratz - Cloud Computing
Jeff Kratz - Cloud ComputingJeff Kratz - Cloud Computing
Jeff Kratz - Cloud ComputingLuz Fiumara
 

Ähnlich wie HBX: Harvard Business School's Digital Education Goes Data-Centric with Amazon Redshift + Informatica (20)

AWS Webcast - Informatica - Big Data Solutions Showcase
AWS Webcast - Informatica - Big Data Solutions ShowcaseAWS Webcast - Informatica - Big Data Solutions Showcase
AWS Webcast - Informatica - Big Data Solutions Showcase
 
AWS re:Invent 2016: What’s New with Amazon Redshift (BDA304)
AWS re:Invent 2016: What’s New with Amazon Redshift (BDA304)AWS re:Invent 2016: What’s New with Amazon Redshift (BDA304)
AWS re:Invent 2016: What’s New with Amazon Redshift (BDA304)
 
Track 3 Session 4_企業工作負載遷移至 AWS 的最佳實踐
Track 3 Session 4_企業工作負載遷移至 AWS 的最佳實踐Track 3 Session 4_企業工作負載遷移至 AWS 的最佳實踐
Track 3 Session 4_企業工作負載遷移至 AWS 的最佳實踐
 
(BDT201) Big Data and HPC State of the Union | AWS re:Invent 2014
(BDT201) Big Data and HPC State of the Union | AWS re:Invent 2014(BDT201) Big Data and HPC State of the Union | AWS re:Invent 2014
(BDT201) Big Data and HPC State of the Union | AWS re:Invent 2014
 
Introduction to the AWS Cloud from Digital Tuesday Meetup
Introduction to the AWS Cloud from Digital Tuesday MeetupIntroduction to the AWS Cloud from Digital Tuesday Meetup
Introduction to the AWS Cloud from Digital Tuesday Meetup
 
Innovation, Cloud Powered - Dr Werner Vogels
Innovation, Cloud Powered - Dr Werner VogelsInnovation, Cloud Powered - Dr Werner Vogels
Innovation, Cloud Powered - Dr Werner Vogels
 
Cloud Native, Cloud First and Hybrid: How Different Organizations are Approac...
Cloud Native, Cloud First and Hybrid: How Different Organizations are Approac...Cloud Native, Cloud First and Hybrid: How Different Organizations are Approac...
Cloud Native, Cloud First and Hybrid: How Different Organizations are Approac...
 
Migrate and Manage Workloads with Apps Associates
Migrate and Manage Workloads with Apps AssociatesMigrate and Manage Workloads with Apps Associates
Migrate and Manage Workloads with Apps Associates
 
Vn introduction to cloud computing with amazon web services
Vn   introduction to cloud computing with amazon web servicesVn   introduction to cloud computing with amazon web services
Vn introduction to cloud computing with amazon web services
 
AWS Summit 2013 | Singapore - Big Data Analytics, Presented by AWS, Intel and...
AWS Summit 2013 | Singapore - Big Data Analytics, Presented by AWS, Intel and...AWS Summit 2013 | Singapore - Big Data Analytics, Presented by AWS, Intel and...
AWS Summit 2013 | Singapore - Big Data Analytics, Presented by AWS, Intel and...
 
Amazon Web Services
Amazon Web ServicesAmazon Web Services
Amazon Web Services
 
AWS Big Data Solution Days
AWS Big Data Solution DaysAWS Big Data Solution Days
AWS Big Data Solution Days
 
透過 Amazon Redshift 打造數據分析服務及 Amazon Redshift 新功能案例介紹
透過 Amazon Redshift 打造數據分析服務及 Amazon Redshift 新功能案例介紹透過 Amazon Redshift 打造數據分析服務及 Amazon Redshift 新功能案例介紹
透過 Amazon Redshift 打造數據分析服務及 Amazon Redshift 新功能案例介紹
 
Building a Data Lake on AWS
Building a Data Lake on AWSBuilding a Data Lake on AWS
Building a Data Lake on AWS
 
Application Migrations at Scale
Application Migrations at ScaleApplication Migrations at Scale
Application Migrations at Scale
 
Vancouver keynote - AWS Innovate - Sam Elmalak
Vancouver keynote - AWS Innovate - Sam ElmalakVancouver keynote - AWS Innovate - Sam Elmalak
Vancouver keynote - AWS Innovate - Sam Elmalak
 
Transforming Your IT with AWS
Transforming Your IT with AWSTransforming Your IT with AWS
Transforming Your IT with AWS
 
2016 AWS Big Data Solution Days
2016 AWS Big Data Solution Days2016 AWS Big Data Solution Days
2016 AWS Big Data Solution Days
 
Choosing technologies for a big data solution in the cloud
Choosing technologies for a big data solution in the cloudChoosing technologies for a big data solution in the cloud
Choosing technologies for a big data solution in the cloud
 
Jeff Kratz - Cloud Computing
Jeff Kratz - Cloud ComputingJeff Kratz - Cloud Computing
Jeff Kratz - Cloud Computing
 

Mehr von Amazon Web Services

Come costruire servizi di Forecasting sfruttando algoritmi di ML e deep learn...
Come costruire servizi di Forecasting sfruttando algoritmi di ML e deep learn...Come costruire servizi di Forecasting sfruttando algoritmi di ML e deep learn...
Come costruire servizi di Forecasting sfruttando algoritmi di ML e deep learn...Amazon Web Services
 
Big Data per le Startup: come creare applicazioni Big Data in modalità Server...
Big Data per le Startup: come creare applicazioni Big Data in modalità Server...Big Data per le Startup: come creare applicazioni Big Data in modalità Server...
Big Data per le Startup: come creare applicazioni Big Data in modalità Server...Amazon Web Services
 
Esegui pod serverless con Amazon EKS e AWS Fargate
Esegui pod serverless con Amazon EKS e AWS FargateEsegui pod serverless con Amazon EKS e AWS Fargate
Esegui pod serverless con Amazon EKS e AWS FargateAmazon Web Services
 
Costruire Applicazioni Moderne con AWS
Costruire Applicazioni Moderne con AWSCostruire Applicazioni Moderne con AWS
Costruire Applicazioni Moderne con AWSAmazon Web Services
 
Come spendere fino al 90% in meno con i container e le istanze spot
Come spendere fino al 90% in meno con i container e le istanze spot Come spendere fino al 90% in meno con i container e le istanze spot
Come spendere fino al 90% in meno con i container e le istanze spot Amazon Web Services
 
Rendi unica l’offerta della tua startup sul mercato con i servizi Machine Lea...
Rendi unica l’offerta della tua startup sul mercato con i servizi Machine Lea...Rendi unica l’offerta della tua startup sul mercato con i servizi Machine Lea...
Rendi unica l’offerta della tua startup sul mercato con i servizi Machine Lea...Amazon Web Services
 
OpsWorks Configuration Management: automatizza la gestione e i deployment del...
OpsWorks Configuration Management: automatizza la gestione e i deployment del...OpsWorks Configuration Management: automatizza la gestione e i deployment del...
OpsWorks Configuration Management: automatizza la gestione e i deployment del...Amazon Web Services
 
Microsoft Active Directory su AWS per supportare i tuoi Windows Workloads
Microsoft Active Directory su AWS per supportare i tuoi Windows WorkloadsMicrosoft Active Directory su AWS per supportare i tuoi Windows Workloads
Microsoft Active Directory su AWS per supportare i tuoi Windows WorkloadsAmazon Web Services
 
Database Oracle e VMware Cloud on AWS i miti da sfatare
Database Oracle e VMware Cloud on AWS i miti da sfatareDatabase Oracle e VMware Cloud on AWS i miti da sfatare
Database Oracle e VMware Cloud on AWS i miti da sfatareAmazon Web Services
 
Crea la tua prima serverless ledger-based app con QLDB e NodeJS
Crea la tua prima serverless ledger-based app con QLDB e NodeJSCrea la tua prima serverless ledger-based app con QLDB e NodeJS
Crea la tua prima serverless ledger-based app con QLDB e NodeJSAmazon Web Services
 
API moderne real-time per applicazioni mobili e web
API moderne real-time per applicazioni mobili e webAPI moderne real-time per applicazioni mobili e web
API moderne real-time per applicazioni mobili e webAmazon Web Services
 
Database Oracle e VMware Cloud™ on AWS: i miti da sfatare
Database Oracle e VMware Cloud™ on AWS: i miti da sfatareDatabase Oracle e VMware Cloud™ on AWS: i miti da sfatare
Database Oracle e VMware Cloud™ on AWS: i miti da sfatareAmazon Web Services
 
Tools for building your MVP on AWS
Tools for building your MVP on AWSTools for building your MVP on AWS
Tools for building your MVP on AWSAmazon Web Services
 
How to Build a Winning Pitch Deck
How to Build a Winning Pitch DeckHow to Build a Winning Pitch Deck
How to Build a Winning Pitch DeckAmazon Web Services
 
Building a web application without servers
Building a web application without serversBuilding a web application without servers
Building a web application without serversAmazon Web Services
 
AWS_HK_StartupDay_Building Interactive websites while automating for efficien...
AWS_HK_StartupDay_Building Interactive websites while automating for efficien...AWS_HK_StartupDay_Building Interactive websites while automating for efficien...
AWS_HK_StartupDay_Building Interactive websites while automating for efficien...Amazon Web Services
 
Introduzione a Amazon Elastic Container Service
Introduzione a Amazon Elastic Container ServiceIntroduzione a Amazon Elastic Container Service
Introduzione a Amazon Elastic Container ServiceAmazon Web Services
 

Mehr von Amazon Web Services (20)

Come costruire servizi di Forecasting sfruttando algoritmi di ML e deep learn...
Come costruire servizi di Forecasting sfruttando algoritmi di ML e deep learn...Come costruire servizi di Forecasting sfruttando algoritmi di ML e deep learn...
Come costruire servizi di Forecasting sfruttando algoritmi di ML e deep learn...
 
Big Data per le Startup: come creare applicazioni Big Data in modalità Server...
Big Data per le Startup: come creare applicazioni Big Data in modalità Server...Big Data per le Startup: come creare applicazioni Big Data in modalità Server...
Big Data per le Startup: come creare applicazioni Big Data in modalità Server...
 
Esegui pod serverless con Amazon EKS e AWS Fargate
Esegui pod serverless con Amazon EKS e AWS FargateEsegui pod serverless con Amazon EKS e AWS Fargate
Esegui pod serverless con Amazon EKS e AWS Fargate
 
Costruire Applicazioni Moderne con AWS
Costruire Applicazioni Moderne con AWSCostruire Applicazioni Moderne con AWS
Costruire Applicazioni Moderne con AWS
 
Come spendere fino al 90% in meno con i container e le istanze spot
Come spendere fino al 90% in meno con i container e le istanze spot Come spendere fino al 90% in meno con i container e le istanze spot
Come spendere fino al 90% in meno con i container e le istanze spot
 
Open banking as a service
Open banking as a serviceOpen banking as a service
Open banking as a service
 
Rendi unica l’offerta della tua startup sul mercato con i servizi Machine Lea...
Rendi unica l’offerta della tua startup sul mercato con i servizi Machine Lea...Rendi unica l’offerta della tua startup sul mercato con i servizi Machine Lea...
Rendi unica l’offerta della tua startup sul mercato con i servizi Machine Lea...
 
OpsWorks Configuration Management: automatizza la gestione e i deployment del...
OpsWorks Configuration Management: automatizza la gestione e i deployment del...OpsWorks Configuration Management: automatizza la gestione e i deployment del...
OpsWorks Configuration Management: automatizza la gestione e i deployment del...
 
Microsoft Active Directory su AWS per supportare i tuoi Windows Workloads
Microsoft Active Directory su AWS per supportare i tuoi Windows WorkloadsMicrosoft Active Directory su AWS per supportare i tuoi Windows Workloads
Microsoft Active Directory su AWS per supportare i tuoi Windows Workloads
 
Computer Vision con AWS
Computer Vision con AWSComputer Vision con AWS
Computer Vision con AWS
 
Database Oracle e VMware Cloud on AWS i miti da sfatare
Database Oracle e VMware Cloud on AWS i miti da sfatareDatabase Oracle e VMware Cloud on AWS i miti da sfatare
Database Oracle e VMware Cloud on AWS i miti da sfatare
 
Crea la tua prima serverless ledger-based app con QLDB e NodeJS
Crea la tua prima serverless ledger-based app con QLDB e NodeJSCrea la tua prima serverless ledger-based app con QLDB e NodeJS
Crea la tua prima serverless ledger-based app con QLDB e NodeJS
 
API moderne real-time per applicazioni mobili e web
API moderne real-time per applicazioni mobili e webAPI moderne real-time per applicazioni mobili e web
API moderne real-time per applicazioni mobili e web
 
Database Oracle e VMware Cloud™ on AWS: i miti da sfatare
Database Oracle e VMware Cloud™ on AWS: i miti da sfatareDatabase Oracle e VMware Cloud™ on AWS: i miti da sfatare
Database Oracle e VMware Cloud™ on AWS: i miti da sfatare
 
Tools for building your MVP on AWS
Tools for building your MVP on AWSTools for building your MVP on AWS
Tools for building your MVP on AWS
 
How to Build a Winning Pitch Deck
How to Build a Winning Pitch DeckHow to Build a Winning Pitch Deck
How to Build a Winning Pitch Deck
 
Building a web application without servers
Building a web application without serversBuilding a web application without servers
Building a web application without servers
 
Fundraising Essentials
Fundraising EssentialsFundraising Essentials
Fundraising Essentials
 
AWS_HK_StartupDay_Building Interactive websites while automating for efficien...
AWS_HK_StartupDay_Building Interactive websites while automating for efficien...AWS_HK_StartupDay_Building Interactive websites while automating for efficien...
AWS_HK_StartupDay_Building Interactive websites while automating for efficien...
 
Introduzione a Amazon Elastic Container Service
Introduzione a Amazon Elastic Container ServiceIntroduzione a Amazon Elastic Container Service
Introduzione a Amazon Elastic Container Service
 

Kürzlich hochgeladen

Presentation on how to chat with PDF using ChatGPT code interpreter
Presentation on how to chat with PDF using ChatGPT code interpreterPresentation on how to chat with PDF using ChatGPT code interpreter
Presentation on how to chat with PDF using ChatGPT code interpreternaman860154
 
Tech-Forward - Achieving Business Readiness For Copilot in Microsoft 365
Tech-Forward - Achieving Business Readiness For Copilot in Microsoft 365Tech-Forward - Achieving Business Readiness For Copilot in Microsoft 365
Tech-Forward - Achieving Business Readiness For Copilot in Microsoft 3652toLead Limited
 
The Codex of Business Writing Software for Real-World Solutions 2.pptx
The Codex of Business Writing Software for Real-World Solutions 2.pptxThe Codex of Business Writing Software for Real-World Solutions 2.pptx
The Codex of Business Writing Software for Real-World Solutions 2.pptxMalak Abu Hammad
 
How to Troubleshoot Apps for the Modern Connected Worker
How to Troubleshoot Apps for the Modern Connected WorkerHow to Troubleshoot Apps for the Modern Connected Worker
How to Troubleshoot Apps for the Modern Connected WorkerThousandEyes
 
Enhancing Worker Digital Experience: A Hands-on Workshop for Partners
Enhancing Worker Digital Experience: A Hands-on Workshop for PartnersEnhancing Worker Digital Experience: A Hands-on Workshop for Partners
Enhancing Worker Digital Experience: A Hands-on Workshop for PartnersThousandEyes
 
Histor y of HAM Radio presentation slide
Histor y of HAM Radio presentation slideHistor y of HAM Radio presentation slide
Histor y of HAM Radio presentation slidevu2urc
 
08448380779 Call Girls In Greater Kailash - I Women Seeking Men
08448380779 Call Girls In Greater Kailash - I Women Seeking Men08448380779 Call Girls In Greater Kailash - I Women Seeking Men
08448380779 Call Girls In Greater Kailash - I Women Seeking MenDelhi Call girls
 
My Hashitalk Indonesia April 2024 Presentation
My Hashitalk Indonesia April 2024 PresentationMy Hashitalk Indonesia April 2024 Presentation
My Hashitalk Indonesia April 2024 PresentationRidwan Fadjar
 
Data Cloud, More than a CDP by Matt Robison
Data Cloud, More than a CDP by Matt RobisonData Cloud, More than a CDP by Matt Robison
Data Cloud, More than a CDP by Matt RobisonAnna Loughnan Colquhoun
 
Finology Group – Insurtech Innovation Award 2024
Finology Group – Insurtech Innovation Award 2024Finology Group – Insurtech Innovation Award 2024
Finology Group – Insurtech Innovation Award 2024The Digital Insurer
 
Breaking the Kubernetes Kill Chain: Host Path Mount
Breaking the Kubernetes Kill Chain: Host Path MountBreaking the Kubernetes Kill Chain: Host Path Mount
Breaking the Kubernetes Kill Chain: Host Path MountPuma Security, LLC
 
FULL ENJOY 🔝 8264348440 🔝 Call Girls in Diplomatic Enclave | Delhi
FULL ENJOY 🔝 8264348440 🔝 Call Girls in Diplomatic Enclave | DelhiFULL ENJOY 🔝 8264348440 🔝 Call Girls in Diplomatic Enclave | Delhi
FULL ENJOY 🔝 8264348440 🔝 Call Girls in Diplomatic Enclave | Delhisoniya singh
 
GenCyber Cyber Security Day Presentation
GenCyber Cyber Security Day PresentationGenCyber Cyber Security Day Presentation
GenCyber Cyber Security Day PresentationMichael W. Hawkins
 
SQL Database Design For Developers at php[tek] 2024
SQL Database Design For Developers at php[tek] 2024SQL Database Design For Developers at php[tek] 2024
SQL Database Design For Developers at php[tek] 2024Scott Keck-Warren
 
IAC 2024 - IA Fast Track to Search Focused AI Solutions
IAC 2024 - IA Fast Track to Search Focused AI SolutionsIAC 2024 - IA Fast Track to Search Focused AI Solutions
IAC 2024 - IA Fast Track to Search Focused AI SolutionsEnterprise Knowledge
 
WhatsApp 9892124323 ✓Call Girls In Kalyan ( Mumbai ) secure service
WhatsApp 9892124323 ✓Call Girls In Kalyan ( Mumbai ) secure serviceWhatsApp 9892124323 ✓Call Girls In Kalyan ( Mumbai ) secure service
WhatsApp 9892124323 ✓Call Girls In Kalyan ( Mumbai ) secure servicePooja Nehwal
 
Strategies for Unlocking Knowledge Management in Microsoft 365 in the Copilot...
Strategies for Unlocking Knowledge Management in Microsoft 365 in the Copilot...Strategies for Unlocking Knowledge Management in Microsoft 365 in the Copilot...
Strategies for Unlocking Knowledge Management in Microsoft 365 in the Copilot...Drew Madelung
 
A Call to Action for Generative AI in 2024
A Call to Action for Generative AI in 2024A Call to Action for Generative AI in 2024
A Call to Action for Generative AI in 2024Results
 
The Role of Taxonomy and Ontology in Semantic Layers - Heather Hedden.pdf
The Role of Taxonomy and Ontology in Semantic Layers - Heather Hedden.pdfThe Role of Taxonomy and Ontology in Semantic Layers - Heather Hedden.pdf
The Role of Taxonomy and Ontology in Semantic Layers - Heather Hedden.pdfEnterprise Knowledge
 
Slack Application Development 101 Slides
Slack Application Development 101 SlidesSlack Application Development 101 Slides
Slack Application Development 101 Slidespraypatel2
 

Kürzlich hochgeladen (20)

Presentation on how to chat with PDF using ChatGPT code interpreter
Presentation on how to chat with PDF using ChatGPT code interpreterPresentation on how to chat with PDF using ChatGPT code interpreter
Presentation on how to chat with PDF using ChatGPT code interpreter
 
Tech-Forward - Achieving Business Readiness For Copilot in Microsoft 365
Tech-Forward - Achieving Business Readiness For Copilot in Microsoft 365Tech-Forward - Achieving Business Readiness For Copilot in Microsoft 365
Tech-Forward - Achieving Business Readiness For Copilot in Microsoft 365
 
The Codex of Business Writing Software for Real-World Solutions 2.pptx
The Codex of Business Writing Software for Real-World Solutions 2.pptxThe Codex of Business Writing Software for Real-World Solutions 2.pptx
The Codex of Business Writing Software for Real-World Solutions 2.pptx
 
How to Troubleshoot Apps for the Modern Connected Worker
How to Troubleshoot Apps for the Modern Connected WorkerHow to Troubleshoot Apps for the Modern Connected Worker
How to Troubleshoot Apps for the Modern Connected Worker
 
Enhancing Worker Digital Experience: A Hands-on Workshop for Partners
Enhancing Worker Digital Experience: A Hands-on Workshop for PartnersEnhancing Worker Digital Experience: A Hands-on Workshop for Partners
Enhancing Worker Digital Experience: A Hands-on Workshop for Partners
 
Histor y of HAM Radio presentation slide
Histor y of HAM Radio presentation slideHistor y of HAM Radio presentation slide
Histor y of HAM Radio presentation slide
 
08448380779 Call Girls In Greater Kailash - I Women Seeking Men
08448380779 Call Girls In Greater Kailash - I Women Seeking Men08448380779 Call Girls In Greater Kailash - I Women Seeking Men
08448380779 Call Girls In Greater Kailash - I Women Seeking Men
 
My Hashitalk Indonesia April 2024 Presentation
My Hashitalk Indonesia April 2024 PresentationMy Hashitalk Indonesia April 2024 Presentation
My Hashitalk Indonesia April 2024 Presentation
 
Data Cloud, More than a CDP by Matt Robison
Data Cloud, More than a CDP by Matt RobisonData Cloud, More than a CDP by Matt Robison
Data Cloud, More than a CDP by Matt Robison
 
Finology Group – Insurtech Innovation Award 2024
Finology Group – Insurtech Innovation Award 2024Finology Group – Insurtech Innovation Award 2024
Finology Group – Insurtech Innovation Award 2024
 
Breaking the Kubernetes Kill Chain: Host Path Mount
Breaking the Kubernetes Kill Chain: Host Path MountBreaking the Kubernetes Kill Chain: Host Path Mount
Breaking the Kubernetes Kill Chain: Host Path Mount
 
FULL ENJOY 🔝 8264348440 🔝 Call Girls in Diplomatic Enclave | Delhi
FULL ENJOY 🔝 8264348440 🔝 Call Girls in Diplomatic Enclave | DelhiFULL ENJOY 🔝 8264348440 🔝 Call Girls in Diplomatic Enclave | Delhi
FULL ENJOY 🔝 8264348440 🔝 Call Girls in Diplomatic Enclave | Delhi
 
GenCyber Cyber Security Day Presentation
GenCyber Cyber Security Day PresentationGenCyber Cyber Security Day Presentation
GenCyber Cyber Security Day Presentation
 
SQL Database Design For Developers at php[tek] 2024
SQL Database Design For Developers at php[tek] 2024SQL Database Design For Developers at php[tek] 2024
SQL Database Design For Developers at php[tek] 2024
 
IAC 2024 - IA Fast Track to Search Focused AI Solutions
IAC 2024 - IA Fast Track to Search Focused AI SolutionsIAC 2024 - IA Fast Track to Search Focused AI Solutions
IAC 2024 - IA Fast Track to Search Focused AI Solutions
 
WhatsApp 9892124323 ✓Call Girls In Kalyan ( Mumbai ) secure service
WhatsApp 9892124323 ✓Call Girls In Kalyan ( Mumbai ) secure serviceWhatsApp 9892124323 ✓Call Girls In Kalyan ( Mumbai ) secure service
WhatsApp 9892124323 ✓Call Girls In Kalyan ( Mumbai ) secure service
 
Strategies for Unlocking Knowledge Management in Microsoft 365 in the Copilot...
Strategies for Unlocking Knowledge Management in Microsoft 365 in the Copilot...Strategies for Unlocking Knowledge Management in Microsoft 365 in the Copilot...
Strategies for Unlocking Knowledge Management in Microsoft 365 in the Copilot...
 
A Call to Action for Generative AI in 2024
A Call to Action for Generative AI in 2024A Call to Action for Generative AI in 2024
A Call to Action for Generative AI in 2024
 
The Role of Taxonomy and Ontology in Semantic Layers - Heather Hedden.pdf
The Role of Taxonomy and Ontology in Semantic Layers - Heather Hedden.pdfThe Role of Taxonomy and Ontology in Semantic Layers - Heather Hedden.pdf
The Role of Taxonomy and Ontology in Semantic Layers - Heather Hedden.pdf
 
Slack Application Development 101 Slides
Slack Application Development 101 SlidesSlack Application Development 101 Slides
Slack Application Development 101 Slides
 

HBX: Harvard Business School's Digital Education Goes Data-Centric with Amazon Redshift + Informatica

  • 1. HBX: Harvard Business School’s Digital Education Goes Data-Centric with Amazon Redshift + Informatica
  • 2. Today’s Presenters David Potes, Manager, Solutions Architecture, Amazon Web Services Andrew McIntyre, Solutions Architect, Informatica Ryan Frazier, Director, Systems Engineering & Operations, HBX
  • 3. Today’s Agenda • An overview of AWS and AWS Marketplace, with an emphasis on AWS Big Data solutions • Informatica solution overview • The HBX success story with AWS and Informatica • Q&A/Discussion
  • 4. Learning Objectives 1. Becoming a data-centric organization for best business results 2. The benefits of agile analytics with cloud data warehouse 3. Rapidly integrating high volume, disparate data into a trusted source
  • 5. Amazon Redshift shift Fast, simple, petabyte-scale data warehousing for $1,000/TB/Year 140+ features
  • 6. AWS Big Data Portfolio Collect Store Analyze Amazon Kinesis Firehose AWS Direct Connect Amazon Snowball Amazon Kinesis Analytics Amazon Kinesis Streams Amazon S3 Amazon Glacier Amazon CloudSearch Amazon RDS, Amazon Aurora Amazon Dynamo DB Amazon Elasticsearch Amazon EMR Amazon EC2 Amazon Redshift Amazon Machine Learning Amazon QuickSight AWS Data Pipeline AWS Database Migration Service AWS Glue Amazon Athena
  • 7. Legacy architectural models lead to dark data 0 200 400 600 800 1000 1200 Enterprise Data Data in Warehouse Very Expensive Lock-In Proprietary Inflexible licensing
  • 8. Traditional Data Warehousing Business Reporting Complex pipelines and queries Secure and Compliant Easy Migration – Point & Click using AWS Database Migration Service Secure & Compliant – End-to-End Encryption. SOC 1/2/3, PCI-DSS, HIPAA and FedRAMP compliant Large Ecosystem – Variety of cloud and on-premises BI and ETL tools Japanese Mobile Phone Provider Powering 100 marketplaces in 50 countries World’s Largest Children’s Book Publisher Bulk Loads and Updates
  • 9. Log Analysis Log & Machine IOT Data Clickstream Events Data Time-Series Data Cheap – Analyze large volumes of data cost-effectively Fast – Massively Parallel Processing (MPP) and columnar architecture for fast queries and parallel loads Near real-time – Micro-batch loading and Amazon Kinesis Firehose for near-real time analytics Interactive data analysis and recommendation engine Ride analytics for pricing and product development Ad prediction and on-demand analytics
  • 10. Business Applications Multi-Tenant BI Applications Back-end services Analytics as a Service Fully Managed – Provisioning, backups, upgrades, security, compression all come built-in so you can focus on your business applications Ease of Chargeback – Pay as you go, add clusters as needed. A few big common clusters, several data marts Service Oriented Architecture – Integrated with other AWS services. Easy to plug into your pipeline Infosys Information Platform (IIP) Analytics-as-a- Service Product and Consumer Analytics
  • 11. Redshift is used for mission-critical workloads Financial and management reporting Payments to suppliers and billing workflows Web/Mobile clickstream and event analysis Recommendation and predictive analytics
  • 12. Amazon Redshift is available everywhere AWS is Dublin Frankfurt London Seoul Sydney Tokyo Singapore Beijing Mumbai Sao Paulo US East - Virginia US West - Oregon US West – Northern California GovCloud Columbus Ohio Montreal Currently Available Coming soon
  • 13. Amazon Redshift is fast “Did I mention that it’s ridiculously fast? We’re using it to provide our analysts with an alternative to Hadoop” “After investigating Redshift, Snowflake, and BigQuery, we found that Redshift offers top-of-the- line performance at best-in-market price points” “…[Redshift] performance has blown away everyone here. We generally see 50-100X speedup over Hive” “We regularly process multibillion row datasets and we do that in a matter of hours. We are heading to up to 10 times more data volumes in the next couple of years, easily” “We saw a 2X performance improvement on a wide variety of workloads. The more complex the queries, the higher the performance improvement” “On our previous big data warehouse system, it took around 45 minutes to run a query against a year of data, but that number went down to just 25 seconds using Amazon Redshift”
  • 14. And has gotten faster... 5X Query throughput improvement this year  Memory allocation (launched)  Improved commit and I/O logic (launched)  Queue hopping (launched)  Query monitoring rules (coming soon)  Power start (coming soon)  Short query bias (coming soon) 10X Vacuuming performance improvement  Ensures data is sorted for efficient and fast I/O  Reclaims space from deleted rows  Enhanced vacuum performance leads to better system throughput Fast Efficient
  • 15. Amazon Redshift is easy to use “With Amazon Redshift and Tableau, anyone in the company can set up any queries they like - from how users are reacting to a feature, to growth by demographic or geography, to the impact sales efforts had in different areas” “The doors were blown wide open to create custom dashboards for anyone to instantly go in and see and assess what is going in our ad delivery landscape, something we have never been able to do until now.” Provides an easy-to-use mechanism for querying data with quick and uniform response times that analysts can use to run research projects and perform in-depth analysis…We don’t have to pre-allocate resources and can easily scale up to meet demand and then scale down for efficiency”
  • 16. Amazon Redshift is easy to use Provisioning in minutes Automatic patching SQL - Data loading Backups are built-in Security is built-in Compression is built-in
  • 17. Amazon Redshift is cheap “450,000 online queries 98 percent faster than previous traditional data center, while reducing infrastructure costs by 80 percent.” “Annual costs of Redshift are equivalent to just the annual maintenance of some of the cheaper on-premises options for data warehouses..” “Most competing data warehousing solutions would have cost us up to $1 million a year. By contrast, Amazon Redshift costs us just $100,000 all-in, representing a total cost savings of around 90%”
  • 18. Durability and availability Automated backups Cross-region backups Cluster-level mirroring Streaming restore Monitoring Ticketing
  • 19. Amazon Redshift is secure End-to-End data encryption Alerts & Notifications Virtual private cloud AWS KMS & HSM Audit logging Certifications & Compliance
  • 20. Amazon Redshift shift Fast, simple, petabyte-scale data warehousing for $1,000/TB/Year Available now  Queue hopping  10X VACUUM performance improvement  Node fault tolerance  Enhanced VPC routing  IAM support for LOAD/UNLOAD  Auto compression for CTAS  TimestampTZ datatype  Query Monitoring rules Coming soon  Automatic and incremental background VACUUM  Short query bias  Power start  IAM Authentication for DB users  Auto compression for new tables  Enhanced JSON & AVRO ingestion performance
  • 22. 22 © Informatica. Proprietary and Confidential. Digital Transformation is happening Now ! NewOld
  • 23. 23 © Informatica. Proprietary and Confidential. Paths to Cloud Data Warehousing and Analytics Extend Migrate Born in the Cloud • Quickly meet new business demands • Variety & Volumes of data for analysis • Current warehouse not performing & need to scale • Reduce costs (platform & maintenance) • Agile Self-Service Analytics • Highly Scalable • Elasticity
  • 24. 25 May 2017 BUILDING A CLOUD BASED DATA WAREHOUSE hbx.hbs.ed RYAN FRAZIER Director, Systems Engineering & Operations Harvard Business School/HBX @rrfrazier
  • 25. Agenda 25 • About HBX • HBX Data Management Initiative • Architecture & Implementation • Challenges • Reflections
  • 27.  Newest division at HBS, tasked with reimagining business education for the digital age  First course June 2014 to deliver HBS experience online  Multiple Course Offerings  CORe (Business Analytics, Financial Accounting, and Economics for Managers)  Disruptive Strategy with Clayton Christensen  Leading with Finance  Negotiation Mastery  Managing Your Career Development The teaching model sets HBX apart from many online learning options and is reflective of the HBS in- person classroom approach HBX Overview 27
  • 28.  Mainly asynchronous online business education  Engagement through student interaction in cohorts of ~400  Case-based learning with highly interactive teaching elements and peer help HBX Course Platform (AWS) 28
  • 29. 29  Studio-based virtual classroom  Synchronous audio/video with chat, polls, boards  Up to 60 global students on studio wall, hundreds or more observers HBX Live Platform
  • 31. Why Create a Data-Driven Culture? 31 Improve Effectiveness • Scale data intensive activities like marketing, admissions, & grading • Use data to test ideas and improve quality of decisions Enhance Outcomes • Identify challenging content • Evaluate and improve interactive content, social engagement & retention • Proactively support struggling students Refine Pedagogy • Evaluate new pedagogical approaches • Optimize evaluation approaches • Support pedagogical research activities and innovation Foster Innovation & Continuous Improvement • Identify and evaluate innovation opportunities • Drive continuous improvement Students Staff Faculty
  • 32. Data Management Program Objectives 32 Integrate Data Sources into Comprehensive Data Warehouse Build Reports and Dashboards Enable Self Service Ensure Data Quality and Integrity
  • 33. Enablers for Building Data Driven Culture at HBX • Use off-shore partner Mindtree to accelerate • Active engagement of vendors on technology challenges • Short internal presentations • Data Analysis Exercise at all-staff team meeting • Active interest & involvement from Business Areas • Alignment to organizational priorities • HBX willingness to try new things • Helps drive engagement with vendors Education Strong Partners Program Governance Experimentation
  • 34. 34Building a Data Management Practice
  • 35. Core Tool Selection 35 Data Warehouse: Amazon Redshift + Snowflake • Chose Redshift for scalability, performance, ease of management • Aligns with AWS platform/ecosystem focus at HBX • Easy integration with other AWS services • Simplified vendor, contract, and cost management • Leverage existing operations tools for monitoring, alerting, etc. • Snowflake for JSON data lake and intermediate processing ETL: Informatica Cloud • Myriad of pluggable connectors • S3, Redshift, Salesforce, ServiceNow, ODBC, REST • Cloud-based architecture w/minimal infrastructure management • Productivity-focused development tools for rapid implementation • Extensible offering for future initiatives (MDM, Customer 360) • Aligned to larger University vendor relationship & investments Reporting/Analytics: Tableau
  • 36. HBX Data Management by the Numbers 36 Source Systems • 4 Major Systems • 46 databases • 598 Tables • 98 Mongo Collections Data Warehouse • 3 Redshift Nodes • 1 Snowflake warehouse • 18 Schemas • 434 tables • 7479 fields • 1,184,804,787 rows • 278 GB Daily ETL Process • 770 jobs • 35 MM+ rows * Updated 5/2017
  • 37. Course Platform Ver. B HBX Core Data Ecosystem (at Launch) 37 Course Platform Ver. A Historical Data MongoDB MySQL Admin System MySQL Amazon Redshift About HBX HBX Data Management Initiative Architecture & Implementation Challenges Reflections Tableau Server Secure Agent Informatica Cloud Services MongoDB MySQL MongoDB MySQL Reporting Copy Metadata
  • 38. HBX Data Ecosystem—Spring 2017 Reporting Copy Course Platform MongoDB MySQL Historical Data MongoDB MySQL Admin System MySQL Salesforce sync ServiceNo w Google Analytics HBX AWS Acct Snowflake Virtual DWUpdated 3/31/2017 Snowflake Metadata Services Snowflake Data Persistence Interactive Reports External Data Sources Ad Hoc Query/Reporting Course Platform MongoDB data (json) HBX Live Hubspot Ad Hoc Query/Reporting Local Data Center Firehose Tableau Secure Agent Existing Data Flow Proposed Future Data Flow Informatica Cloud Services Amazon Redshift
  • 39. 39 Reflections on HBX Data Program
  • 40. Achieving Technical Objectives 40 Integrate Data Sources into Comprehensive Data Warehouse Build Reports and Dashboards Enable Self Service Ensure Data Quality and Integrity Daily load from 4 core systems (Legacy Business System, Salesforce, Course Platform, HBX Live) 33 production reports, serving all HBX business units >30 monthly active Tableau users (~40% of staff) 0.5 FTE technical QA staff and robust development/test/release process
  • 41. Reflections on HBX Implementation 41 Cloud Services Significantly Increase Agility • Rapid provisioning of services • Easy, on-demand scaling as business grows • Cost-effective to support multiple environments • Out-of-the box flexibility and feature enhancements Semi-structured data stores can present challenges for reporting • Think about data structures during development • Consider your data pipeline and how data will be used • Rapidly evolving and maturing space QA & Testing are Critical to Success • Reports and results only as good as the input • Plead with your users to report problems and concerns!
  • 42. HBX Business Outcomes 42 Automation of Manual Business Processes • Moving from complex, time-consuming, error-prone spreadsheet processes to purpose-built data products Deeper Insight Into Prospects and Participants • Ability to bridge top (marketing, leads) and bottom (applicants, registrants, and completers) of funnel: who are our prospects, how do we improve our yield, and how does this relate to outcomes? Growth of a “Data Driven” Culture • Increasing use of data to understand our users, identify challenges and opportunities, and drive decisions • More focus on hypothesis driven experiments and A/B testing
  • 43. What’s Next and Why? 43 Analytics • Greater understanding of users • Research and pedagogy Additional Data Sources • Greater understanding of users & systems • Automate additional processes Streaming & Machine Data • Real-time data reporting • Access additional data Machine Learning • Automate additional processes
  • 45. 45 © Informatica. Proprietary and Confidential. RedShift Upsert – The Coding Way ! Extract the data from source Put into flat files and compress Transfer Compressed Files To S3 Wait for S3 Consistency Copy Data From S3 Into Staging TableInner Join With Target Table To Delete Rows To Be Updated Insert Updated Rows From Staging Table Delete Staging Table Delete Files From S3 Create Staging Table in Redshift 1 2 3 4 5 6 7 8 9 10 Or, Do It In 3 Simple Steps…
  • 46. 46 © Informatica. Proprietary and Confidential. RedShift Upsert – The Informatica Way ! Choose Upsert Operation1 Map Your Fields2 Run Or Schedule!3
  • 47. 47 © Informatica. Proprietary and Confidential. Agile Integration and Scalability Unlock your data 1 2 Scalability & Operation Confidence UI maximizes productivity for developers & citizen integrators Codeless development Out of box templates & wizards Easy to use & highly reusable 3 Hundreds of pre-built connectors for cloud and on-premise data sources Optimized for largest data volumes Monitoring & Administration CONNECT DEVELOP DEPLOY
  • 48. 48 © Informatica. Proprietary and Confidential. Cloud Apps (SaaS)Data Stores DBs, DWs, Big Data, Cloud Enterprise Systems B2B Middleware and Tech Analytics Social Apps 100’s of Connectors, For Any Data Source
  • 49. 49 © Informatica. Proprietary and Confidential. Informatica for RedShift Overview ETL (1, 2, 3) 1. Bulk Source Data Ingestion 2. Multi-part load into S3 of compressed files 3. Copy S3 data into RedShift Staging ELT (4, 5, 6) SQL Pushdown for RedShift to RedShift Table Integrations within same RedShift Cluster Redshift Staging Amazon S3 1 2 3 4 5 Redshift Intermediate Redshift Analytics 6 4 5 Same RedShift Cluster
  • 50. 50 © Informatica. Proprietary and Confidential. Optimized Data Ingestion into RedShift 1. Source Bulk Data Loader 2. Partitions - parallel data pipelines 3. Local staging files 4. S3 Parallel Upload 5. Copy Command to RedShift
  • 51. 51 © Informatica. Proprietary and Confidential. Batching of Reads • Reduce I/O Trips • Default 2 Million rows • 3x improvement by increasing batch size to 16 Million rows for TPC-H Schema Key Range Partitioning • 4x Performance Improvement with 16 partitions for TPC-H Schema • Optimal when partition keys are same as RedShift Distribution Keys Bulk APIs and Batch Rows for Fast Ingestion Key Range Partitioning Optimized Data Ingestion – Batching & Partitioning
  • 52. 52 © Informatica. Proprietary and Confidential. Informatica for Amazon Redshift: We Have Your Back! Robust Comprehensive  Partitioning  SQL Pushdown  Optimized Lookups  Multi-part Upload & Download  Compression before S3 Upload Flexible  AWS KMS Support  IAM Roles  Client & Server Side Encryption  S3 VPC Endpoint Secure  Error management, Notifications, & Alerts  Error Tables or S3 Buckets for repair  Auto-handle special characters  Dynamically create targets  PRIMARY, FOREIGN, & UNIQUE Keys  SORT & DISTRIBUTION keys  ANALYZE & VACUUM options  Configurable S3 Copy Options  Pre and Post SQL  SQL Overrides  S3 data retention policies  AWS Multi-Region support  Secure Agent on premise  Informatica Hosted agent  Agent on AWS  Configurable S3 Copy Options  Dynamic S3 Buckets High Performance
  • 53. 53 © Informatica. Proprietary and Confidential. Informatica Products on AWS Power Center Informatica Cloud Big Data Management Enterprise Informatica Catalog Informatica Cloud Intelligent Data Lake Informatica Data Quality Enterprise Informatica Catalog Power Center Master Data Management Big Data Management Informatica Data Quality Certified Available
  • 54. 54 Try & Buy Informatica Cloud on AWS Marketplace http://infa.media/AWSRedshift Easy Deployment from AWS Marketplace Flexible pricing options • Hourly Pricing (PAYG) • Contract Pricing (Annual Subscription)
  • 55. 55 PowerCenter on AWS Marketplace PowerCenter Quick Start on AWS PowerCenter on AWS Marketplace
  • 56. 56 Informatica Cloud/iPaaS Proven Success 1 T+ Records/month 120% growth YoY 3M+ Integrations/day 200%+ growth YoY 7000+ Customers Connecting 100,000 applications, DBs and other end points 100+ OEMS 150+ iPaaS Connectors >300% Growth of API volume in CY16
  • 58. Informatica’s Comprehensive Solution Intelligent Data Platform ACLOUD REAL TIME/ STREAMING BIG DATA TRADITIONAL DATA INTEGRATION BIG DATA MANAGEMENT MASTER DATA MANAGEMENT DATA QUALITY DATA SECURITY CLOUD DATA MANAGEMENT Products Solutions MONITOR AND MANAGE CONNECTIVITY COMPUTE Enterprise Cloud Data Management CUSTOMER 360 DATA GOVERNANCE REFERENCE 360 INTELLIGEN T DATA LAKE SECURE@SOURC E PRODUCT 360 ENTERPRISE INFORMATION CATALOG SUPPLIER 360 (ENTERPRISE UNIFIED METADATA INTELLIGENCE)
  • 60. 60 © Informatica. Proprietary and Confidential. Learn more….. Learn & Prepare • Cloud Analytics with Informatica Cloud & Amazon Redshift • PowerCenter on AWS • Data Lakes on AWS Get Started on AWS MarketplaceDeep-Dive
  • 62. Q&A

Hinweis der Redaktion

  1. The main goal of this slide is to show platform completeness Key talking points: 1/ Any big data application has a data acquisition phase, a storage need, and an analytics need 2/ Quick Service overviews. Go fast; especially on ones we talk about later. Collect Direct Connect – private, low latency connections between your data centers and ours. Most customers use a pair for redundancy and availability Import/Export – for moving large volumes of data, fedex is your highest bandwidth option. With Snowball, we’ll ship you a ruggedized case, load it up, send it back Kinesis – real time streaming data; has streams for custom apps, firehose for easy Redshift/s3 integration, Analytics for real time SQL AWS IoT platform – complete suite for IoT devices to make it easy to manage them and get telemetry data into AWS Store S3 is the foundation of any big data app on AWS. Scalable, low cost, default landing zone for data ($0.023/GB-Mo and drops from there with scale. That’s $23/TB for a month, less than $300 for a year) Glacier is the sister service for cold storage like data you need for compliance. Age data into it using lifecycle. $0.004/GB-Mo, or $48 per TB per year! DynamoDB – NoSQL store; zero admin; JSON + Key Value with single digit millisecond latency. Great for high concurrency reads and writes Elasticsearch – managed elasticsearch clusters for operational intelligence and search Analyze EMR for fully managed dynamic clusters for running Hadoop/Spark/Presto/HBase Athena for interactive queries on S3 Data using Standard SQL with no infrastructure to manage Redshift – fully managed, petabyte scale DW for $1,000/TB/year ML – fully managed machine learning EC2 – run anything you want that runs on Linux or windows QuickSight for fast, cost-effective BI Lambda – serverless compute for event driven computing And rounding all this out, we have DMS for migrating databases and replicating OLTP to Redshift and Data Pipeline for scheduling and orchestration
  2. Fully Managed – We provision, backup, patch and monitor so you can focus on your data Fast – Massively Parallel Processing and columnar architecture for fast queries and parallel loads Nasdaq security – Ingests 5B rows/trading day, analyzes orders, trades and quotes to protect traders, report activity and develop their marketplaces NTT Docomo - Redshift is NTT Docomo's primary analytics platform for data science and marketing and logistic analytics. Data is pre-processed on premises and loaded into a massive, multi-petabyte data warehouse on Amazon Redshift, which data scientists use as their primary analytics platform.
  3. Pinterest uses Redshift for interactive data analysis. Redshift is used to store all web event data and uses for KPIs, recommendations and A/B experimentation. Lyft uses Redshiftfor ride analytics across the world (rides / location data ) - Through analysis, company engineers estimated that up to 90% of rides during peak times had similar routes. This led to the introduction of Lyft Line – a service that allows customers to save up to 60% by carpooling with others who are going in the same direction. Yelp has multiple deployments of RedShift with different data sets in use by product management, sales analytics, ads, SeatMe (Point of sale analytics) and many other teams. Analyzes 0s of millions of ads/day, 250M mobile events/day, ad campaign performance and new feature usage
  4. Accenture Insights Platform (AIP) is a scalable, on-demand, globally available analytics solution running on Amazon Redshift. AIP is Accenture's foundation for its big data offering to deliver analytics applications for healthcare and financial services.
  5. Mission critical customers stories – Grabtaxi and reiterate FINRA, Nasdaq
  6. Summary – in 14 regions, adding 3 more
  7. Optimum was formerly called Cablevision
  8. http://hq.vevo.com/vevo-data-science/
  9. Mention that customers can leverage AWS KMS in addition to their HSMs
  10. Customers moving from using traditional databases like Oracle or MSSQL and MPP Data Warehouses to RedShift Extend: Easy to meet business demands; Easy to provision, manage, and maintain Variety of data formats – AVRO, PARQUET, JSON that are not handled well by traditional data warehouses Migrate Cost is prohibitive – cloud data warehouse like RedShift is cheaper, easy to scale as you need Analyze data without storage constraints SQL on Hadoop SQL – it is all SQL RedShift Spectrum – keep the data on S3 and analyze in real time with RedShift
  11. AWS for core infrastructure Adminsitrative system (migrating to Salesforce over next year) Multiple Prod Environments—each new release, ensure stability Reporting Copy + Archive for Historical Data ETL Reviewed several options Picked Informatica to align with HU, HBS data mgmt architecture Use Cloud version Redshift for EDW Aligns to cloud, AWS Tableau for Reporting
  12. AWS for core infrastructure Adminsitrative system (migrating to Salesforce over next year) Multiple Prod Environments—each new release, ensure stability Reporting Copy + Archive for Historical Data ETL Reviewed several options Picked Informatica to align with HU, HBS data mgmt architecture Use Cloud version Redshift for EDW Aligns to cloud, AWS Tableau for Reporting
  13. Here are some sample wins in the AWS ecosystem. Asurion Competition: Engine and data systems specific hand coding and 35 points solutions. Why we won: Most comprehensive end to end hybrid data management solution. Extensive connectivity: shielded team from maintaining deep expertise for fast changing technologies and data systems. Reuse visual mappings across multiple engines and systems, instead of hand-coding for each. Intuitive and consistent UI for improved productivity. Automation to enable thousands of data pipelines. Expected benefits: Single & unified view of authorized, cleansed & standardized business data. Data available anytime, anywhere in any format. Increase ROI and save $150K/year in development cost by using Infa instead of manual coding such as SSIS. Predictive analytics to improve mobile customer engagement and loyalty.