Weitere ähnliche Inhalte
Ähnlich wie Sujal and scott fina lb (20)
Kürzlich hochgeladen (20)
Sujal and scott fina lb
- 1. EMC
Las Vegas 2011
WORLD
© Copyright 2010 EMC Corporation. All rights reserved. 1
- 2. Big Data,
Big Opportunity
Sujal Patel
President, Isilon Storage Division
May 9, 2011
© Copyright 2011 EMC Corporation. All rights reserved. 2
- 3. !!! !!!
“Big Data Is Less
About Size, And
More About
Freedom” ―Techcrunch !!!
THE ERA OF !!!
“Findings: „Big
BIG DATA !!! Data‟ Is More
Extreme Than
Volume” ― Gartner
“Big Data! It‟s
Real, It‟s Real-
time, and It‟s
IS HERE…
“Total data: „bigger‟ Already Changing
than big data” ―IDB
!!! Your World”
!!! ― 451 Group !!!
© Copyright 2011 EMC Corporation. All rights reserved. 3
- 4. The EMC Big Data “Stack”
4 Collaborative Act
EMC Documentum xCP
?
3 Real Time Analyze
EMC Greenplum + Hadoop
2 Structured &
Unstructured
1 Petabyte
Scale Store
EMC Isilon + Atmos
© Copyright 2011 EMC Corporation. All rights reserved. 4
- 5. Big Data Is Changing Enterprise Storage
90
80
70
60
50
Big
40
Data 30
Sources 20
10
0
2009 2010 2011 2012 2013 2014
File Based: 60.7% CAGR Block Based: 21.8% CAGR
By 2012, 80% of all storage capacity sold will be for file-based data
Source: IDC
© Copyright 2011 EMC Corporation. All rights reserved. 5
- 6. Scale-UP Architectures NOT Ideal for Big Data
Server
Scalability
Network
Performance
Management
Availability
Cost
Storage
© Copyright 2011 EMC Corporation. All rights reserved. 6
- 7. Server
Scalability
Network
Performance
Management
Availability
Cost
Storage
© Copyright 2011 EMC Corporation. All rights reserved. 7
- 8. It‟s Day and Night Different
Scale-Out Scale-Up
Shared Nothing Shared Storage
Automated Manual
Linear Scalability Performance Bottlenecks
Operational Efficiency Increasing Complexity
© Copyright 2011 EMC Corporation. All rights reserved. 8
- 10. Big Data Requires:
Tremendous Scalability of
Capacity and Performance.
© Copyright 2011 EMC Corporation. All rights reserved. 10
- 11. VIDEO
144-Node Cluster Build
© Copyright 2011 EMC Corporation. All rights reserved. 11
- 13. Isilon Scale-out NAS Product Portfolio
APPLICATION
SOFTWARE
HARDWARE
ACCELERATOR ISILON
S-SERIES X-SERIES NL-SERIES & BACKUP EX NODES
ACCELERATOR
OPERATING
SYSTEM
© Copyright 2011 EMC Corporation. All rights reserved. 13
- 15. Isilon Solutions For…
• • • •
• • •
•
• •
•
• • •
© Copyright 2011 EMC Corporation. All rights reserved. 15
- 17. Why Isilon?
Because Big Data Demands New Thinking
Product Families Purpose-built to Optimize
for IOps, Throughput and/or $/TB.
Record-breaking Scaling of Capacity and
Performance.
Remarkable Simplicity.
© Copyright 2011 EMC Corporation. All rights reserved. 17
- 18. EMC
Las Vegas 2011
WORLD
© Copyright 2010 EMC Corporation. All rights reserved. 18
- 19. Big Data, Big
Opportunity
Scott Yara
Co-founder Greenplum and VP of
Products, EMC Data Computing
Division
May 9, 2011
© Copyright 2011 EMC Corporation. All rights reserved. 19
- 20. Background:
Greenplum Joins EMC in July, 2010…
J U LY 2 0 1 0 - E M C A C Q U I R E S G R E E N P L U M
“For three years, Gartner has identified Greenplum as
the most advanced vendor in the visionary
quadrant of its data warehouse DBMS Magic Quadrant….”
– Gartner
© Copyright 2011 EMC Corporation. All rights reserved. 20
- 21. Background: EMC + Greenplum,
A Fast Track to Innovation
• EMC Leverage:
– Established new Data Computing Products Division
• Reports directly to Pat Gelsinger, President & COO
– Investing to grow from 150 employees to +600 in 2011
• Increase R&D organization by more than 3X
– Launched new Greenplum Data Computing Appliance
• Built by EMC manufacturing, single-call support
• Simple integration with complimentary EMC products
• Available globally, serviced by EMC
– Established joint R&D with VMware around Enterprise Data Cloud
– Building disruptive solutions with EMC‟s global, tier-1 partners
© Copyright 2011 EMC Corporation. All rights reserved. 21
- 22. Current Success and Market Momentum
• Leaders Quadrant in Gartner
DW 2011
• Mission critical deployments
across multiple industries
• Installations from small (TBs)
to very large (PBs)
• Scalable analytics platform
to complement EDW
•22
© Copyright 2011 EMC Corporation. All rights reserved. 22
- 23. To make step-function
changes, revolutionary
changes, seems to take a very
unique combination
of timing, technology, talent…and
luck to make significant change in
our industry. It hasn't happened
that often.
Steve Jobs, 1995
© Copyright 2011 EMC Corporation. All rights reserved. 23
- 24. Data Is A REVOLUTIONARY Change
PERSONAL INTERNET DATA
COMPUTER
© Copyright 2011 EMC Corporation. All rights reserved. 24
- 25. New Realities. The New Normal.
Medical
Government Internet
Data
Collectors
Phone/TV Retail
Financial
© Copyright 2011 EMC Corporation. All rights reserved. 25
- 26. New Realities. The New Normal.
Data
Devices
Medical
Internet
Government
Law Enforcement
Data Public Education
Collectors
Phone/TV Retail
Financial
© Copyright 2011 EMC Corporation. All rights reserved. 26
- 27. New Realities. The New Normal.
Data
Devices
Analytic Medical Information
Advertising
Services Brokers
Government Internet
Data Websites
Data
Collectors Aggregators
Phone/TV Retail Catalog
Co-Ops
Media Credit List
Archives Bureaus Financial Brokers
© Copyright 2011 EMC Corporation. All rights reserved. 27
- 28. New Realities. The New Normal.
Data
Devices
Individual
Analytic Medical Information
Advertising Marketers Employers
Services Brokers
Law
Enforcement
Government Internet
Data Websites
Data
Collectors Aggregators
Data
Users/Buyers
Catalog
Co-Ops
Media
Phone/TV Retail
Private
Media Credit List
Financial Delivery Investigators
Archives Bureaus Brokers
Banks Service /Lawyers
Government
© Copyright 2011 EMC Corporation. All rights reserved. 28
- 29. But Why Now?
Convergenc
Web e
Innovation
(aka “cloud”)
Networking
X86
Virtualization
Storage
Time
© Copyright 2011 EMC Corporation. All rights reserved. 29
- 30. Processing data is now
100-1000X
FASTER AND CHEAPER
© Copyright 2011 EMC Corporation. All rights reserved. 30
- 31. What do we need?
© Copyright 2011 EMC Corporation. All rights reserved. 31
- 32. We Need…
Data Scientists
Innovation
Community
And a complete big data analytics stack
© Copyright 2011 EMC Corporation. All rights reserved. 32
- 34. EMC HADOOP
Unstructured.
Real-time.
Enterprise-Ready.
© Copyright 2011 EMC Corporation. All rights reserved. 34
- 35. What Is Hadoop?
• Apache Hadoop is an open-source technology inspired
by Google‟s MapReduce and Google File System
papers
• It is a software framework that supports data-intensive
distributed applications and is effective for analyzing
and storing massive amounts of data
• Leading internet companies like
Yahoo!, Facebook, eHarmony, Twitter, and eBay, have
pioneered the use of Hadoop
© Copyright 2011 EMC Corporation. All rights reserved. 35
- 36. Greenplum HD Product Family
• Greenplum HD Community Edition:
– Certified Full-Stack, 100% Open Source
– Virtual Machine Appliance
– All core feature development contributed back to Apache Hadoop
• Greenplum HD Enterprise Edition:
– Differentiated, hybrid distribution, advanced features
– Integrated, tested, hardened
– 100% Hadoop, HBase, HDFS API compatible
• Greenplum HD Data Computing Appliance:
– Optimized appliance configuration
– Eliminates complexity, simplifies deployment and management
– Seamless integration with Greenplum Database
© Copyright 2011 EMC Corporation. All rights reserved. 36
- 37. Greenplum HD Innovations
Major Technical Innovations for Hadoop
Real-time
Pluggable I/O Fault-Tolerance
Processing
• Isilon OneFS • Low latency • Eliminate SPOF
read/write operations for Name-Node
• Atmos
• Realtime data • Job Tracker and
• Cassandra interaction and other key
• MapR analytic processing components
• Enables greater • Integration with
efficiency and Cassandra and
performance MapR
© Copyright 2011 EMC Corporation. All rights reserved. 37
- 38. GREENPLUM HD
DATA COMPUTING
APPLIANCE
The Powerful Combination
of Greenplum Database
and Apache Hadoop
© Copyright 2011 EMC Corporation. All rights reserved. 38
- 39. Building a Complete Big Data Analytics Stack
Analytic Toolsets
(Business Analytics, BI, Statistics, etc.)
Greenplum Chorus
Enterprise Collaboration Platform for Data
Greenplum Data Computing Appliances
Purpose-built for Big Data Analytics
Greenplum Database Greenplum HD
Enterprise & Community Editions Hadoop Enterprise & Community Editions
World‟s Most Scalable MPP Database Platform Enterprise Analytics Platform for Unstructured
Data
© Copyright 2011 EMC Corporation. All rights reserved. 39
- 40. Celebrating Big
Data Innovators
www.DataHeroAwards.com
© Copyright 2011 EMC Corporation. All rights reserved. 40
- 41. Data Hero Award Winners
Silver Spring Networks – Energy Category
© Copyright 2011 EMC Corporation. All rights reserved. 41
- 42. Data Hero Award Winners
Broad Institute of MIT and Harvard – Life Sciences Category
© Copyright 2011 EMC Corporation. All rights reserved. 42
- 43. Data Hero Award Winners
Vivek Kundra, U.S. CIO – Visionary Award
© Copyright 2011 EMC Corporation. All rights reserved. 43
- 45. Big Data = Big Opportunity
© Copyright 2011 EMC Corporation. All rights reserved. 45
- 47. EMC
Las Vegas 2011
WORLD
© Copyright 2011 EMC Corporation. All rights reserved. 47