SlideShare ist ein Scribd-Unternehmen logo
1 von 37
RRain
1
Examine the Real Cost of Storing &
Analyzing your Big Data
Speakers
2
John Mallory
CTO - Analytics, EMC Isilon
Jyothi Swaroop
Director Product Marketing &
Alliances
Structured vs. Unstructured Data Growth
Total Capacity Shipped, Worldwide Unstructured Data
80%
74%
67%
71 EB 133 EB37 EB
2013 2015 2017
Source: IDC
Hadoop – “New Gateway Drug to Big Data”
4
Mature Platform Adoption Speed-up Enterprise Solutions
NAS
SAN CLOUD
TAPE
DAS
OBJECT
TRADITIONAL WORKLOADS EMERGING WORKLOADS
5© Copyright 2014 EMC Corporation. All rights reserved.© Copyright 2014 EMC Corporation. All rights reserved.
VALUE?
Cost of Storing Big Data - TCO
6
Source: Winter Corp Report: Big Data – What Does it Really Cost? 2014
7
Cost of Storing Big Data – 5 yrs
Source: Winter Corp Report: Big Data – What Does it Really Cost?
Traditional
(Row/ Columnar) Data
Warehouse
TB 10TB 200TB PB
Low Cost to Scale
QueryResponse
Hrs
Mins
Secs
Hadoop
Big Data – Cost to Scale vs. Performance
8
 Big Data Volume (50TB - PB)
 Fast Data Load & Massive Scale
 Fast Query Across Large Scale
 Flexible Deployment Options
??
NAS
SAN
TAPE OBJECT
CLOUD
DAS
RainStor-
Isilon Active
Archive
TRADITIONAL WORKLOADS EMERGING WORKLOADS
9
RainStor®
10
Derive Business Value from Your Historical Data
and Meet Regulatory Demands.
The Data Archive
RainStor® - Proven
11
20of
World’s Largest
Communications
Providers
15Strategic
Solution &
Technology
Partners
10of
World’s Biggest
Banks & Financial
Institutions
EMC Isilon Scale-Out NAS Environment
Clients and Applications
RESTful API
GET PUT POST DELETE
Gig-e
10 Gig-e
Network
OneFS Operating
Environment
Multi-Protocol
Client/Application
Layer
Ethernet Layer
Protocols
SMBNFS
FTPHTTP
HDFS
for
Hadoop
REST
for
Object
Intra-cluster
Communication
12
EMC Isilon - Industry Recognition
Isilon Systems is a successful acquisition for EMC
IDC Marketscape names EMC Isilon a Leader
in Scale-Out File Storage Market
- Worldwide Scale-Out File-Based Storage, December 2012
- Critical Capabilities for Scale-Out File System Storage, January 2013
EMC Isilon “Outstanding” in Critical Capabilities
for Scale-Out File
- Vendor Rating – EMC, May 2014
13
14
Solutions
15
Solutions:
Analytical Archive | Compliance Archive
(DW Offload) (Tape Avoidance)
Teradata
Netezza
Oracle Ex
Sybase IQ
 Data In
 Store
 Query
 Govern
 Data In
 Store
 Query
 Govern
 Comply
 WORM
SEC 17a-4; Dodd Frank
Source App
EDW
DB
Tape
Analytical Archive: End-to-end
16
QUERY/
ANALYZE
SQL
BI Tools; Hive,
MapReduce
SCALE – EMC Isilon
COMPRESSLOAD/
VALIDATE
Billions
Records/Day
10-40X
(90%+)
AVAILABILITY
Replication
DW
Source
Move
RETAIN
/DISPOSE
Rules
Based
IN STORE QUERY GOVERN
SECURE - Enterprise-grade
Database Storage - Compression: Up to 40X
Source: Ratios vs. Raw – RainStor Benchmarks using customer data (2012-13)
3X
0
5
10
15
20
25
30
35
40
45
50
6X
40X
8X
Hadoop LZO Compressed
Relational
(e.g. Oracle)
Flatfile
Gzip
Columnar
(e.g. Vertica)
RainStor
7X
17
Simplicity and Ease of Use
 Single volume and file system that spans nodes
– Directories and files striped across the cluster
 Automation:
– NO manual intervention
– NO reconfiguration
– NO server or client mount point or
application changes
– NO data migrations
– NO RAID
EFFICIENCY
18
More scalable than traditional storage systems
Largest and Most Scalable File System
OneFS scales from 18 TB to 20 PB in a single file system,
single volume
 Under 1 min to scale
with no downtime
Document Query
XQUERY
Query - Pick the Best Tool for the Job
20
BI AnalyticsAd-Hoc Query
Interactive
SQL-92
SQL 2013
BI TOOLS
DASHBOARD
Hadoop Tools
Hadoop on Scale-out NAS
MAPREDUCE
PIG, HIVE
Hadoop & Big Data
21
LOW VALUE DATA
 Recommendation Engines
 Data Sandboxing
 Log Processing
 Audits
 Regulatory Reporting (Eg. SEC, SOX)
 Lawful Intercept
 Social Media
 Logs
 Clickstreams
 Credit Card
 Trade
 Personal Information
HIGH VALUE DATA
SECURITY?
22
Security Capabilities & Features
Secure Large Volumes of Data on Hadoop
 Data Encryption
 Data Masking
 ViewsPrivacy
 Kerberos Authentication
 Authorization
 LDAP / Active Directory
 Linux PAM Support
Trust
 Tamper-proofing
 Audit Trail
 Record-level Delete
 Data Disposition
Integrity
RainStor-Isilon Architecture Overview
23
Apache Projects RainStor
Programming
Languages
Computation
Security
Database Storage
Object/Hardware
Storage
Vendor Specific
Top of Stack
Standard SQL
(with Oracle,
SQLServer, SybaseIQ
extensions)
Security and Compliance
(Encryption, Masking, Audit Trail, Data Disposition,
Kerberos, LDAP/Active Directory, Immutable)
RainStor Database
(up to 40X Data Compression)
HDFS
(Hadoop Distributed File System)
MapReduce – Batch
(Distributed Programming Framework)
Hive Pig Java
NAS, SAN, CAS, NFS
(On-premise, Cloud)
BI Tools, Dashboards
(ODBC/JDBC Connectivity)
Visualization Layer
EMC Isilon
RainStor: Hadoop 2.0 Distro Certifications
 Cloudera CDH 5.0
– Certified April 2014
 Hortonworks HDP 2.1
– April 2014
“We are delighted with the wide range of technology solution partners that have
certified on CDH 5 …it is testament to the maturity of the platform but also the overall
market demand,”
Tim Stevens, VP of Business & Corporate Development
25
Solution
Compliance Archive
SEC 17a-4(f) Compliance Archive Requirements
26
Records stored in non-erasable media (WORM)
Recording process must be verifiable
Fully Accessible to Authorities & Backed-up
Records should be Recognizable & Identifiable
Downloadable to any acceptable medium
27
Case Studies
28
Challenges
 Cost: Data volumes in disparate trading
applications growing at 70-100% / Year - Storage
costs rising @ 60% / Year
 Compliance: Must provide high performance EBS
and other queries for SEC
Solution
 A RainStor Archive for storing and reporting
against historical trade data
 13 years of history loaded from Sybase IQ
 Daily feed from trading application to RainStor
 Runs on low-cost NAS Tier 3 storage and VMs
 RainStor completely replaced Sybase IQ
 90% cost savings - $5MM ROI
 6 Projects live - 13 more in Progress
90%
Storage Cost
Reduction
“ It’s like shrink-wrapping your
data…forever!”
– VP, Technology
 30X Data Compression
 3X Faster Query Compared to Sybase
CONFIDENTIAL
Compliance Archiving: Global Investment Bank
Lower Compliant Data Retention Costs by a Factor of 10
BENEFITS
Enterprise Standard for Data Retention with Faster Analytics
Analytical Archiving : Large Multi-national Bank
Retain Trading Data, Stay Compliant at Lowest Cost
RainStor
Active Archive
Equities
BAR
400TB
FastForward™
29
FastConnect™
Trades
200TB
CONFIDENTIAL
EMC WORM
Storage
 25X Compression
 Meets Query SLAs
BENEFITS
Enterprise Standard for Compliance Driven Analysis
 Runs on EMC Centera & Isilon (WORM)
 Tape Avoidance
Challenges
 Cost: Fast data growth and Costly EDW’s
(Teradata & Netezza) - offload history
 Compliance: Must meet SEC compliance and
retain equities data for query - run on
approved WORM / CAS Storage (EMC)
 Avoid data on offline tape - reinstate older
Teradata data (BAR) and stay compliant.
Solution
 43 Equities apps (Oracle; SQL Server) offload
history to RS
 History offload from Netezza - run on WORM
 Re-instate Tape and bring online for audits. 43 Apps
RainStor + Isilon + Hadoop – TCO
Compression rate 32X (>96% cost savings)
Utilization Rate >80%
Scalability Up to 20 PB per cluster
Query Performance >= Hadoop on DAS
RainStor + Hadoop + Isilon =
Lowest 5yr TCO!
Why RainStor-Isilon?
31
Flexible
Architecture –
Hadoop, Cloud
Extract EDW
data for Active
Archiving
Lower Storage
Costs by at
least 90%
Gain Deeper
Insights – SQL,
Hive, Pig,
Search, BI tools
Reliable –
High
Availability,
Disaster
Recovery
Purpose-built
Security and
Compliance
features
First SQL Compatible,
Enterprise-grade Database
(native to Hadoop) to run on
Isilon Scale-out NAS.
Thank You
jyothi.swaroop@rainstor.com
32
The Active
Archive
33
Where Big Data & Archive Come Together
Network EDWApps TapePlatforms
RainStor – EMC Isilon Solution
RainStor for Teradata Solution - 3 Components
34
FastForward ™
 Reinstates from Offline
Tape Archives
 Handles V2R4, V2R5,
V2R6, TD12, TD13
 Eliminate Tape.
FastConnect™
 Offload history to
Active Archive on
continuous basis.
 Run on Hadoop for
Low Cost Scale.
RainStor Core Database:
• Highly Efficient Data Store - 20-40X Compression.
35
36
Next Steps
 Contact RainStor to find out
more about the joint solution:
info@rainstor.com
 Contact EMC to find out more:
CONFIDENTIAL37

Weitere ähnliche Inhalte

Was ist angesagt?

Lessons learned processing 70 billion data points a day using the hybrid cloud
Lessons learned processing 70 billion data points a day using the hybrid cloudLessons learned processing 70 billion data points a day using the hybrid cloud
Lessons learned processing 70 billion data points a day using the hybrid cloud
DataWorks Summit
 
Data Orchestration Platform for the Cloud
Data Orchestration Platform for the CloudData Orchestration Platform for the Cloud
Data Orchestration Platform for the Cloud
Alluxio, Inc.
 

Was ist angesagt? (20)

Breakout: Hadoop and the Operational Data Store
Breakout: Hadoop and the Operational Data StoreBreakout: Hadoop and the Operational Data Store
Breakout: Hadoop and the Operational Data Store
 
Apache Spark Workshop at Hadoop Summit
Apache Spark Workshop at Hadoop SummitApache Spark Workshop at Hadoop Summit
Apache Spark Workshop at Hadoop Summit
 
Keys for Success from Streams to Queries
Keys for Success from Streams to QueriesKeys for Success from Streams to Queries
Keys for Success from Streams to Queries
 
Insights into Real World Data Management Challenges
Insights into Real World Data Management ChallengesInsights into Real World Data Management Challenges
Insights into Real World Data Management Challenges
 
Accelerate Oil & Gas Discovery
Accelerate Oil & Gas DiscoveryAccelerate Oil & Gas Discovery
Accelerate Oil & Gas Discovery
 
Data-In-Motion Unleashed
Data-In-Motion UnleashedData-In-Motion Unleashed
Data-In-Motion Unleashed
 
Enabling big data & AI workloads on the object store at DBS
Enabling big data & AI workloads on the object store at DBS Enabling big data & AI workloads on the object store at DBS
Enabling big data & AI workloads on the object store at DBS
 
FAQ on Dedupe NetApp
FAQ on Dedupe NetAppFAQ on Dedupe NetApp
FAQ on Dedupe NetApp
 
Lessons learned processing 70 billion data points a day using the hybrid cloud
Lessons learned processing 70 billion data points a day using the hybrid cloudLessons learned processing 70 billion data points a day using the hybrid cloud
Lessons learned processing 70 billion data points a day using the hybrid cloud
 
Admiral Group
Admiral GroupAdmiral Group
Admiral Group
 
Hortonworks Data Platform and IBM Systems - A Complete Solution for Cognitive...
Hortonworks Data Platform and IBM Systems - A Complete Solution for Cognitive...Hortonworks Data Platform and IBM Systems - A Complete Solution for Cognitive...
Hortonworks Data Platform and IBM Systems - A Complete Solution for Cognitive...
 
Accelerating Big Data Insights
Accelerating Big Data InsightsAccelerating Big Data Insights
Accelerating Big Data Insights
 
Data Orchestration Platform for the Cloud
Data Orchestration Platform for the CloudData Orchestration Platform for the Cloud
Data Orchestration Platform for the Cloud
 
Times ten 18.1_overview_meetup
Times ten 18.1_overview_meetupTimes ten 18.1_overview_meetup
Times ten 18.1_overview_meetup
 
Bridging the gap: achieving fast data synchronization from SAP HANA by levera...
Bridging the gap: achieving fast data synchronization from SAP HANA by levera...Bridging the gap: achieving fast data synchronization from SAP HANA by levera...
Bridging the gap: achieving fast data synchronization from SAP HANA by levera...
 
Panasas® activestor® and ansys
Panasas® activestor® and ansysPanasas® activestor® and ansys
Panasas® activestor® and ansys
 
Hadoop in the Cloud – The What, Why and How from the Experts
Hadoop in the Cloud – The What, Why and How from the ExpertsHadoop in the Cloud – The What, Why and How from the Experts
Hadoop in the Cloud – The What, Why and How from the Experts
 
Big Data Platform Industrialization
Big Data Platform Industrialization Big Data Platform Industrialization
Big Data Platform Industrialization
 
Dave Shuttleworth - Platform performance comparisons, bare metal and cloud ho...
Dave Shuttleworth - Platform performance comparisons, bare metal and cloud ho...Dave Shuttleworth - Platform performance comparisons, bare metal and cloud ho...
Dave Shuttleworth - Platform performance comparisons, bare metal and cloud ho...
 
Panasas ® California Institute of Technology Success Story
Panasas ® California Institute of Technology Success StoryPanasas ® California Institute of Technology Success Story
Panasas ® California Institute of Technology Success Story
 

Ähnlich wie Rain stor isilon_emc_real_Examine the Real Cost of Storing & Analyzing Your Most Important Data cost_webcast_final[1]

Redis_Labs_Redis_on_Flash_on_Power8_-_INAF_Italy_-_June_2015.pptx
Redis_Labs_Redis_on_Flash_on_Power8_-_INAF_Italy_-_June_2015.pptxRedis_Labs_Redis_on_Flash_on_Power8_-_INAF_Italy_-_June_2015.pptx
Redis_Labs_Redis_on_Flash_on_Power8_-_INAF_Italy_-_June_2015.pptx
YouTubeVideos11
 
Best Practices For Building and Operating A Managed Data Lake - StampedeCon 2016
Best Practices For Building and Operating A Managed Data Lake - StampedeCon 2016Best Practices For Building and Operating A Managed Data Lake - StampedeCon 2016
Best Practices For Building and Operating A Managed Data Lake - StampedeCon 2016
StampedeCon
 
20150704 benchmark and user experience in sahara weiting
20150704 benchmark and user experience in sahara weiting20150704 benchmark and user experience in sahara weiting
20150704 benchmark and user experience in sahara weiting
Wei Ting Chen
 
Track 2, session 4, data protection and disaster recovery with riverbed
Track 2, session 4, data protection and disaster recovery with riverbedTrack 2, session 4, data protection and disaster recovery with riverbed
Track 2, session 4, data protection and disaster recovery with riverbed
EMC Forum India
 

Ähnlich wie Rain stor isilon_emc_real_Examine the Real Cost of Storing & Analyzing Your Most Important Data cost_webcast_final[1] (20)

The Most Trusted In-Memory database in the world- Altibase
The Most Trusted In-Memory database in the world- AltibaseThe Most Trusted In-Memory database in the world- Altibase
The Most Trusted In-Memory database in the world- Altibase
 
From limited Hadoop compute capacity to increased data scientist efficiency
From limited Hadoop compute capacity to increased data scientist efficiencyFrom limited Hadoop compute capacity to increased data scientist efficiency
From limited Hadoop compute capacity to increased data scientist efficiency
 
How the Development Bank of Singapore solves on-prem compute capacity challen...
How the Development Bank of Singapore solves on-prem compute capacity challen...How the Development Bank of Singapore solves on-prem compute capacity challen...
How the Development Bank of Singapore solves on-prem compute capacity challen...
 
2017 OpenWorld Keynote for Data Integration
2017 OpenWorld Keynote for Data Integration2017 OpenWorld Keynote for Data Integration
2017 OpenWorld Keynote for Data Integration
 
Redis_Labs_Redis_on_Flash_on_Power8_-_INAF_Italy_-_June_2015.pptx
Redis_Labs_Redis_on_Flash_on_Power8_-_INAF_Italy_-_June_2015.pptxRedis_Labs_Redis_on_Flash_on_Power8_-_INAF_Italy_-_June_2015.pptx
Redis_Labs_Redis_on_Flash_on_Power8_-_INAF_Italy_-_June_2015.pptx
 
Disaggregated Hadoop Stacks
Disaggregated Hadoop StacksDisaggregated Hadoop Stacks
Disaggregated Hadoop Stacks
 
Best Practices For Building and Operating A Managed Data Lake - StampedeCon 2016
Best Practices For Building and Operating A Managed Data Lake - StampedeCon 2016Best Practices For Building and Operating A Managed Data Lake - StampedeCon 2016
Best Practices For Building and Operating A Managed Data Lake - StampedeCon 2016
 
20150704 benchmark and user experience in sahara weiting
20150704 benchmark and user experience in sahara weiting20150704 benchmark and user experience in sahara weiting
20150704 benchmark and user experience in sahara weiting
 
Track 2, session 4, data protection and disaster recovery with riverbed
Track 2, session 4, data protection and disaster recovery with riverbedTrack 2, session 4, data protection and disaster recovery with riverbed
Track 2, session 4, data protection and disaster recovery with riverbed
 
Equinix Big Data Platform and Cassandra - A view into the journey
Equinix Big Data Platform and Cassandra - A view into the journeyEquinix Big Data Platform and Cassandra - A view into the journey
Equinix Big Data Platform and Cassandra - A view into the journey
 
HPE Hadoop Solutions - From use cases to proposal
HPE Hadoop Solutions - From use cases to proposalHPE Hadoop Solutions - From use cases to proposal
HPE Hadoop Solutions - From use cases to proposal
 
What it takes to run Hadoop at Scale: Yahoo! Perspectives
What it takes to run Hadoop at Scale: Yahoo! PerspectivesWhat it takes to run Hadoop at Scale: Yahoo! Perspectives
What it takes to run Hadoop at Scale: Yahoo! Perspectives
 
AWS Partner Webcast - Hadoop in the Cloud: Unlocking the Potential of Big Dat...
AWS Partner Webcast - Hadoop in the Cloud: Unlocking the Potential of Big Dat...AWS Partner Webcast - Hadoop in the Cloud: Unlocking the Potential of Big Dat...
AWS Partner Webcast - Hadoop in the Cloud: Unlocking the Potential of Big Dat...
 
Hadoop Summit San Jose 2015: What it Takes to Run Hadoop at Scale Yahoo Persp...
Hadoop Summit San Jose 2015: What it Takes to Run Hadoop at Scale Yahoo Persp...Hadoop Summit San Jose 2015: What it Takes to Run Hadoop at Scale Yahoo Persp...
Hadoop Summit San Jose 2015: What it Takes to Run Hadoop at Scale Yahoo Persp...
 
Bring Your SAP and Enterprise Data to Hadoop, Kafka, and the Cloud
Bring Your SAP and Enterprise Data to Hadoop, Kafka, and the CloudBring Your SAP and Enterprise Data to Hadoop, Kafka, and the Cloud
Bring Your SAP and Enterprise Data to Hadoop, Kafka, and the Cloud
 
Exploring the Wider World of Big Data
Exploring the Wider World of Big DataExploring the Wider World of Big Data
Exploring the Wider World of Big Data
 
VMworld 2013: Beyond Mission Critical: Virtualizing Big-Data, Hadoop, HPC, Cl...
VMworld 2013: Beyond Mission Critical: Virtualizing Big-Data, Hadoop, HPC, Cl...VMworld 2013: Beyond Mission Critical: Virtualizing Big-Data, Hadoop, HPC, Cl...
VMworld 2013: Beyond Mission Critical: Virtualizing Big-Data, Hadoop, HPC, Cl...
 
Logical Data Lakes: From Single Purpose to Multipurpose Data Lakes (APAC)
Logical Data Lakes: From Single Purpose to Multipurpose Data Lakes (APAC)Logical Data Lakes: From Single Purpose to Multipurpose Data Lakes (APAC)
Logical Data Lakes: From Single Purpose to Multipurpose Data Lakes (APAC)
 
Managing The Data Deluge By Optimizing Storage
Managing The Data Deluge By Optimizing StorageManaging The Data Deluge By Optimizing Storage
Managing The Data Deluge By Optimizing Storage
 
The Transformation of your Data in modern IT (Presented by DellEMC)
The Transformation of your Data in modern IT (Presented by DellEMC)The Transformation of your Data in modern IT (Presented by DellEMC)
The Transformation of your Data in modern IT (Presented by DellEMC)
 

Mehr von RainStor (6)

Archiving is a No-brainer - Bloor Analyst and RainStor Executive Discuss
Archiving is a No-brainer - Bloor Analyst and RainStor Executive DiscussArchiving is a No-brainer - Bloor Analyst and RainStor Executive Discuss
Archiving is a No-brainer - Bloor Analyst and RainStor Executive Discuss
 
Big Data Analytics on Hadoop RainStor Infographic
Big Data Analytics on Hadoop RainStor InfographicBig Data Analytics on Hadoop RainStor Infographic
Big Data Analytics on Hadoop RainStor Infographic
 
TDWI Checklist Report: Active Data Archiving
TDWI Checklist Report:  Active Data ArchivingTDWI Checklist Report:  Active Data Archiving
TDWI Checklist Report: Active Data Archiving
 
Smarter Management for Your Data Growth
Smarter Management for Your Data GrowthSmarter Management for Your Data Growth
Smarter Management for Your Data Growth
 
Big Data Retention Opportunity or Burden? - Featuring Merv Adrian July 2010
Big Data Retention Opportunity or Burden? - Featuring Merv Adrian July 2010Big Data Retention Opportunity or Burden? - Featuring Merv Adrian July 2010
Big Data Retention Opportunity or Burden? - Featuring Merv Adrian July 2010
 
RainStor 3.5 Overview
RainStor 3.5 OverviewRainStor 3.5 Overview
RainStor 3.5 Overview
 

Kürzlich hochgeladen

introduction-to-automotive Andoid os-csimmonds-ndctechtown-2021.pdf
introduction-to-automotive Andoid os-csimmonds-ndctechtown-2021.pdfintroduction-to-automotive Andoid os-csimmonds-ndctechtown-2021.pdf
introduction-to-automotive Andoid os-csimmonds-ndctechtown-2021.pdf
VishalKumarJha10
 

Kürzlich hochgeladen (20)

A Secure and Reliable Document Management System is Essential.docx
A Secure and Reliable Document Management System is Essential.docxA Secure and Reliable Document Management System is Essential.docx
A Secure and Reliable Document Management System is Essential.docx
 
The Guide to Integrating Generative AI into Unified Continuous Testing Platfo...
The Guide to Integrating Generative AI into Unified Continuous Testing Platfo...The Guide to Integrating Generative AI into Unified Continuous Testing Platfo...
The Guide to Integrating Generative AI into Unified Continuous Testing Platfo...
 
Optimizing AI for immediate response in Smart CCTV
Optimizing AI for immediate response in Smart CCTVOptimizing AI for immediate response in Smart CCTV
Optimizing AI for immediate response in Smart CCTV
 
Diamond Application Development Crafting Solutions with Precision
Diamond Application Development Crafting Solutions with PrecisionDiamond Application Development Crafting Solutions with Precision
Diamond Application Development Crafting Solutions with Precision
 
Microsoft AI Transformation Partner Playbook.pdf
Microsoft AI Transformation Partner Playbook.pdfMicrosoft AI Transformation Partner Playbook.pdf
Microsoft AI Transformation Partner Playbook.pdf
 
5 Signs You Need a Fashion PLM Software.pdf
5 Signs You Need a Fashion PLM Software.pdf5 Signs You Need a Fashion PLM Software.pdf
5 Signs You Need a Fashion PLM Software.pdf
 
Introducing Microsoft’s new Enterprise Work Management (EWM) Solution
Introducing Microsoft’s new Enterprise Work Management (EWM) SolutionIntroducing Microsoft’s new Enterprise Work Management (EWM) Solution
Introducing Microsoft’s new Enterprise Work Management (EWM) Solution
 
The Real-World Challenges of Medical Device Cybersecurity- Mitigating Vulnera...
The Real-World Challenges of Medical Device Cybersecurity- Mitigating Vulnera...The Real-World Challenges of Medical Device Cybersecurity- Mitigating Vulnera...
The Real-World Challenges of Medical Device Cybersecurity- Mitigating Vulnera...
 
HR Software Buyers Guide in 2024 - HRSoftware.com
HR Software Buyers Guide in 2024 - HRSoftware.comHR Software Buyers Guide in 2024 - HRSoftware.com
HR Software Buyers Guide in 2024 - HRSoftware.com
 
Exploring the Best Video Editing App.pdf
Exploring the Best Video Editing App.pdfExploring the Best Video Editing App.pdf
Exploring the Best Video Editing App.pdf
 
introduction-to-automotive Andoid os-csimmonds-ndctechtown-2021.pdf
introduction-to-automotive Andoid os-csimmonds-ndctechtown-2021.pdfintroduction-to-automotive Andoid os-csimmonds-ndctechtown-2021.pdf
introduction-to-automotive Andoid os-csimmonds-ndctechtown-2021.pdf
 
Right Money Management App For Your Financial Goals
Right Money Management App For Your Financial GoalsRight Money Management App For Your Financial Goals
Right Money Management App For Your Financial Goals
 
Azure_Native_Qumulo_High_Performance_Compute_Benchmarks.pdf
Azure_Native_Qumulo_High_Performance_Compute_Benchmarks.pdfAzure_Native_Qumulo_High_Performance_Compute_Benchmarks.pdf
Azure_Native_Qumulo_High_Performance_Compute_Benchmarks.pdf
 
Learn the Fundamentals of XCUITest Framework_ A Beginner's Guide.pdf
Learn the Fundamentals of XCUITest Framework_ A Beginner's Guide.pdfLearn the Fundamentals of XCUITest Framework_ A Beginner's Guide.pdf
Learn the Fundamentals of XCUITest Framework_ A Beginner's Guide.pdf
 
call girls in Vaishali (Ghaziabad) 🔝 >༒8448380779 🔝 genuine Escort Service 🔝✔️✔️
call girls in Vaishali (Ghaziabad) 🔝 >༒8448380779 🔝 genuine Escort Service 🔝✔️✔️call girls in Vaishali (Ghaziabad) 🔝 >༒8448380779 🔝 genuine Escort Service 🔝✔️✔️
call girls in Vaishali (Ghaziabad) 🔝 >༒8448380779 🔝 genuine Escort Service 🔝✔️✔️
 
Shapes for Sharing between Graph Data Spaces - and Epistemic Querying of RDF-...
Shapes for Sharing between Graph Data Spaces - and Epistemic Querying of RDF-...Shapes for Sharing between Graph Data Spaces - and Epistemic Querying of RDF-...
Shapes for Sharing between Graph Data Spaces - and Epistemic Querying of RDF-...
 
How to Choose the Right Laravel Development Partner in New York City_compress...
How to Choose the Right Laravel Development Partner in New York City_compress...How to Choose the Right Laravel Development Partner in New York City_compress...
How to Choose the Right Laravel Development Partner in New York City_compress...
 
VTU technical seminar 8Th Sem on Scikit-learn
VTU technical seminar 8Th Sem on Scikit-learnVTU technical seminar 8Th Sem on Scikit-learn
VTU technical seminar 8Th Sem on Scikit-learn
 
Software Quality Assurance Interview Questions
Software Quality Assurance Interview QuestionsSoftware Quality Assurance Interview Questions
Software Quality Assurance Interview Questions
 
W01_panagenda_Navigating-the-Future-with-The-Hitchhikers-Guide-to-Notes-and-D...
W01_panagenda_Navigating-the-Future-with-The-Hitchhikers-Guide-to-Notes-and-D...W01_panagenda_Navigating-the-Future-with-The-Hitchhikers-Guide-to-Notes-and-D...
W01_panagenda_Navigating-the-Future-with-The-Hitchhikers-Guide-to-Notes-and-D...
 

Rain stor isilon_emc_real_Examine the Real Cost of Storing & Analyzing Your Most Important Data cost_webcast_final[1]

  • 1. RRain 1 Examine the Real Cost of Storing & Analyzing your Big Data
  • 2. Speakers 2 John Mallory CTO - Analytics, EMC Isilon Jyothi Swaroop Director Product Marketing & Alliances
  • 3. Structured vs. Unstructured Data Growth Total Capacity Shipped, Worldwide Unstructured Data 80% 74% 67% 71 EB 133 EB37 EB 2013 2015 2017 Source: IDC
  • 4. Hadoop – “New Gateway Drug to Big Data” 4 Mature Platform Adoption Speed-up Enterprise Solutions
  • 5. NAS SAN CLOUD TAPE DAS OBJECT TRADITIONAL WORKLOADS EMERGING WORKLOADS 5© Copyright 2014 EMC Corporation. All rights reserved.© Copyright 2014 EMC Corporation. All rights reserved. VALUE?
  • 6. Cost of Storing Big Data - TCO 6 Source: Winter Corp Report: Big Data – What Does it Really Cost? 2014
  • 7. 7 Cost of Storing Big Data – 5 yrs Source: Winter Corp Report: Big Data – What Does it Really Cost?
  • 8. Traditional (Row/ Columnar) Data Warehouse TB 10TB 200TB PB Low Cost to Scale QueryResponse Hrs Mins Secs Hadoop Big Data – Cost to Scale vs. Performance 8  Big Data Volume (50TB - PB)  Fast Data Load & Massive Scale  Fast Query Across Large Scale  Flexible Deployment Options ??
  • 10. RainStor® 10 Derive Business Value from Your Historical Data and Meet Regulatory Demands. The Data Archive
  • 11. RainStor® - Proven 11 20of World’s Largest Communications Providers 15Strategic Solution & Technology Partners 10of World’s Biggest Banks & Financial Institutions
  • 12. EMC Isilon Scale-Out NAS Environment Clients and Applications RESTful API GET PUT POST DELETE Gig-e 10 Gig-e Network OneFS Operating Environment Multi-Protocol Client/Application Layer Ethernet Layer Protocols SMBNFS FTPHTTP HDFS for Hadoop REST for Object Intra-cluster Communication 12
  • 13. EMC Isilon - Industry Recognition Isilon Systems is a successful acquisition for EMC IDC Marketscape names EMC Isilon a Leader in Scale-Out File Storage Market - Worldwide Scale-Out File-Based Storage, December 2012 - Critical Capabilities for Scale-Out File System Storage, January 2013 EMC Isilon “Outstanding” in Critical Capabilities for Scale-Out File - Vendor Rating – EMC, May 2014 13
  • 15. 15 Solutions: Analytical Archive | Compliance Archive (DW Offload) (Tape Avoidance) Teradata Netezza Oracle Ex Sybase IQ  Data In  Store  Query  Govern  Data In  Store  Query  Govern  Comply  WORM SEC 17a-4; Dodd Frank Source App EDW DB Tape
  • 16. Analytical Archive: End-to-end 16 QUERY/ ANALYZE SQL BI Tools; Hive, MapReduce SCALE – EMC Isilon COMPRESSLOAD/ VALIDATE Billions Records/Day 10-40X (90%+) AVAILABILITY Replication DW Source Move RETAIN /DISPOSE Rules Based IN STORE QUERY GOVERN SECURE - Enterprise-grade
  • 17. Database Storage - Compression: Up to 40X Source: Ratios vs. Raw – RainStor Benchmarks using customer data (2012-13) 3X 0 5 10 15 20 25 30 35 40 45 50 6X 40X 8X Hadoop LZO Compressed Relational (e.g. Oracle) Flatfile Gzip Columnar (e.g. Vertica) RainStor 7X 17
  • 18. Simplicity and Ease of Use  Single volume and file system that spans nodes – Directories and files striped across the cluster  Automation: – NO manual intervention – NO reconfiguration – NO server or client mount point or application changes – NO data migrations – NO RAID EFFICIENCY 18
  • 19. More scalable than traditional storage systems Largest and Most Scalable File System OneFS scales from 18 TB to 20 PB in a single file system, single volume  Under 1 min to scale with no downtime
  • 20. Document Query XQUERY Query - Pick the Best Tool for the Job 20 BI AnalyticsAd-Hoc Query Interactive SQL-92 SQL 2013 BI TOOLS DASHBOARD Hadoop Tools Hadoop on Scale-out NAS MAPREDUCE PIG, HIVE
  • 21. Hadoop & Big Data 21 LOW VALUE DATA  Recommendation Engines  Data Sandboxing  Log Processing  Audits  Regulatory Reporting (Eg. SEC, SOX)  Lawful Intercept  Social Media  Logs  Clickstreams  Credit Card  Trade  Personal Information HIGH VALUE DATA SECURITY?
  • 22. 22 Security Capabilities & Features Secure Large Volumes of Data on Hadoop  Data Encryption  Data Masking  ViewsPrivacy  Kerberos Authentication  Authorization  LDAP / Active Directory  Linux PAM Support Trust  Tamper-proofing  Audit Trail  Record-level Delete  Data Disposition Integrity
  • 23. RainStor-Isilon Architecture Overview 23 Apache Projects RainStor Programming Languages Computation Security Database Storage Object/Hardware Storage Vendor Specific Top of Stack Standard SQL (with Oracle, SQLServer, SybaseIQ extensions) Security and Compliance (Encryption, Masking, Audit Trail, Data Disposition, Kerberos, LDAP/Active Directory, Immutable) RainStor Database (up to 40X Data Compression) HDFS (Hadoop Distributed File System) MapReduce – Batch (Distributed Programming Framework) Hive Pig Java NAS, SAN, CAS, NFS (On-premise, Cloud) BI Tools, Dashboards (ODBC/JDBC Connectivity) Visualization Layer EMC Isilon
  • 24. RainStor: Hadoop 2.0 Distro Certifications  Cloudera CDH 5.0 – Certified April 2014  Hortonworks HDP 2.1 – April 2014 “We are delighted with the wide range of technology solution partners that have certified on CDH 5 …it is testament to the maturity of the platform but also the overall market demand,” Tim Stevens, VP of Business & Corporate Development
  • 26. SEC 17a-4(f) Compliance Archive Requirements 26 Records stored in non-erasable media (WORM) Recording process must be verifiable Fully Accessible to Authorities & Backed-up Records should be Recognizable & Identifiable Downloadable to any acceptable medium
  • 28. 28 Challenges  Cost: Data volumes in disparate trading applications growing at 70-100% / Year - Storage costs rising @ 60% / Year  Compliance: Must provide high performance EBS and other queries for SEC Solution  A RainStor Archive for storing and reporting against historical trade data  13 years of history loaded from Sybase IQ  Daily feed from trading application to RainStor  Runs on low-cost NAS Tier 3 storage and VMs  RainStor completely replaced Sybase IQ  90% cost savings - $5MM ROI  6 Projects live - 13 more in Progress 90% Storage Cost Reduction “ It’s like shrink-wrapping your data…forever!” – VP, Technology  30X Data Compression  3X Faster Query Compared to Sybase CONFIDENTIAL Compliance Archiving: Global Investment Bank Lower Compliant Data Retention Costs by a Factor of 10 BENEFITS Enterprise Standard for Data Retention with Faster Analytics
  • 29. Analytical Archiving : Large Multi-national Bank Retain Trading Data, Stay Compliant at Lowest Cost RainStor Active Archive Equities BAR 400TB FastForward™ 29 FastConnect™ Trades 200TB CONFIDENTIAL EMC WORM Storage  25X Compression  Meets Query SLAs BENEFITS Enterprise Standard for Compliance Driven Analysis  Runs on EMC Centera & Isilon (WORM)  Tape Avoidance Challenges  Cost: Fast data growth and Costly EDW’s (Teradata & Netezza) - offload history  Compliance: Must meet SEC compliance and retain equities data for query - run on approved WORM / CAS Storage (EMC)  Avoid data on offline tape - reinstate older Teradata data (BAR) and stay compliant. Solution  43 Equities apps (Oracle; SQL Server) offload history to RS  History offload from Netezza - run on WORM  Re-instate Tape and bring online for audits. 43 Apps
  • 30. RainStor + Isilon + Hadoop – TCO Compression rate 32X (>96% cost savings) Utilization Rate >80% Scalability Up to 20 PB per cluster Query Performance >= Hadoop on DAS RainStor + Hadoop + Isilon = Lowest 5yr TCO!
  • 31. Why RainStor-Isilon? 31 Flexible Architecture – Hadoop, Cloud Extract EDW data for Active Archiving Lower Storage Costs by at least 90% Gain Deeper Insights – SQL, Hive, Pig, Search, BI tools Reliable – High Availability, Disaster Recovery Purpose-built Security and Compliance features First SQL Compatible, Enterprise-grade Database (native to Hadoop) to run on Isilon Scale-out NAS.
  • 33. 33 Where Big Data & Archive Come Together Network EDWApps TapePlatforms RainStor – EMC Isilon Solution
  • 34. RainStor for Teradata Solution - 3 Components 34 FastForward ™  Reinstates from Offline Tape Archives  Handles V2R4, V2R5, V2R6, TD12, TD13  Eliminate Tape. FastConnect™  Offload history to Active Archive on continuous basis.  Run on Hadoop for Low Cost Scale. RainStor Core Database: • Highly Efficient Data Store - 20-40X Compression.
  • 35. 35
  • 36. 36
  • 37. Next Steps  Contact RainStor to find out more about the joint solution: info@rainstor.com  Contact EMC to find out more: CONFIDENTIAL37