SlideShare ist ein Scribd-Unternehmen logo
1 von 8
Downloaden Sie, um offline zu lesen
White
Paper
EMC Isilon: A Scalable Storage Platform
for Big Data
By Nik Rouda, Senior Analyst and Terri McClure, Senior Analyst
April 2014
This ESG White Paper was commissioned by EMC Isilon
and is distributed under license from ESG.
© 2014 by The Enterprise Strategy Group, Inc. All Rights Reserved.
White Paper: EMC Isilon: A Scalable Storage Platform for Big Data 2
© 2014 by The Enterprise Strategy Group, Inc. All Rights Reserved.
Contents
Big Data Needs Big Storage ..........................................................................................................................3
Businesses Want Big Data to Have a Big Impact ...................................................................................................... 3
Storage Selection Criteria for Big Data.........................................................................................................4
Advantages of Isilon Scale-out Storage for Hadoop.....................................................................................5
Many Protocols, but Only One Copy of Data............................................................................................................ 6
In-place Analytics with Your Favorite Flavor of Hadoop .......................................................................................... 7
Enterprise-class Storage Increases Efficiency and Safety......................................................................................... 7
The Bigger Truth ...........................................................................................................................................7
All trademark names are property of their respective companies. Information contained in this publication has been obtained by sources The
Enterprise Strategy Group (ESG) considers to be reliable but is not warranted by ESG. This publication may contain opinions of ESG, which are
subject to change from time to time. This publication is copyrighted by The Enterprise Strategy Group, Inc. Any reproduction or redistribution of
this publication, in whole or in part, whether in hard-copy format, electronically, or otherwise to persons not authorized to receive it, without the
express consent of The Enterprise Strategy Group, Inc., is in violation of U.S. copyright law and will be subject to an action for civil damages and,
if applicable, criminal prosecution. Should you have any questions, please contact ESG Client Relations at 508.482.0188.
White Paper: EMC Isilon: A Scalable Storage Platform for Big Data 3
© 2014 by The Enterprise Strategy Group, Inc. All Rights Reserved.
Big Data Needs Big Storage
A rising tide of information is being collected, processed, and analyzed by enterprises around the world, but this
flood of data brings as many challenges as it does solutions. As companies become more data driven in a wide
range of activities, they will need their production big data implementations to meet common enterprise
requirements such as high performance, scalability, availability, security, and compliance. Underlying the databases
and analytics engines, other parts of the technology stack are critical components for providing these qualities, not
the least of which involves the choice of storage platforms. The storage systems can make or break a big data
implementation. EMC Isilon is a leader in scale-out storage and offers many advantages as a foundation for big data
analytics, which are built on the company’s years of experience in large enterprise data centers.
Businesses Want Big Data to Have a Big Impact
There has been no shortage of press on the many practical applications of big data in all industries and across all
lines of business. Many of these stories are compelling anecdotes, and are often specific to the particular
organization’s goals and activities. However, some common data analytics trends can be found across industries.
ESG recently conducted its 2014 IT Spending Intentions Survey and identified the top business benefits desired by
respondent organizations from their investments in business intelligence and analytics.1
Figure 1. Business Benefits from Data Investments
Source: Enterprise Strategy Group, 2014.
Implicit in all these goals is the need to serve the business not just with more data, but also with timelier reporting.
For many companies, the time needed to get an answer is the key criterion for the adoption of data-driven decision
1 Source: ESG Research Report, 2014 IT Spending Intentions Survey, February 2014. All ESG references and charts in this white paper have
been taken from this research report.
26%
30%
31%
34%
35%
36%
39%
41%
42%
59%
0% 10% 20% 30% 40% 50% 60% 70%
Reduced risk of product defects
Quicker time to market for products/services
Uncover new market opportunities
Faster tactical response to shifting customer views
More insights into historical results
More insights into future scenarios or outcomes
Incremental cost savings
Higher quality products/services
Reduced risk around business decisions and strategy
Improved operational efficiency
What business benefits do you expect to gain from your investments in the area of
business intelligence, analytics, and big data? (Percent of respondents, N=187, multiple
responses accepted)
White Paper: EMC Isilon: A Scalable Storage Platform for Big Data 4
© 2014 by The Enterprise Strategy Group, Inc. All Rights Reserved.
making. No longer do quarterly batch reports meet the needs—instead, daily updates, real-time alerts, and ad hoc
querying are becoming standard requirements for analysts and executives.
Vendors are now bringing a breadth of data technologies into play, ranging from traditional relational databases to
NoSQL and Hadoop, and from advanced analytics applications to data visualization and reporting tools. With these
tools come accompanying options in architecture models: commodity servers, ready-made appliances, or cloud
services, and open source or proprietary software. Each of these choices will have an impact on the overall
capabilities of the solution, affecting end-user perceptions of performance, flexibility, and availability.
These high expectations from business executives put a lot of pressure on enterprise IT departments to deliver a
well-implemented solution. This isn’t usually an easy task considering that big data initiatives often involve the
integration of many new data sources, big data platforms, and analytics applications with existing data warehouses
and transactional databases. This architectural complexity spans many IT disciplines, with dependencies on
everything, including applications, servers, networks, and storage. Looking at the ESG research in Figure 2, it is clear
that many of the top ten most-cited IT priorities will be directly related to proper management of enterprise data,
including big data.
Figure 2. Top Ten Most Important IT Priorities for 2014
Source: Enterprise Strategy Group, 2014.
These issues are sometimes glibly underestimated with the assumption that the Hadoop Distributed File System
(HDFS) provides cheap and cheerful provisions for storing and managing massive volumes of big data. The truth is
that storage requirements for the enterprise are becoming increasingly demanding, especially as more decision
makers become reliant on big data insights.
Storage Selection Criteria for Big Data
As noted, the choice of storage platform underpins the overall efficacy of the technology stack, and will have
ramifications that must be carefully evaluated. There are a number of factors to consider, including:
22%
23%
23%
23%
23%
24%
25%
29%
32%
32%
0% 5% 10% 15% 20% 25% 30% 35%
Build a “private cloud” infrastructure
Business intelligence/data analytics initiatives
Use cloud infrastructure services
Major application deployments or upgrades
Regulatory compliance initiatives
Desktop virtualization
Manage data growth
Improve data backup and recovery
Information security initiatives
Increase use of server virtualization
Which of the following would you consider to be your organization’s most important IT
priorities over the next 12 months? (Percent of respondents, N=562, ten responses
accepted)
White Paper: EMC Isilon: A Scalable Storage Platform for Big Data 5
© 2014 by The Enterprise Strategy Group, Inc. All Rights Reserved.
 Scalability and efficiency will have an obvious impact on the ability to ingest and store data. Particular
attention should be paid to mechanisms that reduce total footprint, such as deduplication, compression,
and the redundancy required to preserve data against loss. Human capital required to manage the system
should also be analyzed in the efficiency category because organizations cannot afford to continue to add
staff to manage the environment as data grows.
 Total cost of ownership (TCO) matters as the big data initiative benefits are weighed against both capital
and operating expense, including maintenance, support, footprint, and human capital. A reduced cost
structure should lead to more data stored (because organizations can now afford to) and more valuable
insights realized (as a benefit of having more data to analyze).
 Performance seems like an obvious requirement, but it can be elusive as more users do more
comprehensive analysis with larger data volumes. Finding a storage system that can handle the I/O
demands of the environment, including any extract, transform, and load (ETL) to other data repositories, is
critical. Data location has a significant impact on this, especially if large amounts need to be moved around
before analytics processing.
 Data protection, security, and governance utilities are becoming mandatory for big data environments. As
data lakes or data hubs start to encapsulate all manner of sensitive data in one central location, this clearly
needs to be treated with great care. Compliance with relevant government and industry regulations must
be addressed directly and explicitly. As a newer technology, Hadoop by itself isn’t as mature in these areas
as enterprises may require.
 Accessibility may be one of the least recognized attributes of the storage decision, but it can provide
significant advantages in flexibility of models for enabling different groups or tools to harness the data
without moving it into other platforms before processing can begin. Access controls also must be well
developed and granular.
These are all important factors for deciding how appropriate a storage platform is with big data environments.
Again, a range of traditional options for storage platforms include: commodity direct attached (DAS), storage area
network (SAN), and network attached storage (NAS). Conventional wisdom has been to use commodity storage in
the form of internal drives, but when weighing the impact of storage infrastructure choices on data management
and analytics conventional wisdom falls short on delivery.
Advantages of Isilon Scale-out Storage for Hadoop
Today, there is still a relative immaturity of functionality and robustness in many big data technology stacks when it
comes to storage. Although Hadoop and HDFS can simplify the model for scaling on commodity servers with DAS,
some alternatives provide compelling advantages for the enterprise and help overcome some of the challenges
associated with using the traditional approach.
Challenges with using the embedded storage/DAS approach include data protection, data leverage, elongated
business process, and, surprisingly, cost. On the data protection front, HDFS uses multiple copies of data to provide
data protection, meaning it consumes a lot of storage. Both data leverage and business processes are impacted by
the fact that data is only accessible via HDFS and is not accessible to other applications that require other interfaces
(i.e., RESTful object-based applications or NFS/CIFS/SMB file-based applications). This means ETL operations need
to be performed to ingest data or leverage data in other business processes, thus elongating those processes each
time the ETL process needs to be performed. This also means that organizations must have multiple data
repositories for the same data in multiple data formats to support different business processes. So on the surface,
using commodity DAS configurations may sound attractive and may indeed be a good fit for many organizations,
but those companies that need to analyze data from multiple sources or leverage it to support multiple business
processes incur further costs for additional infrastructure and may need to investigate alternative approaches.
One alternative approach that helps overcome these challenges is the adoption of a shared storage platform that
has been designed to meet enterprise IT operations requirements. EMC Isilon is a prime example of this case, which
White Paper: EMC Isilon: A Scalable Storage Platform for Big Data 6
© 2014 by The Enterprise Strategy Group, Inc. All Rights Reserved.
brings Hadoop to your data, instead of moving all your data to Hadoop clusters. It lets users create a central data
hub that supports multiple applications and business processes, reducing costs and business cycles by eliminating
most ETL requirements.
Many Protocols, but Only One Copy of Data
Isilon is a flexible storage platform that supports multiprotocol access to a single data object, eliminating the up-
front protocol decision because NFS, RESTful objects, HTTP, FTP, SMB, and HDFS are all supported. So users can
ingest an object from a web app and access it via NFS to edit it. Or better yet, a user might access web logs directly
from a web application, rather than exporting to a spreadsheet, and access these via the native HDFS interface to
run analytics. This ability to make only one copy available for multiple uses is a major benefit for reduction in
overall storage costs and cycle time because it means there is no need to export data to multiple systems for the
various use cases. A single repository also greatly simplifies compliance audit requirements, rather than chasing
after many distinct locations and sources.
Figure 3. Multi-protocol Access to Isilon’s OneFS Operating System
Source: EMC/Isilon, 2014.
White Paper: EMC Isilon: A Scalable Storage Platform for Big Data 7
© 2014 by The Enterprise Strategy Group, Inc. All Rights Reserved.
In-place Analytics with Your Favorite Flavor of Hadoop
Building on the support for multiprotocol access to a single data object, organizations can effectively do “in-place”
analytics on data without needing a lengthy data ingest from other primary storage data sources to the Hadoop
system, which very often leads to a faster overall time to results. Although more specialized data layouts and
approaches can sometimes be faster in querying and analysis, with Isilon, data analysis can be started immediately,
and the reduced effort and start time delay without ETL can often overcome the difference. In addition, concurrent
instances of different Hadoop distributions could be run in parallel on the same underlying storage system, giving
much more flexibility to leverage the relative strengths of each, again, without the need to move large quantities of
data around.
Enterprise-class Storage Increases Efficiency and Safety
Although HDFS may be a reliable and scalable model for collecting and storing the high volumes and varieties of
data in a typical big data environment, it isn’t necessarily the most efficient. Some features that provide that
robustness on commodity hardware may actually detract from overall efficiency. Mirroring with Hadoop direct
attached storage is a good example, causing typically three to five times redundancy, which significantly affects the
effective usage ratio of total drive capacity. Isilon, with built-in data protection, high availability, and general
robustness, can instead run at 80% utilization levels of capacity (compared with 20-33% with HDFS) and this is
further improved by data reduction of up to 30% with SmartDedupe. All this helps reduce the storage footprint,
bringing associated cost reductions in energy and space consumption in the data center. Separating server and
storage by growing each independently instead of always adding another fixed unit commodity server also allows
more targeted scaling of the environment to meet the actual workloads.
From a governance and security point of view, the Isilon storage system offers “write once, read many” (WORM)
compliance for archival to meet government and industry regulations, standard Kerberos authentication, and
access control lists (ACLs) to make sure the user touching the central data hub is authorized.
All of these features combine to reduce initial cost of purchase, ongoing operational costs, and risk of failure or
security breach of sensitive information.
The Bigger Truth
Having explored the rapid growth of big data in adoption and importance, and the potential impacts of the
underlying infrastructure, it is clear that enterprises should rethink the architectural implications of their storage
choices for their big data initiatives. There are multiple advantages in taking a shared storage approach, covering a
wide range of desired characteristics including increased efficiency, reduced total cost, overall speed to answer,
reduced risk of data loss or inappropriate access, and analytics flexibility.
Isilon is breaking ground in challenging the default storage paradigm assumptions of big data practitioners, and its
approach is well worth evaluation for its merits compared with the de facto standard of direct attached storage in
commodity server hardware. Coming from a long history of building flexible, scalable storage platforms for
demanding enterprise requirements serves Isilon well in addressing many common challenges of big data storage,
and this experience should serve customers well. Particularly, current Isilon customers should experiment with
running Hadoop on their existing systems; they may well find that the right answer is already in place.
20 Asylum Street | Milford, MA 01757 | Tel: 508.482.0188 Fax: 508.482.0218 | www.esg-global.com

Weitere ähnliche Inhalte

Was ist angesagt?

Msft Top10 Business Practicesfor Es Data Centers April09
Msft Top10 Business Practicesfor Es Data Centers April09Msft Top10 Business Practicesfor Es Data Centers April09
Msft Top10 Business Practicesfor Es Data Centers April09hutuworm
 
Attestor 2.0 Sneak Preview
Attestor 2.0 Sneak PreviewAttestor 2.0 Sneak Preview
Attestor 2.0 Sneak PreviewTrustArc
 
BRIDGING DATA SILOS USING BIG DATA INTEGRATION
BRIDGING DATA SILOS USING BIG DATA INTEGRATIONBRIDGING DATA SILOS USING BIG DATA INTEGRATION
BRIDGING DATA SILOS USING BIG DATA INTEGRATIONijmnct
 
Big data – A Review
Big data – A ReviewBig data – A Review
Big data – A ReviewIRJET Journal
 
IRJET- A Scrutiny on Research Analysis of Big Data Analytical Method and Clou...
IRJET- A Scrutiny on Research Analysis of Big Data Analytical Method and Clou...IRJET- A Scrutiny on Research Analysis of Big Data Analytical Method and Clou...
IRJET- A Scrutiny on Research Analysis of Big Data Analytical Method and Clou...IRJET Journal
 
Big data analysis concepts and references by Cloud Security Alliance
Big data analysis concepts and references by Cloud Security AllianceBig data analysis concepts and references by Cloud Security Alliance
Big data analysis concepts and references by Cloud Security AllianceInformation Security Awareness Group
 
Big data-analytics-2013-peer-research-report
Big data-analytics-2013-peer-research-reportBig data-analytics-2013-peer-research-report
Big data-analytics-2013-peer-research-reportAravindharamanan S
 
Why You Need to Govern Big Data
Why You Need to Govern Big DataWhy You Need to Govern Big Data
Why You Need to Govern Big DataIBM Analytics
 
IRJET- Analysis of Big Data Technology and its Challenges
IRJET- Analysis of Big Data Technology and its ChallengesIRJET- Analysis of Big Data Technology and its Challenges
IRJET- Analysis of Big Data Technology and its ChallengesIRJET Journal
 
White Paper on IBM MTSS
White Paper on IBM MTSSWhite Paper on IBM MTSS
White Paper on IBM MTSSEd Aussem
 
Storage Economic Principles: Reducing Data Storage Cost & TCO
Storage Economic Principles: Reducing Data Storage Cost & TCOStorage Economic Principles: Reducing Data Storage Cost & TCO
Storage Economic Principles: Reducing Data Storage Cost & TCOHitachi Vantara
 
Navigating Storage in a Cloudy Environment
Navigating Storage in a Cloudy EnvironmentNavigating Storage in a Cloudy Environment
Navigating Storage in a Cloudy EnvironmentHGST Storage
 
Lighten Your Data Center TCO With “Helium” Storage Solutions
Lighten Your Data Center TCO With “Helium” Storage SolutionsLighten Your Data Center TCO With “Helium” Storage Solutions
Lighten Your Data Center TCO With “Helium” Storage SolutionsHGST Storage
 
A technical Introduction to Big Data Analytics
A technical Introduction to Big Data AnalyticsA technical Introduction to Big Data Analytics
A technical Introduction to Big Data AnalyticsPethuru Raj PhD
 
IDC Study on Enterprise Hybrid Cloud Strategies
IDC Study on Enterprise Hybrid Cloud StrategiesIDC Study on Enterprise Hybrid Cloud Strategies
IDC Study on Enterprise Hybrid Cloud StrategiesEMC
 
Enterprise Information Management
Enterprise Information ManagementEnterprise Information Management
Enterprise Information ManagementBilly Cripe
 

Was ist angesagt? (20)

Msft Top10 Business Practicesfor Es Data Centers April09
Msft Top10 Business Practicesfor Es Data Centers April09Msft Top10 Business Practicesfor Es Data Centers April09
Msft Top10 Business Practicesfor Es Data Centers April09
 
Attestor 2.0 Sneak Preview
Attestor 2.0 Sneak PreviewAttestor 2.0 Sneak Preview
Attestor 2.0 Sneak Preview
 
Cloud Analytics Playbook
Cloud Analytics PlaybookCloud Analytics Playbook
Cloud Analytics Playbook
 
BRIDGING DATA SILOS USING BIG DATA INTEGRATION
BRIDGING DATA SILOS USING BIG DATA INTEGRATIONBRIDGING DATA SILOS USING BIG DATA INTEGRATION
BRIDGING DATA SILOS USING BIG DATA INTEGRATION
 
Data Center
Data Center Data Center
Data Center
 
Big data – A Review
Big data – A ReviewBig data – A Review
Big data – A Review
 
IRJET- A Scrutiny on Research Analysis of Big Data Analytical Method and Clou...
IRJET- A Scrutiny on Research Analysis of Big Data Analytical Method and Clou...IRJET- A Scrutiny on Research Analysis of Big Data Analytical Method and Clou...
IRJET- A Scrutiny on Research Analysis of Big Data Analytical Method and Clou...
 
Big data analysis concepts and references by Cloud Security Alliance
Big data analysis concepts and references by Cloud Security AllianceBig data analysis concepts and references by Cloud Security Alliance
Big data analysis concepts and references by Cloud Security Alliance
 
Big data-analytics-2013-peer-research-report
Big data-analytics-2013-peer-research-reportBig data-analytics-2013-peer-research-report
Big data-analytics-2013-peer-research-report
 
Why You Need to Govern Big Data
Why You Need to Govern Big DataWhy You Need to Govern Big Data
Why You Need to Govern Big Data
 
IRJET- Analysis of Big Data Technology and its Challenges
IRJET- Analysis of Big Data Technology and its ChallengesIRJET- Analysis of Big Data Technology and its Challenges
IRJET- Analysis of Big Data Technology and its Challenges
 
White Paper on IBM MTSS
White Paper on IBM MTSSWhite Paper on IBM MTSS
White Paper on IBM MTSS
 
Storage Economic Principles: Reducing Data Storage Cost & TCO
Storage Economic Principles: Reducing Data Storage Cost & TCOStorage Economic Principles: Reducing Data Storage Cost & TCO
Storage Economic Principles: Reducing Data Storage Cost & TCO
 
Navigating Storage in a Cloudy Environment
Navigating Storage in a Cloudy EnvironmentNavigating Storage in a Cloudy Environment
Navigating Storage in a Cloudy Environment
 
Lighten Your Data Center TCO With “Helium” Storage Solutions
Lighten Your Data Center TCO With “Helium” Storage SolutionsLighten Your Data Center TCO With “Helium” Storage Solutions
Lighten Your Data Center TCO With “Helium” Storage Solutions
 
A technical Introduction to Big Data Analytics
A technical Introduction to Big Data AnalyticsA technical Introduction to Big Data Analytics
A technical Introduction to Big Data Analytics
 
White Paper on IBM MTSS
White Paper on IBM MTSSWhite Paper on IBM MTSS
White Paper on IBM MTSS
 
IDC Study on Enterprise Hybrid Cloud Strategies
IDC Study on Enterprise Hybrid Cloud StrategiesIDC Study on Enterprise Hybrid Cloud Strategies
IDC Study on Enterprise Hybrid Cloud Strategies
 
Enterprise Information Management
Enterprise Information ManagementEnterprise Information Management
Enterprise Information Management
 
Story_On_Data_Back_Up
Story_On_Data_Back_UpStory_On_Data_Back_Up
Story_On_Data_Back_Up
 

Andere mochten auch

Determinants of supply fri032814
Determinants of supply fri032814Determinants of supply fri032814
Determinants of supply fri032814Travis Klein
 
The Industrial Internet@Work
The Industrial Internet@WorkThe Industrial Internet@Work
The Industrial Internet@WorkEMC
 
Thurs review latin amer and europe
Thurs review latin amer and europeThurs review latin amer and europe
Thurs review latin amer and europeTravis Klein
 
Block mexico conquest
Block mexico conquestBlock mexico conquest
Block mexico conquestTravis Klein
 
PyCon lightning talk on my Toro module for Tornado
PyCon lightning talk on my Toro module for TornadoPyCon lightning talk on my Toro module for Tornado
PyCon lightning talk on my Toro module for Tornadoemptysquare
 
Texas s ta r powerpoint
Texas  s ta r powerpointTexas  s ta r powerpoint
Texas s ta r powerpointHalogen30
 
Mit2 092 f09_lec03
Mit2 092 f09_lec03Mit2 092 f09_lec03
Mit2 092 f09_lec03Rahman Hakim
 
El correu electrònic imad
El correu electrònic imadEl correu electrònic imad
El correu electrònic imadmgonellgomez
 
02 tues demand consumer surplus
02 tues demand consumer surplus02 tues demand consumer surplus
02 tues demand consumer surplusTravis Klein
 
Money supply inflation
Money supply inflationMoney supply inflation
Money supply inflationTravis Klein
 
Mit2 092 f09_lec15
Mit2 092 f09_lec15Mit2 092 f09_lec15
Mit2 092 f09_lec15Rahman Hakim
 

Andere mochten auch (20)

Determinants of supply fri032814
Determinants of supply fri032814Determinants of supply fri032814
Determinants of supply fri032814
 
The Industrial Internet@Work
The Industrial Internet@WorkThe Industrial Internet@Work
The Industrial Internet@Work
 
Thurs review latin amer and europe
Thurs review latin amer and europeThurs review latin amer and europe
Thurs review latin amer and europe
 
Block mexico conquest
Block mexico conquestBlock mexico conquest
Block mexico conquest
 
Mon fall of rome
Mon fall of romeMon fall of rome
Mon fall of rome
 
PyCon lightning talk on my Toro module for Tornado
PyCon lightning talk on my Toro module for TornadoPyCon lightning talk on my Toro module for Tornado
PyCon lightning talk on my Toro module for Tornado
 
Perfect comp
Perfect compPerfect comp
Perfect comp
 
Texas s ta r powerpoint
Texas  s ta r powerpointTexas  s ta r powerpoint
Texas s ta r powerpoint
 
Mon banking
Mon bankingMon banking
Mon banking
 
03 externalities
03 externalities03 externalities
03 externalities
 
Mit2 092 f09_lec03
Mit2 092 f09_lec03Mit2 092 f09_lec03
Mit2 092 f09_lec03
 
El correu electrònic imad
El correu electrònic imadEl correu electrònic imad
El correu electrònic imad
 
02 tues demand consumer surplus
02 tues demand consumer surplus02 tues demand consumer surplus
02 tues demand consumer surplus
 
Min wage 2014
Min wage 2014Min wage 2014
Min wage 2014
 
Chinese rev thur
Chinese rev thurChinese rev thur
Chinese rev thur
 
Money supply inflation
Money supply inflationMoney supply inflation
Money supply inflation
 
Mit2 092 f09_lec15
Mit2 092 f09_lec15Mit2 092 f09_lec15
Mit2 092 f09_lec15
 
Two pots
Two potsTwo pots
Two pots
 
Eduard
EduardEduard
Eduard
 
2015 day 8
2015 day 82015 day 8
2015 day 8
 

Ähnlich wie EMC Isilon: A Scalable Storage Platform for Big Data

Pdf wp-emc-mozyenterprise-hybrid-cloud-backup
Pdf wp-emc-mozyenterprise-hybrid-cloud-backupPdf wp-emc-mozyenterprise-hybrid-cloud-backup
Pdf wp-emc-mozyenterprise-hybrid-cloud-backuplverb
 
A Data-driven Maturity Model for Modernized, Automated, and Transformed IT
A Data-driven Maturity Model for Modernized, Automated, and Transformed ITA Data-driven Maturity Model for Modernized, Automated, and Transformed IT
A Data-driven Maturity Model for Modernized, Automated, and Transformed ITbalejandre
 
How Analytics Has Changed in the Last 10 Years (and How It’s Staye.docx
How Analytics Has Changed in the Last 10 Years (and How It’s Staye.docxHow Analytics Has Changed in the Last 10 Years (and How It’s Staye.docx
How Analytics Has Changed in the Last 10 Years (and How It’s Staye.docxpooleavelina
 
Nuestar "Big Data Cloud" Major Data Center Technology nuestarmobilemarketing...
Nuestar "Big Data Cloud" Major Data Center Technology  nuestarmobilemarketing...Nuestar "Big Data Cloud" Major Data Center Technology  nuestarmobilemarketing...
Nuestar "Big Data Cloud" Major Data Center Technology nuestarmobilemarketing...IT Support Engineer
 
Ab cs of big data
Ab cs of big dataAb cs of big data
Ab cs of big dataDigimark
 
Building Smarter, Faster, and Scalable Data-Rich Application
Building Smarter, Faster, and Scalable Data-Rich ApplicationBuilding Smarter, Faster, and Scalable Data-Rich Application
Building Smarter, Faster, and Scalable Data-Rich ApplicationRobert Bira
 
Big Data Analytics_Unit1.pptx
Big Data Analytics_Unit1.pptxBig Data Analytics_Unit1.pptx
Big Data Analytics_Unit1.pptxPrabhaJoshi4
 
Build a Winning Data Strategy in 2022.pdf
Build a Winning Data Strategy in 2022.pdfBuild a Winning Data Strategy in 2022.pdf
Build a Winning Data Strategy in 2022.pdfAvinashBatham
 
9 Steps to Successful Information Lifecycle Management
9 Steps to Successful Information Lifecycle Management9 Steps to Successful Information Lifecycle Management
9 Steps to Successful Information Lifecycle ManagementIron Mountain
 
Big data-comes-of-age ema-9sight
Big data-comes-of-age ema-9sightBig data-comes-of-age ema-9sight
Big data-comes-of-age ema-9sightJyrki Määttä
 
Top 10 guidelines for deploying modern data architecture for the data driven ...
Top 10 guidelines for deploying modern data architecture for the data driven ...Top 10 guidelines for deploying modern data architecture for the data driven ...
Top 10 guidelines for deploying modern data architecture for the data driven ...LindaWatson19
 
Data Virtualization, a Strategic IT Investment to Build Modern Enterprise Dat...
Data Virtualization, a Strategic IT Investment to Build Modern Enterprise Dat...Data Virtualization, a Strategic IT Investment to Build Modern Enterprise Dat...
Data Virtualization, a Strategic IT Investment to Build Modern Enterprise Dat...Denodo
 
Tips --Break Down the Barriers to Better Data Analytics
Tips --Break Down the Barriers to Better Data AnalyticsTips --Break Down the Barriers to Better Data Analytics
Tips --Break Down the Barriers to Better Data AnalyticsAbhishek Sood
 
The Big Data Importance – Tools and their Usage
The Big Data Importance – Tools and their UsageThe Big Data Importance – Tools and their Usage
The Big Data Importance – Tools and their UsageIRJET Journal
 
LEVERAGING CLOUD BASED BIG DATA ANALYTICS IN KNOWLEDGE MANAGEMENT FOR ENHANC...
LEVERAGING CLOUD BASED BIG DATA ANALYTICS  IN KNOWLEDGE MANAGEMENT FOR ENHANC...LEVERAGING CLOUD BASED BIG DATA ANALYTICS  IN KNOWLEDGE MANAGEMENT FOR ENHANC...
LEVERAGING CLOUD BASED BIG DATA ANALYTICS IN KNOWLEDGE MANAGEMENT FOR ENHANC...ijdpsjournal
 
LEVERAGING CLOUD BASED BIG DATA ANALYTICS IN KNOWLEDGE MANAGEMENT FOR ENHANCE...
LEVERAGING CLOUD BASED BIG DATA ANALYTICS IN KNOWLEDGE MANAGEMENT FOR ENHANCE...LEVERAGING CLOUD BASED BIG DATA ANALYTICS IN KNOWLEDGE MANAGEMENT FOR ENHANCE...
LEVERAGING CLOUD BASED BIG DATA ANALYTICS IN KNOWLEDGE MANAGEMENT FOR ENHANCE...ijdpsjournal
 
LEVERAGING CLOUD BASED BIG DATA ANALYTICS IN KNOWLEDGE MANAGEMENT FOR ENHANCE...
LEVERAGING CLOUD BASED BIG DATA ANALYTICS IN KNOWLEDGE MANAGEMENT FOR ENHANCE...LEVERAGING CLOUD BASED BIG DATA ANALYTICS IN KNOWLEDGE MANAGEMENT FOR ENHANCE...
LEVERAGING CLOUD BASED BIG DATA ANALYTICS IN KNOWLEDGE MANAGEMENT FOR ENHANCE...ijdpsjournal
 

Ähnlich wie EMC Isilon: A Scalable Storage Platform for Big Data (20)

Pdf wp-emc-mozyenterprise-hybrid-cloud-backup
Pdf wp-emc-mozyenterprise-hybrid-cloud-backupPdf wp-emc-mozyenterprise-hybrid-cloud-backup
Pdf wp-emc-mozyenterprise-hybrid-cloud-backup
 
The ABCs of Big Data
The ABCs of Big DataThe ABCs of Big Data
The ABCs of Big Data
 
A Data-driven Maturity Model for Modernized, Automated, and Transformed IT
A Data-driven Maturity Model for Modernized, Automated, and Transformed ITA Data-driven Maturity Model for Modernized, Automated, and Transformed IT
A Data-driven Maturity Model for Modernized, Automated, and Transformed IT
 
How Analytics Has Changed in the Last 10 Years (and How It’s Staye.docx
How Analytics Has Changed in the Last 10 Years (and How It’s Staye.docxHow Analytics Has Changed in the Last 10 Years (and How It’s Staye.docx
How Analytics Has Changed in the Last 10 Years (and How It’s Staye.docx
 
Nuestar "Big Data Cloud" Major Data Center Technology nuestarmobilemarketing...
Nuestar "Big Data Cloud" Major Data Center Technology  nuestarmobilemarketing...Nuestar "Big Data Cloud" Major Data Center Technology  nuestarmobilemarketing...
Nuestar "Big Data Cloud" Major Data Center Technology nuestarmobilemarketing...
 
Ab cs of big data
Ab cs of big dataAb cs of big data
Ab cs of big data
 
Building Smarter, Faster, and Scalable Data-Rich Application
Building Smarter, Faster, and Scalable Data-Rich ApplicationBuilding Smarter, Faster, and Scalable Data-Rich Application
Building Smarter, Faster, and Scalable Data-Rich Application
 
Sgcp14dunlea
Sgcp14dunleaSgcp14dunlea
Sgcp14dunlea
 
Big Data Analytics_Unit1.pptx
Big Data Analytics_Unit1.pptxBig Data Analytics_Unit1.pptx
Big Data Analytics_Unit1.pptx
 
Build a Winning Data Strategy in 2022.pdf
Build a Winning Data Strategy in 2022.pdfBuild a Winning Data Strategy in 2022.pdf
Build a Winning Data Strategy in 2022.pdf
 
9 Steps to Successful Information Lifecycle Management
9 Steps to Successful Information Lifecycle Management9 Steps to Successful Information Lifecycle Management
9 Steps to Successful Information Lifecycle Management
 
Big data-comes-of-age ema-9sight
Big data-comes-of-age ema-9sightBig data-comes-of-age ema-9sight
Big data-comes-of-age ema-9sight
 
Top 10 guidelines for deploying modern data architecture for the data driven ...
Top 10 guidelines for deploying modern data architecture for the data driven ...Top 10 guidelines for deploying modern data architecture for the data driven ...
Top 10 guidelines for deploying modern data architecture for the data driven ...
 
Data Virtualization, a Strategic IT Investment to Build Modern Enterprise Dat...
Data Virtualization, a Strategic IT Investment to Build Modern Enterprise Dat...Data Virtualization, a Strategic IT Investment to Build Modern Enterprise Dat...
Data Virtualization, a Strategic IT Investment to Build Modern Enterprise Dat...
 
Tips --Break Down the Barriers to Better Data Analytics
Tips --Break Down the Barriers to Better Data AnalyticsTips --Break Down the Barriers to Better Data Analytics
Tips --Break Down the Barriers to Better Data Analytics
 
The Big Data Importance – Tools and their Usage
The Big Data Importance – Tools and their UsageThe Big Data Importance – Tools and their Usage
The Big Data Importance – Tools and their Usage
 
LEVERAGING CLOUD BASED BIG DATA ANALYTICS IN KNOWLEDGE MANAGEMENT FOR ENHANC...
LEVERAGING CLOUD BASED BIG DATA ANALYTICS  IN KNOWLEDGE MANAGEMENT FOR ENHANC...LEVERAGING CLOUD BASED BIG DATA ANALYTICS  IN KNOWLEDGE MANAGEMENT FOR ENHANC...
LEVERAGING CLOUD BASED BIG DATA ANALYTICS IN KNOWLEDGE MANAGEMENT FOR ENHANC...
 
LEVERAGING CLOUD BASED BIG DATA ANALYTICS IN KNOWLEDGE MANAGEMENT FOR ENHANCE...
LEVERAGING CLOUD BASED BIG DATA ANALYTICS IN KNOWLEDGE MANAGEMENT FOR ENHANCE...LEVERAGING CLOUD BASED BIG DATA ANALYTICS IN KNOWLEDGE MANAGEMENT FOR ENHANCE...
LEVERAGING CLOUD BASED BIG DATA ANALYTICS IN KNOWLEDGE MANAGEMENT FOR ENHANCE...
 
LEVERAGING CLOUD BASED BIG DATA ANALYTICS IN KNOWLEDGE MANAGEMENT FOR ENHANCE...
LEVERAGING CLOUD BASED BIG DATA ANALYTICS IN KNOWLEDGE MANAGEMENT FOR ENHANCE...LEVERAGING CLOUD BASED BIG DATA ANALYTICS IN KNOWLEDGE MANAGEMENT FOR ENHANCE...
LEVERAGING CLOUD BASED BIG DATA ANALYTICS IN KNOWLEDGE MANAGEMENT FOR ENHANCE...
 
Abstract
AbstractAbstract
Abstract
 

Mehr von EMC

INDUSTRY-LEADING TECHNOLOGY FOR LONG TERM RETENTION OF BACKUPS IN THE CLOUD
INDUSTRY-LEADING  TECHNOLOGY FOR LONG TERM RETENTION OF BACKUPS IN THE CLOUDINDUSTRY-LEADING  TECHNOLOGY FOR LONG TERM RETENTION OF BACKUPS IN THE CLOUD
INDUSTRY-LEADING TECHNOLOGY FOR LONG TERM RETENTION OF BACKUPS IN THE CLOUDEMC
 
Cloud Foundry Summit Berlin Keynote
Cloud Foundry Summit Berlin Keynote Cloud Foundry Summit Berlin Keynote
Cloud Foundry Summit Berlin Keynote EMC
 
EMC GLOBAL DATA PROTECTION INDEX
EMC GLOBAL DATA PROTECTION INDEX EMC GLOBAL DATA PROTECTION INDEX
EMC GLOBAL DATA PROTECTION INDEX EMC
 
Transforming Desktop Virtualization with Citrix XenDesktop and EMC XtremIO
Transforming Desktop Virtualization with Citrix XenDesktop and EMC XtremIOTransforming Desktop Virtualization with Citrix XenDesktop and EMC XtremIO
Transforming Desktop Virtualization with Citrix XenDesktop and EMC XtremIOEMC
 
Citrix ready-webinar-xtremio
Citrix ready-webinar-xtremioCitrix ready-webinar-xtremio
Citrix ready-webinar-xtremioEMC
 
EMC FORUM RESEARCH GLOBAL RESULTS - 10,451 RESPONSES ACROSS 33 COUNTRIES
EMC FORUM RESEARCH GLOBAL RESULTS - 10,451 RESPONSES ACROSS 33 COUNTRIES EMC FORUM RESEARCH GLOBAL RESULTS - 10,451 RESPONSES ACROSS 33 COUNTRIES
EMC FORUM RESEARCH GLOBAL RESULTS - 10,451 RESPONSES ACROSS 33 COUNTRIES EMC
 
EMC with Mirantis Openstack
EMC with Mirantis OpenstackEMC with Mirantis Openstack
EMC with Mirantis OpenstackEMC
 
Modern infrastructure for business data lake
Modern infrastructure for business data lakeModern infrastructure for business data lake
Modern infrastructure for business data lakeEMC
 
Force Cyber Criminals to Shop Elsewhere
Force Cyber Criminals to Shop ElsewhereForce Cyber Criminals to Shop Elsewhere
Force Cyber Criminals to Shop ElsewhereEMC
 
Pivotal : Moments in Container History
Pivotal : Moments in Container History Pivotal : Moments in Container History
Pivotal : Moments in Container History EMC
 
Data Lake Protection - A Technical Review
Data Lake Protection - A Technical ReviewData Lake Protection - A Technical Review
Data Lake Protection - A Technical ReviewEMC
 
Mobile E-commerce: Friend or Foe
Mobile E-commerce: Friend or FoeMobile E-commerce: Friend or Foe
Mobile E-commerce: Friend or FoeEMC
 
Virtualization Myths Infographic
Virtualization Myths Infographic Virtualization Myths Infographic
Virtualization Myths Infographic EMC
 
Intelligence-Driven GRC for Security
Intelligence-Driven GRC for SecurityIntelligence-Driven GRC for Security
Intelligence-Driven GRC for SecurityEMC
 
The Trust Paradox: Access Management and Trust in an Insecure Age
The Trust Paradox: Access Management and Trust in an Insecure AgeThe Trust Paradox: Access Management and Trust in an Insecure Age
The Trust Paradox: Access Management and Trust in an Insecure AgeEMC
 
EMC Technology Day - SRM University 2015
EMC Technology Day - SRM University 2015EMC Technology Day - SRM University 2015
EMC Technology Day - SRM University 2015EMC
 
EMC Academic Summit 2015
EMC Academic Summit 2015EMC Academic Summit 2015
EMC Academic Summit 2015EMC
 
Data Science and Big Data Analytics Book from EMC Education Services
Data Science and Big Data Analytics Book from EMC Education ServicesData Science and Big Data Analytics Book from EMC Education Services
Data Science and Big Data Analytics Book from EMC Education ServicesEMC
 
Using EMC Symmetrix Storage in VMware vSphere Environments
Using EMC Symmetrix Storage in VMware vSphere EnvironmentsUsing EMC Symmetrix Storage in VMware vSphere Environments
Using EMC Symmetrix Storage in VMware vSphere EnvironmentsEMC
 
Using EMC VNX storage with VMware vSphereTechBook
Using EMC VNX storage with VMware vSphereTechBookUsing EMC VNX storage with VMware vSphereTechBook
Using EMC VNX storage with VMware vSphereTechBookEMC
 

Mehr von EMC (20)

INDUSTRY-LEADING TECHNOLOGY FOR LONG TERM RETENTION OF BACKUPS IN THE CLOUD
INDUSTRY-LEADING  TECHNOLOGY FOR LONG TERM RETENTION OF BACKUPS IN THE CLOUDINDUSTRY-LEADING  TECHNOLOGY FOR LONG TERM RETENTION OF BACKUPS IN THE CLOUD
INDUSTRY-LEADING TECHNOLOGY FOR LONG TERM RETENTION OF BACKUPS IN THE CLOUD
 
Cloud Foundry Summit Berlin Keynote
Cloud Foundry Summit Berlin Keynote Cloud Foundry Summit Berlin Keynote
Cloud Foundry Summit Berlin Keynote
 
EMC GLOBAL DATA PROTECTION INDEX
EMC GLOBAL DATA PROTECTION INDEX EMC GLOBAL DATA PROTECTION INDEX
EMC GLOBAL DATA PROTECTION INDEX
 
Transforming Desktop Virtualization with Citrix XenDesktop and EMC XtremIO
Transforming Desktop Virtualization with Citrix XenDesktop and EMC XtremIOTransforming Desktop Virtualization with Citrix XenDesktop and EMC XtremIO
Transforming Desktop Virtualization with Citrix XenDesktop and EMC XtremIO
 
Citrix ready-webinar-xtremio
Citrix ready-webinar-xtremioCitrix ready-webinar-xtremio
Citrix ready-webinar-xtremio
 
EMC FORUM RESEARCH GLOBAL RESULTS - 10,451 RESPONSES ACROSS 33 COUNTRIES
EMC FORUM RESEARCH GLOBAL RESULTS - 10,451 RESPONSES ACROSS 33 COUNTRIES EMC FORUM RESEARCH GLOBAL RESULTS - 10,451 RESPONSES ACROSS 33 COUNTRIES
EMC FORUM RESEARCH GLOBAL RESULTS - 10,451 RESPONSES ACROSS 33 COUNTRIES
 
EMC with Mirantis Openstack
EMC with Mirantis OpenstackEMC with Mirantis Openstack
EMC with Mirantis Openstack
 
Modern infrastructure for business data lake
Modern infrastructure for business data lakeModern infrastructure for business data lake
Modern infrastructure for business data lake
 
Force Cyber Criminals to Shop Elsewhere
Force Cyber Criminals to Shop ElsewhereForce Cyber Criminals to Shop Elsewhere
Force Cyber Criminals to Shop Elsewhere
 
Pivotal : Moments in Container History
Pivotal : Moments in Container History Pivotal : Moments in Container History
Pivotal : Moments in Container History
 
Data Lake Protection - A Technical Review
Data Lake Protection - A Technical ReviewData Lake Protection - A Technical Review
Data Lake Protection - A Technical Review
 
Mobile E-commerce: Friend or Foe
Mobile E-commerce: Friend or FoeMobile E-commerce: Friend or Foe
Mobile E-commerce: Friend or Foe
 
Virtualization Myths Infographic
Virtualization Myths Infographic Virtualization Myths Infographic
Virtualization Myths Infographic
 
Intelligence-Driven GRC for Security
Intelligence-Driven GRC for SecurityIntelligence-Driven GRC for Security
Intelligence-Driven GRC for Security
 
The Trust Paradox: Access Management and Trust in an Insecure Age
The Trust Paradox: Access Management and Trust in an Insecure AgeThe Trust Paradox: Access Management and Trust in an Insecure Age
The Trust Paradox: Access Management and Trust in an Insecure Age
 
EMC Technology Day - SRM University 2015
EMC Technology Day - SRM University 2015EMC Technology Day - SRM University 2015
EMC Technology Day - SRM University 2015
 
EMC Academic Summit 2015
EMC Academic Summit 2015EMC Academic Summit 2015
EMC Academic Summit 2015
 
Data Science and Big Data Analytics Book from EMC Education Services
Data Science and Big Data Analytics Book from EMC Education ServicesData Science and Big Data Analytics Book from EMC Education Services
Data Science and Big Data Analytics Book from EMC Education Services
 
Using EMC Symmetrix Storage in VMware vSphere Environments
Using EMC Symmetrix Storage in VMware vSphere EnvironmentsUsing EMC Symmetrix Storage in VMware vSphere Environments
Using EMC Symmetrix Storage in VMware vSphere Environments
 
Using EMC VNX storage with VMware vSphereTechBook
Using EMC VNX storage with VMware vSphereTechBookUsing EMC VNX storage with VMware vSphereTechBook
Using EMC VNX storage with VMware vSphereTechBook
 

Kürzlich hochgeladen

Commit 2024 - Secret Management made easy
Commit 2024 - Secret Management made easyCommit 2024 - Secret Management made easy
Commit 2024 - Secret Management made easyAlfredo García Lavilla
 
Transcript: New from BookNet Canada for 2024: Loan Stars - Tech Forum 2024
Transcript: New from BookNet Canada for 2024: Loan Stars - Tech Forum 2024Transcript: New from BookNet Canada for 2024: Loan Stars - Tech Forum 2024
Transcript: New from BookNet Canada for 2024: Loan Stars - Tech Forum 2024BookNet Canada
 
Dev Dives: Streamline document processing with UiPath Studio Web
Dev Dives: Streamline document processing with UiPath Studio WebDev Dives: Streamline document processing with UiPath Studio Web
Dev Dives: Streamline document processing with UiPath Studio WebUiPathCommunity
 
Are Multi-Cloud and Serverless Good or Bad?
Are Multi-Cloud and Serverless Good or Bad?Are Multi-Cloud and Serverless Good or Bad?
Are Multi-Cloud and Serverless Good or Bad?Mattias Andersson
 
From Family Reminiscence to Scholarly Archive .
From Family Reminiscence to Scholarly Archive .From Family Reminiscence to Scholarly Archive .
From Family Reminiscence to Scholarly Archive .Alan Dix
 
Take control of your SAP testing with UiPath Test Suite
Take control of your SAP testing with UiPath Test SuiteTake control of your SAP testing with UiPath Test Suite
Take control of your SAP testing with UiPath Test SuiteDianaGray10
 
New from BookNet Canada for 2024: BNC CataList - Tech Forum 2024
New from BookNet Canada for 2024: BNC CataList - Tech Forum 2024New from BookNet Canada for 2024: BNC CataList - Tech Forum 2024
New from BookNet Canada for 2024: BNC CataList - Tech Forum 2024BookNet Canada
 
Unraveling Multimodality with Large Language Models.pdf
Unraveling Multimodality with Large Language Models.pdfUnraveling Multimodality with Large Language Models.pdf
Unraveling Multimodality with Large Language Models.pdfAlex Barbosa Coqueiro
 
Digital Identity is Under Attack: FIDO Paris Seminar.pptx
Digital Identity is Under Attack: FIDO Paris Seminar.pptxDigital Identity is Under Attack: FIDO Paris Seminar.pptx
Digital Identity is Under Attack: FIDO Paris Seminar.pptxLoriGlavin3
 
SALESFORCE EDUCATION CLOUD | FEXLE SERVICES
SALESFORCE EDUCATION CLOUD | FEXLE SERVICESSALESFORCE EDUCATION CLOUD | FEXLE SERVICES
SALESFORCE EDUCATION CLOUD | FEXLE SERVICESmohitsingh558521
 
"Subclassing and Composition – A Pythonic Tour of Trade-Offs", Hynek Schlawack
"Subclassing and Composition – A Pythonic Tour of Trade-Offs", Hynek Schlawack"Subclassing and Composition – A Pythonic Tour of Trade-Offs", Hynek Schlawack
"Subclassing and Composition – A Pythonic Tour of Trade-Offs", Hynek SchlawackFwdays
 
Passkey Providers and Enabling Portability: FIDO Paris Seminar.pptx
Passkey Providers and Enabling Portability: FIDO Paris Seminar.pptxPasskey Providers and Enabling Portability: FIDO Paris Seminar.pptx
Passkey Providers and Enabling Portability: FIDO Paris Seminar.pptxLoriGlavin3
 
DevEX - reference for building teams, processes, and platforms
DevEX - reference for building teams, processes, and platformsDevEX - reference for building teams, processes, and platforms
DevEX - reference for building teams, processes, and platformsSergiu Bodiu
 
"Debugging python applications inside k8s environment", Andrii Soldatenko
"Debugging python applications inside k8s environment", Andrii Soldatenko"Debugging python applications inside k8s environment", Andrii Soldatenko
"Debugging python applications inside k8s environment", Andrii SoldatenkoFwdays
 
Developer Data Modeling Mistakes: From Postgres to NoSQL
Developer Data Modeling Mistakes: From Postgres to NoSQLDeveloper Data Modeling Mistakes: From Postgres to NoSQL
Developer Data Modeling Mistakes: From Postgres to NoSQLScyllaDB
 
Ensuring Technical Readiness For Copilot in Microsoft 365
Ensuring Technical Readiness For Copilot in Microsoft 365Ensuring Technical Readiness For Copilot in Microsoft 365
Ensuring Technical Readiness For Copilot in Microsoft 3652toLead Limited
 
Streamlining Python Development: A Guide to a Modern Project Setup
Streamlining Python Development: A Guide to a Modern Project SetupStreamlining Python Development: A Guide to a Modern Project Setup
Streamlining Python Development: A Guide to a Modern Project SetupFlorian Wilhelm
 
Connect Wave/ connectwave Pitch Deck Presentation
Connect Wave/ connectwave Pitch Deck PresentationConnect Wave/ connectwave Pitch Deck Presentation
Connect Wave/ connectwave Pitch Deck PresentationSlibray Presentation
 
What's New in Teams Calling, Meetings and Devices March 2024
What's New in Teams Calling, Meetings and Devices March 2024What's New in Teams Calling, Meetings and Devices March 2024
What's New in Teams Calling, Meetings and Devices March 2024Stephanie Beckett
 
Hyperautomation and AI/ML: A Strategy for Digital Transformation Success.pdf
Hyperautomation and AI/ML: A Strategy for Digital Transformation Success.pdfHyperautomation and AI/ML: A Strategy for Digital Transformation Success.pdf
Hyperautomation and AI/ML: A Strategy for Digital Transformation Success.pdfPrecisely
 

Kürzlich hochgeladen (20)

Commit 2024 - Secret Management made easy
Commit 2024 - Secret Management made easyCommit 2024 - Secret Management made easy
Commit 2024 - Secret Management made easy
 
Transcript: New from BookNet Canada for 2024: Loan Stars - Tech Forum 2024
Transcript: New from BookNet Canada for 2024: Loan Stars - Tech Forum 2024Transcript: New from BookNet Canada for 2024: Loan Stars - Tech Forum 2024
Transcript: New from BookNet Canada for 2024: Loan Stars - Tech Forum 2024
 
Dev Dives: Streamline document processing with UiPath Studio Web
Dev Dives: Streamline document processing with UiPath Studio WebDev Dives: Streamline document processing with UiPath Studio Web
Dev Dives: Streamline document processing with UiPath Studio Web
 
Are Multi-Cloud and Serverless Good or Bad?
Are Multi-Cloud and Serverless Good or Bad?Are Multi-Cloud and Serverless Good or Bad?
Are Multi-Cloud and Serverless Good or Bad?
 
From Family Reminiscence to Scholarly Archive .
From Family Reminiscence to Scholarly Archive .From Family Reminiscence to Scholarly Archive .
From Family Reminiscence to Scholarly Archive .
 
Take control of your SAP testing with UiPath Test Suite
Take control of your SAP testing with UiPath Test SuiteTake control of your SAP testing with UiPath Test Suite
Take control of your SAP testing with UiPath Test Suite
 
New from BookNet Canada for 2024: BNC CataList - Tech Forum 2024
New from BookNet Canada for 2024: BNC CataList - Tech Forum 2024New from BookNet Canada for 2024: BNC CataList - Tech Forum 2024
New from BookNet Canada for 2024: BNC CataList - Tech Forum 2024
 
Unraveling Multimodality with Large Language Models.pdf
Unraveling Multimodality with Large Language Models.pdfUnraveling Multimodality with Large Language Models.pdf
Unraveling Multimodality with Large Language Models.pdf
 
Digital Identity is Under Attack: FIDO Paris Seminar.pptx
Digital Identity is Under Attack: FIDO Paris Seminar.pptxDigital Identity is Under Attack: FIDO Paris Seminar.pptx
Digital Identity is Under Attack: FIDO Paris Seminar.pptx
 
SALESFORCE EDUCATION CLOUD | FEXLE SERVICES
SALESFORCE EDUCATION CLOUD | FEXLE SERVICESSALESFORCE EDUCATION CLOUD | FEXLE SERVICES
SALESFORCE EDUCATION CLOUD | FEXLE SERVICES
 
"Subclassing and Composition – A Pythonic Tour of Trade-Offs", Hynek Schlawack
"Subclassing and Composition – A Pythonic Tour of Trade-Offs", Hynek Schlawack"Subclassing and Composition – A Pythonic Tour of Trade-Offs", Hynek Schlawack
"Subclassing and Composition – A Pythonic Tour of Trade-Offs", Hynek Schlawack
 
Passkey Providers and Enabling Portability: FIDO Paris Seminar.pptx
Passkey Providers and Enabling Portability: FIDO Paris Seminar.pptxPasskey Providers and Enabling Portability: FIDO Paris Seminar.pptx
Passkey Providers and Enabling Portability: FIDO Paris Seminar.pptx
 
DevEX - reference for building teams, processes, and platforms
DevEX - reference for building teams, processes, and platformsDevEX - reference for building teams, processes, and platforms
DevEX - reference for building teams, processes, and platforms
 
"Debugging python applications inside k8s environment", Andrii Soldatenko
"Debugging python applications inside k8s environment", Andrii Soldatenko"Debugging python applications inside k8s environment", Andrii Soldatenko
"Debugging python applications inside k8s environment", Andrii Soldatenko
 
Developer Data Modeling Mistakes: From Postgres to NoSQL
Developer Data Modeling Mistakes: From Postgres to NoSQLDeveloper Data Modeling Mistakes: From Postgres to NoSQL
Developer Data Modeling Mistakes: From Postgres to NoSQL
 
Ensuring Technical Readiness For Copilot in Microsoft 365
Ensuring Technical Readiness For Copilot in Microsoft 365Ensuring Technical Readiness For Copilot in Microsoft 365
Ensuring Technical Readiness For Copilot in Microsoft 365
 
Streamlining Python Development: A Guide to a Modern Project Setup
Streamlining Python Development: A Guide to a Modern Project SetupStreamlining Python Development: A Guide to a Modern Project Setup
Streamlining Python Development: A Guide to a Modern Project Setup
 
Connect Wave/ connectwave Pitch Deck Presentation
Connect Wave/ connectwave Pitch Deck PresentationConnect Wave/ connectwave Pitch Deck Presentation
Connect Wave/ connectwave Pitch Deck Presentation
 
What's New in Teams Calling, Meetings and Devices March 2024
What's New in Teams Calling, Meetings and Devices March 2024What's New in Teams Calling, Meetings and Devices March 2024
What's New in Teams Calling, Meetings and Devices March 2024
 
Hyperautomation and AI/ML: A Strategy for Digital Transformation Success.pdf
Hyperautomation and AI/ML: A Strategy for Digital Transformation Success.pdfHyperautomation and AI/ML: A Strategy for Digital Transformation Success.pdf
Hyperautomation and AI/ML: A Strategy for Digital Transformation Success.pdf
 

EMC Isilon: A Scalable Storage Platform for Big Data

  • 1. White Paper EMC Isilon: A Scalable Storage Platform for Big Data By Nik Rouda, Senior Analyst and Terri McClure, Senior Analyst April 2014 This ESG White Paper was commissioned by EMC Isilon and is distributed under license from ESG. © 2014 by The Enterprise Strategy Group, Inc. All Rights Reserved.
  • 2. White Paper: EMC Isilon: A Scalable Storage Platform for Big Data 2 © 2014 by The Enterprise Strategy Group, Inc. All Rights Reserved. Contents Big Data Needs Big Storage ..........................................................................................................................3 Businesses Want Big Data to Have a Big Impact ...................................................................................................... 3 Storage Selection Criteria for Big Data.........................................................................................................4 Advantages of Isilon Scale-out Storage for Hadoop.....................................................................................5 Many Protocols, but Only One Copy of Data............................................................................................................ 6 In-place Analytics with Your Favorite Flavor of Hadoop .......................................................................................... 7 Enterprise-class Storage Increases Efficiency and Safety......................................................................................... 7 The Bigger Truth ...........................................................................................................................................7 All trademark names are property of their respective companies. Information contained in this publication has been obtained by sources The Enterprise Strategy Group (ESG) considers to be reliable but is not warranted by ESG. This publication may contain opinions of ESG, which are subject to change from time to time. This publication is copyrighted by The Enterprise Strategy Group, Inc. Any reproduction or redistribution of this publication, in whole or in part, whether in hard-copy format, electronically, or otherwise to persons not authorized to receive it, without the express consent of The Enterprise Strategy Group, Inc., is in violation of U.S. copyright law and will be subject to an action for civil damages and, if applicable, criminal prosecution. Should you have any questions, please contact ESG Client Relations at 508.482.0188.
  • 3. White Paper: EMC Isilon: A Scalable Storage Platform for Big Data 3 © 2014 by The Enterprise Strategy Group, Inc. All Rights Reserved. Big Data Needs Big Storage A rising tide of information is being collected, processed, and analyzed by enterprises around the world, but this flood of data brings as many challenges as it does solutions. As companies become more data driven in a wide range of activities, they will need their production big data implementations to meet common enterprise requirements such as high performance, scalability, availability, security, and compliance. Underlying the databases and analytics engines, other parts of the technology stack are critical components for providing these qualities, not the least of which involves the choice of storage platforms. The storage systems can make or break a big data implementation. EMC Isilon is a leader in scale-out storage and offers many advantages as a foundation for big data analytics, which are built on the company’s years of experience in large enterprise data centers. Businesses Want Big Data to Have a Big Impact There has been no shortage of press on the many practical applications of big data in all industries and across all lines of business. Many of these stories are compelling anecdotes, and are often specific to the particular organization’s goals and activities. However, some common data analytics trends can be found across industries. ESG recently conducted its 2014 IT Spending Intentions Survey and identified the top business benefits desired by respondent organizations from their investments in business intelligence and analytics.1 Figure 1. Business Benefits from Data Investments Source: Enterprise Strategy Group, 2014. Implicit in all these goals is the need to serve the business not just with more data, but also with timelier reporting. For many companies, the time needed to get an answer is the key criterion for the adoption of data-driven decision 1 Source: ESG Research Report, 2014 IT Spending Intentions Survey, February 2014. All ESG references and charts in this white paper have been taken from this research report. 26% 30% 31% 34% 35% 36% 39% 41% 42% 59% 0% 10% 20% 30% 40% 50% 60% 70% Reduced risk of product defects Quicker time to market for products/services Uncover new market opportunities Faster tactical response to shifting customer views More insights into historical results More insights into future scenarios or outcomes Incremental cost savings Higher quality products/services Reduced risk around business decisions and strategy Improved operational efficiency What business benefits do you expect to gain from your investments in the area of business intelligence, analytics, and big data? (Percent of respondents, N=187, multiple responses accepted)
  • 4. White Paper: EMC Isilon: A Scalable Storage Platform for Big Data 4 © 2014 by The Enterprise Strategy Group, Inc. All Rights Reserved. making. No longer do quarterly batch reports meet the needs—instead, daily updates, real-time alerts, and ad hoc querying are becoming standard requirements for analysts and executives. Vendors are now bringing a breadth of data technologies into play, ranging from traditional relational databases to NoSQL and Hadoop, and from advanced analytics applications to data visualization and reporting tools. With these tools come accompanying options in architecture models: commodity servers, ready-made appliances, or cloud services, and open source or proprietary software. Each of these choices will have an impact on the overall capabilities of the solution, affecting end-user perceptions of performance, flexibility, and availability. These high expectations from business executives put a lot of pressure on enterprise IT departments to deliver a well-implemented solution. This isn’t usually an easy task considering that big data initiatives often involve the integration of many new data sources, big data platforms, and analytics applications with existing data warehouses and transactional databases. This architectural complexity spans many IT disciplines, with dependencies on everything, including applications, servers, networks, and storage. Looking at the ESG research in Figure 2, it is clear that many of the top ten most-cited IT priorities will be directly related to proper management of enterprise data, including big data. Figure 2. Top Ten Most Important IT Priorities for 2014 Source: Enterprise Strategy Group, 2014. These issues are sometimes glibly underestimated with the assumption that the Hadoop Distributed File System (HDFS) provides cheap and cheerful provisions for storing and managing massive volumes of big data. The truth is that storage requirements for the enterprise are becoming increasingly demanding, especially as more decision makers become reliant on big data insights. Storage Selection Criteria for Big Data As noted, the choice of storage platform underpins the overall efficacy of the technology stack, and will have ramifications that must be carefully evaluated. There are a number of factors to consider, including: 22% 23% 23% 23% 23% 24% 25% 29% 32% 32% 0% 5% 10% 15% 20% 25% 30% 35% Build a “private cloud” infrastructure Business intelligence/data analytics initiatives Use cloud infrastructure services Major application deployments or upgrades Regulatory compliance initiatives Desktop virtualization Manage data growth Improve data backup and recovery Information security initiatives Increase use of server virtualization Which of the following would you consider to be your organization’s most important IT priorities over the next 12 months? (Percent of respondents, N=562, ten responses accepted)
  • 5. White Paper: EMC Isilon: A Scalable Storage Platform for Big Data 5 © 2014 by The Enterprise Strategy Group, Inc. All Rights Reserved.  Scalability and efficiency will have an obvious impact on the ability to ingest and store data. Particular attention should be paid to mechanisms that reduce total footprint, such as deduplication, compression, and the redundancy required to preserve data against loss. Human capital required to manage the system should also be analyzed in the efficiency category because organizations cannot afford to continue to add staff to manage the environment as data grows.  Total cost of ownership (TCO) matters as the big data initiative benefits are weighed against both capital and operating expense, including maintenance, support, footprint, and human capital. A reduced cost structure should lead to more data stored (because organizations can now afford to) and more valuable insights realized (as a benefit of having more data to analyze).  Performance seems like an obvious requirement, but it can be elusive as more users do more comprehensive analysis with larger data volumes. Finding a storage system that can handle the I/O demands of the environment, including any extract, transform, and load (ETL) to other data repositories, is critical. Data location has a significant impact on this, especially if large amounts need to be moved around before analytics processing.  Data protection, security, and governance utilities are becoming mandatory for big data environments. As data lakes or data hubs start to encapsulate all manner of sensitive data in one central location, this clearly needs to be treated with great care. Compliance with relevant government and industry regulations must be addressed directly and explicitly. As a newer technology, Hadoop by itself isn’t as mature in these areas as enterprises may require.  Accessibility may be one of the least recognized attributes of the storage decision, but it can provide significant advantages in flexibility of models for enabling different groups or tools to harness the data without moving it into other platforms before processing can begin. Access controls also must be well developed and granular. These are all important factors for deciding how appropriate a storage platform is with big data environments. Again, a range of traditional options for storage platforms include: commodity direct attached (DAS), storage area network (SAN), and network attached storage (NAS). Conventional wisdom has been to use commodity storage in the form of internal drives, but when weighing the impact of storage infrastructure choices on data management and analytics conventional wisdom falls short on delivery. Advantages of Isilon Scale-out Storage for Hadoop Today, there is still a relative immaturity of functionality and robustness in many big data technology stacks when it comes to storage. Although Hadoop and HDFS can simplify the model for scaling on commodity servers with DAS, some alternatives provide compelling advantages for the enterprise and help overcome some of the challenges associated with using the traditional approach. Challenges with using the embedded storage/DAS approach include data protection, data leverage, elongated business process, and, surprisingly, cost. On the data protection front, HDFS uses multiple copies of data to provide data protection, meaning it consumes a lot of storage. Both data leverage and business processes are impacted by the fact that data is only accessible via HDFS and is not accessible to other applications that require other interfaces (i.e., RESTful object-based applications or NFS/CIFS/SMB file-based applications). This means ETL operations need to be performed to ingest data or leverage data in other business processes, thus elongating those processes each time the ETL process needs to be performed. This also means that organizations must have multiple data repositories for the same data in multiple data formats to support different business processes. So on the surface, using commodity DAS configurations may sound attractive and may indeed be a good fit for many organizations, but those companies that need to analyze data from multiple sources or leverage it to support multiple business processes incur further costs for additional infrastructure and may need to investigate alternative approaches. One alternative approach that helps overcome these challenges is the adoption of a shared storage platform that has been designed to meet enterprise IT operations requirements. EMC Isilon is a prime example of this case, which
  • 6. White Paper: EMC Isilon: A Scalable Storage Platform for Big Data 6 © 2014 by The Enterprise Strategy Group, Inc. All Rights Reserved. brings Hadoop to your data, instead of moving all your data to Hadoop clusters. It lets users create a central data hub that supports multiple applications and business processes, reducing costs and business cycles by eliminating most ETL requirements. Many Protocols, but Only One Copy of Data Isilon is a flexible storage platform that supports multiprotocol access to a single data object, eliminating the up- front protocol decision because NFS, RESTful objects, HTTP, FTP, SMB, and HDFS are all supported. So users can ingest an object from a web app and access it via NFS to edit it. Or better yet, a user might access web logs directly from a web application, rather than exporting to a spreadsheet, and access these via the native HDFS interface to run analytics. This ability to make only one copy available for multiple uses is a major benefit for reduction in overall storage costs and cycle time because it means there is no need to export data to multiple systems for the various use cases. A single repository also greatly simplifies compliance audit requirements, rather than chasing after many distinct locations and sources. Figure 3. Multi-protocol Access to Isilon’s OneFS Operating System Source: EMC/Isilon, 2014.
  • 7. White Paper: EMC Isilon: A Scalable Storage Platform for Big Data 7 © 2014 by The Enterprise Strategy Group, Inc. All Rights Reserved. In-place Analytics with Your Favorite Flavor of Hadoop Building on the support for multiprotocol access to a single data object, organizations can effectively do “in-place” analytics on data without needing a lengthy data ingest from other primary storage data sources to the Hadoop system, which very often leads to a faster overall time to results. Although more specialized data layouts and approaches can sometimes be faster in querying and analysis, with Isilon, data analysis can be started immediately, and the reduced effort and start time delay without ETL can often overcome the difference. In addition, concurrent instances of different Hadoop distributions could be run in parallel on the same underlying storage system, giving much more flexibility to leverage the relative strengths of each, again, without the need to move large quantities of data around. Enterprise-class Storage Increases Efficiency and Safety Although HDFS may be a reliable and scalable model for collecting and storing the high volumes and varieties of data in a typical big data environment, it isn’t necessarily the most efficient. Some features that provide that robustness on commodity hardware may actually detract from overall efficiency. Mirroring with Hadoop direct attached storage is a good example, causing typically three to five times redundancy, which significantly affects the effective usage ratio of total drive capacity. Isilon, with built-in data protection, high availability, and general robustness, can instead run at 80% utilization levels of capacity (compared with 20-33% with HDFS) and this is further improved by data reduction of up to 30% with SmartDedupe. All this helps reduce the storage footprint, bringing associated cost reductions in energy and space consumption in the data center. Separating server and storage by growing each independently instead of always adding another fixed unit commodity server also allows more targeted scaling of the environment to meet the actual workloads. From a governance and security point of view, the Isilon storage system offers “write once, read many” (WORM) compliance for archival to meet government and industry regulations, standard Kerberos authentication, and access control lists (ACLs) to make sure the user touching the central data hub is authorized. All of these features combine to reduce initial cost of purchase, ongoing operational costs, and risk of failure or security breach of sensitive information. The Bigger Truth Having explored the rapid growth of big data in adoption and importance, and the potential impacts of the underlying infrastructure, it is clear that enterprises should rethink the architectural implications of their storage choices for their big data initiatives. There are multiple advantages in taking a shared storage approach, covering a wide range of desired characteristics including increased efficiency, reduced total cost, overall speed to answer, reduced risk of data loss or inappropriate access, and analytics flexibility. Isilon is breaking ground in challenging the default storage paradigm assumptions of big data practitioners, and its approach is well worth evaluation for its merits compared with the de facto standard of direct attached storage in commodity server hardware. Coming from a long history of building flexible, scalable storage platforms for demanding enterprise requirements serves Isilon well in addressing many common challenges of big data storage, and this experience should serve customers well. Particularly, current Isilon customers should experiment with running Hadoop on their existing systems; they may well find that the right answer is already in place.
  • 8. 20 Asylum Street | Milford, MA 01757 | Tel: 508.482.0188 Fax: 508.482.0218 | www.esg-global.com