SlideShare ist ein Scribd-Unternehmen logo
1 von 17
Downloaden Sie, um offline zu lesen
BIG DATA MANAGEMENT
WORK SMARTER NOT HARDER
GET MORE OUT OF YOUR DATA FOR LESS
CONTENTS
Long on Data, Short on Resources   1
Know Your Data   3
ii Reducing Data Maintenance Costs   5
Choose Your Data Platform Wisely   8
ii Reigning in Data Growth Costs   10
Don’t Keep What You Don’t Need   11
ii Overcoming Data Growth and Regulatory Compliance Challenges   12
Getting What You Need to Manage Your Data   14
For More Info   15
1 Big Data Management: Work Smarter Not Harder
We’ve been deluged with
statistics on data’s rapid growth
to the point that the numbers
and bytes have become almost
meaningless. No one would
deny that data growth is an
unstoppable trend. But that’s
not the issue. The real issue is
how organizations can make
big data meaningful when IT
resources are shrinking.
The good news is that business users want more data, and they’re getting
it, but in some cases, more data is actually having an adverse effect on
business. Fifty-six percent of IT decision makers surveyed by IDG said
that their users frequently or occasionally report feeling overwhelmed
by incoming data and information, while 53% said the influx of large
quantities of data has delayed important decisions because they didn’t
have the right tools to properly manage it. Leading companies are
realizing that having the right technology makes all the difference to
assure that data can be used as an asset rather than a liability.
LONG ON DATA, SHORT ON RESOURCES
56% 53%
of IT decision makers said that
their users report feeling
overwhelmed by incoming
data and information
said the influx of data has
delayed decisions
because they didn’t have
the right tools to manage it
IDG Enterprise, 2015
Data management staff
as a percentage of IT staff
has risen a meager
Computer Economics, 2015
0.5%
2 Big Data Management: Work Smarter Not Harder
Long on Data, Short on Resources
However, despite the perceived
value of data, the allocation
of resources to manage and
leverage big data has not kept
pace with its growth. According
to research firm Computer
Economics, data management
staff as a percentage of IT staff
has risen a meager .5% in four
years, and IT spending per user
continues to decline. In fact, the
same study showed that when
adjusted for inflation, spending
decreased from $10,514 in 2012
per user to just $6,847 in 2015.
But it’s not all about the money.
Finding people with the necessary skillsets will only grow more
challenging. The McKinsey Global Institute predicts that by 2018 the US
could face a shortage of 140,000-190,000 people with deep analytical
skills as well as a deficit of 1.5 million people who can leverage big data
analysis to make effective decisions. This drives the need for automation
that reduces the skills and training required to manage data.
As is often the case, the best way to address the big data resource and
skills shortage is to work smarter — not harder. In this ebook, we look
at how IT organizations can manage data smarter — while maintaining
or even reducing costs — so that business users can get real value from
data, faster and easier.
3 Big Data Management: Work Smarter Not Harder
Moving data, transforming data,
and making it available to the
business is a very expensive
process. Given data’s rapid rate
of growth — and the amount
of waste in the current data
management paradigm — it’s
time to transform the economics
of data.
Most enterprises leverage a wide variety of data types in high volumes
for big data analytics projects. These include social media data, internal
data, log data, mobile device data, sensor data, free public external
data — and the list keeps growing. In fact, according to QuinStreet
Research, by 2020, the world will generate 50 times as much data as it
does today, but the IT staff responsible for managing it will only grow
1.5 times. On top of that challenge, only 40-55% of the data
that they load is ever used. When you consider that it
costs $2-6 million to support every 50-100 TB
of new data, supporting dormant data
results in a tremendous amount
of inefficiency.
KNOW YOUR DATA
QuinStreet Enterprise Research, 2014
But the IT staff
who manages it
will only grow 1.5Xthe world will generate 50Xas much data
By 2020
4 Big Data Management: Work Smarter Not Harder
Dormant data also slows down performance since the process of loading
data uses up to 60% of the CPU. A lot of data may need to be retained
in its original form for compliance and undergo ETL and transformation
processes for the prospect of using it for other needs, but never get
used. As a result, it’s unnecessarily impacting costs and performance.
Know Your Data
But the exorbitant cost of not
managing dormant data well isn’t
just about the storage. In fact,
it’s less about the storage and
more about CPU capacity. Most
vendors charge by CPU capacity.
As CPU capacity increases, so do
your licensing costs.
Only
40to
55%
of the data
companies load will ever be used
Every 50-100 TB
of new data costs
$2-6 Million
to support it
Cost of Supporting DataData Waste Cost of CPU
Loading
data uses
up to
60%
(License costs go up as CPU capacity increases)
of the
CPU
Source: Based on Attunity customer implementations/input worldwide, 2015
CUSTOMER SUCCESS STORY
By offloading 43%
of the EDW into Hadoop
$21M
$5M
DECREASE
Source: Based on Attunity customer implementations/input worldwide, 2015
Yearly
maintenance
costs (in three years!)
5 Big Data Management: Work Smarter Not Harder
Reducing Data Maintenance Costs
By looking at and analyzing
EDW use for just one month, an
Attunity customer discovered
that 37 TB of data — 43% of the
EDW — didn’t receive any kind
of analytical query. And yet the
CPU consumption to ingest and
load the data was over 60%.
By offloading that 43% into Hadoop, the customer dramatically
decreased the need for more capacity, reduced the number of EDW
nodes and lowered maintenance costs. In fact, the customer is looking
at driving down yearly maintenance costs from $21 million to $5
million in just three years — all by being more strategic about data
management.
6 Big Data Management: Work Smarter Not Harder
Know Your Data
The data warehouse is a
reflection of the business. It
grows in response to business
needs. It makes sense then to
analyze data activity and usage
accordingly. When you group
applications, data, or users in
the context of the business (for
example, by department or line
of business), you can then begin
to analyze utilization and assign
accountability via chargeback or
showback. For example, when
marketing requests more data
from IT, the IT department may
need to show them how much
data hasn’t been used, along
with the cost to continue to
manage current and new data.
When a business can specify how much it costs to load and maintain
data, and demonstrate how much isn’t being used, the dataset that
seemed so important before may lose some of its significance. The
standing request might just lose its urgency, particularly if the cost to
keep the data comes out of departmental budgets and ROI is lacking.
To figure out what’s used...
look
at what’s been qu
eried
Source: Based on Attunity customer implementations/input worldwide, 2015
43% of data in the
data warehouse never received a
single analytical query in a month
Source: Based on Attunity customer implementations/input worldwide, 2015
Identifying dormant data
recovers storage capacity
nt staff, as a percentage of IT staff, has risen a meager .5%
7 Big Data Management: Work Smarter Not Harder
are consuming CPU capacity. If you do need the data, say for regulatory
reasons, you can offload the processes of ETL to load and transform the
data onto a lower-cost Hadoop cluster. You not only recover storage
capacity, but you also consume less CPU capacity on the system because
of all the data that you’re not loading and ingesting into an EDW.
The key is to gain visibility into the EDW to learn what data is used and
what data is unused.
Identifying dormant data
recovers storage capacity. But it
also helps reduce costs related
to loading and transforming the
data. If you don’t need the data
anymore, you can stop loading
it, which means you eliminate a
portion of the ETL processes that
Know Your Data
8 Big Data Management: Work Smarter Not Harder
As data grows, the platforms that support it increase
in size and multiply because different platforms
optimize different workloads. That’s why placing
data on the right platform is critical to efficiently
managing data as a strategic asset. Enterprises
can realize significant benefits by modernizing and
optimizing data placement.
Not all data is created equal. Some data is of high
value and used for complex analytics while other
data is kept primarily for regulatory purposes —
and then there’s all the data in between. A dataset
should be moved to the most appropriate platform
based on its use case.
CHOOSE YOUR DATA PLATFORM WISELY
Data that’s being loaded, but you
don’t need for the business
Datasets that are being utilized, but
don’t require a high-end data warehouse
Data that should be maintained,
but not used for analytics
Archive or throw away Load and maintain in Hadoop
Load and run batch
analytics in Hadoop
9 Big Data Management: Work Smarter Not Harder
Choose Your Data Platform Wisely
There are three general types of data platforms:
Moving data that’s not queried
but still needs to be maintained
into a lower cost platform
like Hadoop can sometimes
help to support and balance
data growth. As a result, an
enterprise can reduce the need
for more storage capacity and
the number of EDW nodes. This
lowers both maintenance costs
and costs related to adding
more capacity.
The key is to figure out what
you’re loading into each of
these systems, and move
data as necessary to the
most appropriate
platform.
a particular subject area (such
as sales or finance). They may
be fed by data from a data
warehouse or from multiple
source systems. Data marts tend
to be hosted on typical, run-of-
the-mill servers.
ƒƒHadoop
Hadoop is suitable for structured,
unstructured, and semi-
structured data, and can run on
premises or in the cloud. Hadoop
is a great place to load and
maintain high volumes of data
that should be kept but is not
typically used for frequently used,
high-end analytics supporting
many simultaneous users.
Enterprise data warehouse
An enterprise data warehouse
(EDW) is appropriate for
frequently accessed, high-value
data used for complex analytics.
EDWs are high-end engineered
systems designed specifically
for complex analytics and many
simultaneous users — and
they’re priced accordingly. An
EDW is a great place to leverage
high-value data, but it isn’t the
ideal place to store data that you
don’t plan to use anytime soon.
‚‚Data mart
A data mart is more focused
than a data warehouse,
consolidating information for
CUSTOMER SUCCESS STORY
Online
Travel
Company Optimized data and workloads
for Hadoop cluster
Reduced
data footprint
on EDW by 30-40%
10Xin cost
savings
Reigning in Data Growth Costs
An online travel company’s 6+ petabyte production IT systems were
growing rapidly within a multi-platform environment that included
Hadoop and several legacy data warehouse systems. The DB2 data
warehouse was already at 300 TB, and adding more capacity was simply
cost prohibitive.
Using Attunity Visibility to balance workloads and data across the data
warehousing environment had a significant impact on costs associated
with data growth. The online travel company reduced its data footprint
on the EDW by 30-40%. Offloading data and associated workloads to
Hadoop saved the company $6 million.
Furthermore, its IT department
can ensure that these cost
savings are maintained by
providing chargeback reports
to business lines. By showing
business users what data is being
used and at what cost, IT can
make a case for moving data to
lower-cost platforms or making
additional investments in IT.
10 Big Data Management: Work Smarter Not Harder
11 Big Data Management: Work Smarter Not Harder
Even as you move data to the
appropriate platform, it behooves you
to consider whether it’s necessary to
keep specific datasets at all. There’s
great potential to lower costs by
purging unused data. Many Attunity
customers report that more than one-
third of data in the data warehouse
never receives a single analytical query
in a month. That’s a huge chunk of data
— and potential cost savings.
In order to determine what data is
worth keeping, IT must analyze data
usage and collaborate across teams to
classify data into four categories:
Category 1: Data that doesn’t need to be kept at all and
can be purged. This data isn’t used for analysis, and it
doesn’t need to be archived.
Category 2: Data that must be kept for
regulatory or other reasons but isn’t being
used for analytical purposes. These datasets
do not require a high-end engineered EDW.
They can be placed in a Hadoop cluster or something less
cost prohibitive. Hadoop is a perfect option because it’s a less
expensive system that allows you to continue to do all the data
processing and maintenance and still have access to the data,
because it’s a live platform. So when you do need the data,
you can access it directly in Hadoop or move it into the data
warehouse for analysis on premises or in the cloud.
DON’T KEEP WHAT YOU DON’T NEED
CUSTOMER SUCCESS STORY
Large
Financial
Institution
Capped IT infrastructure investment at existing capacity
Avoided $15M
in upgrade costs
Ready to handle faster
rates of data growth
in the future
12 Big Data Management: Work Smarter Not Harder
Overcoming Data Growth and Regulatory Compliance Challenges
Data growth made it difficult for a leading national bank to manage data
and maintain regulatory compliance. With data growing at 100-150% a
year, the bank was quickly running out of capacity. It expected to spend
$10–15 million in 12–18 months on hardware upgrades. Meanwhile,
IT had no way of tracking who accessed what data at the table and
column level, which is necessary to fulfill regulatory compliance and
audit requests. Attunity Visibility enables the IT organization to make
informed decisions about the datasets and related workloads that can
be rebalanced and optimized with Hadoop. As a result, the institution
capped its IT infrastructure investment at existing capacity to avoid
$15 million in upgrade costs while also empowering its teams to handle
faster rates of data growth in
the future. Attunity Visibility also
helps the bank meet regulatory
compliance requirements and
respond to audit requests in
a timely manner. The solution
identifies user activity related
to specific customer data at a
granular level and generates
weekly audit reports.
In order to categorize data, you need
to understand what the datasets are
and what users are doing with them
13 Big Data Management: Work Smarter Not Harder
Category 3: Datasets that are analyzed but don’t require an engineered
EDW, such as large-scale data extracts for offline analytics. SAS is a
good example. Many SAS users access data that’s in a data warehouse,
but they don’t do the analytics in the data warehouse. Instead, they
extract huge amounts of data into the SAS server for data mining. This
use case doesn’t require an engineered system like an EDW. Hadoop
does a great job for batch analytics, and it costs less. You can pull huge
streams of data back to the SAS server and analyze it there.
Category 4: Data that’s widely and repeatedly leveraged by the
business, and therefore suitable for storage in your EDW.
In order to categorize data,
you need to understand what
the datasets are and what
users are doing with them.
You must then get buy-in from
the stakeholders. Show usage
patterns to the business and
collaborate with them to make
decisions in an iterative fashion.
Over time, the returns are
significant.
Don’t Keep What You Don’t Need
14 Big Data Management: Work Smarter Not Harder
Effective data management
requires two primary capabilities:
 Integrate and move data
more easily across all major
relational database systems,
enterprise data warehouses,
and cloud and big data
platforms.
‚‚ Tune performance, optimize
data placement, and reduce
costs with metrics on how
the business is utilizing data
and platform resources.
In addition to getting real value out of data, effective data management
enables IT organizations to reduce big data costs. With visibility into
how data is used, IT can work with the business to make informed
decisions about what data is worth keeping and how it should be
stored, and what data can be purged or archived. This practice has even
enabled some IT organizations to cap their IT infrastructure investments
at existing capacity.
Being called on to do more with
less is nothing new for IT. Time
and again, IT organizations
learn to work smarter and leaner
while delivering key services
to the business. Big data
analytics is no different.
GETTING WHAT YOU NEED TO MANAGE YOUR DATA
Pre
pareData
Move
D
ata
Analyze Usage
Effective
Data Management
Capabilities
FOR MORE INFO
866.288.8648
sales@attunity.com
www.attunity.com
Follow Attunity on
Read the Blog ⟩⟩

Weitere ähnliche Inhalte

Was ist angesagt?

Embracing data science
Embracing data scienceEmbracing data science
Embracing data scienceVipul Kalamkar
 
Odgers Berndtson and Unico Big Data White Paper
Odgers Berndtson and Unico Big Data White PaperOdgers Berndtson and Unico Big Data White Paper
Odgers Berndtson and Unico Big Data White PaperRobertson Executive Search
 
Big data-analytics-2013-peer-research-report
Big data-analytics-2013-peer-research-reportBig data-analytics-2013-peer-research-report
Big data-analytics-2013-peer-research-reportAravindharamanan S
 
Creating the Foundations for the Internet of Things
Creating the Foundations for the Internet of ThingsCreating the Foundations for the Internet of Things
Creating the Foundations for the Internet of ThingsCapgemini
 
Augmented Data Management
Augmented Data ManagementAugmented Data Management
Augmented Data ManagementFORMCEPT
 
Modern Data Management
Modern Data ManagementModern Data Management
Modern Data ManagementSAP Technology
 
Operationalizing the Buzz: Big Data 2013
Operationalizing the Buzz: Big Data 2013Operationalizing the Buzz: Big Data 2013
Operationalizing the Buzz: Big Data 2013VMware Tanzu
 
Cloud and business agility
Cloud and business agilityCloud and business agility
Cloud and business agilityMike ORourke
 
Unlocking value in your (big) data
Unlocking value in your (big) dataUnlocking value in your (big) data
Unlocking value in your (big) dataOscar Renalias
 
Architecting a Data Platform For Enterprise Use (Strata NY 2018)
Architecting a Data Platform For Enterprise Use (Strata NY 2018)Architecting a Data Platform For Enterprise Use (Strata NY 2018)
Architecting a Data Platform For Enterprise Use (Strata NY 2018)mark madsen
 
Why Data Science Projects Fail
Why Data Science Projects FailWhy Data Science Projects Fail
Why Data Science Projects FailSense Corp
 
Analytics 3.0 Measurable business impact from analytics & big data
Analytics 3.0 Measurable business impact from analytics & big dataAnalytics 3.0 Measurable business impact from analytics & big data
Analytics 3.0 Measurable business impact from analytics & big dataMicrosoft
 
A better business case for big data with Hadoop
A better business case for big data with HadoopA better business case for big data with Hadoop
A better business case for big data with HadoopAptitude Software
 
How to understand trends in the data & software market
How to understand trends in the data & software marketHow to understand trends in the data & software market
How to understand trends in the data & software marketmark madsen
 

Was ist angesagt? (19)

Embracing data science
Embracing data scienceEmbracing data science
Embracing data science
 
Mighty Guides- Data Disruption
Mighty Guides- Data DisruptionMighty Guides- Data Disruption
Mighty Guides- Data Disruption
 
Odgers Berndtson and Unico Big Data White Paper
Odgers Berndtson and Unico Big Data White PaperOdgers Berndtson and Unico Big Data White Paper
Odgers Berndtson and Unico Big Data White Paper
 
Big data-analytics-2013-peer-research-report
Big data-analytics-2013-peer-research-reportBig data-analytics-2013-peer-research-report
Big data-analytics-2013-peer-research-report
 
Creating the Foundations for the Internet of Things
Creating the Foundations for the Internet of ThingsCreating the Foundations for the Internet of Things
Creating the Foundations for the Internet of Things
 
Augmented Data Management
Augmented Data ManagementAugmented Data Management
Augmented Data Management
 
Top 10 BI Trends for 2013
Top 10 BI Trends for 2013Top 10 BI Trends for 2013
Top 10 BI Trends for 2013
 
Modern Data Management
Modern Data ManagementModern Data Management
Modern Data Management
 
Operationalizing the Buzz: Big Data 2013
Operationalizing the Buzz: Big Data 2013Operationalizing the Buzz: Big Data 2013
Operationalizing the Buzz: Big Data 2013
 
Bidata
BidataBidata
Bidata
 
Cloud and business agility
Cloud and business agilityCloud and business agility
Cloud and business agility
 
Analytics3.0 e book
Analytics3.0 e bookAnalytics3.0 e book
Analytics3.0 e book
 
Unlocking value in your (big) data
Unlocking value in your (big) dataUnlocking value in your (big) data
Unlocking value in your (big) data
 
Architecting a Data Platform For Enterprise Use (Strata NY 2018)
Architecting a Data Platform For Enterprise Use (Strata NY 2018)Architecting a Data Platform For Enterprise Use (Strata NY 2018)
Architecting a Data Platform For Enterprise Use (Strata NY 2018)
 
Big Data at a Glance
Big Data at a GlanceBig Data at a Glance
Big Data at a Glance
 
Why Data Science Projects Fail
Why Data Science Projects FailWhy Data Science Projects Fail
Why Data Science Projects Fail
 
Analytics 3.0 Measurable business impact from analytics & big data
Analytics 3.0 Measurable business impact from analytics & big dataAnalytics 3.0 Measurable business impact from analytics & big data
Analytics 3.0 Measurable business impact from analytics & big data
 
A better business case for big data with Hadoop
A better business case for big data with HadoopA better business case for big data with Hadoop
A better business case for big data with Hadoop
 
How to understand trends in the data & software market
How to understand trends in the data & software marketHow to understand trends in the data & software market
How to understand trends in the data & software market
 

Ähnlich wie Big Data Management: Work Smarter Not Harder

The New Enterprise Blueprint featuring the Gartner Magic Quadrant
The New Enterprise Blueprint featuring the Gartner Magic QuadrantThe New Enterprise Blueprint featuring the Gartner Magic Quadrant
The New Enterprise Blueprint featuring the Gartner Magic QuadrantLindaWatson19
 
What's the Big Deal About Big Data?
What's the Big Deal About Big Data?What's the Big Deal About Big Data?
What's the Big Deal About Big Data?Logi Analytics
 
Big data-analytics-2013-peer-research-report
Big data-analytics-2013-peer-research-reportBig data-analytics-2013-peer-research-report
Big data-analytics-2013-peer-research-reportAravindharamanan S
 
Intel Big Data Analysis Peer Research Slideshare 2013
Intel Big Data Analysis Peer Research Slideshare 2013Intel Big Data Analysis Peer Research Slideshare 2013
Intel Big Data Analysis Peer Research Slideshare 2013Intel IT Center
 
Big Data and Analytics: The New Underpinning for Supply Chain Success? - 17 F...
Big Data and Analytics: The New Underpinning for Supply Chain Success? - 17 F...Big Data and Analytics: The New Underpinning for Supply Chain Success? - 17 F...
Big Data and Analytics: The New Underpinning for Supply Chain Success? - 17 F...Lora Cecere
 
Getting down to business on Big Data analytics
Getting down to business on Big Data analyticsGetting down to business on Big Data analytics
Getting down to business on Big Data analyticsThe Marketing Distillery
 
intelligent-data-lake_executive-brief
intelligent-data-lake_executive-briefintelligent-data-lake_executive-brief
intelligent-data-lake_executive-briefLindy-Anne Botha
 
Activating Big Data: The Key To Success with Machine Learning Advanced Analyt...
Activating Big Data: The Key To Success with Machine Learning Advanced Analyt...Activating Big Data: The Key To Success with Machine Learning Advanced Analyt...
Activating Big Data: The Key To Success with Machine Learning Advanced Analyt...Vasu S
 
Enterprise Business Intelligence & Data Warehousing: The Data Quality Conundrum
Enterprise Business Intelligence & Data Warehousing: The Data Quality ConundrumEnterprise Business Intelligence & Data Warehousing: The Data Quality Conundrum
Enterprise Business Intelligence & Data Warehousing: The Data Quality ConundrumRTTS
 
Getting down to business on Big Data analytics
Getting down to business on Big Data analyticsGetting down to business on Big Data analytics
Getting down to business on Big Data analyticsThe Marketing Distillery
 
Datacenter industry survey 2015
Datacenter industry survey 2015Datacenter industry survey 2015
Datacenter industry survey 2015sher lion
 
Managing The Data Explosion
Managing The Data ExplosionManaging The Data Explosion
Managing The Data ExplosionLaura Hood
 
Is Your Staff Big Data Ready? 5 Things to Know About What It Will Take to Suc...
Is Your Staff Big Data Ready? 5 Things to Know About What It Will Take to Suc...Is Your Staff Big Data Ready? 5 Things to Know About What It Will Take to Suc...
Is Your Staff Big Data Ready? 5 Things to Know About What It Will Take to Suc...CompTIA
 
Capturing big value in big data
Capturing big value in big data Capturing big value in big data
Capturing big value in big data BSP Media Group
 
Whitepaper - Simplifying Analytics Adoption in Enterprise
Whitepaper - Simplifying Analytics Adoption in EnterpriseWhitepaper - Simplifying Analytics Adoption in Enterprise
Whitepaper - Simplifying Analytics Adoption in EnterpriseBRIDGEi2i Analytics Solutions
 

Ähnlich wie Big Data Management: Work Smarter Not Harder (20)

Research Data Drives Profit
Research Data Drives ProfitResearch Data Drives Profit
Research Data Drives Profit
 
The New Enterprise Blueprint featuring the Gartner Magic Quadrant
The New Enterprise Blueprint featuring the Gartner Magic QuadrantThe New Enterprise Blueprint featuring the Gartner Magic Quadrant
The New Enterprise Blueprint featuring the Gartner Magic Quadrant
 
What's the Big Deal About Big Data?
What's the Big Deal About Big Data?What's the Big Deal About Big Data?
What's the Big Deal About Big Data?
 
Big data-analytics-2013-peer-research-report
Big data-analytics-2013-peer-research-reportBig data-analytics-2013-peer-research-report
Big data-analytics-2013-peer-research-report
 
Intel Big Data Analysis Peer Research Slideshare 2013
Intel Big Data Analysis Peer Research Slideshare 2013Intel Big Data Analysis Peer Research Slideshare 2013
Intel Big Data Analysis Peer Research Slideshare 2013
 
Big Data and Analytics: The New Underpinning for Supply Chain Success? - 17 F...
Big Data and Analytics: The New Underpinning for Supply Chain Success? - 17 F...Big Data and Analytics: The New Underpinning for Supply Chain Success? - 17 F...
Big Data and Analytics: The New Underpinning for Supply Chain Success? - 17 F...
 
Unlocking big data
Unlocking big dataUnlocking big data
Unlocking big data
 
Big Data Analytics
Big Data AnalyticsBig Data Analytics
Big Data Analytics
 
Getting down to business on Big Data analytics
Getting down to business on Big Data analyticsGetting down to business on Big Data analytics
Getting down to business on Big Data analytics
 
intelligent-data-lake_executive-brief
intelligent-data-lake_executive-briefintelligent-data-lake_executive-brief
intelligent-data-lake_executive-brief
 
Activating Big Data: The Key To Success with Machine Learning Advanced Analyt...
Activating Big Data: The Key To Success with Machine Learning Advanced Analyt...Activating Big Data: The Key To Success with Machine Learning Advanced Analyt...
Activating Big Data: The Key To Success with Machine Learning Advanced Analyt...
 
Enterprise Business Intelligence & Data Warehousing: The Data Quality Conundrum
Enterprise Business Intelligence & Data Warehousing: The Data Quality ConundrumEnterprise Business Intelligence & Data Warehousing: The Data Quality Conundrum
Enterprise Business Intelligence & Data Warehousing: The Data Quality Conundrum
 
Big data baddata-gooddata
Big data baddata-gooddataBig data baddata-gooddata
Big data baddata-gooddata
 
Getting down to business on Big Data analytics
Getting down to business on Big Data analyticsGetting down to business on Big Data analytics
Getting down to business on Big Data analytics
 
Datacenter industry survey 2015
Datacenter industry survey 2015Datacenter industry survey 2015
Datacenter industry survey 2015
 
Managing The Data Explosion
Managing The Data ExplosionManaging The Data Explosion
Managing The Data Explosion
 
Is Your Staff Big Data Ready? 5 Things to Know About What It Will Take to Suc...
Is Your Staff Big Data Ready? 5 Things to Know About What It Will Take to Suc...Is Your Staff Big Data Ready? 5 Things to Know About What It Will Take to Suc...
Is Your Staff Big Data Ready? 5 Things to Know About What It Will Take to Suc...
 
Big data
Big dataBig data
Big data
 
Capturing big value in big data
Capturing big value in big data Capturing big value in big data
Capturing big value in big data
 
Whitepaper - Simplifying Analytics Adoption in Enterprise
Whitepaper - Simplifying Analytics Adoption in EnterpriseWhitepaper - Simplifying Analytics Adoption in Enterprise
Whitepaper - Simplifying Analytics Adoption in Enterprise
 

Mehr von Jennifer Walker

Apperian 2014 Executive Enterprise Mobility Report
Apperian 2014 Executive Enterprise Mobility ReportApperian 2014 Executive Enterprise Mobility Report
Apperian 2014 Executive Enterprise Mobility ReportJennifer Walker
 
Apperian 2015 Executive Enterprise Mobility Survey
Apperian 2015 Executive Enterprise Mobility SurveyApperian 2015 Executive Enterprise Mobility Survey
Apperian 2015 Executive Enterprise Mobility SurveyJennifer Walker
 
Apperian 2016 Executive Enterprise Mobility Report
Apperian 2016 Executive Enterprise Mobility ReportApperian 2016 Executive Enterprise Mobility Report
Apperian 2016 Executive Enterprise Mobility ReportJennifer Walker
 
Apperian 2017 Executive Enterprise Mobility Report
Apperian 2017 Executive Enterprise Mobility ReportApperian 2017 Executive Enterprise Mobility Report
Apperian 2017 Executive Enterprise Mobility ReportJennifer Walker
 
How Coca-Cola Started Working with Startups
How Coca-Cola Started Working with StartupsHow Coca-Cola Started Working with Startups
How Coca-Cola Started Working with StartupsJennifer Walker
 
Qlik view selfservicebi_2013-08-01_marketing
Qlik view selfservicebi_2013-08-01_marketingQlik view selfservicebi_2013-08-01_marketing
Qlik view selfservicebi_2013-08-01_marketingJennifer Walker
 

Mehr von Jennifer Walker (6)

Apperian 2014 Executive Enterprise Mobility Report
Apperian 2014 Executive Enterprise Mobility ReportApperian 2014 Executive Enterprise Mobility Report
Apperian 2014 Executive Enterprise Mobility Report
 
Apperian 2015 Executive Enterprise Mobility Survey
Apperian 2015 Executive Enterprise Mobility SurveyApperian 2015 Executive Enterprise Mobility Survey
Apperian 2015 Executive Enterprise Mobility Survey
 
Apperian 2016 Executive Enterprise Mobility Report
Apperian 2016 Executive Enterprise Mobility ReportApperian 2016 Executive Enterprise Mobility Report
Apperian 2016 Executive Enterprise Mobility Report
 
Apperian 2017 Executive Enterprise Mobility Report
Apperian 2017 Executive Enterprise Mobility ReportApperian 2017 Executive Enterprise Mobility Report
Apperian 2017 Executive Enterprise Mobility Report
 
How Coca-Cola Started Working with Startups
How Coca-Cola Started Working with StartupsHow Coca-Cola Started Working with Startups
How Coca-Cola Started Working with Startups
 
Qlik view selfservicebi_2013-08-01_marketing
Qlik view selfservicebi_2013-08-01_marketingQlik view selfservicebi_2013-08-01_marketing
Qlik view selfservicebi_2013-08-01_marketing
 

Kürzlich hochgeladen

FWD Group - Insurer Innovation Award 2024
FWD Group - Insurer Innovation Award 2024FWD Group - Insurer Innovation Award 2024
FWD Group - Insurer Innovation Award 2024The Digital Insurer
 
Cloud Frontiers: A Deep Dive into Serverless Spatial Data and FME
Cloud Frontiers:  A Deep Dive into Serverless Spatial Data and FMECloud Frontiers:  A Deep Dive into Serverless Spatial Data and FME
Cloud Frontiers: A Deep Dive into Serverless Spatial Data and FMESafe Software
 
GenAI Risks & Security Meetup 01052024.pdf
GenAI Risks & Security Meetup 01052024.pdfGenAI Risks & Security Meetup 01052024.pdf
GenAI Risks & Security Meetup 01052024.pdflior mazor
 
Architecting Cloud Native Applications
Architecting Cloud Native ApplicationsArchitecting Cloud Native Applications
Architecting Cloud Native ApplicationsWSO2
 
Apidays Singapore 2024 - Modernizing Securities Finance by Madhu Subbu
Apidays Singapore 2024 - Modernizing Securities Finance by Madhu SubbuApidays Singapore 2024 - Modernizing Securities Finance by Madhu Subbu
Apidays Singapore 2024 - Modernizing Securities Finance by Madhu Subbuapidays
 
Boost Fertility New Invention Ups Success Rates.pdf
Boost Fertility New Invention Ups Success Rates.pdfBoost Fertility New Invention Ups Success Rates.pdf
Boost Fertility New Invention Ups Success Rates.pdfsudhanshuwaghmare1
 
A Year of the Servo Reboot: Where Are We Now?
A Year of the Servo Reboot: Where Are We Now?A Year of the Servo Reboot: Where Are We Now?
A Year of the Servo Reboot: Where Are We Now?Igalia
 
Strategies for Unlocking Knowledge Management in Microsoft 365 in the Copilot...
Strategies for Unlocking Knowledge Management in Microsoft 365 in the Copilot...Strategies for Unlocking Knowledge Management in Microsoft 365 in the Copilot...
Strategies for Unlocking Knowledge Management in Microsoft 365 in the Copilot...Drew Madelung
 
presentation ICT roal in 21st century education
presentation ICT roal in 21st century educationpresentation ICT roal in 21st century education
presentation ICT roal in 21st century educationjfdjdjcjdnsjd
 
MS Copilot expands with MS Graph connectors
MS Copilot expands with MS Graph connectorsMS Copilot expands with MS Graph connectors
MS Copilot expands with MS Graph connectorsNanddeep Nachan
 
"I see eyes in my soup": How Delivery Hero implemented the safety system for ...
"I see eyes in my soup": How Delivery Hero implemented the safety system for ..."I see eyes in my soup": How Delivery Hero implemented the safety system for ...
"I see eyes in my soup": How Delivery Hero implemented the safety system for ...Zilliz
 
Apidays Singapore 2024 - Building Digital Trust in a Digital Economy by Veron...
Apidays Singapore 2024 - Building Digital Trust in a Digital Economy by Veron...Apidays Singapore 2024 - Building Digital Trust in a Digital Economy by Veron...
Apidays Singapore 2024 - Building Digital Trust in a Digital Economy by Veron...apidays
 
How to Troubleshoot Apps for the Modern Connected Worker
How to Troubleshoot Apps for the Modern Connected WorkerHow to Troubleshoot Apps for the Modern Connected Worker
How to Troubleshoot Apps for the Modern Connected WorkerThousandEyes
 
MINDCTI Revenue Release Quarter One 2024
MINDCTI Revenue Release Quarter One 2024MINDCTI Revenue Release Quarter One 2024
MINDCTI Revenue Release Quarter One 2024MIND CTI
 
Apidays New York 2024 - The Good, the Bad and the Governed by David O'Neill, ...
Apidays New York 2024 - The Good, the Bad and the Governed by David O'Neill, ...Apidays New York 2024 - The Good, the Bad and the Governed by David O'Neill, ...
Apidays New York 2024 - The Good, the Bad and the Governed by David O'Neill, ...apidays
 
Manulife - Insurer Transformation Award 2024
Manulife - Insurer Transformation Award 2024Manulife - Insurer Transformation Award 2024
Manulife - Insurer Transformation Award 2024The Digital Insurer
 
Corporate and higher education May webinar.pptx
Corporate and higher education May webinar.pptxCorporate and higher education May webinar.pptx
Corporate and higher education May webinar.pptxRustici Software
 
Ransomware_Q4_2023. The report. [EN].pdf
Ransomware_Q4_2023. The report. [EN].pdfRansomware_Q4_2023. The report. [EN].pdf
Ransomware_Q4_2023. The report. [EN].pdfOverkill Security
 
Strategize a Smooth Tenant-to-tenant Migration and Copilot Takeoff
Strategize a Smooth Tenant-to-tenant Migration and Copilot TakeoffStrategize a Smooth Tenant-to-tenant Migration and Copilot Takeoff
Strategize a Smooth Tenant-to-tenant Migration and Copilot Takeoffsammart93
 

Kürzlich hochgeladen (20)

FWD Group - Insurer Innovation Award 2024
FWD Group - Insurer Innovation Award 2024FWD Group - Insurer Innovation Award 2024
FWD Group - Insurer Innovation Award 2024
 
Cloud Frontiers: A Deep Dive into Serverless Spatial Data and FME
Cloud Frontiers:  A Deep Dive into Serverless Spatial Data and FMECloud Frontiers:  A Deep Dive into Serverless Spatial Data and FME
Cloud Frontiers: A Deep Dive into Serverless Spatial Data and FME
 
GenAI Risks & Security Meetup 01052024.pdf
GenAI Risks & Security Meetup 01052024.pdfGenAI Risks & Security Meetup 01052024.pdf
GenAI Risks & Security Meetup 01052024.pdf
 
Architecting Cloud Native Applications
Architecting Cloud Native ApplicationsArchitecting Cloud Native Applications
Architecting Cloud Native Applications
 
Apidays Singapore 2024 - Modernizing Securities Finance by Madhu Subbu
Apidays Singapore 2024 - Modernizing Securities Finance by Madhu SubbuApidays Singapore 2024 - Modernizing Securities Finance by Madhu Subbu
Apidays Singapore 2024 - Modernizing Securities Finance by Madhu Subbu
 
Boost Fertility New Invention Ups Success Rates.pdf
Boost Fertility New Invention Ups Success Rates.pdfBoost Fertility New Invention Ups Success Rates.pdf
Boost Fertility New Invention Ups Success Rates.pdf
 
+971581248768>> SAFE AND ORIGINAL ABORTION PILLS FOR SALE IN DUBAI AND ABUDHA...
+971581248768>> SAFE AND ORIGINAL ABORTION PILLS FOR SALE IN DUBAI AND ABUDHA...+971581248768>> SAFE AND ORIGINAL ABORTION PILLS FOR SALE IN DUBAI AND ABUDHA...
+971581248768>> SAFE AND ORIGINAL ABORTION PILLS FOR SALE IN DUBAI AND ABUDHA...
 
A Year of the Servo Reboot: Where Are We Now?
A Year of the Servo Reboot: Where Are We Now?A Year of the Servo Reboot: Where Are We Now?
A Year of the Servo Reboot: Where Are We Now?
 
Strategies for Unlocking Knowledge Management in Microsoft 365 in the Copilot...
Strategies for Unlocking Knowledge Management in Microsoft 365 in the Copilot...Strategies for Unlocking Knowledge Management in Microsoft 365 in the Copilot...
Strategies for Unlocking Knowledge Management in Microsoft 365 in the Copilot...
 
presentation ICT roal in 21st century education
presentation ICT roal in 21st century educationpresentation ICT roal in 21st century education
presentation ICT roal in 21st century education
 
MS Copilot expands with MS Graph connectors
MS Copilot expands with MS Graph connectorsMS Copilot expands with MS Graph connectors
MS Copilot expands with MS Graph connectors
 
"I see eyes in my soup": How Delivery Hero implemented the safety system for ...
"I see eyes in my soup": How Delivery Hero implemented the safety system for ..."I see eyes in my soup": How Delivery Hero implemented the safety system for ...
"I see eyes in my soup": How Delivery Hero implemented the safety system for ...
 
Apidays Singapore 2024 - Building Digital Trust in a Digital Economy by Veron...
Apidays Singapore 2024 - Building Digital Trust in a Digital Economy by Veron...Apidays Singapore 2024 - Building Digital Trust in a Digital Economy by Veron...
Apidays Singapore 2024 - Building Digital Trust in a Digital Economy by Veron...
 
How to Troubleshoot Apps for the Modern Connected Worker
How to Troubleshoot Apps for the Modern Connected WorkerHow to Troubleshoot Apps for the Modern Connected Worker
How to Troubleshoot Apps for the Modern Connected Worker
 
MINDCTI Revenue Release Quarter One 2024
MINDCTI Revenue Release Quarter One 2024MINDCTI Revenue Release Quarter One 2024
MINDCTI Revenue Release Quarter One 2024
 
Apidays New York 2024 - The Good, the Bad and the Governed by David O'Neill, ...
Apidays New York 2024 - The Good, the Bad and the Governed by David O'Neill, ...Apidays New York 2024 - The Good, the Bad and the Governed by David O'Neill, ...
Apidays New York 2024 - The Good, the Bad and the Governed by David O'Neill, ...
 
Manulife - Insurer Transformation Award 2024
Manulife - Insurer Transformation Award 2024Manulife - Insurer Transformation Award 2024
Manulife - Insurer Transformation Award 2024
 
Corporate and higher education May webinar.pptx
Corporate and higher education May webinar.pptxCorporate and higher education May webinar.pptx
Corporate and higher education May webinar.pptx
 
Ransomware_Q4_2023. The report. [EN].pdf
Ransomware_Q4_2023. The report. [EN].pdfRansomware_Q4_2023. The report. [EN].pdf
Ransomware_Q4_2023. The report. [EN].pdf
 
Strategize a Smooth Tenant-to-tenant Migration and Copilot Takeoff
Strategize a Smooth Tenant-to-tenant Migration and Copilot TakeoffStrategize a Smooth Tenant-to-tenant Migration and Copilot Takeoff
Strategize a Smooth Tenant-to-tenant Migration and Copilot Takeoff
 

Big Data Management: Work Smarter Not Harder

  • 1. BIG DATA MANAGEMENT WORK SMARTER NOT HARDER GET MORE OUT OF YOUR DATA FOR LESS
  • 2. CONTENTS Long on Data, Short on Resources   1 Know Your Data   3 ii Reducing Data Maintenance Costs   5 Choose Your Data Platform Wisely   8 ii Reigning in Data Growth Costs   10 Don’t Keep What You Don’t Need   11 ii Overcoming Data Growth and Regulatory Compliance Challenges   12 Getting What You Need to Manage Your Data   14 For More Info   15
  • 3. 1 Big Data Management: Work Smarter Not Harder We’ve been deluged with statistics on data’s rapid growth to the point that the numbers and bytes have become almost meaningless. No one would deny that data growth is an unstoppable trend. But that’s not the issue. The real issue is how organizations can make big data meaningful when IT resources are shrinking. The good news is that business users want more data, and they’re getting it, but in some cases, more data is actually having an adverse effect on business. Fifty-six percent of IT decision makers surveyed by IDG said that their users frequently or occasionally report feeling overwhelmed by incoming data and information, while 53% said the influx of large quantities of data has delayed important decisions because they didn’t have the right tools to properly manage it. Leading companies are realizing that having the right technology makes all the difference to assure that data can be used as an asset rather than a liability. LONG ON DATA, SHORT ON RESOURCES 56% 53% of IT decision makers said that their users report feeling overwhelmed by incoming data and information said the influx of data has delayed decisions because they didn’t have the right tools to manage it IDG Enterprise, 2015
  • 4. Data management staff as a percentage of IT staff has risen a meager Computer Economics, 2015 0.5% 2 Big Data Management: Work Smarter Not Harder Long on Data, Short on Resources However, despite the perceived value of data, the allocation of resources to manage and leverage big data has not kept pace with its growth. According to research firm Computer Economics, data management staff as a percentage of IT staff has risen a meager .5% in four years, and IT spending per user continues to decline. In fact, the same study showed that when adjusted for inflation, spending decreased from $10,514 in 2012 per user to just $6,847 in 2015. But it’s not all about the money. Finding people with the necessary skillsets will only grow more challenging. The McKinsey Global Institute predicts that by 2018 the US could face a shortage of 140,000-190,000 people with deep analytical skills as well as a deficit of 1.5 million people who can leverage big data analysis to make effective decisions. This drives the need for automation that reduces the skills and training required to manage data. As is often the case, the best way to address the big data resource and skills shortage is to work smarter — not harder. In this ebook, we look at how IT organizations can manage data smarter — while maintaining or even reducing costs — so that business users can get real value from data, faster and easier.
  • 5. 3 Big Data Management: Work Smarter Not Harder Moving data, transforming data, and making it available to the business is a very expensive process. Given data’s rapid rate of growth — and the amount of waste in the current data management paradigm — it’s time to transform the economics of data. Most enterprises leverage a wide variety of data types in high volumes for big data analytics projects. These include social media data, internal data, log data, mobile device data, sensor data, free public external data — and the list keeps growing. In fact, according to QuinStreet Research, by 2020, the world will generate 50 times as much data as it does today, but the IT staff responsible for managing it will only grow 1.5 times. On top of that challenge, only 40-55% of the data that they load is ever used. When you consider that it costs $2-6 million to support every 50-100 TB of new data, supporting dormant data results in a tremendous amount of inefficiency. KNOW YOUR DATA QuinStreet Enterprise Research, 2014 But the IT staff who manages it will only grow 1.5Xthe world will generate 50Xas much data By 2020
  • 6. 4 Big Data Management: Work Smarter Not Harder Dormant data also slows down performance since the process of loading data uses up to 60% of the CPU. A lot of data may need to be retained in its original form for compliance and undergo ETL and transformation processes for the prospect of using it for other needs, but never get used. As a result, it’s unnecessarily impacting costs and performance. Know Your Data But the exorbitant cost of not managing dormant data well isn’t just about the storage. In fact, it’s less about the storage and more about CPU capacity. Most vendors charge by CPU capacity. As CPU capacity increases, so do your licensing costs. Only 40to 55% of the data companies load will ever be used Every 50-100 TB of new data costs $2-6 Million to support it Cost of Supporting DataData Waste Cost of CPU Loading data uses up to 60% (License costs go up as CPU capacity increases) of the CPU Source: Based on Attunity customer implementations/input worldwide, 2015
  • 7. CUSTOMER SUCCESS STORY By offloading 43% of the EDW into Hadoop $21M $5M DECREASE Source: Based on Attunity customer implementations/input worldwide, 2015 Yearly maintenance costs (in three years!) 5 Big Data Management: Work Smarter Not Harder Reducing Data Maintenance Costs By looking at and analyzing EDW use for just one month, an Attunity customer discovered that 37 TB of data — 43% of the EDW — didn’t receive any kind of analytical query. And yet the CPU consumption to ingest and load the data was over 60%. By offloading that 43% into Hadoop, the customer dramatically decreased the need for more capacity, reduced the number of EDW nodes and lowered maintenance costs. In fact, the customer is looking at driving down yearly maintenance costs from $21 million to $5 million in just three years — all by being more strategic about data management.
  • 8. 6 Big Data Management: Work Smarter Not Harder Know Your Data The data warehouse is a reflection of the business. It grows in response to business needs. It makes sense then to analyze data activity and usage accordingly. When you group applications, data, or users in the context of the business (for example, by department or line of business), you can then begin to analyze utilization and assign accountability via chargeback or showback. For example, when marketing requests more data from IT, the IT department may need to show them how much data hasn’t been used, along with the cost to continue to manage current and new data. When a business can specify how much it costs to load and maintain data, and demonstrate how much isn’t being used, the dataset that seemed so important before may lose some of its significance. The standing request might just lose its urgency, particularly if the cost to keep the data comes out of departmental budgets and ROI is lacking. To figure out what’s used... look at what’s been qu eried Source: Based on Attunity customer implementations/input worldwide, 2015 43% of data in the data warehouse never received a single analytical query in a month
  • 9. Source: Based on Attunity customer implementations/input worldwide, 2015 Identifying dormant data recovers storage capacity nt staff, as a percentage of IT staff, has risen a meager .5% 7 Big Data Management: Work Smarter Not Harder are consuming CPU capacity. If you do need the data, say for regulatory reasons, you can offload the processes of ETL to load and transform the data onto a lower-cost Hadoop cluster. You not only recover storage capacity, but you also consume less CPU capacity on the system because of all the data that you’re not loading and ingesting into an EDW. The key is to gain visibility into the EDW to learn what data is used and what data is unused. Identifying dormant data recovers storage capacity. But it also helps reduce costs related to loading and transforming the data. If you don’t need the data anymore, you can stop loading it, which means you eliminate a portion of the ETL processes that Know Your Data
  • 10. 8 Big Data Management: Work Smarter Not Harder As data grows, the platforms that support it increase in size and multiply because different platforms optimize different workloads. That’s why placing data on the right platform is critical to efficiently managing data as a strategic asset. Enterprises can realize significant benefits by modernizing and optimizing data placement. Not all data is created equal. Some data is of high value and used for complex analytics while other data is kept primarily for regulatory purposes — and then there’s all the data in between. A dataset should be moved to the most appropriate platform based on its use case. CHOOSE YOUR DATA PLATFORM WISELY Data that’s being loaded, but you don’t need for the business Datasets that are being utilized, but don’t require a high-end data warehouse Data that should be maintained, but not used for analytics Archive or throw away Load and maintain in Hadoop Load and run batch analytics in Hadoop
  • 11. 9 Big Data Management: Work Smarter Not Harder Choose Your Data Platform Wisely There are three general types of data platforms: Moving data that’s not queried but still needs to be maintained into a lower cost platform like Hadoop can sometimes help to support and balance data growth. As a result, an enterprise can reduce the need for more storage capacity and the number of EDW nodes. This lowers both maintenance costs and costs related to adding more capacity. The key is to figure out what you’re loading into each of these systems, and move data as necessary to the most appropriate platform. a particular subject area (such as sales or finance). They may be fed by data from a data warehouse or from multiple source systems. Data marts tend to be hosted on typical, run-of- the-mill servers. ƒƒHadoop Hadoop is suitable for structured, unstructured, and semi- structured data, and can run on premises or in the cloud. Hadoop is a great place to load and maintain high volumes of data that should be kept but is not typically used for frequently used, high-end analytics supporting many simultaneous users. Enterprise data warehouse An enterprise data warehouse (EDW) is appropriate for frequently accessed, high-value data used for complex analytics. EDWs are high-end engineered systems designed specifically for complex analytics and many simultaneous users — and they’re priced accordingly. An EDW is a great place to leverage high-value data, but it isn’t the ideal place to store data that you don’t plan to use anytime soon. ‚‚Data mart A data mart is more focused than a data warehouse, consolidating information for
  • 12. CUSTOMER SUCCESS STORY Online Travel Company Optimized data and workloads for Hadoop cluster Reduced data footprint on EDW by 30-40% 10Xin cost savings Reigning in Data Growth Costs An online travel company’s 6+ petabyte production IT systems were growing rapidly within a multi-platform environment that included Hadoop and several legacy data warehouse systems. The DB2 data warehouse was already at 300 TB, and adding more capacity was simply cost prohibitive. Using Attunity Visibility to balance workloads and data across the data warehousing environment had a significant impact on costs associated with data growth. The online travel company reduced its data footprint on the EDW by 30-40%. Offloading data and associated workloads to Hadoop saved the company $6 million. Furthermore, its IT department can ensure that these cost savings are maintained by providing chargeback reports to business lines. By showing business users what data is being used and at what cost, IT can make a case for moving data to lower-cost platforms or making additional investments in IT. 10 Big Data Management: Work Smarter Not Harder
  • 13. 11 Big Data Management: Work Smarter Not Harder Even as you move data to the appropriate platform, it behooves you to consider whether it’s necessary to keep specific datasets at all. There’s great potential to lower costs by purging unused data. Many Attunity customers report that more than one- third of data in the data warehouse never receives a single analytical query in a month. That’s a huge chunk of data — and potential cost savings. In order to determine what data is worth keeping, IT must analyze data usage and collaborate across teams to classify data into four categories: Category 1: Data that doesn’t need to be kept at all and can be purged. This data isn’t used for analysis, and it doesn’t need to be archived. Category 2: Data that must be kept for regulatory or other reasons but isn’t being used for analytical purposes. These datasets do not require a high-end engineered EDW. They can be placed in a Hadoop cluster or something less cost prohibitive. Hadoop is a perfect option because it’s a less expensive system that allows you to continue to do all the data processing and maintenance and still have access to the data, because it’s a live platform. So when you do need the data, you can access it directly in Hadoop or move it into the data warehouse for analysis on premises or in the cloud. DON’T KEEP WHAT YOU DON’T NEED
  • 14. CUSTOMER SUCCESS STORY Large Financial Institution Capped IT infrastructure investment at existing capacity Avoided $15M in upgrade costs Ready to handle faster rates of data growth in the future 12 Big Data Management: Work Smarter Not Harder Overcoming Data Growth and Regulatory Compliance Challenges Data growth made it difficult for a leading national bank to manage data and maintain regulatory compliance. With data growing at 100-150% a year, the bank was quickly running out of capacity. It expected to spend $10–15 million in 12–18 months on hardware upgrades. Meanwhile, IT had no way of tracking who accessed what data at the table and column level, which is necessary to fulfill regulatory compliance and audit requests. Attunity Visibility enables the IT organization to make informed decisions about the datasets and related workloads that can be rebalanced and optimized with Hadoop. As a result, the institution capped its IT infrastructure investment at existing capacity to avoid $15 million in upgrade costs while also empowering its teams to handle faster rates of data growth in the future. Attunity Visibility also helps the bank meet regulatory compliance requirements and respond to audit requests in a timely manner. The solution identifies user activity related to specific customer data at a granular level and generates weekly audit reports.
  • 15. In order to categorize data, you need to understand what the datasets are and what users are doing with them 13 Big Data Management: Work Smarter Not Harder Category 3: Datasets that are analyzed but don’t require an engineered EDW, such as large-scale data extracts for offline analytics. SAS is a good example. Many SAS users access data that’s in a data warehouse, but they don’t do the analytics in the data warehouse. Instead, they extract huge amounts of data into the SAS server for data mining. This use case doesn’t require an engineered system like an EDW. Hadoop does a great job for batch analytics, and it costs less. You can pull huge streams of data back to the SAS server and analyze it there. Category 4: Data that’s widely and repeatedly leveraged by the business, and therefore suitable for storage in your EDW. In order to categorize data, you need to understand what the datasets are and what users are doing with them. You must then get buy-in from the stakeholders. Show usage patterns to the business and collaborate with them to make decisions in an iterative fashion. Over time, the returns are significant. Don’t Keep What You Don’t Need
  • 16. 14 Big Data Management: Work Smarter Not Harder Effective data management requires two primary capabilities:  Integrate and move data more easily across all major relational database systems, enterprise data warehouses, and cloud and big data platforms. ‚‚ Tune performance, optimize data placement, and reduce costs with metrics on how the business is utilizing data and platform resources. In addition to getting real value out of data, effective data management enables IT organizations to reduce big data costs. With visibility into how data is used, IT can work with the business to make informed decisions about what data is worth keeping and how it should be stored, and what data can be purged or archived. This practice has even enabled some IT organizations to cap their IT infrastructure investments at existing capacity. Being called on to do more with less is nothing new for IT. Time and again, IT organizations learn to work smarter and leaner while delivering key services to the business. Big data analytics is no different. GETTING WHAT YOU NEED TO MANAGE YOUR DATA Pre pareData Move D ata Analyze Usage Effective Data Management Capabilities