SlideShare a Scribd company logo
1 of 11
Download to read offline
SAP HANA’s Defining Capabilities
2 SAP HANA’s Defining Capabilities
SAP HANA UNIQUE CAPABILITIES
Overview
The SAP HANA Difference
Architecture
Summary
Š 2013 SAP AG or an SAP affiliate company. All rights reserved.
Table of Contents
The information in this document is confidential and proprietary to SAPand may not be disclosed without the permission of SAP. Except for your obligation to protect
confidential information, this document is not subject to your license agreement or any other service or subscription agreement with SAP. SAPhas no obligation to
pursue any course of business outlined in this document or any related document, or to develop or release any functionality mentioned therein.This document, or any
related document and SAP’s strategy and possible future developments, products and or platforms directions and functionality are all subject to change and may be
changed by SAP at any time for any reason without notice.The information in this document is not a commitment, promise or legal obligation to deliver any material, code
or functionality. This document is provided without a warranty of any kind, either express or implied, including but not limited to, the implied warranties of merchantability,
fitness for a particular purpose, or non-infringement. This document is for informational purposes and may not be incorporated into a contract. SAP assumes no
responsibility for errors or omissions in this document, except if such damages were caused by SAP’s intentional or gross negligence.
All forward-looking statements are subject to various risks and uncertainties that could cause actual results to differ materially from expectations. Readers are cautioned
not to place undue reliance on these forward-looking statements, which speak only as of their dates, and they should not be relied upon in making purchasing decisions
3
4
8
10
3
Š 2013 SAP AG or an SAP affiliate company. All rights reserved.
SAP HANA with its new vision and approach, as with many
things novel and paradigm-shifting, can be met with resistance
and disbelief. Change is not always easy and many who are
comfortable with existing approaches try to compare SAP HANA
with, and to see it in the light of, those older paradigms.This,
however, cannot successfully be done given that SAP HANA
represents a completely new way to address traditional
computing issues.There is a SAP HANA difference and this
paper will focus on some of the unique capabilities of SAP
HANA’s revolutionary approach ­— especially the technological
underpinnings — and how these translates into a very different
way of assessing value for those who deploy it. Some of these
unique SAP HANA capabilities that will be explored here include:
•	 SAP HANA’s Data Locality Aware Processing. SAP
HANA’s CPU-aware optimizer leverages optimizations of
the data’s locality to the CPUs and their caches to reduce
latency of data compute. SAP HANA ensures that data
resides in the DRAM and CPU caches to avoid cache hit
misses, and applies knowledge of all the bandwidth and
latencies between every cache, every CPU, and every node
in the execution plans.
•	 SAP HANA can run real-time OLTP and OLAP on a single
copy of the data. There is no need for unique OLTP and
OLAP copies. Data is stored only once for both
transactional and analytical applications.
•	 Multiple Parallel Processing (MPP) on a shared nothing
architecture along with its support for Single
Instruction, Multiple Data (SIMD).
•	 Even though the SAP HANA true in-memory platform is a
paradigm-shifting new approach, nothing has changed
with regards to ACID compliance or Reliability, Full High
Availability, Disaster Recovery and Supportability. For all its
Š 2013 SAP AG or an SAP affiliate company. All rights reserved.
cutting-edge innovations, it is still ACID compliant and it
supports Disaster Recovery (DR) via synchronous or
asynchronous native replication.
•	 Unique on-the-fly schema extension capability allows for
flexible business model changes.
•	 Dynamic Data Tiering optimizing the balance between
data processing and data storage. Data identified as
frequently used is kept in memory. Any data not recently
accessed is purged from memory but persisted on-disk
without write-back.
•	 It is more than just a database. SAP HANA converges
platform, database, data processing capabilities, handles
spatial and textual data analysis, and provides libraries for
predictive, planning and business analytics. All in a single
platform with a built-in application and web server.
•	 SAP HANA provides the most comprehensive data
provisioning for any data from any source.
•	 It is open and agnostic meaning any application and any
data. SAP HANA runs on commodityx86 based hardware
from SAP partners. It supports a wide variety of languages,
certified 3rd party tools, and custom apps created in
almost any language. It is data agnostic with support for
structured, unstructured (i.e. text), spatial, document data,
and sparse data.
•	 SAP HANA has extreme linear scalability scaling up and
scaling out. Adding more nodes to handle more data,
keeping response time relatively unchanged.
Let’s examine these unique and game-changing capabilities in
more detail.
Overview
SAP HANA’s Defining Capabilities
4 SAP HANA’s Defining Capabilities
Š 2013 SAP AG or an SAP affiliate company. All rights reserved.
The SAP HANA Difference
SAP HANA is more than just a database,
but at its heart there is columnar database
with a data processing engine at its core.
And one that is constantly evolving.Two
years ago, when it first came out, SAP
HANA was capable of scanning data at 2.0
billion integers per second per core,
already an impressive rate. Today, this has
increased to 3.2 billion integers per
second, per core.This continued drive for
improvement is important because, all
data in SAP HANA gets represented as
integers.This means all data is capable of
being scanned at this incredible speed.
Scanning, while it is a good speed test,
does not indicate actual processing speed.
SAP HANA is also capable of handling 12.5
million aggregations per second per core.
A standard SAP HANA node (i.e. single
server) has 40 cores. Meaning that a single
SAP HANA node can scan data at 128
billion integers per second and aggregate
data at 158 million integers per second.
Where this gets really interesting is
realizing that SAP HANA can ingest data at
1.5 million records (i.e. rows of data) per
second per node, all the while maintaining
and supporting all the ACID properties.
•	 Atomicity: Requires that each
transaction is “all or nothing”. If one
part of the transaction fails, the
entire transaction fails, and the
database state is left unchanged.
•	 Consistency: Ensures that any
transaction will bring the database
from one valid state to another. Any
data written to the database must
be valid according to all defined
rules, including but not limited to,
constraints, cascades, triggers, and
any combination thereof.
•	 Isolation: Ensures that the
concurrent execution of
transactions results in a system
state that would be obtained if
transactions were executed serially.
•	 Durability: Means that once a
transaction has been committed, it
will remain so, even in the event of
power loss, crashes, or errors.
So while it can be said that SAP HANA is
a database capable of running any
application that a relational database can
run, that is just the beginning. Before you
read any further, please look at the
‘Durability’point in the ACID list just above
and recognize that SAP HANA is ACID
compliant and does not lose data in event
of a power loss (or any other type of
failure).Just like any other ACID compliant
database, SAP HANA writes any changes
to its data to non-volatile memory before
acknowledging success. SAP has
‘Durability’and it is fully recoverable.
SAP HANA is more than just a
database. The danger of viewing SAP
HANA simply as a database is that it
associates SAP HANA with the
complexity of most databases. Most
databases have the need for tuning with
indexes, specialty purchased and task
configured hardware, and creation of
aggregate tables to maintain the speed
of transactional systems or to “pre-
answer” queries in analytic ones. But
SAP HANA has none of these drawbacks.
The speed of SAP HANA, while game-
changing and beneficial (cost of FTEs
staring at hour glasses adds up), allows it
to run without indexes, aggregates, or
specially configured hardware and
software. You don’t need to create
indexes to make SAP HANA achieve the
astounding read speeds noted earlier in
SERVER 1
SERVER 2
SERVER 3
SERVER 4
SERVER 5
SERVER 6
Cold Standby Server
SharedStorage
HighAvailability
6 Hours Immediate
SAP
HANA
Traditional: Separated OLTP + OLAP OLTP + OLAP in SAP HANA
Current
Data
24 HR Old
Data
5SAP HANA’s Defining Capabilities
Š 2013 SAP AG or an SAP affiliate company. All rights reserved.
the paper. You don’t need to create
materialized aggregates to make the
system achieve acceptable performance.
You don’t need to configure your
hardware and database based on
whether the application will be used for
OLTP versus OLAP (a.k.a. analytics).
Just load data into SAP HANA (or create
virtualized tables to access data in
remote source systems locally) and it will
run faster than any other database –
period. This also means that it costs less
to run, but again more on that later.
One of the benefits of SAP HANA’s
speed is that it is schema agnostic. What
this means is that your OLTP schema
doesn’t need denormalization in order to
perform. This reduces development time
and makes the schema self-describing
and thus supportive of agile
methodologies. Schema agnostic in SAP
HANA also means that you can do full
analytics on this same schema. Let this
sink in. You no longer need to ETL data
into an operational data store in order to
off-load processing from the OLTP
system. That is half the hardware costs
and requires no administrators to
monitor and maintain a second system.
Additionally, analytics data is now 100%
real-time – no batch. Need to know what
orders are open — just ask. Note that
other vendors entering the market are
making similar sounding claims, but with
dramatically fewer benefits. They are
putting very fast columnar tables
alongside their OLTP row tables. This is
good. A second server for operational
reporting is no longer needed. That said,
the rest of the benefits SAP HANA
provides by being schema agnostic are
lost in these competitive systems. The
OLTP data still needs to be migrated to
the read-optimized columnar tables and
thus losing the real-time benefit.
Additional storage is needed as the data
is duplicated. Administrative costs are
higher as you need to monitor and
administer the change data capture
process on the row tables and the delta
data activation process on the columnar
store. How does SAP HANA avoid all this
and provide superior benefits? Simple.
SAP HANA’s table structure has the
OLTP speed of row tables and the read
speed of columnar tables. There is no
cut-over needed, no row table to
The SAP HANA Difference
Easily migrate your applications
(e.g.L: Java, PHP, .NET) in almost any lan-
guage, PHP, Ruby, Java, C,...the list goes on:
•	 Support for ANSI SQL, ODBC,JDBC,
Odata/JSON, and certified 3rd party tools.
•	 Support more standards: JSON and
XMLA over HTTP so it is truly multi-
dimensional platform.
Build new web applications with any open
source
HTML5/JS libraries, Server Side Java Script.
Open Cloud Partner Program
Allows you to select the best SAP HANA cloud
option from partners.
Support advanced text analytics:
Analyze text in all columns of table and text
inside binary files with advanced text analytic
capabilities such as: automatically detecting
31 languages, fuzzy, linguistics, synonymous
search, using SQL.
Analyze streaming data
From integrated ESP in combination with data
in SAP HANA.
Process geospatial data
ODBO http(s), OData/JSON
Browser / Mobile
Web JS Lib Data Viz Lib
Embed sentiment fact extraction in same SQL
App Services
(Web Server)
Any HTML5/JS
Library
SQL Script
DB Services
SAP HANA
Third Party &
Custom Application
HTTP)S), OData, XML/A, ODBC,
JDBC, ADSC, ODBO MDX, SLQ
ODBC, JDBC
Web App Server
Embed fuzzy text search in same SQL
CREATE FULLTEXT INDEX i1 ON PSA_TRANSACTION(
AMOUNT, TRAN_DATE, POST_DATE, DESCRIPTION,
CATEGORY_TEXT ) FUZZY SEARCH INDEX ON SYNC;
SELECT SCORE() AS SCR, * FROM
"SYSTEM"."PSA_TRANSACTION" WHERE CONTAINS (*,
'Sarvice', fuzzy) ORDER BY SCR DESC;
Any Data
Text
Data
Geospatial
Data
Smart
Meter
Mobile Point of
Sale
RFID Machine
Data
Connected
Vehicles
Structured
Data
Click-
Stream
Social
Network
Customer
Data
Embed geospatial in same SQL
6 SAP HANA’s Defining Capabilities
Š 2013 SAP AG or an SAP affiliate company. All rights reserved.
columnar table conversion needed, no
batch building for columnar
compression, no nothing. The same
table supports OLTP and OLAP Analytics
– at the same time – with full ACID
properties. This means real real-time
and real-value.
Another advantage SAP HANA has over
traditional databases is openness. SAP
HANA is open and agnostic. It supports
ANSI compliant SQL, ODBC and JDBC
protocols, and a wide range of certified
3rd party query tools, ETLtools, back-up
and recovery tools, monitoring tools, and
even applications.Yes, even ones that
compete directly with SAP applications.
SAP HANA supports custom applications
built on SAP HANA in any language
including PHP, Ruby, Python,Java, C, and
of course,ABAP.And SAPHANA also
supports MDX over ODBO and XMLA over
HTTP if you want to access SAP HANA as a
multi-dimensional database. SAP HANA is
data agnostic as well, not only supporting
tables, but unstructured data (i.e. text),
spatial data, document data, and sparse
data as well.And SAP is getting ready to
release support for property graph data in
SAP HANA as well.With unstructured data,
SAP HANA supports ingestion of a huge
number of document types including all
the usual suspects, language identification
for 31 languages, entity extraction for three
languages, and sentiment analysis for the
top five spoken languages. For spatial
data, SAP HANA is not only screaming fast
but also ships with a geo-coding service,
polygons for all major political and
geographic areas, and an industry
compliant integration layer supporting
ESRI, Navtec, and Google.As for document
data, SAP HANA supports complete
transactions using JSON or XML. For
sparse data with varying attributes, SAP
HANA supports flexible tables in which
columns can be dynamically added on
Insert or Update.All vitally important non-
structured data types explode.
There are several new specialized
databases built specifically for these
data types that have allowed for
applications to address the 80% of all
data that isn’t structured (i.e. not
relational). These purpose built
databases have provided, huge
innovation leaps. But using them is still a
separate database approach with all its
inherent issues: The need for separate
servers, data to be copied between the
systems, and applications having to
merge the data coming from the
different systems via in-app code. All a
lot of work and effort. This may be fine if
you are a specialized web start-up, but
definitely not if you are an enterprise.
Additionally, these complex
infrastructures reduce agility. By being in
the single SAP HANA data platform,
developers can easily take advantage of
data of any kind to build next generation
applications. Data of heterogeneous
data types can be easily combined,
processed, and leveraged.
SAP HANA also brings the analytics
code closer to the data. In and of itself, this
isn’t new or revolutionary. Stored
procedures have been around forever. The
unique aspect is the offering and provision
The SAP HANA Difference
Any Apps
Any App Server
SAP Business Suite and
BW ABAP App Server
SQL MDX JSON
Open
ConnectivityR
HANA PLATFORM
SQL, SQLScript, JavaScript
Integration Services
Spatial Search Text Mining
Stored Procedure
& Data Models
Application & UI
Services
Business Function
Library
Predictive Analysis
Library
Database
Services
Planning
Engine
Rules
Engine
Transaction Unstructured Machine HADOOP Real-time Locations Other Apps
Supports
any Device
7SAP HANA’s Defining Capabilities
Š 2013 SAP AG or an SAP affiliate company. All rights reserved.
of pre-built libraries and the speed with
which SAP HANA can run the analytics.
There is a Predictive Analysis Library
(PAL) which supports 27 data preparation
and data mining functions including
clustering and classification algorithms,
regression methods, decision trees, and
forecasting methods. These are all natively
compiled and parallelized as part of the
query execution plan determined by the
SAP HANA optimizer providing blinding
speed. There is also a Business Function
Library (BFL) that contains 55 data
intensive functions that would slow down
traditional three tier applications given
their need for data movement from the
database tier to the app tier.Things like
currency conversion, depreciation, and
moving median calculations.And new
libraries are being added to SAP HANA for
data quality and data transformations.
There is also a high speed integration to
“R”that lets you utilize the over 3500
algorithms that have been built on this
open source analytics platform to leverage
the real-time capability.
Individually these innovations are great,
but what is really amazing is that that you
can use them together. SAP HANA makes
this not only possible, but even easy.Want
to show a billion pieces of data on a map
showing visualizations based on
clustering analysis of that data? How
about a financial system that takes every
transaction for the quarter up until this
last second, does a synthetic close on the
books, and then forecasts where the
financials will be at quarter end?
So SAP HANA is a fully ACID
compliant database that supports real-
time OLTP and OLAP (analytics) without
duplicate copies of the data. It supports
every major data type along with text
analytics and geo-spatial capabilities. It
supports all major industry standard
query languages, and comes with highly
optimized industry functions and
algorithms to build class best
performance (think 1000s of times
faster). And it does this without indexes
or aggregates. When these are
combined in totality, you have a system
that uses less hardware and requires
less maintenance and saves you money
while providing game-changing
capabilities and agility.
The SAP HANA Difference
8 SAP HANA’s Defining Capabilities
Š 2013 SAP AG or an SAP affiliate company. All rights reserved.
SAP HANA’s speed is often referred to
as being “in-memory”, where all data can
be placed in the computer’s RAM and
thus avoid slow disk reads. This may be
true, but overshadows other huge
innovations developed by the HANA
team in partnership with Intel. These
innovations deliver orders of magnitude
faster performance over just putting data
a big cache.
SAP HANA’s version of in-memory
works by:
1.	 Having a SAP HANA optimizer that
leverages knowledge of the data’s
locality to the CPUs and their caches.
It leverages this information along
with knowledge of all the bandwidth
and latencies between every cache,
every core, every CPU, and every
node to formulate the most
performant execution plans
2.	 Aligning all data in-memory with the
processor’s cache-line size (the
amount of data always fetched by the
processor’s memory controller). This
allows data to be pipelined from
RAM, through the caches, and into a
core’s registers.
3.	 Aligning data in-memory in vectors
(columns) allowing data to be
compressed highly efficiently by data
type. Therefore SAP HANA is able to
pipeline much larger amounts of data
to the core’s registers.
4.	 SAP HANA acts on compressed data
whenever possible including scans,
aggregates, and joins.
When processing data, SAP HANA
leverages Intel’s SSE vector processing
technology (i.e. SIMD instruction set) to
process multiple pieces of data in a single
instruction cycle. Since SAP HANA uses
variable length integer encoding, this
means that for lower cardinality columns,
it can process 10s of integers per register
per instruction cycle.
Now that we’ve established where SAP
HANA’s speed comes from, we can
address perceived concerns around out-
of-memory conditions, scalability, and data
center readiness. SAP HANA is resilient.
When available memory is filling up, SAP
HANA purges any column partition,
column, or table that hasn’t been accessed
recently from memory. It is not lost.This is
efficient as the data is already persisted
on-disk and no write-back is needed.
BLOBs really don’t need to be in-memory
once their entities have been extracted, so
these can be set to only reside on disk
saving memory. In most instances SAP
recommends leaving all data in memory,
but you don’t have to and be less
frequently accessed data can be stored in
systems such as Hadoop and SAP Sybase
IQ. Further SAP HANA also doesn’t need
that much memory to process very large
amounts of data. Most DBAs quote their
databases in total size, including indexes
and aggregates. So a 1TB database
typically only has in approximately 500 GB
of raw data. SAP HANA achieves
compression of OLTP data in the 3-6X
range and data warehousing data in the
10-20X range. To be conservative, let’s
assume 5X compression. This makes the
500GB of raw data 100 GBs. Now, SAP
HANA doesn’t only store data in-memory,
it also processes all its interim datasets in-
memory, so we recommend a SAP HANA
instance memory of 2X the compressed
raw data, or 200 GB in our example. This
means that you can get all the benefits
SAP HANA offers for a 1TB existing
database in a 200GB SAP HANA node.
Said another way, a standard 1TB SAP
HANA node is capable of handling a 5TB
database on migration. SAP HANA is also
fully elastically scalable allowing you to add
nodes to accommodate more memory
and data. There are certified
configurations up to 56 nodes and 56 TB
of memory and SAP has built a 100 node
100 TB system on which a 1 PB benchmark
(20x compression) was run. Conceive of
needing even more than 56 nodes? Just
ask and SAP will certify more.
What is truly incredible is the linearity
SAP HANA exhibited during scaling tests
Architecture
SOURCE
SSE2 OP
DEST
CLOCK
CYCLE 1
127 0
X4 X3 X2 X1
Y4 Y3 Y2 Y1
X4opY4 X3opY3 X2opY2 X1opY1
Single Node SIMD Operation
9SAP HANA’s Defining Capabilities
Š 2013 SAP AG or an SAP affiliate company. All rights reserved.
on the 100 node system noted above.
As you can see from the graph above,
SAP HANA’s response time was essentially
flat as data and nodes were doubled.
Simply stated, scaling SAP HANA just
takes math.Want to double the speed of
your queries? Double the nodes? Want to
double your users and maintain response
times on the same data size? This is
because SAP HANA is able to fully
parallelize any query and fully saturate its
CPU cores. This means that SAP HANA is
fully utilizing all available resources while
wasting none. It means that SAP HANA is
always going to solve your query as fast as
is possible. If you have large concurrency
on SAP HANA, its execution plans will use
fewer cores and memory to allow all users
to be serviced. And a planned innovation
will make SAP HANA even faster. As noted
by Hasso Plattner at this past year’s
Architecture
Sapphire conference in Orlando, FL, SAP
has developed a new object aware cache
that is capable of handling delta changes.
This means that similar queries should get
millisecond response times even under
load once this innovation is released.
SAP HANA offers reliability, full high
availability, disaster recovery (DR) and
supportability. So from a data center
readiness perspective, SAP HANA is
complete. SAP HANA fully supports fail-
over nodes so that if a node fails for any
reason a backup node will kick in. Since
SAP HANA is in-memory, performance
will be slowed to disk-database speeds
until all data is moved from disk to
memory. Note that the fail-over node is
available immediately and data will load
as queries require it. SAP HANA also
supports DR via either synchronous or
asynchronous native replication. This
allows secondary SAP HANA systems to
be located continents away providing
protection from any natural or political
disasters. There are many options for DR
that can allow for near zero down time
upgrades or use of the secondary system
for use as test and dev. Our partners are
also offering log shipping as another DR
option and efficient use of hardware.
As noted earlier, SAP HANA is open
and agnostic and that goes for
hardware as well. SAP HANA runs on
“commodity” components assembled
by our hardware partners to the
minimum specifications given to them
by SAP. These specifications allow very
different architectures by each partner
providing our customers both choice
and a competitive market. We recently
announced an initiative allowing
customers to use their own storage
systems as long as they meet SAP
HANA’s specifications. And SAP HANA
can also be run virtualized to support
testing and dev. All these innovations
are designed to keep hardware costs
down. Lower costs does not mean
cheap or chintzy. SAP HANA hardware
is not inexpensive as standard
components are enterprise class and
the servers do have lots of memory.
But, using compression, hardware and
storage can be conservatively sized.
And you do get what you pay for. As an
instance can handle mixed workloads
of OLTP and operational reporting, one
less server is required. And as SAP
HANA doesn’t require indexes and
aggregates, data duplicates in row and
columnar structures, customers report
lower operating costs. So you get good
value and good TCO. Take SAP’s own
experience in migrating its CRM
system. SAP saw a 30% reduction in
hardware costs due to no longer
needing a separate system for
operational reporting.
16 nodes
(100 billion rows)
51 nodes
(650 billion rows)
95 nodes
(1,200 billion rows)
Extreme Linear Scalability
Query processing time (in seconds)
Query 1 Query 2 Query 3
0.425
0.266 0.142
0.502
3.102
0.491
3.249
0.7
3.816
Sales and Distribution Reports
Query 1: Single Customer and Material for One Month
Query 2: Range of Customers and Materials for Six Months
Query 3: Year-Over-Year Trending Report for Top 100 Customers for Five Years
10
Š 2013 SAP AG or an SAP affiliate company. All rights reserved.
Summary
SAP HANA represents such a huge shift in
data management, application
development, and analytics that it is a
paradigm shift.And like most things new
and earthshattering it is often hard to
believe or accept at face value, especially
when looked at through the lenses of
existing paradigms. SAP HANA takes the
most important innovations in hardware
architectures, columnar and row
databases, text analytics, parallel
processing, and much more and blends
them together in an elegantly designed
software platform and appliance. SAP
HANA is convergence. SAP HANA
represents a huge shift for enterprise
computing. SAP HANA is the first
technology in decades that will
simultaneously allow landscape
simplification (i.e. fewer servers) and rapid
innovation. Said another way, SAP HANA
is the next generation data platform.
SAP HANA’s Defining Capabilities
Š 2013 SAP AG or an SAP affiliate company. All rights reserved.
13/08
Š 2013 SAP AG or an SAP affiliate company. All rights reserved.
No part of this publication may be reproduced or transmitted in any
form or for any purpose without the express permission of SAP AG.
The information contained herein may be changed without prior notice.
Some software products marketed by SAP AG and its distributors contain
proprietary software components of other software vendors. National
product specifications may vary.
These materials are provided by SAP AG and its affiliated companies
(“SAP Group”) for informational purposes only, without representation
or warranty of any kind, and SAP Group shall not be liable for errors or
omissions with respect to the materials. The only warranties for SAP
Group products and services are those that are set forth in the express
warranty statements accompanying such products and services, if
any. Nothing herein should be construed as constituting an additional
warranty.
SAP and other SAP products and services mentioned herein as well as
their respective logos are trademarks or registered trademarks of SAP AG
in Germany and other countries.
Please see
http://www.sap.com/corporate-en/legal/copyright/index.epx#trademark
for additional trademark information and notices.
www.sap.com/contactsap

More Related Content

What's hot

What's hot (20)

SAP HANA Overview
SAP HANA OverviewSAP HANA Overview
SAP HANA Overview
 
CIO Guide to Using SAP HANA Platform For Big Data
CIO Guide to Using SAP HANA Platform For Big DataCIO Guide to Using SAP HANA Platform For Big Data
CIO Guide to Using SAP HANA Platform For Big Data
 
SAP HANA Interview questions
SAP HANA Interview questionsSAP HANA Interview questions
SAP HANA Interview questions
 
What's new on SAP HANA Smart Data Access
What's new on SAP HANA Smart Data AccessWhat's new on SAP HANA Smart Data Access
What's new on SAP HANA Smart Data Access
 
What's New for SAP HANA Smart Data Integration & Smart Data Quality
What's New for SAP HANA Smart Data Integration & Smart Data QualityWhat's New for SAP HANA Smart Data Integration & Smart Data Quality
What's New for SAP HANA Smart Data Integration & Smart Data Quality
 
SAP HANA SPS10- Security
SAP HANA SPS10- SecuritySAP HANA SPS10- Security
SAP HANA SPS10- Security
 
Sizing sap s 4 hana using the quick sizer tool
Sizing sap s 4 hana using the quick sizer toolSizing sap s 4 hana using the quick sizer tool
Sizing sap s 4 hana using the quick sizer tool
 
SAP HANA SPS09 - HANA IM Services
SAP HANA SPS09 - HANA IM ServicesSAP HANA SPS09 - HANA IM Services
SAP HANA SPS09 - HANA IM Services
 
Autodesk Technical Webinar: SAP HANA in-memory database
Autodesk Technical Webinar: SAP HANA in-memory databaseAutodesk Technical Webinar: SAP HANA in-memory database
Autodesk Technical Webinar: SAP HANA in-memory database
 
Sap hana platform sps 11 introduces new sap hana hadoop integration features
Sap hana platform sps 11 introduces new sap hana hadoop integration featuresSap hana platform sps 11 introduces new sap hana hadoop integration features
Sap hana platform sps 11 introduces new sap hana hadoop integration features
 
SAP HANA SPS09 - Dynamic Tiering
SAP HANA SPS09 - Dynamic TieringSAP HANA SPS09 - Dynamic Tiering
SAP HANA SPS09 - Dynamic Tiering
 
SDA - POC
SDA - POCSDA - POC
SDA - POC
 
HANA SPS07 Smart Data Access
HANA SPS07 Smart Data AccessHANA SPS07 Smart Data Access
HANA SPS07 Smart Data Access
 
SAP HANA – A Technical Snapshot
SAP HANA – A Technical SnapshotSAP HANA – A Technical Snapshot
SAP HANA – A Technical Snapshot
 
Why SAP HANA?
Why SAP HANA?Why SAP HANA?
Why SAP HANA?
 
Leveraging SAP, Hadoop, and Big Data to Redefine Business
Leveraging SAP, Hadoop, and Big Data to Redefine BusinessLeveraging SAP, Hadoop, and Big Data to Redefine Business
Leveraging SAP, Hadoop, and Big Data to Redefine Business
 
SAP HANA SPS09 - Smart Data Streaming
SAP HANA SPS09 - Smart Data StreamingSAP HANA SPS09 - Smart Data Streaming
SAP HANA SPS09 - Smart Data Streaming
 
Best Practices to Administer, Operate, and Monitor an SAP HANA System
Best Practices to Administer, Operate, and Monitor an SAP HANA SystemBest Practices to Administer, Operate, and Monitor an SAP HANA System
Best Practices to Administer, Operate, and Monitor an SAP HANA System
 
SAP HANA Overview
SAP HANA OverviewSAP HANA Overview
SAP HANA Overview
 
Spark Usage in Enterprise Business Operations
Spark Usage in Enterprise Business OperationsSpark Usage in Enterprise Business Operations
Spark Usage in Enterprise Business Operations
 

Similar to 209 hana-defining-capability-whitepaper

Sap hana by jeff_word
Sap hana by jeff_wordSap hana by jeff_word
Sap hana by jeff_word
Sunil Joshi
 
HANA Demystified by DataMagnum
HANA Demystified by DataMagnumHANA Demystified by DataMagnum
HANA Demystified by DataMagnum
Prasad Mavuduri
 
5507832a c074-4013-9d49-6e58befa9c3e-161121113026
5507832a c074-4013-9d49-6e58befa9c3e-1611211130265507832a c074-4013-9d49-6e58befa9c3e-161121113026
5507832a c074-4013-9d49-6e58befa9c3e-161121113026
Krishna Kiran
 
How is sap data services unique for sap hana integration
How is sap data services unique for sap hana integrationHow is sap data services unique for sap hana integration
How is sap data services unique for sap hana integration
Flavio Alejandro Corradini
 

Similar to 209 hana-defining-capability-whitepaper (20)

SAP Lambda Architecture Point of View
SAP Lambda Architecture Point of ViewSAP Lambda Architecture Point of View
SAP Lambda Architecture Point of View
 
SAP HANA SPS09 - SAP HANA Workload Management
SAP HANA SPS09 - SAP HANA Workload ManagementSAP HANA SPS09 - SAP HANA Workload Management
SAP HANA SPS09 - SAP HANA Workload Management
 
What is Sap HANA Convista Consulting Asia.pdf
What is Sap HANA Convista Consulting Asia.pdfWhat is Sap HANA Convista Consulting Asia.pdf
What is Sap HANA Convista Consulting Asia.pdf
 
SAP HANA Accelerator for SAP ASE
SAP HANA Accelerator for SAP ASESAP HANA Accelerator for SAP ASE
SAP HANA Accelerator for SAP ASE
 
What Is SAP HANA And Its Benefits?
What Is SAP HANA And Its Benefits?What Is SAP HANA And Its Benefits?
What Is SAP HANA And Its Benefits?
 
Sap hana by jeff_word
Sap hana by jeff_wordSap hana by jeff_word
Sap hana by jeff_word
 
DMM205.pdf
DMM205.pdfDMM205.pdf
DMM205.pdf
 
SAP HANA Vora SITMTY 20160707
SAP HANA Vora SITMTY 20160707SAP HANA Vora SITMTY 20160707
SAP HANA Vora SITMTY 20160707
 
Sap bw4 hana
Sap bw4 hanaSap bw4 hana
Sap bw4 hana
 
HANA Demystified by DataMagnum
HANA Demystified by DataMagnumHANA Demystified by DataMagnum
HANA Demystified by DataMagnum
 
5507832a c074-4013-9d49-6e58befa9c3e-161121113026
5507832a c074-4013-9d49-6e58befa9c3e-1611211130265507832a c074-4013-9d49-6e58befa9c3e-161121113026
5507832a c074-4013-9d49-6e58befa9c3e-161121113026
 
HANA
HANAHANA
HANA
 
Analytics Products L2 public 2020-23 Black.pptx
Analytics Products L2 public 2020-23 Black.pptxAnalytics Products L2 public 2020-23 Black.pptx
Analytics Products L2 public 2020-23 Black.pptx
 
Why is Smooth Transition to SAP Hana Migration Important?
Why is Smooth Transition to SAP Hana Migration Important?Why is Smooth Transition to SAP Hana Migration Important?
Why is Smooth Transition to SAP Hana Migration Important?
 
Dev207 berlin
Dev207 berlinDev207 berlin
Dev207 berlin
 
SAP HANA SPS08 Overview
SAP HANA SPS08 OverviewSAP HANA SPS08 Overview
SAP HANA SPS08 Overview
 
HANA SPS07 Architecture & Landscape
HANA SPS07 Architecture & LandscapeHANA SPS07 Architecture & Landscape
HANA SPS07 Architecture & Landscape
 
S4 1610 business value l1
S4 1610 business value l1S4 1610 business value l1
S4 1610 business value l1
 
How is sap data services unique for sap hana integration
How is sap data services unique for sap hana integrationHow is sap data services unique for sap hana integration
How is sap data services unique for sap hana integration
 
Dmm302 - Sap Hana Data Warehousing: Models for Sap Bw and SQL DW on SAP HANA
Dmm302 - Sap Hana Data Warehousing: Models for Sap Bw and SQL DW on SAP HANA Dmm302 - Sap Hana Data Warehousing: Models for Sap Bw and SQL DW on SAP HANA
Dmm302 - Sap Hana Data Warehousing: Models for Sap Bw and SQL DW on SAP HANA
 

Recently uploaded

+971581248768>> SAFE AND ORIGINAL ABORTION PILLS FOR SALE IN DUBAI AND ABUDHA...
+971581248768>> SAFE AND ORIGINAL ABORTION PILLS FOR SALE IN DUBAI AND ABUDHA...+971581248768>> SAFE AND ORIGINAL ABORTION PILLS FOR SALE IN DUBAI AND ABUDHA...
+971581248768>> SAFE AND ORIGINAL ABORTION PILLS FOR SALE IN DUBAI AND ABUDHA...
?#DUbAI#??##{{(☎️+971_581248768%)**%*]'#abortion pills for sale in dubai@
 
Artificial Intelligence: Facts and Myths
Artificial Intelligence: Facts and MythsArtificial Intelligence: Facts and Myths
Artificial Intelligence: Facts and Myths
Joaquim Jorge
 

Recently uploaded (20)

Developing An App To Navigate The Roads of Brazil
Developing An App To Navigate The Roads of BrazilDeveloping An App To Navigate The Roads of Brazil
Developing An App To Navigate The Roads of Brazil
 
From Event to Action: Accelerate Your Decision Making with Real-Time Automation
From Event to Action: Accelerate Your Decision Making with Real-Time AutomationFrom Event to Action: Accelerate Your Decision Making with Real-Time Automation
From Event to Action: Accelerate Your Decision Making with Real-Time Automation
 
Real Time Object Detection Using Open CV
Real Time Object Detection Using Open CVReal Time Object Detection Using Open CV
Real Time Object Detection Using Open CV
 
A Domino Admins Adventures (Engage 2024)
A Domino Admins Adventures (Engage 2024)A Domino Admins Adventures (Engage 2024)
A Domino Admins Adventures (Engage 2024)
 
GenAI Risks & Security Meetup 01052024.pdf
GenAI Risks & Security Meetup 01052024.pdfGenAI Risks & Security Meetup 01052024.pdf
GenAI Risks & Security Meetup 01052024.pdf
 
Connector Corner: Accelerate revenue generation using UiPath API-centric busi...
Connector Corner: Accelerate revenue generation using UiPath API-centric busi...Connector Corner: Accelerate revenue generation using UiPath API-centric busi...
Connector Corner: Accelerate revenue generation using UiPath API-centric busi...
 
The 7 Things I Know About Cyber Security After 25 Years | April 2024
The 7 Things I Know About Cyber Security After 25 Years | April 2024The 7 Things I Know About Cyber Security After 25 Years | April 2024
The 7 Things I Know About Cyber Security After 25 Years | April 2024
 
Strategies for Landing an Oracle DBA Job as a Fresher
Strategies for Landing an Oracle DBA Job as a FresherStrategies for Landing an Oracle DBA Job as a Fresher
Strategies for Landing an Oracle DBA Job as a Fresher
 
+971581248768>> SAFE AND ORIGINAL ABORTION PILLS FOR SALE IN DUBAI AND ABUDHA...
+971581248768>> SAFE AND ORIGINAL ABORTION PILLS FOR SALE IN DUBAI AND ABUDHA...+971581248768>> SAFE AND ORIGINAL ABORTION PILLS FOR SALE IN DUBAI AND ABUDHA...
+971581248768>> SAFE AND ORIGINAL ABORTION PILLS FOR SALE IN DUBAI AND ABUDHA...
 
🐬 The future of MySQL is Postgres 🐘
🐬  The future of MySQL is Postgres   🐘🐬  The future of MySQL is Postgres   🐘
🐬 The future of MySQL is Postgres 🐘
 
Apidays New York 2024 - Scaling API-first by Ian Reasor and Radu Cotescu, Adobe
Apidays New York 2024 - Scaling API-first by Ian Reasor and Radu Cotescu, AdobeApidays New York 2024 - Scaling API-first by Ian Reasor and Radu Cotescu, Adobe
Apidays New York 2024 - Scaling API-first by Ian Reasor and Radu Cotescu, Adobe
 
TrustArc Webinar - Stay Ahead of US State Data Privacy Law Developments
TrustArc Webinar - Stay Ahead of US State Data Privacy Law DevelopmentsTrustArc Webinar - Stay Ahead of US State Data Privacy Law Developments
TrustArc Webinar - Stay Ahead of US State Data Privacy Law Developments
 
Artificial Intelligence: Facts and Myths
Artificial Intelligence: Facts and MythsArtificial Intelligence: Facts and Myths
Artificial Intelligence: Facts and Myths
 
Driving Behavioral Change for Information Management through Data-Driven Gree...
Driving Behavioral Change for Information Management through Data-Driven Gree...Driving Behavioral Change for Information Management through Data-Driven Gree...
Driving Behavioral Change for Information Management through Data-Driven Gree...
 
Understanding Discord NSFW Servers A Guide for Responsible Users.pdf
Understanding Discord NSFW Servers A Guide for Responsible Users.pdfUnderstanding Discord NSFW Servers A Guide for Responsible Users.pdf
Understanding Discord NSFW Servers A Guide for Responsible Users.pdf
 
GenCyber Cyber Security Day Presentation
GenCyber Cyber Security Day PresentationGenCyber Cyber Security Day Presentation
GenCyber Cyber Security Day Presentation
 
Apidays New York 2024 - The value of a flexible API Management solution for O...
Apidays New York 2024 - The value of a flexible API Management solution for O...Apidays New York 2024 - The value of a flexible API Management solution for O...
Apidays New York 2024 - The value of a flexible API Management solution for O...
 
ProductAnonymous-April2024-WinProductDiscovery-MelissaKlemke
ProductAnonymous-April2024-WinProductDiscovery-MelissaKlemkeProductAnonymous-April2024-WinProductDiscovery-MelissaKlemke
ProductAnonymous-April2024-WinProductDiscovery-MelissaKlemke
 
Workshop - Best of Both Worlds_ Combine KG and Vector search for enhanced R...
Workshop - Best of Both Worlds_ Combine  KG and Vector search for  enhanced R...Workshop - Best of Both Worlds_ Combine  KG and Vector search for  enhanced R...
Workshop - Best of Both Worlds_ Combine KG and Vector search for enhanced R...
 
Finology Group – Insurtech Innovation Award 2024
Finology Group – Insurtech Innovation Award 2024Finology Group – Insurtech Innovation Award 2024
Finology Group – Insurtech Innovation Award 2024
 

209 hana-defining-capability-whitepaper

  • 2. 2 SAP HANA’s Defining Capabilities SAP HANA UNIQUE CAPABILITIES Overview The SAP HANA Difference Architecture Summary Š 2013 SAP AG or an SAP affiliate company. All rights reserved. Table of Contents The information in this document is confidential and proprietary to SAPand may not be disclosed without the permission of SAP. Except for your obligation to protect confidential information, this document is not subject to your license agreement or any other service or subscription agreement with SAP. SAPhas no obligation to pursue any course of business outlined in this document or any related document, or to develop or release any functionality mentioned therein.This document, or any related document and SAP’s strategy and possible future developments, products and or platforms directions and functionality are all subject to change and may be changed by SAP at any time for any reason without notice.The information in this document is not a commitment, promise or legal obligation to deliver any material, code or functionality. This document is provided without a warranty of any kind, either express or implied, including but not limited to, the implied warranties of merchantability, fitness for a particular purpose, or non-infringement. This document is for informational purposes and may not be incorporated into a contract. SAP assumes no responsibility for errors or omissions in this document, except if such damages were caused by SAP’s intentional or gross negligence. All forward-looking statements are subject to various risks and uncertainties that could cause actual results to differ materially from expectations. Readers are cautioned not to place undue reliance on these forward-looking statements, which speak only as of their dates, and they should not be relied upon in making purchasing decisions 3 4 8 10
  • 3. 3 Š 2013 SAP AG or an SAP affiliate company. All rights reserved. SAP HANA with its new vision and approach, as with many things novel and paradigm-shifting, can be met with resistance and disbelief. Change is not always easy and many who are comfortable with existing approaches try to compare SAP HANA with, and to see it in the light of, those older paradigms.This, however, cannot successfully be done given that SAP HANA represents a completely new way to address traditional computing issues.There is a SAP HANA difference and this paper will focus on some of the unique capabilities of SAP HANA’s revolutionary approach ­— especially the technological underpinnings — and how these translates into a very different way of assessing value for those who deploy it. Some of these unique SAP HANA capabilities that will be explored here include: • SAP HANA’s Data Locality Aware Processing. SAP HANA’s CPU-aware optimizer leverages optimizations of the data’s locality to the CPUs and their caches to reduce latency of data compute. SAP HANA ensures that data resides in the DRAM and CPU caches to avoid cache hit misses, and applies knowledge of all the bandwidth and latencies between every cache, every CPU, and every node in the execution plans. • SAP HANA can run real-time OLTP and OLAP on a single copy of the data. There is no need for unique OLTP and OLAP copies. Data is stored only once for both transactional and analytical applications. • Multiple Parallel Processing (MPP) on a shared nothing architecture along with its support for Single Instruction, Multiple Data (SIMD). • Even though the SAP HANA true in-memory platform is a paradigm-shifting new approach, nothing has changed with regards to ACID compliance or Reliability, Full High Availability, Disaster Recovery and Supportability. For all its Š 2013 SAP AG or an SAP affiliate company. All rights reserved. cutting-edge innovations, it is still ACID compliant and it supports Disaster Recovery (DR) via synchronous or asynchronous native replication. • Unique on-the-fly schema extension capability allows for flexible business model changes. • Dynamic Data Tiering optimizing the balance between data processing and data storage. Data identified as frequently used is kept in memory. Any data not recently accessed is purged from memory but persisted on-disk without write-back. • It is more than just a database. SAP HANA converges platform, database, data processing capabilities, handles spatial and textual data analysis, and provides libraries for predictive, planning and business analytics. All in a single platform with a built-in application and web server. • SAP HANA provides the most comprehensive data provisioning for any data from any source. • It is open and agnostic meaning any application and any data. SAP HANA runs on commodityx86 based hardware from SAP partners. It supports a wide variety of languages, certified 3rd party tools, and custom apps created in almost any language. It is data agnostic with support for structured, unstructured (i.e. text), spatial, document data, and sparse data. • SAP HANA has extreme linear scalability scaling up and scaling out. Adding more nodes to handle more data, keeping response time relatively unchanged. Let’s examine these unique and game-changing capabilities in more detail. Overview SAP HANA’s Defining Capabilities
  • 4. 4 SAP HANA’s Defining Capabilities Š 2013 SAP AG or an SAP affiliate company. All rights reserved. The SAP HANA Difference SAP HANA is more than just a database, but at its heart there is columnar database with a data processing engine at its core. And one that is constantly evolving.Two years ago, when it first came out, SAP HANA was capable of scanning data at 2.0 billion integers per second per core, already an impressive rate. Today, this has increased to 3.2 billion integers per second, per core.This continued drive for improvement is important because, all data in SAP HANA gets represented as integers.This means all data is capable of being scanned at this incredible speed. Scanning, while it is a good speed test, does not indicate actual processing speed. SAP HANA is also capable of handling 12.5 million aggregations per second per core. A standard SAP HANA node (i.e. single server) has 40 cores. Meaning that a single SAP HANA node can scan data at 128 billion integers per second and aggregate data at 158 million integers per second. Where this gets really interesting is realizing that SAP HANA can ingest data at 1.5 million records (i.e. rows of data) per second per node, all the while maintaining and supporting all the ACID properties. • Atomicity: Requires that each transaction is “all or nothing”. If one part of the transaction fails, the entire transaction fails, and the database state is left unchanged. • Consistency: Ensures that any transaction will bring the database from one valid state to another. Any data written to the database must be valid according to all defined rules, including but not limited to, constraints, cascades, triggers, and any combination thereof. • Isolation: Ensures that the concurrent execution of transactions results in a system state that would be obtained if transactions were executed serially. • Durability: Means that once a transaction has been committed, it will remain so, even in the event of power loss, crashes, or errors. So while it can be said that SAP HANA is a database capable of running any application that a relational database can run, that is just the beginning. Before you read any further, please look at the ‘Durability’point in the ACID list just above and recognize that SAP HANA is ACID compliant and does not lose data in event of a power loss (or any other type of failure).Just like any other ACID compliant database, SAP HANA writes any changes to its data to non-volatile memory before acknowledging success. SAP has ‘Durability’and it is fully recoverable. SAP HANA is more than just a database. The danger of viewing SAP HANA simply as a database is that it associates SAP HANA with the complexity of most databases. Most databases have the need for tuning with indexes, specialty purchased and task configured hardware, and creation of aggregate tables to maintain the speed of transactional systems or to “pre- answer” queries in analytic ones. But SAP HANA has none of these drawbacks. The speed of SAP HANA, while game- changing and beneficial (cost of FTEs staring at hour glasses adds up), allows it to run without indexes, aggregates, or specially configured hardware and software. You don’t need to create indexes to make SAP HANA achieve the astounding read speeds noted earlier in SERVER 1 SERVER 2 SERVER 3 SERVER 4 SERVER 5 SERVER 6 Cold Standby Server SharedStorage HighAvailability 6 Hours Immediate SAP HANA Traditional: Separated OLTP + OLAP OLTP + OLAP in SAP HANA Current Data 24 HR Old Data
  • 5. 5SAP HANA’s Defining Capabilities Š 2013 SAP AG or an SAP affiliate company. All rights reserved. the paper. You don’t need to create materialized aggregates to make the system achieve acceptable performance. You don’t need to configure your hardware and database based on whether the application will be used for OLTP versus OLAP (a.k.a. analytics). Just load data into SAP HANA (or create virtualized tables to access data in remote source systems locally) and it will run faster than any other database – period. This also means that it costs less to run, but again more on that later. One of the benefits of SAP HANA’s speed is that it is schema agnostic. What this means is that your OLTP schema doesn’t need denormalization in order to perform. This reduces development time and makes the schema self-describing and thus supportive of agile methodologies. Schema agnostic in SAP HANA also means that you can do full analytics on this same schema. Let this sink in. You no longer need to ETL data into an operational data store in order to off-load processing from the OLTP system. That is half the hardware costs and requires no administrators to monitor and maintain a second system. Additionally, analytics data is now 100% real-time – no batch. Need to know what orders are open — just ask. Note that other vendors entering the market are making similar sounding claims, but with dramatically fewer benefits. They are putting very fast columnar tables alongside their OLTP row tables. This is good. A second server for operational reporting is no longer needed. That said, the rest of the benefits SAP HANA provides by being schema agnostic are lost in these competitive systems. The OLTP data still needs to be migrated to the read-optimized columnar tables and thus losing the real-time benefit. Additional storage is needed as the data is duplicated. Administrative costs are higher as you need to monitor and administer the change data capture process on the row tables and the delta data activation process on the columnar store. How does SAP HANA avoid all this and provide superior benefits? Simple. SAP HANA’s table structure has the OLTP speed of row tables and the read speed of columnar tables. There is no cut-over needed, no row table to The SAP HANA Difference Easily migrate your applications (e.g.L: Java, PHP, .NET) in almost any lan- guage, PHP, Ruby, Java, C,...the list goes on: • Support for ANSI SQL, ODBC,JDBC, Odata/JSON, and certified 3rd party tools. • Support more standards: JSON and XMLA over HTTP so it is truly multi- dimensional platform. Build new web applications with any open source HTML5/JS libraries, Server Side Java Script. Open Cloud Partner Program Allows you to select the best SAP HANA cloud option from partners. Support advanced text analytics: Analyze text in all columns of table and text inside binary files with advanced text analytic capabilities such as: automatically detecting 31 languages, fuzzy, linguistics, synonymous search, using SQL. Analyze streaming data From integrated ESP in combination with data in SAP HANA. Process geospatial data ODBO http(s), OData/JSON Browser / Mobile Web JS Lib Data Viz Lib Embed sentiment fact extraction in same SQL App Services (Web Server) Any HTML5/JS Library SQL Script DB Services SAP HANA Third Party & Custom Application HTTP)S), OData, XML/A, ODBC, JDBC, ADSC, ODBO MDX, SLQ ODBC, JDBC Web App Server Embed fuzzy text search in same SQL CREATE FULLTEXT INDEX i1 ON PSA_TRANSACTION( AMOUNT, TRAN_DATE, POST_DATE, DESCRIPTION, CATEGORY_TEXT ) FUZZY SEARCH INDEX ON SYNC; SELECT SCORE() AS SCR, * FROM "SYSTEM"."PSA_TRANSACTION" WHERE CONTAINS (*, 'Sarvice', fuzzy) ORDER BY SCR DESC; Any Data Text Data Geospatial Data Smart Meter Mobile Point of Sale RFID Machine Data Connected Vehicles Structured Data Click- Stream Social Network Customer Data Embed geospatial in same SQL
  • 6. 6 SAP HANA’s Defining Capabilities Š 2013 SAP AG or an SAP affiliate company. All rights reserved. columnar table conversion needed, no batch building for columnar compression, no nothing. The same table supports OLTP and OLAP Analytics – at the same time – with full ACID properties. This means real real-time and real-value. Another advantage SAP HANA has over traditional databases is openness. SAP HANA is open and agnostic. It supports ANSI compliant SQL, ODBC and JDBC protocols, and a wide range of certified 3rd party query tools, ETLtools, back-up and recovery tools, monitoring tools, and even applications.Yes, even ones that compete directly with SAP applications. SAP HANA supports custom applications built on SAP HANA in any language including PHP, Ruby, Python,Java, C, and of course,ABAP.And SAPHANA also supports MDX over ODBO and XMLA over HTTP if you want to access SAP HANA as a multi-dimensional database. SAP HANA is data agnostic as well, not only supporting tables, but unstructured data (i.e. text), spatial data, document data, and sparse data as well.And SAP is getting ready to release support for property graph data in SAP HANA as well.With unstructured data, SAP HANA supports ingestion of a huge number of document types including all the usual suspects, language identification for 31 languages, entity extraction for three languages, and sentiment analysis for the top five spoken languages. For spatial data, SAP HANA is not only screaming fast but also ships with a geo-coding service, polygons for all major political and geographic areas, and an industry compliant integration layer supporting ESRI, Navtec, and Google.As for document data, SAP HANA supports complete transactions using JSON or XML. For sparse data with varying attributes, SAP HANA supports flexible tables in which columns can be dynamically added on Insert or Update.All vitally important non- structured data types explode. There are several new specialized databases built specifically for these data types that have allowed for applications to address the 80% of all data that isn’t structured (i.e. not relational). These purpose built databases have provided, huge innovation leaps. But using them is still a separate database approach with all its inherent issues: The need for separate servers, data to be copied between the systems, and applications having to merge the data coming from the different systems via in-app code. All a lot of work and effort. This may be fine if you are a specialized web start-up, but definitely not if you are an enterprise. Additionally, these complex infrastructures reduce agility. By being in the single SAP HANA data platform, developers can easily take advantage of data of any kind to build next generation applications. Data of heterogeneous data types can be easily combined, processed, and leveraged. SAP HANA also brings the analytics code closer to the data. In and of itself, this isn’t new or revolutionary. Stored procedures have been around forever. The unique aspect is the offering and provision The SAP HANA Difference Any Apps Any App Server SAP Business Suite and BW ABAP App Server SQL MDX JSON Open ConnectivityR HANA PLATFORM SQL, SQLScript, JavaScript Integration Services Spatial Search Text Mining Stored Procedure & Data Models Application & UI Services Business Function Library Predictive Analysis Library Database Services Planning Engine Rules Engine Transaction Unstructured Machine HADOOP Real-time Locations Other Apps Supports any Device
  • 7. 7SAP HANA’s Defining Capabilities Š 2013 SAP AG or an SAP affiliate company. All rights reserved. of pre-built libraries and the speed with which SAP HANA can run the analytics. There is a Predictive Analysis Library (PAL) which supports 27 data preparation and data mining functions including clustering and classification algorithms, regression methods, decision trees, and forecasting methods. These are all natively compiled and parallelized as part of the query execution plan determined by the SAP HANA optimizer providing blinding speed. There is also a Business Function Library (BFL) that contains 55 data intensive functions that would slow down traditional three tier applications given their need for data movement from the database tier to the app tier.Things like currency conversion, depreciation, and moving median calculations.And new libraries are being added to SAP HANA for data quality and data transformations. There is also a high speed integration to “R”that lets you utilize the over 3500 algorithms that have been built on this open source analytics platform to leverage the real-time capability. Individually these innovations are great, but what is really amazing is that that you can use them together. SAP HANA makes this not only possible, but even easy.Want to show a billion pieces of data on a map showing visualizations based on clustering analysis of that data? How about a financial system that takes every transaction for the quarter up until this last second, does a synthetic close on the books, and then forecasts where the financials will be at quarter end? So SAP HANA is a fully ACID compliant database that supports real- time OLTP and OLAP (analytics) without duplicate copies of the data. It supports every major data type along with text analytics and geo-spatial capabilities. It supports all major industry standard query languages, and comes with highly optimized industry functions and algorithms to build class best performance (think 1000s of times faster). And it does this without indexes or aggregates. When these are combined in totality, you have a system that uses less hardware and requires less maintenance and saves you money while providing game-changing capabilities and agility. The SAP HANA Difference
  • 8. 8 SAP HANA’s Defining Capabilities Š 2013 SAP AG or an SAP affiliate company. All rights reserved. SAP HANA’s speed is often referred to as being “in-memory”, where all data can be placed in the computer’s RAM and thus avoid slow disk reads. This may be true, but overshadows other huge innovations developed by the HANA team in partnership with Intel. These innovations deliver orders of magnitude faster performance over just putting data a big cache. SAP HANA’s version of in-memory works by: 1. Having a SAP HANA optimizer that leverages knowledge of the data’s locality to the CPUs and their caches. It leverages this information along with knowledge of all the bandwidth and latencies between every cache, every core, every CPU, and every node to formulate the most performant execution plans 2. Aligning all data in-memory with the processor’s cache-line size (the amount of data always fetched by the processor’s memory controller). This allows data to be pipelined from RAM, through the caches, and into a core’s registers. 3. Aligning data in-memory in vectors (columns) allowing data to be compressed highly efficiently by data type. Therefore SAP HANA is able to pipeline much larger amounts of data to the core’s registers. 4. SAP HANA acts on compressed data whenever possible including scans, aggregates, and joins. When processing data, SAP HANA leverages Intel’s SSE vector processing technology (i.e. SIMD instruction set) to process multiple pieces of data in a single instruction cycle. Since SAP HANA uses variable length integer encoding, this means that for lower cardinality columns, it can process 10s of integers per register per instruction cycle. Now that we’ve established where SAP HANA’s speed comes from, we can address perceived concerns around out- of-memory conditions, scalability, and data center readiness. SAP HANA is resilient. When available memory is filling up, SAP HANA purges any column partition, column, or table that hasn’t been accessed recently from memory. It is not lost.This is efficient as the data is already persisted on-disk and no write-back is needed. BLOBs really don’t need to be in-memory once their entities have been extracted, so these can be set to only reside on disk saving memory. In most instances SAP recommends leaving all data in memory, but you don’t have to and be less frequently accessed data can be stored in systems such as Hadoop and SAP Sybase IQ. Further SAP HANA also doesn’t need that much memory to process very large amounts of data. Most DBAs quote their databases in total size, including indexes and aggregates. So a 1TB database typically only has in approximately 500 GB of raw data. SAP HANA achieves compression of OLTP data in the 3-6X range and data warehousing data in the 10-20X range. To be conservative, let’s assume 5X compression. This makes the 500GB of raw data 100 GBs. Now, SAP HANA doesn’t only store data in-memory, it also processes all its interim datasets in- memory, so we recommend a SAP HANA instance memory of 2X the compressed raw data, or 200 GB in our example. This means that you can get all the benefits SAP HANA offers for a 1TB existing database in a 200GB SAP HANA node. Said another way, a standard 1TB SAP HANA node is capable of handling a 5TB database on migration. SAP HANA is also fully elastically scalable allowing you to add nodes to accommodate more memory and data. There are certified configurations up to 56 nodes and 56 TB of memory and SAP has built a 100 node 100 TB system on which a 1 PB benchmark (20x compression) was run. Conceive of needing even more than 56 nodes? Just ask and SAP will certify more. What is truly incredible is the linearity SAP HANA exhibited during scaling tests Architecture SOURCE SSE2 OP DEST CLOCK CYCLE 1 127 0 X4 X3 X2 X1 Y4 Y3 Y2 Y1 X4opY4 X3opY3 X2opY2 X1opY1 Single Node SIMD Operation
  • 9. 9SAP HANA’s Defining Capabilities Š 2013 SAP AG or an SAP affiliate company. All rights reserved. on the 100 node system noted above. As you can see from the graph above, SAP HANA’s response time was essentially flat as data and nodes were doubled. Simply stated, scaling SAP HANA just takes math.Want to double the speed of your queries? Double the nodes? Want to double your users and maintain response times on the same data size? This is because SAP HANA is able to fully parallelize any query and fully saturate its CPU cores. This means that SAP HANA is fully utilizing all available resources while wasting none. It means that SAP HANA is always going to solve your query as fast as is possible. If you have large concurrency on SAP HANA, its execution plans will use fewer cores and memory to allow all users to be serviced. And a planned innovation will make SAP HANA even faster. As noted by Hasso Plattner at this past year’s Architecture Sapphire conference in Orlando, FL, SAP has developed a new object aware cache that is capable of handling delta changes. This means that similar queries should get millisecond response times even under load once this innovation is released. SAP HANA offers reliability, full high availability, disaster recovery (DR) and supportability. So from a data center readiness perspective, SAP HANA is complete. SAP HANA fully supports fail- over nodes so that if a node fails for any reason a backup node will kick in. Since SAP HANA is in-memory, performance will be slowed to disk-database speeds until all data is moved from disk to memory. Note that the fail-over node is available immediately and data will load as queries require it. SAP HANA also supports DR via either synchronous or asynchronous native replication. This allows secondary SAP HANA systems to be located continents away providing protection from any natural or political disasters. There are many options for DR that can allow for near zero down time upgrades or use of the secondary system for use as test and dev. Our partners are also offering log shipping as another DR option and efficient use of hardware. As noted earlier, SAP HANA is open and agnostic and that goes for hardware as well. SAP HANA runs on “commodity” components assembled by our hardware partners to the minimum specifications given to them by SAP. These specifications allow very different architectures by each partner providing our customers both choice and a competitive market. We recently announced an initiative allowing customers to use their own storage systems as long as they meet SAP HANA’s specifications. And SAP HANA can also be run virtualized to support testing and dev. All these innovations are designed to keep hardware costs down. Lower costs does not mean cheap or chintzy. SAP HANA hardware is not inexpensive as standard components are enterprise class and the servers do have lots of memory. But, using compression, hardware and storage can be conservatively sized. And you do get what you pay for. As an instance can handle mixed workloads of OLTP and operational reporting, one less server is required. And as SAP HANA doesn’t require indexes and aggregates, data duplicates in row and columnar structures, customers report lower operating costs. So you get good value and good TCO. Take SAP’s own experience in migrating its CRM system. SAP saw a 30% reduction in hardware costs due to no longer needing a separate system for operational reporting. 16 nodes (100 billion rows) 51 nodes (650 billion rows) 95 nodes (1,200 billion rows) Extreme Linear Scalability Query processing time (in seconds) Query 1 Query 2 Query 3 0.425 0.266 0.142 0.502 3.102 0.491 3.249 0.7 3.816 Sales and Distribution Reports Query 1: Single Customer and Material for One Month Query 2: Range of Customers and Materials for Six Months Query 3: Year-Over-Year Trending Report for Top 100 Customers for Five Years
  • 10. 10 Š 2013 SAP AG or an SAP affiliate company. All rights reserved. Summary SAP HANA represents such a huge shift in data management, application development, and analytics that it is a paradigm shift.And like most things new and earthshattering it is often hard to believe or accept at face value, especially when looked at through the lenses of existing paradigms. SAP HANA takes the most important innovations in hardware architectures, columnar and row databases, text analytics, parallel processing, and much more and blends them together in an elegantly designed software platform and appliance. SAP HANA is convergence. SAP HANA represents a huge shift for enterprise computing. SAP HANA is the first technology in decades that will simultaneously allow landscape simplification (i.e. fewer servers) and rapid innovation. Said another way, SAP HANA is the next generation data platform. SAP HANA’s Defining Capabilities
  • 11. Š 2013 SAP AG or an SAP affiliate company. All rights reserved. 13/08 Š 2013 SAP AG or an SAP affiliate company. All rights reserved. No part of this publication may be reproduced or transmitted in any form or for any purpose without the express permission of SAP AG. The information contained herein may be changed without prior notice. Some software products marketed by SAP AG and its distributors contain proprietary software components of other software vendors. National product specifications may vary. These materials are provided by SAP AG and its affiliated companies (“SAP Group”) for informational purposes only, without representation or warranty of any kind, and SAP Group shall not be liable for errors or omissions with respect to the materials. The only warranties for SAP Group products and services are those that are set forth in the express warranty statements accompanying such products and services, if any. Nothing herein should be construed as constituting an additional warranty. SAP and other SAP products and services mentioned herein as well as their respective logos are trademarks or registered trademarks of SAP AG in Germany and other countries. Please see http://www.sap.com/corporate-en/legal/copyright/index.epx#trademark for additional trademark information and notices. www.sap.com/contactsap