Data Grids for Extreme Performance, Scalability and Availability JavaOne 2011 Steve Millidge

Data Grids for Extreme
Performance, Scalability and
Availability
Steve Millidge
Director
C2B2

© C2B2 Consulting Limited 2011
www.c2b2.co.uk
All Rights Reserved

“Reliability, Availability, Scalability
and Performance are prerequisites
for functionality!”

They are Priority 1 Requirements
www.c2b2.co.uk
All Rights Reserved

Availability
• System is available
for customers to use
• No availability results
in no transactions
• Transactions = $$$
• Receive your Pink
Slip if you can’t sort it!

www.c2b2.co.uk
All Rights Reserved

Multipliers in Availability

System System System
1 2 3

99% Availability 99% Availability 99% Availability

Overall Availability = 0.99*0.99*0.99 = 97%

www.c2b2.co.uk
All Rights Reserved

HA Techniques
Redundancy Decoupling
99% Availability
System
System 99% Availability 1

99% Availability 99% Availability System
System
2

Pair = 1 – (0.01*0.01) = 99.99%
System 99% Availability
3
Overall = 0.9999 x 0.9999 x 0.9999 = 99% Overall = 99%

www.c2b2.co.uk
All Rights Reserved

Performance
How fast does a single transaction take to
execute!

• Faster Performance = Happier Customers
• Faster Performance = More Transactions

www.c2b2.co.uk
All Rights Reserved

Barriers to Performance
• Raw Algorithmic Performance
• Resource Limitations
– Not enough cpu, disk, memory
• Resource Contention
– Locks
• IO Latency
– Network, Disk

www.c2b2.co.uk
All Rights Reserved

Latency
Time delay in requesting an operation and it
being initiated

• Key factor in large scale distributed
applications
• Typically not taken into account during
development

www.c2b2.co.uk
All Rights Reserved

Latency Factors
• Network Distance
• Network Reliability
• Data Size
• Operation Granularity
• Resource Contention
• JVM GC

www.c2b2.co.uk
All Rights Reserved

Move the Data and Processing
Close Together

www.c2b2.co.uk
All Rights Reserved

Scalability
Ability to add more hardware in
response to more demand.

Without a reduction in
performance!

www.c2b2.co.uk
All Rights Reserved

Business Imperatives
• Success of the Business or Service
• Growth of Mobile
• Huge Variation of Load through a period
• Sudden Large Spikes due to events

Cloud Enables Elastic Scalability

www.c2b2.co.uk
All Rights Reserved

Scaling Out
Horizontal Scaling
• Add Additional
Servers
• Add Load Balancer
• Distribute traffic
across the servers
• Much Cheaper than
Scale Up
• Has HA benefits

www.c2b2.co.uk
All Rights Reserved

Linear Scalability
(Nirvana)
900
800
700
600
500
Users Linear Scalability
400
Typical Scalability
300
200
100
0
1 2 3 4
Cluster Nodes
www.c2b2.co.uk
All Rights Reserved

Typical Scale Out Architecture
Load Balancer

Nodes Host
Stateless Services

Node Node Node Node
1 2 3 4

Database contains
Database Persistent State

www.c2b2.co.uk
All Rights Reserved

Stateless Services
True Stateless Services Pseudo Stateless
• Static HTML Serving • Read, Update and Store
• Basic Calculations state in the DB
• State Received from • Use sticky session to
Client route to non critical state
• Typical of Most Online
applications
• Push scalability issue to
the database

www.c2b2.co.uk
All Rights Reserved

Scaling a Stateless Middletier is easy
however
Scaling Databases is hard and very
expensive

www.c2b2.co.uk
All Rights Reserved

Radical Idea

Put state back into the Middleware

www.c2b2.co.uk
All Rights Reserved

Caching

www.c2b2.co.uk
All Rights Reserved

Read Through Cache

GET A
Application

Cache A

Cache Loader

Data StoreA

www.c2b2.co.uk
All Rights Reserved

Write Through Cache

GET B
B

PUT
Application

Cache B

Cache Writer

Data Store

www.c2b2.co.uk
All Rights Reserved

Write Behind Cache

GET B
B

PUT
Application

Cache B

Write Behind
Processor

Data Store

www.c2b2.co.uk
All Rights Reserved

Caches
• Caches aren’t New
– Hibernate Session Cache
– Entity Bean Cache
– JPA Cache
– Custom Caches
– Open Source Caches
• Typically Cache Database Data or Page
Fragments
www.c2b2.co.uk
All Rights Reserved

JSR 107
JCACHE - Java Temporary Caching API

• Been around a Long Time
– 10 years
• Focussed on Java SE
– With some JEE Integration for JEE7
• Caching API
– V get(Object key) throws CacheException;
– void put(K key, V value) throws
CacheException;
www.c2b2.co.uk
All Rights Reserved

JSR 107
Get Involved
• Google Group for Discussion
– http://groups.google.com/group/jsr107
• Google Docs for Spec
– https://docs.google.com/document/d/1YZ-
lrH6nW871Vd9Z34Og_EqbX_kxxJi55UrSn4y
L2Ak
• GitHub for Code
– https://github.com/jsr107/jsr107spec

www.c2b2.co.uk
All Rights Reserved

Local Caching (Roll your Own)
Benefits Challenges
• Pretty Simple to Write • Cache Eviction
– Concurrent Hashmap • Cache Loading/Storing
• Used in many • Cache Prefetching
applications • Cache Refresh
• Use JCache API • Write Behind Processing
• Clustering !!

THINK LONG AND HARD!!
www.c2b2.co.uk
All Rights Reserved

Clustering Challenges

GET B

GET B
Application Application

Cache Cache B
B

DataB
Store
www.c2b2.co.uk
All Rights Reserved

Update Replication

UPDATE B
B2

Cache B2
B1 Cache B1

Data Store
www.c2b2.co.uk
All Rights Reserved

Update Invalidation

UPDATE B
B2

Cache B1
B2 Invalidate Cache B1

Data Store
www.c2b2.co.uk
All Rights Reserved

Replication Write Performance
B
PUT B

Application Application Application Application

Cache Cache Cache Cache

B

www.c2b2.co.uk
All Rights Reserved

Cache Partitioning

GET B
PUT C
B
PUT B

Application Application C


B C B

www.c2b2.co.uk
All Rights Reserved

Elasticity in Partitioned Caches

Cache Cache

www.c2b2.co.uk
All Rights Reserved

HA Cache Partitioning
B
PUT B


NODE
B
CRASH
!!!

www.c2b2.co.uk
All Rights Reserved

Partitioned Cache
• Linear Scalability
– 2 hops for Read (Worst Case)
– 2 hops for Write (Worst Case)
• High Availability
– Configurable Duplicates
• Location Independent Access
– Grid knows where data is
• More Nodes = More Data in Memory
www.c2b2.co.uk
All Rights Reserved

Consider a Large Cache

Application Application Application Application Application Application Application
Cache Cache Cache Cache Cache Cache Cache



www.c2b2.co.uk
All Rights Reserved

How Much Can We Store
• 21 Amazon xLargeMemory Instances
– 17Gb RAM
• 3 Nodes Per Instance
– 4Gb 64bit JVM Heap + 5 Gb OS
• 63 Cluster Nodes
• 252 Gb JVM Heap Available
• Approx 125Gb Data in the Grid!
• Cost per Month $9000
www.c2b2.co.uk
All Rights Reserved

Grids can Even Overflow
Application • Passivates Data to a Local
Cache Backing Store (NIO memory
mapped file)

• Use Java NIO for Off Heap
Storage

• Berkely DB local Storage
Local Drive
•Reduces GC overhead

www.c2b2.co.uk
All Rights Reserved

HA In Memory Data

Data Centre

Server Rack 1 Server Rack 2

Application Application Application Application Application Application
Cache Cache Cache Cache Cache Cache
Data Centre
Do We Need the
Server Rack 1
Database? 2
Server Rack

Applicati Applicati Applicati Applicati Applicati Applicati
on
Cache on
Cache on
Cache on
Cache on
Cache on
Cache

www.c2b2.co.uk
All Rights Reserved

Database as Business Audit
Applicati Applicati Applicati Applicati Applicati Applicati Applicati
Cach
on Cach
on Cach
on Cach
on Cach
on Cach
on Cach
on
e e e e e e e

Cach
on Cach
on Cach
on Cach
on Cach
on Cach
on Cach
on
e e e e e e e

Cach
on Cach
on Cach
on Cach
on Cach
on Cach
on Cach
on
e e e e e e e

Business Audit Data

Data Store

www.c2b2.co.uk
All Rights Reserved

Putting it All Together

Web Sockets Load Balancer

JEE JEE JEE JEE JEE
Cluster Cluster Cluster Cluster Cluster
Process
Node Node Node Node Node

Applica Applica Applica Applica Applica Applica Applica
Cac
tion Cac
tion Cac
tion Cac
tion Cac
tion Cac
tion Cac
tion
he he he he he he he

Cac
tion Cac
tion Cac
tion Cac
tion Cac
tion Cac
tion Cac
tion

Cac
tion Cac
tion Cac
tion Cac
tion Cac
tion Cac
tion Cac
tion
Data Grid

www.c2b2.co.uk
All Rights Reserved

Extreme Performance
• Reduced Latency
– Data close to processing
– Reduce roundtrips and expensive calculations
• Parallel Processing
– Distributed Processing (Map-Reduce-like)
– Distributed Query Processing

www.c2b2.co.uk
All Rights Reserved

Extreme Scalability
• O(1) Writes and Reads
– Worst Case two hops
– No increase with number of nodes
• Data Volume Increases with Nodes
– Large data volumes stored in the Data Grid
• Elastic Topology
– Clusters Rebalance with node changes

www.c2b2.co.uk
All Rights Reserved

Extreme Availability
• No Single Point of Failure
– Duplicates prevent data loss
– Duplicate Numbers Configurable
• Write Behind
– decouples Database Availability
• Self Healing
– Removing Nodes causes rebalancing

www.c2b2.co.uk
All Rights Reserved

Data Grids for Extreme Performance, Scalability and Availability JavaOne 2011 Steve Millidge

Empfohlen

Empfohlen

Weitere ähnliche Inhalte

Was ist angesagt?

Was ist angesagt? (18)

Andere mochten auch

Andere mochten auch (20)

Ähnlich wie Data Grids for Extreme Performance, Scalability and Availability JavaOne 2011 Steve Millidge

Ähnlich wie Data Grids for Extreme Performance, Scalability and Availability JavaOne 2011 Steve Millidge (20)

Mehr von C2B2 Consulting

Mehr von C2B2 Consulting (20)

Kürzlich hochgeladen

Kürzlich hochgeladen (20)

Data Grids for Extreme Performance, Scalability and Availability JavaOne 2011 Steve Millidge

Hinweis der Redaktion