SlideShare a Scribd company logo
1 of 30
Download to read offline
Sam  Lightstone
Dis0nguished  Engineer
© 2015 IBM Corporation
The  Science  and  Engineering  
Behind  Super  Fast  Load-­‐and-­‐Go  In-­‐
memory  Analy0cs
IBM Analytics Platform Group
1.  Next generation in-memory SQL data warehousing
" Super	
  Fast	
  (query	
  performance)	
  
" Super	
  Simple	
  (load-­‐and-­‐go)	
  	
  
" Super	
  Small	
  (RAM	
  and	
  storage	
  savings)	
  
2.  Seamlessly integrated
" Built	
  seamlessly	
  into	
  both	
  dashDB	
  (cloud)	
  and	
  DB2	
  (soAware)	
  
" Consistent	
  SQL,	
  language	
  interfaces,	
  administraGon	
  
" DramaGc	
  simplificaGon	
  
3.  Hardware optimized
" Memory	
  opGmized	
  
" CPU-­‐opGmized	
  
" I/O	
  opGmized	
  
BLU	
  Accelera/on	
  
2
35X-73X faster
…than traditional row-organized relational
database technology, on average.
“Compared to our current production system, DB2 10.5 with BLU Acceleration is running 106x
faster for our Admissions and Enrollment workloads. We had one query that we would often
cancel if it didn’t finish in 30 minutes. Now it runs in 56 seconds every time. 32x faster,
predictable response time, no tuning…what more could we ask for?”
- Brenda Boshoff, Sr. DBA
“Wow…unbelievable speedup in query run times! We saw a speedup of 273x
in our Vehicle Tracking report, taking a query from 10 minutes to 2.2 seconds.
That adds value to our business; our end users are going to be ecstatic!”
- Ruel Gonzalez - Information Services
DataProxy LLC
“My largest row-organized, adaptive compressed table gave me 3.2x storage savings. However,
converting this row-organized uncompressed table to a column-organized table in DB2 10.5
delivered a massive 15.4x savings!”
- Iqbal Goralwalla, Head of DB2 Managed Services, Triton
Records: 76M
Columns: 61
Indexes: 10
Load	
  Time
Row-­‐unc 15:39:10
Col 1:10:29(97%)
31.5x storage
savings
(97% less storage required)
13.5x faster load time
Storage Savings
6
Super simple
CREATE. LOAD. GO!
Risk system injects 1/2 TB per night
from 25 different source systems.
“Impressive Load times.”
Some queries achieved an almost
100x speed up with literally no
tuning.
6 hours.
Installing BLU
to query results.
One of the world’s most profitable
and secure rated banks.
Load-­‐and-­‐go	
  simplicity	
  
•  No	
  indexes	
  
•  No	
  storage	
  reclaim	
  	
  (it’s	
  automated)	
  
•  No	
  memory	
  configuraGon	
  (it’s	
  automated)	
  
•  No	
  process	
  model	
  configuraGon	
  (it’s	
  
automated)	
  
•  No	
  staGsGcs	
  collecGon	
  (it’s	
  automated)	
  
•  No	
  MDC	
  or	
  MQTs	
  
•  No	
  StaGsGcal	
  views	
  
•  No	
  opGmizer	
  profiles/guidelines	
  
	
  
“The BLU Acceleration technology has some obvious benefits: … But it’s when I think about
all the things I don't have to do with BLU, it made me appreciate the technology even
more: no tuning, no partitioning, no indexes, no aggregates.”
-Andrew Juarez, Lead SAP Basis and DBA
8
Simple.	
  
•  LOAD	
  and	
  then…	
  run	
  queries	
  
	
  
In-­‐memory	
  speeds	
  with	
  /ny	
  RAM	
  requirements	
  
•  Rule	
  of	
  thumb:	
  RAM	
  requirements	
  are	
  just	
  5%	
  of	
  source	
  data	
  
size.	
  	
  
•  Example:	
  	
  
•  10	
  Terabytes	
  of	
  raw	
  user	
  data	
  
•  500	
  GB	
  of	
  RAM	
  
9
10
Magic: How did they do that?
BLU Acceleration
Easiest. Fastest.
Smallest.
IBM Means
Innovation
BLU	
  Acceleration	
  includes	
  over	
  30	
  new	
  
patents	
  and	
  patents	
  pending	
  from	
  
IBM	
  Research	
  &	
  Development	
  
Laboratories.	
  
11
Looking at Disk Improvements
(courtesy of D. DeWitt, PASS Summit Keynote 2009)
!  Incredibly inexpensive drives (& processors) have made it
possible to collect, store, and analyze huge quantities of data
12
But, consider the metric transfer bandwidth/byte
Over the last 30 years
Capacity:	
  
80MB	
  "	
  800GB	
  
10,000x	
  
	
  
Transfer	
  Rates:	
  
1.2MB/sec	
  "	
  80MB/sec	
  
65x	
  
	
  
!  1980: 1.2 MB/sec / 80 MB = 0.015
!  2009: 80 MB/sec / 800,000 MB =.0001
When relative capacities are factored in, drives are 150X slower
today!!!
We’ve	
  been	
  snookered!	
  
Source: http://www.jcmit.com/mem2013.htm
Main Memory
CPU cache optimized
• RAM is at the bottom of the memory
hierarchy. It is the slowest non-persistent
memory in a server.
• CPU cache is many times faster than
RAM.
• Extreme re-engineering of database
algorithms to be CPU cache optimized
• BLU algorithms adapt automatically to
hardware cache size.
Speed-up versus RAM
15x
41x
173x
1x
0
20
40
60
80
100
120
140
160
180
200
RAM L3 L2 L1
Speed-upversusRAM
Big Idea: RAM is Too Slow
14
More Evil Than Ever
1. Human intervention
2. Voluminous I/O
3. Random I/O
4. Memory stalls
5. Single core processing
6. The cost of RAM
Load-and–go simplicity: System resources
CPU CPUCPU CPU CPU CPU CPU
RAM1. Auto-detect and adapt
to available RAM
2. Auto detect and adapt to
core number and type
3. Auto detect and adapt to
CPU cache size
Load-and-go simplicity: Automatic Workload Management
• Built-in and automated query resource consumption control
• Many queries can be submitted, but effective concurrency, per query RAM,
and CPU are automatically and dynamically controlled
...
Applications and Users
Up to tens of thousands of
SQL queries at once
BLU Runtime
SQL Queries
Moderate number of queries
consume resources
17
Load-and-go simplicity: Automatic Space Reclaim
•  Automatic space reclamation
•  Frees extents with no active values
•  The storage can be subsequently reused by any
table in the table space
•  No need for costly DBA space management and
REORG utility
•  Space is freed online while
work continues
•  Regular space management can result in
increased performance of RUNSTATS and some
queries
Column
3
Column
1
Column
2
2012 2012
2012
2012
DELETE * FROM MyTable
WHERE Year = 2012
These extents hold only
deleted data
Storage
extent
2013 2013 2013
2013
18
•  No CPU for decompression
•  Data flows through memory
and CPU at compressed size
•  “Actionable Compression”
QUERY	

DATA DATA
DATA
•  10-20X smaller is common
•  Compress as small as 1 bit
•  Compress the most frequent
data the smallest
THE DATA
HONEY, I
SHRUNK
BIG tiny
•  Reduce I/O
•  Increase data density
in RAM
•  Increase CPU
efficiency
C1 C2 C3 C4 C5 C6 C7 C8C1 C2 C3 C4 C5 C6 C7 C8
•  Cache intelligently for
analytics
•  Predictive I/O with
“Dynamic List
Prefetching”
•  Massive I/O reduction
RAM	

DISKS	

•  Queries skip
uninteresting data
•  Synopses on every
column,
automatically.
•  “Data Skipping”
•  Use modern SIMD instructions to do
multiple operations with a single
instruction.
•  Use 1 instruction instead of 8.
•  Massive CPU acceleration
•  Store data as vectors
•  “Parallel Vector Processing”
•  RAM is too slow for BLU !
•  Redesign the query engine to
operate at CPU cache speeds
instead
•  CPU cache is 10-75X faster than
RAM access.
BLU Acceleration Illustration
10TB query in seconds or less
10TB data
Actionable Compression
reduces to 1TB
In-memory
Parallel Processing
32MB linear scan
on each core via
Scans as fast as
8MB through SIMD and CPU-
cache optimized algorithms
Result in
seconds or less
Column Processing
reduces to 10GB
Data Skipping
reduces to 1GB
DATA	
  
DATA	
  
DATA	
  
DATA	
  
DATA	
  
DATA	
  
DATA	
  
DATA	
  
DATA	
  
DATA	
   DATA	
   DATA	
  
DATA	
  
DATA	
   DATA	
  
#  The System: 32 cores, 1TB memory, 10TB table with 100 columns and 10 years of data
#  The Query: How many “sales” did we have in 2010?
-  SELECT COUNT(*) from MYTABLE where YEAR = ‘2010’
#  The Result: In seconds or less as each CPU core examines the equivalent of just 8MB of data
22
BLU Acceleration Illustration
10TB query in seconds or less
• 1GB RAM required to cache all data for query over 10TB of data.
• 10,000 times reduction in memory requirements to achieve in-
memory speeds
10TB data
Actionable Compression
reduces to 1TB
In-memory
Parallel Processing
32MB linear scan
on each core via
Scans as fast as
8MB through SIMD and CPU-
cache optimized algorithms
Result in
seconds or less
Column Processing
reduces to 10GB
Data Skipping
reduces to 1GB
DATA	
  
DATA	
  
DATA	
  
DATA	
  
DATA	
  
DATA	
  
DATA	
  
DATA	
  
DATA	
  
DATA	
   DATA	
   DATA	
  
DATA	
  
DATA	
   DATA	
  
23
IBM	
  dashDB	
  –	
  Data	
  Warehousing	
  as	
  a	
  Service	
  
Terabytes of data ready to analyze
within minutes
•  Fast querying
•  In-memory
•  Columnar
•  SIMD hardware acceleration
•  Actionable compression
•  Advanced analytics
•  Support for OLAP SQL extensions
•  In-database analytics & R for
predictive modeling
•  Spatial analytics
•  Easy to integrate
•  Connect common 3rd party BI tools
keeps data warehouse
infrastructure out of your way
26
The dashDB Cocktail: Three Parts IBM, Shaken…
BLU !
Acceleration!
Netezza !
In-Database
Analytics!
SoftLayer
Infrastructure !
as a Service!
Build More
Grow More
Know More
Focus on the business, "
not the business of data warehousing!
Partnership	
  with	
  IBM	
  Cloudant	
  and	
  dashDB	
  
•  Cloudant is a fully managed distributed !
NoSQL Database as a Service (DBaaS)!
•  Multi-tenant and single (dedicated) tenants!
•  Managed 24x7 by Cloudant expert engineers!
•  Service Level Agreement!
•  Operational data store!
•  dashDB’s partnership with Cloudant presents two opportunities:!
1.  A launching point for new NoSQL customers looking for low-risk, low-cost
avenues for getting started with data warehousing & analytics in a!
multi-tenant environment on the cloud!
2.  A new point of entry for existing Cloudant developers to access !
industry-leading INZA analytics and BLU warehousing!
3.  Automatic schema discovery from theJSON data. One button push to create
warehouse. !
dashDB	
  MPP	
  	
  
~elas/c	
  growth	
  
1.  4TB	
  building	
  blocks	
  (subject	
  to	
  change)	
  
2.  Min	
  3	
  servers.	
  	
  
3.  Elastic	
  growth	
  without	
  re-­‐hashing	
  all	
  row	
  data	
  
4.  Webscale	
  clustering	
  
BLU ACCELERATION
MPP Scale-out of dashDB with BLU Acceleration
CPUsCPUsCPUsCPUs
BLU Acceleration
Dynamic In-Memory Processing
CPUsCPUsCPUsCPUs
BLU Acceleration
Dynamic In-Memory Processing
Now in
Closed
Beta!
The	
  what	
  &	
  the	
  wow	
  
1.  Fast:	
  Superior	
  Performance	
  35-­‐73X	
  
2.  Small	
  RAM	
  requirements:	
  Typically	
  just	
  5%	
  of	
  
the	
  original	
  uncompressed	
  user	
  data.	
  	
  
3.  Small:	
  Superior	
  Compression	
  
4.  Simple:	
  Create.	
  Load.	
  Go.	
  	
  
5.  Introducing	
  dashDB	
  for	
  cloud	
  	
  
•  Fully	
  managed	
  service	
  on	
  IBM	
  BlueMix	
  	
  
•  JSON	
  NoSQL	
  integraGon,	
  Cloudant.com	
  
•  R	
  and	
  SpaGal	
  analyGcs	
  
6.  SAP	
  BW	
  cerGfied	
  
7.  Oracle	
  compaGbility	
  @98%	
  +	
  
	
  
“We cut report runtimes by up
to 98% thanks to IBM DB2 with
BLU Acceleration technology –
without changing operations
processes or investing in new
hardware or software. We were
impressed how easy boosting
database performance can be.”
-Bernhard Herzog, Team Manager
Information Technology SAP, Balluff
Where	
  to	
  find	
  more	
  informa/on:	
  
•  dashDB.com	
  
•  ibmbluhub.com	
  
•  Sam	
  Lightstone’s	
  BLOG:	
  SoAwareTradecraA.com	
  
•  Sam	
  Lightstone’s	
  papers	
  on	
  DBLP:	
  	
  	
  
•  hjp://dblp.uni-­‐trier.de/pers/hd/l/Lightstone:Sam	
  
30

More Related Content

What's hot

SOUG_GV_Flashgrid_V4
SOUG_GV_Flashgrid_V4SOUG_GV_Flashgrid_V4
SOUG_GV_Flashgrid_V4
UniFabric
 
SOUG_SDM_OracleDB_V3
SOUG_SDM_OracleDB_V3SOUG_SDM_OracleDB_V3
SOUG_SDM_OracleDB_V3
UniFabric
 
Introduction to NVMe Over Fabrics-V3R
Introduction to NVMe Over Fabrics-V3RIntroduction to NVMe Over Fabrics-V3R
Introduction to NVMe Over Fabrics-V3R
Simon Huang
 

What's hot (20)

Ceph Day Beijing - Optimizing Ceph Performance by Leveraging Intel Optane and...
Ceph Day Beijing - Optimizing Ceph Performance by Leveraging Intel Optane and...Ceph Day Beijing - Optimizing Ceph Performance by Leveraging Intel Optane and...
Ceph Day Beijing - Optimizing Ceph Performance by Leveraging Intel Optane and...
 
Ceph Day Beijing - Ceph RDMA Update
Ceph Day Beijing - Ceph RDMA UpdateCeph Day Beijing - Ceph RDMA Update
Ceph Day Beijing - Ceph RDMA Update
 
Ceph Day KL - Ceph on All-Flash Storage
Ceph Day KL - Ceph on All-Flash Storage Ceph Day KL - Ceph on All-Flash Storage
Ceph Day KL - Ceph on All-Flash Storage
 
SOUG_GV_Flashgrid_V4
SOUG_GV_Flashgrid_V4SOUG_GV_Flashgrid_V4
SOUG_GV_Flashgrid_V4
 
IMCSummit 2016 Keynote - Benzi Galili - More Memory for In-Memory Easy
IMCSummit 2016 Keynote - Benzi Galili - More Memory for In-Memory EasyIMCSummit 2016 Keynote - Benzi Galili - More Memory for In-Memory Easy
IMCSummit 2016 Keynote - Benzi Galili - More Memory for In-Memory Easy
 
SOUG_SDM_OracleDB_V3
SOUG_SDM_OracleDB_V3SOUG_SDM_OracleDB_V3
SOUG_SDM_OracleDB_V3
 
Ceph Day KL - Delivering cost-effective, high performance Ceph cluster
Ceph Day KL - Delivering cost-effective, high performance Ceph clusterCeph Day KL - Delivering cost-effective, high performance Ceph cluster
Ceph Day KL - Delivering cost-effective, high performance Ceph cluster
 
Ceph Day San Jose - Red Hat Storage Acceleration Utlizing Flash Technology
Ceph Day San Jose - Red Hat Storage Acceleration Utlizing Flash TechnologyCeph Day San Jose - Red Hat Storage Acceleration Utlizing Flash Technology
Ceph Day San Jose - Red Hat Storage Acceleration Utlizing Flash Technology
 
Ceph Day Beijing - Our journey to high performance large scale Ceph cluster a...
Ceph Day Beijing - Our journey to high performance large scale Ceph cluster a...Ceph Day Beijing - Our journey to high performance large scale Ceph cluster a...
Ceph Day Beijing - Our journey to high performance large scale Ceph cluster a...
 
UniPlex vScaleDB pat. pending
UniPlex vScaleDB pat. pendingUniPlex vScaleDB pat. pending
UniPlex vScaleDB pat. pending
 
Introduction to NVMe Over Fabrics-V3R
Introduction to NVMe Over Fabrics-V3RIntroduction to NVMe Over Fabrics-V3R
Introduction to NVMe Over Fabrics-V3R
 
Ceph Day San Jose - Ceph at Salesforce
Ceph Day San Jose - Ceph at Salesforce Ceph Day San Jose - Ceph at Salesforce
Ceph Day San Jose - Ceph at Salesforce
 
Redis on NVMe SSD - Zvika Guz, Samsung
 Redis on NVMe SSD - Zvika Guz, Samsung Redis on NVMe SSD - Zvika Guz, Samsung
Redis on NVMe SSD - Zvika Guz, Samsung
 
Ceph Day Beijing - Storage Modernization with Intel and Ceph
Ceph Day Beijing - Storage Modernization with Intel and CephCeph Day Beijing - Storage Modernization with Intel and Ceph
Ceph Day Beijing - Storage Modernization with Intel and Ceph
 
Ceph Day San Jose - Object Storage for Big Data
Ceph Day San Jose - Object Storage for Big Data Ceph Day San Jose - Object Storage for Big Data
Ceph Day San Jose - Object Storage for Big Data
 
Webinar: What’s Your Path to NVMe?
Webinar: What’s Your Path to NVMe?Webinar: What’s Your Path to NVMe?
Webinar: What’s Your Path to NVMe?
 
Ceph Day San Jose - All-Flahs Ceph on NUMA-Balanced Server
Ceph Day San Jose - All-Flahs Ceph on NUMA-Balanced Server Ceph Day San Jose - All-Flahs Ceph on NUMA-Balanced Server
Ceph Day San Jose - All-Flahs Ceph on NUMA-Balanced Server
 
Using Recently Published Ceph Reference Architectures to Select Your Ceph Con...
Using Recently Published Ceph Reference Architectures to Select Your Ceph Con...Using Recently Published Ceph Reference Architectures to Select Your Ceph Con...
Using Recently Published Ceph Reference Architectures to Select Your Ceph Con...
 
Accelerating Cassandra Workloads on Ceph with All-Flash PCIE SSDS
Accelerating Cassandra Workloads on Ceph with All-Flash PCIE SSDSAccelerating Cassandra Workloads on Ceph with All-Flash PCIE SSDS
Accelerating Cassandra Workloads on Ceph with All-Flash PCIE SSDS
 
Ceph Day Taipei - Accelerate Ceph via SPDK
Ceph Day Taipei - Accelerate Ceph via SPDK Ceph Day Taipei - Accelerate Ceph via SPDK
Ceph Day Taipei - Accelerate Ceph via SPDK
 

Similar to IMCSummit 2015 - Day 1 Developer Session - The Science and Engineering Behind In-Memory Analytics

A5 oracle exadata-the game changer for online transaction processing data w...
A5   oracle exadata-the game changer for online transaction processing data w...A5   oracle exadata-the game changer for online transaction processing data w...
A5 oracle exadata-the game changer for online transaction processing data w...
Dr. Wilfred Lin (Ph.D.)
 
Handling Massive Writes
Handling Massive WritesHandling Massive Writes
Handling Massive Writes
Liran Zelkha
 
Meta scale kognitio hadoop webinar
Meta scale kognitio hadoop webinarMeta scale kognitio hadoop webinar
Meta scale kognitio hadoop webinar
Kognitio
 

Similar to IMCSummit 2015 - Day 1 Developer Session - The Science and Engineering Behind In-Memory Analytics (20)

Exadata architecture and internals presentation
Exadata architecture and internals presentationExadata architecture and internals presentation
Exadata architecture and internals presentation
 
Building a High Performance Analytics Platform
Building a High Performance Analytics PlatformBuilding a High Performance Analytics Platform
Building a High Performance Analytics Platform
 
Oracle Database in-Memory Overivew
Oracle Database in-Memory OverivewOracle Database in-Memory Overivew
Oracle Database in-Memory Overivew
 
How AI and ML are driving Memory Architecture changes
How AI and ML are driving Memory Architecture changesHow AI and ML are driving Memory Architecture changes
How AI and ML are driving Memory Architecture changes
 
In-Memory Computing: Myths and Facts
In-Memory Computing: Myths and FactsIn-Memory Computing: Myths and Facts
In-Memory Computing: Myths and Facts
 
Caching Methodology & Strategies
Caching Methodology & StrategiesCaching Methodology & Strategies
Caching Methodology & Strategies
 
Caching methodology and strategies
Caching methodology and strategiesCaching methodology and strategies
Caching methodology and strategies
 
Shootout at the PAAS Corral
Shootout at the PAAS CorralShootout at the PAAS Corral
Shootout at the PAAS Corral
 
A5 oracle exadata-the game changer for online transaction processing data w...
A5   oracle exadata-the game changer for online transaction processing data w...A5   oracle exadata-the game changer for online transaction processing data w...
A5 oracle exadata-the game changer for online transaction processing data w...
 
SQream DB - Bigger Data On GPUs: Approaches, Challenges, Successes
SQream DB - Bigger Data On GPUs: Approaches, Challenges, SuccessesSQream DB - Bigger Data On GPUs: Approaches, Challenges, Successes
SQream DB - Bigger Data On GPUs: Approaches, Challenges, Successes
 
Handling Massive Writes
Handling Massive WritesHandling Massive Writes
Handling Massive Writes
 
High Performance Hardware for Data Analysis
High Performance Hardware for Data AnalysisHigh Performance Hardware for Data Analysis
High Performance Hardware for Data Analysis
 
Mike Pittaro - High Performance Hardware for Data Analysis
Mike Pittaro - High Performance Hardware for Data Analysis Mike Pittaro - High Performance Hardware for Data Analysis
Mike Pittaro - High Performance Hardware for Data Analysis
 
Meta scale kognitio hadoop webinar
Meta scale kognitio hadoop webinarMeta scale kognitio hadoop webinar
Meta scale kognitio hadoop webinar
 
Hadoop Operations: Keeping the Elephant Running Smoothly
Hadoop Operations: Keeping the Elephant Running SmoothlyHadoop Operations: Keeping the Elephant Running Smoothly
Hadoop Operations: Keeping the Elephant Running Smoothly
 
Coa presentation3
Coa presentation3Coa presentation3
Coa presentation3
 
MySQL Performance Tuning at COSCUP 2014
MySQL Performance Tuning at COSCUP 2014MySQL Performance Tuning at COSCUP 2014
MySQL Performance Tuning at COSCUP 2014
 
Severalnines Training: MySQL® Cluster - Part IX
Severalnines Training: MySQL® Cluster - Part IXSeveralnines Training: MySQL® Cluster - Part IX
Severalnines Training: MySQL® Cluster - Part IX
 
Full scan frenzy at amadeus
Full scan frenzy at amadeusFull scan frenzy at amadeus
Full scan frenzy at amadeus
 
Drupal performance
Drupal performanceDrupal performance
Drupal performance
 

More from In-Memory Computing Summit

IMC Summit 2016 Breakout - Nikita Shamgunov - Propelling IoT Innovation with ...
IMC Summit 2016 Breakout - Nikita Shamgunov - Propelling IoT Innovation with ...IMC Summit 2016 Breakout - Nikita Shamgunov - Propelling IoT Innovation with ...
IMC Summit 2016 Breakout - Nikita Shamgunov - Propelling IoT Innovation with ...
In-Memory Computing Summit
 
IMC Summit 2016 Innovation - Girish Mutreja - Unveiling the X Platform
IMC Summit 2016 Innovation - Girish Mutreja - Unveiling the X PlatformIMC Summit 2016 Innovation - Girish Mutreja - Unveiling the X Platform
IMC Summit 2016 Innovation - Girish Mutreja - Unveiling the X Platform
In-Memory Computing Summit
 
IMC Summit 2016 Breakout - Yanping Wang - Non-volatile Generic Object Program...
IMC Summit 2016 Breakout - Yanping Wang - Non-volatile Generic Object Program...IMC Summit 2016 Breakout - Yanping Wang - Non-volatile Generic Object Program...
IMC Summit 2016 Breakout - Yanping Wang - Non-volatile Generic Object Program...
In-Memory Computing Summit
 
IMC Summit 2016 Breakout - Girish Mutreja - Extreme Transaction Processing in...
IMC Summit 2016 Breakout - Girish Mutreja - Extreme Transaction Processing in...IMC Summit 2016 Breakout - Girish Mutreja - Extreme Transaction Processing in...
IMC Summit 2016 Breakout - Girish Mutreja - Extreme Transaction Processing in...
In-Memory Computing Summit
 

More from In-Memory Computing Summit (20)

IMC Summit 2016 Breakout - Per Minoborg - Work with Multiple Hot Terabytes in...
IMC Summit 2016 Breakout - Per Minoborg - Work with Multiple Hot Terabytes in...IMC Summit 2016 Breakout - Per Minoborg - Work with Multiple Hot Terabytes in...
IMC Summit 2016 Breakout - Per Minoborg - Work with Multiple Hot Terabytes in...
 
IMC Summit 2016 Breakout - Henning Andersen - Using Lock-free and Wait-free I...
IMC Summit 2016 Breakout - Henning Andersen - Using Lock-free and Wait-free I...IMC Summit 2016 Breakout - Henning Andersen - Using Lock-free and Wait-free I...
IMC Summit 2016 Breakout - Henning Andersen - Using Lock-free and Wait-free I...
 
IMC Summit 2016 Breakout - Roman Shtykh - Apache Ignite as a Data Processing Hub
IMC Summit 2016 Breakout - Roman Shtykh - Apache Ignite as a Data Processing HubIMC Summit 2016 Breakout - Roman Shtykh - Apache Ignite as a Data Processing Hub
IMC Summit 2016 Breakout - Roman Shtykh - Apache Ignite as a Data Processing Hub
 
IMC Summit 2016 Breakout - Nikita Shamgunov - Propelling IoT Innovation with ...
IMC Summit 2016 Breakout - Nikita Shamgunov - Propelling IoT Innovation with ...IMC Summit 2016 Breakout - Nikita Shamgunov - Propelling IoT Innovation with ...
IMC Summit 2016 Breakout - Nikita Shamgunov - Propelling IoT Innovation with ...
 
IMC Summit 2016 Breakout - Matt Coventon - Test Driving Streaming and CEP on ...
IMC Summit 2016 Breakout - Matt Coventon - Test Driving Streaming and CEP on ...IMC Summit 2016 Breakout - Matt Coventon - Test Driving Streaming and CEP on ...
IMC Summit 2016 Breakout - Matt Coventon - Test Driving Streaming and CEP on ...
 
IMC Summit 2016 Innovation - Derek Nelson - PipelineDB: The Streaming-SQL Dat...
IMC Summit 2016 Innovation - Derek Nelson - PipelineDB: The Streaming-SQL Dat...IMC Summit 2016 Innovation - Derek Nelson - PipelineDB: The Streaming-SQL Dat...
IMC Summit 2016 Innovation - Derek Nelson - PipelineDB: The Streaming-SQL Dat...
 
IMC Summit 2016 Innovation - Dennis Duckworth - Lambda-B-Gone: The In-memory ...
IMC Summit 2016 Innovation - Dennis Duckworth - Lambda-B-Gone: The In-memory ...IMC Summit 2016 Innovation - Dennis Duckworth - Lambda-B-Gone: The In-memory ...
IMC Summit 2016 Innovation - Dennis Duckworth - Lambda-B-Gone: The In-memory ...
 
IMC Summit 2016 Innovation - Steve Wilkes - Tap Into Your Enterprise – Why Da...
IMC Summit 2016 Innovation - Steve Wilkes - Tap Into Your Enterprise – Why Da...IMC Summit 2016 Innovation - Steve Wilkes - Tap Into Your Enterprise – Why Da...
IMC Summit 2016 Innovation - Steve Wilkes - Tap Into Your Enterprise – Why Da...
 
IMC Summit 2016 Innovation - Girish Mutreja - Unveiling the X Platform
IMC Summit 2016 Innovation - Girish Mutreja - Unveiling the X PlatformIMC Summit 2016 Innovation - Girish Mutreja - Unveiling the X Platform
IMC Summit 2016 Innovation - Girish Mutreja - Unveiling the X Platform
 
IMC Summit 2016 Breakout - Ken Gibson - The In-Place Working Storage Tier
IMC Summit 2016 Breakout - Ken Gibson - The In-Place Working Storage TierIMC Summit 2016 Breakout - Ken Gibson - The In-Place Working Storage Tier
IMC Summit 2016 Breakout - Ken Gibson - The In-Place Working Storage Tier
 
IMC Summit 2016 Breakout - Brian Bulkowski - NVMe, Storage Class Memory and O...
IMC Summit 2016 Breakout - Brian Bulkowski - NVMe, Storage Class Memory and O...IMC Summit 2016 Breakout - Brian Bulkowski - NVMe, Storage Class Memory and O...
IMC Summit 2016 Breakout - Brian Bulkowski - NVMe, Storage Class Memory and O...
 
IMC Summit 2016 Breakout - Yanping Wang - Non-volatile Generic Object Program...
IMC Summit 2016 Breakout - Yanping Wang - Non-volatile Generic Object Program...IMC Summit 2016 Breakout - Yanping Wang - Non-volatile Generic Object Program...
IMC Summit 2016 Breakout - Yanping Wang - Non-volatile Generic Object Program...
 
IMC Summit 2016 Breakout - Andy Pavlo - What Non-Volatile Memory Means for th...
IMC Summit 2016 Breakout - Andy Pavlo - What Non-Volatile Memory Means for th...IMC Summit 2016 Breakout - Andy Pavlo - What Non-Volatile Memory Means for th...
IMC Summit 2016 Breakout - Andy Pavlo - What Non-Volatile Memory Means for th...
 
IMC Summit 2016 Breakout - Gordon Patrick - Developments in Persistent Memory
IMC Summit 2016 Breakout - Gordon Patrick - Developments in Persistent MemoryIMC Summit 2016 Breakout - Gordon Patrick - Developments in Persistent Memory
IMC Summit 2016 Breakout - Gordon Patrick - Developments in Persistent Memory
 
IMC Summit 2016 Breakout - Girish Kathalagiri - Decision Making with MLLIB, S...
IMC Summit 2016 Breakout - Girish Kathalagiri - Decision Making with MLLIB, S...IMC Summit 2016 Breakout - Girish Kathalagiri - Decision Making with MLLIB, S...
IMC Summit 2016 Breakout - Girish Kathalagiri - Decision Making with MLLIB, S...
 
IMC Summit 2016 Breakout - Steve Wikes - Making IMC Enterprise Grade
IMC Summit 2016 Breakout - Steve Wikes - Making IMC Enterprise GradeIMC Summit 2016 Breakout - Steve Wikes - Making IMC Enterprise Grade
IMC Summit 2016 Breakout - Steve Wikes - Making IMC Enterprise Grade
 
IMC Summit 2016 Breakout - Noah Arliss - The Truth: How to Test Your Distribu...
IMC Summit 2016 Breakout - Noah Arliss - The Truth: How to Test Your Distribu...IMC Summit 2016 Breakout - Noah Arliss - The Truth: How to Test Your Distribu...
IMC Summit 2016 Breakout - Noah Arliss - The Truth: How to Test Your Distribu...
 
IMC Summit 2016 Breakout - Aleksandar Seovic - The Illusion of Statelessness
IMC Summit 2016 Breakout - Aleksandar Seovic - The Illusion of StatelessnessIMC Summit 2016 Breakout - Aleksandar Seovic - The Illusion of Statelessness
IMC Summit 2016 Breakout - Aleksandar Seovic - The Illusion of Statelessness
 
IMC Summit 2016 Breakout - Girish Mutreja - Extreme Transaction Processing in...
IMC Summit 2016 Breakout - Girish Mutreja - Extreme Transaction Processing in...IMC Summit 2016 Breakout - Girish Mutreja - Extreme Transaction Processing in...
IMC Summit 2016 Breakout - Girish Mutreja - Extreme Transaction Processing in...
 
IMC Summit 2016 Breakout - Greg Luck - How to Speed Up Your Application Using...
IMC Summit 2016 Breakout - Greg Luck - How to Speed Up Your Application Using...IMC Summit 2016 Breakout - Greg Luck - How to Speed Up Your Application Using...
IMC Summit 2016 Breakout - Greg Luck - How to Speed Up Your Application Using...
 

Recently uploaded

Artificial Intelligence: Facts and Myths
Artificial Intelligence: Facts and MythsArtificial Intelligence: Facts and Myths
Artificial Intelligence: Facts and Myths
Joaquim Jorge
 

Recently uploaded (20)

The 7 Things I Know About Cyber Security After 25 Years | April 2024
The 7 Things I Know About Cyber Security After 25 Years | April 2024The 7 Things I Know About Cyber Security After 25 Years | April 2024
The 7 Things I Know About Cyber Security After 25 Years | April 2024
 
Artificial Intelligence: Facts and Myths
Artificial Intelligence: Facts and MythsArtificial Intelligence: Facts and Myths
Artificial Intelligence: Facts and Myths
 
Powerful Google developer tools for immediate impact! (2023-24 C)
Powerful Google developer tools for immediate impact! (2023-24 C)Powerful Google developer tools for immediate impact! (2023-24 C)
Powerful Google developer tools for immediate impact! (2023-24 C)
 
Strategies for Landing an Oracle DBA Job as a Fresher
Strategies for Landing an Oracle DBA Job as a FresherStrategies for Landing an Oracle DBA Job as a Fresher
Strategies for Landing an Oracle DBA Job as a Fresher
 
The Role of Taxonomy and Ontology in Semantic Layers - Heather Hedden.pdf
The Role of Taxonomy and Ontology in Semantic Layers - Heather Hedden.pdfThe Role of Taxonomy and Ontology in Semantic Layers - Heather Hedden.pdf
The Role of Taxonomy and Ontology in Semantic Layers - Heather Hedden.pdf
 
Boost Fertility New Invention Ups Success Rates.pdf
Boost Fertility New Invention Ups Success Rates.pdfBoost Fertility New Invention Ups Success Rates.pdf
Boost Fertility New Invention Ups Success Rates.pdf
 
Presentation on how to chat with PDF using ChatGPT code interpreter
Presentation on how to chat with PDF using ChatGPT code interpreterPresentation on how to chat with PDF using ChatGPT code interpreter
Presentation on how to chat with PDF using ChatGPT code interpreter
 
Tech Trends Report 2024 Future Today Institute.pdf
Tech Trends Report 2024 Future Today Institute.pdfTech Trends Report 2024 Future Today Institute.pdf
Tech Trends Report 2024 Future Today Institute.pdf
 
Mastering MySQL Database Architecture: Deep Dive into MySQL Shell and MySQL R...
Mastering MySQL Database Architecture: Deep Dive into MySQL Shell and MySQL R...Mastering MySQL Database Architecture: Deep Dive into MySQL Shell and MySQL R...
Mastering MySQL Database Architecture: Deep Dive into MySQL Shell and MySQL R...
 
GenCyber Cyber Security Day Presentation
GenCyber Cyber Security Day PresentationGenCyber Cyber Security Day Presentation
GenCyber Cyber Security Day Presentation
 
2024: Domino Containers - The Next Step. News from the Domino Container commu...
2024: Domino Containers - The Next Step. News from the Domino Container commu...2024: Domino Containers - The Next Step. News from the Domino Container commu...
2024: Domino Containers - The Next Step. News from the Domino Container commu...
 
08448380779 Call Girls In Civil Lines Women Seeking Men
08448380779 Call Girls In Civil Lines Women Seeking Men08448380779 Call Girls In Civil Lines Women Seeking Men
08448380779 Call Girls In Civil Lines Women Seeking Men
 
Scaling API-first – The story of a global engineering organization
Scaling API-first – The story of a global engineering organizationScaling API-first – The story of a global engineering organization
Scaling API-first – The story of a global engineering organization
 
GenAI Risks & Security Meetup 01052024.pdf
GenAI Risks & Security Meetup 01052024.pdfGenAI Risks & Security Meetup 01052024.pdf
GenAI Risks & Security Meetup 01052024.pdf
 
How to Troubleshoot Apps for the Modern Connected Worker
How to Troubleshoot Apps for the Modern Connected WorkerHow to Troubleshoot Apps for the Modern Connected Worker
How to Troubleshoot Apps for the Modern Connected Worker
 
Data Cloud, More than a CDP by Matt Robison
Data Cloud, More than a CDP by Matt RobisonData Cloud, More than a CDP by Matt Robison
Data Cloud, More than a CDP by Matt Robison
 
08448380779 Call Girls In Greater Kailash - I Women Seeking Men
08448380779 Call Girls In Greater Kailash - I Women Seeking Men08448380779 Call Girls In Greater Kailash - I Women Seeking Men
08448380779 Call Girls In Greater Kailash - I Women Seeking Men
 
Strategies for Unlocking Knowledge Management in Microsoft 365 in the Copilot...
Strategies for Unlocking Knowledge Management in Microsoft 365 in the Copilot...Strategies for Unlocking Knowledge Management in Microsoft 365 in the Copilot...
Strategies for Unlocking Knowledge Management in Microsoft 365 in the Copilot...
 
Finology Group – Insurtech Innovation Award 2024
Finology Group – Insurtech Innovation Award 2024Finology Group – Insurtech Innovation Award 2024
Finology Group – Insurtech Innovation Award 2024
 
Raspberry Pi 5: Challenges and Solutions in Bringing up an OpenGL/Vulkan Driv...
Raspberry Pi 5: Challenges and Solutions in Bringing up an OpenGL/Vulkan Driv...Raspberry Pi 5: Challenges and Solutions in Bringing up an OpenGL/Vulkan Driv...
Raspberry Pi 5: Challenges and Solutions in Bringing up an OpenGL/Vulkan Driv...
 

IMCSummit 2015 - Day 1 Developer Session - The Science and Engineering Behind In-Memory Analytics

  • 1. Sam  Lightstone Dis0nguished  Engineer © 2015 IBM Corporation The  Science  and  Engineering   Behind  Super  Fast  Load-­‐and-­‐Go  In-­‐ memory  Analy0cs IBM Analytics Platform Group
  • 2. 1.  Next generation in-memory SQL data warehousing " Super  Fast  (query  performance)   " Super  Simple  (load-­‐and-­‐go)     " Super  Small  (RAM  and  storage  savings)   2.  Seamlessly integrated " Built  seamlessly  into  both  dashDB  (cloud)  and  DB2  (soAware)   " Consistent  SQL,  language  interfaces,  administraGon   " DramaGc  simplificaGon   3.  Hardware optimized " Memory  opGmized   " CPU-­‐opGmized   " I/O  opGmized   BLU  Accelera/on   2
  • 3. 35X-73X faster …than traditional row-organized relational database technology, on average.
  • 4. “Compared to our current production system, DB2 10.5 with BLU Acceleration is running 106x faster for our Admissions and Enrollment workloads. We had one query that we would often cancel if it didn’t finish in 30 minutes. Now it runs in 56 seconds every time. 32x faster, predictable response time, no tuning…what more could we ask for?” - Brenda Boshoff, Sr. DBA “Wow…unbelievable speedup in query run times! We saw a speedup of 273x in our Vehicle Tracking report, taking a query from 10 minutes to 2.2 seconds. That adds value to our business; our end users are going to be ecstatic!” - Ruel Gonzalez - Information Services DataProxy LLC
  • 5. “My largest row-organized, adaptive compressed table gave me 3.2x storage savings. However, converting this row-organized uncompressed table to a column-organized table in DB2 10.5 delivered a massive 15.4x savings!” - Iqbal Goralwalla, Head of DB2 Managed Services, Triton Records: 76M Columns: 61 Indexes: 10 Load  Time Row-­‐unc 15:39:10 Col 1:10:29(97%) 31.5x storage savings (97% less storage required) 13.5x faster load time Storage Savings
  • 7. Risk system injects 1/2 TB per night from 25 different source systems. “Impressive Load times.” Some queries achieved an almost 100x speed up with literally no tuning. 6 hours. Installing BLU to query results. One of the world’s most profitable and secure rated banks.
  • 8. Load-­‐and-­‐go  simplicity   •  No  indexes   •  No  storage  reclaim    (it’s  automated)   •  No  memory  configuraGon  (it’s  automated)   •  No  process  model  configuraGon  (it’s   automated)   •  No  staGsGcs  collecGon  (it’s  automated)   •  No  MDC  or  MQTs   •  No  StaGsGcal  views   •  No  opGmizer  profiles/guidelines     “The BLU Acceleration technology has some obvious benefits: … But it’s when I think about all the things I don't have to do with BLU, it made me appreciate the technology even more: no tuning, no partitioning, no indexes, no aggregates.” -Andrew Juarez, Lead SAP Basis and DBA 8 Simple.   •  LOAD  and  then…  run  queries    
  • 9. In-­‐memory  speeds  with  /ny  RAM  requirements   •  Rule  of  thumb:  RAM  requirements  are  just  5%  of  source  data   size.     •  Example:     •  10  Terabytes  of  raw  user  data   •  500  GB  of  RAM   9
  • 10. 10 Magic: How did they do that?
  • 11. BLU Acceleration Easiest. Fastest. Smallest. IBM Means Innovation BLU  Acceleration  includes  over  30  new   patents  and  patents  pending  from   IBM  Research  &  Development   Laboratories.   11
  • 12. Looking at Disk Improvements (courtesy of D. DeWitt, PASS Summit Keynote 2009) !  Incredibly inexpensive drives (& processors) have made it possible to collect, store, and analyze huge quantities of data 12 But, consider the metric transfer bandwidth/byte Over the last 30 years Capacity:   80MB  "  800GB   10,000x     Transfer  Rates:   1.2MB/sec  "  80MB/sec   65x     !  1980: 1.2 MB/sec / 80 MB = 0.015 !  2009: 80 MB/sec / 800,000 MB =.0001 When relative capacities are factored in, drives are 150X slower today!!!
  • 13. We’ve  been  snookered!   Source: http://www.jcmit.com/mem2013.htm Main Memory
  • 14. CPU cache optimized • RAM is at the bottom of the memory hierarchy. It is the slowest non-persistent memory in a server. • CPU cache is many times faster than RAM. • Extreme re-engineering of database algorithms to be CPU cache optimized • BLU algorithms adapt automatically to hardware cache size. Speed-up versus RAM 15x 41x 173x 1x 0 20 40 60 80 100 120 140 160 180 200 RAM L3 L2 L1 Speed-upversusRAM Big Idea: RAM is Too Slow 14
  • 15. More Evil Than Ever 1. Human intervention 2. Voluminous I/O 3. Random I/O 4. Memory stalls 5. Single core processing 6. The cost of RAM
  • 16. Load-and–go simplicity: System resources CPU CPUCPU CPU CPU CPU CPU RAM1. Auto-detect and adapt to available RAM 2. Auto detect and adapt to core number and type 3. Auto detect and adapt to CPU cache size
  • 17. Load-and-go simplicity: Automatic Workload Management • Built-in and automated query resource consumption control • Many queries can be submitted, but effective concurrency, per query RAM, and CPU are automatically and dynamically controlled ... Applications and Users Up to tens of thousands of SQL queries at once BLU Runtime SQL Queries Moderate number of queries consume resources 17
  • 18. Load-and-go simplicity: Automatic Space Reclaim •  Automatic space reclamation •  Frees extents with no active values •  The storage can be subsequently reused by any table in the table space •  No need for costly DBA space management and REORG utility •  Space is freed online while work continues •  Regular space management can result in increased performance of RUNSTATS and some queries Column 3 Column 1 Column 2 2012 2012 2012 2012 DELETE * FROM MyTable WHERE Year = 2012 These extents hold only deleted data Storage extent 2013 2013 2013 2013 18
  • 19. •  No CPU for decompression •  Data flows through memory and CPU at compressed size •  “Actionable Compression” QUERY DATA DATA DATA •  10-20X smaller is common •  Compress as small as 1 bit •  Compress the most frequent data the smallest THE DATA HONEY, I SHRUNK BIG tiny
  • 20. •  Reduce I/O •  Increase data density in RAM •  Increase CPU efficiency C1 C2 C3 C4 C5 C6 C7 C8C1 C2 C3 C4 C5 C6 C7 C8 •  Cache intelligently for analytics •  Predictive I/O with “Dynamic List Prefetching” •  Massive I/O reduction RAM DISKS •  Queries skip uninteresting data •  Synopses on every column, automatically. •  “Data Skipping”
  • 21. •  Use modern SIMD instructions to do multiple operations with a single instruction. •  Use 1 instruction instead of 8. •  Massive CPU acceleration •  Store data as vectors •  “Parallel Vector Processing” •  RAM is too slow for BLU ! •  Redesign the query engine to operate at CPU cache speeds instead •  CPU cache is 10-75X faster than RAM access.
  • 22. BLU Acceleration Illustration 10TB query in seconds or less 10TB data Actionable Compression reduces to 1TB In-memory Parallel Processing 32MB linear scan on each core via Scans as fast as 8MB through SIMD and CPU- cache optimized algorithms Result in seconds or less Column Processing reduces to 10GB Data Skipping reduces to 1GB DATA   DATA   DATA   DATA   DATA   DATA   DATA   DATA   DATA   DATA   DATA   DATA   DATA   DATA   DATA   #  The System: 32 cores, 1TB memory, 10TB table with 100 columns and 10 years of data #  The Query: How many “sales” did we have in 2010? -  SELECT COUNT(*) from MYTABLE where YEAR = ‘2010’ #  The Result: In seconds or less as each CPU core examines the equivalent of just 8MB of data 22
  • 23. BLU Acceleration Illustration 10TB query in seconds or less • 1GB RAM required to cache all data for query over 10TB of data. • 10,000 times reduction in memory requirements to achieve in- memory speeds 10TB data Actionable Compression reduces to 1TB In-memory Parallel Processing 32MB linear scan on each core via Scans as fast as 8MB through SIMD and CPU- cache optimized algorithms Result in seconds or less Column Processing reduces to 10GB Data Skipping reduces to 1GB DATA   DATA   DATA   DATA   DATA   DATA   DATA   DATA   DATA   DATA   DATA   DATA   DATA   DATA   DATA   23
  • 24. IBM  dashDB  –  Data  Warehousing  as  a  Service   Terabytes of data ready to analyze within minutes •  Fast querying •  In-memory •  Columnar •  SIMD hardware acceleration •  Actionable compression •  Advanced analytics •  Support for OLAP SQL extensions •  In-database analytics & R for predictive modeling •  Spatial analytics •  Easy to integrate •  Connect common 3rd party BI tools keeps data warehouse infrastructure out of your way
  • 25.
  • 26. 26 The dashDB Cocktail: Three Parts IBM, Shaken… BLU ! Acceleration! Netezza ! In-Database Analytics! SoftLayer Infrastructure ! as a Service! Build More Grow More Know More Focus on the business, " not the business of data warehousing!
  • 27. Partnership  with  IBM  Cloudant  and  dashDB   •  Cloudant is a fully managed distributed ! NoSQL Database as a Service (DBaaS)! •  Multi-tenant and single (dedicated) tenants! •  Managed 24x7 by Cloudant expert engineers! •  Service Level Agreement! •  Operational data store! •  dashDB’s partnership with Cloudant presents two opportunities:! 1.  A launching point for new NoSQL customers looking for low-risk, low-cost avenues for getting started with data warehousing & analytics in a! multi-tenant environment on the cloud! 2.  A new point of entry for existing Cloudant developers to access ! industry-leading INZA analytics and BLU warehousing! 3.  Automatic schema discovery from theJSON data. One button push to create warehouse. !
  • 28. dashDB  MPP     ~elas/c  growth   1.  4TB  building  blocks  (subject  to  change)   2.  Min  3  servers.     3.  Elastic  growth  without  re-­‐hashing  all  row  data   4.  Webscale  clustering   BLU ACCELERATION MPP Scale-out of dashDB with BLU Acceleration CPUsCPUsCPUsCPUs BLU Acceleration Dynamic In-Memory Processing CPUsCPUsCPUsCPUs BLU Acceleration Dynamic In-Memory Processing Now in Closed Beta!
  • 29. The  what  &  the  wow   1.  Fast:  Superior  Performance  35-­‐73X   2.  Small  RAM  requirements:  Typically  just  5%  of   the  original  uncompressed  user  data.     3.  Small:  Superior  Compression   4.  Simple:  Create.  Load.  Go.     5.  Introducing  dashDB  for  cloud     •  Fully  managed  service  on  IBM  BlueMix     •  JSON  NoSQL  integraGon,  Cloudant.com   •  R  and  SpaGal  analyGcs   6.  SAP  BW  cerGfied   7.  Oracle  compaGbility  @98%  +     “We cut report runtimes by up to 98% thanks to IBM DB2 with BLU Acceleration technology – without changing operations processes or investing in new hardware or software. We were impressed how easy boosting database performance can be.” -Bernhard Herzog, Team Manager Information Technology SAP, Balluff
  • 30. Where  to  find  more  informa/on:   •  dashDB.com   •  ibmbluhub.com   •  Sam  Lightstone’s  BLOG:  SoAwareTradecraA.com   •  Sam  Lightstone’s  papers  on  DBLP:       •  hjp://dblp.uni-­‐trier.de/pers/hd/l/Lightstone:Sam   30