SlideShare ist ein Scribd-Unternehmen logo
1 von 89
REMINDER
Check in on the
COLLABORATE mobile app
Top 5 Trends in Database
Technology
Guy Harrison,
Executive Director, Information Mgt R&D,
Dell Software Group
Session ID#: 995
@guyharrison
Top 5 Trends in
Database
Technology
Guy Harrison,
Executive Director, Information Mgt R&D,
Dell Software Group
Dell Software Group3
Web: guyharrison.net
Email: guy.harrison@software.dell.com
Twitter: @guyharrison
Introductions
4
Dell Software Group
5
Dell Software Group
6
Dell Software Group
7
Dell Software Group
Dell and Quest – a brief history
Dell Software Group8
But Seriously
Dell Software Group9
5 Database Technology Trends
The end of “one size fits all”
Big Data and Hadoop
NoSQL
Columnar architectures
The end of disk?
10
Dell Software Group
Trend #1:
The end of “one size fits
all”
Dell Software Group11
History of databases
Magnetic tape
“flat” (sequential) files
Pre-computer
technologies:
Printing press
Dewey
decimal
system
Punched cards
Magnetic Disk
IMS
Relational
Model
defined
Indexed-Sequential
Access Mechanism
(ISAM)
Network Model
IDMS
ADABAS
System R
Oracle V2
Ingres
dBase
DB2
Informix
Sybase
SQL Server
Access
Postgres
MySQL
Cassandra
Hadoop
Vertica
Riak
HBase
Dynamo
MongoDB
Redis
VoltDB
Hana
Neo4J
Aerospike
Hierarchical model
1960-701940-50 1950-60 1970-80 1980-90 1990-2000 2000-2010
Dell Software Group12
Why?
• 3rd Platform drives
new demands on
the database:
– Global High
Availability
– Data volumes
– Unstructured data
– Transaction rates
– Latency
• A single architecture
cannot meet all
those demands
Dell Software Group13
Operational
RDBMS
(Oracle, SQL
Server, …)
In-memory
Analytics
(HANA,
Exalytics …)
In-memory
processing
(Spark)
Hadoop
Web DBMS
(MySQL,
Mongo,
Cassandra)
ERP & in-
house CRM
Analytic/BI
software
(SAS,
Tableau)
Web Server
Data
Warehouse
RDBMS
(Oracle,
Terradata …)
It takes all sorts
Dell Software Group14
Oracle engineered systems
15
Dell Software Group
Trend #2:
Big Data and Hadoop
Dell Software Group16
The 3-4 “V”s
Volume
•Terabytes
•Petabytes
•Exabytes
•Zetabytes
Velocity
•Transaction rates
•User populations
•Machines
Variety
•Structured
•Unstructured
•Human Generated
•Machine Generated
Value
17
Dell Software Group
The Industrial revolution of data
18
Dell Software Group
2005
19
Dell Software Group
2009
Dell Software Group20
The instrumented human
• Bluetooth Personal
Area Network
• 3G/WiFi Wide Area
Network
• GPS
• Storage
• Pulse, temp
monitor
• Silent alarms
• Pedometer, sleep
monitoring
• Compass
• Camera
• Mike/earphones
• Heads up display
• Emotion/Attention
monitor
Dell Software Group21
Dell Software Group22
The instrumented world
Dell Software Group23
Big Data is the culmination of cloud, social and
mobile
Dell Software Group24
More Data
• Storing all data – including machine generated and
sol, Social, community, demographic data in
original format – for ever
To More Effect
• Smarter use of data (data science) to achieve
competitive or human benefit
Dell Software Group25
More Data
• Storing all data – including machine generated
and sol, Social, community, demographic data in
original format – for ever
To More Effect
• Smarter use of data (data science) to achieve
competitive or human benefit
26
Dell Software Group
Pioneers of big data
27
Dell Software Group
28
Dell Software Group
29
Dell Software Group
30
Dell Software Group
31
Dell Software Group
Dell Software Group32
Google File System (GFS)
Map Reduce BigTable
Google Applications
Google Software Architecture (circa 2005)
Dell Software Group33
Start ReduceMap
Map
Map
Map
Map
Map
Map
Map
Map
Map
Map
Map
Map
Map
Map
Map
Map
Map
Map
Map
Map
Map
Map
Map
Map
Map
Map
Map
Map
Map
Map
Map
Map
Map
Map
Map Reduce
Dell Software Group34
Hadoop: 1.0: Open Source Map-Reduce Stack
Dell Software Group35
Hadoop at Yahoo
• 2010(biggest cluster):
• 4000 nodes 16PB disk
• 64 TB of RAM
• 32,000 Cores
• 2014:
– 16 Clusters
– 32,500 nodes
Dell Software Group36
Dell Software Group37
SQOOP
(RDBMS loader)
Hive
(Query)
Pig
(Scripting)
Flume
(Log Loader)
Oozie (Workflow manager)
Hadoop File System (HDFS)
Map Reduce /
YARN
Hbase
(database)
Zookeeper
(locking)
Hadoop family
Dell Software Group39
Economies
$4,911
$750
$0 $1,000 $2,000 $3,000 $4,000 $5,000 $6,000
Exadata
Hadoop
Exadata vs Hadoop $$/TB (Hardware only)
Dell Software Group40
Hadoop is the most concrete Big Data
technology
Toad: your companion in
the Big Data revolution
Dell Software Group41
More Data
• Storing all data – including machine generated and
sol, Social, community, demographic data in
original format – for ever
To More Effect
• Smarter use of data (data science) to achieve
competitive or human benefit
Dell Software Group42
More Data
• Storing all data – including machine generated and
sol, Social, community, demographic data in
original format – for ever
To More Effect
• Smarter use of data (data science) to achieve
competitive or human benefit
Dell Software Group43
Big Data Analytics
AKA Data Science
Machine Learning
• Programs that evolve
with “experience
Predictive
Analytics
• Programs that
extrapolate from
past to future
Collective
Intelligence
• Programs that use
inputs from
“crowds” to
simulate
intelligence
Dell Software Group44
Dell Software Group45
Collective Intelligence
From now on, I’ll call
you ‘An Ambulance’.
OK?
“Siri call me an
ambulance”
47
Dell Software Group
Trend #3:
NoSQL
Dell Software Group48
Web
Servers
Database
Servers
Memcached
Servers
Shard (G-O) Shard (P-Z)Shard (A-F)
Read Only Slaves
Dell Software Group49
CAP Theorem says something has to give
• CAP (Brewer’s) Theorem
says you can only have
two out of three of
Consistency, Partition
Tolerance, Availability Consistency
• Everyone always sees
the same data
Availability
• System stays up
when nodes fail
Partition
Tolerance
• System stays up
when network
between nodes fail
Oracle RAC
lives here
NO
GO
Most NoSQL lives
here
Dell Software Group50
Major influences on non-relational
• Eventually consistent transaction model
• Consistent hashing
Amazon Dynamo
• Column Family model for sparse distributed
columnar data
Google BigTable
• Paved the way for the document database
OODBMS and XML DBs
Dell Software Group51
Amazon Dynamo Model
Dell Software Group52
Name Site Counter
Dick Ebay 507,018
Dick Google 690,414
Jane Google 716,426
Dick Facebook 723,649
Jane Facebook 643,261
Jane ILoveLarry.com 856,767
Dick MadBillFans.com 675,230
NameId Name
1 Dick
2 Jane
SiteId SiteName
1 Ebay
2 Google
3 Facebook
4 ILoveLarry.com
5 MadBillFans.com
NameId SiteId Counter
1 1 507,018
1 3 690,414
2 3 716,426
1 3 723,649
2 3 643,261
2 4 856,767
1 5 675,230
Id Name Ebay Google Facebook (other columns) MadBillFans.com
1 Dick 507,018 690,414 723,649 . . . . . . . . . . . . . . 675,230
Id Name Google Facebook (other columns) ILoveLarry.com
2 Jane 716,426 643,261 . . . . . . . . . . . . . . 856,767
BigTable Data Model
Dell Software Group53
OODBMS -1990s
• The OODBMS Manifesto
(Atkinson/Bancilhon/DeWitt/Dittrich/Maier/Zdo
nik, '90)
• "A relational database is like a garage that forces
you to take your car apart and store the pieces in
little drawers“
– Also SQL is ugly
• “A Object database is like a closet which requires
that you hang up your suit with tie, underwear,
belt socks and shoes all attached” (Dave Ensor)
http://4.bp.blogspot.com/-
IPgd1Tg8ByE/UkOzH-
g1FmI/AAAAAAAACB0/QYg8kE
Vp5_0/s1600/db4o_vs_orm.png
Dell Software Group54
Revenge of the Object Nerds – Document
databases
• Structured documents – XML and
JSON (JavaScript Object Notation)
become more prevalent within
applications
• Web programmers start storing
these in BLOBS in MySQL
• Emergence of XML and JSON
databases
Dell Software Group55
Graph
Database
Neo4J
Infinite
Graph
FlockDB
Document
JSON based
MongoDB
CouchDB
RethinkDB
XML based
MarkLogic
BerkeleyDB
XML
Key Value
Memchache
DB
Oracle
NoSQL
Dynamo
Voldemort
DynamoDB
Riak
Table Based BigTable
Cassandra
Hbase
HyperTable
Accumulo
Dell Software Group56
It’s not a database, it’s a key value store
http://browsertoolkit.com/fault-tolerance.png
Dell Software Group57
No Means Yes!
58
Dell Software Group
Trend #4:
Column-oriented DB
Dell Software Group59
Row orientation vs column orientation
ID Name DOB Salary Sales Expenses
1001 Dick 21/12/60 67,000 78980 3244
1002 Jane 12/12/55 55,000 67840 2333
1003 Robert 17/02/80 22,000 67890 6436
1004 Dan 15/03/75 65,200 98770 2345
1005 Steven 11/11/81 76,000 43240 3214
Block ID Name DOB Salary Sales Expenses
1 1001 Dick 21/12/60 67,000 78980 3244
2 1002 Jane 12/12/55 55,000 67840 2333
3 1003 Robert 17/02/80 22,000 67890 6436
4 1004 Dan 15/03/75 65,200 98770 2345
5 1005 Steven 11/11/81 76,000 43240 3214
Block
1 Dick Jane Robert Dan Steven
2 21/12/60 12/12/55 17/02/80 15/03/75 11/11/81
3 67,000 55,000 22,000 65,200 76,000
4 78980 67840 67890 98770 43240
5 3244 2333 6436 2345 3214
Row oriented database
Column oriented database
Dell Software Group60
Analytical Queries
Block ID Name DOB Salary Sales Expenses
1 1001 Dick 21/12/60 67,000 78980 3244
2 1002 Jane 12/12/55 55,000 67840 2333
3 1003 Robert 17/02/80 22,000 67890 6436
4 1004 Dan 15/03/75 65,200 98770 2345
5 1005 Steven 11/11/81 76,000 43240 3214
Block
1 Dick Jane Robert Dan Steven
2 21/12/60 12/12/55 17/02/80 15/03/75 11/11/81
3 67,000 55,000 22,000 65,200 76,000
4 78980 67840 67890 98770 43240
5 3244 2333 6436 2345 3214
Row oriented database
Column oriented database
SELECT SUM(salary)
FROM saleperson
Dell Software Group61
Compression
Block ID Name DOB Salary Sales Expenses
1 1001 Dick 21/12/60 67,000 78980 3244
2 1002 Jane 12/12/55 55,000 67840 2333
3 1003 Robert 17/02/80 22,000 67890 6436
4 1004 Dan 15/03/75 65,200 98770 2345
5 1005 Steven 11/11/81 76,000 43240 3214
Block
1 Dick Jane Robert Dan Steven
2 21/12/60 12/12/55 17/02/80 15/03/75 11/11/81
3 67,000 55,000 22,000 65,200 76,000
4 78980 67840 67890 98770 43240
5 3244 2333 6436 2345 3214
Row oriented database
Column oriented database
Poor compression ratio (low
repetition)
Good compression ratio (high
repetition)
Dell Software Group62
Inserts
Block ID Name DOB Salary Sales Expenses
1 1001 Dick 21/12/60 67,000 78980 3244
2 1002 Jane 12/12/55 55,000 67840 2333
3 1003 Robert 17/02/80 22,000 67890 6436
4 1004 Dan 15/03/75 65,200 98770 2345
5 1005 Steven 11/11/81 76,000 43240 3214
Block
1 Dick Jane Robert Dan Steven
2 21/12/60 12/12/55 17/02/80 15/03/75 11/11/81
3 67,000 55,000 22,000 65,200 76,000
4 78980 67840 67890 98770 43240
5 3244 2333 6436 2345 3214
Row oriented database
Column oriented database
INSERT INTO
salesperson
Dell Software Group63
C-Store (Vertica) Solution for inserts
Read Optimized Store
• Columnar
• Disk-based
• Highly Compressed
• Bulk loadable
Write Optimized Store
• Row oriented
• Uncompressed
• Single row inserts
Asynchronous Tuple Mover
Bulk sequential loads
Continual Parallel inserts
Merged
Query
Dell Software Group64
Exadata Hybrid Columnar Compression (EHCC)
Compression Unit (~<1M)
Block (8K) Block Block Block
Column 1 Column 2 Column 3 Column 4
Row
Row
Row
Dell Software Group65
Exadata Hybrid Columnar Compression
• Provides high
compression ratio
• Manageable impact
on row read/write
operations
• Some optimization of
analytic queries
SELECT SUM(Column4)
FROM table
66
Dell Software Group
Trend #5:
The End of Disk?
Dell Software Group67
5MB HDD circa 1956
Dell Software Group68
The more that things change....
Dell Software Group69
Faster or slower?
260
1,635
-630
1,013
-390
-1,000 -500 0 500 1,000 1,500 2,000
IO Rate
Disk Capacity
IO/Capacity
CPU
IO/CPU
%age change
Dell Software Group70
Solid state disk to the rescue
• DDR RAM Drive
• SATA flash drive
• PCI flash drive
• SSD storage Server
Dell Software Group71
Cheaper by the IO
4,000
80
25
15
0 1,000 2,000 3,000 4,000 5,000
Magnetic Disk
SSD SATA Flash
SSD PCI flash
SSD DDR-RAM
Seek time (us)
Dell Software Group72
But not by the GB
0.35 0.28 0.21 0.17 0.13
2.9
2.2
1.7
1.3
1
10
7.4
5.3
3.2
2.3
0
2
4
6
8
10
12
2011 2012 2013 2014 2015
$$/GB
HDD MLC SDD SLC SSD
0.35
0.28
0.21
0.17
0.13
2.9
2.2
1.7
1.3
1
2.3
0.1
1
10
2011 2012 2013 2014 2015
$$/GB
HDD MLC SDD SLC SSD
Dell Software Group76
Tiered storage management
Main Memory
DDR SSD
Flash SSD
Fast Disk (SAS, RAID 0+1)
Slow Disk (SATA, RAID 5)
Tape, Flat Files, Hadoop
$/IOP
$/GB
Dell Software Group77
In-Memory databases
• Cost of RAM falling
50% each 18 months.
• Some databases can
fit entirely within the
RAM of a single server
or cluster of servers
0.001
0.01
0.1
1
10
100
$1.00
$10.00
$100.00
$1,000.00
$10,000.00
$100,000.00
1990 1995 2000 2005 2010 2015 2020
Size(GB)
Cost(US$/GB)
Year
US$/GB Size (GB)
Dell Software Group78
Oracle Times Ten
• In-memory transactional database
• Disk-based Checkpoints and disk-
based logging
• By default, COMMITs are not durable
(writes to the transaction log are
asynchronous).
• Can configure synchronous
replication or synchronous log writes
to avoid data loss
• Columnar compression and analytic
functions in the Exalytics version
Clients
Memory
Checkpoints
Transaction
Logs
Commits
Point in time
snapshot
Dell Software Group79
SAP Hana
Memory
Row Store
Column store
Delta store
Persistence Layer
Savepoints
Data files
Txn logs
Note: Table must be either row or column – not both
Dell Software Group80
Exalytics
Instantaneous!
Dell Software Group81
You keep using that word….
I do not think it means what
you think it meansInigo Montoya
Dell Software Group82
Exalytics
Hardware:
• 2 TB RAM
• 4 10GBe , 2
InfiniBand ports
• 6x1.2TB SAS (7.2 TB)
• 3x800GB (2.4TB) SSD
Software:
• Oracle BI
• ESSBase
• Oracle R
• Times-Ten
• 12c In-memory
Dell Software Group83
VoltDB
• Single threaded access
to memory: no
latch/mutex waits
• Transactions in self-
contained stored
procedures: minimal
locking
• K-Safety for COMMIT:
No sync waits
CPU
In-memory
Partition
CPU
In-memory
Partition
CPU
In-memory
Partition
CPU
In-memory
Partition
CPU
In-memory
Partition
CPU
In-memory
Partition
Clients Clients Clients
Dell Software Group84
Spark (sort of) in-memory Hadoop
• In Memory compute
• HDFS compatible
• Libraries for data processing,
machine learning, streaming,
SQL, etc
• Python and Scala interfaces
• Part of the Berkeley Data
Analytic Stack
• Integrating into all Hadoop
distributions (and Cassandra)
HDFS
Tachyon – in memory
File system
Spark: in-memory distributed compute
Spark
Streaming
Mlib
Machine
Learning
SparkSQL
Mesos Cluster manager
Dell Software Group85
Data files
Oracle 12c in-memory
Memory (SGA)
Row store Column Store (IMCU)
OLTP Analytics
(SMU)
database Column store
Redo Logs
86
Dell Software Group
What does all this
mean for me?
87
Dell Software Group
Trend #6:
shameless product
plugs will increase over
the next 120 seconds
Dell Software Group88
Toad: your
companion
in the Big
Data
revolution
Dell Software Group89
Dell Statistica
Dell Software Group90
Dell In-Memory Appliances for Cloudera Enterprise
Mid-Size Configuration
16 Node Cluster
R720- 4 Infrastructure Nodes
R720XD- 12 Data Nodes
Force10- S4810P
Force10- S55
~528TB (disk raw space)
~4.5 TB (raw memory)
Starter Configuration
8 Node Cluster
R720- 4 Infrastructure Nodes
R720XD- 4 Data Nodes
Force10- S55
~176TB (disk raw space)
~1.5TB (raw memory)
Small Enterprise
Configuration
24 Node Cluster
R720- 4 Infrastructure Nodes
R720XD- 20 Data Nodes
~880TB (disk raw space)
~7.5 TB (raw memory)
Expansion Unit- R720XD-4 Data, Cloudera Enterprise Data Hub, Scale in Blocks
Dell Software Group91
Dell appliances for any database
• Dell provides appliances and reference
architectures specifically designed for:
– Oracle
– SQL Server
– HANA
– SSD database acceleration
– Large memory footprints
Dell Software Group92
• Success in Big Data requires
capabilities at multiple
technology levels: hardware,
software infrastructure,
business intelligence and
analytics
• Only Dell can deliver
capabilities at every technology
layer
• Only Dell’s solutions are
designed and priced to suit
mid-market initial deployments
and to scale to the largest
enterprise
Data Integration
Hadoop and database
software
Advanced Analytics
Business Intelligence
Server and Storage
Boomi
Boomi,
Toad Intelligence Central
Dell appliances for Hadoop,
Oracle, etc
Dell servers and storage
arrays
Toad
Data
point
Statistica
Systems Management Dell Foglight and TOAD
Big Data for the rest of us
93
Dell Software Group
Thank you.
Please complete the session
evaluation
We appreciate your feedback and insight
You may complete the session evaluation either
on paper or online via the mobile app

Weitere ähnliche Inhalte

Was ist angesagt?

Big data and hadoop overvew
Big data and hadoop overvewBig data and hadoop overvew
Big data and hadoop overvewKunal Khanna
 
Big Data - A brief introduction
Big Data - A brief introductionBig Data - A brief introduction
Big Data - A brief introductionFrans van Noort
 
NoSQL – Back to the Future or Yet Another DB Feature?
NoSQL – Back to the Future or Yet Another DB Feature?NoSQL – Back to the Future or Yet Another DB Feature?
NoSQL – Back to the Future or Yet Another DB Feature?Martin Scholl
 
Big Data: an introduction
Big Data: an introductionBig Data: an introduction
Big Data: an introductionBart Vandewoestyne
 
Emergent Distributed Data Storage
Emergent Distributed Data StorageEmergent Distributed Data Storage
Emergent Distributed Data Storagehybrid cloud
 
So You Want to Build a Data Lake?
So You Want to Build a Data Lake?So You Want to Build a Data Lake?
So You Want to Build a Data Lake?David P. Moore
 
The Big Data Stack
The Big Data StackThe Big Data Stack
The Big Data StackZubair Nabi
 
Introduction to BIg Data and Hadoop
Introduction to BIg Data and HadoopIntroduction to BIg Data and Hadoop
Introduction to BIg Data and HadoopAmir Shaikh
 
Introduction to Apache Hadoop Eco-System
Introduction to Apache Hadoop Eco-SystemIntroduction to Apache Hadoop Eco-System
Introduction to Apache Hadoop Eco-SystemMd. Hasan Basri (Angel)
 
Data Warehouse on Hadoop Based System In Action
Data Warehouse on Hadoop Based System In ActionData Warehouse on Hadoop Based System In Action
Data Warehouse on Hadoop Based System In ActionFrank Y
 
Big Data Course - BigData HUB
Big Data Course - BigData HUBBig Data Course - BigData HUB
Big Data Course - BigData HUBAhmed Salman
 
Introduction to Big Data
Introduction to Big DataIntroduction to Big Data
Introduction to Big DataHaluan Irsad
 
Big Data Platforms: An Overview
Big Data Platforms: An OverviewBig Data Platforms: An Overview
Big Data Platforms: An OverviewC. Scyphers
 
Big Data and Hadoop
Big Data and HadoopBig Data and Hadoop
Big Data and HadoopFlavio Vit
 
What is hadoop
What is hadoopWhat is hadoop
What is hadoopAsis Mohanty
 
How Apache Hadoop is Revolutionizing Business Intelligence and Data Analytics...
How Apache Hadoop is Revolutionizing Business Intelligence and Data Analytics...How Apache Hadoop is Revolutionizing Business Intelligence and Data Analytics...
How Apache Hadoop is Revolutionizing Business Intelligence and Data Analytics...Amr Awadallah
 
Relational databases vs Non-relational databases
Relational databases vs Non-relational databasesRelational databases vs Non-relational databases
Relational databases vs Non-relational databasesJames Serra
 

Was ist angesagt? (20)

A data analyst view of Bigdata
A data analyst view of Bigdata A data analyst view of Bigdata
A data analyst view of Bigdata
 
Big data and hadoop overvew
Big data and hadoop overvewBig data and hadoop overvew
Big data and hadoop overvew
 
Big Data - A brief introduction
Big Data - A brief introductionBig Data - A brief introduction
Big Data - A brief introduction
 
NoSQL – Back to the Future or Yet Another DB Feature?
NoSQL – Back to the Future or Yet Another DB Feature?NoSQL – Back to the Future or Yet Another DB Feature?
NoSQL – Back to the Future or Yet Another DB Feature?
 
Big Data: an introduction
Big Data: an introductionBig Data: an introduction
Big Data: an introduction
 
Emergent Distributed Data Storage
Emergent Distributed Data StorageEmergent Distributed Data Storage
Emergent Distributed Data Storage
 
So You Want to Build a Data Lake?
So You Want to Build a Data Lake?So You Want to Build a Data Lake?
So You Want to Build a Data Lake?
 
The Big Data Stack
The Big Data StackThe Big Data Stack
The Big Data Stack
 
Introduction to BIg Data and Hadoop
Introduction to BIg Data and HadoopIntroduction to BIg Data and Hadoop
Introduction to BIg Data and Hadoop
 
RDBMS vs NoSQL
RDBMS vs NoSQLRDBMS vs NoSQL
RDBMS vs NoSQL
 
Big data ppt
Big data pptBig data ppt
Big data ppt
 
Introduction to Apache Hadoop Eco-System
Introduction to Apache Hadoop Eco-SystemIntroduction to Apache Hadoop Eco-System
Introduction to Apache Hadoop Eco-System
 
Data Warehouse on Hadoop Based System In Action
Data Warehouse on Hadoop Based System In ActionData Warehouse on Hadoop Based System In Action
Data Warehouse on Hadoop Based System In Action
 
Big Data Course - BigData HUB
Big Data Course - BigData HUBBig Data Course - BigData HUB
Big Data Course - BigData HUB
 
Introduction to Big Data
Introduction to Big DataIntroduction to Big Data
Introduction to Big Data
 
Big Data Platforms: An Overview
Big Data Platforms: An OverviewBig Data Platforms: An Overview
Big Data Platforms: An Overview
 
Big Data and Hadoop
Big Data and HadoopBig Data and Hadoop
Big Data and Hadoop
 
What is hadoop
What is hadoopWhat is hadoop
What is hadoop
 
How Apache Hadoop is Revolutionizing Business Intelligence and Data Analytics...
How Apache Hadoop is Revolutionizing Business Intelligence and Data Analytics...How Apache Hadoop is Revolutionizing Business Intelligence and Data Analytics...
How Apache Hadoop is Revolutionizing Business Intelligence and Data Analytics...
 
Relational databases vs Non-relational databases
Relational databases vs Non-relational databasesRelational databases vs Non-relational databases
Relational databases vs Non-relational databases
 

Ähnlich wie Five database trends - updated April 2015

Yahoo!, Big Data, and Microsoft BI: Bigger and Better Together
Yahoo!, Big Data, and Microsoft BI: Bigger and Better TogetherYahoo!, Big Data, and Microsoft BI: Bigger and Better Together
Yahoo!, Big Data, and Microsoft BI: Bigger and Better TogetherDenny Lee
 
Hadoop at the Center: The Next Generation of Hadoop
Hadoop at the Center: The Next Generation of HadoopHadoop at the Center: The Next Generation of Hadoop
Hadoop at the Center: The Next Generation of HadoopAdam Muise
 
Graph Database Use Cases - StampedeCon 2015
Graph Database Use Cases - StampedeCon 2015Graph Database Use Cases - StampedeCon 2015
Graph Database Use Cases - StampedeCon 2015StampedeCon
 
Graph database Use Cases
Graph database Use CasesGraph database Use Cases
Graph database Use CasesMax De Marzi
 
Understanding Big Data And Hadoop
Understanding Big Data And HadoopUnderstanding Big Data And Hadoop
Understanding Big Data And HadoopEdureka!
 
Druid Adoption Tips and Tricks
Druid Adoption Tips and TricksDruid Adoption Tips and Tricks
Druid Adoption Tips and TricksImply
 
Drilling into Data with Apache Drill
Drilling into Data with Apache DrillDrilling into Data with Apache Drill
Drilling into Data with Apache DrillDataWorks Summit
 
Next Generation Hadoop Introduction
Next Generation Hadoop IntroductionNext Generation Hadoop Introduction
Next Generation Hadoop IntroductionAdam Muise
 
Drilling into Data with Apache Drill
Drilling into Data with Apache DrillDrilling into Data with Apache Drill
Drilling into Data with Apache DrillMapR Technologies
 
Hadoop Summit Amsterdam 2013 - Making Hadoop Ready for Prime Time - Syncsort ...
Hadoop Summit Amsterdam 2013 - Making Hadoop Ready for Prime Time - Syncsort ...Hadoop Summit Amsterdam 2013 - Making Hadoop Ready for Prime Time - Syncsort ...
Hadoop Summit Amsterdam 2013 - Making Hadoop Ready for Prime Time - Syncsort ...Steven Totman
 
Introduction to Sql on Hadoop
Introduction to Sql on HadoopIntroduction to Sql on Hadoop
Introduction to Sql on HadoopSamuel Yee
 
Big MDM Part 2: Using a Graph Database for MDM and Relationship Management
Big MDM Part 2: Using a Graph Database for MDM and Relationship ManagementBig MDM Part 2: Using a Graph Database for MDM and Relationship Management
Big MDM Part 2: Using a Graph Database for MDM and Relationship ManagementCaserta
 
The modern analytics architecture
The modern analytics architectureThe modern analytics architecture
The modern analytics architectureJoseph D'Antoni
 
IT Arena-2021
IT Arena-2021IT Arena-2021
IT Arena-2021b0ris_1
 
Tableau & MongoDB: Visual Analytics at the Speed of Thought
Tableau & MongoDB: Visual Analytics at the Speed of ThoughtTableau & MongoDB: Visual Analytics at the Speed of Thought
Tableau & MongoDB: Visual Analytics at the Speed of ThoughtMongoDB
 
DF1 - R - Natekin - Improving Daily Analysis with data.table
DF1 - R - Natekin - Improving Daily Analysis with data.tableDF1 - R - Natekin - Improving Daily Analysis with data.table
DF1 - R - Natekin - Improving Daily Analysis with data.tableMoscowDataFest
 
Add Redis to Postgres to Make Your Microservices Go Boom!
Add Redis to Postgres to Make Your Microservices Go Boom!Add Redis to Postgres to Make Your Microservices Go Boom!
Add Redis to Postgres to Make Your Microservices Go Boom!Dave Nielsen
 

Ähnlich wie Five database trends - updated April 2015 (20)

Yahoo!, Big Data, and Microsoft BI: Bigger and Better Together
Yahoo!, Big Data, and Microsoft BI: Bigger and Better TogetherYahoo!, Big Data, and Microsoft BI: Bigger and Better Together
Yahoo!, Big Data, and Microsoft BI: Bigger and Better Together
 
Hadoop
HadoopHadoop
Hadoop
 
Hadoop at the Center: The Next Generation of Hadoop
Hadoop at the Center: The Next Generation of HadoopHadoop at the Center: The Next Generation of Hadoop
Hadoop at the Center: The Next Generation of Hadoop
 
Graph Database Use Cases - StampedeCon 2015
Graph Database Use Cases - StampedeCon 2015Graph Database Use Cases - StampedeCon 2015
Graph Database Use Cases - StampedeCon 2015
 
Graph database Use Cases
Graph database Use CasesGraph database Use Cases
Graph database Use Cases
 
Big Data Platform Industrialization
Big Data Platform Industrialization Big Data Platform Industrialization
Big Data Platform Industrialization
 
Big Data Platform Industrialization
Big Data Platform Industrialization Big Data Platform Industrialization
Big Data Platform Industrialization
 
Understanding Big Data And Hadoop
Understanding Big Data And HadoopUnderstanding Big Data And Hadoop
Understanding Big Data And Hadoop
 
Druid Adoption Tips and Tricks
Druid Adoption Tips and TricksDruid Adoption Tips and Tricks
Druid Adoption Tips and Tricks
 
Drilling into Data with Apache Drill
Drilling into Data with Apache DrillDrilling into Data with Apache Drill
Drilling into Data with Apache Drill
 
Next Generation Hadoop Introduction
Next Generation Hadoop IntroductionNext Generation Hadoop Introduction
Next Generation Hadoop Introduction
 
Drilling into Data with Apache Drill
Drilling into Data with Apache DrillDrilling into Data with Apache Drill
Drilling into Data with Apache Drill
 
Hadoop Summit Amsterdam 2013 - Making Hadoop Ready for Prime Time - Syncsort ...
Hadoop Summit Amsterdam 2013 - Making Hadoop Ready for Prime Time - Syncsort ...Hadoop Summit Amsterdam 2013 - Making Hadoop Ready for Prime Time - Syncsort ...
Hadoop Summit Amsterdam 2013 - Making Hadoop Ready for Prime Time - Syncsort ...
 
Introduction to Sql on Hadoop
Introduction to Sql on HadoopIntroduction to Sql on Hadoop
Introduction to Sql on Hadoop
 
Big MDM Part 2: Using a Graph Database for MDM and Relationship Management
Big MDM Part 2: Using a Graph Database for MDM and Relationship ManagementBig MDM Part 2: Using a Graph Database for MDM and Relationship Management
Big MDM Part 2: Using a Graph Database for MDM and Relationship Management
 
The modern analytics architecture
The modern analytics architectureThe modern analytics architecture
The modern analytics architecture
 
IT Arena-2021
IT Arena-2021IT Arena-2021
IT Arena-2021
 
Tableau & MongoDB: Visual Analytics at the Speed of Thought
Tableau & MongoDB: Visual Analytics at the Speed of ThoughtTableau & MongoDB: Visual Analytics at the Speed of Thought
Tableau & MongoDB: Visual Analytics at the Speed of Thought
 
DF1 - R - Natekin - Improving Daily Analysis with data.table
DF1 - R - Natekin - Improving Daily Analysis with data.tableDF1 - R - Natekin - Improving Daily Analysis with data.table
DF1 - R - Natekin - Improving Daily Analysis with data.table
 
Add Redis to Postgres to Make Your Microservices Go Boom!
Add Redis to Postgres to Make Your Microservices Go Boom!Add Redis to Postgres to Make Your Microservices Go Boom!
Add Redis to Postgres to Make Your Microservices Go Boom!
 

Mehr von Guy Harrison

From oracle to hadoop with Sqoop and other tools
From oracle to hadoop with Sqoop and other toolsFrom oracle to hadoop with Sqoop and other tools
From oracle to hadoop with Sqoop and other toolsGuy Harrison
 
Mega trends in information management
Mega trends in information managementMega trends in information management
Mega trends in information managementGuy Harrison
 
Big datacamp2013 share
Big datacamp2013 shareBig datacamp2013 share
Big datacamp2013 shareGuy Harrison
 
Hadoop, Oracle and the big data revolution collaborate 2013
Hadoop, Oracle and the big data revolution collaborate 2013Hadoop, Oracle and the big data revolution collaborate 2013
Hadoop, Oracle and the big data revolution collaborate 2013Guy Harrison
 
Hadoop, oracle and the industrial revolution of data
Hadoop, oracle and the industrial revolution of data Hadoop, oracle and the industrial revolution of data
Hadoop, oracle and the industrial revolution of data Guy Harrison
 
Making the most of ssd in oracle11g
Making the most of ssd in oracle11gMaking the most of ssd in oracle11g
Making the most of ssd in oracle11gGuy Harrison
 
Oracle sql high performance tuning
Oracle sql high performance tuningOracle sql high performance tuning
Oracle sql high performance tuningGuy Harrison
 
Hadoop and rdbms with sqoop
Hadoop and rdbms with sqoop Hadoop and rdbms with sqoop
Hadoop and rdbms with sqoop Guy Harrison
 
Next generation databases july2010
Next generation databases july2010Next generation databases july2010
Next generation databases july2010Guy Harrison
 
Optimize oracle on VMware (April 2011)
Optimize oracle on VMware (April 2011)Optimize oracle on VMware (April 2011)
Optimize oracle on VMware (April 2011)Guy Harrison
 
Optimizing Oracle databases with SSD - April 2014
Optimizing Oracle databases with SSD - April 2014Optimizing Oracle databases with SSD - April 2014
Optimizing Oracle databases with SSD - April 2014Guy Harrison
 
Understanding Solid State Disk and the Oracle Database Flash Cache (older ver...
Understanding Solid State Disk and the Oracle Database Flash Cache (older ver...Understanding Solid State Disk and the Oracle Database Flash Cache (older ver...
Understanding Solid State Disk and the Oracle Database Flash Cache (older ver...Guy Harrison
 
High Performance Plsql
High Performance PlsqlHigh Performance Plsql
High Performance PlsqlGuy Harrison
 
Performance By Design
Performance By DesignPerformance By Design
Performance By DesignGuy Harrison
 
Optimize Oracle On VMware (Sep 2011)
Optimize Oracle On VMware (Sep 2011)Optimize Oracle On VMware (Sep 2011)
Optimize Oracle On VMware (Sep 2011)Guy Harrison
 
Thanks for the Memory
Thanks for the MemoryThanks for the Memory
Thanks for the MemoryGuy Harrison
 
Top 10 tips for Oracle performance
Top 10 tips for Oracle performanceTop 10 tips for Oracle performance
Top 10 tips for Oracle performanceGuy Harrison
 
How I learned to stop worrying and love Oracle
How I learned to stop worrying and love OracleHow I learned to stop worrying and love Oracle
How I learned to stop worrying and love OracleGuy Harrison
 
Performance By Design
Performance By DesignPerformance By Design
Performance By DesignGuy Harrison
 
High Performance Plsql
High Performance PlsqlHigh Performance Plsql
High Performance PlsqlGuy Harrison
 

Mehr von Guy Harrison (20)

From oracle to hadoop with Sqoop and other tools
From oracle to hadoop with Sqoop and other toolsFrom oracle to hadoop with Sqoop and other tools
From oracle to hadoop with Sqoop and other tools
 
Mega trends in information management
Mega trends in information managementMega trends in information management
Mega trends in information management
 
Big datacamp2013 share
Big datacamp2013 shareBig datacamp2013 share
Big datacamp2013 share
 
Hadoop, Oracle and the big data revolution collaborate 2013
Hadoop, Oracle and the big data revolution collaborate 2013Hadoop, Oracle and the big data revolution collaborate 2013
Hadoop, Oracle and the big data revolution collaborate 2013
 
Hadoop, oracle and the industrial revolution of data
Hadoop, oracle and the industrial revolution of data Hadoop, oracle and the industrial revolution of data
Hadoop, oracle and the industrial revolution of data
 
Making the most of ssd in oracle11g
Making the most of ssd in oracle11gMaking the most of ssd in oracle11g
Making the most of ssd in oracle11g
 
Oracle sql high performance tuning
Oracle sql high performance tuningOracle sql high performance tuning
Oracle sql high performance tuning
 
Hadoop and rdbms with sqoop
Hadoop and rdbms with sqoop Hadoop and rdbms with sqoop
Hadoop and rdbms with sqoop
 
Next generation databases july2010
Next generation databases july2010Next generation databases july2010
Next generation databases july2010
 
Optimize oracle on VMware (April 2011)
Optimize oracle on VMware (April 2011)Optimize oracle on VMware (April 2011)
Optimize oracle on VMware (April 2011)
 
Optimizing Oracle databases with SSD - April 2014
Optimizing Oracle databases with SSD - April 2014Optimizing Oracle databases with SSD - April 2014
Optimizing Oracle databases with SSD - April 2014
 
Understanding Solid State Disk and the Oracle Database Flash Cache (older ver...
Understanding Solid State Disk and the Oracle Database Flash Cache (older ver...Understanding Solid State Disk and the Oracle Database Flash Cache (older ver...
Understanding Solid State Disk and the Oracle Database Flash Cache (older ver...
 
High Performance Plsql
High Performance PlsqlHigh Performance Plsql
High Performance Plsql
 
Performance By Design
Performance By DesignPerformance By Design
Performance By Design
 
Optimize Oracle On VMware (Sep 2011)
Optimize Oracle On VMware (Sep 2011)Optimize Oracle On VMware (Sep 2011)
Optimize Oracle On VMware (Sep 2011)
 
Thanks for the Memory
Thanks for the MemoryThanks for the Memory
Thanks for the Memory
 
Top 10 tips for Oracle performance
Top 10 tips for Oracle performanceTop 10 tips for Oracle performance
Top 10 tips for Oracle performance
 
How I learned to stop worrying and love Oracle
How I learned to stop worrying and love OracleHow I learned to stop worrying and love Oracle
How I learned to stop worrying and love Oracle
 
Performance By Design
Performance By DesignPerformance By Design
Performance By Design
 
High Performance Plsql
High Performance PlsqlHigh Performance Plsql
High Performance Plsql
 

KĂźrzlich hochgeladen

GenCyber Cyber Security Day Presentation
GenCyber Cyber Security Day PresentationGenCyber Cyber Security Day Presentation
GenCyber Cyber Security Day PresentationMichael W. Hawkins
 
Histor y of HAM Radio presentation slide
Histor y of HAM Radio presentation slideHistor y of HAM Radio presentation slide
Histor y of HAM Radio presentation slidevu2urc
 
Google AI Hackathon: LLM based Evaluator for RAG
Google AI Hackathon: LLM based Evaluator for RAGGoogle AI Hackathon: LLM based Evaluator for RAG
Google AI Hackathon: LLM based Evaluator for RAGSujit Pal
 
FULL ENJOY 🔝 8264348440 🔝 Call Girls in Diplomatic Enclave | Delhi
FULL ENJOY 🔝 8264348440 🔝 Call Girls in Diplomatic Enclave | DelhiFULL ENJOY 🔝 8264348440 🔝 Call Girls in Diplomatic Enclave | Delhi
FULL ENJOY 🔝 8264348440 🔝 Call Girls in Diplomatic Enclave | Delhisoniya singh
 
Enhancing Worker Digital Experience: A Hands-on Workshop for Partners
Enhancing Worker Digital Experience: A Hands-on Workshop for PartnersEnhancing Worker Digital Experience: A Hands-on Workshop for Partners
Enhancing Worker Digital Experience: A Hands-on Workshop for PartnersThousandEyes
 
[2024]Digital Global Overview Report 2024 Meltwater.pdf
[2024]Digital Global Overview Report 2024 Meltwater.pdf[2024]Digital Global Overview Report 2024 Meltwater.pdf
[2024]Digital Global Overview Report 2024 Meltwater.pdfhans926745
 
Scaling API-first – The story of a global engineering organization
Scaling API-first – The story of a global engineering organizationScaling API-first – The story of a global engineering organization
Scaling API-first – The story of a global engineering organizationRadu Cotescu
 
08448380779 Call Girls In Friends Colony Women Seeking Men
08448380779 Call Girls In Friends Colony Women Seeking Men08448380779 Call Girls In Friends Colony Women Seeking Men
08448380779 Call Girls In Friends Colony Women Seeking MenDelhi Call girls
 
08448380779 Call Girls In Civil Lines Women Seeking Men
08448380779 Call Girls In Civil Lines Women Seeking Men08448380779 Call Girls In Civil Lines Women Seeking Men
08448380779 Call Girls In Civil Lines Women Seeking MenDelhi Call girls
 
Transcript: #StandardsGoals for 2024: What’s new for BISAC - Tech Forum 2024
Transcript: #StandardsGoals for 2024: What’s new for BISAC - Tech Forum 2024Transcript: #StandardsGoals for 2024: What’s new for BISAC - Tech Forum 2024
Transcript: #StandardsGoals for 2024: What’s new for BISAC - Tech Forum 2024BookNet Canada
 
Data Cloud, More than a CDP by Matt Robison
Data Cloud, More than a CDP by Matt RobisonData Cloud, More than a CDP by Matt Robison
Data Cloud, More than a CDP by Matt RobisonAnna Loughnan Colquhoun
 
The Codex of Business Writing Software for Real-World Solutions 2.pptx
The Codex of Business Writing Software for Real-World Solutions 2.pptxThe Codex of Business Writing Software for Real-World Solutions 2.pptx
The Codex of Business Writing Software for Real-World Solutions 2.pptxMalak Abu Hammad
 
The Role of Taxonomy and Ontology in Semantic Layers - Heather Hedden.pdf
The Role of Taxonomy and Ontology in Semantic Layers - Heather Hedden.pdfThe Role of Taxonomy and Ontology in Semantic Layers - Heather Hedden.pdf
The Role of Taxonomy and Ontology in Semantic Layers - Heather Hedden.pdfEnterprise Knowledge
 
🐬 The future of MySQL is Postgres 🐘
🐬  The future of MySQL is Postgres   🐘🐬  The future of MySQL is Postgres   🐘
🐬 The future of MySQL is Postgres 🐘RTylerCroy
 
SQL Database Design For Developers at php[tek] 2024
SQL Database Design For Developers at php[tek] 2024SQL Database Design For Developers at php[tek] 2024
SQL Database Design For Developers at php[tek] 2024Scott Keck-Warren
 
Neo4j - How KGs are shaping the future of Generative AI at AWS Summit London ...
Neo4j - How KGs are shaping the future of Generative AI at AWS Summit London ...Neo4j - How KGs are shaping the future of Generative AI at AWS Summit London ...
Neo4j - How KGs are shaping the future of Generative AI at AWS Summit London ...Neo4j
 
Finology Group – Insurtech Innovation Award 2024
Finology Group – Insurtech Innovation Award 2024Finology Group – Insurtech Innovation Award 2024
Finology Group – Insurtech Innovation Award 2024The Digital Insurer
 
WhatsApp 9892124323 ✓Call Girls In Kalyan ( Mumbai ) secure service
WhatsApp 9892124323 ✓Call Girls In Kalyan ( Mumbai ) secure serviceWhatsApp 9892124323 ✓Call Girls In Kalyan ( Mumbai ) secure service
WhatsApp 9892124323 ✓Call Girls In Kalyan ( Mumbai ) secure servicePooja Nehwal
 
Breaking the Kubernetes Kill Chain: Host Path Mount
Breaking the Kubernetes Kill Chain: Host Path MountBreaking the Kubernetes Kill Chain: Host Path Mount
Breaking the Kubernetes Kill Chain: Host Path MountPuma Security, LLC
 
Mastering MySQL Database Architecture: Deep Dive into MySQL Shell and MySQL R...
Mastering MySQL Database Architecture: Deep Dive into MySQL Shell and MySQL R...Mastering MySQL Database Architecture: Deep Dive into MySQL Shell and MySQL R...
Mastering MySQL Database Architecture: Deep Dive into MySQL Shell and MySQL R...Miguel AraĂşjo
 

KĂźrzlich hochgeladen (20)

GenCyber Cyber Security Day Presentation
GenCyber Cyber Security Day PresentationGenCyber Cyber Security Day Presentation
GenCyber Cyber Security Day Presentation
 
Histor y of HAM Radio presentation slide
Histor y of HAM Radio presentation slideHistor y of HAM Radio presentation slide
Histor y of HAM Radio presentation slide
 
Google AI Hackathon: LLM based Evaluator for RAG
Google AI Hackathon: LLM based Evaluator for RAGGoogle AI Hackathon: LLM based Evaluator for RAG
Google AI Hackathon: LLM based Evaluator for RAG
 
FULL ENJOY 🔝 8264348440 🔝 Call Girls in Diplomatic Enclave | Delhi
FULL ENJOY 🔝 8264348440 🔝 Call Girls in Diplomatic Enclave | DelhiFULL ENJOY 🔝 8264348440 🔝 Call Girls in Diplomatic Enclave | Delhi
FULL ENJOY 🔝 8264348440 🔝 Call Girls in Diplomatic Enclave | Delhi
 
Enhancing Worker Digital Experience: A Hands-on Workshop for Partners
Enhancing Worker Digital Experience: A Hands-on Workshop for PartnersEnhancing Worker Digital Experience: A Hands-on Workshop for Partners
Enhancing Worker Digital Experience: A Hands-on Workshop for Partners
 
[2024]Digital Global Overview Report 2024 Meltwater.pdf
[2024]Digital Global Overview Report 2024 Meltwater.pdf[2024]Digital Global Overview Report 2024 Meltwater.pdf
[2024]Digital Global Overview Report 2024 Meltwater.pdf
 
Scaling API-first – The story of a global engineering organization
Scaling API-first – The story of a global engineering organizationScaling API-first – The story of a global engineering organization
Scaling API-first – The story of a global engineering organization
 
08448380779 Call Girls In Friends Colony Women Seeking Men
08448380779 Call Girls In Friends Colony Women Seeking Men08448380779 Call Girls In Friends Colony Women Seeking Men
08448380779 Call Girls In Friends Colony Women Seeking Men
 
08448380779 Call Girls In Civil Lines Women Seeking Men
08448380779 Call Girls In Civil Lines Women Seeking Men08448380779 Call Girls In Civil Lines Women Seeking Men
08448380779 Call Girls In Civil Lines Women Seeking Men
 
Transcript: #StandardsGoals for 2024: What’s new for BISAC - Tech Forum 2024
Transcript: #StandardsGoals for 2024: What’s new for BISAC - Tech Forum 2024Transcript: #StandardsGoals for 2024: What’s new for BISAC - Tech Forum 2024
Transcript: #StandardsGoals for 2024: What’s new for BISAC - Tech Forum 2024
 
Data Cloud, More than a CDP by Matt Robison
Data Cloud, More than a CDP by Matt RobisonData Cloud, More than a CDP by Matt Robison
Data Cloud, More than a CDP by Matt Robison
 
The Codex of Business Writing Software for Real-World Solutions 2.pptx
The Codex of Business Writing Software for Real-World Solutions 2.pptxThe Codex of Business Writing Software for Real-World Solutions 2.pptx
The Codex of Business Writing Software for Real-World Solutions 2.pptx
 
The Role of Taxonomy and Ontology in Semantic Layers - Heather Hedden.pdf
The Role of Taxonomy and Ontology in Semantic Layers - Heather Hedden.pdfThe Role of Taxonomy and Ontology in Semantic Layers - Heather Hedden.pdf
The Role of Taxonomy and Ontology in Semantic Layers - Heather Hedden.pdf
 
🐬 The future of MySQL is Postgres 🐘
🐬  The future of MySQL is Postgres   🐘🐬  The future of MySQL is Postgres   🐘
🐬 The future of MySQL is Postgres 🐘
 
SQL Database Design For Developers at php[tek] 2024
SQL Database Design For Developers at php[tek] 2024SQL Database Design For Developers at php[tek] 2024
SQL Database Design For Developers at php[tek] 2024
 
Neo4j - How KGs are shaping the future of Generative AI at AWS Summit London ...
Neo4j - How KGs are shaping the future of Generative AI at AWS Summit London ...Neo4j - How KGs are shaping the future of Generative AI at AWS Summit London ...
Neo4j - How KGs are shaping the future of Generative AI at AWS Summit London ...
 
Finology Group – Insurtech Innovation Award 2024
Finology Group – Insurtech Innovation Award 2024Finology Group – Insurtech Innovation Award 2024
Finology Group – Insurtech Innovation Award 2024
 
WhatsApp 9892124323 ✓Call Girls In Kalyan ( Mumbai ) secure service
WhatsApp 9892124323 ✓Call Girls In Kalyan ( Mumbai ) secure serviceWhatsApp 9892124323 ✓Call Girls In Kalyan ( Mumbai ) secure service
WhatsApp 9892124323 ✓Call Girls In Kalyan ( Mumbai ) secure service
 
Breaking the Kubernetes Kill Chain: Host Path Mount
Breaking the Kubernetes Kill Chain: Host Path MountBreaking the Kubernetes Kill Chain: Host Path Mount
Breaking the Kubernetes Kill Chain: Host Path Mount
 
Mastering MySQL Database Architecture: Deep Dive into MySQL Shell and MySQL R...
Mastering MySQL Database Architecture: Deep Dive into MySQL Shell and MySQL R...Mastering MySQL Database Architecture: Deep Dive into MySQL Shell and MySQL R...
Mastering MySQL Database Architecture: Deep Dive into MySQL Shell and MySQL R...
 

Five database trends - updated April 2015

  • 1. REMINDER Check in on the COLLABORATE mobile app Top 5 Trends in Database Technology Guy Harrison, Executive Director, Information Mgt R&D, Dell Software Group Session ID#: 995 @guyharrison
  • 2. Top 5 Trends in Database Technology Guy Harrison, Executive Director, Information Mgt R&D, Dell Software Group
  • 3. Dell Software Group3 Web: guyharrison.net Email: guy.harrison@software.dell.com Twitter: @guyharrison Introductions
  • 7. 7 Dell Software Group Dell and Quest – a brief history
  • 9. Dell Software Group9 5 Database Technology Trends The end of “one size fits all” Big Data and Hadoop NoSQL Columnar architectures The end of disk?
  • 10. 10 Dell Software Group Trend #1: The end of “one size fits all”
  • 11. Dell Software Group11 History of databases Magnetic tape “flat” (sequential) files Pre-computer technologies: Printing press Dewey decimal system Punched cards Magnetic Disk IMS Relational Model defined Indexed-Sequential Access Mechanism (ISAM) Network Model IDMS ADABAS System R Oracle V2 Ingres dBase DB2 Informix Sybase SQL Server Access Postgres MySQL Cassandra Hadoop Vertica Riak HBase Dynamo MongoDB Redis VoltDB Hana Neo4J Aerospike Hierarchical model 1960-701940-50 1950-60 1970-80 1980-90 1990-2000 2000-2010
  • 12. Dell Software Group12 Why? • 3rd Platform drives new demands on the database: – Global High Availability – Data volumes – Unstructured data – Transaction rates – Latency • A single architecture cannot meet all those demands
  • 13. Dell Software Group13 Operational RDBMS (Oracle, SQL Server, …) In-memory Analytics (HANA, Exalytics …) In-memory processing (Spark) Hadoop Web DBMS (MySQL, Mongo, Cassandra) ERP & in- house CRM Analytic/BI software (SAS, Tableau) Web Server Data Warehouse RDBMS (Oracle, Terradata …) It takes all sorts
  • 14. Dell Software Group14 Oracle engineered systems
  • 15. 15 Dell Software Group Trend #2: Big Data and Hadoop
  • 16. Dell Software Group16 The 3-4 “V”s Volume •Terabytes •Petabytes •Exabytes •Zetabytes Velocity •Transaction rates •User populations •Machines Variety •Structured •Unstructured •Human Generated •Machine Generated Value
  • 17. 17 Dell Software Group The Industrial revolution of data
  • 20. Dell Software Group20 The instrumented human • Bluetooth Personal Area Network • 3G/WiFi Wide Area Network • GPS • Storage • Pulse, temp monitor • Silent alarms • Pedometer, sleep monitoring • Compass • Camera • Mike/earphones • Heads up display • Emotion/Attention monitor
  • 22. Dell Software Group22 The instrumented world
  • 23. Dell Software Group23 Big Data is the culmination of cloud, social and mobile
  • 24. Dell Software Group24 More Data • Storing all data – including machine generated and sol, Social, community, demographic data in original format – for ever To More Effect • Smarter use of data (data science) to achieve competitive or human benefit
  • 25. Dell Software Group25 More Data • Storing all data – including machine generated and sol, Social, community, demographic data in original format – for ever To More Effect • Smarter use of data (data science) to achieve competitive or human benefit
  • 32. Dell Software Group32 Google File System (GFS) Map Reduce BigTable Google Applications Google Software Architecture (circa 2005)
  • 33. Dell Software Group33 Start ReduceMap Map Map Map Map Map Map Map Map Map Map Map Map Map Map Map Map Map Map Map Map Map Map Map Map Map Map Map Map Map Map Map Map Map Map Map Reduce
  • 34. Dell Software Group34 Hadoop: 1.0: Open Source Map-Reduce Stack
  • 35. Dell Software Group35 Hadoop at Yahoo • 2010(biggest cluster): • 4000 nodes 16PB disk • 64 TB of RAM • 32,000 Cores • 2014: – 16 Clusters – 32,500 nodes
  • 37. Dell Software Group37 SQOOP (RDBMS loader) Hive (Query) Pig (Scripting) Flume (Log Loader) Oozie (Workflow manager) Hadoop File System (HDFS) Map Reduce / YARN Hbase (database) Zookeeper (locking) Hadoop family
  • 38. Dell Software Group39 Economies $4,911 $750 $0 $1,000 $2,000 $3,000 $4,000 $5,000 $6,000 Exadata Hadoop Exadata vs Hadoop $$/TB (Hardware only)
  • 39. Dell Software Group40 Hadoop is the most concrete Big Data technology Toad: your companion in the Big Data revolution
  • 40. Dell Software Group41 More Data • Storing all data – including machine generated and sol, Social, community, demographic data in original format – for ever To More Effect • Smarter use of data (data science) to achieve competitive or human benefit
  • 41. Dell Software Group42 More Data • Storing all data – including machine generated and sol, Social, community, demographic data in original format – for ever To More Effect • Smarter use of data (data science) to achieve competitive or human benefit
  • 42. Dell Software Group43 Big Data Analytics AKA Data Science Machine Learning • Programs that evolve with “experience Predictive Analytics • Programs that extrapolate from past to future Collective Intelligence • Programs that use inputs from “crowds” to simulate intelligence
  • 44. Dell Software Group45 Collective Intelligence From now on, I’ll call you ‘An Ambulance’. OK? “Siri call me an ambulance”
  • 46. Dell Software Group48 Web Servers Database Servers Memcached Servers Shard (G-O) Shard (P-Z)Shard (A-F) Read Only Slaves
  • 47. Dell Software Group49 CAP Theorem says something has to give • CAP (Brewer’s) Theorem says you can only have two out of three of Consistency, Partition Tolerance, Availability Consistency • Everyone always sees the same data Availability • System stays up when nodes fail Partition Tolerance • System stays up when network between nodes fail Oracle RAC lives here NO GO Most NoSQL lives here
  • 48. Dell Software Group50 Major influences on non-relational • Eventually consistent transaction model • Consistent hashing Amazon Dynamo • Column Family model for sparse distributed columnar data Google BigTable • Paved the way for the document database OODBMS and XML DBs
  • 50. Dell Software Group52 Name Site Counter Dick Ebay 507,018 Dick Google 690,414 Jane Google 716,426 Dick Facebook 723,649 Jane Facebook 643,261 Jane ILoveLarry.com 856,767 Dick MadBillFans.com 675,230 NameId Name 1 Dick 2 Jane SiteId SiteName 1 Ebay 2 Google 3 Facebook 4 ILoveLarry.com 5 MadBillFans.com NameId SiteId Counter 1 1 507,018 1 3 690,414 2 3 716,426 1 3 723,649 2 3 643,261 2 4 856,767 1 5 675,230 Id Name Ebay Google Facebook (other columns) MadBillFans.com 1 Dick 507,018 690,414 723,649 . . . . . . . . . . . . . . 675,230 Id Name Google Facebook (other columns) ILoveLarry.com 2 Jane 716,426 643,261 . . . . . . . . . . . . . . 856,767 BigTable Data Model
  • 51. Dell Software Group53 OODBMS -1990s • The OODBMS Manifesto (Atkinson/Bancilhon/DeWitt/Dittrich/Maier/Zdo nik, '90) • "A relational database is like a garage that forces you to take your car apart and store the pieces in little drawers“ – Also SQL is ugly • “A Object database is like a closet which requires that you hang up your suit with tie, underwear, belt socks and shoes all attached” (Dave Ensor) http://4.bp.blogspot.com/- IPgd1Tg8ByE/UkOzH- g1FmI/AAAAAAAACB0/QYg8kE Vp5_0/s1600/db4o_vs_orm.png
  • 52. Dell Software Group54 Revenge of the Object Nerds – Document databases • Structured documents – XML and JSON (JavaScript Object Notation) become more prevalent within applications • Web programmers start storing these in BLOBS in MySQL • Emergence of XML and JSON databases
  • 53. Dell Software Group55 Graph Database Neo4J Infinite Graph FlockDB Document JSON based MongoDB CouchDB RethinkDB XML based MarkLogic BerkeleyDB XML Key Value Memchache DB Oracle NoSQL Dynamo Voldemort DynamoDB Riak Table Based BigTable Cassandra Hbase HyperTable Accumulo
  • 54. Dell Software Group56 It’s not a database, it’s a key value store http://browsertoolkit.com/fault-tolerance.png
  • 56. 58 Dell Software Group Trend #4: Column-oriented DB
  • 57. Dell Software Group59 Row orientation vs column orientation ID Name DOB Salary Sales Expenses 1001 Dick 21/12/60 67,000 78980 3244 1002 Jane 12/12/55 55,000 67840 2333 1003 Robert 17/02/80 22,000 67890 6436 1004 Dan 15/03/75 65,200 98770 2345 1005 Steven 11/11/81 76,000 43240 3214 Block ID Name DOB Salary Sales Expenses 1 1001 Dick 21/12/60 67,000 78980 3244 2 1002 Jane 12/12/55 55,000 67840 2333 3 1003 Robert 17/02/80 22,000 67890 6436 4 1004 Dan 15/03/75 65,200 98770 2345 5 1005 Steven 11/11/81 76,000 43240 3214 Block 1 Dick Jane Robert Dan Steven 2 21/12/60 12/12/55 17/02/80 15/03/75 11/11/81 3 67,000 55,000 22,000 65,200 76,000 4 78980 67840 67890 98770 43240 5 3244 2333 6436 2345 3214 Row oriented database Column oriented database
  • 58. Dell Software Group60 Analytical Queries Block ID Name DOB Salary Sales Expenses 1 1001 Dick 21/12/60 67,000 78980 3244 2 1002 Jane 12/12/55 55,000 67840 2333 3 1003 Robert 17/02/80 22,000 67890 6436 4 1004 Dan 15/03/75 65,200 98770 2345 5 1005 Steven 11/11/81 76,000 43240 3214 Block 1 Dick Jane Robert Dan Steven 2 21/12/60 12/12/55 17/02/80 15/03/75 11/11/81 3 67,000 55,000 22,000 65,200 76,000 4 78980 67840 67890 98770 43240 5 3244 2333 6436 2345 3214 Row oriented database Column oriented database SELECT SUM(salary) FROM saleperson
  • 59. Dell Software Group61 Compression Block ID Name DOB Salary Sales Expenses 1 1001 Dick 21/12/60 67,000 78980 3244 2 1002 Jane 12/12/55 55,000 67840 2333 3 1003 Robert 17/02/80 22,000 67890 6436 4 1004 Dan 15/03/75 65,200 98770 2345 5 1005 Steven 11/11/81 76,000 43240 3214 Block 1 Dick Jane Robert Dan Steven 2 21/12/60 12/12/55 17/02/80 15/03/75 11/11/81 3 67,000 55,000 22,000 65,200 76,000 4 78980 67840 67890 98770 43240 5 3244 2333 6436 2345 3214 Row oriented database Column oriented database Poor compression ratio (low repetition) Good compression ratio (high repetition)
  • 60. Dell Software Group62 Inserts Block ID Name DOB Salary Sales Expenses 1 1001 Dick 21/12/60 67,000 78980 3244 2 1002 Jane 12/12/55 55,000 67840 2333 3 1003 Robert 17/02/80 22,000 67890 6436 4 1004 Dan 15/03/75 65,200 98770 2345 5 1005 Steven 11/11/81 76,000 43240 3214 Block 1 Dick Jane Robert Dan Steven 2 21/12/60 12/12/55 17/02/80 15/03/75 11/11/81 3 67,000 55,000 22,000 65,200 76,000 4 78980 67840 67890 98770 43240 5 3244 2333 6436 2345 3214 Row oriented database Column oriented database INSERT INTO salesperson
  • 61. Dell Software Group63 C-Store (Vertica) Solution for inserts Read Optimized Store • Columnar • Disk-based • Highly Compressed • Bulk loadable Write Optimized Store • Row oriented • Uncompressed • Single row inserts Asynchronous Tuple Mover Bulk sequential loads Continual Parallel inserts Merged Query
  • 62. Dell Software Group64 Exadata Hybrid Columnar Compression (EHCC) Compression Unit (~<1M) Block (8K) Block Block Block Column 1 Column 2 Column 3 Column 4 Row Row Row
  • 63. Dell Software Group65 Exadata Hybrid Columnar Compression • Provides high compression ratio • Manageable impact on row read/write operations • Some optimization of analytic queries SELECT SUM(Column4) FROM table
  • 64. 66 Dell Software Group Trend #5: The End of Disk?
  • 65. Dell Software Group67 5MB HDD circa 1956
  • 66. Dell Software Group68 The more that things change....
  • 67. Dell Software Group69 Faster or slower? 260 1,635 -630 1,013 -390 -1,000 -500 0 500 1,000 1,500 2,000 IO Rate Disk Capacity IO/Capacity CPU IO/CPU %age change
  • 68. Dell Software Group70 Solid state disk to the rescue • DDR RAM Drive • SATA flash drive • PCI flash drive • SSD storage Server
  • 69. Dell Software Group71 Cheaper by the IO 4,000 80 25 15 0 1,000 2,000 3,000 4,000 5,000 Magnetic Disk SSD SATA Flash SSD PCI flash SSD DDR-RAM Seek time (us)
  • 70. Dell Software Group72 But not by the GB 0.35 0.28 0.21 0.17 0.13 2.9 2.2 1.7 1.3 1 10 7.4 5.3 3.2 2.3 0 2 4 6 8 10 12 2011 2012 2013 2014 2015 $$/GB HDD MLC SDD SLC SSD 0.35 0.28 0.21 0.17 0.13 2.9 2.2 1.7 1.3 1 2.3 0.1 1 10 2011 2012 2013 2014 2015 $$/GB HDD MLC SDD SLC SSD
  • 71. Dell Software Group76 Tiered storage management Main Memory DDR SSD Flash SSD Fast Disk (SAS, RAID 0+1) Slow Disk (SATA, RAID 5) Tape, Flat Files, Hadoop $/IOP $/GB
  • 72. Dell Software Group77 In-Memory databases • Cost of RAM falling 50% each 18 months. • Some databases can fit entirely within the RAM of a single server or cluster of servers 0.001 0.01 0.1 1 10 100 $1.00 $10.00 $100.00 $1,000.00 $10,000.00 $100,000.00 1990 1995 2000 2005 2010 2015 2020 Size(GB) Cost(US$/GB) Year US$/GB Size (GB)
  • 73. Dell Software Group78 Oracle Times Ten • In-memory transactional database • Disk-based Checkpoints and disk- based logging • By default, COMMITs are not durable (writes to the transaction log are asynchronous). • Can configure synchronous replication or synchronous log writes to avoid data loss • Columnar compression and analytic functions in the Exalytics version Clients Memory Checkpoints Transaction Logs Commits Point in time snapshot
  • 74. Dell Software Group79 SAP Hana Memory Row Store Column store Delta store Persistence Layer Savepoints Data files Txn logs Note: Table must be either row or column – not both
  • 76. Dell Software Group81 You keep using that word…. I do not think it means what you think it meansInigo Montoya
  • 77. Dell Software Group82 Exalytics Hardware: • 2 TB RAM • 4 10GBe , 2 InfiniBand ports • 6x1.2TB SAS (7.2 TB) • 3x800GB (2.4TB) SSD Software: • Oracle BI • ESSBase • Oracle R • Times-Ten • 12c In-memory
  • 78. Dell Software Group83 VoltDB • Single threaded access to memory: no latch/mutex waits • Transactions in self- contained stored procedures: minimal locking • K-Safety for COMMIT: No sync waits CPU In-memory Partition CPU In-memory Partition CPU In-memory Partition CPU In-memory Partition CPU In-memory Partition CPU In-memory Partition Clients Clients Clients
  • 79. Dell Software Group84 Spark (sort of) in-memory Hadoop • In Memory compute • HDFS compatible • Libraries for data processing, machine learning, streaming, SQL, etc • Python and Scala interfaces • Part of the Berkeley Data Analytic Stack • Integrating into all Hadoop distributions (and Cassandra) HDFS Tachyon – in memory File system Spark: in-memory distributed compute Spark Streaming Mlib Machine Learning SparkSQL Mesos Cluster manager
  • 80. Dell Software Group85 Data files Oracle 12c in-memory Memory (SGA) Row store Column Store (IMCU) OLTP Analytics (SMU) database Column store Redo Logs
  • 81. 86 Dell Software Group What does all this mean for me?
  • 82. 87 Dell Software Group Trend #6: shameless product plugs will increase over the next 120 seconds
  • 83. Dell Software Group88 Toad: your companion in the Big Data revolution
  • 85. Dell Software Group90 Dell In-Memory Appliances for Cloudera Enterprise Mid-Size Configuration 16 Node Cluster R720- 4 Infrastructure Nodes R720XD- 12 Data Nodes Force10- S4810P Force10- S55 ~528TB (disk raw space) ~4.5 TB (raw memory) Starter Configuration 8 Node Cluster R720- 4 Infrastructure Nodes R720XD- 4 Data Nodes Force10- S55 ~176TB (disk raw space) ~1.5TB (raw memory) Small Enterprise Configuration 24 Node Cluster R720- 4 Infrastructure Nodes R720XD- 20 Data Nodes ~880TB (disk raw space) ~7.5 TB (raw memory) Expansion Unit- R720XD-4 Data, Cloudera Enterprise Data Hub, Scale in Blocks
  • 86. Dell Software Group91 Dell appliances for any database • Dell provides appliances and reference architectures specifically designed for: – Oracle – SQL Server – HANA – SSD database acceleration – Large memory footprints
  • 87. Dell Software Group92 • Success in Big Data requires capabilities at multiple technology levels: hardware, software infrastructure, business intelligence and analytics • Only Dell can deliver capabilities at every technology layer • Only Dell’s solutions are designed and priced to suit mid-market initial deployments and to scale to the largest enterprise Data Integration Hadoop and database software Advanced Analytics Business Intelligence Server and Storage Boomi Boomi, Toad Intelligence Central Dell appliances for Hadoop, Oracle, etc Dell servers and storage arrays Toad Data point Statistica Systems Management Dell Foglight and TOAD Big Data for the rest of us
  • 89. Please complete the session evaluation We appreciate your feedback and insight You may complete the session evaluation either on paper or online via the mobile app

Hinweis der Redaktion

  1. When you think about Dell you probably think about laptops
  2. Or servers that might run databases or a Hadoop cluster, but you probably don't think of Dell as having expertise in either Oracle or Hadoop
  3. But actually Dell now has a billion-dollar software arm which includes the world's number one independent database tool – toad – used by millions of users and supporting almost every data platform
  4. NoSQL tends to be strongly coupled with the application. Everybody else is out of luck
  5. In 2000 a 1 TB database would have required 200 500 GB disks - with an aggregate IO capacity of around 2000 IO per second. Today that database could be supported in a single 1 TB disc but which would only support 200 I/O is per second