SlideShare ist ein Scribd-Unternehmen logo
1 von 15
Sunil Sayyaparaju, Citrusleaf Inc
Agenda
 Evolution of SQL RDBMS
 Need to break out
 Fresh Thinking
 Spectrum of databases
 Future
Evolution of SQL RDBMS
   Data management started with flat files
   1960: Navigational DBMS
     Iterate over entire file on tape. No search
   1970: Relational DBMS
       God sent Codd
       Then came tables, keys, normalization
       Adopted tuple calculus to form basis for SQL
       System R and Ingres were born
        ○ gave birth to DB2, Sybase, Informix, Oracle
   1980: Object-oriented databases
   2000: In-memory, XML databases
   2000: Distributed Shared-disk databases
Need to break out
   More and more data continued to pour in
   Storage costs went up
     Were offset by cheaper and larger disks
   Speed went down
     Were offset by powerful machines
     Were offset by several optimizations
   Cost went up
     Large businesses could bear it
     But small businesses ???
   24X7 uptime became necessary
     Uhh ohhh
   Flexibility of DB schema
     Uhh ohhh
Distributed Shared-disk Model
 Multiple machines sharing a disk
 Data copies in cache, single copy on disk
 Advantages
     Could scale well in reads
     Add/Remove individual nodes
   Hauntings
       Write scalability, Locking
       Maintaining transaction semantics
       Communication between nodes
       Invalidating old replicated data on write
   Workaround: Redesign applications
     To exploit this model
     Called well-partitioned applications
   $M Question: If I redesign my application, why not a
    totally new model ?
Evils of 24x7 uptime
   Evils :
     s/w or h/w upgrades
     Failures
     Routine maintenance
     DB Schema changes
   Workaround:
     Replicate data and switch
     Problem: Needs manual intervention
Fresh Thinking
   I want
       24x7 uptime without manual intervention
       Flexibility in my database schema
       Speed and Predictability
       Vertical and horizontal scalability
   I don’t want
     Splurging money on software and hardware
     Overheads unrelated to my use-case
   I can loose (Most important)
     Attitude: I know to manage my data
        ○ Several applications already do that. For e.g SAP R/3
     Joins, Multi-record transactions
     Complex query functionality
     SQL altogether
Let us do some housecleaning
   Full blown RDBMS                    Cutdown RDBMS

Query Compilation      Query Compilation




Query Optimization

Query Execution        Query Execution




Transaction Engine     Transaction Engine




Storage & Access           Storage & Access
Who does not want features ?


                                        Formula1 Car   Sedan Car
 Fuel Efficient ?                            No           Yes
 Can it carry my family ?                    No           Yes
 Does it have a 6 disk audio player ?        No           Yes
 Does it have airbags ?                      No           Yes

   Then why will someone buy F1 Car ?
     Because it goes amazingly fast
     Its does best what it is designed for

Trivia: Why F1 cars don’t have airbags ?
Let there be NoSQL
   Started as No-SQL
   Some evolved into Not-Only-SQL
   Horizontal scalability is assumed
   Supports latest hardware like SSDs etc
   Different flavors of NoSQL
       Targeted for different use-cases
       Key-value stores
       Ordered Key-value stores
       Document stores with text search
       Graph databases
Spectrum of Databases
            NoSQL   Lotus Notes                         Citrusleaf
                    ObjectDB                            Mongo
                    Versant                             Cassandra
                    Zope                                Redis
                                                        MySQL NDB
SQL/NoSQL




            SQL     Oracle              Oracle RAC      HP Nonstop
                    DB2                 Sybase SDC      VoltDB
                    MS-SQL              IBM PureScale
                    Sybase ASE          ScaleDB
                    MySQL

                    Monolithic          Distributed     Distributed
                                        Shared-disk     Shared-nothing

                                  Distributedness
NoSQL Datamodels
Future: Fortunate/Unfortunate ?
             NoSQL                                    Citrusleaf
                                                      Mongo
                                                      Cassandra
                                                      Redis
                                                      MySQL NDB
SQL/NoSQL




             SQL     Oracle
                     DB2
                     MS-SQL
                     Sybase ASE
                     MySQL

                     Monolithic         Distributed   Distributed
                                        Shared-disk   Shared-nothing

                                  Distributedness
Future: More Storage roles

                 Application




 Hadoop   Hadoop           Hadoop   Hadoop
  Job      Job              Job      Job




 HDFS     HDFS             Mongo    Citrusleaf
Conclusion
   You cannot just replace SQL with NoSQL
   You loose some features when you go to NoSQL
   You have to put extra effort to use NoSQL
   Make sure that NoSQL is not as fat as SQL

   NoSQL solves subset of/specific problems but well
   NoSQL is lean and mean
   NoSQL is designed to be highly available
   NoSQL does not demand powerful hardware

Weitere ähnliche Inhalte

Was ist angesagt?

In-memory Database and MySQL Cluster
In-memory Database and MySQL ClusterIn-memory Database and MySQL Cluster
In-memory Database and MySQL Cluster
grandis_au
 
Whitepaper_Cassandra_Datastax_Final
Whitepaper_Cassandra_Datastax_FinalWhitepaper_Cassandra_Datastax_Final
Whitepaper_Cassandra_Datastax_Final
Michele Hunter
 
Postgres_9.0 vs MySQL_5.5
Postgres_9.0 vs MySQL_5.5Postgres_9.0 vs MySQL_5.5
Postgres_9.0 vs MySQL_5.5
Trieu Dao Minh
 
A Survey of Advanced Non-relational Database Systems: Approaches and Applicat...
A Survey of Advanced Non-relational Database Systems: Approaches and Applicat...A Survey of Advanced Non-relational Database Systems: Approaches and Applicat...
A Survey of Advanced Non-relational Database Systems: Approaches and Applicat...
Qian Lin
 

Was ist angesagt? (20)

Databases in the Cloud
Databases in the CloudDatabases in the Cloud
Databases in the Cloud
 
NoSQL databases - An introduction
NoSQL databases - An introductionNoSQL databases - An introduction
NoSQL databases - An introduction
 
In-memory Database and MySQL Cluster
In-memory Database and MySQL ClusterIn-memory Database and MySQL Cluster
In-memory Database and MySQL Cluster
 
NoSQL Databases: Why, what and when
NoSQL Databases: Why, what and whenNoSQL Databases: Why, what and when
NoSQL Databases: Why, what and when
 
Whitepaper_Cassandra_Datastax_Final
Whitepaper_Cassandra_Datastax_FinalWhitepaper_Cassandra_Datastax_Final
Whitepaper_Cassandra_Datastax_Final
 
Apache Cassandra
Apache CassandraApache Cassandra
Apache Cassandra
 
NoSQL in Real-time Architectures
NoSQL in Real-time ArchitecturesNoSQL in Real-time Architectures
NoSQL in Real-time Architectures
 
No sql databases explained
No sql databases explainedNo sql databases explained
No sql databases explained
 
What is Apache Cassandra? | Apache Cassandra Tutorial | Apache Cassandra Intr...
What is Apache Cassandra? | Apache Cassandra Tutorial | Apache Cassandra Intr...What is Apache Cassandra? | Apache Cassandra Tutorial | Apache Cassandra Intr...
What is Apache Cassandra? | Apache Cassandra Tutorial | Apache Cassandra Intr...
 
Rdbms vs. no sql
Rdbms vs. no sqlRdbms vs. no sql
Rdbms vs. no sql
 
Running MySQL in AWS
Running MySQL in AWSRunning MySQL in AWS
Running MySQL in AWS
 
NewSQL
NewSQLNewSQL
NewSQL
 
MySQL Cluster Schema management (2014)
MySQL Cluster Schema management (2014)MySQL Cluster Schema management (2014)
MySQL Cluster Schema management (2014)
 
The Great Debate: PostgreSQL vs MySQL
The Great Debate: PostgreSQL vs MySQLThe Great Debate: PostgreSQL vs MySQL
The Great Debate: PostgreSQL vs MySQL
 
Postgres_9.0 vs MySQL_5.5
Postgres_9.0 vs MySQL_5.5Postgres_9.0 vs MySQL_5.5
Postgres_9.0 vs MySQL_5.5
 
SQLBits X Scaling out with SQL Azure Federations
SQLBits X Scaling out with SQL Azure FederationsSQLBits X Scaling out with SQL Azure Federations
SQLBits X Scaling out with SQL Azure Federations
 
Introduction to NoSQL
Introduction to NoSQLIntroduction to NoSQL
Introduction to NoSQL
 
Elephants vs. Dolphins: Comparing PostgreSQL and MySQL for use in the DoD
Elephants vs. Dolphins:  Comparing PostgreSQL and MySQL for use in the DoDElephants vs. Dolphins:  Comparing PostgreSQL and MySQL for use in the DoD
Elephants vs. Dolphins: Comparing PostgreSQL and MySQL for use in the DoD
 
NoSQL
NoSQLNoSQL
NoSQL
 
A Survey of Advanced Non-relational Database Systems: Approaches and Applicat...
A Survey of Advanced Non-relational Database Systems: Approaches and Applicat...A Survey of Advanced Non-relational Database Systems: Approaches and Applicat...
A Survey of Advanced Non-relational Database Systems: Approaches and Applicat...
 

Andere mochten auch

Andere mochten auch (7)

Winning the big data revolution: what businesses leaders need to know
Winning the big data revolution: what businesses leaders need to knowWinning the big data revolution: what businesses leaders need to know
Winning the big data revolution: what businesses leaders need to know
 
Big Data Revolution - Copec Big Data Event
Big Data Revolution - Copec Big Data EventBig Data Revolution - Copec Big Data Event
Big Data Revolution - Copec Big Data Event
 
Webinar: ROI on Big Data - RDBMS, NoSQL or Both? A Simple Guide for Knowing H...
Webinar: ROI on Big Data - RDBMS, NoSQL or Both? A Simple Guide for Knowing H...Webinar: ROI on Big Data - RDBMS, NoSQL or Both? A Simple Guide for Knowing H...
Webinar: ROI on Big Data - RDBMS, NoSQL or Both? A Simple Guide for Knowing H...
 
Big Data Revolution: Increasing Transparency to Risk and Valuation
Big Data Revolution: Increasing Transparency to Risk and ValuationBig Data Revolution: Increasing Transparency to Risk and Valuation
Big Data Revolution: Increasing Transparency to Risk and Valuation
 
Hadoop, Oracle and the big data revolution collaborate 2013
Hadoop, Oracle and the big data revolution collaborate 2013Hadoop, Oracle and the big data revolution collaborate 2013
Hadoop, Oracle and the big data revolution collaborate 2013
 
Riot Games Scalable Data Warehouse Lecture at UCSB / UCLA
Riot Games Scalable Data Warehouse Lecture at UCSB / UCLARiot Games Scalable Data Warehouse Lecture at UCSB / UCLA
Riot Games Scalable Data Warehouse Lecture at UCSB / UCLA
 
Big Data Revolution: Are You Ready for the Data Overload?
Big Data Revolution: Are You Ready for the Data Overload?Big Data Revolution: Are You Ready for the Data Overload?
Big Data Revolution: Are You Ready for the Data Overload?
 

Ähnlich wie How big data moved the needle from monolithic SQL RDBMS to distributed NoSQL

NO SQL: What, Why, How
NO SQL: What, Why, HowNO SQL: What, Why, How
NO SQL: What, Why, How
Igor Moochnick
 
NoSQLDatabases
NoSQLDatabasesNoSQLDatabases
NoSQLDatabases
Adi Challa
 

Ähnlich wie How big data moved the needle from monolithic SQL RDBMS to distributed NoSQL (20)

Nosql seminar
Nosql seminarNosql seminar
Nosql seminar
 
Minnebar 2013 - Scaling with Cassandra
Minnebar 2013 - Scaling with CassandraMinnebar 2013 - Scaling with Cassandra
Minnebar 2013 - Scaling with Cassandra
 
Unit II -BIG DATA ANALYTICS.docx
Unit II -BIG DATA ANALYTICS.docxUnit II -BIG DATA ANALYTICS.docx
Unit II -BIG DATA ANALYTICS.docx
 
NoSQL
NoSQLNoSQL
NoSQL
 
Sql vs NO-SQL database differences explained
Sql vs NO-SQL database differences explainedSql vs NO-SQL database differences explained
Sql vs NO-SQL database differences explained
 
Introduction to NoSQL | Big Data Hadoop Spark Tutorial | CloudxLab
Introduction to NoSQL | Big Data Hadoop Spark Tutorial | CloudxLabIntroduction to NoSQL | Big Data Hadoop Spark Tutorial | CloudxLab
Introduction to NoSQL | Big Data Hadoop Spark Tutorial | CloudxLab
 
Vote NO for MySQL
Vote NO for MySQLVote NO for MySQL
Vote NO for MySQL
 
SQL vs NoSQL deep dive
SQL vs NoSQL deep diveSQL vs NoSQL deep dive
SQL vs NoSQL deep dive
 
Why no sql ? Why Couchbase ?
Why no sql ? Why Couchbase ?Why no sql ? Why Couchbase ?
Why no sql ? Why Couchbase ?
 
NO SQL: What, Why, How
NO SQL: What, Why, HowNO SQL: What, Why, How
NO SQL: What, Why, How
 
If NoSQL is your answer, you are probably asking the wrong question.
If NoSQL is your answer, you are probably asking the wrong question.If NoSQL is your answer, you are probably asking the wrong question.
If NoSQL is your answer, you are probably asking the wrong question.
 
Unit 3 MongDB
Unit 3 MongDBUnit 3 MongDB
Unit 3 MongDB
 
Navigating NoSQL in cloudy skies
Navigating NoSQL in cloudy skiesNavigating NoSQL in cloudy skies
Navigating NoSQL in cloudy skies
 
NoSQL - Not Only SQL
NoSQL - Not Only SQLNoSQL - Not Only SQL
NoSQL - Not Only SQL
 
NoSQLDatabases
NoSQLDatabasesNoSQLDatabases
NoSQLDatabases
 
How and when to use NoSQL
How and when to use NoSQLHow and when to use NoSQL
How and when to use NoSQL
 
Enterprise NoSQL: Silver Bullet or Poison Pill
Enterprise NoSQL: Silver Bullet or Poison PillEnterprise NoSQL: Silver Bullet or Poison Pill
Enterprise NoSQL: Silver Bullet or Poison Pill
 
Nashville analytics summit aug9 no sql mike king dell v1.5
Nashville analytics summit aug9 no sql mike king dell v1.5Nashville analytics summit aug9 no sql mike king dell v1.5
Nashville analytics summit aug9 no sql mike king dell v1.5
 
NoSql Databases
NoSql DatabasesNoSql Databases
NoSql Databases
 
Sql vs NoSQL-Presentation
 Sql vs NoSQL-Presentation Sql vs NoSQL-Presentation
Sql vs NoSQL-Presentation
 

Kürzlich hochgeladen

Artificial Intelligence: Facts and Myths
Artificial Intelligence: Facts and MythsArtificial Intelligence: Facts and Myths
Artificial Intelligence: Facts and Myths
Joaquim Jorge
 

Kürzlich hochgeladen (20)

Advantages of Hiring UIUX Design Service Providers for Your Business
Advantages of Hiring UIUX Design Service Providers for Your BusinessAdvantages of Hiring UIUX Design Service Providers for Your Business
Advantages of Hiring UIUX Design Service Providers for Your Business
 
Driving Behavioral Change for Information Management through Data-Driven Gree...
Driving Behavioral Change for Information Management through Data-Driven Gree...Driving Behavioral Change for Information Management through Data-Driven Gree...
Driving Behavioral Change for Information Management through Data-Driven Gree...
 
Connector Corner: Accelerate revenue generation using UiPath API-centric busi...
Connector Corner: Accelerate revenue generation using UiPath API-centric busi...Connector Corner: Accelerate revenue generation using UiPath API-centric busi...
Connector Corner: Accelerate revenue generation using UiPath API-centric busi...
 
GenAI Risks & Security Meetup 01052024.pdf
GenAI Risks & Security Meetup 01052024.pdfGenAI Risks & Security Meetup 01052024.pdf
GenAI Risks & Security Meetup 01052024.pdf
 
Finology Group – Insurtech Innovation Award 2024
Finology Group – Insurtech Innovation Award 2024Finology Group – Insurtech Innovation Award 2024
Finology Group – Insurtech Innovation Award 2024
 
Workshop - Best of Both Worlds_ Combine KG and Vector search for enhanced R...
Workshop - Best of Both Worlds_ Combine  KG and Vector search for  enhanced R...Workshop - Best of Both Worlds_ Combine  KG and Vector search for  enhanced R...
Workshop - Best of Both Worlds_ Combine KG and Vector search for enhanced R...
 
How to Troubleshoot Apps for the Modern Connected Worker
How to Troubleshoot Apps for the Modern Connected WorkerHow to Troubleshoot Apps for the Modern Connected Worker
How to Troubleshoot Apps for the Modern Connected Worker
 
Boost PC performance: How more available memory can improve productivity
Boost PC performance: How more available memory can improve productivityBoost PC performance: How more available memory can improve productivity
Boost PC performance: How more available memory can improve productivity
 
Apidays New York 2024 - The value of a flexible API Management solution for O...
Apidays New York 2024 - The value of a flexible API Management solution for O...Apidays New York 2024 - The value of a flexible API Management solution for O...
Apidays New York 2024 - The value of a flexible API Management solution for O...
 
The 7 Things I Know About Cyber Security After 25 Years | April 2024
The 7 Things I Know About Cyber Security After 25 Years | April 2024The 7 Things I Know About Cyber Security After 25 Years | April 2024
The 7 Things I Know About Cyber Security After 25 Years | April 2024
 
How to Troubleshoot Apps for the Modern Connected Worker
How to Troubleshoot Apps for the Modern Connected WorkerHow to Troubleshoot Apps for the Modern Connected Worker
How to Troubleshoot Apps for the Modern Connected Worker
 
ProductAnonymous-April2024-WinProductDiscovery-MelissaKlemke
ProductAnonymous-April2024-WinProductDiscovery-MelissaKlemkeProductAnonymous-April2024-WinProductDiscovery-MelissaKlemke
ProductAnonymous-April2024-WinProductDiscovery-MelissaKlemke
 
What Are The Drone Anti-jamming Systems Technology?
What Are The Drone Anti-jamming Systems Technology?What Are The Drone Anti-jamming Systems Technology?
What Are The Drone Anti-jamming Systems Technology?
 
Apidays Singapore 2024 - Building Digital Trust in a Digital Economy by Veron...
Apidays Singapore 2024 - Building Digital Trust in a Digital Economy by Veron...Apidays Singapore 2024 - Building Digital Trust in a Digital Economy by Veron...
Apidays Singapore 2024 - Building Digital Trust in a Digital Economy by Veron...
 
Mastering MySQL Database Architecture: Deep Dive into MySQL Shell and MySQL R...
Mastering MySQL Database Architecture: Deep Dive into MySQL Shell and MySQL R...Mastering MySQL Database Architecture: Deep Dive into MySQL Shell and MySQL R...
Mastering MySQL Database Architecture: Deep Dive into MySQL Shell and MySQL R...
 
presentation ICT roal in 21st century education
presentation ICT roal in 21st century educationpresentation ICT roal in 21st century education
presentation ICT roal in 21st century education
 
Artificial Intelligence: Facts and Myths
Artificial Intelligence: Facts and MythsArtificial Intelligence: Facts and Myths
Artificial Intelligence: Facts and Myths
 
Strategies for Unlocking Knowledge Management in Microsoft 365 in the Copilot...
Strategies for Unlocking Knowledge Management in Microsoft 365 in the Copilot...Strategies for Unlocking Knowledge Management in Microsoft 365 in the Copilot...
Strategies for Unlocking Knowledge Management in Microsoft 365 in the Copilot...
 
Tech Trends Report 2024 Future Today Institute.pdf
Tech Trends Report 2024 Future Today Institute.pdfTech Trends Report 2024 Future Today Institute.pdf
Tech Trends Report 2024 Future Today Institute.pdf
 
Scaling API-first – The story of a global engineering organization
Scaling API-first – The story of a global engineering organizationScaling API-first – The story of a global engineering organization
Scaling API-first – The story of a global engineering organization
 

How big data moved the needle from monolithic SQL RDBMS to distributed NoSQL

  • 2. Agenda  Evolution of SQL RDBMS  Need to break out  Fresh Thinking  Spectrum of databases  Future
  • 3. Evolution of SQL RDBMS  Data management started with flat files  1960: Navigational DBMS  Iterate over entire file on tape. No search  1970: Relational DBMS  God sent Codd  Then came tables, keys, normalization  Adopted tuple calculus to form basis for SQL  System R and Ingres were born ○ gave birth to DB2, Sybase, Informix, Oracle  1980: Object-oriented databases  2000: In-memory, XML databases  2000: Distributed Shared-disk databases
  • 4. Need to break out  More and more data continued to pour in  Storage costs went up  Were offset by cheaper and larger disks  Speed went down  Were offset by powerful machines  Were offset by several optimizations  Cost went up  Large businesses could bear it  But small businesses ???  24X7 uptime became necessary  Uhh ohhh  Flexibility of DB schema  Uhh ohhh
  • 5. Distributed Shared-disk Model  Multiple machines sharing a disk  Data copies in cache, single copy on disk  Advantages  Could scale well in reads  Add/Remove individual nodes  Hauntings  Write scalability, Locking  Maintaining transaction semantics  Communication between nodes  Invalidating old replicated data on write  Workaround: Redesign applications  To exploit this model  Called well-partitioned applications  $M Question: If I redesign my application, why not a totally new model ?
  • 6. Evils of 24x7 uptime  Evils :  s/w or h/w upgrades  Failures  Routine maintenance  DB Schema changes  Workaround:  Replicate data and switch  Problem: Needs manual intervention
  • 7. Fresh Thinking  I want  24x7 uptime without manual intervention  Flexibility in my database schema  Speed and Predictability  Vertical and horizontal scalability  I don’t want  Splurging money on software and hardware  Overheads unrelated to my use-case  I can loose (Most important)  Attitude: I know to manage my data ○ Several applications already do that. For e.g SAP R/3  Joins, Multi-record transactions  Complex query functionality  SQL altogether
  • 8. Let us do some housecleaning  Full blown RDBMS  Cutdown RDBMS Query Compilation Query Compilation Query Optimization Query Execution Query Execution Transaction Engine Transaction Engine Storage & Access Storage & Access
  • 9. Who does not want features ? Formula1 Car Sedan Car Fuel Efficient ? No Yes Can it carry my family ? No Yes Does it have a 6 disk audio player ? No Yes Does it have airbags ? No Yes  Then why will someone buy F1 Car ?  Because it goes amazingly fast  Its does best what it is designed for Trivia: Why F1 cars don’t have airbags ?
  • 10. Let there be NoSQL  Started as No-SQL  Some evolved into Not-Only-SQL  Horizontal scalability is assumed  Supports latest hardware like SSDs etc  Different flavors of NoSQL  Targeted for different use-cases  Key-value stores  Ordered Key-value stores  Document stores with text search  Graph databases
  • 11. Spectrum of Databases NoSQL Lotus Notes Citrusleaf ObjectDB Mongo Versant Cassandra Zope Redis MySQL NDB SQL/NoSQL SQL Oracle Oracle RAC HP Nonstop DB2 Sybase SDC VoltDB MS-SQL IBM PureScale Sybase ASE ScaleDB MySQL Monolithic Distributed Distributed Shared-disk Shared-nothing Distributedness
  • 13. Future: Fortunate/Unfortunate ? NoSQL Citrusleaf Mongo Cassandra Redis MySQL NDB SQL/NoSQL SQL Oracle DB2 MS-SQL Sybase ASE MySQL Monolithic Distributed Distributed Shared-disk Shared-nothing Distributedness
  • 14. Future: More Storage roles Application Hadoop Hadoop Hadoop Hadoop Job Job Job Job HDFS HDFS Mongo Citrusleaf
  • 15. Conclusion  You cannot just replace SQL with NoSQL  You loose some features when you go to NoSQL  You have to put extra effort to use NoSQL  Make sure that NoSQL is not as fat as SQL  NoSQL solves subset of/specific problems but well  NoSQL is lean and mean  NoSQL is designed to be highly available  NoSQL does not demand powerful hardware