SlideShare ist ein Scribd-Unternehmen logo
1 von 27
Downloaden Sie, um offline zu lesen
Preparing Your Data For Cloud

                       Narinder Kumar
                    Inphina Technologies




                                    1
Agenda
   Relational DBMS's : Pros & Cons
   Non-Relational DBMS's : Pros & Cons
   Types of Non-Relational DBMS's
   Current Market State
   Applicability of Different Data-Bases in
    different environments



                                               2
Relational DBMS - Pros
   Data Integrity
   ACID Capabilities
   High Level Query Model
   Data Normalization
   Data Independence




                                    3
Relational DBMS - Cons
   Scaling Issues
       By Duplication (Master-Slave)
       By Sharding/Division (Not transparent)
   Fixed Schema
   Mostly disk-oriented (Performance)
   May fair poorly with large data



                                                 4
Non-Relational DBMS - Pros
   Scalability

   Replication / Availability

   Performance

   Deployment Flexibility

   Modelling Flexibility

   Faster Development (?)
                                     5
Non-Relational DBMS - Cons
   Lack of Transactional Support
   Data Integrity is Application's responsibility
   Data Duplication / Application Dependent
   Eventually Consistent (mostly)
   No Standardization
   New Technology



                                                     6
RDBMS's and Cloud




                    7
Cloud Capable RDBMS




       s




Almost every RDBMS can run in a IAAS Cloud Platform

                                                      8
Cloud Native RDBMS




                     9
Types of Non-Relational DBMS

   Key Value Stores

   Document Stores

   Column Stores

   Graph Stores


                                     10
Key Value Data-Bases
   Object is completely Opaque to DB
   Mostly GET, PUT & DELETE operations are
    supported
   There may be limits on size of Objects


         Inspired by Amazon Dynamo Paper



                                              11
Key Value DataBases & Cloud

                Project Voldemort




MemCachedDB       Tokyo Tyrant




                                    12
Document Data-Bases
   Object is not completely opaque to DB
   Every Object has it's own schema
     FirstName="Bob", Address="5 Oak St.", Hobby="sailing".
     FirstName="Jonathan", Children=("Michael,10", "Jennifer,8")

   Can perform queries based on Object's
    attributes
   Possible to describe relationships between
    Objects
   Joins and Transactions are not supported
   Good for XML or JSON objects
                                                                   13
Document Data-Bases & Cloud




                              14
Column-Store Data-Bases
   Richer than Document Stores
   Multi-Dimensional Map
       Tables
           Row
           Column
           Time-Stamp
   Supports Multiple Data Types
   Usually use an Underlying DFS
            Inspired by Google Big Table Paper
                                                 15
Column-Store Data-Bases & Cloud




                              16
Key Factors while Making a Choice
   Application Architecture Requirements
   Platform choices
   Non-Functional Requirements
       Consistency
       Availability
       Partition
       Security
       Data Redemption

                                            17
Different Requirements = Different Solutions




                                          18
Scenario 1
   Feature First
       Corporate Data
       Consistency Requirements
       Business Intelligence
       Legacy Application


    RDBMS on Amazon Cloud, RackSpace (IaaS) or
          Microsoft Azure/Amazon RDS (PaaS)
                                                 19
Scenario 2
   Consumer Facing Application
       Big Files (Images, BLOB's, Files)
       Geographically Distributed
       Mostly writes
       Not heavy requirement on Rich Queries


     Key-Value Data Stores (Amazon S3, Project
                 Voldemort, Redis)

                                                 20
Scenario 3
   Hundreds Of Government Documents with
    different schemas
       Need to serve on Web
       Data Mining


     Document Data-Stores (Amazon SimpleDB,
          Apache Couche DB, MongoDB)


                                              21
Scenario 4
   Scale First
       Huge Data-Set
       Analytical Requirements
       Consumer Facing
       High Availability over Consistency


Column Data-Stores (Google App Engine, Hbase,
                 Cassandra)

                                             22
Mix & Match of Earlier Scenarios
                     Polyglot Persistence
                         RDBMS for low-volume
                          and high value
                         Key-Value DB for large
                          files with little queries
                         Memcached DB for short-
                          lived Data
                         Column DB for Analytics




                                                      23
CAP Theorem




              24
Conclusions
   One Size does not Fit all

   Many choices

   No-SQL DB's providing Alternatives

   RDBMS serve useful purpose



                                         25
nkumar@inphina.com

      www.inphina.com
http://thoughts.inphina.com


                              26
References
   http://nosql-database.org/

   http://www.drdobbs.com/database/218900502

   http://perspectives.mvdirona.com/2009/11/03/OneSizeDoesNotFitAll.aspx

   http://blog.nahurst.com/visual-guide-to-nosql-systems

   http://blog.heroku.com/archives/2010/7/20/nosql/

   http://www.vineetgupta.com/2010/01/nosql-databases-part-1-landscape.html

   http://project-voldemort.com/

   http://code.google.com/p/redis/

   http://memcachedb.org/

   http://aws.amazon.com/simpledb/

   http://couchdb.apache.org/

   http://www.mongodb.org


    http://riak.basho.com/
                                                                               27

Weitere ähnliche Inhalte

Was ist angesagt?

The World of Structured Storage System
The World of Structured Storage SystemThe World of Structured Storage System
The World of Structured Storage System
Schubert Zhang
 

Was ist angesagt? (20)

TCO Comparison MongoDB & Oracle
TCO Comparison MongoDB & OracleTCO Comparison MongoDB & Oracle
TCO Comparison MongoDB & Oracle
 
Nonrelational Databases
Nonrelational DatabasesNonrelational Databases
Nonrelational Databases
 
Comparison of MPP Data Warehouse Platforms
Comparison of MPP Data Warehouse PlatformsComparison of MPP Data Warehouse Platforms
Comparison of MPP Data Warehouse Platforms
 
MongoDB
MongoDBMongoDB
MongoDB
 
No sql
No sqlNo sql
No sql
 
Big data vahidamiri-datastack.ir
Big data vahidamiri-datastack.irBig data vahidamiri-datastack.ir
Big data vahidamiri-datastack.ir
 
[db tech showcase Tokyo 2017] C34: Replacing Oracle Database at DBS Bank ~Ora...
[db tech showcase Tokyo 2017] C34: Replacing Oracle Database at DBS Bank ~Ora...[db tech showcase Tokyo 2017] C34: Replacing Oracle Database at DBS Bank ~Ora...
[db tech showcase Tokyo 2017] C34: Replacing Oracle Database at DBS Bank ~Ora...
 
In memory big data management and processing a survey
In memory big data management and processing a surveyIn memory big data management and processing a survey
In memory big data management and processing a survey
 
NoSQL databases and managing big data
NoSQL databases and managing big dataNoSQL databases and managing big data
NoSQL databases and managing big data
 
MongoDB: The Operational Big Data by NORBERTO LEITE at Big Data Spain 2014
 MongoDB: The Operational Big Data by NORBERTO LEITE at Big Data Spain 2014 MongoDB: The Operational Big Data by NORBERTO LEITE at Big Data Spain 2014
MongoDB: The Operational Big Data by NORBERTO LEITE at Big Data Spain 2014
 
An Enterprise Architect's View of MongoDB
An Enterprise Architect's View of MongoDBAn Enterprise Architect's View of MongoDB
An Enterprise Architect's View of MongoDB
 
Non relational databases-no sql
Non relational databases-no sqlNon relational databases-no sql
Non relational databases-no sql
 
Database awareness
Database awarenessDatabase awareness
Database awareness
 
Emergent Distributed Data Storage
Emergent Distributed Data StorageEmergent Distributed Data Storage
Emergent Distributed Data Storage
 
NOSQL- Presentation on NoSQL
NOSQL- Presentation on NoSQLNOSQL- Presentation on NoSQL
NOSQL- Presentation on NoSQL
 
Building Big Data Applications
Building Big Data ApplicationsBuilding Big Data Applications
Building Big Data Applications
 
Webinar: Faster Big Data Analytics with MongoDB
Webinar: Faster Big Data Analytics with MongoDBWebinar: Faster Big Data Analytics with MongoDB
Webinar: Faster Big Data Analytics with MongoDB
 
Introduction to Big Data Analytics on Apache Hadoop
Introduction to Big Data Analytics on Apache HadoopIntroduction to Big Data Analytics on Apache Hadoop
Introduction to Big Data Analytics on Apache Hadoop
 
Big Data/Hadoop Infrastructure Considerations
Big Data/Hadoop Infrastructure ConsiderationsBig Data/Hadoop Infrastructure Considerations
Big Data/Hadoop Infrastructure Considerations
 
The World of Structured Storage System
The World of Structured Storage SystemThe World of Structured Storage System
The World of Structured Storage System
 

Andere mochten auch

Innovation manddag
Innovation manddagInnovation manddag
Innovation manddag
SocialOnline
 
2. seminar hvorfor blogge
2. seminar hvorfor blogge2. seminar hvorfor blogge
2. seminar hvorfor blogge
SocialOnline
 
CERCONEWS June 2011
CERCONEWS  June 2011CERCONEWS  June 2011
CERCONEWS June 2011
cercopan
 
Innovation manddag ufilm
Innovation manddag ufilmInnovation manddag ufilm
Innovation manddag ufilm
SocialOnline
 
Social marketing analyse
Social marketing analyseSocial marketing analyse
Social marketing analyse
SocialOnline
 
Hubspot Charts Graphs April2010
Hubspot Charts Graphs April2010Hubspot Charts Graphs April2010
Hubspot Charts Graphs April2010
SocialOnline
 
Social Marketing Analyse
Social Marketing AnalyseSocial Marketing Analyse
Social Marketing Analyse
SocialOnline
 

Andere mochten auch (14)

Innovation manddag
Innovation manddagInnovation manddag
Innovation manddag
 
2. seminar hvorfor blogge
2. seminar hvorfor blogge2. seminar hvorfor blogge
2. seminar hvorfor blogge
 
CERCONEWS June 2011
CERCONEWS  June 2011CERCONEWS  June 2011
CERCONEWS June 2011
 
Innovation manddag ufilm
Innovation manddag ufilmInnovation manddag ufilm
Innovation manddag ufilm
 
CERCONEWS January 2010
CERCONEWS  January 2010CERCONEWS  January 2010
CERCONEWS January 2010
 
Com scoredatapassport 2h10
Com scoredatapassport 2h10Com scoredatapassport 2h10
Com scoredatapassport 2h10
 
CERCONEWS May 2010
CERCONEWS  May 2010CERCONEWS  May 2010
CERCONEWS May 2010
 
Social marketing analyse
Social marketing analyseSocial marketing analyse
Social marketing analyse
 
Hubspot Charts Graphs April2010
Hubspot Charts Graphs April2010Hubspot Charts Graphs April2010
Hubspot Charts Graphs April2010
 
Social Marketing Analyse
Social Marketing AnalyseSocial Marketing Analyse
Social Marketing Analyse
 
Getting started with jClouds
Getting started with jCloudsGetting started with jClouds
Getting started with jClouds
 
Scala collections
Scala collectionsScala collections
Scala collections
 
Cloud slam2011 multi-tenancy
Cloud slam2011 multi-tenancyCloud slam2011 multi-tenancy
Cloud slam2011 multi-tenancy
 
Multi-tenancy in the cloud
Multi-tenancy in the cloudMulti-tenancy in the cloud
Multi-tenancy in the cloud
 

Ähnlich wie Preparing your data for the cloud

A Survey of Advanced Non-relational Database Systems: Approaches and Applicat...
A Survey of Advanced Non-relational Database Systems: Approaches and Applicat...A Survey of Advanced Non-relational Database Systems: Approaches and Applicat...
A Survey of Advanced Non-relational Database Systems: Approaches and Applicat...
Qian Lin
 
NoSQL Options Compared
NoSQL Options ComparedNoSQL Options Compared
NoSQL Options Compared
Sergey Bushik
 
NoSQL Data Stores in Research and Practice - ICDE 2016 Tutorial - Extended Ve...
NoSQL Data Stores in Research and Practice - ICDE 2016 Tutorial - Extended Ve...NoSQL Data Stores in Research and Practice - ICDE 2016 Tutorial - Extended Ve...
NoSQL Data Stores in Research and Practice - ICDE 2016 Tutorial - Extended Ve...
Felix Gessert
 
DynamoDB Gluecon 2012
DynamoDB Gluecon 2012DynamoDB Gluecon 2012
DynamoDB Gluecon 2012
Appirio
 
NO SQL: What, Why, How
NO SQL: What, Why, HowNO SQL: What, Why, How
NO SQL: What, Why, How
Igor Moochnick
 
The elephantintheroom bigdataanalyticsinthecloud
The elephantintheroom bigdataanalyticsinthecloudThe elephantintheroom bigdataanalyticsinthecloud
The elephantintheroom bigdataanalyticsinthecloud
Khazret Sapenov
 

Ähnlich wie Preparing your data for the cloud (20)

SQL? NoSQL? NewSQL?!? What’s a Java developer to do? - JDC2012 Cairo, Egypt
SQL? NoSQL? NewSQL?!? What’s a Java developer to do? - JDC2012 Cairo, EgyptSQL? NoSQL? NewSQL?!? What’s a Java developer to do? - JDC2012 Cairo, Egypt
SQL? NoSQL? NewSQL?!? What’s a Java developer to do? - JDC2012 Cairo, Egypt
 
New Data Technologies, Graph Computing and Relationship Discovery in the Ente...
New Data Technologies, Graph Computing and Relationship Discovery in the Ente...New Data Technologies, Graph Computing and Relationship Discovery in the Ente...
New Data Technologies, Graph Computing and Relationship Discovery in the Ente...
 
Presentation on Databases in the Cloud
Presentation on Databases in the CloudPresentation on Databases in the Cloud
Presentation on Databases in the Cloud
 
NOSQL in big data is the not only structure langua.pdf
NOSQL in big data is the not only structure langua.pdfNOSQL in big data is the not only structure langua.pdf
NOSQL in big data is the not only structure langua.pdf
 
A Survey of Advanced Non-relational Database Systems: Approaches and Applicat...
A Survey of Advanced Non-relational Database Systems: Approaches and Applicat...A Survey of Advanced Non-relational Database Systems: Approaches and Applicat...
A Survey of Advanced Non-relational Database Systems: Approaches and Applicat...
 
NoSQL Options Compared
NoSQL Options ComparedNoSQL Options Compared
NoSQL Options Compared
 
Rdbms vs. no sql
Rdbms vs. no sqlRdbms vs. no sql
Rdbms vs. no sql
 
NoSQL Data Stores in Research and Practice - ICDE 2016 Tutorial - Extended Ve...
NoSQL Data Stores in Research and Practice - ICDE 2016 Tutorial - Extended Ve...NoSQL Data Stores in Research and Practice - ICDE 2016 Tutorial - Extended Ve...
NoSQL Data Stores in Research and Practice - ICDE 2016 Tutorial - Extended Ve...
 
Big Data Basic Concepts | Presented in 2014
Big Data Basic Concepts  | Presented in 2014Big Data Basic Concepts  | Presented in 2014
Big Data Basic Concepts | Presented in 2014
 
عصر کلان داده، چرا و چگونه؟
عصر کلان داده، چرا و چگونه؟عصر کلان داده، چرا و چگونه؟
عصر کلان داده، چرا و چگونه؟
 
Information processing architectures
Information processing architecturesInformation processing architectures
Information processing architectures
 
DynamoDB Gluecon 2012
DynamoDB Gluecon 2012DynamoDB Gluecon 2012
DynamoDB Gluecon 2012
 
Gluecon 2012 - DynamoDB
Gluecon 2012 - DynamoDBGluecon 2012 - DynamoDB
Gluecon 2012 - DynamoDB
 
Nosql Presentation.pdf for DBMS understanding
Nosql Presentation.pdf for DBMS understandingNosql Presentation.pdf for DBMS understanding
Nosql Presentation.pdf for DBMS understanding
 
NO SQL: What, Why, How
NO SQL: What, Why, HowNO SQL: What, Why, How
NO SQL: What, Why, How
 
Minnebar 2013 - Scaling with Cassandra
Minnebar 2013 - Scaling with CassandraMinnebar 2013 - Scaling with Cassandra
Minnebar 2013 - Scaling with Cassandra
 
Report 2.0.docx
Report 2.0.docxReport 2.0.docx
Report 2.0.docx
 
No sqlpresentation
No sqlpresentationNo sqlpresentation
No sqlpresentation
 
The elephantintheroom bigdataanalyticsinthecloud
The elephantintheroom bigdataanalyticsinthecloudThe elephantintheroom bigdataanalyticsinthecloud
The elephantintheroom bigdataanalyticsinthecloud
 
NoSQL Architecture Overview
NoSQL Architecture OverviewNoSQL Architecture Overview
NoSQL Architecture Overview
 

Mehr von Inphina Technologies

Google appenginemigrationcasestudy
Google appenginemigrationcasestudyGoogle appenginemigrationcasestudy
Google appenginemigrationcasestudy
Inphina Technologies
 

Mehr von Inphina Technologies (8)

Scala test
Scala testScala test
Scala test
 
Easy ORMness with Objectify-Appengine
Easy ORMness with Objectify-AppengineEasy ORMness with Objectify-Appengine
Easy ORMness with Objectify-Appengine
 
Cloud Foundry Impressions
Cloud Foundry Impressions Cloud Foundry Impressions
Cloud Foundry Impressions
 
Google appenginemigrationcasestudy
Google appenginemigrationcasestudyGoogle appenginemigrationcasestudy
Google appenginemigrationcasestudy
 
Multi-Tenancy in the Cloud
Multi-Tenancy in the CloudMulti-Tenancy in the Cloud
Multi-Tenancy in the Cloud
 
Inphina at a glance
Inphina at a glanceInphina at a glance
Inphina at a glance
 
Inphina cloud
Inphina cloudInphina cloud
Inphina cloud
 
Testing your application on Google App Engine
Testing your application on Google App EngineTesting your application on Google App Engine
Testing your application on Google App Engine
 

Kürzlich hochgeladen

Cloud Frontiers: A Deep Dive into Serverless Spatial Data and FME
Cloud Frontiers:  A Deep Dive into Serverless Spatial Data and FMECloud Frontiers:  A Deep Dive into Serverless Spatial Data and FME
Cloud Frontiers: A Deep Dive into Serverless Spatial Data and FME
Safe Software
 

Kürzlich hochgeladen (20)

ICT role in 21st century education and its challenges
ICT role in 21st century education and its challengesICT role in 21st century education and its challenges
ICT role in 21st century education and its challenges
 
Repurposing LNG terminals for Hydrogen Ammonia: Feasibility and Cost Saving
Repurposing LNG terminals for Hydrogen Ammonia: Feasibility and Cost SavingRepurposing LNG terminals for Hydrogen Ammonia: Feasibility and Cost Saving
Repurposing LNG terminals for Hydrogen Ammonia: Feasibility and Cost Saving
 
Automating Google Workspace (GWS) & more with Apps Script
Automating Google Workspace (GWS) & more with Apps ScriptAutomating Google Workspace (GWS) & more with Apps Script
Automating Google Workspace (GWS) & more with Apps Script
 
Strategize a Smooth Tenant-to-tenant Migration and Copilot Takeoff
Strategize a Smooth Tenant-to-tenant Migration and Copilot TakeoffStrategize a Smooth Tenant-to-tenant Migration and Copilot Takeoff
Strategize a Smooth Tenant-to-tenant Migration and Copilot Takeoff
 
Mastering MySQL Database Architecture: Deep Dive into MySQL Shell and MySQL R...
Mastering MySQL Database Architecture: Deep Dive into MySQL Shell and MySQL R...Mastering MySQL Database Architecture: Deep Dive into MySQL Shell and MySQL R...
Mastering MySQL Database Architecture: Deep Dive into MySQL Shell and MySQL R...
 
Boost Fertility New Invention Ups Success Rates.pdf
Boost Fertility New Invention Ups Success Rates.pdfBoost Fertility New Invention Ups Success Rates.pdf
Boost Fertility New Invention Ups Success Rates.pdf
 
TrustArc Webinar - Unlock the Power of AI-Driven Data Discovery
TrustArc Webinar - Unlock the Power of AI-Driven Data DiscoveryTrustArc Webinar - Unlock the Power of AI-Driven Data Discovery
TrustArc Webinar - Unlock the Power of AI-Driven Data Discovery
 
Powerful Google developer tools for immediate impact! (2023-24 C)
Powerful Google developer tools for immediate impact! (2023-24 C)Powerful Google developer tools for immediate impact! (2023-24 C)
Powerful Google developer tools for immediate impact! (2023-24 C)
 
presentation ICT roal in 21st century education
presentation ICT roal in 21st century educationpresentation ICT roal in 21st century education
presentation ICT roal in 21st century education
 
Artificial Intelligence Chap.5 : Uncertainty
Artificial Intelligence Chap.5 : UncertaintyArtificial Intelligence Chap.5 : Uncertainty
Artificial Intelligence Chap.5 : Uncertainty
 
AXA XL - Insurer Innovation Award Americas 2024
AXA XL - Insurer Innovation Award Americas 2024AXA XL - Insurer Innovation Award Americas 2024
AXA XL - Insurer Innovation Award Americas 2024
 
Cloud Frontiers: A Deep Dive into Serverless Spatial Data and FME
Cloud Frontiers:  A Deep Dive into Serverless Spatial Data and FMECloud Frontiers:  A Deep Dive into Serverless Spatial Data and FME
Cloud Frontiers: A Deep Dive into Serverless Spatial Data and FME
 
Emergent Methods: Multi-lingual narrative tracking in the news - real-time ex...
Emergent Methods: Multi-lingual narrative tracking in the news - real-time ex...Emergent Methods: Multi-lingual narrative tracking in the news - real-time ex...
Emergent Methods: Multi-lingual narrative tracking in the news - real-time ex...
 
2024: Domino Containers - The Next Step. News from the Domino Container commu...
2024: Domino Containers - The Next Step. News from the Domino Container commu...2024: Domino Containers - The Next Step. News from the Domino Container commu...
2024: Domino Containers - The Next Step. News from the Domino Container commu...
 
GenAI Risks & Security Meetup 01052024.pdf
GenAI Risks & Security Meetup 01052024.pdfGenAI Risks & Security Meetup 01052024.pdf
GenAI Risks & Security Meetup 01052024.pdf
 
AWS Community Day CPH - Three problems of Terraform
AWS Community Day CPH - Three problems of TerraformAWS Community Day CPH - Three problems of Terraform
AWS Community Day CPH - Three problems of Terraform
 
Axa Assurance Maroc - Insurer Innovation Award 2024
Axa Assurance Maroc - Insurer Innovation Award 2024Axa Assurance Maroc - Insurer Innovation Award 2024
Axa Assurance Maroc - Insurer Innovation Award 2024
 
Apidays Singapore 2024 - Building Digital Trust in a Digital Economy by Veron...
Apidays Singapore 2024 - Building Digital Trust in a Digital Economy by Veron...Apidays Singapore 2024 - Building Digital Trust in a Digital Economy by Veron...
Apidays Singapore 2024 - Building Digital Trust in a Digital Economy by Veron...
 
Apidays New York 2024 - Accelerating FinTech Innovation by Vasa Krishnan, Fin...
Apidays New York 2024 - Accelerating FinTech Innovation by Vasa Krishnan, Fin...Apidays New York 2024 - Accelerating FinTech Innovation by Vasa Krishnan, Fin...
Apidays New York 2024 - Accelerating FinTech Innovation by Vasa Krishnan, Fin...
 
EMPOWERMENT TECHNOLOGY GRADE 11 QUARTER 2 REVIEWER
EMPOWERMENT TECHNOLOGY GRADE 11 QUARTER 2 REVIEWEREMPOWERMENT TECHNOLOGY GRADE 11 QUARTER 2 REVIEWER
EMPOWERMENT TECHNOLOGY GRADE 11 QUARTER 2 REVIEWER
 

Preparing your data for the cloud

  • 1. Preparing Your Data For Cloud Narinder Kumar Inphina Technologies 1
  • 2. Agenda  Relational DBMS's : Pros & Cons  Non-Relational DBMS's : Pros & Cons  Types of Non-Relational DBMS's  Current Market State  Applicability of Different Data-Bases in different environments 2
  • 3. Relational DBMS - Pros  Data Integrity  ACID Capabilities  High Level Query Model  Data Normalization  Data Independence 3
  • 4. Relational DBMS - Cons  Scaling Issues  By Duplication (Master-Slave)  By Sharding/Division (Not transparent)  Fixed Schema  Mostly disk-oriented (Performance)  May fair poorly with large data 4
  • 5. Non-Relational DBMS - Pros  Scalability  Replication / Availability  Performance  Deployment Flexibility  Modelling Flexibility  Faster Development (?) 5
  • 6. Non-Relational DBMS - Cons  Lack of Transactional Support  Data Integrity is Application's responsibility  Data Duplication / Application Dependent  Eventually Consistent (mostly)  No Standardization  New Technology 6
  • 8. Cloud Capable RDBMS s Almost every RDBMS can run in a IAAS Cloud Platform 8
  • 10. Types of Non-Relational DBMS  Key Value Stores  Document Stores  Column Stores  Graph Stores 10
  • 11. Key Value Data-Bases  Object is completely Opaque to DB  Mostly GET, PUT & DELETE operations are supported  There may be limits on size of Objects Inspired by Amazon Dynamo Paper 11
  • 12. Key Value DataBases & Cloud Project Voldemort MemCachedDB Tokyo Tyrant 12
  • 13. Document Data-Bases  Object is not completely opaque to DB  Every Object has it's own schema FirstName="Bob", Address="5 Oak St.", Hobby="sailing". FirstName="Jonathan", Children=("Michael,10", "Jennifer,8")  Can perform queries based on Object's attributes  Possible to describe relationships between Objects  Joins and Transactions are not supported  Good for XML or JSON objects 13
  • 15. Column-Store Data-Bases  Richer than Document Stores  Multi-Dimensional Map  Tables  Row  Column  Time-Stamp  Supports Multiple Data Types  Usually use an Underlying DFS Inspired by Google Big Table Paper 15
  • 17. Key Factors while Making a Choice  Application Architecture Requirements  Platform choices  Non-Functional Requirements  Consistency  Availability  Partition  Security  Data Redemption 17
  • 18. Different Requirements = Different Solutions 18
  • 19. Scenario 1  Feature First  Corporate Data  Consistency Requirements  Business Intelligence  Legacy Application RDBMS on Amazon Cloud, RackSpace (IaaS) or Microsoft Azure/Amazon RDS (PaaS) 19
  • 20. Scenario 2  Consumer Facing Application  Big Files (Images, BLOB's, Files)  Geographically Distributed  Mostly writes  Not heavy requirement on Rich Queries Key-Value Data Stores (Amazon S3, Project Voldemort, Redis) 20
  • 21. Scenario 3  Hundreds Of Government Documents with different schemas  Need to serve on Web  Data Mining Document Data-Stores (Amazon SimpleDB, Apache Couche DB, MongoDB) 21
  • 22. Scenario 4  Scale First  Huge Data-Set  Analytical Requirements  Consumer Facing  High Availability over Consistency Column Data-Stores (Google App Engine, Hbase, Cassandra) 22
  • 23. Mix & Match of Earlier Scenarios  Polyglot Persistence  RDBMS for low-volume and high value  Key-Value DB for large files with little queries  Memcached DB for short- lived Data  Column DB for Analytics 23
  • 25. Conclusions  One Size does not Fit all  Many choices  No-SQL DB's providing Alternatives  RDBMS serve useful purpose 25
  • 26. nkumar@inphina.com www.inphina.com http://thoughts.inphina.com 26
  • 27. References  http://nosql-database.org/  http://www.drdobbs.com/database/218900502  http://perspectives.mvdirona.com/2009/11/03/OneSizeDoesNotFitAll.aspx  http://blog.nahurst.com/visual-guide-to-nosql-systems  http://blog.heroku.com/archives/2010/7/20/nosql/  http://www.vineetgupta.com/2010/01/nosql-databases-part-1-landscape.html  http://project-voldemort.com/  http://code.google.com/p/redis/  http://memcachedb.org/  http://aws.amazon.com/simpledb/  http://couchdb.apache.org/  http://www.mongodb.org  http://riak.basho.com/ 27