SlideShare ist ein Scribd-Unternehmen logo
1 von 27
Downloaden Sie, um offline zu lesen
Preparing Your Data For Cloud

                       Narinder Kumar
                    Inphina Technologies




                                    1
Agenda
   Relational DBMS's : Pros & Cons
   Non-Relational DBMS's : Pros & Cons
   Types of Non-Relational DBMS's
   Current Market State
   Applicability of Different Data-Bases in
    different environments



                                               2
Relational DBMS - Pros
   Data Integrity
   ACID Capabilities
   High Level Query Model
   Data Normalization
   Data Independence




                                    3
Relational DBMS - Cons
   Scaling Issues
       By Duplication (Master-Slave)
       By Sharding/Division (Not transparent)
   Fixed Schema
   Mostly disk-oriented (Performance)
   May fair poorly with large data



                                                 4
Non-Relational DBMS - Pros
   Scalability

   Replication / Availability

   Performance

   Deployment Flexibility

   Modelling Flexibility

   Faster Development (?)
                                     5
Non-Relational DBMS - Cons
   Lack of Transactional Support
   Data Integrity is Application's responsibility
   Data Duplication / Application Dependent
   Eventually Consistent (mostly)
   No Standardization
   New Technology



                                                     6
RDBMS's and Cloud




                    7
Cloud Capable RDBMS




       s




Almost every RDBMS can run in a IAAS Cloud Platform

                                                      8
Cloud Native RDBMS




                     9
Types of Non-Relational DBMS

   Key Value Stores

   Document Stores

   Column Stores

   Graph Stores


                                     10
Key Value Data-Bases
   Object is completely Opaque to DB
   Mostly GET, PUT & DELETE operations are
    supported
   There may be limits on size of Objects


         Inspired by Amazon Dynamo Paper



                                              11
Key Value DataBases & Cloud

                Project Voldemort




MemCachedDB       Tokyo Tyrant




                                    12
Document Data-Bases
   Object is not completely opaque to DB
   Every Object has it's own schema
     FirstName="Bob", Address="5 Oak St.", Hobby="sailing".
     FirstName="Jonathan", Children=("Michael,10", "Jennifer,8")

   Can perform queries based on Object's
    attributes
   Possible to describe relationships between
    Objects
   Joins and Transactions are not supported
   Good for XML or JSON objects
                                                                   13
Document Data-Bases & Cloud




                              14
Column-Store Data-Bases
   Richer than Document Stores
   Multi-Dimensional Map
       Tables
           Row
           Column
           Time-Stamp
   Supports Multiple Data Types
   Usually use an Underlying DFS
            Inspired by Google Big Table Paper
                                                 15
Column-Store Data-Bases & Cloud




                              16
Key Factors while Making a Choice
   Application Architecture Requirements
   Platform choices
   Non-Functional Requirements
       Consistency
       Availability
       Partition
       Security
       Data Redemption

                                            17
Different Requirements = Different Solutions




                                          18
Scenario 1
   Feature First
       Corporate Data
       Consistency Requirements
       Business Intelligence
       Legacy Application


    RDBMS on Amazon Cloud, RackSpace (IaaS) or
          Microsoft Azure/Amazon RDS (PaaS)
                                                 19
Scenario 2
   Consumer Facing Application
       Big Files (Images, BLOB's, Files)
       Geographically Distributed
       Mostly writes
       Not heavy requirement on Rich Queries


     Key-Value Data Stores (Amazon S3, Project
                 Voldemort, Redis)

                                                 20
Scenario 3
   Hundreds Of Government Documents with
    different schemas
       Need to serve on Web
       Data Mining


     Document Data-Stores (Amazon SimpleDB,
          Apache Couche DB, MongoDB)


                                              21
Scenario 4
   Scale First
       Huge Data-Set
       Analytical Requirements
       Consumer Facing
       High Availability over Consistency


Column Data-Stores (Google App Engine, Hbase,
                 Cassandra)

                                             22
Mix & Match of Earlier Scenarios
                     Polyglot Persistence
                         RDBMS for low-volume
                          and high value
                         Key-Value DB for large
                          files with little queries
                         Memcached DB for short-
                          lived Data
                         Column DB for Analytics




                                                      23
CAP Theorem




              24
Conclusions
   One Size does not Fit all

   Many choices

   No-SQL DB's providing Alternatives

   RDBMS serve useful purpose



                                         25
nkumar@inphina.com

      www.inphina.com
http://thoughts.inphina.com


                              26
References
   http://nosql-database.org/

   http://www.drdobbs.com/database/218900502

   http://perspectives.mvdirona.com/2009/11/03/OneSizeDoesNotFitAll.aspx

   http://blog.nahurst.com/visual-guide-to-nosql-systems

   http://blog.heroku.com/archives/2010/7/20/nosql/

   http://www.vineetgupta.com/2010/01/nosql-databases-part-1-landscape.html

   http://project-voldemort.com/

   http://code.google.com/p/redis/

   http://memcachedb.org/

   http://aws.amazon.com/simpledb/

   http://couchdb.apache.org/

   http://www.mongodb.org


    http://riak.basho.com/
                                                                               27

Weitere ähnliche Inhalte

Was ist angesagt?

TCO Comparison MongoDB & Oracle
TCO Comparison MongoDB & OracleTCO Comparison MongoDB & Oracle
TCO Comparison MongoDB & OracleEl Taller Web
 
Nonrelational Databases
Nonrelational DatabasesNonrelational Databases
Nonrelational DatabasesUdi Bauman
 
Comparison of MPP Data Warehouse Platforms
Comparison of MPP Data Warehouse PlatformsComparison of MPP Data Warehouse Platforms
Comparison of MPP Data Warehouse PlatformsDavid Portnoy
 
Big data vahidamiri-datastack.ir
Big data vahidamiri-datastack.irBig data vahidamiri-datastack.ir
Big data vahidamiri-datastack.irdatastack
 
[db tech showcase Tokyo 2017] C34: Replacing Oracle Database at DBS Bank ~Ora...
[db tech showcase Tokyo 2017] C34: Replacing Oracle Database at DBS Bank ~Ora...[db tech showcase Tokyo 2017] C34: Replacing Oracle Database at DBS Bank ~Ora...
[db tech showcase Tokyo 2017] C34: Replacing Oracle Database at DBS Bank ~Ora...Insight Technology, Inc.
 
In memory big data management and processing a survey
In memory big data management and processing a surveyIn memory big data management and processing a survey
In memory big data management and processing a surveyredpel dot com
 
NoSQL databases and managing big data
NoSQL databases and managing big dataNoSQL databases and managing big data
NoSQL databases and managing big dataSteven Francia
 
MongoDB: The Operational Big Data by NORBERTO LEITE at Big Data Spain 2014
 MongoDB: The Operational Big Data by NORBERTO LEITE at Big Data Spain 2014 MongoDB: The Operational Big Data by NORBERTO LEITE at Big Data Spain 2014
MongoDB: The Operational Big Data by NORBERTO LEITE at Big Data Spain 2014Big Data Spain
 
An Enterprise Architect's View of MongoDB
An Enterprise Architect's View of MongoDBAn Enterprise Architect's View of MongoDB
An Enterprise Architect's View of MongoDBMongoDB
 
Non relational databases-no sql
Non relational databases-no sqlNon relational databases-no sql
Non relational databases-no sqlRam kumar
 
Database awareness
Database awarenessDatabase awareness
Database awarenesskloia
 
Emergent Distributed Data Storage
Emergent Distributed Data StorageEmergent Distributed Data Storage
Emergent Distributed Data Storagehybrid cloud
 
NOSQL- Presentation on NoSQL
NOSQL- Presentation on NoSQLNOSQL- Presentation on NoSQL
NOSQL- Presentation on NoSQLRamakant Soni
 
Building Big Data Applications
Building Big Data ApplicationsBuilding Big Data Applications
Building Big Data ApplicationsRichard McDougall
 
Webinar: Faster Big Data Analytics with MongoDB
Webinar: Faster Big Data Analytics with MongoDBWebinar: Faster Big Data Analytics with MongoDB
Webinar: Faster Big Data Analytics with MongoDBMongoDB
 
Introduction to Big Data Analytics on Apache Hadoop
Introduction to Big Data Analytics on Apache HadoopIntroduction to Big Data Analytics on Apache Hadoop
Introduction to Big Data Analytics on Apache HadoopAvkash Chauhan
 
Big Data/Hadoop Infrastructure Considerations
Big Data/Hadoop Infrastructure ConsiderationsBig Data/Hadoop Infrastructure Considerations
Big Data/Hadoop Infrastructure ConsiderationsRichard McDougall
 
The World of Structured Storage System
The World of Structured Storage SystemThe World of Structured Storage System
The World of Structured Storage SystemSchubert Zhang
 

Was ist angesagt? (20)

TCO Comparison MongoDB & Oracle
TCO Comparison MongoDB & OracleTCO Comparison MongoDB & Oracle
TCO Comparison MongoDB & Oracle
 
Nonrelational Databases
Nonrelational DatabasesNonrelational Databases
Nonrelational Databases
 
Comparison of MPP Data Warehouse Platforms
Comparison of MPP Data Warehouse PlatformsComparison of MPP Data Warehouse Platforms
Comparison of MPP Data Warehouse Platforms
 
MongoDB
MongoDBMongoDB
MongoDB
 
No sql
No sqlNo sql
No sql
 
Big data vahidamiri-datastack.ir
Big data vahidamiri-datastack.irBig data vahidamiri-datastack.ir
Big data vahidamiri-datastack.ir
 
[db tech showcase Tokyo 2017] C34: Replacing Oracle Database at DBS Bank ~Ora...
[db tech showcase Tokyo 2017] C34: Replacing Oracle Database at DBS Bank ~Ora...[db tech showcase Tokyo 2017] C34: Replacing Oracle Database at DBS Bank ~Ora...
[db tech showcase Tokyo 2017] C34: Replacing Oracle Database at DBS Bank ~Ora...
 
In memory big data management and processing a survey
In memory big data management and processing a surveyIn memory big data management and processing a survey
In memory big data management and processing a survey
 
NoSQL databases and managing big data
NoSQL databases and managing big dataNoSQL databases and managing big data
NoSQL databases and managing big data
 
MongoDB: The Operational Big Data by NORBERTO LEITE at Big Data Spain 2014
 MongoDB: The Operational Big Data by NORBERTO LEITE at Big Data Spain 2014 MongoDB: The Operational Big Data by NORBERTO LEITE at Big Data Spain 2014
MongoDB: The Operational Big Data by NORBERTO LEITE at Big Data Spain 2014
 
An Enterprise Architect's View of MongoDB
An Enterprise Architect's View of MongoDBAn Enterprise Architect's View of MongoDB
An Enterprise Architect's View of MongoDB
 
Non relational databases-no sql
Non relational databases-no sqlNon relational databases-no sql
Non relational databases-no sql
 
Database awareness
Database awarenessDatabase awareness
Database awareness
 
Emergent Distributed Data Storage
Emergent Distributed Data StorageEmergent Distributed Data Storage
Emergent Distributed Data Storage
 
NOSQL- Presentation on NoSQL
NOSQL- Presentation on NoSQLNOSQL- Presentation on NoSQL
NOSQL- Presentation on NoSQL
 
Building Big Data Applications
Building Big Data ApplicationsBuilding Big Data Applications
Building Big Data Applications
 
Webinar: Faster Big Data Analytics with MongoDB
Webinar: Faster Big Data Analytics with MongoDBWebinar: Faster Big Data Analytics with MongoDB
Webinar: Faster Big Data Analytics with MongoDB
 
Introduction to Big Data Analytics on Apache Hadoop
Introduction to Big Data Analytics on Apache HadoopIntroduction to Big Data Analytics on Apache Hadoop
Introduction to Big Data Analytics on Apache Hadoop
 
Big Data/Hadoop Infrastructure Considerations
Big Data/Hadoop Infrastructure ConsiderationsBig Data/Hadoop Infrastructure Considerations
Big Data/Hadoop Infrastructure Considerations
 
The World of Structured Storage System
The World of Structured Storage SystemThe World of Structured Storage System
The World of Structured Storage System
 

Andere mochten auch

Innovation manddag
Innovation manddagInnovation manddag
Innovation manddagSocialOnline
 
2. seminar hvorfor blogge
2. seminar hvorfor blogge2. seminar hvorfor blogge
2. seminar hvorfor bloggeSocialOnline
 
CERCONEWS June 2011
CERCONEWS  June 2011CERCONEWS  June 2011
CERCONEWS June 2011cercopan
 
Innovation manddag ufilm
Innovation manddag ufilmInnovation manddag ufilm
Innovation manddag ufilmSocialOnline
 
CERCONEWS January 2010
CERCONEWS  January 2010CERCONEWS  January 2010
CERCONEWS January 2010cercopan
 
Com scoredatapassport 2h10
Com scoredatapassport 2h10Com scoredatapassport 2h10
Com scoredatapassport 2h10SocialOnline
 
CERCONEWS May 2010
CERCONEWS  May 2010CERCONEWS  May 2010
CERCONEWS May 2010cercopan
 
Social marketing analyse
Social marketing analyseSocial marketing analyse
Social marketing analyseSocialOnline
 
Hubspot Charts Graphs April2010
Hubspot Charts Graphs April2010Hubspot Charts Graphs April2010
Hubspot Charts Graphs April2010SocialOnline
 
Social Marketing Analyse
Social Marketing AnalyseSocial Marketing Analyse
Social Marketing AnalyseSocialOnline
 

Andere mochten auch (14)

Innovation manddag
Innovation manddagInnovation manddag
Innovation manddag
 
2. seminar hvorfor blogge
2. seminar hvorfor blogge2. seminar hvorfor blogge
2. seminar hvorfor blogge
 
CERCONEWS June 2011
CERCONEWS  June 2011CERCONEWS  June 2011
CERCONEWS June 2011
 
Innovation manddag ufilm
Innovation manddag ufilmInnovation manddag ufilm
Innovation manddag ufilm
 
CERCONEWS January 2010
CERCONEWS  January 2010CERCONEWS  January 2010
CERCONEWS January 2010
 
Com scoredatapassport 2h10
Com scoredatapassport 2h10Com scoredatapassport 2h10
Com scoredatapassport 2h10
 
CERCONEWS May 2010
CERCONEWS  May 2010CERCONEWS  May 2010
CERCONEWS May 2010
 
Social marketing analyse
Social marketing analyseSocial marketing analyse
Social marketing analyse
 
Hubspot Charts Graphs April2010
Hubspot Charts Graphs April2010Hubspot Charts Graphs April2010
Hubspot Charts Graphs April2010
 
Social Marketing Analyse
Social Marketing AnalyseSocial Marketing Analyse
Social Marketing Analyse
 
Getting started with jClouds
Getting started with jCloudsGetting started with jClouds
Getting started with jClouds
 
Scala collections
Scala collectionsScala collections
Scala collections
 
Cloud slam2011 multi-tenancy
Cloud slam2011 multi-tenancyCloud slam2011 multi-tenancy
Cloud slam2011 multi-tenancy
 
Multi-tenancy in the cloud
Multi-tenancy in the cloudMulti-tenancy in the cloud
Multi-tenancy in the cloud
 

Ähnlich wie Preparing your data for the cloud

SQL? NoSQL? NewSQL?!? What’s a Java developer to do? - JDC2012 Cairo, Egypt
SQL? NoSQL? NewSQL?!? What’s a Java developer to do? - JDC2012 Cairo, EgyptSQL? NoSQL? NewSQL?!? What’s a Java developer to do? - JDC2012 Cairo, Egypt
SQL? NoSQL? NewSQL?!? What’s a Java developer to do? - JDC2012 Cairo, EgyptChris Richardson
 
New Data Technologies, Graph Computing and Relationship Discovery in the Ente...
New Data Technologies, Graph Computing and Relationship Discovery in the Ente...New Data Technologies, Graph Computing and Relationship Discovery in the Ente...
New Data Technologies, Graph Computing and Relationship Discovery in the Ente...InfiniteGraph
 
Presentation on Databases in the Cloud
Presentation on Databases in the CloudPresentation on Databases in the Cloud
Presentation on Databases in the Cloudmoshfiq
 
NOSQL in big data is the not only structure langua.pdf
NOSQL in big data is the not only structure langua.pdfNOSQL in big data is the not only structure langua.pdf
NOSQL in big data is the not only structure langua.pdfajajkhan16
 
A Survey of Advanced Non-relational Database Systems: Approaches and Applicat...
A Survey of Advanced Non-relational Database Systems: Approaches and Applicat...A Survey of Advanced Non-relational Database Systems: Approaches and Applicat...
A Survey of Advanced Non-relational Database Systems: Approaches and Applicat...Qian Lin
 
NoSQL Options Compared
NoSQL Options ComparedNoSQL Options Compared
NoSQL Options ComparedSergey Bushik
 
NoSQL Data Stores in Research and Practice - ICDE 2016 Tutorial - Extended Ve...
NoSQL Data Stores in Research and Practice - ICDE 2016 Tutorial - Extended Ve...NoSQL Data Stores in Research and Practice - ICDE 2016 Tutorial - Extended Ve...
NoSQL Data Stores in Research and Practice - ICDE 2016 Tutorial - Extended Ve...Felix Gessert
 
Big Data Basic Concepts | Presented in 2014
Big Data Basic Concepts  | Presented in 2014Big Data Basic Concepts  | Presented in 2014
Big Data Basic Concepts | Presented in 2014Kenneth Igiri
 
عصر کلان داده، چرا و چگونه؟
عصر کلان داده، چرا و چگونه؟عصر کلان داده، چرا و چگونه؟
عصر کلان داده، چرا و چگونه؟datastack
 
Information processing architectures
Information processing architecturesInformation processing architectures
Information processing architecturesRaji Gogulapati
 
DynamoDB Gluecon 2012
DynamoDB Gluecon 2012DynamoDB Gluecon 2012
DynamoDB Gluecon 2012Appirio
 
Gluecon 2012 - DynamoDB
Gluecon 2012 - DynamoDBGluecon 2012 - DynamoDB
Gluecon 2012 - DynamoDBJeff Douglas
 
NO SQL: What, Why, How
NO SQL: What, Why, HowNO SQL: What, Why, How
NO SQL: What, Why, HowIgor Moochnick
 
Minnebar 2013 - Scaling with Cassandra
Minnebar 2013 - Scaling with CassandraMinnebar 2013 - Scaling with Cassandra
Minnebar 2013 - Scaling with CassandraJeff Bollinger
 
No sqlpresentation
No sqlpresentationNo sqlpresentation
No sqlpresentationSalma Gouia
 
The elephantintheroom bigdataanalyticsinthecloud
The elephantintheroom bigdataanalyticsinthecloudThe elephantintheroom bigdataanalyticsinthecloud
The elephantintheroom bigdataanalyticsinthecloudKhazret Sapenov
 
NoSQL Architecture Overview
NoSQL Architecture OverviewNoSQL Architecture Overview
NoSQL Architecture OverviewChristopher Foot
 

Ähnlich wie Preparing your data for the cloud (20)

SQL? NoSQL? NewSQL?!? What’s a Java developer to do? - JDC2012 Cairo, Egypt
SQL? NoSQL? NewSQL?!? What’s a Java developer to do? - JDC2012 Cairo, EgyptSQL? NoSQL? NewSQL?!? What’s a Java developer to do? - JDC2012 Cairo, Egypt
SQL? NoSQL? NewSQL?!? What’s a Java developer to do? - JDC2012 Cairo, Egypt
 
New Data Technologies, Graph Computing and Relationship Discovery in the Ente...
New Data Technologies, Graph Computing and Relationship Discovery in the Ente...New Data Technologies, Graph Computing and Relationship Discovery in the Ente...
New Data Technologies, Graph Computing and Relationship Discovery in the Ente...
 
Presentation on Databases in the Cloud
Presentation on Databases in the CloudPresentation on Databases in the Cloud
Presentation on Databases in the Cloud
 
NOSQL in big data is the not only structure langua.pdf
NOSQL in big data is the not only structure langua.pdfNOSQL in big data is the not only structure langua.pdf
NOSQL in big data is the not only structure langua.pdf
 
A Survey of Advanced Non-relational Database Systems: Approaches and Applicat...
A Survey of Advanced Non-relational Database Systems: Approaches and Applicat...A Survey of Advanced Non-relational Database Systems: Approaches and Applicat...
A Survey of Advanced Non-relational Database Systems: Approaches and Applicat...
 
NoSQL Options Compared
NoSQL Options ComparedNoSQL Options Compared
NoSQL Options Compared
 
Rdbms vs. no sql
Rdbms vs. no sqlRdbms vs. no sql
Rdbms vs. no sql
 
NoSQL Data Stores in Research and Practice - ICDE 2016 Tutorial - Extended Ve...
NoSQL Data Stores in Research and Practice - ICDE 2016 Tutorial - Extended Ve...NoSQL Data Stores in Research and Practice - ICDE 2016 Tutorial - Extended Ve...
NoSQL Data Stores in Research and Practice - ICDE 2016 Tutorial - Extended Ve...
 
Big Data Basic Concepts | Presented in 2014
Big Data Basic Concepts  | Presented in 2014Big Data Basic Concepts  | Presented in 2014
Big Data Basic Concepts | Presented in 2014
 
عصر کلان داده، چرا و چگونه؟
عصر کلان داده، چرا و چگونه؟عصر کلان داده، چرا و چگونه؟
عصر کلان داده، چرا و چگونه؟
 
Information processing architectures
Information processing architecturesInformation processing architectures
Information processing architectures
 
DynamoDB Gluecon 2012
DynamoDB Gluecon 2012DynamoDB Gluecon 2012
DynamoDB Gluecon 2012
 
Gluecon 2012 - DynamoDB
Gluecon 2012 - DynamoDBGluecon 2012 - DynamoDB
Gluecon 2012 - DynamoDB
 
NO SQL: What, Why, How
NO SQL: What, Why, HowNO SQL: What, Why, How
NO SQL: What, Why, How
 
Minnebar 2013 - Scaling with Cassandra
Minnebar 2013 - Scaling with CassandraMinnebar 2013 - Scaling with Cassandra
Minnebar 2013 - Scaling with Cassandra
 
Report 2.0.docx
Report 2.0.docxReport 2.0.docx
Report 2.0.docx
 
No sqlpresentation
No sqlpresentationNo sqlpresentation
No sqlpresentation
 
The elephantintheroom bigdataanalyticsinthecloud
The elephantintheroom bigdataanalyticsinthecloudThe elephantintheroom bigdataanalyticsinthecloud
The elephantintheroom bigdataanalyticsinthecloud
 
NoSQL Architecture Overview
NoSQL Architecture OverviewNoSQL Architecture Overview
NoSQL Architecture Overview
 
Nosql seminar
Nosql seminarNosql seminar
Nosql seminar
 

Mehr von Inphina Technologies

Mehr von Inphina Technologies (8)

Scala test
Scala testScala test
Scala test
 
Easy ORMness with Objectify-Appengine
Easy ORMness with Objectify-AppengineEasy ORMness with Objectify-Appengine
Easy ORMness with Objectify-Appengine
 
Cloud Foundry Impressions
Cloud Foundry Impressions Cloud Foundry Impressions
Cloud Foundry Impressions
 
Google appenginemigrationcasestudy
Google appenginemigrationcasestudyGoogle appenginemigrationcasestudy
Google appenginemigrationcasestudy
 
Multi-Tenancy in the Cloud
Multi-Tenancy in the CloudMulti-Tenancy in the Cloud
Multi-Tenancy in the Cloud
 
Inphina at a glance
Inphina at a glanceInphina at a glance
Inphina at a glance
 
Inphina cloud
Inphina cloudInphina cloud
Inphina cloud
 
Testing your application on Google App Engine
Testing your application on Google App EngineTesting your application on Google App Engine
Testing your application on Google App Engine
 

Kürzlich hochgeladen

The 7 Things I Know About Cyber Security After 25 Years | April 2024
The 7 Things I Know About Cyber Security After 25 Years | April 2024The 7 Things I Know About Cyber Security After 25 Years | April 2024
The 7 Things I Know About Cyber Security After 25 Years | April 2024Rafal Los
 
Advantages of Hiring UIUX Design Service Providers for Your Business
Advantages of Hiring UIUX Design Service Providers for Your BusinessAdvantages of Hiring UIUX Design Service Providers for Your Business
Advantages of Hiring UIUX Design Service Providers for Your BusinessPixlogix Infotech
 
Slack Application Development 101 Slides
Slack Application Development 101 SlidesSlack Application Development 101 Slides
Slack Application Development 101 Slidespraypatel2
 
08448380779 Call Girls In Civil Lines Women Seeking Men
08448380779 Call Girls In Civil Lines Women Seeking Men08448380779 Call Girls In Civil Lines Women Seeking Men
08448380779 Call Girls In Civil Lines Women Seeking MenDelhi Call girls
 
Boost PC performance: How more available memory can improve productivity
Boost PC performance: How more available memory can improve productivityBoost PC performance: How more available memory can improve productivity
Boost PC performance: How more available memory can improve productivityPrincipled Technologies
 
Presentation on how to chat with PDF using ChatGPT code interpreter
Presentation on how to chat with PDF using ChatGPT code interpreterPresentation on how to chat with PDF using ChatGPT code interpreter
Presentation on how to chat with PDF using ChatGPT code interpreternaman860154
 
08448380779 Call Girls In Diplomatic Enclave Women Seeking Men
08448380779 Call Girls In Diplomatic Enclave Women Seeking Men08448380779 Call Girls In Diplomatic Enclave Women Seeking Men
08448380779 Call Girls In Diplomatic Enclave Women Seeking MenDelhi Call girls
 
The Codex of Business Writing Software for Real-World Solutions 2.pptx
The Codex of Business Writing Software for Real-World Solutions 2.pptxThe Codex of Business Writing Software for Real-World Solutions 2.pptx
The Codex of Business Writing Software for Real-World Solutions 2.pptxMalak Abu Hammad
 
Artificial Intelligence: Facts and Myths
Artificial Intelligence: Facts and MythsArtificial Intelligence: Facts and Myths
Artificial Intelligence: Facts and MythsJoaquim Jorge
 
A Domino Admins Adventures (Engage 2024)
A Domino Admins Adventures (Engage 2024)A Domino Admins Adventures (Engage 2024)
A Domino Admins Adventures (Engage 2024)Gabriella Davis
 
Real Time Object Detection Using Open CV
Real Time Object Detection Using Open CVReal Time Object Detection Using Open CV
Real Time Object Detection Using Open CVKhem
 
Factors to Consider When Choosing Accounts Payable Services Providers.pptx
Factors to Consider When Choosing Accounts Payable Services Providers.pptxFactors to Consider When Choosing Accounts Payable Services Providers.pptx
Factors to Consider When Choosing Accounts Payable Services Providers.pptxKatpro Technologies
 
Workshop - Best of Both Worlds_ Combine KG and Vector search for enhanced R...
Workshop - Best of Both Worlds_ Combine  KG and Vector search for  enhanced R...Workshop - Best of Both Worlds_ Combine  KG and Vector search for  enhanced R...
Workshop - Best of Both Worlds_ Combine KG and Vector search for enhanced R...Neo4j
 
GenCyber Cyber Security Day Presentation
GenCyber Cyber Security Day PresentationGenCyber Cyber Security Day Presentation
GenCyber Cyber Security Day PresentationMichael W. Hawkins
 
Data Cloud, More than a CDP by Matt Robison
Data Cloud, More than a CDP by Matt RobisonData Cloud, More than a CDP by Matt Robison
Data Cloud, More than a CDP by Matt RobisonAnna Loughnan Colquhoun
 
Breaking the Kubernetes Kill Chain: Host Path Mount
Breaking the Kubernetes Kill Chain: Host Path MountBreaking the Kubernetes Kill Chain: Host Path Mount
Breaking the Kubernetes Kill Chain: Host Path MountPuma Security, LLC
 
TrustArc Webinar - Stay Ahead of US State Data Privacy Law Developments
TrustArc Webinar - Stay Ahead of US State Data Privacy Law DevelopmentsTrustArc Webinar - Stay Ahead of US State Data Privacy Law Developments
TrustArc Webinar - Stay Ahead of US State Data Privacy Law DevelopmentsTrustArc
 
Automating Google Workspace (GWS) & more with Apps Script
Automating Google Workspace (GWS) & more with Apps ScriptAutomating Google Workspace (GWS) & more with Apps Script
Automating Google Workspace (GWS) & more with Apps Scriptwesley chun
 
Boost Fertility New Invention Ups Success Rates.pdf
Boost Fertility New Invention Ups Success Rates.pdfBoost Fertility New Invention Ups Success Rates.pdf
Boost Fertility New Invention Ups Success Rates.pdfsudhanshuwaghmare1
 
08448380779 Call Girls In Greater Kailash - I Women Seeking Men
08448380779 Call Girls In Greater Kailash - I Women Seeking Men08448380779 Call Girls In Greater Kailash - I Women Seeking Men
08448380779 Call Girls In Greater Kailash - I Women Seeking MenDelhi Call girls
 

Kürzlich hochgeladen (20)

The 7 Things I Know About Cyber Security After 25 Years | April 2024
The 7 Things I Know About Cyber Security After 25 Years | April 2024The 7 Things I Know About Cyber Security After 25 Years | April 2024
The 7 Things I Know About Cyber Security After 25 Years | April 2024
 
Advantages of Hiring UIUX Design Service Providers for Your Business
Advantages of Hiring UIUX Design Service Providers for Your BusinessAdvantages of Hiring UIUX Design Service Providers for Your Business
Advantages of Hiring UIUX Design Service Providers for Your Business
 
Slack Application Development 101 Slides
Slack Application Development 101 SlidesSlack Application Development 101 Slides
Slack Application Development 101 Slides
 
08448380779 Call Girls In Civil Lines Women Seeking Men
08448380779 Call Girls In Civil Lines Women Seeking Men08448380779 Call Girls In Civil Lines Women Seeking Men
08448380779 Call Girls In Civil Lines Women Seeking Men
 
Boost PC performance: How more available memory can improve productivity
Boost PC performance: How more available memory can improve productivityBoost PC performance: How more available memory can improve productivity
Boost PC performance: How more available memory can improve productivity
 
Presentation on how to chat with PDF using ChatGPT code interpreter
Presentation on how to chat with PDF using ChatGPT code interpreterPresentation on how to chat with PDF using ChatGPT code interpreter
Presentation on how to chat with PDF using ChatGPT code interpreter
 
08448380779 Call Girls In Diplomatic Enclave Women Seeking Men
08448380779 Call Girls In Diplomatic Enclave Women Seeking Men08448380779 Call Girls In Diplomatic Enclave Women Seeking Men
08448380779 Call Girls In Diplomatic Enclave Women Seeking Men
 
The Codex of Business Writing Software for Real-World Solutions 2.pptx
The Codex of Business Writing Software for Real-World Solutions 2.pptxThe Codex of Business Writing Software for Real-World Solutions 2.pptx
The Codex of Business Writing Software for Real-World Solutions 2.pptx
 
Artificial Intelligence: Facts and Myths
Artificial Intelligence: Facts and MythsArtificial Intelligence: Facts and Myths
Artificial Intelligence: Facts and Myths
 
A Domino Admins Adventures (Engage 2024)
A Domino Admins Adventures (Engage 2024)A Domino Admins Adventures (Engage 2024)
A Domino Admins Adventures (Engage 2024)
 
Real Time Object Detection Using Open CV
Real Time Object Detection Using Open CVReal Time Object Detection Using Open CV
Real Time Object Detection Using Open CV
 
Factors to Consider When Choosing Accounts Payable Services Providers.pptx
Factors to Consider When Choosing Accounts Payable Services Providers.pptxFactors to Consider When Choosing Accounts Payable Services Providers.pptx
Factors to Consider When Choosing Accounts Payable Services Providers.pptx
 
Workshop - Best of Both Worlds_ Combine KG and Vector search for enhanced R...
Workshop - Best of Both Worlds_ Combine  KG and Vector search for  enhanced R...Workshop - Best of Both Worlds_ Combine  KG and Vector search for  enhanced R...
Workshop - Best of Both Worlds_ Combine KG and Vector search for enhanced R...
 
GenCyber Cyber Security Day Presentation
GenCyber Cyber Security Day PresentationGenCyber Cyber Security Day Presentation
GenCyber Cyber Security Day Presentation
 
Data Cloud, More than a CDP by Matt Robison
Data Cloud, More than a CDP by Matt RobisonData Cloud, More than a CDP by Matt Robison
Data Cloud, More than a CDP by Matt Robison
 
Breaking the Kubernetes Kill Chain: Host Path Mount
Breaking the Kubernetes Kill Chain: Host Path MountBreaking the Kubernetes Kill Chain: Host Path Mount
Breaking the Kubernetes Kill Chain: Host Path Mount
 
TrustArc Webinar - Stay Ahead of US State Data Privacy Law Developments
TrustArc Webinar - Stay Ahead of US State Data Privacy Law DevelopmentsTrustArc Webinar - Stay Ahead of US State Data Privacy Law Developments
TrustArc Webinar - Stay Ahead of US State Data Privacy Law Developments
 
Automating Google Workspace (GWS) & more with Apps Script
Automating Google Workspace (GWS) & more with Apps ScriptAutomating Google Workspace (GWS) & more with Apps Script
Automating Google Workspace (GWS) & more with Apps Script
 
Boost Fertility New Invention Ups Success Rates.pdf
Boost Fertility New Invention Ups Success Rates.pdfBoost Fertility New Invention Ups Success Rates.pdf
Boost Fertility New Invention Ups Success Rates.pdf
 
08448380779 Call Girls In Greater Kailash - I Women Seeking Men
08448380779 Call Girls In Greater Kailash - I Women Seeking Men08448380779 Call Girls In Greater Kailash - I Women Seeking Men
08448380779 Call Girls In Greater Kailash - I Women Seeking Men
 

Preparing your data for the cloud

  • 1. Preparing Your Data For Cloud Narinder Kumar Inphina Technologies 1
  • 2. Agenda  Relational DBMS's : Pros & Cons  Non-Relational DBMS's : Pros & Cons  Types of Non-Relational DBMS's  Current Market State  Applicability of Different Data-Bases in different environments 2
  • 3. Relational DBMS - Pros  Data Integrity  ACID Capabilities  High Level Query Model  Data Normalization  Data Independence 3
  • 4. Relational DBMS - Cons  Scaling Issues  By Duplication (Master-Slave)  By Sharding/Division (Not transparent)  Fixed Schema  Mostly disk-oriented (Performance)  May fair poorly with large data 4
  • 5. Non-Relational DBMS - Pros  Scalability  Replication / Availability  Performance  Deployment Flexibility  Modelling Flexibility  Faster Development (?) 5
  • 6. Non-Relational DBMS - Cons  Lack of Transactional Support  Data Integrity is Application's responsibility  Data Duplication / Application Dependent  Eventually Consistent (mostly)  No Standardization  New Technology 6
  • 8. Cloud Capable RDBMS s Almost every RDBMS can run in a IAAS Cloud Platform 8
  • 10. Types of Non-Relational DBMS  Key Value Stores  Document Stores  Column Stores  Graph Stores 10
  • 11. Key Value Data-Bases  Object is completely Opaque to DB  Mostly GET, PUT & DELETE operations are supported  There may be limits on size of Objects Inspired by Amazon Dynamo Paper 11
  • 12. Key Value DataBases & Cloud Project Voldemort MemCachedDB Tokyo Tyrant 12
  • 13. Document Data-Bases  Object is not completely opaque to DB  Every Object has it's own schema FirstName="Bob", Address="5 Oak St.", Hobby="sailing". FirstName="Jonathan", Children=("Michael,10", "Jennifer,8")  Can perform queries based on Object's attributes  Possible to describe relationships between Objects  Joins and Transactions are not supported  Good for XML or JSON objects 13
  • 15. Column-Store Data-Bases  Richer than Document Stores  Multi-Dimensional Map  Tables  Row  Column  Time-Stamp  Supports Multiple Data Types  Usually use an Underlying DFS Inspired by Google Big Table Paper 15
  • 17. Key Factors while Making a Choice  Application Architecture Requirements  Platform choices  Non-Functional Requirements  Consistency  Availability  Partition  Security  Data Redemption 17
  • 18. Different Requirements = Different Solutions 18
  • 19. Scenario 1  Feature First  Corporate Data  Consistency Requirements  Business Intelligence  Legacy Application RDBMS on Amazon Cloud, RackSpace (IaaS) or Microsoft Azure/Amazon RDS (PaaS) 19
  • 20. Scenario 2  Consumer Facing Application  Big Files (Images, BLOB's, Files)  Geographically Distributed  Mostly writes  Not heavy requirement on Rich Queries Key-Value Data Stores (Amazon S3, Project Voldemort, Redis) 20
  • 21. Scenario 3  Hundreds Of Government Documents with different schemas  Need to serve on Web  Data Mining Document Data-Stores (Amazon SimpleDB, Apache Couche DB, MongoDB) 21
  • 22. Scenario 4  Scale First  Huge Data-Set  Analytical Requirements  Consumer Facing  High Availability over Consistency Column Data-Stores (Google App Engine, Hbase, Cassandra) 22
  • 23. Mix & Match of Earlier Scenarios  Polyglot Persistence  RDBMS for low-volume and high value  Key-Value DB for large files with little queries  Memcached DB for short- lived Data  Column DB for Analytics 23
  • 25. Conclusions  One Size does not Fit all  Many choices  No-SQL DB's providing Alternatives  RDBMS serve useful purpose 25
  • 26. nkumar@inphina.com www.inphina.com http://thoughts.inphina.com 26
  • 27. References  http://nosql-database.org/  http://www.drdobbs.com/database/218900502  http://perspectives.mvdirona.com/2009/11/03/OneSizeDoesNotFitAll.aspx  http://blog.nahurst.com/visual-guide-to-nosql-systems  http://blog.heroku.com/archives/2010/7/20/nosql/  http://www.vineetgupta.com/2010/01/nosql-databases-part-1-landscape.html  http://project-voldemort.com/  http://code.google.com/p/redis/  http://memcachedb.org/  http://aws.amazon.com/simpledb/  http://couchdb.apache.org/  http://www.mongodb.org  http://riak.basho.com/ 27