SlideShare ist ein Scribd-Unternehmen logo
1 von 19
Distributed RDBMS 
Data Distribution Policy: Part 3 
Changing your data distribution policy 
October 2014
Data Distribution Policy: Part 3 
Distributed RDBMSs provide many scalability, availability 
and performance advantages. 
This presentation takes a deeper look at distributed 
RDBMS efficiency over the long haul as application usage 
patterns, user requirements, and workloads change. 
The presentation discusses: 
• Three stages of your data distribution policy’s lifecycle. 
• Adapting the distributed RDBMS to match application changes. 
• Ensuring that your distributed relational database is flexible and 
2 
elastic enough to accommodate endless growth and change.
3 
Why is a Distributed Relational Database Good? 
Distributed relational databases are a perfect match for 
Cloud computing models and distributed Cloud 
infrastructure. 
They are the way forward for delivering web scale 
applications and keeping ACID properties. 
• Social apps 
• Games 
• Many concurrent users 
• High transaction throughput 
• Very large data volumes
What Is a Data Distribution Policy? – Recap 
A data distribution policy describes the rules under which 
data is distributed across a distributed RDBMS. 
(a virtual database made up of many database instances, or “shards”). 
A good data distribution policy aims to: 
1. Maintain full relational database integrity 
2. Distribute workloads in an even and predictable manner 
3. Minimize the amount of joins across the array of 
4 
database instances 
4. Align with workflow and application usage patterns 
5. Yield database scalability
5 
“Change, nothing stays the same…” 
...shouted 80’s rock band Van 
Halen proudly in their song 
“Unchained”. 
Just as music fashions change, 
we know databases must adapt 
to follow new usage patterns. 
Unexpected influxes of data or 
transactions are difficult to 
predict. You may have only 
anticipated and planed for a 
specific amount of growth and 
capacity. 
But what if you 
underestimate 
your success?
Imagine your application is 
taking off with great 
success. Sounds like good 
news, right? 
However, it might be hard 
on your database, as your 
business success can 
generate significantly more 
transactions, concurrent 
users and data that all 
needs to be 
accommodated. 
6 
Taking Additional Data into Consideration 
These types of situations 
are hard to predict and 
occur on a daily basis.
If you already have a distributed RDBMS, the original data 
distribution policy was (hopefully) created based on specific 
application usage patterns and workflows (see Part 2 of this 
presentation series). 
Over time, application workflows and application usage 
patterns can change. 
This can lead to database hotspots, bottlenecks, and 
database clusters that are overloaded compared to other 
clusters. 
7 
Adapting to Change
Data Distribution Policy Lifecycle 
8 
There are three main situations to accommodate during a 
data distribution policy’s lifecycle: 
1. Changing Demand and Traffic Loads 
2. Changing Application Usage 
3. New Product Capabilities 
The answer to these changes is typically the same: 
“Rebalance” the distributed database 
Distribution policy management through lifecycle changes is 
a key issue to test in any distributed RDBMS technology.
9 
Rebalancing the Distributed Database 
In the past, changing a data distribution policy has been 
hard to address. Manually changing sharding code within 
an application was the frontline battle zone of changing data 
distribution. 
Today, software like ScaleBase can accommodate all these 
changes easily for you, quickly and with minimal disruptions 
to live systems.
Scenario 1: Adapting to Changing Demand and 
Traffic Loads 
Data distribution policies should always be designed so 
that data that is frequently accessed together is aggregated 
into the same database instance (or shard) as this provides 
the greatest efficiency and scalability benefits. 
Data distributions are built according to anticipated traffic 
predictions (both reads and writes), but traffic loads 
change. 
10
Scenario 1: Adapting to Changing Demand and 
Traffic Loads – Typical Challenges 
1. Be aware of changes in workload patterns and understand 
11 
their impact on your distributed relational database. 
2. A specific application function’s sudden popularity, or 
changes in your business environment can lead to usage 
spikes and transaction bottlenecks from increased demand 
and unexpected transaction patterns. 
3. If workloads appear where the distribution policy was not 
optimized, new and unplanned operations may cause more 
costly execution paths that result in sub-par performance and 
scalability. 
* Automated threshold alerting and various other monitoring 
can help you stay ahead of peaks and bottlenecks, so look for 
these facilities in any distributed solution you choose.
Scenario 2: Adapting to Application Usage 
Changes 
Over time, it’s quite common for application usage to 
change. When this happens: 
1. The system’s new behavior patterns need to be understood 
12 
in order to make appropriate changes and optimizations. 
2. Adapting to change is typically where do-it-yourself 
home-grown sharding fails. 
3. Re-writing the custom application code that did the initial 
data distribution to provide new data distribution can lead to 
errors that are easy to make, hard to uncover, and hard to 
recover from. 
4. Identifying the distribution policy changes required to 
optimally re-balance workloads around new application 
usage patterns needs some of the analysis that we 
described in Part 2.
Scenario 3: Adapting to New Product 
Capabilities 
The final challenge comes is modifying an application that 
add new capabilities to your product or service. 
1. Updated business requirements can necessitate different 
13 
functions to integrate new solutions with existing systems, 
extending the current application and database to 
accommodate relevant new business needs. 
2. Old-fashioned do-it-yourself distribution policy hardcoding 
eliminates flexibility and often does not allow changes to be 
made, turning an implementation attempt into a very 
complicated and daunting task.
Scenario 3: Adapting to New Product 
Capabilities (Continued) 
You can’t stop your business while you’re rebalancing the 
distributed database with data placement changes. Rewriting 
application data redistribution code creates yet another 
challenge in implementing a change while keeping existing data 
and operations intact. 
As a result, many cases companies have opted to rebuild their 
system again from scratch instead of attempting to make 
modifications. 
If data distribution logic is built into the actual application, it can 
be very hard to make system modifications on the fly. This is 
costly and ultimately results in maintenance nightmares, 
performance degradation, and downtime. 
This is not good! 
14
What Can You Do? 
To simplify the management of the data distribution policy 
that underlies your distributed RDBMS you MUST make a 
strict separation between your application and where the 
database distribution policy is defined, managed, and 
maintained. 
• If you’re a startup building a new app, or if you have an 
15 
existing app that needs to scale for growth, you want to 
“hit the road, running!” (again, to quote Van Halen). 
ScaleBase software was created to handle changes like 
the ones previously mentioned, providing customers with 
the peace of mind they need to grow successfully, in any 
manner, at any rate and to any scale.
ScaleBase Software 
• ScaleBase is a distributed database built on MySQL and 
16 
optimized for the cloud. It deploys in minutes so your 
database can handle an unlimited number of users, 
humongous volumes of data, and faster transactions. 
• It dynamically optimizes workloads and availability by 
logically distributing data across public, private, and geo-distributed 
clouds.
Try ScaleBase Today 
ScaleBase software is available for free: 
• ScaleBase Website 
• Amazon Marketplace 
• Rackspace Marketplace 
• IBM Cloud marketplace 
• ScaleBase’s free online Analysis Genie service 
AWS Marketplace Guide and a AWS Getting Started 
Tutorial are available from the documentation section of the 
ScaleBase website. 
17 
Contact ScaleBase 
sales@scalebase.com
Data Distribution Policy: Part 1 and 2 
Data Distribution Policy Part 1: 
• What a data distribution policy is 
• The challenges faced when data is distributed via sharding 
• What defines a good data distribution policy 
• The best way to distribute data for your application and 
18 
workload 
Data Distribution Policy Part 2: 
• The different approaches to data distribution 
• How to create your own data distribution policy, whether you 
are scaling an existing application or creating a new app. 
• How ScaleBase can help you create your policy
Distributed RDBMS 
Data Distribution Policy: Part 3 
Changing your data distribution policy 
October 2014

Weitere ähnliche Inhalte

Was ist angesagt?

Webinar: Emerging Trends in Data Architecture – What’s the Next Big Thing?
Webinar: Emerging Trends in Data Architecture – What’s the Next Big Thing?Webinar: Emerging Trends in Data Architecture – What’s the Next Big Thing?
Webinar: Emerging Trends in Data Architecture – What’s the Next Big Thing?DATAVERSITY
 
Integrating hadoop - Big Data TechCon 2013
Integrating hadoop - Big Data TechCon 2013Integrating hadoop - Big Data TechCon 2013
Integrating hadoop - Big Data TechCon 2013Jonathan Seidman
 
Big Data using NoSQL Technologies
Big Data using NoSQL TechnologiesBig Data using NoSQL Technologies
Big Data using NoSQL TechnologiesAmit Singh
 
5 Steps for Architecting a Data Lake
5 Steps for Architecting a Data Lake5 Steps for Architecting a Data Lake
5 Steps for Architecting a Data LakeMetroStar
 
Open Development
Open DevelopmentOpen Development
Open DevelopmentMedsphere
 
Introduction to Microsoft’s Master Data Services (MDS)
Introduction to Microsoft’s Master Data Services (MDS)Introduction to Microsoft’s Master Data Services (MDS)
Introduction to Microsoft’s Master Data Services (MDS)James Serra
 
Hybrid Data Warehouse Hadoop Implementations
Hybrid Data Warehouse Hadoop ImplementationsHybrid Data Warehouse Hadoop Implementations
Hybrid Data Warehouse Hadoop ImplementationsDavid Portnoy
 
Should I move my database to the cloud?
Should I move my database to the cloud?Should I move my database to the cloud?
Should I move my database to the cloud?James Serra
 
Designing For Occasionally Connected Apps Slideshare
Designing For Occasionally Connected Apps SlideshareDesigning For Occasionally Connected Apps Slideshare
Designing For Occasionally Connected Apps SlideshareDean Willson
 
Pervasive analytics through data & analytic centricity
Pervasive analytics through data & analytic centricityPervasive analytics through data & analytic centricity
Pervasive analytics through data & analytic centricityCloudera, Inc.
 
The Future of Data Warehousing: ETL Will Never be the Same
The Future of Data Warehousing: ETL Will Never be the SameThe Future of Data Warehousing: ETL Will Never be the Same
The Future of Data Warehousing: ETL Will Never be the SameCloudera, Inc.
 
BigData & CDN - OOP2011 (Pavlo Baron)
BigData & CDN - OOP2011 (Pavlo Baron)BigData & CDN - OOP2011 (Pavlo Baron)
BigData & CDN - OOP2011 (Pavlo Baron)Pavlo Baron
 
Hadoop Integration into Data Warehousing Architectures
Hadoop Integration into Data Warehousing ArchitecturesHadoop Integration into Data Warehousing Architectures
Hadoop Integration into Data Warehousing ArchitecturesHumza Naseer
 
Data Warehouse Optimization
Data Warehouse OptimizationData Warehouse Optimization
Data Warehouse OptimizationCloudera, Inc.
 
Enabling a Data Mesh Architecture with Data Virtualization
Enabling a Data Mesh Architecture with Data VirtualizationEnabling a Data Mesh Architecture with Data Virtualization
Enabling a Data Mesh Architecture with Data VirtualizationDenodo
 
AWS Cloud Kata 2013 | Singapore - Getting to Scale on AWS
AWS Cloud Kata 2013 | Singapore - Getting to Scale on AWSAWS Cloud Kata 2013 | Singapore - Getting to Scale on AWS
AWS Cloud Kata 2013 | Singapore - Getting to Scale on AWSAmazon Web Services
 

Was ist angesagt? (20)

Webinar: Emerging Trends in Data Architecture – What’s the Next Big Thing?
Webinar: Emerging Trends in Data Architecture – What’s the Next Big Thing?Webinar: Emerging Trends in Data Architecture – What’s the Next Big Thing?
Webinar: Emerging Trends in Data Architecture – What’s the Next Big Thing?
 
Integrating hadoop - Big Data TechCon 2013
Integrating hadoop - Big Data TechCon 2013Integrating hadoop - Big Data TechCon 2013
Integrating hadoop - Big Data TechCon 2013
 
Big Data using NoSQL Technologies
Big Data using NoSQL TechnologiesBig Data using NoSQL Technologies
Big Data using NoSQL Technologies
 
5 Steps for Architecting a Data Lake
5 Steps for Architecting a Data Lake5 Steps for Architecting a Data Lake
5 Steps for Architecting a Data Lake
 
Data Federation
Data FederationData Federation
Data Federation
 
Open Development
Open DevelopmentOpen Development
Open Development
 
Introduction to Microsoft’s Master Data Services (MDS)
Introduction to Microsoft’s Master Data Services (MDS)Introduction to Microsoft’s Master Data Services (MDS)
Introduction to Microsoft’s Master Data Services (MDS)
 
Hybrid Data Warehouse Hadoop Implementations
Hybrid Data Warehouse Hadoop ImplementationsHybrid Data Warehouse Hadoop Implementations
Hybrid Data Warehouse Hadoop Implementations
 
Should I move my database to the cloud?
Should I move my database to the cloud?Should I move my database to the cloud?
Should I move my database to the cloud?
 
Designing For Occasionally Connected Apps Slideshare
Designing For Occasionally Connected Apps SlideshareDesigning For Occasionally Connected Apps Slideshare
Designing For Occasionally Connected Apps Slideshare
 
Pervasive analytics through data & analytic centricity
Pervasive analytics through data & analytic centricityPervasive analytics through data & analytic centricity
Pervasive analytics through data & analytic centricity
 
The Future of Data Warehousing: ETL Will Never be the Same
The Future of Data Warehousing: ETL Will Never be the SameThe Future of Data Warehousing: ETL Will Never be the Same
The Future of Data Warehousing: ETL Will Never be the Same
 
BigData & CDN - OOP2011 (Pavlo Baron)
BigData & CDN - OOP2011 (Pavlo Baron)BigData & CDN - OOP2011 (Pavlo Baron)
BigData & CDN - OOP2011 (Pavlo Baron)
 
Disaster Recovery Site Implementation with MySQL
Disaster Recovery Site Implementation with MySQLDisaster Recovery Site Implementation with MySQL
Disaster Recovery Site Implementation with MySQL
 
Hadoop Integration into Data Warehousing Architectures
Hadoop Integration into Data Warehousing ArchitecturesHadoop Integration into Data Warehousing Architectures
Hadoop Integration into Data Warehousing Architectures
 
Data Warehouse Optimization
Data Warehouse OptimizationData Warehouse Optimization
Data Warehouse Optimization
 
Enabling a Data Mesh Architecture with Data Virtualization
Enabling a Data Mesh Architecture with Data VirtualizationEnabling a Data Mesh Architecture with Data Virtualization
Enabling a Data Mesh Architecture with Data Virtualization
 
SQL Server Disaster Recovery Implementation
SQL Server Disaster Recovery ImplementationSQL Server Disaster Recovery Implementation
SQL Server Disaster Recovery Implementation
 
AWS Cloud Kata 2013 | Singapore - Getting to Scale on AWS
AWS Cloud Kata 2013 | Singapore - Getting to Scale on AWSAWS Cloud Kata 2013 | Singapore - Getting to Scale on AWS
AWS Cloud Kata 2013 | Singapore - Getting to Scale on AWS
 
Flexible Design
Flexible DesignFlexible Design
Flexible Design
 

Ähnlich wie Distributed RDBMS: Data Distribution Policy: Part 3 - Changing Your Data Distribution Policy

Challenges in Querying a Distributed Relational Database
Challenges in Querying a Distributed Relational DatabaseChallenges in Querying a Distributed Relational Database
Challenges in Querying a Distributed Relational DatabaseScaleBase
 
GigaOm-sector-roadmap-cloud-analytic-databases-2017
GigaOm-sector-roadmap-cloud-analytic-databases-2017GigaOm-sector-roadmap-cloud-analytic-databases-2017
GigaOm-sector-roadmap-cloud-analytic-databases-2017Jeremy Maranitch
 
Nosql-Module 1 PPT.pptx
Nosql-Module 1 PPT.pptxNosql-Module 1 PPT.pptx
Nosql-Module 1 PPT.pptxRadhika R
 
Microsoft SQL Azure - Scaling Out with SQL Azure Whitepaper
Microsoft SQL Azure - Scaling Out with SQL Azure WhitepaperMicrosoft SQL Azure - Scaling Out with SQL Azure Whitepaper
Microsoft SQL Azure - Scaling Out with SQL Azure WhitepaperMicrosoft Private Cloud
 
Cloud application services (saa s) – multi tenant data architecture
Cloud application services (saa s) – multi tenant data architectureCloud application services (saa s) – multi tenant data architecture
Cloud application services (saa s) – multi tenant data architectureJohnny Le
 
How to Build a Scalable Web Application for Your Project
How to Build a Scalable Web Application for Your ProjectHow to Build a Scalable Web Application for Your Project
How to Build a Scalable Web Application for Your ProjectBitCot
 
Which Change Data Capture Strategy is Right for You?
Which Change Data Capture Strategy is Right for You?Which Change Data Capture Strategy is Right for You?
Which Change Data Capture Strategy is Right for You?Precisely
 
Real Time Analytics
Real Time AnalyticsReal Time Analytics
Real Time AnalyticsMohsin Hakim
 
LEGO EMBRACING CHANGE BY COMBINING BI WITH FLEXIBLE INFORMATION SYSTEM
LEGO EMBRACING CHANGE BY COMBINING BI WITH FLEXIBLE INFORMATION SYSTEMLEGO EMBRACING CHANGE BY COMBINING BI WITH FLEXIBLE INFORMATION SYSTEM
LEGO EMBRACING CHANGE BY COMBINING BI WITH FLEXIBLE INFORMATION SYSTEMmyteratak
 
Real Time Analytics
Real Time AnalyticsReal Time Analytics
Real Time AnalyticsMohsin Hakim
 
Lecture4 big data technology foundations
Lecture4 big data technology foundationsLecture4 big data technology foundations
Lecture4 big data technology foundationshktripathy
 
The technology of the business data lake
The technology of the business data lakeThe technology of the business data lake
The technology of the business data lakeCapgemini
 
Nw2008 tips tricks_edw_v10
Nw2008 tips tricks_edw_v10Nw2008 tips tricks_edw_v10
Nw2008 tips tricks_edw_v10Harsha Gowda B R
 
Ieee-no sql distributed db and cloud architecture report
Ieee-no sql distributed db and cloud architecture reportIeee-no sql distributed db and cloud architecture report
Ieee-no sql distributed db and cloud architecture reportOutsource Portfolio
 
Traditional BI vs. Business Data Lake – A Comparison
Traditional BI vs. Business Data Lake – A ComparisonTraditional BI vs. Business Data Lake – A Comparison
Traditional BI vs. Business Data Lake – A ComparisonCapgemini
 

Ähnlich wie Distributed RDBMS: Data Distribution Policy: Part 3 - Changing Your Data Distribution Policy (20)

Challenges in Querying a Distributed Relational Database
Challenges in Querying a Distributed Relational DatabaseChallenges in Querying a Distributed Relational Database
Challenges in Querying a Distributed Relational Database
 
GigaOm-sector-roadmap-cloud-analytic-databases-2017
GigaOm-sector-roadmap-cloud-analytic-databases-2017GigaOm-sector-roadmap-cloud-analytic-databases-2017
GigaOm-sector-roadmap-cloud-analytic-databases-2017
 
Nosql-Module 1 PPT.pptx
Nosql-Module 1 PPT.pptxNosql-Module 1 PPT.pptx
Nosql-Module 1 PPT.pptx
 
Microsoft SQL Azure - Scaling Out with SQL Azure Whitepaper
Microsoft SQL Azure - Scaling Out with SQL Azure WhitepaperMicrosoft SQL Azure - Scaling Out with SQL Azure Whitepaper
Microsoft SQL Azure - Scaling Out with SQL Azure Whitepaper
 
ADBMS 19MCA8125.pdf
ADBMS 19MCA8125.pdfADBMS 19MCA8125.pdf
ADBMS 19MCA8125.pdf
 
M 94 4
M 94 4M 94 4
M 94 4
 
Cloud application services (saa s) – multi tenant data architecture
Cloud application services (saa s) – multi tenant data architectureCloud application services (saa s) – multi tenant data architecture
Cloud application services (saa s) – multi tenant data architecture
 
How to Build a Scalable Web Application for Your Project
How to Build a Scalable Web Application for Your ProjectHow to Build a Scalable Web Application for Your Project
How to Build a Scalable Web Application for Your Project
 
S18 das
S18 dasS18 das
S18 das
 
Which Change Data Capture Strategy is Right for You?
Which Change Data Capture Strategy is Right for You?Which Change Data Capture Strategy is Right for You?
Which Change Data Capture Strategy is Right for You?
 
Real Time Analytics
Real Time AnalyticsReal Time Analytics
Real Time Analytics
 
Database management system
Database management systemDatabase management system
Database management system
 
NoSQL and Couchbase
NoSQL and CouchbaseNoSQL and Couchbase
NoSQL and Couchbase
 
LEGO EMBRACING CHANGE BY COMBINING BI WITH FLEXIBLE INFORMATION SYSTEM
LEGO EMBRACING CHANGE BY COMBINING BI WITH FLEXIBLE INFORMATION SYSTEMLEGO EMBRACING CHANGE BY COMBINING BI WITH FLEXIBLE INFORMATION SYSTEM
LEGO EMBRACING CHANGE BY COMBINING BI WITH FLEXIBLE INFORMATION SYSTEM
 
Real Time Analytics
Real Time AnalyticsReal Time Analytics
Real Time Analytics
 
Lecture4 big data technology foundations
Lecture4 big data technology foundationsLecture4 big data technology foundations
Lecture4 big data technology foundations
 
The technology of the business data lake
The technology of the business data lakeThe technology of the business data lake
The technology of the business data lake
 
Nw2008 tips tricks_edw_v10
Nw2008 tips tricks_edw_v10Nw2008 tips tricks_edw_v10
Nw2008 tips tricks_edw_v10
 
Ieee-no sql distributed db and cloud architecture report
Ieee-no sql distributed db and cloud architecture reportIeee-no sql distributed db and cloud architecture report
Ieee-no sql distributed db and cloud architecture report
 
Traditional BI vs. Business Data Lake – A Comparison
Traditional BI vs. Business Data Lake – A ComparisonTraditional BI vs. Business Data Lake – A Comparison
Traditional BI vs. Business Data Lake – A Comparison
 

Mehr von ScaleBase

Database Scalability - The Shard Conflict
Database Scalability - The Shard ConflictDatabase Scalability - The Shard Conflict
Database Scalability - The Shard ConflictScaleBase
 
ScaleBase Webinar: Scaling MySQL - Sharding Made Easy!
ScaleBase Webinar: Scaling MySQL - Sharding Made Easy!ScaleBase Webinar: Scaling MySQL - Sharding Made Easy!
ScaleBase Webinar: Scaling MySQL - Sharding Made Easy!ScaleBase
 
ScaleBase Webinar: Strategies for scaling MySQL
ScaleBase Webinar: Strategies for scaling MySQLScaleBase Webinar: Strategies for scaling MySQL
ScaleBase Webinar: Strategies for scaling MySQLScaleBase
 
Scaling MySQL: Catch 22 of Read Write Splitting
Scaling MySQL: Catch 22 of Read Write SplittingScaling MySQL: Catch 22 of Read Write Splitting
Scaling MySQL: Catch 22 of Read Write SplittingScaleBase
 
Scaling MySQL: Benefits of Automatic Data Distribution
Scaling MySQL: Benefits of Automatic Data DistributionScaling MySQL: Benefits of Automatic Data Distribution
Scaling MySQL: Benefits of Automatic Data DistributionScaleBase
 
Choosing a Next Gen Database: the New World Order of NoSQL, NewSQL, and MySQL
Choosing a Next Gen Database: the New World Order of NoSQL, NewSQL, and MySQLChoosing a Next Gen Database: the New World Order of NoSQL, NewSQL, and MySQL
Choosing a Next Gen Database: the New World Order of NoSQL, NewSQL, and MySQLScaleBase
 
ScaleBase Webinar: Methods and Challenges to Scale Out a MySQL Database
ScaleBase Webinar: Methods and Challenges to Scale Out a MySQL DatabaseScaleBase Webinar: Methods and Challenges to Scale Out a MySQL Database
ScaleBase Webinar: Methods and Challenges to Scale Out a MySQL DatabaseScaleBase
 
ScaleBase Backs Mozilla's new app store
ScaleBase Backs Mozilla's new app storeScaleBase Backs Mozilla's new app store
ScaleBase Backs Mozilla's new app storeScaleBase
 
ScaleBase Webinar 8.16: ScaleUp vs. ScaleOut
ScaleBase Webinar 8.16: ScaleUp vs. ScaleOutScaleBase Webinar 8.16: ScaleUp vs. ScaleOut
ScaleBase Webinar 8.16: ScaleUp vs. ScaleOutScaleBase
 

Mehr von ScaleBase (9)

Database Scalability - The Shard Conflict
Database Scalability - The Shard ConflictDatabase Scalability - The Shard Conflict
Database Scalability - The Shard Conflict
 
ScaleBase Webinar: Scaling MySQL - Sharding Made Easy!
ScaleBase Webinar: Scaling MySQL - Sharding Made Easy!ScaleBase Webinar: Scaling MySQL - Sharding Made Easy!
ScaleBase Webinar: Scaling MySQL - Sharding Made Easy!
 
ScaleBase Webinar: Strategies for scaling MySQL
ScaleBase Webinar: Strategies for scaling MySQLScaleBase Webinar: Strategies for scaling MySQL
ScaleBase Webinar: Strategies for scaling MySQL
 
Scaling MySQL: Catch 22 of Read Write Splitting
Scaling MySQL: Catch 22 of Read Write SplittingScaling MySQL: Catch 22 of Read Write Splitting
Scaling MySQL: Catch 22 of Read Write Splitting
 
Scaling MySQL: Benefits of Automatic Data Distribution
Scaling MySQL: Benefits of Automatic Data DistributionScaling MySQL: Benefits of Automatic Data Distribution
Scaling MySQL: Benefits of Automatic Data Distribution
 
Choosing a Next Gen Database: the New World Order of NoSQL, NewSQL, and MySQL
Choosing a Next Gen Database: the New World Order of NoSQL, NewSQL, and MySQLChoosing a Next Gen Database: the New World Order of NoSQL, NewSQL, and MySQL
Choosing a Next Gen Database: the New World Order of NoSQL, NewSQL, and MySQL
 
ScaleBase Webinar: Methods and Challenges to Scale Out a MySQL Database
ScaleBase Webinar: Methods and Challenges to Scale Out a MySQL DatabaseScaleBase Webinar: Methods and Challenges to Scale Out a MySQL Database
ScaleBase Webinar: Methods and Challenges to Scale Out a MySQL Database
 
ScaleBase Backs Mozilla's new app store
ScaleBase Backs Mozilla's new app storeScaleBase Backs Mozilla's new app store
ScaleBase Backs Mozilla's new app store
 
ScaleBase Webinar 8.16: ScaleUp vs. ScaleOut
ScaleBase Webinar 8.16: ScaleUp vs. ScaleOutScaleBase Webinar 8.16: ScaleUp vs. ScaleOut
ScaleBase Webinar 8.16: ScaleUp vs. ScaleOut
 

Kürzlich hochgeladen

Salesforce Implementation Services PPT By ABSYZ
Salesforce Implementation Services PPT By ABSYZSalesforce Implementation Services PPT By ABSYZ
Salesforce Implementation Services PPT By ABSYZABSYZ Inc
 
Patterns for automating API delivery. API conference
Patterns for automating API delivery. API conferencePatterns for automating API delivery. API conference
Patterns for automating API delivery. API conferencessuser9e7c64
 
Introduction to Firebase Workshop Slides
Introduction to Firebase Workshop SlidesIntroduction to Firebase Workshop Slides
Introduction to Firebase Workshop Slidesvaideheekore1
 
Simplifying Microservices & Apps - The art of effortless development - Meetup...
Simplifying Microservices & Apps - The art of effortless development - Meetup...Simplifying Microservices & Apps - The art of effortless development - Meetup...
Simplifying Microservices & Apps - The art of effortless development - Meetup...Rob Geurden
 
How to submit a standout Adobe Champion Application
How to submit a standout Adobe Champion ApplicationHow to submit a standout Adobe Champion Application
How to submit a standout Adobe Champion ApplicationBradBedford3
 
Comparing Linux OS Image Update Models - EOSS 2024.pdf
Comparing Linux OS Image Update Models - EOSS 2024.pdfComparing Linux OS Image Update Models - EOSS 2024.pdf
Comparing Linux OS Image Update Models - EOSS 2024.pdfDrew Moseley
 
Amazon Bedrock in Action - presentation of the Bedrock's capabilities
Amazon Bedrock in Action - presentation of the Bedrock's capabilitiesAmazon Bedrock in Action - presentation of the Bedrock's capabilities
Amazon Bedrock in Action - presentation of the Bedrock's capabilitiesKrzysztofKkol1
 
Sending Calendar Invites on SES and Calendarsnack.pdf
Sending Calendar Invites on SES and Calendarsnack.pdfSending Calendar Invites on SES and Calendarsnack.pdf
Sending Calendar Invites on SES and Calendarsnack.pdf31events.com
 
Tech Tuesday Slides - Introduction to Project Management with OnePlan's Work ...
Tech Tuesday Slides - Introduction to Project Management with OnePlan's Work ...Tech Tuesday Slides - Introduction to Project Management with OnePlan's Work ...
Tech Tuesday Slides - Introduction to Project Management with OnePlan's Work ...OnePlan Solutions
 
Odoo 14 - eLearning Module In Odoo 14 Enterprise
Odoo 14 - eLearning Module In Odoo 14 EnterpriseOdoo 14 - eLearning Module In Odoo 14 Enterprise
Odoo 14 - eLearning Module In Odoo 14 Enterprisepreethippts
 
VK Business Profile - provides IT solutions and Web Development
VK Business Profile - provides IT solutions and Web DevelopmentVK Business Profile - provides IT solutions and Web Development
VK Business Profile - provides IT solutions and Web Developmentvyaparkranti
 
SoftTeco - Software Development Company Profile
SoftTeco - Software Development Company ProfileSoftTeco - Software Development Company Profile
SoftTeco - Software Development Company Profileakrivarotava
 
Ronisha Informatics Private Limited Catalogue
Ronisha Informatics Private Limited CatalogueRonisha Informatics Private Limited Catalogue
Ronisha Informatics Private Limited Catalogueitservices996
 
Large Language Models for Test Case Evolution and Repair
Large Language Models for Test Case Evolution and RepairLarge Language Models for Test Case Evolution and Repair
Large Language Models for Test Case Evolution and RepairLionel Briand
 
Global Identity Enrolment and Verification Pro Solution - Cizo Technology Ser...
Global Identity Enrolment and Verification Pro Solution - Cizo Technology Ser...Global Identity Enrolment and Verification Pro Solution - Cizo Technology Ser...
Global Identity Enrolment and Verification Pro Solution - Cizo Technology Ser...Cizo Technology Services
 
Post Quantum Cryptography – The Impact on Identity
Post Quantum Cryptography – The Impact on IdentityPost Quantum Cryptography – The Impact on Identity
Post Quantum Cryptography – The Impact on Identityteam-WIBU
 
Keeping your build tool updated in a multi repository world
Keeping your build tool updated in a multi repository worldKeeping your build tool updated in a multi repository world
Keeping your build tool updated in a multi repository worldRoberto Pérez Alcolea
 
VictoriaMetrics Anomaly Detection Updates: Q1 2024
VictoriaMetrics Anomaly Detection Updates: Q1 2024VictoriaMetrics Anomaly Detection Updates: Q1 2024
VictoriaMetrics Anomaly Detection Updates: Q1 2024VictoriaMetrics
 
Not a Kubernetes fan? The state of PaaS in 2024
Not a Kubernetes fan? The state of PaaS in 2024Not a Kubernetes fan? The state of PaaS in 2024
Not a Kubernetes fan? The state of PaaS in 2024Anthony Dahanne
 
Machine Learning Software Engineering Patterns and Their Engineering
Machine Learning Software Engineering Patterns and Their EngineeringMachine Learning Software Engineering Patterns and Their Engineering
Machine Learning Software Engineering Patterns and Their EngineeringHironori Washizaki
 

Kürzlich hochgeladen (20)

Salesforce Implementation Services PPT By ABSYZ
Salesforce Implementation Services PPT By ABSYZSalesforce Implementation Services PPT By ABSYZ
Salesforce Implementation Services PPT By ABSYZ
 
Patterns for automating API delivery. API conference
Patterns for automating API delivery. API conferencePatterns for automating API delivery. API conference
Patterns for automating API delivery. API conference
 
Introduction to Firebase Workshop Slides
Introduction to Firebase Workshop SlidesIntroduction to Firebase Workshop Slides
Introduction to Firebase Workshop Slides
 
Simplifying Microservices & Apps - The art of effortless development - Meetup...
Simplifying Microservices & Apps - The art of effortless development - Meetup...Simplifying Microservices & Apps - The art of effortless development - Meetup...
Simplifying Microservices & Apps - The art of effortless development - Meetup...
 
How to submit a standout Adobe Champion Application
How to submit a standout Adobe Champion ApplicationHow to submit a standout Adobe Champion Application
How to submit a standout Adobe Champion Application
 
Comparing Linux OS Image Update Models - EOSS 2024.pdf
Comparing Linux OS Image Update Models - EOSS 2024.pdfComparing Linux OS Image Update Models - EOSS 2024.pdf
Comparing Linux OS Image Update Models - EOSS 2024.pdf
 
Amazon Bedrock in Action - presentation of the Bedrock's capabilities
Amazon Bedrock in Action - presentation of the Bedrock's capabilitiesAmazon Bedrock in Action - presentation of the Bedrock's capabilities
Amazon Bedrock in Action - presentation of the Bedrock's capabilities
 
Sending Calendar Invites on SES and Calendarsnack.pdf
Sending Calendar Invites on SES and Calendarsnack.pdfSending Calendar Invites on SES and Calendarsnack.pdf
Sending Calendar Invites on SES and Calendarsnack.pdf
 
Tech Tuesday Slides - Introduction to Project Management with OnePlan's Work ...
Tech Tuesday Slides - Introduction to Project Management with OnePlan's Work ...Tech Tuesday Slides - Introduction to Project Management with OnePlan's Work ...
Tech Tuesday Slides - Introduction to Project Management with OnePlan's Work ...
 
Odoo 14 - eLearning Module In Odoo 14 Enterprise
Odoo 14 - eLearning Module In Odoo 14 EnterpriseOdoo 14 - eLearning Module In Odoo 14 Enterprise
Odoo 14 - eLearning Module In Odoo 14 Enterprise
 
VK Business Profile - provides IT solutions and Web Development
VK Business Profile - provides IT solutions and Web DevelopmentVK Business Profile - provides IT solutions and Web Development
VK Business Profile - provides IT solutions and Web Development
 
SoftTeco - Software Development Company Profile
SoftTeco - Software Development Company ProfileSoftTeco - Software Development Company Profile
SoftTeco - Software Development Company Profile
 
Ronisha Informatics Private Limited Catalogue
Ronisha Informatics Private Limited CatalogueRonisha Informatics Private Limited Catalogue
Ronisha Informatics Private Limited Catalogue
 
Large Language Models for Test Case Evolution and Repair
Large Language Models for Test Case Evolution and RepairLarge Language Models for Test Case Evolution and Repair
Large Language Models for Test Case Evolution and Repair
 
Global Identity Enrolment and Verification Pro Solution - Cizo Technology Ser...
Global Identity Enrolment and Verification Pro Solution - Cizo Technology Ser...Global Identity Enrolment and Verification Pro Solution - Cizo Technology Ser...
Global Identity Enrolment and Verification Pro Solution - Cizo Technology Ser...
 
Post Quantum Cryptography – The Impact on Identity
Post Quantum Cryptography – The Impact on IdentityPost Quantum Cryptography – The Impact on Identity
Post Quantum Cryptography – The Impact on Identity
 
Keeping your build tool updated in a multi repository world
Keeping your build tool updated in a multi repository worldKeeping your build tool updated in a multi repository world
Keeping your build tool updated in a multi repository world
 
VictoriaMetrics Anomaly Detection Updates: Q1 2024
VictoriaMetrics Anomaly Detection Updates: Q1 2024VictoriaMetrics Anomaly Detection Updates: Q1 2024
VictoriaMetrics Anomaly Detection Updates: Q1 2024
 
Not a Kubernetes fan? The state of PaaS in 2024
Not a Kubernetes fan? The state of PaaS in 2024Not a Kubernetes fan? The state of PaaS in 2024
Not a Kubernetes fan? The state of PaaS in 2024
 
Machine Learning Software Engineering Patterns and Their Engineering
Machine Learning Software Engineering Patterns and Their EngineeringMachine Learning Software Engineering Patterns and Their Engineering
Machine Learning Software Engineering Patterns and Their Engineering
 

Distributed RDBMS: Data Distribution Policy: Part 3 - Changing Your Data Distribution Policy

  • 1. Distributed RDBMS Data Distribution Policy: Part 3 Changing your data distribution policy October 2014
  • 2. Data Distribution Policy: Part 3 Distributed RDBMSs provide many scalability, availability and performance advantages. This presentation takes a deeper look at distributed RDBMS efficiency over the long haul as application usage patterns, user requirements, and workloads change. The presentation discusses: • Three stages of your data distribution policy’s lifecycle. • Adapting the distributed RDBMS to match application changes. • Ensuring that your distributed relational database is flexible and 2 elastic enough to accommodate endless growth and change.
  • 3. 3 Why is a Distributed Relational Database Good? Distributed relational databases are a perfect match for Cloud computing models and distributed Cloud infrastructure. They are the way forward for delivering web scale applications and keeping ACID properties. • Social apps • Games • Many concurrent users • High transaction throughput • Very large data volumes
  • 4. What Is a Data Distribution Policy? – Recap A data distribution policy describes the rules under which data is distributed across a distributed RDBMS. (a virtual database made up of many database instances, or “shards”). A good data distribution policy aims to: 1. Maintain full relational database integrity 2. Distribute workloads in an even and predictable manner 3. Minimize the amount of joins across the array of 4 database instances 4. Align with workflow and application usage patterns 5. Yield database scalability
  • 5. 5 “Change, nothing stays the same…” ...shouted 80’s rock band Van Halen proudly in their song “Unchained”. Just as music fashions change, we know databases must adapt to follow new usage patterns. Unexpected influxes of data or transactions are difficult to predict. You may have only anticipated and planed for a specific amount of growth and capacity. But what if you underestimate your success?
  • 6. Imagine your application is taking off with great success. Sounds like good news, right? However, it might be hard on your database, as your business success can generate significantly more transactions, concurrent users and data that all needs to be accommodated. 6 Taking Additional Data into Consideration These types of situations are hard to predict and occur on a daily basis.
  • 7. If you already have a distributed RDBMS, the original data distribution policy was (hopefully) created based on specific application usage patterns and workflows (see Part 2 of this presentation series). Over time, application workflows and application usage patterns can change. This can lead to database hotspots, bottlenecks, and database clusters that are overloaded compared to other clusters. 7 Adapting to Change
  • 8. Data Distribution Policy Lifecycle 8 There are three main situations to accommodate during a data distribution policy’s lifecycle: 1. Changing Demand and Traffic Loads 2. Changing Application Usage 3. New Product Capabilities The answer to these changes is typically the same: “Rebalance” the distributed database Distribution policy management through lifecycle changes is a key issue to test in any distributed RDBMS technology.
  • 9. 9 Rebalancing the Distributed Database In the past, changing a data distribution policy has been hard to address. Manually changing sharding code within an application was the frontline battle zone of changing data distribution. Today, software like ScaleBase can accommodate all these changes easily for you, quickly and with minimal disruptions to live systems.
  • 10. Scenario 1: Adapting to Changing Demand and Traffic Loads Data distribution policies should always be designed so that data that is frequently accessed together is aggregated into the same database instance (or shard) as this provides the greatest efficiency and scalability benefits. Data distributions are built according to anticipated traffic predictions (both reads and writes), but traffic loads change. 10
  • 11. Scenario 1: Adapting to Changing Demand and Traffic Loads – Typical Challenges 1. Be aware of changes in workload patterns and understand 11 their impact on your distributed relational database. 2. A specific application function’s sudden popularity, or changes in your business environment can lead to usage spikes and transaction bottlenecks from increased demand and unexpected transaction patterns. 3. If workloads appear where the distribution policy was not optimized, new and unplanned operations may cause more costly execution paths that result in sub-par performance and scalability. * Automated threshold alerting and various other monitoring can help you stay ahead of peaks and bottlenecks, so look for these facilities in any distributed solution you choose.
  • 12. Scenario 2: Adapting to Application Usage Changes Over time, it’s quite common for application usage to change. When this happens: 1. The system’s new behavior patterns need to be understood 12 in order to make appropriate changes and optimizations. 2. Adapting to change is typically where do-it-yourself home-grown sharding fails. 3. Re-writing the custom application code that did the initial data distribution to provide new data distribution can lead to errors that are easy to make, hard to uncover, and hard to recover from. 4. Identifying the distribution policy changes required to optimally re-balance workloads around new application usage patterns needs some of the analysis that we described in Part 2.
  • 13. Scenario 3: Adapting to New Product Capabilities The final challenge comes is modifying an application that add new capabilities to your product or service. 1. Updated business requirements can necessitate different 13 functions to integrate new solutions with existing systems, extending the current application and database to accommodate relevant new business needs. 2. Old-fashioned do-it-yourself distribution policy hardcoding eliminates flexibility and often does not allow changes to be made, turning an implementation attempt into a very complicated and daunting task.
  • 14. Scenario 3: Adapting to New Product Capabilities (Continued) You can’t stop your business while you’re rebalancing the distributed database with data placement changes. Rewriting application data redistribution code creates yet another challenge in implementing a change while keeping existing data and operations intact. As a result, many cases companies have opted to rebuild their system again from scratch instead of attempting to make modifications. If data distribution logic is built into the actual application, it can be very hard to make system modifications on the fly. This is costly and ultimately results in maintenance nightmares, performance degradation, and downtime. This is not good! 14
  • 15. What Can You Do? To simplify the management of the data distribution policy that underlies your distributed RDBMS you MUST make a strict separation between your application and where the database distribution policy is defined, managed, and maintained. • If you’re a startup building a new app, or if you have an 15 existing app that needs to scale for growth, you want to “hit the road, running!” (again, to quote Van Halen). ScaleBase software was created to handle changes like the ones previously mentioned, providing customers with the peace of mind they need to grow successfully, in any manner, at any rate and to any scale.
  • 16. ScaleBase Software • ScaleBase is a distributed database built on MySQL and 16 optimized for the cloud. It deploys in minutes so your database can handle an unlimited number of users, humongous volumes of data, and faster transactions. • It dynamically optimizes workloads and availability by logically distributing data across public, private, and geo-distributed clouds.
  • 17. Try ScaleBase Today ScaleBase software is available for free: • ScaleBase Website • Amazon Marketplace • Rackspace Marketplace • IBM Cloud marketplace • ScaleBase’s free online Analysis Genie service AWS Marketplace Guide and a AWS Getting Started Tutorial are available from the documentation section of the ScaleBase website. 17 Contact ScaleBase sales@scalebase.com
  • 18. Data Distribution Policy: Part 1 and 2 Data Distribution Policy Part 1: • What a data distribution policy is • The challenges faced when data is distributed via sharding • What defines a good data distribution policy • The best way to distribute data for your application and 18 workload Data Distribution Policy Part 2: • The different approaches to data distribution • How to create your own data distribution policy, whether you are scaling an existing application or creating a new app. • How ScaleBase can help you create your policy
  • 19. Distributed RDBMS Data Distribution Policy: Part 3 Changing your data distribution policy October 2014