SlideShare ist ein Scribd-Unternehmen logo
1 von 25
Downloaden Sie, um offline zu lesen
Writing Space
and the
Cassandra NoSQL DBMS
Brian King
(with thanks to Michael Aillon)
Writing Space
“Writing is one of the most effective tools
available to develop a student's critical thinking.”
Why A Writing Space?
•  Efficient Administration Of Writing Assignments
•  Scalable Classrooms (500+)
•  Workflow Optimization / Automation
•  Integrated Access to Assessment Tools
o  Grammar Checking
o  Auto-Scoring
o  Plagiarism Detection (Source Check)
•  Grading Rubrics
•  Online Editing and Document Upload
•  Peer Review
•  Group Projects
The Business Needs
•  Highly "Internet" Scalable
•  Global Presence
•  Continuous Availability (Fault Tolerance)
•  Broad OS And Browser Support
•  Mobile Device Support - "Mobile First"
•  Low Cost (Systems, Maintenance, Integration)
•  Write Once, Integrate “Anywhere”
•  Gain Experience With Modern NoSQL Technologies
•  REST Service-Based Architecture
•  Model UI
The Technical Goals
Writing Space - Instructor
Writing Space - Student
Cassandra
•  Highly Scalable
•  Easy Multi-Data Center Support
•  Performance
•  Distributed Ring Configuration (Master-less)
•  Dynamic Schema, “Schema-less”
•  Slice Queries
What We Like
•  Eventual / Tunable Consistency
•  Key-Name-Value Data Store (Column Based)
•  Data Modeling Based On Core Queries
•  All Rows in a CF Typically Don't Live On 1 Server
•  However, All Columns For a Row Do
•  RDBMS Mindset
•  No Ad Hoc Queries
What Challenged Us
What Is Consistency?
•  Write Consistency: Number Of Replicas Written To
•  Read Consistency: Number Of Replicas Queried
•  Replication Factor: Number Of Replicas For A Row
•  Quorum Consistency Level (Read And Write):
o  Option In Specifying Read/Write Consistency
o  (Replication_Factor / 2) + 1
o  Ensures Strong Consistency
o  While Maintaining High Availability
•  With 4 Servers, Writing Space uses:
o  Replication Factor = 3
o  Read and Write Quorum Consistency
Typical RDBMS Features Not Available (Yet):
•  Referential Integrity Constraints / Foreign Keys
•  Commit / Rollback
•  Stored Procedures
•  Joins
•  Views
•  Triggers
•  Functions
•  Security Privileges
•  Rules
•  Partitioned Table Definitions
What's Not In Cassandra...
Cassandra
In
Writing Space
Document Versioning...
How We Modeled Our Data...
Storage Strategy: Document-oriented
1:M
1:1
The Writing Space
DB Infrastructure
The Hardware
•  Many Inexpensive Servers (Actually 4 + 1)
•  Our Configuration:
Processor: Xeon E5630, 2.53GHz, 4 Cores
Memory: 96 GB
Storage:
Two Mirrored Spinning Disks For OS / Binaries
Three Striped 480GB Solid State Drives
(Providing 1.3 TB Local DB Storage)
•  Peer to Peer Ring
•  Hot Swappable - Fault Tolerant
•  "What's Your Insurance Company?"
Why DataStax Cassandra?
•  A Certified, Production Ready Version Of Cassandra
•  24/7 World Class Support
•  Integration With Hadoop
•  Integration With Solr
•  OpsCenter (Multi-Data Center Management Tool)
•  Doc Store and UI
•  Load: 3x Anticipated Load
•  Total Time Of Run: 1.75 hours
•  Max Document Size: 10k (25k, 50k and 75k DS)
Results
Average Response Time: < 300ms
Maximum Running Vusers: 684
Total Throughput (bytes): 7,176,727,121
Average Throughput (bytes/sec): 1,993,535
Total Hits: 342,833
Average Hits per Second: 95
DB Server CPU < 0.3%
Performance
•  Document Store only
•  Load: 100x Anticipated Load
•  Total Time Of Run: 1 hour
•  Document Size: 25k, 50k and 75k
Results
Average Response Time: < 100ms
Maximum Running Vusers: 2,200
Total Throughput (bytes): 2,291,522,553
Average Throughput (bytes/sec): 565,808
Total Hits: 834,640
Average Hits per Second: 206
DB Server CPU < 1%
Performance
Wrapping It Up
Cloud Decision Points
•  Cost Savings
•  Continuous Availability
•  Performance / Dynamic (Elastic) Scalability
•  Global Distribution Of Access Points
•  Redundancy
•  Disaster Recovery
•  Resiliency To Node / Connectivity Loses A Must
•  Think About Reporting Up Front
•  Data Analytics – Hadoop and Solr Are Heavy Duty
•  More Expensive Hardware?
•  Different RAID Configuration (Not Striping)
•  Get Training – Especially About Schema Design
What Would We Do Differently?
Consider The Human Element...
•  Mind Shift For RDBMS Folks
•  Need To “Let Go” That Data Needs To Be Normalized
•  Experience Of Operations Team
•  Netflix - 4 People Managing 800+ Nodes
Global Enterprise
•  Global Presence
•  Disaster Recovery
•  Internet Scale
Final Thoughts...
Writing Space
and the
Cassandra NoSQL DBMS
Thank you!
Questions?
Brian.King@Pearson.com

Weitere ähnliche Inhalte

Was ist angesagt?

Using flash on the server side
Using flash on the server sideUsing flash on the server side
Using flash on the server side
Howard Marks
 

Was ist angesagt? (20)

Dropping ACID: Wrapping Your Mind Around NoSQL Databases
Dropping ACID: Wrapping Your Mind Around NoSQL DatabasesDropping ACID: Wrapping Your Mind Around NoSQL Databases
Dropping ACID: Wrapping Your Mind Around NoSQL Databases
 
RavenDB 3.5
RavenDB 3.5RavenDB 3.5
RavenDB 3.5
 
Lessons from the Trenches - Building Enterprise Applications with RavenDB
Lessons from the Trenches - Building Enterprise Applications with RavenDBLessons from the Trenches - Building Enterprise Applications with RavenDB
Lessons from the Trenches - Building Enterprise Applications with RavenDB
 
MongoDB - Getting Started
MongoDB  - Getting StartedMongoDB  - Getting Started
MongoDB - Getting Started
 
Azure Data Storage
Azure Data StorageAzure Data Storage
Azure Data Storage
 
Know thy cost (or where performance problems lurk)
Know thy cost (or where performance problems lurk)Know thy cost (or where performance problems lurk)
Know thy cost (or where performance problems lurk)
 
Cloud Computing101 Azure, updated june 2017
Cloud Computing101 Azure, updated june 2017Cloud Computing101 Azure, updated june 2017
Cloud Computing101 Azure, updated june 2017
 
Sharding
ShardingSharding
Sharding
 
GDG Ternopil TechTalks Web #1 2015 - Data storages in Microsoft Azure
GDG Ternopil TechTalks Web #1 2015 - Data storages in Microsoft AzureGDG Ternopil TechTalks Web #1 2015 - Data storages in Microsoft Azure
GDG Ternopil TechTalks Web #1 2015 - Data storages in Microsoft Azure
 
Cloud Storage in Azure, AWS and Google Cloud
Cloud  Storage in Azure, AWS and Google CloudCloud  Storage in Azure, AWS and Google Cloud
Cloud Storage in Azure, AWS and Google Cloud
 
Azure storage
Azure storageAzure storage
Azure storage
 
Sql saturday azure storage by Anton Vidishchev
Sql saturday azure storage by Anton VidishchevSql saturday azure storage by Anton Vidishchev
Sql saturday azure storage by Anton Vidishchev
 
Using NoSQL MongoDB with ColdFusion
Using NoSQL MongoDB with ColdFusionUsing NoSQL MongoDB with ColdFusion
Using NoSQL MongoDB with ColdFusion
 
Building Codealike: a journey into the developers analytics world
Building Codealike: a journey into the developers analytics worldBuilding Codealike: a journey into the developers analytics world
Building Codealike: a journey into the developers analytics world
 
Scaling the Web: Databases & NoSQL
Scaling the Web: Databases & NoSQLScaling the Web: Databases & NoSQL
Scaling the Web: Databases & NoSQL
 
RavenDB 4.0
RavenDB 4.0RavenDB 4.0
RavenDB 4.0
 
Using flash on the server side
Using flash on the server sideUsing flash on the server side
Using flash on the server side
 
Cosmos DB Service
Cosmos DB ServiceCosmos DB Service
Cosmos DB Service
 
MongoDB Administration 101
MongoDB Administration 101MongoDB Administration 101
MongoDB Administration 101
 
Azure storage
Azure storageAzure storage
Azure storage
 

Ähnlich wie Writing Space and the Cassandra NoSQL DBMS

Ultimate SharePoint Infrastructure Best Practises Session - Isle of Man Share...
Ultimate SharePoint Infrastructure Best Practises Session - Isle of Man Share...Ultimate SharePoint Infrastructure Best Practises Session - Isle of Man Share...
Ultimate SharePoint Infrastructure Best Practises Session - Isle of Man Share...
Michael Noel
 

Ähnlich wie Writing Space and the Cassandra NoSQL DBMS (20)

Student projects with open source CSQL
Student projects with open source CSQLStudent projects with open source CSQL
Student projects with open source CSQL
 
Ultimate SharePoint Infrastructure Best Practises Session - Isle of Man Share...
Ultimate SharePoint Infrastructure Best Practises Session - Isle of Man Share...Ultimate SharePoint Infrastructure Best Practises Session - Isle of Man Share...
Ultimate SharePoint Infrastructure Best Practises Session - Isle of Man Share...
 
Azure CosmosDb - Where we are
Azure CosmosDb - Where we areAzure CosmosDb - Where we are
Azure CosmosDb - Where we are
 
Select Stars: A SQL DBA's Introduction to Azure Cosmos DB (SQL Saturday Orego...
Select Stars: A SQL DBA's Introduction to Azure Cosmos DB (SQL Saturday Orego...Select Stars: A SQL DBA's Introduction to Azure Cosmos DB (SQL Saturday Orego...
Select Stars: A SQL DBA's Introduction to Azure Cosmos DB (SQL Saturday Orego...
 
NoSQL – Data Center Centric Application Enablement
NoSQL – Data Center Centric Application EnablementNoSQL – Data Center Centric Application Enablement
NoSQL – Data Center Centric Application Enablement
 
Cassandra Community Webinar: From Mongo to Cassandra, Architectural Lessons
Cassandra Community Webinar: From Mongo to Cassandra, Architectural LessonsCassandra Community Webinar: From Mongo to Cassandra, Architectural Lessons
Cassandra Community Webinar: From Mongo to Cassandra, Architectural Lessons
 
Innovations of .NET and Azure (Recaps of Build 2017 selected sessions)
Innovations of .NET and Azure (Recaps of Build 2017 selected sessions)Innovations of .NET and Azure (Recaps of Build 2017 selected sessions)
Innovations of .NET and Azure (Recaps of Build 2017 selected sessions)
 
Select Stars: A DBA's Guide to Azure Cosmos DB (Chicago Suburban SQL Server U...
Select Stars: A DBA's Guide to Azure Cosmos DB (Chicago Suburban SQL Server U...Select Stars: A DBA's Guide to Azure Cosmos DB (Chicago Suburban SQL Server U...
Select Stars: A DBA's Guide to Azure Cosmos DB (Chicago Suburban SQL Server U...
 
CosmosDB for DBAs & Developers
CosmosDB for DBAs & DevelopersCosmosDB for DBAs & Developers
CosmosDB for DBAs & Developers
 
Cassandra training
Cassandra trainingCassandra training
Cassandra training
 
AWS Summit Sydney | 50GB Mailboxes for 50,000 Users on AWS? Easy - Session Sp...
AWS Summit Sydney | 50GB Mailboxes for 50,000 Users on AWS? Easy - Session Sp...AWS Summit Sydney | 50GB Mailboxes for 50,000 Users on AWS? Easy - Session Sp...
AWS Summit Sydney | 50GB Mailboxes for 50,000 Users on AWS? Easy - Session Sp...
 
VMworld 2013: Virtualizing Databases: Doing IT Right
VMworld 2013: Virtualizing Databases: Doing IT Right VMworld 2013: Virtualizing Databases: Doing IT Right
VMworld 2013: Virtualizing Databases: Doing IT Right
 
Azure SQL Database
Azure SQL Database Azure SQL Database
Azure SQL Database
 
Виталий Бондаренко "Fast Data Platform for Real-Time Analytics. Architecture ...
Виталий Бондаренко "Fast Data Platform for Real-Time Analytics. Architecture ...Виталий Бондаренко "Fast Data Platform for Real-Time Analytics. Architecture ...
Виталий Бондаренко "Fast Data Platform for Real-Time Analytics. Architecture ...
 
Azure data platform overview
Azure data platform overviewAzure data platform overview
Azure data platform overview
 
(DAT204) NoSQL? No Worries: Build Scalable Apps on AWS NoSQL Services
(DAT204) NoSQL? No Worries: Build Scalable Apps on AWS NoSQL Services(DAT204) NoSQL? No Worries: Build Scalable Apps on AWS NoSQL Services
(DAT204) NoSQL? No Worries: Build Scalable Apps on AWS NoSQL Services
 
AWS January 2016 Webinar Series - Getting Started with Big Data on AWS
AWS January 2016 Webinar Series - Getting Started with Big Data on AWSAWS January 2016 Webinar Series - Getting Started with Big Data on AWS
AWS January 2016 Webinar Series - Getting Started with Big Data on AWS
 
Performance Eye for the SQL Guy
Performance Eye for the SQL GuyPerformance Eye for the SQL Guy
Performance Eye for the SQL Guy
 
Accelerate SQL Server Migration to the AWS Cloud
Accelerate SQL Server Migration to the AWS Cloud Accelerate SQL Server Migration to the AWS Cloud
Accelerate SQL Server Migration to the AWS Cloud
 
Scalable relational database with SQL Azure
Scalable relational database with SQL AzureScalable relational database with SQL Azure
Scalable relational database with SQL Azure
 

Mehr von DataStax Academy

Cassandra on Docker @ Walmart Labs
Cassandra on Docker @ Walmart LabsCassandra on Docker @ Walmart Labs
Cassandra on Docker @ Walmart Labs
DataStax Academy
 
Cassandra Adoption on Cisco UCS & Open stack
Cassandra Adoption on Cisco UCS & Open stackCassandra Adoption on Cisco UCS & Open stack
Cassandra Adoption on Cisco UCS & Open stack
DataStax Academy
 
Cassandra @ Netflix: Monitoring C* at Scale, Gossip and Tickler & Python
Cassandra @ Netflix: Monitoring C* at Scale, Gossip and Tickler & PythonCassandra @ Netflix: Monitoring C* at Scale, Gossip and Tickler & Python
Cassandra @ Netflix: Monitoring C* at Scale, Gossip and Tickler & Python
DataStax Academy
 
Standing Up Your First Cluster
Standing Up Your First ClusterStanding Up Your First Cluster
Standing Up Your First Cluster
DataStax Academy
 
Real Time Analytics with Dse
Real Time Analytics with DseReal Time Analytics with Dse
Real Time Analytics with Dse
DataStax Academy
 
Introduction to Data Modeling with Apache Cassandra
Introduction to Data Modeling with Apache CassandraIntroduction to Data Modeling with Apache Cassandra
Introduction to Data Modeling with Apache Cassandra
DataStax Academy
 
Enabling Search in your Cassandra Application with DataStax Enterprise
Enabling Search in your Cassandra Application with DataStax EnterpriseEnabling Search in your Cassandra Application with DataStax Enterprise
Enabling Search in your Cassandra Application with DataStax Enterprise
DataStax Academy
 
Advanced Data Modeling with Apache Cassandra
Advanced Data Modeling with Apache CassandraAdvanced Data Modeling with Apache Cassandra
Advanced Data Modeling with Apache Cassandra
DataStax Academy
 

Mehr von DataStax Academy (20)

Forrester CXNYC 2017 - Delivering great real-time cx is a true craft
Forrester CXNYC 2017 - Delivering great real-time cx is a true craftForrester CXNYC 2017 - Delivering great real-time cx is a true craft
Forrester CXNYC 2017 - Delivering great real-time cx is a true craft
 
Introduction to DataStax Enterprise Graph Database
Introduction to DataStax Enterprise Graph DatabaseIntroduction to DataStax Enterprise Graph Database
Introduction to DataStax Enterprise Graph Database
 
Introduction to DataStax Enterprise Advanced Replication with Apache Cassandra
Introduction to DataStax Enterprise Advanced Replication with Apache CassandraIntroduction to DataStax Enterprise Advanced Replication with Apache Cassandra
Introduction to DataStax Enterprise Advanced Replication with Apache Cassandra
 
Cassandra on Docker @ Walmart Labs
Cassandra on Docker @ Walmart LabsCassandra on Docker @ Walmart Labs
Cassandra on Docker @ Walmart Labs
 
Cassandra 3.0 Data Modeling
Cassandra 3.0 Data ModelingCassandra 3.0 Data Modeling
Cassandra 3.0 Data Modeling
 
Cassandra Adoption on Cisco UCS & Open stack
Cassandra Adoption on Cisco UCS & Open stackCassandra Adoption on Cisco UCS & Open stack
Cassandra Adoption on Cisco UCS & Open stack
 
Data Modeling for Apache Cassandra
Data Modeling for Apache CassandraData Modeling for Apache Cassandra
Data Modeling for Apache Cassandra
 
Coursera Cassandra Driver
Coursera Cassandra DriverCoursera Cassandra Driver
Coursera Cassandra Driver
 
Production Ready Cassandra
Production Ready CassandraProduction Ready Cassandra
Production Ready Cassandra
 
Cassandra @ Netflix: Monitoring C* at Scale, Gossip and Tickler & Python
Cassandra @ Netflix: Monitoring C* at Scale, Gossip and Tickler & PythonCassandra @ Netflix: Monitoring C* at Scale, Gossip and Tickler & Python
Cassandra @ Netflix: Monitoring C* at Scale, Gossip and Tickler & Python
 
Cassandra @ Sony: The good, the bad, and the ugly part 1
Cassandra @ Sony: The good, the bad, and the ugly part 1Cassandra @ Sony: The good, the bad, and the ugly part 1
Cassandra @ Sony: The good, the bad, and the ugly part 1
 
Cassandra @ Sony: The good, the bad, and the ugly part 2
Cassandra @ Sony: The good, the bad, and the ugly part 2Cassandra @ Sony: The good, the bad, and the ugly part 2
Cassandra @ Sony: The good, the bad, and the ugly part 2
 
Standing Up Your First Cluster
Standing Up Your First ClusterStanding Up Your First Cluster
Standing Up Your First Cluster
 
Real Time Analytics with Dse
Real Time Analytics with DseReal Time Analytics with Dse
Real Time Analytics with Dse
 
Introduction to Data Modeling with Apache Cassandra
Introduction to Data Modeling with Apache CassandraIntroduction to Data Modeling with Apache Cassandra
Introduction to Data Modeling with Apache Cassandra
 
Cassandra Core Concepts
Cassandra Core ConceptsCassandra Core Concepts
Cassandra Core Concepts
 
Enabling Search in your Cassandra Application with DataStax Enterprise
Enabling Search in your Cassandra Application with DataStax EnterpriseEnabling Search in your Cassandra Application with DataStax Enterprise
Enabling Search in your Cassandra Application with DataStax Enterprise
 
Bad Habits Die Hard
Bad Habits Die Hard Bad Habits Die Hard
Bad Habits Die Hard
 
Advanced Data Modeling with Apache Cassandra
Advanced Data Modeling with Apache CassandraAdvanced Data Modeling with Apache Cassandra
Advanced Data Modeling with Apache Cassandra
 
Advanced Cassandra
Advanced CassandraAdvanced Cassandra
Advanced Cassandra
 

Kürzlich hochgeladen

Architecting Cloud Native Applications
Architecting Cloud Native ApplicationsArchitecting Cloud Native Applications
Architecting Cloud Native Applications
WSO2
 
+971581248768>> SAFE AND ORIGINAL ABORTION PILLS FOR SALE IN DUBAI AND ABUDHA...
+971581248768>> SAFE AND ORIGINAL ABORTION PILLS FOR SALE IN DUBAI AND ABUDHA...+971581248768>> SAFE AND ORIGINAL ABORTION PILLS FOR SALE IN DUBAI AND ABUDHA...
+971581248768>> SAFE AND ORIGINAL ABORTION PILLS FOR SALE IN DUBAI AND ABUDHA...
?#DUbAI#??##{{(☎️+971_581248768%)**%*]'#abortion pills for sale in dubai@
 

Kürzlich hochgeladen (20)

Architecting Cloud Native Applications
Architecting Cloud Native ApplicationsArchitecting Cloud Native Applications
Architecting Cloud Native Applications
 
MINDCTI Revenue Release Quarter One 2024
MINDCTI Revenue Release Quarter One 2024MINDCTI Revenue Release Quarter One 2024
MINDCTI Revenue Release Quarter One 2024
 
TrustArc Webinar - Stay Ahead of US State Data Privacy Law Developments
TrustArc Webinar - Stay Ahead of US State Data Privacy Law DevelopmentsTrustArc Webinar - Stay Ahead of US State Data Privacy Law Developments
TrustArc Webinar - Stay Ahead of US State Data Privacy Law Developments
 
DBX First Quarter 2024 Investor Presentation
DBX First Quarter 2024 Investor PresentationDBX First Quarter 2024 Investor Presentation
DBX First Quarter 2024 Investor Presentation
 
Exploring the Future Potential of AI-Enabled Smartphone Processors
Exploring the Future Potential of AI-Enabled Smartphone ProcessorsExploring the Future Potential of AI-Enabled Smartphone Processors
Exploring the Future Potential of AI-Enabled Smartphone Processors
 
ICT role in 21st century education and its challenges
ICT role in 21st century education and its challengesICT role in 21st century education and its challenges
ICT role in 21st century education and its challenges
 
Artificial Intelligence Chap.5 : Uncertainty
Artificial Intelligence Chap.5 : UncertaintyArtificial Intelligence Chap.5 : Uncertainty
Artificial Intelligence Chap.5 : Uncertainty
 
Navi Mumbai Call Girls 🥰 8617370543 Service Offer VIP Hot Model
Navi Mumbai Call Girls 🥰 8617370543 Service Offer VIP Hot ModelNavi Mumbai Call Girls 🥰 8617370543 Service Offer VIP Hot Model
Navi Mumbai Call Girls 🥰 8617370543 Service Offer VIP Hot Model
 
"I see eyes in my soup": How Delivery Hero implemented the safety system for ...
"I see eyes in my soup": How Delivery Hero implemented the safety system for ..."I see eyes in my soup": How Delivery Hero implemented the safety system for ...
"I see eyes in my soup": How Delivery Hero implemented the safety system for ...
 
Repurposing LNG terminals for Hydrogen Ammonia: Feasibility and Cost Saving
Repurposing LNG terminals for Hydrogen Ammonia: Feasibility and Cost SavingRepurposing LNG terminals for Hydrogen Ammonia: Feasibility and Cost Saving
Repurposing LNG terminals for Hydrogen Ammonia: Feasibility and Cost Saving
 
+971581248768>> SAFE AND ORIGINAL ABORTION PILLS FOR SALE IN DUBAI AND ABUDHA...
+971581248768>> SAFE AND ORIGINAL ABORTION PILLS FOR SALE IN DUBAI AND ABUDHA...+971581248768>> SAFE AND ORIGINAL ABORTION PILLS FOR SALE IN DUBAI AND ABUDHA...
+971581248768>> SAFE AND ORIGINAL ABORTION PILLS FOR SALE IN DUBAI AND ABUDHA...
 
Apidays New York 2024 - Accelerating FinTech Innovation by Vasa Krishnan, Fin...
Apidays New York 2024 - Accelerating FinTech Innovation by Vasa Krishnan, Fin...Apidays New York 2024 - Accelerating FinTech Innovation by Vasa Krishnan, Fin...
Apidays New York 2024 - Accelerating FinTech Innovation by Vasa Krishnan, Fin...
 
Apidays New York 2024 - Scaling API-first by Ian Reasor and Radu Cotescu, Adobe
Apidays New York 2024 - Scaling API-first by Ian Reasor and Radu Cotescu, AdobeApidays New York 2024 - Scaling API-first by Ian Reasor and Radu Cotescu, Adobe
Apidays New York 2024 - Scaling API-first by Ian Reasor and Radu Cotescu, Adobe
 
Web Form Automation for Bonterra Impact Management (fka Social Solutions Apri...
Web Form Automation for Bonterra Impact Management (fka Social Solutions Apri...Web Form Automation for Bonterra Impact Management (fka Social Solutions Apri...
Web Form Automation for Bonterra Impact Management (fka Social Solutions Apri...
 
Strategize a Smooth Tenant-to-tenant Migration and Copilot Takeoff
Strategize a Smooth Tenant-to-tenant Migration and Copilot TakeoffStrategize a Smooth Tenant-to-tenant Migration and Copilot Takeoff
Strategize a Smooth Tenant-to-tenant Migration and Copilot Takeoff
 
Apidays New York 2024 - The value of a flexible API Management solution for O...
Apidays New York 2024 - The value of a flexible API Management solution for O...Apidays New York 2024 - The value of a flexible API Management solution for O...
Apidays New York 2024 - The value of a flexible API Management solution for O...
 
Strategies for Landing an Oracle DBA Job as a Fresher
Strategies for Landing an Oracle DBA Job as a FresherStrategies for Landing an Oracle DBA Job as a Fresher
Strategies for Landing an Oracle DBA Job as a Fresher
 
2024: Domino Containers - The Next Step. News from the Domino Container commu...
2024: Domino Containers - The Next Step. News from the Domino Container commu...2024: Domino Containers - The Next Step. News from the Domino Container commu...
2024: Domino Containers - The Next Step. News from the Domino Container commu...
 
Apidays New York 2024 - The Good, the Bad and the Governed by David O'Neill, ...
Apidays New York 2024 - The Good, the Bad and the Governed by David O'Neill, ...Apidays New York 2024 - The Good, the Bad and the Governed by David O'Neill, ...
Apidays New York 2024 - The Good, the Bad and the Governed by David O'Neill, ...
 
AXA XL - Insurer Innovation Award Americas 2024
AXA XL - Insurer Innovation Award Americas 2024AXA XL - Insurer Innovation Award Americas 2024
AXA XL - Insurer Innovation Award Americas 2024
 

Writing Space and the Cassandra NoSQL DBMS

  • 1. Writing Space and the Cassandra NoSQL DBMS Brian King (with thanks to Michael Aillon)
  • 3. “Writing is one of the most effective tools available to develop a student's critical thinking.” Why A Writing Space?
  • 4. •  Efficient Administration Of Writing Assignments •  Scalable Classrooms (500+) •  Workflow Optimization / Automation •  Integrated Access to Assessment Tools o  Grammar Checking o  Auto-Scoring o  Plagiarism Detection (Source Check) •  Grading Rubrics •  Online Editing and Document Upload •  Peer Review •  Group Projects The Business Needs
  • 5. •  Highly "Internet" Scalable •  Global Presence •  Continuous Availability (Fault Tolerance) •  Broad OS And Browser Support •  Mobile Device Support - "Mobile First" •  Low Cost (Systems, Maintenance, Integration) •  Write Once, Integrate “Anywhere” •  Gain Experience With Modern NoSQL Technologies •  REST Service-Based Architecture •  Model UI The Technical Goals
  • 6. Writing Space - Instructor
  • 7. Writing Space - Student
  • 9. •  Highly Scalable •  Easy Multi-Data Center Support •  Performance •  Distributed Ring Configuration (Master-less) •  Dynamic Schema, “Schema-less” •  Slice Queries What We Like
  • 10. •  Eventual / Tunable Consistency •  Key-Name-Value Data Store (Column Based) •  Data Modeling Based On Core Queries •  All Rows in a CF Typically Don't Live On 1 Server •  However, All Columns For a Row Do •  RDBMS Mindset •  No Ad Hoc Queries What Challenged Us
  • 11. What Is Consistency? •  Write Consistency: Number Of Replicas Written To •  Read Consistency: Number Of Replicas Queried •  Replication Factor: Number Of Replicas For A Row •  Quorum Consistency Level (Read And Write): o  Option In Specifying Read/Write Consistency o  (Replication_Factor / 2) + 1 o  Ensures Strong Consistency o  While Maintaining High Availability •  With 4 Servers, Writing Space uses: o  Replication Factor = 3 o  Read and Write Quorum Consistency
  • 12. Typical RDBMS Features Not Available (Yet): •  Referential Integrity Constraints / Foreign Keys •  Commit / Rollback •  Stored Procedures •  Joins •  Views •  Triggers •  Functions •  Security Privileges •  Rules •  Partitioned Table Definitions What's Not In Cassandra...
  • 15. How We Modeled Our Data... Storage Strategy: Document-oriented 1:M 1:1
  • 16. The Writing Space DB Infrastructure
  • 17. The Hardware •  Many Inexpensive Servers (Actually 4 + 1) •  Our Configuration: Processor: Xeon E5630, 2.53GHz, 4 Cores Memory: 96 GB Storage: Two Mirrored Spinning Disks For OS / Binaries Three Striped 480GB Solid State Drives (Providing 1.3 TB Local DB Storage) •  Peer to Peer Ring •  Hot Swappable - Fault Tolerant •  "What's Your Insurance Company?"
  • 18. Why DataStax Cassandra? •  A Certified, Production Ready Version Of Cassandra •  24/7 World Class Support •  Integration With Hadoop •  Integration With Solr •  OpsCenter (Multi-Data Center Management Tool)
  • 19. •  Doc Store and UI •  Load: 3x Anticipated Load •  Total Time Of Run: 1.75 hours •  Max Document Size: 10k (25k, 50k and 75k DS) Results Average Response Time: < 300ms Maximum Running Vusers: 684 Total Throughput (bytes): 7,176,727,121 Average Throughput (bytes/sec): 1,993,535 Total Hits: 342,833 Average Hits per Second: 95 DB Server CPU < 0.3% Performance
  • 20. •  Document Store only •  Load: 100x Anticipated Load •  Total Time Of Run: 1 hour •  Document Size: 25k, 50k and 75k Results Average Response Time: < 100ms Maximum Running Vusers: 2,200 Total Throughput (bytes): 2,291,522,553 Average Throughput (bytes/sec): 565,808 Total Hits: 834,640 Average Hits per Second: 206 DB Server CPU < 1% Performance
  • 22. Cloud Decision Points •  Cost Savings •  Continuous Availability •  Performance / Dynamic (Elastic) Scalability •  Global Distribution Of Access Points •  Redundancy •  Disaster Recovery •  Resiliency To Node / Connectivity Loses A Must
  • 23. •  Think About Reporting Up Front •  Data Analytics – Hadoop and Solr Are Heavy Duty •  More Expensive Hardware? •  Different RAID Configuration (Not Striping) •  Get Training – Especially About Schema Design What Would We Do Differently?
  • 24. Consider The Human Element... •  Mind Shift For RDBMS Folks •  Need To “Let Go” That Data Needs To Be Normalized •  Experience Of Operations Team •  Netflix - 4 People Managing 800+ Nodes Global Enterprise •  Global Presence •  Disaster Recovery •  Internet Scale Final Thoughts...
  • 25. Writing Space and the Cassandra NoSQL DBMS Thank you! Questions? Brian.King@Pearson.com