SlideShare ist ein Scribd-Unternehmen logo
1 von 40
Downloaden Sie, um offline zu lesen
reimagining the business of
apps
#Cassandra13	
  	
  
©2013 NativeX Holdings, LLC
The Perils and Triumphs of using
Cassandra at a .NET/Microsoft Shop
#Cassandra13	
  	
  
About the Presenters
Jeff Smoley – Infrastructure Architect
Derek Bromenshenkel – Infrastructure Architect
#Cassandra13	
  	
  
Agenda
•  About NativeX
•  Why Cassandra?
•  Challenges
•  Auto Id Generation
•  FluentCassandra
•  Hector
•  IKVM.NET
•  HectorNet
•  Reporting Integration
©2013 NativeX Holdings, LLC#Cassandra13	
  	
  
About NativeX
•  Formerly W3i
•  Home Office in Sartell, MN
•  75 miles NW of Minneapolis
•  Remote Offices in MSP and SF
•  150 Employees
©2013 NativeX Holdings, LLC#Cassandra13	
  	
  
What NativeX Does
•  Marketing technology
platform that enables
developers to build
successful business around
their apps.
•  We provide Publishers with
a way to monetize and
Advertisers with a way to
gain distribution.
#Cassandra13	
  	
  
Mobile Vanity Metrics
•  Over 700M unique devices
•  1000s of Apps
•  > 100M Monthly Active Users
•  > 200GB of data ingest per week
©2013 NativeX Holdings, LLC#Cassandra13	
  	
  
Backstory
•  From 100M session/quarter
to 5B.
•  Anticipate 7B sessions in
Q2.
•  Growth was anticipated.
•  Realized infrastructure
needed to change to support
this. 0	
  
1	
  
2	
  
3	
  
4	
  
5	
  
6	
  
2011	
  Q4	
   2012	
  Q1	
   2012	
  Q2	
   2012	
  Q3	
   2012	
  Q4	
   2013	
  Q1	
  
Billions	
  
API	
  Requests	
  
©2013 NativeX Holdings, LLC#Cassandra13	
  	
  
Original OLTP Architecture
•  Microsoft SQL Server
•  2 Node Cluster (Failover)
•  12 cores / node
•  192 GB mem / node
•  Compellent SAN
•  172 Tiered Disk
•  SSD, FC, SATA
©2013 NativeX Holdings, LLC#Cassandra13	
  	
  
Objectives
Scale	
  
• Horizontal	
  
• Incremental	
  cost	
  
structure	
  
Resiliency	
  
• No	
  single	
  point	
  
of	
  failure	
  
• Geographically	
  
distributed	
  
©2013 NativeX Holdings, LLC#Cassandra13	
  	
  
What is NoSQL
•  Stands for Not Only SQL.
•  The NoSQL movement is about understanding problems and
focusing on solutions.
•  It’s not about silver bullets and black boxes.
•  It is about using the right tool for the right problem.
©2013 NativeX Holdings, LLC#Cassandra13	
  	
  
Researched Products
•  Compared features like:
•  Distributed / Shared Nothing
•  Multi-Cluster Support
•  Maturity & Popularity
•  Documentation
•  .NET Support
©2013 NativeX Holdings, LLC#Cassandra13	
  	
  
Why Cassandra?
•  Multi-node
•  Multi-cluster
•  Highly Available
•  Durable
•  Shared Nothing
•  Tunable Consistency
©2013 NativeX Holdings, LLC#Cassandra13	
  	
  
Cassandra at NativeX
•  C* was not a replacement DB system.
•  We continue to use MS SQL Server alongside C*.
•  SQL Server used for storing configuration data.
•  C* solves a very specific problem for us.
•  Writing large volumes of data quickly.
•  Reading very specific data out of a large record set.
#Cassandra13	
  	
  
Challenges
•  C* does not have Auto Id generation.
•  How to connect to C* with C#?
•  Finding a connector with good Failure Tolerance.
•  How to integrate our reporting system?
©2013 NativeX Holdings, LLC#Cassandra13	
  	
  
Auto ID Generation
•  Pre-existing requirements
•  Unique, 64-bit positive integers
•  Increasing (sortable) a plus
•  Previously SQL Server Identity column
•  A Time-based UUID is sortable and unique
•  Changed everything we could
•  The future for us
©2013 NativeX Holdings, LLC#Cassandra13	
  	
  
Auto ID – What are the options?
•  SQL dummy table
•  Easy & familiar, but limited
•  Pre-generated range
•  Proposed by @mdennis
•  Distributed, but more complicated to implement
•  Sharding [Instagram]
•  Discovered too late
•  Unfamiliar with Postgres
©2013 NativeX Holdings, LLC#Cassandra13	
  	
  
We chose Snowflake
•  Built by Twitter, Apache 2.0 license
•  https://github.com/twitter/snowflake
•  “… network service for generating unique ID numbers at high
scale..”
•  Same motivation; MySQL -> C*
•  A few tweaks for our Windows environment
©2013 NativeX Holdings, LLC#Cassandra13	
  	
  
Technical reasons for Snowflake
•  Meets all requirements
•  Tested in high transaction system
•  Java based [Scala] implementation
•  Thrift server
•  Run as a Windows service with Apache Daemon
•  Con: Requires Apache Zookeeper
•  Coordinate the worker id
©2013 NativeX Holdings, LLC#Cassandra13	
  	
  
Connecting to Snowflake
•  Built our own .NET
Snowflake Client
•  Snowflake server on each
web node
•  Local instance is primary
•  Round robin failover to other
nodes
•  Auto failover AND recovery
•  “Circuit Breaker” pattern
Web	
  
App	
  
SF	
  
Server	
  1	
  
Web	
  
App	
  
SF	
  
Server	
  3	
  
Web	
  
App	
  
SF	
  
Server	
  2	
  
Web	
  
App	
  
SF	
  
Server	
  4	
  
#Cassandra13	
  	
  
Challenges
•  C* does not have Auto Id generation.
•  How to connect to C* with C#?
•  Finding a connector with good Failure Tolerance.
•  How to integrate our reporting system?
©2013 NativeX Holdings, LLC#Cassandra13	
  	
  
Connecting to Cassandra with C#
•  Thrift alone too low level
•  Needs
•  CQL support
•  Active development / support
•  Wants
•  ADO.NET / LINQ feel
•  ????
•  FluentCassandra is where we started
©2013 NativeX Holdings, LLC#Cassandra13	
  	
  
Vetting FluentCassandra
•  Pros
•  Open source -
https://github.com/fluentcassandra/fluentcassandra
•  Nick Berardi, project owner, is excellent
•  Designed for CQL
•  Familiar feel
•  Were able to start project development with it
©2013 NativeX Holdings, LLC#Cassandra13	
  	
  
Vetting FluentCassandra
•  Cons
•  Immaturity
•  Few users with high transaction system
•  Permanent node blacklisting
•  Lacked auto retry
•  Couldn’t live with these limitations
•  Tried hiring independent contractor to help us mature it
#Cassandra13	
  	
  
Challenges
•  C* does not have Auto Id generation.
•  How to connect to C* with C#?
•  Finding a connector with good Failure Tolerance.
•  How to integrate our reporting system?
©2013 NativeX Holdings, LLC#Cassandra13	
  	
  
Hector: Yes, please
•  Popular C* connector
•  Use cases matching ours
•  Good maturity
•  Auto node discovery
•  Auto retry
•  Auto failure recovery
•  Written in Java – major roadblock
©2013 NativeX Holdings, LLC#Cassandra13	
  	
  
Help!
•  We knew we still needed help.
•  We found a company named Concord.
•  Based out of the Twin Cites.
•  Specialize in System, Process, and Data Integration.
•  http://concordusa.com/
©2013 NativeX Holdings, LLC#Cassandra13	
  	
  
Concord’s Recommendation
•  Concord recommended that we use IKVM.NET to port Hector to
a .NET assembly.
•  They had previous success using IKVM for other Java to .NET
ports.
•  They felt that maturing FluentCassandra was going to take
longer than our timeline allowed.
©2013 NativeX Holdings, LLC#Cassandra13	
  	
  
About the IKVM.NET Project
•  http://www.ikvm.net/
•  Open Source Project.
•  Main contributor is Jeroen Frijters.
•  He is actively contributing to the project.
•  License allows for use in commercial applications.
©2013 NativeX Holdings, LLC#Cassandra13	
  	
  
What is IKVM.NET?
•  IKVM.NET includes the following components:
•  A Java Virtual Machine implemented in .NET.
•  A .NET implementation of the Java class libraries.
•  Set of tools that enable Java and .NET interoperability.
©2013 NativeX Holdings, LLC#Cassandra13	
  	
  
Uses for IKVM
•  Drop-in JVM
•  Included is a distribution of a .NET implementation of a Java
Virtual Machine.
•  Allows you to run jar files using the .NET stack.
•  Example: ikvm -jar myapp.jar
©2013 NativeX Holdings, LLC#Cassandra13	
  	
  
Uses for IKVM
•  Use Java libraries in your .NET applications
•  Using ikvmc you can compile Java bytecode to .NET IL.
•  Example: ikvmc -target:library mylib.jar
©2013 NativeX Holdings, LLC#Cassandra13	
  	
  
Uses for IKVM
•  Develop .NET applications in Java
•  Write code in Java.
•  Compile to JVM bytecode.
•  Use ikvmc to produce a .NET Executable.
•  Can also use .NET API’s in Java code using the ikvmstub
application to generate a Java jar file.
•  Example: ikvmstub MyDotNetAssemblyName
©2013 NativeX Holdings, LLC#Cassandra13	
  	
  
Hector Converted to .NET
•  Per Concord’s recommendation we chose to compile the Hector
jar into a .NET Assembly.
•  Hector and all of it’s dependencies are pulled into one .NET
dll that can be referenced by any .NET assembly.
•  In addition you will have to reference some core IKVM
assemblies.
•  Each Java dependency is given it’s own namespace with in
the .NET dll.
©2013 NativeX Holdings, LLC#Cassandra13	
  	
  
HectorNet
•  Concord also created a dll called HectorNet that wraps some of
the Hector behaviors and makes it feel more like .NET.
•  Such as supporting connection strings.
•  Mapping Thrift byte arrays to .NET data types.
•  Mapping to native .NET collections instead of using Java
collections.
#Cassandra13	
  	
  
Challenges
•  C* does not have Auto Id generation.
•  How to connect to C* with C#?
•  Finding a connector with good Failure Tolerance.
•  How to integrate our reporting system?
©2013 NativeX Holdings, LLC#Cassandra13	
  	
  
Integrating Reporting
OLTP	
  
C*	
  
Extract	
   Transform	
  
CUBE	
  
SSAS	
  
OLAP	
  
MS	
  SQL	
  
Load	
  
ETL	
  -­‐	
  SSIS	
  
©2013 NativeX Holdings, LLC#Cassandra13	
  	
  
Integrating Reporting
•  The SSIS Extract process uses C# Script Tasks.
•  Script Task needs references to HectorNet and all of its
dependencies.
•  SSIS can only reference assemblies that are in the GAC.
•  Assemblies in the GAC have to be Signed.
#Cassandra13	
  	
  
Why Not DataStax C# Driver?
•  We built everything using CQL 2.0.
•  Wasn’t ready in time for our launch date.
#Cassandra13	
  	
  
DSE for the Win!
•  We use DataStax Enterprise.
•  Mainly for support, which continues to be a life saver.
©2013 NativeX Holdings, LLC#Cassandra13	
  	
  
Thank you!
•  We are hiring
•  http://nativex.com/careers/
•  Join the MSP C* Meetup
•  http://www.meetup.com/Minneapolis-St-Paul-Cassandra-Meetup/
•  Contact us
•  Jeff.Smoley@nativex.com
•  Derek.Bromenshenkel@nativex.com @breakingtrail
•  Slide Deck
•  http://www.slideshare.net/jjsmoley/the-perils-and-triumphs-of-using-
cassandra-at-a-netmicrosoft-shop

Weitere ähnliche Inhalte

Was ist angesagt?

Was ist angesagt? (20)

Melbourne Chef Meetup: Automating Azure Compliance with InSpec
Melbourne Chef Meetup: Automating Azure Compliance with InSpecMelbourne Chef Meetup: Automating Azure Compliance with InSpec
Melbourne Chef Meetup: Automating Azure Compliance with InSpec
 
Cassandra and security
Cassandra and securityCassandra and security
Cassandra and security
 
Microsoft az-104 Dumps
Microsoft az-104 DumpsMicrosoft az-104 Dumps
Microsoft az-104 Dumps
 
Overview of HDFS Transparent Encryption
Overview of HDFS Transparent Encryption Overview of HDFS Transparent Encryption
Overview of HDFS Transparent Encryption
 
Surviving the Azure Avalanche
Surviving the Azure AvalancheSurviving the Azure Avalanche
Surviving the Azure Avalanche
 
Refactoring Into Microservices. Chicago Coders Conference 2017-06-26
Refactoring Into Microservices. Chicago Coders Conference 2017-06-26Refactoring Into Microservices. Chicago Coders Conference 2017-06-26
Refactoring Into Microservices. Chicago Coders Conference 2017-06-26
 
Docker - The Big Picture
Docker  - The Big PictureDocker  - The Big Picture
Docker - The Big Picture
 
Protect your private data with ORC column encryption
Protect your private data with ORC column encryptionProtect your private data with ORC column encryption
Protect your private data with ORC column encryption
 
Securing kafka with 500 billion messages a day
Securing kafka with 500 billion messages a daySecuring kafka with 500 billion messages a day
Securing kafka with 500 billion messages a day
 
Az 104 session 8 azure monitoring
Az 104 session 8 azure monitoringAz 104 session 8 azure monitoring
Az 104 session 8 azure monitoring
 
Database-as-a-Service with Oracle Enterprise Manager Cloud Control 12c and Or...
Database-as-a-Service with Oracle Enterprise Manager Cloud Control 12c and Or...Database-as-a-Service with Oracle Enterprise Manager Cloud Control 12c and Or...
Database-as-a-Service with Oracle Enterprise Manager Cloud Control 12c and Or...
 
Securing Hadoop's REST APIs with Apache Knox Gateway Hadoop Summit June 6th, ...
Securing Hadoop's REST APIs with Apache Knox Gateway Hadoop Summit June 6th, ...Securing Hadoop's REST APIs with Apache Knox Gateway Hadoop Summit June 6th, ...
Securing Hadoop's REST APIs with Apache Knox Gateway Hadoop Summit June 6th, ...
 
Integrating Okta with Anypoint Platform for a mobile security use case
Integrating Okta with Anypoint Platform for a mobile security use caseIntegrating Okta with Anypoint Platform for a mobile security use case
Integrating Okta with Anypoint Platform for a mobile security use case
 
Turbocharging php applications with zend server
Turbocharging php applications with zend serverTurbocharging php applications with zend server
Turbocharging php applications with zend server
 
Transparent Encryption in HDFS
Transparent Encryption in HDFSTransparent Encryption in HDFS
Transparent Encryption in HDFS
 
Hdp security overview
Hdp security overview Hdp security overview
Hdp security overview
 
Az 104 session 4: azure storage
Az 104 session 4: azure storageAz 104 session 4: azure storage
Az 104 session 4: azure storage
 
An Approach for Multi-Tenancy Through Apache Knox
An Approach for Multi-Tenancy Through Apache KnoxAn Approach for Multi-Tenancy Through Apache Knox
An Approach for Multi-Tenancy Through Apache Knox
 
Net core vs. node.js what to choose when
Net core vs. node.js  what to choose when Net core vs. node.js  what to choose when
Net core vs. node.js what to choose when
 
Hadoop security
Hadoop securityHadoop security
Hadoop security
 

Ähnlich wie C* Summit 2013: The Perils and Triumphs of using Cassandra at a .NET/Microsoft Shop by Derek Bromenshenkel and Jeff Smoley

Docker Birthday #3 Slides - Overview
Docker Birthday #3 Slides - OverviewDocker Birthday #3 Slides - Overview
Docker Birthday #3 Slides - Overview
Chris Ciborowski
 

Ähnlich wie C* Summit 2013: The Perils and Triumphs of using Cassandra at a .NET/Microsoft Shop by Derek Bromenshenkel and Jeff Smoley (20)

Rami Sayar - Node microservices with Docker
Rami Sayar - Node microservices with DockerRami Sayar - Node microservices with Docker
Rami Sayar - Node microservices with Docker
 
Docker for the enterprise
Docker for the enterpriseDocker for the enterprise
Docker for the enterprise
 
Apache Cassandra in the Cloud
Apache Cassandra in the CloudApache Cassandra in the Cloud
Apache Cassandra in the Cloud
 
Docker Birthday #3 - Intro to Docker Slides
Docker Birthday #3 - Intro to Docker SlidesDocker Birthday #3 - Intro to Docker Slides
Docker Birthday #3 - Intro to Docker Slides
 
Docker Birthday #3 Slides - Overview
Docker Birthday #3 Slides - OverviewDocker Birthday #3 Slides - Overview
Docker Birthday #3 Slides - Overview
 
Stay productive_while_slicing_up_the_monolith
Stay productive_while_slicing_up_the_monolithStay productive_while_slicing_up_the_monolith
Stay productive_while_slicing_up_the_monolith
 
Edge 2016 Session 1886 Building your own docker container cloud on ibm power...
Edge 2016 Session 1886  Building your own docker container cloud on ibm power...Edge 2016 Session 1886  Building your own docker container cloud on ibm power...
Edge 2016 Session 1886 Building your own docker container cloud on ibm power...
 
Mongo DB at Community Engine
Mongo DB at Community EngineMongo DB at Community Engine
Mongo DB at Community Engine
 
MongoDB at community engine
MongoDB at community engineMongoDB at community engine
MongoDB at community engine
 
Migrating a build farm from on-prem to AWS
Migrating a build farm from on-prem to AWSMigrating a build farm from on-prem to AWS
Migrating a build farm from on-prem to AWS
 
Cloud-native Data: Every Microservice Needs a Cache
Cloud-native Data: Every Microservice Needs a CacheCloud-native Data: Every Microservice Needs a Cache
Cloud-native Data: Every Microservice Needs a Cache
 
Containers, microservices and serverless for realists
Containers, microservices and serverless for realistsContainers, microservices and serverless for realists
Containers, microservices and serverless for realists
 
MongoDB World 2019: Why NBCUniversal Migrated to MongoDB Atlas
MongoDB World 2019: Why NBCUniversal Migrated to MongoDB AtlasMongoDB World 2019: Why NBCUniversal Migrated to MongoDB Atlas
MongoDB World 2019: Why NBCUniversal Migrated to MongoDB Atlas
 
Basics of Java Cloud
Basics of Java CloudBasics of Java Cloud
Basics of Java Cloud
 
Virtualization Vs. Containers
Virtualization Vs. ContainersVirtualization Vs. Containers
Virtualization Vs. Containers
 
Microservices Development - ICP Workshop Batch II
Microservices Development - ICP Workshop Batch IIMicroservices Development - ICP Workshop Batch II
Microservices Development - ICP Workshop Batch II
 
Enterprise Java on Azure: From Java EE to Spring, we have you covered
Enterprise Java on Azure: From Java EE to Spring, we have you coveredEnterprise Java on Azure: From Java EE to Spring, we have you covered
Enterprise Java on Azure: From Java EE to Spring, we have you covered
 
Docker & aPaaS: Enterprise Innovation and Trends for 2015
Docker & aPaaS: Enterprise Innovation and Trends for 2015Docker & aPaaS: Enterprise Innovation and Trends for 2015
Docker & aPaaS: Enterprise Innovation and Trends for 2015
 
Why kubernetes matters
Why kubernetes mattersWhy kubernetes matters
Why kubernetes matters
 
Hindsight is 20/20: MySQL to Cassandra
Hindsight is 20/20: MySQL to CassandraHindsight is 20/20: MySQL to Cassandra
Hindsight is 20/20: MySQL to Cassandra
 

Mehr von DataStax Academy

Cassandra on Docker @ Walmart Labs
Cassandra on Docker @ Walmart LabsCassandra on Docker @ Walmart Labs
Cassandra on Docker @ Walmart Labs
DataStax Academy
 
Cassandra Adoption on Cisco UCS & Open stack
Cassandra Adoption on Cisco UCS & Open stackCassandra Adoption on Cisco UCS & Open stack
Cassandra Adoption on Cisco UCS & Open stack
DataStax Academy
 
Cassandra @ Netflix: Monitoring C* at Scale, Gossip and Tickler & Python
Cassandra @ Netflix: Monitoring C* at Scale, Gossip and Tickler & PythonCassandra @ Netflix: Monitoring C* at Scale, Gossip and Tickler & Python
Cassandra @ Netflix: Monitoring C* at Scale, Gossip and Tickler & Python
DataStax Academy
 
Standing Up Your First Cluster
Standing Up Your First ClusterStanding Up Your First Cluster
Standing Up Your First Cluster
DataStax Academy
 
Real Time Analytics with Dse
Real Time Analytics with DseReal Time Analytics with Dse
Real Time Analytics with Dse
DataStax Academy
 
Introduction to Data Modeling with Apache Cassandra
Introduction to Data Modeling with Apache CassandraIntroduction to Data Modeling with Apache Cassandra
Introduction to Data Modeling with Apache Cassandra
DataStax Academy
 
Enabling Search in your Cassandra Application with DataStax Enterprise
Enabling Search in your Cassandra Application with DataStax EnterpriseEnabling Search in your Cassandra Application with DataStax Enterprise
Enabling Search in your Cassandra Application with DataStax Enterprise
DataStax Academy
 
Advanced Data Modeling with Apache Cassandra
Advanced Data Modeling with Apache CassandraAdvanced Data Modeling with Apache Cassandra
Advanced Data Modeling with Apache Cassandra
DataStax Academy
 

Mehr von DataStax Academy (20)

Forrester CXNYC 2017 - Delivering great real-time cx is a true craft
Forrester CXNYC 2017 - Delivering great real-time cx is a true craftForrester CXNYC 2017 - Delivering great real-time cx is a true craft
Forrester CXNYC 2017 - Delivering great real-time cx is a true craft
 
Introduction to DataStax Enterprise Graph Database
Introduction to DataStax Enterprise Graph DatabaseIntroduction to DataStax Enterprise Graph Database
Introduction to DataStax Enterprise Graph Database
 
Introduction to DataStax Enterprise Advanced Replication with Apache Cassandra
Introduction to DataStax Enterprise Advanced Replication with Apache CassandraIntroduction to DataStax Enterprise Advanced Replication with Apache Cassandra
Introduction to DataStax Enterprise Advanced Replication with Apache Cassandra
 
Cassandra on Docker @ Walmart Labs
Cassandra on Docker @ Walmart LabsCassandra on Docker @ Walmart Labs
Cassandra on Docker @ Walmart Labs
 
Cassandra 3.0 Data Modeling
Cassandra 3.0 Data ModelingCassandra 3.0 Data Modeling
Cassandra 3.0 Data Modeling
 
Cassandra Adoption on Cisco UCS & Open stack
Cassandra Adoption on Cisco UCS & Open stackCassandra Adoption on Cisco UCS & Open stack
Cassandra Adoption on Cisco UCS & Open stack
 
Data Modeling for Apache Cassandra
Data Modeling for Apache CassandraData Modeling for Apache Cassandra
Data Modeling for Apache Cassandra
 
Coursera Cassandra Driver
Coursera Cassandra DriverCoursera Cassandra Driver
Coursera Cassandra Driver
 
Production Ready Cassandra
Production Ready CassandraProduction Ready Cassandra
Production Ready Cassandra
 
Cassandra @ Netflix: Monitoring C* at Scale, Gossip and Tickler & Python
Cassandra @ Netflix: Monitoring C* at Scale, Gossip and Tickler & PythonCassandra @ Netflix: Monitoring C* at Scale, Gossip and Tickler & Python
Cassandra @ Netflix: Monitoring C* at Scale, Gossip and Tickler & Python
 
Cassandra @ Sony: The good, the bad, and the ugly part 1
Cassandra @ Sony: The good, the bad, and the ugly part 1Cassandra @ Sony: The good, the bad, and the ugly part 1
Cassandra @ Sony: The good, the bad, and the ugly part 1
 
Cassandra @ Sony: The good, the bad, and the ugly part 2
Cassandra @ Sony: The good, the bad, and the ugly part 2Cassandra @ Sony: The good, the bad, and the ugly part 2
Cassandra @ Sony: The good, the bad, and the ugly part 2
 
Standing Up Your First Cluster
Standing Up Your First ClusterStanding Up Your First Cluster
Standing Up Your First Cluster
 
Real Time Analytics with Dse
Real Time Analytics with DseReal Time Analytics with Dse
Real Time Analytics with Dse
 
Introduction to Data Modeling with Apache Cassandra
Introduction to Data Modeling with Apache CassandraIntroduction to Data Modeling with Apache Cassandra
Introduction to Data Modeling with Apache Cassandra
 
Cassandra Core Concepts
Cassandra Core ConceptsCassandra Core Concepts
Cassandra Core Concepts
 
Enabling Search in your Cassandra Application with DataStax Enterprise
Enabling Search in your Cassandra Application with DataStax EnterpriseEnabling Search in your Cassandra Application with DataStax Enterprise
Enabling Search in your Cassandra Application with DataStax Enterprise
 
Bad Habits Die Hard
Bad Habits Die Hard Bad Habits Die Hard
Bad Habits Die Hard
 
Advanced Data Modeling with Apache Cassandra
Advanced Data Modeling with Apache CassandraAdvanced Data Modeling with Apache Cassandra
Advanced Data Modeling with Apache Cassandra
 
Advanced Cassandra
Advanced CassandraAdvanced Cassandra
Advanced Cassandra
 

Kürzlich hochgeladen

EIS-Webinar-Prompt-Knowledge-Eng-2024-04-08.pptx
EIS-Webinar-Prompt-Knowledge-Eng-2024-04-08.pptxEIS-Webinar-Prompt-Knowledge-Eng-2024-04-08.pptx
EIS-Webinar-Prompt-Knowledge-Eng-2024-04-08.pptx
Earley Information Science
 
Artificial Intelligence: Facts and Myths
Artificial Intelligence: Facts and MythsArtificial Intelligence: Facts and Myths
Artificial Intelligence: Facts and Myths
Joaquim Jorge
 

Kürzlich hochgeladen (20)

Factors to Consider When Choosing Accounts Payable Services Providers.pptx
Factors to Consider When Choosing Accounts Payable Services Providers.pptxFactors to Consider When Choosing Accounts Payable Services Providers.pptx
Factors to Consider When Choosing Accounts Payable Services Providers.pptx
 
Slack Application Development 101 Slides
Slack Application Development 101 SlidesSlack Application Development 101 Slides
Slack Application Development 101 Slides
 
EIS-Webinar-Prompt-Knowledge-Eng-2024-04-08.pptx
EIS-Webinar-Prompt-Knowledge-Eng-2024-04-08.pptxEIS-Webinar-Prompt-Knowledge-Eng-2024-04-08.pptx
EIS-Webinar-Prompt-Knowledge-Eng-2024-04-08.pptx
 
Raspberry Pi 5: Challenges and Solutions in Bringing up an OpenGL/Vulkan Driv...
Raspberry Pi 5: Challenges and Solutions in Bringing up an OpenGL/Vulkan Driv...Raspberry Pi 5: Challenges and Solutions in Bringing up an OpenGL/Vulkan Driv...
Raspberry Pi 5: Challenges and Solutions in Bringing up an OpenGL/Vulkan Driv...
 
Exploring the Future Potential of AI-Enabled Smartphone Processors
Exploring the Future Potential of AI-Enabled Smartphone ProcessorsExploring the Future Potential of AI-Enabled Smartphone Processors
Exploring the Future Potential of AI-Enabled Smartphone Processors
 
Presentation on how to chat with PDF using ChatGPT code interpreter
Presentation on how to chat with PDF using ChatGPT code interpreterPresentation on how to chat with PDF using ChatGPT code interpreter
Presentation on how to chat with PDF using ChatGPT code interpreter
 
Artificial Intelligence: Facts and Myths
Artificial Intelligence: Facts and MythsArtificial Intelligence: Facts and Myths
Artificial Intelligence: Facts and Myths
 
How to Troubleshoot Apps for the Modern Connected Worker
How to Troubleshoot Apps for the Modern Connected WorkerHow to Troubleshoot Apps for the Modern Connected Worker
How to Troubleshoot Apps for the Modern Connected Worker
 
08448380779 Call Girls In Civil Lines Women Seeking Men
08448380779 Call Girls In Civil Lines Women Seeking Men08448380779 Call Girls In Civil Lines Women Seeking Men
08448380779 Call Girls In Civil Lines Women Seeking Men
 
Boost Fertility New Invention Ups Success Rates.pdf
Boost Fertility New Invention Ups Success Rates.pdfBoost Fertility New Invention Ups Success Rates.pdf
Boost Fertility New Invention Ups Success Rates.pdf
 
Scaling API-first – The story of a global engineering organization
Scaling API-first – The story of a global engineering organizationScaling API-first – The story of a global engineering organization
Scaling API-first – The story of a global engineering organization
 
Workshop - Best of Both Worlds_ Combine KG and Vector search for enhanced R...
Workshop - Best of Both Worlds_ Combine  KG and Vector search for  enhanced R...Workshop - Best of Both Worlds_ Combine  KG and Vector search for  enhanced R...
Workshop - Best of Both Worlds_ Combine KG and Vector search for enhanced R...
 
The Role of Taxonomy and Ontology in Semantic Layers - Heather Hedden.pdf
The Role of Taxonomy and Ontology in Semantic Layers - Heather Hedden.pdfThe Role of Taxonomy and Ontology in Semantic Layers - Heather Hedden.pdf
The Role of Taxonomy and Ontology in Semantic Layers - Heather Hedden.pdf
 
🐬 The future of MySQL is Postgres 🐘
🐬  The future of MySQL is Postgres   🐘🐬  The future of MySQL is Postgres   🐘
🐬 The future of MySQL is Postgres 🐘
 
The Codex of Business Writing Software for Real-World Solutions 2.pptx
The Codex of Business Writing Software for Real-World Solutions 2.pptxThe Codex of Business Writing Software for Real-World Solutions 2.pptx
The Codex of Business Writing Software for Real-World Solutions 2.pptx
 
Automating Google Workspace (GWS) & more with Apps Script
Automating Google Workspace (GWS) & more with Apps ScriptAutomating Google Workspace (GWS) & more with Apps Script
Automating Google Workspace (GWS) & more with Apps Script
 
Finology Group – Insurtech Innovation Award 2024
Finology Group – Insurtech Innovation Award 2024Finology Group – Insurtech Innovation Award 2024
Finology Group – Insurtech Innovation Award 2024
 
A Year of the Servo Reboot: Where Are We Now?
A Year of the Servo Reboot: Where Are We Now?A Year of the Servo Reboot: Where Are We Now?
A Year of the Servo Reboot: Where Are We Now?
 
Breaking the Kubernetes Kill Chain: Host Path Mount
Breaking the Kubernetes Kill Chain: Host Path MountBreaking the Kubernetes Kill Chain: Host Path Mount
Breaking the Kubernetes Kill Chain: Host Path Mount
 
Axa Assurance Maroc - Insurer Innovation Award 2024
Axa Assurance Maroc - Insurer Innovation Award 2024Axa Assurance Maroc - Insurer Innovation Award 2024
Axa Assurance Maroc - Insurer Innovation Award 2024
 

C* Summit 2013: The Perils and Triumphs of using Cassandra at a .NET/Microsoft Shop by Derek Bromenshenkel and Jeff Smoley

  • 1. reimagining the business of apps #Cassandra13     ©2013 NativeX Holdings, LLC The Perils and Triumphs of using Cassandra at a .NET/Microsoft Shop
  • 2. #Cassandra13     About the Presenters Jeff Smoley – Infrastructure Architect Derek Bromenshenkel – Infrastructure Architect
  • 3. #Cassandra13     Agenda •  About NativeX •  Why Cassandra? •  Challenges •  Auto Id Generation •  FluentCassandra •  Hector •  IKVM.NET •  HectorNet •  Reporting Integration
  • 4. ©2013 NativeX Holdings, LLC#Cassandra13     About NativeX •  Formerly W3i •  Home Office in Sartell, MN •  75 miles NW of Minneapolis •  Remote Offices in MSP and SF •  150 Employees
  • 5. ©2013 NativeX Holdings, LLC#Cassandra13     What NativeX Does •  Marketing technology platform that enables developers to build successful business around their apps. •  We provide Publishers with a way to monetize and Advertisers with a way to gain distribution.
  • 6. #Cassandra13     Mobile Vanity Metrics •  Over 700M unique devices •  1000s of Apps •  > 100M Monthly Active Users •  > 200GB of data ingest per week
  • 7. ©2013 NativeX Holdings, LLC#Cassandra13     Backstory •  From 100M session/quarter to 5B. •  Anticipate 7B sessions in Q2. •  Growth was anticipated. •  Realized infrastructure needed to change to support this. 0   1   2   3   4   5   6   2011  Q4   2012  Q1   2012  Q2   2012  Q3   2012  Q4   2013  Q1   Billions   API  Requests  
  • 8. ©2013 NativeX Holdings, LLC#Cassandra13     Original OLTP Architecture •  Microsoft SQL Server •  2 Node Cluster (Failover) •  12 cores / node •  192 GB mem / node •  Compellent SAN •  172 Tiered Disk •  SSD, FC, SATA
  • 9. ©2013 NativeX Holdings, LLC#Cassandra13     Objectives Scale   • Horizontal   • Incremental  cost   structure   Resiliency   • No  single  point   of  failure   • Geographically   distributed  
  • 10. ©2013 NativeX Holdings, LLC#Cassandra13     What is NoSQL •  Stands for Not Only SQL. •  The NoSQL movement is about understanding problems and focusing on solutions. •  It’s not about silver bullets and black boxes. •  It is about using the right tool for the right problem.
  • 11. ©2013 NativeX Holdings, LLC#Cassandra13     Researched Products •  Compared features like: •  Distributed / Shared Nothing •  Multi-Cluster Support •  Maturity & Popularity •  Documentation •  .NET Support
  • 12. ©2013 NativeX Holdings, LLC#Cassandra13     Why Cassandra? •  Multi-node •  Multi-cluster •  Highly Available •  Durable •  Shared Nothing •  Tunable Consistency
  • 13. ©2013 NativeX Holdings, LLC#Cassandra13     Cassandra at NativeX •  C* was not a replacement DB system. •  We continue to use MS SQL Server alongside C*. •  SQL Server used for storing configuration data. •  C* solves a very specific problem for us. •  Writing large volumes of data quickly. •  Reading very specific data out of a large record set.
  • 14. #Cassandra13     Challenges •  C* does not have Auto Id generation. •  How to connect to C* with C#? •  Finding a connector with good Failure Tolerance. •  How to integrate our reporting system?
  • 15. ©2013 NativeX Holdings, LLC#Cassandra13     Auto ID Generation •  Pre-existing requirements •  Unique, 64-bit positive integers •  Increasing (sortable) a plus •  Previously SQL Server Identity column •  A Time-based UUID is sortable and unique •  Changed everything we could •  The future for us
  • 16. ©2013 NativeX Holdings, LLC#Cassandra13     Auto ID – What are the options? •  SQL dummy table •  Easy & familiar, but limited •  Pre-generated range •  Proposed by @mdennis •  Distributed, but more complicated to implement •  Sharding [Instagram] •  Discovered too late •  Unfamiliar with Postgres
  • 17. ©2013 NativeX Holdings, LLC#Cassandra13     We chose Snowflake •  Built by Twitter, Apache 2.0 license •  https://github.com/twitter/snowflake •  “… network service for generating unique ID numbers at high scale..” •  Same motivation; MySQL -> C* •  A few tweaks for our Windows environment
  • 18. ©2013 NativeX Holdings, LLC#Cassandra13     Technical reasons for Snowflake •  Meets all requirements •  Tested in high transaction system •  Java based [Scala] implementation •  Thrift server •  Run as a Windows service with Apache Daemon •  Con: Requires Apache Zookeeper •  Coordinate the worker id
  • 19. ©2013 NativeX Holdings, LLC#Cassandra13     Connecting to Snowflake •  Built our own .NET Snowflake Client •  Snowflake server on each web node •  Local instance is primary •  Round robin failover to other nodes •  Auto failover AND recovery •  “Circuit Breaker” pattern Web   App   SF   Server  1   Web   App   SF   Server  3   Web   App   SF   Server  2   Web   App   SF   Server  4  
  • 20. #Cassandra13     Challenges •  C* does not have Auto Id generation. •  How to connect to C* with C#? •  Finding a connector with good Failure Tolerance. •  How to integrate our reporting system?
  • 21. ©2013 NativeX Holdings, LLC#Cassandra13     Connecting to Cassandra with C# •  Thrift alone too low level •  Needs •  CQL support •  Active development / support •  Wants •  ADO.NET / LINQ feel •  ???? •  FluentCassandra is where we started
  • 22. ©2013 NativeX Holdings, LLC#Cassandra13     Vetting FluentCassandra •  Pros •  Open source - https://github.com/fluentcassandra/fluentcassandra •  Nick Berardi, project owner, is excellent •  Designed for CQL •  Familiar feel •  Were able to start project development with it
  • 23. ©2013 NativeX Holdings, LLC#Cassandra13     Vetting FluentCassandra •  Cons •  Immaturity •  Few users with high transaction system •  Permanent node blacklisting •  Lacked auto retry •  Couldn’t live with these limitations •  Tried hiring independent contractor to help us mature it
  • 24. #Cassandra13     Challenges •  C* does not have Auto Id generation. •  How to connect to C* with C#? •  Finding a connector with good Failure Tolerance. •  How to integrate our reporting system?
  • 25. ©2013 NativeX Holdings, LLC#Cassandra13     Hector: Yes, please •  Popular C* connector •  Use cases matching ours •  Good maturity •  Auto node discovery •  Auto retry •  Auto failure recovery •  Written in Java – major roadblock
  • 26. ©2013 NativeX Holdings, LLC#Cassandra13     Help! •  We knew we still needed help. •  We found a company named Concord. •  Based out of the Twin Cites. •  Specialize in System, Process, and Data Integration. •  http://concordusa.com/
  • 27. ©2013 NativeX Holdings, LLC#Cassandra13     Concord’s Recommendation •  Concord recommended that we use IKVM.NET to port Hector to a .NET assembly. •  They had previous success using IKVM for other Java to .NET ports. •  They felt that maturing FluentCassandra was going to take longer than our timeline allowed.
  • 28. ©2013 NativeX Holdings, LLC#Cassandra13     About the IKVM.NET Project •  http://www.ikvm.net/ •  Open Source Project. •  Main contributor is Jeroen Frijters. •  He is actively contributing to the project. •  License allows for use in commercial applications.
  • 29. ©2013 NativeX Holdings, LLC#Cassandra13     What is IKVM.NET? •  IKVM.NET includes the following components: •  A Java Virtual Machine implemented in .NET. •  A .NET implementation of the Java class libraries. •  Set of tools that enable Java and .NET interoperability.
  • 30. ©2013 NativeX Holdings, LLC#Cassandra13     Uses for IKVM •  Drop-in JVM •  Included is a distribution of a .NET implementation of a Java Virtual Machine. •  Allows you to run jar files using the .NET stack. •  Example: ikvm -jar myapp.jar
  • 31. ©2013 NativeX Holdings, LLC#Cassandra13     Uses for IKVM •  Use Java libraries in your .NET applications •  Using ikvmc you can compile Java bytecode to .NET IL. •  Example: ikvmc -target:library mylib.jar
  • 32. ©2013 NativeX Holdings, LLC#Cassandra13     Uses for IKVM •  Develop .NET applications in Java •  Write code in Java. •  Compile to JVM bytecode. •  Use ikvmc to produce a .NET Executable. •  Can also use .NET API’s in Java code using the ikvmstub application to generate a Java jar file. •  Example: ikvmstub MyDotNetAssemblyName
  • 33. ©2013 NativeX Holdings, LLC#Cassandra13     Hector Converted to .NET •  Per Concord’s recommendation we chose to compile the Hector jar into a .NET Assembly. •  Hector and all of it’s dependencies are pulled into one .NET dll that can be referenced by any .NET assembly. •  In addition you will have to reference some core IKVM assemblies. •  Each Java dependency is given it’s own namespace with in the .NET dll.
  • 34. ©2013 NativeX Holdings, LLC#Cassandra13     HectorNet •  Concord also created a dll called HectorNet that wraps some of the Hector behaviors and makes it feel more like .NET. •  Such as supporting connection strings. •  Mapping Thrift byte arrays to .NET data types. •  Mapping to native .NET collections instead of using Java collections.
  • 35. #Cassandra13     Challenges •  C* does not have Auto Id generation. •  How to connect to C* with C#? •  Finding a connector with good Failure Tolerance. •  How to integrate our reporting system?
  • 36. ©2013 NativeX Holdings, LLC#Cassandra13     Integrating Reporting OLTP   C*   Extract   Transform   CUBE   SSAS   OLAP   MS  SQL   Load   ETL  -­‐  SSIS  
  • 37. ©2013 NativeX Holdings, LLC#Cassandra13     Integrating Reporting •  The SSIS Extract process uses C# Script Tasks. •  Script Task needs references to HectorNet and all of its dependencies. •  SSIS can only reference assemblies that are in the GAC. •  Assemblies in the GAC have to be Signed.
  • 38. #Cassandra13     Why Not DataStax C# Driver? •  We built everything using CQL 2.0. •  Wasn’t ready in time for our launch date.
  • 39. #Cassandra13     DSE for the Win! •  We use DataStax Enterprise. •  Mainly for support, which continues to be a life saver.
  • 40. ©2013 NativeX Holdings, LLC#Cassandra13     Thank you! •  We are hiring •  http://nativex.com/careers/ •  Join the MSP C* Meetup •  http://www.meetup.com/Minneapolis-St-Paul-Cassandra-Meetup/ •  Contact us •  Jeff.Smoley@nativex.com •  Derek.Bromenshenkel@nativex.com @breakingtrail •  Slide Deck •  http://www.slideshare.net/jjsmoley/the-perils-and-triumphs-of-using- cassandra-at-a-netmicrosoft-shop