SlideShare ist ein Scribd-Unternehmen logo
1 von 24
Downloaden Sie, um offline zu lesen
CONFIDENTIAL
1
Praveen Kumar
Emerging Software Platforms,
Global Software Engineering
Mar 2014
Equinix Big Data Platform & Cassandra
Confidential – © 2013 Equinix Inc. www.equinix.com 2
Big Data at Equinix
~2 million
Alarms
~200k
interconnections
~250k
Electrical circuits
Sensors across 95+ IBXs
~40k
Infrastructure objects
Confidential – © 2013 Equinix Inc. www.equinix.com 3
Big Data at Equinix
Sensors across 95+ IBXs
Lead to / produce
Support for multiple protocols
Push as well pull methods
Time series data
Cross sectional dataNot so clean data
High velocity
Clean data Lots and lots of noise
Some useful intel
Confidential – © 2013 Equinix Inc. www.equinix.com 4
Big Data at Equinix
What do we use(or plan to use) this data for?
Customer Presentment Billing
Operations New Product & Services
Confidential – © 2013 Equinix Inc. www.equinix.com 5
Big Data at Equinix
Use-case analysis : 80-20 rule
~80% of use-cases analyzed act upon “Hot Data”
~80% of data for most of use-cases analyzed is time-series.
All “quick win” use-cases need data mediation, aggregation and roll-up for
presentment.
Real-time to near real-time processing of events
Collection, processing and storage technologies suitable for
time-series data.
Collection, mediation, cross-referencing and co-relation of
data from different sources; roll-up and aggregate.
Confidential – © 2013 Equinix Inc. www.equinix.com 6
Big Data at Equinix
Our Approach : Equinix Big Data Platform
§  Common platform to be shared by all initial Big
Data use cases – multi tenancy
§  Built on inexpensive hardware using free or
inexpensive software
§  Seamless & massive scalability using scale-out
§  High reliability - partial failover, graceful
degradation, self-healing, self-balancing
§  Data ingestion and processing capabilities for
high volumes at high velocity
§  Support for structured and semi-structured data
§  Provides real-time processing abilities
§  Provides parallel processing capabilities
§  Support for low latency queries, wide range
scan queries and search
§  Provides abstraction via connectors,
frameworks and libraries
§  Support for low latency queries, wide range
scan queries and search
§  Support for predictive analytics using machine
learning
Immediate requirements
Long term goals
Big Data Platform - Logical Architecture (technology agnostic)
Confidential – © 2013 Equinix Inc. www.equinix.com 7
Big Data at Equinix
Requirements & Technologies considered for Big Data Platform
Confidential – © 2013 Equinix Inc. www.equinix.com 8
Big Data at Equinix
Grand Finale
Hadoop Ecosystem vs. DataStax Enterprise
SearchSearch
SearchSearch
AnalyticsAnalytics
StorageStorageAnalyticsAnalytics
StorageStorage
StorageStorage
Hadoop	
  Distributed	
  File	
  System
(Storage/Analytics)
NameNode Secondary	
  Name	
  Node
Data	
  Nodes	
  (Storage)
HBase	
  (Storage/Analytics)
Hbase	
  Master
Hbase	
  Region	
  Servers
Hbase	
  Master
Search
Management	
  
Services
Cloudera	
  Manager
Solr	
  Nodes
Zookeeper
Pros
•  Scalability
•  Cloud readiness
•  Resource availability
•  Industry momentum
•  Product eco-system
maturity
•  Technical support
Cons
•  Infrastructure footprint
•  Operational Complexity
•  Learning curve
•  Availability
•  Total cost of ownership
Pros
•  Infrastructure footprint
•  Operational ease
•  Scalability
•  Availability
•  Cloud readiness
•  Learning curve
•  Resource availability
•  Technical support
•  Total cost of ownership
Cons
•  Industry momentum
•  Product eco-system
maturity
Confidential – © 2013 Equinix Inc. www.equinix.com 9
Criteria	
   Cassandra	
   HBase	
  
CAP Theorem Focus Availability, Partition-Tolerance Consistency, Availability
Data Partitioning
Supports ordered & random partitioning, random
partitioning is recommended.
Ordered Partitioning. Load balancing
achieved through resharding.
Distributed System P2P architecture (Amazon Dynamo)
Master / Slave via HDFS, Zookeeper for
coordination
Administration & Maintenance Medium High
Single Write Master No (R+W+1 to get Strong Consistency) Yes
Multi-tenancy Yes Yes
Secondary indexes
Supports secondary indexes on CF where column
name is known.
Does not natively support secondary indexes.
Consistency Tunable Consistency Strict consistency (Not ACID)
Hot Spot Problem
No, distributes load across nodes using random
partition strategy.
Yes, one node may handle most of the traffic
due to ordered partition.
Multi-Data Center Support
and Disaster Recovery
Asynchronous replication via WAN Asynchronous replication via WAN
Single point of failure Ring topology, there is no single point of failure.
Although there exists a concept of a master
server, HBase itself does not depend on it
heavily. HBase cluster can keep serving data
even if the master goes down. Hadoop
namenode is a single point of failure.
Commercial vendors Datastax, Acunu Clodera, Hortonworks
Cassandra Vs. HBase
Big Data at Equinix
Confidential – © 2013 Equinix Inc. www.equinix.com 10
Why DSE Cassandra
Big Data at Equinix
Support for Analytics
Integrated search using Solr
Security features
Cluster management capabilities
Commercial support
DataStax would probably list lots of more reasons, these are the reasons
relevant to us.
Confidential – © 2013 Equinix Inc. www.equinix.com 11
Big Data at Equinix
Grand Finale
Hadoop Ecosystem vs. DataStax Enterprise
SearchSearch
SearchSearch
AnalyticsAnalytics
StorageStorageAnalyticsAnalytics
StorageStorage
StorageStorage
Hadoop	
  Distributed	
  File	
  System
(Storage/Analytics)
NameNode Secondary	
  Name	
  Node
Data	
  Nodes	
  (Storage)
HBase	
  (Storage/Analytics)
Hbase	
  Master
Hbase	
  Region	
  Servers
Hbase	
  Master
Search
Management	
  
Services
Cloudera	
  Manager
Solr	
  Nodes
Zookeeper
Pros
•  Scalability
•  Cloud readiness
•  Resource availability
•  Industry momentum
•  Product eco-system
maturity
•  Technical support
Cons
•  Infrastructure footprint
•  Operational Complexity
•  Learning curve
•  Availability
•  Total cost of ownership
Pros
•  Infrastructure footprint
•  Operational ease
•  Scalability
•  Availability
•  Cloud readiness
•  Learning curve
•  Resource availability
•  Technical support
•  Total cost of ownership
Cons
•  Industry momentum
•  Product eco-system
maturity
ß Sold
Confidential – © 2013 Equinix Inc. www.equinix.com 12
Big Data at Equinix
How far are we on our Big Data journey?
ü  Pilot use-case from PoC to Production
ü  Moved network statistics use case from RRD
based solution to DSE Cassandra
ü  Build in progress for
§  power monitoring use cases
§  data center monitoring
§  network monitoring
In-plans
Ø  Recommendation engine on interconnection
platform
Ø  Use case analysis and technology selection for
connected data sets
Ø  Building data science capabilities for use cases
requiring predictive modeling
A few data points
Physical bare metal boxes for DSE
nodes
Densely packed data nodes with 4TB
storage on each node, 96GB RAM
About ~250 million records a day
Also used for log analysis for internal
IT systems monitoring use-cases
Confidential – © 2013 Equinix Inc. www.equinix.com 13
Big Data at Equinix
Experience so far
Lack of standards based connectors / drivers
DataStax has developed a Java Driver, but doesn’t support JDBC
No data visualization tools to access from Cassandra for low-latency access
No data access tools (Toad equivalent) available yet; DevCenter is not there yet
We
used Astyanax and are evaluating DataStax java driver
built libraries to abstract Astyanax for application engineering teams
built rest services for data access by applications
Good reliability
Not many instances of nodes being down
Handled loads even when nodes were down
Confidential – © 2013 Equinix Inc. www.equinix.com 14
Big Data at Equinix
Where do we go from here??
Graph databases
Batch processing (Hadoop, Spark , MapReduce ??)
Interactive queries
Online data processing
Data analytics
Data science and machine learning
Data visualization tools and applications
Developer toolkits
We are hiring
Big Data Architect
Big Data Engineers
Data Scientists
send resume at
pkumar@equinix.com
CONFIDENTIAL
15
Thank you!
•  pkmr.work@gmail.com
•  pkumar@equinix.com
•  www.equinix.com
EQUINIX?
Confidential – © 2013 Equinix Inc. www.equinix.com 17
WHO IS EQUINIX?
Confidential – © 2013 Equinix Inc. www.equinix.com 18
GLOBAL
DATA CENTERS
95+ Data Centers
9M+ Square Feet
99.999% Uptime Record
INTERCONNECTION
950+ Networks
110,000+ Cross Connects
BUSINESS
ECOSYSTEMS
Equinix Marketplace™
4,000+ Businesses
Revenue Opportunities
MOVING TOWARDS THE FUTURE | PLATFORM
Equinix: A Platform for Growth
Solid. Powerful. Growing.
$1.8B
IN ANNUALIZED
REVENUE
MEMBER OF THE NASDAQ 100
$7B
INVESTMENTS
IN EXPANSION
15 COUNTRIES
5 CONTINENTS
31 MARKETS
Confidential – © 2013 Equinix Inc. www.equinix.com 21
HOW WE’RE DIFFERENT | GLOBAL FOOTPRINT
Where You Are. Where You Need To Be.
90%
PASS THROUGH EQUINIX DATA CENTERS
OVER
OF INTERNET ROUTES
950+NETWORK PROVIDERS
450+
CLOUD & SaaS
PROVIDERS
CONFIDENTIAL
24
Thank you!
•  pkmr.work@gmail.com
•  pkumar@equinix.com
•  www.equinix.com

Weitere ähnliche Inhalte

Was ist angesagt?

Part 3: Models in Production: A Look From Beginning to End
Part 3: Models in Production: A Look From Beginning to EndPart 3: Models in Production: A Look From Beginning to End
Part 3: Models in Production: A Look From Beginning to EndCloudera, Inc.
 
Standing Up an Effective Enterprise Data Hub -- Technology and Beyond
Standing Up an Effective Enterprise Data Hub -- Technology and BeyondStanding Up an Effective Enterprise Data Hub -- Technology and Beyond
Standing Up an Effective Enterprise Data Hub -- Technology and BeyondCloudera, Inc.
 
The Vision & Challenge of Applied Machine Learning
The Vision & Challenge of Applied Machine LearningThe Vision & Challenge of Applied Machine Learning
The Vision & Challenge of Applied Machine LearningCloudera, Inc.
 
Customer Best Practices: Optimizing Cloudera on AWS
Customer Best Practices: Optimizing Cloudera on AWSCustomer Best Practices: Optimizing Cloudera on AWS
Customer Best Practices: Optimizing Cloudera on AWSCloudera, Inc.
 
Simplifying Real-Time Architectures for IoT with Apache Kudu
Simplifying Real-Time Architectures for IoT with Apache KuduSimplifying Real-Time Architectures for IoT with Apache Kudu
Simplifying Real-Time Architectures for IoT with Apache KuduCloudera, Inc.
 
Cloudera Data Science Workbench: sparklyr, implyr, and More - dplyr Interfac...
 Cloudera Data Science Workbench: sparklyr, implyr, and More - dplyr Interfac... Cloudera Data Science Workbench: sparklyr, implyr, and More - dplyr Interfac...
Cloudera Data Science Workbench: sparklyr, implyr, and More - dplyr Interfac...Cloudera, Inc.
 
Part 1: Introducing the Cloudera Data Science Workbench
Part 1: Introducing the Cloudera Data Science WorkbenchPart 1: Introducing the Cloudera Data Science Workbench
Part 1: Introducing the Cloudera Data Science WorkbenchCloudera, Inc.
 
How Data Drives Business at Choice Hotels
How Data Drives Business at Choice HotelsHow Data Drives Business at Choice Hotels
How Data Drives Business at Choice HotelsCloudera, Inc.
 
What’s New in Cloudera Enterprise 6.0: The Inside Scoop 6.14.18
What’s New in Cloudera Enterprise 6.0: The Inside Scoop 6.14.18What’s New in Cloudera Enterprise 6.0: The Inside Scoop 6.14.18
What’s New in Cloudera Enterprise 6.0: The Inside Scoop 6.14.18Cloudera, Inc.
 
Hadoop on Cloud: Why and How?
Hadoop on Cloud: Why and How?Hadoop on Cloud: Why and How?
Hadoop on Cloud: Why and How?Cloudera, Inc.
 
Self-service Big Data Analytics on Microsoft Azure
Self-service Big Data Analytics on Microsoft AzureSelf-service Big Data Analytics on Microsoft Azure
Self-service Big Data Analytics on Microsoft AzureCloudera, Inc.
 
Supercharge Splunk with Cloudera

Supercharge Splunk with Cloudera
Supercharge Splunk with Cloudera

Supercharge Splunk with Cloudera
Cloudera, Inc.
 
Kudu Forrester Webinar
Kudu Forrester WebinarKudu Forrester Webinar
Kudu Forrester WebinarCloudera, Inc.
 
Evolution from Apache Hadoop to the Enterprise Data Hub by Cloudera - ArabNet...
Evolution from Apache Hadoop to the Enterprise Data Hub by Cloudera - ArabNet...Evolution from Apache Hadoop to the Enterprise Data Hub by Cloudera - ArabNet...
Evolution from Apache Hadoop to the Enterprise Data Hub by Cloudera - ArabNet...ArabNet ME
 
Part 2: Apache Kudu: Extending the Capabilities of Operational and Analytic D...
Part 2: Apache Kudu: Extending the Capabilities of Operational and Analytic D...Part 2: Apache Kudu: Extending the Capabilities of Operational and Analytic D...
Part 2: Apache Kudu: Extending the Capabilities of Operational and Analytic D...Cloudera, Inc.
 
New Performance Benchmarks: Apache Impala (incubating) Leads Traditional Anal...
New Performance Benchmarks: Apache Impala (incubating) Leads Traditional Anal...New Performance Benchmarks: Apache Impala (incubating) Leads Traditional Anal...
New Performance Benchmarks: Apache Impala (incubating) Leads Traditional Anal...Cloudera, Inc.
 
Data Science and Machine Learning for the Enterprise
Data Science and Machine Learning for the EnterpriseData Science and Machine Learning for the Enterprise
Data Science and Machine Learning for the EnterpriseCloudera, Inc.
 
Data Engineering: Elastic, Low-Cost Data Processing in the Cloud
Data Engineering: Elastic, Low-Cost Data Processing in the CloudData Engineering: Elastic, Low-Cost Data Processing in the Cloud
Data Engineering: Elastic, Low-Cost Data Processing in the CloudCloudera, Inc.
 
Big data journey to the cloud 5.30.18 asher bartch
Big data journey to the cloud 5.30.18   asher bartchBig data journey to the cloud 5.30.18   asher bartch
Big data journey to the cloud 5.30.18 asher bartchCloudera, Inc.
 
A Community Approach to Fighting Cyber Threats
A Community Approach to Fighting Cyber ThreatsA Community Approach to Fighting Cyber Threats
A Community Approach to Fighting Cyber ThreatsCloudera, Inc.
 

Was ist angesagt? (20)

Part 3: Models in Production: A Look From Beginning to End
Part 3: Models in Production: A Look From Beginning to EndPart 3: Models in Production: A Look From Beginning to End
Part 3: Models in Production: A Look From Beginning to End
 
Standing Up an Effective Enterprise Data Hub -- Technology and Beyond
Standing Up an Effective Enterprise Data Hub -- Technology and BeyondStanding Up an Effective Enterprise Data Hub -- Technology and Beyond
Standing Up an Effective Enterprise Data Hub -- Technology and Beyond
 
The Vision & Challenge of Applied Machine Learning
The Vision & Challenge of Applied Machine LearningThe Vision & Challenge of Applied Machine Learning
The Vision & Challenge of Applied Machine Learning
 
Customer Best Practices: Optimizing Cloudera on AWS
Customer Best Practices: Optimizing Cloudera on AWSCustomer Best Practices: Optimizing Cloudera on AWS
Customer Best Practices: Optimizing Cloudera on AWS
 
Simplifying Real-Time Architectures for IoT with Apache Kudu
Simplifying Real-Time Architectures for IoT with Apache KuduSimplifying Real-Time Architectures for IoT with Apache Kudu
Simplifying Real-Time Architectures for IoT with Apache Kudu
 
Cloudera Data Science Workbench: sparklyr, implyr, and More - dplyr Interfac...
 Cloudera Data Science Workbench: sparklyr, implyr, and More - dplyr Interfac... Cloudera Data Science Workbench: sparklyr, implyr, and More - dplyr Interfac...
Cloudera Data Science Workbench: sparklyr, implyr, and More - dplyr Interfac...
 
Part 1: Introducing the Cloudera Data Science Workbench
Part 1: Introducing the Cloudera Data Science WorkbenchPart 1: Introducing the Cloudera Data Science Workbench
Part 1: Introducing the Cloudera Data Science Workbench
 
How Data Drives Business at Choice Hotels
How Data Drives Business at Choice HotelsHow Data Drives Business at Choice Hotels
How Data Drives Business at Choice Hotels
 
What’s New in Cloudera Enterprise 6.0: The Inside Scoop 6.14.18
What’s New in Cloudera Enterprise 6.0: The Inside Scoop 6.14.18What’s New in Cloudera Enterprise 6.0: The Inside Scoop 6.14.18
What’s New in Cloudera Enterprise 6.0: The Inside Scoop 6.14.18
 
Hadoop on Cloud: Why and How?
Hadoop on Cloud: Why and How?Hadoop on Cloud: Why and How?
Hadoop on Cloud: Why and How?
 
Self-service Big Data Analytics on Microsoft Azure
Self-service Big Data Analytics on Microsoft AzureSelf-service Big Data Analytics on Microsoft Azure
Self-service Big Data Analytics on Microsoft Azure
 
Supercharge Splunk with Cloudera

Supercharge Splunk with Cloudera
Supercharge Splunk with Cloudera

Supercharge Splunk with Cloudera

 
Kudu Forrester Webinar
Kudu Forrester WebinarKudu Forrester Webinar
Kudu Forrester Webinar
 
Evolution from Apache Hadoop to the Enterprise Data Hub by Cloudera - ArabNet...
Evolution from Apache Hadoop to the Enterprise Data Hub by Cloudera - ArabNet...Evolution from Apache Hadoop to the Enterprise Data Hub by Cloudera - ArabNet...
Evolution from Apache Hadoop to the Enterprise Data Hub by Cloudera - ArabNet...
 
Part 2: Apache Kudu: Extending the Capabilities of Operational and Analytic D...
Part 2: Apache Kudu: Extending the Capabilities of Operational and Analytic D...Part 2: Apache Kudu: Extending the Capabilities of Operational and Analytic D...
Part 2: Apache Kudu: Extending the Capabilities of Operational and Analytic D...
 
New Performance Benchmarks: Apache Impala (incubating) Leads Traditional Anal...
New Performance Benchmarks: Apache Impala (incubating) Leads Traditional Anal...New Performance Benchmarks: Apache Impala (incubating) Leads Traditional Anal...
New Performance Benchmarks: Apache Impala (incubating) Leads Traditional Anal...
 
Data Science and Machine Learning for the Enterprise
Data Science and Machine Learning for the EnterpriseData Science and Machine Learning for the Enterprise
Data Science and Machine Learning for the Enterprise
 
Data Engineering: Elastic, Low-Cost Data Processing in the Cloud
Data Engineering: Elastic, Low-Cost Data Processing in the CloudData Engineering: Elastic, Low-Cost Data Processing in the Cloud
Data Engineering: Elastic, Low-Cost Data Processing in the Cloud
 
Big data journey to the cloud 5.30.18 asher bartch
Big data journey to the cloud 5.30.18   asher bartchBig data journey to the cloud 5.30.18   asher bartch
Big data journey to the cloud 5.30.18 asher bartch
 
A Community Approach to Fighting Cyber Threats
A Community Approach to Fighting Cyber ThreatsA Community Approach to Fighting Cyber Threats
A Community Approach to Fighting Cyber Threats
 

Ähnlich wie Cassandra Day SV 2014: Apache Cassandra at Equinix for High Performance, Scalability and Short Response Time

Equinix Big Data Platform and Cassandra - A view into the journey
Equinix Big Data Platform and Cassandra - A view into the journeyEquinix Big Data Platform and Cassandra - A view into the journey
Equinix Big Data Platform and Cassandra - A view into the journeyPraveen Kumar
 
Comment dĂŠvelopper une stratĂŠgie Big Data dans le cloud public avec l'offre P...
Comment dĂŠvelopper une stratĂŠgie Big Data dans le cloud public avec l'offre P...Comment dĂŠvelopper une stratĂŠgie Big Data dans le cloud public avec l'offre P...
Comment dĂŠvelopper une stratĂŠgie Big Data dans le cloud public avec l'offre P...Cloudera, Inc.
 
Analytics and Lakehouse Integration Options for Oracle Applications
Analytics and Lakehouse Integration Options for Oracle ApplicationsAnalytics and Lakehouse Integration Options for Oracle Applications
Analytics and Lakehouse Integration Options for Oracle ApplicationsRay FĂŠvrier
 
Actian Analytics Platform - Hadoop SQL Edition
Actian Analytics Platform - Hadoop SQL EditionActian Analytics Platform - Hadoop SQL Edition
Actian Analytics Platform - Hadoop SQL EditionAlessandro Salvatico
 
Demystifying Data Warehouse as a Service
Demystifying Data Warehouse as a ServiceDemystifying Data Warehouse as a Service
Demystifying Data Warehouse as a ServiceSnowflake Computing
 
Azure Data.pptx
Azure Data.pptxAzure Data.pptx
Azure Data.pptxFedoRam1
 
Demystifying Data Warehousing as a Service - DFW
Demystifying Data Warehousing as a Service - DFWDemystifying Data Warehousing as a Service - DFW
Demystifying Data Warehousing as a Service - DFWKent Graziano
 
Horses for Courses: Database Roundtable
Horses for Courses: Database RoundtableHorses for Courses: Database Roundtable
Horses for Courses: Database RoundtableEric Kavanagh
 
Leveraging the Cloud for Big Data Analytics 12.11.18
Leveraging the Cloud for Big Data Analytics 12.11.18Leveraging the Cloud for Big Data Analytics 12.11.18
Leveraging the Cloud for Big Data Analytics 12.11.18Cloudera, Inc.
 
Multidisziplinäre Analyseanwendungen auf einer gemeinsamen Datenplattform ers...
Multidisziplinäre Analyseanwendungen auf einer gemeinsamen Datenplattform ers...Multidisziplinäre Analyseanwendungen auf einer gemeinsamen Datenplattform ers...
Multidisziplinäre Analyseanwendungen auf einer gemeinsamen Datenplattform ers...Cloudera, Inc.
 
High-Performance Analytics in the Cloud with Apache Impala
High-Performance Analytics in the Cloud with Apache ImpalaHigh-Performance Analytics in the Cloud with Apache Impala
High-Performance Analytics in the Cloud with Apache ImpalaCloudera, Inc.
 
Seamless Migration of Public Sector Data and Workloads to the AWS Cloud - AWS...
Seamless Migration of Public Sector Data and Workloads to the AWS Cloud - AWS...Seamless Migration of Public Sector Data and Workloads to the AWS Cloud - AWS...
Seamless Migration of Public Sector Data and Workloads to the AWS Cloud - AWS...Amazon Web Services
 
Build Big Data Enterprise Solutions Faster on Azure HDInsight
Build Big Data Enterprise Solutions Faster on Azure HDInsightBuild Big Data Enterprise Solutions Faster on Azure HDInsight
Build Big Data Enterprise Solutions Faster on Azure HDInsightDataWorks Summit/Hadoop Summit
 
Get Started with Cloudera’s Cyber Solution
Get Started with Cloudera’s Cyber SolutionGet Started with Cloudera’s Cyber Solution
Get Started with Cloudera’s Cyber SolutionCloudera, Inc.
 
How to Architect a Serverless Cloud Data Lake for Enhanced Data Analytics
How to Architect a Serverless Cloud Data Lake for Enhanced Data AnalyticsHow to Architect a Serverless Cloud Data Lake for Enhanced Data Analytics
How to Architect a Serverless Cloud Data Lake for Enhanced Data AnalyticsInformatica
 
Billions of Messages in Real Time: Why Paypal & LinkedIn Trust an Engagement ...
Billions of Messages in Real Time: Why Paypal & LinkedIn Trust an Engagement ...Billions of Messages in Real Time: Why Paypal & LinkedIn Trust an Engagement ...
Billions of Messages in Real Time: Why Paypal & LinkedIn Trust an Engagement ...confluent
 
SQL + Hadoop: The High Performance Advantageďż˝
SQL + Hadoop:  The High Performance Advantage�SQL + Hadoop:  The High Performance Advantage�
SQL + Hadoop: The High Performance Advantage�Actian Corporation
 
The Future of Data Management: The Enterprise Data Hub
The Future of Data Management: The Enterprise Data HubThe Future of Data Management: The Enterprise Data Hub
The Future of Data Management: The Enterprise Data HubCloudera, Inc.
 
AWS Big Data Solution Days
AWS Big Data Solution DaysAWS Big Data Solution Days
AWS Big Data Solution DaysAmazon Web Services
 
Revolutionizing the customer experience - Hello Engagement Database
Revolutionizing the customer experience - Hello Engagement DatabaseRevolutionizing the customer experience - Hello Engagement Database
Revolutionizing the customer experience - Hello Engagement DatabaseDipti Borkar
 

Ähnlich wie Cassandra Day SV 2014: Apache Cassandra at Equinix for High Performance, Scalability and Short Response Time (20)

Equinix Big Data Platform and Cassandra - A view into the journey
Equinix Big Data Platform and Cassandra - A view into the journeyEquinix Big Data Platform and Cassandra - A view into the journey
Equinix Big Data Platform and Cassandra - A view into the journey
 
Comment dĂŠvelopper une stratĂŠgie Big Data dans le cloud public avec l'offre P...
Comment dĂŠvelopper une stratĂŠgie Big Data dans le cloud public avec l'offre P...Comment dĂŠvelopper une stratĂŠgie Big Data dans le cloud public avec l'offre P...
Comment dĂŠvelopper une stratĂŠgie Big Data dans le cloud public avec l'offre P...
 
Analytics and Lakehouse Integration Options for Oracle Applications
Analytics and Lakehouse Integration Options for Oracle ApplicationsAnalytics and Lakehouse Integration Options for Oracle Applications
Analytics and Lakehouse Integration Options for Oracle Applications
 
Actian Analytics Platform - Hadoop SQL Edition
Actian Analytics Platform - Hadoop SQL EditionActian Analytics Platform - Hadoop SQL Edition
Actian Analytics Platform - Hadoop SQL Edition
 
Demystifying Data Warehouse as a Service
Demystifying Data Warehouse as a ServiceDemystifying Data Warehouse as a Service
Demystifying Data Warehouse as a Service
 
Azure Data.pptx
Azure Data.pptxAzure Data.pptx
Azure Data.pptx
 
Demystifying Data Warehousing as a Service - DFW
Demystifying Data Warehousing as a Service - DFWDemystifying Data Warehousing as a Service - DFW
Demystifying Data Warehousing as a Service - DFW
 
Horses for Courses: Database Roundtable
Horses for Courses: Database RoundtableHorses for Courses: Database Roundtable
Horses for Courses: Database Roundtable
 
Leveraging the Cloud for Big Data Analytics 12.11.18
Leveraging the Cloud for Big Data Analytics 12.11.18Leveraging the Cloud for Big Data Analytics 12.11.18
Leveraging the Cloud for Big Data Analytics 12.11.18
 
Multidisziplinäre Analyseanwendungen auf einer gemeinsamen Datenplattform ers...
Multidisziplinäre Analyseanwendungen auf einer gemeinsamen Datenplattform ers...Multidisziplinäre Analyseanwendungen auf einer gemeinsamen Datenplattform ers...
Multidisziplinäre Analyseanwendungen auf einer gemeinsamen Datenplattform ers...
 
High-Performance Analytics in the Cloud with Apache Impala
High-Performance Analytics in the Cloud with Apache ImpalaHigh-Performance Analytics in the Cloud with Apache Impala
High-Performance Analytics in the Cloud with Apache Impala
 
Seamless Migration of Public Sector Data and Workloads to the AWS Cloud - AWS...
Seamless Migration of Public Sector Data and Workloads to the AWS Cloud - AWS...Seamless Migration of Public Sector Data and Workloads to the AWS Cloud - AWS...
Seamless Migration of Public Sector Data and Workloads to the AWS Cloud - AWS...
 
Build Big Data Enterprise Solutions Faster on Azure HDInsight
Build Big Data Enterprise Solutions Faster on Azure HDInsightBuild Big Data Enterprise Solutions Faster on Azure HDInsight
Build Big Data Enterprise Solutions Faster on Azure HDInsight
 
Get Started with Cloudera’s Cyber Solution
Get Started with Cloudera’s Cyber SolutionGet Started with Cloudera’s Cyber Solution
Get Started with Cloudera’s Cyber Solution
 
How to Architect a Serverless Cloud Data Lake for Enhanced Data Analytics
How to Architect a Serverless Cloud Data Lake for Enhanced Data AnalyticsHow to Architect a Serverless Cloud Data Lake for Enhanced Data Analytics
How to Architect a Serverless Cloud Data Lake for Enhanced Data Analytics
 
Billions of Messages in Real Time: Why Paypal & LinkedIn Trust an Engagement ...
Billions of Messages in Real Time: Why Paypal & LinkedIn Trust an Engagement ...Billions of Messages in Real Time: Why Paypal & LinkedIn Trust an Engagement ...
Billions of Messages in Real Time: Why Paypal & LinkedIn Trust an Engagement ...
 
SQL + Hadoop: The High Performance Advantageďż˝
SQL + Hadoop:  The High Performance Advantage�SQL + Hadoop:  The High Performance Advantage�
SQL + Hadoop: The High Performance Advantageďż˝
 
The Future of Data Management: The Enterprise Data Hub
The Future of Data Management: The Enterprise Data HubThe Future of Data Management: The Enterprise Data Hub
The Future of Data Management: The Enterprise Data Hub
 
AWS Big Data Solution Days
AWS Big Data Solution DaysAWS Big Data Solution Days
AWS Big Data Solution Days
 
Revolutionizing the customer experience - Hello Engagement Database
Revolutionizing the customer experience - Hello Engagement DatabaseRevolutionizing the customer experience - Hello Engagement Database
Revolutionizing the customer experience - Hello Engagement Database
 

Mehr von DataStax Academy

Forrester CXNYC 2017 - Delivering great real-time cx is a true craft
Forrester CXNYC 2017 - Delivering great real-time cx is a true craftForrester CXNYC 2017 - Delivering great real-time cx is a true craft
Forrester CXNYC 2017 - Delivering great real-time cx is a true craftDataStax Academy
 
Introduction to DataStax Enterprise Graph Database
Introduction to DataStax Enterprise Graph DatabaseIntroduction to DataStax Enterprise Graph Database
Introduction to DataStax Enterprise Graph DatabaseDataStax Academy
 
Introduction to DataStax Enterprise Advanced Replication with Apache Cassandra
Introduction to DataStax Enterprise Advanced Replication with Apache CassandraIntroduction to DataStax Enterprise Advanced Replication with Apache Cassandra
Introduction to DataStax Enterprise Advanced Replication with Apache CassandraDataStax Academy
 
Cassandra on Docker @ Walmart Labs
Cassandra on Docker @ Walmart LabsCassandra on Docker @ Walmart Labs
Cassandra on Docker @ Walmart LabsDataStax Academy
 
Cassandra 3.0 Data Modeling
Cassandra 3.0 Data ModelingCassandra 3.0 Data Modeling
Cassandra 3.0 Data ModelingDataStax Academy
 
Cassandra Adoption on Cisco UCS & Open stack
Cassandra Adoption on Cisco UCS & Open stackCassandra Adoption on Cisco UCS & Open stack
Cassandra Adoption on Cisco UCS & Open stackDataStax Academy
 
Data Modeling for Apache Cassandra
Data Modeling for Apache CassandraData Modeling for Apache Cassandra
Data Modeling for Apache CassandraDataStax Academy
 
Coursera Cassandra Driver
Coursera Cassandra DriverCoursera Cassandra Driver
Coursera Cassandra DriverDataStax Academy
 
Production Ready Cassandra
Production Ready CassandraProduction Ready Cassandra
Production Ready CassandraDataStax Academy
 
Cassandra @ Netflix: Monitoring C* at Scale, Gossip and Tickler & Python
Cassandra @ Netflix: Monitoring C* at Scale, Gossip and Tickler & PythonCassandra @ Netflix: Monitoring C* at Scale, Gossip and Tickler & Python
Cassandra @ Netflix: Monitoring C* at Scale, Gossip and Tickler & PythonDataStax Academy
 
Cassandra @ Sony: The good, the bad, and the ugly part 1
Cassandra @ Sony: The good, the bad, and the ugly part 1Cassandra @ Sony: The good, the bad, and the ugly part 1
Cassandra @ Sony: The good, the bad, and the ugly part 1DataStax Academy
 
Cassandra @ Sony: The good, the bad, and the ugly part 2
Cassandra @ Sony: The good, the bad, and the ugly part 2Cassandra @ Sony: The good, the bad, and the ugly part 2
Cassandra @ Sony: The good, the bad, and the ugly part 2DataStax Academy
 
Standing Up Your First Cluster
Standing Up Your First ClusterStanding Up Your First Cluster
Standing Up Your First ClusterDataStax Academy
 
Real Time Analytics with Dse
Real Time Analytics with DseReal Time Analytics with Dse
Real Time Analytics with DseDataStax Academy
 
Introduction to Data Modeling with Apache Cassandra
Introduction to Data Modeling with Apache CassandraIntroduction to Data Modeling with Apache Cassandra
Introduction to Data Modeling with Apache CassandraDataStax Academy
 
Cassandra Core Concepts
Cassandra Core ConceptsCassandra Core Concepts
Cassandra Core ConceptsDataStax Academy
 
Enabling Search in your Cassandra Application with DataStax Enterprise
Enabling Search in your Cassandra Application with DataStax EnterpriseEnabling Search in your Cassandra Application with DataStax Enterprise
Enabling Search in your Cassandra Application with DataStax EnterpriseDataStax Academy
 
Bad Habits Die Hard
Bad Habits Die Hard Bad Habits Die Hard
Bad Habits Die Hard DataStax Academy
 
Advanced Data Modeling with Apache Cassandra
Advanced Data Modeling with Apache CassandraAdvanced Data Modeling with Apache Cassandra
Advanced Data Modeling with Apache CassandraDataStax Academy
 

Mehr von DataStax Academy (20)

Forrester CXNYC 2017 - Delivering great real-time cx is a true craft
Forrester CXNYC 2017 - Delivering great real-time cx is a true craftForrester CXNYC 2017 - Delivering great real-time cx is a true craft
Forrester CXNYC 2017 - Delivering great real-time cx is a true craft
 
Introduction to DataStax Enterprise Graph Database
Introduction to DataStax Enterprise Graph DatabaseIntroduction to DataStax Enterprise Graph Database
Introduction to DataStax Enterprise Graph Database
 
Introduction to DataStax Enterprise Advanced Replication with Apache Cassandra
Introduction to DataStax Enterprise Advanced Replication with Apache CassandraIntroduction to DataStax Enterprise Advanced Replication with Apache Cassandra
Introduction to DataStax Enterprise Advanced Replication with Apache Cassandra
 
Cassandra on Docker @ Walmart Labs
Cassandra on Docker @ Walmart LabsCassandra on Docker @ Walmart Labs
Cassandra on Docker @ Walmart Labs
 
Cassandra 3.0 Data Modeling
Cassandra 3.0 Data ModelingCassandra 3.0 Data Modeling
Cassandra 3.0 Data Modeling
 
Cassandra Adoption on Cisco UCS & Open stack
Cassandra Adoption on Cisco UCS & Open stackCassandra Adoption on Cisco UCS & Open stack
Cassandra Adoption on Cisco UCS & Open stack
 
Data Modeling for Apache Cassandra
Data Modeling for Apache CassandraData Modeling for Apache Cassandra
Data Modeling for Apache Cassandra
 
Coursera Cassandra Driver
Coursera Cassandra DriverCoursera Cassandra Driver
Coursera Cassandra Driver
 
Production Ready Cassandra
Production Ready CassandraProduction Ready Cassandra
Production Ready Cassandra
 
Cassandra @ Netflix: Monitoring C* at Scale, Gossip and Tickler & Python
Cassandra @ Netflix: Monitoring C* at Scale, Gossip and Tickler & PythonCassandra @ Netflix: Monitoring C* at Scale, Gossip and Tickler & Python
Cassandra @ Netflix: Monitoring C* at Scale, Gossip and Tickler & Python
 
Cassandra @ Sony: The good, the bad, and the ugly part 1
Cassandra @ Sony: The good, the bad, and the ugly part 1Cassandra @ Sony: The good, the bad, and the ugly part 1
Cassandra @ Sony: The good, the bad, and the ugly part 1
 
Cassandra @ Sony: The good, the bad, and the ugly part 2
Cassandra @ Sony: The good, the bad, and the ugly part 2Cassandra @ Sony: The good, the bad, and the ugly part 2
Cassandra @ Sony: The good, the bad, and the ugly part 2
 
Standing Up Your First Cluster
Standing Up Your First ClusterStanding Up Your First Cluster
Standing Up Your First Cluster
 
Real Time Analytics with Dse
Real Time Analytics with DseReal Time Analytics with Dse
Real Time Analytics with Dse
 
Introduction to Data Modeling with Apache Cassandra
Introduction to Data Modeling with Apache CassandraIntroduction to Data Modeling with Apache Cassandra
Introduction to Data Modeling with Apache Cassandra
 
Cassandra Core Concepts
Cassandra Core ConceptsCassandra Core Concepts
Cassandra Core Concepts
 
Enabling Search in your Cassandra Application with DataStax Enterprise
Enabling Search in your Cassandra Application with DataStax EnterpriseEnabling Search in your Cassandra Application with DataStax Enterprise
Enabling Search in your Cassandra Application with DataStax Enterprise
 
Bad Habits Die Hard
Bad Habits Die Hard Bad Habits Die Hard
Bad Habits Die Hard
 
Advanced Data Modeling with Apache Cassandra
Advanced Data Modeling with Apache CassandraAdvanced Data Modeling with Apache Cassandra
Advanced Data Modeling with Apache Cassandra
 
Advanced Cassandra
Advanced CassandraAdvanced Cassandra
Advanced Cassandra
 

KĂźrzlich hochgeladen

Apidays Singapore 2024 - Building Digital Trust in a Digital Economy by Veron...
Apidays Singapore 2024 - Building Digital Trust in a Digital Economy by Veron...Apidays Singapore 2024 - Building Digital Trust in a Digital Economy by Veron...
Apidays Singapore 2024 - Building Digital Trust in a Digital Economy by Veron...apidays
 
Automating Google Workspace (GWS) & more with Apps Script
Automating Google Workspace (GWS) & more with Apps ScriptAutomating Google Workspace (GWS) & more with Apps Script
Automating Google Workspace (GWS) & more with Apps Scriptwesley chun
 
Histor y of HAM Radio presentation slide
Histor y of HAM Radio presentation slideHistor y of HAM Radio presentation slide
Histor y of HAM Radio presentation slidevu2urc
 
What Are The Drone Anti-jamming Systems Technology?
What Are The Drone Anti-jamming Systems Technology?What Are The Drone Anti-jamming Systems Technology?
What Are The Drone Anti-jamming Systems Technology?Antenna Manufacturer Coco
 
Artificial Intelligence: Facts and Myths
Artificial Intelligence: Facts and MythsArtificial Intelligence: Facts and Myths
Artificial Intelligence: Facts and MythsJoaquim Jorge
 
08448380779 Call Girls In Greater Kailash - I Women Seeking Men
08448380779 Call Girls In Greater Kailash - I Women Seeking Men08448380779 Call Girls In Greater Kailash - I Women Seeking Men
08448380779 Call Girls In Greater Kailash - I Women Seeking MenDelhi Call girls
 
The Codex of Business Writing Software for Real-World Solutions 2.pptx
The Codex of Business Writing Software for Real-World Solutions 2.pptxThe Codex of Business Writing Software for Real-World Solutions 2.pptx
The Codex of Business Writing Software for Real-World Solutions 2.pptxMalak Abu Hammad
 
🐬 The future of MySQL is Postgres 🐘
🐬  The future of MySQL is Postgres   🐘🐬  The future of MySQL is Postgres   🐘
🐬 The future of MySQL is Postgres 🐘RTylerCroy
 
EIS-Webinar-Prompt-Knowledge-Eng-2024-04-08.pptx
EIS-Webinar-Prompt-Knowledge-Eng-2024-04-08.pptxEIS-Webinar-Prompt-Knowledge-Eng-2024-04-08.pptx
EIS-Webinar-Prompt-Knowledge-Eng-2024-04-08.pptxEarley Information Science
 
Bajaj Allianz Life Insurance Company - Insurer Innovation Award 2024
Bajaj Allianz Life Insurance Company - Insurer Innovation Award 2024Bajaj Allianz Life Insurance Company - Insurer Innovation Award 2024
Bajaj Allianz Life Insurance Company - Insurer Innovation Award 2024The Digital Insurer
 
2024: Domino Containers - The Next Step. News from the Domino Container commu...
2024: Domino Containers - The Next Step. News from the Domino Container commu...2024: Domino Containers - The Next Step. News from the Domino Container commu...
2024: Domino Containers - The Next Step. News from the Domino Container commu...Martijn de Jong
 
From Event to Action: Accelerate Your Decision Making with Real-Time Automation
From Event to Action: Accelerate Your Decision Making with Real-Time AutomationFrom Event to Action: Accelerate Your Decision Making with Real-Time Automation
From Event to Action: Accelerate Your Decision Making with Real-Time AutomationSafe Software
 
Real Time Object Detection Using Open CV
Real Time Object Detection Using Open CVReal Time Object Detection Using Open CV
Real Time Object Detection Using Open CVKhem
 
Slack Application Development 101 Slides
Slack Application Development 101 SlidesSlack Application Development 101 Slides
Slack Application Development 101 Slidespraypatel2
 
04-2024-HHUG-Sales-and-Marketing-Alignment.pptx
04-2024-HHUG-Sales-and-Marketing-Alignment.pptx04-2024-HHUG-Sales-and-Marketing-Alignment.pptx
04-2024-HHUG-Sales-and-Marketing-Alignment.pptxHampshireHUG
 
Boost PC performance: How more available memory can improve productivity
Boost PC performance: How more available memory can improve productivityBoost PC performance: How more available memory can improve productivity
Boost PC performance: How more available memory can improve productivityPrincipled Technologies
 
08448380779 Call Girls In Friends Colony Women Seeking Men
08448380779 Call Girls In Friends Colony Women Seeking Men08448380779 Call Girls In Friends Colony Women Seeking Men
08448380779 Call Girls In Friends Colony Women Seeking MenDelhi Call girls
 
A Call to Action for Generative AI in 2024
A Call to Action for Generative AI in 2024A Call to Action for Generative AI in 2024
A Call to Action for Generative AI in 2024Results
 
A Domino Admins Adventures (Engage 2024)
A Domino Admins Adventures (Engage 2024)A Domino Admins Adventures (Engage 2024)
A Domino Admins Adventures (Engage 2024)Gabriella Davis
 
Mastering MySQL Database Architecture: Deep Dive into MySQL Shell and MySQL R...
Mastering MySQL Database Architecture: Deep Dive into MySQL Shell and MySQL R...Mastering MySQL Database Architecture: Deep Dive into MySQL Shell and MySQL R...
Mastering MySQL Database Architecture: Deep Dive into MySQL Shell and MySQL R...Miguel AraĂşjo
 

KĂźrzlich hochgeladen (20)

Apidays Singapore 2024 - Building Digital Trust in a Digital Economy by Veron...
Apidays Singapore 2024 - Building Digital Trust in a Digital Economy by Veron...Apidays Singapore 2024 - Building Digital Trust in a Digital Economy by Veron...
Apidays Singapore 2024 - Building Digital Trust in a Digital Economy by Veron...
 
Automating Google Workspace (GWS) & more with Apps Script
Automating Google Workspace (GWS) & more with Apps ScriptAutomating Google Workspace (GWS) & more with Apps Script
Automating Google Workspace (GWS) & more with Apps Script
 
Histor y of HAM Radio presentation slide
Histor y of HAM Radio presentation slideHistor y of HAM Radio presentation slide
Histor y of HAM Radio presentation slide
 
What Are The Drone Anti-jamming Systems Technology?
What Are The Drone Anti-jamming Systems Technology?What Are The Drone Anti-jamming Systems Technology?
What Are The Drone Anti-jamming Systems Technology?
 
Artificial Intelligence: Facts and Myths
Artificial Intelligence: Facts and MythsArtificial Intelligence: Facts and Myths
Artificial Intelligence: Facts and Myths
 
08448380779 Call Girls In Greater Kailash - I Women Seeking Men
08448380779 Call Girls In Greater Kailash - I Women Seeking Men08448380779 Call Girls In Greater Kailash - I Women Seeking Men
08448380779 Call Girls In Greater Kailash - I Women Seeking Men
 
The Codex of Business Writing Software for Real-World Solutions 2.pptx
The Codex of Business Writing Software for Real-World Solutions 2.pptxThe Codex of Business Writing Software for Real-World Solutions 2.pptx
The Codex of Business Writing Software for Real-World Solutions 2.pptx
 
🐬 The future of MySQL is Postgres 🐘
🐬  The future of MySQL is Postgres   🐘🐬  The future of MySQL is Postgres   🐘
🐬 The future of MySQL is Postgres 🐘
 
EIS-Webinar-Prompt-Knowledge-Eng-2024-04-08.pptx
EIS-Webinar-Prompt-Knowledge-Eng-2024-04-08.pptxEIS-Webinar-Prompt-Knowledge-Eng-2024-04-08.pptx
EIS-Webinar-Prompt-Knowledge-Eng-2024-04-08.pptx
 
Bajaj Allianz Life Insurance Company - Insurer Innovation Award 2024
Bajaj Allianz Life Insurance Company - Insurer Innovation Award 2024Bajaj Allianz Life Insurance Company - Insurer Innovation Award 2024
Bajaj Allianz Life Insurance Company - Insurer Innovation Award 2024
 
2024: Domino Containers - The Next Step. News from the Domino Container commu...
2024: Domino Containers - The Next Step. News from the Domino Container commu...2024: Domino Containers - The Next Step. News from the Domino Container commu...
2024: Domino Containers - The Next Step. News from the Domino Container commu...
 
From Event to Action: Accelerate Your Decision Making with Real-Time Automation
From Event to Action: Accelerate Your Decision Making with Real-Time AutomationFrom Event to Action: Accelerate Your Decision Making with Real-Time Automation
From Event to Action: Accelerate Your Decision Making with Real-Time Automation
 
Real Time Object Detection Using Open CV
Real Time Object Detection Using Open CVReal Time Object Detection Using Open CV
Real Time Object Detection Using Open CV
 
Slack Application Development 101 Slides
Slack Application Development 101 SlidesSlack Application Development 101 Slides
Slack Application Development 101 Slides
 
04-2024-HHUG-Sales-and-Marketing-Alignment.pptx
04-2024-HHUG-Sales-and-Marketing-Alignment.pptx04-2024-HHUG-Sales-and-Marketing-Alignment.pptx
04-2024-HHUG-Sales-and-Marketing-Alignment.pptx
 
Boost PC performance: How more available memory can improve productivity
Boost PC performance: How more available memory can improve productivityBoost PC performance: How more available memory can improve productivity
Boost PC performance: How more available memory can improve productivity
 
08448380779 Call Girls In Friends Colony Women Seeking Men
08448380779 Call Girls In Friends Colony Women Seeking Men08448380779 Call Girls In Friends Colony Women Seeking Men
08448380779 Call Girls In Friends Colony Women Seeking Men
 
A Call to Action for Generative AI in 2024
A Call to Action for Generative AI in 2024A Call to Action for Generative AI in 2024
A Call to Action for Generative AI in 2024
 
A Domino Admins Adventures (Engage 2024)
A Domino Admins Adventures (Engage 2024)A Domino Admins Adventures (Engage 2024)
A Domino Admins Adventures (Engage 2024)
 
Mastering MySQL Database Architecture: Deep Dive into MySQL Shell and MySQL R...
Mastering MySQL Database Architecture: Deep Dive into MySQL Shell and MySQL R...Mastering MySQL Database Architecture: Deep Dive into MySQL Shell and MySQL R...
Mastering MySQL Database Architecture: Deep Dive into MySQL Shell and MySQL R...
 

Cassandra Day SV 2014: Apache Cassandra at Equinix for High Performance, Scalability and Short Response Time

  • 1. CONFIDENTIAL 1 Praveen Kumar Emerging Software Platforms, Global Software Engineering Mar 2014 Equinix Big Data Platform & Cassandra
  • 2. Confidential – Š 2013 Equinix Inc. www.equinix.com 2 Big Data at Equinix ~2 million Alarms ~200k interconnections ~250k Electrical circuits Sensors across 95+ IBXs ~40k Infrastructure objects
  • 3. Confidential – Š 2013 Equinix Inc. www.equinix.com 3 Big Data at Equinix Sensors across 95+ IBXs Lead to / produce Support for multiple protocols Push as well pull methods Time series data Cross sectional dataNot so clean data High velocity Clean data Lots and lots of noise Some useful intel
  • 4. Confidential – Š 2013 Equinix Inc. www.equinix.com 4 Big Data at Equinix What do we use(or plan to use) this data for? Customer Presentment Billing Operations New Product & Services
  • 5. Confidential – Š 2013 Equinix Inc. www.equinix.com 5 Big Data at Equinix Use-case analysis : 80-20 rule ~80% of use-cases analyzed act upon “Hot Data” ~80% of data for most of use-cases analyzed is time-series. All “quick win” use-cases need data mediation, aggregation and roll-up for presentment. Real-time to near real-time processing of events Collection, processing and storage technologies suitable for time-series data. Collection, mediation, cross-referencing and co-relation of data from different sources; roll-up and aggregate.
  • 6. Confidential – Š 2013 Equinix Inc. www.equinix.com 6 Big Data at Equinix Our Approach : Equinix Big Data Platform §  Common platform to be shared by all initial Big Data use cases – multi tenancy §  Built on inexpensive hardware using free or inexpensive software §  Seamless & massive scalability using scale-out §  High reliability - partial failover, graceful degradation, self-healing, self-balancing §  Data ingestion and processing capabilities for high volumes at high velocity §  Support for structured and semi-structured data §  Provides real-time processing abilities §  Provides parallel processing capabilities §  Support for low latency queries, wide range scan queries and search §  Provides abstraction via connectors, frameworks and libraries §  Support for low latency queries, wide range scan queries and search §  Support for predictive analytics using machine learning Immediate requirements Long term goals Big Data Platform - Logical Architecture (technology agnostic)
  • 7. Confidential – Š 2013 Equinix Inc. www.equinix.com 7 Big Data at Equinix Requirements & Technologies considered for Big Data Platform
  • 8. Confidential – Š 2013 Equinix Inc. www.equinix.com 8 Big Data at Equinix Grand Finale Hadoop Ecosystem vs. DataStax Enterprise SearchSearch SearchSearch AnalyticsAnalytics StorageStorageAnalyticsAnalytics StorageStorage StorageStorage Hadoop  Distributed  File  System (Storage/Analytics) NameNode Secondary  Name  Node Data  Nodes  (Storage) HBase  (Storage/Analytics) Hbase  Master Hbase  Region  Servers Hbase  Master Search Management   Services Cloudera  Manager Solr  Nodes Zookeeper Pros •  Scalability •  Cloud readiness •  Resource availability •  Industry momentum •  Product eco-system maturity •  Technical support Cons •  Infrastructure footprint •  Operational Complexity •  Learning curve •  Availability •  Total cost of ownership Pros •  Infrastructure footprint •  Operational ease •  Scalability •  Availability •  Cloud readiness •  Learning curve •  Resource availability •  Technical support •  Total cost of ownership Cons •  Industry momentum •  Product eco-system maturity
  • 9. Confidential – Š 2013 Equinix Inc. www.equinix.com 9 Criteria   Cassandra   HBase   CAP Theorem Focus Availability, Partition-Tolerance Consistency, Availability Data Partitioning Supports ordered & random partitioning, random partitioning is recommended. Ordered Partitioning. Load balancing achieved through resharding. Distributed System P2P architecture (Amazon Dynamo) Master / Slave via HDFS, Zookeeper for coordination Administration & Maintenance Medium High Single Write Master No (R+W+1 to get Strong Consistency) Yes Multi-tenancy Yes Yes Secondary indexes Supports secondary indexes on CF where column name is known. Does not natively support secondary indexes. Consistency Tunable Consistency Strict consistency (Not ACID) Hot Spot Problem No, distributes load across nodes using random partition strategy. Yes, one node may handle most of the traffic due to ordered partition. Multi-Data Center Support and Disaster Recovery Asynchronous replication via WAN Asynchronous replication via WAN Single point of failure Ring topology, there is no single point of failure. Although there exists a concept of a master server, HBase itself does not depend on it heavily. HBase cluster can keep serving data even if the master goes down. Hadoop namenode is a single point of failure. Commercial vendors Datastax, Acunu Clodera, Hortonworks Cassandra Vs. HBase Big Data at Equinix
  • 10. Confidential – Š 2013 Equinix Inc. www.equinix.com 10 Why DSE Cassandra Big Data at Equinix Support for Analytics Integrated search using Solr Security features Cluster management capabilities Commercial support DataStax would probably list lots of more reasons, these are the reasons relevant to us.
  • 11. Confidential – Š 2013 Equinix Inc. www.equinix.com 11 Big Data at Equinix Grand Finale Hadoop Ecosystem vs. DataStax Enterprise SearchSearch SearchSearch AnalyticsAnalytics StorageStorageAnalyticsAnalytics StorageStorage StorageStorage Hadoop  Distributed  File  System (Storage/Analytics) NameNode Secondary  Name  Node Data  Nodes  (Storage) HBase  (Storage/Analytics) Hbase  Master Hbase  Region  Servers Hbase  Master Search Management   Services Cloudera  Manager Solr  Nodes Zookeeper Pros •  Scalability •  Cloud readiness •  Resource availability •  Industry momentum •  Product eco-system maturity •  Technical support Cons •  Infrastructure footprint •  Operational Complexity •  Learning curve •  Availability •  Total cost of ownership Pros •  Infrastructure footprint •  Operational ease •  Scalability •  Availability •  Cloud readiness •  Learning curve •  Resource availability •  Technical support •  Total cost of ownership Cons •  Industry momentum •  Product eco-system maturity ß Sold
  • 12. Confidential – Š 2013 Equinix Inc. www.equinix.com 12 Big Data at Equinix How far are we on our Big Data journey? ü  Pilot use-case from PoC to Production ü  Moved network statistics use case from RRD based solution to DSE Cassandra ü  Build in progress for §  power monitoring use cases §  data center monitoring §  network monitoring In-plans Ø  Recommendation engine on interconnection platform Ø  Use case analysis and technology selection for connected data sets Ø  Building data science capabilities for use cases requiring predictive modeling A few data points Physical bare metal boxes for DSE nodes Densely packed data nodes with 4TB storage on each node, 96GB RAM About ~250 million records a day Also used for log analysis for internal IT systems monitoring use-cases
  • 13. Confidential – Š 2013 Equinix Inc. www.equinix.com 13 Big Data at Equinix Experience so far Lack of standards based connectors / drivers DataStax has developed a Java Driver, but doesn’t support JDBC No data visualization tools to access from Cassandra for low-latency access No data access tools (Toad equivalent) available yet; DevCenter is not there yet We used Astyanax and are evaluating DataStax java driver built libraries to abstract Astyanax for application engineering teams built rest services for data access by applications Good reliability Not many instances of nodes being down Handled loads even when nodes were down
  • 14. Confidential – Š 2013 Equinix Inc. www.equinix.com 14 Big Data at Equinix Where do we go from here?? Graph databases Batch processing (Hadoop, Spark , MapReduce ??) Interactive queries Online data processing Data analytics Data science and machine learning Data visualization tools and applications Developer toolkits We are hiring Big Data Architect Big Data Engineers Data Scientists send resume at pkumar@equinix.com
  • 15. CONFIDENTIAL 15 Thank you! •  pkmr.work@gmail.com •  pkumar@equinix.com •  www.equinix.com
  • 17. Confidential – Š 2013 Equinix Inc. www.equinix.com 17 WHO IS EQUINIX?
  • 18. Confidential – Š 2013 Equinix Inc. www.equinix.com 18 GLOBAL DATA CENTERS 95+ Data Centers 9M+ Square Feet 99.999% Uptime Record INTERCONNECTION 950+ Networks 110,000+ Cross Connects BUSINESS ECOSYSTEMS Equinix Marketplace™ 4,000+ Businesses Revenue Opportunities MOVING TOWARDS THE FUTURE | PLATFORM Equinix: A Platform for Growth
  • 19. Solid. Powerful. Growing. $1.8B IN ANNUALIZED REVENUE MEMBER OF THE NASDAQ 100 $7B INVESTMENTS IN EXPANSION
  • 21. Confidential – Š 2013 Equinix Inc. www.equinix.com 21 HOW WE’RE DIFFERENT | GLOBAL FOOTPRINT Where You Are. Where You Need To Be.
  • 22. 90% PASS THROUGH EQUINIX DATA CENTERS OVER OF INTERNET ROUTES 950+NETWORK PROVIDERS
  • 24. CONFIDENTIAL 24 Thank you! •  pkmr.work@gmail.com •  pkumar@equinix.com •  www.equinix.com