SlideShare a Scribd company logo
1 of 47
MAY 21 - 23, 2019
Gaylord National Resort & Convention Center Maryland
Implementation of a Big Data
Architecture for Real-Time Analytics
with DataStax Enterprise Graph,
Analytics and Search
Joseph Arriola
© DataStax, All Rights Reserved.ConfidentialConfidential © DataStax, All Rights Reserved.
About me…
11+ 3 30+ 5 15+
© DataStax, All Rights Reserved.ConfidentialConfidential © DataStax, All Rights Reserved.
Agenda
● Challenges
● Context of Architectures of Business
Intelligence
● What does the Big Data world offer?
● Tools based on the use case
● What did DataStax do?
● Where to start with DSE?
● Use Case - Building it
CHALLENGES
Context of Architectures of Business Intelligence
© DataStax, All Rights Reserved.ConfidentialConfidential © DataStax, All Rights Reserved.
Challenges
● Create an architecture Real-Time Analytics
● Without degrade the transactional service of legacy
● Scalability: Process and storage
● Allow connection with enterprise tools of Business
Intelligences
© DataStax, All Rights Reserved.ConfidentialConfidential © DataStax, All Rights Reserved.
Old Architecture
Data BaseApplications Analysis
Read & Write Read
Real Time
Analytics
Without
degrade
Scalability Allow connection
© DataStax, All Rights Reserved.ConfidentialConfidential © DataStax, All Rights Reserved.
Business Intelligence Architecture
Data
Warehouse
Data Base
Legacy
Systems
Data Source
ETL Cube
Process Data Model & Storage OLAP Data Presentation
Dashboard
Reports
Ad-hoc Queries
1 Day of Delay
Real Time
Analytics
Without
degrade
Scalability Allow connection
BIG DATA WORLD
Tools based on the use case
© DataStax, All Rights Reserved.ConfidentialConfidential © DataStax, All Rights Reserved.
What does the Big Data world offer?
© DataStax, All Rights Reserved.ConfidentialConfidential © DataStax, All Rights Reserved.
Tools based on the use case
Batch Layer
Speed Layer
Serving
Layer
PipelineReal Time
Events
Lambda Architecture
© DataStax, All Rights Reserved.ConfidentialConfidential © DataStax, All Rights Reserved.
Tools based on the use case
Serving
Layer
Batch
Layer
Speed
Layer
Serving
Layer
Serving
Layer
Batch
Layer
Speed
Layer
Batch
Layer
Speed
Layer
Pipeline
Speed
Layer
DATASTAX ENTERPRISE
Tools based on the use case
© DataStax, All Rights Reserved.ConfidentialConfidential © DataStax, All Rights Reserved.
What did DataStax do?
Apache Cassandra, Spark, Lucene, Solr, TinkerPop ® Apache Software Foundation
DSE OpsCenter
DataStax Studio / Drivers
© DataStax, All Rights Reserved.ConfidentialConfidential © DataStax, All Rights Reserved.
Where to start with DSE?
● Recap Cassandra Topology
● Data replication
● Type of workload Datacenter
© DataStax, All Rights Reserved.ConfidentialConfidential © DataStax, All Rights Reserved.
Recap Cassandra Topology
DC1 DC2
Cluster
● Node: A single instance
● Datacenter: A logical grouping of nodes
● Cluster: A logical grouping of data centers
© DataStax, All Rights Reserved.ConfidentialConfidential © DataStax, All Rights Reserved.
Data replication
● Replication automatically handled
● SimpleStrategy
● NetworkTopologyStrategy
80
Client
CREATE KEYSPACE keyspace_name
WITH REPLICATION = { 'class' : 'SimpleStrategy',
'replication_factor' : 3 };
© DataStax, All Rights Reserved.ConfidentialConfidential © DataStax, All Rights Reserved.
Data replication
● Replication automatically handled
● SimpleStrategy
● NetworkTopologyStrategy
CREATE KEYSPACE keyspace_name
WITH REPLICATION = {'class' : 'NetworkTopologyStrategy',
'DC1' : 3, 'DC2’ : 3 };
Client
Client
© DataStax, All Rights Reserved.ConfidentialConfidential © DataStax, All Rights Reserved.
Type of workload Datacenter
● Transactional
CQL
SELECT id, artist_name FROM music.solr WHERE id = 123 LIMIT 10
● Restriction of Partition Key
● There is no
−GROUP BY – Sum, avg, etc…
−JOINS
−LIKE ‘%%’
https://docs.datastax.com/en/dse/6.7/cql/cql/cqlAbout.html
© DataStax, All Rights Reserved.ConfidentialConfidential © DataStax, All Rights Reserved.
Type of workload Datacenter
● Transactional
● DSE Analytics
Spark
Cassandra
Connector
val result = sqlContext.sql("SELECT category, count(1) FROM
demo GROUP BY category ")
https://spark.apache.org/
https://docs.datastax.com/en/dse/6.7/dse-dev/datastax_enterprise/analytics/analyticsTOC.html
© DataStax, All Rights Reserved.ConfidentialConfidential © DataStax, All Rights Reserved.
Type of workload Datacenter
● Transactional
● DSE Analytics
https://spark.apache.org/docs/latest/streaming-programming-guide.html
https://docs.datastax.com/en/dse/6.7/dse-dev/datastax_enterprise/spark/sparkStreamingIntro.html
© DataStax, All Rights Reserved.ConfidentialConfidential © DataStax, All Rights Reserved.
Type of workload Datacenter
● Transactional
● DSE Analytics
● DSE Search
SELECT id, artist_name FROM music.solr WHERE solr_query =
'artist_name:Miles*' LIMIT 10
CQL + Solr_query
https://docs.datastax.com/en/dse/6.7/dse-admin/datastax_enterprise/search/searchAbout.html
© DataStax, All Rights Reserved.ConfidentialConfidential © DataStax, All Rights Reserved.
Type of workload Datacenter
● Transactional
● DSE Analytics
● DSE Search
● DSE SearchAnalytics
val result = sqlContext.sql("SELECT artist_country, sum(1) FROM
music.solr
GROUP BY artist_country
WHERE solr_query = 'artist_name:A*’ ")
https://docs.datastax.com/en/dse/6.7/dse-admin/datastax_enterprise/analytics/dseSearchAnalyticsOverview.html
© DataStax, All Rights Reserved.ConfidentialConfidential © DataStax, All Rights Reserved.
Type of workload Datacenter
● Transactional
● DSE Analytics
● DSE Search
● DSE SearchAnalytics
● DSE Graph
https://docs.datastax.com/en/dse/6.7/dse-admin/datastax_enterprise/graph/graphTOC.html
© DataStax, All Rights Reserved.ConfidentialConfidential © DataStax, All Rights Reserved.
DataStax Enterprise Graph Analytics
https://academy.datastax.com/resources/ds332
© DataStax, All Rights Reserved.ConfidentialConfidential © DataStax, All Rights Reserved.
DataStax Enterprise Graph Analytics
https://academy.datastax.com/resources/ds332
© DataStax, All Rights Reserved.ConfidentialConfidential © DataStax, All Rights Reserved.
DataStax Enterprise Graph Analytics
https://academy.datastax.com/resources/ds332
© DataStax, All Rights Reserved.ConfidentialConfidential © DataStax, All Rights Reserved.
DataStax Enterprise Graph Analytics
https://academy.datastax.com/resources/ds332
© DataStax, All Rights Reserved.ConfidentialConfidential © DataStax, All Rights Reserved.
DataStax Enterprise Graph Analytics
https://academy.datastax.com/resources/ds332
1
1
2
2
© DataStax, All Rights Reserved.ConfidentialConfidential © DataStax, All Rights Reserved.
Type of workload Datacenter
● Transactional
● DSE Analytics
● DSE Search
● DSE SearchAnalytics
● DSE Graph
/etc/default/dse
GRAPH_ENABLED=1
SPARK_ENABLED=1
SOLR_ENABLED=1
https://docs.datastax.com/en/dse/6.7/dse-admin/datastax_enterprise/production/initializeDCPerType.html
© DataStax, All Rights Reserved.ConfidentialConfidential © DataStax, All Rights Reserved.
DSE - AlwaysOn SQL
● Is a high availability service that responds to SQL queries
from JDBC and ODBC applications.
● It is built on top of the Spark SQL Thriftserver.
● DSE Graph data is also available via SQL in the form of
vertex tables and edge tables, just like in DseGraphFrames.
https://www.datastax.com/2018/05/introducing-alwayson-sql-for-dse-analytics
https://docs.datastax.com/en/dse/6.7/dse-admin/datastax_enterprise/spark/alwaysOnSql.html?#alwaysOnSql__alwaysOnSqlEnabling
© DataStax, All Rights Reserved.ConfidentialConfidential © DataStax, All Rights Reserved.
DSE - AlwaysOn SQL
● What do we need to have?
−A running datacenter with DSE Analytics nodes enabled.
−Setup and Enabled AlwaysOn SQL in the alwayson_sql_options
section in dse.yaml.
−Start the Service.
https://www.datastax.com/2018/05/introducing-alwayson-sql-for-dse-analytics
https://docs.datastax.com/en/dse/6.7/dse-admin/datastax_enterprise/spark/alwaysOnSql.html?#alwaysOnSql__alwaysOnSqlEnabling
USE CASE
Building it
© DataStax, All Rights Reserved.ConfidentialConfidential © DataStax, All Rights Reserved.
Use Case
● A telco company needs to implement a Big Data Real Time
analytics architecture in order to monitor effective SMS
campaigns.
● It is important to have a real-time dashboard and available
connections to perform custom analyzes.
© DataStax, All Rights Reserved.ConfidentialConfidential © DataStax, All Rights Reserved.
Proposed architecture – Transactional
© DataStax, All Rights Reserved.ConfidentialConfidential © DataStax, All Rights Reserved.
Proposed architecture – Transactional
Central EDR
https://streamsets.com/
https://streamsets.com/documentation/datacollector/latest/help/index.html
© DataStax, All Rights Reserved.ConfidentialConfidential © DataStax, All Rights Reserved.
Proposed architecture
Analytics + Search + Graph
ASG
DataStax Studio 2.0
CREATE KEYSPACE sms_campaigns
WITH REPLICATION = {'class' : 'NetworkTopologyStrategy', ‘DC_T' : 3, ‘DC_ANLTCS’ : 3 };
© DataStax, All Rights Reserved.ConfidentialConfidential © DataStax, All Rights Reserved.
Proposed architecture
Analytics + Search + Graph
https://docs.datastax.com/en/dse/5.1/dse-dev/datastax_enterprise/studio/installStudio.html
https://www.datastax.com/2017/04/announcing-datastax-studio-2-0-a-powerful-developer-environment-for-datastax-enterprise
© DataStax, All Rights Reserved.ConfidentialConfidential © DataStax, All Rights Reserved.
Proposed architecture
Analytics + Search + Graph
https://docs.datastax.com/en/dse/5.1/dse-dev/datastax_enterprise/studio/installStudio.html
DataStax Studio 2.0
ASG
AlwaysOn SQL
© DataStax, All Rights Reserved.ConfidentialConfidential © DataStax, All Rights Reserved.
Proposed architecture - Real Time Analytics
ASG
© DataStax, All Rights Reserved.ConfidentialConfidential © DataStax, All Rights Reserved.
Proposed architecture - Real Time Analytics
© DataStax, All Rights Reserved.ConfidentialConfidential © DataStax, All Rights Reserved.
Proposed architecture - Real Time Analytics
ASG
© DataStax, All Rights Reserved.ConfidentialConfidential © DataStax, All Rights Reserved.
Proposed architecture - Real Time Analytics
Real-Time Card
Real-Time and Historic Data Analysis
© DataStax, All Rights Reserved.ConfidentialConfidential © DataStax, All Rights Reserved.
Proposed architecture - Real Time Analytics
https://www.pubnub.com/tutorials/microsoft-power-bi/streaming-business-data-to-dashboards/
© DataStax, All Rights Reserved.ConfidentialConfidential © DataStax, All Rights Reserved.
Whole Architecture
ASG
AlwaysOn SQL
BI Tools
AD-HOC Queries
Data exploration
Real-Time Data
DseGraphFrames
Gremlin Query
Real Time
Analytics
Without
degrade
Scalability Allow connection
© DataStax, All Rights Reserved.Confidential
Q&A
Joseph Arriola
jcarriolaa@gmail.com
jcarriola@solcomp.com
https://www.linkedin.com/in/jcarriolaa/
THANK YOU

More Related Content

What's hot

IDERA Live | Have No Fear the DBA is Here: Protecting Data Resources
IDERA Live | Have No Fear the DBA is Here: Protecting Data ResourcesIDERA Live | Have No Fear the DBA is Here: Protecting Data Resources
IDERA Live | Have No Fear the DBA is Here: Protecting Data Resources
IDERA Software
 
Real Time Business Intelligence with Cassandra, Kafka and Hadoop - A Real Sto...
Real Time Business Intelligence with Cassandra, Kafka and Hadoop - A Real Sto...Real Time Business Intelligence with Cassandra, Kafka and Hadoop - A Real Sto...
Real Time Business Intelligence with Cassandra, Kafka and Hadoop - A Real Sto...
DataStax
 

What's hot (20)

Using Event-Driven Architectures with Cassandra
Using Event-Driven Architectures with CassandraUsing Event-Driven Architectures with Cassandra
Using Event-Driven Architectures with Cassandra
 
IDERA Live | Have No Fear the DBA is Here: Protecting Data Resources
IDERA Live | Have No Fear the DBA is Here: Protecting Data ResourcesIDERA Live | Have No Fear the DBA is Here: Protecting Data Resources
IDERA Live | Have No Fear the DBA is Here: Protecting Data Resources
 
Webinar | How to Understand Apache Cassandra™ Performance Through Read/Writ...
Webinar  |  How to Understand Apache Cassandra™ Performance Through Read/Writ...Webinar  |  How to Understand Apache Cassandra™ Performance Through Read/Writ...
Webinar | How to Understand Apache Cassandra™ Performance Through Read/Writ...
 
Using Approximate Data for Small, Insightful Analytics (Ben Kornmeier, Protec...
Using Approximate Data for Small, Insightful Analytics (Ben Kornmeier, Protec...Using Approximate Data for Small, Insightful Analytics (Ben Kornmeier, Protec...
Using Approximate Data for Small, Insightful Analytics (Ben Kornmeier, Protec...
 
Demystifying Data Warehousing as a Service - DFW
Demystifying Data Warehousing as a Service - DFWDemystifying Data Warehousing as a Service - DFW
Demystifying Data Warehousing as a Service - DFW
 
GPS Insight on Using Presto with Scylla for Data Analytics and Data Archival
GPS Insight on Using Presto with Scylla for Data Analytics and Data ArchivalGPS Insight on Using Presto with Scylla for Data Analytics and Data Archival
GPS Insight on Using Presto with Scylla for Data Analytics and Data Archival
 
Azure Data Lake and Azure Data Lake Analytics
Azure Data Lake and Azure Data Lake AnalyticsAzure Data Lake and Azure Data Lake Analytics
Azure Data Lake and Azure Data Lake Analytics
 
Data stax webinar cassandra and titandb insights into datastax graph strategy...
Data stax webinar cassandra and titandb insights into datastax graph strategy...Data stax webinar cassandra and titandb insights into datastax graph strategy...
Data stax webinar cassandra and titandb insights into datastax graph strategy...
 
Real Time Business Intelligence with Cassandra, Kafka and Hadoop - A Real Sto...
Real Time Business Intelligence with Cassandra, Kafka and Hadoop - A Real Sto...Real Time Business Intelligence with Cassandra, Kafka and Hadoop - A Real Sto...
Real Time Business Intelligence with Cassandra, Kafka and Hadoop - A Real Sto...
 
Why data warehouses cannot support hot analytics
Why data warehouses cannot support hot analyticsWhy data warehouses cannot support hot analytics
Why data warehouses cannot support hot analytics
 
Data Privacy with Apache Spark: Defensive and Offensive Approaches
Data Privacy with Apache Spark: Defensive and Offensive ApproachesData Privacy with Apache Spark: Defensive and Offensive Approaches
Data Privacy with Apache Spark: Defensive and Offensive Approaches
 
Changing the game with cloud dw
Changing the game with cloud dwChanging the game with cloud dw
Changing the game with cloud dw
 
Deletes Without Tombstones or TTLs (Eric Stevens, ProtectWise) | Cassandra Su...
Deletes Without Tombstones or TTLs (Eric Stevens, ProtectWise) | Cassandra Su...Deletes Without Tombstones or TTLs (Eric Stevens, ProtectWise) | Cassandra Su...
Deletes Without Tombstones or TTLs (Eric Stevens, ProtectWise) | Cassandra Su...
 
Hello DataStax Enterprise Graph
Hello DataStax Enterprise Graph Hello DataStax Enterprise Graph
Hello DataStax Enterprise Graph
 
Unleash the power of Azure Data Factory
Unleash the power of Azure Data Factory Unleash the power of Azure Data Factory
Unleash the power of Azure Data Factory
 
Elastic Data Warehousing
Elastic Data WarehousingElastic Data Warehousing
Elastic Data Warehousing
 
Agile Data Warehousing: Using SDDM to Build a Virtualized ODS
Agile Data Warehousing: Using SDDM to Build a Virtualized ODSAgile Data Warehousing: Using SDDM to Build a Virtualized ODS
Agile Data Warehousing: Using SDDM to Build a Virtualized ODS
 
Demystifying Data Warehousing as a Service (GLOC 2019)
Demystifying Data Warehousing as a Service (GLOC 2019)Demystifying Data Warehousing as a Service (GLOC 2019)
Demystifying Data Warehousing as a Service (GLOC 2019)
 
Cloudera Breakfast Series, Analytics Part 1: Use All Your Data
Cloudera Breakfast Series, Analytics Part 1: Use All Your DataCloudera Breakfast Series, Analytics Part 1: Use All Your Data
Cloudera Breakfast Series, Analytics Part 1: Use All Your Data
 
Analyzing Semi-Structured Data At Volume In The Cloud
Analyzing Semi-Structured Data At Volume In The CloudAnalyzing Semi-Structured Data At Volume In The Cloud
Analyzing Semi-Structured Data At Volume In The Cloud
 

Similar to implementation of a big data architecture for real-time analytics with data stax enterprise graph, analytics and search

IDERA Live | The Ever Growing Science of Database Migrations
IDERA Live | The Ever Growing Science of Database MigrationsIDERA Live | The Ever Growing Science of Database Migrations
IDERA Live | The Ever Growing Science of Database Migrations
IDERA Software
 
IndyCodeCamp SDS May 16th 2009
IndyCodeCamp SDS May 16th 2009IndyCodeCamp SDS May 16th 2009
IndyCodeCamp SDS May 16th 2009
Aaron King
 

Similar to implementation of a big data architecture for real-time analytics with data stax enterprise graph, analytics and search (20)

DataStax 6 and Beyond
DataStax 6 and BeyondDataStax 6 and Beyond
DataStax 6 and Beyond
 
Webinar: DataStax Enterprise 5.0 What’s New and How It’ll Make Your Life Easier
Webinar: DataStax Enterprise 5.0 What’s New and How It’ll Make Your Life EasierWebinar: DataStax Enterprise 5.0 What’s New and How It’ll Make Your Life Easier
Webinar: DataStax Enterprise 5.0 What’s New and How It’ll Make Your Life Easier
 
Live traffic capture and replay in cassandra 4.0
Live traffic capture and replay in cassandra 4.0Live traffic capture and replay in cassandra 4.0
Live traffic capture and replay in cassandra 4.0
 
Oracle Database Cloud Service - Provisioning Your First DBaaS Instance
Oracle Database Cloud Service - Provisioning Your First DBaaS InstanceOracle Database Cloud Service - Provisioning Your First DBaaS Instance
Oracle Database Cloud Service - Provisioning Your First DBaaS Instance
 
Top 10 Best Practices for Apache Cassandra and DataStax Enterprise
Top 10 Best Practices for Apache Cassandra and DataStax EnterpriseTop 10 Best Practices for Apache Cassandra and DataStax Enterprise
Top 10 Best Practices for Apache Cassandra and DataStax Enterprise
 
Project Casquatch: An Open Source Java Abstraction Framework for Cassandra Da...
Project Casquatch: An Open Source Java Abstraction Framework for Cassandra Da...Project Casquatch: An Open Source Java Abstraction Framework for Cassandra Da...
Project Casquatch: An Open Source Java Abstraction Framework for Cassandra Da...
 
Webinar: The Performance Challenge: Providing an Amazing Customer Experience ...
Webinar: The Performance Challenge: Providing an Amazing Customer Experience ...Webinar: The Performance Challenge: Providing an Amazing Customer Experience ...
Webinar: The Performance Challenge: Providing an Amazing Customer Experience ...
 
Slides: Relational to NoSQL Migration
Slides: Relational to NoSQL MigrationSlides: Relational to NoSQL Migration
Slides: Relational to NoSQL Migration
 
Melbourne: Certus Data 2.0 Vault Meetup with Snowflake - Data Vault In The Cl...
Melbourne: Certus Data 2.0 Vault Meetup with Snowflake - Data Vault In The Cl...Melbourne: Certus Data 2.0 Vault Meetup with Snowflake - Data Vault In The Cl...
Melbourne: Certus Data 2.0 Vault Meetup with Snowflake - Data Vault In The Cl...
 
Introducing DataStax Enterprise 4.7
Introducing DataStax Enterprise 4.7Introducing DataStax Enterprise 4.7
Introducing DataStax Enterprise 4.7
 
Building a Pluggable Analytics Stack with Cassandra (Jim Peregord, Element Co...
Building a Pluggable Analytics Stack with Cassandra (Jim Peregord, Element Co...Building a Pluggable Analytics Stack with Cassandra (Jim Peregord, Element Co...
Building a Pluggable Analytics Stack with Cassandra (Jim Peregord, Element Co...
 
Five Lessons in Distributed Databases
Five Lessons  in Distributed DatabasesFive Lessons  in Distributed Databases
Five Lessons in Distributed Databases
 
Data day texas: Cassandra and the Cloud
Data day texas: Cassandra and the CloudData day texas: Cassandra and the Cloud
Data day texas: Cassandra and the Cloud
 
Powerful Spatial Features You Never Knew Existed in Oracle Spatial and Graph ...
Powerful Spatial Features You Never Knew Existed in Oracle Spatial and Graph ...Powerful Spatial Features You Never Knew Existed in Oracle Spatial and Graph ...
Powerful Spatial Features You Never Knew Existed in Oracle Spatial and Graph ...
 
IDERA Live | The Ever Growing Science of Database Migrations
IDERA Live | The Ever Growing Science of Database MigrationsIDERA Live | The Ever Growing Science of Database Migrations
IDERA Live | The Ever Growing Science of Database Migrations
 
How to get Real-Time Value from your IoT Data - Datastax
How to get Real-Time Value from your IoT Data - DatastaxHow to get Real-Time Value from your IoT Data - Datastax
How to get Real-Time Value from your IoT Data - Datastax
 
Product Keynote: Denodo 8.0 - A Logical Data Fabric for the Intelligent Enter...
Product Keynote: Denodo 8.0 - A Logical Data Fabric for the Intelligent Enter...Product Keynote: Denodo 8.0 - A Logical Data Fabric for the Intelligent Enter...
Product Keynote: Denodo 8.0 - A Logical Data Fabric for the Intelligent Enter...
 
IndyCodeCamp SDS May 16th 2009
IndyCodeCamp SDS May 16th 2009IndyCodeCamp SDS May 16th 2009
IndyCodeCamp SDS May 16th 2009
 
Apache Cassandra Lunch #75: Getting Started with DataStax Enterprise on Docker
Apache Cassandra Lunch #75: Getting Started with DataStax Enterprise on DockerApache Cassandra Lunch #75: Getting Started with DataStax Enterprise on Docker
Apache Cassandra Lunch #75: Getting Started with DataStax Enterprise on Docker
 
Developing Enterprise Consciousness: Building Modern Open Data Platforms
Developing Enterprise Consciousness: Building Modern Open Data PlatformsDeveloping Enterprise Consciousness: Building Modern Open Data Platforms
Developing Enterprise Consciousness: Building Modern Open Data Platforms
 

Recently uploaded

Abortion pills in Doha Qatar (+966572737505 ! Get Cytotec
Abortion pills in Doha Qatar (+966572737505 ! Get CytotecAbortion pills in Doha Qatar (+966572737505 ! Get Cytotec
Abortion pills in Doha Qatar (+966572737505 ! Get Cytotec
Abortion pills in Riyadh +966572737505 get cytotec
 
Junnasandra Call Girls: 🍓 7737669865 🍓 High Profile Model Escorts | Bangalore...
Junnasandra Call Girls: 🍓 7737669865 🍓 High Profile Model Escorts | Bangalore...Junnasandra Call Girls: 🍓 7737669865 🍓 High Profile Model Escorts | Bangalore...
Junnasandra Call Girls: 🍓 7737669865 🍓 High Profile Model Escorts | Bangalore...
amitlee9823
 
Call Girls In Attibele ☎ 7737669865 🥵 Book Your One night Stand
Call Girls In Attibele ☎ 7737669865 🥵 Book Your One night StandCall Girls In Attibele ☎ 7737669865 🥵 Book Your One night Stand
Call Girls In Attibele ☎ 7737669865 🥵 Book Your One night Stand
amitlee9823
 
FESE Capital Markets Fact Sheet 2024 Q1.pdf
FESE Capital Markets Fact Sheet 2024 Q1.pdfFESE Capital Markets Fact Sheet 2024 Q1.pdf
FESE Capital Markets Fact Sheet 2024 Q1.pdf
MarinCaroMartnezBerg
 
Vip Mumbai Call Girls Marol Naka Call On 9920725232 With Body to body massage...
Vip Mumbai Call Girls Marol Naka Call On 9920725232 With Body to body massage...Vip Mumbai Call Girls Marol Naka Call On 9920725232 With Body to body massage...
Vip Mumbai Call Girls Marol Naka Call On 9920725232 With Body to body massage...
amitlee9823
 
Probability Grade 10 Third Quarter Lessons
Probability Grade 10 Third Quarter LessonsProbability Grade 10 Third Quarter Lessons
Probability Grade 10 Third Quarter Lessons
JoseMangaJr1
 
Call Girls In Shalimar Bagh ( Delhi) 9953330565 Escorts Service
Call Girls In Shalimar Bagh ( Delhi) 9953330565 Escorts ServiceCall Girls In Shalimar Bagh ( Delhi) 9953330565 Escorts Service
Call Girls In Shalimar Bagh ( Delhi) 9953330565 Escorts Service
9953056974 Low Rate Call Girls In Saket, Delhi NCR
 
Al Barsha Escorts $#$ O565212860 $#$ Escort Service In Al Barsha
Al Barsha Escorts $#$ O565212860 $#$ Escort Service In Al BarshaAl Barsha Escorts $#$ O565212860 $#$ Escort Service In Al Barsha
Al Barsha Escorts $#$ O565212860 $#$ Escort Service In Al Barsha
AroojKhan71
 
Call Girls In Bellandur ☎ 7737669865 🥵 Book Your One night Stand
Call Girls In Bellandur ☎ 7737669865 🥵 Book Your One night StandCall Girls In Bellandur ☎ 7737669865 🥵 Book Your One night Stand
Call Girls In Bellandur ☎ 7737669865 🥵 Book Your One night Stand
amitlee9823
 
Call Girls Jalahalli Just Call 👗 7737669865 👗 Top Class Call Girl Service Ban...
Call Girls Jalahalli Just Call 👗 7737669865 👗 Top Class Call Girl Service Ban...Call Girls Jalahalli Just Call 👗 7737669865 👗 Top Class Call Girl Service Ban...
Call Girls Jalahalli Just Call 👗 7737669865 👗 Top Class Call Girl Service Ban...
amitlee9823
 
Vip Mumbai Call Girls Thane West Call On 9920725232 With Body to body massage...
Vip Mumbai Call Girls Thane West Call On 9920725232 With Body to body massage...Vip Mumbai Call Girls Thane West Call On 9920725232 With Body to body massage...
Vip Mumbai Call Girls Thane West Call On 9920725232 With Body to body massage...
amitlee9823
 
Call Girls In Hsr Layout ☎ 7737669865 🥵 Book Your One night Stand
Call Girls In Hsr Layout ☎ 7737669865 🥵 Book Your One night StandCall Girls In Hsr Layout ☎ 7737669865 🥵 Book Your One night Stand
Call Girls In Hsr Layout ☎ 7737669865 🥵 Book Your One night Stand
amitlee9823
 

Recently uploaded (20)

Mature dropshipping via API with DroFx.pptx
Mature dropshipping via API with DroFx.pptxMature dropshipping via API with DroFx.pptx
Mature dropshipping via API with DroFx.pptx
 
Abortion pills in Doha Qatar (+966572737505 ! Get Cytotec
Abortion pills in Doha Qatar (+966572737505 ! Get CytotecAbortion pills in Doha Qatar (+966572737505 ! Get Cytotec
Abortion pills in Doha Qatar (+966572737505 ! Get Cytotec
 
Junnasandra Call Girls: 🍓 7737669865 🍓 High Profile Model Escorts | Bangalore...
Junnasandra Call Girls: 🍓 7737669865 🍓 High Profile Model Escorts | Bangalore...Junnasandra Call Girls: 🍓 7737669865 🍓 High Profile Model Escorts | Bangalore...
Junnasandra Call Girls: 🍓 7737669865 🍓 High Profile Model Escorts | Bangalore...
 
Cheap Rate Call girls Sarita Vihar Delhi 9205541914 shot 1500 night
Cheap Rate Call girls Sarita Vihar Delhi 9205541914 shot 1500 nightCheap Rate Call girls Sarita Vihar Delhi 9205541914 shot 1500 night
Cheap Rate Call girls Sarita Vihar Delhi 9205541914 shot 1500 night
 
Call Girls In Attibele ☎ 7737669865 🥵 Book Your One night Stand
Call Girls In Attibele ☎ 7737669865 🥵 Book Your One night StandCall Girls In Attibele ☎ 7737669865 🥵 Book Your One night Stand
Call Girls In Attibele ☎ 7737669865 🥵 Book Your One night Stand
 
FESE Capital Markets Fact Sheet 2024 Q1.pdf
FESE Capital Markets Fact Sheet 2024 Q1.pdfFESE Capital Markets Fact Sheet 2024 Q1.pdf
FESE Capital Markets Fact Sheet 2024 Q1.pdf
 
Vip Mumbai Call Girls Marol Naka Call On 9920725232 With Body to body massage...
Vip Mumbai Call Girls Marol Naka Call On 9920725232 With Body to body massage...Vip Mumbai Call Girls Marol Naka Call On 9920725232 With Body to body massage...
Vip Mumbai Call Girls Marol Naka Call On 9920725232 With Body to body massage...
 
Probability Grade 10 Third Quarter Lessons
Probability Grade 10 Third Quarter LessonsProbability Grade 10 Third Quarter Lessons
Probability Grade 10 Third Quarter Lessons
 
Call Girls In Shalimar Bagh ( Delhi) 9953330565 Escorts Service
Call Girls In Shalimar Bagh ( Delhi) 9953330565 Escorts ServiceCall Girls In Shalimar Bagh ( Delhi) 9953330565 Escorts Service
Call Girls In Shalimar Bagh ( Delhi) 9953330565 Escorts Service
 
Al Barsha Escorts $#$ O565212860 $#$ Escort Service In Al Barsha
Al Barsha Escorts $#$ O565212860 $#$ Escort Service In Al BarshaAl Barsha Escorts $#$ O565212860 $#$ Escort Service In Al Barsha
Al Barsha Escorts $#$ O565212860 $#$ Escort Service In Al Barsha
 
Call Girls In Bellandur ☎ 7737669865 🥵 Book Your One night Stand
Call Girls In Bellandur ☎ 7737669865 🥵 Book Your One night StandCall Girls In Bellandur ☎ 7737669865 🥵 Book Your One night Stand
Call Girls In Bellandur ☎ 7737669865 🥵 Book Your One night Stand
 
Predicting Loan Approval: A Data Science Project
Predicting Loan Approval: A Data Science ProjectPredicting Loan Approval: A Data Science Project
Predicting Loan Approval: A Data Science Project
 
Invezz.com - Grow your wealth with trading signals
Invezz.com - Grow your wealth with trading signalsInvezz.com - Grow your wealth with trading signals
Invezz.com - Grow your wealth with trading signals
 
Week-01-2.ppt BBB human Computer interaction
Week-01-2.ppt BBB human Computer interactionWeek-01-2.ppt BBB human Computer interaction
Week-01-2.ppt BBB human Computer interaction
 
Accredited-Transport-Cooperatives-Jan-2021-Web.pdf
Accredited-Transport-Cooperatives-Jan-2021-Web.pdfAccredited-Transport-Cooperatives-Jan-2021-Web.pdf
Accredited-Transport-Cooperatives-Jan-2021-Web.pdf
 
5CL-ADBA,5cladba, Chinese supplier, safety is guaranteed
5CL-ADBA,5cladba, Chinese supplier, safety is guaranteed5CL-ADBA,5cladba, Chinese supplier, safety is guaranteed
5CL-ADBA,5cladba, Chinese supplier, safety is guaranteed
 
Halmar dropshipping via API with DroFx
Halmar  dropshipping  via API with DroFxHalmar  dropshipping  via API with DroFx
Halmar dropshipping via API with DroFx
 
Call Girls Jalahalli Just Call 👗 7737669865 👗 Top Class Call Girl Service Ban...
Call Girls Jalahalli Just Call 👗 7737669865 👗 Top Class Call Girl Service Ban...Call Girls Jalahalli Just Call 👗 7737669865 👗 Top Class Call Girl Service Ban...
Call Girls Jalahalli Just Call 👗 7737669865 👗 Top Class Call Girl Service Ban...
 
Vip Mumbai Call Girls Thane West Call On 9920725232 With Body to body massage...
Vip Mumbai Call Girls Thane West Call On 9920725232 With Body to body massage...Vip Mumbai Call Girls Thane West Call On 9920725232 With Body to body massage...
Vip Mumbai Call Girls Thane West Call On 9920725232 With Body to body massage...
 
Call Girls In Hsr Layout ☎ 7737669865 🥵 Book Your One night Stand
Call Girls In Hsr Layout ☎ 7737669865 🥵 Book Your One night StandCall Girls In Hsr Layout ☎ 7737669865 🥵 Book Your One night Stand
Call Girls In Hsr Layout ☎ 7737669865 🥵 Book Your One night Stand
 

implementation of a big data architecture for real-time analytics with data stax enterprise graph, analytics and search

  • 1. MAY 21 - 23, 2019 Gaylord National Resort & Convention Center Maryland
  • 2. Implementation of a Big Data Architecture for Real-Time Analytics with DataStax Enterprise Graph, Analytics and Search Joseph Arriola
  • 3. © DataStax, All Rights Reserved.ConfidentialConfidential © DataStax, All Rights Reserved. About me… 11+ 3 30+ 5 15+
  • 4. © DataStax, All Rights Reserved.ConfidentialConfidential © DataStax, All Rights Reserved. Agenda ● Challenges ● Context of Architectures of Business Intelligence ● What does the Big Data world offer? ● Tools based on the use case ● What did DataStax do? ● Where to start with DSE? ● Use Case - Building it
  • 5. CHALLENGES Context of Architectures of Business Intelligence
  • 6. © DataStax, All Rights Reserved.ConfidentialConfidential © DataStax, All Rights Reserved. Challenges ● Create an architecture Real-Time Analytics ● Without degrade the transactional service of legacy ● Scalability: Process and storage ● Allow connection with enterprise tools of Business Intelligences
  • 7. © DataStax, All Rights Reserved.ConfidentialConfidential © DataStax, All Rights Reserved. Old Architecture Data BaseApplications Analysis Read & Write Read Real Time Analytics Without degrade Scalability Allow connection
  • 8. © DataStax, All Rights Reserved.ConfidentialConfidential © DataStax, All Rights Reserved. Business Intelligence Architecture Data Warehouse Data Base Legacy Systems Data Source ETL Cube Process Data Model & Storage OLAP Data Presentation Dashboard Reports Ad-hoc Queries 1 Day of Delay Real Time Analytics Without degrade Scalability Allow connection
  • 9. BIG DATA WORLD Tools based on the use case
  • 10. © DataStax, All Rights Reserved.ConfidentialConfidential © DataStax, All Rights Reserved. What does the Big Data world offer?
  • 11. © DataStax, All Rights Reserved.ConfidentialConfidential © DataStax, All Rights Reserved. Tools based on the use case Batch Layer Speed Layer Serving Layer PipelineReal Time Events Lambda Architecture
  • 12. © DataStax, All Rights Reserved.ConfidentialConfidential © DataStax, All Rights Reserved. Tools based on the use case Serving Layer Batch Layer Speed Layer Serving Layer Serving Layer Batch Layer Speed Layer Batch Layer Speed Layer Pipeline Speed Layer
  • 14. © DataStax, All Rights Reserved.ConfidentialConfidential © DataStax, All Rights Reserved. What did DataStax do? Apache Cassandra, Spark, Lucene, Solr, TinkerPop ® Apache Software Foundation DSE OpsCenter DataStax Studio / Drivers
  • 15. © DataStax, All Rights Reserved.ConfidentialConfidential © DataStax, All Rights Reserved. Where to start with DSE? ● Recap Cassandra Topology ● Data replication ● Type of workload Datacenter
  • 16. © DataStax, All Rights Reserved.ConfidentialConfidential © DataStax, All Rights Reserved. Recap Cassandra Topology DC1 DC2 Cluster ● Node: A single instance ● Datacenter: A logical grouping of nodes ● Cluster: A logical grouping of data centers
  • 17. © DataStax, All Rights Reserved.ConfidentialConfidential © DataStax, All Rights Reserved. Data replication ● Replication automatically handled ● SimpleStrategy ● NetworkTopologyStrategy 80 Client CREATE KEYSPACE keyspace_name WITH REPLICATION = { 'class' : 'SimpleStrategy', 'replication_factor' : 3 };
  • 18. © DataStax, All Rights Reserved.ConfidentialConfidential © DataStax, All Rights Reserved. Data replication ● Replication automatically handled ● SimpleStrategy ● NetworkTopologyStrategy CREATE KEYSPACE keyspace_name WITH REPLICATION = {'class' : 'NetworkTopologyStrategy', 'DC1' : 3, 'DC2’ : 3 }; Client Client
  • 19. © DataStax, All Rights Reserved.ConfidentialConfidential © DataStax, All Rights Reserved. Type of workload Datacenter ● Transactional CQL SELECT id, artist_name FROM music.solr WHERE id = 123 LIMIT 10 ● Restriction of Partition Key ● There is no −GROUP BY – Sum, avg, etc… −JOINS −LIKE ‘%%’ https://docs.datastax.com/en/dse/6.7/cql/cql/cqlAbout.html
  • 20. © DataStax, All Rights Reserved.ConfidentialConfidential © DataStax, All Rights Reserved. Type of workload Datacenter ● Transactional ● DSE Analytics Spark Cassandra Connector val result = sqlContext.sql("SELECT category, count(1) FROM demo GROUP BY category ") https://spark.apache.org/ https://docs.datastax.com/en/dse/6.7/dse-dev/datastax_enterprise/analytics/analyticsTOC.html
  • 21. © DataStax, All Rights Reserved.ConfidentialConfidential © DataStax, All Rights Reserved. Type of workload Datacenter ● Transactional ● DSE Analytics https://spark.apache.org/docs/latest/streaming-programming-guide.html https://docs.datastax.com/en/dse/6.7/dse-dev/datastax_enterprise/spark/sparkStreamingIntro.html
  • 22. © DataStax, All Rights Reserved.ConfidentialConfidential © DataStax, All Rights Reserved. Type of workload Datacenter ● Transactional ● DSE Analytics ● DSE Search SELECT id, artist_name FROM music.solr WHERE solr_query = 'artist_name:Miles*' LIMIT 10 CQL + Solr_query https://docs.datastax.com/en/dse/6.7/dse-admin/datastax_enterprise/search/searchAbout.html
  • 23. © DataStax, All Rights Reserved.ConfidentialConfidential © DataStax, All Rights Reserved. Type of workload Datacenter ● Transactional ● DSE Analytics ● DSE Search ● DSE SearchAnalytics val result = sqlContext.sql("SELECT artist_country, sum(1) FROM music.solr GROUP BY artist_country WHERE solr_query = 'artist_name:A*’ ") https://docs.datastax.com/en/dse/6.7/dse-admin/datastax_enterprise/analytics/dseSearchAnalyticsOverview.html
  • 24. © DataStax, All Rights Reserved.ConfidentialConfidential © DataStax, All Rights Reserved. Type of workload Datacenter ● Transactional ● DSE Analytics ● DSE Search ● DSE SearchAnalytics ● DSE Graph https://docs.datastax.com/en/dse/6.7/dse-admin/datastax_enterprise/graph/graphTOC.html
  • 25. © DataStax, All Rights Reserved.ConfidentialConfidential © DataStax, All Rights Reserved. DataStax Enterprise Graph Analytics https://academy.datastax.com/resources/ds332
  • 26. © DataStax, All Rights Reserved.ConfidentialConfidential © DataStax, All Rights Reserved. DataStax Enterprise Graph Analytics https://academy.datastax.com/resources/ds332
  • 27. © DataStax, All Rights Reserved.ConfidentialConfidential © DataStax, All Rights Reserved. DataStax Enterprise Graph Analytics https://academy.datastax.com/resources/ds332
  • 28. © DataStax, All Rights Reserved.ConfidentialConfidential © DataStax, All Rights Reserved. DataStax Enterprise Graph Analytics https://academy.datastax.com/resources/ds332
  • 29. © DataStax, All Rights Reserved.ConfidentialConfidential © DataStax, All Rights Reserved. DataStax Enterprise Graph Analytics https://academy.datastax.com/resources/ds332 1 1 2 2
  • 30. © DataStax, All Rights Reserved.ConfidentialConfidential © DataStax, All Rights Reserved. Type of workload Datacenter ● Transactional ● DSE Analytics ● DSE Search ● DSE SearchAnalytics ● DSE Graph /etc/default/dse GRAPH_ENABLED=1 SPARK_ENABLED=1 SOLR_ENABLED=1 https://docs.datastax.com/en/dse/6.7/dse-admin/datastax_enterprise/production/initializeDCPerType.html
  • 31. © DataStax, All Rights Reserved.ConfidentialConfidential © DataStax, All Rights Reserved. DSE - AlwaysOn SQL ● Is a high availability service that responds to SQL queries from JDBC and ODBC applications. ● It is built on top of the Spark SQL Thriftserver. ● DSE Graph data is also available via SQL in the form of vertex tables and edge tables, just like in DseGraphFrames. https://www.datastax.com/2018/05/introducing-alwayson-sql-for-dse-analytics https://docs.datastax.com/en/dse/6.7/dse-admin/datastax_enterprise/spark/alwaysOnSql.html?#alwaysOnSql__alwaysOnSqlEnabling
  • 32. © DataStax, All Rights Reserved.ConfidentialConfidential © DataStax, All Rights Reserved. DSE - AlwaysOn SQL ● What do we need to have? −A running datacenter with DSE Analytics nodes enabled. −Setup and Enabled AlwaysOn SQL in the alwayson_sql_options section in dse.yaml. −Start the Service. https://www.datastax.com/2018/05/introducing-alwayson-sql-for-dse-analytics https://docs.datastax.com/en/dse/6.7/dse-admin/datastax_enterprise/spark/alwaysOnSql.html?#alwaysOnSql__alwaysOnSqlEnabling
  • 34. © DataStax, All Rights Reserved.ConfidentialConfidential © DataStax, All Rights Reserved. Use Case ● A telco company needs to implement a Big Data Real Time analytics architecture in order to monitor effective SMS campaigns. ● It is important to have a real-time dashboard and available connections to perform custom analyzes.
  • 35. © DataStax, All Rights Reserved.ConfidentialConfidential © DataStax, All Rights Reserved. Proposed architecture – Transactional
  • 36. © DataStax, All Rights Reserved.ConfidentialConfidential © DataStax, All Rights Reserved. Proposed architecture – Transactional Central EDR https://streamsets.com/ https://streamsets.com/documentation/datacollector/latest/help/index.html
  • 37. © DataStax, All Rights Reserved.ConfidentialConfidential © DataStax, All Rights Reserved. Proposed architecture Analytics + Search + Graph ASG DataStax Studio 2.0 CREATE KEYSPACE sms_campaigns WITH REPLICATION = {'class' : 'NetworkTopologyStrategy', ‘DC_T' : 3, ‘DC_ANLTCS’ : 3 };
  • 38. © DataStax, All Rights Reserved.ConfidentialConfidential © DataStax, All Rights Reserved. Proposed architecture Analytics + Search + Graph https://docs.datastax.com/en/dse/5.1/dse-dev/datastax_enterprise/studio/installStudio.html https://www.datastax.com/2017/04/announcing-datastax-studio-2-0-a-powerful-developer-environment-for-datastax-enterprise
  • 39. © DataStax, All Rights Reserved.ConfidentialConfidential © DataStax, All Rights Reserved. Proposed architecture Analytics + Search + Graph https://docs.datastax.com/en/dse/5.1/dse-dev/datastax_enterprise/studio/installStudio.html DataStax Studio 2.0 ASG AlwaysOn SQL
  • 40. © DataStax, All Rights Reserved.ConfidentialConfidential © DataStax, All Rights Reserved. Proposed architecture - Real Time Analytics ASG
  • 41. © DataStax, All Rights Reserved.ConfidentialConfidential © DataStax, All Rights Reserved. Proposed architecture - Real Time Analytics
  • 42. © DataStax, All Rights Reserved.ConfidentialConfidential © DataStax, All Rights Reserved. Proposed architecture - Real Time Analytics ASG
  • 43. © DataStax, All Rights Reserved.ConfidentialConfidential © DataStax, All Rights Reserved. Proposed architecture - Real Time Analytics Real-Time Card Real-Time and Historic Data Analysis
  • 44. © DataStax, All Rights Reserved.ConfidentialConfidential © DataStax, All Rights Reserved. Proposed architecture - Real Time Analytics https://www.pubnub.com/tutorials/microsoft-power-bi/streaming-business-data-to-dashboards/
  • 45. © DataStax, All Rights Reserved.ConfidentialConfidential © DataStax, All Rights Reserved. Whole Architecture ASG AlwaysOn SQL BI Tools AD-HOC Queries Data exploration Real-Time Data DseGraphFrames Gremlin Query Real Time Analytics Without degrade Scalability Allow connection
  • 46. © DataStax, All Rights Reserved.Confidential Q&A Joseph Arriola jcarriolaa@gmail.com jcarriola@solcomp.com https://www.linkedin.com/in/jcarriolaa/

Editor's Notes

  1. I have been working in IT for around 11 years, in different industries such as Public Sector, Bank, Reteilers and telco company. I started as developer en java, C++, C#. and then in data werouse. that´s allowed to me start in the world of big data. I am an Information System Engineering, I studied a master's degree in business intelligece and also in philosophy. In order to prepare myself in the world of big data, I have studied different online certifications including "Apache Cassandra professional" by DataStax. I have been working around 5 years in the digital transformation and big data. implementing projects, providing consultancies and trannings. I am the founder of the Big Data Guatemala community, with the purpose of making known different technologies. in my speaker profile there are already around 15 conferences in different countries such as Guatemala, El Salvador, Mexico and for the first time in the USA.
  2. https://www.datastax.com/2018/05/introducing-alwayson-sql-for-dse-analytics https://docs.datastax.com/en/dse/6.7/dse-admin/datastax_enterprise/spark/alwaysOnSql.html?#alwaysOnSql__alwaysOnSqlEnabling
  3. https://www.datastax.com/2018/05/introducing-alwayson-sql-for-dse-analytics https://docs.datastax.com/en/dse/6.7/dse-admin/datastax_enterprise/spark/alwaysOnSql.html?#alwaysOnSql__alwaysOnSqlEnabling