We recently launched DataStax Enterprise 4.5 - the fastest, most scalable distributed database technology with blazing performance, 100x faster analytics and automated diagnostics.
Join DataStax’s product gurus Martin Van Ryswyk, EVP of Engineering, and Robin Schumacher, VP of Products, in an open dialog as they discuss the importance of -
- Selecting the right database technology for today’s digital world
- Integrated analytics for lightning fast customer interactions
- Merging operational and historical data for the most accurate insights, possible
4. Agenda
• The Connected World Needs a Revolutionary New
DBMS
• Introducing DataStax Enterprise 4.5
- Integrated Analytics
- New Performance Service
- Enhanced visual management
• Transactional Analytics Transforms Businesses
• Open dialog
7. What is Apache Cassandra?
San
Francisco
Stockholm
New York
C*
C*C*
Open Source NoSQL Database
Industry Leading Performance
Predictable Scalability
Geographical Data Distribution
Operational Simplicity
Business Flexibility
8. Security Analytics Search Visual
Monitoring
Management
Services
In-Memory
Dev.IDE&
Drivers
Professional
Services
Support&
Training
Integrated analytics, OLTP, Search
In-memory analytics and OLTP
Built-in data protection
Production-ready Cassandra
Multi-workload, use case capable
Automated management services
What is DataStax Enterprise?
9. Announcing DataStax Enterprise 4.5
9
1. New Analytics options for Cassandra
• Near real-time analytics on Cassandra data with Spark
• Seamless integration with Hadoop data warehouses/lakes
• Hive compatible SQL-like language
2. New Performance Service simplifies performance tuning
3. Enhanced OpsCenter supplies greater scale, built-in security, and easy
best practice enforcement
11. Modern online Web and Mobile
applications have outgrown legacy
RDBMS transactions
Today’s OLTP is more of an
“interaction” needing customization
via analytics
The end goal is to influence the
next action a customer takes on a
website or mobile app
Why Analytics in DataStax Enterprise?
12. Cassandra Analytics Powered by Spark
• Spark now integrated into DataStax Enterprise.
• Supplies much faster response times for analytics queries than
standard Hadoop.
• Provides near real-time analytics directly on Cassandra data.
• Includes Shark for ad-hoc, Hive query use cases.
• Fully supports workload isolation.
13. Cassandra Analytics Powered by Spark
0
10
20
30
40
50
60
Extreme Simple
seconds
Hadoop
Spark
Extreme: data not in memory and fetched from multiple nodes
Simple: data set fits in memory
14. Cassandra Analytics Powered by Spark
Open Sourced Only in DataStax Enterprise
Spark->Cassandra connectivity layer Certified Spark + Cassandra
Datatype mappings Visual cluster creation and
management
Performance Optimizations Built-in analytics high availability
15. Integration with Hadoop Platforms
• Easily integrates operational data stored in Cassandra with historical
data stored in Hadoop data warehouses/lakes.
• Join Cassandra tables with Hadoop objects (e.g. Hive table) in the
same query.
• Transfer data back to Hadoop deployment if desired.
16. Performance Service
• New performance diagnostic data dictionary.
• CQL-Based.
• Thirty tables that contain everything needed to troubleshoot
performance.
• Includes detection of worst running queries on a cluster.
17. Scalable – Certified to support 1,000
node database clusters (Tested on
AWS and Google Compute Engine)
Secure – New built-in security
controls for managing clusters and
tasks
Smarter – improved visual interface
for enforcing best practices.
OpsCenter 5.0
Available end of July.
21. Get Started Today
• Download DataStax Enterprise from
http://www.datastax.com/download.
• Try our new all-in-one smart installer!
• Online documentation available at:
http://www.datastax.com/docs.