Weitere ähnliche Inhalte
Ähnlich wie C* Summit EU 2013: Leveraging the Power of Cassandra: Operational Reporting and Interactive Analysis (20)
Mehr von DataStax Academy (20)
Kürzlich hochgeladen (20)
C* Summit EU 2013: Leveraging the Power of Cassandra: Operational Reporting and Interactive Analysis
- 1. Leveraging the Power of Cassandra:
Operational Reporting & Interactive
Analysis
Ernesto Ongaro
BI Consultant Jaspersoft
- 3. Requirements for Cassandra
reporting and analysis
People want access to the data in Cassandra
Most consumers of data are not technical
Traditional reporting and analytics tools don’t work with
Cassandra
Building reports from scratch is not easy or fun
Providing ad-hoc analytics is very complicated
©2013 Jaspersoft Corporation.
3
- 4. Current State of Reporting &
Analytics
Connectors are for RDBMS only
Expensive
Desktop
Standalone
©2013 Jaspersoft Corporation. Proprietary and Confidential
4
- 5. Advantages to using a reporting
and analysis framework
Build it yourself
Use a framework
Visual report designer
✖
✔
Security
✖
✔
Scheduling
✖
✔
Web access
✖
✔
API
✖
✔
Self-service queries
✖
✔
Charting libraries
✖
✔
Metadata layer
✖
✔
Input controls
✖
✔
Flexibility
✔
✔
©2013 Jaspersoft Corporation.
5
- 6. Architectural Approaches
Four methods to visualize your Cassandra data
① ETL Approach (Extract, Transform, Load)
② Direct access reports and dashboards
③ Direct access data exploration
④ 1-3 with Hadoop Hive
©2013 Jaspersoft Corporation. Proprietary and Confidential
6
- 7. 1 – ETL Approach
Most traditional
approach
Data is extracted via
batch method
Option with most
connectors
ETL process is most
robust option
BI Platform
SQL
ETL
RDBMS
©2013 Jaspersoft Corporation. Proprietary and Confidential
7
- 8. JaspersoftETL
Powered by
Over 450 connectors
Data quality, transformations, aggregations
©2013 Jaspersoft Corporation. Proprietary and Confidential
8
- 9. 2 – Direct Access Reports and
Dashboards
Reports are developed
using Jaspersoft Studio
(Eclipse based designer)
Lowest latency
Good supplement to ETL
when “near time” is
required
Connector based on
https://github.com/Netflix/
astyanax
©2013 Jaspersoft Corporation. Proprietary and Confidential
BI Platform
CQL3 Native
Connector
9
- 11. 3 – Direct Access Exploration
Allows users to
explore data (vs
pre-defined reports
+ dashboards)
Loads results of a
query into memory
where further
filtering, grouping
and agg. occurs
In Memory
OLAP Engine
BI Platform
CQL3 Native
Connector
©2013 Jaspersoft Corporation. Proprietary and Confidential
11
- 13. 4 – Hadoop Hive
Good for massive
data
Batch process
Native Hadoop
Hive connector as
well
BI Platform
HQL
SQL
ETL
RDBMS
©2013 Jaspersoft Corporation. Proprietary and Confidential
13
libhive
- 14. Demonstration
Demo flow:
•
•
•
Example Dashboard +
report
Jaspersoft Studio
Ad-hoc Exploration
Demo environment:
•
•
Jaspersoft 5.5 – runs on
Tomcat 7
DataStax Enterprise 3.1
(Cassandra 1.2.10.1)
©2013 Jaspersoft Corporation. Proprietary and Confidential
14
- 16. Conclusion
•
•
•
•
Four different ways to get insights from Cassandra
Commercial open-source software
Get started at http://jaspersoft.com
Thank you!
©2013 Jaspersoft Corporation. Proprietary and Confidential
16