TIBCO Jaspersoft® for AWS is a business intelligence suite that helps you deliver stunning interactive reports and dashboards inside your app that make it easy for your customers to get answers. Purpose-built for AWS, our reporting and analytics server quickly and easily connects to Amazon Relational Database Service (RDS), Amazon Redshift, and Amazon EMR. It includes ad-hoc reporting, dashboards, data analysis, data visualization, and data blending. In less than 10 minutes, you can be analyzing and reporting on your data. You get a full Cloud BI server starting at less than $1/hour, with no user or data limits and no additional fees.
This webinar deck shows how embeddable analytics with TIBCO Jaspersoft for AWS gives you the power to create the experience your end users demand and how to scale and manage that experience across your customer base with AWS.
2. Today’s Presenters
David Potes, Partner Solutions Architect, Big Data, AWS
Mariano Luna, Sr. Manager, Technical Alliances & Cloud, TIBCO Software
Patrick Brown, VP of Digital Marketing, Waggle
Raj Chary, VP Technology/Architecture, Waggle
3. Relational data warehouse
Massively parallel; petabyte scale
Fully managed
HDD and SSD platforms
$1,000/TB/year; starts at $0.25/hour
Amazon
Redshift
a lot faster
a lot simpler
a lot cheaper
4. The Amazon Redshift view of data warehousing
10x cheaper
Easy to provision
Higher DBA productivity
10x faster
No programming
Easily leverage BI tools,
Hadoop, machine learning,
streaming
Analysis inline with process
flows
Pay as you go, grow as you
need
Managed availability and
disaster recovery
Enterprise Big data SaaS
5. Amazon Redshift architecture
Leader node
Simple SQL endpoint
Stores metadata
Optimizes query plan
Coordinates query execution
Compute nodes
Local columnar storage
Parallel/distributed execution of all queries, loads,
backups, restores, resizes
Start at just $0.25/hour, grow to 2 PB
(compressed)
DC1: SSD; scale from 160 GB to 326 TB
DS2: HDD; scale from 2 TB to 2 PB
Ingestion/backup
Backup
Restore
JDBC/ODBC
10 GigE
(HPC)
7. Benefit #1: Amazon Redshift is fast
Parallel and distributed
Query
Load
Export
Backup
Restore
Resize
8. Benefit #1: Amazon Redshift is fast
Hardware optimized for I/O intensive workloads, 4 GB/sec/node
Enhanced networking, over 1 million packets/sec/node
Choice of storage type, instance size
Regular cadence of autopatched improvements
9. Benefit #2: Amazon Redshift is inexpensive
Ds2 (HDD)
Price per hour for
DW1.XL single node
Effective annual
price per TB compressed
On demand $ 0.850 $ 3,725
1-year reservation $ 0.500 $ 2,190
3-year reservation $ 0.228 $ 999
Dc1 (SSD)
Price per hour for
DW2.L single node
Effective annual
price per TB compressed
On demand $ 0.250 $ 13,690
1-year reservation $ 0.161 $ 8,795
3-year reservation $ 0.100 $ 5,500
Pricing is simple
Number of nodes x price/hour
No charge for leader node
No upfront costs
Pay as you go
10. Benefit #3: Amazon Redshift is fully managed
Continuous/incremental backups
Multiple copies within cluster
Continuous and incremental backups
to Amazon S3
Continuous and incremental backups
across regions
Streaming restore
Amazon S3
Amazon S3
Region 1
Region 2
11. Benefit #4: Security is built in
• Load encrypted from Amazon S3
• SSL to secure data in transit
• ECDHE perfect forward security
• Amazon VPC for network isolation
• Encryption to secure data at rest
– All blocks on disks and in Amazon S3 encrypted
– Block key, cluster key, master key (AES-256)
– On-premises HSM and AWS CloudHSM support
• Audit logging and AWS CloudTrail integration
• SOC 1/2/3, PCI-DSS, FedRAMP, BAA
10 GigE
(HPC)
Ingestion
Backup
Restore
Customer VPC
Internal
VPC
JDBC/ODBC
12. Benefit #5: We innovate quickly
Well over 125 new features added since launch
Release every two weeks
Automatic patching
Service Launch (2/14)
PDX (4/2)
Temp Credentials (4/11)
DUB (4/25)
SOC1/2/3 (5/8)
Unload Encrypted Files
NRT (6/5)
JDBC Fetch Size (6/27)
Unload logs (7/5)
SHA1 Builtin (7/15)
4 byte UTF-8 (7/18)
Sharing snapshots (7/18)
Statement Timeout (7/22)
Timezone, Epoch, Autoformat (7/25)
WLM Timeout/Wildcards (8/1)
CRC32 Builtin, CSV, Restore Progress
(8/9)
Resource Level IAM (8/9)
PCI (8/22)
UTF-8 Substitution (8/29)
JSON, Regex, Cursors (9/10)
Split_part, Audit tables (10/3)
SIN/SYD (10/8)
HSM Support (11/11)
Kinesis EMR/HDFS/SSH copy,
Distributed Tables, Audit
Logging/CloudTrail, Concurrency, Resize
Perf., Approximate Count Distinct, SNS
Alerts, Cross Region Backup (11/13)
Distributed Tables, Single Node Cursor
Support, Maximum Connections to 500
(12/13)
EIP Support for VPC Clusters (12/28)
New query monitoring system tables and
diststyle all (1/13)
Redshift on DW2 (SSD) Nodes (1/23)
Compression for COPY from SSH, Fetch
size support for single node clusters, new
system tables with commit stats,
row_number(), strotol() and query
termination (2/13)
Resize progress indicator & Cluster
Version (3/21)
Regex_Substr, COPY from JSON (3/25)
50 slots, COPY from EMR, ECDHE
ciphers (4/22)
3 new regex features, Unload to single
file, FedRAMP(5/6)
Rename Cluster (6/2)
Copy from multiple regions,
percentile_cont, percentile_disc (6/30)
Free Trial (7/1)
pg_last_unload_count (9/15)
AES-128 S3 encryption (9/29)
UTF-16 support (9/29)
13. Benefit #6: Amazon Redshift is powerful
• Approximate functions
• User-defined functions
• Machine learning
• Data science
15. Hello, it’s me
Mariano Luna
Sr. Manager, Technical Alliances & Cloud
TIBCO Software (Houston, TX)
16. Do you feel like your users are always looking for more?
17. The Rise of Modern Applications
The Old Way Modern Applications
18. The Rise of Modern Applications
Your app
Analytics
Knowledge
worker
The Old Way Modern Applications
19. The Rise of Modern Applications
Your app
Analytics
Put answers
in context
Increase
adoption of BI
Give consumers
actionable data
Embedded Analytics
Knowledge
worker
The Old Way Modern Applications
21. Jaspersoft Overview
An embeddable analytics platform
designed for software companies
Jaspersoft Studio
Desktop report designer
for JasperReports
JasperReports Server
Powerful BI platform and server
Reports Dashboards Self-service
Jaspersoft ETL
Data integration for improved
reporting & analysis
JasperReports
World’s most popular Java
reporting library
22. From Data to Delivery
Connect to & manage
your platform
Server
Connect to
your data
Data Tier
Inside any app
or process
Delivery
Your app
Output
Create beautiful
reports & dashboards
23. Why Jaspersoft for AWS
OOTB multi-tenant
support
Built to 100% modern web
standards
RESTful web
service APIs
Visualize.js for
advanced
embedding
JavaScript/
HTML5 UI
10 minutes
to deploy
Autodetect for:
• RDS
• Redshift
• EMR
Starts at less
than $1/hour
Pay-as-you-go with
no user or data limits
Save with discounted
annual subscriptions
Autoscaling
clusters
Infinite elastic
scalability
Multi-AZ
enterprise
deployments
24. Patrick Brown - Vice President of Marketing
Raj Chary - Vice President of Technology/Architecture
25. Smart, responsive practice
Math and ELA (Grades 2-8)
Provides students the right
challenge at the right time
What is Waggle?
26. Right Challenge, Right
Time Waggle looks for more
than correct answers.
Waggle continually analyzes
each student’s decisions and
progress. That way, students
get tougher material right
when they’re ready.
What is Waggle?
27. Productive Struggle
Waggle motivates students to
push themselves forward.
How? Through helpful hints,
supportive feedback, and
achievement badges that
build grit and confidence.
What is Waggle?
28. Constructive Grouping
Waggle’s insights means you
can easily group students
together based on learning
needs. All without sacrificing
the quality of individual
instruction.
What is Waggle?
29. Waggle: Product Demo
• Data Creators
Differentiated learning experience
Fun and engaging
• Data Visualizers
Seamless integration with application
Analytics with a Story
Actionable Data
With integration deep dives into TIBCO Jaspersoft (Visualize.js) and AWS services (Redshift)
31. How did we build Waggle?
• Data Modelling
Lens-based model (distribution
keys and sort keys)
Ask and validate
Performance Efficiency
(compression, load/unload,
vacuum/analyze, in-memory
processing, WLM)
32. How did we build Waggle?
• Data Modelling
Lens-based model (distribution
keys and sort keys)
Ask and validate
Performance Efficiency
(compression, load/unload,
vacuum/analyze, in-memory
processing, WLM)
• Contextual Design
Prototype and
storyboard
Capture the “Intent”
33. Amazon Redshift: Data Warehouse
Layout
Write Cluster
Compute – dw2.large
Redshift
Read Cluster
Compute – dw2.large
Redshift
History Cluster
Density – dw1.xlarge
Redshift
Initial and
Incremental
{processed}
data loads
Periodic Data
Snapshots for
historical
analysis
Data
sources
For serving Jaspersoft
reports
APIs
OLTP
S3 COPY
S3
UnLoad
and Load
S3
UnLoad
and Load
Data mart
(aggregations)
NodesNodes
Staging
Datamart
(aggregations)
Nodes
S3 UnLoad and
Load
S3 UnLoad and
Load
+ UPSERTS
34. Results and Lessons Learned
• Performance Metrics
– Millions of records are processed in <1 minute
• LOAD/UNLOAD commands | UPSERTS | S3 COPY
Command
– Report queries average < 1 to ~1.5 seconds
– {compression} – gained 20+% efficiencies in data retrieval
• Best Practices
– {sort keys} – lens-based data model: visualize data in variety of ways
– {commit stats} – Redshift is not a transactional system
– {nested loops} – no Cartesian products, ensure joins well managed
– {queries that queue} – tune the WLM configuration
– {query runtimes} – faster query means less queuing
– {stats missing} – analyze and vacuum when possible
– {alerts with tables} – monitor to ensure queries running optimally