1. DATA INTELLIGENCE FOR ALL
Adatao Live Demo
at the First Spark Summit
Dec 2, 2013, San Francisco
(Video at the end of this deck)
Christopher Nguyen, PhD
Co-Founder & CEO
2. Big-Data Compute Engines, Google Apps
Engineering Director, Google Founders’ Award,
HKUST Prof, 2 successful enterprise exits,
Stanford PhD
Deep engineering &
business experience from
Google, Yahoo et al.
PhD’s in DM & ML from
UIUC, Georgia Tech,
Stanford, Berkeley, ...
Hadoop distributed/streaming analytics,Yahoo
Hadoop Eng, UIUC PhD
Machine learning & machine vision, US Army
Research Lab, Johns Hopkins PhD
3. Business Users
Data Scientists
Data Engineers
ONE Integrated Platform for Business & Data Science & Engineering
BIG
INSIGHTS
001 001
0 1 1 00 1 1 0
1 1 1 01 1 1 0
1 0 0 01 0 0 0
0 0 0 10 0 0 1
0 0 0 10 0 0 1
0 1 1 00 1 1 0
1 1 1 01 1 1 0
Visually Beautiful
Interactive Data
Exploration
Narrative Web App
BIG
COMPUTE
Powerful In-Memory Data Mining
Machine Learning Big Analytics Platform
(Hadoop HDFS, Cassandra, SQL DMBS, Streaming Data)
BIG
DATA
4. Architecture Design
One Integrated Platform
for Business & Data Science & Engineering
Business Users
Data Scientists
Data Engineers
001 001
0 1 1 00 1 1 0
1 1 1 01 1 1 0
1 0 0 01 0 0 0
0 0 0 10 0 0 1
0 0 0 10 0 0 1
0 1 1 00 1 1 0
1 1 1 01 1 1 0
Business Users
VS
Data Scientists
Data Engineers
stack
for
business
users
stack
for
data
science
stack
for
data
eng
OTHERS
5. 001 001
0 1 1 00 1 1 0
1 1 1 01 1 1 0
1 0 0 01 0 0 0
0 0 0 10 0 0 1
0 0 0 10 0 0 1
0 1 1 00 1 1 0
1 1 1 01 1 1 0
for Data Scientists & Engineers
Big Data Mining & Machine Learning
Powerful In-Memory Data Mining & Machine
Learning—Model Terabytes in Seconds
Interactive, Cluster-Scale Data Munging &
Modeling with Native R, R-Studio, Python, SQL,
and Java Front-ends
Real-Time Scoring Directly From Trained Models
Share reproducible, live data analysis documents
Hadoop, Cassandra, RDBMS, Streaming Data
6. for Business Users
Predictive Decision Making
A Beautiful New Way to Create & Share
Visual Narratives of Your Analysis
!
Perform Ad Hoc Queries in Plain English
!
Publish Streaming, Interactive Dashboards
!
Collaborate With Others In Real Time
!
Query Terabytes in Seconds.