The Briefing Room with Dr. Robin Bloor and Platfora
Live Webcast on October 28, 2014
Watch the archive:
https://bloorgroup.webex.com/bloorgroup/lsr.php?RCID=0a3c69090358622b0acbf58c474c2df0
The future of big data analytics depends heavily on two factors: access and performance. Within the current landscape, business analysts can be limited by the data preparation process, which is often greatly slowed when requesting data from multi-structured sources such as Hadoop. The result? An encumbered workflow. Fortunately, a new solution built on Apache Spark, the open source cluster computing framework, has emerged and has the potential to disrupt the current analytics paradigm.
Register for this episode of The Briefing Room to hear veteran Analyst Dr. Robin Bloor as he explains how big data has forced a sea change in analytical processes. He'll be briefed by Denise Hemke of Platfora, who will tout her company's Big Data Analytic Platform for Hadoop. She will provide a demo and show how Platfora's end-to-end platform can bring next generation capabilities to analytical workflows, including faster access for analysts and more robust development for data scientists.
Visit InsideAnlaysis.com for more information.
Strategize a Smooth Tenant-to-tenant Migration and Copilot Takeoff
Fire in the Hole: How a Spark-Powered Platform Charges Analytics
1. Grab some
coffee and
enjoy the
pre-show
banter
before the
top of the
hour!
2. Fire in the Hole: How a Spark-Powered Platform Charges Analytics
The Briefing Room
3. Twitter Tag: #briefr
The Briefing Room
Welcome
Host:
Eric Kavanagh
eric.kavanagh@bloorgroup.com
@eric_kavanagh
4. ! Reveal the essential characteristics of enterprise
software, good and bad
! Provide a forum for detailed analysis of today’s innovative
technologies
! Give vendors a chance to explain their product to savvy
analysts
! Allow audience members to pose serious questions... and
get answers!
Twitter Tag: #briefr
The Briefing Room
Mission
5. This Month: ANALYTIC PLATFORMS
November: DISCOVERY & VISUALIZATION
December: INNOVATORS
Twitter Tag: #briefr
The Briefing Room
Topics
2014 Editorial Calendar at
www.insideanalysis.com/webcasts/the-briefing-room
6. Twitter Tag: #briefr
The Briefing Room
Executive Summary
Ø The earth is SHAKING
Ø Remain FLEXIBLE
Ø EXPECT the future
Ø Prepare for CHANGE
7. Twitter Tag: #briefr
The Briefing Room
Analyst: Robin Bloor
Robin Bloor is
Chief Analyst at
The Bloor Group
robin.bloor@bloorgroup.com
@robinbloor
8. Twitter Tag: #briefr
The Briefing Room
Platfora
! Platfora is a big data analytics company
! Its Big Data Analytics platform runs natively on the open
source Apache Hadoop framework, and it delivers analytics
over petabyte-scale data
! The recently released Platfora 4.0, built on Apache Spark,
includes Geo Analytics capabilities and added Advanced
Visualizations
9. Twitter Tag: #briefr
The Briefing Room
Guest: Denise Hemke
As a Director of Product Management, Denise Hemke
focuses on building BI so business analysts have self-service
access to petabyte scale data. Denise has been
building enterprise products for the last 13 years in a
variety of different industries. She is passionate about
partnering with customers, engineering and design. She
enjoys building enterprise products that solve real
customer use cases with a consumer-quality aesthetic.
Early in her career, she focused on the development of BI
and management applications for AT&T data centers and
their customers. At Salesforce, Denise managed teams
responsible for building monitoring & management, debugging, and productivity tools for
use by R&D, Operations and customers. Denise also served as the Director of Engineering
at Platfora. In that role, she was responsible for innovation and delivery of the customer
surface area, which includes rendering large-scale visualizations.
10. Disrupting the
traditional analyst
workflow with
Platfora and Spark.
Denise Hemke, Director of Products
October 28, 2014
@denisehemke @platfora
10
11. Introducing Platfora
MISSION
LEAD THE INDUSTRY TRANSITION FROM BUSINESS
INTELLIGENCE TO BIG DATA ANALYTICS.
@denisehemke @platfora
11
#1 Big Data
Analytics platform
native on Hadoop
End-to-end platform built
for Multi-Structured Data
Self-service, iterative,
interactive, and fast
12. Platfora is the Only End-to-End Big Data
Analytics Platform
@denisehemke @platfora
Analysis
Data
Preparation
10
%
100%
TODAY: “MULTI-STRUCTURED
DATA ANALYSIS”
Analyst
2005 - TODAY
IT
“DATA
DISCOVERY”
Analyst
13. “Apache Spark is Hadoop's speedy Swiss Army knife.”
“Apache lights a fire under Hadoop with
Spark.”
“Spark is making waves because it’s putting
MapReduce on the endangered species list.”
@denisehemke @platfora 13
14. Spark: The New Foundation of Big Data
Analytics
And, why it matters for your business
@denisehemke @platfora
“Out of the box” advanced analytics
Beyond SQL only
Easier to access
Vendor neutral and open source
In-memory speed
Spark is winning
14
15. Spark: The New Foundation of Big Data
Analytics
And, why it matters for your business
@denisehemke @platfora
“Out of the box” advanced analytics
Beyond SQL only
Easier to access
Vendor neutral and open source
In-memory speed
Spark is winning
15
16. Spark: The New Foundation of Big Data
Analytics
And, why it matters for your business
@denisehemke @platfora
“Out of the box” advanced analytics
Beyond SQL only
Easier to access
Vendor neutral and open source
In-memory speed
Spark is winning
16
17. Spark: The New Foundation of Big Data
Analytics
And, why it matters for your business
@denisehemke @platfora
“Out of the box” advanced analytics
Beyond SQL only
Easier to access
Vendor neutral and open source
In-memory speed
Spark is winning
17
18. Spark: The New Foundation of Big Data
Analytics
And, why it matters for your business
@denisehemke @platfora
“Out of the box” advanced analytics
Beyond SQL only
Easier to access
Vendor neutral and open source
In-memory speed
Spark is winning
18
19. Spark: The New Foundation of Big Data
Analytics
And, why it matters for your business
@denisehemke @platfora
“Out of the box” advanced analytics
Beyond SQL only
Easier to access
Vendor neutral and open source
In-memory speed
Spark is winning
19
20. Let’s Avoid the Pitfalls of the Past
The big data analytics workflow is broken and needs to be fixed
@denisehemke @platfora
20
DATA ADMIN
He’s
overwhelmed by
the number
of data
preparation
requests.
DATA
SCIENTIST
She’s focused on
mundane work
instead of high
value projects.
C-SUITE
He’s falling
behind the
competition.
BUSINESS
ANALYST
He can’t answer
his questions fast
enough.
21. Let’s Avoid the Pitfalls of the Past
The big data analytics workflow is broken and needs to be fixed
@denisehemke @platfora
21
DATA ADMIN
He’s
overwhelmed by
the number
of data
preparation
requests.
DATA
SCIENTIST
She’s focused on
mundane work
instead of high
value projects.
C-SUITE
He’s falling
behind the
competition.
BUSINESS
ANALYST
He can’t answer
his questions fast
enough.
22. Let’s Avoid the Pitfalls of the Past
The big data analytics workflow is broken and needs to be fixed
@denisehemke @platfora
22
DATA ADMIN
He’s
overwhelmed by
the number
of data
preparation
requests.
DATA
SCIENTIST
She’s focused on
mundane work
instead of high
value projects.
C-SUITE
He’s falling
behind the
competition.
BUSINESS
ANALYST
He can’t answer
his questions fast
enough.
23. Let’s Avoid the Pitfalls of the Past
The big data analytics workflow is broken and needs to be fixed
@denisehemke @platfora
23
DATA ADMIN
He’s
overwhelmed by
the number
of data
preparation
requests.
DATA
SCIENTIST
She’s focused on
mundane work
instead of high
value projects.
C-SUITE
He’s falling
behind the
competition.
BUSINESS
ANALYST
He can’t answer
his questions fast
enough.
24. Let’s Avoid the Pitfalls of the Past
The big data analytics workflow is broken and needs to be fixed
@denisehemke @platfora
24
DATA ADMIN
He’s
overwhelmed by
the number
of data
preparation
requests.
DATA
SCIENTIST
She’s focused on
mundane work
instead of high
value projects.
C-SUITE
He’s falling
behind the
competition.
BUSINESS
ANALYST
He can’t answer
his questions fast
enough.
25. Platfora is Laying the Foundation for the
Future of Big Data Analytics
@denisehemke @platfora
25
Built on Spark
The definitive
end-to-end Business
Analyst workflow
built on Spark
Next-gen
Data Preparation
Fast, smart, and powerful
data preparation
seamlessly integrated into
the full-stack
Platfora Platform
Extensions
Adapt the platform to
your data and questions
while amplifying the work
of developers
26. The Definitive End-to-End Business Analyst
Workflow
@denisehemke @platfora
26
You can use data science to get
better answers
• Access to advanced analytics
processing models
You don’t have to write the code
yourself
• Integrated into the full-stack platform
You’re not stuck in history
• Always running on the latest technology
Built on Spark
27. Data Preparation Integrated into an End-to-
End Platform
@denisehemke @platfora
27
You get revolutionary time
to value
• Natively integrated full-stack solution
So simple your business
users can do it
• Making data prep visual, safe, and
intelligent
Powerful for your enterprise
• Built to handle enterprise scale big data
projects
Next-gen data preparation
28. Configurable Platform Extensions That Meet
the Needs of any Business
@denisehemke @platfora
28
Platfora Platform Extensions (PPE)
You can stop trying to find a BI
system that can answer your
questions out
of the box
• Adapt the platform to your data and
questions
You don’t have to repeat the
process, every time your
questions change
• Reusable and configurable
You have all the power in the world
• Utilize all of Spark, including access any
data source
34. Data Analytics: A Process
u Data Analytics is a multi-disciplinary
end-to-end
repetitive process
u It changed, because of:
• Data availability ++
• Parallel technology
• Scalable software
• Open source tools
• M/C Learning
Data Access
Data Prep
Model
Analyze
Execute
Deploy
35. Does it Matter if it’s Not the Fastest?
The situation is COMPLEX
rather than simple.
Analytics is not Formula 1
racing, but:
u It is ITERATIVE
u The speed of the END-TO-END
PROCESS matters
u And, the impact of the
technology on the
ANALYTICAL PROCESS
matters
37. Analytical Latencies
1. Data access
2. Data preparation
3. Model development
4. Execution
5. Implementation
6. Model audit & update
This is where the
rubber meets the road:
SPEED = VALUE
38. The Impending Reality
Technology can speed up analytics by
two orders of magnitude
(on the IT side)
THIS WILL CHANGE ANALYTICS
39. u Are we really short on data scientists? Or are we
short on fast analytics tools?
u Is the data really big?
u Please comment on analytical workloads:
• What do you see as the natural IT bottlenecks?
• What do you see as the natural business
bottlenecks?
u Do we want business analysts to become ersatz
data scientists?
40. u In respect to scale, what is your largest
implementation by data volume, and what was
the industry sector/problem space?
u What do you see as the largest barrier to
adoption of Platfora?