The Briefing Room with Radiant Advisors and IBM
Live Webcast on February 25, 2014
Watch the archive: https://bloorgroup.webex.com/bloorgroup/lsr.php?RCID=53c9b7fa2000f98f5b236747e3602511
The power of Big Data depends heavily upon the context in which it's used, and most organizations are just beginning to figure out where, how and when to leverage it. One key to success is integration with existing information systems, many of which still rely on relational database technologies. Finding ways to blend these two worlds can help companies generate measurable business value in fairly short order.
Register for this episode of The Briefing Room to hear Analysts Lindy Ryan and John O'Brien as they explain how the combination of traditional Business Intelligence with Big Data Analytics can provide game-changing results in today's information economy. They'll be briefed by Eric Poulin and Paul Flach of Stream Integration who will share best practices for designing and implementing Big Data solutions. They'll discuss the components of IBM BigInsights, and explain how BigSheets can empower non-technical users who need to explore self-structured data.
Visit InsideAnlaysis.com for more information.
1. Grab some coffee and enjoy
the pre-show banter before
the top of the hour!
2. Big Data in Action: Real-World Solution Showcase
The Briefing Room
3. Twitter Tag: #briefr
The Briefing Room
Welcome
Host:
Eric Kavanagh
eric.kavanagh@bloorgroup.com
4. Mission
! Reveal the essential characteristics of enterprise software,
Twitter Tag: #briefr
The Briefing Room
good and bad
! Provide a forum for detailed analysis of today’s innovative
technologies
! Give vendors a chance to explain their product to savvy
analysts
! Allow audience members to pose serious questions... and get
answers!
5. Twitter Tag: #briefr
The Briefing Room
Topics
This Month: BIG DATA
March: CLOUD
April: BIG DATA
2014 Editorial Calendar at
www.insideanalysis.com/webcasts/the-briefing-room
7. Analysts: Lindy Ryan and John O’Brien
Twitter Tag: #briefr
Lindy Ryan is the Research Director for Radiant Advisor’s Data
Discovery and Visualization practice and leads research and analyst
activities in the confluence of data discovery, visualization, and data
science from a business needs perspective. She also retains the role
of Editor in Chief of RediscoveringBI Magazine. As Radiant Advisors’
Editor in Chief for three years, Lindy participated in in-depth
discussions and analysis with industry thought leaders and vendors
while maturing her position and perspectives in the BI industry.
John O’Brien is Principal and CEO of Radiant Advisors. With over 25
years of experience delivering value through data warehousing and BI
programs, John’s unique perspective comes from the combination of
his roles as a practitioner, consultant, and vendor in the BI industry.
His knowledge in designing, building, and growing enterprise BI
systems and teams brings real world insights to each role and phase
within a BI program. Today, through Radiant Advisors John provides
research and advisory services that guide companies in meeting the
demands of next generation information management, architecture,
and emerging technologies.
The Briefing Room
8. ! IBM offers a full suite of Big Data solutions, including
InfoSphere Streams, InfoSphere BigInsights and InfoSphere
Data Explorer
! IBM also offers a series of products designed to leverage the
Twitter Tag: #briefr
The Briefing Room
power of Hadoop
! Stream Integration is a Premier Business Partner with IBM
and focuses its consultancy exclusively on IBM products
IBM
9. Twitter Tag: #briefr
The Briefing Room
Guests:
Eric Poulin
VP of Business Analytics,
Stream Integration
Paul Flach
VP of Enterprise Analytics,
Stream Integration
10. 10
Big
Data
Performance
for
Analy3cs
Eric
Poulin
VP,
Analy3cs
Big
Data
Eric.poulin@streamintegra3on.com
11. 11
11
Agenda
• Overview
of
Stream
Integra3on
• Big
Data
Performance
for
Analy3cs
• Modular
Analy3cs
13. LINKING
DATA
13
TO
THE
BUSINESS
REQUIREMENTS
TRANSACTIONAL
COLLABORATIVE
APPLICATIONS
MANAGE
CONTENT
ANALYZE
BIG
DATA
STRUCTURED
DATA
INTEGRATE
INFOSPHERE
MDM
GOVERN
DATA
BUSINESS
ANALYTICS
APPLICATIONS
STREAMS
EXTERNAL
INFORMATION
SOURCES
ww
QUALITY
LIFECYCLE
MANAGEMENT
SECURITY
PRIVACY
INFORMATION
SERVER
DESIGN
★
DEPLOY
★
OPERATE
★
MANAGE
★
EXTEND
BIG
INSIGHTS
TRADITIONAL
SOURCES
PUREDATA/NETEZZA
STREAMING
INFORMATION
15. 15
Capabili3es
Required
for
Hadoop
Style
Workloads
Applica3on
Support
and
Development
Cluster
and
Workload
Management
Run3me
Visualiza3on
Discovery
Data
Ingest
Analy3cs
Engines
Data
Store
File
System
Tooling
Security
15
16. 16
Big
SQL
provides
na3ve
SQL
for
Hadoop
ANSI
SQL
92+
support
17. 17
Coordinator node
Map
Reduce
MPP
RunKme
n+2
User
Data
temp(s)
HDFS
Hadoop Data Node(s)
SQL sub-sections
Map
Reduce
MPP
RunKme
n+n
User
Data
temp(s)
HDFS
Head Node
Catalog
Host 2 Host n
Host 1
Cluster
network
Local
fs
(temps)
Local
fs
(catalog
tables)
Distributed
fs
sync
Map
Reduce
MPP
RunKme
n+1
User
Data
temp(s)
HDFS
Direct
Hadoop
data
access
sync
sync
Big
AcceleraKon
Query
OpKmizer
Common
SQL
BigInsights
–
DB2
–
Netezza
Oracle
–
Teradata
Next
Gen
Big
SQL
will
provide
first
MPP
query
engine
for
Hadoop
18. 18
BigSheets
provides
business
users
with
access
to
data
without
programming
Spreadsheet-‐style
interface
Data
VisualizaKon
Graphs
19. 19
Watson
Explorer
included
in
BigInsights
Faceted
Search,
NavigaKon
Discovery
20. 20
AnalyKcs
Accelerators
provide
ability
to
extract
insights
more
quickly
Text
Social
Media
Machine
Data
21. 21
App
Store
reduces
development
effort
and
enables
reusability
Combine
Hadoop
Apps
22. 22
Open
Source
Hadoop
Components
Visualization Discovery Data Ingest
Open
Source
Analytics Engines
Cluster Optimization and Management
Nutch
Runtime
Data Store HBase
File System
MapReduce
HDFS
Application Support and Development Tooling
MapReduce
Pig
Hive
ZooKeeper
Sqoop
Security
HCatalog
Flume
Avro
Lucene
Oozie
Derby
22
23. 23
BigInsights
Enterprise
Edi3on
Components
Visualization Discovery Data Ingest
Netezza
DB2
Analytics Engines
Cluster Optimization and Management Streams
Derby
Private
firewall
Open
Source
DataStage
Nutch
IBM
IBM InfoSphere BigInsights
Integrated
Installer
Runtime
Admin
Console
Data Store HBase
File System
MapReduce
HDFS
Text
Processing
Engine
and
Extractor
Library
(AQL+HIL)
JDBC
Application Support and Development Tooling
App
infrastructure
MapReduce
Pig
Hive
Splicable
Text
Compression
High
Availability
ZooKeeper
Sqoop
SystemML
Eclipse
Big
SQL
Security
HCatalog
R
Gnip
BoardReader
GPFS-‐FPO
LDAP
Guardium
Flume
Jaql
Avro
BigSheets
Dashboard
/
visualiza3on
Data
Explorer
Lucene
Oozie
PAM
Enhanced
Monitoring
Adap3ve
MapReduce
Teradata
23
36. This Month: BIG DATA
March: CLOUD
April: BIG DATA
www.insideanalysis.com/webcasts/the-briefing-room
Twitter Tag: #briefr
The Briefing Room
Upcoming Topics
2014 Editorial Calendar at
www.insideanalysis.com