Weitere Ă€hnliche Inhalte Ăhnlich wie Hortonworks kognitio webinar 10 dec 2013 (20) KĂŒrzlich hochgeladen (20) Hortonworks kognitio webinar 10 dec 20131. Hadoop and the new BI:
The Modern Data Architecture
âŠfor in memory Big Data Analytics
10 December 2013
2. Quick Housekeeping
Q&A box is available for your questions
Webinar will be recorded for future viewing
Thank You for joining!
© Hortonworks Inc. 2013
4. Your Presenters
âą Paul Groom (@datagroom)
â Chief Innovation Officer
â 28 years buried in the big data of the data
guiding business users to value
â Two wheels are more fun than four
âą John Kreisa (@marked_man)
â VP Strategic Marketing, Hortonworks
â Over 20 years in data management as a
developer and a marketer
â Avid camper
© Hortonworks Inc. 2013
Page 4
5. Todayâs Topics
âą Introduction
âą Drivers for the Modern Data Architecture (MDA)
âą Apache Hadoop in the MDA
âą Kognitioâs role in the MDA
âą Q&A
© Hortonworks Inc. 2013
Page 5
7. APPLICATIONS
Modern Data Architecture Enabled
BusinessÂ
Analytics
CustomÂ
Applications
Packaged
Applications
DEVÂ &Â DATA
TOOLS
SOURCES
DATAÂ SYSTEM
BUILDÂ &Â
TEST
OPERATIONAL
TOOLS
RDBMS
EDW
MANAGEÂ &Â
MONITOR
MPP
REPOSITORIES
Existing SourcesÂ
Emerging SourcesÂ
(CRM, ERP, Clickstream, Logs)
(Sensor, Sentiment, Geo, Unstructured)
© Hortonworks Inc. 2013
Page 7
8. Hadoop Powers Modern Data Architecture
Hadoop Cluster
compute
&
storage
.
.
.
.
.
.
.
.
.
.
compute
&
storage
Hadoop clusters provide
scale-out storage and
distributed data processing
on commodity hardware
Apache Hadoop is an open source project
governed by the Apache Software Foundation
(ASF) that allows you to gain insight from massive
amounts of structured and unstructured data
quickly and without significant investment.
© Hortonworks Inc. 2013
Page 8
9. Drivers of Hadoop Adoption
New Business
Applications
From NEW types of
Data (or existing
types for longer)
© Hortonworks Inc. 2013
Page 9
10. Most Common NEW TYPES OF DATA
1. Sentiment
Understand how your customers feel about your brand and
products â right now
2. Clickstream
Capture and analyze website visitorsâ data trails and
optimize your website
3. Sensor/Machine
Discover patterns in data streaming automatically from
remote sensors and machines
4. Geographic
Analyze location-based data to manage operations where
they occur
5. Server Logs
Research logs to diagnose process failures and prevent
security breaches
6. Unstructured (txt, video, pictures, etc..)
Understand patterns in files across millions of web pages,
emails, and documents
© Hortonworks Inc. 2013
Value
11. Keep Existing Data Around Longer
âą Online archive
â Data that was once moved to tape can
now be queried to understand long term trends
âą Compliance retention
â Industry specific requirements for retention
of data
Value
âą Combine with external historical data sources
â Weather, survey, research, purchased, etc.
© Hortonworks Inc. 2013
12. Drivers of Hadoop Adoption
Architectural
A Modern Data
Architecture
Complement your existing data
systems: the right workload in the
right place
New Business
Applications
© Hortonworks Inc. 2013
Page 12
13. Requirements for Hadoop Adoption
Requirements for Hadoopâs Role
in the Modern Data Architecture
Integrated
Key Services
Interoperable with
existing data center
investments
Platform, operational and
data services essential for
the enterprise
Skills
Leverage your existing
skills: development,
operations, analytics
© Hortonworks Inc. 2013
Page 13
14. Requirements for Enterprise Hadoop
1
2
3
Key Services
Platform, Operational and
Data services essential
for the enterprise
OPERATIONALÂ
SERVICES
AMBARI
HBASE
PIG
SQOOP
HIVEÂ &
HCATALOG
LOADÂ &Â
EXTRACT
Skills
NFS
CORE
PLATFORMÂ
SERVICES
Integrated
WebHDFS
KNOX*
MAPÂ
REDUCE
TEZ
YARNÂ Â
HDFS
Enterprise Readiness
High Availability, Disaster
Recovery, Rolling Upgrades,
Security and Snapshots
HORTONWORKSÂ
DATAÂ PLATFORMÂ (HDP)
Engineered with existing
data center investments
OS/VM
© Hortonworks Inc. 2013
FLUME
FALCON*
OOZIE
Leverage your existing
skills: development,
analytics, operations
DATA
SERVICES
Cloud
Appliance
Page 14
15. Requirements for Enterprise Hadoop
3
Leverage your existing
skills: development,
analytics, operations
Integration
DEVELOP
ANALYZE
2
Skills
Platform, operational and
data services essential
for the enterprise
OPERATE
1
Key Services
COLLECT
PROCESS
BUILD
EXPLORE
QUERY
DELIVER
PROVISION
MANAGE
MONITOR
Engineered with existing
data center investments
© Hortonworks Inc. 2013
Page 15
16. Familiar and Existing Tools
3
Leverage your existing
skills: development,
analytics, operations
Integration
DEVELOP
ANALYZE
2
Skills
Platform, operational and
data services essential
for the enterprise
OPERATE
1
Key Services
COLLECT
PROCESS
BUILD
EXPLORE
QUERY
DELIVER
PROVISION
MANAGE
MONITOR
Engineered with existing
data center investments
© Hortonworks Inc. 2013
Page 16
17. APPLICATIONS
Requirements for Enterprise Hadoop
BusinessÂ
Analytics
CustomÂ
Applications
Packaged
Applications
Integrated with
DEVÂ &Â DATA
TOOLS
Applications
BUILDÂ &Â
DATAÂ SYSTEM
Business Intelligence,
TEST
Developer IDEs,
Data Integration
SOURCES
3
OPERATIONAL
TOOLS
RDBMS
EDW
MANAGEÂ &Â
Systems
MONITOR
MPP
Data Systems & Storage,
Systems Management
REPOSITORIES
Platforms
Integration
Existing SourcesÂ
Engineered with existing
(CRM, ERP, Clickstream, Logs)
data center investments
© Hortonworks Inc. 2013
Emerging SourcesÂ
(Sensor, Sentiment, Geo, Unstructured)
Operating Systems,
Virtualization, Cloud,
Appliances
Page 17
18. SOURCES
DATAÂ SYSTEM
APPLICATIONS
A Modern Data Architecture Applied
BusinessÂ
Analytics
CustomÂ
Applications
Packaged
Applications
Complement data systems
RDBMS
EDW
MPP
Right workload right place
REPOSITORIES
Existing SourcesÂ
Emerging SourcesÂ
(CRM, ERP, Clickstream, Logs)
(Sensor, Sentiment, Geo, Unstructured)
© Hortonworks Inc. 2013 - Confidential
Page 18
19. APPLICATIONS
Kognitio in the Modern Data Architecture
BusinessÂ
Analytics
BusinessÂ
Intelligence Tools
OLAPÂ Clients
DEVÂ &Â DATA
TOOLS
SOURCES
DATAÂ SYSTEM
Inâmemory MPP Accelerator
BUILDÂ &Â
TEST
OPERATIONAL
TOOLS
RDBMS
EDW
MANAGEÂ &Â
MONITOR
MPP
REPOSITORIES
Existing SourcesÂ
Emerging SourcesÂ
(CRM, ERP, Clickstream, Logs)
(Sensor, Sentiment, Geo, Unstructured)
© Hortonworks Inc. 2013 - Confidential
Page 19
20. APPLICATIONS
Kognitio in the Modern Data Architecture
BusinessObjects BI
DEVÂ &Â DATA TOOLS
DATAÂ SYSTEM
Inâmemory MPP Accelerator
OPERATIONALÂ TOOLS
RDBMS
HANA
EDW
MPP
SOURCES
INFRASTRUCTURE
Existing SourcesÂ
Emerging SourcesÂ
(CRM, ERP, Clickstream, Logs)
(Sensor, Sentiment, Geo, Unstructured)
© Hortonworks Inc. 2013 - Confidential
Page 20
21. Todayâs Topics
âą Introduction
âą Drivers for the Modern Data Architecture (MDA)
âą Apache Hadoopâs role in the MDA
âą Kognitioâs role in the MDA
âą Q&A
© Hortonworks Inc. 2013
Page 21
22. Hadoop and the new BI
Requirements for Hadoopâs Role
in the Modern Data Architecture
1
Integrated
Interoperable with
existing data center
investments
© Hortonworks Inc. 2013
2
Skills
3
Key Services
Platform, operational and
data services essential for
the enterprise
Leverage your existing
skills: development,
operations, analytics
Page 22
23. Motivation
âą Historical architecture = Existing investment
1
Key Services
Platform, Operational a
Data services essential
for the enterprise
Cognos
âą Must plug-and-play with MDA
â Do not disrupt, enhance!
âą Performance and behavior expectations
â Dynamic ad-hoc access
â Drill unlimited
â Report on-demand
© Hortonworks Inc. 2013
Page 23
26. In-memory analytical platform
âą Software only
â Easy to deploy alongside HDP
â Simple two stage install
âą Commodity Hardware
3
Integration
Engineered with existing
data center investments
â X86/64 Linux Platform with 10GbE network â same as HDP
â Biased to more RAM and less disk
âą Scale-out MPP
â Same compute model as Hadoop
â Strong focus on 100% effective CPU utilization for any given query
âą Exploits features of underlying persistent store
â Simple âPull dataâ access methods
â Parallelism â all HDP nodes intercommunicating with all Kognitio nodes
âą ANSI 2011 SQL
â Mature fully featured
â Transaction processing capable
âą Not-only-SQL
2
Skills
Leverage your existing
skills: development,
analytics, operations
â Any script or binaries executed in-line within SQL queries
© Hortonworks Inc. 2013
Page 26
27. Tight Integration
3
âą Map-reduce Connector
â Filtered access
© Hortonworks Inc. 2013
Integration
Engineered with existing
data center investments
âą HDFS Connector
â Low Latency access
Page 27
28. So why In-memory?
INSTANT WAIT
âą Exploit the âDynamicâ access element of âDâ-RAM
â Data placed in memory in structures best suited for CPUs, not for disks
© Hortonworks Inc. 2013
Page 28
30. Building Data Models
âą Hadoop is a great repository
âą Perfect to handle volume and variability without effort
âą Perfect to âtriageâ the data, to reshape, filter and project intoâŠ
âą Data Virtualisation / Logical Data Warehouse
⊠but with the associated horsepower to dynamically analyse the data
âą Plug standard tools straight in â not a Java programmer in sight!
âą Central control and security
âą Data model shelf life getting shorter â sandboxes and workbenches
â Build on-demand to meet todays needs â just pull data from your HDP
â Lots of project based discovery and analytics
â World is changing rapidly
â Ever tighter feedback loops
© Hortonworks Inc. 2013
Page 30
31. Analytical Complexity
Increasing Computation
Machine learning
algorithms
Behaviour
modelling
Statistical
Analysis
Dynamic
Simulation
Clustering
Dynamic
Interaction
Reporting &
BPM
Campaign
Management
Fraud
detection
Technology/Automation
© Hortonworks Inc. 2013
Page 31
33. Mature SQL atop Hadoop
Kognitio is an inâmemoryÂ
analytical platform that is tightlyÂ
integrated with Hadoop for highâ
performance advanced analyticsÂ
that make Big Data moreÂ
consumable for enterprises,Â
especially those with mature BIÂ
environments or engrainedÂ
tools.Â
âą Powering advanced analytics atÂ
organizations worldwide, such as:Â
⹠Privately held
âą Invented the inâmemory analytical platform
âą Labs in the UK â HQ in New York, NYÂ
© Hortonworks Inc. 2013
Page 33
34. APPLICATIONS
Kognitio in the Modern Data Architecture
BusinessÂ
Analytics
BusinessÂ
Intelligence Tools
OLAPÂ Clients
DEVÂ &Â DATA
TOOLS
SOURCES
DATAÂ SYSTEM
Inâmemory MPP Accelerator
BUILDÂ &Â
TEST
OPERATIONAL
TOOLS
RDBMS
EDW
MANAGEÂ &Â
MONITOR
MPP
REPOSITORIES
Existing SourcesÂ
Emerging SourcesÂ
(CRM, ERP, Clickstream, Logs)
(Sensor, Sentiment, Geo, Unstructured)
© Hortonworks Inc. 2013
Page 34
35. Forrester Wave: a âstrong performerâ
âą
âą
Kognitioâs EDW is a strong, cost-effective
alternative to SAP HANA.
âą
KognitioâŠwas designed from the start as an
MPP (distributed) in-memory RDBMS,
making extensive use of RAM-based
processing for maximum performance.
âą
© Forrester Corp. Used with permission.
© Hortonworks Inc. 2013
Kognitioâs entirely in-memory, distributed
EDW is appealing for customers looking for
fast performance on commodity hardware
Download a complimentary copy of the
full report at www.kognitio.com/wave
Page 35
36. The Modern Data Architecture
âŠfor in memory Big Data Analytics
More about Kognito and Hortonworks
http://hortonworks.com/partner/kognitio
Get started with Hortonworks Sandbox
http://hortonworks.com/hadoop-tutorial/
Follow us:
@hortonworks @kognitio
Question & Answer session will be conducted electronically,
using the panel to the right of your screen
Todayâs Slides available at: www.slideshare.net/kognitio