SlideShare ist ein Scribd-Unternehmen logo
1 von 23
Optimizing Log
Analytics from the Edge
April 2016
© Hortonworks Inc. 2011 – 2015. All Rights Reserved
2 © Hortonworks Inc. 2011 – 2016. All Rights Reserved
About Hortonworks
Customer Momentum
~800 customers (as of Feb 10, 2016)
Publicly traded on NASDAQ: HDP
Hortonworks Data Platform
Completely open multi-tenant platform
for any app and any data
Consistent enterprise services for security,
operations, and governance
Partner for Customer Success
Leader in open-source community, focused
on innovation to meet enterprise needs
Unrivaled Hadoop support subscriptions
Founded in 2011
Original 24 architects, developers,
operators of Hadoop from Yahoo!
800+
E M P L O Y E E S
1500+
E C O S Y S T E M
PA R T N E R S
3 © Hortonworks Inc. 2011 – 2016. All Rights Reserved
EMBRACE AN
OPEN APPROACH
MASTER THE
VALUE OF DATA
EVERY BUSINESS
IS A DATA BUSINESS
4 © Hortonworks Inc. 2011 – 2016. All Rights Reserved
DATA
AT REST
DATA
IN MOTION
ACTIONABLE
INTELLIGENCE
MODERN DATA APPLICATIONS
Actionable
Intelligence from
Connected Data
Platforms
Capturing perishable
insights from data in motion
Ensuring rich, historical insights on
data at rest
Necessary for modern data
applications
Hortonworks
DataFlow
Hortonworks
Data Platform
5 © Hortonworks Inc. 2011 – 2016. All Rights Reserved
Optimizing Log Ingest with
Hortonworks DataFlow
6 © Hortonworks Inc. 2011 – 2016. All Rights Reserved
Why Hortonworks DataFlow?
Because even the best data scientists
and most powerful platforms need
the right data to analyze
7 © Hortonworks Inc. 2011 – 2016. All Rights Reserved
Store Data
Process and Analyze
Data
Acquire Data
Perception of DataFlows: Easy, Definitive
Dataflow
8 © Hortonworks Inc. 2011 – 2016. All Rights Reserved
Reality of Dataflows: Complex, Convoluted
Store Data
Process and Analyze
Data
Acquire Data
Store DataStore Data
Store Data
Store Data
Acquire Data
Acquire Data
Acquire Data
Dataflow
9 © Hortonworks Inc. 2011 – 2016. All Rights Reserved
HDF has 130+ Processors - Multiple for Log Analytics
HTTP
Syslog
Email
HTML
Image
Hash Encrypt
Extract
TailMerge
Evaluate
Duplicate Execute
Scan
GeoEnrich
Replace
ConvertSplit
Translate
HL7
FTP
UDP
XML
SFTP
Route Content
Route Context
Route Text
Control Rate
Distribute Load
AMQP
10 © Hortonworks Inc. 2011 – 2016. All Rights Reserved
Log Analytics Systems Today
LOG
ANALYTICS
PLATFORMNetwork
Device Logs
• Not all data can be captured
• Not all captured data is valuable
• Transport all data
11 © Hortonworks Inc. 2011 – 2016. All Rights Reserved
Cost Effectively Expand Storage Options of Log Data
LOG
ANALYTICS
PLATFORM
Network
Device Logs
HDP
HDF
3. Cost effectively
expand collection and
grow timescale of logs
collected
2. Content-based routing
based on dynamic
evaluation of content,
attributes, priority
1. Integrate and
enrich logs across
data centers and
security zones
12 © Hortonworks Inc. 2011 – 2016. All Rights Reserved
Efficiently Expand Log Ingestion from the Edge
LOG
ANALYTICS
PLATFORM
Network
Device Logs
HDF
HDF
HDF
HDPHDF
• Expand collection to new sources of machine data
• Edge analytics to transform, enrich and prioritize content based routing
• Capture and transport only valuable data
13 © Hortonworks Inc. 2011 – 2016. All Rights Reserved
Expand Analytics and Reporting Options with HDP
LOG
ANALYTICS
PLATFORM
Network
Device Logs
HDF
HDF
HDF
HDPHDF
ODBC interface
traditional BI tools
Easy access to log analytics data
through traditional BI tools
Give data scientists better
tooling – Spark, Storm etc
14 © Hortonworks Inc. 2011 – 2016. All Rights Reserved
Expand to small scale, remote systems
LOG
ANALYTICS
PLATFORM
Network
Device Logs
HDF
HDF
HDF
HDPHDF
15 © Hortonworks Inc. 2011 – 2016. All Rights Reserved
Optimize Log Analytics with Content Based Routing
LOG
ANALYTICS
PLATFORM
Edge analytics for cost-effective
and efficient movement of
machine data
HDF
Intelligent, content based
routing, transformation
and enrichment
Send data to alternative
systems based on value,
content, priority
HDP
HDF
HDF
HDF
16 © Hortonworks Inc. 2011 – 2016. All Rights Reserved
Splunk Optimization:
Using HDP as Data Refinery
17 © Hortonworks Inc. 2011 – 2016. All Rights Reserved
Splunk Hadoop Connect
17
 Reliable bi-directional integration
Import
Browse
Export
Splunk Hadoop Connect
>2000 downloads
HA Indexes and
Storage
Commodity
Servers
Hadoop
(MapReduce &
HDFS)
Report &
analyze
Custom
dashboards
Monitor
and alert
Ad hoc
search
18 © Hortonworks Inc. 2011 – 2016. All Rights Reserved
Splunk, Hunk & Hortonworks
YARN Ready Partner
Certified on Hortonworks Data Platform
Existing Sandbox tutorial
19 © Hortonworks Inc. 2011 – 2016. All Rights Reserved
Splunk, Part of the Modern Data Architecture
• Bi-directional data integration
between Splunk & HDP
• Collect data from across the
organization, deliver it to Hadoop
for refining data and batch
analytics
• Output of Hadoop jobs can be
imported into Splunk Enterprise
for rapid analysis and visualization
• Archiving from Splunk Enterprise
to Hadoop
20 © Hortonworks Inc. 2011 – 2016. All Rights Reserved
Splunk, Part of the Modern Data Architecture
• Bi-directional data integration
between Splunk & HDP
• Collect data from across the
organization, deliver it to
Hadoop for refining data and
batch analytics
• Output of Hadoop jobs can be
imported into Splunk Enterprise
for rapid analysis and
visualization
• Archiving from Splunk Enterprise
to Hadoop
21 © Hortonworks Inc. 2011 – 2016. All Rights Reserved
Hunk + Hortonworks
21
Explore, analyze and visualize data in
HDP from one integrated platform
Simply point Hunk at your HDP cluster(s)
and start exploring data immediately
Search data, change perspectives and
preview results as MapReduce jobs run
INTERACTIVE
EXPLORATION
RICH DEVELOPER
ENVIRONMENT
Build big data apps on data in HDP using
standard web languages and frameworks
FULL-FEATURED
ANALYTICS
FAST TO DEPLOY
AND DRIVE VALUE
22 © Hortonworks Inc. 2011 – 2016. All Rights Reserved
Augment Splunk Deployment with Hortonworks Data Platform
Heavy Indexer
Universal
Forwarders
HDP
Enables
Splunk Storage
• Expansion to more data than previously feasible
• Archive data from Splunk into Hadoop
• Query archived Splunk data in Hadoop
• Focus Splunk infrastructure on what really matters
23 © Hortonworks Inc. 2011 – 2016. All Rights Reserved
Find out how you much can optimize
your log analytics infrastructure today.
Contact sales@hortonworks.com

Weitere ähnliche Inhalte

Was ist angesagt?

The First Mile - Edge and IoT Data Collection With Apache Nifi and MiniFi
The First Mile - Edge and IoT Data Collection With Apache Nifi and MiniFiThe First Mile - Edge and IoT Data Collection With Apache Nifi and MiniFi
The First Mile - Edge and IoT Data Collection With Apache Nifi and MiniFi
DataWorks Summit
 

Was ist angesagt? (17)

MiNiFi 0.0.1 MeetUp talk
MiNiFi 0.0.1 MeetUp talkMiNiFi 0.0.1 MeetUp talk
MiNiFi 0.0.1 MeetUp talk
 
State of the Apache NiFi Ecosystem & Community
State of the Apache NiFi Ecosystem & CommunityState of the Apache NiFi Ecosystem & Community
State of the Apache NiFi Ecosystem & Community
 
Hortonworks Data in Motion Webinar Series Part 7 Apache Kafka Nifi Better Tog...
Hortonworks Data in Motion Webinar Series Part 7 Apache Kafka Nifi Better Tog...Hortonworks Data in Motion Webinar Series Part 7 Apache Kafka Nifi Better Tog...
Hortonworks Data in Motion Webinar Series Part 7 Apache Kafka Nifi Better Tog...
 
Data ingestion and distribution with apache NiFi
Data ingestion and distribution with apache NiFiData ingestion and distribution with apache NiFi
Data ingestion and distribution with apache NiFi
 
NJ Hadoop Meetup - Apache NiFi Deep Dive
NJ Hadoop Meetup - Apache NiFi Deep DiveNJ Hadoop Meetup - Apache NiFi Deep Dive
NJ Hadoop Meetup - Apache NiFi Deep Dive
 
Apache NiFi Meetup - Introduction to NiFi Registry
Apache NiFi Meetup - Introduction to NiFi RegistryApache NiFi Meetup - Introduction to NiFi Registry
Apache NiFi Meetup - Introduction to NiFi Registry
 
Running Apache NiFi with Apache Spark : Integration Options
Running Apache NiFi with Apache Spark : Integration OptionsRunning Apache NiFi with Apache Spark : Integration Options
Running Apache NiFi with Apache Spark : Integration Options
 
Apache Nifi Crash Course
Apache Nifi Crash CourseApache Nifi Crash Course
Apache Nifi Crash Course
 
Streaming analytics manager
Streaming analytics managerStreaming analytics manager
Streaming analytics manager
 
Data on the Move - DataCon DC
Data on the Move - DataCon DCData on the Move - DataCon DC
Data on the Move - DataCon DC
 
What’s new in Apache Spark 2.3 and Spark 2.4
What’s new in Apache Spark 2.3 and Spark 2.4What’s new in Apache Spark 2.3 and Spark 2.4
What’s new in Apache Spark 2.3 and Spark 2.4
 
The First Mile - Edge and IoT Data Collection With Apache Nifi and MiniFi
The First Mile - Edge and IoT Data Collection With Apache Nifi and MiniFiThe First Mile - Edge and IoT Data Collection With Apache Nifi and MiniFi
The First Mile - Edge and IoT Data Collection With Apache Nifi and MiniFi
 
Hortonworks Data In Motion Series Part 3 - HDF Ambari
Hortonworks Data In Motion Series Part 3 - HDF Ambari Hortonworks Data In Motion Series Part 3 - HDF Ambari
Hortonworks Data In Motion Series Part 3 - HDF Ambari
 
Data at Scales and the Values of Starting Small with Apache NiFi & MiNiFi
Data at Scales and the Values of Starting Small with Apache NiFi & MiNiFiData at Scales and the Values of Starting Small with Apache NiFi & MiNiFi
Data at Scales and the Values of Starting Small with Apache NiFi & MiNiFi
 
Difference between apache spark and apache nifi
Difference between apache spark and apache nifiDifference between apache spark and apache nifi
Difference between apache spark and apache nifi
 
You Can't Search Without Data
You Can't Search Without DataYou Can't Search Without Data
You Can't Search Without Data
 
Building a Smarter Home with Apache NiFi and Spark
Building a Smarter Home with Apache NiFi and SparkBuilding a Smarter Home with Apache NiFi and Spark
Building a Smarter Home with Apache NiFi and Spark
 

Ähnlich wie Log Analytics Optimization

Open-BDA Hadoop Summit 2014 - Mr. Slim Baltagi (Building a Modern Data Archit...
Open-BDA Hadoop Summit 2014 - Mr. Slim Baltagi (Building a Modern Data Archit...Open-BDA Hadoop Summit 2014 - Mr. Slim Baltagi (Building a Modern Data Archit...
Open-BDA Hadoop Summit 2014 - Mr. Slim Baltagi (Building a Modern Data Archit...
Innovative Management Services
 
Real-Time Processing in Hadoop for IoT Use Cases - Phoenix HUG
Real-Time Processing in Hadoop for IoT Use Cases - Phoenix HUGReal-Time Processing in Hadoop for IoT Use Cases - Phoenix HUG
Real-Time Processing in Hadoop for IoT Use Cases - Phoenix HUG
skumpf
 

Ähnlich wie Log Analytics Optimization (20)

Hortonworks Data In Motion Webinar Series Pt. 2
Hortonworks Data In Motion Webinar Series Pt. 2Hortonworks Data In Motion Webinar Series Pt. 2
Hortonworks Data In Motion Webinar Series Pt. 2
 
Open-BDA Hadoop Summit 2014 - Mr. Slim Baltagi (Building a Modern Data Archit...
Open-BDA Hadoop Summit 2014 - Mr. Slim Baltagi (Building a Modern Data Archit...Open-BDA Hadoop Summit 2014 - Mr. Slim Baltagi (Building a Modern Data Archit...
Open-BDA Hadoop Summit 2014 - Mr. Slim Baltagi (Building a Modern Data Archit...
 
Splunk-hortonworks-risk-management-oct-2014
Splunk-hortonworks-risk-management-oct-2014Splunk-hortonworks-risk-management-oct-2014
Splunk-hortonworks-risk-management-oct-2014
 
Data in Motion - Data at Rest - Hortonworks a Modern Architecture
Data in Motion - Data at Rest - Hortonworks a Modern ArchitectureData in Motion - Data at Rest - Hortonworks a Modern Architecture
Data in Motion - Data at Rest - Hortonworks a Modern Architecture
 
Rescue your Big Data from Downtime with HP Operations Bridge and Apache Hadoop
Rescue your Big Data from Downtime with HP Operations Bridge and Apache HadoopRescue your Big Data from Downtime with HP Operations Bridge and Apache Hadoop
Rescue your Big Data from Downtime with HP Operations Bridge and Apache Hadoop
 
Hortonworks - What's Possible with a Modern Data Architecture?
Hortonworks - What's Possible with a Modern Data Architecture?Hortonworks - What's Possible with a Modern Data Architecture?
Hortonworks - What's Possible with a Modern Data Architecture?
 
Storm Demo Talk - Colorado Springs May 2015
Storm Demo Talk - Colorado Springs May 2015Storm Demo Talk - Colorado Springs May 2015
Storm Demo Talk - Colorado Springs May 2015
 
Hortonworks and Platfora in Financial Services - Webinar
Hortonworks and Platfora in Financial Services - WebinarHortonworks and Platfora in Financial Services - Webinar
Hortonworks and Platfora in Financial Services - Webinar
 
Apache Hadoop on the Open Cloud
Apache Hadoop on the Open CloudApache Hadoop on the Open Cloud
Apache Hadoop on the Open Cloud
 
Building a Modern Data Architecture with Enterprise Hadoop
Building a Modern Data Architecture with Enterprise HadoopBuilding a Modern Data Architecture with Enterprise Hadoop
Building a Modern Data Architecture with Enterprise Hadoop
 
Real-Time Processing in Hadoop for IoT Use Cases - Phoenix HUG
Real-Time Processing in Hadoop for IoT Use Cases - Phoenix HUGReal-Time Processing in Hadoop for IoT Use Cases - Phoenix HUG
Real-Time Processing in Hadoop for IoT Use Cases - Phoenix HUG
 
Enterprise Apache Hadoop: State of the Union
Enterprise Apache Hadoop: State of the UnionEnterprise Apache Hadoop: State of the Union
Enterprise Apache Hadoop: State of the Union
 
Hortonworks Data in Motion Webinar Series - Part 1
Hortonworks Data in Motion Webinar Series - Part 1Hortonworks Data in Motion Webinar Series - Part 1
Hortonworks Data in Motion Webinar Series - Part 1
 
Harnessing Data-in-Motion with HDF 2.0, introduction to Apache NIFI/MINIFI
Harnessing Data-in-Motion with HDF 2.0, introduction to Apache NIFI/MINIFIHarnessing Data-in-Motion with HDF 2.0, introduction to Apache NIFI/MINIFI
Harnessing Data-in-Motion with HDF 2.0, introduction to Apache NIFI/MINIFI
 
Apache Atlas: Why Big Data Management Requires Hierarchical Taxonomies
Apache Atlas: Why Big Data Management Requires Hierarchical Taxonomies Apache Atlas: Why Big Data Management Requires Hierarchical Taxonomies
Apache Atlas: Why Big Data Management Requires Hierarchical Taxonomies
 
Internet of things Crash Course Workshop
Internet of things Crash Course WorkshopInternet of things Crash Course Workshop
Internet of things Crash Course Workshop
 
Internet of Things Crash Course Workshop at Hadoop Summit
Internet of Things Crash Course Workshop at Hadoop SummitInternet of Things Crash Course Workshop at Hadoop Summit
Internet of Things Crash Course Workshop at Hadoop Summit
 
HDF Powered by Apache NiFi Introduction
HDF Powered by Apache NiFi IntroductionHDF Powered by Apache NiFi Introduction
HDF Powered by Apache NiFi Introduction
 
A Comprehensive Approach to Building your Big Data - with Cisco, Hortonworks ...
A Comprehensive Approach to Building your Big Data - with Cisco, Hortonworks ...A Comprehensive Approach to Building your Big Data - with Cisco, Hortonworks ...
A Comprehensive Approach to Building your Big Data - with Cisco, Hortonworks ...
 
Storm Demo Talk - Denver Apr 2015
Storm Demo Talk - Denver Apr 2015Storm Demo Talk - Denver Apr 2015
Storm Demo Talk - Denver Apr 2015
 

Mehr von Isheeta Sanghi

Mehr von Isheeta Sanghi (6)

Apache NiFi- MiNiFi meetup Slides
Apache NiFi- MiNiFi meetup SlidesApache NiFi- MiNiFi meetup Slides
Apache NiFi- MiNiFi meetup Slides
 
Integrating Apache NiFi and Apache Flink
Integrating Apache NiFi and Apache FlinkIntegrating Apache NiFi and Apache Flink
Integrating Apache NiFi and Apache Flink
 
Integrating Apache NiFi and Apache Flink
Integrating Apache NiFi and Apache FlinkIntegrating Apache NiFi and Apache Flink
Integrating Apache NiFi and Apache Flink
 
Integrating Apache NiFi and Apache Flink
Integrating Apache NiFi and Apache FlinkIntegrating Apache NiFi and Apache Flink
Integrating Apache NiFi and Apache Flink
 
Apache Hadoop Security - Ranger
Apache Hadoop Security - RangerApache Hadoop Security - Ranger
Apache Hadoop Security - Ranger
 
Spark + Hadoop Perfect together
Spark + Hadoop Perfect togetherSpark + Hadoop Perfect together
Spark + Hadoop Perfect together
 

Kürzlich hochgeladen

Architecting Cloud Native Applications
Architecting Cloud Native ApplicationsArchitecting Cloud Native Applications
Architecting Cloud Native Applications
WSO2
 
Why Teams call analytics are critical to your entire business
Why Teams call analytics are critical to your entire businessWhy Teams call analytics are critical to your entire business
Why Teams call analytics are critical to your entire business
panagenda
 

Kürzlich hochgeladen (20)

Architecting Cloud Native Applications
Architecting Cloud Native ApplicationsArchitecting Cloud Native Applications
Architecting Cloud Native Applications
 
Apidays Singapore 2024 - Building Digital Trust in a Digital Economy by Veron...
Apidays Singapore 2024 - Building Digital Trust in a Digital Economy by Veron...Apidays Singapore 2024 - Building Digital Trust in a Digital Economy by Veron...
Apidays Singapore 2024 - Building Digital Trust in a Digital Economy by Veron...
 
Real Time Object Detection Using Open CV
Real Time Object Detection Using Open CVReal Time Object Detection Using Open CV
Real Time Object Detection Using Open CV
 
How to Troubleshoot Apps for the Modern Connected Worker
How to Troubleshoot Apps for the Modern Connected WorkerHow to Troubleshoot Apps for the Modern Connected Worker
How to Troubleshoot Apps for the Modern Connected Worker
 
Navi Mumbai Call Girls 🥰 8617370543 Service Offer VIP Hot Model
Navi Mumbai Call Girls 🥰 8617370543 Service Offer VIP Hot ModelNavi Mumbai Call Girls 🥰 8617370543 Service Offer VIP Hot Model
Navi Mumbai Call Girls 🥰 8617370543 Service Offer VIP Hot Model
 
"I see eyes in my soup": How Delivery Hero implemented the safety system for ...
"I see eyes in my soup": How Delivery Hero implemented the safety system for ..."I see eyes in my soup": How Delivery Hero implemented the safety system for ...
"I see eyes in my soup": How Delivery Hero implemented the safety system for ...
 
AWS Community Day CPH - Three problems of Terraform
AWS Community Day CPH - Three problems of TerraformAWS Community Day CPH - Three problems of Terraform
AWS Community Day CPH - Three problems of Terraform
 
2024: Domino Containers - The Next Step. News from the Domino Container commu...
2024: Domino Containers - The Next Step. News from the Domino Container commu...2024: Domino Containers - The Next Step. News from the Domino Container commu...
2024: Domino Containers - The Next Step. News from the Domino Container commu...
 
Repurposing LNG terminals for Hydrogen Ammonia: Feasibility and Cost Saving
Repurposing LNG terminals for Hydrogen Ammonia: Feasibility and Cost SavingRepurposing LNG terminals for Hydrogen Ammonia: Feasibility and Cost Saving
Repurposing LNG terminals for Hydrogen Ammonia: Feasibility and Cost Saving
 
Exploring the Future Potential of AI-Enabled Smartphone Processors
Exploring the Future Potential of AI-Enabled Smartphone ProcessorsExploring the Future Potential of AI-Enabled Smartphone Processors
Exploring the Future Potential of AI-Enabled Smartphone Processors
 
Polkadot JAM Slides - Token2049 - By Dr. Gavin Wood
Polkadot JAM Slides - Token2049 - By Dr. Gavin WoodPolkadot JAM Slides - Token2049 - By Dr. Gavin Wood
Polkadot JAM Slides - Token2049 - By Dr. Gavin Wood
 
AXA XL - Insurer Innovation Award Americas 2024
AXA XL - Insurer Innovation Award Americas 2024AXA XL - Insurer Innovation Award Americas 2024
AXA XL - Insurer Innovation Award Americas 2024
 
GenAI Risks & Security Meetup 01052024.pdf
GenAI Risks & Security Meetup 01052024.pdfGenAI Risks & Security Meetup 01052024.pdf
GenAI Risks & Security Meetup 01052024.pdf
 
Apidays New York 2024 - The value of a flexible API Management solution for O...
Apidays New York 2024 - The value of a flexible API Management solution for O...Apidays New York 2024 - The value of a flexible API Management solution for O...
Apidays New York 2024 - The value of a flexible API Management solution for O...
 
Why Teams call analytics are critical to your entire business
Why Teams call analytics are critical to your entire businessWhy Teams call analytics are critical to your entire business
Why Teams call analytics are critical to your entire business
 
Corporate and higher education May webinar.pptx
Corporate and higher education May webinar.pptxCorporate and higher education May webinar.pptx
Corporate and higher education May webinar.pptx
 
ICT role in 21st century education and its challenges
ICT role in 21st century education and its challengesICT role in 21st century education and its challenges
ICT role in 21st century education and its challenges
 
Strategies for Landing an Oracle DBA Job as a Fresher
Strategies for Landing an Oracle DBA Job as a FresherStrategies for Landing an Oracle DBA Job as a Fresher
Strategies for Landing an Oracle DBA Job as a Fresher
 
Manulife - Insurer Transformation Award 2024
Manulife - Insurer Transformation Award 2024Manulife - Insurer Transformation Award 2024
Manulife - Insurer Transformation Award 2024
 
Strategies for Unlocking Knowledge Management in Microsoft 365 in the Copilot...
Strategies for Unlocking Knowledge Management in Microsoft 365 in the Copilot...Strategies for Unlocking Knowledge Management in Microsoft 365 in the Copilot...
Strategies for Unlocking Knowledge Management in Microsoft 365 in the Copilot...
 

Log Analytics Optimization

  • 1. Optimizing Log Analytics from the Edge April 2016 © Hortonworks Inc. 2011 – 2015. All Rights Reserved
  • 2. 2 © Hortonworks Inc. 2011 – 2016. All Rights Reserved About Hortonworks Customer Momentum ~800 customers (as of Feb 10, 2016) Publicly traded on NASDAQ: HDP Hortonworks Data Platform Completely open multi-tenant platform for any app and any data Consistent enterprise services for security, operations, and governance Partner for Customer Success Leader in open-source community, focused on innovation to meet enterprise needs Unrivaled Hadoop support subscriptions Founded in 2011 Original 24 architects, developers, operators of Hadoop from Yahoo! 800+ E M P L O Y E E S 1500+ E C O S Y S T E M PA R T N E R S
  • 3. 3 © Hortonworks Inc. 2011 – 2016. All Rights Reserved EMBRACE AN OPEN APPROACH MASTER THE VALUE OF DATA EVERY BUSINESS IS A DATA BUSINESS
  • 4. 4 © Hortonworks Inc. 2011 – 2016. All Rights Reserved DATA AT REST DATA IN MOTION ACTIONABLE INTELLIGENCE MODERN DATA APPLICATIONS Actionable Intelligence from Connected Data Platforms Capturing perishable insights from data in motion Ensuring rich, historical insights on data at rest Necessary for modern data applications Hortonworks DataFlow Hortonworks Data Platform
  • 5. 5 © Hortonworks Inc. 2011 – 2016. All Rights Reserved Optimizing Log Ingest with Hortonworks DataFlow
  • 6. 6 © Hortonworks Inc. 2011 – 2016. All Rights Reserved Why Hortonworks DataFlow? Because even the best data scientists and most powerful platforms need the right data to analyze
  • 7. 7 © Hortonworks Inc. 2011 – 2016. All Rights Reserved Store Data Process and Analyze Data Acquire Data Perception of DataFlows: Easy, Definitive Dataflow
  • 8. 8 © Hortonworks Inc. 2011 – 2016. All Rights Reserved Reality of Dataflows: Complex, Convoluted Store Data Process and Analyze Data Acquire Data Store DataStore Data Store Data Store Data Acquire Data Acquire Data Acquire Data Dataflow
  • 9. 9 © Hortonworks Inc. 2011 – 2016. All Rights Reserved HDF has 130+ Processors - Multiple for Log Analytics HTTP Syslog Email HTML Image Hash Encrypt Extract TailMerge Evaluate Duplicate Execute Scan GeoEnrich Replace ConvertSplit Translate HL7 FTP UDP XML SFTP Route Content Route Context Route Text Control Rate Distribute Load AMQP
  • 10. 10 © Hortonworks Inc. 2011 – 2016. All Rights Reserved Log Analytics Systems Today LOG ANALYTICS PLATFORMNetwork Device Logs • Not all data can be captured • Not all captured data is valuable • Transport all data
  • 11. 11 © Hortonworks Inc. 2011 – 2016. All Rights Reserved Cost Effectively Expand Storage Options of Log Data LOG ANALYTICS PLATFORM Network Device Logs HDP HDF 3. Cost effectively expand collection and grow timescale of logs collected 2. Content-based routing based on dynamic evaluation of content, attributes, priority 1. Integrate and enrich logs across data centers and security zones
  • 12. 12 © Hortonworks Inc. 2011 – 2016. All Rights Reserved Efficiently Expand Log Ingestion from the Edge LOG ANALYTICS PLATFORM Network Device Logs HDF HDF HDF HDPHDF • Expand collection to new sources of machine data • Edge analytics to transform, enrich and prioritize content based routing • Capture and transport only valuable data
  • 13. 13 © Hortonworks Inc. 2011 – 2016. All Rights Reserved Expand Analytics and Reporting Options with HDP LOG ANALYTICS PLATFORM Network Device Logs HDF HDF HDF HDPHDF ODBC interface traditional BI tools Easy access to log analytics data through traditional BI tools Give data scientists better tooling – Spark, Storm etc
  • 14. 14 © Hortonworks Inc. 2011 – 2016. All Rights Reserved Expand to small scale, remote systems LOG ANALYTICS PLATFORM Network Device Logs HDF HDF HDF HDPHDF
  • 15. 15 © Hortonworks Inc. 2011 – 2016. All Rights Reserved Optimize Log Analytics with Content Based Routing LOG ANALYTICS PLATFORM Edge analytics for cost-effective and efficient movement of machine data HDF Intelligent, content based routing, transformation and enrichment Send data to alternative systems based on value, content, priority HDP HDF HDF HDF
  • 16. 16 © Hortonworks Inc. 2011 – 2016. All Rights Reserved Splunk Optimization: Using HDP as Data Refinery
  • 17. 17 © Hortonworks Inc. 2011 – 2016. All Rights Reserved Splunk Hadoop Connect 17  Reliable bi-directional integration Import Browse Export Splunk Hadoop Connect >2000 downloads HA Indexes and Storage Commodity Servers Hadoop (MapReduce & HDFS) Report & analyze Custom dashboards Monitor and alert Ad hoc search
  • 18. 18 © Hortonworks Inc. 2011 – 2016. All Rights Reserved Splunk, Hunk & Hortonworks YARN Ready Partner Certified on Hortonworks Data Platform Existing Sandbox tutorial
  • 19. 19 © Hortonworks Inc. 2011 – 2016. All Rights Reserved Splunk, Part of the Modern Data Architecture • Bi-directional data integration between Splunk & HDP • Collect data from across the organization, deliver it to Hadoop for refining data and batch analytics • Output of Hadoop jobs can be imported into Splunk Enterprise for rapid analysis and visualization • Archiving from Splunk Enterprise to Hadoop
  • 20. 20 © Hortonworks Inc. 2011 – 2016. All Rights Reserved Splunk, Part of the Modern Data Architecture • Bi-directional data integration between Splunk & HDP • Collect data from across the organization, deliver it to Hadoop for refining data and batch analytics • Output of Hadoop jobs can be imported into Splunk Enterprise for rapid analysis and visualization • Archiving from Splunk Enterprise to Hadoop
  • 21. 21 © Hortonworks Inc. 2011 – 2016. All Rights Reserved Hunk + Hortonworks 21 Explore, analyze and visualize data in HDP from one integrated platform Simply point Hunk at your HDP cluster(s) and start exploring data immediately Search data, change perspectives and preview results as MapReduce jobs run INTERACTIVE EXPLORATION RICH DEVELOPER ENVIRONMENT Build big data apps on data in HDP using standard web languages and frameworks FULL-FEATURED ANALYTICS FAST TO DEPLOY AND DRIVE VALUE
  • 22. 22 © Hortonworks Inc. 2011 – 2016. All Rights Reserved Augment Splunk Deployment with Hortonworks Data Platform Heavy Indexer Universal Forwarders HDP Enables Splunk Storage • Expansion to more data than previously feasible • Archive data from Splunk into Hadoop • Query archived Splunk data in Hadoop • Focus Splunk infrastructure on what really matters
  • 23. 23 © Hortonworks Inc. 2011 – 2016. All Rights Reserved Find out how you much can optimize your log analytics infrastructure today. Contact sales@hortonworks.com

Hinweis der Redaktion

  1. In reality, dataflows move all over. Data is moved and stored in multiple places – sometimes interim, sometimes longterm. Data is procesed in different places, and then moved again. Complicated, convoluted, messy.
  2. Interactively search without fixed schemas or moving data. Preview results and accelerate reports for fast search and improved cluster performance. Provide self-service analytics for business and IT stakeholders with data models and pivot. Rapidly build big data apps with a rich developer environment.
  3. Interactively search without fixed schemas or moving data. Preview results and accelerate reports for fast search and improved cluster performance. Provide self-service analytics for business and IT stakeholders with data models and pivot. Rapidly build big data apps with a rich developer environment.