SlideShare ist ein Scribd-Unternehmen logo
1 von 17
INTEL CONFIDENTIAL, FOR INTERNAL USE ONLY
1
Getting Started with Big Data
How to Move Forward with
Apache Hadoop* Software
INTEL CONFIDENTIAL
2
Five Things to Know
Big data is a disruptive force that can drive
competitive advantage
Apache Hadoop* software is an emerging
technology for big data analytics
There are two approaches to implementing
big data projects
Intel® technologies and software support big data
Optimize and tune your big data environment
for best performance
1
2
3
4
5
INTEL CONFIDENTIAL
3
Big Data
Volume, Variety, and Velocity
Volume: Data sets that are orders of magnitude larger
than you have handled before
• The digital universe of data could reach 8 zettabytes of data by 20151
• That equals the data held by 18 million U.S. Libraries of Congress2
Variety: More diverse data types, including:
• Structured (transactions, customer information)
• Semistructured and unstructured (web logs, e-mails, documents,
images, video)
Velocity: Arriving faster than ever before
• Real-time streaming data
1 Gens, Frank. IDC Predictions 2012: Competing for 2020. IDC (December 2011).
2 “Big Data Infographic and Gartner 2012 Top 10 Strategic TechTrends.” Business Analytics 3.0 (blog) (November 11, 2011).
INTEL CONFIDENTIAL
4
Getting Bigger
Billions of Connected Devices and Internet Users
Source: Savitz, Eric. “Cisco Predicts the Rise of the Zettabyte Era.” Forbes (May 30, 2012).
forbes.com/sites/ericsavitz/2012/05/30/cisco-predicts-the-rise-of-the-zettabyte-era/
By 2016,
19 billion connected
devices—including 3.4
billion Internet users and
machine-to-machine
connections−will contribute
to the flood of
big data.
INTEL CONFIDENTIAL
5
The Reason for All the Buzz
Big Data Drives Competitive Advantage
The real value of big data is in the insights it produces when analyzed:
Finding patterns
Deriving meaning
Making decisions
Responding to the world with intelligence
INTEL CONFIDENTIAL
6
The Apache Hadoop* Framework
An Emerging Approach to Big Data Analytics
Open-source software that provides a simple programming model for
distributed processing of large data sets
• Provides a massively scalable storage and a data processing system (not a
database) built on clusters of computers
• Supplements your existing systems by handling data that’s typically
a problem for them
- Too large
- Unstructured
- Mix of types
- Real-time streaming
INTEL CONFIDENTIAL
7
It handles all kinds of data.
It scales quickly and affordably.
It reveals new insight. .
It reduces costs.
It delivers higher availability.
It lowers organizational risk.
Apache Hadoop* Breakthroughs
Advantages over Traditional Systems
No need to develop specific schemas.
Add more servers and storage as you need it!
Find hidden relationships that were difficult—
or even impossible—to find in the past.
• Open-source software that runs on standard servers.
• Lower cost per terabyte for storage and processing.
Fault tolerant; designed to recover from hardware,
software, and system failures.
Apache Hadoop* innovations continue through an active
and diverse global community.
INTEL CONFIDENTIAL
8
Two Approaches to Apache Hadoop*
What’s Right for Your Organization?
Apache Hadoop* software-only deployments
• Free Apache Hadoop open-source software
• Vendor distributions that prepackage Hadoop*
software with value-added enhancements and
services
1
Hadoop software integrated with
traditional databases
• Extend existing data warehousing and analytics
platforms to include Hadoop software
2
INTEL CONFIDENTIAL
9
Apache Hadoop* Deployment
Put the Right Infrastructure in Place
Clusters of standard servers
10 gigabit Ethernet networking
Intelligent storage
Apache Hadoop* software
INTEL CONFIDENTIAL
10
Intel® Technologies for Big Data
Get Maximum Performance
Server clusters: Intel® Xeon® processor E5 family
Networking: Intel ® Ethernet 10 Gigabit Converged
Network Adapters
Storage: Intel ® Solid-State Drives
Software: Intel ® Distribution for Apache Hadoop*
software
(Intel Distribution)1
1 Currently available in China, Taiwan, and the United States.
INTEL CONFIDENTIAL
11
Intel® Distribution for
Apache Hadoop* Software
Enterprise ready for a variety of use cases1
Supports a wide range of analytics
• Enhances Apache Hive* and Apache HBase* software
Introduces graph analytics capabilities with Intel® GraphBuilder soft ware
• Provides a Java library for constructing graphs that help visualize data relationships
Optimizes open-source Apache Hadoop* components
• Takes advantage of Intel Xeon® processor capabilities
Hadoop* security, scalability, and management enhancements
• Tightly integrated into the platform
Support and services from Intel and its partners
Find out more about the Intel Distribution
1 Currently available in China, Taiwan, and the United States.
INTEL CONFIDENTIAL
12
Apache Hadoop* Optimization
Practical Trade-offs for Hardware, Software, and System Settings
Fine-tune your solution for best performance:
Maximize productivity
Limit energy consumption
Maximize resource utilization
Reduce operating costs
Lower your total cost of ownership
INTEL CONFIDENTIAL
13
Benchmark Performance
Intel’s HiBench Suite
Comprehensive set of benchmark tests for Apache
Hadoop*software
Represents important Hadoop* workloads and analytics
with a mix of hardware usage characteristics
Available as open-source software under Apache License
2.0 at https://github.com/hibench/HiBench-2.1
INTEL CONFIDENTIAL
14
Get Started
Five Steps for IT Managers
Work with your business users to articulate the big opportunities
Do your research to get up to speed on the technology
Develop use case(s) for your project
Identify gaps between current- and future-state capabilities
Develop a test environment for a production version
1
2
3
4
5
INTEL CONFIDENTIAL
15
Big Data Planning Guide
Everything You Need to Get Started
Intel.com/ITCenter
Read the full planning guide at Intel.com/bigdata
Learn more about the Intel® Distribution for
Apache Hadoop* software at hadoop.intel.com
INTEL CONFIDENTIAL
16
Legal
This presentation is for informational purposes only. THIS DOCUMENT IS PROVIDED “AS IS”
WITH NO WARRANTIES WHATSOEVER, INCLUDING ANY WARRANTY OF MERCHANTABILITY,
NONINFRINGEMENT, FITNESS FOR ANY PARTICULAR PURPOSE, OR ANY WARRANTY
OTHERWISE ARISING OUT OF ANY PROPOSAL, SPECIFICATION, OR SAMPLE. Intel disclaims
all liability, including liability for infringement of any property rights, relating to use of this
information. No license, express or implied, by estoppel or otherwise, to any intellectual
property rights is granted herein.
Copyright © 2013 Intel Corporation. Intel, the Intel logo, and Xeon are trademarks of Intel
Corporation in the U.S. and other countries.
*Other names and brands may be claimed as the property of others.
Getting Started with Big Data: Planning Guide

Weitere ähnliche Inhalte

Was ist angesagt?

Lessons learned processing 70 billion data points a day using the hybrid cloud
Lessons learned processing 70 billion data points a day using the hybrid cloudLessons learned processing 70 billion data points a day using the hybrid cloud
Lessons learned processing 70 billion data points a day using the hybrid cloud
DataWorks Summit
 
Adding structure to your streaming pipelines: moving from Spark streaming to ...
Adding structure to your streaming pipelines: moving from Spark streaming to ...Adding structure to your streaming pipelines: moving from Spark streaming to ...
Adding structure to your streaming pipelines: moving from Spark streaming to ...
DataWorks Summit
 
Iasi code camp 20 april 2013 testing big data-anca sfecla - embarcadero
Iasi code camp 20 april 2013 testing big data-anca sfecla - embarcaderoIasi code camp 20 april 2013 testing big data-anca sfecla - embarcadero
Iasi code camp 20 april 2013 testing big data-anca sfecla - embarcadero
Codecamp Romania
 
IoT: How Data Science Driven Software is Eating the Connected World
IoT: How Data Science Driven Software is Eating the Connected WorldIoT: How Data Science Driven Software is Eating the Connected World
IoT: How Data Science Driven Software is Eating the Connected World
DataWorks Summit
 

Was ist angesagt? (16)

DataOps or how I learned to love production - Michael Hausenblas
DataOps or how I learned to love production  - Michael HausenblasDataOps or how I learned to love production  - Michael Hausenblas
DataOps or how I learned to love production - Michael Hausenblas
 
Solr consistency and recovery internals
Solr consistency and recovery internalsSolr consistency and recovery internals
Solr consistency and recovery internals
 
Splunking configfiles 20211208_daniel_wilson
Splunking configfiles 20211208_daniel_wilsonSplunking configfiles 20211208_daniel_wilson
Splunking configfiles 20211208_daniel_wilson
 
Hadoop Hadoop & Spark meetup - Altiscale
Hadoop Hadoop & Spark meetup - AltiscaleHadoop Hadoop & Spark meetup - Altiscale
Hadoop Hadoop & Spark meetup - Altiscale
 
Designing Data Pipelines for Automous and Trusted Analytics
Designing Data Pipelines for Automous and Trusted AnalyticsDesigning Data Pipelines for Automous and Trusted Analytics
Designing Data Pipelines for Automous and Trusted Analytics
 
Part 2: A Visual Dive into Machine Learning and Deep Learning 

Part 2: A Visual Dive into Machine Learning and Deep Learning 
Part 2: A Visual Dive into Machine Learning and Deep Learning 

Part 2: A Visual Dive into Machine Learning and Deep Learning 

 
Sqrrl Overview for Stac Research
Sqrrl Overview for Stac ResearchSqrrl Overview for Stac Research
Sqrrl Overview for Stac Research
 
Lessons learned processing 70 billion data points a day using the hybrid cloud
Lessons learned processing 70 billion data points a day using the hybrid cloudLessons learned processing 70 billion data points a day using the hybrid cloud
Lessons learned processing 70 billion data points a day using the hybrid cloud
 
Taking Splunk to the Next Level - Architecture Breakout Session
Taking Splunk to the Next Level - Architecture Breakout SessionTaking Splunk to the Next Level - Architecture Breakout Session
Taking Splunk to the Next Level - Architecture Breakout Session
 
Big Data: Myths and Realities
Big Data: Myths and RealitiesBig Data: Myths and Realities
Big Data: Myths and Realities
 
Adding structure to your streaming pipelines: moving from Spark streaming to ...
Adding structure to your streaming pipelines: moving from Spark streaming to ...Adding structure to your streaming pipelines: moving from Spark streaming to ...
Adding structure to your streaming pipelines: moving from Spark streaming to ...
 
Iasi code camp 20 april 2013 testing big data-anca sfecla - embarcadero
Iasi code camp 20 april 2013 testing big data-anca sfecla - embarcaderoIasi code camp 20 april 2013 testing big data-anca sfecla - embarcadero
Iasi code camp 20 april 2013 testing big data-anca sfecla - embarcadero
 
Digitalising the Core – How Analytics is Shaping the Energy Industry Daniel J...
Digitalising the Core – How Analytics is Shaping the Energy Industry Daniel J...Digitalising the Core – How Analytics is Shaping the Energy Industry Daniel J...
Digitalising the Core – How Analytics is Shaping the Energy Industry Daniel J...
 
Evolving Hadoop for the Data Society
Evolving Hadoop for the Data SocietyEvolving Hadoop for the Data Society
Evolving Hadoop for the Data Society
 
IoT: How Data Science Driven Software is Eating the Connected World
IoT: How Data Science Driven Software is Eating the Connected WorldIoT: How Data Science Driven Software is Eating the Connected World
IoT: How Data Science Driven Software is Eating the Connected World
 
Cloudera Federal Forum 2014: Hadoop's Impact on the Future of Data Management
Cloudera Federal Forum 2014: Hadoop's Impact on the Future of Data ManagementCloudera Federal Forum 2014: Hadoop's Impact on the Future of Data Management
Cloudera Federal Forum 2014: Hadoop's Impact on the Future of Data Management
 

Andere mochten auch

Andere mochten auch (6)

Speed up Interactive Analytic Queries over Existing Big Data on Hadoop with P...
Speed up Interactive Analytic Queries over Existing Big Data on Hadoop with P...Speed up Interactive Analytic Queries over Existing Big Data on Hadoop with P...
Speed up Interactive Analytic Queries over Existing Big Data on Hadoop with P...
 
Identity Protection for the Digital Age
Identity Protection for the Digital AgeIdentity Protection for the Digital Age
Identity Protection for the Digital Age
 
Real Time Interactive Queries IN HADOOP: Big Data Warehousing Meetup
Real Time Interactive Queries IN HADOOP: Big Data Warehousing MeetupReal Time Interactive Queries IN HADOOP: Big Data Warehousing Meetup
Real Time Interactive Queries IN HADOOP: Big Data Warehousing Meetup
 
Big data analysis concepts and references
Big data analysis concepts and referencesBig data analysis concepts and references
Big data analysis concepts and references
 
Demonetization
DemonetizationDemonetization
Demonetization
 
Big data ppt
Big  data pptBig  data ppt
Big data ppt
 

Ähnlich wie Getting Started with Big Data: Planning Guide

Cloudwatt pioneers big_data
Cloudwatt pioneers big_dataCloudwatt pioneers big_data
Cloudwatt pioneers big_data
xband
 
Open-BDA Hadoop Summit 2014 - Mr. Slim Baltagi (Building a Modern Data Archit...
Open-BDA Hadoop Summit 2014 - Mr. Slim Baltagi (Building a Modern Data Archit...Open-BDA Hadoop Summit 2014 - Mr. Slim Baltagi (Building a Modern Data Archit...
Open-BDA Hadoop Summit 2014 - Mr. Slim Baltagi (Building a Modern Data Archit...
Innovative Management Services
 
Big data intel platform commenting
Big data   intel platform commentingBig data   intel platform commenting
Big data intel platform commenting
Intel IT Center
 
Transform You Business with Big Data and Hortonworks
Transform You Business with Big Data and HortonworksTransform You Business with Big Data and Hortonworks
Transform You Business with Big Data and Hortonworks
Hortonworks
 
Carpe Datum: Building Big Data Analytical Applications with HP Haven
Carpe Datum: Building Big Data Analytical Applications with HP HavenCarpe Datum: Building Big Data Analytical Applications with HP Haven
Carpe Datum: Building Big Data Analytical Applications with HP Haven
DataWorks Summit
 
Arun Rathinasabapathy, Senior Software Engineer, LexisNexis at MLconf ATL 2016
Arun Rathinasabapathy, Senior Software Engineer, LexisNexis at MLconf ATL 2016Arun Rathinasabapathy, Senior Software Engineer, LexisNexis at MLconf ATL 2016
Arun Rathinasabapathy, Senior Software Engineer, LexisNexis at MLconf ATL 2016
MLconf
 

Ähnlich wie Getting Started with Big Data: Planning Guide (20)

A modern, flexible approach to Hadoop implementation incorporating innovation...
A modern, flexible approach to Hadoop implementation incorporating innovation...A modern, flexible approach to Hadoop implementation incorporating innovation...
A modern, flexible approach to Hadoop implementation incorporating innovation...
 
Big Data Intel® Platform
Big Data Intel® PlatformBig Data Intel® Platform
Big Data Intel® Platform
 
Cloudwatt pioneers big_data
Cloudwatt pioneers big_dataCloudwatt pioneers big_data
Cloudwatt pioneers big_data
 
A Comprehensive Approach to Building your Big Data - with Cisco, Hortonworks ...
A Comprehensive Approach to Building your Big Data - with Cisco, Hortonworks ...A Comprehensive Approach to Building your Big Data - with Cisco, Hortonworks ...
A Comprehensive Approach to Building your Big Data - with Cisco, Hortonworks ...
 
Open-BDA Hadoop Summit 2014 - Mr. Slim Baltagi (Building a Modern Data Archit...
Open-BDA Hadoop Summit 2014 - Mr. Slim Baltagi (Building a Modern Data Archit...Open-BDA Hadoop Summit 2014 - Mr. Slim Baltagi (Building a Modern Data Archit...
Open-BDA Hadoop Summit 2014 - Mr. Slim Baltagi (Building a Modern Data Archit...
 
Business Intelligence and Big Data Analytics with Pentaho
Business Intelligence and Big Data Analytics with Pentaho Business Intelligence and Big Data Analytics with Pentaho
Business Intelligence and Big Data Analytics with Pentaho
 
Haven 2 0
Haven 2 0 Haven 2 0
Haven 2 0
 
Big data intel platform commenting
Big data   intel platform commentingBig data   intel platform commenting
Big data intel platform commenting
 
Hadoop
HadoopHadoop
Hadoop
 
Transform Your Business with Big Data and Hortonworks
Transform Your Business with Big Data and Hortonworks Transform Your Business with Big Data and Hortonworks
Transform Your Business with Big Data and Hortonworks
 
Hadoop and SQL: Delivery Analytics Across the Organization
Hadoop and SQL:  Delivery Analytics Across the OrganizationHadoop and SQL:  Delivery Analytics Across the Organization
Hadoop and SQL: Delivery Analytics Across the Organization
 
Transform You Business with Big Data and Hortonworks
Transform You Business with Big Data and HortonworksTransform You Business with Big Data and Hortonworks
Transform You Business with Big Data and Hortonworks
 
Big Data in Azure
Big Data in AzureBig Data in Azure
Big Data in Azure
 
Hortonworks and Voltage Security webinar
Hortonworks and Voltage Security webinarHortonworks and Voltage Security webinar
Hortonworks and Voltage Security webinar
 
Carpe Datum: Building Big Data Analytical Applications with HP Haven
Carpe Datum: Building Big Data Analytical Applications with HP HavenCarpe Datum: Building Big Data Analytical Applications with HP Haven
Carpe Datum: Building Big Data Analytical Applications with HP Haven
 
Bridging the Big Data Gap in the Software-Driven World
Bridging the Big Data Gap in the Software-Driven WorldBridging the Big Data Gap in the Software-Driven World
Bridging the Big Data Gap in the Software-Driven World
 
Hortonworks and Red Hat Webinar - Part 2
Hortonworks and Red Hat Webinar - Part 2Hortonworks and Red Hat Webinar - Part 2
Hortonworks and Red Hat Webinar - Part 2
 
Arun Rathinasabapathy, Senior Software Engineer, LexisNexis at MLconf ATL 2016
Arun Rathinasabapathy, Senior Software Engineer, LexisNexis at MLconf ATL 2016Arun Rathinasabapathy, Senior Software Engineer, LexisNexis at MLconf ATL 2016
Arun Rathinasabapathy, Senior Software Engineer, LexisNexis at MLconf ATL 2016
 
Big Data Open Source Technologies
Big Data Open Source TechnologiesBig Data Open Source Technologies
Big Data Open Source Technologies
 
Delivering a Flexible IT Infrastructure for Analytics on IBM Power Systems
Delivering a Flexible IT Infrastructure for Analytics on IBM Power SystemsDelivering a Flexible IT Infrastructure for Analytics on IBM Power Systems
Delivering a Flexible IT Infrastructure for Analytics on IBM Power Systems
 

Mehr von Intel IT Center

Mehr von Intel IT Center (20)

AI Crash Course- Supercomputing
AI Crash Course- SupercomputingAI Crash Course- Supercomputing
AI Crash Course- Supercomputing
 
FPGA Inference - DellEMC SURFsara
FPGA Inference - DellEMC SURFsaraFPGA Inference - DellEMC SURFsara
FPGA Inference - DellEMC SURFsara
 
High Memory Bandwidth Demo @ One Intel Station
High Memory Bandwidth Demo @ One Intel StationHigh Memory Bandwidth Demo @ One Intel Station
High Memory Bandwidth Demo @ One Intel Station
 
INFOGRAPHIC: Advantages of Intel vs. IBM Power on SAP HANA solutions
INFOGRAPHIC: Advantages of Intel vs. IBM Power on SAP HANA solutionsINFOGRAPHIC: Advantages of Intel vs. IBM Power on SAP HANA solutions
INFOGRAPHIC: Advantages of Intel vs. IBM Power on SAP HANA solutions
 
Disrupt Hackers With Robust User Authentication
Disrupt Hackers With Robust User AuthenticationDisrupt Hackers With Robust User Authentication
Disrupt Hackers With Robust User Authentication
 
Strengthen Your Enterprise Arsenal Against Cyber Attacks With Hardware-Enhanc...
Strengthen Your Enterprise Arsenal Against Cyber Attacks With Hardware-Enhanc...Strengthen Your Enterprise Arsenal Against Cyber Attacks With Hardware-Enhanc...
Strengthen Your Enterprise Arsenal Against Cyber Attacks With Hardware-Enhanc...
 
Harness Digital Disruption to Create 2022’s Workplace Today
Harness Digital Disruption to Create 2022’s Workplace TodayHarness Digital Disruption to Create 2022’s Workplace Today
Harness Digital Disruption to Create 2022’s Workplace Today
 
Don't Rely on Software Alone. Protect Endpoints with Hardware-Enhanced Security.
Don't Rely on Software Alone.Protect Endpoints with Hardware-Enhanced Security.Don't Rely on Software Alone.Protect Endpoints with Hardware-Enhanced Security.
Don't Rely on Software Alone. Protect Endpoints with Hardware-Enhanced Security.
 
Achieve Unconstrained Collaboration in a Digital World
Achieve Unconstrained Collaboration in a Digital WorldAchieve Unconstrained Collaboration in a Digital World
Achieve Unconstrained Collaboration in a Digital World
 
Intel® Xeon® Scalable Processors Enabled Applications Marketing Guide
Intel® Xeon® Scalable Processors Enabled Applications Marketing GuideIntel® Xeon® Scalable Processors Enabled Applications Marketing Guide
Intel® Xeon® Scalable Processors Enabled Applications Marketing Guide
 
#NABshow: National Association of Broadcasters 2017 Super Session Presentatio...
#NABshow: National Association of Broadcasters 2017 Super Session Presentatio...#NABshow: National Association of Broadcasters 2017 Super Session Presentatio...
#NABshow: National Association of Broadcasters 2017 Super Session Presentatio...
 
Three Steps to Making a Digital Workplace a Reality
Three Steps to Making a Digital Workplace a RealityThree Steps to Making a Digital Workplace a Reality
Three Steps to Making a Digital Workplace a Reality
 
Three Steps to Making The Digital Workplace a Reality - by Intel’s Chad Const...
Three Steps to Making The Digital Workplace a Reality - by Intel’s Chad Const...Three Steps to Making The Digital Workplace a Reality - by Intel’s Chad Const...
Three Steps to Making The Digital Workplace a Reality - by Intel’s Chad Const...
 
Intel® Xeon® Processor E7-8800/4800 v4 EAMG 2.0
Intel® Xeon® Processor E7-8800/4800 v4 EAMG 2.0Intel® Xeon® Processor E7-8800/4800 v4 EAMG 2.0
Intel® Xeon® Processor E7-8800/4800 v4 EAMG 2.0
 
Intel® Xeon® Processor E5-2600 v4 Enterprise Database Applications Showcase
Intel® Xeon® Processor E5-2600 v4 Enterprise Database Applications ShowcaseIntel® Xeon® Processor E5-2600 v4 Enterprise Database Applications Showcase
Intel® Xeon® Processor E5-2600 v4 Enterprise Database Applications Showcase
 
Intel® Xeon® Processor E5-2600 v4 Core Business Applications Showcase
Intel® Xeon® Processor E5-2600 v4 Core Business Applications ShowcaseIntel® Xeon® Processor E5-2600 v4 Core Business Applications Showcase
Intel® Xeon® Processor E5-2600 v4 Core Business Applications Showcase
 
Intel® Xeon® Processor E5-2600 v4 Financial Security Applications Showcase
Intel® Xeon® Processor E5-2600 v4 Financial Security Applications ShowcaseIntel® Xeon® Processor E5-2600 v4 Financial Security Applications Showcase
Intel® Xeon® Processor E5-2600 v4 Financial Security Applications Showcase
 
Intel® Xeon® Processor E5-2600 v4 Telco Cloud Digital Applications Showcase
Intel® Xeon® Processor E5-2600 v4 Telco Cloud Digital Applications ShowcaseIntel® Xeon® Processor E5-2600 v4 Telco Cloud Digital Applications Showcase
Intel® Xeon® Processor E5-2600 v4 Telco Cloud Digital Applications Showcase
 
Intel® Xeon® Processor E5-2600 v4 Tech Computing Applications Showcase
Intel® Xeon® Processor E5-2600 v4 Tech Computing Applications ShowcaseIntel® Xeon® Processor E5-2600 v4 Tech Computing Applications Showcase
Intel® Xeon® Processor E5-2600 v4 Tech Computing Applications Showcase
 
Intel® Xeon® Processor E5-2600 v4 Big Data Analytics Applications Showcase
Intel® Xeon® Processor E5-2600 v4 Big Data Analytics Applications ShowcaseIntel® Xeon® Processor E5-2600 v4 Big Data Analytics Applications Showcase
Intel® Xeon® Processor E5-2600 v4 Big Data Analytics Applications Showcase
 

Kürzlich hochgeladen

Cloud Frontiers: A Deep Dive into Serverless Spatial Data and FME
Cloud Frontiers:  A Deep Dive into Serverless Spatial Data and FMECloud Frontiers:  A Deep Dive into Serverless Spatial Data and FME
Cloud Frontiers: A Deep Dive into Serverless Spatial Data and FME
Safe Software
 
Why Teams call analytics are critical to your entire business
Why Teams call analytics are critical to your entire businessWhy Teams call analytics are critical to your entire business
Why Teams call analytics are critical to your entire business
panagenda
 
Cloud Frontiers: A Deep Dive into Serverless Spatial Data and FME
Cloud Frontiers:  A Deep Dive into Serverless Spatial Data and FMECloud Frontiers:  A Deep Dive into Serverless Spatial Data and FME
Cloud Frontiers: A Deep Dive into Serverless Spatial Data and FME
Safe Software
 
Architecting Cloud Native Applications
Architecting Cloud Native ApplicationsArchitecting Cloud Native Applications
Architecting Cloud Native Applications
WSO2
 
Finding Java's Hidden Performance Traps @ DevoxxUK 2024
Finding Java's Hidden Performance Traps @ DevoxxUK 2024Finding Java's Hidden Performance Traps @ DevoxxUK 2024
Finding Java's Hidden Performance Traps @ DevoxxUK 2024
Victor Rentea
 
+971581248768>> SAFE AND ORIGINAL ABORTION PILLS FOR SALE IN DUBAI AND ABUDHA...
+971581248768>> SAFE AND ORIGINAL ABORTION PILLS FOR SALE IN DUBAI AND ABUDHA...+971581248768>> SAFE AND ORIGINAL ABORTION PILLS FOR SALE IN DUBAI AND ABUDHA...
+971581248768>> SAFE AND ORIGINAL ABORTION PILLS FOR SALE IN DUBAI AND ABUDHA...
?#DUbAI#??##{{(☎️+971_581248768%)**%*]'#abortion pills for sale in dubai@
 

Kürzlich hochgeladen (20)

Apidays New York 2024 - The value of a flexible API Management solution for O...
Apidays New York 2024 - The value of a flexible API Management solution for O...Apidays New York 2024 - The value of a flexible API Management solution for O...
Apidays New York 2024 - The value of a flexible API Management solution for O...
 
Connector Corner: Accelerate revenue generation using UiPath API-centric busi...
Connector Corner: Accelerate revenue generation using UiPath API-centric busi...Connector Corner: Accelerate revenue generation using UiPath API-centric busi...
Connector Corner: Accelerate revenue generation using UiPath API-centric busi...
 
AWS Community Day CPH - Three problems of Terraform
AWS Community Day CPH - Three problems of TerraformAWS Community Day CPH - Three problems of Terraform
AWS Community Day CPH - Three problems of Terraform
 
Introduction to Multilingual Retrieval Augmented Generation (RAG)
Introduction to Multilingual Retrieval Augmented Generation (RAG)Introduction to Multilingual Retrieval Augmented Generation (RAG)
Introduction to Multilingual Retrieval Augmented Generation (RAG)
 
Cloud Frontiers: A Deep Dive into Serverless Spatial Data and FME
Cloud Frontiers:  A Deep Dive into Serverless Spatial Data and FMECloud Frontiers:  A Deep Dive into Serverless Spatial Data and FME
Cloud Frontiers: A Deep Dive into Serverless Spatial Data and FME
 
FWD Group - Insurer Innovation Award 2024
FWD Group - Insurer Innovation Award 2024FWD Group - Insurer Innovation Award 2024
FWD Group - Insurer Innovation Award 2024
 
Why Teams call analytics are critical to your entire business
Why Teams call analytics are critical to your entire businessWhy Teams call analytics are critical to your entire business
Why Teams call analytics are critical to your entire business
 
presentation ICT roal in 21st century education
presentation ICT roal in 21st century educationpresentation ICT roal in 21st century education
presentation ICT roal in 21st century education
 
Cloud Frontiers: A Deep Dive into Serverless Spatial Data and FME
Cloud Frontiers:  A Deep Dive into Serverless Spatial Data and FMECloud Frontiers:  A Deep Dive into Serverless Spatial Data and FME
Cloud Frontiers: A Deep Dive into Serverless Spatial Data and FME
 
Rising Above_ Dubai Floods and the Fortitude of Dubai International Airport.pdf
Rising Above_ Dubai Floods and the Fortitude of Dubai International Airport.pdfRising Above_ Dubai Floods and the Fortitude of Dubai International Airport.pdf
Rising Above_ Dubai Floods and the Fortitude of Dubai International Airport.pdf
 
MINDCTI Revenue Release Quarter One 2024
MINDCTI Revenue Release Quarter One 2024MINDCTI Revenue Release Quarter One 2024
MINDCTI Revenue Release Quarter One 2024
 
Architecting Cloud Native Applications
Architecting Cloud Native ApplicationsArchitecting Cloud Native Applications
Architecting Cloud Native Applications
 
Elevate Developer Efficiency & build GenAI Application with Amazon Q​
Elevate Developer Efficiency & build GenAI Application with Amazon Q​Elevate Developer Efficiency & build GenAI Application with Amazon Q​
Elevate Developer Efficiency & build GenAI Application with Amazon Q​
 
Repurposing LNG terminals for Hydrogen Ammonia: Feasibility and Cost Saving
Repurposing LNG terminals for Hydrogen Ammonia: Feasibility and Cost SavingRepurposing LNG terminals for Hydrogen Ammonia: Feasibility and Cost Saving
Repurposing LNG terminals for Hydrogen Ammonia: Feasibility and Cost Saving
 
ProductAnonymous-April2024-WinProductDiscovery-MelissaKlemke
ProductAnonymous-April2024-WinProductDiscovery-MelissaKlemkeProductAnonymous-April2024-WinProductDiscovery-MelissaKlemke
ProductAnonymous-April2024-WinProductDiscovery-MelissaKlemke
 
Mcleodganj Call Girls 🥰 8617370543 Service Offer VIP Hot Model
Mcleodganj Call Girls 🥰 8617370543 Service Offer VIP Hot ModelMcleodganj Call Girls 🥰 8617370543 Service Offer VIP Hot Model
Mcleodganj Call Girls 🥰 8617370543 Service Offer VIP Hot Model
 
Finding Java's Hidden Performance Traps @ DevoxxUK 2024
Finding Java's Hidden Performance Traps @ DevoxxUK 2024Finding Java's Hidden Performance Traps @ DevoxxUK 2024
Finding Java's Hidden Performance Traps @ DevoxxUK 2024
 
+971581248768>> SAFE AND ORIGINAL ABORTION PILLS FOR SALE IN DUBAI AND ABUDHA...
+971581248768>> SAFE AND ORIGINAL ABORTION PILLS FOR SALE IN DUBAI AND ABUDHA...+971581248768>> SAFE AND ORIGINAL ABORTION PILLS FOR SALE IN DUBAI AND ABUDHA...
+971581248768>> SAFE AND ORIGINAL ABORTION PILLS FOR SALE IN DUBAI AND ABUDHA...
 
Apidays New York 2024 - Passkeys: Developing APIs to enable passwordless auth...
Apidays New York 2024 - Passkeys: Developing APIs to enable passwordless auth...Apidays New York 2024 - Passkeys: Developing APIs to enable passwordless auth...
Apidays New York 2024 - Passkeys: Developing APIs to enable passwordless auth...
 
Biography Of Angeliki Cooney | Senior Vice President Life Sciences | Albany, ...
Biography Of Angeliki Cooney | Senior Vice President Life Sciences | Albany, ...Biography Of Angeliki Cooney | Senior Vice President Life Sciences | Albany, ...
Biography Of Angeliki Cooney | Senior Vice President Life Sciences | Albany, ...
 

Getting Started with Big Data: Planning Guide

  • 1. INTEL CONFIDENTIAL, FOR INTERNAL USE ONLY 1 Getting Started with Big Data How to Move Forward with Apache Hadoop* Software
  • 2. INTEL CONFIDENTIAL 2 Five Things to Know Big data is a disruptive force that can drive competitive advantage Apache Hadoop* software is an emerging technology for big data analytics There are two approaches to implementing big data projects Intel® technologies and software support big data Optimize and tune your big data environment for best performance 1 2 3 4 5
  • 3. INTEL CONFIDENTIAL 3 Big Data Volume, Variety, and Velocity Volume: Data sets that are orders of magnitude larger than you have handled before • The digital universe of data could reach 8 zettabytes of data by 20151 • That equals the data held by 18 million U.S. Libraries of Congress2 Variety: More diverse data types, including: • Structured (transactions, customer information) • Semistructured and unstructured (web logs, e-mails, documents, images, video) Velocity: Arriving faster than ever before • Real-time streaming data 1 Gens, Frank. IDC Predictions 2012: Competing for 2020. IDC (December 2011). 2 “Big Data Infographic and Gartner 2012 Top 10 Strategic TechTrends.” Business Analytics 3.0 (blog) (November 11, 2011).
  • 4. INTEL CONFIDENTIAL 4 Getting Bigger Billions of Connected Devices and Internet Users Source: Savitz, Eric. “Cisco Predicts the Rise of the Zettabyte Era.” Forbes (May 30, 2012). forbes.com/sites/ericsavitz/2012/05/30/cisco-predicts-the-rise-of-the-zettabyte-era/ By 2016, 19 billion connected devices—including 3.4 billion Internet users and machine-to-machine connections−will contribute to the flood of big data.
  • 5. INTEL CONFIDENTIAL 5 The Reason for All the Buzz Big Data Drives Competitive Advantage The real value of big data is in the insights it produces when analyzed: Finding patterns Deriving meaning Making decisions Responding to the world with intelligence
  • 6. INTEL CONFIDENTIAL 6 The Apache Hadoop* Framework An Emerging Approach to Big Data Analytics Open-source software that provides a simple programming model for distributed processing of large data sets • Provides a massively scalable storage and a data processing system (not a database) built on clusters of computers • Supplements your existing systems by handling data that’s typically a problem for them - Too large - Unstructured - Mix of types - Real-time streaming
  • 7. INTEL CONFIDENTIAL 7 It handles all kinds of data. It scales quickly and affordably. It reveals new insight. . It reduces costs. It delivers higher availability. It lowers organizational risk. Apache Hadoop* Breakthroughs Advantages over Traditional Systems No need to develop specific schemas. Add more servers and storage as you need it! Find hidden relationships that were difficult— or even impossible—to find in the past. • Open-source software that runs on standard servers. • Lower cost per terabyte for storage and processing. Fault tolerant; designed to recover from hardware, software, and system failures. Apache Hadoop* innovations continue through an active and diverse global community.
  • 8. INTEL CONFIDENTIAL 8 Two Approaches to Apache Hadoop* What’s Right for Your Organization? Apache Hadoop* software-only deployments • Free Apache Hadoop open-source software • Vendor distributions that prepackage Hadoop* software with value-added enhancements and services 1 Hadoop software integrated with traditional databases • Extend existing data warehousing and analytics platforms to include Hadoop software 2
  • 9. INTEL CONFIDENTIAL 9 Apache Hadoop* Deployment Put the Right Infrastructure in Place Clusters of standard servers 10 gigabit Ethernet networking Intelligent storage Apache Hadoop* software
  • 10. INTEL CONFIDENTIAL 10 Intel® Technologies for Big Data Get Maximum Performance Server clusters: Intel® Xeon® processor E5 family Networking: Intel ® Ethernet 10 Gigabit Converged Network Adapters Storage: Intel ® Solid-State Drives Software: Intel ® Distribution for Apache Hadoop* software (Intel Distribution)1 1 Currently available in China, Taiwan, and the United States.
  • 11. INTEL CONFIDENTIAL 11 Intel® Distribution for Apache Hadoop* Software Enterprise ready for a variety of use cases1 Supports a wide range of analytics • Enhances Apache Hive* and Apache HBase* software Introduces graph analytics capabilities with Intel® GraphBuilder soft ware • Provides a Java library for constructing graphs that help visualize data relationships Optimizes open-source Apache Hadoop* components • Takes advantage of Intel Xeon® processor capabilities Hadoop* security, scalability, and management enhancements • Tightly integrated into the platform Support and services from Intel and its partners Find out more about the Intel Distribution 1 Currently available in China, Taiwan, and the United States.
  • 12. INTEL CONFIDENTIAL 12 Apache Hadoop* Optimization Practical Trade-offs for Hardware, Software, and System Settings Fine-tune your solution for best performance: Maximize productivity Limit energy consumption Maximize resource utilization Reduce operating costs Lower your total cost of ownership
  • 13. INTEL CONFIDENTIAL 13 Benchmark Performance Intel’s HiBench Suite Comprehensive set of benchmark tests for Apache Hadoop*software Represents important Hadoop* workloads and analytics with a mix of hardware usage characteristics Available as open-source software under Apache License 2.0 at https://github.com/hibench/HiBench-2.1
  • 14. INTEL CONFIDENTIAL 14 Get Started Five Steps for IT Managers Work with your business users to articulate the big opportunities Do your research to get up to speed on the technology Develop use case(s) for your project Identify gaps between current- and future-state capabilities Develop a test environment for a production version 1 2 3 4 5
  • 15. INTEL CONFIDENTIAL 15 Big Data Planning Guide Everything You Need to Get Started Intel.com/ITCenter Read the full planning guide at Intel.com/bigdata Learn more about the Intel® Distribution for Apache Hadoop* software at hadoop.intel.com
  • 16. INTEL CONFIDENTIAL 16 Legal This presentation is for informational purposes only. THIS DOCUMENT IS PROVIDED “AS IS” WITH NO WARRANTIES WHATSOEVER, INCLUDING ANY WARRANTY OF MERCHANTABILITY, NONINFRINGEMENT, FITNESS FOR ANY PARTICULAR PURPOSE, OR ANY WARRANTY OTHERWISE ARISING OUT OF ANY PROPOSAL, SPECIFICATION, OR SAMPLE. Intel disclaims all liability, including liability for infringement of any property rights, relating to use of this information. No license, express or implied, by estoppel or otherwise, to any intellectual property rights is granted herein. Copyright © 2013 Intel Corporation. Intel, the Intel logo, and Xeon are trademarks of Intel Corporation in the U.S. and other countries. *Other names and brands may be claimed as the property of others.