Suche senden
Hochladen
Hadoop - Simple. Scalable.
•
1 gefällt mir
•
1,023 views
elliando dias
Folgen
Technologie
Melden
Teilen
Melden
Teilen
1 von 44
Jetzt herunterladen
Downloaden Sie, um offline zu lesen
Empfohlen
Nov HUG 2009: Hadoop Record Reader In Python
Nov HUG 2009: Hadoop Record Reader In Python
Yahoo Developer Network
JOSA TechTalks - Big Data on Hadoop
JOSA TechTalks - Big Data on Hadoop
Jordan Open Source Association
introduction to data processing using Hadoop and Pig
introduction to data processing using Hadoop and Pig
Ricardo Varela
Cassandra + Hadoop @ApacheCon
Cassandra + Hadoop @ApacheCon
Jeremy Hanna
Practical Hadoop using Pig
Practical Hadoop using Pig
David Wellman
Another Intro To Hadoop
Another Intro To Hadoop
Adeel Ahmad
Hadoop Technology
Hadoop Technology
Atul Kushwaha
Hive and data analysis using pandas
Hive and data analysis using pandas
Purna Chander K
Empfohlen
Nov HUG 2009: Hadoop Record Reader In Python
Nov HUG 2009: Hadoop Record Reader In Python
Yahoo Developer Network
JOSA TechTalks - Big Data on Hadoop
JOSA TechTalks - Big Data on Hadoop
Jordan Open Source Association
introduction to data processing using Hadoop and Pig
introduction to data processing using Hadoop and Pig
Ricardo Varela
Cassandra + Hadoop @ApacheCon
Cassandra + Hadoop @ApacheCon
Jeremy Hanna
Practical Hadoop using Pig
Practical Hadoop using Pig
David Wellman
Another Intro To Hadoop
Another Intro To Hadoop
Adeel Ahmad
Hadoop Technology
Hadoop Technology
Atul Kushwaha
Hive and data analysis using pandas
Hive and data analysis using pandas
Purna Chander K
Geek camp
Geek camp
jdhok
Getting Started on Hadoop
Getting Started on Hadoop
Paco Nathan
Making Big Data, small
Making Big Data, small
MarcinJedyk
Scalable Hadoop with succinct Python: the best of both worlds
Scalable Hadoop with succinct Python: the best of both worlds
DataWorks Summit
Hadoop: The elephant in the room
Hadoop: The elephant in the room
cacois
Hadoop training by keylabs
Hadoop training by keylabs
Siva Sankar
Hive integration: HBase and Rcfile__HadoopSummit2010
Hive integration: HBase and Rcfile__HadoopSummit2010
Yahoo Developer Network
Hadoop
Hadoop
siva shankari
Hadoop
Hadoop
Jaydeep Patel
Intro to Hadoop
Intro to Hadoop
jeffturner
Hadoop
Hadoop
Kartik Kalpande Patil
Bw tech hadoop
Bw tech hadoop
Mindgrub Technologies
How To Run Mapreduce Jobs In Python
How To Run Mapreduce Jobs In Python
Yi Wang
BioPig for scalable analysis of big sequencing data
BioPig for scalable analysis of big sequencing data
Zhong Wang
Hadoop at Yahoo! -- Hadoop World NY 2009
Hadoop at Yahoo! -- Hadoop World NY 2009
yhadoop
Introduction to Hadoop - FinistJug
Introduction to Hadoop - FinistJug
David Morin
How to measure your dataflow using fio, pktgen and bandwidthTest
How to measure your dataflow using fio, pktgen and bandwidthTest
Naoto MATSUMOTO
9/2017 STL HUG - Back to School
9/2017 STL HUG - Back to School
Adam Doyle
Hadoop and big data
Hadoop and big data
Sharad Pandey
Implementing S-Expressions Based Extented Languages in LISP
Implementing S-Expressions Based Extented Languages in LISP
elliando dias
JCR Content Management
JCR Content Management
elliando dias
Bibliografía primaria y topológica de las ediciones de los libros de Santiago...
Bibliografía primaria y topológica de las ediciones de los libros de Santiago...
Marta Domínguez-Senra
Weitere ähnliche Inhalte
Was ist angesagt?
Geek camp
Geek camp
jdhok
Getting Started on Hadoop
Getting Started on Hadoop
Paco Nathan
Making Big Data, small
Making Big Data, small
MarcinJedyk
Scalable Hadoop with succinct Python: the best of both worlds
Scalable Hadoop with succinct Python: the best of both worlds
DataWorks Summit
Hadoop: The elephant in the room
Hadoop: The elephant in the room
cacois
Hadoop training by keylabs
Hadoop training by keylabs
Siva Sankar
Hive integration: HBase and Rcfile__HadoopSummit2010
Hive integration: HBase and Rcfile__HadoopSummit2010
Yahoo Developer Network
Hadoop
Hadoop
siva shankari
Hadoop
Hadoop
Jaydeep Patel
Intro to Hadoop
Intro to Hadoop
jeffturner
Hadoop
Hadoop
Kartik Kalpande Patil
Bw tech hadoop
Bw tech hadoop
Mindgrub Technologies
How To Run Mapreduce Jobs In Python
How To Run Mapreduce Jobs In Python
Yi Wang
BioPig for scalable analysis of big sequencing data
BioPig for scalable analysis of big sequencing data
Zhong Wang
Hadoop at Yahoo! -- Hadoop World NY 2009
Hadoop at Yahoo! -- Hadoop World NY 2009
yhadoop
Introduction to Hadoop - FinistJug
Introduction to Hadoop - FinistJug
David Morin
How to measure your dataflow using fio, pktgen and bandwidthTest
How to measure your dataflow using fio, pktgen and bandwidthTest
Naoto MATSUMOTO
9/2017 STL HUG - Back to School
9/2017 STL HUG - Back to School
Adam Doyle
Hadoop and big data
Hadoop and big data
Sharad Pandey
Was ist angesagt?
(19)
Geek camp
Geek camp
Getting Started on Hadoop
Getting Started on Hadoop
Making Big Data, small
Making Big Data, small
Scalable Hadoop with succinct Python: the best of both worlds
Scalable Hadoop with succinct Python: the best of both worlds
Hadoop: The elephant in the room
Hadoop: The elephant in the room
Hadoop training by keylabs
Hadoop training by keylabs
Hive integration: HBase and Rcfile__HadoopSummit2010
Hive integration: HBase and Rcfile__HadoopSummit2010
Hadoop
Hadoop
Hadoop
Hadoop
Intro to Hadoop
Intro to Hadoop
Hadoop
Hadoop
Bw tech hadoop
Bw tech hadoop
How To Run Mapreduce Jobs In Python
How To Run Mapreduce Jobs In Python
BioPig for scalable analysis of big sequencing data
BioPig for scalable analysis of big sequencing data
Hadoop at Yahoo! -- Hadoop World NY 2009
Hadoop at Yahoo! -- Hadoop World NY 2009
Introduction to Hadoop - FinistJug
Introduction to Hadoop - FinistJug
How to measure your dataflow using fio, pktgen and bandwidthTest
How to measure your dataflow using fio, pktgen and bandwidthTest
9/2017 STL HUG - Back to School
9/2017 STL HUG - Back to School
Hadoop and big data
Hadoop and big data
Andere mochten auch
Implementing S-Expressions Based Extented Languages in LISP
Implementing S-Expressions Based Extented Languages in LISP
elliando dias
JCR Content Management
JCR Content Management
elliando dias
Bibliografía primaria y topológica de las ediciones de los libros de Santiago...
Bibliografía primaria y topológica de las ediciones de los libros de Santiago...
Marta Domínguez-Senra
Writing Your Own JSR-Compliant, Domain-Specific Scripting Language
Writing Your Own JSR-Compliant, Domain-Specific Scripting Language
elliando dias
SharePoint Governance and Lifecycle Management with Project Server 2010
SharePoint Governance and Lifecycle Management with Project Server 2010
Alexander Burton
Why you should be excited about ClojureScript
Why you should be excited about ClojureScript
elliando dias
Nomenclatura e peças de container
Nomenclatura e peças de container
elliando dias
Functional Programming with Immutable Data Structures
Functional Programming with Immutable Data Structures
elliando dias
Clojurescript slides
Clojurescript slides
elliando dias
Andere mochten auch
(9)
Implementing S-Expressions Based Extented Languages in LISP
Implementing S-Expressions Based Extented Languages in LISP
JCR Content Management
JCR Content Management
Bibliografía primaria y topológica de las ediciones de los libros de Santiago...
Bibliografía primaria y topológica de las ediciones de los libros de Santiago...
Writing Your Own JSR-Compliant, Domain-Specific Scripting Language
Writing Your Own JSR-Compliant, Domain-Specific Scripting Language
SharePoint Governance and Lifecycle Management with Project Server 2010
SharePoint Governance and Lifecycle Management with Project Server 2010
Why you should be excited about ClojureScript
Why you should be excited about ClojureScript
Nomenclatura e peças de container
Nomenclatura e peças de container
Functional Programming with Immutable Data Structures
Functional Programming with Immutable Data Structures
Clojurescript slides
Clojurescript slides
Ähnlich wie Hadoop - Simple. Scalable.
Module 01 - Understanding Big Data and Hadoop 1.x,2.x
Module 01 - Understanding Big Data and Hadoop 1.x,2.x
NPN Training
(Berkeley CS186 guest lecture) Big Data Analytics Systems: What Goes Around C...
(Berkeley CS186 guest lecture) Big Data Analytics Systems: What Goes Around C...
Reynold Xin
A gentle introduction to the world of BigData and Hadoop
A gentle introduction to the world of BigData and Hadoop
Stefano Paluello
BW Tech Meetup: Hadoop and The rise of Big Data
BW Tech Meetup: Hadoop and The rise of Big Data
Mindgrub Technologies
Hadoop ecosystem framework n hadoop in live environment
Hadoop ecosystem framework n hadoop in live environment
Delhi/NCR HUG
Presentation sreenu dwh-services
Presentation sreenu dwh-services
Sreenu Musham
Hadoop: Distributed Data Processing
Hadoop: Distributed Data Processing
Cloudera, Inc.
Hadoop Architecture in Depth
Hadoop Architecture in Depth
Syed Hadoop
Hadoop introduction , Why and What is Hadoop ?
Hadoop introduction , Why and What is Hadoop ?
sudhakara st
Big Data Architecture and Deployment
Big Data Architecture and Deployment
Cisco Canada
Cisco connect toronto 2015 big data sean mc keown
Cisco connect toronto 2015 big data sean mc keown
Cisco Canada
Big data with HDFS and Mapreduce
Big data with HDFS and Mapreduce
senthil0809
Apache Hadoop Big Data Technology
Apache Hadoop Big Data Technology
Jay Nagar
hadoop
hadoop
swatic018
hadoop
hadoop
swatic018
Hadoop and big data training
Hadoop and big data training
agiamas
عصر کلان داده، چرا و چگونه؟
عصر کلان داده، چرا و چگونه؟
datastack
Lecture 2 part 1
Lecture 2 part 1
Jazan University
Apache Hadoop & Friends at Utah Java User's Group
Apache Hadoop & Friends at Utah Java User's Group
Cloudera, Inc.
Introduction to Hadoop
Introduction to Hadoop
joelcrabb
Ähnlich wie Hadoop - Simple. Scalable.
(20)
Module 01 - Understanding Big Data and Hadoop 1.x,2.x
Module 01 - Understanding Big Data and Hadoop 1.x,2.x
(Berkeley CS186 guest lecture) Big Data Analytics Systems: What Goes Around C...
(Berkeley CS186 guest lecture) Big Data Analytics Systems: What Goes Around C...
A gentle introduction to the world of BigData and Hadoop
A gentle introduction to the world of BigData and Hadoop
BW Tech Meetup: Hadoop and The rise of Big Data
BW Tech Meetup: Hadoop and The rise of Big Data
Hadoop ecosystem framework n hadoop in live environment
Hadoop ecosystem framework n hadoop in live environment
Presentation sreenu dwh-services
Presentation sreenu dwh-services
Hadoop: Distributed Data Processing
Hadoop: Distributed Data Processing
Hadoop Architecture in Depth
Hadoop Architecture in Depth
Hadoop introduction , Why and What is Hadoop ?
Hadoop introduction , Why and What is Hadoop ?
Big Data Architecture and Deployment
Big Data Architecture and Deployment
Cisco connect toronto 2015 big data sean mc keown
Cisco connect toronto 2015 big data sean mc keown
Big data with HDFS and Mapreduce
Big data with HDFS and Mapreduce
Apache Hadoop Big Data Technology
Apache Hadoop Big Data Technology
hadoop
hadoop
hadoop
hadoop
Hadoop and big data training
Hadoop and big data training
عصر کلان داده، چرا و چگونه؟
عصر کلان داده، چرا و چگونه؟
Lecture 2 part 1
Lecture 2 part 1
Apache Hadoop & Friends at Utah Java User's Group
Apache Hadoop & Friends at Utah Java User's Group
Introduction to Hadoop
Introduction to Hadoop
Mehr von elliando dias
Geometria Projetiva
Geometria Projetiva
elliando dias
Polyglot and Poly-paradigm Programming for Better Agility
Polyglot and Poly-paradigm Programming for Better Agility
elliando dias
Javascript Libraries
Javascript Libraries
elliando dias
How to Make an Eight Bit Computer and Save the World!
How to Make an Eight Bit Computer and Save the World!
elliando dias
Ragel talk
Ragel talk
elliando dias
A Practical Guide to Connecting Hardware to the Web
A Practical Guide to Connecting Hardware to the Web
elliando dias
Introdução ao Arduino
Introdução ao Arduino
elliando dias
Minicurso arduino
Minicurso arduino
elliando dias
Incanter Data Sorcery
Incanter Data Sorcery
elliando dias
Rango
Rango
elliando dias
Fab.in.a.box - Fab Academy: Machine Design
Fab.in.a.box - Fab Academy: Machine Design
elliando dias
The Digital Revolution: Machines that makes
The Digital Revolution: Machines that makes
elliando dias
Hadoop + Clojure
Hadoop + Clojure
elliando dias
Hadoop and Hive Development at Facebook
Hadoop and Hive Development at Facebook
elliando dias
Multi-core Parallelization in Clojure - a Case Study
Multi-core Parallelization in Clojure - a Case Study
elliando dias
From Lisp to Clojure/Incanter and RAn Introduction
From Lisp to Clojure/Incanter and RAn Introduction
elliando dias
FleetDB A Schema-Free Database in Clojure
FleetDB A Schema-Free Database in Clojure
elliando dias
Clojure and The Robot Apocalypse
Clojure and The Robot Apocalypse
elliando dias
Clojure - A new Lisp
Clojure - A new Lisp
elliando dias
Clojure - An Introduction for Lisp Programmers
Clojure - An Introduction for Lisp Programmers
elliando dias
Mehr von elliando dias
(20)
Geometria Projetiva
Geometria Projetiva
Polyglot and Poly-paradigm Programming for Better Agility
Polyglot and Poly-paradigm Programming for Better Agility
Javascript Libraries
Javascript Libraries
How to Make an Eight Bit Computer and Save the World!
How to Make an Eight Bit Computer and Save the World!
Ragel talk
Ragel talk
A Practical Guide to Connecting Hardware to the Web
A Practical Guide to Connecting Hardware to the Web
Introdução ao Arduino
Introdução ao Arduino
Minicurso arduino
Minicurso arduino
Incanter Data Sorcery
Incanter Data Sorcery
Rango
Rango
Fab.in.a.box - Fab Academy: Machine Design
Fab.in.a.box - Fab Academy: Machine Design
The Digital Revolution: Machines that makes
The Digital Revolution: Machines that makes
Hadoop + Clojure
Hadoop + Clojure
Hadoop and Hive Development at Facebook
Hadoop and Hive Development at Facebook
Multi-core Parallelization in Clojure - a Case Study
Multi-core Parallelization in Clojure - a Case Study
From Lisp to Clojure/Incanter and RAn Introduction
From Lisp to Clojure/Incanter and RAn Introduction
FleetDB A Schema-Free Database in Clojure
FleetDB A Schema-Free Database in Clojure
Clojure and The Robot Apocalypse
Clojure and The Robot Apocalypse
Clojure - A new Lisp
Clojure - A new Lisp
Clojure - An Introduction for Lisp Programmers
Clojure - An Introduction for Lisp Programmers
Kürzlich hochgeladen
+971581248768>> SAFE AND ORIGINAL ABORTION PILLS FOR SALE IN DUBAI AND ABUDHA...
+971581248768>> SAFE AND ORIGINAL ABORTION PILLS FOR SALE IN DUBAI AND ABUDHA...
?#DUbAI#??##{{(☎️+971_581248768%)**%*]'#abortion pills for sale in dubai@
Apidays New York 2024 - Passkeys: Developing APIs to enable passwordless auth...
Apidays New York 2024 - Passkeys: Developing APIs to enable passwordless auth...
apidays
Artificial Intelligence Chap.5 : Uncertainty
Artificial Intelligence Chap.5 : Uncertainty
Khushali Kathiriya
Elevate Developer Efficiency & build GenAI Application with Amazon Q
Elevate Developer Efficiency & build GenAI Application with Amazon Q
Bhuvaneswari Subramani
Apidays New York 2024 - APIs in 2030: The Risk of Technological Sleepwalk by ...
Apidays New York 2024 - APIs in 2030: The Risk of Technological Sleepwalk by ...
apidays
Polkadot JAM Slides - Token2049 - By Dr. Gavin Wood
Polkadot JAM Slides - Token2049 - By Dr. Gavin Wood
Juan lago vázquez
presentation ICT roal in 21st century education
presentation ICT roal in 21st century education
jfdjdjcjdnsjd
Connector Corner: Accelerate revenue generation using UiPath API-centric busi...
Connector Corner: Accelerate revenue generation using UiPath API-centric busi...
DianaGray10
Six Myths about Ontologies: The Basics of Formal Ontology
Six Myths about Ontologies: The Basics of Formal Ontology
johnbeverley2021
Platformless Horizons for Digital Adaptability
Platformless Horizons for Digital Adaptability
WSO2
CNIC Information System with Pakdata Cf In Pakistan
CNIC Information System with Pakdata Cf In Pakistan
danishmna97
EMPOWERMENT TECHNOLOGY GRADE 11 QUARTER 2 REVIEWER
EMPOWERMENT TECHNOLOGY GRADE 11 QUARTER 2 REVIEWER
MadyBayot
Navigating the Deluge_ Dubai Floods and the Resilience of Dubai International...
Navigating the Deluge_ Dubai Floods and the Resilience of Dubai International...
Orbitshub
Biography Of Angeliki Cooney | Senior Vice President Life Sciences | Albany, ...
Biography Of Angeliki Cooney | Senior Vice President Life Sciences | Albany, ...
Angeliki Cooney
Boost Fertility New Invention Ups Success Rates.pdf
Boost Fertility New Invention Ups Success Rates.pdf
sudhanshuwaghmare1
"I see eyes in my soup": How Delivery Hero implemented the safety system for ...
"I see eyes in my soup": How Delivery Hero implemented the safety system for ...
Zilliz
[BuildWithAI] Introduction to Gemini.pdf
[BuildWithAI] Introduction to Gemini.pdf
Sandro Moreira
Cloud Frontiers: A Deep Dive into Serverless Spatial Data and FME
Cloud Frontiers: A Deep Dive into Serverless Spatial Data and FME
Safe Software
Web Form Automation for Bonterra Impact Management (fka Social Solutions Apri...
Web Form Automation for Bonterra Impact Management (fka Social Solutions Apri...
Jeffrey Haguewood
Apidays New York 2024 - The Good, the Bad and the Governed by David O'Neill, ...
Apidays New York 2024 - The Good, the Bad and the Governed by David O'Neill, ...
apidays
Kürzlich hochgeladen
(20)
+971581248768>> SAFE AND ORIGINAL ABORTION PILLS FOR SALE IN DUBAI AND ABUDHA...
+971581248768>> SAFE AND ORIGINAL ABORTION PILLS FOR SALE IN DUBAI AND ABUDHA...
Apidays New York 2024 - Passkeys: Developing APIs to enable passwordless auth...
Apidays New York 2024 - Passkeys: Developing APIs to enable passwordless auth...
Artificial Intelligence Chap.5 : Uncertainty
Artificial Intelligence Chap.5 : Uncertainty
Elevate Developer Efficiency & build GenAI Application with Amazon Q
Elevate Developer Efficiency & build GenAI Application with Amazon Q
Apidays New York 2024 - APIs in 2030: The Risk of Technological Sleepwalk by ...
Apidays New York 2024 - APIs in 2030: The Risk of Technological Sleepwalk by ...
Polkadot JAM Slides - Token2049 - By Dr. Gavin Wood
Polkadot JAM Slides - Token2049 - By Dr. Gavin Wood
presentation ICT roal in 21st century education
presentation ICT roal in 21st century education
Connector Corner: Accelerate revenue generation using UiPath API-centric busi...
Connector Corner: Accelerate revenue generation using UiPath API-centric busi...
Six Myths about Ontologies: The Basics of Formal Ontology
Six Myths about Ontologies: The Basics of Formal Ontology
Platformless Horizons for Digital Adaptability
Platformless Horizons for Digital Adaptability
CNIC Information System with Pakdata Cf In Pakistan
CNIC Information System with Pakdata Cf In Pakistan
EMPOWERMENT TECHNOLOGY GRADE 11 QUARTER 2 REVIEWER
EMPOWERMENT TECHNOLOGY GRADE 11 QUARTER 2 REVIEWER
Navigating the Deluge_ Dubai Floods and the Resilience of Dubai International...
Navigating the Deluge_ Dubai Floods and the Resilience of Dubai International...
Biography Of Angeliki Cooney | Senior Vice President Life Sciences | Albany, ...
Biography Of Angeliki Cooney | Senior Vice President Life Sciences | Albany, ...
Boost Fertility New Invention Ups Success Rates.pdf
Boost Fertility New Invention Ups Success Rates.pdf
"I see eyes in my soup": How Delivery Hero implemented the safety system for ...
"I see eyes in my soup": How Delivery Hero implemented the safety system for ...
[BuildWithAI] Introduction to Gemini.pdf
[BuildWithAI] Introduction to Gemini.pdf
Cloud Frontiers: A Deep Dive into Serverless Spatial Data and FME
Cloud Frontiers: A Deep Dive into Serverless Spatial Data and FME
Web Form Automation for Bonterra Impact Management (fka Social Solutions Apri...
Web Form Automation for Bonterra Impact Management (fka Social Solutions Apri...
Apidays New York 2024 - The Good, the Bad and the Governed by David O'Neill, ...
Apidays New York 2024 - The Good, the Bad and the Governed by David O'Neill, ...
Hadoop - Simple. Scalable.
1.
Hadoop Simple. Scalable.
2.
@markgunnels mark@catamorphiclabs.com
3.
Java. Clojure. Ruby.
Cloudera Certified
4.
posscon.org April 15, 16,
and 17
5.
Agenda Overview Massively
Large Data Sets and the problems therein Distributed File System MapReduce Pig
6.
Overview
7.
Doug Cutting
Genius
8.
Favorite Hadoop Story
New York Times
9.
4 Terabytes of
Source Articles.
10.
24 Hours.
11.
5.5 Terabytes of
PDFs.
12.
Did it again.
13.
$240.
14.
Infoporn from Yahoo
73 hours 490 TB Shuffling 280 TB Output 4000 Nodes 16 PB Disk Space 32K Cores 64 TB RAM
15.
Hadoop solves...
16.
Analyzing Massively Large
Datasets
17.
Two Problems You have
to distribute.
18.
Data Storage Capacity
has increased rapidly beyond read speeds. Datasets won't fit on one disk. Tolerate node failure.
19.
Data Analysis
Combine data from many machines. Tolerate node failure.
20.
How Hadoop solves
these problems.
21.
Send Code to
Data. Not Data to Code.
22.
Data Storage
HDFS
23.
Name Node. Data
Nodes. Master - Slave Relationship
24.
Shard massive files
across multiple machines. MB, GB, and TB
25.
Tolerant of Node
Failure Files replicated across at least 3 nodes.
26.
HDFS behaves like
a normal file system. No true appends yet.
27.
Demonstration.
28.
Data Analysis
MapReduce
29.
Job Tracker. Task
Nodes. Master - Slave Relationship.
30.
map
31.
Demonstration
32.
pmap
33.
Demonstration
34.
reduce
35.
Demonstration
36.
(reduce (pmap))
37.
Demonstration.
38.
MapReduce
Java
39.
Nobody likes it.
:-)
40.
MapReduce Ruby. Python. Unix
Utilities.
41.
MapReduce Clojure
42.
Hadoop Ecosystem Pigkeeper. Hive.
Cascading.
43.
Pig
44.
HBase
Jetzt herunterladen