Suche senden
Hochladen
Hadoopを業務で使ってみました
•
Als KEY, PDF herunterladen
•
4 gefällt mir
•
2,196 views
Tatsuya Sasaki
Folgen
テックライフLT #tllt で使ったスライドです
Weniger lesen
Mehr lesen
Technologie
Bildung
Melden
Teilen
Melden
Teilen
1 von 23
Jetzt herunterladen
Empfohlen
800万人の"食べたい"をHadoopで分散処理
800万人の"食べたい"をHadoopで分散処理
Tatsuya Sasaki
マーケティングのためのHadoop利用
マーケティングのためのHadoop利用
Tatsuya Sasaki
961万人の食卓を支えるデータ解析
961万人の食卓を支えるデータ解析
Tatsuya Sasaki
Big Data in the Microsoft Platform
Big Data in the Microsoft Platform
Jesus Rodriguez
Cloud Friendly Hadoop and Hive
Cloud Friendly Hadoop and Hive
DataWorks Summit
Intro to cassandra + hadoop
Intro to cassandra + hadoop
Jeremy Hanna
Yahoo! - Arun Murthy - Hadoop World 2010
Yahoo! - Arun Murthy - Hadoop World 2010
Cloudera, Inc.
Hadoop_content_by_sasidhar2
Hadoop_content_by_sasidhar2
Akshara Technologies Training by Industry Experts
Empfohlen
800万人の"食べたい"をHadoopで分散処理
800万人の"食べたい"をHadoopで分散処理
Tatsuya Sasaki
マーケティングのためのHadoop利用
マーケティングのためのHadoop利用
Tatsuya Sasaki
961万人の食卓を支えるデータ解析
961万人の食卓を支えるデータ解析
Tatsuya Sasaki
Big Data in the Microsoft Platform
Big Data in the Microsoft Platform
Jesus Rodriguez
Cloud Friendly Hadoop and Hive
Cloud Friendly Hadoop and Hive
DataWorks Summit
Intro to cassandra + hadoop
Intro to cassandra + hadoop
Jeremy Hanna
Yahoo! - Arun Murthy - Hadoop World 2010
Yahoo! - Arun Murthy - Hadoop World 2010
Cloudera, Inc.
Hadoop_content_by_sasidhar2
Hadoop_content_by_sasidhar2
Akshara Technologies Training by Industry Experts
Qubole Overview at the Fifth Elephant Conference
Qubole Overview at the Fifth Elephant Conference
Joydeep Sen Sarma
Cassandra/Hadoop Integration
Cassandra/Hadoop Integration
Jeremy Hanna
Cloud Optimized Big Data
Cloud Optimized Big Data
Joydeep Sen Sarma
PySpark Cassandra - Amsterdam Spark Meetup
PySpark Cassandra - Amsterdam Spark Meetup
Frens Jan Rumph
Hadoop big data online training
Hadoop big data online training
Magnific Trainings
Hadoop 101 - Big Data Technology
Hadoop 101 - Big Data Technology
Firman Gautama
Flickr: Computer vision at scale with Hadoop and Storm (Huy Nguyen)
Flickr: Computer vision at scale with Hadoop and Storm (Huy Nguyen)
Yahoo Developer Network
Hadoop: Past, Present and Future - v2.1 - SQLSaturday #340
Hadoop: Past, Present and Future - v2.1 - SQLSaturday #340
Big Data Joe™ Rossi
Introduction to the Hadoop Ecosystem (codemotion Edition)
Introduction to the Hadoop Ecosystem (codemotion Edition)
Uwe Printz
Introduction to Apache Hivemall v0.5.2 and v0.6
Introduction to Apache Hivemall v0.5.2 and v0.6
Makoto Yui
Productive data engineer
Productive data engineer
Rafał Wojdyła
Hive vs Pig for HadoopSourceCodeReading
Hive vs Pig for HadoopSourceCodeReading
Mitsuharu Hamba
Hadoop introduction
Hadoop introduction
shubham kuwar
Drill at the Chug 9-19-12
Drill at the Chug 9-19-12
Ted Dunning
Hadoop basics
Hadoop basics
Antonio Silveira
Cassandra + Spark (You’ve got the lighter, let’s start a fire)
Cassandra + Spark (You’ve got the lighter, let’s start a fire)
Robert Stupp
Hadoop training
Hadoop training
TIB Academy
Introduction to pig & pig latin
Introduction to pig & pig latin
knowbigdata
Map reduce and hadoop at mylife
Map reduce and hadoop at mylife
responseteam
からあげエンジニアについて
からあげエンジニアについて
Tatsuya Sasaki
クックパッドでのemr利用事例
クックパッドでのemr利用事例
Tatsuya Sasaki
からあげとビーチと私
からあげとビーチと私
Tatsuya Sasaki
Weitere ähnliche Inhalte
Was ist angesagt?
Qubole Overview at the Fifth Elephant Conference
Qubole Overview at the Fifth Elephant Conference
Joydeep Sen Sarma
Cassandra/Hadoop Integration
Cassandra/Hadoop Integration
Jeremy Hanna
Cloud Optimized Big Data
Cloud Optimized Big Data
Joydeep Sen Sarma
PySpark Cassandra - Amsterdam Spark Meetup
PySpark Cassandra - Amsterdam Spark Meetup
Frens Jan Rumph
Hadoop big data online training
Hadoop big data online training
Magnific Trainings
Hadoop 101 - Big Data Technology
Hadoop 101 - Big Data Technology
Firman Gautama
Flickr: Computer vision at scale with Hadoop and Storm (Huy Nguyen)
Flickr: Computer vision at scale with Hadoop and Storm (Huy Nguyen)
Yahoo Developer Network
Hadoop: Past, Present and Future - v2.1 - SQLSaturday #340
Hadoop: Past, Present and Future - v2.1 - SQLSaturday #340
Big Data Joe™ Rossi
Introduction to the Hadoop Ecosystem (codemotion Edition)
Introduction to the Hadoop Ecosystem (codemotion Edition)
Uwe Printz
Introduction to Apache Hivemall v0.5.2 and v0.6
Introduction to Apache Hivemall v0.5.2 and v0.6
Makoto Yui
Productive data engineer
Productive data engineer
Rafał Wojdyła
Hive vs Pig for HadoopSourceCodeReading
Hive vs Pig for HadoopSourceCodeReading
Mitsuharu Hamba
Hadoop introduction
Hadoop introduction
shubham kuwar
Drill at the Chug 9-19-12
Drill at the Chug 9-19-12
Ted Dunning
Hadoop basics
Hadoop basics
Antonio Silveira
Cassandra + Spark (You’ve got the lighter, let’s start a fire)
Cassandra + Spark (You’ve got the lighter, let’s start a fire)
Robert Stupp
Hadoop training
Hadoop training
TIB Academy
Introduction to pig & pig latin
Introduction to pig & pig latin
knowbigdata
Map reduce and hadoop at mylife
Map reduce and hadoop at mylife
responseteam
Was ist angesagt?
(19)
Qubole Overview at the Fifth Elephant Conference
Qubole Overview at the Fifth Elephant Conference
Cassandra/Hadoop Integration
Cassandra/Hadoop Integration
Cloud Optimized Big Data
Cloud Optimized Big Data
PySpark Cassandra - Amsterdam Spark Meetup
PySpark Cassandra - Amsterdam Spark Meetup
Hadoop big data online training
Hadoop big data online training
Hadoop 101 - Big Data Technology
Hadoop 101 - Big Data Technology
Flickr: Computer vision at scale with Hadoop and Storm (Huy Nguyen)
Flickr: Computer vision at scale with Hadoop and Storm (Huy Nguyen)
Hadoop: Past, Present and Future - v2.1 - SQLSaturday #340
Hadoop: Past, Present and Future - v2.1 - SQLSaturday #340
Introduction to the Hadoop Ecosystem (codemotion Edition)
Introduction to the Hadoop Ecosystem (codemotion Edition)
Introduction to Apache Hivemall v0.5.2 and v0.6
Introduction to Apache Hivemall v0.5.2 and v0.6
Productive data engineer
Productive data engineer
Hive vs Pig for HadoopSourceCodeReading
Hive vs Pig for HadoopSourceCodeReading
Hadoop introduction
Hadoop introduction
Drill at the Chug 9-19-12
Drill at the Chug 9-19-12
Hadoop basics
Hadoop basics
Cassandra + Spark (You’ve got the lighter, let’s start a fire)
Cassandra + Spark (You’ve got the lighter, let’s start a fire)
Hadoop training
Hadoop training
Introduction to pig & pig latin
Introduction to pig & pig latin
Map reduce and hadoop at mylife
Map reduce and hadoop at mylife
Mehr von Tatsuya Sasaki
からあげエンジニアについて
からあげエンジニアについて
Tatsuya Sasaki
クックパッドでのemr利用事例
クックパッドでのemr利用事例
Tatsuya Sasaki
からあげとビーチと私
からあげとビーチと私
Tatsuya Sasaki
メタプログラミングでDSLを書こう
メタプログラミングでDSLを書こう
Tatsuya Sasaki
NoSQLデータベースが登場した背景と特徴
NoSQLデータベースが登場した背景と特徴
Tatsuya Sasaki
Hadoopをemr経由で利用する方法
Hadoopをemr経由で利用する方法
Tatsuya Sasaki
COOKPADでのHadoop利用
COOKPADでのHadoop利用
Tatsuya Sasaki
Hadoop導入事例 in クックパッド
Hadoop導入事例 in クックパッド
Tatsuya Sasaki
Hadoopを業務で使ってみた
Hadoopを業務で使ってみた
Tatsuya Sasaki
YUI
YUI
Tatsuya Sasaki
Mehr von Tatsuya Sasaki
(10)
からあげエンジニアについて
からあげエンジニアについて
クックパッドでのemr利用事例
クックパッドでのemr利用事例
からあげとビーチと私
からあげとビーチと私
メタプログラミングでDSLを書こう
メタプログラミングでDSLを書こう
NoSQLデータベースが登場した背景と特徴
NoSQLデータベースが登場した背景と特徴
Hadoopをemr経由で利用する方法
Hadoopをemr経由で利用する方法
COOKPADでのHadoop利用
COOKPADでのHadoop利用
Hadoop導入事例 in クックパッド
Hadoop導入事例 in クックパッド
Hadoopを業務で使ってみた
Hadoopを業務で使ってみた
YUI
YUI
Kürzlich hochgeladen
UiPath Community: Communication Mining from Zero to Hero
UiPath Community: Communication Mining from Zero to Hero
UiPathCommunity
[Webinar] SpiraTest - Setting New Standards in Quality Assurance
[Webinar] SpiraTest - Setting New Standards in Quality Assurance
Inflectra
How to write a Business Continuity Plan
How to write a Business Continuity Plan
Databarracks
Arizona Broadband Policy Past, Present, and Future Presentation 3/25/24
Arizona Broadband Policy Past, Present, and Future Presentation 3/25/24
Mark Goldstein
Long journey of Ruby standard library at RubyConf AU 2024
Long journey of Ruby standard library at RubyConf AU 2024
Hiroshi SHIBATA
Why device, WIFI, and ISP insights are crucial to supporting remote Microsoft...
Why device, WIFI, and ISP insights are crucial to supporting remote Microsoft...
panagenda
Take control of your SAP testing with UiPath Test Suite
Take control of your SAP testing with UiPath Test Suite
DianaGray10
How to Effectively Monitor SD-WAN and SASE Environments with ThousandEyes
How to Effectively Monitor SD-WAN and SASE Environments with ThousandEyes
ThousandEyes
Data governance with Unity Catalog Presentation
Data governance with Unity Catalog Presentation
Knoldus Inc.
(How to Program) Paul Deitel, Harvey Deitel-Java How to Program, Early Object...
(How to Program) Paul Deitel, Harvey Deitel-Java How to Program, Early Object...
AliaaTarek5
Assure Ecommerce and Retail Operations Uptime with ThousandEyes
Assure Ecommerce and Retail Operations Uptime with ThousandEyes
ThousandEyes
Emixa Mendix Meetup 11 April 2024 about Mendix Native development
Emixa Mendix Meetup 11 April 2024 about Mendix Native development
Pim van der Noll
The State of Passkeys with FIDO Alliance.pptx
The State of Passkeys with FIDO Alliance.pptx
LoriGlavin3
2024 April Patch Tuesday
2024 April Patch Tuesday
Ivanti
DevEX - reference for building teams, processes, and platforms
DevEX - reference for building teams, processes, and platforms
Sergiu Bodiu
Decarbonising Buildings: Making a net-zero built environment a reality
Decarbonising Buildings: Making a net-zero built environment a reality
IES VE
Rise of the Machines: Known As Drones...
Rise of the Machines: Known As Drones...
Rick Flair
Genislab builds better products and faster go-to-market with Lean project man...
Genislab builds better products and faster go-to-market with Lean project man...
Farhan Tariq
Potential of AI (Generative AI) in Business: Learnings and Insights
Potential of AI (Generative AI) in Business: Learnings and Insights
Ravi Sanghani
How AI, OpenAI, and ChatGPT impact business and software.
How AI, OpenAI, and ChatGPT impact business and software.
Curtis Poe
Kürzlich hochgeladen
(20)
UiPath Community: Communication Mining from Zero to Hero
UiPath Community: Communication Mining from Zero to Hero
[Webinar] SpiraTest - Setting New Standards in Quality Assurance
[Webinar] SpiraTest - Setting New Standards in Quality Assurance
How to write a Business Continuity Plan
How to write a Business Continuity Plan
Arizona Broadband Policy Past, Present, and Future Presentation 3/25/24
Arizona Broadband Policy Past, Present, and Future Presentation 3/25/24
Long journey of Ruby standard library at RubyConf AU 2024
Long journey of Ruby standard library at RubyConf AU 2024
Why device, WIFI, and ISP insights are crucial to supporting remote Microsoft...
Why device, WIFI, and ISP insights are crucial to supporting remote Microsoft...
Take control of your SAP testing with UiPath Test Suite
Take control of your SAP testing with UiPath Test Suite
How to Effectively Monitor SD-WAN and SASE Environments with ThousandEyes
How to Effectively Monitor SD-WAN and SASE Environments with ThousandEyes
Data governance with Unity Catalog Presentation
Data governance with Unity Catalog Presentation
(How to Program) Paul Deitel, Harvey Deitel-Java How to Program, Early Object...
(How to Program) Paul Deitel, Harvey Deitel-Java How to Program, Early Object...
Assure Ecommerce and Retail Operations Uptime with ThousandEyes
Assure Ecommerce and Retail Operations Uptime with ThousandEyes
Emixa Mendix Meetup 11 April 2024 about Mendix Native development
Emixa Mendix Meetup 11 April 2024 about Mendix Native development
The State of Passkeys with FIDO Alliance.pptx
The State of Passkeys with FIDO Alliance.pptx
2024 April Patch Tuesday
2024 April Patch Tuesday
DevEX - reference for building teams, processes, and platforms
DevEX - reference for building teams, processes, and platforms
Decarbonising Buildings: Making a net-zero built environment a reality
Decarbonising Buildings: Making a net-zero built environment a reality
Rise of the Machines: Known As Drones...
Rise of the Machines: Known As Drones...
Genislab builds better products and faster go-to-market with Lean project man...
Genislab builds better products and faster go-to-market with Lean project man...
Potential of AI (Generative AI) in Business: Learnings and Insights
Potential of AI (Generative AI) in Business: Learnings and Insights
How AI, OpenAI, and ChatGPT impact business and software.
How AI, OpenAI, and ChatGPT impact business and software.
Hadoopを業務で使ってみました
1.
Hadoop
2.
• id:sasata299 • • Ruby
Perl •
3.
Hadoop
4.
816 30
3 1
5.
(
)
6.
7.
• • GROUP BY
( ( Д`) • 7000 ( )
8.
9.
Hadoop
10.
Hadoop • Hadoop Streaming •
Ruby • Amazon EC2 Hadoop • 50
11.
Hadoop Streaming
12.
•
( ) • Mapper Reducer
13.
14.
HDFS Mapper, Reducer
15.
Java ( or
JRuby ) Java API
16.
Hadoop Streaming
…orz
17.
18.
Hadoop
cat `hadoop dfs -cat s3://xxxx/user/root/in/hoge` HDFS
19.
7000
( )→
20.
7000
( )→ 30
21.
Hadoop
!!
22.
• Hadoop Streaming
HDFS (Hadoop cat ) • 7000 30 Hadoop
Jetzt herunterladen