21. ๋ฐ์ดํฐ ๋ถ์์ ์ํ ์ธํ๋ผ ๋น๊ต
Previous Analysis Tools
Data Analysis Tool
Local: Data Aggregation
MapReduce based Analysis
Data Processing/Extraction
and Analysis Tool
Data Mining /
Data Processing/Extraction
MapReduce Framework
Distributed Big Data System based on Hadoop
โฆโฆ
21
23. ๋งต๋ฆฌ๋์ค ์ ์
In Wikipedia!
MapReduce is a programming model for processing large data sets with a parallel,
distributed algorithm on a cluster.!
A MapReduce program comprises a Map() procedure that performs filtering and sorting !
and a Reduce() procedure that performs a summary operation.
โข Hadoop์ผ๋ก ๋ํ๋๋ ๋น ๋ฐ์ดํฐ ํ๋ซํผ์์์ ๋ฐ์ดํฐ
์ฒ๋ฆฌ ํ๋ก์ธ์ค!
โข Map/Reduce : Key, Value ๊ตฌ์กฐ๋ฅผ ๊ธฐ๋ฐ์ผ๋ก
ํ๋ ๋ฐ์ดํฐ ๋ถ์ฐ ์ฒ๋ฆฌ ๊ตฌ์กฐโจ
(ํ๋ ์์ํฌ)!
โข Google์์ 2004๋ ์ต์ด ๋ฐํ
23