This document summarizes an agenda for the SARA Hadoop Hackathon on December 7, 2010. It provides background on Hadoop and how it relates to earlier technologies like Nutch and MapReduce. It then outlines the agenda for the day which includes introductions, presentations on MapReduce at University of Twente and a kickoff for the hackathon project building period. An optional tour of the SARA facilities is also included. The day will conclude with presentations of hackathon results.
2. DJOERD HIEMSTRA
(UTwente)
EDGAR MEIJ
(UvA)
SARA Hadoop Hackathon, December 7, 2010
3. 2002 2004 2006
Nutch* MR/GFS** Hadoop
* http://nutch.apache.org/
** http://labs.google.com/papers/mapreduce.html
http://labs.google.com/papers/gfs.html
SARA Hadoop Hackathon, December 7, 2010
4. 2010: A Hype in Production
http://wiki.apache.org/hadoop/PoweredBy
SARA Hadoop Hackathon, December 7, 2010
5. Super computing
Cloud computing Grid computing
Cluster computing GPU computing
http://www.sara.nl/
SARA Hadoop Hackathon, December 7, 2010
6. :-(
Data Expensive!
Computation
:-)
Data Cheaper!
Computation
Ref: Luiz André Barroso and Urs Hölzle, Google Inc.
The Datacenter as a Computer: An Introduction to the Design of WarehouseScale Machines
SARA Hadoop Hackathon, December 7, 2010