1. Living on a Cloud Dr Keith Marlow Chief Architect, APAC Region [email_address]
2.
3.
4.
5.
6.
7.
8.
9. How does Hadoop scale? Map/Reduce Input Map Map Map Map Transient Data Results Reduce Reduce Reduce Reduce Split into ‘ bits’ Process the ‘bits’ on each node Collate each ‘bin’ on each node Shuffle into ‘ bins’ Join it all together
10.
11.
12. Hadoop – How do we use it? What 20,0000 nodes look like