2. Storm is a free and open source distributed realtime
computation system
2
What is Storm
Hadoop : Batch Processing :: Storm : Real-Time Processing
Provides general primitive to do real time computation
Scalable and Fault-tolerant
Guaranteed Message Processing atleast once
Can be used with any programming language
11. 11
DRPC
Distributed Remote Procedure Call
DRPC parallelizes the computation of really intense
functions on the fly using Storm
The Storm topology takes in as input a stream of
function arguments, and it emits an output stream of
the results for each of those function calls.
DRPC is not so much a feature of Storm as it is a
pattern expressed from Storm's primitives of streams,
spouts, bolts, and topologies.
13. 13
Trident
What is Trident?
Trident is a high-level abstraction for doing realtime
computing on top of Storm.
Similar to Cascading or Pig in Hadoop
Easier to build topologies
Trident has joins, aggregations, grouping, functions,
and filters
Trident lets you express realtime computations in a
natural way while still getting maximal performance.