15. Pulling
Data
From
Twi,er
• Custom
source,
using
twi,er4j
• Sources
process
data
as
discrete
events
16. Loading
Data
Into
HDFS
• HDFS
Sink
comes
stock
with
Flume
• Easily
separate
files
by
creaRon
Rme
• hdfs://hadoop1:8020/user/flume/tweets/%Y/%m/%d/%H/