3. Big data experience
Big Data Platforms
Technologies
Google BigQuery
Hadoop MapReduce/HDFS
Amazon EC2, EMR
Army Cloud
DIA Cloud
NoSQL: Hbase, Accumulo, MongoDB
Apache Storm: data stream processing
DIA Cloud
4. Symptoms of a big data problem
o If what you are doing works for you, don’t change it!
o Storage space
o Data throughput
o Computations take too long
o Queries take too long
o You have lots of disparate data
5. Real world problem example
o Customer receives daily “deliveries” of textual data
o Couldn’t get all the data loaded into the server. Wouldn’t fit
and it was taking a long time
o Unable to run algorithms on in a timely manner
o User interfaces were sluggish because of slow query times