2. www.edureka.co/apache-spark-scala-training
What will you learn today ?
What is Apache Spark ?
How Spark fits into Hadoop Ecosystem ?
Why Spark for Big Data Analytics ?
Spark’s popularity
Hands-On : Analyzing data with Spark
4. www.edureka.co/apache-spark-scala-training
How Spark fits into Hadoop Ecosystem ?
Spark is intended to enhance, not replace, the Hadoop stack
Spark is designed to read and write data to HDFS as well as other storage systems such as
CSV files, Amazon S3 and NoSQL databases
6. www.edureka.co/apache-spark-scala-training
Why Spark for Big Data Analytics ?
Following features make Spark, the best fit for Big Data Analytics :
Spark simplifies data analysis
Spark provides built-in libraries to do advanced analytics
Spark speaks more than one language
Spark provides faster results
Spark allows you to use different Hadoop vendors
8. www.edureka.co/apache-spark-scala-training
Word Count Problem - Spark
Spark Scala Code for Word Count Problem
Spark Python Code for Word Count Problem
Clearly processing data with Spark is much
easier than MapReduce and Spark gives you
the flexibility to choose your favorite
language Scala, Java, Python etc.
10. www.edureka.co/apache-spark-scala-training
Spark Libraries
Spark SQL : Spark’s module for working with structured data
MLlib : Spark’s machine learning library
GraphX : Spark’s API for graph computation
Spark Streaming : Spark’s API to process streaming data
15. www.edureka.co/apache-spark-scala-training
Spark is here to stay
Spark is not one of those "here today, gone tomorrow". Spark is here to stay
for the foreseeable future, and it is well worth to get your teeth into it in
order to get some value out of your data
17. www.edureka.co/apache-spark-scala-training
References
IBM backs Apache Spark for Big Data Analytics :
http://www.forbes.com/sites/paulmiller/2015/06/15/ibm-backs-apache-spark-for-big-data-analytics/
How eBay uses Spark to ignite Data Analytics :
http://www.ebaytechblog.com/2014/05/28/using-spark-to-ignite-data-analytics/
Why Cloudera is saying 'Goodbye, MapReduce' and 'Hello, Spark' :
http://fortune.com/2015/09/09/cloudera-spark-mapreduce/
5 reasons to turn to Spark for Big Data Analytics :
http://www.infoworld.com/article/2897287/big-data/5-reasons-to-turn-to-spark-for-big-data-analytics.html