TALK TRACK
Ad-hoc experimentation Spark, Hive, Shell, Flink, Tajo, Ignite, Lens, etc
Deeply integrated with Spark + Hadoop Can be managed via Ambari Stacks
Supports multiple language backends Pluggable “Interpreters”
Incubating at Apache 100% open source and open community
[NEXT SLIDE]
TALK TRACK
Ensuring Spark is well integrated with YARN, Ambari, and Ranger enables enterprise to deploy Spark apps with confidence, and since HDP is available across Windows, Linux, on-premises and cloud deployment environments, it just makes it that much easier for enterprises to adopt it.
[NEXT SLIDE]
http://www.cs.berkeley.edu/~matei/papers/2010/hotcloud_spark.pdf
http://www.cs.berkeley.edu/~matei/papers/2012/nsdi_spark.pd
TALK TRACK
Ensuring Spark is well integrated with YARN, Ambari, and Ranger enables enterprise to deploy Spark apps with confidence, and since HDP is available across Windows, Linux, on-premises and cloud deployment environments, it just makes it that much easier for enterprises to adopt it.
[NEXT SLIDE]
http://www.cs.berkeley.edu/~matei/papers/2010/hotcloud_spark.pdf
http://www.cs.berkeley.edu/~matei/papers/2012/nsdi_spark.pd
Key idea: add “variables” to the “functions” in functional programming
NEED SPEAKER NOTES
NEED SPEAKER NOTES
NEED SPEAKER NOTES
Spark DataFrames represent tabular Data
NEED SPEAKER NOTES
NEED SPEAKER NOTES
NEED SPEAKER NOTES
TALK TRACK
Ensuring Spark is well integrated with YARN, Ambari, and Ranger enables enterprise to deploy Spark apps with confidence, and since HDP is available across Windows, Linux, on-premises and cloud deployment environments, it just makes it that much easier for enterprises to adopt it.
[NEXT SLIDE]
TALK TRACK
[NEXT SLIDE]
NEED SPEAKER NOTES
NEED SPEAKER NOTES
TALK TRACK
Ensuring Spark is well integrated with YARN, Ambari, and Ranger enables enterprise to deploy Spark apps with confidence, and since HDP is available across Windows, Linux, on-premises and cloud deployment environments, it just makes it that much easier for enterprises to adopt it.
[NEXT SLIDE]
[RESOURCES]
A vertex is an entity that can bring a bag of data (generally small)
An edge connects vertices and can also own a bag of data
https://amplab.cs.berkeley.edu/wp-content/uploads/2013/05/grades-graphx_with_fonts.pdf
Takeaways
Change order of interoperability slide
Flush out no lock-in slide to talk about “proprietary open source”