SlideShare ist ein Scribd-Unternehmen logo
1 von 45
Downloaden Sie, um offline zu lesen
David	Taieb	
STSM	-	IBM	Cloud	Data	Services	
Developer	advocate		
david_taieb@us.ibm.com	
HANDS-ON	SESSION:		
DEVELOPING	ANALYTIC	APPLICATIONS	
USING	APACHE	SPARK™	AND	PYTHON	
	
Part	2:	Analyzing	car	twiQer	data	with	Spark	
and	DashDb	
PyCon	2016,	Portland
©2016	IBM	Corpora6on		
	
Agenda
•  Provision	the	applica6on	services	on	Bluemix:	
Spark,	DashDb,	IBM	Insight	for	TwiJer	
•  Load	car	related	tweets	from	IBM	Insight	for	
TwiJer	into	DashDb	warehouse	
•  Run	Analy6cs	in	Python	Notebook	and	
discover	new	insights
©2016	IBM	Corpora6on		
	
Sign up for Bluemix
•  Access	IBM	Bluemix	website	on	hJps://console.ng.bluemix.net	
•  Click	on	Get	Started	for	Free	
•  Complete	the	form	and	click	Create	account	
•  Look	for	confirma6on	email	and	click	on	confirm	you	account	link	
Create	new	Space
©2016	IBM	Corpora6on		
	
Create a new space on Bluemix
In	prepara6on	for	running	the	project,	we	create	a	new	space	on	Bluemix		
Create	a	Spark	Instance	
Op6onal:	You	can	skip	this	step	if	you	already	have	a	
space	with	Spark	instance	that	you	would	like	to	reuse
©2016	IBM	Corpora6on		
	
Create a Spark Instance
Op6onal:	You	can	skip	this	step	if	you	already	have	a	
space	with	Spark	instance	that	you	would	like	to	reuse
©2016	IBM	Corpora6on		
	
Create New Spark Instance
Op6onal:	You	can	skip	this	step	if	you	already	have	a	
space	with	Spark	instance	that	you	would	like	to	reuse
©2016	IBM	Corpora6on		
	
Acquiring the data
•  In	the	next	sec6on,	we	show	how	to	acquire	
the	twiJer	data	and	store	it	into	DashDb.	
•  We	use	the	TwiJer	loading	connector	
available	as	a	menu	in	DashDb	console	
Create	a	DashDb	instance
©2016	IBM	Corpora6on		
	
Create an instance of IBM Dash DB on Bluemix
Create	an	IBM	Insight	for	TwiJer	instance
©2016	IBM	Corpora6on		
	
Create an instance of IBM Insight for Twitter on Bluemix
©2016	IBM	Corpora6on		
	
Agenda
•  Provision	the	applica6on	services	on	Bluemix:	
Spark,	DashDb,	IBM	Insight	for	TwiJer	
•  Load	car	related	tweets	from	IBM	Insight	for	
TwiJer	into	DashDb	warehouse	
•  Run	Analy6cs	in	Python	Notebook	and	
discover	new	insights
©2016	IBM	Corpora6on		
	
Launch DashDb Console
Click	on	the	DashDb	Service	6le	to	open	this	dashboard,	then	click	on	Launch	buJon		
Load	TwiJer	Data
©2016	IBM	Corpora6on		
	
Load Twitter Data
DashDb	Console	offered	mul6ple	data	
connectors	including	a	TwiJer	
connector	that	automa6cally	
connects	to	IBM	Insight	for	TwiJer	
Connect	to	TwiJer
©2016	IBM	Corpora6on		
	
Connect to Twitter
Reusing	the	TwiJer	
service	instance	
created	in	previous	
step
©2016	IBM	Corpora6on		
	
Select the data to be loaded
TwiJer	Query	being	used:		
posted:2015-01-01,2015-12-31	followers_count:2000	listed_count:1000	(volkswagen	OR	vw	OR	toyota	
OR	daimler	OR	mercedes	OR	bmw	OR	gm	OR	"general	motors"	OR	tesla)	
Specify	twiJer	query	
Provide	preview	count	of	
output	data
©2016	IBM	Corpora6on		
	
Select the DashDb Table
Name	of	the	schema	under	which	the	tables	will	be	created	
Prefix	(Namespace)	for	the	created	tables	
List	of	tables	that	will	be	created
©2016	IBM	Corpora6on		
	
Loading data monitoring page
Warning:	loading	6me	may	vary	
based	on	bandwidth.	It	may	take	
between	15	mns	and	1hour
©2016	IBM	Corpora6on		
	
Complete the load: Statistics
©2016	IBM	Corpora6on		
	
Complete the load: explore the data
©2016	IBM	Corpora6on		
	
Get connection information
Copy	the	User	id,	password	and	jdbc	url,	you’ll	need	this	informa6on	later
©2016	IBM	Corpora6on		
	
Agenda
•  Provision	the	applica6on	services	on	Bluemix:	
Spark,	DashDb,	IBM	Insight	for	TwiJer	
•  Load	car	related	tweets	from	IBM	Insight	for	
TwiJer	into	DashDb	warehouse	
•  Run	Analy6cs	in	Python	Notebook	and	
discover	new	insights
©2016	IBM	Corpora6on		
	
Create new Notebook from URL
Import	required	Python	packages	
• Create	notebook	from	URL	
• Use	hJps://github.com/ibm-cds-labs/spark.samples/raw/master/notebook/
DashDB%20TwiJer%20Car%202015%20Python%20Notebook.ipynb
©2016	IBM	Corpora6on		
	
Step 1: Import Python Packages
• Install	nltk	package	(Natural	language	toolkit)	
• We	will	use	it	to	filter	stop	words	later	in	the	tutorial
©2016	IBM	Corpora6on		
	
Import Python modules and setup the SQLContext
©2016	IBM	Corpora6on		
	
Step 2: Define global Variables
Set	up	various	data	structures	we’ll	
need	throughout	the	Notebook	
This	is	the	SCHEMA	and	PREFIX	you	used	in	
Step	3	of	the	TwiJer	connector	wizard
©2016	IBM	Corpora6on		
	
Set up some global helper functions
JavaScript	Google	map	visualiza6on	
Misc	helper	that	fill	in	missing	dates
©2016	IBM	Corpora6on		
	
Step 3: Acquire the data from DashDB
User	ID	and	password	from	Connec6on	page	
User	ID	and	password	from	Connec6on	page
©2016	IBM	Corpora6on		
	
Join the Tweets and Sentiment Table
In	this	step,	we	want	to	add	a	sen6ment	score	for	each	tweet	record:	
•  Join	the	Tweets	and	Sen6ments	table	
•  Encode	the	sen6ment	into	a	number	e.g.	POSITIVE=+1,	NEGATIVE=-1,	AMBIVALENT=0	
•  Create	an	average	for	each	sen6ment	associated	with	a	tweet	
•  %6me	instruments	the	code	to	provide	profile	execu6on	stats.
©2016	IBM	Corpora6on		
	
Step 4: Transform the data
Create	a	clean	Working	dataframe	
that	will	be	easier	to	use	in	our	
analy6cs
©2016	IBM	Corpora6on		
	
Step 5: Geographic distribution of tweets
GroupBy	countries	and	aggregate	the	tweets	count	
Convert	Spark	SQL	dataframe	to	Pandas	data	
structure	for	visualiza6on
©2016	IBM	Corpora6on		
	
Bar chart visualization of Tweet distribution by Geo
©2016	IBM	Corpora6on		
	
Google map visualization of tweet distribution by Geos
Call	GeoChart	Helper	that	set	up	the	JavaScript	code
©2016	IBM	Corpora6on		
	
Clean up memory before next analytics
Resources	including	memory	on	the	Spark	Driver	machine	are	not	infinite.		
It	is	good	prac6ce	to	clean	up	when	data	is	not	needed	anymore
©2016	IBM	Corpora6on		
	
Step 6: Analyzing tweets sentiment
GroupBy	Sen6ments	and	aggregate	the	tweets	count	
Convert	Spark	SQL	dataframe	to	Pandas	data	
structure	for	visualiza6on
©2016	IBM	Corpora6on		
	
Sentiment visualization
Use	Matplot	pie	chart
©2016	IBM	Corpora6on		
	
Step 7: Analyze Tweet timeline
Convert	Spark	SQL	dataframe	to	Pandas	data	
structure	for	visualiza6on	
GroupBy	Pos6ng	6me	and	sen6ment	tuples	
Aggregate	the	tweet	counts	
GroupBy	Pos6ng	6me	and	sen6ment	tuples	
Aggregate	the	sum	of	the	tweet	counts
©2016	IBM	Corpora6on		
	
Prepare the timeline data structures
©2016	IBM	Corpora6on		
	
Time series visualization for all tweets
©2016	IBM	Corpora6on		
	
Deep dive into car manufacturers
Create	new	DataFrame	that	enrich	tweets	with	
extra	metadata:	
-Boolean	for	each	car	manufacturer	
-Boolean	for	electric	car	
-Boolean	for	self	driving	car
©2016	IBM	Corpora6on		
	
Re-analyze tweeter timeline for each car manufacturer
Create	new	DataFrame	for	each	car	manufacturer	
Aggregate	the	tweet	counts,	order	by	pos6ng	6me
©2016	IBM	Corpora6on		
	
Timeline series visualization
No6ce	the	peak	of	tweets	for	VW	between	
September	and	October	2015
©2016	IBM	Corpora6on		
	
Explain why the peak of tweets for VW between September and October 2015
Filter	for	all	VW	tweets	between	Sept	and	Oct	2015	
Pie	chart	visualiza6on	of	the	top	10	words	being	
used	in	these	tweets	
Create	map	count	of	all	non-stop	words	
used	in	the	tweets	
Use	NLTK	stopwords	module	to	filter	out	stop	words
©2016	IBM	Corpora6on		
	
Peak explained
We	can	clearly	see	from	the	list	of	
most	used	words	that	the	peak	
correspond	to	the	VW	scandal	
around	fraudulent	emissions	
tes6ng
©2016	IBM	Corpora6on		
	
Follow the notebook for many more
interesting analytics
©2016	IBM	Corpora6on		
	
Resource
•  hJps://developer.ibm.com/clouddataservices/	
•  hJps://github.com/ibm-cds-labs/simple-data-pipe	
•  hJps://github.com/ibm-cds-labs/pipes-connector-flightstats	
•  hJp://spark.apache.org/docs/latest/mllib-guide.html	
•  hJps://console.ng.bluemix.net/data/analy6cs/
©2016	IBM	Corpora6on		
	
Thank You

Weitere ähnliche Inhalte

Was ist angesagt?

How Disney+ uses fast data ubiquity to improve the customer experience
 How Disney+ uses fast data ubiquity to improve the customer experience  How Disney+ uses fast data ubiquity to improve the customer experience
How Disney+ uses fast data ubiquity to improve the customer experience Martin Zapletal
 
Machine learning in the physical world by Kip Larson from AWS IoT
Machine learning in the physical world by  Kip Larson from AWS IoTMachine learning in the physical world by  Kip Larson from AWS IoT
Machine learning in the physical world by Kip Larson from AWS IoTBill Liu
 
Introduction to Google Cloud Platform (GCP) | Google Cloud Tutorial for Begin...
Introduction to Google Cloud Platform (GCP) | Google Cloud Tutorial for Begin...Introduction to Google Cloud Platform (GCP) | Google Cloud Tutorial for Begin...
Introduction to Google Cloud Platform (GCP) | Google Cloud Tutorial for Begin...Edureka!
 
GDG Heraklion - Architecting for the Google Cloud Platform
GDG Heraklion - Architecting for the Google Cloud PlatformGDG Heraklion - Architecting for the Google Cloud Platform
GDG Heraklion - Architecting for the Google Cloud PlatformMárton Kodok
 
NYC WebPerf Meetup Feb 2020 - Measuring the Adoption of Web Performance Techn...
NYC WebPerf Meetup Feb 2020 - Measuring the Adoption of Web Performance Techn...NYC WebPerf Meetup Feb 2020 - Measuring the Adoption of Web Performance Techn...
NYC WebPerf Meetup Feb 2020 - Measuring the Adoption of Web Performance Techn...Paul Calvano
 
Scale with a smile with Google Cloud Platform At DevConTLV (June 2014)
Scale with a smile with Google Cloud Platform At DevConTLV (June 2014)Scale with a smile with Google Cloud Platform At DevConTLV (June 2014)
Scale with a smile with Google Cloud Platform At DevConTLV (June 2014)Ido Green
 
How to deploy machine learning models in the Cloud
How to deploy machine learning models in the CloudHow to deploy machine learning models in the Cloud
How to deploy machine learning models in the CloudAlex Casalboni
 
Getting started with Google Cloud Training Material - 2018
Getting started with Google Cloud Training Material - 2018Getting started with Google Cloud Training Material - 2018
Getting started with Google Cloud Training Material - 2018JK Baseer
 
Top Advantages of Using Google Cloud Platform
Top Advantages of Using Google Cloud PlatformTop Advantages of Using Google Cloud Platform
Top Advantages of Using Google Cloud PlatformKinsta WordPress Hosting
 
Whizlabs webinar - Deploying Portfolio Site with AWS Serverless
Whizlabs webinar - Deploying Portfolio Site with AWS ServerlessWhizlabs webinar - Deploying Portfolio Site with AWS Serverless
Whizlabs webinar - Deploying Portfolio Site with AWS ServerlessDhaval Nagar
 
Fluent 2018: Tracking Performance of the Web with HTTP Archive
Fluent 2018: Tracking Performance of the Web with HTTP ArchiveFluent 2018: Tracking Performance of the Web with HTTP Archive
Fluent 2018: Tracking Performance of the Web with HTTP ArchivePaul Calvano
 
DbyDx Software Corporate Presentation
DbyDx Software Corporate PresentationDbyDx Software Corporate Presentation
DbyDx Software Corporate PresentationDbyDx Software
 
Google cloud platform
Google cloud platformGoogle cloud platform
Google cloud platformrajdeep
 
AOF Diagram Synchronization in Papyrus Neon
AOF Diagram Synchronization in Papyrus NeonAOF Diagram Synchronization in Papyrus Neon
AOF Diagram Synchronization in Papyrus Neoncdamus
 
Building your own calendly using amazon app sync
Building your own calendly using amazon app syncBuilding your own calendly using amazon app sync
Building your own calendly using amazon app syncDhaval Nagar
 
Make your own Pokédex with the Pokéapi & Node/Express!
Make your own Pokédex with the Pokéapi & Node/Express! Make your own Pokédex with the Pokéapi & Node/Express!
Make your own Pokédex with the Pokéapi & Node/Express! Autodesk
 
Norwegian SharePoint Community - What happened at Ignite
Norwegian SharePoint Community - What happened at IgniteNorwegian SharePoint Community - What happened at Ignite
Norwegian SharePoint Community - What happened at IgniteMikael Svenson
 
Build SPFx Solutions for SharePoint 2019
Build SPFx Solutions for SharePoint 2019Build SPFx Solutions for SharePoint 2019
Build SPFx Solutions for SharePoint 2019Suhail Jamaldeen
 
Forge - DevCon 2016: Building Value-Added Integrations with Autodesk’s IoT APIs
Forge - DevCon 2016: Building Value-Added Integrations with Autodesk’s IoT APIsForge - DevCon 2016: Building Value-Added Integrations with Autodesk’s IoT APIs
Forge - DevCon 2016: Building Value-Added Integrations with Autodesk’s IoT APIsAutodesk
 

Was ist angesagt? (20)

How Disney+ uses fast data ubiquity to improve the customer experience
 How Disney+ uses fast data ubiquity to improve the customer experience  How Disney+ uses fast data ubiquity to improve the customer experience
How Disney+ uses fast data ubiquity to improve the customer experience
 
Machine learning in the physical world by Kip Larson from AWS IoT
Machine learning in the physical world by  Kip Larson from AWS IoTMachine learning in the physical world by  Kip Larson from AWS IoT
Machine learning in the physical world by Kip Larson from AWS IoT
 
Introduction to Google Cloud Platform (GCP) | Google Cloud Tutorial for Begin...
Introduction to Google Cloud Platform (GCP) | Google Cloud Tutorial for Begin...Introduction to Google Cloud Platform (GCP) | Google Cloud Tutorial for Begin...
Introduction to Google Cloud Platform (GCP) | Google Cloud Tutorial for Begin...
 
GDG Heraklion - Architecting for the Google Cloud Platform
GDG Heraklion - Architecting for the Google Cloud PlatformGDG Heraklion - Architecting for the Google Cloud Platform
GDG Heraklion - Architecting for the Google Cloud Platform
 
NYC WebPerf Meetup Feb 2020 - Measuring the Adoption of Web Performance Techn...
NYC WebPerf Meetup Feb 2020 - Measuring the Adoption of Web Performance Techn...NYC WebPerf Meetup Feb 2020 - Measuring the Adoption of Web Performance Techn...
NYC WebPerf Meetup Feb 2020 - Measuring the Adoption of Web Performance Techn...
 
Scale with a smile with Google Cloud Platform At DevConTLV (June 2014)
Scale with a smile with Google Cloud Platform At DevConTLV (June 2014)Scale with a smile with Google Cloud Platform At DevConTLV (June 2014)
Scale with a smile with Google Cloud Platform At DevConTLV (June 2014)
 
How to deploy machine learning models in the Cloud
How to deploy machine learning models in the CloudHow to deploy machine learning models in the Cloud
How to deploy machine learning models in the Cloud
 
Getting started with Google Cloud Training Material - 2018
Getting started with Google Cloud Training Material - 2018Getting started with Google Cloud Training Material - 2018
Getting started with Google Cloud Training Material - 2018
 
Top Advantages of Using Google Cloud Platform
Top Advantages of Using Google Cloud PlatformTop Advantages of Using Google Cloud Platform
Top Advantages of Using Google Cloud Platform
 
Whizlabs webinar - Deploying Portfolio Site with AWS Serverless
Whizlabs webinar - Deploying Portfolio Site with AWS ServerlessWhizlabs webinar - Deploying Portfolio Site with AWS Serverless
Whizlabs webinar - Deploying Portfolio Site with AWS Serverless
 
Fluent 2018: Tracking Performance of the Web with HTTP Archive
Fluent 2018: Tracking Performance of the Web with HTTP ArchiveFluent 2018: Tracking Performance of the Web with HTTP Archive
Fluent 2018: Tracking Performance of the Web with HTTP Archive
 
DbyDx Software Corporate Presentation
DbyDx Software Corporate PresentationDbyDx Software Corporate Presentation
DbyDx Software Corporate Presentation
 
Google cloud platform
Google cloud platformGoogle cloud platform
Google cloud platform
 
AOF Diagram Synchronization in Papyrus Neon
AOF Diagram Synchronization in Papyrus NeonAOF Diagram Synchronization in Papyrus Neon
AOF Diagram Synchronization in Papyrus Neon
 
Building your own calendly using amazon app sync
Building your own calendly using amazon app syncBuilding your own calendly using amazon app sync
Building your own calendly using amazon app sync
 
Make your own Pokédex with the Pokéapi & Node/Express!
Make your own Pokédex with the Pokéapi & Node/Express! Make your own Pokédex with the Pokéapi & Node/Express!
Make your own Pokédex with the Pokéapi & Node/Express!
 
Norwegian SharePoint Community - What happened at Ignite
Norwegian SharePoint Community - What happened at IgniteNorwegian SharePoint Community - What happened at Ignite
Norwegian SharePoint Community - What happened at Ignite
 
Build SPFx Solutions for SharePoint 2019
Build SPFx Solutions for SharePoint 2019Build SPFx Solutions for SharePoint 2019
Build SPFx Solutions for SharePoint 2019
 
Tour of Dapr
Tour of DaprTour of Dapr
Tour of Dapr
 
Forge - DevCon 2016: Building Value-Added Integrations with Autodesk’s IoT APIs
Forge - DevCon 2016: Building Value-Added Integrations with Autodesk’s IoT APIsForge - DevCon 2016: Building Value-Added Integrations with Autodesk’s IoT APIs
Forge - DevCon 2016: Building Value-Added Integrations with Autodesk’s IoT APIs
 

Andere mochten auch

Data Science with Spark - Training at SparkSummit (East)
Data Science with Spark - Training at SparkSummit (East)Data Science with Spark - Training at SparkSummit (East)
Data Science with Spark - Training at SparkSummit (East)Krishna Sankar
 
Scala meetup - Intro to spark
Scala meetup - Intro to sparkScala meetup - Intro to spark
Scala meetup - Intro to sparkJavier Arrieta
 
Apache Spark Tutorial
Apache Spark TutorialApache Spark Tutorial
Apache Spark TutorialAhmet Bulut
 
Spark tutorial @ KCC 2015
Spark tutorial @ KCC 2015Spark tutorial @ KCC 2015
Spark tutorial @ KCC 2015Jongwook Woo
 
Preso spark leadership
Preso spark leadershipPreso spark leadership
Preso spark leadershipsjoerdluteyn
 
Spark, the new age of data scientist
Spark, the new age of data scientistSpark, the new age of data scientist
Spark, the new age of data scientistMassimiliano Martella
 
Spark introduction - In Chinese
Spark introduction - In ChineseSpark introduction - In Chinese
Spark introduction - In Chinesecolorant
 
Spark the next top compute model
Spark   the next top compute modelSpark   the next top compute model
Spark the next top compute modelDean Wampler
 
An Introduct to Spark - Atlanta Spark Meetup
An Introduct to Spark - Atlanta Spark MeetupAn Introduct to Spark - Atlanta Spark Meetup
An Introduct to Spark - Atlanta Spark Meetupjlacefie
 
The Future of Data Science
The Future of Data ScienceThe Future of Data Science
The Future of Data Sciencesarith divakar
 
A Deeper Understanding of Spark Internals (Hadoop Conference Japan 2014)
A Deeper Understanding of Spark Internals (Hadoop Conference Japan 2014)A Deeper Understanding of Spark Internals (Hadoop Conference Japan 2014)
A Deeper Understanding of Spark Internals (Hadoop Conference Japan 2014)Hadoop / Spark Conference Japan
 
Scala presentation by Aleksandar Prokopec
Scala presentation by Aleksandar ProkopecScala presentation by Aleksandar Prokopec
Scala presentation by Aleksandar ProkopecLoïc Descotte
 
Why dont you_create_new_spark_jl
Why dont you_create_new_spark_jlWhy dont you_create_new_spark_jl
Why dont you_create_new_spark_jlShintaro Fukushima
 

Andere mochten auch (20)

Data Science with Spark - Training at SparkSummit (East)
Data Science with Spark - Training at SparkSummit (East)Data Science with Spark - Training at SparkSummit (East)
Data Science with Spark - Training at SparkSummit (East)
 
Scala meetup - Intro to spark
Scala meetup - Intro to sparkScala meetup - Intro to spark
Scala meetup - Intro to spark
 
Apache Spark Tutorial
Apache Spark TutorialApache Spark Tutorial
Apache Spark Tutorial
 
Spark tutorial @ KCC 2015
Spark tutorial @ KCC 2015Spark tutorial @ KCC 2015
Spark tutorial @ KCC 2015
 
Spark - Philly JUG
Spark  - Philly JUGSpark  - Philly JUG
Spark - Philly JUG
 
Performance
PerformancePerformance
Performance
 
Preso spark leadership
Preso spark leadershipPreso spark leadership
Preso spark leadership
 
Spark, the new age of data scientist
Spark, the new age of data scientistSpark, the new age of data scientist
Spark, the new age of data scientist
 
Spark introduction - In Chinese
Spark introduction - In ChineseSpark introduction - In Chinese
Spark introduction - In Chinese
 
Apache Spark with Scala
Apache Spark with ScalaApache Spark with Scala
Apache Spark with Scala
 
Spark the next top compute model
Spark   the next top compute modelSpark   the next top compute model
Spark the next top compute model
 
NYC_2016_slides
NYC_2016_slidesNYC_2016_slides
NYC_2016_slides
 
Intro to Apache Spark
Intro to Apache SparkIntro to Apache Spark
Intro to Apache Spark
 
An Introduct to Spark - Atlanta Spark Meetup
An Introduct to Spark - Atlanta Spark MeetupAn Introduct to Spark - Atlanta Spark Meetup
An Introduct to Spark - Atlanta Spark Meetup
 
Scala in practice
Scala in practiceScala in practice
Scala in practice
 
The Future of Data Science
The Future of Data ScienceThe Future of Data Science
The Future of Data Science
 
A Deeper Understanding of Spark Internals (Hadoop Conference Japan 2014)
A Deeper Understanding of Spark Internals (Hadoop Conference Japan 2014)A Deeper Understanding of Spark Internals (Hadoop Conference Japan 2014)
A Deeper Understanding of Spark Internals (Hadoop Conference Japan 2014)
 
Scala+RDD
Scala+RDDScala+RDD
Scala+RDD
 
Scala presentation by Aleksandar Prokopec
Scala presentation by Aleksandar ProkopecScala presentation by Aleksandar Prokopec
Scala presentation by Aleksandar Prokopec
 
Why dont you_create_new_spark_jl
Why dont you_create_new_spark_jlWhy dont you_create_new_spark_jl
Why dont you_create_new_spark_jl
 

Ähnlich wie Spark tutorial py con 2016 part 2

Rock the activity stream api
Rock the activity stream api Rock the activity stream api
Rock the activity stream api Wannes Rams
 
A301 ctu madrid2016-monitoring
A301 ctu madrid2016-monitoringA301 ctu madrid2016-monitoring
A301 ctu madrid2016-monitoringMichael Dawson
 
AquaHacking Challenge - Bluemix Introduction
AquaHacking Challenge - Bluemix IntroductionAquaHacking Challenge - Bluemix Introduction
AquaHacking Challenge - Bluemix IntroductionJean-Louis (JL) Marechaux
 
Accelerating your application development with IBM BlueMix (Your dream devel...
Accelerating your application development with IBM BlueMix (Your dream devel...Accelerating your application development with IBM BlueMix (Your dream devel...
Accelerating your application development with IBM BlueMix (Your dream devel...Supal Chowdhury
 
Acando Seminar Best of ignite 2016
Acando Seminar Best of ignite 2016Acando Seminar Best of ignite 2016
Acando Seminar Best of ignite 2016Acando Sweden
 
Reactive to Proactive: Intelligent Troubleshooting and Monitoring with Splunk
Reactive to Proactive: Intelligent Troubleshooting and Monitoring with SplunkReactive to Proactive: Intelligent Troubleshooting and Monitoring with Splunk
Reactive to Proactive: Intelligent Troubleshooting and Monitoring with SplunkSplunk
 
IBM-SAP Partnership: Driving the Digital Transformation
IBM-SAP Partnership: Driving the Digital Transformation IBM-SAP Partnership: Driving the Digital Transformation
IBM-SAP Partnership: Driving the Digital Transformation INDUSCommunity
 
IBM-SAP Partnership: Driving the Digital Transformation
IBM-SAP Partnership: Driving the Digital Transformation IBM-SAP Partnership: Driving the Digital Transformation
IBM-SAP Partnership: Driving the Digital Transformation INDUSCommunity
 
Platform Showcase: Making the Ultimate Live Demo, by Gabriel Michaud
Platform Showcase: Making the Ultimate Live Demo, by Gabriel MichaudPlatform Showcase: Making the Ultimate Live Demo, by Gabriel Michaud
Platform Showcase: Making the Ultimate Live Demo, by Gabriel MichaudAcumatica Cloud ERP
 
Odsc london data science bootcamp with pixie dust
Odsc london data science bootcamp with pixie dustOdsc london data science bootcamp with pixie dust
Odsc london data science bootcamp with pixie dustDavid Taieb
 
MissionGraph QTD Training Materials
MissionGraph QTD Training MaterialsMissionGraph QTD Training Materials
MissionGraph QTD Training MaterialsBenjamin Huston
 
IBM Bluemix Garage in London
IBM Bluemix Garage in LondonIBM Bluemix Garage in London
IBM Bluemix Garage in LondonSonia Cyrus
 
IBM Bluemix Paris Meetup #22-20170315 Meetup @VillagebyCA - Serverless & Open...
IBM Bluemix Paris Meetup #22-20170315 Meetup @VillagebyCA - Serverless & Open...IBM Bluemix Paris Meetup #22-20170315 Meetup @VillagebyCA - Serverless & Open...
IBM Bluemix Paris Meetup #22-20170315 Meetup @VillagebyCA - Serverless & Open...IBM France Lab
 
Final create an app to perform intelligent searched on your data
Final  create an app to perform intelligent searched on your dataFinal  create an app to perform intelligent searched on your data
Final create an app to perform intelligent searched on your dataAnam Mahmood
 
How to build and deploy app on Replit
How to build and deploy app on ReplitHow to build and deploy app on Replit
How to build and deploy app on Replitmatiasfund
 
Service Design and Change in Corporate Contexts - Service Experience Camp 2016
Service Design and Change in Corporate Contexts - Service Experience Camp 2016Service Design and Change in Corporate Contexts - Service Experience Camp 2016
Service Design and Change in Corporate Contexts - Service Experience Camp 2016Klaus Rüggenmann
 
Challenge accepted: IBM Cloud onboarding & Upgrades to IBM Notes Browser Plug-in
Challenge accepted: IBM Cloud onboarding & Upgrades to IBM Notes Browser Plug-inChallenge accepted: IBM Cloud onboarding & Upgrades to IBM Notes Browser Plug-in
Challenge accepted: IBM Cloud onboarding & Upgrades to IBM Notes Browser Plug-inDaniel Reimann
 
TTN things connected acount creation
TTN things connected acount creationTTN things connected acount creation
TTN things connected acount creationJisc
 
New IT Steps to Accelerate Agility
New IT Steps to Accelerate AgilityNew IT Steps to Accelerate Agility
New IT Steps to Accelerate AgilityWSO2
 

Ähnlich wie Spark tutorial py con 2016 part 2 (20)

Rock the activity stream api
Rock the activity stream api Rock the activity stream api
Rock the activity stream api
 
A301 ctu madrid2016-monitoring
A301 ctu madrid2016-monitoringA301 ctu madrid2016-monitoring
A301 ctu madrid2016-monitoring
 
AquaHacking Challenge - Bluemix Introduction
AquaHacking Challenge - Bluemix IntroductionAquaHacking Challenge - Bluemix Introduction
AquaHacking Challenge - Bluemix Introduction
 
ABP.pptx
ABP.pptxABP.pptx
ABP.pptx
 
Accelerating your application development with IBM BlueMix (Your dream devel...
Accelerating your application development with IBM BlueMix (Your dream devel...Accelerating your application development with IBM BlueMix (Your dream devel...
Accelerating your application development with IBM BlueMix (Your dream devel...
 
Acando Seminar Best of ignite 2016
Acando Seminar Best of ignite 2016Acando Seminar Best of ignite 2016
Acando Seminar Best of ignite 2016
 
Reactive to Proactive: Intelligent Troubleshooting and Monitoring with Splunk
Reactive to Proactive: Intelligent Troubleshooting and Monitoring with SplunkReactive to Proactive: Intelligent Troubleshooting and Monitoring with Splunk
Reactive to Proactive: Intelligent Troubleshooting and Monitoring with Splunk
 
IBM-SAP Partnership: Driving the Digital Transformation
IBM-SAP Partnership: Driving the Digital Transformation IBM-SAP Partnership: Driving the Digital Transformation
IBM-SAP Partnership: Driving the Digital Transformation
 
IBM-SAP Partnership: Driving the Digital Transformation
IBM-SAP Partnership: Driving the Digital Transformation IBM-SAP Partnership: Driving the Digital Transformation
IBM-SAP Partnership: Driving the Digital Transformation
 
Platform Showcase: Making the Ultimate Live Demo, by Gabriel Michaud
Platform Showcase: Making the Ultimate Live Demo, by Gabriel MichaudPlatform Showcase: Making the Ultimate Live Demo, by Gabriel Michaud
Platform Showcase: Making the Ultimate Live Demo, by Gabriel Michaud
 
Odsc london data science bootcamp with pixie dust
Odsc london data science bootcamp with pixie dustOdsc london data science bootcamp with pixie dust
Odsc london data science bootcamp with pixie dust
 
MissionGraph QTD Training Materials
MissionGraph QTD Training MaterialsMissionGraph QTD Training Materials
MissionGraph QTD Training Materials
 
IBM Bluemix Garage in London
IBM Bluemix Garage in LondonIBM Bluemix Garage in London
IBM Bluemix Garage in London
 
IBM Bluemix Paris Meetup #22-20170315 Meetup @VillagebyCA - Serverless & Open...
IBM Bluemix Paris Meetup #22-20170315 Meetup @VillagebyCA - Serverless & Open...IBM Bluemix Paris Meetup #22-20170315 Meetup @VillagebyCA - Serverless & Open...
IBM Bluemix Paris Meetup #22-20170315 Meetup @VillagebyCA - Serverless & Open...
 
Final create an app to perform intelligent searched on your data
Final  create an app to perform intelligent searched on your dataFinal  create an app to perform intelligent searched on your data
Final create an app to perform intelligent searched on your data
 
How to build and deploy app on Replit
How to build and deploy app on ReplitHow to build and deploy app on Replit
How to build and deploy app on Replit
 
Service Design and Change in Corporate Contexts - Service Experience Camp 2016
Service Design and Change in Corporate Contexts - Service Experience Camp 2016Service Design and Change in Corporate Contexts - Service Experience Camp 2016
Service Design and Change in Corporate Contexts - Service Experience Camp 2016
 
Challenge accepted: IBM Cloud onboarding & Upgrades to IBM Notes Browser Plug-in
Challenge accepted: IBM Cloud onboarding & Upgrades to IBM Notes Browser Plug-inChallenge accepted: IBM Cloud onboarding & Upgrades to IBM Notes Browser Plug-in
Challenge accepted: IBM Cloud onboarding & Upgrades to IBM Notes Browser Plug-in
 
TTN things connected acount creation
TTN things connected acount creationTTN things connected acount creation
TTN things connected acount creation
 
New IT Steps to Accelerate Agility
New IT Steps to Accelerate AgilityNew IT Steps to Accelerate Agility
New IT Steps to Accelerate Agility
 

Kürzlich hochgeladen

6 Tips for Interpretable Topic Models _ by Nicha Ruchirawat _ Towards Data Sc...
6 Tips for Interpretable Topic Models _ by Nicha Ruchirawat _ Towards Data Sc...6 Tips for Interpretable Topic Models _ by Nicha Ruchirawat _ Towards Data Sc...
6 Tips for Interpretable Topic Models _ by Nicha Ruchirawat _ Towards Data Sc...Dr Arash Najmaei ( Phd., MBA, BSc)
 
Predictive Analysis for Loan Default Presentation : Data Analysis Project PPT
Predictive Analysis for Loan Default  Presentation : Data Analysis Project PPTPredictive Analysis for Loan Default  Presentation : Data Analysis Project PPT
Predictive Analysis for Loan Default Presentation : Data Analysis Project PPTBoston Institute of Analytics
 
Decoding Patterns: Customer Churn Prediction Data Analysis Project
Decoding Patterns: Customer Churn Prediction Data Analysis ProjectDecoding Patterns: Customer Churn Prediction Data Analysis Project
Decoding Patterns: Customer Churn Prediction Data Analysis ProjectBoston Institute of Analytics
 
Semantic Shed - Squashing and Squeezing.pptx
Semantic Shed - Squashing and Squeezing.pptxSemantic Shed - Squashing and Squeezing.pptx
Semantic Shed - Squashing and Squeezing.pptxMike Bennett
 
Advanced Machine Learning for Business Professionals
Advanced Machine Learning for Business ProfessionalsAdvanced Machine Learning for Business Professionals
Advanced Machine Learning for Business ProfessionalsVICTOR MAESTRE RAMIREZ
 
The Power of Data-Driven Storytelling_ Unveiling the Layers of Insight.pptx
The Power of Data-Driven Storytelling_ Unveiling the Layers of Insight.pptxThe Power of Data-Driven Storytelling_ Unveiling the Layers of Insight.pptx
The Power of Data-Driven Storytelling_ Unveiling the Layers of Insight.pptxTasha Penwell
 
convolutional neural network and its applications.pdf
convolutional neural network and its applications.pdfconvolutional neural network and its applications.pdf
convolutional neural network and its applications.pdfSubhamKumar3239
 
INTRODUCTION TO Natural language processing
INTRODUCTION TO Natural language processingINTRODUCTION TO Natural language processing
INTRODUCTION TO Natural language processingsocarem879
 
Student profile product demonstration on grades, ability, well-being and mind...
Student profile product demonstration on grades, ability, well-being and mind...Student profile product demonstration on grades, ability, well-being and mind...
Student profile product demonstration on grades, ability, well-being and mind...Seán Kennedy
 
FAIR, FAIRsharing, FAIR Cookbook and ELIXIR - Sansone SA - Boston 2024
FAIR, FAIRsharing, FAIR Cookbook and ELIXIR - Sansone SA - Boston 2024FAIR, FAIRsharing, FAIR Cookbook and ELIXIR - Sansone SA - Boston 2024
FAIR, FAIRsharing, FAIR Cookbook and ELIXIR - Sansone SA - Boston 2024Susanna-Assunta Sansone
 
Data Analysis Project Presentation: Unveiling Your Ideal Customer, Bank Custo...
Data Analysis Project Presentation: Unveiling Your Ideal Customer, Bank Custo...Data Analysis Project Presentation: Unveiling Your Ideal Customer, Bank Custo...
Data Analysis Project Presentation: Unveiling Your Ideal Customer, Bank Custo...Boston Institute of Analytics
 
Learn How Data Science Changes Our World
Learn How Data Science Changes Our WorldLearn How Data Science Changes Our World
Learn How Data Science Changes Our WorldEduminds Learning
 
wepik-insightful-infographics-a-data-visualization-overview-20240401133220kwr...
wepik-insightful-infographics-a-data-visualization-overview-20240401133220kwr...wepik-insightful-infographics-a-data-visualization-overview-20240401133220kwr...
wepik-insightful-infographics-a-data-visualization-overview-20240401133220kwr...KarteekMane1
 
SMOTE and K-Fold Cross Validation-Presentation.pptx
SMOTE and K-Fold Cross Validation-Presentation.pptxSMOTE and K-Fold Cross Validation-Presentation.pptx
SMOTE and K-Fold Cross Validation-Presentation.pptxHaritikaChhatwal1
 
Minimizing AI Hallucinations/Confabulations and the Path towards AGI with Exa...
Minimizing AI Hallucinations/Confabulations and the Path towards AGI with Exa...Minimizing AI Hallucinations/Confabulations and the Path towards AGI with Exa...
Minimizing AI Hallucinations/Confabulations and the Path towards AGI with Exa...Thomas Poetter
 
Defining Constituents, Data Vizzes and Telling a Data Story
Defining Constituents, Data Vizzes and Telling a Data StoryDefining Constituents, Data Vizzes and Telling a Data Story
Defining Constituents, Data Vizzes and Telling a Data StoryJeremy Anderson
 
Data Factory in Microsoft Fabric (MsBIP #82)
Data Factory in Microsoft Fabric (MsBIP #82)Data Factory in Microsoft Fabric (MsBIP #82)
Data Factory in Microsoft Fabric (MsBIP #82)Cathrine Wilhelmsen
 
Easter Eggs From Star Wars and in cars 1 and 2
Easter Eggs From Star Wars and in cars 1 and 2Easter Eggs From Star Wars and in cars 1 and 2
Easter Eggs From Star Wars and in cars 1 and 217djon017
 
What To Do For World Nature Conservation Day by Slidesgo.pptx
What To Do For World Nature Conservation Day by Slidesgo.pptxWhat To Do For World Nature Conservation Day by Slidesgo.pptx
What To Do For World Nature Conservation Day by Slidesgo.pptxSimranPal17
 
Data Analysis Project : Targeting the Right Customers, Presentation on Bank M...
Data Analysis Project : Targeting the Right Customers, Presentation on Bank M...Data Analysis Project : Targeting the Right Customers, Presentation on Bank M...
Data Analysis Project : Targeting the Right Customers, Presentation on Bank M...Boston Institute of Analytics
 

Kürzlich hochgeladen (20)

6 Tips for Interpretable Topic Models _ by Nicha Ruchirawat _ Towards Data Sc...
6 Tips for Interpretable Topic Models _ by Nicha Ruchirawat _ Towards Data Sc...6 Tips for Interpretable Topic Models _ by Nicha Ruchirawat _ Towards Data Sc...
6 Tips for Interpretable Topic Models _ by Nicha Ruchirawat _ Towards Data Sc...
 
Predictive Analysis for Loan Default Presentation : Data Analysis Project PPT
Predictive Analysis for Loan Default  Presentation : Data Analysis Project PPTPredictive Analysis for Loan Default  Presentation : Data Analysis Project PPT
Predictive Analysis for Loan Default Presentation : Data Analysis Project PPT
 
Decoding Patterns: Customer Churn Prediction Data Analysis Project
Decoding Patterns: Customer Churn Prediction Data Analysis ProjectDecoding Patterns: Customer Churn Prediction Data Analysis Project
Decoding Patterns: Customer Churn Prediction Data Analysis Project
 
Semantic Shed - Squashing and Squeezing.pptx
Semantic Shed - Squashing and Squeezing.pptxSemantic Shed - Squashing and Squeezing.pptx
Semantic Shed - Squashing and Squeezing.pptx
 
Advanced Machine Learning for Business Professionals
Advanced Machine Learning for Business ProfessionalsAdvanced Machine Learning for Business Professionals
Advanced Machine Learning for Business Professionals
 
The Power of Data-Driven Storytelling_ Unveiling the Layers of Insight.pptx
The Power of Data-Driven Storytelling_ Unveiling the Layers of Insight.pptxThe Power of Data-Driven Storytelling_ Unveiling the Layers of Insight.pptx
The Power of Data-Driven Storytelling_ Unveiling the Layers of Insight.pptx
 
convolutional neural network and its applications.pdf
convolutional neural network and its applications.pdfconvolutional neural network and its applications.pdf
convolutional neural network and its applications.pdf
 
INTRODUCTION TO Natural language processing
INTRODUCTION TO Natural language processingINTRODUCTION TO Natural language processing
INTRODUCTION TO Natural language processing
 
Student profile product demonstration on grades, ability, well-being and mind...
Student profile product demonstration on grades, ability, well-being and mind...Student profile product demonstration on grades, ability, well-being and mind...
Student profile product demonstration on grades, ability, well-being and mind...
 
FAIR, FAIRsharing, FAIR Cookbook and ELIXIR - Sansone SA - Boston 2024
FAIR, FAIRsharing, FAIR Cookbook and ELIXIR - Sansone SA - Boston 2024FAIR, FAIRsharing, FAIR Cookbook and ELIXIR - Sansone SA - Boston 2024
FAIR, FAIRsharing, FAIR Cookbook and ELIXIR - Sansone SA - Boston 2024
 
Data Analysis Project Presentation: Unveiling Your Ideal Customer, Bank Custo...
Data Analysis Project Presentation: Unveiling Your Ideal Customer, Bank Custo...Data Analysis Project Presentation: Unveiling Your Ideal Customer, Bank Custo...
Data Analysis Project Presentation: Unveiling Your Ideal Customer, Bank Custo...
 
Learn How Data Science Changes Our World
Learn How Data Science Changes Our WorldLearn How Data Science Changes Our World
Learn How Data Science Changes Our World
 
wepik-insightful-infographics-a-data-visualization-overview-20240401133220kwr...
wepik-insightful-infographics-a-data-visualization-overview-20240401133220kwr...wepik-insightful-infographics-a-data-visualization-overview-20240401133220kwr...
wepik-insightful-infographics-a-data-visualization-overview-20240401133220kwr...
 
SMOTE and K-Fold Cross Validation-Presentation.pptx
SMOTE and K-Fold Cross Validation-Presentation.pptxSMOTE and K-Fold Cross Validation-Presentation.pptx
SMOTE and K-Fold Cross Validation-Presentation.pptx
 
Minimizing AI Hallucinations/Confabulations and the Path towards AGI with Exa...
Minimizing AI Hallucinations/Confabulations and the Path towards AGI with Exa...Minimizing AI Hallucinations/Confabulations and the Path towards AGI with Exa...
Minimizing AI Hallucinations/Confabulations and the Path towards AGI with Exa...
 
Defining Constituents, Data Vizzes and Telling a Data Story
Defining Constituents, Data Vizzes and Telling a Data StoryDefining Constituents, Data Vizzes and Telling a Data Story
Defining Constituents, Data Vizzes and Telling a Data Story
 
Data Factory in Microsoft Fabric (MsBIP #82)
Data Factory in Microsoft Fabric (MsBIP #82)Data Factory in Microsoft Fabric (MsBIP #82)
Data Factory in Microsoft Fabric (MsBIP #82)
 
Easter Eggs From Star Wars and in cars 1 and 2
Easter Eggs From Star Wars and in cars 1 and 2Easter Eggs From Star Wars and in cars 1 and 2
Easter Eggs From Star Wars and in cars 1 and 2
 
What To Do For World Nature Conservation Day by Slidesgo.pptx
What To Do For World Nature Conservation Day by Slidesgo.pptxWhat To Do For World Nature Conservation Day by Slidesgo.pptx
What To Do For World Nature Conservation Day by Slidesgo.pptx
 
Data Analysis Project : Targeting the Right Customers, Presentation on Bank M...
Data Analysis Project : Targeting the Right Customers, Presentation on Bank M...Data Analysis Project : Targeting the Right Customers, Presentation on Bank M...
Data Analysis Project : Targeting the Right Customers, Presentation on Bank M...
 

Spark tutorial py con 2016 part 2