SlideShare ist ein Scribd-Unternehmen logo
1 von 27
Downloaden Sie, um offline zu lesen
October	2016
Predictive	Analytics
Big	Data	&	Artificial	Intelligence
Agenda
Artificial	Intelligence AI
Big	Data
Machine	Learning
Deep	Learning
Neural	Networks
NLPNatural	Language	Processing
Demystify	the	following	buzzwords.
Image	Recognition
2
Ultimate	Goal:	Predictive	Analytics
Predict	what	users	will	want	to	buy.
A	consumer	searches	
for	a	TV	and	based	on	
previous	customers	
data,	show	a	product	
that	has	a	high	
probability	of	being	
bought	as	well.
3
Evolution	of	Data	Analytics
1990s 2000s
Excel Business	Intelligence	(BI)
Dashboards
2015	and	beyond
Actionable
Insights
What	Happened? What’s	Happening? What	Will	Happen?
4
The	Process
Structured	and	
unstructured	(ex.	
video)	data
Data	is	stored	in	
databases	and	
servers
Data	
Generated
Data
Stored
Actionable
Insights
Data
Processing
Process	the	data	
using	CPU/GPUs	
and	AI	algorithms	
to	detect	patterns
Predictive
signals	are	
generated
Central	Processing	Unit	(CPU)	/	Graphics	Processing	Unit	(GPU)
Big	Data Artificial	Intelligence
5
How	Did	We	Get	Here?
Databases
(the	80s)
Data	Warehousing
(the	90s)
• Relational	databases
• Gigabytes	in	size
• Low	latency
• Terabytes	in	size
• Custom	hardware
6
Today,	it’s	Big	Data
7
Artificial	Intelligence	(AI)
8
Artificial	Intelligence	(AI)
9
When	To	Use	Machine	Learning
A	pattern	exists1
We	cannot	pin	down	the	pattern	
mathematically
2
We	have	data	and	hopefully	lots	of	
data
10
Types	of	Machine	Learning
11
Supervised	Learning
X
X
X
X
X
Price
Square	Feet
We	know	what	we	are	trying	to	
predict.		We	use	some	examples	that	
we	and	the	model	know	the	answers	
to	“train”	our	model.	It	can	then	
generate	predictions	to	examples	we	
don’t	know	the	answer	to.
Example:	Predict	the	price	of	a	house	
based	on	the	size	of	the	house.	
X
X
12
Unsupervised	Learning
O
O O
O
O
O
O
OO
O
X
Y
OO
O O
O
We	don’t	know	what	we	are	trying	to	
predict.	We	are	trying	to	identify	
some	naturally	occurring	patterns	in	
the	data	which	may	be	informative.
Example:	Try	to	identify	“clusters”	of	
customers	based	on	the	data	we	have	
on	them.
13
What	is	Deep	Learning?
• Deep	Learning	and	Neural	Networks	are	synonymous
• It’s	a	branch	of	machine	learning	based	on	a	set	of	algorithms	that	
attempt	to	model	high	level	abstractions	in	data	by	using	a	deep	graph	
with	multiple	processing	layers,	composed	of	multiple	linear	and	non-
linear	transformations
What	we	see What	the	computer	“sees”
14
Tools	of	The	Trade
Apache	SystemML
Google	Cloud
Machine	Learning
15
mrjain@gmail.com
Questions?
version:	draft
Appendix
17
AI	Researchers
Geoffrey	Hinton
University	of	Toronto
Google
Yoshua Bengio
University	of	Montreal
Yann	LeCun
New	York	University
Facebook
Andrew	Ng
Stanford	University
Baidu
18
CPU	vs	GPU	Performance
19
MapReduce
20
The	Name…Hadoop
Named	after	the	yellow	toy	elephant	of	Doug	Cutting’s	son.	
In	2006	while	working	at	Yahoo,	Doug	came	up	with	the	Hadoop	
framework.	In	2008,	it	was	taken	over	by	the	open	source	group	
Apache,	hence	the	official	name	is	Apache	Hadoop.
21
Hadoop	to	the	Rescue
“an	open	source	framework	written	in	Java	for	storing	and	
processing	massive	amounts	of	data	in	a	distributed	manner”
1
Hadoop	Distributed	File	System	
(HDFS).	Scalable	file	system	that	
distributes	and	stores	data	across	
many	machines	in	a	cluster.
MapReduce – framework	for	
distributed	processing.
2	Key	Components	of	the	Framework:
Storage 2 Analysis
22
Hadoop Architecture
Hadoop	can	run	on	cheap	commoditized	
hardware	on	premise	or	in	the	cloud.
Stores	files	in	large	
blocks	(64MB)	across	
multiple	machines	for	
fault	tolerance.	By	
default,	data	is	stored	
on	3	separate	machines
HDFS
MapReduce
Breaks	large	data	processing	
problems	into	multiple	steps,	
namely	Mappers	(DataNode)	
and	Reducers	(TaskTrackers)	
that	can	be	worked	on	in	
parallel	on	multiple	machines
23
MapReduce Store	Sales	Data	
(100MB)
Mappers Name	Node	1 Data	Node	1
(64MB)
Data	Node	2
(36MB)
LA NYC LA NYC
Reducers Job	Tracker Task	Tracker
1
LA LA
Task	Tracker
2
NYC NYC
Shuffle	and	Sort
24
MapReduce
Map Shuffle	&	Sort Reduce Result
25
Hadoop	1.0	vs	2.0
26
The	Future…
27

Weitere ähnliche Inhalte

Was ist angesagt?

Big Data Analytics | What Is Big Data Analytics? | Big Data Analytics For Beg...
Big Data Analytics | What Is Big Data Analytics? | Big Data Analytics For Beg...Big Data Analytics | What Is Big Data Analytics? | Big Data Analytics For Beg...
Big Data Analytics | What Is Big Data Analytics? | Big Data Analytics For Beg...
Simplilearn
 

Was ist angesagt? (20)

Big Data Analytics for Healthcare
Big Data Analytics for HealthcareBig Data Analytics for Healthcare
Big Data Analytics for Healthcare
 
Machine learning
Machine learningMachine learning
Machine learning
 
What is Big Data?
What is Big Data?What is Big Data?
What is Big Data?
 
Big Data applications in Health Care
Big Data applications in Health CareBig Data applications in Health Care
Big Data applications in Health Care
 
Three Big Data Case Studies
Three Big Data Case StudiesThree Big Data Case Studies
Three Big Data Case Studies
 
Machine Learning in Healthcare
Machine Learning in HealthcareMachine Learning in Healthcare
Machine Learning in Healthcare
 
Data science
Data scienceData science
Data science
 
Data Analytics
Data AnalyticsData Analytics
Data Analytics
 
Predictive Analytics - An Overview
Predictive Analytics - An OverviewPredictive Analytics - An Overview
Predictive Analytics - An Overview
 
Data analytics
Data analyticsData analytics
Data analytics
 
Introduction to data science
Introduction to data scienceIntroduction to data science
Introduction to data science
 
AI Governance Playbook
AI Governance PlaybookAI Governance Playbook
AI Governance Playbook
 
Introduction to data science
Introduction to data scienceIntroduction to data science
Introduction to data science
 
The Evolution of Data Science
The Evolution of Data ScienceThe Evolution of Data Science
The Evolution of Data Science
 
Big Data Characteristics And Process PowerPoint Presentation Slides
Big Data Characteristics And Process PowerPoint Presentation SlidesBig Data Characteristics And Process PowerPoint Presentation Slides
Big Data Characteristics And Process PowerPoint Presentation Slides
 
Deep learning health care
Deep learning health care  Deep learning health care
Deep learning health care
 
Data Science: Past, Present, and Future
Data Science: Past, Present, and FutureData Science: Past, Present, and Future
Data Science: Past, Present, and Future
 
Big Data Analytics | What Is Big Data Analytics? | Big Data Analytics For Beg...
Big Data Analytics | What Is Big Data Analytics? | Big Data Analytics For Beg...Big Data Analytics | What Is Big Data Analytics? | Big Data Analytics For Beg...
Big Data Analytics | What Is Big Data Analytics? | Big Data Analytics For Beg...
 
AI in Healthcare: From Hype to Impact (updated)
AI in Healthcare: From Hype to Impact (updated)AI in Healthcare: From Hype to Impact (updated)
AI in Healthcare: From Hype to Impact (updated)
 
Applications of Big Data
Applications of Big DataApplications of Big Data
Applications of Big Data
 

Andere mochten auch

MachineLearning.ppt
MachineLearning.pptMachineLearning.ppt
MachineLearning.ppt
butest
 
Machine Learning presentation.
Machine Learning presentation.Machine Learning presentation.
Machine Learning presentation.
butest
 
An introduction to Machine Learning
An introduction to Machine LearningAn introduction to Machine Learning
An introduction to Machine Learning
butest
 
Basics of Machine Learning
Basics of Machine LearningBasics of Machine Learning
Basics of Machine Learning
butest
 
IBM Watson Health: How cognitive technologies have begun transforming clinica...
IBM Watson Health: How cognitive technologies have begun transforming clinica...IBM Watson Health: How cognitive technologies have begun transforming clinica...
IBM Watson Health: How cognitive technologies have begun transforming clinica...
Maged N. Kamel Boulos
 
Big Data to Artificial Intelligence in Healthcare
Big Data to Artificial Intelligence in HealthcareBig Data to Artificial Intelligence in Healthcare
Big Data to Artificial Intelligence in Healthcare
jetweedy
 

Andere mochten auch (14)

MachineLearning.ppt
MachineLearning.pptMachineLearning.ppt
MachineLearning.ppt
 
Machine Learning for Dummies
Machine Learning for DummiesMachine Learning for Dummies
Machine Learning for Dummies
 
Machine Learning presentation.
Machine Learning presentation.Machine Learning presentation.
Machine Learning presentation.
 
An introduction to Machine Learning
An introduction to Machine LearningAn introduction to Machine Learning
An introduction to Machine Learning
 
Introduction to Machine Learning
Introduction to Machine LearningIntroduction to Machine Learning
Introduction to Machine Learning
 
Basics of Machine Learning
Basics of Machine LearningBasics of Machine Learning
Basics of Machine Learning
 
IBM Watson Health: How cognitive technologies have begun transforming clinica...
IBM Watson Health: How cognitive technologies have begun transforming clinica...IBM Watson Health: How cognitive technologies have begun transforming clinica...
IBM Watson Health: How cognitive technologies have begun transforming clinica...
 
Big Data to Artificial Intelligence in Healthcare
Big Data to Artificial Intelligence in HealthcareBig Data to Artificial Intelligence in Healthcare
Big Data to Artificial Intelligence in Healthcare
 
The Hive Think Tank: Unpacking AI for Healthcare
The Hive Think Tank: Unpacking AI for Healthcare The Hive Think Tank: Unpacking AI for Healthcare
The Hive Think Tank: Unpacking AI for Healthcare
 
IBM Watson for Healthcare
IBM Watson for HealthcareIBM Watson for Healthcare
IBM Watson for Healthcare
 
IBM Watson in Healthcare
IBM Watson in HealthcareIBM Watson in Healthcare
IBM Watson in Healthcare
 
Big Data & Artificial Intelligence
Big Data & Artificial IntelligenceBig Data & Artificial Intelligence
Big Data & Artificial Intelligence
 
IBM Watson: How it Works, and What it means for Society beyond winning Jeopardy!
IBM Watson: How it Works, and What it means for Society beyond winning Jeopardy!IBM Watson: How it Works, and What it means for Society beyond winning Jeopardy!
IBM Watson: How it Works, and What it means for Society beyond winning Jeopardy!
 
Introduction to Big Data/Machine Learning
Introduction to Big Data/Machine LearningIntroduction to Big Data/Machine Learning
Introduction to Big Data/Machine Learning
 

Ähnlich wie Predictive Analytics - Big Data & Artificial Intelligence

Bigdata " new level"
Bigdata " new level"Bigdata " new level"
Bigdata " new level"
Vamshikrishna Goud
 

Ähnlich wie Predictive Analytics - Big Data & Artificial Intelligence (20)

Predictive Analytics World Chicago 2015
Predictive Analytics World Chicago 2015Predictive Analytics World Chicago 2015
Predictive Analytics World Chicago 2015
 
Advanced Analytics for Any Data at Real-Time Speed
Advanced Analytics for Any Data at Real-Time SpeedAdvanced Analytics for Any Data at Real-Time Speed
Advanced Analytics for Any Data at Real-Time Speed
 
In-Memory Computing Webcast. Market Predictions 2017
In-Memory Computing Webcast. Market Predictions 2017In-Memory Computing Webcast. Market Predictions 2017
In-Memory Computing Webcast. Market Predictions 2017
 
Bigdata " new level"
Bigdata " new level"Bigdata " new level"
Bigdata " new level"
 
AI in the Enterprise at Scale
AI in the Enterprise at ScaleAI in the Enterprise at Scale
AI in the Enterprise at Scale
 
Dell AI and HPC University Roadshow
Dell AI and HPC University RoadshowDell AI and HPC University Roadshow
Dell AI and HPC University Roadshow
 
SuanIct-Bigdata desktop-final
SuanIct-Bigdata desktop-finalSuanIct-Bigdata desktop-final
SuanIct-Bigdata desktop-final
 
Big Data in Azure
Big Data in AzureBig Data in Azure
Big Data in Azure
 
AI for Manufacturing (Machine Vision, Edge AI, Federated Learning)
AI for Manufacturing (Machine Vision, Edge AI, Federated Learning)AI for Manufacturing (Machine Vision, Edge AI, Federated Learning)
AI for Manufacturing (Machine Vision, Edge AI, Federated Learning)
 
Workshop_Presentation.pptx
Workshop_Presentation.pptxWorkshop_Presentation.pptx
Workshop_Presentation.pptx
 
Predictive modelling with azure ml
Predictive modelling with azure mlPredictive modelling with azure ml
Predictive modelling with azure ml
 
Database Shootout: What's best for BI?
Database Shootout: What's best for BI?Database Shootout: What's best for BI?
Database Shootout: What's best for BI?
 
Big data Introduction by Mohan
Big data Introduction by MohanBig data Introduction by Mohan
Big data Introduction by Mohan
 
Big data Analytics
Big data Analytics Big data Analytics
Big data Analytics
 
Accelerate Machine Learning Software on Intel Architecture
Accelerate Machine Learning Software on Intel Architecture Accelerate Machine Learning Software on Intel Architecture
Accelerate Machine Learning Software on Intel Architecture
 
Big Data - A Real Life Revolution
Big Data - A Real Life RevolutionBig Data - A Real Life Revolution
Big Data - A Real Life Revolution
 
Moving Targets: Harnessing Real-time Value from Data in Motion
Moving Targets: Harnessing Real-time Value from Data in Motion Moving Targets: Harnessing Real-time Value from Data in Motion
Moving Targets: Harnessing Real-time Value from Data in Motion
 
Introduction to Big Data and its Trends
Introduction to Big Data and its TrendsIntroduction to Big Data and its Trends
Introduction to Big Data and its Trends
 
Internet of Things: Lightning Round, Hite
Internet of Things: Lightning Round, HiteInternet of Things: Lightning Round, Hite
Internet of Things: Lightning Round, Hite
 
CS8091_BDA_Unit_I_Analytical_Architecture
CS8091_BDA_Unit_I_Analytical_ArchitectureCS8091_BDA_Unit_I_Analytical_Architecture
CS8091_BDA_Unit_I_Analytical_Architecture
 

Mehr von Manish Jain

Mehr von Manish Jain (7)

DeFi 101
DeFi 101DeFi 101
DeFi 101
 
Cookbook for Building An App
Cookbook for Building An AppCookbook for Building An App
Cookbook for Building An App
 
Startup Engineering Cookbook for Mobile Apps
Startup Engineering Cookbook for Mobile AppsStartup Engineering Cookbook for Mobile Apps
Startup Engineering Cookbook for Mobile Apps
 
Startup Engineering Cookbook
Startup Engineering CookbookStartup Engineering Cookbook
Startup Engineering Cookbook
 
Installing WordPress on AWS
Installing WordPress on AWSInstalling WordPress on AWS
Installing WordPress on AWS
 
10 Things about Aadhaar
10 Things about Aadhaar10 Things about Aadhaar
10 Things about Aadhaar
 
The Road to Financial Freedom
The Road to Financial FreedomThe Road to Financial Freedom
The Road to Financial Freedom
 

Kürzlich hochgeladen

Cloud Frontiers: A Deep Dive into Serverless Spatial Data and FME
Cloud Frontiers:  A Deep Dive into Serverless Spatial Data and FMECloud Frontiers:  A Deep Dive into Serverless Spatial Data and FME
Cloud Frontiers: A Deep Dive into Serverless Spatial Data and FME
Safe Software
 
Cloud Frontiers: A Deep Dive into Serverless Spatial Data and FME
Cloud Frontiers:  A Deep Dive into Serverless Spatial Data and FMECloud Frontiers:  A Deep Dive into Serverless Spatial Data and FME
Cloud Frontiers: A Deep Dive into Serverless Spatial Data and FME
Safe Software
 

Kürzlich hochgeladen (20)

Rising Above_ Dubai Floods and the Fortitude of Dubai International Airport.pdf
Rising Above_ Dubai Floods and the Fortitude of Dubai International Airport.pdfRising Above_ Dubai Floods and the Fortitude of Dubai International Airport.pdf
Rising Above_ Dubai Floods and the Fortitude of Dubai International Airport.pdf
 
DEV meet-up UiPath Document Understanding May 7 2024 Amsterdam
DEV meet-up UiPath Document Understanding May 7 2024 AmsterdamDEV meet-up UiPath Document Understanding May 7 2024 Amsterdam
DEV meet-up UiPath Document Understanding May 7 2024 Amsterdam
 
DBX First Quarter 2024 Investor Presentation
DBX First Quarter 2024 Investor PresentationDBX First Quarter 2024 Investor Presentation
DBX First Quarter 2024 Investor Presentation
 
AXA XL - Insurer Innovation Award Americas 2024
AXA XL - Insurer Innovation Award Americas 2024AXA XL - Insurer Innovation Award Americas 2024
AXA XL - Insurer Innovation Award Americas 2024
 
"I see eyes in my soup": How Delivery Hero implemented the safety system for ...
"I see eyes in my soup": How Delivery Hero implemented the safety system for ..."I see eyes in my soup": How Delivery Hero implemented the safety system for ...
"I see eyes in my soup": How Delivery Hero implemented the safety system for ...
 
Apidays New York 2024 - Scaling API-first by Ian Reasor and Radu Cotescu, Adobe
Apidays New York 2024 - Scaling API-first by Ian Reasor and Radu Cotescu, AdobeApidays New York 2024 - Scaling API-first by Ian Reasor and Radu Cotescu, Adobe
Apidays New York 2024 - Scaling API-first by Ian Reasor and Radu Cotescu, Adobe
 
Spring Boot vs Quarkus the ultimate battle - DevoxxUK
Spring Boot vs Quarkus the ultimate battle - DevoxxUKSpring Boot vs Quarkus the ultimate battle - DevoxxUK
Spring Boot vs Quarkus the ultimate battle - DevoxxUK
 
Cloud Frontiers: A Deep Dive into Serverless Spatial Data and FME
Cloud Frontiers:  A Deep Dive into Serverless Spatial Data and FMECloud Frontiers:  A Deep Dive into Serverless Spatial Data and FME
Cloud Frontiers: A Deep Dive into Serverless Spatial Data and FME
 
Apidays New York 2024 - The Good, the Bad and the Governed by David O'Neill, ...
Apidays New York 2024 - The Good, the Bad and the Governed by David O'Neill, ...Apidays New York 2024 - The Good, the Bad and the Governed by David O'Neill, ...
Apidays New York 2024 - The Good, the Bad and the Governed by David O'Neill, ...
 
Strategize a Smooth Tenant-to-tenant Migration and Copilot Takeoff
Strategize a Smooth Tenant-to-tenant Migration and Copilot TakeoffStrategize a Smooth Tenant-to-tenant Migration and Copilot Takeoff
Strategize a Smooth Tenant-to-tenant Migration and Copilot Takeoff
 
Strategies for Landing an Oracle DBA Job as a Fresher
Strategies for Landing an Oracle DBA Job as a FresherStrategies for Landing an Oracle DBA Job as a Fresher
Strategies for Landing an Oracle DBA Job as a Fresher
 
AWS Community Day CPH - Three problems of Terraform
AWS Community Day CPH - Three problems of TerraformAWS Community Day CPH - Three problems of Terraform
AWS Community Day CPH - Three problems of Terraform
 
Repurposing LNG terminals for Hydrogen Ammonia: Feasibility and Cost Saving
Repurposing LNG terminals for Hydrogen Ammonia: Feasibility and Cost SavingRepurposing LNG terminals for Hydrogen Ammonia: Feasibility and Cost Saving
Repurposing LNG terminals for Hydrogen Ammonia: Feasibility and Cost Saving
 
[BuildWithAI] Introduction to Gemini.pdf
[BuildWithAI] Introduction to Gemini.pdf[BuildWithAI] Introduction to Gemini.pdf
[BuildWithAI] Introduction to Gemini.pdf
 
EMPOWERMENT TECHNOLOGY GRADE 11 QUARTER 2 REVIEWER
EMPOWERMENT TECHNOLOGY GRADE 11 QUARTER 2 REVIEWEREMPOWERMENT TECHNOLOGY GRADE 11 QUARTER 2 REVIEWER
EMPOWERMENT TECHNOLOGY GRADE 11 QUARTER 2 REVIEWER
 
Exploring Multimodal Embeddings with Milvus
Exploring Multimodal Embeddings with MilvusExploring Multimodal Embeddings with Milvus
Exploring Multimodal Embeddings with Milvus
 
Apidays New York 2024 - APIs in 2030: The Risk of Technological Sleepwalk by ...
Apidays New York 2024 - APIs in 2030: The Risk of Technological Sleepwalk by ...Apidays New York 2024 - APIs in 2030: The Risk of Technological Sleepwalk by ...
Apidays New York 2024 - APIs in 2030: The Risk of Technological Sleepwalk by ...
 
Apidays New York 2024 - Accelerating FinTech Innovation by Vasa Krishnan, Fin...
Apidays New York 2024 - Accelerating FinTech Innovation by Vasa Krishnan, Fin...Apidays New York 2024 - Accelerating FinTech Innovation by Vasa Krishnan, Fin...
Apidays New York 2024 - Accelerating FinTech Innovation by Vasa Krishnan, Fin...
 
Boost Fertility New Invention Ups Success Rates.pdf
Boost Fertility New Invention Ups Success Rates.pdfBoost Fertility New Invention Ups Success Rates.pdf
Boost Fertility New Invention Ups Success Rates.pdf
 
Cloud Frontiers: A Deep Dive into Serverless Spatial Data and FME
Cloud Frontiers:  A Deep Dive into Serverless Spatial Data and FMECloud Frontiers:  A Deep Dive into Serverless Spatial Data and FME
Cloud Frontiers: A Deep Dive into Serverless Spatial Data and FME
 

Predictive Analytics - Big Data & Artificial Intelligence