3. #sqlsatistanbul
Agenda
3 Trends about Future
What’s IoT
Introduction to Big Data
Welcome to Data World!
Machine Learning Overview
AzureML Strengths
Microsoft Azure Overview
Setting Up An AzureML Workspace
Exploring AzureML Studio
Azure ML API Services
4. #sqlsatistanbul
Target Audience
Data Professional
Developer
Business Intelligence
Trend Hunter
Programming
Probability / Statistics
Calculus
Database
Minimal Prerequsities
6. #sqlsatistanbul
"Internet Of Thing" connects the physical world to the Internet
Within 6 years, 60.000 new
"things" will be added to the
Internet every 1 second!!!
212 billion devices will be part
of the Internet of Things by
2020 (IDC)
IDC predicts that IoT will
generate nearly $9 Trillion in
annual sales by 2020
7. #sqlsatistanbul
IoT starts to be more visible in our everyday lives.
A Car produced in 2014 has an average of 60 - 100 built-
in sensors
The Internet of Things isn't coming soon. It's already
here...
IoT will soon be everywhere. Like that:
• Home (Consumer)
• Fashion (Style)
• Transport (Traffic)
• Retail (Inventory)
• Logistic (Real Time)
• Manufacturing (M2M)
• Cities (Industry)
• Health (Body)
• Environment (Prevention)
• Building (Infrastructure)
• Agriculture (Control)
• Security (Detection)
9. #sqlsatistanbul
What is Big Data?
Data that is too large or complex for analysis in
traditional relational databases
Typified by the “3 V’s”:
Volume – Huge amounts of data to process
Variety – A mixture of structured and unstructured data
Velocity – New data generated extremely frequently
Web server log reporting Social media sentiment analysis Sensor anomaly detection
10. #sqlsatistanbul
Big Data Technologies
Hadoop
Open source distributed data processing cluster
Data processed in Hadoop Distributed File System
(HDFS)
Related projects
Hive
Pig
HCatalog
Oozie
Sqoop
Others
HDFS
Name Node Data Nodes
Hadoop Cluster
11. #sqlsatistanbul
Map Reduce
1. Source data is divided
among data nodes
2. Map phase generates
key/value pairs
3. Reduce phase
aggregates values for
each key
Lorem ipsum sit amet magma sit elit
Fusce magna sed sit amet magna
Key Value
Lorem 1
ipsum 1
sit 1
amet 1
magma 1
sit 1
elit 1
Key Value
Fusce 1
magma 1
sed 1
sit 1
amet 1
magma 1
Key Value
Lorem 1
ipsum 1
sit 3
amet 2
magma 3
elit 1
Fusce 1
sed 1
MAPREDUCE
14. #sqlsatistanbul
1980
(«Neuro»)
1990
(«Symbolic»)
2000
(«Kernel Machines»)
2005
(«Graphical Models»)
2011
(«Big Data, DNN»)
Neural
Networks
Artificial
Intelligence
Learning =
Adaptation of
Neurons
based on
External
Stimuli
Expert
Systems
Decision
Tree
Learning
Learning =
Methods to
automaticaly
build Expert
Systems
Statistical Learning
Theory
Scoring Systems
Learning =
Optimization of
Convex Functions
Wide application in
products
Statistical Modeling
of Data
Learning =
Parameter
Estimation or
Inference
Distributed
computing and
storage
Deep Neural
Networks
Learning = Scalable,
Adaptive,
Computation for
Various Big Data
History
15. #sqlsatistanbul
Azure
Machine
Learning
allows us to
Solve extremely hard problems better
Extract more value from Big Data
Drive a shift in Data Analytics
The goal of machine learning is to program computers to use example data or past
experience to solve a given problem - Introduction to Machine Learning, 2nd
Edition, MIT Press
18. #sqlsatistanbul
• Accessible through a web browser, no
software to install
• Collaborative, work with anyone,
anywhere via Azure workspace
• Visual composition with end2end
support for data science workflow
• Extensible, support for R OSS