SlideShare a Scribd company logo
1 of 30
Download to read offline
Open Data, Big Data
and Machine Learning
Steven Van Vaerenbergh
Universidad de Cantabria
May 31, 2016
#EMWeek16 - Santander
About me
Researcher in machine learning
gtas.unican.es/people/steven
Open Data, Big Data and Machine Learning 2
twitter.com/steven2358
Steven Van Vaerenbergh
1. Open Data
Open Data, Big Data and Machine Learning 3Steven Van Vaerenbergh
Denmark’s Open Address Data Set
• Making public data
“free of charge”
Open Data, Big Data and Machine Learning 4
Period Benefits Costs Return on
Investment
2004-2009 (including
setup)
>€60M ~€2M 22:1
2010 (steady state) ~€14M €0.2M 70:1
Source: http://odimpact.org/static/files/case-study-denmark.pdf
Steven Van Vaerenbergh
Open Data, Big Data and Machine Learning 5Steven Van Vaerenbergh
Steven Van Vaerenbergh Open Data, Big Data and Machine Learning 6
Open Data in Santander
• Santander Datos Abiertos http://datos.santander.es/
• FIWARE lab: https://www.fiware.org/lab/
• FIWARE Academy: http://edu.fiware.org
Open Data, Big Data and Machine Learning 7Steven Van Vaerenbergh
Open data
• “A data set is open if it is available under a free
license to everyone”.
• Providers: Governments, public services,
companies, individuals.
• Tendency: Many data providers stop making apps
and leave this to third parties.
Open data improves transparency
Not all data should be open though (privacy)
Open Data, Big Data and Machine Learning 8Steven Van Vaerenbergh
2. Big Data
Open Data, Big Data and Machine Learning 9Steven Van Vaerenbergh
Big Data
• Scientific definition: “Data sets that are so large
that traditional data processing techniques cannot
be applied to them”.
• Terabytes, Petabytes, Exabytes, etc.
• “Big Data” is also used to refer to novel analysis
techniques for such data.
• Typically not open data.
Open Data, Big Data and Machine Learning 10Steven Van Vaerenbergh
Big Data = Data Science with Lots of Data
Source: http://drewconway.com/zia/2013/3/26/the-data-science-venn-diagram
Open Data, Big Data and Machine Learning 11Steven Van Vaerenbergh
Big Data
• Many frameworks are being developed:
• Apache Hadoop
• Apache Mahout
• NoSQL
• Caution: The science behind big data is in its infancy.
E.g. most methods are not able to produce error
bars, which is paramount in many applications.
Open Data, Big Data and Machine Learning 12Steven Van Vaerenbergh
Big Data
• Media and press often use “big data” to refer to
data science even if the amount of data is relatively
small.
 “Big data” is often simply a marketing term.
Open Data, Big Data and Machine Learning 13Steven Van Vaerenbergh
Big Data = Data Science with Lots of Data
Source: http://drewconway.com/zia/2013/3/26/the-data-science-venn-diagram
Open Data, Big Data and Machine Learning 14
?
Steven Van Vaerenbergh
3. Machine Learning
Open Data, Big Data and Machine Learning 15Steven Van Vaerenbergh
Traditional Machine Intelligence
• Example: decision tree for determining access
Program consists of a set of rules (logic)
Open Data, Big Data and Machine Learning 16
Input: age, gender, occupation,… Permission to enter Juanito’s tree house?
Yes No No
Steven Van Vaerenbergh
Traditional Machine Intelligence
• Example: decision tree for digit recognition
Set of rules is very hard to design by hand
Open Data, Big Data and Machine Learning 17
Input: images (MNIST) Which digit is represented?
Steven Van Vaerenbergh
Traditional Machine Intelligence
• Example: decision tree for image recognition
Set of rules is impossible to design by hand
Open Data, Big Data and Machine Learning 18
Input: images (CIFAR10) What does the image represent?
Correct
answer
?
Steven Van Vaerenbergh
Machine Learning
• Solution: Let the program itself determine its
internal set of rules.
• Provide the program with inputs and correct
answers for these rules, and let it “learn”.
“Machine Learning is the study of
computer algorithms that
improve their performance
on a task automatically
through experience.”
- Tom Mitchell
Open Data, Big Data and Machine Learning 19Steven Van Vaerenbergh
Open Data, Big Data and Machine Learning 20
Traditional Machine Intelligence
Computer
Input
Program
Output
Machine Learning (ML)
ML algorithm
Input
Output
Program
Steven Van Vaerenbergh
Machine Learning Applications
• Spam filters detect unsolicited emails
Open Data, Big Data and Machine Learning 21
SPAM
Steven Van Vaerenbergh
Machine Learning Applications
• Biomedicine: pattern detection in images
Open Data, Big Data and Machine Learning 22Steven Van Vaerenbergh
Machine Learning Applications
• Computer Vision: Kinect body tracking
Open Data, Big Data and Machine Learning 23Steven Van Vaerenbergh
Machine Learning Applications
• Natural Language Processing (NLP)
Open Data, Big Data and Machine Learning 24Steven Van Vaerenbergh
Machine Learning Applications
1996: IBM’s Deep Blue
(Chess)
• Intelligence based on
manually-entered rules
2016: Google Deepmind’s
AlphaGo (Go)
• Program learns
autonomously
Open Data, Big Data and Machine Learning 25Steven Van Vaerenbergh
Machine Learning Applications
• Human activity recognition
Open Data, Big Data and Machine Learning 26
Running
Walking
Steven Van Vaerenbergh
Internal representation
How to represent the function from input to output?
• Neural networks
• Support vector machines
• Sets of rules / Logic programs
• Bayes/Markov nets
• Model ensembles
• Decision trees
• Etc.
Neural net demo: http://playground.tensorflow.org/
Open Data, Big Data and Machine Learning 27Steven Van Vaerenbergh
Tools and Frameworks
• Machine learning toolkits:
• Scikit Learn (Python) http://scikit-learn.org/
• Weka (Java) http://www.cs.waikato.ac.nz/ml/weka/
• Shogun http://www.shogun-toolbox.org/
• Cloud-based machine learning
• IBM Watson https://developer.ibm.com/watson/
• Amazon ML https://aws.amazon.com/machine-learning/
• Microsoft Azure ML https://azure.microsoft.com/en-
us/services/machine-learning/
• Google Cloud ML
https://cloud.google.com/products/machine-learning/
Open Data, Big Data and Machine Learning 28Steven Van Vaerenbergh
Takeaways
• Open data, big data and machine learning are
components of the current technological wave that
resembles an industrial revolution.
• Big data requires a rigorous scientific engineering
framework that is currently unfinished.
• Machine learning algorithms create intelligent
programs by automatically learning from example
data.
Open Data, Big Data and Machine Learning 29Steven Van Vaerenbergh
Join us on Meetup
Meetup group for people
in Santander & Cantabria
interested in everything
related to data science
www.meetup.com/Data-Science-Santander
Open Data, Big Data and Machine Learning 30Steven Van Vaerenbergh

More Related Content

What's hot

[Webinar] How Big Data and Machine Learning Are Transforming ITSM
[Webinar] How Big Data and Machine Learning Are Transforming ITSM[Webinar] How Big Data and Machine Learning Are Transforming ITSM
[Webinar] How Big Data and Machine Learning Are Transforming ITSMSunView Software, Inc.
 
Introduction to machine learning
Introduction to machine learningIntroduction to machine learning
Introduction to machine learningPruet Boonma
 
Creating a Data Science Ecosystem for Scientific, Societal and Educational Im...
Creating a Data Science Ecosystem for Scientific, Societal and Educational Im...Creating a Data Science Ecosystem for Scientific, Societal and Educational Im...
Creating a Data Science Ecosystem for Scientific, Societal and Educational Im...Ilkay Altintas, Ph.D.
 
Big Data & Machine Learning - TDC2013 Sao Paulo
Big Data & Machine Learning - TDC2013 Sao PauloBig Data & Machine Learning - TDC2013 Sao Paulo
Big Data & Machine Learning - TDC2013 Sao PauloOCTO Technology
 
Machine Learning in Big Data
Machine Learning in Big DataMachine Learning in Big Data
Machine Learning in Big DataDataWorks Summit
 
Machine Learning Introduction for Digital Business Leaders
Machine Learning Introduction for Digital Business LeadersMachine Learning Introduction for Digital Business Leaders
Machine Learning Introduction for Digital Business LeadersSudha Jamthe
 
Introduction to Machine Learning
Introduction to Machine LearningIntroduction to Machine Learning
Introduction to Machine LearningRaveen Perera
 
Data science presentation 2nd CI day
Data science presentation 2nd CI dayData science presentation 2nd CI day
Data science presentation 2nd CI dayMohammed Barakat
 
Data science presentation
Data science presentationData science presentation
Data science presentationMSDEVMTL
 
Programming for data science in python
Programming for data science in pythonProgramming for data science in python
Programming for data science in pythonUmmeSalmaM1
 
Introduction to data science intro,ch(1,2,3)
Introduction to data science intro,ch(1,2,3)Introduction to data science intro,ch(1,2,3)
Introduction to data science intro,ch(1,2,3)heba_ahmad
 
Introduction to data science
Introduction to data scienceIntroduction to data science
Introduction to data scienceSampath Kumar
 
Introduction To Data Science
Introduction To Data ScienceIntroduction To Data Science
Introduction To Data ScienceSpotle.ai
 
Big Data and Data Science: The Technologies Shaping Our Lives
Big Data and Data Science: The Technologies Shaping Our LivesBig Data and Data Science: The Technologies Shaping Our Lives
Big Data and Data Science: The Technologies Shaping Our LivesRukshan Batuwita
 
Introduction to Data Science
Introduction to Data ScienceIntroduction to Data Science
Introduction to Data ScienceANOOP V S
 
Self Study Business Approach to DS_01022022.docx
Self Study Business Approach to DS_01022022.docxSelf Study Business Approach to DS_01022022.docx
Self Study Business Approach to DS_01022022.docxShanmugasundaram M
 
Introduction to Data Science
Introduction to Data ScienceIntroduction to Data Science
Introduction to Data ScienceCaserta
 

What's hot (20)

[Webinar] How Big Data and Machine Learning Are Transforming ITSM
[Webinar] How Big Data and Machine Learning Are Transforming ITSM[Webinar] How Big Data and Machine Learning Are Transforming ITSM
[Webinar] How Big Data and Machine Learning Are Transforming ITSM
 
Introduction to machine learning
Introduction to machine learningIntroduction to machine learning
Introduction to machine learning
 
Creating a Data Science Ecosystem for Scientific, Societal and Educational Im...
Creating a Data Science Ecosystem for Scientific, Societal and Educational Im...Creating a Data Science Ecosystem for Scientific, Societal and Educational Im...
Creating a Data Science Ecosystem for Scientific, Societal and Educational Im...
 
Big Data & Machine Learning - TDC2013 Sao Paulo
Big Data & Machine Learning - TDC2013 Sao PauloBig Data & Machine Learning - TDC2013 Sao Paulo
Big Data & Machine Learning - TDC2013 Sao Paulo
 
Machine Learning in Big Data
Machine Learning in Big DataMachine Learning in Big Data
Machine Learning in Big Data
 
Machine Learning Introduction for Digital Business Leaders
Machine Learning Introduction for Digital Business LeadersMachine Learning Introduction for Digital Business Leaders
Machine Learning Introduction for Digital Business Leaders
 
Unit 3 part 2
Unit  3 part 2Unit  3 part 2
Unit 3 part 2
 
Introduction to Machine Learning
Introduction to Machine LearningIntroduction to Machine Learning
Introduction to Machine Learning
 
Data science presentation 2nd CI day
Data science presentation 2nd CI dayData science presentation 2nd CI day
Data science presentation 2nd CI day
 
Data science presentation
Data science presentationData science presentation
Data science presentation
 
Programming for data science in python
Programming for data science in pythonProgramming for data science in python
Programming for data science in python
 
Introduction to data science intro,ch(1,2,3)
Introduction to data science intro,ch(1,2,3)Introduction to data science intro,ch(1,2,3)
Introduction to data science intro,ch(1,2,3)
 
Data Science
Data ScienceData Science
Data Science
 
Introduction to data science
Introduction to data scienceIntroduction to data science
Introduction to data science
 
Introduction To Data Science
Introduction To Data ScienceIntroduction To Data Science
Introduction To Data Science
 
Big Data and Data Science: The Technologies Shaping Our Lives
Big Data and Data Science: The Technologies Shaping Our LivesBig Data and Data Science: The Technologies Shaping Our Lives
Big Data and Data Science: The Technologies Shaping Our Lives
 
Introduction to Data Science
Introduction to Data ScienceIntroduction to Data Science
Introduction to Data Science
 
Self Study Business Approach to DS_01022022.docx
Self Study Business Approach to DS_01022022.docxSelf Study Business Approach to DS_01022022.docx
Self Study Business Approach to DS_01022022.docx
 
Intro to Data Science by DatalentTeam at Data Science Clinic#11
Intro to Data Science by DatalentTeam at Data Science Clinic#11Intro to Data Science by DatalentTeam at Data Science Clinic#11
Intro to Data Science by DatalentTeam at Data Science Clinic#11
 
Introduction to Data Science
Introduction to Data ScienceIntroduction to Data Science
Introduction to Data Science
 

Viewers also liked

Big data, Machine learning and the Auditor
Big data, Machine learning and the AuditorBig data, Machine learning and the Auditor
Big data, Machine learning and the AuditorBharath Rao
 
Big Data and Machine Learning
Big Data and Machine LearningBig Data and Machine Learning
Big Data and Machine LearningMichel Bruley
 
Big data and machine learning / Gil Chamiel
Big data and machine learning / Gil Chamiel   Big data and machine learning / Gil Chamiel
Big data and machine learning / Gil Chamiel geektimecoil
 
Machine learning for dummies
Machine learning for dummiesMachine learning for dummies
Machine learning for dummiesAlexandre Uehara
 
An introduction to Machine Learning (and a little bit of Deep Learning)
An introduction to Machine Learning (and a little bit of Deep Learning)An introduction to Machine Learning (and a little bit of Deep Learning)
An introduction to Machine Learning (and a little bit of Deep Learning)Thomas da Silva Paula
 
Machine Learning for Recommender Systems MLSS 2015 Sydney
Machine Learning for Recommender Systems MLSS 2015 SydneyMachine Learning for Recommender Systems MLSS 2015 Sydney
Machine Learning for Recommender Systems MLSS 2015 SydneyAlexandros Karatzoglou
 
Machine Learning on Big Data
Machine Learning on Big DataMachine Learning on Big Data
Machine Learning on Big DataMax Lin
 
Pybcn machine learning for dummies with python
Pybcn machine learning for dummies with pythonPybcn machine learning for dummies with python
Pybcn machine learning for dummies with pythonJavier Arias Losada
 
QCon Rio - Machine Learning for Everyone
QCon Rio - Machine Learning for EveryoneQCon Rio - Machine Learning for Everyone
QCon Rio - Machine Learning for EveryoneDhiana Deva
 
Introduction to Machine Learning
Introduction to Machine LearningIntroduction to Machine Learning
Introduction to Machine LearningLior Rokach
 
Introduction to Machine Learning and Deep Learning
Introduction to Machine Learning and Deep LearningIntroduction to Machine Learning and Deep Learning
Introduction to Machine Learning and Deep LearningTerry Taewoong Um
 
Machine learning ~ Forecasting
Machine learning ~ ForecastingMachine learning ~ Forecasting
Machine learning ~ ForecastingShaswat Mandhanya
 
[系列活動] Machine Learning 機器學習課程
[系列活動] Machine Learning 機器學習課程[系列活動] Machine Learning 機器學習課程
[系列活動] Machine Learning 機器學習課程台灣資料科學年會
 

Viewers also liked (13)

Big data, Machine learning and the Auditor
Big data, Machine learning and the AuditorBig data, Machine learning and the Auditor
Big data, Machine learning and the Auditor
 
Big Data and Machine Learning
Big Data and Machine LearningBig Data and Machine Learning
Big Data and Machine Learning
 
Big data and machine learning / Gil Chamiel
Big data and machine learning / Gil Chamiel   Big data and machine learning / Gil Chamiel
Big data and machine learning / Gil Chamiel
 
Machine learning for dummies
Machine learning for dummiesMachine learning for dummies
Machine learning for dummies
 
An introduction to Machine Learning (and a little bit of Deep Learning)
An introduction to Machine Learning (and a little bit of Deep Learning)An introduction to Machine Learning (and a little bit of Deep Learning)
An introduction to Machine Learning (and a little bit of Deep Learning)
 
Machine Learning for Recommender Systems MLSS 2015 Sydney
Machine Learning for Recommender Systems MLSS 2015 SydneyMachine Learning for Recommender Systems MLSS 2015 Sydney
Machine Learning for Recommender Systems MLSS 2015 Sydney
 
Machine Learning on Big Data
Machine Learning on Big DataMachine Learning on Big Data
Machine Learning on Big Data
 
Pybcn machine learning for dummies with python
Pybcn machine learning for dummies with pythonPybcn machine learning for dummies with python
Pybcn machine learning for dummies with python
 
QCon Rio - Machine Learning for Everyone
QCon Rio - Machine Learning for EveryoneQCon Rio - Machine Learning for Everyone
QCon Rio - Machine Learning for Everyone
 
Introduction to Machine Learning
Introduction to Machine LearningIntroduction to Machine Learning
Introduction to Machine Learning
 
Introduction to Machine Learning and Deep Learning
Introduction to Machine Learning and Deep LearningIntroduction to Machine Learning and Deep Learning
Introduction to Machine Learning and Deep Learning
 
Machine learning ~ Forecasting
Machine learning ~ ForecastingMachine learning ~ Forecasting
Machine learning ~ Forecasting
 
[系列活動] Machine Learning 機器學習課程
[系列活動] Machine Learning 機器學習課程[系列活動] Machine Learning 機器學習課程
[系列活動] Machine Learning 機器學習課程
 

Similar to Open Data, Big Data and Machine Learning

Bigdata and Hadoop with applications
Bigdata and Hadoop with applicationsBigdata and Hadoop with applications
Bigdata and Hadoop with applicationsPadma Metta
 
1345 track 1 chen_using our laptop
1345 track 1 chen_using our laptop1345 track 1 chen_using our laptop
1345 track 1 chen_using our laptopRising Media, Inc.
 
Course 1 - Introduction to Big Data by Toon Vanagt ( #BigDataBXL)
Course 1 - Introduction to Big Data by Toon Vanagt ( #BigDataBXL)Course 1 - Introduction to Big Data by Toon Vanagt ( #BigDataBXL)
Course 1 - Introduction to Big Data by Toon Vanagt ( #BigDataBXL)Betacowork
 
AI for Software Engineering:
Research & Innovation
AI for Software Engineering:
Research & InnovationAI for Software Engineering:
Research & Innovation
AI for Software Engineering:
Research & InnovationOleksandr Zaitsev
 
Data Science.pptx NEW COURICUUMN IN DATA
Data Science.pptx NEW COURICUUMN IN DATAData Science.pptx NEW COURICUUMN IN DATA
Data Science.pptx NEW COURICUUMN IN DATAjaved75
 
Data science / Big Data
Data science / Big DataData science / Big Data
Data science / Big DataYasas Senarath
 
Beltug philippe van impe - opendata
Beltug   philippe van impe - opendataBeltug   philippe van impe - opendata
Beltug philippe van impe - opendataDigitYser
 
How to Prepare for a Career in Data Science
How to Prepare for a Career in Data ScienceHow to Prepare for a Career in Data Science
How to Prepare for a Career in Data ScienceJuuso Parkkinen
 
SmartData Webinar Slides: How to analyze 72 billion messages a day to find tr...
SmartData Webinar Slides: How to analyze 72 billion messages a day to find tr...SmartData Webinar Slides: How to analyze 72 billion messages a day to find tr...
SmartData Webinar Slides: How to analyze 72 billion messages a day to find tr...DATAVERSITY
 
Bringing Cities, Libraries and Citizens Together through Open Data Hackathons
Bringing Cities, Libraries and Citizens Together through Open Data HackathonsBringing Cities, Libraries and Citizens Together through Open Data Hackathons
Bringing Cities, Libraries and Citizens Together through Open Data Hackathonsacecarruthers
 
Data Mining Intro
Data Mining IntroData Mining Intro
Data Mining IntroAsma CHERIF
 
Filth and lies: analysing social media
Filth and lies: analysing social mediaFilth and lies: analysing social media
Filth and lies: analysing social mediaDiana Maynard
 
MBA-TU-Thailand:BigData for business startup.
MBA-TU-Thailand:BigData for business startup.MBA-TU-Thailand:BigData for business startup.
MBA-TU-Thailand:BigData for business startup.stelligence
 
SC6 Workshop 1: Big Data Europe platform requirements and draft architecture:...
SC6 Workshop 1: Big Data Europe platform requirements and draft architecture:...SC6 Workshop 1: Big Data Europe platform requirements and draft architecture:...
SC6 Workshop 1: Big Data Europe platform requirements and draft architecture:...BigData_Europe
 
SuanIct-Bigdata desktop-final
SuanIct-Bigdata desktop-finalSuanIct-Bigdata desktop-final
SuanIct-Bigdata desktop-finalstelligence
 
predictive analysis and usage in procurement ppt 2017
predictive analysis and usage in procurement  ppt 2017predictive analysis and usage in procurement  ppt 2017
predictive analysis and usage in procurement ppt 2017Prashant Bhatmule
 
Seminaire bigdata23102014
Seminaire bigdata23102014Seminaire bigdata23102014
Seminaire bigdata23102014Raja Chiky
 
How to make use of social media data to support marketing decisions - Vleric...
How to make use of social media data to support marketing decisions -  Vleric...How to make use of social media data to support marketing decisions -  Vleric...
How to make use of social media data to support marketing decisions - Vleric...Ayman van Bregt
 

Similar to Open Data, Big Data and Machine Learning (20)

Bigdata and Hadoop with applications
Bigdata and Hadoop with applicationsBigdata and Hadoop with applications
Bigdata and Hadoop with applications
 
Data Mining With Big Data
Data Mining With Big DataData Mining With Big Data
Data Mining With Big Data
 
1345 track 1 chen_using our laptop
1345 track 1 chen_using our laptop1345 track 1 chen_using our laptop
1345 track 1 chen_using our laptop
 
Course 1 - Introduction to Big Data by Toon Vanagt ( #BigDataBXL)
Course 1 - Introduction to Big Data by Toon Vanagt ( #BigDataBXL)Course 1 - Introduction to Big Data by Toon Vanagt ( #BigDataBXL)
Course 1 - Introduction to Big Data by Toon Vanagt ( #BigDataBXL)
 
AI for Software Engineering:
Research & Innovation
AI for Software Engineering:
Research & InnovationAI for Software Engineering:
Research & Innovation
AI for Software Engineering:
Research & Innovation
 
Data Science.pptx NEW COURICUUMN IN DATA
Data Science.pptx NEW COURICUUMN IN DATAData Science.pptx NEW COURICUUMN IN DATA
Data Science.pptx NEW COURICUUMN IN DATA
 
Big Data
Big DataBig Data
Big Data
 
Data science / Big Data
Data science / Big DataData science / Big Data
Data science / Big Data
 
Beltug philippe van impe - opendata
Beltug   philippe van impe - opendataBeltug   philippe van impe - opendata
Beltug philippe van impe - opendata
 
How to Prepare for a Career in Data Science
How to Prepare for a Career in Data ScienceHow to Prepare for a Career in Data Science
How to Prepare for a Career in Data Science
 
SmartData Webinar Slides: How to analyze 72 billion messages a day to find tr...
SmartData Webinar Slides: How to analyze 72 billion messages a day to find tr...SmartData Webinar Slides: How to analyze 72 billion messages a day to find tr...
SmartData Webinar Slides: How to analyze 72 billion messages a day to find tr...
 
Bringing Cities, Libraries and Citizens Together through Open Data Hackathons
Bringing Cities, Libraries and Citizens Together through Open Data HackathonsBringing Cities, Libraries and Citizens Together through Open Data Hackathons
Bringing Cities, Libraries and Citizens Together through Open Data Hackathons
 
Data Mining Intro
Data Mining IntroData Mining Intro
Data Mining Intro
 
Filth and lies: analysing social media
Filth and lies: analysing social mediaFilth and lies: analysing social media
Filth and lies: analysing social media
 
MBA-TU-Thailand:BigData for business startup.
MBA-TU-Thailand:BigData for business startup.MBA-TU-Thailand:BigData for business startup.
MBA-TU-Thailand:BigData for business startup.
 
SC6 Workshop 1: Big Data Europe platform requirements and draft architecture:...
SC6 Workshop 1: Big Data Europe platform requirements and draft architecture:...SC6 Workshop 1: Big Data Europe platform requirements and draft architecture:...
SC6 Workshop 1: Big Data Europe platform requirements and draft architecture:...
 
SuanIct-Bigdata desktop-final
SuanIct-Bigdata desktop-finalSuanIct-Bigdata desktop-final
SuanIct-Bigdata desktop-final
 
predictive analysis and usage in procurement ppt 2017
predictive analysis and usage in procurement  ppt 2017predictive analysis and usage in procurement  ppt 2017
predictive analysis and usage in procurement ppt 2017
 
Seminaire bigdata23102014
Seminaire bigdata23102014Seminaire bigdata23102014
Seminaire bigdata23102014
 
How to make use of social media data to support marketing decisions - Vleric...
How to make use of social media data to support marketing decisions -  Vleric...How to make use of social media data to support marketing decisions -  Vleric...
How to make use of social media data to support marketing decisions - Vleric...
 

Recently uploaded

Midocean dropshipping via API with DroFx
Midocean dropshipping via API with DroFxMidocean dropshipping via API with DroFx
Midocean dropshipping via API with DroFxolyaivanovalion
 
Al Barsha Escorts $#$ O565212860 $#$ Escort Service In Al Barsha
Al Barsha Escorts $#$ O565212860 $#$ Escort Service In Al BarshaAl Barsha Escorts $#$ O565212860 $#$ Escort Service In Al Barsha
Al Barsha Escorts $#$ O565212860 $#$ Escort Service In Al BarshaAroojKhan71
 
CebaBaby dropshipping via API with DroFX.pptx
CebaBaby dropshipping via API with DroFX.pptxCebaBaby dropshipping via API with DroFX.pptx
CebaBaby dropshipping via API with DroFX.pptxolyaivanovalion
 
Delhi Call Girls Punjabi Bagh 9711199171 ☎✔👌✔ Whatsapp Hard And Sexy Vip Call
Delhi Call Girls Punjabi Bagh 9711199171 ☎✔👌✔ Whatsapp Hard And Sexy Vip CallDelhi Call Girls Punjabi Bagh 9711199171 ☎✔👌✔ Whatsapp Hard And Sexy Vip Call
Delhi Call Girls Punjabi Bagh 9711199171 ☎✔👌✔ Whatsapp Hard And Sexy Vip Callshivangimorya083
 
BDSM⚡Call Girls in Mandawali Delhi >༒8448380779 Escort Service
BDSM⚡Call Girls in Mandawali Delhi >༒8448380779 Escort ServiceBDSM⚡Call Girls in Mandawali Delhi >༒8448380779 Escort Service
BDSM⚡Call Girls in Mandawali Delhi >༒8448380779 Escort ServiceDelhi Call girls
 
Edukaciniai dropshipping via API with DroFx
Edukaciniai dropshipping via API with DroFxEdukaciniai dropshipping via API with DroFx
Edukaciniai dropshipping via API with DroFxolyaivanovalion
 
Zuja dropshipping via API with DroFx.pptx
Zuja dropshipping via API with DroFx.pptxZuja dropshipping via API with DroFx.pptx
Zuja dropshipping via API with DroFx.pptxolyaivanovalion
 
Chintamani Call Girls: 🍓 7737669865 🍓 High Profile Model Escorts | Bangalore ...
Chintamani Call Girls: 🍓 7737669865 🍓 High Profile Model Escorts | Bangalore ...Chintamani Call Girls: 🍓 7737669865 🍓 High Profile Model Escorts | Bangalore ...
Chintamani Call Girls: 🍓 7737669865 🍓 High Profile Model Escorts | Bangalore ...amitlee9823
 
Discover Why Less is More in B2B Research
Discover Why Less is More in B2B ResearchDiscover Why Less is More in B2B Research
Discover Why Less is More in B2B Researchmichael115558
 
Delhi Call Girls CP 9711199171 ☎✔👌✔ Whatsapp Hard And Sexy Vip Call
Delhi Call Girls CP 9711199171 ☎✔👌✔ Whatsapp Hard And Sexy Vip CallDelhi Call Girls CP 9711199171 ☎✔👌✔ Whatsapp Hard And Sexy Vip Call
Delhi Call Girls CP 9711199171 ☎✔👌✔ Whatsapp Hard And Sexy Vip Callshivangimorya083
 
Market Analysis in the 5 Largest Economic Countries in Southeast Asia.pdf
Market Analysis in the 5 Largest Economic Countries in Southeast Asia.pdfMarket Analysis in the 5 Largest Economic Countries in Southeast Asia.pdf
Market Analysis in the 5 Largest Economic Countries in Southeast Asia.pdfRachmat Ramadhan H
 
Smarteg dropshipping via API with DroFx.pptx
Smarteg dropshipping via API with DroFx.pptxSmarteg dropshipping via API with DroFx.pptx
Smarteg dropshipping via API with DroFx.pptxolyaivanovalion
 
Ravak dropshipping via API with DroFx.pptx
Ravak dropshipping via API with DroFx.pptxRavak dropshipping via API with DroFx.pptx
Ravak dropshipping via API with DroFx.pptxolyaivanovalion
 
Schema on read is obsolete. Welcome metaprogramming..pdf
Schema on read is obsolete. Welcome metaprogramming..pdfSchema on read is obsolete. Welcome metaprogramming..pdf
Schema on read is obsolete. Welcome metaprogramming..pdfLars Albertsson
 
Accredited-Transport-Cooperatives-Jan-2021-Web.pdf
Accredited-Transport-Cooperatives-Jan-2021-Web.pdfAccredited-Transport-Cooperatives-Jan-2021-Web.pdf
Accredited-Transport-Cooperatives-Jan-2021-Web.pdfadriantubila
 
Call Girls Indiranagar Just Call 👗 7737669865 👗 Top Class Call Girl Service B...
Call Girls Indiranagar Just Call 👗 7737669865 👗 Top Class Call Girl Service B...Call Girls Indiranagar Just Call 👗 7737669865 👗 Top Class Call Girl Service B...
Call Girls Indiranagar Just Call 👗 7737669865 👗 Top Class Call Girl Service B...amitlee9823
 
Cheap Rate Call girls Sarita Vihar Delhi 9205541914 shot 1500 night
Cheap Rate Call girls Sarita Vihar Delhi 9205541914 shot 1500 nightCheap Rate Call girls Sarita Vihar Delhi 9205541914 shot 1500 night
Cheap Rate Call girls Sarita Vihar Delhi 9205541914 shot 1500 nightDelhi Call girls
 
Call me @ 9892124323 Cheap Rate Call Girls in Vashi with Real Photo 100% Secure
Call me @ 9892124323  Cheap Rate Call Girls in Vashi with Real Photo 100% SecureCall me @ 9892124323  Cheap Rate Call Girls in Vashi with Real Photo 100% Secure
Call me @ 9892124323 Cheap Rate Call Girls in Vashi with Real Photo 100% SecurePooja Nehwal
 

Recently uploaded (20)

Midocean dropshipping via API with DroFx
Midocean dropshipping via API with DroFxMidocean dropshipping via API with DroFx
Midocean dropshipping via API with DroFx
 
Al Barsha Escorts $#$ O565212860 $#$ Escort Service In Al Barsha
Al Barsha Escorts $#$ O565212860 $#$ Escort Service In Al BarshaAl Barsha Escorts $#$ O565212860 $#$ Escort Service In Al Barsha
Al Barsha Escorts $#$ O565212860 $#$ Escort Service In Al Barsha
 
CebaBaby dropshipping via API with DroFX.pptx
CebaBaby dropshipping via API with DroFX.pptxCebaBaby dropshipping via API with DroFX.pptx
CebaBaby dropshipping via API with DroFX.pptx
 
Delhi Call Girls Punjabi Bagh 9711199171 ☎✔👌✔ Whatsapp Hard And Sexy Vip Call
Delhi Call Girls Punjabi Bagh 9711199171 ☎✔👌✔ Whatsapp Hard And Sexy Vip CallDelhi Call Girls Punjabi Bagh 9711199171 ☎✔👌✔ Whatsapp Hard And Sexy Vip Call
Delhi Call Girls Punjabi Bagh 9711199171 ☎✔👌✔ Whatsapp Hard And Sexy Vip Call
 
BDSM⚡Call Girls in Mandawali Delhi >༒8448380779 Escort Service
BDSM⚡Call Girls in Mandawali Delhi >༒8448380779 Escort ServiceBDSM⚡Call Girls in Mandawali Delhi >༒8448380779 Escort Service
BDSM⚡Call Girls in Mandawali Delhi >༒8448380779 Escort Service
 
Edukaciniai dropshipping via API with DroFx
Edukaciniai dropshipping via API with DroFxEdukaciniai dropshipping via API with DroFx
Edukaciniai dropshipping via API with DroFx
 
Zuja dropshipping via API with DroFx.pptx
Zuja dropshipping via API with DroFx.pptxZuja dropshipping via API with DroFx.pptx
Zuja dropshipping via API with DroFx.pptx
 
Chintamani Call Girls: 🍓 7737669865 🍓 High Profile Model Escorts | Bangalore ...
Chintamani Call Girls: 🍓 7737669865 🍓 High Profile Model Escorts | Bangalore ...Chintamani Call Girls: 🍓 7737669865 🍓 High Profile Model Escorts | Bangalore ...
Chintamani Call Girls: 🍓 7737669865 🍓 High Profile Model Escorts | Bangalore ...
 
Discover Why Less is More in B2B Research
Discover Why Less is More in B2B ResearchDiscover Why Less is More in B2B Research
Discover Why Less is More in B2B Research
 
Delhi Call Girls CP 9711199171 ☎✔👌✔ Whatsapp Hard And Sexy Vip Call
Delhi Call Girls CP 9711199171 ☎✔👌✔ Whatsapp Hard And Sexy Vip CallDelhi Call Girls CP 9711199171 ☎✔👌✔ Whatsapp Hard And Sexy Vip Call
Delhi Call Girls CP 9711199171 ☎✔👌✔ Whatsapp Hard And Sexy Vip Call
 
Market Analysis in the 5 Largest Economic Countries in Southeast Asia.pdf
Market Analysis in the 5 Largest Economic Countries in Southeast Asia.pdfMarket Analysis in the 5 Largest Economic Countries in Southeast Asia.pdf
Market Analysis in the 5 Largest Economic Countries in Southeast Asia.pdf
 
Smarteg dropshipping via API with DroFx.pptx
Smarteg dropshipping via API with DroFx.pptxSmarteg dropshipping via API with DroFx.pptx
Smarteg dropshipping via API with DroFx.pptx
 
Ravak dropshipping via API with DroFx.pptx
Ravak dropshipping via API with DroFx.pptxRavak dropshipping via API with DroFx.pptx
Ravak dropshipping via API with DroFx.pptx
 
Sampling (random) method and Non random.ppt
Sampling (random) method and Non random.pptSampling (random) method and Non random.ppt
Sampling (random) method and Non random.ppt
 
Abortion pills in Doha Qatar (+966572737505 ! Get Cytotec
Abortion pills in Doha Qatar (+966572737505 ! Get CytotecAbortion pills in Doha Qatar (+966572737505 ! Get Cytotec
Abortion pills in Doha Qatar (+966572737505 ! Get Cytotec
 
Schema on read is obsolete. Welcome metaprogramming..pdf
Schema on read is obsolete. Welcome metaprogramming..pdfSchema on read is obsolete. Welcome metaprogramming..pdf
Schema on read is obsolete. Welcome metaprogramming..pdf
 
Accredited-Transport-Cooperatives-Jan-2021-Web.pdf
Accredited-Transport-Cooperatives-Jan-2021-Web.pdfAccredited-Transport-Cooperatives-Jan-2021-Web.pdf
Accredited-Transport-Cooperatives-Jan-2021-Web.pdf
 
Call Girls Indiranagar Just Call 👗 7737669865 👗 Top Class Call Girl Service B...
Call Girls Indiranagar Just Call 👗 7737669865 👗 Top Class Call Girl Service B...Call Girls Indiranagar Just Call 👗 7737669865 👗 Top Class Call Girl Service B...
Call Girls Indiranagar Just Call 👗 7737669865 👗 Top Class Call Girl Service B...
 
Cheap Rate Call girls Sarita Vihar Delhi 9205541914 shot 1500 night
Cheap Rate Call girls Sarita Vihar Delhi 9205541914 shot 1500 nightCheap Rate Call girls Sarita Vihar Delhi 9205541914 shot 1500 night
Cheap Rate Call girls Sarita Vihar Delhi 9205541914 shot 1500 night
 
Call me @ 9892124323 Cheap Rate Call Girls in Vashi with Real Photo 100% Secure
Call me @ 9892124323  Cheap Rate Call Girls in Vashi with Real Photo 100% SecureCall me @ 9892124323  Cheap Rate Call Girls in Vashi with Real Photo 100% Secure
Call me @ 9892124323 Cheap Rate Call Girls in Vashi with Real Photo 100% Secure
 

Open Data, Big Data and Machine Learning

  • 1. Open Data, Big Data and Machine Learning Steven Van Vaerenbergh Universidad de Cantabria May 31, 2016 #EMWeek16 - Santander
  • 2. About me Researcher in machine learning gtas.unican.es/people/steven Open Data, Big Data and Machine Learning 2 twitter.com/steven2358 Steven Van Vaerenbergh
  • 3. 1. Open Data Open Data, Big Data and Machine Learning 3Steven Van Vaerenbergh
  • 4. Denmark’s Open Address Data Set • Making public data “free of charge” Open Data, Big Data and Machine Learning 4 Period Benefits Costs Return on Investment 2004-2009 (including setup) >€60M ~€2M 22:1 2010 (steady state) ~€14M €0.2M 70:1 Source: http://odimpact.org/static/files/case-study-denmark.pdf Steven Van Vaerenbergh
  • 5. Open Data, Big Data and Machine Learning 5Steven Van Vaerenbergh
  • 6. Steven Van Vaerenbergh Open Data, Big Data and Machine Learning 6
  • 7. Open Data in Santander • Santander Datos Abiertos http://datos.santander.es/ • FIWARE lab: https://www.fiware.org/lab/ • FIWARE Academy: http://edu.fiware.org Open Data, Big Data and Machine Learning 7Steven Van Vaerenbergh
  • 8. Open data • “A data set is open if it is available under a free license to everyone”. • Providers: Governments, public services, companies, individuals. • Tendency: Many data providers stop making apps and leave this to third parties. Open data improves transparency Not all data should be open though (privacy) Open Data, Big Data and Machine Learning 8Steven Van Vaerenbergh
  • 9. 2. Big Data Open Data, Big Data and Machine Learning 9Steven Van Vaerenbergh
  • 10. Big Data • Scientific definition: “Data sets that are so large that traditional data processing techniques cannot be applied to them”. • Terabytes, Petabytes, Exabytes, etc. • “Big Data” is also used to refer to novel analysis techniques for such data. • Typically not open data. Open Data, Big Data and Machine Learning 10Steven Van Vaerenbergh
  • 11. Big Data = Data Science with Lots of Data Source: http://drewconway.com/zia/2013/3/26/the-data-science-venn-diagram Open Data, Big Data and Machine Learning 11Steven Van Vaerenbergh
  • 12. Big Data • Many frameworks are being developed: • Apache Hadoop • Apache Mahout • NoSQL • Caution: The science behind big data is in its infancy. E.g. most methods are not able to produce error bars, which is paramount in many applications. Open Data, Big Data and Machine Learning 12Steven Van Vaerenbergh
  • 13. Big Data • Media and press often use “big data” to refer to data science even if the amount of data is relatively small.  “Big data” is often simply a marketing term. Open Data, Big Data and Machine Learning 13Steven Van Vaerenbergh
  • 14. Big Data = Data Science with Lots of Data Source: http://drewconway.com/zia/2013/3/26/the-data-science-venn-diagram Open Data, Big Data and Machine Learning 14 ? Steven Van Vaerenbergh
  • 15. 3. Machine Learning Open Data, Big Data and Machine Learning 15Steven Van Vaerenbergh
  • 16. Traditional Machine Intelligence • Example: decision tree for determining access Program consists of a set of rules (logic) Open Data, Big Data and Machine Learning 16 Input: age, gender, occupation,… Permission to enter Juanito’s tree house? Yes No No Steven Van Vaerenbergh
  • 17. Traditional Machine Intelligence • Example: decision tree for digit recognition Set of rules is very hard to design by hand Open Data, Big Data and Machine Learning 17 Input: images (MNIST) Which digit is represented? Steven Van Vaerenbergh
  • 18. Traditional Machine Intelligence • Example: decision tree for image recognition Set of rules is impossible to design by hand Open Data, Big Data and Machine Learning 18 Input: images (CIFAR10) What does the image represent? Correct answer ? Steven Van Vaerenbergh
  • 19. Machine Learning • Solution: Let the program itself determine its internal set of rules. • Provide the program with inputs and correct answers for these rules, and let it “learn”. “Machine Learning is the study of computer algorithms that improve their performance on a task automatically through experience.” - Tom Mitchell Open Data, Big Data and Machine Learning 19Steven Van Vaerenbergh
  • 20. Open Data, Big Data and Machine Learning 20 Traditional Machine Intelligence Computer Input Program Output Machine Learning (ML) ML algorithm Input Output Program Steven Van Vaerenbergh
  • 21. Machine Learning Applications • Spam filters detect unsolicited emails Open Data, Big Data and Machine Learning 21 SPAM Steven Van Vaerenbergh
  • 22. Machine Learning Applications • Biomedicine: pattern detection in images Open Data, Big Data and Machine Learning 22Steven Van Vaerenbergh
  • 23. Machine Learning Applications • Computer Vision: Kinect body tracking Open Data, Big Data and Machine Learning 23Steven Van Vaerenbergh
  • 24. Machine Learning Applications • Natural Language Processing (NLP) Open Data, Big Data and Machine Learning 24Steven Van Vaerenbergh
  • 25. Machine Learning Applications 1996: IBM’s Deep Blue (Chess) • Intelligence based on manually-entered rules 2016: Google Deepmind’s AlphaGo (Go) • Program learns autonomously Open Data, Big Data and Machine Learning 25Steven Van Vaerenbergh
  • 26. Machine Learning Applications • Human activity recognition Open Data, Big Data and Machine Learning 26 Running Walking Steven Van Vaerenbergh
  • 27. Internal representation How to represent the function from input to output? • Neural networks • Support vector machines • Sets of rules / Logic programs • Bayes/Markov nets • Model ensembles • Decision trees • Etc. Neural net demo: http://playground.tensorflow.org/ Open Data, Big Data and Machine Learning 27Steven Van Vaerenbergh
  • 28. Tools and Frameworks • Machine learning toolkits: • Scikit Learn (Python) http://scikit-learn.org/ • Weka (Java) http://www.cs.waikato.ac.nz/ml/weka/ • Shogun http://www.shogun-toolbox.org/ • Cloud-based machine learning • IBM Watson https://developer.ibm.com/watson/ • Amazon ML https://aws.amazon.com/machine-learning/ • Microsoft Azure ML https://azure.microsoft.com/en- us/services/machine-learning/ • Google Cloud ML https://cloud.google.com/products/machine-learning/ Open Data, Big Data and Machine Learning 28Steven Van Vaerenbergh
  • 29. Takeaways • Open data, big data and machine learning are components of the current technological wave that resembles an industrial revolution. • Big data requires a rigorous scientific engineering framework that is currently unfinished. • Machine learning algorithms create intelligent programs by automatically learning from example data. Open Data, Big Data and Machine Learning 29Steven Van Vaerenbergh
  • 30. Join us on Meetup Meetup group for people in Santander & Cantabria interested in everything related to data science www.meetup.com/Data-Science-Santander Open Data, Big Data and Machine Learning 30Steven Van Vaerenbergh