SlideShare ist ein Scribd-Unternehmen logo
1 von 79
The Art of
Intelligence
A Practical
Introduction
Machine Learning
50 Shades of Data 1
Lucas Jellema, CTO of AMIS
ODevC Yatra, July 2018
Lucas Jellema
Architect / Developer
1994 started in IT at Oracle
2002 joined AMIS
Currently CTO & Solution Architect
Presenting
• Oracle OpenWorld
• JavaOne
• Oracle Code
• Devoxx
• Java and Oracle User Group meetups
• Java Rockstar (JavaOne 2015)
• OTN Yatra 2013
• ODevC Yatra 2018
50 Shades of Data 3
Writing
• Blogs at http://technology.amis.nl
• 1500 articles – from UI to Middle Tier, Database and Infrastructure
• Articles at Medium, DZone and Oracle Technology Network
• Books for McGraw Hill (Oracle Press)
• Oracle ACE Director & Developer Champion
50 Shades of Data 4
From The Netherlands
50 Shades of Data 5
X = [X1,X2,X3,…,XN]
AGENDA
• What is Machine Learning?
• Why could it be relevant [to you]?
• What does it entail?
• With which algorithms, tools and technologies?
• Oracle and Machine Learning?
• How do you embark on Machine Learning?
• Handson
• Functional/non-technical
• Technical
LEARNING
• How do we learn?
• Try something (else) => get feedback => learn
• Eventually:
• We get it (understanding) so we can predict the outcome
of a certain action in a new situation
• Or we have experienced enough situations to predict
the outcome in most situations with high confidence
• Through interpolation, extrapolation, etc.
• We remain clueless
13
MACHINE LEARNING
• Analyze Historical Data (input and result – training set) to discover
Patterns & Models
• Iteratively apply Models to [additional] Input (test set) and compare
model outcome with known actual result to improve the model
• Use Model to predict
outcome for
entirely new data
14
WHY IS IT RELEVANT (NOW)?
• Data
• big, fast, open
• Machine Learning has become feasible
and accessible
• Available
• Affordable (software & hardware)
• Doable (Citizen Data Scientist)
• Fast enough
• Business Cases & Opportunities => Demands
• End users, Consumers, Competitive pressure, Society
WHY IS IT RELEVANT (NOW)?
GARTNER – STRATEGIC
TECHNOLOGY TRENDS 2018
EXAMPLE USE CASES
• Speech recognition
• Identify churn candidates
• Intent & Sentiment analysis on social media
• Upsell & Cross Sell
• Target Marketing
• Customer Service
• Chat bots & voice response systems
• Predictive Maintenance
• Gaming
• Captcha
• Medical Diagnosis
• Anomaly Detection (find the odd one out)
• Autonomous Cars
• Voter Segment Analysis
• Customer Recommendations
• Smart Data Capture
• Face Detection
• Fraud Prevention
• (really good) OCR
• Traffic light control
• Navigation
• Should we investigate | do lab test?
• Spam filtering
• Propose friends | contacts
• Troll detection
• Auto correct
• Photo Tagging and Album organization
READY-TO-RUN ML APPS
Someone else selected, configured and trained an ML model
and makes it available for you to use against your own data
READY TO RUN ML APPS – SAAS POWERED BY ML
#DevoxxMA
PRODUCTS WITH ML INSIDE
#DevoxxMA
Do It Yourself
Machine Learning
THE DATA SCIENCE WORKFLOW
• Set Business Goal – research scope, objectives
• Gather data
• Prepare data
• Cleanse, transform (wrangle), combine (merge, enrich)
• Explore data
• Model Data
• Select model, train model, test model
• Present findings and recommend next steps
• Apply:
• Make use of insights in business decisions
• Automate Data Gathering & Preparation, Deploy Model, Embed Model in
operational systems
DATA DISCOVERY | EXPLORATION
24
A B C D E F G
1104534 ZTR 0.1 anijs 2 36 T
631148 ESE 132 rivier 0 21 S
-3 WGN 71 appel 0 1 -
1262300 ZTR 56 zes 2 41 T
315529 HVN 1290 hamer 0 11 -
788914 ASM 676 zwaluw 0 26 T
157762 HVN 9482 wie 0 6 -
946681 DHG 42 rond 1 31 T
-31539 WGN 2423 bruin 0 0 -
47338 HVN 54 hamer 0 16 P
SCATTER PLOT
ATTRIBUTE F (Y-AXIS)VS ATTRIBUTE A
25
0
5
10
15
20
25
30
35
40
45
-200000 0 200000 400000 600000 800000 1000000 1200000 1400000
Y-Values
Y-Values
SCATTER PLOT
ATTRIBUTE F (Y-AXIS)VS ATTRIBUTE A
26
0
5
10
15
20
25
30
35
40
45
1965 1970 1975 1980 1985 1990 1995 2000 2005 2010 2015
Age of Lucas Jellema vs Year
Y-Values
DATA DISCOVERY – ATTRIBUTES IDENTIFIED
27
Time of
Birth
City ? ? #Kids Age Level of
Education
1104534 ZTR 0.1 anijs 2 36 T
631148 ESE 132 rivier 0 21 S
-3 WGN 71 appel 0 1 -
1262300 ZTR 56 zes 2 41 T
315529 HVN 1290 hamer 0 11 -
788914 ASM 676 zwaluw 0 26 T
157762 HVN 9482 wie 0 6 -
946681 DHG 42 rond 1 31 T
-31539 WGN 2423 bruin 0 0 -
47338 HVN 54 hamer 0 16 P
TYPES OF MACHINE LEARNING
• Supervised
• Train and test model from known data (both features and target)
• Unsupervised
• Analyze unlabeled data – see if you can find anything
• Semi-Supervised
• Interactive flow, for example human identifying clusters
• Reinforcement
• Continuously improve algorithm (model) as time progresses, based on new
experience
MACHINE LEARNING ALGORITHMS
• Clustering
• Hierarchical k-means, Orthogonal Partitioning Clustering, Expectation-Maximization
• Feature Extraction/Attribute Importance/Principal Component Analysis
• Classification
• Decision Tree, Naïve Bayes, Random Forest, Logistic Regression, Support Vector Machine
• Regression
• Multiple Regression, Support Vector Machine, Linear Model, LASSO,
Random Forest, Ridgre Regression, Generalized Linear Model,
Stepwise Linear Regression
• Association & Collaborative Filtering
(market basket analysis, apriori)
• Reinforcement Learning – brute force, value function,
Monte Carlo, temporal difference, ..
• Neural network and Deep Learning with
Deep Neural Network
• Can be used for many different use cases
MODELING PHASE
• Select a model to try to create a fit with (predict target well)
• Set configuration parameters for model
• Divide data in training set and test set
• Train model with training set
• Evaluate performance of trained model on the test set
• Confusion matrix, mean square error, support, lift, false positives, false negatives
• Optionally: tweak model parameters, add attributes, feed in more training data,
choose different model
• Eventually (hopefully): pick model plus parameters plus attributes
that will reliably predict the target variable given new data
• Possibly combine multiple models to collaborate on target value
OPTICAL DIGIT RECOGNITION == CLASSIFICATION
Predicted
Actual
0 1 2 3 4 5 6 7 8 9
0
1
2
3
4
5
6
7
8
9
Naïve Bayes
Decision Tree
Deep
Neural
Network
CLASSIFICATION GONE WRONG
• Machine learning applied to millions of drawings
on QuickDraw
• to classify drawings
• For example: drawings of beds
• See for example:
• https://aiexperiments.withgoogle.com/quick-draw
MACHINE LEARNING  OPERATIONAL
SYSTEMS
• “We have a model that will choose best chess move based on
certain input”
MACHINE LEARNING  OPERATIONAL
SYSTEMS
• Discovery => Model => Deploy
• “We have a model that will predict a class (classification) or value
(regression) based on certain input with a meaningful degree of
accuracy” – how can we make use of that model?
DEPLOY MODEL AND EXPOSE
• Model is usually created on Big Data in Data Science environment using the
Data Scientist’s tools
• Model itself is typically fairly small
• Model will be applied in operational systems against single data items (not
huge collections nor the entire Big Data set)
• Running the model online may not require extensive resources
• Implementing the model at production run time
• Export model (from Data Scientist environment) and import (into production
environment)
• Reimplement the model in the development technology and deploy (in the regular
way) to the production environment
• Expose model through API
80M PICTURES OF ROAD
BIG DATA => SMALL ML MODELS
DEPLOY MODEL AND EXPOSE
REST
API
MODEL MANAGEMENT
• Governance (new versions, testing and approval)
• A/B testing
• Auditing (what did the model decide and why? notifying humans? )
• Evaluation (how well did the model’s output match the reality)
to help evolve the model
• for example recommendations followed
• Monitor self learning models (to detect rogue models)
WHAT TO DO IT WITH?
• Mathematics (Statistics)
• Gauss (normal distribution)
• Bayes’ Theorem
• Euclidean Distance
• Perceptron
• Mean Square Error
WHAT TO DO IT WITH?
TOOLS AND LIBRARIES IMPLEMENTING
MACHINE LEARNING ALGORITHMS
+
AND OF COURSE
DATA
DATA
HOW TO PICK TOOLS FOR THE JOB
• What are the jobs?
• Gather data
• Prepare data
• Explore and (hopefully) Discover
• Present
• Embed & Deploy Model
• What are considerations?
• Volume
• Speed and Time
• Skills
• Platform
• Cost
POPULAR TECHNOLOGIES
POPULAR FRAMEWORKS & LIBRARIES
• TensorFlow
• MXNet
• Caffe
• DL4J
• Keras
• … many more…
Oracle Database Option
Advanced Analytics
#DevoxxMA
NOTEBOOK –
THE LAB JOURNAL FROM THE DATALAB
• Common format for data exploration and presentation
• User friendly interface on top of powerful technologies
• Most popular implementations
• Jupyter (fka IPython)
• Apache Zeppelin
• Spark Notebook
• Beaker
• SageMath (SageMathCloud => CoCalc)
• Oracle Machine Learning Notebook UI
• Try out Jupyter at: https://mybinder.org/
EXAMPLE NOTEBOOK EXPLORATION
OPEN DATA
• Governments and NGOs, scientific and even commercial
organizations are publishing data
• Inviting anyone who wants to join in to help make
sense of the data – understand driving factors,
identify categories, help predict
• Many areas
• Economy, health, public safety, sports, traffic &
transportation, games, environment, maps, …
OPEN DATA – SOME EXAMPLES
• Kaggle - Data Sets and [Samples of] Data Discovery: www.kaggle.com
• India Government - data.gov.in
• US, EU and UK Government Data: data.gov, open-data.europa.eu and data.gov.uk
• Open Images Data Set: www.image-net.org
• Open Data From World Bank: data.worldbank.org
• Historic Football Data: api.football-data.org
• New York City Open Data - opendata.cityofnewyork.us
• Airports, Airlines, Flight Routes: openflights.org
• Open Database – machine counterpart to Wikipedia: www.wikidata.org
• Google Audio Set (manually annotated audio events)
- research.google.com/audioset/
• Movielens - Movies, viewers and ratings:
files.grouplens.org/datasets/movielens/
WHAT IS HADOOP?
• Big Data means Big Computing and Big Storage
• Big requires scalable => horizontal scale out
• Moving data is very expensive (network, disk IO)
• Rather than move data to processor – move processing to data: distributed
processing
• Horizontal scale out => Hadoop:
distributed data & distributed processing
• HDFS – Hadoop Distributed File System
• Map Reduce – parallel, distributed processing
• Map-Reduce operates on data locally, then
persists and aggregates results
WHAT IS SPARK?
• Developing and orchestrating Map-Reduce on Hadoop is not simple
• Running jobs can be slow due to frequent disk writing
• Spark is for managing and orchestrating distributed processing on a
variety of cluster systems
• with Hadoop as the most obvious target
• through APIs in Java, Python, R, Scala
• Spark uses lazy operations and distributed in-memory data
structures – offering much better performance
• Through Spark – cluster based processing can be used interactively
• Spark has additional modules that leverage distributed
processing for running prepackaged jobs (SQL, Graph, ML, …)
APACHE SPARK OVERVIEW
EXAMPLE RUNNING AGAINST SPARK
• https://github.com/jadianes/spark-movie-lens/blob/master/notebooks/building-recommender.ipynb
WHAT IS ORACLE DOING AROUND
MACHINE LEARNING?
• Oracle Advanced Analytics in Oracle Database
• Data Mining, Enterprise R
• Text (ESA), Spatial, Graph
• SQL
DEMO: CLASSIFICATION
#DevoxxMA
DEMO: CONFERENCE ABSTRACT
CLASSIFICATION CHALLENGE
• Take all conference abstracts for
• Train a Classification Model on
picking the Conference Track
• Based on Title, Summary [, Speaker, Level,…]
• Use the Model to pick the Track
for sessions at
DEMONSTRATION OF ORACLE ADVANCED
ANALYTICS
• Using Text Mining and Naives Bayes Data Mining Classification
• Train model for classifying conference abstracts into tracks
• Use model to propose a track for new abstracts
• Steps
• Gather data
• Import, cleanse, enrich, …
• Prepare training set and test set
• Select and configure model
• Combining Text and Mining
using Naive Bayes
• Train model
• Test and apply model
PREPARE DATABASE (IN THE CLOUD)
PREPARE DATABASE (IN THE CLOUD)
COMPLETING THE DATABASE INSTANCE
ONCE THE INSTANCE IS RUNNING...
USE AS ANY ORACLE DATABASE INSTANCE –
LOCAL, ON PREMISES, ... – ACCESSIBLE VIA SQL*NET
TRAIN MODEL
DECLARE
xformlist dbms_data_mining_transform.TRANSFORM_LIST;
BEGIN
DBMS_DATA_MINING_TRANSFORM.SET_TRANSFORM( xformlist, 'abstract',
NULL, 'abstract', NULL,
'TEXT(TOKEN_TYPE:NORMAL)');
DBMS_DATA_MINING.CREATE_MODEL
( model_name => 'SESSION_CLASS_NB'
, mining_function => dbms_data_mining.classification
, data_table_name => 'J1_SESSIONS'
, case_id_column_name => 'session_title'
, target_column_name => 'session_track'
, settings_table_name => 'session_class_nb_settings'
, xform_list => xformlist);
END;
APPLY MODEL
APPLY MODEL
APPLY MODEL
BIG DATA SQL
ORACLE DATABASE AS SINGLE POINT OF ENTRY
MANY CLOUD SERVICES AROUND BIG DATA &
[PREDICTIVE] ANALYTICS & MACHINE LEARNING
70
WHAT IS ORACLE DOING AROUND
MACHINE LEARNING?
• Big Data Discovery (fka Endeca), Big Data Preparation and Big Data Compute
• Big Data Appliance
• Data Visualization Cloud
• Analytics Cloud
• Industry specific Analytics Clouds (Sales, Marketing, HCM) on top of SaaS
• RTD – Real Time Decisions
• DaaS
• Oracle Labs (labs.oracle.com)
• Machine Learning Research Group (link)
• Machine Learning CS – “Oracle Notebook”
ORACLE AI PLATFORM CLOUD SERVICE
(COMING SHORTLY)
HUMANS LEARNING MACHINE
LEARNING: YOUR FIRST STEPS
#DevoxxMA
HUMANS LEARNING MACHINE LEARNING:
YOUR FIRST STEPS
• Jupyter Notebooks and Python – https://mybinder.org/
• HortonWorks Sandbox VM – Hadoop & Spark & Hive, Ambari
• DataBricks Cloud Environment with Apache Spark (free trial)
• KataKoda – tutorials & live environment for TensorFlow
• Oracle Big Data Lite – Prebuilt Virtual Machine
• Data Visualization Desktop – ready to run desktop tool
• Tutorials, Courses (Udacity, Coursera, edX)
• Books
• Introducing Data Science
• Learning Apache Spark 2
• Python Machine Learning
HANDS ON MACHINE LEARNING (BABY STEPS)
• All materials are in: https://github.com/AMIS-Services
Non Technical Technical
Decision Trees
SUMMARY
• IoT, Big Data, Machine Learning => AI
• Recent and Rapid Democratization of Machine Learning
• Algorithms, Storage and Compute Resources, High Level Machine Learning
Frameworks, Education resources , Open Data, Trained ML Models, Out of the
Box SaaS capabilities – powered by ML
• Produce business value today
• Machine Learning by computers helps us(ers) understand historic
data and apply that insight to new data
• Developers have to learn how to incorporate Machine Learning
into their applications – for smarter Uis, more automation, faster
(p)reactions
SUMMARY
• R and Python are most popular technologies for data exploration
and ML model discovery [on small subsets of Big Data]
• Apache Spark (on Hadoop) is frequently used to powercrunch data
(wrangling) and run ML models on Big Data sets
• Notebooks are a popular vehicle in the Data Science lab
• To explore and report
• Oracle is quite active on Machine Learning
• Power PaaS and SaaS with ML
• Provide us with the Machine Learning Data Lab & Run Time (on the cloud)
• Getting started on Machine Learning is fun, smart & well supported
Thank you!
• Blog: technology.amis.nl
• Email: lucas.jellema@amis.nl
• : @lucasjellema
• : lucas-jellema
• : www.amis.nl, info@amis.nl
HANDS ON
• Alle materialen staan in: https://github.com/AMIS-Services
Non Technical
REFERENCES
• AI Adventures (Google) https://www.youtube.com/watch?v=RJudqel8DVA
• Twitch TV
https://www.twitch.tv/videos/179940629
and sources on GitHub:
https://github.com/sunilmallya/dl-twitch-series
• Tensor Flow & Deep Learning without a PhD (Devoxx)
https://www.youtube.com/watch?v=vq2nnJ4g6N0
• KataKoda Browser Based Runtime for TensorFlow
https://www.katacoda.com/courses/tensorflow
• And many more
#DevoxxMA

Weitere ähnliche Inhalte

Was ist angesagt?

Introduction to Machine Learning - From DBA's to Data Scientists - OGBEMEA
Introduction to Machine Learning - From DBA's to Data Scientists - OGBEMEAIntroduction to Machine Learning - From DBA's to Data Scientists - OGBEMEA
Introduction to Machine Learning - From DBA's to Data Scientists - OGBEMEASandesh Rao
 
Introduction to AutoML and Data Science using the Oracle Autonomous Database ...
Introduction to AutoML and Data Science using the Oracle Autonomous Database ...Introduction to AutoML and Data Science using the Oracle Autonomous Database ...
Introduction to AutoML and Data Science using the Oracle Autonomous Database ...Sandesh Rao
 
Machine Learning in Autonomous Data Warehouse
 Machine Learning in Autonomous Data Warehouse Machine Learning in Autonomous Data Warehouse
Machine Learning in Autonomous Data WarehouseSandesh Rao
 
#dbhouseparty - Graph Technologies - More than just Social (Distancing) Networks
#dbhouseparty - Graph Technologies - More than just Social (Distancing) Networks#dbhouseparty - Graph Technologies - More than just Social (Distancing) Networks
#dbhouseparty - Graph Technologies - More than just Social (Distancing) NetworksTammy Bednar
 
20 tips and tricks with the Autonomous Database
20 tips and tricks with the Autonomous Database20 tips and tricks with the Autonomous Database
20 tips and tricks with the Autonomous DatabaseSandesh Rao
 
How to use Exachk effectively to manage Exadata environments OGBEmea
How to use Exachk effectively to manage Exadata environments OGBEmeaHow to use Exachk effectively to manage Exadata environments OGBEmea
How to use Exachk effectively to manage Exadata environments OGBEmeaSandesh Rao
 
Database Cloud Services Office Hours : Oracle sharding hyperscale globally d...
Database Cloud Services Office Hours : Oracle sharding  hyperscale globally d...Database Cloud Services Office Hours : Oracle sharding  hyperscale globally d...
Database Cloud Services Office Hours : Oracle sharding hyperscale globally d...Tammy Bednar
 
AUSOUG - NZOUG-GroundBreakers-Jun 2019 - 19c RAC
AUSOUG - NZOUG-GroundBreakers-Jun 2019 - 19c RACAUSOUG - NZOUG-GroundBreakers-Jun 2019 - 19c RAC
AUSOUG - NZOUG-GroundBreakers-Jun 2019 - 19c RACSandesh Rao
 
Top 20 FAQs on the Autonomous Database
Top 20 FAQs on the Autonomous DatabaseTop 20 FAQs on the Autonomous Database
Top 20 FAQs on the Autonomous DatabaseSandesh Rao
 
AutoML - Heralding a New Era of Machine Learning - CASOUG Oct 2021
AutoML - Heralding a New Era of Machine Learning - CASOUG Oct 2021AutoML - Heralding a New Era of Machine Learning - CASOUG Oct 2021
AutoML - Heralding a New Era of Machine Learning - CASOUG Oct 2021Sandesh Rao
 
AIOUG -GroundBreakers-Jul 2019 - Introduction to Machine Learning - From DBA'...
AIOUG -GroundBreakers-Jul 2019 - Introduction to Machine Learning - From DBA'...AIOUG -GroundBreakers-Jul 2019 - Introduction to Machine Learning - From DBA'...
AIOUG -GroundBreakers-Jul 2019 - Introduction to Machine Learning - From DBA'...Sandesh Rao
 
#dbhouseparty - Should I be building Microservices?
#dbhouseparty - Should I be building Microservices?#dbhouseparty - Should I be building Microservices?
#dbhouseparty - Should I be building Microservices?Tammy Bednar
 
Introduction to Machine Learning and Data Science using Autonomous Database ...
Introduction to Machine Learning and Data Science using Autonomous Database  ...Introduction to Machine Learning and Data Science using Autonomous Database  ...
Introduction to Machine Learning and Data Science using Autonomous Database ...Sandesh Rao
 
Introduction to Machine Learning for Oracle Database Professionals
Introduction to Machine Learning for Oracle Database ProfessionalsIntroduction to Machine Learning for Oracle Database Professionals
Introduction to Machine Learning for Oracle Database ProfessionalsAlex Gorbachev
 
ORAchk EXAchk what's new in 12.1.0.2.7
ORAchk EXAchk what's new in 12.1.0.2.7ORAchk EXAchk what's new in 12.1.0.2.7
ORAchk EXAchk what's new in 12.1.0.2.7Sandesh Rao
 
The Top 5 Reasons to Deploy Your Applications on Oracle RAC
The Top 5 Reasons to Deploy Your Applications on Oracle RACThe Top 5 Reasons to Deploy Your Applications on Oracle RAC
The Top 5 Reasons to Deploy Your Applications on Oracle RACMarkus Michalewicz
 
Ground Breakers Romania: Oracle Autonomous Database
Ground Breakers Romania: Oracle Autonomous DatabaseGround Breakers Romania: Oracle Autonomous Database
Ground Breakers Romania: Oracle Autonomous DatabaseMaria Colgan
 
#dbhouseparty - Spatial Technologies - @Home and Everywhere Else on the Map
#dbhouseparty - Spatial Technologies - @Home and Everywhere Else on the Map#dbhouseparty - Spatial Technologies - @Home and Everywhere Else on the Map
#dbhouseparty - Spatial Technologies - @Home and Everywhere Else on the MapTammy Bednar
 
Oracle RAC 19c - the Basis for the Autonomous Database
Oracle RAC 19c - the Basis for the Autonomous DatabaseOracle RAC 19c - the Basis for the Autonomous Database
Oracle RAC 19c - the Basis for the Autonomous DatabaseMarkus Michalewicz
 
Database@Home : Data Driven Apps : Core-dev or Low Code UI
Database@Home : Data Driven Apps : Core-dev or Low Code UIDatabase@Home : Data Driven Apps : Core-dev or Low Code UI
Database@Home : Data Driven Apps : Core-dev or Low Code UITammy Bednar
 

Was ist angesagt? (20)

Introduction to Machine Learning - From DBA's to Data Scientists - OGBEMEA
Introduction to Machine Learning - From DBA's to Data Scientists - OGBEMEAIntroduction to Machine Learning - From DBA's to Data Scientists - OGBEMEA
Introduction to Machine Learning - From DBA's to Data Scientists - OGBEMEA
 
Introduction to AutoML and Data Science using the Oracle Autonomous Database ...
Introduction to AutoML and Data Science using the Oracle Autonomous Database ...Introduction to AutoML and Data Science using the Oracle Autonomous Database ...
Introduction to AutoML and Data Science using the Oracle Autonomous Database ...
 
Machine Learning in Autonomous Data Warehouse
 Machine Learning in Autonomous Data Warehouse Machine Learning in Autonomous Data Warehouse
Machine Learning in Autonomous Data Warehouse
 
#dbhouseparty - Graph Technologies - More than just Social (Distancing) Networks
#dbhouseparty - Graph Technologies - More than just Social (Distancing) Networks#dbhouseparty - Graph Technologies - More than just Social (Distancing) Networks
#dbhouseparty - Graph Technologies - More than just Social (Distancing) Networks
 
20 tips and tricks with the Autonomous Database
20 tips and tricks with the Autonomous Database20 tips and tricks with the Autonomous Database
20 tips and tricks with the Autonomous Database
 
How to use Exachk effectively to manage Exadata environments OGBEmea
How to use Exachk effectively to manage Exadata environments OGBEmeaHow to use Exachk effectively to manage Exadata environments OGBEmea
How to use Exachk effectively to manage Exadata environments OGBEmea
 
Database Cloud Services Office Hours : Oracle sharding hyperscale globally d...
Database Cloud Services Office Hours : Oracle sharding  hyperscale globally d...Database Cloud Services Office Hours : Oracle sharding  hyperscale globally d...
Database Cloud Services Office Hours : Oracle sharding hyperscale globally d...
 
AUSOUG - NZOUG-GroundBreakers-Jun 2019 - 19c RAC
AUSOUG - NZOUG-GroundBreakers-Jun 2019 - 19c RACAUSOUG - NZOUG-GroundBreakers-Jun 2019 - 19c RAC
AUSOUG - NZOUG-GroundBreakers-Jun 2019 - 19c RAC
 
Top 20 FAQs on the Autonomous Database
Top 20 FAQs on the Autonomous DatabaseTop 20 FAQs on the Autonomous Database
Top 20 FAQs on the Autonomous Database
 
AutoML - Heralding a New Era of Machine Learning - CASOUG Oct 2021
AutoML - Heralding a New Era of Machine Learning - CASOUG Oct 2021AutoML - Heralding a New Era of Machine Learning - CASOUG Oct 2021
AutoML - Heralding a New Era of Machine Learning - CASOUG Oct 2021
 
AIOUG -GroundBreakers-Jul 2019 - Introduction to Machine Learning - From DBA'...
AIOUG -GroundBreakers-Jul 2019 - Introduction to Machine Learning - From DBA'...AIOUG -GroundBreakers-Jul 2019 - Introduction to Machine Learning - From DBA'...
AIOUG -GroundBreakers-Jul 2019 - Introduction to Machine Learning - From DBA'...
 
#dbhouseparty - Should I be building Microservices?
#dbhouseparty - Should I be building Microservices?#dbhouseparty - Should I be building Microservices?
#dbhouseparty - Should I be building Microservices?
 
Introduction to Machine Learning and Data Science using Autonomous Database ...
Introduction to Machine Learning and Data Science using Autonomous Database  ...Introduction to Machine Learning and Data Science using Autonomous Database  ...
Introduction to Machine Learning and Data Science using Autonomous Database ...
 
Introduction to Machine Learning for Oracle Database Professionals
Introduction to Machine Learning for Oracle Database ProfessionalsIntroduction to Machine Learning for Oracle Database Professionals
Introduction to Machine Learning for Oracle Database Professionals
 
ORAchk EXAchk what's new in 12.1.0.2.7
ORAchk EXAchk what's new in 12.1.0.2.7ORAchk EXAchk what's new in 12.1.0.2.7
ORAchk EXAchk what's new in 12.1.0.2.7
 
The Top 5 Reasons to Deploy Your Applications on Oracle RAC
The Top 5 Reasons to Deploy Your Applications on Oracle RACThe Top 5 Reasons to Deploy Your Applications on Oracle RAC
The Top 5 Reasons to Deploy Your Applications on Oracle RAC
 
Ground Breakers Romania: Oracle Autonomous Database
Ground Breakers Romania: Oracle Autonomous DatabaseGround Breakers Romania: Oracle Autonomous Database
Ground Breakers Romania: Oracle Autonomous Database
 
#dbhouseparty - Spatial Technologies - @Home and Everywhere Else on the Map
#dbhouseparty - Spatial Technologies - @Home and Everywhere Else on the Map#dbhouseparty - Spatial Technologies - @Home and Everywhere Else on the Map
#dbhouseparty - Spatial Technologies - @Home and Everywhere Else on the Map
 
Oracle RAC 19c - the Basis for the Autonomous Database
Oracle RAC 19c - the Basis for the Autonomous DatabaseOracle RAC 19c - the Basis for the Autonomous Database
Oracle RAC 19c - the Basis for the Autonomous Database
 
Database@Home : Data Driven Apps : Core-dev or Low Code UI
Database@Home : Data Driven Apps : Core-dev or Low Code UIDatabase@Home : Data Driven Apps : Core-dev or Low Code UI
Database@Home : Data Driven Apps : Core-dev or Low Code UI
 

Ähnlich wie The Art of Intelligence – Introduction Machine Learning for Oracle professionals (ODevCYatra 2018, Hyderabad, Pune, Mumbai)

The Art of Intelligence – A Practical Introduction Machine Learning for Oracl...
The Art of Intelligence – A Practical Introduction Machine Learning for Oracl...The Art of Intelligence – A Practical Introduction Machine Learning for Oracl...
The Art of Intelligence – A Practical Introduction Machine Learning for Oracl...Lucas Jellema
 
The Art of Intelligence – A Practical Introduction Machine Learning for Orac...
The Art of Intelligence – A Practical Introduction Machine Learning for Orac...The Art of Intelligence – A Practical Introduction Machine Learning for Orac...
The Art of Intelligence – A Practical Introduction Machine Learning for Orac...Lucas Jellema
 
Introduction to Machine Learning - An overview and first step for candidate d...
Introduction to Machine Learning - An overview and first step for candidate d...Introduction to Machine Learning - An overview and first step for candidate d...
Introduction to Machine Learning - An overview and first step for candidate d...Lucas Jellema
 
The Art of Intelligence – Introduction Machine Learning for Java professional...
The Art of Intelligence – Introduction Machine Learning for Java professional...The Art of Intelligence – Introduction Machine Learning for Java professional...
The Art of Intelligence – Introduction Machine Learning for Java professional...Lucas Jellema
 
Makine Öğrenmesi, Yapay Zeka ve Veri Bilimi Süreçlerinin Otomatikleştirilmesi...
Makine Öğrenmesi, Yapay Zeka ve Veri Bilimi Süreçlerinin Otomatikleştirilmesi...Makine Öğrenmesi, Yapay Zeka ve Veri Bilimi Süreçlerinin Otomatikleştirilmesi...
Makine Öğrenmesi, Yapay Zeka ve Veri Bilimi Süreçlerinin Otomatikleştirilmesi...Ali Alkan
 
Neo4j GraphTalk Basel - Building intelligent Software with Graphs
Neo4j GraphTalk Basel - Building intelligent Software with GraphsNeo4j GraphTalk Basel - Building intelligent Software with Graphs
Neo4j GraphTalk Basel - Building intelligent Software with GraphsNeo4j
 
Azure machine learning tech mela
Azure machine learning tech melaAzure machine learning tech mela
Azure machine learning tech melaYogendra Tamang
 
Productionising Machine Learning Models
Productionising Machine Learning ModelsProductionising Machine Learning Models
Productionising Machine Learning ModelsTash Bickley
 
Neo4j GraphTalk Düsseldorf - Building intelligent solutions with Graphs
Neo4j GraphTalk Düsseldorf - Building intelligent solutions with GraphsNeo4j GraphTalk Düsseldorf - Building intelligent solutions with Graphs
Neo4j GraphTalk Düsseldorf - Building intelligent solutions with GraphsNeo4j
 
Azure Machine Learning
Azure Machine LearningAzure Machine Learning
Azure Machine LearningMostafa
 
Introducing new AIOps innovations in Oracle 19c - San Jose AICUG
Introducing new AIOps innovations in Oracle 19c - San Jose AICUGIntroducing new AIOps innovations in Oracle 19c - San Jose AICUG
Introducing new AIOps innovations in Oracle 19c - San Jose AICUGSandesh Rao
 
GraphTalk Wien - Intelligente Lösungen mit Graphen erstellen
GraphTalk Wien - Intelligente Lösungen mit Graphen erstellenGraphTalk Wien - Intelligente Lösungen mit Graphen erstellen
GraphTalk Wien - Intelligente Lösungen mit Graphen erstellenNeo4j
 
Microsoft DevOps for AI with GoDataDriven
Microsoft DevOps for AI with GoDataDrivenMicrosoft DevOps for AI with GoDataDriven
Microsoft DevOps for AI with GoDataDrivenGoDataDriven
 
Apache Spark Model Deployment
Apache Spark Model Deployment Apache Spark Model Deployment
Apache Spark Model Deployment Databricks
 
PAC 2019 virtual Alexander Podelko
PAC 2019 virtual Alexander Podelko PAC 2019 virtual Alexander Podelko
PAC 2019 virtual Alexander Podelko Neotys
 
Design Patterns for Machine Learning in Production - Sergei Izrailev, Chief D...
Design Patterns for Machine Learning in Production - Sergei Izrailev, Chief D...Design Patterns for Machine Learning in Production - Sergei Izrailev, Chief D...
Design Patterns for Machine Learning in Production - Sergei Izrailev, Chief D...Sri Ambati
 

Ähnlich wie The Art of Intelligence – Introduction Machine Learning for Oracle professionals (ODevCYatra 2018, Hyderabad, Pune, Mumbai) (20)

The Art of Intelligence – A Practical Introduction Machine Learning for Oracl...
The Art of Intelligence – A Practical Introduction Machine Learning for Oracl...The Art of Intelligence – A Practical Introduction Machine Learning for Oracl...
The Art of Intelligence – A Practical Introduction Machine Learning for Oracl...
 
The Art of Intelligence – A Practical Introduction Machine Learning for Orac...
The Art of Intelligence – A Practical Introduction Machine Learning for Orac...The Art of Intelligence – A Practical Introduction Machine Learning for Orac...
The Art of Intelligence – A Practical Introduction Machine Learning for Orac...
 
Introduction overviewmachinelearning sig Door Lucas Jellema
Introduction overviewmachinelearning sig Door Lucas JellemaIntroduction overviewmachinelearning sig Door Lucas Jellema
Introduction overviewmachinelearning sig Door Lucas Jellema
 
Introduction to Machine Learning - An overview and first step for candidate d...
Introduction to Machine Learning - An overview and first step for candidate d...Introduction to Machine Learning - An overview and first step for candidate d...
Introduction to Machine Learning - An overview and first step for candidate d...
 
The Art of Intelligence – Introduction Machine Learning for Java professional...
The Art of Intelligence – Introduction Machine Learning for Java professional...The Art of Intelligence – Introduction Machine Learning for Java professional...
The Art of Intelligence – Introduction Machine Learning for Java professional...
 
Makine Öğrenmesi, Yapay Zeka ve Veri Bilimi Süreçlerinin Otomatikleştirilmesi...
Makine Öğrenmesi, Yapay Zeka ve Veri Bilimi Süreçlerinin Otomatikleştirilmesi...Makine Öğrenmesi, Yapay Zeka ve Veri Bilimi Süreçlerinin Otomatikleştirilmesi...
Makine Öğrenmesi, Yapay Zeka ve Veri Bilimi Süreçlerinin Otomatikleştirilmesi...
 
Neo4j GraphTalk Basel - Building intelligent Software with Graphs
Neo4j GraphTalk Basel - Building intelligent Software with GraphsNeo4j GraphTalk Basel - Building intelligent Software with Graphs
Neo4j GraphTalk Basel - Building intelligent Software with Graphs
 
Machine learning
Machine learningMachine learning
Machine learning
 
Azure machine learning tech mela
Azure machine learning tech melaAzure machine learning tech mela
Azure machine learning tech mela
 
Productionising Machine Learning Models
Productionising Machine Learning ModelsProductionising Machine Learning Models
Productionising Machine Learning Models
 
Neo4j GraphTalk Düsseldorf - Building intelligent solutions with Graphs
Neo4j GraphTalk Düsseldorf - Building intelligent solutions with GraphsNeo4j GraphTalk Düsseldorf - Building intelligent solutions with Graphs
Neo4j GraphTalk Düsseldorf - Building intelligent solutions with Graphs
 
Azure Machine Learning
Azure Machine LearningAzure Machine Learning
Azure Machine Learning
 
Introducing new AIOps innovations in Oracle 19c - San Jose AICUG
Introducing new AIOps innovations in Oracle 19c - San Jose AICUGIntroducing new AIOps innovations in Oracle 19c - San Jose AICUG
Introducing new AIOps innovations in Oracle 19c - San Jose AICUG
 
GraphTalk Wien - Intelligente Lösungen mit Graphen erstellen
GraphTalk Wien - Intelligente Lösungen mit Graphen erstellenGraphTalk Wien - Intelligente Lösungen mit Graphen erstellen
GraphTalk Wien - Intelligente Lösungen mit Graphen erstellen
 
Microsoft DevOps for AI with GoDataDriven
Microsoft DevOps for AI with GoDataDrivenMicrosoft DevOps for AI with GoDataDriven
Microsoft DevOps for AI with GoDataDriven
 
Lean Analytics: How to get more out of your data science team
Lean Analytics: How to get more out of your data science teamLean Analytics: How to get more out of your data science team
Lean Analytics: How to get more out of your data science team
 
Apache Spark Model Deployment
Apache Spark Model Deployment Apache Spark Model Deployment
Apache Spark Model Deployment
 
PAC 2019 virtual Alexander Podelko
PAC 2019 virtual Alexander Podelko PAC 2019 virtual Alexander Podelko
PAC 2019 virtual Alexander Podelko
 
Design Patterns for Machine Learning in Production - Sergei Izrailev, Chief D...
Design Patterns for Machine Learning in Production - Sergei Izrailev, Chief D...Design Patterns for Machine Learning in Production - Sergei Izrailev, Chief D...
Design Patterns for Machine Learning in Production - Sergei Izrailev, Chief D...
 
Global ai conf_final
Global ai conf_finalGlobal ai conf_final
Global ai conf_final
 

Mehr von Lucas Jellema

Introduction to web application development with Vue (for absolute beginners)...
Introduction to web application development with Vue (for absolute beginners)...Introduction to web application development with Vue (for absolute beginners)...
Introduction to web application development with Vue (for absolute beginners)...Lucas Jellema
 
Making the Shift Left - Bringing Ops to Dev before bringing applications to p...
Making the Shift Left - Bringing Ops to Dev before bringing applications to p...Making the Shift Left - Bringing Ops to Dev before bringing applications to p...
Making the Shift Left - Bringing Ops to Dev before bringing applications to p...Lucas Jellema
 
Lightweight coding in powerful Cloud Development Environments (DigitalXchange...
Lightweight coding in powerful Cloud Development Environments (DigitalXchange...Lightweight coding in powerful Cloud Development Environments (DigitalXchange...
Lightweight coding in powerful Cloud Development Environments (DigitalXchange...Lucas Jellema
 
Apache Superset - open source data exploration and visualization (Conclusion ...
Apache Superset - open source data exploration and visualization (Conclusion ...Apache Superset - open source data exploration and visualization (Conclusion ...
Apache Superset - open source data exploration and visualization (Conclusion ...Lucas Jellema
 
CONNECTING THE REAL WORLD TO ENTERPRISE IT – HOW IoT DRIVES OUR ENERGY TRANSI...
CONNECTING THE REAL WORLD TO ENTERPRISE IT – HOW IoT DRIVES OUR ENERGY TRANSI...CONNECTING THE REAL WORLD TO ENTERPRISE IT – HOW IoT DRIVES OUR ENERGY TRANSI...
CONNECTING THE REAL WORLD TO ENTERPRISE IT – HOW IoT DRIVES OUR ENERGY TRANSI...Lucas Jellema
 
Help me move away from Oracle - or not?! (Oracle Community Tour EMEA - LVOUG...
Help me move away from Oracle - or not?!  (Oracle Community Tour EMEA - LVOUG...Help me move away from Oracle - or not?!  (Oracle Community Tour EMEA - LVOUG...
Help me move away from Oracle - or not?! (Oracle Community Tour EMEA - LVOUG...Lucas Jellema
 
Op je vingers tellen... tot 1000!
Op je vingers tellen... tot 1000!Op je vingers tellen... tot 1000!
Op je vingers tellen... tot 1000!Lucas Jellema
 
IoT - from prototype to enterprise platform (DigitalXchange 2022)
IoT - from prototype to enterprise platform (DigitalXchange 2022)IoT - from prototype to enterprise platform (DigitalXchange 2022)
IoT - from prototype to enterprise platform (DigitalXchange 2022)Lucas Jellema
 
Who Wants to Become an IT Architect-A Look at the Bigger Picture - DigitalXch...
Who Wants to Become an IT Architect-A Look at the Bigger Picture - DigitalXch...Who Wants to Become an IT Architect-A Look at the Bigger Picture - DigitalXch...
Who Wants to Become an IT Architect-A Look at the Bigger Picture - DigitalXch...Lucas Jellema
 
Steampipe - use SQL to retrieve data from cloud, platforms and files (Code Ca...
Steampipe - use SQL to retrieve data from cloud, platforms and files (Code Ca...Steampipe - use SQL to retrieve data from cloud, platforms and files (Code Ca...
Steampipe - use SQL to retrieve data from cloud, platforms and files (Code Ca...Lucas Jellema
 
Automation of Software Engineering with OCI DevOps Build and Deployment Pipel...
Automation of Software Engineering with OCI DevOps Build and Deployment Pipel...Automation of Software Engineering with OCI DevOps Build and Deployment Pipel...
Automation of Software Engineering with OCI DevOps Build and Deployment Pipel...Lucas Jellema
 
Introducing Dapr.io - the open source personal assistant to microservices and...
Introducing Dapr.io - the open source personal assistant to microservices and...Introducing Dapr.io - the open source personal assistant to microservices and...
Introducing Dapr.io - the open source personal assistant to microservices and...Lucas Jellema
 
How and Why you can and should Participate in Open Source Projects (AMIS, Sof...
How and Why you can and should Participate in Open Source Projects (AMIS, Sof...How and Why you can and should Participate in Open Source Projects (AMIS, Sof...
How and Why you can and should Participate in Open Source Projects (AMIS, Sof...Lucas Jellema
 
Microservices, Apache Kafka, Node, Dapr and more - Part Two (Fontys Hogeschoo...
Microservices, Apache Kafka, Node, Dapr and more - Part Two (Fontys Hogeschoo...Microservices, Apache Kafka, Node, Dapr and more - Part Two (Fontys Hogeschoo...
Microservices, Apache Kafka, Node, Dapr and more - Part Two (Fontys Hogeschoo...Lucas Jellema
 
Microservices, Node, Dapr and more - Part One (Fontys Hogeschool, Spring 2022)
Microservices, Node, Dapr and more - Part One (Fontys Hogeschool, Spring 2022)Microservices, Node, Dapr and more - Part One (Fontys Hogeschool, Spring 2022)
Microservices, Node, Dapr and more - Part One (Fontys Hogeschool, Spring 2022)Lucas Jellema
 
6Reinventing Oracle Systems in a Cloudy World (RMOUG Trainingdays, February 2...
6Reinventing Oracle Systems in a Cloudy World (RMOUG Trainingdays, February 2...6Reinventing Oracle Systems in a Cloudy World (RMOUG Trainingdays, February 2...
6Reinventing Oracle Systems in a Cloudy World (RMOUG Trainingdays, February 2...Lucas Jellema
 
Help me move away from Oracle! (RMOUG Training Days 2022, February 2022)
Help me move away from Oracle! (RMOUG Training Days 2022, February 2022)Help me move away from Oracle! (RMOUG Training Days 2022, February 2022)
Help me move away from Oracle! (RMOUG Training Days 2022, February 2022)Lucas Jellema
 
Tech Talks 101 - DevOps (jan 2022)
Tech Talks 101 - DevOps (jan 2022)Tech Talks 101 - DevOps (jan 2022)
Tech Talks 101 - DevOps (jan 2022)Lucas Jellema
 
Conclusion Code Cafe - Microcks for Mocking and Testing Async APIs (January 2...
Conclusion Code Cafe - Microcks for Mocking and Testing Async APIs (January 2...Conclusion Code Cafe - Microcks for Mocking and Testing Async APIs (January 2...
Conclusion Code Cafe - Microcks for Mocking and Testing Async APIs (January 2...Lucas Jellema
 
Cloud Native Application Development - build fast, low TCO, scalable & agile ...
Cloud Native Application Development - build fast, low TCO, scalable & agile ...Cloud Native Application Development - build fast, low TCO, scalable & agile ...
Cloud Native Application Development - build fast, low TCO, scalable & agile ...Lucas Jellema
 

Mehr von Lucas Jellema (20)

Introduction to web application development with Vue (for absolute beginners)...
Introduction to web application development with Vue (for absolute beginners)...Introduction to web application development with Vue (for absolute beginners)...
Introduction to web application development with Vue (for absolute beginners)...
 
Making the Shift Left - Bringing Ops to Dev before bringing applications to p...
Making the Shift Left - Bringing Ops to Dev before bringing applications to p...Making the Shift Left - Bringing Ops to Dev before bringing applications to p...
Making the Shift Left - Bringing Ops to Dev before bringing applications to p...
 
Lightweight coding in powerful Cloud Development Environments (DigitalXchange...
Lightweight coding in powerful Cloud Development Environments (DigitalXchange...Lightweight coding in powerful Cloud Development Environments (DigitalXchange...
Lightweight coding in powerful Cloud Development Environments (DigitalXchange...
 
Apache Superset - open source data exploration and visualization (Conclusion ...
Apache Superset - open source data exploration and visualization (Conclusion ...Apache Superset - open source data exploration and visualization (Conclusion ...
Apache Superset - open source data exploration and visualization (Conclusion ...
 
CONNECTING THE REAL WORLD TO ENTERPRISE IT – HOW IoT DRIVES OUR ENERGY TRANSI...
CONNECTING THE REAL WORLD TO ENTERPRISE IT – HOW IoT DRIVES OUR ENERGY TRANSI...CONNECTING THE REAL WORLD TO ENTERPRISE IT – HOW IoT DRIVES OUR ENERGY TRANSI...
CONNECTING THE REAL WORLD TO ENTERPRISE IT – HOW IoT DRIVES OUR ENERGY TRANSI...
 
Help me move away from Oracle - or not?! (Oracle Community Tour EMEA - LVOUG...
Help me move away from Oracle - or not?!  (Oracle Community Tour EMEA - LVOUG...Help me move away from Oracle - or not?!  (Oracle Community Tour EMEA - LVOUG...
Help me move away from Oracle - or not?! (Oracle Community Tour EMEA - LVOUG...
 
Op je vingers tellen... tot 1000!
Op je vingers tellen... tot 1000!Op je vingers tellen... tot 1000!
Op je vingers tellen... tot 1000!
 
IoT - from prototype to enterprise platform (DigitalXchange 2022)
IoT - from prototype to enterprise platform (DigitalXchange 2022)IoT - from prototype to enterprise platform (DigitalXchange 2022)
IoT - from prototype to enterprise platform (DigitalXchange 2022)
 
Who Wants to Become an IT Architect-A Look at the Bigger Picture - DigitalXch...
Who Wants to Become an IT Architect-A Look at the Bigger Picture - DigitalXch...Who Wants to Become an IT Architect-A Look at the Bigger Picture - DigitalXch...
Who Wants to Become an IT Architect-A Look at the Bigger Picture - DigitalXch...
 
Steampipe - use SQL to retrieve data from cloud, platforms and files (Code Ca...
Steampipe - use SQL to retrieve data from cloud, platforms and files (Code Ca...Steampipe - use SQL to retrieve data from cloud, platforms and files (Code Ca...
Steampipe - use SQL to retrieve data from cloud, platforms and files (Code Ca...
 
Automation of Software Engineering with OCI DevOps Build and Deployment Pipel...
Automation of Software Engineering with OCI DevOps Build and Deployment Pipel...Automation of Software Engineering with OCI DevOps Build and Deployment Pipel...
Automation of Software Engineering with OCI DevOps Build and Deployment Pipel...
 
Introducing Dapr.io - the open source personal assistant to microservices and...
Introducing Dapr.io - the open source personal assistant to microservices and...Introducing Dapr.io - the open source personal assistant to microservices and...
Introducing Dapr.io - the open source personal assistant to microservices and...
 
How and Why you can and should Participate in Open Source Projects (AMIS, Sof...
How and Why you can and should Participate in Open Source Projects (AMIS, Sof...How and Why you can and should Participate in Open Source Projects (AMIS, Sof...
How and Why you can and should Participate in Open Source Projects (AMIS, Sof...
 
Microservices, Apache Kafka, Node, Dapr and more - Part Two (Fontys Hogeschoo...
Microservices, Apache Kafka, Node, Dapr and more - Part Two (Fontys Hogeschoo...Microservices, Apache Kafka, Node, Dapr and more - Part Two (Fontys Hogeschoo...
Microservices, Apache Kafka, Node, Dapr and more - Part Two (Fontys Hogeschoo...
 
Microservices, Node, Dapr and more - Part One (Fontys Hogeschool, Spring 2022)
Microservices, Node, Dapr and more - Part One (Fontys Hogeschool, Spring 2022)Microservices, Node, Dapr and more - Part One (Fontys Hogeschool, Spring 2022)
Microservices, Node, Dapr and more - Part One (Fontys Hogeschool, Spring 2022)
 
6Reinventing Oracle Systems in a Cloudy World (RMOUG Trainingdays, February 2...
6Reinventing Oracle Systems in a Cloudy World (RMOUG Trainingdays, February 2...6Reinventing Oracle Systems in a Cloudy World (RMOUG Trainingdays, February 2...
6Reinventing Oracle Systems in a Cloudy World (RMOUG Trainingdays, February 2...
 
Help me move away from Oracle! (RMOUG Training Days 2022, February 2022)
Help me move away from Oracle! (RMOUG Training Days 2022, February 2022)Help me move away from Oracle! (RMOUG Training Days 2022, February 2022)
Help me move away from Oracle! (RMOUG Training Days 2022, February 2022)
 
Tech Talks 101 - DevOps (jan 2022)
Tech Talks 101 - DevOps (jan 2022)Tech Talks 101 - DevOps (jan 2022)
Tech Talks 101 - DevOps (jan 2022)
 
Conclusion Code Cafe - Microcks for Mocking and Testing Async APIs (January 2...
Conclusion Code Cafe - Microcks for Mocking and Testing Async APIs (January 2...Conclusion Code Cafe - Microcks for Mocking and Testing Async APIs (January 2...
Conclusion Code Cafe - Microcks for Mocking and Testing Async APIs (January 2...
 
Cloud Native Application Development - build fast, low TCO, scalable & agile ...
Cloud Native Application Development - build fast, low TCO, scalable & agile ...Cloud Native Application Development - build fast, low TCO, scalable & agile ...
Cloud Native Application Development - build fast, low TCO, scalable & agile ...
 

Kürzlich hochgeladen

Odoo 14 - eLearning Module In Odoo 14 Enterprise
Odoo 14 - eLearning Module In Odoo 14 EnterpriseOdoo 14 - eLearning Module In Odoo 14 Enterprise
Odoo 14 - eLearning Module In Odoo 14 Enterprisepreethippts
 
VK Business Profile - provides IT solutions and Web Development
VK Business Profile - provides IT solutions and Web DevelopmentVK Business Profile - provides IT solutions and Web Development
VK Business Profile - provides IT solutions and Web Developmentvyaparkranti
 
Xen Safety Embedded OSS Summit April 2024 v4.pdf
Xen Safety Embedded OSS Summit April 2024 v4.pdfXen Safety Embedded OSS Summit April 2024 v4.pdf
Xen Safety Embedded OSS Summit April 2024 v4.pdfStefano Stabellini
 
GOING AOT WITH GRAALVM – DEVOXX GREECE.pdf
GOING AOT WITH GRAALVM – DEVOXX GREECE.pdfGOING AOT WITH GRAALVM – DEVOXX GREECE.pdf
GOING AOT WITH GRAALVM – DEVOXX GREECE.pdfAlina Yurenko
 
Salesforce Implementation Services PPT By ABSYZ
Salesforce Implementation Services PPT By ABSYZSalesforce Implementation Services PPT By ABSYZ
Salesforce Implementation Services PPT By ABSYZABSYZ Inc
 
Machine Learning Software Engineering Patterns and Their Engineering
Machine Learning Software Engineering Patterns and Their EngineeringMachine Learning Software Engineering Patterns and Their Engineering
Machine Learning Software Engineering Patterns and Their EngineeringHironori Washizaki
 
Alfresco TTL#157 - Troubleshooting Made Easy: Deciphering Alfresco mTLS Confi...
Alfresco TTL#157 - Troubleshooting Made Easy: Deciphering Alfresco mTLS Confi...Alfresco TTL#157 - Troubleshooting Made Easy: Deciphering Alfresco mTLS Confi...
Alfresco TTL#157 - Troubleshooting Made Easy: Deciphering Alfresco mTLS Confi...Angel Borroy López
 
Simplifying Microservices & Apps - The art of effortless development - Meetup...
Simplifying Microservices & Apps - The art of effortless development - Meetup...Simplifying Microservices & Apps - The art of effortless development - Meetup...
Simplifying Microservices & Apps - The art of effortless development - Meetup...Rob Geurden
 
Dealing with Cultural Dispersion — Stefano Lambiase — ICSE-SEIS 2024
Dealing with Cultural Dispersion — Stefano Lambiase — ICSE-SEIS 2024Dealing with Cultural Dispersion — Stefano Lambiase — ICSE-SEIS 2024
Dealing with Cultural Dispersion — Stefano Lambiase — ICSE-SEIS 2024StefanoLambiase
 
Understanding Flamingo - DeepMind's VLM Architecture
Understanding Flamingo - DeepMind's VLM ArchitectureUnderstanding Flamingo - DeepMind's VLM Architecture
Understanding Flamingo - DeepMind's VLM Architecturerahul_net
 
React Server Component in Next.js by Hanief Utama
React Server Component in Next.js by Hanief UtamaReact Server Component in Next.js by Hanief Utama
React Server Component in Next.js by Hanief UtamaHanief Utama
 
Sending Calendar Invites on SES and Calendarsnack.pdf
Sending Calendar Invites on SES and Calendarsnack.pdfSending Calendar Invites on SES and Calendarsnack.pdf
Sending Calendar Invites on SES and Calendarsnack.pdf31events.com
 
Taming Distributed Systems: Key Insights from Wix's Large-Scale Experience - ...
Taming Distributed Systems: Key Insights from Wix's Large-Scale Experience - ...Taming Distributed Systems: Key Insights from Wix's Large-Scale Experience - ...
Taming Distributed Systems: Key Insights from Wix's Large-Scale Experience - ...Natan Silnitsky
 
英国UN学位证,北安普顿大学毕业证书1:1制作
英国UN学位证,北安普顿大学毕业证书1:1制作英国UN学位证,北安普顿大学毕业证书1:1制作
英国UN学位证,北安普顿大学毕业证书1:1制作qr0udbr0
 
Software Project Health Check: Best Practices and Techniques for Your Product...
Software Project Health Check: Best Practices and Techniques for Your Product...Software Project Health Check: Best Practices and Techniques for Your Product...
Software Project Health Check: Best Practices and Techniques for Your Product...Velvetech LLC
 
Folding Cheat Sheet #4 - fourth in a series
Folding Cheat Sheet #4 - fourth in a seriesFolding Cheat Sheet #4 - fourth in a series
Folding Cheat Sheet #4 - fourth in a seriesPhilip Schwarz
 
How to submit a standout Adobe Champion Application
How to submit a standout Adobe Champion ApplicationHow to submit a standout Adobe Champion Application
How to submit a standout Adobe Champion ApplicationBradBedford3
 
Maximizing Efficiency and Profitability with OnePlan’s Professional Service A...
Maximizing Efficiency and Profitability with OnePlan’s Professional Service A...Maximizing Efficiency and Profitability with OnePlan’s Professional Service A...
Maximizing Efficiency and Profitability with OnePlan’s Professional Service A...OnePlan Solutions
 
Large Language Models for Test Case Evolution and Repair
Large Language Models for Test Case Evolution and RepairLarge Language Models for Test Case Evolution and Repair
Large Language Models for Test Case Evolution and RepairLionel Briand
 

Kürzlich hochgeladen (20)

Odoo 14 - eLearning Module In Odoo 14 Enterprise
Odoo 14 - eLearning Module In Odoo 14 EnterpriseOdoo 14 - eLearning Module In Odoo 14 Enterprise
Odoo 14 - eLearning Module In Odoo 14 Enterprise
 
VK Business Profile - provides IT solutions and Web Development
VK Business Profile - provides IT solutions and Web DevelopmentVK Business Profile - provides IT solutions and Web Development
VK Business Profile - provides IT solutions and Web Development
 
Xen Safety Embedded OSS Summit April 2024 v4.pdf
Xen Safety Embedded OSS Summit April 2024 v4.pdfXen Safety Embedded OSS Summit April 2024 v4.pdf
Xen Safety Embedded OSS Summit April 2024 v4.pdf
 
GOING AOT WITH GRAALVM – DEVOXX GREECE.pdf
GOING AOT WITH GRAALVM – DEVOXX GREECE.pdfGOING AOT WITH GRAALVM – DEVOXX GREECE.pdf
GOING AOT WITH GRAALVM – DEVOXX GREECE.pdf
 
Salesforce Implementation Services PPT By ABSYZ
Salesforce Implementation Services PPT By ABSYZSalesforce Implementation Services PPT By ABSYZ
Salesforce Implementation Services PPT By ABSYZ
 
Machine Learning Software Engineering Patterns and Their Engineering
Machine Learning Software Engineering Patterns and Their EngineeringMachine Learning Software Engineering Patterns and Their Engineering
Machine Learning Software Engineering Patterns and Their Engineering
 
Alfresco TTL#157 - Troubleshooting Made Easy: Deciphering Alfresco mTLS Confi...
Alfresco TTL#157 - Troubleshooting Made Easy: Deciphering Alfresco mTLS Confi...Alfresco TTL#157 - Troubleshooting Made Easy: Deciphering Alfresco mTLS Confi...
Alfresco TTL#157 - Troubleshooting Made Easy: Deciphering Alfresco mTLS Confi...
 
Simplifying Microservices & Apps - The art of effortless development - Meetup...
Simplifying Microservices & Apps - The art of effortless development - Meetup...Simplifying Microservices & Apps - The art of effortless development - Meetup...
Simplifying Microservices & Apps - The art of effortless development - Meetup...
 
Dealing with Cultural Dispersion — Stefano Lambiase — ICSE-SEIS 2024
Dealing with Cultural Dispersion — Stefano Lambiase — ICSE-SEIS 2024Dealing with Cultural Dispersion — Stefano Lambiase — ICSE-SEIS 2024
Dealing with Cultural Dispersion — Stefano Lambiase — ICSE-SEIS 2024
 
Understanding Flamingo - DeepMind's VLM Architecture
Understanding Flamingo - DeepMind's VLM ArchitectureUnderstanding Flamingo - DeepMind's VLM Architecture
Understanding Flamingo - DeepMind's VLM Architecture
 
React Server Component in Next.js by Hanief Utama
React Server Component in Next.js by Hanief UtamaReact Server Component in Next.js by Hanief Utama
React Server Component in Next.js by Hanief Utama
 
Sending Calendar Invites on SES and Calendarsnack.pdf
Sending Calendar Invites on SES and Calendarsnack.pdfSending Calendar Invites on SES and Calendarsnack.pdf
Sending Calendar Invites on SES and Calendarsnack.pdf
 
Taming Distributed Systems: Key Insights from Wix's Large-Scale Experience - ...
Taming Distributed Systems: Key Insights from Wix's Large-Scale Experience - ...Taming Distributed Systems: Key Insights from Wix's Large-Scale Experience - ...
Taming Distributed Systems: Key Insights from Wix's Large-Scale Experience - ...
 
英国UN学位证,北安普顿大学毕业证书1:1制作
英国UN学位证,北安普顿大学毕业证书1:1制作英国UN学位证,北安普顿大学毕业证书1:1制作
英国UN学位证,北安普顿大学毕业证书1:1制作
 
Software Project Health Check: Best Practices and Techniques for Your Product...
Software Project Health Check: Best Practices and Techniques for Your Product...Software Project Health Check: Best Practices and Techniques for Your Product...
Software Project Health Check: Best Practices and Techniques for Your Product...
 
Folding Cheat Sheet #4 - fourth in a series
Folding Cheat Sheet #4 - fourth in a seriesFolding Cheat Sheet #4 - fourth in a series
Folding Cheat Sheet #4 - fourth in a series
 
How to submit a standout Adobe Champion Application
How to submit a standout Adobe Champion ApplicationHow to submit a standout Adobe Champion Application
How to submit a standout Adobe Champion Application
 
Maximizing Efficiency and Profitability with OnePlan’s Professional Service A...
Maximizing Efficiency and Profitability with OnePlan’s Professional Service A...Maximizing Efficiency and Profitability with OnePlan’s Professional Service A...
Maximizing Efficiency and Profitability with OnePlan’s Professional Service A...
 
Large Language Models for Test Case Evolution and Repair
Large Language Models for Test Case Evolution and RepairLarge Language Models for Test Case Evolution and Repair
Large Language Models for Test Case Evolution and Repair
 
Hot Sexy call girls in Patel Nagar🔝 9953056974 🔝 escort Service
Hot Sexy call girls in Patel Nagar🔝 9953056974 🔝 escort ServiceHot Sexy call girls in Patel Nagar🔝 9953056974 🔝 escort Service
Hot Sexy call girls in Patel Nagar🔝 9953056974 🔝 escort Service
 

The Art of Intelligence – Introduction Machine Learning for Oracle professionals (ODevCYatra 2018, Hyderabad, Pune, Mumbai)

  • 1. The Art of Intelligence A Practical Introduction Machine Learning 50 Shades of Data 1 Lucas Jellema, CTO of AMIS ODevC Yatra, July 2018
  • 2. Lucas Jellema Architect / Developer 1994 started in IT at Oracle 2002 joined AMIS Currently CTO & Solution Architect
  • 3. Presenting • Oracle OpenWorld • JavaOne • Oracle Code • Devoxx • Java and Oracle User Group meetups • Java Rockstar (JavaOne 2015) • OTN Yatra 2013 • ODevC Yatra 2018 50 Shades of Data 3
  • 4. Writing • Blogs at http://technology.amis.nl • 1500 articles – from UI to Middle Tier, Database and Infrastructure • Articles at Medium, DZone and Oracle Technology Network • Books for McGraw Hill (Oracle Press) • Oracle ACE Director & Developer Champion 50 Shades of Data 4
  • 5. From The Netherlands 50 Shades of Data 5
  • 6.
  • 7.
  • 8.
  • 9.
  • 11.
  • 12. AGENDA • What is Machine Learning? • Why could it be relevant [to you]? • What does it entail? • With which algorithms, tools and technologies? • Oracle and Machine Learning? • How do you embark on Machine Learning? • Handson • Functional/non-technical • Technical
  • 13. LEARNING • How do we learn? • Try something (else) => get feedback => learn • Eventually: • We get it (understanding) so we can predict the outcome of a certain action in a new situation • Or we have experienced enough situations to predict the outcome in most situations with high confidence • Through interpolation, extrapolation, etc. • We remain clueless 13
  • 14. MACHINE LEARNING • Analyze Historical Data (input and result – training set) to discover Patterns & Models • Iteratively apply Models to [additional] Input (test set) and compare model outcome with known actual result to improve the model • Use Model to predict outcome for entirely new data 14
  • 15. WHY IS IT RELEVANT (NOW)? • Data • big, fast, open • Machine Learning has become feasible and accessible • Available • Affordable (software & hardware) • Doable (Citizen Data Scientist) • Fast enough • Business Cases & Opportunities => Demands • End users, Consumers, Competitive pressure, Society
  • 16. WHY IS IT RELEVANT (NOW)?
  • 18. EXAMPLE USE CASES • Speech recognition • Identify churn candidates • Intent & Sentiment analysis on social media • Upsell & Cross Sell • Target Marketing • Customer Service • Chat bots & voice response systems • Predictive Maintenance • Gaming • Captcha • Medical Diagnosis • Anomaly Detection (find the odd one out) • Autonomous Cars • Voter Segment Analysis • Customer Recommendations • Smart Data Capture • Face Detection • Fraud Prevention • (really good) OCR • Traffic light control • Navigation • Should we investigate | do lab test? • Spam filtering • Propose friends | contacts • Troll detection • Auto correct • Photo Tagging and Album organization
  • 19. READY-TO-RUN ML APPS Someone else selected, configured and trained an ML model and makes it available for you to use against your own data
  • 20. READY TO RUN ML APPS – SAAS POWERED BY ML #DevoxxMA
  • 21. PRODUCTS WITH ML INSIDE #DevoxxMA
  • 23. THE DATA SCIENCE WORKFLOW • Set Business Goal – research scope, objectives • Gather data • Prepare data • Cleanse, transform (wrangle), combine (merge, enrich) • Explore data • Model Data • Select model, train model, test model • Present findings and recommend next steps • Apply: • Make use of insights in business decisions • Automate Data Gathering & Preparation, Deploy Model, Embed Model in operational systems
  • 24. DATA DISCOVERY | EXPLORATION 24 A B C D E F G 1104534 ZTR 0.1 anijs 2 36 T 631148 ESE 132 rivier 0 21 S -3 WGN 71 appel 0 1 - 1262300 ZTR 56 zes 2 41 T 315529 HVN 1290 hamer 0 11 - 788914 ASM 676 zwaluw 0 26 T 157762 HVN 9482 wie 0 6 - 946681 DHG 42 rond 1 31 T -31539 WGN 2423 bruin 0 0 - 47338 HVN 54 hamer 0 16 P
  • 25. SCATTER PLOT ATTRIBUTE F (Y-AXIS)VS ATTRIBUTE A 25 0 5 10 15 20 25 30 35 40 45 -200000 0 200000 400000 600000 800000 1000000 1200000 1400000 Y-Values Y-Values
  • 26. SCATTER PLOT ATTRIBUTE F (Y-AXIS)VS ATTRIBUTE A 26 0 5 10 15 20 25 30 35 40 45 1965 1970 1975 1980 1985 1990 1995 2000 2005 2010 2015 Age of Lucas Jellema vs Year Y-Values
  • 27. DATA DISCOVERY – ATTRIBUTES IDENTIFIED 27 Time of Birth City ? ? #Kids Age Level of Education 1104534 ZTR 0.1 anijs 2 36 T 631148 ESE 132 rivier 0 21 S -3 WGN 71 appel 0 1 - 1262300 ZTR 56 zes 2 41 T 315529 HVN 1290 hamer 0 11 - 788914 ASM 676 zwaluw 0 26 T 157762 HVN 9482 wie 0 6 - 946681 DHG 42 rond 1 31 T -31539 WGN 2423 bruin 0 0 - 47338 HVN 54 hamer 0 16 P
  • 28. TYPES OF MACHINE LEARNING • Supervised • Train and test model from known data (both features and target) • Unsupervised • Analyze unlabeled data – see if you can find anything • Semi-Supervised • Interactive flow, for example human identifying clusters • Reinforcement • Continuously improve algorithm (model) as time progresses, based on new experience
  • 29. MACHINE LEARNING ALGORITHMS • Clustering • Hierarchical k-means, Orthogonal Partitioning Clustering, Expectation-Maximization • Feature Extraction/Attribute Importance/Principal Component Analysis • Classification • Decision Tree, Naïve Bayes, Random Forest, Logistic Regression, Support Vector Machine • Regression • Multiple Regression, Support Vector Machine, Linear Model, LASSO, Random Forest, Ridgre Regression, Generalized Linear Model, Stepwise Linear Regression • Association & Collaborative Filtering (market basket analysis, apriori) • Reinforcement Learning – brute force, value function, Monte Carlo, temporal difference, .. • Neural network and Deep Learning with Deep Neural Network • Can be used for many different use cases
  • 30. MODELING PHASE • Select a model to try to create a fit with (predict target well) • Set configuration parameters for model • Divide data in training set and test set • Train model with training set • Evaluate performance of trained model on the test set • Confusion matrix, mean square error, support, lift, false positives, false negatives • Optionally: tweak model parameters, add attributes, feed in more training data, choose different model • Eventually (hopefully): pick model plus parameters plus attributes that will reliably predict the target variable given new data • Possibly combine multiple models to collaborate on target value
  • 31. OPTICAL DIGIT RECOGNITION == CLASSIFICATION Predicted Actual 0 1 2 3 4 5 6 7 8 9 0 1 2 3 4 5 6 7 8 9 Naïve Bayes Decision Tree Deep Neural Network
  • 32. CLASSIFICATION GONE WRONG • Machine learning applied to millions of drawings on QuickDraw • to classify drawings • For example: drawings of beds • See for example: • https://aiexperiments.withgoogle.com/quick-draw
  • 33. MACHINE LEARNING  OPERATIONAL SYSTEMS • “We have a model that will choose best chess move based on certain input”
  • 34. MACHINE LEARNING  OPERATIONAL SYSTEMS • Discovery => Model => Deploy • “We have a model that will predict a class (classification) or value (regression) based on certain input with a meaningful degree of accuracy” – how can we make use of that model?
  • 35. DEPLOY MODEL AND EXPOSE • Model is usually created on Big Data in Data Science environment using the Data Scientist’s tools • Model itself is typically fairly small • Model will be applied in operational systems against single data items (not huge collections nor the entire Big Data set) • Running the model online may not require extensive resources • Implementing the model at production run time • Export model (from Data Scientist environment) and import (into production environment) • Reimplement the model in the development technology and deploy (in the regular way) to the production environment • Expose model through API
  • 37. BIG DATA => SMALL ML MODELS
  • 38. DEPLOY MODEL AND EXPOSE REST API
  • 39. MODEL MANAGEMENT • Governance (new versions, testing and approval) • A/B testing • Auditing (what did the model decide and why? notifying humans? ) • Evaluation (how well did the model’s output match the reality) to help evolve the model • for example recommendations followed • Monitor self learning models (to detect rogue models)
  • 40. WHAT TO DO IT WITH? • Mathematics (Statistics) • Gauss (normal distribution) • Bayes’ Theorem • Euclidean Distance • Perceptron • Mean Square Error
  • 41. WHAT TO DO IT WITH?
  • 42. TOOLS AND LIBRARIES IMPLEMENTING MACHINE LEARNING ALGORITHMS +
  • 44. HOW TO PICK TOOLS FOR THE JOB • What are the jobs? • Gather data • Prepare data • Explore and (hopefully) Discover • Present • Embed & Deploy Model • What are considerations? • Volume • Speed and Time • Skills • Platform • Cost
  • 46. POPULAR FRAMEWORKS & LIBRARIES • TensorFlow • MXNet • Caffe • DL4J • Keras • … many more… Oracle Database Option Advanced Analytics #DevoxxMA
  • 47. NOTEBOOK – THE LAB JOURNAL FROM THE DATALAB • Common format for data exploration and presentation • User friendly interface on top of powerful technologies • Most popular implementations • Jupyter (fka IPython) • Apache Zeppelin • Spark Notebook • Beaker • SageMath (SageMathCloud => CoCalc) • Oracle Machine Learning Notebook UI • Try out Jupyter at: https://mybinder.org/
  • 49. OPEN DATA • Governments and NGOs, scientific and even commercial organizations are publishing data • Inviting anyone who wants to join in to help make sense of the data – understand driving factors, identify categories, help predict • Many areas • Economy, health, public safety, sports, traffic & transportation, games, environment, maps, …
  • 50. OPEN DATA – SOME EXAMPLES • Kaggle - Data Sets and [Samples of] Data Discovery: www.kaggle.com • India Government - data.gov.in • US, EU and UK Government Data: data.gov, open-data.europa.eu and data.gov.uk • Open Images Data Set: www.image-net.org • Open Data From World Bank: data.worldbank.org • Historic Football Data: api.football-data.org • New York City Open Data - opendata.cityofnewyork.us • Airports, Airlines, Flight Routes: openflights.org • Open Database – machine counterpart to Wikipedia: www.wikidata.org • Google Audio Set (manually annotated audio events) - research.google.com/audioset/ • Movielens - Movies, viewers and ratings: files.grouplens.org/datasets/movielens/
  • 51. WHAT IS HADOOP? • Big Data means Big Computing and Big Storage • Big requires scalable => horizontal scale out • Moving data is very expensive (network, disk IO) • Rather than move data to processor – move processing to data: distributed processing • Horizontal scale out => Hadoop: distributed data & distributed processing • HDFS – Hadoop Distributed File System • Map Reduce – parallel, distributed processing • Map-Reduce operates on data locally, then persists and aggregates results
  • 52. WHAT IS SPARK? • Developing and orchestrating Map-Reduce on Hadoop is not simple • Running jobs can be slow due to frequent disk writing • Spark is for managing and orchestrating distributed processing on a variety of cluster systems • with Hadoop as the most obvious target • through APIs in Java, Python, R, Scala • Spark uses lazy operations and distributed in-memory data structures – offering much better performance • Through Spark – cluster based processing can be used interactively • Spark has additional modules that leverage distributed processing for running prepackaged jobs (SQL, Graph, ML, …)
  • 54. EXAMPLE RUNNING AGAINST SPARK • https://github.com/jadianes/spark-movie-lens/blob/master/notebooks/building-recommender.ipynb
  • 55. WHAT IS ORACLE DOING AROUND MACHINE LEARNING? • Oracle Advanced Analytics in Oracle Database • Data Mining, Enterprise R • Text (ESA), Spatial, Graph • SQL
  • 57. DEMO: CONFERENCE ABSTRACT CLASSIFICATION CHALLENGE • Take all conference abstracts for • Train a Classification Model on picking the Conference Track • Based on Title, Summary [, Speaker, Level,…] • Use the Model to pick the Track for sessions at
  • 58. DEMONSTRATION OF ORACLE ADVANCED ANALYTICS • Using Text Mining and Naives Bayes Data Mining Classification • Train model for classifying conference abstracts into tracks • Use model to propose a track for new abstracts • Steps • Gather data • Import, cleanse, enrich, … • Prepare training set and test set • Select and configure model • Combining Text and Mining using Naive Bayes • Train model • Test and apply model
  • 59. PREPARE DATABASE (IN THE CLOUD)
  • 60. PREPARE DATABASE (IN THE CLOUD)
  • 62. ONCE THE INSTANCE IS RUNNING...
  • 63. USE AS ANY ORACLE DATABASE INSTANCE – LOCAL, ON PREMISES, ... – ACCESSIBLE VIA SQL*NET
  • 64. TRAIN MODEL DECLARE xformlist dbms_data_mining_transform.TRANSFORM_LIST; BEGIN DBMS_DATA_MINING_TRANSFORM.SET_TRANSFORM( xformlist, 'abstract', NULL, 'abstract', NULL, 'TEXT(TOKEN_TYPE:NORMAL)'); DBMS_DATA_MINING.CREATE_MODEL ( model_name => 'SESSION_CLASS_NB' , mining_function => dbms_data_mining.classification , data_table_name => 'J1_SESSIONS' , case_id_column_name => 'session_title' , target_column_name => 'session_track' , settings_table_name => 'session_class_nb_settings' , xform_list => xformlist); END;
  • 68. BIG DATA SQL ORACLE DATABASE AS SINGLE POINT OF ENTRY
  • 69. MANY CLOUD SERVICES AROUND BIG DATA & [PREDICTIVE] ANALYTICS & MACHINE LEARNING 70
  • 70. WHAT IS ORACLE DOING AROUND MACHINE LEARNING? • Big Data Discovery (fka Endeca), Big Data Preparation and Big Data Compute • Big Data Appliance • Data Visualization Cloud • Analytics Cloud • Industry specific Analytics Clouds (Sales, Marketing, HCM) on top of SaaS • RTD – Real Time Decisions • DaaS • Oracle Labs (labs.oracle.com) • Machine Learning Research Group (link) • Machine Learning CS – “Oracle Notebook”
  • 71. ORACLE AI PLATFORM CLOUD SERVICE (COMING SHORTLY)
  • 72. HUMANS LEARNING MACHINE LEARNING: YOUR FIRST STEPS #DevoxxMA
  • 73. HUMANS LEARNING MACHINE LEARNING: YOUR FIRST STEPS • Jupyter Notebooks and Python – https://mybinder.org/ • HortonWorks Sandbox VM – Hadoop & Spark & Hive, Ambari • DataBricks Cloud Environment with Apache Spark (free trial) • KataKoda – tutorials & live environment for TensorFlow • Oracle Big Data Lite – Prebuilt Virtual Machine • Data Visualization Desktop – ready to run desktop tool • Tutorials, Courses (Udacity, Coursera, edX) • Books • Introducing Data Science • Learning Apache Spark 2 • Python Machine Learning
  • 74. HANDS ON MACHINE LEARNING (BABY STEPS) • All materials are in: https://github.com/AMIS-Services Non Technical Technical Decision Trees
  • 75. SUMMARY • IoT, Big Data, Machine Learning => AI • Recent and Rapid Democratization of Machine Learning • Algorithms, Storage and Compute Resources, High Level Machine Learning Frameworks, Education resources , Open Data, Trained ML Models, Out of the Box SaaS capabilities – powered by ML • Produce business value today • Machine Learning by computers helps us(ers) understand historic data and apply that insight to new data • Developers have to learn how to incorporate Machine Learning into their applications – for smarter Uis, more automation, faster (p)reactions
  • 76. SUMMARY • R and Python are most popular technologies for data exploration and ML model discovery [on small subsets of Big Data] • Apache Spark (on Hadoop) is frequently used to powercrunch data (wrangling) and run ML models on Big Data sets • Notebooks are a popular vehicle in the Data Science lab • To explore and report • Oracle is quite active on Machine Learning • Power PaaS and SaaS with ML • Provide us with the Machine Learning Data Lab & Run Time (on the cloud) • Getting started on Machine Learning is fun, smart & well supported
  • 77. Thank you! • Blog: technology.amis.nl • Email: lucas.jellema@amis.nl • : @lucasjellema • : lucas-jellema • : www.amis.nl, info@amis.nl
  • 78. HANDS ON • Alle materialen staan in: https://github.com/AMIS-Services Non Technical
  • 79. REFERENCES • AI Adventures (Google) https://www.youtube.com/watch?v=RJudqel8DVA • Twitch TV https://www.twitch.tv/videos/179940629 and sources on GitHub: https://github.com/sunilmallya/dl-twitch-series • Tensor Flow & Deep Learning without a PhD (Devoxx) https://www.youtube.com/watch?v=vq2nnJ4g6N0 • KataKoda Browser Based Runtime for TensorFlow https://www.katacoda.com/courses/tensorflow • And many more #DevoxxMA

Hinweis der Redaktion

  1. Why do we study history? To understand the present and predict the future (from current events)
  2. IoT Social Media
  3. IoT Social Media
  4. Market Basket Analysis: https://www.linkedin.com/pulse/using-machine-learning-market-basket-analysis-thomsen
  5. http://yann.lecun.com/exdb/mnist/ MNIST – handwritten images
  6. https://aiexperiments.withgoogle.com/quick-draw
  7. https://www.slideshare.net/databricks/apache-spark-model-deployment
  8. https://www.slideshare.net/databricks/apache-spark-model-deployment
  9. https://www.slideshare.net/databricks/apache-spark-model-deployment
  10. https://www.slideshare.net/AshishBansal17/tensorflow-vs-mxnet
  11. https://github.com/lucasjellema/theArtOfMachineLearning/blob/master/LinearRegression.ipynb https://nbviewer.jupyter.org/github/lucasjellema/devoxx17-intro-machine-learning/blob/master/LinearRegression.ipynb https://github.com/lucasjellema/jupyter-notebook-eredivisie/blob/master/EredivisieResults_2016_2017.ipynb https://github.com/jadianes/spark-movie-lens/blob/master/notebooks/building-recommender.ipynb https://github.com/justmarkham/DAT4/blob/master/notebooks/08_linear_regression.ipynb
  12. https://openflights.org/data.html - airports, airlines, flight routes Google Audio Set - https://research.google.com/audioset/ (A large-scale dataset of manually annotated audio events) Open Images Data Set - https://github.com/openimages/dataset , www.image-net.org http://api.football-data.org/index UK Data - https://data.gov.uk/ Open Data Sets - https://www.kaggle.com/datasets CBS Open Data - https://www.cbs.nl/nl-nl/onze-diensten/open-data Open Data Sets for Deep learning - https://deeplearning4j.org/opendata Data.gov The home of the US Government’s open data https://open-data.europa.eu/ The home of the European Commission’s open data https://www.wikidata.org (in part originated out of Freebase.org An open database that retrieves its information from sites like Wikipedia, MusicBrains, and the SEC archive ) Data.worldbank.org Open data initiative from the World Bank Aiddata.org Open data for international development Open.fda.gov Open data from the US Food and Drug Administration Google Knowledge Graph API - https://developers.google.com/knowledge-graph/ Detroit Open Data Portal https://data.detroitmi.gov/ Example: Detroit Police Crime statistics: https://data.detroitmi.gov/Public-Safety/-Archived-All-Crime-Incidents-2009-May-5-2017/b4hw-v6w2
  13. https://openflights.org/data.html - airports, airlines, flight routes Google Audio Set - https://research.google.com/audioset/ (A large-scale dataset of manually annotated audio events) Open Images Data Set - https://github.com/openimages/dataset , www.image-net.org http://api.football-data.org/index http://files.grouplens.org/datasets/movielens/ml-latest-small-README.html UK Data - https://data.gov.uk/ Open Data Sets - https://www.kaggle.com/datasets CBS Open Data - https://www.cbs.nl/nl-nl/onze-diensten/open-data Open Data Sets for Deep learning - https://deeplearning4j.org/opendata Data.gov The home of the US Government’s open data https://open-data.europa.eu/ The home of the European Commission’s open data https://www.wikidata.org (in part originated out of Freebase.org An open database that retrieves its information from sites like Wikipedia, MusicBrains, and the SEC archive ) Data.worldbank.org Open data initiative from the World Bank Aiddata.org Open data for international development Open.fda.gov Open data from the US Food and Drug Administration Google Knowledge Graph API - https://developers.google.com/knowledge-graph/ Detroit Open Data Portal https://data.detroitmi.gov/ Example: Detroit Police Crime statistics: https://data.detroitmi.gov/Public-Safety/-Archived-All-Crime-Incidents-2009-May-5-2017/b4hw-v6w2
  14. https://github.com/jadianes/spark-movie-lens/blob/master/notebooks/building-recommender.ipynb
  15. https://www.oracle.com/big-data/big-data-discovery/index.html https://labs.oracle.com/pls/apex/f?p=labs:49:::::P49_PROJECT_ID:7 https://technology.amis.nl/2004/10/16/hidden-plsql-gem-in-10g-dbms_frequent_itemset-for-plsql-based-data-mining/ http://oracledmt.blogspot.nl/2006/05/sql-of-analytics-1-data-mining.html
  16. https://www.oracle.com/big-data/big-data-discovery/index.html https://labs.oracle.com/pls/apex/f?p=labs:49:::::P49_PROJECT_ID:7
  17. http://tmpnb.org http://www.oracle.com/technetwork/database/bigdata-appliance/oracle-bigdatalite-2104726.html https://www.udacity.com/course/intro-to-machine-learning--ud120 https://www.coursera.org/learn/machine-learning#%20 https://www.edx.org/course/machine-learning-columbiax-csmm-102x-0 https://technology.amis.nl/2017/05/06/the-hello-world-of-machine-learning-with-python-pandas-jupyter-doing-iris-classification-based-on-quintessential-set-of-flower-data/ https://github.com/rhiever/Data-Analysis-and-Machine-Learning-Projects/blob/master/example-data-science-notebook/Example%20Machine%20Learning%20Notebook.ipynb https://databricks.com/try-databricks https://hortonworks.com/products/sandbox/ http://www.oracle.com/technetwork/middleware/oracle-data-visualization/downloads/oracle-data-visualization-desktop-2938957.html