SlideShare ist ein Scribd-Unternehmen logo
1 von 23
Downloaden Sie, um offline zu lesen
What's The Role of Machine Learning In
Fast Data and Streaming Applications?
WEBINAR
Emre Velipasaoglu, Ph.D 

Principal Data Scientist
What is Machine Learning?
A computer program is a set of explicit
instructions that produce an output for a
given input.
Machine Learning (ML) is about how to
program computers to improve
automatically with experience rather
than explicit instructions.
val x = 0 to 1000000000000
val y = x.map(x => x/2.0 + 1)
1 -> 1.5
2 -> 2.0
=>
10^12 -> ?
Why does everyone want to use it?
A lot of the recent transformative technologies are based on ML:
• optical character recognition
• speech recognition
• fraud detection
• web search
• personalized marketing 

and advertising
• Computer Aided medical 

Diagnosis (CADx)
Why does everyone want to use it?
Emerging trends that will leverage ML:
• Internet of Things (IoT)
• Augmented Reality (AR) 

and Virtual Reality (VR)
• Autonomous vehicles
• Customer service chat bots
• Security: Face/voice/biometrics recognition
• Healthcare: drug discovery, outcome prediction, 

personalized care

• Democratization of ML and the long tail of ML applications
What are some of the use cases?
Machines do certain tasks better than humans.
• IBM DeepBlue in chess
• IBM Watson in Jeopardy
• Google AlphaGo in go
• Lip reading: LipNet 93% vs. humans 65%
What are some of the use cases?
Machines are more cost efficient in certain tasks:
• Transcribing: Microsoft 89% vs. humans 89%
• Computer-aided diagnosis: E.g. Dermatologist-level classification of skin
cancer with deep neural networks, Esteva, et. al., published in Nature,
June 2017.
What are some of the use cases?
Machines are the only way to scale up processing in certain tasks.
• Previewing video: Clarifai can 

analyze 3.5 minutes of video in 

10 seconds for detecting 

objects.
• Commercial loan agreements 

review: AI in seconds vs. 

humans in 360,000 hours.
Why should you care?
Information gives competitive advantage.
ML unlocks information from your data
It is your product, your data, your operations.
ML is being democratized.
It is not for a handful of giant tech companies anymore.
How does it work?
E.g. Augmented Reality (AR) Shopping Personalization:
• Shopping is one area where 

AR is expected to impact.
• IBM CeBIT 2013 app:
• scans a shelf
• recognizes products
• overlays nutritional info
• Add a recommender system, 

tailored to
• your customers,
• your product catalog.
Learning a Recommender
User Rating Matrix
A B C D
Alan 5 1 1
Emre 4 2 3
Vishal 5 1
Matrix
Factorization
User Latent 

Factor Model
f1 f2
Alan 1.63 0.89
Emre 0.89 2.10
Vishal 2.03 1.01
Item Latent Factor Model
A B C D
f1 2.21 1.88 -0.24 0.33
f2 1.68 1.22 1.51 0.74
Scoring Items
Query 

User
User Latent Factors
f1 f2
Vishal 2.03 1.01
Query 

Item
Estimate
Ratings
Ranked Items
score
A 6.00
D 1.41
C 1.05
Item Latent Factors
A C D
f1 2.21 -0.24 0.33
f2 1.68 1.51 0.74
User Latent 

Factor Model
f1 f2
Alan 1.63 0.89
Emre 0.89 2.10
Vishal 2.03 1.01
Item Latent Factor Model
A B C D
f1 2.21 1.88 -0.24 0.33
f2 1.68 1.22 1.51 0.74
ApplicationUserItems
Modern ML
• Size of data
• Does not fit in one node, must distribute
• E.g. billions of user x item ratings (several orders of magnitude more
historical events to aggregate ratings from)
• Size of model
• Does not fit in one node, must distribute
• E.g. millions of users, thousands of products
• Learning speed
• Anywhere from real-time model updates to batch updates in minutes
• Operational latency and throughput
• Low milliseconds response time for millions of transactions
ML Lifecycle - Development
Early research
• Explore of modeling techniques
Iterations
• Feature selection
• Training parameter tuning
Productization
• Feature computations, model updating, scoring, caching, optimizations, etc. 

(e.g. update and query of latent factors)
recommender
system
collaborative filtering
content-based filtering
hybrid models
Bayesian networks
clustering
latent semantic models
Markov decision process …
singular value decomposition
alternating least squares
non-negative matrix factorization
number of latent factors
step size
convergence criteria
ML Lifecycle - Management
Monitoring
• Model performance
• Latency
• Throughput
• Model quality
• Drift (e.g. has the user’s tastes changed recently?)
• Security and robustness
Controlling
• Model optimization (for performance)
• Model update (for quality)
Which tools are available and what do they do?
Machine Learning
Spark MLlib ML library for Spark
Flink ML ML library for Flink
Mahout Distributed or scalable ML algorithms
Tensorflow Google's open source deep learning library
Theano Numerical library for Python, especially for deep learning
Deeplearning4j Deep learning library in Java
BigDL Intel’s distributed deep learning library on Spark
scikit-learn Main ML library for Python
OpenNLP ML based toolkit for the processing of natural language text
Which tools are available and what do they do?
Streaming
Flink Stream processing framework with sophisticated handling of
late arriving data
Spark Streaming Dataset based computing framework with mini-batch
streaming support
Beam API for data processing pipelines
Data Ingestion
Kafka Distributed stream processing for high-throughput, low-
latency, real-time data feeds
Flume Log processing
Which tools are available and what do they do?
Persistence and Storage
HDFS Hadoop based Distributed File System
Cassandra Distributed NoSQL database management system
ElasticSearch Distributed, RESTful search and analytics engine
AWS S3 Cloud based object store
How does the Fast Data Platform tie it all together?
HDFS
User
Rating
Matrix
Spark
Matrix
Factorizat
ion
Cassandra
Latent
Item
Model
Latent
User
Model
Flink / Akka Streams
Query
Item
Query
User
Item
Factors
User
Factors
Score
Kafka
Ranked
Items
Kafka
User Items
Application
batch streaming
What else does FDP provide?
data persistence & storage
stream
processing
machine learning
cluster
analysis
infrastructure
durable
messaging
backplane
microservices
intelligent
management
In Summary
• Machine Learning is the way to build transformative products leveraging data that
are otherwise impossible to build.

• It is not difficult to build Machine Learning based solutions, thanks to new open
source tools.

• Lightbend’s Fast Data Platform provides an easy onramp for building, deploying
and running Fast Data clusters and services leveraging best of breed tools .
Upgrade your grey matter!

Get the free O’Reilly book by Dr. Dean Wampler, 

VP of Fast Data Engineering at Lightbend
bit.ly/lightbend-fast-data
lightbend.com/fast-data-platform
What's The Role Of Machine Learning In Fast Data And Streaming Applications?

Weitere ähnliche Inhalte

Was ist angesagt?

Event-Driven Stream Processing and Model Deployment with Apache Kafka, Kafka ...
Event-Driven Stream Processing and Model Deployment with Apache Kafka, Kafka ...Event-Driven Stream Processing and Model Deployment with Apache Kafka, Kafka ...
Event-Driven Stream Processing and Model Deployment with Apache Kafka, Kafka ...
Kai Wähner
 

Was ist angesagt? (20)

Work is a Stream of Applications (Audun Strand, NAV) Kafka Summit London 2019
Work is a Stream of Applications (Audun Strand, NAV) Kafka Summit London 2019Work is a Stream of Applications (Audun Strand, NAV) Kafka Summit London 2019
Work is a Stream of Applications (Audun Strand, NAV) Kafka Summit London 2019
 
Bridge Your Kafka Streams to Azure Webinar
Bridge Your Kafka Streams to Azure WebinarBridge Your Kafka Streams to Azure Webinar
Bridge Your Kafka Streams to Azure Webinar
 
Events Everywhere: Enabling Digital Transformation in the Public Sector
Events Everywhere: Enabling Digital Transformation in the Public SectorEvents Everywhere: Enabling Digital Transformation in the Public Sector
Events Everywhere: Enabling Digital Transformation in the Public Sector
 
Event-Driven Stream Processing and Model Deployment with Apache Kafka, Kafka ...
Event-Driven Stream Processing and Model Deployment with Apache Kafka, Kafka ...Event-Driven Stream Processing and Model Deployment with Apache Kafka, Kafka ...
Event-Driven Stream Processing and Model Deployment with Apache Kafka, Kafka ...
 
Applying ML on your Data in Motion with AWS and Confluent | Joseph Morais, Co...
Applying ML on your Data in Motion with AWS and Confluent | Joseph Morais, Co...Applying ML on your Data in Motion with AWS and Confluent | Joseph Morais, Co...
Applying ML on your Data in Motion with AWS and Confluent | Joseph Morais, Co...
 
Qlik and Confluent Success Stories with Kafka - How Generali and Skechers Kee...
Qlik and Confluent Success Stories with Kafka - How Generali and Skechers Kee...Qlik and Confluent Success Stories with Kafka - How Generali and Skechers Kee...
Qlik and Confluent Success Stories with Kafka - How Generali and Skechers Kee...
 
ETL Is Dead, Long-live Streams
ETL Is Dead, Long-live StreamsETL Is Dead, Long-live Streams
ETL Is Dead, Long-live Streams
 
Event streaming: A paradigm shift in enterprise software architecture
Event streaming: A paradigm shift in enterprise software architectureEvent streaming: A paradigm shift in enterprise software architecture
Event streaming: A paradigm shift in enterprise software architecture
 
Modernizing your Application Architecture with Microservices
Modernizing your Application Architecture with MicroservicesModernizing your Application Architecture with Microservices
Modernizing your Application Architecture with Microservices
 
Improving Veteran benefit services through efficient data streaming | Robert ...
Improving Veteran benefit services through efficient data streaming | Robert ...Improving Veteran benefit services through efficient data streaming | Robert ...
Improving Veteran benefit services through efficient data streaming | Robert ...
 
Microservices in the Enterprise
Microservices in the Enterprise Microservices in the Enterprise
Microservices in the Enterprise
 
Transform Your Mainframe and IBM i Data for the Cloud with Precisely and Apac...
Transform Your Mainframe and IBM i Data for the Cloud with Precisely and Apac...Transform Your Mainframe and IBM i Data for the Cloud with Precisely and Apac...
Transform Your Mainframe and IBM i Data for the Cloud with Precisely and Apac...
 
IoT Connected Brewery
IoT Connected BreweryIoT Connected Brewery
IoT Connected Brewery
 
Streaming Machine Learning with Python, Jupyter, TensorFlow, Apache Kafka and...
Streaming Machine Learning with Python, Jupyter, TensorFlow, Apache Kafka and...Streaming Machine Learning with Python, Jupyter, TensorFlow, Apache Kafka and...
Streaming Machine Learning with Python, Jupyter, TensorFlow, Apache Kafka and...
 
How to Build and Operate a Global Behavioral Change Platform (Neil Adamson, V...
How to Build and Operate a Global Behavioral Change Platform (Neil Adamson, V...How to Build and Operate a Global Behavioral Change Platform (Neil Adamson, V...
How to Build and Operate a Global Behavioral Change Platform (Neil Adamson, V...
 
Redis and Kafka - Advanced Microservices Design Patterns Simplified
Redis and Kafka - Advanced Microservices Design Patterns SimplifiedRedis and Kafka - Advanced Microservices Design Patterns Simplified
Redis and Kafka - Advanced Microservices Design Patterns Simplified
 
Ten reasons to choose Apache Pulsar over Apache Kafka for Event Sourcing_Robe...
Ten reasons to choose Apache Pulsar over Apache Kafka for Event Sourcing_Robe...Ten reasons to choose Apache Pulsar over Apache Kafka for Event Sourcing_Robe...
Ten reasons to choose Apache Pulsar over Apache Kafka for Event Sourcing_Robe...
 
Machine Learning Trends of 2018 combined with the Apache Kafka Ecosystem
Machine Learning Trends of 2018 combined with the Apache Kafka EcosystemMachine Learning Trends of 2018 combined with the Apache Kafka Ecosystem
Machine Learning Trends of 2018 combined with the Apache Kafka Ecosystem
 
Kafka for Real-Time Replication between Edge and Hybrid Cloud
Kafka for Real-Time Replication between Edge and Hybrid CloudKafka for Real-Time Replication between Edge and Hybrid Cloud
Kafka for Real-Time Replication between Edge and Hybrid Cloud
 
Transformation During a Global Pandemic | Ashish Pandit and Scott Lee, Univer...
Transformation During a Global Pandemic | Ashish Pandit and Scott Lee, Univer...Transformation During a Global Pandemic | Ashish Pandit and Scott Lee, Univer...
Transformation During a Global Pandemic | Ashish Pandit and Scott Lee, Univer...
 

Andere mochten auch

Exploring Reactive Integrations With Akka Streams, Alpakka And Apache Kafka
Exploring Reactive Integrations With Akka Streams, Alpakka And Apache KafkaExploring Reactive Integrations With Akka Streams, Alpakka And Apache Kafka
Exploring Reactive Integrations With Akka Streams, Alpakka And Apache Kafka
Lightbend
 
Building Streaming And Fast Data Applications With Spark, Mesos, Akka, Cassan...
Building Streaming And Fast Data Applications With Spark, Mesos, Akka, Cassan...Building Streaming And Fast Data Applications With Spark, Mesos, Akka, Cassan...
Building Streaming And Fast Data Applications With Spark, Mesos, Akka, Cassan...
Lightbend
 
Lessons Learned From PayPal: Implementing Back-Pressure With Akka Streams And...
Lessons Learned From PayPal: Implementing Back-Pressure With Akka Streams And...Lessons Learned From PayPal: Implementing Back-Pressure With Akka Streams And...
Lessons Learned From PayPal: Implementing Back-Pressure With Akka Streams And...
Lightbend
 
Reactive Stream Processing with Akka Streams
Reactive Stream Processing with Akka StreamsReactive Stream Processing with Akka Streams
Reactive Stream Processing with Akka Streams
Konrad Malawski
 

Andere mochten auch (9)

Akka Streams - From Zero to Kafka
Akka Streams - From Zero to KafkaAkka Streams - From Zero to Kafka
Akka Streams - From Zero to Kafka
 
Akka streams kafka kinesis
Akka streams kafka kinesisAkka streams kafka kinesis
Akka streams kafka kinesis
 
Reactive integrations with Akka Streams
Reactive integrations with Akka StreamsReactive integrations with Akka Streams
Reactive integrations with Akka Streams
 
Moving from Big Data to Fast Data? Here's How To Pick The Right Streaming Engine
Moving from Big Data to Fast Data? Here's How To Pick The Right Streaming EngineMoving from Big Data to Fast Data? Here's How To Pick The Right Streaming Engine
Moving from Big Data to Fast Data? Here's How To Pick The Right Streaming Engine
 
Exploring Reactive Integrations With Akka Streams, Alpakka And Apache Kafka
Exploring Reactive Integrations With Akka Streams, Alpakka And Apache KafkaExploring Reactive Integrations With Akka Streams, Alpakka And Apache Kafka
Exploring Reactive Integrations With Akka Streams, Alpakka And Apache Kafka
 
Building Streaming And Fast Data Applications With Spark, Mesos, Akka, Cassan...
Building Streaming And Fast Data Applications With Spark, Mesos, Akka, Cassan...Building Streaming And Fast Data Applications With Spark, Mesos, Akka, Cassan...
Building Streaming And Fast Data Applications With Spark, Mesos, Akka, Cassan...
 
Lessons Learned From PayPal: Implementing Back-Pressure With Akka Streams And...
Lessons Learned From PayPal: Implementing Back-Pressure With Akka Streams And...Lessons Learned From PayPal: Implementing Back-Pressure With Akka Streams And...
Lessons Learned From PayPal: Implementing Back-Pressure With Akka Streams And...
 
Build Real-Time Streaming ETL Pipelines With Akka Streams, Alpakka And Apache...
Build Real-Time Streaming ETL Pipelines With Akka Streams, Alpakka And Apache...Build Real-Time Streaming ETL Pipelines With Akka Streams, Alpakka And Apache...
Build Real-Time Streaming ETL Pipelines With Akka Streams, Alpakka And Apache...
 
Reactive Stream Processing with Akka Streams
Reactive Stream Processing with Akka StreamsReactive Stream Processing with Akka Streams
Reactive Stream Processing with Akka Streams
 

Ähnlich wie What's The Role Of Machine Learning In Fast Data And Streaming Applications?

Shiva Amiri, Chief Product Officer, RTDS Inc. at MLconf SEA - 5/01/15
Shiva Amiri, Chief Product Officer, RTDS Inc. at MLconf SEA - 5/01/15Shiva Amiri, Chief Product Officer, RTDS Inc. at MLconf SEA - 5/01/15
Shiva Amiri, Chief Product Officer, RTDS Inc. at MLconf SEA - 5/01/15
MLconf
 
Digital_IOT_(Microsoft_Solution).pdf
Digital_IOT_(Microsoft_Solution).pdfDigital_IOT_(Microsoft_Solution).pdf
Digital_IOT_(Microsoft_Solution).pdf
ssuserd23711
 

Ähnlich wie What's The Role Of Machine Learning In Fast Data And Streaming Applications? (20)

C19013010 the tutorial to build shared ai services session 1
C19013010  the tutorial to build shared ai services session 1C19013010  the tutorial to build shared ai services session 1
C19013010 the tutorial to build shared ai services session 1
 
Webinar: Machine Learning para Microcontroladores
Webinar: Machine Learning para MicrocontroladoresWebinar: Machine Learning para Microcontroladores
Webinar: Machine Learning para Microcontroladores
 
Shiva Amiri, Chief Product Officer, RTDS Inc. at MLconf SEA - 5/01/15
Shiva Amiri, Chief Product Officer, RTDS Inc. at MLconf SEA - 5/01/15Shiva Amiri, Chief Product Officer, RTDS Inc. at MLconf SEA - 5/01/15
Shiva Amiri, Chief Product Officer, RTDS Inc. at MLconf SEA - 5/01/15
 
Machine Learning AND Deep Learning for OpenPOWER
Machine Learning AND Deep Learning for OpenPOWERMachine Learning AND Deep Learning for OpenPOWER
Machine Learning AND Deep Learning for OpenPOWER
 
Bring Your Own Recipes Hands-On Session
Bring Your Own Recipes Hands-On Session Bring Your Own Recipes Hands-On Session
Bring Your Own Recipes Hands-On Session
 
Digital_IOT_(Microsoft_Solution).pdf
Digital_IOT_(Microsoft_Solution).pdfDigital_IOT_(Microsoft_Solution).pdf
Digital_IOT_(Microsoft_Solution).pdf
 
Functionalities in AI Applications and Use Cases (OECD)
Functionalities in AI Applications and Use Cases (OECD)Functionalities in AI Applications and Use Cases (OECD)
Functionalities in AI Applications and Use Cases (OECD)
 
Feature store: Solving anti-patterns in ML-systems
Feature store: Solving anti-patterns in ML-systemsFeature store: Solving anti-patterns in ML-systems
Feature store: Solving anti-patterns in ML-systems
 
Step by step AI Day 3: AI Technologies
Step by step AI Day 3: AI TechnologiesStep by step AI Day 3: AI Technologies
Step by step AI Day 3: AI Technologies
 
Dell AI Telecom Webinar
Dell AI Telecom WebinarDell AI Telecom Webinar
Dell AI Telecom Webinar
 
Streaming analytics
Streaming analyticsStreaming analytics
Streaming analytics
 
Introduction to ML.NET
Introduction to ML.NETIntroduction to ML.NET
Introduction to ML.NET
 
Serverless machine learning architectures at Helixa
Serverless machine learning architectures at HelixaServerless machine learning architectures at Helixa
Serverless machine learning architectures at Helixa
 
10-Hot-Data-Analytics-Tre-8904178.ppsx
10-Hot-Data-Analytics-Tre-8904178.ppsx10-Hot-Data-Analytics-Tre-8904178.ppsx
10-Hot-Data-Analytics-Tre-8904178.ppsx
 
Machine Learning with Data Science Online Course | Learn and Build
 Machine Learning with Data Science Online Course | Learn and Build  Machine Learning with Data Science Online Course | Learn and Build
Machine Learning with Data Science Online Course | Learn and Build
 
Technovision
TechnovisionTechnovision
Technovision
 
Gse uk-cedrinemadera-2018-shared
Gse uk-cedrinemadera-2018-sharedGse uk-cedrinemadera-2018-shared
Gse uk-cedrinemadera-2018-shared
 
demo AI ML.pptx
demo AI ML.pptxdemo AI ML.pptx
demo AI ML.pptx
 
Advanced Analytics and Data Science Expertise
Advanced Analytics and Data Science ExpertiseAdvanced Analytics and Data Science Expertise
Advanced Analytics and Data Science Expertise
 
Mini-Track: AI and ML in Network Operations Applications
Mini-Track: AI and ML in Network Operations ApplicationsMini-Track: AI and ML in Network Operations Applications
Mini-Track: AI and ML in Network Operations Applications
 

Mehr von Lightbend

Mehr von Lightbend (20)

IoT 'Megaservices' - High Throughput Microservices with Akka
IoT 'Megaservices' - High Throughput Microservices with AkkaIoT 'Megaservices' - High Throughput Microservices with Akka
IoT 'Megaservices' - High Throughput Microservices with Akka
 
How Akka Cluster Works: Actors Living in a Cluster
How Akka Cluster Works: Actors Living in a ClusterHow Akka Cluster Works: Actors Living in a Cluster
How Akka Cluster Works: Actors Living in a Cluster
 
The Reactive Principles: Eight Tenets For Building Cloud Native Applications
The Reactive Principles: Eight Tenets For Building Cloud Native ApplicationsThe Reactive Principles: Eight Tenets For Building Cloud Native Applications
The Reactive Principles: Eight Tenets For Building Cloud Native Applications
 
Putting the 'I' in IoT - Building Digital Twins with Akka Microservices
Putting the 'I' in IoT - Building Digital Twins with Akka MicroservicesPutting the 'I' in IoT - Building Digital Twins with Akka Microservices
Putting the 'I' in IoT - Building Digital Twins with Akka Microservices
 
Akka at Enterprise Scale: Performance Tuning Distributed Applications
Akka at Enterprise Scale: Performance Tuning Distributed ApplicationsAkka at Enterprise Scale: Performance Tuning Distributed Applications
Akka at Enterprise Scale: Performance Tuning Distributed Applications
 
Digital Transformation with Kubernetes, Containers, and Microservices
Digital Transformation with Kubernetes, Containers, and MicroservicesDigital Transformation with Kubernetes, Containers, and Microservices
Digital Transformation with Kubernetes, Containers, and Microservices
 
Detecting Real-Time Financial Fraud with Cloudflow on Kubernetes
Detecting Real-Time Financial Fraud with Cloudflow on KubernetesDetecting Real-Time Financial Fraud with Cloudflow on Kubernetes
Detecting Real-Time Financial Fraud with Cloudflow on Kubernetes
 
Cloudstate - Towards Stateful Serverless
Cloudstate - Towards Stateful ServerlessCloudstate - Towards Stateful Serverless
Cloudstate - Towards Stateful Serverless
 
Digital Transformation from Monoliths to Microservices to Serverless and Beyond
Digital Transformation from Monoliths to Microservices to Serverless and BeyondDigital Transformation from Monoliths to Microservices to Serverless and Beyond
Digital Transformation from Monoliths to Microservices to Serverless and Beyond
 
Akka Anti-Patterns, Goodbye: Six Features of Akka 2.6
Akka Anti-Patterns, Goodbye: Six Features of Akka 2.6Akka Anti-Patterns, Goodbye: Six Features of Akka 2.6
Akka Anti-Patterns, Goodbye: Six Features of Akka 2.6
 
Lessons From HPE: From Batch To Streaming For 20 Billion Sensors With Lightbe...
Lessons From HPE: From Batch To Streaming For 20 Billion Sensors With Lightbe...Lessons From HPE: From Batch To Streaming For 20 Billion Sensors With Lightbe...
Lessons From HPE: From Batch To Streaming For 20 Billion Sensors With Lightbe...
 
How to build streaming data pipelines with Akka Streams, Flink, and Spark usi...
How to build streaming data pipelines with Akka Streams, Flink, and Spark usi...How to build streaming data pipelines with Akka Streams, Flink, and Spark usi...
How to build streaming data pipelines with Akka Streams, Flink, and Spark usi...
 
Microservices, Kubernetes, and Application Modernization Done Right
Microservices, Kubernetes, and Application Modernization Done RightMicroservices, Kubernetes, and Application Modernization Done Right
Microservices, Kubernetes, and Application Modernization Done Right
 
Full Stack Reactive In Practice
Full Stack Reactive In PracticeFull Stack Reactive In Practice
Full Stack Reactive In Practice
 
Akka and Kubernetes: A Symbiotic Love Story
Akka and Kubernetes: A Symbiotic Love StoryAkka and Kubernetes: A Symbiotic Love Story
Akka and Kubernetes: A Symbiotic Love Story
 
Scala 3 Is Coming: Martin Odersky Shares What To Know
Scala 3 Is Coming: Martin Odersky Shares What To KnowScala 3 Is Coming: Martin Odersky Shares What To Know
Scala 3 Is Coming: Martin Odersky Shares What To Know
 
Migrating From Java EE To Cloud-Native Reactive Systems
Migrating From Java EE To Cloud-Native Reactive SystemsMigrating From Java EE To Cloud-Native Reactive Systems
Migrating From Java EE To Cloud-Native Reactive Systems
 
Running Kafka On Kubernetes With Strimzi For Real-Time Streaming Applications
Running Kafka On Kubernetes With Strimzi For Real-Time Streaming ApplicationsRunning Kafka On Kubernetes With Strimzi For Real-Time Streaming Applications
Running Kafka On Kubernetes With Strimzi For Real-Time Streaming Applications
 
Designing Events-First Microservices For A Cloud Native World
Designing Events-First Microservices For A Cloud Native WorldDesigning Events-First Microservices For A Cloud Native World
Designing Events-First Microservices For A Cloud Native World
 
Scala Security: Eliminate 200+ Code-Level Threats With Fortify SCA For Scala
Scala Security: Eliminate 200+ Code-Level Threats With Fortify SCA For ScalaScala Security: Eliminate 200+ Code-Level Threats With Fortify SCA For Scala
Scala Security: Eliminate 200+ Code-Level Threats With Fortify SCA For Scala
 

Kürzlich hochgeladen

Mifty kit IN Salmiya (+918133066128) Abortion pills IN Salmiyah Cytotec pills
Mifty kit IN Salmiya (+918133066128) Abortion pills IN Salmiyah Cytotec pillsMifty kit IN Salmiya (+918133066128) Abortion pills IN Salmiyah Cytotec pills
Mifty kit IN Salmiya (+918133066128) Abortion pills IN Salmiyah Cytotec pills
Abortion pills in Kuwait Cytotec pills in Kuwait
 
unwanted pregnancy Kit [+918133066128] Abortion Pills IN Dubai UAE Abudhabi
unwanted pregnancy Kit [+918133066128] Abortion Pills IN Dubai UAE Abudhabiunwanted pregnancy Kit [+918133066128] Abortion Pills IN Dubai UAE Abudhabi
unwanted pregnancy Kit [+918133066128] Abortion Pills IN Dubai UAE Abudhabi
Abortion pills in Kuwait Cytotec pills in Kuwait
 
Mckinsey foundation level Handbook for Viewing
Mckinsey foundation level Handbook for ViewingMckinsey foundation level Handbook for Viewing
Mckinsey foundation level Handbook for Viewing
Nauman Safdar
 
!~+971581248768>> SAFE AND ORIGINAL ABORTION PILLS FOR SALE IN DUBAI AND ABUD...
!~+971581248768>> SAFE AND ORIGINAL ABORTION PILLS FOR SALE IN DUBAI AND ABUD...!~+971581248768>> SAFE AND ORIGINAL ABORTION PILLS FOR SALE IN DUBAI AND ABUD...
!~+971581248768>> SAFE AND ORIGINAL ABORTION PILLS FOR SALE IN DUBAI AND ABUD...
DUBAI (+971)581248768 BUY ABORTION PILLS IN ABU dhabi...Qatar
 
Quick Doctor In Kuwait +2773`7758`557 Kuwait Doha Qatar Dubai Abu Dhabi Sharj...
Quick Doctor In Kuwait +2773`7758`557 Kuwait Doha Qatar Dubai Abu Dhabi Sharj...Quick Doctor In Kuwait +2773`7758`557 Kuwait Doha Qatar Dubai Abu Dhabi Sharj...
Quick Doctor In Kuwait +2773`7758`557 Kuwait Doha Qatar Dubai Abu Dhabi Sharj...
daisycvs
 
Mifepristone Available in Muscat +918761049707^^ €€ Buy Abortion Pills in Oman
Mifepristone Available in Muscat +918761049707^^ €€ Buy Abortion Pills in OmanMifepristone Available in Muscat +918761049707^^ €€ Buy Abortion Pills in Oman
Mifepristone Available in Muscat +918761049707^^ €€ Buy Abortion Pills in Oman
instagramfab782445
 
Jual Obat Aborsi ( Asli No.1 ) 085657271886 Obat Penggugur Kandungan Cytotec
Jual Obat Aborsi ( Asli No.1 ) 085657271886 Obat Penggugur Kandungan CytotecJual Obat Aborsi ( Asli No.1 ) 085657271886 Obat Penggugur Kandungan Cytotec
Jual Obat Aborsi ( Asli No.1 ) 085657271886 Obat Penggugur Kandungan Cytotec
ZurliaSoop
 

Kürzlich hochgeladen (20)

Pre Engineered Building Manufacturers Hyderabad.pptx
Pre Engineered  Building Manufacturers Hyderabad.pptxPre Engineered  Building Manufacturers Hyderabad.pptx
Pre Engineered Building Manufacturers Hyderabad.pptx
 
Dr. Admir Softic_ presentation_Green Club_ENG.pdf
Dr. Admir Softic_ presentation_Green Club_ENG.pdfDr. Admir Softic_ presentation_Green Club_ENG.pdf
Dr. Admir Softic_ presentation_Green Club_ENG.pdf
 
Paradip CALL GIRL❤7091819311❤CALL GIRLS IN ESCORT SERVICE WE ARE PROVIDING
Paradip CALL GIRL❤7091819311❤CALL GIRLS IN ESCORT SERVICE WE ARE PROVIDINGParadip CALL GIRL❤7091819311❤CALL GIRLS IN ESCORT SERVICE WE ARE PROVIDING
Paradip CALL GIRL❤7091819311❤CALL GIRLS IN ESCORT SERVICE WE ARE PROVIDING
 
Mifty kit IN Salmiya (+918133066128) Abortion pills IN Salmiyah Cytotec pills
Mifty kit IN Salmiya (+918133066128) Abortion pills IN Salmiyah Cytotec pillsMifty kit IN Salmiya (+918133066128) Abortion pills IN Salmiyah Cytotec pills
Mifty kit IN Salmiya (+918133066128) Abortion pills IN Salmiyah Cytotec pills
 
Cannabis Legalization World Map: 2024 Updated
Cannabis Legalization World Map: 2024 UpdatedCannabis Legalization World Map: 2024 Updated
Cannabis Legalization World Map: 2024 Updated
 
Katrina Personal Brand Project and portfolio 1
Katrina Personal Brand Project and portfolio 1Katrina Personal Brand Project and portfolio 1
Katrina Personal Brand Project and portfolio 1
 
unwanted pregnancy Kit [+918133066128] Abortion Pills IN Dubai UAE Abudhabi
unwanted pregnancy Kit [+918133066128] Abortion Pills IN Dubai UAE Abudhabiunwanted pregnancy Kit [+918133066128] Abortion Pills IN Dubai UAE Abudhabi
unwanted pregnancy Kit [+918133066128] Abortion Pills IN Dubai UAE Abudhabi
 
Falcon Invoice Discounting: Empowering Your Business Growth
Falcon Invoice Discounting: Empowering Your Business GrowthFalcon Invoice Discounting: Empowering Your Business Growth
Falcon Invoice Discounting: Empowering Your Business Growth
 
Organizational Transformation Lead with Culture
Organizational Transformation Lead with CultureOrganizational Transformation Lead with Culture
Organizational Transformation Lead with Culture
 
Mckinsey foundation level Handbook for Viewing
Mckinsey foundation level Handbook for ViewingMckinsey foundation level Handbook for Viewing
Mckinsey foundation level Handbook for Viewing
 
!~+971581248768>> SAFE AND ORIGINAL ABORTION PILLS FOR SALE IN DUBAI AND ABUD...
!~+971581248768>> SAFE AND ORIGINAL ABORTION PILLS FOR SALE IN DUBAI AND ABUD...!~+971581248768>> SAFE AND ORIGINAL ABORTION PILLS FOR SALE IN DUBAI AND ABUD...
!~+971581248768>> SAFE AND ORIGINAL ABORTION PILLS FOR SALE IN DUBAI AND ABUD...
 
BeMetals Investor Presentation_May 3, 2024.pdf
BeMetals Investor Presentation_May 3, 2024.pdfBeMetals Investor Presentation_May 3, 2024.pdf
BeMetals Investor Presentation_May 3, 2024.pdf
 
Uneak White's Personal Brand Exploration Presentation
Uneak White's Personal Brand Exploration PresentationUneak White's Personal Brand Exploration Presentation
Uneak White's Personal Brand Exploration Presentation
 
Phases of Negotiation .pptx
 Phases of Negotiation .pptx Phases of Negotiation .pptx
Phases of Negotiation .pptx
 
PHX May 2024 Corporate Presentation Final
PHX May 2024 Corporate Presentation FinalPHX May 2024 Corporate Presentation Final
PHX May 2024 Corporate Presentation Final
 
Falcon Invoice Discounting: The best investment platform in india for investors
Falcon Invoice Discounting: The best investment platform in india for investorsFalcon Invoice Discounting: The best investment platform in india for investors
Falcon Invoice Discounting: The best investment platform in india for investors
 
Quick Doctor In Kuwait +2773`7758`557 Kuwait Doha Qatar Dubai Abu Dhabi Sharj...
Quick Doctor In Kuwait +2773`7758`557 Kuwait Doha Qatar Dubai Abu Dhabi Sharj...Quick Doctor In Kuwait +2773`7758`557 Kuwait Doha Qatar Dubai Abu Dhabi Sharj...
Quick Doctor In Kuwait +2773`7758`557 Kuwait Doha Qatar Dubai Abu Dhabi Sharj...
 
Mifepristone Available in Muscat +918761049707^^ €€ Buy Abortion Pills in Oman
Mifepristone Available in Muscat +918761049707^^ €€ Buy Abortion Pills in OmanMifepristone Available in Muscat +918761049707^^ €€ Buy Abortion Pills in Oman
Mifepristone Available in Muscat +918761049707^^ €€ Buy Abortion Pills in Oman
 
Jual Obat Aborsi ( Asli No.1 ) 085657271886 Obat Penggugur Kandungan Cytotec
Jual Obat Aborsi ( Asli No.1 ) 085657271886 Obat Penggugur Kandungan CytotecJual Obat Aborsi ( Asli No.1 ) 085657271886 Obat Penggugur Kandungan Cytotec
Jual Obat Aborsi ( Asli No.1 ) 085657271886 Obat Penggugur Kandungan Cytotec
 
Escorts in Nungambakkam Phone 8250092165 Enjoy 24/7 Escort Service Enjoy Your...
Escorts in Nungambakkam Phone 8250092165 Enjoy 24/7 Escort Service Enjoy Your...Escorts in Nungambakkam Phone 8250092165 Enjoy 24/7 Escort Service Enjoy Your...
Escorts in Nungambakkam Phone 8250092165 Enjoy 24/7 Escort Service Enjoy Your...
 

What's The Role Of Machine Learning In Fast Data And Streaming Applications?

  • 1. What's The Role of Machine Learning In Fast Data and Streaming Applications? WEBINAR Emre Velipasaoglu, Ph.D 
 Principal Data Scientist
  • 2. What is Machine Learning? A computer program is a set of explicit instructions that produce an output for a given input. Machine Learning (ML) is about how to program computers to improve automatically with experience rather than explicit instructions. val x = 0 to 1000000000000 val y = x.map(x => x/2.0 + 1) 1 -> 1.5 2 -> 2.0 => 10^12 -> ?
  • 3. Why does everyone want to use it? A lot of the recent transformative technologies are based on ML: • optical character recognition • speech recognition • fraud detection • web search • personalized marketing 
 and advertising • Computer Aided medical 
 Diagnosis (CADx)
  • 4. Why does everyone want to use it? Emerging trends that will leverage ML: • Internet of Things (IoT) • Augmented Reality (AR) 
 and Virtual Reality (VR) • Autonomous vehicles • Customer service chat bots • Security: Face/voice/biometrics recognition • Healthcare: drug discovery, outcome prediction, 
 personalized care
 • Democratization of ML and the long tail of ML applications
  • 5. What are some of the use cases? Machines do certain tasks better than humans. • IBM DeepBlue in chess • IBM Watson in Jeopardy • Google AlphaGo in go • Lip reading: LipNet 93% vs. humans 65%
  • 6. What are some of the use cases? Machines are more cost efficient in certain tasks: • Transcribing: Microsoft 89% vs. humans 89% • Computer-aided diagnosis: E.g. Dermatologist-level classification of skin cancer with deep neural networks, Esteva, et. al., published in Nature, June 2017.
  • 7. What are some of the use cases? Machines are the only way to scale up processing in certain tasks. • Previewing video: Clarifai can 
 analyze 3.5 minutes of video in 
 10 seconds for detecting 
 objects. • Commercial loan agreements 
 review: AI in seconds vs. 
 humans in 360,000 hours.
  • 8. Why should you care? Information gives competitive advantage. ML unlocks information from your data It is your product, your data, your operations. ML is being democratized. It is not for a handful of giant tech companies anymore.
  • 9. How does it work? E.g. Augmented Reality (AR) Shopping Personalization: • Shopping is one area where 
 AR is expected to impact. • IBM CeBIT 2013 app: • scans a shelf • recognizes products • overlays nutritional info • Add a recommender system, 
 tailored to • your customers, • your product catalog.
  • 10. Learning a Recommender User Rating Matrix A B C D Alan 5 1 1 Emre 4 2 3 Vishal 5 1 Matrix Factorization User Latent 
 Factor Model f1 f2 Alan 1.63 0.89 Emre 0.89 2.10 Vishal 2.03 1.01 Item Latent Factor Model A B C D f1 2.21 1.88 -0.24 0.33 f2 1.68 1.22 1.51 0.74
  • 11. Scoring Items Query 
 User User Latent Factors f1 f2 Vishal 2.03 1.01 Query 
 Item Estimate Ratings Ranked Items score A 6.00 D 1.41 C 1.05 Item Latent Factors A C D f1 2.21 -0.24 0.33 f2 1.68 1.51 0.74 User Latent 
 Factor Model f1 f2 Alan 1.63 0.89 Emre 0.89 2.10 Vishal 2.03 1.01 Item Latent Factor Model A B C D f1 2.21 1.88 -0.24 0.33 f2 1.68 1.22 1.51 0.74 ApplicationUserItems
  • 12. Modern ML • Size of data • Does not fit in one node, must distribute • E.g. billions of user x item ratings (several orders of magnitude more historical events to aggregate ratings from) • Size of model • Does not fit in one node, must distribute • E.g. millions of users, thousands of products • Learning speed • Anywhere from real-time model updates to batch updates in minutes • Operational latency and throughput • Low milliseconds response time for millions of transactions
  • 13. ML Lifecycle - Development Early research • Explore of modeling techniques Iterations • Feature selection • Training parameter tuning Productization • Feature computations, model updating, scoring, caching, optimizations, etc. 
 (e.g. update and query of latent factors) recommender system collaborative filtering content-based filtering hybrid models Bayesian networks clustering latent semantic models Markov decision process … singular value decomposition alternating least squares non-negative matrix factorization number of latent factors step size convergence criteria
  • 14. ML Lifecycle - Management Monitoring • Model performance • Latency • Throughput • Model quality • Drift (e.g. has the user’s tastes changed recently?) • Security and robustness Controlling • Model optimization (for performance) • Model update (for quality)
  • 15. Which tools are available and what do they do? Machine Learning Spark MLlib ML library for Spark Flink ML ML library for Flink Mahout Distributed or scalable ML algorithms Tensorflow Google's open source deep learning library Theano Numerical library for Python, especially for deep learning Deeplearning4j Deep learning library in Java BigDL Intel’s distributed deep learning library on Spark scikit-learn Main ML library for Python OpenNLP ML based toolkit for the processing of natural language text
  • 16. Which tools are available and what do they do? Streaming Flink Stream processing framework with sophisticated handling of late arriving data Spark Streaming Dataset based computing framework with mini-batch streaming support Beam API for data processing pipelines Data Ingestion Kafka Distributed stream processing for high-throughput, low- latency, real-time data feeds Flume Log processing
  • 17. Which tools are available and what do they do? Persistence and Storage HDFS Hadoop based Distributed File System Cassandra Distributed NoSQL database management system ElasticSearch Distributed, RESTful search and analytics engine AWS S3 Cloud based object store
  • 18. How does the Fast Data Platform tie it all together? HDFS User Rating Matrix Spark Matrix Factorizat ion Cassandra Latent Item Model Latent User Model Flink / Akka Streams Query Item Query User Item Factors User Factors Score Kafka Ranked Items Kafka User Items Application batch streaming
  • 19. What else does FDP provide? data persistence & storage stream processing machine learning cluster analysis infrastructure durable messaging backplane microservices intelligent management
  • 20. In Summary • Machine Learning is the way to build transformative products leveraging data that are otherwise impossible to build.
 • It is not difficult to build Machine Learning based solutions, thanks to new open source tools.
 • Lightbend’s Fast Data Platform provides an easy onramp for building, deploying and running Fast Data clusters and services leveraging best of breed tools .
  • 21. Upgrade your grey matter!
 Get the free O’Reilly book by Dr. Dean Wampler, 
 VP of Fast Data Engineering at Lightbend bit.ly/lightbend-fast-data