Data intensive applications with Apache Flink - Simone Robutti, Radicalbit

•

1 gefällt mir•747 views

Data Science Milan

"Data intensive applications with Apache Flink" by Simone Robutti, Machine Learning Engineer @ Radicalbit In the last 10 years, the IT industry has seen a complete revolution in the perceived value that computing has on businesses and how engineers think about applications: in several application domains, the need for data has outgrown the capacity of commodity hardware and the need for information has outpaced traditional processing technologies and approaches. In this talk we'll introduce Apache Flink, a streaming dataflow engine that provides data distribution, communication, and fault tolerance for distributed computations over data streams. It is an open source project that builds on top of proven approaches, as well as innovative algorithms. We will go in-depth on how this tool can be used to implement data-intensive applications, in particular regarding present tools and future perspectives to use machine learning algorithms in a distributed context. Simone Robutti, 27, Machine Learning Engineer at Radicalbit. He achieved a Master’s Degree at Università degli studi di Milano with a thesis on SVM for noisy labeled datasets. From then on his interests shifted towards the engineering side of Machine Learning and Big Data: implementation, deploy, portability and maintainability of ML-intensive systems. Right now his focus in Radicalbit is Flink and its Machine Learning library FlinkML.

Daten & Analysen

Milan – July 13 2016
Data Intensive Applications with Apache Flink
Simone Robutti
Machine Learning Engineer at Radicalbit
@SimoneRobutti

Agenda
1. Brief Introduction to Apache Flink
○ Why
○ What
○ How
2. Machine Learning on Flink
○ Present landscape
○ Future of the Ecosystem
3. Closing notes on Radicalbit (shameless plug ahead)

100% Buzzword-free guaranteed
Big Data
Machine
Intelligence
Web-scale
400x
It’s like the
human brain
Exactly-once
Exactly-once

Why Flink (and not Spark/Storm/Samza...)
Because it’s
production-ready
streaming-first
low-latency
fault-tolerant
high-throughput
processing engine

Flink: what is it?
From Flink’s Documentation

Connectors and integrations

Flink’s Runtime
From Flink’s Documentation

Flink’s DataFlow
From Flink’s Documentation
Written by the user through DataSet/DataStream API
Compiled and optimized in the client

Flink’s DataFlow
From Flink’s Documentation
The compiled job is translated to distributed tasks by
the master and executed by workers

Machine Learning on Flink

Ready and awesome for parallel ML
Work in progress for distributed ML
ML on Flink

Flink for Model Evaluation Pipelines
Source
Data
Preparation
Evaluation Sink
Source
Post
process
-ing
Composable, modular Flink Operator

Evaluation with Flink-JPMML
Source
Operator
Flink -
JPMML
Operator
Sink
Operator
Source
Operator
model.pmml
Small library that implements basic model eval.
Data
Preparation

“I have seen people insisting on using Hadoop for
datasets that could easily fit on a flash drive and could
easily be processed on a laptop.”
- Yann LeCun
-
ML on Flink

FlinkML
What: Out-of-the-box workhorse algorithms (ALS,
SVM, LinReg, LogReg …)
Status: early phase, slow development

FlinkML
Pro: available out of the box, written with Flink API
Cons: reinvents the wheel, only a few algorithms,
no model persistence

Samsara
What: Linear algebra framework
Status: mature

Samsara
Pro: generic algorithms with platform-specific
bindings, skilled community
Cons: covers only a few use cases

SAMOA
What: Online learning algorithm framework (VHT,
AMR, …)
Status: early phase, complicated relationship with
the industry

SAMOA
Pro: many powerful generic online learning
algorithms, backed by academics (MOA, Weka)
Cons: not production ready, academic focus

ML on Flink: the future of the ecosystem

Apache Beam
Programming model for data processing pipelines
● Streaming first, batch as a bounded stream
● Layered API: What, Where, When, How
● Platform agnostic: same program, different
runners

Apache Beam - Runners
● Flink
● Spark (Partial)
● Google Cloud Dataflow
● Plain Java
● Gearpump (WIP)
● Apex (WIP)

BeamML: a runner-agnostic ML library

FlinkML Roadmap
● More algorithms!
● Evaluation framework
● Persistence/export
● Online Learning Framework

Proteus
Online Learning Platform - based on Flink
Source: Proteus’ website

The role of Radicalbit

Contributions
● Cassandra Connector
● Scala API extensions
● FlinkML (Linear Algebra Framework, MinHash)
● Akka Connector

Our vision
Flink can become the ideal choice to build real-time decision-
heavy applications with high data-throughput
To achieve this:
● Ambitious applications (aim for real-time services)
● Reliable distributed online learning (Proteus?)
● A Pipelining Framework (experiment fast, increase testability and
modularity)

Q&A

THANKS!
Simone Robutti
Mail: simone.robutti@radicalbit.io Medium: @simone.robutti
Twitter: @SimoneRobutti — @weareradicalbit

Weitere ähnliche Inhalte

Was ist angesagt?

Deploying your Predictive Models as a Service via Domino

Deploying your Predictive Models as a Service via Domino

Deploying your Predictive Models as a Service via Domino

Geo Python16 keynote

Geo Python16 keynote

Geo Python16 keynote

Project "Deep Water"

Project "Deep Water"

Project "Deep Water"

Some "challenges" on the open-source/open-data front

Some "challenges" on the open-source/open-data front

Some "challenges" on the open-source/open-data front

Data Programming: Creating Large Datasets, Quickly -- Presented at JPL MLRG

Data Programming: Creating Large Datasets, Quickly -- Presented at JPL MLRG

Data Programming: Creating Large Datasets, Quickly -- Presented at JPL MLRG

Deep Water - Bringing Tensorflow, Caffe, Mxnet to H2O

Deep Water - Bringing Tensorflow, Caffe, Mxnet to H2O

Deep Water - Bringing Tensorflow, Caffe, Mxnet to H2O

Deep Learning with MXNet - Dmitry Larko

Deep Learning with MXNet - Dmitry Larko

Deep Learning with MXNet - Dmitry Larko

Presented at #H2OWorld 2017 in Mountain View, CA. Enjoy the video: https://youtu.be/-rGRHrED94Y. Learn more about H2O.ai: https://www.h2o.ai/. Follow @h2oai: https://twitter.com/h2oai. - - - Abstract: Most machine learning systems enable two essential processes: creating a model and applying the model in a repeatable and controlled fashion. These two processes are interrelated and pose technological and organizational challenges as they evolve from research to prototype to production. This presentation outlines common design patterns for tackling such challenges while implementing machine learning in a production environment. Sergei's Bio: Dr. Sergei Izrailev is Chief Data Scientist at BeeswaxIO, where he is responsible for data strategy and building AI applications powering the next generation of real-time bidding technology. Before Beeswax, Sergei led data science teams at Integral Ad Science and Collective, where he focused on architecture, development and scaling of data science based advertising technology products. Prior to advertising, Sergei was a quant/trader and developed trading strategies and portfolio optimization methodologies. Previously, he worked as a senior scientist at Johnson & Johnson, where he developed intelligent tools for structure-based drug discovery. Sergei holds a Ph.D. in Physics and Master of Computer Science degrees from the University of Illinois at Urbana-Champaign.

Design Patterns for Machine Learning in Production - Sergei Izrailev, Chief D...

Design Patterns for Machine Learning in Production - Sergei Izrailev, Chief D...

Design Patterns for Machine Learning in Production - Sergei Izrailev, Chief D...

We've all been there before... you hear the announcement that your flight is canceled. Fellow passengers race to the gate agent to rebook on the next available flight. How do they quickly determine the best route from Berlin to San Francisco? Ultimately the flight route network is best solved as a graph problem. We will discuss our lessons learned from working with a major airline to solve this problem using JanusGraph database. JanusGraph is an open source graph database designed for massive scale. It is compatible with several pieces of the open source big data stack: Apache TinkerPop (graph computing framework), HBase, Cassandra, and Solr. We will go into depth about our approach to benchmarking graph performance and discuss the utilities we developed. We will share our comparison results for evaluating which storage backend use with JanusGraph. Whether you are productizing a new database or you are a frustrated traveler, a fast resolution is needed to satisfy everybody involved. Presented at DataWorks Summit Berlin on April 18, 2018

Airline Reservations and Routing: A Graph Use Case

Airline Reservations and Routing: A Graph Use Case

Airline Reservations and Routing: A Graph Use Case

Graph Computing with JanusGraph

Graph Computing with JanusGraph

Graph Computing with JanusGraph

Ensemble machine learning methods are often used when the true prediction function is not easily approximated by a single algorithm. Practitioners may prefer ensemble algorithms when model performance is valued above other factors such as model complexity and training time. The Super Learner algorithm, also called "stacking", learns the optimal combination of the base learner fits. The latest version of H2O now contains a "Stacked Ensemble" method, which allows the user to stack H2O models into a Super Learner. The Stacked Ensemble method is the the native H2O version of stacking, previously only available in the h2oEnsemble R package, and now enables stacking from all the H2O APIs: Python, R, Scala, etc. Erin is a Statistician and Machine Learning Scientist at H2O.ai. Before joining H2O, she was the Principal Data Scientist at Wise.io (acquired by GE Digital) and Marvin Mobile Security (acquired by Veracode) and the founder of DataScientific, Inc. Erin received her Ph.D. from University of California, Berkeley. Her research focuses on ensemble machine learning, learning from imbalanced binary-outcome data, influence curve based variance estimation and statistical computing.

Stacked Ensembles in H2O

Stacked Ensembles in H2O

Stacked Ensembles in H2O

Towards the Cytoscape Cyberinfrastructure

Towards the Cytoscape Cyberinfrastructure

Towards the Cytoscape Cyberinfrastructure

cyREST: Cytoscape as a Service

cyREST: Cytoscape as a Service

cyREST: Cytoscape as a Service

Graph databases are relative newcomers in the NoSQL database landscape. What are some graph model and design considerations when choosing a graph database in your architecture? Let's take a tour of a couple graph use cases that we've collaborated on recently with our clients to help you better understand how and why a graph database can be integrated to help solve problems found with connected data. Presented at DataWorks Summit San Jose - IBM Meetup on June 18, 2018. https://www.meetup.com/BigDataDevelopers/events/251307524/

Exploring Graph Use Cases with JanusGraph

Exploring Graph Use Cases with JanusGraph

Exploring Graph Use Cases with JanusGraph

Distributed deep learning

Distributed deep learning

Distributed deep learning

Alireza Shafaei

Cloud Computing - examples

Cloud Computing - examples

Cloud Computing - examples

EUBrasilCloudFORUM .

Cytoscape: Now and Future

Cytoscape: Now and Future

Cytoscape: Now and Future

Metaflow was started at Netflix to answer a pressing business need: How to enable an organization of data scientists, who are not software engineers by training, build and deploy end-to-end machine learning workflows and applications independently. We wanted to provide the best possible user experience for data scientists, allowing them to focus on parts they like (modeling using their favorite off-the-shelf libraries) while providing robust built-in solutions for the foundational infrastructure: data, compute, orchestration, and versioning. Today, the open-source Metaflow powers hundreds of business-critical ML projects at Netflix and other companies from bioinformatics to real estate. In this talk, you will learn about: - What to expect from a modern ML infrastructure stack. - Using Metaflow to boost the productivity of your data science organization, based on lessons learned from Netflix. - Deployment strategies for a full stack of ML infrastructure that plays nicely with your existing systems and policies. https://www.aicamp.ai/event/eventdetails/W2021080510

Metaflow: The ML Infrastructure at Netflix

Metaflow: The ML Infrastructure at Netflix

Metaflow: The ML Infrastructure at Netflix

Data science isn't an easy task to pull of. You start with exploring data and experimenting with models. Finally, you find some amazing insight! What now? How do you transform a little experiment to a production ready workflow? Better yet, how do you scale it from a small sample in R/Python to TBs of production data? Building a BIG ML Workflow - from zero to hero, is about the work process you need to take in order to have a production ready workflow up and running. Covering : * Small - Medium experimentation (R) * Big data implementation (Spark Mllib /+ pipeline) * Setting Metrics and checks in place * Ad hoc querying and exploring your results (Zeppelin) * Pain points & Lessons learned the hard way (is there any other way?)

Production-Ready BIG ML Workflows - from zero to hero

Production-Ready BIG ML Workflows - from zero to hero

Production-Ready BIG ML Workflows - from zero to hero

ETL & Machine Learning

ETL & Machine Learning

ETL & Machine Learning

Was ist angesagt? (20)

Deploying your Predictive Models as a Service via Domino

Deploying your Predictive Models as a Service via Domino

Deploying your Predictive Models as a Service via Domino

Geo Python16 keynote

Geo Python16 keynote

Geo Python16 keynote

Project "Deep Water"

Project "Deep Water"

Project "Deep Water"

Some "challenges" on the open-source/open-data front

Some "challenges" on the open-source/open-data front

Some "challenges" on the open-source/open-data front

Data Programming: Creating Large Datasets, Quickly -- Presented at JPL MLRG

Data Programming: Creating Large Datasets, Quickly -- Presented at JPL MLRG

Data Programming: Creating Large Datasets, Quickly -- Presented at JPL MLRG

Deep Water - Bringing Tensorflow, Caffe, Mxnet to H2O

Deep Water - Bringing Tensorflow, Caffe, Mxnet to H2O

Deep Water - Bringing Tensorflow, Caffe, Mxnet to H2O

Deep Learning with MXNet - Dmitry Larko

Deep Learning with MXNet - Dmitry Larko

Deep Learning with MXNet - Dmitry Larko

Design Patterns for Machine Learning in Production - Sergei Izrailev, Chief D...

Design Patterns for Machine Learning in Production - Sergei Izrailev, Chief D...

Design Patterns for Machine Learning in Production - Sergei Izrailev, Chief D...

Airline Reservations and Routing: A Graph Use Case

Airline Reservations and Routing: A Graph Use Case

Airline Reservations and Routing: A Graph Use Case

Graph Computing with JanusGraph

Graph Computing with JanusGraph

Graph Computing with JanusGraph

Stacked Ensembles in H2O

Stacked Ensembles in H2O

Stacked Ensembles in H2O

Towards the Cytoscape Cyberinfrastructure

Towards the Cytoscape Cyberinfrastructure

Towards the Cytoscape Cyberinfrastructure

cyREST: Cytoscape as a Service

cyREST: Cytoscape as a Service

cyREST: Cytoscape as a Service

Exploring Graph Use Cases with JanusGraph

Exploring Graph Use Cases with JanusGraph

Exploring Graph Use Cases with JanusGraph

Distributed deep learning

Distributed deep learning

Distributed deep learning

Cloud Computing - examples

Cloud Computing - examples

Cloud Computing - examples

Cytoscape: Now and Future

Cytoscape: Now and Future

Cytoscape: Now and Future

Metaflow: The ML Infrastructure at Netflix

Metaflow: The ML Infrastructure at Netflix

Metaflow: The ML Infrastructure at Netflix

Production-Ready BIG ML Workflows - from zero to hero

Production-Ready BIG ML Workflows - from zero to hero

Production-Ready BIG ML Workflows - from zero to hero

ETL & Machine Learning

ETL & Machine Learning

ETL & Machine Learning

Andere mochten auch

Inaugural talk Data Science Milan - Gianmario Spacagna

Inaugural talk Data Science Milan - Gianmario Spacagna

Inaugural talk Data Science Milan - Gianmario Spacagna

Data Science Milan

In the depths of the last cold, wet British winter, the Advanced Data Analytics team from Barclays escaped to a villa on Lanzarote, Canary Islands, for a one week hackathon where they collaboratively developed a recommendation system on top of Apache Spark. The contest consisted on using Bristol customer shopping behaviour data to make personalised recommendations in a sort of Kaggle-like competition where each team's goal was to build an MVP and then repeatedly iterate on it using common interfaces defined by a specifically built framework. The talk will cover: • How to rapidly prototype in Spark (via the native Scala API) on your laptop and magically scale to a production cluster without huge re-engineering effort. • The benefits of doing type-safe ETLs representing data in hybrid, and possibly nested, structures like case classes. • Enhanced collaboration and fair performance comparison by sharing ad-hoc APIs plugged into a common evaluation framework. • The co-existence of machine learning models available in MLlib and domain-specific bespoke algorithms implemented from scratch. • A showcase of different families of recommender models (business-to-business similarity, customer-to-customer similarity, matrix factorisation, random forest and ensembling techniques). • How Scala (and functional programming) helped our cause. Gianmario is a Senior Data Scientist at Pirelli Tyre, processing telemetry data for smart manufacturing and connected vehicles applications. His main expertise is on building production-oriented machine learning systems. Co-author of the Professional Manifesto for Data Science, he loves evangelising his passion for best practices and effective methodologies amongst the community. Prior to Pirelli, he worked in Financial Services (Barclays), Cyber Security (Cisco) and Predictive Marketing (AgilOne).

The Barclays Data Science Hackathon: Building Retail Recommender Systems base...

The Barclays Data Science Hackathon: Building Retail Recommender Systems base...

The Barclays Data Science Hackathon: Building Retail Recommender Systems base...

Data Science Milan

Risking Everything with Akka Streams

Risking Everything with Akka Streams

Risking Everything with Akka Streams

Apache Gearpump - Lightweight Real-time Streaming Engine

Apache Gearpump - Lightweight Real-time Streaming Engine

Apache Gearpump - Lightweight Real-time Streaming Engine

Osobistist kerivnika dnz

Osobistist kerivnika dnz

Osobistist kerivnika dnz

Barrett Wissman – Breaking Barriers

Barrett Wissman – Breaking Barriers

Barrett Wissman – Breaking Barriers

Barrett Wissman

Developing JavaScript Widgets

Developing JavaScript Widgets

Developing JavaScript Widgets

Expara Business Canvas Workshop

Expara Business Canvas Workshop

Expara Business Canvas Workshop

ICBO 2014, October 8, 2014

ICBO 2014, October 8, 2014

ICBO 2014, October 8, 2014

History of the guitar

History of the guitar

History of the guitar

Czy designerzy powinni uczyć się kodować - Dribbble Warsaw #3

Czy designerzy powinni uczyć się kodować - Dribbble Warsaw #3

Czy designerzy powinni uczyć się kodować - Dribbble Warsaw #3

HDL-32E High Definition LiDAR™ Sensor

HDL-32E High Definition LiDAR™ Sensor

HDL-32E High Definition LiDAR™ Sensor

Din itex 10_09_2012

Din itex 10_09_2012

Din itex 10_09_2012

Denis Bychkovsky

Historia del museo de telégrafos.

Historia del museo de telégrafos.

Historia del museo de telégrafos.

victoriacrespog

Preparing for the Zombie Apocalypse

Preparing for the Zombie Apocalypse

Preparing for the Zombie Apocalypse

Andere mochten auch (15)

Inaugural talk Data Science Milan - Gianmario Spacagna

Inaugural talk Data Science Milan - Gianmario Spacagna

Inaugural talk Data Science Milan - Gianmario Spacagna

The Barclays Data Science Hackathon: Building Retail Recommender Systems base...

The Barclays Data Science Hackathon: Building Retail Recommender Systems base...

The Barclays Data Science Hackathon: Building Retail Recommender Systems base...

Risking Everything with Akka Streams

Risking Everything with Akka Streams

Risking Everything with Akka Streams

Apache Gearpump - Lightweight Real-time Streaming Engine

Apache Gearpump - Lightweight Real-time Streaming Engine

Apache Gearpump - Lightweight Real-time Streaming Engine

Osobistist kerivnika dnz

Osobistist kerivnika dnz

Osobistist kerivnika dnz

Barrett Wissman – Breaking Barriers

Barrett Wissman – Breaking Barriers

Barrett Wissman – Breaking Barriers

Developing JavaScript Widgets

Developing JavaScript Widgets

Developing JavaScript Widgets

Expara Business Canvas Workshop

Expara Business Canvas Workshop

Expara Business Canvas Workshop

ICBO 2014, October 8, 2014

ICBO 2014, October 8, 2014

ICBO 2014, October 8, 2014

History of the guitar

History of the guitar

History of the guitar

Czy designerzy powinni uczyć się kodować - Dribbble Warsaw #3

Czy designerzy powinni uczyć się kodować - Dribbble Warsaw #3

Czy designerzy powinni uczyć się kodować - Dribbble Warsaw #3

HDL-32E High Definition LiDAR™ Sensor

HDL-32E High Definition LiDAR™ Sensor

HDL-32E High Definition LiDAR™ Sensor

Din itex 10_09_2012

Din itex 10_09_2012

Din itex 10_09_2012

Historia del museo de telégrafos.

Historia del museo de telégrafos.

Historia del museo de telégrafos.

Preparing for the Zombie Apocalypse

Preparing for the Zombie Apocalypse

Preparing for the Zombie Apocalypse

Ähnlich wie Data intensive applications with Apache Flink - Simone Robutti, Radicalbit

Apache Fink 1.0: A New Era for Real-World Streaming Analytics

Apache Fink 1.0: A New Era for Real-World Streaming Analytics

Apache Fink 1.0: A New Era for Real-World Streaming Analytics

Apache Flink - Overview and Use cases of a Distributed Dataflow System (at pr...

Apache Flink - Overview and Use cases of a Distributed Dataflow System (at pr...

Apache Flink - Overview and Use cases of a Distributed Dataflow System (at pr...

Apache Spark vs Apache Flink

Apache Spark vs Apache Flink

Apache Spark vs Apache Flink

Overview of Apache Flink: the 4G of Big Data Analytics Frameworks

Overview of Apache Flink: the 4G of Big Data Analytics Frameworks

Overview of Apache Flink: the 4G of Big Data Analytics Frameworks

DataWorks Summit/Hadoop Summit

Slides of my talk at the Hadoop Summit Europe in Dublin, Ireland on April 13th, 2016. The talk introduces Apache Flink as both a multi-purpose Big Data analytics framework and real-world streaming analytics framework. It is focusing on Flink's key differentiators and suitability for streaming analytics use cases. It also shows how Flink enables novel use cases such as distributed CEP (Complex Event Processing) and querying the state by behaving like a key value data store.

Overview of Apache Fink: the 4 G of Big Data Analytics Frameworks

Overview of Apache Fink: the 4 G of Big Data Analytics Frameworks

Overview of Apache Fink: the 4 G of Big Data Analytics Frameworks

Slides of my talk at the Hadoop Summit Europe in Dublin, Ireland on April 13th, 2016. The talk introduces Apache Flink as both a multi-purpose Big Data analytics framework and real-world streaming analytics framework. It is focusing on Flink's key differentiators and suitability for streaming analytics use cases. It also shows how Flink enables novel use cases such as distributed CEP (Complex Event Processing) and querying the state by behaving like a key value data store.

Overview of Apache Fink: The 4G of Big Data Analytics Frameworks

Overview of Apache Fink: The 4G of Big Data Analytics Frameworks

Overview of Apache Fink: The 4G of Big Data Analytics Frameworks

Portable Streaming Pipelines with Apache Beam

Portable Streaming Pipelines with Apache Beam

Portable Streaming Pipelines with Apache Beam

The world of big data involves an ever-changing field of players. Much as SQL stands as a lingua franca for declarative data analysis, Apache Beam aims to provide a portable standard for expressing robust, out-of-order data processing pipelines in a variety of languages across a variety of platforms. In a way, Apache Beam is a glue that can connect the big data ecosystem together; it enables users to "run any data processing pipeline anywhere." This talk will briefly cover the capabilities of the Beam model for data processing and discuss its architecture, including the portability model. We’ll focus on the present state of the community and the current status of the Beam ecosystem. We’ll cover the state of the art in data processing and discuss where Beam is going next, including completion of the portability framework and the Streaming SQL. Finally, we’ll discuss areas of improvement and how anybody can join us on the path of creating the glue that interconnects the big data ecosystem. Speaker Davor Bonaci, Apache Software Foundation; Simbly, V.P. of Apache Beam; Founder/CEO at Operiant

Present and future of unified, portable, and efficient data processing with A...

Present and future of unified, portable, and efficient data processing with A...

Present and future of unified, portable, and efficient data processing with A...

DataWorks Summit

The world of big data involves an ever changing field of players. Much as SQL stands as a lingua franca for declarative data analysis, Apache Beam (incubating) aims to provide a portable standard for expressing robust, out-of-order data processing pipelines in a variety of languages across a variety of platforms. In this talk, I will: Cover briefly the capabilities of the Beam model for data processing and integration with IOs, as well as the current state of the Beam ecosystem. Discuss the benefits Beam provides regarding portability and ease-of-use. Demo the same Beam pipeline running on multiple runners in multiple deployment scenarios (e.g. Apache Flink on Google Cloud, Apache Spark on AWS, Apache Apex on-premise). Give a glimpse at some of the challenges Beam aims to address in the future.

Realizing the promise of portability with Apache Beam

Realizing the promise of portability with Apache Beam

Realizing the promise of portability with Apache Beam

Apache Beam is a top-level Apache project which aims at providing a unified API for efficient and portable data processing pipeline. Beam handles both batch and streaming use cases and neatly separates properties of the data from runtime characteristics, allowing pipelines to be portable across multiple runtimes, both open-source (e.g., Apache Flink, Apache Spark, Apache Apex, ...) and proprietary (e.g., Google Cloud Dataflow). This talk will cover the basics of Apache Beam, describe the main concepts of the programming model and talk about the current state of the project (new python support, first stable version). We'll illustrate the concepts with a use case running on several runners.

Portable batch and streaming pipelines with Apache Beam (Big Data Application...

Portable batch and streaming pipelines with Apache Beam (Big Data Application...

Portable batch and streaming pipelines with Apache Beam (Big Data Application...

Flink in action

Flink in action

Flink in action

Artem Semenenko

Data Summer Conf 2018, “Building unified Batch and Stream processing pipeline...

Data Summer Conf 2018, “Building unified Batch and Stream processing pipeline...

Data Summer Conf 2018, “Building unified Batch and Stream processing pipeline...

Near real-time anomaly detection at Lyft

Near real-time anomaly detection at Lyft

Near real-time anomaly detection at Lyft

Introduction to Apache Flink

Introduction to Apache Flink

Introduction to Apache Flink

This talk was given at Capital One on September 15, 2015 at the launch of the Washington DC Area Apache Flink Meetup. Apache flink is positioned at the forefront of 2 major trends in Big Data Analytics: - Unification of Batch and Stream processing - Multi-purpose Big Data Analytics frameworks In these slides, we will also find answers to the burning question: Why Apache Flink? You will also learn more about how Apache Flink compares to Hadoop MapReduce, Apache Spark and Apache Storm.

Unified Batch and Real-Time Stream Processing Using Apache Flink

Unified Batch and Real-Time Stream Processing Using Apache Flink

Unified Batch and Real-Time Stream Processing Using Apache Flink

Python Streaming Pipelines on Flink - Beam Meetup at Lyft 2019

Python Streaming Pipelines on Flink - Beam Meetup at Lyft 2019

Python Streaming Pipelines on Flink - Beam Meetup at Lyft 2019

LAMP is so yesterday, MEAN is so tomorrow! :)

LAMP is so yesterday, MEAN is so tomorrow! :)

LAMP is so yesterday, MEAN is so tomorrow! :)

Apache Arrow at DataEngConf Barcelona 2018

Apache Arrow at DataEngConf Barcelona 2018

Apache Arrow at DataEngConf Barcelona 2018

Technology Stack Discussion

Technology Stack Discussion

Technology Stack Discussion

Flink history, roadmap and vision

Flink history, roadmap and vision

Flink history, roadmap and vision

Ähnlich wie Data intensive applications with Apache Flink - Simone Robutti, Radicalbit (20)

Apache Fink 1.0: A New Era for Real-World Streaming Analytics

Apache Fink 1.0: A New Era for Real-World Streaming Analytics

Apache Fink 1.0: A New Era for Real-World Streaming Analytics

Apache Flink - Overview and Use cases of a Distributed Dataflow System (at pr...

Apache Flink - Overview and Use cases of a Distributed Dataflow System (at pr...

Apache Flink - Overview and Use cases of a Distributed Dataflow System (at pr...

Apache Spark vs Apache Flink

Apache Spark vs Apache Flink

Apache Spark vs Apache Flink

Overview of Apache Flink: the 4G of Big Data Analytics Frameworks

Overview of Apache Flink: the 4G of Big Data Analytics Frameworks

Overview of Apache Flink: the 4G of Big Data Analytics Frameworks

Overview of Apache Fink: the 4 G of Big Data Analytics Frameworks

Overview of Apache Fink: the 4 G of Big Data Analytics Frameworks

Overview of Apache Fink: the 4 G of Big Data Analytics Frameworks

Overview of Apache Fink: The 4G of Big Data Analytics Frameworks

Overview of Apache Fink: The 4G of Big Data Analytics Frameworks

Overview of Apache Fink: The 4G of Big Data Analytics Frameworks

Portable Streaming Pipelines with Apache Beam

Portable Streaming Pipelines with Apache Beam

Portable Streaming Pipelines with Apache Beam

Present and future of unified, portable, and efficient data processing with A...

Present and future of unified, portable, and efficient data processing with A...

Present and future of unified, portable, and efficient data processing with A...

Realizing the promise of portability with Apache Beam

Realizing the promise of portability with Apache Beam

Realizing the promise of portability with Apache Beam

Portable batch and streaming pipelines with Apache Beam (Big Data Application...

Portable batch and streaming pipelines with Apache Beam (Big Data Application...

Portable batch and streaming pipelines with Apache Beam (Big Data Application...

Flink in action

Flink in action

Flink in action

Data Summer Conf 2018, “Building unified Batch and Stream processing pipeline...

Data Summer Conf 2018, “Building unified Batch and Stream processing pipeline...

Data Summer Conf 2018, “Building unified Batch and Stream processing pipeline...

Near real-time anomaly detection at Lyft

Near real-time anomaly detection at Lyft

Near real-time anomaly detection at Lyft

Introduction to Apache Flink

Introduction to Apache Flink

Introduction to Apache Flink

Unified Batch and Real-Time Stream Processing Using Apache Flink

Unified Batch and Real-Time Stream Processing Using Apache Flink

Unified Batch and Real-Time Stream Processing Using Apache Flink

Python Streaming Pipelines on Flink - Beam Meetup at Lyft 2019

Python Streaming Pipelines on Flink - Beam Meetup at Lyft 2019

Python Streaming Pipelines on Flink - Beam Meetup at Lyft 2019

LAMP is so yesterday, MEAN is so tomorrow! :)

LAMP is so yesterday, MEAN is so tomorrow! :)

LAMP is so yesterday, MEAN is so tomorrow! :)

Apache Arrow at DataEngConf Barcelona 2018

Apache Arrow at DataEngConf Barcelona 2018

Apache Arrow at DataEngConf Barcelona 2018

Technology Stack Discussion

Technology Stack Discussion

Technology Stack Discussion

Flink history, roadmap and vision

Flink history, roadmap and vision

Flink history, roadmap and vision

Mehr von Data Science Milan

ML & Graph algorithms to prevent financial crime in digital payments

ML & Graph algorithms to prevent financial crime in digital payments

ML & Graph algorithms to prevent financial crime in digital payments

Data Science Milan

In this talk Mauro Pelucchi will present the Economic Complexity Index (ECI) and the Product Complexity Index (PCI), two network measures that provide unique insights into economic development patterns.We will show how to compute these metrics and explore the network theory behind these indices (Hidalgo and Hausmann, 2009). The measures are also related to various dimensionality reduction methods and can be used to determine distances between nodes based on their nodes based on their similarity.Finally, we will discover how to interpret these metrics to compare countries, markets, products, and guide our plans in a data-driven context.

How to use the Economic Complexity Index to guide innovation plans

How to use the Economic Complexity Index to guide innovation plans

How to use the Economic Complexity Index to guide innovation plans

Data Science Milan

Robustness Metrics for ML Models based on Deep Learning Methods

Robustness Metrics for ML Models based on Deep Learning Methods

Robustness Metrics for ML Models based on Deep Learning Methods

Data Science Milan

It is indeed a wonderful time to build machine learning systems, as the growing ecosystems of tools and shared best practices make even small teams incredibly productive at scale. In this talk, we present our philosophy for modern, no-nonsense data pipelines, highlighting the advantages of a (almost) pure serverless and open-source approach, and showing how the entire toolchain works - from raw data to model serving - on a real-world dataset. Finally, we argue that the crucial component for analyzing data pipelines is not the model per se, but the surrounding DAG, and present our proposal for producing automated "DAG cards" from Metaflow classes. Bio: Jacopo Tagliabue was co-founder and CTO of Tooso, an A.I. company in San Francisco acquired by Coveo in 2019. Jacopo is currently the Lead A.I. Scientist at Coveo. When not busy building A.I. products, he is exploring research topics at the intersection of language, reasoning and learning, with several publications at major conferences (e.g. WWW, SIGIR, RecSys, NAACL). In previous lives, he managed to get a Ph.D., do scienc-y things for a pro basketball team, and simulate a pre-Columbian civilization. Topics: MLOps, Metaflow, model cards.

"You don't need a bigger boat": serverless MLOps for reasonable companies

"You don't need a bigger boat": serverless MLOps for reasonable companies

"You don't need a bigger boat": serverless MLOps for reasonable companies

Data Science Milan

Manual question generation (worksheets and quizzes) in edtech is not scalable for online transformation and leads to increased workload on teachers due to the pandemic. In this session, we will explore natural language processing (NLP) techniques to generate Multiple Choice Questions automatically from any text content using the T5 transformer model. We will also explore methods to deploy the T5 question generation model for fast CPU inference using ONNX conversion and quantization. Bio: Ramsri is a Lead Data Scientist with 8+ years of work experience across Silicon Valley, Singapore, and India. Most recently he had been a co-founder and CTO of a funded AI-assisted assessments startup. He has spent the last 2 years developing question generation models in edtech and also released an open-source library on the same.

Question generation using Natural Language Processing by QuestGen.AI

Question generation using Natural Language Processing by QuestGen.AI

Question generation using Natural Language Processing by QuestGen.AI

Data Science Milan

Abstract: Data preparation and modelling are the activities that take most of the time in a typical data scientist workday. In this session we’ll see how AWS services for Analytics and data management can be effectively used and integrated in AI/ML pipelines. We’ll focus on AWS Glue, AWS Glue DataBrew and AWS Data Wrangler with a bit of theory and hands-on demos. Bio: Francesco Marelli is a senior solutions architect at Amazon Web Services. He has lived and worked in UK, italy, Switzerland and other countries in EMEA. He is specialized in the design and implementation of Analytics, Data Management and Big Data systems. Francesco also has a strong experience in systems integration and design and implementation of applications. Topics: machine learning pipelines, AWS, cloud.

Speed up data preparation for ML pipelines on AWS

Speed up data preparation for ML pipelines on AWS

Speed up data preparation for ML pipelines on AWS

Data Science Milan

Serverless machine learning architectures at Helixa

Serverless machine learning architectures at Helixa

Serverless machine learning architectures at Helixa

Data Science Milan

A Feature Store enables machine learning (ML) features to be registered, discovered, and used as part of ML pipelines, thus making it easier to transform and validate the training data that is fed into machine learning systems. Feature stores can also enable consistent engineering of features between training and inference, but to do so, they need a common data processing platform. The first Feature Stores, developed at hyperscale AI companies such as Uber, Airbnb, and Facebook, enabled feature engineering using domain specific languages, providing abstractions tailored to the companies’ feature engineering domains. However, a general purpose Feature Store needs a general purpose feature engineering, feature selection, and feature transformation platform. In this talk, we describe how we built a general purpose, open-source Feature Store for ML around dataframes and Apache Spark. We will demonstrate how data engineers can transform and engineers features from backend databases and data lakes, while data scientists can use PySpark to select and transform features into train/test data in a file format of choice (.tfrecords, .npy, .petastorm, etc) on a file system of choice (S3, HDFS). Finally, we will show how the Feature Store enables end-to-end ML pipelines to be factored into feature engineering and data science stages that each can run at different cadences. Bio: Fabio Buso is the head of engineering at Logical Clocks AB, where he leads the Feature Store development. Fabio holds a master's degree in cloud computing and services with a focus on data intensive applications, awarded by a joint program between KTH Stockholm and TU Berlin. Topics: feature store, MLOps.

MLOps with a Feature Store: Filling the Gap in ML Infrastructure

MLOps with a Feature Store: Filling the Gap in ML Infrastructure

MLOps with a Feature Store: Filling the Gap in ML Infrastructure

Data Science Milan

Reinforcement Learning is a growing subset of Machine Learning and one of the most important frontiers of Artificial Intelligence. Its goal is to capture higher logic and use more adaptable algorithms than classical Machine Learning. Formally it denotes a set of algorithms that deal with sequential decision-making and have the potential capability to make highly intelligent decisions depending on their local environment. Reinforcement Learning problems can be described as an agent that has to make decisions in its environment in order to optimize a cumulative reward, and it is clear that this formalization applies to a great variety of tasks in many different fields. In this talk, the main features of the most important Reinforcement Learning algorithms will be illustrated and deepened, with some concrete and explanatory examples. Bio: Marco Del Pra Marco was born in Venice 41 years ago, has two master's degrees (Computer Science and Mathematics), and has two important publications in applied mathematics. He has been working in Artificial Intelligence for 10 years, mainly as a freelancer. Among others, he worked for the European Commission's Joint Research Center, for Cuebiq, and as Data Science Lead for Microsoft's Artificial Intelligence projects in Italy.

Reinforcement Learning Overview | Marco Del Pra

Reinforcement Learning Overview | Marco Del Pra

Reinforcement Learning Overview | Marco Del Pra

Data Science Milan

Today there are a lot of data that are stored in the form of time series, and with the actual large diffusion of real-time applications many areas are strongly increasing their interest in applications based on this kind of data, like for example finance, advertising, marketing, health care, automated disease detection, biometrics, retail, and identification of anomalies of any kind. It is therefore very interesting to understand the role and potential of machine learning in this sector. Many methods can be used for the classification of the time series, but all of them, apart from deep learning, require some kind of feature engineering as a separate stage before the classification is performed, and this can imply the loss of some important information and the increase of the development and test time. On the contrary, deep learning models such as recurrent and convolutional neural networks already incorporate this kind of feature engineering internally, optimizing it and eliminating the need to do it manually. Therefore they are able to extract information from the time series in a faster, more direct, and more complete way. Bio: Marco Del Pra I am 41 years old, I was born in Venice, I have 2 master's degrees (Computer Science and Mathematics). I have been working for about 10 years in Artificial Intelligence, first as Data Scientist, then as Team Leader and finally as Head of Data. Among others, I worked for Microsoft, for the European Commission (JRC of Ispra) and for Cuebiq. I am currently working as a freelancer and I am creating with 2 other cofounders an innovative AI startup. I have 2 important publications in applied mathematics. Topics: recurrent and convolutional neural networks, deep learning, time-series.

Time Series Classification with Deep Learning | Marco Del Pra

Time Series Classification with Deep Learning | Marco Del Pra

Time Series Classification with Deep Learning | Marco Del Pra

Data Science Milan

The talk will introduce Ludwig, a deep learning toolbox that allows to train models and to use them for prediction without the need to write code. It is unique in its ability to help make deep learning easier to understand for non-experts and enable faster model improvement iteration cycles for experienced machine learning developers and researchers alike. By using Ludwig, experts and researchers can simplify the prototyping process and streamline data processing so that they can focus on developing deep learning architectures. Bio: Piero Molino is a Senior Research Scientist at Uber AI with focus on machine learning for language and dialogue. Piero completed a PhD on Question Answering at the University of Bari, Italy. Founded QuestionCube, a startup that built a framework for semantic search and QA. Worked for Yahoo Labs in Barcelona on learning to rank, IBM Watson in New York on natural language processing with deep learning and then joined Geometric Intelligence, where he worked on grounded language understanding. After Uber acquired Geometric Intelligence, he became one of the founding members of Uber AI Labs.

Ludwig: A code-free deep learning toolbox | Piero Molino, Uber AI

Ludwig: A code-free deep learning toolbox | Piero Molino, Uber AI

Ludwig: A code-free deep learning toolbox | Piero Molino, Uber AI

Data Science Milan

Traditional market research is generally conducted by questionnaires or other forms of explicit feedback, directly asked to an ad hoc panel of individuals that in aggregate are representative of a larger group of people. Unfortunately, those traditional approaches are often invasive, nonscalable, and biased. Indirect approaches based on sparse and implicit consumer feedback (e.g., social network interactions, web browsing, or online purchases) are more scalable, authentic, and more suitable for real-time consumer insights. Although those sources of implicit consumer feedback provide relevant and detailed pictures of the population, they individually provide only a limited set of observable behaviors. The Holy Grail of market research is the ability to merge different sources of consumers interests into an augmented view that connects all the dots across multiple domains. Unfortunately, user-centric "fusion" algorithms present many limitations in the case of heterogeneous datasets strongly differing in terms of size and density and when the number of sources to merge increases. We propose a novel approach of Audience Projection able to define a target audience as a subset of the population in a source domain and to project this target to a set of users into a destination dataset. We will show how libraries such as spaCy can provide Deep Learning implementations for Named Entity Recognition (NER) to match related brands and we will use Bayesian Inference to transfer knowledge from the source domain. This way, we can estimate the probability of the user to belong to the target using the source distribution of volume of interests of common entities as model evidence and the source target size as prior probability. Bio: Gianmario Spacagna is the chief scientist and head of AI at Helixa. His team’s mission is building the next generation of behavior algorithms and models of human decision making with careful attention to their potential and effects on society. His experience covers a diverse portfolio of machine learning algorithms and data products across different industries. Previously, he worked as a data scientist in IoT automotive (Pirelli Cyber Technology), retail and business banking (Barclays Analytics Centre of Excellence), threat intelligence (Cisco Talos), predictive marketing (AgilOne), plus some occasional freelancing. He’s a co-author of the book Python Deep Learning, contributor to the “Professional Manifesto for Data Science,” and founder of the Data Science Milan community. Gianmario holds a master’s degree in telematics (Polytechnic of Turin) and software engineering of distributed systems (KTH of Stockholm). After having spent half of his career abroad, he now lives in Milan. His favorite hobbies include home cooking, hiking, and exploring the surrounding nature on his motorcycle.

Audience projection of target consumers over multiple domains a ner and baye...

Audience projection of target consumers over multiple domains a ner and baye...

Audience projection of target consumers over multiple domains a ner and baye...

Data Science Milan

Weakly Supervised Learning: Introduction and Best Practices In the talk we will introduce the definition of three main types of weakly supervised learning: incomplete, inexact and inaccurate; we examine how the models can be trained in case of weak supervision and view the real application of weakly supervised learning, how it can improve results and decrease the costs. Bio: Kristina Khvatova works as a Software Engineer at Softec S.p.A. Currently she is involved in the development of a project for data analysis and visualisation; it includes quantitative and qualitative analysis based on classification, optimisation, time series prediction, anomaly detection techniques. She obtained a master degree in Mathematics at the Saint-Petersburg State University and a master degree in Computer Science at the University of Milano-Bicocca.

Weak supervised learning - Kristina Khvatova

Weak supervised learning - Kristina Khvatova

Weak supervised learning - Kristina Khvatova

Data Science Milan

GANs beyond nice pictures: real value of data generation (theory and business applications) About the speaker, Alex Honchar: I am machine learning expert currently applying AI in medtech, fintech and other areas. I also enjoy teaching and blogging (50k+ views monthly) about deep learning applications. As an academia member, I have a track of scientific publications as well. Beside sciences, I travel, do sports and perform card magic.

GANs beyond nice pictures: real value of data generation, Alex Honchar

GANs beyond nice pictures: real value of data generation, Alex Honchar

GANs beyond nice pictures: real value of data generation, Alex Honchar

Data Science Milan

Humans have the extraordinary ability to learn continually from experience. Not only can we apply previously learned knowledge and skills to new situations, we can also use these as the foundation for later learning. One of the grand goals of AI is building an artificial continually learning agent that constructs a sophisticated understanding of the world from its own experience through the autonomous incremental development of ever more complex skills and knowledge. "Continual Learning" (CL) is indeed a fast emerging topic in AI concerning the ability to efficiently improve the performance of a deep model over time, dealing with a long (and possibly unlimited) sequence of data/tasks. In this workshop, after a brief introduction of the topic, we’ll implement different Continual Learning strategies and assess them on common vision benchmarks. We’ll conclude the workshop with a look at possible real world applications of CL. Vincenzo Lomonaco is a Deep Learning PhD student at the University of Bologna and founder of ContinualAI.org. He is also the PhD students representative at the Department of Computer Science of Engineering (DISI) and teaching assistant of the courses “Machine Learning” and “Computer Architectures” in the same department. Previously, he was a Machine Learning software engineer at IDL in-line Devices and a Master Student at the University of Bologna where he graduated cum laude in 2015 with the dissertation “Deep Learning for Computer Vision: a Comparison Between CNNs and HTMs on Object Recognition Tasks".

Continual/Lifelong Learning with Deep Architectures, Vincenzo Lomonaco

Continual/Lifelong Learning with Deep Architectures, Vincenzo Lomonaco

Continual/Lifelong Learning with Deep Architectures, Vincenzo Lomonaco

Data Science Milan

Processing 3D images has many use cases. For example, to improve autonomous car driving, to enable digital conversions of old factory buildings, to enable augmented reality solutions for medical surgeries, etc. Also 3D images help in 3D modeling and safety evaluation of products. 3D image processing brings enormous benefits but also amplifies computing cost. The size of the point cloud, the number of points, sparse and irregular point cloud, and the adverse impact of the light reflections, (partial) occlusions, etc., make it difficult for engineers to process point clouds. Moving from using hand crafted features to using deep learning techniques to semantically segment the images, to classify objects, to detect objects, to detect actions in 3D videos, etc., we have come a long way in 3D image processing. 3D Point Cloud image processing is increasingly used to solve Industry 4.0 use cases to help architects, builders and product managers. I will share some of the innovations that are helping the progress of 3D point cloud processing. I will share the practical implementation issues we faced while developing deep learning models to make sense of 3D Point Clouds. Attendees: Beginners and Intermediate skilled in Image Processing and 3D Point Clouds Profile of the speaker: SK Reddy is the Chief Product Officer AI in Hexagon (www.hexagon.com). He is an AI and ML expert and a successful twice startup entrepreneur. He is an AI startup advisor too. Also he is a frequent speaker in conferences and is an AI blogger.

3D Point Cloud analysis using Deep Learning

3D Point Cloud analysis using Deep Learning

3D Point Cloud analysis using Deep Learning

Data Science Milan

The notebook and documentation of the original tutorial is available at https://github.com/gm-spacagna/deep-ttf. Deep Time-to-Failure: predicting failures, churns and customer lifetime using recurrent neural networks. Machineries and customers are among the most valuable assets for many businesses. A common trait of these assets is that sooner or later they will fail or, in the case of customers, they will churn. In order to catch those failure events we would ideally consider the whole history of the machine/customer available information and learn smart representations of the system status over time. Traditional machine learning and statistical models approach the prediction of time-to-failure, aka. expected lifetime, as a supervised regression problem using handcrafted features. Training those models is hard because of three main reasons: The complexity of extracting predictive features from time-series without overfitting. The difficulty of modeling uncertainty and confidence levels in the predictions. The scarcity of labeled data, failure events are by definition rare and that results in highly unbalanced training datasets. The first issue can be solved adopting recurrent neural architectures. A solution to the the last two problems could be to exploit censored data and to build survival regression models. In this talk we will present a novel technique based on recurrent neural networks that can turn any length-variable sequence of data into a probability distribution representing the estimated remaining time to the failure event. The network will be trained in presence of ground truth as well as with right-censored data. We will demonstrate using a case study regarding 100 jet engine simulated degradation provided by NASA. During the tutorial you will learn: What is Survival Analysis and what are the most popular Survival Regression techniques. How a Weibull distribution can be used as generic distribution for modeling Time-to-Failure events. How to build a deep learning algorithm in Keras leveraging recurrent units (LSTM or GRU) that can map raw time-series of covariates into Weibull probability distributions. The tutorial will also cover a few common pitfalls, visualizations and evaluation tools useful for testing and adapting this approach to generic use cases. You are free to bring your laptop if you would like to do some live coding and experiment yourself. In this case we strongly encourage to check you have all of the requirements installed in your machine. More details on the required packages can be found on the Github repository gm-spacagna/deep-ttf.

Deep time-to-failure: predicting failures, churns and customer lifetime with ...

Deep time-to-failure: predicting failures, churns and customer lifetime with ...

Deep time-to-failure: predicting failures, churns and customer lifetime with ...

Data Science Milan

50 Shades of Text - Leveraging Natural Language Processing (NLP) to validate, improve, and expand the functionalities of a product Nowadays, every company either stores or produces text data: from web logs and user queries, to translations and support tickets, yet not everyone knows how to extract valuable insights from it. In this session, we will present a practical case on how to move from raw text data to a valuable business application leveraging upon some of the major NLP methodologies (word embedding, word2vec, doc2vec, fastText, etc.) Bio: Alessandro is a data veteran. He holds two Master’s degrees in computer engineering, one from Politecnico di Milano and the other from University of Illinois at Chicago (UIC). He started his career in data consultancy, where he mastered Apache Spark for Machine Learning projects and subsequently joined WW Grainger, one of the largest MRO e-commerce companies in the United States. In September 2017, after more than 5 years in the USA, Alessandro returned to his native country, Italy, where he is now leading a team of data scientists. His current work focuses on achieving energy efficiency through the automation of energy management processes for commercial customers.

50 Shades of Text - Leveraging Natural Language Processing (NLP), Alessandro ...

50 Shades of Text - Leveraging Natural Language Processing (NLP), Alessandro ...

50 Shades of Text - Leveraging Natural Language Processing (NLP), Alessandro ...

Data Science Milan

“Product close-out strategy” by Ilaria Gianoli, Data Scientist, Data Reply Abstract: How to deal with products in their decline phase? Ilaria will share her experience in optimizing the close-out strategy for a multinational retail leader, with a particular focus on the price optimization. Bio: Ilaria is a Data Scientist at Data Reply, where she works as a consultant across different industries, in particular in the Retail. She uses her mathematical, statistical and machine learning background to turning data into business opportunities. She also works closely to the business to provide quantitative support for decision making, adapting the complexity of the mathematical models to customer needs. She holds a MSc in Applied Statistics - Mathematical Engineering from Politecnico di Milano. “Online pricing: from theory to application” by Giovanni Corradini, Data Scientist, Data Reply Abstract: Multi-Armed Bandit algorithms are populating the world of e-commerce. How do they work? Giovanni will share the basic of this field and an application of a state-of-the-art algorithm on real world simulation of the ticket industry. Bio: Giovanni is a Data Scientist at Data Reply. He holds a MSc in Applied Statistics - Mathematical Engineering from Politecnico di Milano. He has a background in statistics, machine learning and data mining and he provides decision making support to industries in many different fields. “Renewal Price Optimization for Subscription products” by Riccardo Lorenzon, Data Scientist, Data Reply Abstract: We are observing a huge shift in modern economy from a pay-per-product model to a subscription-based model. When it comes to pricing strategies, it is important both to close the single deal and monetize long-term relationships with the customer. Riccardo will present an application of subscription renewal pricing optimization models for a company belonging to the publishing industry. Bio: Riccardo holds a MSc in Mathematical Models for Decision Making from Politecnico di Milano. He developed hands-on experience on end-to-end data projects across multiple industries. His proactive creativity helps him be very effective in the business case design and early stages of projects.

Pricing Optimization: Close-out, Online and Renewal strategies, Data Reply

Pricing Optimization: Close-out, Online and Renewal strategies, Data Reply

Pricing Optimization: Close-out, Online and Renewal strategies, Data Reply

Data Science Milan

"How Pirelli uses Domino and Plotly for Smart Manufacturing" by Alberto Arrigoni, Senior Data Scientist, Pirelli (pirelli.com) Abstract: Pirelli, a global performance tire manufacturer, uses data science in its 20 factories to improve quality and efficiency, and reduce energy consumption. For this “Smart Manufacturing” initiative, Pirelli’s data science team has developed predictive models and analytics tools to monitor processes, machines and materials on the factory floors. In this talk we will show some of the solutions we deploy, demonstrate how we used Domino’s data science platform and Plot.ly to build these solutions, and discuss the next steps in this journey towards predictive maintenance. Bio: Alberto Arrigoni is a data scientist at Pirelli, where he works to process sensors and telemetry data for IoT, Smart Factories and connected-vehicle applications. He works closely with all major business units such as R&D, industrial engineering and BI to develop tailored machine learning algorithms and production systems. He holds a PhD in biostatistics from the University of Milan Bicocca and prior to joining Pirelli was a staff data scientist at the National Institute of Molecular Genetics (Milan), as well as a Fulbright student at the Santa Clara University and visiting PhD student at Pacific Biosciences (Menlo Park, CA).

"How Pirelli uses Domino and Plotly for Smart Manufacturing" by Alberto Arrig...

"How Pirelli uses Domino and Plotly for Smart Manufacturing" by Alberto Arrig...

"How Pirelli uses Domino and Plotly for Smart Manufacturing" by Alberto Arrig...

Data Science Milan

Mehr von Data Science Milan (20)

ML & Graph algorithms to prevent financial crime in digital payments

ML & Graph algorithms to prevent financial crime in digital payments

ML & Graph algorithms to prevent financial crime in digital payments

How to use the Economic Complexity Index to guide innovation plans

How to use the Economic Complexity Index to guide innovation plans

How to use the Economic Complexity Index to guide innovation plans

Robustness Metrics for ML Models based on Deep Learning Methods

Robustness Metrics for ML Models based on Deep Learning Methods

Robustness Metrics for ML Models based on Deep Learning Methods

"You don't need a bigger boat": serverless MLOps for reasonable companies

"You don't need a bigger boat": serverless MLOps for reasonable companies

"You don't need a bigger boat": serverless MLOps for reasonable companies

Question generation using Natural Language Processing by QuestGen.AI

Question generation using Natural Language Processing by QuestGen.AI

Question generation using Natural Language Processing by QuestGen.AI

Speed up data preparation for ML pipelines on AWS

Speed up data preparation for ML pipelines on AWS

Speed up data preparation for ML pipelines on AWS

Serverless machine learning architectures at Helixa

Serverless machine learning architectures at Helixa

Serverless machine learning architectures at Helixa

MLOps with a Feature Store: Filling the Gap in ML Infrastructure

MLOps with a Feature Store: Filling the Gap in ML Infrastructure

MLOps with a Feature Store: Filling the Gap in ML Infrastructure

Reinforcement Learning Overview | Marco Del Pra

Reinforcement Learning Overview | Marco Del Pra

Reinforcement Learning Overview | Marco Del Pra

Time Series Classification with Deep Learning | Marco Del Pra

Time Series Classification with Deep Learning | Marco Del Pra

Time Series Classification with Deep Learning | Marco Del Pra

Ludwig: A code-free deep learning toolbox | Piero Molino, Uber AI

Ludwig: A code-free deep learning toolbox | Piero Molino, Uber AI

Ludwig: A code-free deep learning toolbox | Piero Molino, Uber AI

Audience projection of target consumers over multiple domains a ner and baye...

Audience projection of target consumers over multiple domains a ner and baye...

Audience projection of target consumers over multiple domains a ner and baye...

Weak supervised learning - Kristina Khvatova

Weak supervised learning - Kristina Khvatova

Weak supervised learning - Kristina Khvatova

GANs beyond nice pictures: real value of data generation, Alex Honchar

GANs beyond nice pictures: real value of data generation, Alex Honchar

GANs beyond nice pictures: real value of data generation, Alex Honchar

Continual/Lifelong Learning with Deep Architectures, Vincenzo Lomonaco

Continual/Lifelong Learning with Deep Architectures, Vincenzo Lomonaco

Continual/Lifelong Learning with Deep Architectures, Vincenzo Lomonaco

3D Point Cloud analysis using Deep Learning

3D Point Cloud analysis using Deep Learning

3D Point Cloud analysis using Deep Learning

Deep time-to-failure: predicting failures, churns and customer lifetime with ...

Deep time-to-failure: predicting failures, churns and customer lifetime with ...

Deep time-to-failure: predicting failures, churns and customer lifetime with ...

50 Shades of Text - Leveraging Natural Language Processing (NLP), Alessandro ...

50 Shades of Text - Leveraging Natural Language Processing (NLP), Alessandro ...

50 Shades of Text - Leveraging Natural Language Processing (NLP), Alessandro ...

Pricing Optimization: Close-out, Online and Renewal strategies, Data Reply

Pricing Optimization: Close-out, Online and Renewal strategies, Data Reply

Pricing Optimization: Close-out, Online and Renewal strategies, Data Reply

"How Pirelli uses Domino and Plotly for Smart Manufacturing" by Alberto Arrig...

"How Pirelli uses Domino and Plotly for Smart Manufacturing" by Alberto Arrig...

"How Pirelli uses Domino and Plotly for Smart Manufacturing" by Alberto Arrig...

Kürzlich hochgeladen

Just Call Vip call girls Bellary Escorts ☎️9352988975 Two shot with one girl (Bellary ) Booking Contact Details :- WhatsApp Chat :- +91-9352988975 If you're looking for India Call girls you've come to the right place. You'll find some of the most beautiful call girls in our location with. These ladies have pleasing personalities, hot figures, and a passion for physical pleasure. Call girls in India Lucknow Many men have booked them for their erotic and soul-mixing performances, which are sure to leave you with unforgettable memories. #K09 Escort Service India is available in the city for men and women of all ages. They can satisfy your sexual needs and will make your experience even more enjoyable and memorable. Whether you're looking for a blow-job, stripping, lovemaking, or other dirty acts, you'll be able to find a match for your tastes and budget. These highly trained professionals will help you have an unforgettable night. One Shot — 5000/in call (time 1 hour), 6000/out call Two shot with one girl — 8000/in call (time 2 hour), 10000/out call Body to body massage with sex- 8000/in call (time 1 hour) Full night Service for one person– 12000/in call, 13000/out call (shot limit 3-4 shots) Full night Service for more than 1 person — please contact Us —9352988975 We are available 24*7 all days of the year. Call us — 9352988975 Thank you for Visiting.

Just Call Vip call girls Bellary Escorts ☎️9352988975 Two shot with one girl ...

Just Call Vip call girls Bellary Escorts ☎️9352988975 Two shot with one girl ...

Just Call Vip call girls Bellary Escorts ☎️9352988975 Two shot with one girl ...

➥🔝 7737669865 🔝▻ Bangalore Call-girls in Women Seeking Men 🔝Bangalore🔝 Escorts Service Booking Contact Details :- WhatsApp Chat :- +91-7737669865 Call Girls In Model Towh +91-7737669865 !! Best Woman Seeking Man Call Girls Service, Escorts Service in Home Hotel in NCR 24 Hours Available Service Call Girls, Contact Us +91-7737669865 (Any Time. Any Where) Call Girls in , Noida, Gurgaon, Ghaziabad,Sexy Indian Female Escorts Service NCRWelcome To Escorts Service – An All Over New Very Sexy Hot Call Girls Agency Service Escorts In South NCR’s No. 1 High Profile Independent Female Escorts Service. We Provide Good Quality Educated Profile At #K09 Very Regnebal Price 100% Safe And Original.We Are Provide Escorts Service All OYO Hotels ,3*,4*,5* Star Hotel And Home Flat, Apartment. Guest-House. Services In -Call And Out – Call Both Are Services Available. 24Hrs. Any Time Any Where. In All Over Noida Gurgaon Ghaziabad Faridabad.More Information And Contact Profile Real Pic Visit Our Website City Wise Escorts Service Agency.Good Looking Cheap And Best Models Girls U Can Get Best Click On Link……Night Call Girls Now In Hotel Le Meridien Gurgaon Near Female Escort One Shot — 5000/in call (time 1 hour), 6000/out call Two shot with one girl — 8000/in call (time 2 hour), 10000/out call Body to body massage with sex- 8000/in call (time 1 hour) Full night Service for one person– 12000/in call, 13000/out call (shot limit 3-4 shots) Full night Service for more than 1 person — please contact Us —7737669865 We are available 24*7 all days of the year. Call us — 7737669865 Thank you for Visiting.

➥🔝 7737669865 🔝▻ Bangalore Call-girls in Women Seeking Men 🔝Bangalore🔝 Esc...

➥🔝 7737669865 🔝▻ Bangalore Call-girls in Women Seeking Men 🔝Bangalore🔝 Esc...

➥🔝 7737669865 🔝▻ Bangalore Call-girls in Women Seeking Men 🔝Bangalore🔝 Esc...

Saudi Arabia [ Abortion pills) Jeddah/riaydh/dammam/+966572737505☎️] cytotec tablets uses abortion pills 💊💊 How effective is the abortion pill? 💊💊 +966572737505) "Abortion pills in Jeddah" how to get cytotec tablets in Riyadh " Abortion pills in dammam*💊💊 The abortion pill is very effective. If you’re taking mifepristone and misoprostol, it depends on how far along the pregnancy is, and how many doses of medicine you take:💊💊 +966572737505) how to buy cytotec pills At 8 weeks pregnant or less, it works about 94-98% of the time. +966572737505[ 💊💊💊 At 8-9 weeks pregnant, it works about 94-96% of the time. +966572737505) At 9-10 weeks pregnant, it works about 91-93% of the time. +966572737505)💊💊 If you take an extra dose of misoprostol, it works about 99% of the time. At 10-11 weeks pregnant, it works about 87% of the time. +966572737505) If you take an extra dose of misoprostol, it works about 98% of the time. In general, taking both mifepristone and+966572737505 misoprostol works a bit better than taking misoprostol only. +966572737505 Taking misoprostol alone works to end the+966572737505 pregnancy about 85-95% of the time — depending on how far along the+966572737505 pregnancy is and how you take the medicine. +966572737505 The abortion pill usually works, but if it doesn’t, you can take more medicine or have an in-clinic abortion. +966572737505 When can I take the abortion pill?+966572737505 In general, you can have a medication abortion up to 77 days (11 weeks)+966572737505 after the first day of your last period. If it’s been 78 days or more since the first day of your last+966572737505 period, you can have an in-clinic abortion to end your pregnancy.+966572737505 Why do people choose the abortion pill? Which kind of abortion you choose all depends on your personal+966572737505 preference and situation. With+966572737505 medication+966572737505 abortion, some people like that you don’t need to have a procedure in a doctor’s office. You can have your medication abortion on your own+966572737505 schedule, at home or in another comfortable place that you choose.+966572737505 You get to decide who you want to be with during your abortion, or you can go it alone. Because+966572737505 medication abortion is similar to a miscarriage, many people feel like it’s more “natural” and less invasive. And some+966572737505 people may not have an in-clinic abortion provider close by, so abortion pills are more available to+966572737505 them. +966572737505 Your doctor, nurse, or health center staff can help you decide which kind of abortion is best for you. +966572737505 More questions from patients: Saudi Arabia+966572737505 CYTOTEC Misoprostol Tablets. Misoprostol is a medication that can prevent stomach ulcers if you also take NSAID medications. It reduces the amount of acid in your stomach, which protects your stomach lining. The brand name of this medication is Cytotec®.+966573737505) Unwanted Kit is a combination of two medicin

Abortion pills in Jeddah | +966572737505 | Get Cytotec

Abortion pills in Jeddah | +966572737505 | Get Cytotec

Abortion pills in Jeddah | +966572737505 | Get Cytotec

Abortion pills in Riyadh +966572737505 get cytotec

➥🔝 7737669865 🔝▻ mahisagar Call-girls in Women Seeking Men 🔝mahisagar🔝 Escorts Service Booking Contact Details :- WhatsApp Chat :- +91-7737669865 Call Girls In Model Towh +91-7737669865 !! Best Woman Seeking Man Call Girls Service, Escorts Service in Home Hotel in NCR 24 Hours Available Service Call Girls, Contact Us +91-7737669865 (Any Time. Any Where) Call Girls in , India,Sexy Indian Female Escorts Service NCRWelcome To Escorts Service – An All Over New Very Sexy Hot Call Girls Agency Service Escorts In South NCR’s No. 1 High Profile Independent Female Escorts Service. We Provide Good Quality Educated Profile At #K09 Very Regnebal Price 100% Safe And Original.We Are Provide Escorts Service All OYO Hotels ,3*,4*,5* Star Hotel And Home Flat, Apartment. Guest-House. Services In -Call And Out – Call Both Are Services Available. 24Hrs. Any Time Any Where. In All Over Noida Gurgaon Ghaziabad Faridabad.More Information And Contact Profile Real Pic Visit Our Website City Wise Escorts Service Agency.Good Looking Cheap And Best Models Girls U Can Get Best Click On Link……Night Call Girls Now In Hotel Le Meridien Gurgaon Near Female Escort One Shot — 5000/in call (time 1 hour), 6000/out call Two shot with one girl — 8000/in call (time 2 hour), 10000/out call Body to body massage with sex- 8000/in call (time 1 hour) Full night Service for one person– 12000/in call, 13000/out call (shot limit 3-4 shots) Full night Service for more than 1 person — please contact Us —7737669865 We are available 24*7 all days of the year. Call us — 7737669865 Thank you for Visiting.

➥🔝 7737669865 🔝▻ mahisagar Call-girls in Women Seeking Men 🔝mahisagar🔝 Esc...

➥🔝 7737669865 🔝▻ mahisagar Call-girls in Women Seeking Men 🔝mahisagar🔝 Esc...

➥🔝 7737669865 🔝▻ mahisagar Call-girls in Women Seeking Men 🔝mahisagar🔝 Esc...

Call Girls Indiranagar Just Call 👗 7737669865 👗 Top Class Call Girl Service Bangalore Booking Contact Details :- WhatsApp Chat :- +91-7737669865 2-May-2024(SMW) Call Girls In Model Towh Bangalore +91-7737669865 !! Best Woman Seeking Man Call Girls Service, Escorts Service in Home Hotel in Bangalore NCR 24 Hours Available Service Call Girls, Contact Us +91-7737669865 (Any Time. Any Where) Call Girls in Bangalore, Noida, Gurgaon, Ghaziabad,Sexy Indian Female Escorts Service Bangalore NCRWelcome To Bangalore Escorts Service – An All Over New Bangalore Very Sexy Hot Call Girls Agency Service Escorts In South BangaloreNCRBangalore’s No. 1 High Profile Independent Female Escorts Service. We Provide Good Quality Educated Profile At Very Regnebal Price 100% Safe And Original.We Are Provide Escorts Service All OYO Hotels ,3*,4*,5* Star Hotel And Home Flat, Apartment. Guest-House. Services In -Call And Out – Call Both Are Services Available. 24Hrs. Any Time Any Where. In All Over Bangalore Noida Gurgaon Ghaziabad Faridabad.More Information And Contact Profile Real Pic Visit Our Website City Wise Escorts Service Agency.Good Looking Cheap And Best Models Girls U Can Get Best Click On Link……Night Call Girls Now In Hotel Le Meridien Gurgaon Near Female Escort One Shot — 5000/in call (time 1 hour), 6000/out call Two shot with one girl — 8000/in call (time 2 hour), 10000/out call Body to body massage with sex- 8000/in call (time 1 hour) Full night Service for one person– 12000/in call, 13000/out call (shot limit 3-4 shots) Full night Service for more than 1 person — please contact Us —7737669865 We are available 24*7 all days of the year. Call us — 7737669865 Thank you for Visiting.

Call Girls Indiranagar Just Call 👗 7737669865 👗 Top Class Call Girl Service B...

Call Girls Indiranagar Just Call 👗 7737669865 👗 Top Class Call Girl Service B...

Call Girls Indiranagar Just Call 👗 7737669865 👗 Top Class Call Girl Service B...

Call Girls Hsr Layout Just Call 👗 7737669865 👗 Top Class Call Girl Service Bangalore Booking Contact Details :- WhatsApp Chat :- +91-7737669865 2-May-2024(SMW) Call Girls In Model Towh Bangalore +91-7737669865 !! Best Woman Seeking Man Call Girls Service, Escorts Service in Home Hotel in Bangalore NCR 24 Hours Available Service Call Girls, Contact Us +91-7737669865 (Any Time. Any Where) Call Girls in Bangalore, Noida, Gurgaon, Ghaziabad,Sexy Indian Female Escorts Service Bangalore NCRWelcome To Bangalore Escorts Service – An All Over New Bangalore Very Sexy Hot Call Girls Agency Service Escorts In South BangaloreNCRBangalore’s No. 1 High Profile Independent Female Escorts Service. We Provide Good Quality Educated Profile At Very Regnebal Price 100% Safe And Original.We Are Provide Escorts Service All OYO Hotels ,3*,4*,5* Star Hotel And Home Flat, Apartment. Guest-House. Services In -Call And Out – Call Both Are Services Available. 24Hrs. Any Time Any Where. In All Over Bangalore Noida Gurgaon Ghaziabad Faridabad.More Information And Contact Profile Real Pic Visit Our Website City Wise Escorts Service Agency.Good Looking Cheap And Best Models Girls U Can Get Best Click On Link……Night Call Girls Now In Hotel Le Meridien Gurgaon Near Female Escort One Shot — 5000/in call (time 1 hour), 6000/out call Two shot with one girl — 8000/in call (time 2 hour), 10000/out call Body to body massage with sex- 8000/in call (time 1 hour) Full night Service for one person– 12000/in call, 13000/out call (shot limit 3-4 shots) Full night Service for more than 1 person — please contact Us —7737669865 We are available 24*7 all days of the year. Call us — 7737669865 Thank you for Visiting.

Call Girls Hsr Layout Just Call 👗 7737669865 👗 Top Class Call Girl Service Ba...

Call Girls Hsr Layout Just Call 👗 7737669865 👗 Top Class Call Girl Service Ba...

Call Girls Hsr Layout Just Call 👗 7737669865 👗 Top Class Call Girl Service Ba...

Call Girls In Shivaji Nagar ☎ 7737669865 🥵 Book Your One night Stand Booking Contact Details :- WhatsApp Chat :- +91-7737669865 Call Girls In Model Towh +91-7737669865 !! Best Woman Seeking Man Call Girls Service, Escorts Service in Home Hotel in NCR 24 Hours Available Service Call Girls, Contact Us +91-7737669865 (Any Time. Any Where) Call Girls in , Noida, Gurgaon, Ghaziabad,Sexy Indian Female Escorts Service NCRWelcome To Escorts Service – An All Over New Very Sexy Hot Call Girls Agency Service Escorts In South NCR’s No. 1 High Profile Independent Female Escorts Service. We Provide Good Quality Educated Profile At #K09 Very Regnebal Price 100% Safe And Original.We Are Provide Escorts Service All OYO Hotels ,3*,4*,5* Star Hotel And Home Flat, Apartment. Guest-House. Services In -Call And Out – Call Both Are Services Available. 24Hrs. Any Time Any Where. In All Over Noida Gurgaon Ghaziabad Faridabad.More Information And Contact Profile Real Pic Visit Our Website City Wise Escorts Service Agency.Good Looking Cheap And Best Models Girls U Can Get Best Click On Link……Night Call Girls Now In Hotel Le Meridien Gurgaon Near Female Escort One Shot — 5000/in call (time 1 hour), 6000/out call Two shot with one girl — 8000/in call (time 2 hour), 10000/out call Body to body massage with sex- 8000/in call (time 1 hour) Full night Service for one person– 12000/in call, 13000/out call (shot limit 3-4 shots) Full night Service for more than 1 person — please contact Us —7737669865 We are available 24*7 all days of the year. Call us — 7737669865 Thank you for Visiting.

Call Girls In Shivaji Nagar ☎ 7737669865 🥵 Book Your One night Stand

Call Girls In Shivaji Nagar ☎ 7737669865 🥵 Book Your One night Stand

Call Girls In Shivaji Nagar ☎ 7737669865 🥵 Book Your One night Stand

Anomaly detection and data imputation within time series

Anomaly detection and data imputation within time series

Anomaly detection and data imputation within time series

Paris Women in Machine Learning and Data Science

Just Call Vip call girls roorkee Escorts ☎️9352988975 Two shot with one girl (roorkee ) Booking Contact Details :- WhatsApp Chat :- +91-9352988975 If you're looking for India Call girls you've come to the right place. You'll find some of the most beautiful call girls in our location with. These ladies have pleasing personalities, hot figures, and a passion for physical pleasure. Call girls in India Lucknow Many men have booked them for their erotic and soul-mixing performances, which are sure to leave you with unforgettable memories. #K09 Escort Service India is available in the city for men and women of all ages. They can satisfy your sexual needs and will make your experience even more enjoyable and memorable. Whether you're looking for a blow-job, stripping, lovemaking, or other dirty acts, you'll be able to find a match for your tastes and budget. These highly trained professionals will help you have an unforgettable night. One Shot — 5000/in call (time 1 hour), 6000/out call Two shot with one girl — 8000/in call (time 2 hour), 10000/out call Body to body massage with sex- 8000/in call (time 1 hour) Full night Service for one person– 12000/in call, 13000/out call (shot limit 3-4 shots) Full night Service for more than 1 person — please contact Us —9352988975 We are available 24*7 all days of the year. Call us — 9352988975 Thank you for Visiting.

Just Call Vip call girls roorkee Escorts ☎️9352988975 Two shot with one girl ...

Just Call Vip call girls roorkee Escorts ☎️9352988975 Two shot with one girl ...

Just Call Vip call girls roorkee Escorts ☎️9352988975 Two shot with one girl ...

Aspirational Block Program Block Syaldey District - Almora

Aspirational Block Program Block Syaldey District - Almora

Aspirational Block Program Block Syaldey District - Almora

GovindSinghDasila

Just Call Vip call girls Erode Escorts ☎️9352988975 Two shot with one girl (Erode ) Booking Contact Details :- WhatsApp Chat :- +91-9352988975 If you're looking for India Call girls you've come to the right place. You'll find some of the most beautiful call girls in our location with. These ladies have pleasing personalities, hot figures, and a passion for physical pleasure. Call girls in India Lucknow Many men have booked them for their erotic and soul-mixing performances, which are sure to leave you with unforgettable memories. #K09 Escort Service India is available in the city for men and women of all ages. They can satisfy your sexual needs and will make your experience even more enjoyable and memorable. Whether you're looking for a blow-job, stripping, lovemaking, or other dirty acts, you'll be able to find a match for your tastes and budget. These highly trained professionals will help you have an unforgettable night. One Shot — 5000/in call (time 1 hour), 6000/out call Two shot with one girl — 8000/in call (time 2 hour), 10000/out call Body to body massage with sex- 8000/in call (time 1 hour) Full night Service for one person– 12000/in call, 13000/out call (shot limit 3-4 shots) Full night Service for more than 1 person — please contact Us —9352988975 We are available 24*7 all days of the year. Call us — 9352988975 Thank you for Visiting.

Just Call Vip call girls Erode Escorts ☎️9352988975 Two shot with one girl (E...

Just Call Vip call girls Erode Escorts ☎️9352988975 Two shot with one girl (E...

Just Call Vip call girls Erode Escorts ☎️9352988975 Two shot with one girl (E...

This project aims to predict whether a loan application will be approved or denied based on various factors such as applicant's income, credit score, loan amount, etc. Using a dataset containing historical loan application data, we employed machine learning algorithms to build a predictive model. The model was trained on features such as applicant's income, credit history, loan amount, loan term, and others. After training the model, we evaluated its performance using metrics like accuracy, precision, recall, and F1 score. The insights from this project can help financial institutions streamline their loan approval process and make informed decisions. Visit for more information: https://bostoninstituteofanalytics.org/data-science-and-artificial-intelligence/

Predicting Loan Approval: A Data Science Project

Predicting Loan Approval: A Data Science Project

Predicting Loan Approval: A Data Science Project

Boston Institute of Analytics

Klinik_ Apotek Onlin 085657271886 Solusi Menggugurkan Masalah Kehamilan Anda Jual Obat Aborsi Asli KLINIK ABORSI TERPEECAYA _ Jual Obat Aborsi Cytotec Misoprostol Asli 100% Ampuh Hanya 3 Jam Langsung Gugur || OBAT PENGGUGUR KANDUNGAN AMPUH MANJUR OBAT ABORSI OLINE" APOTIK Jual Obat Cytotec, Gastrul, Gynecoside Asli Ampuh. JUAL ” Obat Aborsi Tuntas | Obat Aborsi Manjur | Obat Aborsi Ampuh | Obat Penggugur Janin | Obat Pencegah Kehamilan | Obat Pelancar Haid | Obat terlambat Bulan | Ciri Obat Aborsi Asli | Obat Telat Bulan | Pil Aborsi Asli | Cara Menggugurkan Konten | Cara Aborsi Tuntas | Harga Obat Aborsi Asli | Pil Aborsi | Jual Obat Aborsi Cytotec | Cara Aborsi Sendiri | Cara Aborsi Usia 1 Bulan | Cara Aborsi Usia 2 Tahun | Cara Aborsi Usia 3 Bulan | Obat Aborsi Usia 4 Bulan | Cara Abrasi Usia 5 Bulan | Cara Menggugurkan Konten | Kandungan Obat Penggugur | Cara Menghitung Usia Konten | Cara Mengatasi Terlambat Bulan | Penjual Obat Aborsi Asli | Obat Aborsi Garansi | Kandungan Obat Peluntur | Obat Telat Datang Bulan | Obat Telat Haid | Obat Aborsi Paling Murah | Klinik Jual Obat Aborsi | Jual Pil Cytotec | Apotik Jual Obat Aborsi | Kandungan Dokter Abrasi | Cara Aborsi Cepat | Jual Obat Aborsi Bergaransi | Jual Obat Cytotec Asli | Obat Aborsi Aman Manjur | Obat Misoprostol Cytotec Asli. "APA ITU ABORSI" “Aborsi Adalah dengan membendung hormon yang di perlukan untuk mempertahankan kehamilan yaitu hormon progesteron, karena hormon ini dibendung, maka jalur kehamilan mulai membuka dan leher rahim menjadi melunak,sehingga mengeluarkan darah yang merupakan tanda bahwa obat telah bekerja || maksimal 1 jam obat diminum || PENJELASAN OBAT ABORSI USIA 1 _7 BULAN Pada usia kandungan ini, pasien akan merasakan sakit yang sedikit tidak berlebihan || sekitar 1 jam ||. namun hanya akan terjadi pada saatdarah keluar merupakan pertanda menstruasi. Hal ini dikarenakan pada usiakandungan 3 bulan,janin sudah terbentuk sebesar kepalan tangan orang dewasa. Cara kerja obat aborsi : JUAL OBAT ABORSI AMPUH dosis 3 bulan secara umum sama dengan cara kerja || DOSIS OBAT ABORSI 2 bulan”, hanya berbedanya selain mengisolasijanin juga menghancurkan janin dengan formula methotrexate dikandungdidalamnya. Formula methotrexate ini sangat ampuh untuk menghancurkan janinmenjadi serpihan-serpihan kecil akan sangat berguna pada saat dikeluarkan nanti. APA ALASAN WANITA MELAKUKAN ABORSI? Aborsi di lakukan wanita hamil baik yang sudah menikah maupun belum menikah dengan berbagai alasan , akan tetapi alasan yang utama adalah alasan-alasan non medis (termasuk aborsi sendiri / di sengaja/ buatan] MELAYANI PEMESANAN OBAT ABORSI SETIAP HARI, SIAP KIRIM KESELURUH KOTA BESAR DI INDONESIA DAN LUAR NEGERI. HUBUNGI PEMESANAN LEBIH NYAMAN VIA WA/: 085657271886

Jual Obat Aborsi Surabaya ( Asli No.1 ) 085657271886 Obat Penggugur Kandungan...

Jual Obat Aborsi Surabaya ( Asli No.1 ) 085657271886 Obat Penggugur Kandungan...

Jual Obat Aborsi Surabaya ( Asli No.1 ) 085657271886 Obat Penggugur Kandungan...

Vip Mumbai Call Girls Thane West Call On 9920725232 With Body to body massage with sex Booking Contact Details :- WhatsApp Chat :- +91-9920725232 4-May-2024(SMW) Call Girls In Model Towh +91-9920725232 !! Best Woman Seeking Man Call Girls Service, Escorts Service in Home Hotel in NCR 24 Hours Available Service Call Girls, Contact Us +91-9920725232 (Any Time. Any Where) Call Girls in , Noida, Gurgaon, Ghaziabad,Sexy Indian Female Escorts Service NCRWelcome To Escorts Service – An All Over New Very Sexy Hot Call Girls Agency Service Escorts In South NCR’s No. 1 High Profile Independent Female Escorts Service. We Provide Good Quality Educated Profile At Very Regnebal Price 100% Safe And Original.We Are Provide Escorts Service All OYO Hotels ,3*,4*,5* Star Hotel And Home Flat, Apartment. Guest-House. Services In -Call And Out – Call Both Are Services Available. 24Hrs. Any Time Any Where. In All Over Noida Gurgaon Ghaziabad Faridabad.More Information And Contact Profile Real Pic Visit Our Website City Wise Escorts Service Agency.Good Looking Cheap And Best Models Girls U Can Get Best Click On Link……Night Call Girls Now In Hotel Le Meridien Gurgaon Near Female Escort One Shot — 5000/in call (time 1 hour), 6000/out call Two shot with one girl — 8000/in call (time 2 hour), 10000/out call Body to body massage with sex- 8000/in call (time 1 hour) Full night Service for one person– 12000/in call, 13000/out call (shot limit 3-4 shots) Full night Service for more than 1 person — please contact Us —9920725232 We are available 24*7 all days of the year. Call us — 9920725232 Thank you for Visiting.

Vip Mumbai Call Girls Thane West Call On 9920725232 With Body to body massage...

Vip Mumbai Call Girls Thane West Call On 9920725232 With Body to body massage...

Vip Mumbai Call Girls Thane West Call On 9920725232 With Body to body massage...

Call Girls Indiranagar Just Call 👗 9155563397 👗 Top Class Call Girl Service Bangalore Booking Contact Details :- WhatsApp Chat :- +91-9155563397 2-May-2024(SMW) Call Girls In Model Towh Bangalore +91-9155563397 !! Best Woman Seeking Man Call Girls Service, Escorts Service in Home Hotel in Bangalore NCR 24 Hours Available Service Call Girls, Contact Us +91-9155563397 (Any Time. Any Where) Call Girls in Bangalore, Noida, Gurgaon, Ghaziabad,Sexy Indian Female Escorts Service Bangalore NCRWelcome To Bangalore Escorts Service – An All Over New Bangalore Very Sexy Hot Call Girls Agency Service Escorts In South BangaloreNCRBangalore’s No. 1 High Profile Independent Female Escorts Service. We Provide Good Quality Educated Profile At Very Regnebal Price 100% Safe And Original.We Are Provide Escorts Service All OYO Hotels ,3*,4*,5* Star Hotel And Home Flat, Apartment. Guest-House. Services In -Call And Out – Call Both Are Services Available. 24Hrs. Any Time Any Where. In All Over Bangalore Noida Gurgaon Ghaziabad Faridabad.More Information And Contact Profile Real Pic Visit Our Website City Wise Escorts Service Agency.Good Looking Cheap And Best Models Girls U Can Get Best Click On Link……Night Call Girls Now In Hotel Le Meridien Gurgaon Near Female Escort One Shot — 5000/in call (time 1 hour), 6000/out call Two shot with one girl — 8000/in call (time 2 hour), 10000/out call Body to body massage with sex- 8000/in call (time 1 hour) Full night Service for one person– 12000/in call, 13000/out call (shot limit 3-4 shots) Full night Service for more than 1 person — please contact Us —9155563397 We are available 24*7 all days of the year.

Call Girls Indiranagar Just Call 👗 9155563397 👗 Top Class Call Girl Service B...

Call Girls Indiranagar Just Call 👗 9155563397 👗 Top Class Call Girl Service B...

Call Girls Indiranagar Just Call 👗 9155563397 👗 Top Class Call Girl Service B...

only4webmaster01

Thane Call Girls 7091864438 Call Girls in Thane Escort service book now -

Thane Call Girls 7091864438 Call Girls in Thane Escort service book now -

Thane Call Girls 7091864438 Call Girls in Thane Escort service book now -

Call Girls Begur Just Call 👗 7737669865 👗 Top Class Call Girl Service Bangalore Booking Contact Details :- WhatsApp Chat :- +91-7737669865 2-May-2024(SMW) Call Girls In Model Towh Bangalore +91-7737669865 !! Best Woman Seeking Man Call Girls Service, Escorts Service in Home Hotel in Bangalore NCR 24 Hours Available Service Call Girls, Contact Us +91-7737669865 (Any Time. Any Where) Call Girls in Bangalore, Noida, Gurgaon, Ghaziabad,Sexy Indian Female Escorts Service Bangalore NCRWelcome To Bangalore Escorts Service – An All Over New Bangalore Very Sexy Hot Call Girls Agency Service Escorts In South BangaloreNCRBangalore’s No. 1 High Profile Independent Female Escorts Service. We Provide Good Quality Educated Profile At Very Regnebal Price 100% Safe And Original.We Are Provide Escorts Service All OYO Hotels ,3*,4*,5* Star Hotel And Home Flat, Apartment. Guest-House. Services In -Call And Out – Call Both Are Services Available. 24Hrs. Any Time Any Where. In All Over Bangalore Noida Gurgaon Ghaziabad Faridabad.More Information And Contact Profile Real Pic Visit Our Website City Wise Escorts Service Agency.Good Looking Cheap And Best Models Girls U Can Get Best Click On Link……Night Call Girls Now In Hotel Le Meridien Gurgaon Near Female Escort One Shot — 5000/in call (time 1 hour), 6000/out call Two shot with one girl — 8000/in call (time 2 hour), 10000/out call Body to body massage with sex- 8000/in call (time 1 hour) Full night Service for one person– 12000/in call, 13000/out call (shot limit 3-4 shots) Full night Service for more than 1 person — please contact Us —7737669865 We are available 24*7 all days of the year. Call us — 7737669865 Thank you for Visiting.

Call Girls Begur Just Call 👗 7737669865 👗 Top Class Call Girl Service Bangalore

Call Girls Begur Just Call 👗 7737669865 👗 Top Class Call Girl Service Bangalore

Call Girls Begur Just Call 👗 7737669865 👗 Top Class Call Girl Service Bangalore

👉 Amritsar Call Girl 👉📞 6367187148 👉📞 Just📲 Call Ruhi Call Girl Phone No Amritsar No💰Advance Cash On Delivery Service Escorts Service Available Whatsapp Chaya ☎️ : [+91-6367187148] Escorts Service Amritsar are always ready to make their clients happy. Their exotic looks and sexy personalities are sure to turn heads. You can enjoy with them, including massages and erotic encounters.#P12Our area Escorts are young and sexy, so you can expect to have an exotic time with them. They are trained to satiate your naughty nerves and they can handle anything that you want. They are also intelligent, so they know how to make you feel comfortable and relaxed SERVICE ✅ ❣️ ⭐➡️HOT & SEXY MODELS // COLLEGE GIRLS HOUSE WIFE RUSSIAN , AIR HOSTES ,VIP MODELS . AVAILABLE FOR COMPLETE ENJOYMENT WITH HIGH PROFILE INDIAN MODEL AVAILABLE HOTEL & HOME ★ SAFE AND SECURE HIGH CLASS SERVICE AFFORDABLE RATE ★ SATISFACTION,UNLIMITED ENJOYMENT. ★ All Meetings are confidential and no information is provided to any one at any cost. ★ EXCLUSIVE PROFILes Are Safe and Consensual with Most Limits Respected ★ Service Available In: - HOME & HOTEL Star Hotel Service .In Call & Out call SeRvIcEs : ★ A-Level (star escort) ★ Strip-tease ★ BBBJ (Bareback Blowjob)Receive advanced sexual techniques in different mode make their life more pleasurable. ★ Spending time in hotel rooms ★ BJ (Blowjob Without a Condom) ★ Completion (Oral to completion) ★ Covered (Covered blowjob Without condom ★ANAL SERVICES.

👉 Amritsar Call Girl 👉📞 6367187148 👉📞 Just📲 Call Ruhi Call Girl Phone No Amri...

👉 Amritsar Call Girl 👉📞 6367187148 👉📞 Just📲 Call Ruhi Call Girl Phone No Amri...

👉 Amritsar Call Girl 👉📞 6367187148 👉📞 Just📲 Call Ruhi Call Girl Phone No Amri...

karishmasinghjnh

➥🔝 7737669865 🔝▻ Dindigul Call-girls in Women Seeking Men 🔝Dindigul🔝 Escorts Service Booking Contact Details :- WhatsApp Chat :- +91-7737669865 Call Girls In Model Towh +91-7737669865 !! Best Woman Seeking Man Call Girls Service, Escorts Service in Home Hotel in NCR 24 Hours Available Service Call Girls, Contact Us +91-7737669865 (Any Time. Any Where) Call Girls in , Noida, Gurgaon, Ghaziabad,Sexy Indian Female Escorts Service NCRWelcome To Escorts Service – An All Over New Very Sexy Hot Call Girls Agency Service Escorts In South NCR’s No. 1 High Profile Independent Female Escorts Service. We Provide Good Quality Educated Profile At #K09 Very Regnebal Price 100% Safe And Original.We Are Provide Escorts Service All OYO Hotels ,3*,4*,5* Star Hotel And Home Flat, Apartment. Guest-House. Services In -Call And Out – Call Both Are Services Available. 24Hrs. Any Time Any Where. In All Over Noida Gurgaon Ghaziabad Faridabad.More Information And Contact Profile Real Pic Visit Our Website City Wise Escorts Service Agency.Good Looking Cheap And Best Models Girls U Can Get Best Click On Link……Night Call Girls Now In Hotel Le Meridien Gurgaon Near Female Escort One Shot — 5000/in call (time 1 hour), 6000/out call Two shot with one girl — 8000/in call (time 2 hour), 10000/out call Body to body massage with sex- 8000/in call (time 1 hour) Full night Service for one person– 12000/in call, 13000/out call (shot limit 3-4 shots) Full night Service for more than 1 person — please contact Us —7737669865 We are available 24*7 all days of the year. Call us — 7737669865 Thank you for Visiting.

➥🔝 7737669865 🔝▻ Dindigul Call-girls in Women Seeking Men 🔝Dindigul🔝 Escor...

➥🔝 7737669865 🔝▻ Dindigul Call-girls in Women Seeking Men 🔝Dindigul🔝 Escor...

➥🔝 7737669865 🔝▻ Dindigul Call-girls in Women Seeking Men 🔝Dindigul🔝 Escor...

Call Girls Bannerghatta Road Just Call 👗 7737669865 👗 Top Class Call Girl Service Bangalore Booking Contact Details :- WhatsApp Chat :- +91-7737669865 2-May-2024(SMW) Call Girls In Model Towh Bangalore +91-7737669865 !! Best Woman Seeking Man Call Girls Service, Escorts Service in Home Hotel in Bangalore NCR 24 Hours Available Service Call Girls, Contact Us +91-7737669865 (Any Time. Any Where) Call Girls in Bangalore, Noida, Gurgaon, Ghaziabad,Sexy Indian Female Escorts Service Bangalore NCRWelcome To Bangalore Escorts Service – An All Over New Bangalore Very Sexy Hot Call Girls Agency Service Escorts In South BangaloreNCRBangalore’s No. 1 High Profile Independent Female Escorts Service. We Provide Good Quality Educated Profile At Very Regnebal Price 100% Safe And Original.We Are Provide Escorts Service All OYO Hotels ,3*,4*,5* Star Hotel And Home Flat, Apartment. Guest-House. Services In -Call And Out – Call Both Are Services Available. 24Hrs. Any Time Any Where. In All Over Bangalore Noida Gurgaon Ghaziabad Faridabad.More Information And Contact Profile Real Pic Visit Our Website City Wise Escorts Service Agency.Good Looking Cheap And Best Models Girls U Can Get Best Click On Link……Night Call Girls Now In Hotel Le Meridien Gurgaon Near Female Escort One Shot — 5000/in call (time 1 hour), 6000/out call Two shot with one girl — 8000/in call (time 2 hour), 10000/out call Body to body massage with sex- 8000/in call (time 1 hour) Full night Service for one person– 12000/in call, 13000/out call (shot limit 3-4 shots) Full night Service for more than 1 person — please contact Us —7737669865 We are available 24*7 all days of the year. Call us — 7737669865 Thank you for Visiting.

Call Girls Bannerghatta Road Just Call 👗 7737669865 👗 Top Class Call Girl Ser...

Call Girls Bannerghatta Road Just Call 👗 7737669865 👗 Top Class Call Girl Ser...

Call Girls Bannerghatta Road Just Call 👗 7737669865 👗 Top Class Call Girl Ser...

Kürzlich hochgeladen (20)

Just Call Vip call girls Bellary Escorts ☎️9352988975 Two shot with one girl ...

Just Call Vip call girls Bellary Escorts ☎️9352988975 Two shot with one girl ...

Just Call Vip call girls Bellary Escorts ☎️9352988975 Two shot with one girl ...

➥🔝 7737669865 🔝▻ Bangalore Call-girls in Women Seeking Men 🔝Bangalore🔝 Esc...

➥🔝 7737669865 🔝▻ Bangalore Call-girls in Women Seeking Men 🔝Bangalore🔝 Esc...

➥🔝 7737669865 🔝▻ Bangalore Call-girls in Women Seeking Men 🔝Bangalore🔝 Esc...

Abortion pills in Jeddah | +966572737505 | Get Cytotec

Abortion pills in Jeddah | +966572737505 | Get Cytotec

Abortion pills in Jeddah | +966572737505 | Get Cytotec

➥🔝 7737669865 🔝▻ mahisagar Call-girls in Women Seeking Men 🔝mahisagar🔝 Esc...

➥🔝 7737669865 🔝▻ mahisagar Call-girls in Women Seeking Men 🔝mahisagar🔝 Esc...

➥🔝 7737669865 🔝▻ mahisagar Call-girls in Women Seeking Men 🔝mahisagar🔝 Esc...

Call Girls Indiranagar Just Call 👗 7737669865 👗 Top Class Call Girl Service B...

Call Girls Indiranagar Just Call 👗 7737669865 👗 Top Class Call Girl Service B...

Call Girls Indiranagar Just Call 👗 7737669865 👗 Top Class Call Girl Service B...

Call Girls Hsr Layout Just Call 👗 7737669865 👗 Top Class Call Girl Service Ba...

Call Girls Hsr Layout Just Call 👗 7737669865 👗 Top Class Call Girl Service Ba...

Call Girls Hsr Layout Just Call 👗 7737669865 👗 Top Class Call Girl Service Ba...

Call Girls In Shivaji Nagar ☎ 7737669865 🥵 Book Your One night Stand

Call Girls In Shivaji Nagar ☎ 7737669865 🥵 Book Your One night Stand

Call Girls In Shivaji Nagar ☎ 7737669865 🥵 Book Your One night Stand

Anomaly detection and data imputation within time series

Anomaly detection and data imputation within time series

Anomaly detection and data imputation within time series

Just Call Vip call girls roorkee Escorts ☎️9352988975 Two shot with one girl ...

Just Call Vip call girls roorkee Escorts ☎️9352988975 Two shot with one girl ...

Just Call Vip call girls roorkee Escorts ☎️9352988975 Two shot with one girl ...

Aspirational Block Program Block Syaldey District - Almora

Aspirational Block Program Block Syaldey District - Almora

Aspirational Block Program Block Syaldey District - Almora

Just Call Vip call girls Erode Escorts ☎️9352988975 Two shot with one girl (E...

Just Call Vip call girls Erode Escorts ☎️9352988975 Two shot with one girl (E...

Just Call Vip call girls Erode Escorts ☎️9352988975 Two shot with one girl (E...

Predicting Loan Approval: A Data Science Project

Predicting Loan Approval: A Data Science Project

Predicting Loan Approval: A Data Science Project

Jual Obat Aborsi Surabaya ( Asli No.1 ) 085657271886 Obat Penggugur Kandungan...

Jual Obat Aborsi Surabaya ( Asli No.1 ) 085657271886 Obat Penggugur Kandungan...

Jual Obat Aborsi Surabaya ( Asli No.1 ) 085657271886 Obat Penggugur Kandungan...

Vip Mumbai Call Girls Thane West Call On 9920725232 With Body to body massage...

Vip Mumbai Call Girls Thane West Call On 9920725232 With Body to body massage...

Vip Mumbai Call Girls Thane West Call On 9920725232 With Body to body massage...

Call Girls Indiranagar Just Call 👗 9155563397 👗 Top Class Call Girl Service B...

Call Girls Indiranagar Just Call 👗 9155563397 👗 Top Class Call Girl Service B...

Call Girls Indiranagar Just Call 👗 9155563397 👗 Top Class Call Girl Service B...

Thane Call Girls 7091864438 Call Girls in Thane Escort service book now -

Thane Call Girls 7091864438 Call Girls in Thane Escort service book now -

Thane Call Girls 7091864438 Call Girls in Thane Escort service book now -

Call Girls Begur Just Call 👗 7737669865 👗 Top Class Call Girl Service Bangalore

Call Girls Begur Just Call 👗 7737669865 👗 Top Class Call Girl Service Bangalore

Call Girls Begur Just Call 👗 7737669865 👗 Top Class Call Girl Service Bangalore

👉 Amritsar Call Girl 👉📞 6367187148 👉📞 Just📲 Call Ruhi Call Girl Phone No Amri...

👉 Amritsar Call Girl 👉📞 6367187148 👉📞 Just📲 Call Ruhi Call Girl Phone No Amri...

👉 Amritsar Call Girl 👉📞 6367187148 👉📞 Just📲 Call Ruhi Call Girl Phone No Amri...

➥🔝 7737669865 🔝▻ Dindigul Call-girls in Women Seeking Men 🔝Dindigul🔝 Escor...

➥🔝 7737669865 🔝▻ Dindigul Call-girls in Women Seeking Men 🔝Dindigul🔝 Escor...

➥🔝 7737669865 🔝▻ Dindigul Call-girls in Women Seeking Men 🔝Dindigul🔝 Escor...

Call Girls Bannerghatta Road Just Call 👗 7737669865 👗 Top Class Call Girl Ser...

Call Girls Bannerghatta Road Just Call 👗 7737669865 👗 Top Class Call Girl Ser...

Call Girls Bannerghatta Road Just Call 👗 7737669865 👗 Top Class Call Girl Ser...

Data intensive applications with Apache Flink - Simone Robutti, Radicalbit

1. Milan – July 13 2016 Data Intensive Applications with Apache Flink Simone Robutti Machine Learning Engineer at Radicalbit @SimoneRobutti

2. Agenda 1. Brief Introduction to Apache Flink ○ Why ○ What ○ How 2. Machine Learning on Flink ○ Present landscape ○ Future of the Ecosystem 3. Closing notes on Radicalbit (shameless plug ahead)

3. 100% Buzzword-free guaranteed Big Data Machine Intelligence Web-scale 400x It’s like the human brain Exactly-once Exactly-once

4. Why Flink (and not Spark/Storm/Samza...) Because it’s production-ready streaming-first low-latency fault-tolerant high-throughput processing engine

5. Flink: what is it? From Flink’s Documentation

6. Connectors and integrations

7. Flink’s Runtime From Flink’s Documentation

8. Flink’s DataFlow From Flink’s Documentation Written by the user through DataSet/DataStream API Compiled and optimized in the client

9. Flink’s DataFlow From Flink’s Documentation The compiled job is translated to distributed tasks by the master and executed by workers

10. Machine Learning on Flink

11. Ready and awesome for parallel ML Work in progress for distributed ML ML on Flink

12. Flink for Model Evaluation Pipelines Source Data Preparation Evaluation Sink Source Post process -ing Composable, modular Flink Operator

13. Evaluation with Flink-JPMML Source Operator Flink - JPMML Operator Sink Operator Source Operator model.pmml Small library that implements basic model eval. Data Preparation

14. “I have seen people insisting on using Hadoop for datasets that could easily fit on a flash drive and could easily be processed on a laptop.” - Yann LeCun - ML on Flink

15.

16. FlinkML What: Out-of-the-box workhorse algorithms (ALS, SVM, LinReg, LogReg …) Status: early phase, slow development

17. FlinkML Pro: available out of the box, written with Flink API Cons: reinvents the wheel, only a few algorithms, no model persistence

18. Samsara What: Linear algebra framework Status: mature

19. Samsara Pro: generic algorithms with platform-specific bindings, skilled community Cons: covers only a few use cases

20. SAMOA What: Online learning algorithm framework (VHT, AMR, …) Status: early phase, complicated relationship with the industry

21. SAMOA Pro: many powerful generic online learning algorithms, backed by academics (MOA, Weka) Cons: not production ready, academic focus

22. ML on Flink: the future of the ecosystem

23. Apache Beam Programming model for data processing pipelines ● Streaming first, batch as a bounded stream ● Layered API: What, Where, When, How ● Platform agnostic: same program, different runners

24. Apache Beam - Runners ● Flink ● Spark (Partial) ● Google Cloud Dataflow ● Plain Java ● Gearpump (WIP) ● Apex (WIP)

25. BeamML: a runner-agnostic ML library

26. FlinkML Roadmap ● More algorithms! ● Evaluation framework ● Persistence/export ● Online Learning Framework

27. Proteus Online Learning Platform - based on Flink Source: Proteus’ website

28. The role of Radicalbit

29. Contributions ● Cassandra Connector ● Scala API extensions ● FlinkML (Linear Algebra Framework, MinHash) ● Akka Connector

30. Our vision Flink can become the ideal choice to build real-time decision- heavy applications with high data-throughput To achieve this: ● Ambitious applications (aim for real-time services) ● Reliable distributed online learning (Proteus?) ● A Pipelining Framework (experiment fast, increase testability and modularity)

32. THANKS! Simone Robutti Mail: simone.robutti@radicalbit.io Medium: @simone.robutti Twitter: @SimoneRobutti — @weareradicalbit