SlideShare a Scribd company logo
1 of 66
Machine learning:
          Keywords + Applications
1) Applications of machine learning
  - wind power forecasting (important e.g. for PengHu island!)
  - rainfalls estimation
2) Some key words (you must know what they mean):
  - black box / white box
  - shrinking horizon
  - objective function
  - “what you get is what you have”
  - model complexity
  - cross-validation
  - generative model
  - quantile, value-at-risk
What you will see
                  in these slides
1) Applications of machine learning
  - wind power forecasting (important e.g. for PengHu island!)
  - rainfalls estimation
2) Some key words (you must know what they mean):
  - black box / white box
  - shrinking horizon
  - objective function
  - “what you get is what you have”
  - model complexity
  - cross-validation
  - generative model
  - quantile, value-at-risk
I want to produce
    electricity
I want to produce
               electricity


I have:
- water for hydroelectricity
- a nuclear power plant
- wind farms

- gas turbines
I want to produce
                electricity

I must ensure, for each time step:

        Production of electricity
                    =
         Demand of electricity

 Demand(t0), Demand(t1), Demand(t2), Demand(t3) known.
I want to produce
                electricity

We get four equations:
Production(t0) = Demand(t0)
Production(t1) = Demand(t1)
Production(t2) = Demandt(2)
Production(t3) = Demand(t3)


                      Other equation:
    Production = hydro-production + nuclear-production
              + wind-farm production + gas production
I want to produce
               electricity

We get four equations:
H(t0)+W(t0)+N(t0)+G(t0) = Demand(t0)
H(t1)+W(t1)+N(t1)+G(t1) = Demand(t1)
H(t2)+W(t2)+N(t2)+G(t2) = Demandt(2)
H(t3)+W(t3)+N(t3)+G(t3) = Demand(t3)

Stock level for Hydro depends on production
  x(1) = x(0)-H(0) x(2) = x(1)-H(1)
  x(3) = x(2)-H(2) x(4) = x(3)-H(3)
Also depends on inflows

We get four equations:
H(t0)+W(t0)+N(t0)+G(t0) = Demand(t0)
H(t1)+W(t1)+N(t1)+G(t1) = Demand(t1)
H(t2)+W(t2)+N(t2)+G(t2) = Demandt(2)
H(t3)+W(t3)+N(t3)+G(t3) = Demand(t3)

Stock level for Hydro: x(0); constraint: x(i) >= 0
  x(1) = x(0)+I(0)-H(0) x(2) = x(1)+I(1)-H(1)
  x(3) = x(2)+I(2)-H(2) x(4) = x(3)+I(3)-H(3)
8 equations
           (yes, it increases...)

H(t0)+W(t0)+N(t0)+G(t0) = Demand(t0)
H(t1)+W(t1)+N(t1)+G(t1) = Demand(t1)
H(t2)+W(t2)+N(t2)+G(t2) = Demandt(2)
H(t3)+W(t3)+N(t3)+G(t3) = Demand(t3)

X(0) + I(0) – H(0) >=0
X(0) + I(0) – H(0) + I(1) – H(1) >=0
X(0) + I(0) – H(0) + I(1) – H(1) + I(2) – H(2) >=0
X(0) + I(0)–H(0) +I(1)– H(1) +I(2)–H(2) +I(3)-H(3)>=0
8 equations
           (yes, it increases...)


Nuclear has constraints as well:

- N(1) in f(N(0))
- N(2) in f(N(1))
- N(3) in f(N(2))

  (very simplified; in fact there are stocks, refills...)
Ok! Summary ?
W(0), W(1), W(2), W(3) wind farms production
   = can not be chosen and
       W(1), W(2), W(3) unknown!

To be chosen:
G(0), G(1), G(2), G(3) gas turbines production

H(0), H(1), H(2), H(3) hydroelectric production
                       (can be somehow negative)

N(0), N(1), N(2), N(3) nuclear power
Ok! Summary ?

To be chosen:
G(0), G(1), G(2), G(3) gas turbines production
H(0), H(1), H(2), H(3) hydroelectric production
                       (can be somehow negative)
N(0), N(1), N(2), N(3) nuclear power


Constraints: production plans must satisfy constraints.

E.g.: if unlimited gas turbines production, we might decide
      G(0)=demand(0)-W(0), G(1)=demand(1)-W(1),
        G(2)=demand(2)-W(2), G(3)=demand(3)-W(3)
                 ==> it is a feasible solution
Ok! Summary ?

To be chosen:
G(0), G(1), G(2), G(3) gas production
H(0), H(1), H(2), H(3) hydroelectric production
                       (can be somehow negative)
N(0), N(1), N(2), N(3) nuclear power


Constraints: production plans must satisfy constraints.

E.g.: if unlimited gas production, we might decide
      G(0)=demand(0)-W(0), G(1)=demand(1)-W(1),
        G(2)=demand(2)-W(2), G(3)=demand(3)-W(3)
                 ==> it is a feasible solution
                 ==> it is a bad feasible solution
Ok! Summary ?

To be chosen:
G(0), G(1), G(2), G(3) gas production
H(0), H(1), H(2), H(3) hydroelectric production
                       (can be somehow negative)
N(0), N(1), N(2), N(3) nuclear power


Constraints: production plans must satisfy constraints.

E.g.: if unlimited gas production, we might decide
      G(0)=demand(0)-W(0), G(1)=demand(1)-W(1),
        G(2)=demand(2)-W(2), G(3)=demand(3)-W(3)
                 ==> it is a feasible solution
                 ==> it is a bad feasible solution

Objective function: not all solutions are equivalent!
Ok! Summary ?


Production cost:

  Hcost * (H0+H1+H2+H3)
 + Ncost * (N0+N1+N2+N3)
 + Gcost * (G0+G1+G2+G3)
 + Wcost* (W0+W1+W2+W3)

Nb: Cost does not only mean $.
   Cost means ecological & environmental costs as well.
Quizz !

So we have:
x0,x1,x2,x3: states at time t0, t1, t2, t3.
x0 is given, x1, x2, x3 depend on our decisions.

Some decisions are chosen at time t0.
Some decisions are chosen at time t1.
Some decisions are chosen at time t2.
Some decisions are chosen at time t3.

The cost depends on all decisions.

Is this a supervised learning problem ?
Is this a reinforcement learning problem ?
Is this a boring problem ?
Ok! Summary ?


So we have equations.
If we know W(1),W(2),W(3),
    we can evaluate the production cost.
We want to:
   - solve equations
   - minimize production cost

Problem: we don't know W(1), W(2), W(3).
How to know ?
Ok! Summary ?

        We want to know W(1), W(2), W(3).

                      Steps:

    (1) Weather simulation: we predict the wind
        at time steps t1 t2 t3 (as in classical
                  weather forecast)

             (2) From the wind forecast,
     predict the power (e.g. “black box” model):
      Based on data
      E.g. mean-square error

     Predicting W(1), W(2), W(3):
     Boring problem ?
     Supervised learning problem ?
     Reinforcement learning problem ?
Ok! Summary ?

        We want to know W(1), W(2), W(3).

                       Steps:

    (1) Weather simulation: we predict What does
                                          the wind
        at time steps t1 t2 t3 (as in classical box”
                                         “black
                  weather forecast)        mean ?

             (2) From the wind forecast,
     predict the power (e.g. “black box” model):
      Based on data
      E.g. mean-square error

     Predicting W(1), W(2), W(3):
     Boring problem ?
     Supervised learning problem ?
     Reinforcement learning problem ?
Difficulties ?


In many cases, you will see in your life as an engineer that:

- collecting datas and models is a big
        part of the work

- solving the problem exactly is impossible

- what really matters in an application is to
       find where the current codes are
       not satisfactory, and not to spend time on
       other aspects
Typical questions for
  this application
      Many constraints /
           effects
        are missing !

         (for the real
         application,
       we must have far
             more
        constraints...)
Typical questions for
                       this application
                           Many constraints /
                                effects
                             are missing !

Mean square                   (for the real
 error in the                 application,
 supervised                 we must have far
 learning for                     more
W1,W2,W3 ?                   constraints...)

 But ..........
  ................
 .................
Typical questions for
                       this application
                           Many constraints /
                                effects
                             are missing !

                              (for the real      How many time
Mean square
                              application,      steps in the future
 error in the
                            we must have far         should
 supervised
                                  more            we consider ?
 learning for
W1,W2,W3 ?                   constraints...)

 But ..........
  ................
 .................
Typical questions for
                        this application
                            Many constraints /
                                 effects
                              are missing !

                               (for the real      How many time
 Mean square
                               application,      steps in the future
  error in the
                             we must have far         should
  supervised
                                   more            we consider ?
  learning for
 W1,W2,W3 ?                   constraints...)

  But ..........
   ................
  .................


  We should
   penalize
cases with W4
   small !
Typical questions for
                        this application
                            Many constraints /
                                 effects
                              are missing !

                               (for the real      How many time
 Mean square
                               application,      steps in the future
  error in the
                             we must have far         should
  supervised
                                   more            we consider ?
  learning for
 W1,W2,W3 ?                   constraints...)

  But ..........
   ................                                   In case of long
  .................                                        term:
                                                         should we
  We should                                               consider
                                                     “climate change”
   penalize
cases with W4                                              bias ?
   small !
Some of these points
   Typical questions for                   are important, some
                                              are negligible,
     this application                       depending on the
                                                 system
                      Many constraints /     under analysis.
                           effects
                        are missing !

                        (for the real       How many time
 Mean square
                        application,       steps in the future
  error in the
                      we must have far          should
  supervised
                            more             we consider ?
  learning for
 W1,W2,W3 ?            constraints...)

  But ..........
   ................                             In case of long
  .................                                  term:
                                                   should we
  We should                                         consider
                                               “climate change”
   penalize
cases with W4                                        bias ?
   small !
Another beautiful application

This is Paris.
Beautiful town.
With plenty of people
(10 millions in IDF).
Another beautiful application

This is Paris.
Beautiful town.
With plenty of people
(10 millions in IDF).
Producing plenty of fecal
matter ==> dirty water.
Our river in Paris
 is the “Seine”.

    A French
  politician said
 he would soon
 swim across it.

After all, he never
       did it.

 For your health,
   don't do it.

  Nevertheless,
      we try
    to keep it
     as clean
   as possible.
Dirty water should be separated from the Seine.
And usually it is.
Something like this:




                                      Seine

              Dirty
              water
Problem: if big rainfalls reach dirty water,
    then dirty water might pollute the Seine


                        Seine

Dirty
water
No typhoon in France.
But we can have heavy rains/winds in Paris:
- 0.96 dm in 24 hours happened in 1987.
- gusts at 169 km/h in 1999 (very unusual in France)

                Problem: if big rainfalls reach dirty water,
                   then dirty water might pollute the Seine


                                       Seine

              Dirty
              water
                     (yes, in Taiwan it is more impressive,
              sometimes it is 16.7 dm in 24 hours and gusts
                                     can reach 250 km/h...)
No typhoon in France.
But we can have heavy rains/winds in Paris:
- 0.96 dm in 24 hours happened in 1987.
- gusts at 169 km/h in 1999 (very unusual in France)

                Problem: if big rainfalls reach dirty water,
                   then dirty water might pollute the Seine


                                       Seine

              Dirty
              water
                     (yes, in Taiwan it is more impressive,
              sometimes it is 16.7 dm in 24 hours and gusts
                                     can reach 250 km/h...)
No typhoon in France.
But we can have heavy rains/winds in Paris:
- 0.96 dm in 24 hours happened in 1987.
- gusts at 169 km/h in 1999 (very unusual in France)

                Problem: if big rainfalls reach dirty water,
                   then dirty water might pollute the Seine


                                       Seine
                Dirty
               water
              → Seine!(yes, in Taiwan it is more impressive,
              sometimes it is 16.7 dm in 24 hours and gusts
                                     can reach 250 km/h...)
Another beautiful application

Three water networks:

- dirty water: should go to cleaning stations



- clean water: can go to the Seine, but can't be drunk



- drinkable water (France: tap water = drinkable)
Big water network




Dirty   Dirty   Dirty    Dirty
water   water   water    water




Clean   Clean   Clean    Clean
water   water   water    water
Water vs dirty water

Challenge:
Summer storms.
Not comparable to a Taiwanese typhoon.
But a lot of water.
Can make dirty water become very big.
Can invade clean water.

Your mission:
- Get read of dirty water
- Protect clean water
Water vs dirty water

State: level in each stock,
     valves' status
     (open or closed)

At each time step,
   rainfalls(i) liters of water reach stock i.
   you can open or close valves
   ==> get a new state.

Your mission:
  - Get read of dirty water
  - Protect clean water
Water vs dirty water

   Typically:
(0, 1, 0, 0, 0, 1, 0, 1,   0.42, 0.2, 0.0, 0.8, 0.3)
    (valves)                  (stock levels)

Plenty of rules:
- if (valve 4 opens, then water from stock 1
      goes to stock 2 at rate 0.02m3/s)
- if (stock[2]>0.3) then dirty water ==> Seine,
            3
       0.1m /s

==> Miminize the quantity of dirty water in clean
    stocks at the end of the storm
Water vs dirty water


                                          D-dimensional
                                              vector
Equations:

Stocks(t+1) = complicatedFunction(Stocks(t),
                        rainfalls(t), valves(t))
   D-dimensional
       vector
(D=number of stocks)      D-dimensional          V-dimensional
                              vector                 vector
                       (D=number of stocks)   (V=number of valves)
Water vs dirty water

  To be decided:
valves(t) for each t

                       If there are 240 times steps,
                       we get 240 x V decision
   V-dimensional
       vector
                       variables
(V=number of valves)



Criterion = objective function = quantity of dirty
water reaching the clean network + quantity of
dirty water in the river
Shrinking horizon



Too many time steps!

At each time step, make a decision
using only 30 time steps.

Move this window of 30 time steps.
Shrinking horizon



Too many time steps!

At each time step, make a decision
using only 30 time steps.

Move this window of 30 time steps.
Shrinking horizon



Too many time steps!

At each time step, make a decision
using only 30 time steps.

Move this window of 30 time steps.
Shrinking horizon


Too many time steps!

At each time step, make a decision
using only 30 time steps.

Move this window of 30 time steps.
Shrinking horizon




moving window of
 30 time steps
Summary ?



    Is this:
- an optimization problem ?
- a reinforcement learning problem ?
- a supervised machine learning problem ?
Summary ?



    Is this:
- an optimization problem ?
- a reinforcement learning problem ?
- a supervised machine learning problem ?

Problem: rainfalls are unknown.
How to predict rainfalls ?

In fact, there are distinct rainfalls:
 - R1: a spatial distribution of rainfalls
         (one number per time step
        per point of the map)
 - R2:
    a underground list of rainfall arrivals (inflows),
    per stocks (D-dimensional)

Input data:
 - weather forecast of archive ( R1(t) for each t)
 - archives of weather forecast R1(t)
 - archives of inflows R2(t)
If your life was depending on it, what
would you do ?
If your life was depending on it, what
  would you do ?


We are at time t.
We need a forecaster:
- which takes available data as input
- and outputs R2(t') for t'>=t (why not for t' < t ?)
If your life was depending on it, what
  would you do ?


We are at time t.
We need a forecaster:
- which takes available data as input
- and outputs R2(t') for t'>=t (why not for t' < t ?)

(R2(t+1),R2(t+2),R2(t+3), .... , R2(t+30))

   =?
If your life was depending on it, what
  would you do ?


We are at time t.
We need a forecaster:
- which takes available data as input
- and outputs R2(t') for t'>=t (why not for t' < t ?)

(R2(t+1),R2(t+2),R2(t+3), .... , R2(t+30))

   = f( R1(t) ) ?
If your life was depending on it, what
  would you do ?


We are at time t.
We need a forecaster:
- which takes available data as input
- and outputs R2(t') for t'>=t (why not for t' < t ?)

(R2(t+1),R2(t+2),R2(t+3), .... , R2(t+30))

   = f( R1(t), R1(t-1), R1(t-2), R1(t-3), R1(t-4), ..., R1(t-50) )

                    (because there are delays)
If your life was depending on it, what
  would you do ?


We are at time t.
We need a forecaster:
- which takes available data as input
- and outputs R2(t') for t'>=t (why not for t' < t ?)

(R2(t+1),R2(t+2),R2(t+3), .... , R2(t+30))

   = f( R1(t), R1(t-1), R1(t-2), R1(t-3), R1(t-4), ..., R1(t-50), R2(t) )

                    (because “what you get is what you have”)
If your life was depending on it, what
  would you do ?


   = f( R1(t), R1(t-1), R1(t-2), R1(t-3), R1(t-4), ..., R1(t-50), R2(t) )

and then agregation:
   = f( R1(t),
        R1(t-1)+R1(t-2),
        R1(t-3)+R1(t-4)+R1(t-5)+R1(t-6),
        +...,
        R2(t) )

Why ?
If your life was depending on it, what
  would you do ?


   = f( R1(t), R1(t-1), R1(t-2), R1(t-3), R1(t-4), ..., R1(t-50), R2(t) )

and then agregation:
   = f( R1(t),
        R1(t-1)+R1(t-2),
        R1(t-3)+R1(t-4)+R1(t-5)+R1(t-6),
        +...,
        R2(t) )

Because less parameters.
If your life was depending on it, what
  would you do ?

   = f( R1(t), R1(t-1), R1(t-2), R1(t-3), R1(t-4), ..., R1(t-50), R2(t) )

and then agregation:
   = f( R1(t),
        R1(t-1)+R1(t-2),
        R1(t-3)+R1(t-4)+R1(t-5)+R1(t-6),
        +...,
        R2(t) )

Because less parameters.
Rule of thumb: number of parameters
      less than number of data points / 20 <=== why ?
If your life was depending on it, what
  would you do ?

   = f( R1(t), R1(t-1), R1(t-2), R1(t-3), R1(t-4), ..., R1(t-50), R2(t) )

and then agregation:
   = f( R1(t),
        R1(t-1)+R1(t-2),
        R1(t-3)+R1(t-4)+R1(t-5)+R1(t-6),
        +...,
        R2(t) )

Because less parameters.
Rule of thumb: number of parameters
      less than number of data points / 20 <=== why ?

How to choose between all these models ?
If your life was depending on it, what
  would you do ?

   = f( R1(t), R1(t-1), R1(t-2), R1(t-3), R1(t-4), ..., R1(t-50), R2(t) )

and then agregation:
   = f( R1(t),
        R1(t-1)+R1(t-2),
        R1(t-3)+R1(t-4)+R1(t-5)+R1(t-6),
        +...,
        R2(t) )

Because less parameters.
Rule of thumb: number of parameters
      less than number of data points / 20 <=== why ?

How to choose between all these models ?          Cross-validation.
Main weakness of this analysis ?



The same as in the previous application.

We predicted R2(t), R2(t+1), ....

Then we maximize cleanness based on these forecasts.

But there are huge uncertainties.
Main weakness of this analysis ?

This is often done in real world.
                                       No change on the
Often, we do not spend          time optimization algorithm
                                      on checking that         the consequences
are minor.                          (we are just pessimistic
                                        in the forecasts)
“Cheap” solutions (do not take too much time):

   - predicting a quantile (do you know how ?)
      instead of a conditional expectation
      and check on simulations

   - predicting a conditional expectation +
         moments (do you know how ?)

       Then, optimize on average
(slight change in the objective function)
What about an exact solution ?

The exact solution is much harder to implement.

We can use forecasts with moments.

Then, we get a MDP.

Then, this is reinforcement learning.

- simple: forecasting + optimizing
- a bit more complex: pessimistic forecasting + optimizing
- more complex: forecasting with moments + optimizing on average
           or optimizing a quantile (“value at risk”)
- advanced: full reinforcement learning model
What about an exact solution ?
The best choice depends on the precision of your model,
the budget you have.

Some problems involve billions of US $ and have precise models.
Then, each percent of improvement represents more money than
  all your professional life. Then, you can (must)
  implement something very advanced.

Sometimes, model are very imprecise.
Then, optimizing at 0.001% is meaningless. Improving the model
is more important.

- simple: forecasting + optimizing
- a bit more complex: pessimistic forecasting + optimizing
- more complex: forecasting with moments + optimizing on average
            or optimizing a quantile (“value at risk”)
- advanced: full reinforcement learning model
What do you think ?
Did you understand ?

1) Applications of machine learning
  - wind power forecasting (important e.g. for PengHu island!)
  - rainfalls estimation
2) Some key words (you must know what they mean):
  - black box / white box
  - shrinking horizon
  - objective function
  - “what you get is what you have”
  - model complexity
  - cross-validation
  - generative model
  - quantile, value-at-risk

                ===> olivier.teytaud@inria.fr

More Related Content

Viewers also liked

Viewers also liked (8)

Combining games artificial intelligences & improving random seeds
Combining games artificial intelligences & improving random seedsCombining games artificial intelligences & improving random seeds
Combining games artificial intelligences & improving random seeds
 
Bias correction, and other uncertainty management techniques
Bias correction, and other uncertainty management techniquesBias correction, and other uncertainty management techniques
Bias correction, and other uncertainty management techniques
 
Monte Carlo Tree Search in 2014 (MCMC days in Marseille)
Monte Carlo Tree Search in 2014 (MCMC days in Marseille)Monte Carlo Tree Search in 2014 (MCMC days in Marseille)
Monte Carlo Tree Search in 2014 (MCMC days in Marseille)
 
Disappointing results & open problems in Monte-Carlo Tree Search
Disappointing results & open problems in Monte-Carlo Tree SearchDisappointing results & open problems in Monte-Carlo Tree Search
Disappointing results & open problems in Monte-Carlo Tree Search
 
Simulation-based optimization: Upper Confidence Tree and Direct Policy Search
Simulation-based optimization: Upper Confidence Tree and Direct Policy SearchSimulation-based optimization: Upper Confidence Tree and Direct Policy Search
Simulation-based optimization: Upper Confidence Tree and Direct Policy Search
 
Examples of operational research
Examples of operational researchExamples of operational research
Examples of operational research
 
Planning for power systems
Planning for power systemsPlanning for power systems
Planning for power systems
 
Direct policy search
Direct policy searchDirect policy search
Direct policy search
 

Similar to Keywords and examples of machine learning

Tools for artificial intelligence: EXP3, Zermelo algorithm, Alpha-Beta, and s...
Tools for artificial intelligence: EXP3, Zermelo algorithm, Alpha-Beta, and s...Tools for artificial intelligence: EXP3, Zermelo algorithm, Alpha-Beta, and s...
Tools for artificial intelligence: EXP3, Zermelo algorithm, Alpha-Beta, and s...
Olivier Teytaud
 
2013-1 Machine Learning Lecture 05 - Andrew Moore - Support Vector Machines
2013-1 Machine Learning Lecture 05 - Andrew Moore - Support Vector Machines2013-1 Machine Learning Lecture 05 - Andrew Moore - Support Vector Machines
2013-1 Machine Learning Lecture 05 - Andrew Moore - Support Vector Machines
Dongseo University
 

Similar to Keywords and examples of machine learning (20)

Tools for artificial intelligence: EXP3, Zermelo algorithm, Alpha-Beta, and s...
Tools for artificial intelligence: EXP3, Zermelo algorithm, Alpha-Beta, and s...Tools for artificial intelligence: EXP3, Zermelo algorithm, Alpha-Beta, and s...
Tools for artificial intelligence: EXP3, Zermelo algorithm, Alpha-Beta, and s...
 
MongoDB World 2019: Event Horizon: Meet Albert Einstein As You Move To The Cloud
MongoDB World 2019: Event Horizon: Meet Albert Einstein As You Move To The CloudMongoDB World 2019: Event Horizon: Meet Albert Einstein As You Move To The Cloud
MongoDB World 2019: Event Horizon: Meet Albert Einstein As You Move To The Cloud
 
Bayesian Autoencoders (BAE) & Honest Thoughts on research
Bayesian Autoencoders (BAE) & Honest Thoughts on research Bayesian Autoencoders (BAE) & Honest Thoughts on research
Bayesian Autoencoders (BAE) & Honest Thoughts on research
 
Tools for artificial intelligence
Tools for artificial intelligenceTools for artificial intelligence
Tools for artificial intelligence
 
Microchip Mfg. problem
Microchip Mfg. problemMicrochip Mfg. problem
Microchip Mfg. problem
 
Acm aleppo cpc training fifth session
Acm aleppo cpc training fifth sessionAcm aleppo cpc training fifth session
Acm aleppo cpc training fifth session
 
Security of Artificial Intelligence
Security of Artificial IntelligenceSecurity of Artificial Intelligence
Security of Artificial Intelligence
 
Stochastic modelling and quasi-random numbers
Stochastic modelling and quasi-random numbersStochastic modelling and quasi-random numbers
Stochastic modelling and quasi-random numbers
 
WINSEM2016-17_CSE1002_LO_1336_24-JAN-2017_RM003_session 10.pptx
WINSEM2016-17_CSE1002_LO_1336_24-JAN-2017_RM003_session 10.pptxWINSEM2016-17_CSE1002_LO_1336_24-JAN-2017_RM003_session 10.pptx
WINSEM2016-17_CSE1002_LO_1336_24-JAN-2017_RM003_session 10.pptx
 
Introduction to Deep Neural Network
Introduction to Deep Neural NetworkIntroduction to Deep Neural Network
Introduction to Deep Neural Network
 
Mm chap08 -_lossy_compression_algorithms
Mm chap08 -_lossy_compression_algorithmsMm chap08 -_lossy_compression_algorithms
Mm chap08 -_lossy_compression_algorithms
 
Applied numerical methods lec1
Applied numerical methods lec1Applied numerical methods lec1
Applied numerical methods lec1
 
Introduction to computing Processing and performance.pdf
Introduction to computing Processing and performance.pdfIntroduction to computing Processing and performance.pdf
Introduction to computing Processing and performance.pdf
 
Stephens-L
Stephens-LStephens-L
Stephens-L
 
Lecture4 xing
Lecture4 xingLecture4 xing
Lecture4 xing
 
Betrayal, Distrust, and Rationality: Smart Counter-Collusion Contracts for Ve...
Betrayal, Distrust, and Rationality: Smart Counter-Collusion Contracts for Ve...Betrayal, Distrust, and Rationality: Smart Counter-Collusion Contracts for Ve...
Betrayal, Distrust, and Rationality: Smart Counter-Collusion Contracts for Ve...
 
2013-1 Machine Learning Lecture 05 - Andrew Moore - Support Vector Machines
2013-1 Machine Learning Lecture 05 - Andrew Moore - Support Vector Machines2013-1 Machine Learning Lecture 05 - Andrew Moore - Support Vector Machines
2013-1 Machine Learning Lecture 05 - Andrew Moore - Support Vector Machines
 
Yoyak ScalaDays 2015
Yoyak ScalaDays 2015Yoyak ScalaDays 2015
Yoyak ScalaDays 2015
 
Support Vector Machines
Support Vector MachinesSupport Vector Machines
Support Vector Machines
 
Class 30: Sex, Religion, and Politics
Class 30: Sex, Religion, and PoliticsClass 30: Sex, Religion, and Politics
Class 30: Sex, Religion, and Politics
 

Recently uploaded

+971581248768>> SAFE AND ORIGINAL ABORTION PILLS FOR SALE IN DUBAI AND ABUDHA...
+971581248768>> SAFE AND ORIGINAL ABORTION PILLS FOR SALE IN DUBAI AND ABUDHA...+971581248768>> SAFE AND ORIGINAL ABORTION PILLS FOR SALE IN DUBAI AND ABUDHA...
+971581248768>> SAFE AND ORIGINAL ABORTION PILLS FOR SALE IN DUBAI AND ABUDHA...
?#DUbAI#??##{{(☎️+971_581248768%)**%*]'#abortion pills for sale in dubai@
 
Artificial Intelligence: Facts and Myths
Artificial Intelligence: Facts and MythsArtificial Intelligence: Facts and Myths
Artificial Intelligence: Facts and Myths
Joaquim Jorge
 

Recently uploaded (20)

How to Troubleshoot Apps for the Modern Connected Worker
How to Troubleshoot Apps for the Modern Connected WorkerHow to Troubleshoot Apps for the Modern Connected Worker
How to Troubleshoot Apps for the Modern Connected Worker
 
GenAI Risks & Security Meetup 01052024.pdf
GenAI Risks & Security Meetup 01052024.pdfGenAI Risks & Security Meetup 01052024.pdf
GenAI Risks & Security Meetup 01052024.pdf
 
+971581248768>> SAFE AND ORIGINAL ABORTION PILLS FOR SALE IN DUBAI AND ABUDHA...
+971581248768>> SAFE AND ORIGINAL ABORTION PILLS FOR SALE IN DUBAI AND ABUDHA...+971581248768>> SAFE AND ORIGINAL ABORTION PILLS FOR SALE IN DUBAI AND ABUDHA...
+971581248768>> SAFE AND ORIGINAL ABORTION PILLS FOR SALE IN DUBAI AND ABUDHA...
 
Tech Trends Report 2024 Future Today Institute.pdf
Tech Trends Report 2024 Future Today Institute.pdfTech Trends Report 2024 Future Today Institute.pdf
Tech Trends Report 2024 Future Today Institute.pdf
 
Artificial Intelligence: Facts and Myths
Artificial Intelligence: Facts and MythsArtificial Intelligence: Facts and Myths
Artificial Intelligence: Facts and Myths
 
Strategize a Smooth Tenant-to-tenant Migration and Copilot Takeoff
Strategize a Smooth Tenant-to-tenant Migration and Copilot TakeoffStrategize a Smooth Tenant-to-tenant Migration and Copilot Takeoff
Strategize a Smooth Tenant-to-tenant Migration and Copilot Takeoff
 
What Are The Drone Anti-jamming Systems Technology?
What Are The Drone Anti-jamming Systems Technology?What Are The Drone Anti-jamming Systems Technology?
What Are The Drone Anti-jamming Systems Technology?
 
A Domino Admins Adventures (Engage 2024)
A Domino Admins Adventures (Engage 2024)A Domino Admins Adventures (Engage 2024)
A Domino Admins Adventures (Engage 2024)
 
Apidays New York 2024 - The value of a flexible API Management solution for O...
Apidays New York 2024 - The value of a flexible API Management solution for O...Apidays New York 2024 - The value of a flexible API Management solution for O...
Apidays New York 2024 - The value of a flexible API Management solution for O...
 
From Event to Action: Accelerate Your Decision Making with Real-Time Automation
From Event to Action: Accelerate Your Decision Making with Real-Time AutomationFrom Event to Action: Accelerate Your Decision Making with Real-Time Automation
From Event to Action: Accelerate Your Decision Making with Real-Time Automation
 
Boost PC performance: How more available memory can improve productivity
Boost PC performance: How more available memory can improve productivityBoost PC performance: How more available memory can improve productivity
Boost PC performance: How more available memory can improve productivity
 
A Year of the Servo Reboot: Where Are We Now?
A Year of the Servo Reboot: Where Are We Now?A Year of the Servo Reboot: Where Are We Now?
A Year of the Servo Reboot: Where Are We Now?
 
Workshop - Best of Both Worlds_ Combine KG and Vector search for enhanced R...
Workshop - Best of Both Worlds_ Combine  KG and Vector search for  enhanced R...Workshop - Best of Both Worlds_ Combine  KG and Vector search for  enhanced R...
Workshop - Best of Both Worlds_ Combine KG and Vector search for enhanced R...
 
Apidays Singapore 2024 - Building Digital Trust in a Digital Economy by Veron...
Apidays Singapore 2024 - Building Digital Trust in a Digital Economy by Veron...Apidays Singapore 2024 - Building Digital Trust in a Digital Economy by Veron...
Apidays Singapore 2024 - Building Digital Trust in a Digital Economy by Veron...
 
TrustArc Webinar - Unlock the Power of AI-Driven Data Discovery
TrustArc Webinar - Unlock the Power of AI-Driven Data DiscoveryTrustArc Webinar - Unlock the Power of AI-Driven Data Discovery
TrustArc Webinar - Unlock the Power of AI-Driven Data Discovery
 
Axa Assurance Maroc - Insurer Innovation Award 2024
Axa Assurance Maroc - Insurer Innovation Award 2024Axa Assurance Maroc - Insurer Innovation Award 2024
Axa Assurance Maroc - Insurer Innovation Award 2024
 
Powerful Google developer tools for immediate impact! (2023-24 C)
Powerful Google developer tools for immediate impact! (2023-24 C)Powerful Google developer tools for immediate impact! (2023-24 C)
Powerful Google developer tools for immediate impact! (2023-24 C)
 
Scaling API-first – The story of a global engineering organization
Scaling API-first – The story of a global engineering organizationScaling API-first – The story of a global engineering organization
Scaling API-first – The story of a global engineering organization
 
Handwritten Text Recognition for manuscripts and early printed texts
Handwritten Text Recognition for manuscripts and early printed textsHandwritten Text Recognition for manuscripts and early printed texts
Handwritten Text Recognition for manuscripts and early printed texts
 
Exploring the Future Potential of AI-Enabled Smartphone Processors
Exploring the Future Potential of AI-Enabled Smartphone ProcessorsExploring the Future Potential of AI-Enabled Smartphone Processors
Exploring the Future Potential of AI-Enabled Smartphone Processors
 

Keywords and examples of machine learning

  • 1. Machine learning: Keywords + Applications 1) Applications of machine learning - wind power forecasting (important e.g. for PengHu island!) - rainfalls estimation 2) Some key words (you must know what they mean): - black box / white box - shrinking horizon - objective function - “what you get is what you have” - model complexity - cross-validation - generative model - quantile, value-at-risk
  • 2. What you will see in these slides 1) Applications of machine learning - wind power forecasting (important e.g. for PengHu island!) - rainfalls estimation 2) Some key words (you must know what they mean): - black box / white box - shrinking horizon - objective function - “what you get is what you have” - model complexity - cross-validation - generative model - quantile, value-at-risk
  • 3. I want to produce electricity
  • 4. I want to produce electricity I have: - water for hydroelectricity - a nuclear power plant - wind farms - gas turbines
  • 5. I want to produce electricity I must ensure, for each time step: Production of electricity = Demand of electricity Demand(t0), Demand(t1), Demand(t2), Demand(t3) known.
  • 6. I want to produce electricity We get four equations: Production(t0) = Demand(t0) Production(t1) = Demand(t1) Production(t2) = Demandt(2) Production(t3) = Demand(t3) Other equation: Production = hydro-production + nuclear-production + wind-farm production + gas production
  • 7. I want to produce electricity We get four equations: H(t0)+W(t0)+N(t0)+G(t0) = Demand(t0) H(t1)+W(t1)+N(t1)+G(t1) = Demand(t1) H(t2)+W(t2)+N(t2)+G(t2) = Demandt(2) H(t3)+W(t3)+N(t3)+G(t3) = Demand(t3) Stock level for Hydro depends on production x(1) = x(0)-H(0) x(2) = x(1)-H(1) x(3) = x(2)-H(2) x(4) = x(3)-H(3)
  • 8. Also depends on inflows We get four equations: H(t0)+W(t0)+N(t0)+G(t0) = Demand(t0) H(t1)+W(t1)+N(t1)+G(t1) = Demand(t1) H(t2)+W(t2)+N(t2)+G(t2) = Demandt(2) H(t3)+W(t3)+N(t3)+G(t3) = Demand(t3) Stock level for Hydro: x(0); constraint: x(i) >= 0 x(1) = x(0)+I(0)-H(0) x(2) = x(1)+I(1)-H(1) x(3) = x(2)+I(2)-H(2) x(4) = x(3)+I(3)-H(3)
  • 9. 8 equations (yes, it increases...) H(t0)+W(t0)+N(t0)+G(t0) = Demand(t0) H(t1)+W(t1)+N(t1)+G(t1) = Demand(t1) H(t2)+W(t2)+N(t2)+G(t2) = Demandt(2) H(t3)+W(t3)+N(t3)+G(t3) = Demand(t3) X(0) + I(0) – H(0) >=0 X(0) + I(0) – H(0) + I(1) – H(1) >=0 X(0) + I(0) – H(0) + I(1) – H(1) + I(2) – H(2) >=0 X(0) + I(0)–H(0) +I(1)– H(1) +I(2)–H(2) +I(3)-H(3)>=0
  • 10. 8 equations (yes, it increases...) Nuclear has constraints as well: - N(1) in f(N(0)) - N(2) in f(N(1)) - N(3) in f(N(2)) (very simplified; in fact there are stocks, refills...)
  • 11. Ok! Summary ? W(0), W(1), W(2), W(3) wind farms production = can not be chosen and W(1), W(2), W(3) unknown! To be chosen: G(0), G(1), G(2), G(3) gas turbines production H(0), H(1), H(2), H(3) hydroelectric production (can be somehow negative) N(0), N(1), N(2), N(3) nuclear power
  • 12. Ok! Summary ? To be chosen: G(0), G(1), G(2), G(3) gas turbines production H(0), H(1), H(2), H(3) hydroelectric production (can be somehow negative) N(0), N(1), N(2), N(3) nuclear power Constraints: production plans must satisfy constraints. E.g.: if unlimited gas turbines production, we might decide G(0)=demand(0)-W(0), G(1)=demand(1)-W(1), G(2)=demand(2)-W(2), G(3)=demand(3)-W(3) ==> it is a feasible solution
  • 13. Ok! Summary ? To be chosen: G(0), G(1), G(2), G(3) gas production H(0), H(1), H(2), H(3) hydroelectric production (can be somehow negative) N(0), N(1), N(2), N(3) nuclear power Constraints: production plans must satisfy constraints. E.g.: if unlimited gas production, we might decide G(0)=demand(0)-W(0), G(1)=demand(1)-W(1), G(2)=demand(2)-W(2), G(3)=demand(3)-W(3) ==> it is a feasible solution ==> it is a bad feasible solution
  • 14. Ok! Summary ? To be chosen: G(0), G(1), G(2), G(3) gas production H(0), H(1), H(2), H(3) hydroelectric production (can be somehow negative) N(0), N(1), N(2), N(3) nuclear power Constraints: production plans must satisfy constraints. E.g.: if unlimited gas production, we might decide G(0)=demand(0)-W(0), G(1)=demand(1)-W(1), G(2)=demand(2)-W(2), G(3)=demand(3)-W(3) ==> it is a feasible solution ==> it is a bad feasible solution Objective function: not all solutions are equivalent!
  • 15. Ok! Summary ? Production cost: Hcost * (H0+H1+H2+H3) + Ncost * (N0+N1+N2+N3) + Gcost * (G0+G1+G2+G3) + Wcost* (W0+W1+W2+W3) Nb: Cost does not only mean $. Cost means ecological & environmental costs as well.
  • 16. Quizz ! So we have: x0,x1,x2,x3: states at time t0, t1, t2, t3. x0 is given, x1, x2, x3 depend on our decisions. Some decisions are chosen at time t0. Some decisions are chosen at time t1. Some decisions are chosen at time t2. Some decisions are chosen at time t3. The cost depends on all decisions. Is this a supervised learning problem ? Is this a reinforcement learning problem ? Is this a boring problem ?
  • 17. Ok! Summary ? So we have equations. If we know W(1),W(2),W(3), we can evaluate the production cost. We want to: - solve equations - minimize production cost Problem: we don't know W(1), W(2), W(3). How to know ?
  • 18. Ok! Summary ? We want to know W(1), W(2), W(3). Steps: (1) Weather simulation: we predict the wind at time steps t1 t2 t3 (as in classical weather forecast) (2) From the wind forecast, predict the power (e.g. “black box” model): Based on data E.g. mean-square error Predicting W(1), W(2), W(3): Boring problem ? Supervised learning problem ? Reinforcement learning problem ?
  • 19. Ok! Summary ? We want to know W(1), W(2), W(3). Steps: (1) Weather simulation: we predict What does the wind at time steps t1 t2 t3 (as in classical box” “black weather forecast) mean ? (2) From the wind forecast, predict the power (e.g. “black box” model): Based on data E.g. mean-square error Predicting W(1), W(2), W(3): Boring problem ? Supervised learning problem ? Reinforcement learning problem ?
  • 20. Difficulties ? In many cases, you will see in your life as an engineer that: - collecting datas and models is a big part of the work - solving the problem exactly is impossible - what really matters in an application is to find where the current codes are not satisfactory, and not to spend time on other aspects
  • 21. Typical questions for this application Many constraints / effects are missing ! (for the real application, we must have far more constraints...)
  • 22. Typical questions for this application Many constraints / effects are missing ! Mean square (for the real error in the application, supervised we must have far learning for more W1,W2,W3 ? constraints...) But .......... ................ .................
  • 23. Typical questions for this application Many constraints / effects are missing ! (for the real How many time Mean square application, steps in the future error in the we must have far should supervised more we consider ? learning for W1,W2,W3 ? constraints...) But .......... ................ .................
  • 24. Typical questions for this application Many constraints / effects are missing ! (for the real How many time Mean square application, steps in the future error in the we must have far should supervised more we consider ? learning for W1,W2,W3 ? constraints...) But .......... ................ ................. We should penalize cases with W4 small !
  • 25. Typical questions for this application Many constraints / effects are missing ! (for the real How many time Mean square application, steps in the future error in the we must have far should supervised more we consider ? learning for W1,W2,W3 ? constraints...) But .......... ................ In case of long ................. term: should we We should consider “climate change” penalize cases with W4 bias ? small !
  • 26. Some of these points Typical questions for are important, some are negligible, this application depending on the system Many constraints / under analysis. effects are missing ! (for the real How many time Mean square application, steps in the future error in the we must have far should supervised more we consider ? learning for W1,W2,W3 ? constraints...) But .......... ................ In case of long ................. term: should we We should consider “climate change” penalize cases with W4 bias ? small !
  • 27.
  • 28. Another beautiful application This is Paris. Beautiful town. With plenty of people (10 millions in IDF).
  • 29. Another beautiful application This is Paris. Beautiful town. With plenty of people (10 millions in IDF). Producing plenty of fecal matter ==> dirty water.
  • 30. Our river in Paris is the “Seine”. A French politician said he would soon swim across it. After all, he never did it. For your health, don't do it. Nevertheless, we try to keep it as clean as possible.
  • 31. Dirty water should be separated from the Seine. And usually it is. Something like this: Seine Dirty water
  • 32. Problem: if big rainfalls reach dirty water, then dirty water might pollute the Seine Seine Dirty water
  • 33. No typhoon in France. But we can have heavy rains/winds in Paris: - 0.96 dm in 24 hours happened in 1987. - gusts at 169 km/h in 1999 (very unusual in France) Problem: if big rainfalls reach dirty water, then dirty water might pollute the Seine Seine Dirty water (yes, in Taiwan it is more impressive, sometimes it is 16.7 dm in 24 hours and gusts can reach 250 km/h...)
  • 34. No typhoon in France. But we can have heavy rains/winds in Paris: - 0.96 dm in 24 hours happened in 1987. - gusts at 169 km/h in 1999 (very unusual in France) Problem: if big rainfalls reach dirty water, then dirty water might pollute the Seine Seine Dirty water (yes, in Taiwan it is more impressive, sometimes it is 16.7 dm in 24 hours and gusts can reach 250 km/h...)
  • 35. No typhoon in France. But we can have heavy rains/winds in Paris: - 0.96 dm in 24 hours happened in 1987. - gusts at 169 km/h in 1999 (very unusual in France) Problem: if big rainfalls reach dirty water, then dirty water might pollute the Seine Seine Dirty water → Seine!(yes, in Taiwan it is more impressive, sometimes it is 16.7 dm in 24 hours and gusts can reach 250 km/h...)
  • 36. Another beautiful application Three water networks: - dirty water: should go to cleaning stations - clean water: can go to the Seine, but can't be drunk - drinkable water (France: tap water = drinkable)
  • 37. Big water network Dirty Dirty Dirty Dirty water water water water Clean Clean Clean Clean water water water water
  • 38. Water vs dirty water Challenge: Summer storms. Not comparable to a Taiwanese typhoon. But a lot of water. Can make dirty water become very big. Can invade clean water. Your mission: - Get read of dirty water - Protect clean water
  • 39. Water vs dirty water State: level in each stock, valves' status (open or closed) At each time step, rainfalls(i) liters of water reach stock i. you can open or close valves ==> get a new state. Your mission: - Get read of dirty water - Protect clean water
  • 40. Water vs dirty water Typically: (0, 1, 0, 0, 0, 1, 0, 1, 0.42, 0.2, 0.0, 0.8, 0.3) (valves) (stock levels) Plenty of rules: - if (valve 4 opens, then water from stock 1 goes to stock 2 at rate 0.02m3/s) - if (stock[2]>0.3) then dirty water ==> Seine, 3 0.1m /s ==> Miminize the quantity of dirty water in clean stocks at the end of the storm
  • 41. Water vs dirty water D-dimensional vector Equations: Stocks(t+1) = complicatedFunction(Stocks(t), rainfalls(t), valves(t)) D-dimensional vector (D=number of stocks) D-dimensional V-dimensional vector vector (D=number of stocks) (V=number of valves)
  • 42. Water vs dirty water To be decided: valves(t) for each t If there are 240 times steps, we get 240 x V decision V-dimensional vector variables (V=number of valves) Criterion = objective function = quantity of dirty water reaching the clean network + quantity of dirty water in the river
  • 43. Shrinking horizon Too many time steps! At each time step, make a decision using only 30 time steps. Move this window of 30 time steps.
  • 44. Shrinking horizon Too many time steps! At each time step, make a decision using only 30 time steps. Move this window of 30 time steps.
  • 45. Shrinking horizon Too many time steps! At each time step, make a decision using only 30 time steps. Move this window of 30 time steps.
  • 46. Shrinking horizon Too many time steps! At each time step, make a decision using only 30 time steps. Move this window of 30 time steps.
  • 48. Summary ? Is this: - an optimization problem ? - a reinforcement learning problem ? - a supervised machine learning problem ?
  • 49. Summary ? Is this: - an optimization problem ? - a reinforcement learning problem ? - a supervised machine learning problem ? Problem: rainfalls are unknown.
  • 50. How to predict rainfalls ? In fact, there are distinct rainfalls: - R1: a spatial distribution of rainfalls (one number per time step per point of the map) - R2: a underground list of rainfall arrivals (inflows), per stocks (D-dimensional) Input data: - weather forecast of archive ( R1(t) for each t) - archives of weather forecast R1(t) - archives of inflows R2(t)
  • 51. If your life was depending on it, what would you do ?
  • 52. If your life was depending on it, what would you do ? We are at time t. We need a forecaster: - which takes available data as input - and outputs R2(t') for t'>=t (why not for t' < t ?)
  • 53. If your life was depending on it, what would you do ? We are at time t. We need a forecaster: - which takes available data as input - and outputs R2(t') for t'>=t (why not for t' < t ?) (R2(t+1),R2(t+2),R2(t+3), .... , R2(t+30)) =?
  • 54. If your life was depending on it, what would you do ? We are at time t. We need a forecaster: - which takes available data as input - and outputs R2(t') for t'>=t (why not for t' < t ?) (R2(t+1),R2(t+2),R2(t+3), .... , R2(t+30)) = f( R1(t) ) ?
  • 55. If your life was depending on it, what would you do ? We are at time t. We need a forecaster: - which takes available data as input - and outputs R2(t') for t'>=t (why not for t' < t ?) (R2(t+1),R2(t+2),R2(t+3), .... , R2(t+30)) = f( R1(t), R1(t-1), R1(t-2), R1(t-3), R1(t-4), ..., R1(t-50) ) (because there are delays)
  • 56. If your life was depending on it, what would you do ? We are at time t. We need a forecaster: - which takes available data as input - and outputs R2(t') for t'>=t (why not for t' < t ?) (R2(t+1),R2(t+2),R2(t+3), .... , R2(t+30)) = f( R1(t), R1(t-1), R1(t-2), R1(t-3), R1(t-4), ..., R1(t-50), R2(t) ) (because “what you get is what you have”)
  • 57. If your life was depending on it, what would you do ? = f( R1(t), R1(t-1), R1(t-2), R1(t-3), R1(t-4), ..., R1(t-50), R2(t) ) and then agregation: = f( R1(t), R1(t-1)+R1(t-2), R1(t-3)+R1(t-4)+R1(t-5)+R1(t-6), +..., R2(t) ) Why ?
  • 58. If your life was depending on it, what would you do ? = f( R1(t), R1(t-1), R1(t-2), R1(t-3), R1(t-4), ..., R1(t-50), R2(t) ) and then agregation: = f( R1(t), R1(t-1)+R1(t-2), R1(t-3)+R1(t-4)+R1(t-5)+R1(t-6), +..., R2(t) ) Because less parameters.
  • 59. If your life was depending on it, what would you do ? = f( R1(t), R1(t-1), R1(t-2), R1(t-3), R1(t-4), ..., R1(t-50), R2(t) ) and then agregation: = f( R1(t), R1(t-1)+R1(t-2), R1(t-3)+R1(t-4)+R1(t-5)+R1(t-6), +..., R2(t) ) Because less parameters. Rule of thumb: number of parameters less than number of data points / 20 <=== why ?
  • 60. If your life was depending on it, what would you do ? = f( R1(t), R1(t-1), R1(t-2), R1(t-3), R1(t-4), ..., R1(t-50), R2(t) ) and then agregation: = f( R1(t), R1(t-1)+R1(t-2), R1(t-3)+R1(t-4)+R1(t-5)+R1(t-6), +..., R2(t) ) Because less parameters. Rule of thumb: number of parameters less than number of data points / 20 <=== why ? How to choose between all these models ?
  • 61. If your life was depending on it, what would you do ? = f( R1(t), R1(t-1), R1(t-2), R1(t-3), R1(t-4), ..., R1(t-50), R2(t) ) and then agregation: = f( R1(t), R1(t-1)+R1(t-2), R1(t-3)+R1(t-4)+R1(t-5)+R1(t-6), +..., R2(t) ) Because less parameters. Rule of thumb: number of parameters less than number of data points / 20 <=== why ? How to choose between all these models ? Cross-validation.
  • 62. Main weakness of this analysis ? The same as in the previous application. We predicted R2(t), R2(t+1), .... Then we maximize cleanness based on these forecasts. But there are huge uncertainties.
  • 63. Main weakness of this analysis ? This is often done in real world. No change on the Often, we do not spend time optimization algorithm on checking that the consequences are minor. (we are just pessimistic in the forecasts) “Cheap” solutions (do not take too much time): - predicting a quantile (do you know how ?) instead of a conditional expectation and check on simulations - predicting a conditional expectation + moments (do you know how ?) Then, optimize on average (slight change in the objective function)
  • 64. What about an exact solution ? The exact solution is much harder to implement. We can use forecasts with moments. Then, we get a MDP. Then, this is reinforcement learning. - simple: forecasting + optimizing - a bit more complex: pessimistic forecasting + optimizing - more complex: forecasting with moments + optimizing on average or optimizing a quantile (“value at risk”) - advanced: full reinforcement learning model
  • 65. What about an exact solution ? The best choice depends on the precision of your model, the budget you have. Some problems involve billions of US $ and have precise models. Then, each percent of improvement represents more money than all your professional life. Then, you can (must) implement something very advanced. Sometimes, model are very imprecise. Then, optimizing at 0.001% is meaningless. Improving the model is more important. - simple: forecasting + optimizing - a bit more complex: pessimistic forecasting + optimizing - more complex: forecasting with moments + optimizing on average or optimizing a quantile (“value at risk”) - advanced: full reinforcement learning model
  • 66. What do you think ? Did you understand ? 1) Applications of machine learning - wind power forecasting (important e.g. for PengHu island!) - rainfalls estimation 2) Some key words (you must know what they mean): - black box / white box - shrinking horizon - objective function - “what you get is what you have” - model complexity - cross-validation - generative model - quantile, value-at-risk ===> olivier.teytaud@inria.fr