SlideShare a Scribd company logo
1 of 1
Download to read offline
Joint
        Research
                                                                              Comparing Goodness-of-fit Measures for Calibration
                                                                              Comparing Goodness-of-fit Measures for Calibration                                                                                                                                                                                                                                                                  of
                                                                                                                                                                                                                                                                                                                                                                                                  of
         Centre
     EGU2012-11549
     Session: HS1.3
                                                                                      Models Focused on Extreme Events
                                                                                      Models Focused on Extreme Events
      Apr 23th, 2012                                                                                                                                    Mauricio Zambrano-Bigiarini and Alberto Bellin
                                                                                                                                                        Mauricio Zambrano-Bigiarini and Alberto Bellin
  1) Motivation                                                                                                                                                                                                                                                                                                                                             4) Results
  Despite serious and well-known limitations ( e.g., Legates and
  McCabe, 1999), many single objective goodness-of-fit measures
  are still of widespread use. As an example, the Nash-Sutcliffe                                     3.1) Nash-Sutcliffe efficiency : 3.6) Relative Nash-Sutcliffe efficiency :                                                Nomenclature
  efficiency (NSE) has been highly criticised as an inappropriate                                          (Nash and Sutcliffe, 1970)
                                                                                                                             N
                                                                                                                                                              (Krausse et al., 2005)                                           ●   Si : i-th simulated value
                                                                                                                                                                                                               2
  benchmark for comparing modelling results to observations                                                                                                                              N
                                                                                                                                                                                                     Oi−S i
                                                                                                                                                                                                 (            )
                                                                                                                                            2
                                                                                                                         ∑ ( O i − Si )                                              ∑                                         ●   Oi : i-th observed value
  (e.g., Schaefli and Gupta 2007), nonetheless, it is still one of                                        NSE =1−        i =1
                                                                                                                                                                                     i =1             Oi
                                                                                                                           N
                                                                                                                                            2                        rNSE =1−                                                  ●   j : arbitrary power, i.e, positive integer
  the most common performance measures used by both                                                                      ∑ ( O i− O )                                                    N
                                                                                                                                                                                                     Oi− O     2


  environmental scientists and practitioners.
                                                                                                                         i =1                                                        ∑
                                                                                                                                                                                     i =1
                                                                                                                                                                                                 (     O      )                ●


                                                                                                                                                                                                                               ●
                                                                                                                                                                                                                                   Ō : mean observed value
                                                                                                                                                                                                                                   Ôi : median of the observed values in
                                                                                                     3.2) Index of Agreement:                             3.7) Relative Index of Agreement:                                            the same month than Oi
  2) Aim                                                                                                   (Willmot, 1981)
                                                                                                                                 N
                                                                                                                                                               (Krausse et al., 2005)
                                                                                                                                                                                             N                     2
                                                                                                                                                                                                                               ●     r     : Pearson's product-moment
                                                                                                                                                                                                         Oi− S i
                                                                                                                                                                                                     (             )
                                                                                                                                                2
                                                                                                                             ∑ ( O i − Si )                                                                                              correlation coefficient
  To provide practical guidance about how different goodness-                                                                                                                            ∑
                                                                                                          d =1−               i =1                                                                        Oi                        α : ratio between the standard
  of-fit measures reported in literature perform when used within                                                                                                                         i =1                                 ●
                                                                                                                   N
                                                                                                                                                    2
                                                                                                                                                                     rd =1−    N                                           2             deviation of simulations (σs) and
                                                                                                                  ∑ (∣Si− O∣+∣O i− O∣)                                                   ∣Si − O∣+ ∣Oi −O∣
  a single-objective optimisation procedure, both for the
  identification of model parameters and in the reproduction of
                                                                                                                  i =1                                                        ∑
                                                                                                                                                                              i =1
                                                                                                                                                                                     (           O                     )                 observations (σo)
                                                                                                                                                                                                                               ●   β : ratio between the mean of the
  high- and low-flow events.                                                                         3.3) Coefficient of Persistence: 3.8) Modified Nash-Sutcliffe efficiency :                                                       simulations (μs) and observations
                                                                                                           (Kitanidis & Bras, 1980)                           (Legates and McCabe, 1999)                                              (μo)
                                                                                                                         N                                                                   N
                                                                                                                                        2

  3) Methodology                                                                                                       ∑ ( S i− O i )                                                    ∑ ∣Oi −Si∣j                           ●   ωi : weight in [0,1] applied to both
                                                                                                                    i =2
                                                                                                          cp =1−    N                                                mNSE =1− i =1                                                    observed and simulated values at                                                                                                                            Fig 08. Boxplots summarizing the 5th and 50th
  3.1) Study Area
                                                                                                                                                                                N
                                                                                                                                                                                                                                      time step i
                                                                                                                   ∑ ( O i− Oi−1 )2                                                      ∑ ∣Oi −O∣j                                                                                                                                      Fig 07. Boxplots summarizing the parameter values
                                                                                                                                                                                                                                                                                                                                                                                                  percentiles of daily discharge obtained with calibrations
                                                                                                                   i =2                                                                  i =1                                  ●   λ : number in [0, 1] representing the                                                                                                                          focused on low flows. Only the best half of the total
                                                                                                                                                                                                                                       weight given to the high-flow part                                                                obtained with calibrations focused on low flows. Only
                                                                                                                                                                                                                                                                                                                                                                                                  parameter sets obtained during calibration are
                                                                                                     3.4) Volumetric Efficiency:                          3.9) Modified Index of Agreement :                                           of the signal, with λ close to 1                                                                  the best half of the total parameter sets obtained
                                                                                                                                                                                                                                                                                                                                                                                                  considered for each calibration exercise. Grey vertical
                                                                                                           (Criss and Winston, 2008)                          (Legates and McCabe, 1999)                                               when focusing on high-flow                                                                        during calibration are considered for each calibration
                                                                                                                                                                                                                                                                                Fig 05. Boxplots summarizing the parameter values                                                                 lines indicate the value of the observed 5th and 50th
                                                                                                                                                                                          N                                            events, and λ close to zero when                                                                  exercise.
                                                                                                                         N                                                                                                                                                      obtained with calibrations focused on high flows. Only                                                            percentiles.
                                                                                                                       ∑ ∣Oi −Si∣                                                        ∑ ( O i− S i ) j                              focusing in low-flow conditions
                                                                                                                                                                                                                                                                                the best half of the total parameter sets obtained
                                                                                                                                                                                         i =1
                                                                                                          VE =1− i=1 N                                               d j=1−                                                        OL ,OH : user-defined thresholds used        during calibration are considered for each calibration
                                                                                                                                                                                                                                                                                                                                                    8) Conclusions
                                                                                                                                                                                                                               ●
                                                                                                                                                                               N
                                                                                                                                                                                                                       j                                                        exercise.
                                                                                                                             ∑ Oi                                             ∑ (∣Si −O∣+ ∣Oi −O∣)                                   to separate low and high values,
                                                                                                                                                                                                                                     respectively. In order to avoid
                                                                                                                             i =1                                             i =1
                                                                                                                                                                                                                                     subjectivity in the selection of OL                                                                        ●
                                                                                                                                                                                                                                                                                                                                                    Large underestimation of observed low flows (~40%) were
                                                                                                     3.5) Kling-Gupta efficiency:                         3.10) Weighted Seasonal Nash-Sutcliffe:                                    and OH , we use the flow duration                                                                              obtained with simulated values calibrated by using NSE and the
                                                                                                                                                                (this work)                  N
                                                                                                            (Gupta et al., 2009)                                                                                                     curve criterion proposed by Yilmaz
                                                                                                                                                                                         ∑ ∣ω i ( Oi −Si )∣j                         et al., 2008                                                                                                   KGE. Such underestimation is commonly masked out by the
                                                                                                       KGE =1− √( r −1)2 + (α −1)2+ (β−1)2                           wsNSE =1− i=1
                                                                                                                                                                                N                                                                                                                                                                   good overall fit in terms of NSE, KGE and other statistics.
                                                                                                                                                                                         ∑ ∣ω i (O i− Oi )∣j
                                                                                                                                                                                                      ̂
                                                                                                                 Cov s, o σs     μs                                                      i =1

                                                                                                              r = σ σ α = σ o β= μ o
                                                                                                                                                                                                                                                                                                                                                    Low-flows calibrations:

                                                                                                                                                                 {                                                             }
                                                                                                                   s o                                                          λ             ,    O i⩾OH

                                                                                                     3.6) Seasonal Nash-Sutcliffe:                           ω i = (1−λ )+
                                                                                                                                                                           (2 λ −1)( Oi− OL )
                                                                                                                                                                                               , OL < Oi < O H
                                                                                                                                                                                                                                                                                                                                                ●
                                                                                                                                                                                                                                                                                                                                                    rNSE and wsNSE (j=1, λ=0) perform the best (in terms of 5th
                                                                                                                                                                                O H −OL
                                                                                                           (Adapted from Garrick et al., 1978)
                                                                                                                                                                              1−λ             ,    O i⩾OL
                                                                                                                                                                                                                                                                                                                                                    percentile), and both of them provide also a good
                                                                                                                                 N

                                                                                                                             ∑ ∣Oi −Si∣j                                                                                                                                                                                                            representation of medium flows (50th percentile) with all the
                                                                                                          sNSE =1− iN
                                                                                                                    =1
                                                                                                                                                                                                                                                                                                                                                    other measures overestimating them.
                                                                                                                             ∑ ∣O i−O i∣j
                                                                                                                                    ̂                                                                                                                                                                                                           ●
                                                                                                                                                                                                                                                                                                                                                    NSE, KGE and d tend to underestimate GW_DELAY in
                                                                                                                             i =1

                                                                                                                                                                                                                                                                                                                                                    comparison to to the other goodness-of-fit measures.
Fig 01. Location of the Ega River Basin, meteorological stations, and discharge station used for
the calibration of the upper catchment.
                                                                                                                                                                                                                                                                                                                                                    High-flows calibrations:
  3.2) Calibration Procedure                                                                                                                                                                                                                                                                                                                    ●
                                                                                                                                                                                                                                                                                                                                                    wsNSE (j=2, λ=0.95), d and KGE perform the best (in terms of
  The Soil and Water Assessment Tool (SWAT) version 2005                                                                                                                                                                                                                                                                                            95th percentile). At the same time, only wsNSE provides a good
  was calibrated for the period Jan/1961-Dec/1970, using the                                                                                                                                                                                                                                                                                        representation of medium flows (50th percentile), with all the
  first year as warming up period.                                                                                                                                                                                                                                                                                                                  other goodness-of-fit measures overestimating them.
  A set of 9 parameters was selected for calibration:                                                                                                                                                                                                                                                                                           ●
                                                                                                                                                                                                                                                                                                                                                    The optimal value and variability of parameters related to the
                         Parameter                                         Min       Max             Fig 02. Discharge time series corresponding to the outlet of the upper part of the Ega River Basin (Ega en Estella stream gauge, Q071).                                                                                                        slow response of the catchment (GW_DELAY and ALPHA_BF)
 Base flow alpha factor [days]                            ALPHA_BF 1.00E-1        9.90E-1            Horizontal blue and red lines show the discharge values used to separate high and low flows, respectively (see Fig 03).
 Manning's “n” value for the main channel [-]             CH_N2        1.60E-2    1.50E-1
                                                                                                                                                                                                                                                                                                                                                    was very similar among all the goodness-of-fit measures tested
                                                                                                                                                                                                                                                                                   Fig 06. Boxplots summarizing the 50th and 95th
 Initial SCS CN II value [-]                              CN2          4.00E+1    9.50E+1                                                                                                                                                                                          percentiles of daily discharge obtained with                     (both close to zero), with KGE and wsNSE presenting the
 Saturated hydraulic conductivity [mm/hr]                 SOL_K        1.00E-3    1.00E+3                                                                                                                                                                                          calibrations focused on high flows. Only the best half           largest spread. However, those values are very different from
                                                                                                                                                                                                                                                                                   of the total parameter sets obtained during calibration
 Available water capacity, [mmH2O/mm soil]                SOL_AWC      1.00E-2    3.50E-1
                                                                                                                                                                                                                                                                                   are considered for each calibration exercise. Grey               the ones obtained during the calibration focused on low flows.
 Effective hydraulic conductivity in main channel [mm/hr] CH_K2        0.00E+0    2.00E+2                                                                                                                                                                                          vertical lines indicate the value of the observed 5th
 Soil evaporation compensation factor [-]                 ESCO         1.00E-2    1.00E+0                                                                                                                                                                                          and 50th percentiles.
 Surface runoff lag time [days]                           SURLAG       1.00E+0    1.20E+1                                                                                                                                                                                            References:
 Snowfall temperature [°C]                                SFTMP        -5.00E+0 5.00E+0                                                                                                                                                                                              ●
                                                                                                                                                                                                                                                                                         Criss, R., Winston, W., 2008. Do Nash values have value? Discussion and alternate proposals. Hydrological Processes 22, 2723–2725
  Calibration was carried out using Particle Swarm Optimisation                                                                                                                                                                                                                      ●
                                                                                                                                                                                                                                                                                         Garrick, M., Cunnane, C., Nash, J.E., 1978. A criterion of efficiency for rainfall-runoff models. Journal of Hydrology 36, 375–381
                                                                                                   Fig 03. Daily flow duration curve corresponding to                                                                                                                                    Kennedy, J., and R. Eberhart (1995), Particle swarm optimization, in Proceedings IEEE International Conference on Neural Networks, 1995, vol. 4, pp. 1942–1948,
  (PSO, Kennedy and Eberhart, 1995),    with 20 particles, 300
                                                                                                                                                                                                                                                                                     ●


                                                                                                   the outlet of the upper part of the Ega River Basin           Fig 04. Weighting values used in wsNSE, which take into account the skewness of the
                                                                                                                                                                                                                                                                                     ●
                                                                                                                                                                                                                                                                                         Kitanidis, P.K., Bras, R.L., 1980. Real-time forecasting with a conceptual hydrologic model 2. applications and results. Water Resources Research 16, 1034–1044.
  iterations, ω=1/(2log2), c1=c2=0.5+log2, linearly decreasing                                     (Q071). Verticl black lines show the discharge                observed daily discharges. Red and blue lines correspond to the weights used when                                   ●
                                                                                                                                                                                                                                                                                         Krause, P., Boyle, D., Bäse, F., 2005. Comparison of different efficiency criteria for hydrological model assessment. Advances in Geosciences 5, 89–97..
  Vmax from 1.0 to 0.5, and random topology (see EGU2012-10950                                     values used to separate high and low flows                    focusing on low- and high-flow events, respectively. On the left panel, the horizontal axis                         ●
                                                                                                                                                                                                                                                                                         Legates, D., McCabe Jr., G., 1999. Evaluating the use of “goodness-of-fit” measures in hydrologic and hydroclimatic model validation. Water Resources Research 35,
                                                                                                   following Yilmaz et. al., 2008)                               represent discharges values of the Ega River Basin (Q071), while on the right panel the                                 233–241
  for further details).
                                                                                                                                                                                                                                                                                         Schaefli, B., Gupta, H., 2007. Do Nash values have value?. Hydrological Processes 21, 2075–2080.


www.jrc.europa.eu
                                                                                               Mauricio Zambrano-Bigiarini                                       horizontal axis represent the empirical CDF of the observed discharge values.                                       ●

                                                                                                                                                                                                                                                                                     ●
                                                                                                                                                                                                                                                                                         Willmott, C., 1981. On the validation of models. Physical Geography 2, 184–194.
                                                                                               European Commission • Joint Research Centre • Institute for Environment and Sustainability                                                                                            ●
                                                                                                                                                                                                                                                                                         Yilmaz, K., Gupta, H., Wagener, T., 2008. A process-based diagnostic approach to model evaluation: Application to the NWS distributed hydrologic model. Water
                                                                                               Tel. +39 0332 789588 • Email: mauricio.zambrano@jrc.ec.europa.eu                                                                                                                          Resources Research 44, W09417.

More Related Content

Recently uploaded

Student login on Anyboli platform.helpin
Student login on Anyboli platform.helpinStudent login on Anyboli platform.helpin
Student login on Anyboli platform.helpinRaunakKeshri1
 
Grant Readiness 101 TechSoup and Remy Consulting
Grant Readiness 101 TechSoup and Remy ConsultingGrant Readiness 101 TechSoup and Remy Consulting
Grant Readiness 101 TechSoup and Remy ConsultingTechSoup
 
The basics of sentences session 2pptx copy.pptx
The basics of sentences session 2pptx copy.pptxThe basics of sentences session 2pptx copy.pptx
The basics of sentences session 2pptx copy.pptxheathfieldcps1
 
SOCIAL AND HISTORICAL CONTEXT - LFTVD.pptx
SOCIAL AND HISTORICAL CONTEXT - LFTVD.pptxSOCIAL AND HISTORICAL CONTEXT - LFTVD.pptx
SOCIAL AND HISTORICAL CONTEXT - LFTVD.pptxiammrhaywood
 
Introduction to AI in Higher Education_draft.pptx
Introduction to AI in Higher Education_draft.pptxIntroduction to AI in Higher Education_draft.pptx
Introduction to AI in Higher Education_draft.pptxpboyjonauth
 
Arihant handbook biology for class 11 .pdf
Arihant handbook biology for class 11 .pdfArihant handbook biology for class 11 .pdf
Arihant handbook biology for class 11 .pdfchloefrazer622
 
Accessible design: Minimum effort, maximum impact
Accessible design: Minimum effort, maximum impactAccessible design: Minimum effort, maximum impact
Accessible design: Minimum effort, maximum impactdawncurless
 
microwave assisted reaction. General introduction
microwave assisted reaction. General introductionmicrowave assisted reaction. General introduction
microwave assisted reaction. General introductionMaksud Ahmed
 
Call Girls in Dwarka Mor Delhi Contact Us 9654467111
Call Girls in Dwarka Mor Delhi Contact Us 9654467111Call Girls in Dwarka Mor Delhi Contact Us 9654467111
Call Girls in Dwarka Mor Delhi Contact Us 9654467111Sapana Sha
 
1029 - Danh muc Sach Giao Khoa 10 . pdf
1029 -  Danh muc Sach Giao Khoa 10 . pdf1029 -  Danh muc Sach Giao Khoa 10 . pdf
1029 - Danh muc Sach Giao Khoa 10 . pdfQucHHunhnh
 
“Oh GOSH! Reflecting on Hackteria's Collaborative Practices in a Global Do-It...
“Oh GOSH! Reflecting on Hackteria's Collaborative Practices in a Global Do-It...“Oh GOSH! Reflecting on Hackteria's Collaborative Practices in a Global Do-It...
“Oh GOSH! Reflecting on Hackteria's Collaborative Practices in a Global Do-It...Marc Dusseiller Dusjagr
 
Interactive Powerpoint_How to Master effective communication
Interactive Powerpoint_How to Master effective communicationInteractive Powerpoint_How to Master effective communication
Interactive Powerpoint_How to Master effective communicationnomboosow
 
Nutritional Needs Presentation - HLTH 104
Nutritional Needs Presentation - HLTH 104Nutritional Needs Presentation - HLTH 104
Nutritional Needs Presentation - HLTH 104misteraugie
 
Hybridoma Technology ( Production , Purification , and Application )
Hybridoma Technology  ( Production , Purification , and Application  ) Hybridoma Technology  ( Production , Purification , and Application  )
Hybridoma Technology ( Production , Purification , and Application ) Sakshi Ghasle
 
Employee wellbeing at the workplace.pptx
Employee wellbeing at the workplace.pptxEmployee wellbeing at the workplace.pptx
Employee wellbeing at the workplace.pptxNirmalaLoungPoorunde1
 
Introduction to ArtificiaI Intelligence in Higher Education
Introduction to ArtificiaI Intelligence in Higher EducationIntroduction to ArtificiaI Intelligence in Higher Education
Introduction to ArtificiaI Intelligence in Higher Educationpboyjonauth
 
Privatization and Disinvestment - Meaning, Objectives, Advantages and Disadva...
Privatization and Disinvestment - Meaning, Objectives, Advantages and Disadva...Privatization and Disinvestment - Meaning, Objectives, Advantages and Disadva...
Privatization and Disinvestment - Meaning, Objectives, Advantages and Disadva...RKavithamani
 
Activity 01 - Artificial Culture (1).pdf
Activity 01 - Artificial Culture (1).pdfActivity 01 - Artificial Culture (1).pdf
Activity 01 - Artificial Culture (1).pdfciinovamais
 
Beyond the EU: DORA and NIS 2 Directive's Global Impact
Beyond the EU: DORA and NIS 2 Directive's Global ImpactBeyond the EU: DORA and NIS 2 Directive's Global Impact
Beyond the EU: DORA and NIS 2 Directive's Global ImpactPECB
 

Recently uploaded (20)

Student login on Anyboli platform.helpin
Student login on Anyboli platform.helpinStudent login on Anyboli platform.helpin
Student login on Anyboli platform.helpin
 
Grant Readiness 101 TechSoup and Remy Consulting
Grant Readiness 101 TechSoup and Remy ConsultingGrant Readiness 101 TechSoup and Remy Consulting
Grant Readiness 101 TechSoup and Remy Consulting
 
The basics of sentences session 2pptx copy.pptx
The basics of sentences session 2pptx copy.pptxThe basics of sentences session 2pptx copy.pptx
The basics of sentences session 2pptx copy.pptx
 
SOCIAL AND HISTORICAL CONTEXT - LFTVD.pptx
SOCIAL AND HISTORICAL CONTEXT - LFTVD.pptxSOCIAL AND HISTORICAL CONTEXT - LFTVD.pptx
SOCIAL AND HISTORICAL CONTEXT - LFTVD.pptx
 
Mattingly "AI & Prompt Design: Structured Data, Assistants, & RAG"
Mattingly "AI & Prompt Design: Structured Data, Assistants, & RAG"Mattingly "AI & Prompt Design: Structured Data, Assistants, & RAG"
Mattingly "AI & Prompt Design: Structured Data, Assistants, & RAG"
 
Introduction to AI in Higher Education_draft.pptx
Introduction to AI in Higher Education_draft.pptxIntroduction to AI in Higher Education_draft.pptx
Introduction to AI in Higher Education_draft.pptx
 
Arihant handbook biology for class 11 .pdf
Arihant handbook biology for class 11 .pdfArihant handbook biology for class 11 .pdf
Arihant handbook biology for class 11 .pdf
 
Accessible design: Minimum effort, maximum impact
Accessible design: Minimum effort, maximum impactAccessible design: Minimum effort, maximum impact
Accessible design: Minimum effort, maximum impact
 
microwave assisted reaction. General introduction
microwave assisted reaction. General introductionmicrowave assisted reaction. General introduction
microwave assisted reaction. General introduction
 
Call Girls in Dwarka Mor Delhi Contact Us 9654467111
Call Girls in Dwarka Mor Delhi Contact Us 9654467111Call Girls in Dwarka Mor Delhi Contact Us 9654467111
Call Girls in Dwarka Mor Delhi Contact Us 9654467111
 
1029 - Danh muc Sach Giao Khoa 10 . pdf
1029 -  Danh muc Sach Giao Khoa 10 . pdf1029 -  Danh muc Sach Giao Khoa 10 . pdf
1029 - Danh muc Sach Giao Khoa 10 . pdf
 
“Oh GOSH! Reflecting on Hackteria's Collaborative Practices in a Global Do-It...
“Oh GOSH! Reflecting on Hackteria's Collaborative Practices in a Global Do-It...“Oh GOSH! Reflecting on Hackteria's Collaborative Practices in a Global Do-It...
“Oh GOSH! Reflecting on Hackteria's Collaborative Practices in a Global Do-It...
 
Interactive Powerpoint_How to Master effective communication
Interactive Powerpoint_How to Master effective communicationInteractive Powerpoint_How to Master effective communication
Interactive Powerpoint_How to Master effective communication
 
Nutritional Needs Presentation - HLTH 104
Nutritional Needs Presentation - HLTH 104Nutritional Needs Presentation - HLTH 104
Nutritional Needs Presentation - HLTH 104
 
Hybridoma Technology ( Production , Purification , and Application )
Hybridoma Technology  ( Production , Purification , and Application  ) Hybridoma Technology  ( Production , Purification , and Application  )
Hybridoma Technology ( Production , Purification , and Application )
 
Employee wellbeing at the workplace.pptx
Employee wellbeing at the workplace.pptxEmployee wellbeing at the workplace.pptx
Employee wellbeing at the workplace.pptx
 
Introduction to ArtificiaI Intelligence in Higher Education
Introduction to ArtificiaI Intelligence in Higher EducationIntroduction to ArtificiaI Intelligence in Higher Education
Introduction to ArtificiaI Intelligence in Higher Education
 
Privatization and Disinvestment - Meaning, Objectives, Advantages and Disadva...
Privatization and Disinvestment - Meaning, Objectives, Advantages and Disadva...Privatization and Disinvestment - Meaning, Objectives, Advantages and Disadva...
Privatization and Disinvestment - Meaning, Objectives, Advantages and Disadva...
 
Activity 01 - Artificial Culture (1).pdf
Activity 01 - Artificial Culture (1).pdfActivity 01 - Artificial Culture (1).pdf
Activity 01 - Artificial Culture (1).pdf
 
Beyond the EU: DORA and NIS 2 Directive's Global Impact
Beyond the EU: DORA and NIS 2 Directive's Global ImpactBeyond the EU: DORA and NIS 2 Directive's Global Impact
Beyond the EU: DORA and NIS 2 Directive's Global Impact
 

Featured

AI Trends in Creative Operations 2024 by Artwork Flow.pdf
AI Trends in Creative Operations 2024 by Artwork Flow.pdfAI Trends in Creative Operations 2024 by Artwork Flow.pdf
AI Trends in Creative Operations 2024 by Artwork Flow.pdfmarketingartwork
 
PEPSICO Presentation to CAGNY Conference Feb 2024
PEPSICO Presentation to CAGNY Conference Feb 2024PEPSICO Presentation to CAGNY Conference Feb 2024
PEPSICO Presentation to CAGNY Conference Feb 2024Neil Kimberley
 
Content Methodology: A Best Practices Report (Webinar)
Content Methodology: A Best Practices Report (Webinar)Content Methodology: A Best Practices Report (Webinar)
Content Methodology: A Best Practices Report (Webinar)contently
 
How to Prepare For a Successful Job Search for 2024
How to Prepare For a Successful Job Search for 2024How to Prepare For a Successful Job Search for 2024
How to Prepare For a Successful Job Search for 2024Albert Qian
 
Social Media Marketing Trends 2024 // The Global Indie Insights
Social Media Marketing Trends 2024 // The Global Indie InsightsSocial Media Marketing Trends 2024 // The Global Indie Insights
Social Media Marketing Trends 2024 // The Global Indie InsightsKurio // The Social Media Age(ncy)
 
Trends In Paid Search: Navigating The Digital Landscape In 2024
Trends In Paid Search: Navigating The Digital Landscape In 2024Trends In Paid Search: Navigating The Digital Landscape In 2024
Trends In Paid Search: Navigating The Digital Landscape In 2024Search Engine Journal
 
5 Public speaking tips from TED - Visualized summary
5 Public speaking tips from TED - Visualized summary5 Public speaking tips from TED - Visualized summary
5 Public speaking tips from TED - Visualized summarySpeakerHub
 
ChatGPT and the Future of Work - Clark Boyd
ChatGPT and the Future of Work - Clark Boyd ChatGPT and the Future of Work - Clark Boyd
ChatGPT and the Future of Work - Clark Boyd Clark Boyd
 
Getting into the tech field. what next
Getting into the tech field. what next Getting into the tech field. what next
Getting into the tech field. what next Tessa Mero
 
Google's Just Not That Into You: Understanding Core Updates & Search Intent
Google's Just Not That Into You: Understanding Core Updates & Search IntentGoogle's Just Not That Into You: Understanding Core Updates & Search Intent
Google's Just Not That Into You: Understanding Core Updates & Search IntentLily Ray
 
Time Management & Productivity - Best Practices
Time Management & Productivity -  Best PracticesTime Management & Productivity -  Best Practices
Time Management & Productivity - Best PracticesVit Horky
 
The six step guide to practical project management
The six step guide to practical project managementThe six step guide to practical project management
The six step guide to practical project managementMindGenius
 
Beginners Guide to TikTok for Search - Rachel Pearson - We are Tilt __ Bright...
Beginners Guide to TikTok for Search - Rachel Pearson - We are Tilt __ Bright...Beginners Guide to TikTok for Search - Rachel Pearson - We are Tilt __ Bright...
Beginners Guide to TikTok for Search - Rachel Pearson - We are Tilt __ Bright...RachelPearson36
 
Unlocking the Power of ChatGPT and AI in Testing - A Real-World Look, present...
Unlocking the Power of ChatGPT and AI in Testing - A Real-World Look, present...Unlocking the Power of ChatGPT and AI in Testing - A Real-World Look, present...
Unlocking the Power of ChatGPT and AI in Testing - A Real-World Look, present...Applitools
 
12 Ways to Increase Your Influence at Work
12 Ways to Increase Your Influence at Work12 Ways to Increase Your Influence at Work
12 Ways to Increase Your Influence at WorkGetSmarter
 

Featured (20)

AI Trends in Creative Operations 2024 by Artwork Flow.pdf
AI Trends in Creative Operations 2024 by Artwork Flow.pdfAI Trends in Creative Operations 2024 by Artwork Flow.pdf
AI Trends in Creative Operations 2024 by Artwork Flow.pdf
 
Skeleton Culture Code
Skeleton Culture CodeSkeleton Culture Code
Skeleton Culture Code
 
PEPSICO Presentation to CAGNY Conference Feb 2024
PEPSICO Presentation to CAGNY Conference Feb 2024PEPSICO Presentation to CAGNY Conference Feb 2024
PEPSICO Presentation to CAGNY Conference Feb 2024
 
Content Methodology: A Best Practices Report (Webinar)
Content Methodology: A Best Practices Report (Webinar)Content Methodology: A Best Practices Report (Webinar)
Content Methodology: A Best Practices Report (Webinar)
 
How to Prepare For a Successful Job Search for 2024
How to Prepare For a Successful Job Search for 2024How to Prepare For a Successful Job Search for 2024
How to Prepare For a Successful Job Search for 2024
 
Social Media Marketing Trends 2024 // The Global Indie Insights
Social Media Marketing Trends 2024 // The Global Indie InsightsSocial Media Marketing Trends 2024 // The Global Indie Insights
Social Media Marketing Trends 2024 // The Global Indie Insights
 
Trends In Paid Search: Navigating The Digital Landscape In 2024
Trends In Paid Search: Navigating The Digital Landscape In 2024Trends In Paid Search: Navigating The Digital Landscape In 2024
Trends In Paid Search: Navigating The Digital Landscape In 2024
 
5 Public speaking tips from TED - Visualized summary
5 Public speaking tips from TED - Visualized summary5 Public speaking tips from TED - Visualized summary
5 Public speaking tips from TED - Visualized summary
 
ChatGPT and the Future of Work - Clark Boyd
ChatGPT and the Future of Work - Clark Boyd ChatGPT and the Future of Work - Clark Boyd
ChatGPT and the Future of Work - Clark Boyd
 
Getting into the tech field. what next
Getting into the tech field. what next Getting into the tech field. what next
Getting into the tech field. what next
 
Google's Just Not That Into You: Understanding Core Updates & Search Intent
Google's Just Not That Into You: Understanding Core Updates & Search IntentGoogle's Just Not That Into You: Understanding Core Updates & Search Intent
Google's Just Not That Into You: Understanding Core Updates & Search Intent
 
How to have difficult conversations
How to have difficult conversations How to have difficult conversations
How to have difficult conversations
 
Introduction to Data Science
Introduction to Data ScienceIntroduction to Data Science
Introduction to Data Science
 
Time Management & Productivity - Best Practices
Time Management & Productivity -  Best PracticesTime Management & Productivity -  Best Practices
Time Management & Productivity - Best Practices
 
The six step guide to practical project management
The six step guide to practical project managementThe six step guide to practical project management
The six step guide to practical project management
 
Beginners Guide to TikTok for Search - Rachel Pearson - We are Tilt __ Bright...
Beginners Guide to TikTok for Search - Rachel Pearson - We are Tilt __ Bright...Beginners Guide to TikTok for Search - Rachel Pearson - We are Tilt __ Bright...
Beginners Guide to TikTok for Search - Rachel Pearson - We are Tilt __ Bright...
 
Unlocking the Power of ChatGPT and AI in Testing - A Real-World Look, present...
Unlocking the Power of ChatGPT and AI in Testing - A Real-World Look, present...Unlocking the Power of ChatGPT and AI in Testing - A Real-World Look, present...
Unlocking the Power of ChatGPT and AI in Testing - A Real-World Look, present...
 
12 Ways to Increase Your Influence at Work
12 Ways to Increase Your Influence at Work12 Ways to Increase Your Influence at Work
12 Ways to Increase Your Influence at Work
 
ChatGPT webinar slides
ChatGPT webinar slidesChatGPT webinar slides
ChatGPT webinar slides
 
More than Just Lines on a Map: Best Practices for U.S Bike Routes
More than Just Lines on a Map: Best Practices for U.S Bike RoutesMore than Just Lines on a Map: Best Practices for U.S Bike Routes
More than Just Lines on a Map: Best Practices for U.S Bike Routes
 

Comparing Goodness-of-fit Measures for Calibration of Models Focused on Extreme Events (EGU 2012)

  • 1. Joint Research Comparing Goodness-of-fit Measures for Calibration Comparing Goodness-of-fit Measures for Calibration of of Centre EGU2012-11549 Session: HS1.3 Models Focused on Extreme Events Models Focused on Extreme Events Apr 23th, 2012 Mauricio Zambrano-Bigiarini and Alberto Bellin Mauricio Zambrano-Bigiarini and Alberto Bellin 1) Motivation 4) Results Despite serious and well-known limitations ( e.g., Legates and McCabe, 1999), many single objective goodness-of-fit measures are still of widespread use. As an example, the Nash-Sutcliffe 3.1) Nash-Sutcliffe efficiency : 3.6) Relative Nash-Sutcliffe efficiency : Nomenclature efficiency (NSE) has been highly criticised as an inappropriate (Nash and Sutcliffe, 1970) N (Krausse et al., 2005) ● Si : i-th simulated value 2 benchmark for comparing modelling results to observations N Oi−S i ( ) 2 ∑ ( O i − Si ) ∑ ● Oi : i-th observed value (e.g., Schaefli and Gupta 2007), nonetheless, it is still one of NSE =1− i =1 i =1 Oi N 2 rNSE =1− ● j : arbitrary power, i.e, positive integer the most common performance measures used by both ∑ ( O i− O ) N Oi− O 2 environmental scientists and practitioners. i =1 ∑ i =1 ( O ) ● ● Ō : mean observed value Ôi : median of the observed values in 3.2) Index of Agreement: 3.7) Relative Index of Agreement: the same month than Oi 2) Aim (Willmot, 1981) N (Krausse et al., 2005) N 2 ● r : Pearson's product-moment Oi− S i ( ) 2 ∑ ( O i − Si ) correlation coefficient To provide practical guidance about how different goodness- ∑ d =1− i =1 Oi α : ratio between the standard of-fit measures reported in literature perform when used within i =1 ● N 2 rd =1− N 2 deviation of simulations (σs) and ∑ (∣Si− O∣+∣O i− O∣) ∣Si − O∣+ ∣Oi −O∣ a single-objective optimisation procedure, both for the identification of model parameters and in the reproduction of i =1 ∑ i =1 ( O ) observations (σo) ● β : ratio between the mean of the high- and low-flow events. 3.3) Coefficient of Persistence: 3.8) Modified Nash-Sutcliffe efficiency : simulations (μs) and observations (Kitanidis & Bras, 1980) (Legates and McCabe, 1999) (μo) N N 2 3) Methodology ∑ ( S i− O i ) ∑ ∣Oi −Si∣j ● ωi : weight in [0,1] applied to both i =2 cp =1− N mNSE =1− i =1 observed and simulated values at Fig 08. Boxplots summarizing the 5th and 50th 3.1) Study Area N time step i ∑ ( O i− Oi−1 )2 ∑ ∣Oi −O∣j Fig 07. Boxplots summarizing the parameter values percentiles of daily discharge obtained with calibrations i =2 i =1 ● λ : number in [0, 1] representing the focused on low flows. Only the best half of the total weight given to the high-flow part obtained with calibrations focused on low flows. Only parameter sets obtained during calibration are 3.4) Volumetric Efficiency: 3.9) Modified Index of Agreement : of the signal, with λ close to 1 the best half of the total parameter sets obtained considered for each calibration exercise. Grey vertical (Criss and Winston, 2008) (Legates and McCabe, 1999) when focusing on high-flow during calibration are considered for each calibration Fig 05. Boxplots summarizing the parameter values lines indicate the value of the observed 5th and 50th N events, and λ close to zero when exercise. N obtained with calibrations focused on high flows. Only percentiles. ∑ ∣Oi −Si∣ ∑ ( O i− S i ) j focusing in low-flow conditions the best half of the total parameter sets obtained i =1 VE =1− i=1 N d j=1− OL ,OH : user-defined thresholds used during calibration are considered for each calibration 8) Conclusions ● N j exercise. ∑ Oi ∑ (∣Si −O∣+ ∣Oi −O∣) to separate low and high values, respectively. In order to avoid i =1 i =1 subjectivity in the selection of OL ● Large underestimation of observed low flows (~40%) were 3.5) Kling-Gupta efficiency: 3.10) Weighted Seasonal Nash-Sutcliffe: and OH , we use the flow duration obtained with simulated values calibrated by using NSE and the (this work) N (Gupta et al., 2009) curve criterion proposed by Yilmaz ∑ ∣ω i ( Oi −Si )∣j et al., 2008 KGE. Such underestimation is commonly masked out by the KGE =1− √( r −1)2 + (α −1)2+ (β−1)2 wsNSE =1− i=1 N good overall fit in terms of NSE, KGE and other statistics. ∑ ∣ω i (O i− Oi )∣j ̂ Cov s, o σs μs i =1 r = σ σ α = σ o β= μ o Low-flows calibrations: { } s o λ , O i⩾OH 3.6) Seasonal Nash-Sutcliffe: ω i = (1−λ )+ (2 λ −1)( Oi− OL ) , OL < Oi < O H ● rNSE and wsNSE (j=1, λ=0) perform the best (in terms of 5th O H −OL (Adapted from Garrick et al., 1978) 1−λ , O i⩾OL percentile), and both of them provide also a good N ∑ ∣Oi −Si∣j representation of medium flows (50th percentile) with all the sNSE =1− iN =1 other measures overestimating them. ∑ ∣O i−O i∣j ̂ ● NSE, KGE and d tend to underestimate GW_DELAY in i =1 comparison to to the other goodness-of-fit measures. Fig 01. Location of the Ega River Basin, meteorological stations, and discharge station used for the calibration of the upper catchment. High-flows calibrations: 3.2) Calibration Procedure ● wsNSE (j=2, λ=0.95), d and KGE perform the best (in terms of The Soil and Water Assessment Tool (SWAT) version 2005 95th percentile). At the same time, only wsNSE provides a good was calibrated for the period Jan/1961-Dec/1970, using the representation of medium flows (50th percentile), with all the first year as warming up period. other goodness-of-fit measures overestimating them. A set of 9 parameters was selected for calibration: ● The optimal value and variability of parameters related to the Parameter Min Max Fig 02. Discharge time series corresponding to the outlet of the upper part of the Ega River Basin (Ega en Estella stream gauge, Q071). slow response of the catchment (GW_DELAY and ALPHA_BF) Base flow alpha factor [days] ALPHA_BF 1.00E-1 9.90E-1 Horizontal blue and red lines show the discharge values used to separate high and low flows, respectively (see Fig 03). Manning's “n” value for the main channel [-] CH_N2 1.60E-2 1.50E-1 was very similar among all the goodness-of-fit measures tested Fig 06. Boxplots summarizing the 50th and 95th Initial SCS CN II value [-] CN2 4.00E+1 9.50E+1 percentiles of daily discharge obtained with (both close to zero), with KGE and wsNSE presenting the Saturated hydraulic conductivity [mm/hr] SOL_K 1.00E-3 1.00E+3 calibrations focused on high flows. Only the best half largest spread. However, those values are very different from of the total parameter sets obtained during calibration Available water capacity, [mmH2O/mm soil] SOL_AWC 1.00E-2 3.50E-1 are considered for each calibration exercise. Grey the ones obtained during the calibration focused on low flows. Effective hydraulic conductivity in main channel [mm/hr] CH_K2 0.00E+0 2.00E+2 vertical lines indicate the value of the observed 5th Soil evaporation compensation factor [-] ESCO 1.00E-2 1.00E+0 and 50th percentiles. Surface runoff lag time [days] SURLAG 1.00E+0 1.20E+1 References: Snowfall temperature [°C] SFTMP -5.00E+0 5.00E+0 ● Criss, R., Winston, W., 2008. Do Nash values have value? Discussion and alternate proposals. Hydrological Processes 22, 2723–2725 Calibration was carried out using Particle Swarm Optimisation ● Garrick, M., Cunnane, C., Nash, J.E., 1978. A criterion of efficiency for rainfall-runoff models. Journal of Hydrology 36, 375–381 Fig 03. Daily flow duration curve corresponding to Kennedy, J., and R. Eberhart (1995), Particle swarm optimization, in Proceedings IEEE International Conference on Neural Networks, 1995, vol. 4, pp. 1942–1948, (PSO, Kennedy and Eberhart, 1995), with 20 particles, 300 ● the outlet of the upper part of the Ega River Basin Fig 04. Weighting values used in wsNSE, which take into account the skewness of the ● Kitanidis, P.K., Bras, R.L., 1980. Real-time forecasting with a conceptual hydrologic model 2. applications and results. Water Resources Research 16, 1034–1044. iterations, ω=1/(2log2), c1=c2=0.5+log2, linearly decreasing (Q071). Verticl black lines show the discharge observed daily discharges. Red and blue lines correspond to the weights used when ● Krause, P., Boyle, D., Bäse, F., 2005. Comparison of different efficiency criteria for hydrological model assessment. Advances in Geosciences 5, 89–97.. Vmax from 1.0 to 0.5, and random topology (see EGU2012-10950 values used to separate high and low flows focusing on low- and high-flow events, respectively. On the left panel, the horizontal axis ● Legates, D., McCabe Jr., G., 1999. Evaluating the use of “goodness-of-fit” measures in hydrologic and hydroclimatic model validation. Water Resources Research 35, following Yilmaz et. al., 2008) represent discharges values of the Ega River Basin (Q071), while on the right panel the 233–241 for further details). Schaefli, B., Gupta, H., 2007. Do Nash values have value?. Hydrological Processes 21, 2075–2080. www.jrc.europa.eu Mauricio Zambrano-Bigiarini horizontal axis represent the empirical CDF of the observed discharge values. ● ● Willmott, C., 1981. On the validation of models. Physical Geography 2, 184–194. European Commission • Joint Research Centre • Institute for Environment and Sustainability ● Yilmaz, K., Gupta, H., Wagener, T., 2008. A process-based diagnostic approach to model evaluation: Application to the NWS distributed hydrologic model. Water Tel. +39 0332 789588 • Email: mauricio.zambrano@jrc.ec.europa.eu Resources Research 44, W09417.