SlideShare ist ein Scribd-Unternehmen logo
1 von 33
Improving the Accuracy of Surrogate Models 
Using Inverse Transform Sampling 
Junqiang Zhang*, Achille Messac#, Jie Zhang*, and Souma Chowdhury* 
* Rensselaer Polytechnic Institute, Department of Mechanical, Aerospace, and Nuclear Engineering 
# Syracuse University, Department of Mechanical and Aerospace Engineering 
53rd AIAA/ASME/ASCE/AHS/ASC Structures, Structural Dynamics 
and Materials Conference 
8th AIAA Multidisciplinary Design Optimization Specialist Conference 
April 23 - 26, 2012 
Honolulu, Hawaii
Introduction 
• Sampling is an important component of optimization, numerical 
simulations, design of experiments and uncertainty analysis. 
• Surrogate modeling is concerned with the construction of 
approximation models to estimate the system performance, and to 
develop relationships between specific system inputs and outputs. 
• It is expected that an intelligent selection of sample points can 
increase the accuracy of surrogate models. 
2 
Surrogate
3 
Sampling Based on Probability Distribution 
• Observations of inputs often follow a distribution. 
• A set of sample points representative of the naturally 
occurring distribution of inputs is often desirable. 
Distribution of a population Sample points 
-20 
0 
20 
-20 
0 
6 
4 
2 
0 
20 
x 10-3 
x2 x1 
PDF 
x1 
x2 
20 
10 
0 
-10 
-20 
-20 -10 0 10 20
Presentation Outline 
4 
Research Objectives and Motivation 
Probability-based sampling methods overview 
Inverse transform sampling 
Surrogate model development 
• Surrogate model performance comparison 
• Performance in increasing sample space 
Concluding remarks
 Certain inputs occur more frequently, comprising regions of 
high interests in the condition space. 
 It is desirable to have higher accuracy in the system response 
(surrogate) in the regions of higher interest. 
5 
Motivation and Research Objectives 
Motivation: 
Objectives: 
 Develop a sampling strategy for surrogate model 
development, which promotes higher accuracy in regions of 
high interest (of the observed input).
Existing Probabilistic Sampling Strategies 
6 
• Rejection sampling 
• Importance sampling 
• Markov Chain Monte Carlo 
• Metropolis-Hastings Sampling 
• Gibbs Sampling
7 
Inverse Transform Sampling: Key Features 
 Inverse transform can 
• Sample more points in the regions where random variables 
have higher probability densities; and 
• Sample fewer points in the regions where random variables 
have low probability densities. 
 The probability of random variables is used as the metric of 
distance instead of the Euclidean distance. 
 Sample points are uniform in terms of the probability 
differences.
8 
Procedure: Step 1 
Random Variable Observations 
Distribution Function Fitting 
Generating the Sequence of CDFs 
Coordinates Evaluation 
Step 1 
Step 2 
Step 3 
Step 4 
The occurrence of sampling 
variables should be sufficiently 
observed. 
10 
5 
0 
-5 
-10 
-15 
-20 
-5 0 5 10 15 
x1 
x2
9 
Procedure : Step 2 
Approaches 
• The least squares method 
• The least absolute deviations method 
• The generalized method of moments 
• The Maximum Likelihood Estimation 
Random Variable Observations 
Distribution Function Fitting 
Generating the Sequence of CDFs 
Coordinates Evaluation 
Step 1 
Step 2 
Step 3 
Step 4 
-20 
0 
20 
-20 
0 
0.015 
0.01 
0.005 
0 
20 
x2 x1 
PDF
10 
Procedure : Step 3 
• CDF increases from 0 to 1. 
• Low-discrepancy sampling methods 
generate uniformly distributed 
sequences between 0 and 1 in all 
dimensions of a sample space. 
• Van der Corput sequence 
• Halton/Hammersley sequence 
• Sobol sequence 
• Faure sequence 
Random Variable Observations 
Distribution Function Fitting 
Generate the Sequence of CDFs 
Coordinates Evaluation 
Step 1 
Step 2 
Step 3 
Step 4 
1 
0.8 
0.6 
0.4 
0.2 
0 
0 0.2 0.4 0.6 0.8 1 
CDF(x1) 
CDF(x2)
11 
Procedure : Step 4 
• Coordinates are evaluated using the 
inverse function of CDF. 
• Analytical expressions 
• Numerical approaches 
• The Newton’s method 
• The Levenberg-Marquardt algorithm 
• The trust region methods 
Random Variable Observations 
Distribution Function Fitting 
Generating the Sequence of CDFs 
Coordinates Evaluation 
Step 1 
Step 2 
Step 3 
Step 4 
x1 
x2 
20 
10 
0 
-10 
-20 
-20 -10 0 10 20
12 
More Applications 
• More points 
-20 
0 
20 
-20 
0 
0.015 
0.01 
0.005 
0 
20 
x2 x1 
PDF 
x1 
x2 
20 
10 
0 
-10 
-20 
-20 -10 0 10 20 
x1 
x2 
20 
10 
0 
-10 
-20 
-20 -10 0 10 20 
-20 
0 
20 
-20 
0 
0.015 
0.01 
0.005 
0 
0.015 
0.01 
0.005 
0 
20 
x2 x1 
PDF 
x1 
x2 
20 
10 
0 
-10 
-20 
20 
10 
0 
-10 
-20 
-20 -10 0 10 20 
-20 
0 
20 
-20 
0 
20 
x2 x1 
PDF 
x1 
x2 
-20 -10 0 10 20 
31 
127 
• Multimodal functions 
Bi-modal 
Quad-modal
13 
Window Performance Evaluation 
• The heat transfer rate through a triple pane window is 
evaluated under varying climatic conditions. 
• A CFD model of the triple pane window is created. 
• Sample climatic conditions are boundary conditions of the 
window CFD model. 
Cross Section
14 
Window Performance Evaluation 
Step 1 Random Variable Observations 
Three climatic conditions: 
• Air temperature 
• Wind speed 
• Solar radiation 
. 
Michigan, ND. 
3720 hourly observations for either 
January or August from 2006 to 2010
15 
Window Performance Evaluation 
Step 2 Distribution Function Fitting 
Three climatic conditions: 
• Air temperature: Gaussian 
• Wind speed: Weibull 
• Solar radiation: Gamma 
Parameters are fitted using the Maximum Likelihood Estimation. 
Michigan, ND. 
3720 hourly observations for either 
January or August from 2006 to 2010
Three climatic conditions: 
• Air temperature: Gaussian 
• Wind speed: Weibull 
• Solar radiation: Gamma 
Parameters are fitted using the Maximum Likelihood Estimation. 
Michigan, ND. 
3720 hourly observations for either 
January or August from 2006 to 2010 
16 
Window Performance Evaluation 
Step 3 Generate the Sequence of CDFs 
1 
0.8 
0.6 
0.4 
0.2 
0 
270 280 290 300 310 
Temperature 
CDF 
1 
0.8 
0.6 
0.4 
0.2 
0 
0 5 10 15 
Wind Speed 
CDF 
1 
0.8 
0.6 
0.4 
0.2 
0 
0 500 1000 1500 
Solar radiation 
CDF 
Sobol sequence
Three climatic conditions: 
• Air temperature: Gaussian 
• Wind speed: Weibull 
• Solar radiation: Gamma 
Parameters are fitted using the Maximum Likelihood Estimation. 
Michigan, ND. 
3720 hourly observations for either 
January or August from 2006 to 2010 
17 
Window Performance Evaluation 
Step 4 Coordinates Evaluation 
1 
0.8 
0.6 
0.4 
0.2 
0 
270 280 290 300 310 
Temperature 
CDF 
1 
0.8 
0.6 
0.4 
0.2 
0 
0 5 10 15 
Wind Speed 
CDF 
1 
0.8 
0.6 
0.4 
0.2 
0 
0 500 1000 1500 
Solar radiation 
CDF
18 
Distribution of Sample Points 
• Sample climatic conditions for January 
• Sample climatic conditions for August 
Sample points crowd in the region where PDF is high.
19 
Surrogate Model Development 
• The heat transfer rate through the window is evaluated using 
31 sample climatic conditions for either January or August, 
respectively. 
• Two surrogate models are developed for January and August 
using Kriging, respectively. 
Outdoor temperature 
Wind speed 
Solar radiation 
Heat flux 
Kriging 
Inputs 
Output 
 In this paper, we use a Matlab Kriging 
toolbox DACE (Design and Analysis 
of Computer Experiments), developed 
by Dr. Nielsen.
20 
Surrogate Model Performance Criteria 
For January and August, 3720 climatic conditions are used to 
evaluate errors of each surrogate. 
The performance of the surrogate can be evaluated using: 
• Root Mean Squared Error (RMSE) 
• Root Mean Squared Percentage Error (RMSPE) 
• Maximum Absolute Error (MAE) 
• Maximum Percentage Error (MPE)
21 
Surrogate Model Performance Comparison 
Month Method RMSE MAE RMSPE MPE 
January Inverse 0.047 0.49 0.64% 7.2% 
Sobol 0.054 0.30 0.68% 9.3% 
August Inverse 0.079 0.54 11% 318% 
Sobol 0.094 0.32 85% 4373% 
• RMSE, RMSPE, and MPE: Inverse transform sampling 
performs better than the Sobol sequence. 
• MAE: Inverse transform sampling has a larger MAE values.
22 
Performance in Increasing Sample Space 
• All the hourly climatic conditions are classified into regions 
with increasing PDF values in the sample space. 
• The performance of the surrogate models is evaluated in 
increasing sample space. 
280 
285 
290 
295 
300 
305 
2 
4 
6 
8 
800 
600 
400 
200 
0 
10 
Wind speed (m/s) Temperature (K) 
Solar radiation (W/m2) 
100% 
… 
… 
0.8% 
0.1% 
3720 climatic conditions
Root Mean Squared Percentage Error 
23 
Performance in Increasing Sample Space 
The surrogate model for January 
Root Mean Squared Error 
Increasing percentage of sample space 
Increasing percentage of sample space
The surrogate model for January 
Maximum Percentage Error 
24 
Performance in Increasing Sample Space 
Maximum Absolute Error 
Increasing percentage of sample space 
Increasing percentage of sample space
Root Mean Squared Percentage Error 
25 
Performance in Increasing Sample Space 
The surrogate model for August 
Root Mean Squared Error 
Increasing percentage of sample space 
Increasing percentage of sample space
The surrogate model for August 
Maximum Percentage Error 
26 
Performance in Increasing Sample Space 
Maximum Absolute Error 
Increasing percentage of sample space 
Increasing percentage of sample space
27 
Conclusions 
• Inverse transform sampling is uniquely helpful for surrogate 
development where the system inputs follow a certain distribution. 
• The CDF of the inputs are made to follow a pseudorandom 
sequence (such as Sobol). 
• For window performance evaluation, the surrogate models 
developed using inverse transform sampling have lower root mean 
squared error than those developed using the Sobol sequence. 
• For window performance evaluation, the surrogate models 
developed using inverse transform sampling have higher maximum 
absolute error than those developed using the Sobol sequence.
28 
Future Work 
• Extend the applicability of inverse transform sampling to 
correlated multi-variate/multi-input systems.
Acknowledgement 
• I would like to acknowledge my research adviser 
Prof. Achille Messac, for his immense help and 
support in this research. 
• Support from the NSF Awards is also 
acknowledged. 
29
30 
Selected References 
• Husslage, B. G., Rennen, G., van Dam, E. R., and den Hertog, D., “Space-filling Latin Hypercube Designs for Computer 
Experiments,” Optimization and Engineering, Vol. 12, 2011, pp. 611–632. 
• Clarkson, K. L. and Shor, P. W., “Applications of Random Sampling in Computational Geometry, II,” Discrete and Computational 
Geometry, Vol. 4, 1989, pp. 387–421. 
• Goldreich, O., Computational Complexity: A Conceptual Perspective, Cambridge University Press, 1st ed., 2008. 
• LaValle, S. M., Planning Algorithms, Cambridge University Press, 2006. 
• Niederreiter, H., “Point Sets and Sequences with Small Discrepancy,” Monatshefte fr Mathematik, Vol. 104, December 1987, pp. 
273–337. 
• van der Corput, J. G., “Verteilungsfunktionen,” Nederl. Akad. Wetensch. Proc., Vol. 38, 1935, pp. 813–821. 
• Diaconis, P., “The Distribution of Leading Digits and Uniform Distribution Mod 1,” The Annals of Probability, Vol. 5, No. 1, Feb 
1977, pp. 72–81. 
• Sobol, I. M., “Uniformly Distributed Sequences with an Additional Uniform Property,” USSR Computational Mathematics and 
Mathematical Physics, Vol. 16, 1976, pp. 236–242. 
• Faure, H., “Discrpances de suites associes un systme de numration en dimension s,” Acta Arithmetica, Vol. 41, 1982, pp. 337–351. 
• Miller, F., Vandome, A., and John, M., Inverse Transform Sampling, VDM Verlag Dr. Mueller e.K., 2010. 
• von Neumann, J., “Various Techniques Used in Connection with Random Digits,” Nat. Bureau Stand. Appl. Math. Ser., Vol. 12, 
1951, pp. 3638. 
• Marshall, A. W., “The Use of Multi-stage Sampling Schemes in Monte Carlo Computations,” H. A. Meyer (ed.), Symposium on 
Monte Carlo Methods, edited by N. Y. John Wiley & Sons, Inc., 1956, p. 123140. 
• Gilks, W., Gilks, W., Richardson, S., and Spiegelhalter, D., Markov Chain Monte Carlo in Practice, Interdisciplinary Statistics, 
Chapman & Hall, 1996.
31 
Performance in Increasing Sample Space 
• All the hourly climatic conditions are classified into regions 
with increasing PDF values in the sample space. 
• For each variable, the probability is the integral of the fitted 
PDF in the shortest interval. 
• The performance of the surrogate models is evaluated in 
increasing sample space.
32 
Review 
• Sampling sequences 
• Latin hypercube 
• Random 
• Pseudorandom 
• Low-dispersion 
• Low-discrepancy 
• Generating sample points from a probability distribution 
• Inverse transform sampling 
• rejection sampling 
• importance sampling 
• Markov Chain Monte Carlo 
• Metropolis-Hastings Sampling 
• Gibbs Sampling
Comparisons and Analyses 
33 
Sobol sequence Inverse transform 
• A Voronoi diagram is a special kind of decomposition of a metric space 
determined by distances to a specified discrete set of points in the space. 
• Each point has a cell that includes the region closer to the point than to 
any others. 
• The lines are equidistant to the two nearest points.

Weitere ähnliche Inhalte

Was ist angesagt?

EFFECTS OF MET DATA PROCESSING IN AERMOD CONCENTRATIONS
EFFECTS OF MET DATA PROCESSING IN AERMOD CONCENTRATIONSEFFECTS OF MET DATA PROCESSING IN AERMOD CONCENTRATIONS
EFFECTS OF MET DATA PROCESSING IN AERMOD CONCENTRATIONS
Sergio A. Guerra
 
NOVEL DATA ANALYSIS TECHNIQUE USED TO EVALUATE NOX AND CO2 CONTINUOUS EMISSIO...
NOVEL DATA ANALYSIS TECHNIQUE USED TO EVALUATE NOX AND CO2 CONTINUOUS EMISSIO...NOVEL DATA ANALYSIS TECHNIQUE USED TO EVALUATE NOX AND CO2 CONTINUOUS EMISSIO...
NOVEL DATA ANALYSIS TECHNIQUE USED TO EVALUATE NOX AND CO2 CONTINUOUS EMISSIO...
Sergio A. Guerra
 
Multisensor data fusion in object tracking applications
Multisensor data fusion in object tracking applicationsMultisensor data fusion in object tracking applications
Multisensor data fusion in object tracking applications
Sayed Abulhasan Quadri
 
Effects of Wind Direction on Trace Metal Concentration in Southeast Kansas
Effects of Wind Direction on  Trace Metal Concentration in Southeast KansasEffects of Wind Direction on  Trace Metal Concentration in Southeast Kansas
Effects of Wind Direction on Trace Metal Concentration in Southeast Kansas
Sergio A. Guerra
 

Was ist angesagt? (20)

Innovative Dispersion Modeling Practices to Achieve a Reasonable Level of Con...
Innovative Dispersion Modeling Practices to Achieve a Reasonable Level of Con...Innovative Dispersion Modeling Practices to Achieve a Reasonable Level of Con...
Innovative Dispersion Modeling Practices to Achieve a Reasonable Level of Con...
 
INNOVATIVE DISPERSION MODELING PRACTICES TO ACHIEVE A REASONABLE LEVEL OF CON...
INNOVATIVE DISPERSION MODELING PRACTICES TO ACHIEVE A REASONABLE LEVEL OF CON...INNOVATIVE DISPERSION MODELING PRACTICES TO ACHIEVE A REASONABLE LEVEL OF CON...
INNOVATIVE DISPERSION MODELING PRACTICES TO ACHIEVE A REASONABLE LEVEL OF CON...
 
New generation of high sensitivity airborne potassium magnetometers
New generation of high sensitivity airborne potassium magnetometersNew generation of high sensitivity airborne potassium magnetometers
New generation of high sensitivity airborne potassium magnetometers
 
Advanced Modeling Techniques for Permit Modeling - Turning challenges into o...
Advanced Modeling Techniques for Permit Modeling - Turning challenges into o...Advanced Modeling Techniques for Permit Modeling - Turning challenges into o...
Advanced Modeling Techniques for Permit Modeling - Turning challenges into o...
 
AIR DISPERSION MODELING HIGHLIGHTS FROM 2012 ACE
AIR DISPERSION MODELING HIGHLIGHTS FROM 2012 ACEAIR DISPERSION MODELING HIGHLIGHTS FROM 2012 ACE
AIR DISPERSION MODELING HIGHLIGHTS FROM 2012 ACE
 
CPP's Advanced Dispersion Modeling Services
CPP's Advanced Dispersion Modeling ServicesCPP's Advanced Dispersion Modeling Services
CPP's Advanced Dispersion Modeling Services
 
The PuffR R Package for Conducting Air Quality Dispersion Analyses
The PuffR R Package for Conducting Air Quality Dispersion AnalysesThe PuffR R Package for Conducting Air Quality Dispersion Analyses
The PuffR R Package for Conducting Air Quality Dispersion Analyses
 
EFFECTS OF MET DATA PROCESSING IN AERMOD CONCENTRATIONS
EFFECTS OF MET DATA PROCESSING IN AERMOD CONCENTRATIONSEFFECTS OF MET DATA PROCESSING IN AERMOD CONCENTRATIONS
EFFECTS OF MET DATA PROCESSING IN AERMOD CONCENTRATIONS
 
AERMOD CHANGES AND UPDATES
AERMOD CHANGES AND UPDATESAERMOD CHANGES AND UPDATES
AERMOD CHANGES AND UPDATES
 
CALPUFF versus AERMOD comparison
CALPUFF versus AERMOD comparisonCALPUFF versus AERMOD comparison
CALPUFF versus AERMOD comparison
 
Pairing aermod concentrations with the 50th percentile monitored value
Pairing aermod concentrations with the 50th percentile monitored valuePairing aermod concentrations with the 50th percentile monitored value
Pairing aermod concentrations with the 50th percentile monitored value
 
NOVEL DATA ANALYSIS TECHNIQUE USED TO EVALUATE NOX AND CO2 CONTINUOUS EMISSIO...
NOVEL DATA ANALYSIS TECHNIQUE USED TO EVALUATE NOX AND CO2 CONTINUOUS EMISSIO...NOVEL DATA ANALYSIS TECHNIQUE USED TO EVALUATE NOX AND CO2 CONTINUOUS EMISSIO...
NOVEL DATA ANALYSIS TECHNIQUE USED TO EVALUATE NOX AND CO2 CONTINUOUS EMISSIO...
 
Background Concentrations and the Need for a New System to Update AERMOD
Background Concentrations and the Need for a New System to Update AERMODBackground Concentrations and the Need for a New System to Update AERMOD
Background Concentrations and the Need for a New System to Update AERMOD
 
Case Studies in Air Dispersion Modeling for Young Professionals
Case Studies in Air Dispersion Modeling for Young ProfessionalsCase Studies in Air Dispersion Modeling for Young Professionals
Case Studies in Air Dispersion Modeling for Young Professionals
 
Multisensor data fusion in object tracking applications
Multisensor data fusion in object tracking applicationsMultisensor data fusion in object tracking applications
Multisensor data fusion in object tracking applications
 
Using Physical Modeling to Evaluate Re-entrainment of Stack Emissions
Using Physical Modeling to Evaluate Re-entrainment of Stack EmissionsUsing Physical Modeling to Evaluate Re-entrainment of Stack Emissions
Using Physical Modeling to Evaluate Re-entrainment of Stack Emissions
 
Effects of Wind Direction on Trace Metal Concentration in Southeast Kansas
Effects of Wind Direction on  Trace Metal Concentration in Southeast KansasEffects of Wind Direction on  Trace Metal Concentration in Southeast Kansas
Effects of Wind Direction on Trace Metal Concentration in Southeast Kansas
 
GDRR Opening Workshop - PANEL SESSION: Probabilistic Fault Tree Analysis, wit...
GDRR Opening Workshop - PANEL SESSION: Probabilistic Fault Tree Analysis, wit...GDRR Opening Workshop - PANEL SESSION: Probabilistic Fault Tree Analysis, wit...
GDRR Opening Workshop - PANEL SESSION: Probabilistic Fault Tree Analysis, wit...
 
Highlights from the 2016 Guideline on Air Quality Models Conference
Highlights from the 2016 Guideline on Air Quality Models ConferenceHighlights from the 2016 Guideline on Air Quality Models Conference
Highlights from the 2016 Guideline on Air Quality Models Conference
 
Forecasting fine grained air quality based on big data
Forecasting fine grained air quality based on big dataForecasting fine grained air quality based on big data
Forecasting fine grained air quality based on big data
 

Ähnlich wie Sampling-SDM2012_Jun

AIAA-SDM-SequentialSampling-2012
AIAA-SDM-SequentialSampling-2012AIAA-SDM-SequentialSampling-2012
AIAA-SDM-SequentialSampling-2012
OptiModel
 
ASS_SDM2012_Ali
ASS_SDM2012_AliASS_SDM2012_Ali
ASS_SDM2012_Ali
MDO_Lab
 
Science Coffee - Algorithms to Monitor Telemetry for Subtle Indications of De...
Science Coffee - Algorithms to Monitor Telemetry for Subtle Indications of De...Science Coffee - Algorithms to Monitor Telemetry for Subtle Indications of De...
Science Coffee - Algorithms to Monitor Telemetry for Subtle Indications of De...
Advanced-Concepts-Team
 
Testing Dynamic Behavior in Executable Software Models - Making Cyber-physica...
Testing Dynamic Behavior in Executable Software Models - Making Cyber-physica...Testing Dynamic Behavior in Executable Software Models - Making Cyber-physica...
Testing Dynamic Behavior in Executable Software Models - Making Cyber-physica...
Lionel Briand
 
AIAA-MAO-RegionalError-2012
AIAA-MAO-RegionalError-2012AIAA-MAO-RegionalError-2012
AIAA-MAO-RegionalError-2012
OptiModel
 
COSMOS1_Scitech_2014_Ali
COSMOS1_Scitech_2014_AliCOSMOS1_Scitech_2014_Ali
COSMOS1_Scitech_2014_Ali
MDO_Lab
 

Ähnlich wie Sampling-SDM2012_Jun (20)

Grey-box modeling: systems approach to water management
Grey-box modeling: systems approach to water managementGrey-box modeling: systems approach to water management
Grey-box modeling: systems approach to water management
 
DSD-NL 2018 Inverse Analysis for Workshop Anura3D MPM - Ghasemi Martinelli
DSD-NL 2018 Inverse Analysis for Workshop Anura3D MPM - Ghasemi MartinelliDSD-NL 2018 Inverse Analysis for Workshop Anura3D MPM - Ghasemi Martinelli
DSD-NL 2018 Inverse Analysis for Workshop Anura3D MPM - Ghasemi Martinelli
 
AIAA-SDM-SequentialSampling-2012
AIAA-SDM-SequentialSampling-2012AIAA-SDM-SequentialSampling-2012
AIAA-SDM-SequentialSampling-2012
 
ASS_SDM2012_Ali
ASS_SDM2012_AliASS_SDM2012_Ali
ASS_SDM2012_Ali
 
variableselectionmodelBuilding.ppt
variableselectionmodelBuilding.pptvariableselectionmodelBuilding.ppt
variableselectionmodelBuilding.ppt
 
DSUS_MAO_2012_Jie
DSUS_MAO_2012_JieDSUS_MAO_2012_Jie
DSUS_MAO_2012_Jie
 
Science Coffee - Algorithms to Monitor Telemetry for Subtle Indications of De...
Science Coffee - Algorithms to Monitor Telemetry for Subtle Indications of De...Science Coffee - Algorithms to Monitor Telemetry for Subtle Indications of De...
Science Coffee - Algorithms to Monitor Telemetry for Subtle Indications of De...
 
Testing Dynamic Behavior in Executable Software Models - Making Cyber-physica...
Testing Dynamic Behavior in Executable Software Models - Making Cyber-physica...Testing Dynamic Behavior in Executable Software Models - Making Cyber-physica...
Testing Dynamic Behavior in Executable Software Models - Making Cyber-physica...
 
Generation of one minute data
Generation of one minute dataGeneration of one minute data
Generation of one minute data
 
Cross-validation aggregation for forecasting
Cross-validation aggregation for forecastingCross-validation aggregation for forecasting
Cross-validation aggregation for forecasting
 
AIAA-MAO-RegionalError-2012
AIAA-MAO-RegionalError-2012AIAA-MAO-RegionalError-2012
AIAA-MAO-RegionalError-2012
 
COSMOS1_Scitech_2014_Ali
COSMOS1_Scitech_2014_AliCOSMOS1_Scitech_2014_Ali
COSMOS1_Scitech_2014_Ali
 
The Treatment of Uncertainty in Models
The Treatment of Uncertainty in ModelsThe Treatment of Uncertainty in Models
The Treatment of Uncertainty in Models
 
Attribute MSA
Attribute MSAAttribute MSA
Attribute MSA
 
Attribute MSA
Attribute MSA Attribute MSA
Attribute MSA
 
ACCESS-Opt_Overview
ACCESS-Opt_OverviewACCESS-Opt_Overview
ACCESS-Opt_Overview
 
Aplication of on line data analytics to a continuous process polybetene unit
Aplication of on line data analytics to a continuous process polybetene unitAplication of on line data analytics to a continuous process polybetene unit
Aplication of on line data analytics to a continuous process polybetene unit
 
111020_TOPL_Review
111020_TOPL_Review111020_TOPL_Review
111020_TOPL_Review
 
62 friesen field_data_requirements_for_the_validation_of_pv_module_performanc...
62 friesen field_data_requirements_for_the_validation_of_pv_module_performanc...62 friesen field_data_requirements_for_the_validation_of_pv_module_performanc...
62 friesen field_data_requirements_for_the_validation_of_pv_module_performanc...
 
Automating Speed: A Proven Approach to Preventing Performance Regressions in ...
Automating Speed: A Proven Approach to Preventing Performance Regressions in ...Automating Speed: A Proven Approach to Preventing Performance Regressions in ...
Automating Speed: A Proven Approach to Preventing Performance Regressions in ...
 

Mehr von MDO_Lab

ModelSelection1_WCSMO_2013_Ali
ModelSelection1_WCSMO_2013_AliModelSelection1_WCSMO_2013_Ali
ModelSelection1_WCSMO_2013_Ali
MDO_Lab
 
MOWF_SchiTech_2015_Weiyang
MOWF_SchiTech_2015_WeiyangMOWF_SchiTech_2015_Weiyang
MOWF_SchiTech_2015_Weiyang
MDO_Lab
 
SA_SciTech_2014_Weiyang
SA_SciTech_2014_WeiyangSA_SciTech_2014_Weiyang
SA_SciTech_2014_Weiyang
MDO_Lab
 
MOMDPSO_IDETC_2014_Weiyang
MOMDPSO_IDETC_2014_WeiyangMOMDPSO_IDETC_2014_Weiyang
MOMDPSO_IDETC_2014_Weiyang
MDO_Lab
 
MOWF_WCSMO_2013_Weiyang
MOWF_WCSMO_2013_WeiyangMOWF_WCSMO_2013_Weiyang
MOWF_WCSMO_2013_Weiyang
MDO_Lab
 
WFSA_IDETC_2013_Weiyang
WFSA_IDETC_2013_WeiyangWFSA_IDETC_2013_Weiyang
WFSA_IDETC_2013_Weiyang
MDO_Lab
 
ATI_SDM_2010_Jun
ATI_SDM_2010_JunATI_SDM_2010_Jun
ATI_SDM_2010_Jun
MDO_Lab
 
PF_MAO_2010_Souam
PF_MAO_2010_SouamPF_MAO_2010_Souam
PF_MAO_2010_Souam
MDO_Lab
 
WFO_SDM_2011_Souma
WFO_SDM_2011_SoumaWFO_SDM_2011_Souma
WFO_SDM_2011_Souma
MDO_Lab
 
RBHF_SDM_2011_Jie
RBHF_SDM_2011_JieRBHF_SDM_2011_Jie
RBHF_SDM_2011_Jie
MDO_Lab
 
AHF_IDETC_2011_Jie
AHF_IDETC_2011_JieAHF_IDETC_2011_Jie
AHF_IDETC_2011_Jie
MDO_Lab
 

Mehr von MDO_Lab (20)

ModelSelection1_WCSMO_2013_Ali
ModelSelection1_WCSMO_2013_AliModelSelection1_WCSMO_2013_Ali
ModelSelection1_WCSMO_2013_Ali
 
MOWF_SchiTech_2015_Weiyang
MOWF_SchiTech_2015_WeiyangMOWF_SchiTech_2015_Weiyang
MOWF_SchiTech_2015_Weiyang
 
SA_SciTech_2014_Weiyang
SA_SciTech_2014_WeiyangSA_SciTech_2014_Weiyang
SA_SciTech_2014_Weiyang
 
MOMDPSO_IDETC_2014_Weiyang
MOMDPSO_IDETC_2014_WeiyangMOMDPSO_IDETC_2014_Weiyang
MOMDPSO_IDETC_2014_Weiyang
 
MOWF_WCSMO_2013_Weiyang
MOWF_WCSMO_2013_WeiyangMOWF_WCSMO_2013_Weiyang
MOWF_WCSMO_2013_Weiyang
 
WFSA_IDETC_2013_Weiyang
WFSA_IDETC_2013_WeiyangWFSA_IDETC_2013_Weiyang
WFSA_IDETC_2013_Weiyang
 
WM_MAO_2012_Weiyang
WM_MAO_2012_WeiyangWM_MAO_2012_Weiyang
WM_MAO_2012_Weiyang
 
CP3_SDM_2010_Souma
CP3_SDM_2010_SoumaCP3_SDM_2010_Souma
CP3_SDM_2010_Souma
 
ATI_SDM_2010_Jun
ATI_SDM_2010_JunATI_SDM_2010_Jun
ATI_SDM_2010_Jun
 
PF_MAO_2010_Souam
PF_MAO_2010_SouamPF_MAO_2010_Souam
PF_MAO_2010_Souam
 
WFO_MAO_2010_Souma
WFO_MAO_2010_SoumaWFO_MAO_2010_Souma
WFO_MAO_2010_Souma
 
ATE_MAO_2010_Jun
ATE_MAO_2010_JunATE_MAO_2010_Jun
ATE_MAO_2010_Jun
 
COST_MAO_2010_Jie
COST_MAO_2010_JieCOST_MAO_2010_Jie
COST_MAO_2010_Jie
 
WFO_IDETC_2011_Souma
WFO_IDETC_2011_SoumaWFO_IDETC_2011_Souma
WFO_IDETC_2011_Souma
 
COSTMODEL_IDETC_2010_Jie
COSTMODEL_IDETC_2010_JieCOSTMODEL_IDETC_2010_Jie
COSTMODEL_IDETC_2010_Jie
 
WFO_TIERF_2011_Messac
WFO_TIERF_2011_MessacWFO_TIERF_2011_Messac
WFO_TIERF_2011_Messac
 
WFO_SDM_2011_Souma
WFO_SDM_2011_SoumaWFO_SDM_2011_Souma
WFO_SDM_2011_Souma
 
RBHF_SDM_2011_Jie
RBHF_SDM_2011_JieRBHF_SDM_2011_Jie
RBHF_SDM_2011_Jie
 
WFO_IDETC_2011_Souma
WFO_IDETC_2011_SoumaWFO_IDETC_2011_Souma
WFO_IDETC_2011_Souma
 
AHF_IDETC_2011_Jie
AHF_IDETC_2011_JieAHF_IDETC_2011_Jie
AHF_IDETC_2011_Jie
 

Kürzlich hochgeladen

Seal of Good Local Governance (SGLG) 2024Final.pptx
Seal of Good Local Governance (SGLG) 2024Final.pptxSeal of Good Local Governance (SGLG) 2024Final.pptx
Seal of Good Local Governance (SGLG) 2024Final.pptx
negromaestrong
 
The basics of sentences session 2pptx copy.pptx
The basics of sentences session 2pptx copy.pptxThe basics of sentences session 2pptx copy.pptx
The basics of sentences session 2pptx copy.pptx
heathfieldcps1
 
Beyond the EU: DORA and NIS 2 Directive's Global Impact
Beyond the EU: DORA and NIS 2 Directive's Global ImpactBeyond the EU: DORA and NIS 2 Directive's Global Impact
Beyond the EU: DORA and NIS 2 Directive's Global Impact
PECB
 

Kürzlich hochgeladen (20)

This PowerPoint helps students to consider the concept of infinity.
This PowerPoint helps students to consider the concept of infinity.This PowerPoint helps students to consider the concept of infinity.
This PowerPoint helps students to consider the concept of infinity.
 
Seal of Good Local Governance (SGLG) 2024Final.pptx
Seal of Good Local Governance (SGLG) 2024Final.pptxSeal of Good Local Governance (SGLG) 2024Final.pptx
Seal of Good Local Governance (SGLG) 2024Final.pptx
 
The basics of sentences session 2pptx copy.pptx
The basics of sentences session 2pptx copy.pptxThe basics of sentences session 2pptx copy.pptx
The basics of sentences session 2pptx copy.pptx
 
APM Welcome, APM North West Network Conference, Synergies Across Sectors
APM Welcome, APM North West Network Conference, Synergies Across SectorsAPM Welcome, APM North West Network Conference, Synergies Across Sectors
APM Welcome, APM North West Network Conference, Synergies Across Sectors
 
Nutritional Needs Presentation - HLTH 104
Nutritional Needs Presentation - HLTH 104Nutritional Needs Presentation - HLTH 104
Nutritional Needs Presentation - HLTH 104
 
SECOND SEMESTER TOPIC COVERAGE SY 2023-2024 Trends, Networks, and Critical Th...
SECOND SEMESTER TOPIC COVERAGE SY 2023-2024 Trends, Networks, and Critical Th...SECOND SEMESTER TOPIC COVERAGE SY 2023-2024 Trends, Networks, and Critical Th...
SECOND SEMESTER TOPIC COVERAGE SY 2023-2024 Trends, Networks, and Critical Th...
 
Mattingly "AI & Prompt Design: The Basics of Prompt Design"
Mattingly "AI & Prompt Design: The Basics of Prompt Design"Mattingly "AI & Prompt Design: The Basics of Prompt Design"
Mattingly "AI & Prompt Design: The Basics of Prompt Design"
 
SOCIAL AND HISTORICAL CONTEXT - LFTVD.pptx
SOCIAL AND HISTORICAL CONTEXT - LFTVD.pptxSOCIAL AND HISTORICAL CONTEXT - LFTVD.pptx
SOCIAL AND HISTORICAL CONTEXT - LFTVD.pptx
 
Application orientated numerical on hev.ppt
Application orientated numerical on hev.pptApplication orientated numerical on hev.ppt
Application orientated numerical on hev.ppt
 
Key note speaker Neum_Admir Softic_ENG.pdf
Key note speaker Neum_Admir Softic_ENG.pdfKey note speaker Neum_Admir Softic_ENG.pdf
Key note speaker Neum_Admir Softic_ENG.pdf
 
How to Give a Domain for a Field in Odoo 17
How to Give a Domain for a Field in Odoo 17How to Give a Domain for a Field in Odoo 17
How to Give a Domain for a Field in Odoo 17
 
Class 11th Physics NEET formula sheet pdf
Class 11th Physics NEET formula sheet pdfClass 11th Physics NEET formula sheet pdf
Class 11th Physics NEET formula sheet pdf
 
Beyond the EU: DORA and NIS 2 Directive's Global Impact
Beyond the EU: DORA and NIS 2 Directive's Global ImpactBeyond the EU: DORA and NIS 2 Directive's Global Impact
Beyond the EU: DORA and NIS 2 Directive's Global Impact
 
Basic Civil Engineering first year Notes- Chapter 4 Building.pptx
Basic Civil Engineering first year Notes- Chapter 4 Building.pptxBasic Civil Engineering first year Notes- Chapter 4 Building.pptx
Basic Civil Engineering first year Notes- Chapter 4 Building.pptx
 
Paris 2024 Olympic Geographies - an activity
Paris 2024 Olympic Geographies - an activityParis 2024 Olympic Geographies - an activity
Paris 2024 Olympic Geographies - an activity
 
microwave assisted reaction. General introduction
microwave assisted reaction. General introductionmicrowave assisted reaction. General introduction
microwave assisted reaction. General introduction
 
Mehran University Newsletter Vol-X, Issue-I, 2024
Mehran University Newsletter Vol-X, Issue-I, 2024Mehran University Newsletter Vol-X, Issue-I, 2024
Mehran University Newsletter Vol-X, Issue-I, 2024
 
psychiatric nursing HISTORY COLLECTION .docx
psychiatric  nursing HISTORY  COLLECTION  .docxpsychiatric  nursing HISTORY  COLLECTION  .docx
psychiatric nursing HISTORY COLLECTION .docx
 
fourth grading exam for kindergarten in writing
fourth grading exam for kindergarten in writingfourth grading exam for kindergarten in writing
fourth grading exam for kindergarten in writing
 
Sports & Fitness Value Added Course FY..
Sports & Fitness Value Added Course FY..Sports & Fitness Value Added Course FY..
Sports & Fitness Value Added Course FY..
 

Sampling-SDM2012_Jun

  • 1. Improving the Accuracy of Surrogate Models Using Inverse Transform Sampling Junqiang Zhang*, Achille Messac#, Jie Zhang*, and Souma Chowdhury* * Rensselaer Polytechnic Institute, Department of Mechanical, Aerospace, and Nuclear Engineering # Syracuse University, Department of Mechanical and Aerospace Engineering 53rd AIAA/ASME/ASCE/AHS/ASC Structures, Structural Dynamics and Materials Conference 8th AIAA Multidisciplinary Design Optimization Specialist Conference April 23 - 26, 2012 Honolulu, Hawaii
  • 2. Introduction • Sampling is an important component of optimization, numerical simulations, design of experiments and uncertainty analysis. • Surrogate modeling is concerned with the construction of approximation models to estimate the system performance, and to develop relationships between specific system inputs and outputs. • It is expected that an intelligent selection of sample points can increase the accuracy of surrogate models. 2 Surrogate
  • 3. 3 Sampling Based on Probability Distribution • Observations of inputs often follow a distribution. • A set of sample points representative of the naturally occurring distribution of inputs is often desirable. Distribution of a population Sample points -20 0 20 -20 0 6 4 2 0 20 x 10-3 x2 x1 PDF x1 x2 20 10 0 -10 -20 -20 -10 0 10 20
  • 4. Presentation Outline 4 Research Objectives and Motivation Probability-based sampling methods overview Inverse transform sampling Surrogate model development • Surrogate model performance comparison • Performance in increasing sample space Concluding remarks
  • 5.  Certain inputs occur more frequently, comprising regions of high interests in the condition space.  It is desirable to have higher accuracy in the system response (surrogate) in the regions of higher interest. 5 Motivation and Research Objectives Motivation: Objectives:  Develop a sampling strategy for surrogate model development, which promotes higher accuracy in regions of high interest (of the observed input).
  • 6. Existing Probabilistic Sampling Strategies 6 • Rejection sampling • Importance sampling • Markov Chain Monte Carlo • Metropolis-Hastings Sampling • Gibbs Sampling
  • 7. 7 Inverse Transform Sampling: Key Features  Inverse transform can • Sample more points in the regions where random variables have higher probability densities; and • Sample fewer points in the regions where random variables have low probability densities.  The probability of random variables is used as the metric of distance instead of the Euclidean distance.  Sample points are uniform in terms of the probability differences.
  • 8. 8 Procedure: Step 1 Random Variable Observations Distribution Function Fitting Generating the Sequence of CDFs Coordinates Evaluation Step 1 Step 2 Step 3 Step 4 The occurrence of sampling variables should be sufficiently observed. 10 5 0 -5 -10 -15 -20 -5 0 5 10 15 x1 x2
  • 9. 9 Procedure : Step 2 Approaches • The least squares method • The least absolute deviations method • The generalized method of moments • The Maximum Likelihood Estimation Random Variable Observations Distribution Function Fitting Generating the Sequence of CDFs Coordinates Evaluation Step 1 Step 2 Step 3 Step 4 -20 0 20 -20 0 0.015 0.01 0.005 0 20 x2 x1 PDF
  • 10. 10 Procedure : Step 3 • CDF increases from 0 to 1. • Low-discrepancy sampling methods generate uniformly distributed sequences between 0 and 1 in all dimensions of a sample space. • Van der Corput sequence • Halton/Hammersley sequence • Sobol sequence • Faure sequence Random Variable Observations Distribution Function Fitting Generate the Sequence of CDFs Coordinates Evaluation Step 1 Step 2 Step 3 Step 4 1 0.8 0.6 0.4 0.2 0 0 0.2 0.4 0.6 0.8 1 CDF(x1) CDF(x2)
  • 11. 11 Procedure : Step 4 • Coordinates are evaluated using the inverse function of CDF. • Analytical expressions • Numerical approaches • The Newton’s method • The Levenberg-Marquardt algorithm • The trust region methods Random Variable Observations Distribution Function Fitting Generating the Sequence of CDFs Coordinates Evaluation Step 1 Step 2 Step 3 Step 4 x1 x2 20 10 0 -10 -20 -20 -10 0 10 20
  • 12. 12 More Applications • More points -20 0 20 -20 0 0.015 0.01 0.005 0 20 x2 x1 PDF x1 x2 20 10 0 -10 -20 -20 -10 0 10 20 x1 x2 20 10 0 -10 -20 -20 -10 0 10 20 -20 0 20 -20 0 0.015 0.01 0.005 0 0.015 0.01 0.005 0 20 x2 x1 PDF x1 x2 20 10 0 -10 -20 20 10 0 -10 -20 -20 -10 0 10 20 -20 0 20 -20 0 20 x2 x1 PDF x1 x2 -20 -10 0 10 20 31 127 • Multimodal functions Bi-modal Quad-modal
  • 13. 13 Window Performance Evaluation • The heat transfer rate through a triple pane window is evaluated under varying climatic conditions. • A CFD model of the triple pane window is created. • Sample climatic conditions are boundary conditions of the window CFD model. Cross Section
  • 14. 14 Window Performance Evaluation Step 1 Random Variable Observations Three climatic conditions: • Air temperature • Wind speed • Solar radiation . Michigan, ND. 3720 hourly observations for either January or August from 2006 to 2010
  • 15. 15 Window Performance Evaluation Step 2 Distribution Function Fitting Three climatic conditions: • Air temperature: Gaussian • Wind speed: Weibull • Solar radiation: Gamma Parameters are fitted using the Maximum Likelihood Estimation. Michigan, ND. 3720 hourly observations for either January or August from 2006 to 2010
  • 16. Three climatic conditions: • Air temperature: Gaussian • Wind speed: Weibull • Solar radiation: Gamma Parameters are fitted using the Maximum Likelihood Estimation. Michigan, ND. 3720 hourly observations for either January or August from 2006 to 2010 16 Window Performance Evaluation Step 3 Generate the Sequence of CDFs 1 0.8 0.6 0.4 0.2 0 270 280 290 300 310 Temperature CDF 1 0.8 0.6 0.4 0.2 0 0 5 10 15 Wind Speed CDF 1 0.8 0.6 0.4 0.2 0 0 500 1000 1500 Solar radiation CDF Sobol sequence
  • 17. Three climatic conditions: • Air temperature: Gaussian • Wind speed: Weibull • Solar radiation: Gamma Parameters are fitted using the Maximum Likelihood Estimation. Michigan, ND. 3720 hourly observations for either January or August from 2006 to 2010 17 Window Performance Evaluation Step 4 Coordinates Evaluation 1 0.8 0.6 0.4 0.2 0 270 280 290 300 310 Temperature CDF 1 0.8 0.6 0.4 0.2 0 0 5 10 15 Wind Speed CDF 1 0.8 0.6 0.4 0.2 0 0 500 1000 1500 Solar radiation CDF
  • 18. 18 Distribution of Sample Points • Sample climatic conditions for January • Sample climatic conditions for August Sample points crowd in the region where PDF is high.
  • 19. 19 Surrogate Model Development • The heat transfer rate through the window is evaluated using 31 sample climatic conditions for either January or August, respectively. • Two surrogate models are developed for January and August using Kriging, respectively. Outdoor temperature Wind speed Solar radiation Heat flux Kriging Inputs Output  In this paper, we use a Matlab Kriging toolbox DACE (Design and Analysis of Computer Experiments), developed by Dr. Nielsen.
  • 20. 20 Surrogate Model Performance Criteria For January and August, 3720 climatic conditions are used to evaluate errors of each surrogate. The performance of the surrogate can be evaluated using: • Root Mean Squared Error (RMSE) • Root Mean Squared Percentage Error (RMSPE) • Maximum Absolute Error (MAE) • Maximum Percentage Error (MPE)
  • 21. 21 Surrogate Model Performance Comparison Month Method RMSE MAE RMSPE MPE January Inverse 0.047 0.49 0.64% 7.2% Sobol 0.054 0.30 0.68% 9.3% August Inverse 0.079 0.54 11% 318% Sobol 0.094 0.32 85% 4373% • RMSE, RMSPE, and MPE: Inverse transform sampling performs better than the Sobol sequence. • MAE: Inverse transform sampling has a larger MAE values.
  • 22. 22 Performance in Increasing Sample Space • All the hourly climatic conditions are classified into regions with increasing PDF values in the sample space. • The performance of the surrogate models is evaluated in increasing sample space. 280 285 290 295 300 305 2 4 6 8 800 600 400 200 0 10 Wind speed (m/s) Temperature (K) Solar radiation (W/m2) 100% … … 0.8% 0.1% 3720 climatic conditions
  • 23. Root Mean Squared Percentage Error 23 Performance in Increasing Sample Space The surrogate model for January Root Mean Squared Error Increasing percentage of sample space Increasing percentage of sample space
  • 24. The surrogate model for January Maximum Percentage Error 24 Performance in Increasing Sample Space Maximum Absolute Error Increasing percentage of sample space Increasing percentage of sample space
  • 25. Root Mean Squared Percentage Error 25 Performance in Increasing Sample Space The surrogate model for August Root Mean Squared Error Increasing percentage of sample space Increasing percentage of sample space
  • 26. The surrogate model for August Maximum Percentage Error 26 Performance in Increasing Sample Space Maximum Absolute Error Increasing percentage of sample space Increasing percentage of sample space
  • 27. 27 Conclusions • Inverse transform sampling is uniquely helpful for surrogate development where the system inputs follow a certain distribution. • The CDF of the inputs are made to follow a pseudorandom sequence (such as Sobol). • For window performance evaluation, the surrogate models developed using inverse transform sampling have lower root mean squared error than those developed using the Sobol sequence. • For window performance evaluation, the surrogate models developed using inverse transform sampling have higher maximum absolute error than those developed using the Sobol sequence.
  • 28. 28 Future Work • Extend the applicability of inverse transform sampling to correlated multi-variate/multi-input systems.
  • 29. Acknowledgement • I would like to acknowledge my research adviser Prof. Achille Messac, for his immense help and support in this research. • Support from the NSF Awards is also acknowledged. 29
  • 30. 30 Selected References • Husslage, B. G., Rennen, G., van Dam, E. R., and den Hertog, D., “Space-filling Latin Hypercube Designs for Computer Experiments,” Optimization and Engineering, Vol. 12, 2011, pp. 611–632. • Clarkson, K. L. and Shor, P. W., “Applications of Random Sampling in Computational Geometry, II,” Discrete and Computational Geometry, Vol. 4, 1989, pp. 387–421. • Goldreich, O., Computational Complexity: A Conceptual Perspective, Cambridge University Press, 1st ed., 2008. • LaValle, S. M., Planning Algorithms, Cambridge University Press, 2006. • Niederreiter, H., “Point Sets and Sequences with Small Discrepancy,” Monatshefte fr Mathematik, Vol. 104, December 1987, pp. 273–337. • van der Corput, J. G., “Verteilungsfunktionen,” Nederl. Akad. Wetensch. Proc., Vol. 38, 1935, pp. 813–821. • Diaconis, P., “The Distribution of Leading Digits and Uniform Distribution Mod 1,” The Annals of Probability, Vol. 5, No. 1, Feb 1977, pp. 72–81. • Sobol, I. M., “Uniformly Distributed Sequences with an Additional Uniform Property,” USSR Computational Mathematics and Mathematical Physics, Vol. 16, 1976, pp. 236–242. • Faure, H., “Discrpances de suites associes un systme de numration en dimension s,” Acta Arithmetica, Vol. 41, 1982, pp. 337–351. • Miller, F., Vandome, A., and John, M., Inverse Transform Sampling, VDM Verlag Dr. Mueller e.K., 2010. • von Neumann, J., “Various Techniques Used in Connection with Random Digits,” Nat. Bureau Stand. Appl. Math. Ser., Vol. 12, 1951, pp. 3638. • Marshall, A. W., “The Use of Multi-stage Sampling Schemes in Monte Carlo Computations,” H. A. Meyer (ed.), Symposium on Monte Carlo Methods, edited by N. Y. John Wiley & Sons, Inc., 1956, p. 123140. • Gilks, W., Gilks, W., Richardson, S., and Spiegelhalter, D., Markov Chain Monte Carlo in Practice, Interdisciplinary Statistics, Chapman & Hall, 1996.
  • 31. 31 Performance in Increasing Sample Space • All the hourly climatic conditions are classified into regions with increasing PDF values in the sample space. • For each variable, the probability is the integral of the fitted PDF in the shortest interval. • The performance of the surrogate models is evaluated in increasing sample space.
  • 32. 32 Review • Sampling sequences • Latin hypercube • Random • Pseudorandom • Low-dispersion • Low-discrepancy • Generating sample points from a probability distribution • Inverse transform sampling • rejection sampling • importance sampling • Markov Chain Monte Carlo • Metropolis-Hastings Sampling • Gibbs Sampling
  • 33. Comparisons and Analyses 33 Sobol sequence Inverse transform • A Voronoi diagram is a special kind of decomposition of a metric space determined by distances to a specified discrete set of points in the space. • Each point has a cell that includes the region closer to the point than to any others. • The lines are equidistant to the two nearest points.

Hinweis der Redaktion

  1. Inverse transform sampling was applied to a unimodal distribution to sample 31 points in previous slides to show how the sampling approach works. It can also be used to sample more points. The figure on the left shows that, when the number of sample points is increased from 31 to 127, the sample points obtained from this approach still crowd in the region where the probability density is high. Although the number of points in the regions with low probability density also increase, it is increasing at a lower rate than that of points in the regions with high probability density. Inverse transform sampling can also be used to sample multi-modal distributions. The two figures show the sample points for a bimodal distribution and a quad-modal distribution. The sample points crowd in the regions where the probability density is high.
  2. Higher MAE: It is because in the region with very low distribution density, the sample points are far from each other. The absolute errors at some points in this region are higher for the inverse than the sobol. August high MPE at 4373%: At some points, their values are very close to zero. These values are used as denominators.
  3. For the sobol sequence, Root Mean Squared Error and Root Mean Squared Percentage Error are similar in different percentages of sample space. For the inverse transform sampling, Root Mean Squared Error and Root Mean Squared Percentage Error are increasing as the percentage increases. Their overall performance is still better than the sobol sequence.
  4. As the percentage of sample space increases, maximum absolute error and maximum percentage error are both increasing. When the whole space is reached, the density of sample points obtained by inverse transform sampling in some space becomes lower than those for sobol. The maximum absolute error for inverse becomes higher than that of sobol. However, the maximum percentage error for inverse is still lower than that for sobol
  5. The spike for the root mean squared percentage error is because in the 12.5% region but not in the 10.0% region, the actual heat flux through the window is very close to zero. It is used as the denominator for percentage error evaluation.
  6. The spike for the maximum percentage error is because in the 12.5% region but not in the 10.0% region, the actual heat flux through the window is very close to zero. It is used as the denominator for percentage error evaluation.