SlideShare ist ein Scribd-Unternehmen logo
1 von 51
Downloaden Sie, um offline zu lesen
Karsten Peters
peters@dkrz.de
Applying machine learning
to address pressing issues
of fundamental weather
and climate research
Maria Moreno de Castro
moreno@dkrz.de
the presentation follow this perspective article
Organism
Biome
Region
Landscape
ECOSYSTEM
Organ
Cell
Molecule
GLOBE
Complex
Biology + Chemistry + Physics
Unique
The Earth System
Slide courtesy from the author Markus Reichstein
THE EARTH SYSTEM
THE EARTH SYSTEM
The behavior is dominated by spatial and temporal relations
Main research focus:
○ seasonal meteorological predictions
○ forecasting extreme events: floods, fires,...
○ long term climate predictions
It’s not like we
haven’t got
enough data at
our hands…
There’s
observational
data…
The A-Train
- 7 satellites flying in formation
- Operating since ~18 years
- Aqua collects about 89 GB of data/day
Example of observational data collection by remote sensing
It’s not like we
haven’t got
enough data at
our hands…
There’s
observational
data …
...and model
data.
Model data are the result of simulations generated by numerically solved
differential equations derived from physical models by discretizing the Earth and
representing key processes with parameterizations
Repeat for every model timestep and for every point of the globe
Model data Calculate physical
processes
Apply boundary
conditions
Multimodel analysis
©DKRZ/MPI-M
Last report:
3.5PBytes
Next report:
~ 30PBytes
Climate Models at the 1km scale are coming up
~650 GB of data per output time step
<<
Mistral High
Performace Computer
- 6 years old (new in 2020)
- 3.9 PFlops (#80 in Top500)
- 52 PiBytes disk (#4 in IO500)
Tape archive
- >200 PiBytes
- 5PiBytes disk cache
<<
example of spatio-temporal relations:
the prediction of fire occurrence, the stimation of burnt
area, and the trace of gas emissions depend on:
● instantaneous climatic drivers: temperature, humidity,...
● sources of ignition: humans, lightning,...
● state variables: available fuel,..
● moisture, terrain, wind speed and direction,..
Machine learning applications often do not directly and
exhaustively account for spatio-temporal correlations
Deep learning is a promising approach
Example: convolutional networks (spatial) + recurrent networks (memory, sequence learning)
Examples of Deep Learning applications in
Earth System Science
Slide courtesy from the author Markus Reichstein
Deep learning challenges
in Earth System science
● Diverse sources of noise → poor signal-to-noise ratio
Deep learning challenges
in Earth System science
● Diverse sources of noise → poor signal-to-noise ratio
● Inconsistencies
Deep learning challenges
in Earth System science
Fundamental
laws
of physics
energy and mass conservations,.... and we
must assure the deep learning models do not
allow for negative densities, precipitations,...
Noether's theorem explains why
conservation laws exists (wikipedia)
● Diverse sources of noise→ poor signal-to-noise ratio
● Inconsistencies → energy and mass conservations, density must be positive,...
● Extrapolation problem
Deep learning challenges
in Earth System science
Extrapolation
problem:
classification
Extrapolation
problem:
classification
the model should
show is not certain
about predicting in
undersampled
regions...
input data
Extrapolation
problem:
regression
Non-
stationary
system
Data shift or concept drift
● training data are not longer representative if the system has changed
● the accuracy of the trained model definitely decreased under data shift/concept drift
● Diverse sources of noise→ poor signal-to-noise ratio
● Inconsistencies → energy and mass conservations, density must be positive,...
● Extrapolation problem → system changes in time: data shift or concept drift
Deep learning challenges
in Earth System science
● Beyond visible spectrum → different statistical properties, no i.i.d. sets
● 40 000 x 20 000 pixels for a regular 1 km global resolution
● Multiple scales
Images
Deep learning challenges
in Earth System science
what is the scale of this? →
● Beyond visible spectrum → different statistical properties, no i.i.d. sets
● 40 000 x 20 000 pixels for a regular 1 km global resolution
● Multiple scales
● Scale invariant features
Images
Deep learning challenges
in Earth System science
Scale invariant!
● Beyond visible spectrum → different statistical properties, no i.i.d. sets
● 40 000 x 20 000 pixels for a regular 1 km global resolution
● Multiple scales
● Scale invariant features
● No ImageNet → and difficult to have, example: labelling clouds
Images
Deep learning challenges
in Earth System science
● Beyond visible spectrum → different statistical properties, no i.i.d. sets
● 40 000 x 20 000 pixels for a regular 1 km global resolution
● Multiple scales
● Scale invariant features
● No ImageNet → and difficult to have, example: labelling clouds
● Missing data → a solution Christopher Kadow, the leader of DKRZ
machine learning research group, adapted the Nvidia Technology for
image inpainting
Deep learning challenges
in Earth System science
Images
‘ground-truth’
original data
masked data
missing values
reconstruction
by Deep Convolutional NN
Image inpainting to reconstruct temperature missing observations
Hybrid models
Physical
models
ML and DL
models
Physical
models
ML and DL
models
Lightweighting/simplifying/speeding up physical models
● improve parametrizations
● analysis of model-observations mismatch
● emulation
Physical
models
ML and DL
models
Domain knowledge can guide/optimize the pure data-driven methods
● design the architecture
● constrain the cost (or reward) function
● physically based data augmentation: expansion of the data set
for undersampled regions
Depth
(m)
Temp
(°C)
feature prediction
Example: lakes simulations to predict temperature from depth measurements
Physical model
example: Tempd+1
= Temp d
+ sun - wind - upwelling
given that we measured Td=surface
= 15°C
Depth
(m)
Temp
(°C)
feature prediction
Physical model
example: Tempd+1
= Temp d
+ sun - wind - upwelling
given that we measured Td=surface
= 15°C
Moderate model skills
and of course zero
inconsistency
Depth
(m)
Temp
(°C)
feature prediction
Neural Network
might allow negative densities!
_
Better model skills and
but the inconsistency
spreads
Depth
(m)
Density
(g/L)
Temp
(°C)
features prediction
DATA AUGMENTATION: to include
new features that comes from physical
knowledge and then NN
Depth
(m)
Density
(g/L)
Temp
(°C)
features prediction
DATA AUGMENTATION: to include
new features that comes from physical
knowledge and then NN
Even better model skills
and a bit less
inconsistency but it still
spreads
Depth
(m)
Density
(g/L)
Temp
(°C)
✓
X
features prediction
physical model + NN + constrain loss
function: denser water must be deeper
Depth
(m)
Density
(g/L)
Temp
(°C)
✓
X
features prediction
physical model + NN + constrain loss
function: denser water must be deeper
Totally consistent and
high model skills!
Great model
performance (~1°C less
error) and totally
consistent
References
● Earth System figure: https://karenbakker.org/the-climate-system/
● Data cube image: Earth Syst. Dynam., 11, 201–234 (2020) https://doi.org/10.5194/esd-11-201-2020
● Climate model image: A. Gettelman and R.B. Rood, Demystifying Climate Models, Earth Systems
Data and Models 2, doi: 10.1007/978-3-662-48959-8_5
● Multimodel figure: Michael Böttinger (DKRZ) and Joachem Marotzke (Max Planck Institute
Meteorology)
● Mistral picture: Michael Böttinger (DKRZ) .
● Wildfire picture: https://pixnio.com/miscellaneous/fire-flames-pictures/aerial-ignition-interior-high
-rates-of-spread-in-open-savannas
● Hockey stick IPCC https://www.ipcc.ch/report/ar3/wg1/ (Chapter 2)
● Scale invariant issue with chiguagua and dingo: Christian Staudt http://clstaudt.me
● Scale free image http://paulbourke.net/fractals/googleearth/
● Image inpainting:
○ Nvidia Technology https://www.nvidia.com/research/inpainting/
○ Kadow et al. (2020), Artificial Intelligence reconstructs missing Climate Information (in review)
● Physics-guided neural networks : https://arxiv.org/pdf/1710.11431.pdf and
https://towardsdatascience.com/physics-guided-neural-networks-pgnns-8fe9dbad9414
DKRZ Unit: Machine Learning as a Service
● Provide a knowledge base
● Bring prototypes to production
● Train, educate, and exchange
machinelearning-join@lists.dkrz.de
Summary of the main topics discussed in the kick-off workshop
● Applying machine learning to Earth System modelling
○ Hybrid approaches to (i) improve parametrizations and (ii) validate physical models
○ Increase the availability of training data via (i) data augmentation and (ii) labelling
○ Infer causality of the patterns found in observational data
● Technology
○ Support for Python, portation to GPUs, and larger memory
○ Machine learning libraries for NetDCF data handling
○ Adaptive learning integrated with physical models during running time on the HPC
○ Distributed training and execution
○ Portability between HPC centres
● Uncertainty and reproducibility
○ Performance metrics for (i) unsupervised learning and (ii) data shift/concept drift
○ Adoption of interpretable models and uncertainty quantification and explainability methods
○ Sharing of training scripts and training data or trained model
● Community activities and capacity building: workshops, summer schools,...
Artificial intelligence and machine learning
activities at DKRZ
EXTRASLIDE
Climate models
● physical models derived from first principles (mechanistic)
● used to simulate how the Earth’s climate changes in time (dynamical)
● written in the form of coupled differential equations
● solution depends on boundary and initial conditions
● run with different conditions allows to create scenarios (see for instance, RCP)
● solved numerically with long lasting parallel runs
● calibrated and validated against observational data
● results are called model data
EXTRASLIDE
year
RCP 2.6: the best
case scenario
historical
RPC 8.5:
business
as usualExample of a basic climate model
including time (t) and space (x):
Tempt+1,x+1
= Temp t,x
+ warmingt,x
- coolingt,x
given that Tempt=0
= 25°C
High Performance Computing Data Center
Ongoing efforts to reduce our carbon footprint:
● Power Usage Effectiveness ~1 (PUE = 1.19)
● Cold aisle containment reduce CO2
emissions by 20%
● Hot air is recycled for heating nearby facilities
● Cooling water is recycled in our toilets
● Greener energy supplier possible
BEFORE AFTER
EXTRASLIDE

Weitere ähnliche Inhalte

Was ist angesagt?

The 7 steps of Machine Learning
The 7 steps of Machine LearningThe 7 steps of Machine Learning
The 7 steps of Machine LearningWaziri Shebogholo
 
A Framework and Infrastructure for Uncertainty Quantification and Management ...
A Framework and Infrastructure for Uncertainty Quantification and Management ...A Framework and Infrastructure for Uncertainty Quantification and Management ...
A Framework and Infrastructure for Uncertainty Quantification and Management ...aimsnist
 
IRJET-Performance Analysis of Liver Disease Prediction using Machine Learning...
IRJET-Performance Analysis of Liver Disease Prediction using Machine Learning...IRJET-Performance Analysis of Liver Disease Prediction using Machine Learning...
IRJET-Performance Analysis of Liver Disease Prediction using Machine Learning...IRJET Journal
 
Image classification with Deep Neural Networks
Image classification with Deep Neural NetworksImage classification with Deep Neural Networks
Image classification with Deep Neural NetworksYogendra Tamang
 
Supervised learning and Unsupervised learning
Supervised learning and Unsupervised learning Supervised learning and Unsupervised learning
Supervised learning and Unsupervised learning Usama Fayyaz
 
Federated learning
Federated learningFederated learning
Federated learningMindos Cheng
 
Machine Learning Algorithms | Machine Learning Tutorial | Data Science Algori...
Machine Learning Algorithms | Machine Learning Tutorial | Data Science Algori...Machine Learning Algorithms | Machine Learning Tutorial | Data Science Algori...
Machine Learning Algorithms | Machine Learning Tutorial | Data Science Algori...Simplilearn
 
Introduction to Machine Learning
Introduction to Machine LearningIntroduction to Machine Learning
Introduction to Machine LearningRaveen Perera
 
House Price Prediction An AI Approach.
House Price Prediction An AI Approach.House Price Prediction An AI Approach.
House Price Prediction An AI Approach.Nahian Ahmed
 
Machine Learning - Dataset Preparation
Machine Learning - Dataset PreparationMachine Learning - Dataset Preparation
Machine Learning - Dataset PreparationAndrew Ferlitsch
 
Machine learning overview
Machine learning overviewMachine learning overview
Machine learning overviewprih_yah
 
Survey on data mining techniques in heart disease prediction
Survey on data mining techniques in heart disease predictionSurvey on data mining techniques in heart disease prediction
Survey on data mining techniques in heart disease predictionSivagowry Shathesh
 
Neural networks and deep learning
Neural networks and deep learningNeural networks and deep learning
Neural networks and deep learningJörgen Sandig
 
Deep Learning for Artificial Intelligence (AI)
Deep Learning for Artificial Intelligence (AI)Deep Learning for Artificial Intelligence (AI)
Deep Learning for Artificial Intelligence (AI)Er. Shiva K. Shrestha
 
What is Deep Learning?
What is Deep Learning?What is Deep Learning?
What is Deep Learning?NVIDIA
 
Split Learning versus Federated Learning for Data Transparent ML, Camera Cult...
Split Learning versus Federated Learning for Data Transparent ML, Camera Cult...Split Learning versus Federated Learning for Data Transparent ML, Camera Cult...
Split Learning versus Federated Learning for Data Transparent ML, Camera Cult...Camera Culture Group, MIT Media Lab
 
Decision trees for machine learning
Decision trees for machine learningDecision trees for machine learning
Decision trees for machine learningAmr BARAKAT
 
Deep learning health care
Deep learning health care  Deep learning health care
Deep learning health care Meenakshi Sood
 

Was ist angesagt? (20)

The 7 steps of Machine Learning
The 7 steps of Machine LearningThe 7 steps of Machine Learning
The 7 steps of Machine Learning
 
A Framework and Infrastructure for Uncertainty Quantification and Management ...
A Framework and Infrastructure for Uncertainty Quantification and Management ...A Framework and Infrastructure for Uncertainty Quantification and Management ...
A Framework and Infrastructure for Uncertainty Quantification and Management ...
 
IRJET-Performance Analysis of Liver Disease Prediction using Machine Learning...
IRJET-Performance Analysis of Liver Disease Prediction using Machine Learning...IRJET-Performance Analysis of Liver Disease Prediction using Machine Learning...
IRJET-Performance Analysis of Liver Disease Prediction using Machine Learning...
 
Image classification with Deep Neural Networks
Image classification with Deep Neural NetworksImage classification with Deep Neural Networks
Image classification with Deep Neural Networks
 
Supervised learning and Unsupervised learning
Supervised learning and Unsupervised learning Supervised learning and Unsupervised learning
Supervised learning and Unsupervised learning
 
Federated learning
Federated learningFederated learning
Federated learning
 
Tutorial on Deep Learning
Tutorial on Deep LearningTutorial on Deep Learning
Tutorial on Deep Learning
 
Machine Learning Algorithms | Machine Learning Tutorial | Data Science Algori...
Machine Learning Algorithms | Machine Learning Tutorial | Data Science Algori...Machine Learning Algorithms | Machine Learning Tutorial | Data Science Algori...
Machine Learning Algorithms | Machine Learning Tutorial | Data Science Algori...
 
Introduction to Machine Learning
Introduction to Machine LearningIntroduction to Machine Learning
Introduction to Machine Learning
 
House Price Prediction An AI Approach.
House Price Prediction An AI Approach.House Price Prediction An AI Approach.
House Price Prediction An AI Approach.
 
Machine Learning - Dataset Preparation
Machine Learning - Dataset PreparationMachine Learning - Dataset Preparation
Machine Learning - Dataset Preparation
 
Deep learning ppt
Deep learning pptDeep learning ppt
Deep learning ppt
 
Machine learning overview
Machine learning overviewMachine learning overview
Machine learning overview
 
Survey on data mining techniques in heart disease prediction
Survey on data mining techniques in heart disease predictionSurvey on data mining techniques in heart disease prediction
Survey on data mining techniques in heart disease prediction
 
Neural networks and deep learning
Neural networks and deep learningNeural networks and deep learning
Neural networks and deep learning
 
Deep Learning for Artificial Intelligence (AI)
Deep Learning for Artificial Intelligence (AI)Deep Learning for Artificial Intelligence (AI)
Deep Learning for Artificial Intelligence (AI)
 
What is Deep Learning?
What is Deep Learning?What is Deep Learning?
What is Deep Learning?
 
Split Learning versus Federated Learning for Data Transparent ML, Camera Cult...
Split Learning versus Federated Learning for Data Transparent ML, Camera Cult...Split Learning versus Federated Learning for Data Transparent ML, Camera Cult...
Split Learning versus Federated Learning for Data Transparent ML, Camera Cult...
 
Decision trees for machine learning
Decision trees for machine learningDecision trees for machine learning
Decision trees for machine learning
 
Deep learning health care
Deep learning health care  Deep learning health care
Deep learning health care
 

Ähnlich wie Machine learning and climate and weather research

TERN eMAST : Observations and terrestrial ecosystem models : Terrestrial Ecos...
TERN eMAST : Observations and terrestrial ecosystem models : Terrestrial Ecos...TERN eMAST : Observations and terrestrial ecosystem models : Terrestrial Ecos...
TERN eMAST : Observations and terrestrial ecosystem models : Terrestrial Ecos...Brad Evans
 
The Face of Nanomaterials: Insightful Classification Using Deep Learning - An...
The Face of Nanomaterials: Insightful Classification Using Deep Learning - An...The Face of Nanomaterials: Insightful Classification Using Deep Learning - An...
The Face of Nanomaterials: Insightful Classification Using Deep Learning - An...PyData
 
Deep Learning and Automatic Differentiation from Theano to PyTorch
Deep Learning and Automatic Differentiation from Theano to PyTorchDeep Learning and Automatic Differentiation from Theano to PyTorch
Deep Learning and Automatic Differentiation from Theano to PyTorchinside-BigData.com
 
Improving Physical Parametrizations in Climate Models using Machine Learning
Improving Physical Parametrizations in Climate Models using Machine LearningImproving Physical Parametrizations in Climate Models using Machine Learning
Improving Physical Parametrizations in Climate Models using Machine LearningNoah Brenowitz
 
Pattern learning and recognition on statistical manifolds: An information-geo...
Pattern learning and recognition on statistical manifolds: An information-geo...Pattern learning and recognition on statistical manifolds: An information-geo...
Pattern learning and recognition on statistical manifolds: An information-geo...Frank Nielsen
 
Machine Learning: Foundations Course Number 0368403401
Machine Learning: Foundations Course Number 0368403401Machine Learning: Foundations Course Number 0368403401
Machine Learning: Foundations Course Number 0368403401butest
 
Learning new climate science by thinking creatively with machine learning
Learning new climate science by thinking creatively with machine learningLearning new climate science by thinking creatively with machine learning
Learning new climate science by thinking creatively with machine learningZachary Labe
 
Slide 1
Slide 1Slide 1
Slide 1butest
 
Slide 1
Slide 1Slide 1
Slide 1butest
 
Developing Computational Skills in the Sciences with Matlab Webinar 2017
Developing Computational Skills in the Sciences with Matlab Webinar 2017Developing Computational Skills in the Sciences with Matlab Webinar 2017
Developing Computational Skills in the Sciences with Matlab Webinar 2017SERC at Carleton College
 
TMS workshop on machine learning in materials science: Intro to deep learning...
TMS workshop on machine learning in materials science: Intro to deep learning...TMS workshop on machine learning in materials science: Intro to deep learning...
TMS workshop on machine learning in materials science: Intro to deep learning...BrianDeCost
 
How to Prepare Weather and Climate Models for Future HPC Hardware
How to Prepare Weather and Climate Models for Future HPC HardwareHow to Prepare Weather and Climate Models for Future HPC Hardware
How to Prepare Weather and Climate Models for Future HPC Hardwareinside-BigData.com
 
Running a GPU burst for Multi-Messenger Astrophysics with IceCube across all ...
Running a GPU burst for Multi-Messenger Astrophysics with IceCube across all ...Running a GPU burst for Multi-Messenger Astrophysics with IceCube across all ...
Running a GPU burst for Multi-Messenger Astrophysics with IceCube across all ...Igor Sfiligoi
 
Running a GPU burst for Multi-Messenger Astrophysics with IceCube across all ...
Running a GPU burst for Multi-Messenger Astrophysics with IceCube across all ...Running a GPU burst for Multi-Messenger Astrophysics with IceCube across all ...
Running a GPU burst for Multi-Messenger Astrophysics with IceCube across all ...Frank Wuerthwein
 
Near Exascale Computing in the Cloud
Near Exascale Computing in the CloudNear Exascale Computing in the Cloud
Near Exascale Computing in the CloudFrank Wuerthwein
 

Ähnlich wie Machine learning and climate and weather research (20)

CLIM Program: Remote Sensing Workshop, Optimization Methods in Remote Sensing...
CLIM Program: Remote Sensing Workshop, Optimization Methods in Remote Sensing...CLIM Program: Remote Sensing Workshop, Optimization Methods in Remote Sensing...
CLIM Program: Remote Sensing Workshop, Optimization Methods in Remote Sensing...
 
TERN eMAST : Observations and terrestrial ecosystem models : Terrestrial Ecos...
TERN eMAST : Observations and terrestrial ecosystem models : Terrestrial Ecos...TERN eMAST : Observations and terrestrial ecosystem models : Terrestrial Ecos...
TERN eMAST : Observations and terrestrial ecosystem models : Terrestrial Ecos...
 
The Face of Nanomaterials: Insightful Classification Using Deep Learning - An...
The Face of Nanomaterials: Insightful Classification Using Deep Learning - An...The Face of Nanomaterials: Insightful Classification Using Deep Learning - An...
The Face of Nanomaterials: Insightful Classification Using Deep Learning - An...
 
MAXSS & NVIDIA
MAXSS & NVIDIAMAXSS & NVIDIA
MAXSS & NVIDIA
 
Complex Models for Big Data
Complex Models for Big DataComplex Models for Big Data
Complex Models for Big Data
 
Deep Learning and Automatic Differentiation from Theano to PyTorch
Deep Learning and Automatic Differentiation from Theano to PyTorchDeep Learning and Automatic Differentiation from Theano to PyTorch
Deep Learning and Automatic Differentiation from Theano to PyTorch
 
Improving Physical Parametrizations in Climate Models using Machine Learning
Improving Physical Parametrizations in Climate Models using Machine LearningImproving Physical Parametrizations in Climate Models using Machine Learning
Improving Physical Parametrizations in Climate Models using Machine Learning
 
Pattern learning and recognition on statistical manifolds: An information-geo...
Pattern learning and recognition on statistical manifolds: An information-geo...Pattern learning and recognition on statistical manifolds: An information-geo...
Pattern learning and recognition on statistical manifolds: An information-geo...
 
Machine Learning: Foundations Course Number 0368403401
Machine Learning: Foundations Course Number 0368403401Machine Learning: Foundations Course Number 0368403401
Machine Learning: Foundations Course Number 0368403401
 
Program on Mathematical and Statistical Methods for Climate and the Earth Sys...
Program on Mathematical and Statistical Methods for Climate and the Earth Sys...Program on Mathematical and Statistical Methods for Climate and the Earth Sys...
Program on Mathematical and Statistical Methods for Climate and the Earth Sys...
 
Learning new climate science by thinking creatively with machine learning
Learning new climate science by thinking creatively with machine learningLearning new climate science by thinking creatively with machine learning
Learning new climate science by thinking creatively with machine learning
 
CLIM: Transition Workshop - Optimization Methods in Remote Sensing - Jessica...
CLIM: Transition Workshop - Optimization Methods in Remote Sensing  - Jessica...CLIM: Transition Workshop - Optimization Methods in Remote Sensing  - Jessica...
CLIM: Transition Workshop - Optimization Methods in Remote Sensing - Jessica...
 
Slide 1
Slide 1Slide 1
Slide 1
 
Slide 1
Slide 1Slide 1
Slide 1
 
Developing Computational Skills in the Sciences with Matlab Webinar 2017
Developing Computational Skills in the Sciences with Matlab Webinar 2017Developing Computational Skills in the Sciences with Matlab Webinar 2017
Developing Computational Skills in the Sciences with Matlab Webinar 2017
 
TMS workshop on machine learning in materials science: Intro to deep learning...
TMS workshop on machine learning in materials science: Intro to deep learning...TMS workshop on machine learning in materials science: Intro to deep learning...
TMS workshop on machine learning in materials science: Intro to deep learning...
 
How to Prepare Weather and Climate Models for Future HPC Hardware
How to Prepare Weather and Climate Models for Future HPC HardwareHow to Prepare Weather and Climate Models for Future HPC Hardware
How to Prepare Weather and Climate Models for Future HPC Hardware
 
Running a GPU burst for Multi-Messenger Astrophysics with IceCube across all ...
Running a GPU burst for Multi-Messenger Astrophysics with IceCube across all ...Running a GPU burst for Multi-Messenger Astrophysics with IceCube across all ...
Running a GPU burst for Multi-Messenger Astrophysics with IceCube across all ...
 
Running a GPU burst for Multi-Messenger Astrophysics with IceCube across all ...
Running a GPU burst for Multi-Messenger Astrophysics with IceCube across all ...Running a GPU burst for Multi-Messenger Astrophysics with IceCube across all ...
Running a GPU burst for Multi-Messenger Astrophysics with IceCube across all ...
 
Near Exascale Computing in the Cloud
Near Exascale Computing in the CloudNear Exascale Computing in the Cloud
Near Exascale Computing in the Cloud
 

Kürzlich hochgeladen

Non Text Magic Studio Magic Design for Presentations L&P.pdf
Non Text Magic Studio Magic Design for Presentations L&P.pdfNon Text Magic Studio Magic Design for Presentations L&P.pdf
Non Text Magic Studio Magic Design for Presentations L&P.pdfPratikPatil591646
 
IBEF report on the Insurance market in India
IBEF report on the Insurance market in IndiaIBEF report on the Insurance market in India
IBEF report on the Insurance market in IndiaManalVerma4
 
Statistics For Management by Richard I. Levin 8ed.pdf
Statistics For Management by Richard I. Levin 8ed.pdfStatistics For Management by Richard I. Levin 8ed.pdf
Statistics For Management by Richard I. Levin 8ed.pdfnikeshsingh56
 
Decoding Movie Sentiments: Analyzing Reviews with Data Analysis model
Decoding Movie Sentiments: Analyzing Reviews with Data Analysis modelDecoding Movie Sentiments: Analyzing Reviews with Data Analysis model
Decoding Movie Sentiments: Analyzing Reviews with Data Analysis modelBoston Institute of Analytics
 
Role of Consumer Insights in business transformation
Role of Consumer Insights in business transformationRole of Consumer Insights in business transformation
Role of Consumer Insights in business transformationAnnie Melnic
 
Presentation of project of business person who are success
Presentation of project of business person who are successPresentation of project of business person who are success
Presentation of project of business person who are successPratikSingh115843
 
English-8-Q4-W3-Synthesizing-Essential-Information-From-Various-Sources-1.pdf
English-8-Q4-W3-Synthesizing-Essential-Information-From-Various-Sources-1.pdfEnglish-8-Q4-W3-Synthesizing-Essential-Information-From-Various-Sources-1.pdf
English-8-Q4-W3-Synthesizing-Essential-Information-From-Various-Sources-1.pdfblazblazml
 
6 Tips for Interpretable Topic Models _ by Nicha Ruchirawat _ Towards Data Sc...
6 Tips for Interpretable Topic Models _ by Nicha Ruchirawat _ Towards Data Sc...6 Tips for Interpretable Topic Models _ by Nicha Ruchirawat _ Towards Data Sc...
6 Tips for Interpretable Topic Models _ by Nicha Ruchirawat _ Towards Data Sc...Dr Arash Najmaei ( Phd., MBA, BSc)
 
Digital Indonesia Report 2024 by We Are Social .pdf
Digital Indonesia Report 2024 by We Are Social .pdfDigital Indonesia Report 2024 by We Are Social .pdf
Digital Indonesia Report 2024 by We Are Social .pdfNicoChristianSunaryo
 
Digital Marketing Plan, how digital marketing works
Digital Marketing Plan, how digital marketing worksDigital Marketing Plan, how digital marketing works
Digital Marketing Plan, how digital marketing worksdeepakthakur548787
 
Bank Loan Approval Analysis: A Comprehensive Data Analysis Project
Bank Loan Approval Analysis: A Comprehensive Data Analysis ProjectBank Loan Approval Analysis: A Comprehensive Data Analysis Project
Bank Loan Approval Analysis: A Comprehensive Data Analysis ProjectBoston Institute of Analytics
 
Data Analysis Project Presentation: Unveiling Your Ideal Customer, Bank Custo...
Data Analysis Project Presentation: Unveiling Your Ideal Customer, Bank Custo...Data Analysis Project Presentation: Unveiling Your Ideal Customer, Bank Custo...
Data Analysis Project Presentation: Unveiling Your Ideal Customer, Bank Custo...Boston Institute of Analytics
 
why-transparency-and-traceability-are-essential-for-sustainable-supply-chains...
why-transparency-and-traceability-are-essential-for-sustainable-supply-chains...why-transparency-and-traceability-are-essential-for-sustainable-supply-chains...
why-transparency-and-traceability-are-essential-for-sustainable-supply-chains...Jack Cole
 
DATA ANALYSIS using various data sets like shoping data set etc
DATA ANALYSIS using various data sets like shoping data set etcDATA ANALYSIS using various data sets like shoping data set etc
DATA ANALYSIS using various data sets like shoping data set etclalithasri22
 

Kürzlich hochgeladen (17)

Non Text Magic Studio Magic Design for Presentations L&P.pdf
Non Text Magic Studio Magic Design for Presentations L&P.pdfNon Text Magic Studio Magic Design for Presentations L&P.pdf
Non Text Magic Studio Magic Design for Presentations L&P.pdf
 
2023 Survey Shows Dip in High School E-Cigarette Use
2023 Survey Shows Dip in High School E-Cigarette Use2023 Survey Shows Dip in High School E-Cigarette Use
2023 Survey Shows Dip in High School E-Cigarette Use
 
IBEF report on the Insurance market in India
IBEF report on the Insurance market in IndiaIBEF report on the Insurance market in India
IBEF report on the Insurance market in India
 
Insurance Churn Prediction Data Analysis Project
Insurance Churn Prediction Data Analysis ProjectInsurance Churn Prediction Data Analysis Project
Insurance Churn Prediction Data Analysis Project
 
Statistics For Management by Richard I. Levin 8ed.pdf
Statistics For Management by Richard I. Levin 8ed.pdfStatistics For Management by Richard I. Levin 8ed.pdf
Statistics For Management by Richard I. Levin 8ed.pdf
 
Decoding Movie Sentiments: Analyzing Reviews with Data Analysis model
Decoding Movie Sentiments: Analyzing Reviews with Data Analysis modelDecoding Movie Sentiments: Analyzing Reviews with Data Analysis model
Decoding Movie Sentiments: Analyzing Reviews with Data Analysis model
 
Role of Consumer Insights in business transformation
Role of Consumer Insights in business transformationRole of Consumer Insights in business transformation
Role of Consumer Insights in business transformation
 
Presentation of project of business person who are success
Presentation of project of business person who are successPresentation of project of business person who are success
Presentation of project of business person who are success
 
English-8-Q4-W3-Synthesizing-Essential-Information-From-Various-Sources-1.pdf
English-8-Q4-W3-Synthesizing-Essential-Information-From-Various-Sources-1.pdfEnglish-8-Q4-W3-Synthesizing-Essential-Information-From-Various-Sources-1.pdf
English-8-Q4-W3-Synthesizing-Essential-Information-From-Various-Sources-1.pdf
 
6 Tips for Interpretable Topic Models _ by Nicha Ruchirawat _ Towards Data Sc...
6 Tips for Interpretable Topic Models _ by Nicha Ruchirawat _ Towards Data Sc...6 Tips for Interpretable Topic Models _ by Nicha Ruchirawat _ Towards Data Sc...
6 Tips for Interpretable Topic Models _ by Nicha Ruchirawat _ Towards Data Sc...
 
Data Analysis Project: Stroke Prediction
Data Analysis Project: Stroke PredictionData Analysis Project: Stroke Prediction
Data Analysis Project: Stroke Prediction
 
Digital Indonesia Report 2024 by We Are Social .pdf
Digital Indonesia Report 2024 by We Are Social .pdfDigital Indonesia Report 2024 by We Are Social .pdf
Digital Indonesia Report 2024 by We Are Social .pdf
 
Digital Marketing Plan, how digital marketing works
Digital Marketing Plan, how digital marketing worksDigital Marketing Plan, how digital marketing works
Digital Marketing Plan, how digital marketing works
 
Bank Loan Approval Analysis: A Comprehensive Data Analysis Project
Bank Loan Approval Analysis: A Comprehensive Data Analysis ProjectBank Loan Approval Analysis: A Comprehensive Data Analysis Project
Bank Loan Approval Analysis: A Comprehensive Data Analysis Project
 
Data Analysis Project Presentation: Unveiling Your Ideal Customer, Bank Custo...
Data Analysis Project Presentation: Unveiling Your Ideal Customer, Bank Custo...Data Analysis Project Presentation: Unveiling Your Ideal Customer, Bank Custo...
Data Analysis Project Presentation: Unveiling Your Ideal Customer, Bank Custo...
 
why-transparency-and-traceability-are-essential-for-sustainable-supply-chains...
why-transparency-and-traceability-are-essential-for-sustainable-supply-chains...why-transparency-and-traceability-are-essential-for-sustainable-supply-chains...
why-transparency-and-traceability-are-essential-for-sustainable-supply-chains...
 
DATA ANALYSIS using various data sets like shoping data set etc
DATA ANALYSIS using various data sets like shoping data set etcDATA ANALYSIS using various data sets like shoping data set etc
DATA ANALYSIS using various data sets like shoping data set etc
 

Machine learning and climate and weather research

  • 1. Karsten Peters peters@dkrz.de Applying machine learning to address pressing issues of fundamental weather and climate research Maria Moreno de Castro moreno@dkrz.de
  • 2. the presentation follow this perspective article
  • 3. Organism Biome Region Landscape ECOSYSTEM Organ Cell Molecule GLOBE Complex Biology + Chemistry + Physics Unique The Earth System Slide courtesy from the author Markus Reichstein
  • 5. THE EARTH SYSTEM The behavior is dominated by spatial and temporal relations Main research focus: ○ seasonal meteorological predictions ○ forecasting extreme events: floods, fires,... ○ long term climate predictions
  • 6. It’s not like we haven’t got enough data at our hands… There’s observational data…
  • 7. The A-Train - 7 satellites flying in formation - Operating since ~18 years - Aqua collects about 89 GB of data/day Example of observational data collection by remote sensing
  • 8. It’s not like we haven’t got enough data at our hands… There’s observational data … ...and model data.
  • 9. Model data are the result of simulations generated by numerically solved differential equations derived from physical models by discretizing the Earth and representing key processes with parameterizations
  • 10. Repeat for every model timestep and for every point of the globe Model data Calculate physical processes Apply boundary conditions
  • 12. Climate Models at the 1km scale are coming up ~650 GB of data per output time step
  • 13. << Mistral High Performace Computer - 6 years old (new in 2020) - 3.9 PFlops (#80 in Top500) - 52 PiBytes disk (#4 in IO500) Tape archive - >200 PiBytes - 5PiBytes disk cache
  • 14. <<
  • 15. example of spatio-temporal relations: the prediction of fire occurrence, the stimation of burnt area, and the trace of gas emissions depend on: ● instantaneous climatic drivers: temperature, humidity,... ● sources of ignition: humans, lightning,... ● state variables: available fuel,.. ● moisture, terrain, wind speed and direction,..
  • 16. Machine learning applications often do not directly and exhaustively account for spatio-temporal correlations Deep learning is a promising approach Example: convolutional networks (spatial) + recurrent networks (memory, sequence learning)
  • 17. Examples of Deep Learning applications in Earth System Science Slide courtesy from the author Markus Reichstein
  • 18. Deep learning challenges in Earth System science
  • 19. ● Diverse sources of noise → poor signal-to-noise ratio Deep learning challenges in Earth System science
  • 20. ● Diverse sources of noise → poor signal-to-noise ratio ● Inconsistencies Deep learning challenges in Earth System science
  • 21. Fundamental laws of physics energy and mass conservations,.... and we must assure the deep learning models do not allow for negative densities, precipitations,... Noether's theorem explains why conservation laws exists (wikipedia)
  • 22. ● Diverse sources of noise→ poor signal-to-noise ratio ● Inconsistencies → energy and mass conservations, density must be positive,... ● Extrapolation problem Deep learning challenges in Earth System science
  • 24. Extrapolation problem: classification the model should show is not certain about predicting in undersampled regions...
  • 26.
  • 27. Non- stationary system Data shift or concept drift ● training data are not longer representative if the system has changed ● the accuracy of the trained model definitely decreased under data shift/concept drift
  • 28. ● Diverse sources of noise→ poor signal-to-noise ratio ● Inconsistencies → energy and mass conservations, density must be positive,... ● Extrapolation problem → system changes in time: data shift or concept drift Deep learning challenges in Earth System science
  • 29. ● Beyond visible spectrum → different statistical properties, no i.i.d. sets ● 40 000 x 20 000 pixels for a regular 1 km global resolution ● Multiple scales Images Deep learning challenges in Earth System science
  • 30. what is the scale of this? → ● Beyond visible spectrum → different statistical properties, no i.i.d. sets ● 40 000 x 20 000 pixels for a regular 1 km global resolution ● Multiple scales ● Scale invariant features Images Deep learning challenges in Earth System science
  • 32. ● Beyond visible spectrum → different statistical properties, no i.i.d. sets ● 40 000 x 20 000 pixels for a regular 1 km global resolution ● Multiple scales ● Scale invariant features ● No ImageNet → and difficult to have, example: labelling clouds Images Deep learning challenges in Earth System science
  • 33. ● Beyond visible spectrum → different statistical properties, no i.i.d. sets ● 40 000 x 20 000 pixels for a regular 1 km global resolution ● Multiple scales ● Scale invariant features ● No ImageNet → and difficult to have, example: labelling clouds ● Missing data → a solution Christopher Kadow, the leader of DKRZ machine learning research group, adapted the Nvidia Technology for image inpainting Deep learning challenges in Earth System science Images
  • 34.
  • 35. ‘ground-truth’ original data masked data missing values reconstruction by Deep Convolutional NN Image inpainting to reconstruct temperature missing observations
  • 37. Physical models ML and DL models Lightweighting/simplifying/speeding up physical models ● improve parametrizations ● analysis of model-observations mismatch ● emulation
  • 38. Physical models ML and DL models Domain knowledge can guide/optimize the pure data-driven methods ● design the architecture ● constrain the cost (or reward) function ● physically based data augmentation: expansion of the data set for undersampled regions
  • 39. Depth (m) Temp (°C) feature prediction Example: lakes simulations to predict temperature from depth measurements Physical model example: Tempd+1 = Temp d + sun - wind - upwelling given that we measured Td=surface = 15°C
  • 40. Depth (m) Temp (°C) feature prediction Physical model example: Tempd+1 = Temp d + sun - wind - upwelling given that we measured Td=surface = 15°C Moderate model skills and of course zero inconsistency
  • 41. Depth (m) Temp (°C) feature prediction Neural Network might allow negative densities! _ Better model skills and but the inconsistency spreads
  • 42. Depth (m) Density (g/L) Temp (°C) features prediction DATA AUGMENTATION: to include new features that comes from physical knowledge and then NN
  • 43. Depth (m) Density (g/L) Temp (°C) features prediction DATA AUGMENTATION: to include new features that comes from physical knowledge and then NN Even better model skills and a bit less inconsistency but it still spreads
  • 44. Depth (m) Density (g/L) Temp (°C) ✓ X features prediction physical model + NN + constrain loss function: denser water must be deeper
  • 45. Depth (m) Density (g/L) Temp (°C) ✓ X features prediction physical model + NN + constrain loss function: denser water must be deeper Totally consistent and high model skills! Great model performance (~1°C less error) and totally consistent
  • 46. References ● Earth System figure: https://karenbakker.org/the-climate-system/ ● Data cube image: Earth Syst. Dynam., 11, 201–234 (2020) https://doi.org/10.5194/esd-11-201-2020 ● Climate model image: A. Gettelman and R.B. Rood, Demystifying Climate Models, Earth Systems Data and Models 2, doi: 10.1007/978-3-662-48959-8_5 ● Multimodel figure: Michael Böttinger (DKRZ) and Joachem Marotzke (Max Planck Institute Meteorology) ● Mistral picture: Michael Böttinger (DKRZ) . ● Wildfire picture: https://pixnio.com/miscellaneous/fire-flames-pictures/aerial-ignition-interior-high -rates-of-spread-in-open-savannas ● Hockey stick IPCC https://www.ipcc.ch/report/ar3/wg1/ (Chapter 2) ● Scale invariant issue with chiguagua and dingo: Christian Staudt http://clstaudt.me ● Scale free image http://paulbourke.net/fractals/googleearth/ ● Image inpainting: ○ Nvidia Technology https://www.nvidia.com/research/inpainting/ ○ Kadow et al. (2020), Artificial Intelligence reconstructs missing Climate Information (in review) ● Physics-guided neural networks : https://arxiv.org/pdf/1710.11431.pdf and https://towardsdatascience.com/physics-guided-neural-networks-pgnns-8fe9dbad9414
  • 47. DKRZ Unit: Machine Learning as a Service ● Provide a knowledge base ● Bring prototypes to production ● Train, educate, and exchange
  • 49. Summary of the main topics discussed in the kick-off workshop ● Applying machine learning to Earth System modelling ○ Hybrid approaches to (i) improve parametrizations and (ii) validate physical models ○ Increase the availability of training data via (i) data augmentation and (ii) labelling ○ Infer causality of the patterns found in observational data ● Technology ○ Support for Python, portation to GPUs, and larger memory ○ Machine learning libraries for NetDCF data handling ○ Adaptive learning integrated with physical models during running time on the HPC ○ Distributed training and execution ○ Portability between HPC centres ● Uncertainty and reproducibility ○ Performance metrics for (i) unsupervised learning and (ii) data shift/concept drift ○ Adoption of interpretable models and uncertainty quantification and explainability methods ○ Sharing of training scripts and training data or trained model ● Community activities and capacity building: workshops, summer schools,... Artificial intelligence and machine learning activities at DKRZ EXTRASLIDE
  • 50. Climate models ● physical models derived from first principles (mechanistic) ● used to simulate how the Earth’s climate changes in time (dynamical) ● written in the form of coupled differential equations ● solution depends on boundary and initial conditions ● run with different conditions allows to create scenarios (see for instance, RCP) ● solved numerically with long lasting parallel runs ● calibrated and validated against observational data ● results are called model data EXTRASLIDE year RCP 2.6: the best case scenario historical RPC 8.5: business as usualExample of a basic climate model including time (t) and space (x): Tempt+1,x+1 = Temp t,x + warmingt,x - coolingt,x given that Tempt=0 = 25°C
  • 51. High Performance Computing Data Center Ongoing efforts to reduce our carbon footprint: ● Power Usage Effectiveness ~1 (PUE = 1.19) ● Cold aisle containment reduce CO2 emissions by 20% ● Hot air is recycled for heating nearby facilities ● Cooling water is recycled in our toilets ● Greener energy supplier possible BEFORE AFTER EXTRASLIDE