SlideShare ist ein Scribd-Unternehmen logo
1 von 25
Downloaden Sie, um offline zu lesen
“High-throughput computing for
materials databases and materials
design”
Dane Morgan, Henry Wu, Tam Mayeshiba
University of Wisconsin
Open Science Grid User School
July 29, 2016
Univ. of Wisconsin – Madison, WI
High-throughput computing for materials
databases and materials design
Dane Morgan
Tam Mayeshiba, Henry Wu, and
many others …
Open Science Grid User School
July 29, 2016
Univ. of Wisconsin – Madison, WI
Funding Acknowledgements
Software Infrastructure
for Sustained Innovation
(SI2) award No. 1148011
National Energy Research
Scientific Computing Center
Outline
High-throughput Molecular Simulation
Alloy Diffusion Database
4
Outline
High-throughput Molecular Simulation
Alloy Diffusion Database
5
The Dream of Molecular
Computational Materials Science
6
Understand, Optimize, Discover Materials
Atomic Understanding Computation
http://www.teslasociety.com/nyautoshow.htm
+Experiment
http://foreverfreshindoorgarden.wordpress.com/tag/compost-experiment/
This dream is being realized
This is a major transformation
Ab Initio Methods
a
b
c
, ,
1 12
2
I
i
i I i i i
i ii I
Z
H
r rr R  
 
      
  
H 
Electronic Structure
Band structure,
magnetism, …
Energies, Forces
Atomic positions,
phase stability, …
Composition
Structure
Materials
Properties
Drivers for Transformation
8
Fundamental theory
ˆH =
-
1
2
ÑI
2
I
å +
Z j
Z ¢j
Rj
- R ¢j
j, ¢j
å +
Z j
ri
- Rj
i, j
å
-
1
2
Ñi
2
i
å +
1
ri
- r¢i
i, ¢i
å
é
ë
ê
ê
ê
ê
ê
ê
ù
û
ú
ú
ú
ú
ú
ú
Modeling methods
Computational power
Unprecedented
transformation in
Understanding
Design
Data
A Simple but Powerful Message
Computation Is Scalable
A Simple but Powerful Message
If you can compute it once
Then with some automation
You can compute it a lot
Outline
High-throughput Molecular Simulation
Alloy Diffusion Database
11
Ab Initio Methods and Diffusion in
Solids
• Diffusion typically occurs by jumps
between stable sites
• Jump rates depends on attempt rates
and migration barriers, which can be
calculated ab initio
Rate =n exp -Em kT( )
• Diffusion coefficients (D) can be
calculated from jump rates analytically
• D’s are critical for design of Li ion
batteries, solid oxide fuel cells,
semiconductor devices, steels, …
Brett, et al, Chem. Soc. Rev. ‘08
0
1
2
3
4
0
2
4
6
8
10
12
0 500 1000 1500 2000
Totalcost(millions$)
Timetocomplete(years)
Cost for computers ($1000)
High-Throughput for Dilute Alloys
Resource needs
• ~50 viable pure elemental
systems in each structure 
~10,000 dilute B in A alloy-
structure systems (maybe ~5%
known)
• 1 system takes ~20k/core-hours
(~9 days on 100 cores (= $20k))
• So need 2×108 core-hours or
~23k core-years, 1 postdoc.
Determine Dv* for A1-xBx (x<<1) for all elements A,B in the
common (FCC, BCC, HCP, Diamond) crystal structures
6 years/$2m
MAterials Simulation Toolkit (MAST)
The MAterials Simulation Toolkit (MAST) is an automated workflow manager and
post-processing tool primarily designed to perform atomic simulation calculations
for diffusion and defect workflows, especially using density functional theory as
implemented by the Vienna Ab-initio Simulation Package (VASP).
https://pypi.python.org/pypi/MAST
MAST Workflow Management
overview
15
Crontab or user command
Jobs on queue
Workflow
directories with
input files and
metadata
New jobs to
queue
Finished workflows
to archive
Evaluate workflow
logic
(DAG)
Actual diffusion workflow schematic
16
32 steps (not all steps are shown)
Using Open Science Grid - Problems
• Our typical unit of one diffusion coefficient is
~20k CPU hours – clearly needs to be broken
up for OSG
• Single ab initio calculations tend to be
significantly parallel (~16-128 cores) and long
(10-100h) – poor match for OSG
• MAST workflow manager not initially
compatible with OSG (MAST runs from a
managing shared home directory)
Using Open Science Grid - Solutions
• Consider the smallest steps in our ~20k CPU hour workflow and
build on those (single step calculations).
• Restrict to specific types of nodes with 16-20 parallel cores available
on one node.
• Chose materials carefully to be fast (few electrons) so jobs can
usually finish within 24h soft limit on OSG machines.
• Manage workflow differently
– Idea 1: Adapted MAST to CHTC by sending all tools needed on the
home directory (MAST, related directories, python language, etc.) to
compute node with job. Worked, but had to send a lot back and forth
and managing the directories to avoid workflow errors (e.g.,
overwriting) was very hard.
– Idea 2: Used MAST to set up workflow DAG and then transcribed to
use DAGMAN workflow manager in CONDOR on OSG. Better! But
reduces error checking ability.
Open Science Grid Usage
• Used about 2.6m CPU hours over 15m
– About 1.5m CPU hours dedicated to production
runs for diffusion project.
• Ran about 80 diffusion coefficients.
• Integrated with traditional HPC (XSEDE,
NERSC) for larger runs.
Diffusion Database
http://diffusiondata.materialshub.org/
https://www.engr.wisc.edu/making-massive-materials-data-sets-tools-accessible/
• Impurity diffusion of X in host H
for over 350 systems.
• Largest diffusion database from
a single group in the world.
New science and critical design
data.
• Data disseminated through web
– Web application for plotting
and exploring data
– All data available from figshare
with permanent DOI.
Wu, et al., Scientific Data, ‘16
Diffusion Database
http://diffusiondata.materialshub.org/
https://www.engr.wisc.edu/making-massive-materials-data-sets-tools-accessible/
Wu, et al., Scientific Data, ‘16
BCC FCC
Very different trends between FCC and BCC – need large database to discover this.
Excess Formation Volume Computation
• Disordered binary mixtures of elements A and B for BCC and FCC crystal
structures at various compositions (A, A0.75B0.25, A0. 5B0.5, A0.25B0.75, and B)
• A, B = Al, Co, Cu, Fe, Mg, Mo, Nb, Ni, and Ti
9 pure elements, and a combined 36 unique elemental pairs.
• Use 3 different special quasi-random structures (SQS) for each
crystal structure.
• Each SQS is optimized for all three mixtures (25%, 50%, and 75%).
• Calculate formation volume with DFT, spin-polarized:
• Iterative relaxation between ionic relaxation and volume relaxation.
• At least 3 repeats of the above iteration.
• Total number of calculations:
• (2 structures)×(36 pairs)×(3 SQS)×(3 compositions)×(6 DFT) =
• = 3888 DFT calculations.
Excess Formation Volume Results
Al-Cu Al-Ni Al-Co
A0 -1.8600 -4.0792 -5.5367
A1 1.4111 0.0389 -1.9511
A2 0.6178 -1.6611 0.2356
We are able to generate a
large amount of accurate
data and can extract valuable
thermodynamic parameters.
Excess volume - fit to 2nd order Redlich-Kister Polynomial
Vexcess = A0XAXB + A1XAXB(XA - XB )+ A2XAXB(XA - XB )2
0 0.25 0.5 0.75 1
XB
, atomic fraction of B
-2
-1.5
-1
-0.5
0
ExcessVolume[Å
3
/atom]
A=Al B=Cu
A=Al B=Ni
A=Al B=Co
FCC
Summary
• We have developed an approach
to successfully run large sets of
high-throughput ab initio
calculations for materials design
using OSG.
• We have used over 2.6m CPU
hours over the last ~2years to
develop the world’s largest
diffusion database from a single
research group.
• Enables many other materials
properties calculations which we
are exploring, e.g., alloy
volumes, oxide defects, etc. …
Thank You
Any Questions?

Weitere ähnliche Inhalte

Was ist angesagt?

Smart Metrics for High Performance Material Design
Smart Metrics for High Performance Material DesignSmart Metrics for High Performance Material Design
Smart Metrics for High Performance Material Designaimsnist
 
Predicting local atomic structures from X-ray absorption spectroscopy using t...
Predicting local atomic structures from X-ray absorption spectroscopy using t...Predicting local atomic structures from X-ray absorption spectroscopy using t...
Predicting local atomic structures from X-ray absorption spectroscopy using t...aimsnist
 
How to Leverage Artificial Intelligence to Accelerate Data Collection and Ana...
How to Leverage Artificial Intelligence to Accelerate Data Collection and Ana...How to Leverage Artificial Intelligence to Accelerate Data Collection and Ana...
How to Leverage Artificial Intelligence to Accelerate Data Collection and Ana...aimsnist
 
Computational Discovery of Two-Dimensional Materials, Evaluation of Force-Fie...
Computational Discovery of Two-Dimensional Materials, Evaluation of Force-Fie...Computational Discovery of Two-Dimensional Materials, Evaluation of Force-Fie...
Computational Discovery of Two-Dimensional Materials, Evaluation of Force-Fie...KAMAL CHOUDHARY
 
Applications of Machine Learning for Materials Discovery at NREL
Applications of Machine Learning for Materials Discovery at NRELApplications of Machine Learning for Materials Discovery at NREL
Applications of Machine Learning for Materials Discovery at NRELaimsnist
 
Accelerated Materials Discovery & Characterization with Classical, Quantum an...
Accelerated Materials Discovery & Characterization with Classical, Quantum an...Accelerated Materials Discovery & Characterization with Classical, Quantum an...
Accelerated Materials Discovery & Characterization with Classical, Quantum an...KAMAL CHOUDHARY
 
High-throughput discovery of low-dimensional and topologically non-trivial ma...
High-throughput discovery of low-dimensional and topologically non-trivial ma...High-throughput discovery of low-dimensional and topologically non-trivial ma...
High-throughput discovery of low-dimensional and topologically non-trivial ma...KAMAL CHOUDHARY
 
Materials Design in the Age of Deep Learning and Quantum Computation
Materials Design in the Age of Deep Learning and Quantum ComputationMaterials Design in the Age of Deep Learning and Quantum Computation
Materials Design in the Age of Deep Learning and Quantum ComputationKAMAL CHOUDHARY
 
Automated Machine Learning Applied to Diverse Materials Design Problems
Automated Machine Learning Applied to Diverse Materials Design ProblemsAutomated Machine Learning Applied to Diverse Materials Design Problems
Automated Machine Learning Applied to Diverse Materials Design ProblemsAnubhav Jain
 
Database of Topological Materials and Spin-orbit Spillage
Database of Topological Materials and Spin-orbit SpillageDatabase of Topological Materials and Spin-orbit Spillage
Database of Topological Materials and Spin-orbit SpillageKAMAL CHOUDHARY
 
2D/3D Materials screening and genetic algorithm with ML model
2D/3D Materials screening and genetic algorithm with ML model2D/3D Materials screening and genetic algorithm with ML model
2D/3D Materials screening and genetic algorithm with ML modelaimsnist
 
Conducting and Enabling Data-Driven Research Through the Materials Project
Conducting and Enabling Data-Driven Research Through the Materials ProjectConducting and Enabling Data-Driven Research Through the Materials Project
Conducting and Enabling Data-Driven Research Through the Materials ProjectAnubhav Jain
 
Graphs, Environments, and Machine Learning for Materials Science
Graphs, Environments, and Machine Learning for Materials ScienceGraphs, Environments, and Machine Learning for Materials Science
Graphs, Environments, and Machine Learning for Materials Scienceaimsnist
 
Failing Fastest: What an Effective HTE and ML Workflow Enables for Functional...
Failing Fastest: What an Effective HTE and ML Workflow Enables for Functional...Failing Fastest: What an Effective HTE and ML Workflow Enables for Functional...
Failing Fastest: What an Effective HTE and ML Workflow Enables for Functional...aimsnist
 
Software tools, crystal descriptors, and machine learning applied to material...
Software tools, crystal descriptors, and machine learning applied to material...Software tools, crystal descriptors, and machine learning applied to material...
Software tools, crystal descriptors, and machine learning applied to material...Anubhav Jain
 
Quantum Computation for Predicting Electron and Phonon Properties of Solids
Quantum Computation for Predicting Electron and Phonon Properties of SolidsQuantum Computation for Predicting Electron and Phonon Properties of Solids
Quantum Computation for Predicting Electron and Phonon Properties of SolidsKAMAL CHOUDHARY
 
Targeted Band Structure Design and Thermoelectric Materials Discovery Using H...
Targeted Band Structure Design and Thermoelectric Materials Discovery Using H...Targeted Band Structure Design and Thermoelectric Materials Discovery Using H...
Targeted Band Structure Design and Thermoelectric Materials Discovery Using H...Anubhav Jain
 
Density functional theory calculations and data mining for new thermoelectric...
Density functional theory calculations and data mining for new thermoelectric...Density functional theory calculations and data mining for new thermoelectric...
Density functional theory calculations and data mining for new thermoelectric...Anubhav Jain
 
Machine Learning Platform for Catalyst Design
Machine Learning Platform for Catalyst DesignMachine Learning Platform for Catalyst Design
Machine Learning Platform for Catalyst DesignAnubhav Jain
 
Software tools for data-driven research and their application to thermoelectr...
Software tools for data-driven research and their application to thermoelectr...Software tools for data-driven research and their application to thermoelectr...
Software tools for data-driven research and their application to thermoelectr...Anubhav Jain
 

Was ist angesagt? (20)

Smart Metrics for High Performance Material Design
Smart Metrics for High Performance Material DesignSmart Metrics for High Performance Material Design
Smart Metrics for High Performance Material Design
 
Predicting local atomic structures from X-ray absorption spectroscopy using t...
Predicting local atomic structures from X-ray absorption spectroscopy using t...Predicting local atomic structures from X-ray absorption spectroscopy using t...
Predicting local atomic structures from X-ray absorption spectroscopy using t...
 
How to Leverage Artificial Intelligence to Accelerate Data Collection and Ana...
How to Leverage Artificial Intelligence to Accelerate Data Collection and Ana...How to Leverage Artificial Intelligence to Accelerate Data Collection and Ana...
How to Leverage Artificial Intelligence to Accelerate Data Collection and Ana...
 
Computational Discovery of Two-Dimensional Materials, Evaluation of Force-Fie...
Computational Discovery of Two-Dimensional Materials, Evaluation of Force-Fie...Computational Discovery of Two-Dimensional Materials, Evaluation of Force-Fie...
Computational Discovery of Two-Dimensional Materials, Evaluation of Force-Fie...
 
Applications of Machine Learning for Materials Discovery at NREL
Applications of Machine Learning for Materials Discovery at NRELApplications of Machine Learning for Materials Discovery at NREL
Applications of Machine Learning for Materials Discovery at NREL
 
Accelerated Materials Discovery & Characterization with Classical, Quantum an...
Accelerated Materials Discovery & Characterization with Classical, Quantum an...Accelerated Materials Discovery & Characterization with Classical, Quantum an...
Accelerated Materials Discovery & Characterization with Classical, Quantum an...
 
High-throughput discovery of low-dimensional and topologically non-trivial ma...
High-throughput discovery of low-dimensional and topologically non-trivial ma...High-throughput discovery of low-dimensional and topologically non-trivial ma...
High-throughput discovery of low-dimensional and topologically non-trivial ma...
 
Materials Design in the Age of Deep Learning and Quantum Computation
Materials Design in the Age of Deep Learning and Quantum ComputationMaterials Design in the Age of Deep Learning and Quantum Computation
Materials Design in the Age of Deep Learning and Quantum Computation
 
Automated Machine Learning Applied to Diverse Materials Design Problems
Automated Machine Learning Applied to Diverse Materials Design ProblemsAutomated Machine Learning Applied to Diverse Materials Design Problems
Automated Machine Learning Applied to Diverse Materials Design Problems
 
Database of Topological Materials and Spin-orbit Spillage
Database of Topological Materials and Spin-orbit SpillageDatabase of Topological Materials and Spin-orbit Spillage
Database of Topological Materials and Spin-orbit Spillage
 
2D/3D Materials screening and genetic algorithm with ML model
2D/3D Materials screening and genetic algorithm with ML model2D/3D Materials screening and genetic algorithm with ML model
2D/3D Materials screening and genetic algorithm with ML model
 
Conducting and Enabling Data-Driven Research Through the Materials Project
Conducting and Enabling Data-Driven Research Through the Materials ProjectConducting and Enabling Data-Driven Research Through the Materials Project
Conducting and Enabling Data-Driven Research Through the Materials Project
 
Graphs, Environments, and Machine Learning for Materials Science
Graphs, Environments, and Machine Learning for Materials ScienceGraphs, Environments, and Machine Learning for Materials Science
Graphs, Environments, and Machine Learning for Materials Science
 
Failing Fastest: What an Effective HTE and ML Workflow Enables for Functional...
Failing Fastest: What an Effective HTE and ML Workflow Enables for Functional...Failing Fastest: What an Effective HTE and ML Workflow Enables for Functional...
Failing Fastest: What an Effective HTE and ML Workflow Enables for Functional...
 
Software tools, crystal descriptors, and machine learning applied to material...
Software tools, crystal descriptors, and machine learning applied to material...Software tools, crystal descriptors, and machine learning applied to material...
Software tools, crystal descriptors, and machine learning applied to material...
 
Quantum Computation for Predicting Electron and Phonon Properties of Solids
Quantum Computation for Predicting Electron and Phonon Properties of SolidsQuantum Computation for Predicting Electron and Phonon Properties of Solids
Quantum Computation for Predicting Electron and Phonon Properties of Solids
 
Targeted Band Structure Design and Thermoelectric Materials Discovery Using H...
Targeted Band Structure Design and Thermoelectric Materials Discovery Using H...Targeted Band Structure Design and Thermoelectric Materials Discovery Using H...
Targeted Band Structure Design and Thermoelectric Materials Discovery Using H...
 
Density functional theory calculations and data mining for new thermoelectric...
Density functional theory calculations and data mining for new thermoelectric...Density functional theory calculations and data mining for new thermoelectric...
Density functional theory calculations and data mining for new thermoelectric...
 
Machine Learning Platform for Catalyst Design
Machine Learning Platform for Catalyst DesignMachine Learning Platform for Catalyst Design
Machine Learning Platform for Catalyst Design
 
Software tools for data-driven research and their application to thermoelectr...
Software tools for data-driven research and their application to thermoelectr...Software tools for data-driven research and their application to thermoelectr...
Software tools for data-driven research and their application to thermoelectr...
 

Ähnlich wie Morgan osg user school 2016 07-29 dist

Data Mining to Discovery for Inorganic Solids: Software Tools and Applications
Data Mining to Discovery for Inorganic Solids: Software Tools and ApplicationsData Mining to Discovery for Inorganic Solids: Software Tools and Applications
Data Mining to Discovery for Inorganic Solids: Software Tools and ApplicationsAnubhav Jain
 
Software tools, crystal descriptors, and machine learning applied to material...
Software tools, crystal descriptors, and machine learning applied to material...Software tools, crystal descriptors, and machine learning applied to material...
Software tools, crystal descriptors, and machine learning applied to material...Anubhav Jain
 
Smart Metrics for High Performance Material Design
Smart Metrics for High Performance Material DesignSmart Metrics for High Performance Material Design
Smart Metrics for High Performance Material DesignKAMAL CHOUDHARY
 
Overview of the Exascale Additive Manufacturing Project
Overview of the Exascale Additive Manufacturing ProjectOverview of the Exascale Additive Manufacturing Project
Overview of the Exascale Additive Manufacturing Projectinside-BigData.com
 
Kobeworkshop pubchemqc project
Kobeworkshop pubchemqc projectKobeworkshop pubchemqc project
Kobeworkshop pubchemqc projectMaho Nakata
 
A Machine Learning Framework for Materials Knowledge Systems
A Machine Learning Framework for Materials Knowledge SystemsA Machine Learning Framework for Materials Knowledge Systems
A Machine Learning Framework for Materials Knowledge Systemsaimsnist
 
Discovering new functional materials for clean energy and beyond using high-t...
Discovering new functional materials for clean energy and beyond using high-t...Discovering new functional materials for clean energy and beyond using high-t...
Discovering new functional materials for clean energy and beyond using high-t...Anubhav Jain
 
Virus, Vaccines, Genes and Quantum - 2020-06-18
Virus, Vaccines, Genes and Quantum - 2020-06-18Virus, Vaccines, Genes and Quantum - 2020-06-18
Virus, Vaccines, Genes and Quantum - 2020-06-18Aritra Sarkar
 
Colloborative computing
Colloborative computing Colloborative computing
Colloborative computing Cisco
 
Materials Project computation and database infrastructure
Materials Project computation and database infrastructureMaterials Project computation and database infrastructure
Materials Project computation and database infrastructureAnubhav Jain
 
How HPC and large-scale data analytics are transforming experimental science
How HPC and large-scale data analytics are transforming experimental scienceHow HPC and large-scale data analytics are transforming experimental science
How HPC and large-scale data analytics are transforming experimental scienceinside-BigData.com
 
PhD_10_2011_Abhijeet_Paul
PhD_10_2011_Abhijeet_PaulPhD_10_2011_Abhijeet_Paul
PhD_10_2011_Abhijeet_PaulAbhijeet Paul
 
Available methods for predicting materials synthesizability using computation...
Available methods for predicting materials synthesizability using computation...Available methods for predicting materials synthesizability using computation...
Available methods for predicting materials synthesizability using computation...Anubhav Jain
 
Big Fast Data in High-Energy Particle Physics
Big Fast Data in High-Energy Particle PhysicsBig Fast Data in High-Energy Particle Physics
Big Fast Data in High-Energy Particle PhysicsAndrew Lowe
 
Cycle Computing Record-breaking Petascale HPC Run
Cycle Computing Record-breaking Petascale HPC RunCycle Computing Record-breaking Petascale HPC Run
Cycle Computing Record-breaking Petascale HPC Runinside-BigData.com
 
The PubChemQC Project
The PubChemQC ProjectThe PubChemQC Project
The PubChemQC ProjectMaho Nakata
 
Modern Computing: Cloud, Distributed, & High Performance
Modern Computing: Cloud, Distributed, & High PerformanceModern Computing: Cloud, Distributed, & High Performance
Modern Computing: Cloud, Distributed, & High Performanceinside-BigData.com
 
TeraGrid and Physics Research
TeraGrid and Physics ResearchTeraGrid and Physics Research
TeraGrid and Physics Researchshandra_psc
 

Ähnlich wie Morgan osg user school 2016 07-29 dist (20)

Data Mining to Discovery for Inorganic Solids: Software Tools and Applications
Data Mining to Discovery for Inorganic Solids: Software Tools and ApplicationsData Mining to Discovery for Inorganic Solids: Software Tools and Applications
Data Mining to Discovery for Inorganic Solids: Software Tools and Applications
 
Software tools, crystal descriptors, and machine learning applied to material...
Software tools, crystal descriptors, and machine learning applied to material...Software tools, crystal descriptors, and machine learning applied to material...
Software tools, crystal descriptors, and machine learning applied to material...
 
Is 20TB really Big Data?
Is 20TB really Big Data?Is 20TB really Big Data?
Is 20TB really Big Data?
 
Smart Metrics for High Performance Material Design
Smart Metrics for High Performance Material DesignSmart Metrics for High Performance Material Design
Smart Metrics for High Performance Material Design
 
Overview of the Exascale Additive Manufacturing Project
Overview of the Exascale Additive Manufacturing ProjectOverview of the Exascale Additive Manufacturing Project
Overview of the Exascale Additive Manufacturing Project
 
Kobeworkshop pubchemqc project
Kobeworkshop pubchemqc projectKobeworkshop pubchemqc project
Kobeworkshop pubchemqc project
 
A Machine Learning Framework for Materials Knowledge Systems
A Machine Learning Framework for Materials Knowledge SystemsA Machine Learning Framework for Materials Knowledge Systems
A Machine Learning Framework for Materials Knowledge Systems
 
Discovering new functional materials for clean energy and beyond using high-t...
Discovering new functional materials for clean energy and beyond using high-t...Discovering new functional materials for clean energy and beyond using high-t...
Discovering new functional materials for clean energy and beyond using high-t...
 
Virus, Vaccines, Genes and Quantum - 2020-06-18
Virus, Vaccines, Genes and Quantum - 2020-06-18Virus, Vaccines, Genes and Quantum - 2020-06-18
Virus, Vaccines, Genes and Quantum - 2020-06-18
 
Colloborative computing
Colloborative computing Colloborative computing
Colloborative computing
 
Materials Project computation and database infrastructure
Materials Project computation and database infrastructureMaterials Project computation and database infrastructure
Materials Project computation and database infrastructure
 
How HPC and large-scale data analytics are transforming experimental science
How HPC and large-scale data analytics are transforming experimental scienceHow HPC and large-scale data analytics are transforming experimental science
How HPC and large-scale data analytics are transforming experimental science
 
PhD_10_2011_Abhijeet_Paul
PhD_10_2011_Abhijeet_PaulPhD_10_2011_Abhijeet_Paul
PhD_10_2011_Abhijeet_Paul
 
Available methods for predicting materials synthesizability using computation...
Available methods for predicting materials synthesizability using computation...Available methods for predicting materials synthesizability using computation...
Available methods for predicting materials synthesizability using computation...
 
Big Fast Data in High-Energy Particle Physics
Big Fast Data in High-Energy Particle PhysicsBig Fast Data in High-Energy Particle Physics
Big Fast Data in High-Energy Particle Physics
 
Cycle Computing Record-breaking Petascale HPC Run
Cycle Computing Record-breaking Petascale HPC RunCycle Computing Record-breaking Petascale HPC Run
Cycle Computing Record-breaking Petascale HPC Run
 
The PubChemQC Project
The PubChemQC ProjectThe PubChemQC Project
The PubChemQC Project
 
01-10 Exploring new high potential 2D materials - Angioni.pdf
01-10 Exploring new high potential 2D materials - Angioni.pdf01-10 Exploring new high potential 2D materials - Angioni.pdf
01-10 Exploring new high potential 2D materials - Angioni.pdf
 
Modern Computing: Cloud, Distributed, & High Performance
Modern Computing: Cloud, Distributed, & High PerformanceModern Computing: Cloud, Distributed, & High Performance
Modern Computing: Cloud, Distributed, & High Performance
 
TeraGrid and Physics Research
TeraGrid and Physics ResearchTeraGrid and Physics Research
TeraGrid and Physics Research
 

Mehr von ddm314

Morgan uw mse900 2020 040-25 v2.0
Morgan uw mse900 2020 040-25 v2.0Morgan uw mse900 2020 040-25 v2.0
Morgan uw mse900 2020 040-25 v2.0ddm314
 
2019 09-06 skunkworks q&amp;a information session v2.1 dist
2019 09-06 skunkworks q&amp;a information session v2.1 dist2019 09-06 skunkworks q&amp;a information session v2.1 dist
2019 09-06 skunkworks q&amp;a information session v2.1 distddm314
 
Ema 20190124 v1.4_dist
Ema 20190124 v1.4_distEma 20190124 v1.4_dist
Ema 20190124 v1.4_distddm314
 
2018 09-07 skunkworks q&amp;a information session v1.2
2018 09-07 skunkworks q&amp;a information session v1.22018 09-07 skunkworks q&amp;a information session v1.2
2018 09-07 skunkworks q&amp;a information session v1.2ddm314
 
2017-09-08 skunkworks q&amp;a information session v1.0 distr
2017-09-08 skunkworks q&amp;a information session v1.0 distr2017-09-08 skunkworks q&amp;a information session v1.0 distr
2017-09-08 skunkworks q&amp;a information session v1.0 distrddm314
 
2016 09-06v3 skunkworks q&amp;a information session public
2016 09-06v3 skunkworks q&amp;a information session public2016 09-06v3 skunkworks q&amp;a information session public
2016 09-06v3 skunkworks q&amp;a information session publicddm314
 
Zach and Aren talk on Materials Informatics at UW WIMSEA 2016-02-12
Zach and Aren talk on Materials Informatics at UW WIMSEA 2016-02-12Zach and Aren talk on Materials Informatics at UW WIMSEA 2016-02-12
Zach and Aren talk on Materials Informatics at UW WIMSEA 2016-02-12ddm314
 
Mat informatics opportunties fisherbarton 2015 12-07 1.1
Mat informatics opportunties fisherbarton 2015 12-07 1.1Mat informatics opportunties fisherbarton 2015 12-07 1.1
Mat informatics opportunties fisherbarton 2015 12-07 1.1ddm314
 
Skunworks Final poster 2015-09-21_eau_claire
Skunworks Final poster 2015-09-21_eau_claireSkunworks Final poster 2015-09-21_eau_claire
Skunworks Final poster 2015-09-21_eau_claireddm314
 
UW Materials Informatics 2015-09-21 v2.0 dist
UW Materials Informatics 2015-09-21 v2.0 distUW Materials Informatics 2015-09-21 v2.0 dist
UW Materials Informatics 2015-09-21 v2.0 distddm314
 
Morgan mgi meeting 2015 01-11 v2.0 distribution
Morgan mgi meeting 2015 01-11 v2.0 distributionMorgan mgi meeting 2015 01-11 v2.0 distribution
Morgan mgi meeting 2015 01-11 v2.0 distributionddm314
 
Morgan tms ods 2015 02-29 v4.4 dist
Morgan tms ods 2015 02-29 v4.4 distMorgan tms ods 2015 02-29 v4.4 dist
Morgan tms ods 2015 02-29 v4.4 distddm314
 

Mehr von ddm314 (12)

Morgan uw mse900 2020 040-25 v2.0
Morgan uw mse900 2020 040-25 v2.0Morgan uw mse900 2020 040-25 v2.0
Morgan uw mse900 2020 040-25 v2.0
 
2019 09-06 skunkworks q&amp;a information session v2.1 dist
2019 09-06 skunkworks q&amp;a information session v2.1 dist2019 09-06 skunkworks q&amp;a information session v2.1 dist
2019 09-06 skunkworks q&amp;a information session v2.1 dist
 
Ema 20190124 v1.4_dist
Ema 20190124 v1.4_distEma 20190124 v1.4_dist
Ema 20190124 v1.4_dist
 
2018 09-07 skunkworks q&amp;a information session v1.2
2018 09-07 skunkworks q&amp;a information session v1.22018 09-07 skunkworks q&amp;a information session v1.2
2018 09-07 skunkworks q&amp;a information session v1.2
 
2017-09-08 skunkworks q&amp;a information session v1.0 distr
2017-09-08 skunkworks q&amp;a information session v1.0 distr2017-09-08 skunkworks q&amp;a information session v1.0 distr
2017-09-08 skunkworks q&amp;a information session v1.0 distr
 
2016 09-06v3 skunkworks q&amp;a information session public
2016 09-06v3 skunkworks q&amp;a information session public2016 09-06v3 skunkworks q&amp;a information session public
2016 09-06v3 skunkworks q&amp;a information session public
 
Zach and Aren talk on Materials Informatics at UW WIMSEA 2016-02-12
Zach and Aren talk on Materials Informatics at UW WIMSEA 2016-02-12Zach and Aren talk on Materials Informatics at UW WIMSEA 2016-02-12
Zach and Aren talk on Materials Informatics at UW WIMSEA 2016-02-12
 
Mat informatics opportunties fisherbarton 2015 12-07 1.1
Mat informatics opportunties fisherbarton 2015 12-07 1.1Mat informatics opportunties fisherbarton 2015 12-07 1.1
Mat informatics opportunties fisherbarton 2015 12-07 1.1
 
Skunworks Final poster 2015-09-21_eau_claire
Skunworks Final poster 2015-09-21_eau_claireSkunworks Final poster 2015-09-21_eau_claire
Skunworks Final poster 2015-09-21_eau_claire
 
UW Materials Informatics 2015-09-21 v2.0 dist
UW Materials Informatics 2015-09-21 v2.0 distUW Materials Informatics 2015-09-21 v2.0 dist
UW Materials Informatics 2015-09-21 v2.0 dist
 
Morgan mgi meeting 2015 01-11 v2.0 distribution
Morgan mgi meeting 2015 01-11 v2.0 distributionMorgan mgi meeting 2015 01-11 v2.0 distribution
Morgan mgi meeting 2015 01-11 v2.0 distribution
 
Morgan tms ods 2015 02-29 v4.4 dist
Morgan tms ods 2015 02-29 v4.4 distMorgan tms ods 2015 02-29 v4.4 dist
Morgan tms ods 2015 02-29 v4.4 dist
 

Kürzlich hochgeladen

sdfsadopkjpiosufoiasdoifjasldkjfl a asldkjflaskdjflkjsdsdf
sdfsadopkjpiosufoiasdoifjasldkjfl a asldkjflaskdjflkjsdsdfsdfsadopkjpiosufoiasdoifjasldkjfl a asldkjflaskdjflkjsdsdf
sdfsadopkjpiosufoiasdoifjasldkjfl a asldkjflaskdjflkjsdsdfJulia Kaye
 
Dev.bg DevOps March 2024 Monitoring & Logging
Dev.bg DevOps March 2024 Monitoring & LoggingDev.bg DevOps March 2024 Monitoring & Logging
Dev.bg DevOps March 2024 Monitoring & LoggingMarian Marinov
 
GENERAL CONDITIONS FOR CONTRACTS OF CIVIL ENGINEERING WORKS
GENERAL CONDITIONS  FOR  CONTRACTS OF CIVIL ENGINEERING WORKS GENERAL CONDITIONS  FOR  CONTRACTS OF CIVIL ENGINEERING WORKS
GENERAL CONDITIONS FOR CONTRACTS OF CIVIL ENGINEERING WORKS Bahzad5
 
Guardians and Glitches: Navigating the Duality of Gen AI in AppSec
Guardians and Glitches: Navigating the Duality of Gen AI in AppSecGuardians and Glitches: Navigating the Duality of Gen AI in AppSec
Guardians and Glitches: Navigating the Duality of Gen AI in AppSecTrupti Shiralkar, CISSP
 
nvidia AI-gtc 2024 partial slide deck.pptx
nvidia AI-gtc 2024 partial slide deck.pptxnvidia AI-gtc 2024 partial slide deck.pptx
nvidia AI-gtc 2024 partial slide deck.pptxjasonsedano2
 
Renewable Energy & Entrepreneurship Workshop_21Feb2024.pdf
Renewable Energy & Entrepreneurship Workshop_21Feb2024.pdfRenewable Energy & Entrepreneurship Workshop_21Feb2024.pdf
Renewable Energy & Entrepreneurship Workshop_21Feb2024.pdfodunowoeminence2019
 
Power System electrical and electronics .pptx
Power System electrical and electronics .pptxPower System electrical and electronics .pptx
Power System electrical and electronics .pptxMUKULKUMAR210
 
How to Write a Good Scientific Paper.pdf
How to Write a Good Scientific Paper.pdfHow to Write a Good Scientific Paper.pdf
How to Write a Good Scientific Paper.pdfRedhwan Qasem Shaddad
 
Multicomponent Spiral Wound Membrane Separation Model.pdf
Multicomponent Spiral Wound Membrane Separation Model.pdfMulticomponent Spiral Wound Membrane Separation Model.pdf
Multicomponent Spiral Wound Membrane Separation Model.pdfGiovanaGhasary1
 
A Seminar on Electric Vehicle Software Simulation
A Seminar on Electric Vehicle Software SimulationA Seminar on Electric Vehicle Software Simulation
A Seminar on Electric Vehicle Software SimulationMohsinKhanA
 
Clutches and brkesSelect any 3 position random motion out of real world and d...
Clutches and brkesSelect any 3 position random motion out of real world and d...Clutches and brkesSelect any 3 position random motion out of real world and d...
Clutches and brkesSelect any 3 position random motion out of real world and d...sahb78428
 
Technology Features of Apollo HDD Machine, Its Technical Specification with C...
Technology Features of Apollo HDD Machine, Its Technical Specification with C...Technology Features of Apollo HDD Machine, Its Technical Specification with C...
Technology Features of Apollo HDD Machine, Its Technical Specification with C...Apollo Techno Industries Pvt Ltd
 
Mohs Scale of Hardness, Hardness Scale.pptx
Mohs Scale of Hardness, Hardness Scale.pptxMohs Scale of Hardness, Hardness Scale.pptx
Mohs Scale of Hardness, Hardness Scale.pptxKISHAN KUMAR
 
Best-NO1 Best Rohani Amil In Lahore Kala Ilam In Lahore Kala Jadu Amil In Lah...
Best-NO1 Best Rohani Amil In Lahore Kala Ilam In Lahore Kala Jadu Amil In Lah...Best-NO1 Best Rohani Amil In Lahore Kala Ilam In Lahore Kala Jadu Amil In Lah...
Best-NO1 Best Rohani Amil In Lahore Kala Ilam In Lahore Kala Jadu Amil In Lah...Amil baba
 
Transforming Process Safety Management: Challenges, Benefits, and Transition ...
Transforming Process Safety Management: Challenges, Benefits, and Transition ...Transforming Process Safety Management: Challenges, Benefits, and Transition ...
Transforming Process Safety Management: Challenges, Benefits, and Transition ...soginsider
 
ASME BPVC 2023 Section I para leer y entender
ASME BPVC 2023 Section I para leer y entenderASME BPVC 2023 Section I para leer y entender
ASME BPVC 2023 Section I para leer y entenderjuancarlos286641
 

Kürzlich hochgeladen (20)

sdfsadopkjpiosufoiasdoifjasldkjfl a asldkjflaskdjflkjsdsdf
sdfsadopkjpiosufoiasdoifjasldkjfl a asldkjflaskdjflkjsdsdfsdfsadopkjpiosufoiasdoifjasldkjfl a asldkjflaskdjflkjsdsdf
sdfsadopkjpiosufoiasdoifjasldkjfl a asldkjflaskdjflkjsdsdf
 
Dev.bg DevOps March 2024 Monitoring & Logging
Dev.bg DevOps March 2024 Monitoring & LoggingDev.bg DevOps March 2024 Monitoring & Logging
Dev.bg DevOps March 2024 Monitoring & Logging
 
GENERAL CONDITIONS FOR CONTRACTS OF CIVIL ENGINEERING WORKS
GENERAL CONDITIONS  FOR  CONTRACTS OF CIVIL ENGINEERING WORKS GENERAL CONDITIONS  FOR  CONTRACTS OF CIVIL ENGINEERING WORKS
GENERAL CONDITIONS FOR CONTRACTS OF CIVIL ENGINEERING WORKS
 
Guardians and Glitches: Navigating the Duality of Gen AI in AppSec
Guardians and Glitches: Navigating the Duality of Gen AI in AppSecGuardians and Glitches: Navigating the Duality of Gen AI in AppSec
Guardians and Glitches: Navigating the Duality of Gen AI in AppSec
 
Lecture 2 .pptx
Lecture 2                            .pptxLecture 2                            .pptx
Lecture 2 .pptx
 
nvidia AI-gtc 2024 partial slide deck.pptx
nvidia AI-gtc 2024 partial slide deck.pptxnvidia AI-gtc 2024 partial slide deck.pptx
nvidia AI-gtc 2024 partial slide deck.pptx
 
Renewable Energy & Entrepreneurship Workshop_21Feb2024.pdf
Renewable Energy & Entrepreneurship Workshop_21Feb2024.pdfRenewable Energy & Entrepreneurship Workshop_21Feb2024.pdf
Renewable Energy & Entrepreneurship Workshop_21Feb2024.pdf
 
Litature Review: Research Paper work for Engineering
Litature Review: Research Paper work for EngineeringLitature Review: Research Paper work for Engineering
Litature Review: Research Paper work for Engineering
 
計劃趕得上變化
計劃趕得上變化計劃趕得上變化
計劃趕得上變化
 
Lecture 4 .pdf
Lecture 4                              .pdfLecture 4                              .pdf
Lecture 4 .pdf
 
Power System electrical and electronics .pptx
Power System electrical and electronics .pptxPower System electrical and electronics .pptx
Power System electrical and electronics .pptx
 
How to Write a Good Scientific Paper.pdf
How to Write a Good Scientific Paper.pdfHow to Write a Good Scientific Paper.pdf
How to Write a Good Scientific Paper.pdf
 
Multicomponent Spiral Wound Membrane Separation Model.pdf
Multicomponent Spiral Wound Membrane Separation Model.pdfMulticomponent Spiral Wound Membrane Separation Model.pdf
Multicomponent Spiral Wound Membrane Separation Model.pdf
 
A Seminar on Electric Vehicle Software Simulation
A Seminar on Electric Vehicle Software SimulationA Seminar on Electric Vehicle Software Simulation
A Seminar on Electric Vehicle Software Simulation
 
Clutches and brkesSelect any 3 position random motion out of real world and d...
Clutches and brkesSelect any 3 position random motion out of real world and d...Clutches and brkesSelect any 3 position random motion out of real world and d...
Clutches and brkesSelect any 3 position random motion out of real world and d...
 
Technology Features of Apollo HDD Machine, Its Technical Specification with C...
Technology Features of Apollo HDD Machine, Its Technical Specification with C...Technology Features of Apollo HDD Machine, Its Technical Specification with C...
Technology Features of Apollo HDD Machine, Its Technical Specification with C...
 
Mohs Scale of Hardness, Hardness Scale.pptx
Mohs Scale of Hardness, Hardness Scale.pptxMohs Scale of Hardness, Hardness Scale.pptx
Mohs Scale of Hardness, Hardness Scale.pptx
 
Best-NO1 Best Rohani Amil In Lahore Kala Ilam In Lahore Kala Jadu Amil In Lah...
Best-NO1 Best Rohani Amil In Lahore Kala Ilam In Lahore Kala Jadu Amil In Lah...Best-NO1 Best Rohani Amil In Lahore Kala Ilam In Lahore Kala Jadu Amil In Lah...
Best-NO1 Best Rohani Amil In Lahore Kala Ilam In Lahore Kala Jadu Amil In Lah...
 
Transforming Process Safety Management: Challenges, Benefits, and Transition ...
Transforming Process Safety Management: Challenges, Benefits, and Transition ...Transforming Process Safety Management: Challenges, Benefits, and Transition ...
Transforming Process Safety Management: Challenges, Benefits, and Transition ...
 
ASME BPVC 2023 Section I para leer y entender
ASME BPVC 2023 Section I para leer y entenderASME BPVC 2023 Section I para leer y entender
ASME BPVC 2023 Section I para leer y entender
 

Morgan osg user school 2016 07-29 dist

  • 1. “High-throughput computing for materials databases and materials design” Dane Morgan, Henry Wu, Tam Mayeshiba University of Wisconsin Open Science Grid User School July 29, 2016 Univ. of Wisconsin – Madison, WI
  • 2. High-throughput computing for materials databases and materials design Dane Morgan Tam Mayeshiba, Henry Wu, and many others … Open Science Grid User School July 29, 2016 Univ. of Wisconsin – Madison, WI
  • 3. Funding Acknowledgements Software Infrastructure for Sustained Innovation (SI2) award No. 1148011 National Energy Research Scientific Computing Center
  • 6. The Dream of Molecular Computational Materials Science 6 Understand, Optimize, Discover Materials Atomic Understanding Computation http://www.teslasociety.com/nyautoshow.htm +Experiment http://foreverfreshindoorgarden.wordpress.com/tag/compost-experiment/ This dream is being realized This is a major transformation
  • 7. Ab Initio Methods a b c , , 1 12 2 I i i I i i i i ii I Z H r rr R               H  Electronic Structure Band structure, magnetism, … Energies, Forces Atomic positions, phase stability, … Composition Structure Materials Properties
  • 8. Drivers for Transformation 8 Fundamental theory ˆH = - 1 2 ÑI 2 I å + Z j Z ¢j Rj - R ¢j j, ¢j å + Z j ri - Rj i, j å - 1 2 Ñi 2 i å + 1 ri - r¢i i, ¢i å é ë ê ê ê ê ê ê ù û ú ú ú ú ú ú Modeling methods Computational power Unprecedented transformation in Understanding Design Data
  • 9. A Simple but Powerful Message Computation Is Scalable
  • 10. A Simple but Powerful Message If you can compute it once Then with some automation You can compute it a lot
  • 12. Ab Initio Methods and Diffusion in Solids • Diffusion typically occurs by jumps between stable sites • Jump rates depends on attempt rates and migration barriers, which can be calculated ab initio Rate =n exp -Em kT( ) • Diffusion coefficients (D) can be calculated from jump rates analytically • D’s are critical for design of Li ion batteries, solid oxide fuel cells, semiconductor devices, steels, … Brett, et al, Chem. Soc. Rev. ‘08
  • 13. 0 1 2 3 4 0 2 4 6 8 10 12 0 500 1000 1500 2000 Totalcost(millions$) Timetocomplete(years) Cost for computers ($1000) High-Throughput for Dilute Alloys Resource needs • ~50 viable pure elemental systems in each structure  ~10,000 dilute B in A alloy- structure systems (maybe ~5% known) • 1 system takes ~20k/core-hours (~9 days on 100 cores (= $20k)) • So need 2×108 core-hours or ~23k core-years, 1 postdoc. Determine Dv* for A1-xBx (x<<1) for all elements A,B in the common (FCC, BCC, HCP, Diamond) crystal structures 6 years/$2m
  • 14. MAterials Simulation Toolkit (MAST) The MAterials Simulation Toolkit (MAST) is an automated workflow manager and post-processing tool primarily designed to perform atomic simulation calculations for diffusion and defect workflows, especially using density functional theory as implemented by the Vienna Ab-initio Simulation Package (VASP). https://pypi.python.org/pypi/MAST
  • 15. MAST Workflow Management overview 15 Crontab or user command Jobs on queue Workflow directories with input files and metadata New jobs to queue Finished workflows to archive Evaluate workflow logic (DAG)
  • 16. Actual diffusion workflow schematic 16 32 steps (not all steps are shown)
  • 17. Using Open Science Grid - Problems • Our typical unit of one diffusion coefficient is ~20k CPU hours – clearly needs to be broken up for OSG • Single ab initio calculations tend to be significantly parallel (~16-128 cores) and long (10-100h) – poor match for OSG • MAST workflow manager not initially compatible with OSG (MAST runs from a managing shared home directory)
  • 18. Using Open Science Grid - Solutions • Consider the smallest steps in our ~20k CPU hour workflow and build on those (single step calculations). • Restrict to specific types of nodes with 16-20 parallel cores available on one node. • Chose materials carefully to be fast (few electrons) so jobs can usually finish within 24h soft limit on OSG machines. • Manage workflow differently – Idea 1: Adapted MAST to CHTC by sending all tools needed on the home directory (MAST, related directories, python language, etc.) to compute node with job. Worked, but had to send a lot back and forth and managing the directories to avoid workflow errors (e.g., overwriting) was very hard. – Idea 2: Used MAST to set up workflow DAG and then transcribed to use DAGMAN workflow manager in CONDOR on OSG. Better! But reduces error checking ability.
  • 19. Open Science Grid Usage • Used about 2.6m CPU hours over 15m – About 1.5m CPU hours dedicated to production runs for diffusion project. • Ran about 80 diffusion coefficients. • Integrated with traditional HPC (XSEDE, NERSC) for larger runs.
  • 20. Diffusion Database http://diffusiondata.materialshub.org/ https://www.engr.wisc.edu/making-massive-materials-data-sets-tools-accessible/ • Impurity diffusion of X in host H for over 350 systems. • Largest diffusion database from a single group in the world. New science and critical design data. • Data disseminated through web – Web application for plotting and exploring data – All data available from figshare with permanent DOI. Wu, et al., Scientific Data, ‘16
  • 21. Diffusion Database http://diffusiondata.materialshub.org/ https://www.engr.wisc.edu/making-massive-materials-data-sets-tools-accessible/ Wu, et al., Scientific Data, ‘16 BCC FCC Very different trends between FCC and BCC – need large database to discover this.
  • 22. Excess Formation Volume Computation • Disordered binary mixtures of elements A and B for BCC and FCC crystal structures at various compositions (A, A0.75B0.25, A0. 5B0.5, A0.25B0.75, and B) • A, B = Al, Co, Cu, Fe, Mg, Mo, Nb, Ni, and Ti 9 pure elements, and a combined 36 unique elemental pairs. • Use 3 different special quasi-random structures (SQS) for each crystal structure. • Each SQS is optimized for all three mixtures (25%, 50%, and 75%). • Calculate formation volume with DFT, spin-polarized: • Iterative relaxation between ionic relaxation and volume relaxation. • At least 3 repeats of the above iteration. • Total number of calculations: • (2 structures)×(36 pairs)×(3 SQS)×(3 compositions)×(6 DFT) = • = 3888 DFT calculations.
  • 23. Excess Formation Volume Results Al-Cu Al-Ni Al-Co A0 -1.8600 -4.0792 -5.5367 A1 1.4111 0.0389 -1.9511 A2 0.6178 -1.6611 0.2356 We are able to generate a large amount of accurate data and can extract valuable thermodynamic parameters. Excess volume - fit to 2nd order Redlich-Kister Polynomial Vexcess = A0XAXB + A1XAXB(XA - XB )+ A2XAXB(XA - XB )2 0 0.25 0.5 0.75 1 XB , atomic fraction of B -2 -1.5 -1 -0.5 0 ExcessVolume[Å 3 /atom] A=Al B=Cu A=Al B=Ni A=Al B=Co FCC
  • 24. Summary • We have developed an approach to successfully run large sets of high-throughput ab initio calculations for materials design using OSG. • We have used over 2.6m CPU hours over the last ~2years to develop the world’s largest diffusion database from a single research group. • Enables many other materials properties calculations which we are exploring, e.g., alloy volumes, oxide defects, etc. …