SlideShare ist ein Scribd-Unternehmen logo
1 von 19
Downloaden Sie, um offline zu lesen
Evgeny Blokhin
Chelyabinsk SUSU’2013 summer workshop
Max-Planck Institute for Solid State Research
Stuttgart, Germany
Materials informatics
Outlook
1. Data-mining in materials science
2. Blue Obelisk
3. Python programming language
What is data-mining?
statistics
databases
information theory machine learning
artificial intelligence
optimization
Data
mining
Tasks of data-mining
1. Classification
2. Prognosing
3. Visualization
4. Reasoning
5. Analysis
6. Expert systems
Big data in materials science
EXAMPLE: nearly for the last 4 years
with my colleagues-theoreticians we produced:
over 9000 simulation output files
over 50 articles
1. Accelrys Pipeline Pilot and Materials Studio, http://accelrys.com/products
2. AFLOW framework and Aflowlib.org repository, http://www.aflowlib.org
3. AIDA, Bosch LLC
4. Blue Obelisk Data Repository (XSLT, XML), http://bodr.sourceforge.net
5. CCLib (Python), http://cclib.sf.net
6. CDF (Python), http://kitchingroup.cheme.cmu.edu/cdf
7. CMR (Python), https://wiki.fysik.dtu.dk/cmr
8. Comp. Chem. Comparison and Benchmark Database, http://cccbdb.nist.gov
9. cctbx: Computational Crystallography Toolbox, http://cctbx.sourceforge.net
10. ESTEST (Python, XQuery), http://estest.ucdavis.edu
11. J-ICE online viewer (based on Jmol, Java), http://j-ice.sourceforge.net
12. Materials Project (Python), http://www.materialsproject.org
13. PAULING FILE world largest database for inorganic compounds, http://paulingfile.com
14. Quixote, http://quixote.wikispot.org
15. Scipio (Java), https://scipio.iciq.es
16. WebMO: Web-based interface to computational chemistry packages (Java,
Perl), http://webmo.net
New type of modeling software
…and smart codes
ENCUT = 500
IBRION = 2
ISIF = 3
NSW = 20
IDIOT = 3
NELMIN = 5
EDIFF = 1.0e-08
EDIFFG = -1.0e-08
IALGO = 38
ISMEAR = 0
LREAL = .FALSE.
LWAVE = .FALSE.
*** VASP MASTER: I AM SURE YOU KNOW WHAT
YOU ARE DOING ***
d-metal oxides
band gap problem
standard DFT GGA
approach
Hartree-Fock
admixing
LCAO
approximation
Usage of Gaussian
basis sets
good atomization
energy
Example of inference over an ontology
Open data, open standards, open source in
chemistry
Open data, open standards, open source in
chemistry
1.Elsevier, Wiley, Springer publishers are “evil”
2.“The right to read is right to mine”
3.“Jailbreaking” the scientific data from PDFs:
access, reuse, integrity
4.Why the level of collaboration is so low?
Materials Project
Prof. G. Ceder,
MIT, Boston
Guido van Rossum,
Google, Dropbox
http://goo.gl/FtFS7h
Python programming language
Advantages of Python
Syntax: tabulation, syntactic sugar, speech-
like, flexibility, expression
VERY fast prototyping
Great popularity in scientific community
100% cross-platform and portable
Disadvantages of Python
Relatively slow speed comparing to compiled
languages like C++ or Fortran
Global Interpreter Lock (GIL)
Historically not popular in some narrow
scientific areas (“reigns” of Java)
Two examples
list = [x**2 for x in range(10)]
numbers = [10, 4, 2, -1, 6]
filter(lambda x: x < 5, numbers)
1. Multi-dimensional array manipulation (fast!)
2. Discrete fourier transform
3. Linear Algebra
4. Mathematical functions
5. Matrix library
6. Polynomials
7. Set routines
8. Sorting, searching and counting
9. Statistics
eigvals, eigvecs = numpy.linalg.eigh(dynmat)
Solving eigenvalue problem for a
dynamical matrix (phonopy code):

Weitere ähnliche Inhalte

Was ist angesagt?

Lisa Johnson at #ICG13: Re-assembly, quality evaluation, and annotation of 67...
Lisa Johnson at #ICG13: Re-assembly, quality evaluation, and annotation of 67...Lisa Johnson at #ICG13: Re-assembly, quality evaluation, and annotation of 67...
Lisa Johnson at #ICG13: Re-assembly, quality evaluation, and annotation of 67...GigaScience, BGI Hong Kong
 
What is DataCite-screenshots
What is DataCite-screenshotsWhat is DataCite-screenshots
What is DataCite-screenshotsdatacite
 
MESUR: Making sense and use of usage data
MESUR: Making sense and use of usage dataMESUR: Making sense and use of usage data
MESUR: Making sense and use of usage dataHerbert Van de Sompel
 
Aspects of Reproducibility in Earth Science
Aspects of Reproducibility in Earth ScienceAspects of Reproducibility in Earth Science
Aspects of Reproducibility in Earth ScienceRaul Palma
 
Proposal for Text Mining PubAg
Proposal for Text Mining PubAgProposal for Text Mining PubAg
Proposal for Text Mining PubAgJake Lever
 
re3data.org – Registry of Research Data Repositories
re3data.org – Registry of Research Data Repositoriesre3data.org – Registry of Research Data Repositories
re3data.org – Registry of Research Data RepositoriesHeinz Pampel
 
Workshop 5: Uptake of, and concepts in text and data mining
Workshop 5: Uptake of, and concepts in text and data miningWorkshop 5: Uptake of, and concepts in text and data mining
Workshop 5: Uptake of, and concepts in text and data miningRoss Mounce
 
BioHackathon 2010 Intro
BioHackathon 2010 IntroBioHackathon 2010 Intro
BioHackathon 2010 IntroBrad Chapman
 
2011 03-provenance-workshop-edingurgh
2011 03-provenance-workshop-edingurgh2011 03-provenance-workshop-edingurgh
2011 03-provenance-workshop-edingurghJun Zhao
 
Using Neo4j for exploring the research graph connections made by RD-Switchboard
Using Neo4j for exploring the research graph connections made by RD-SwitchboardUsing Neo4j for exploring the research graph connections made by RD-Switchboard
Using Neo4j for exploring the research graph connections made by RD-Switchboardamiraryani
 
Science Commons Open Notebook Science Talk
Science Commons Open Notebook Science TalkScience Commons Open Notebook Science Talk
Science Commons Open Notebook Science TalkJean-Claude Bradley
 
Research Objects: more than the sum of the parts
Research Objects: more than the sum of the partsResearch Objects: more than the sum of the parts
Research Objects: more than the sum of the partsCarole Goble
 
Metagenomic Data Provenance and Management using the ISA infrastructure --- o...
Metagenomic Data Provenance and Management using the ISA infrastructure --- o...Metagenomic Data Provenance and Management using the ISA infrastructure --- o...
Metagenomic Data Provenance and Management using the ISA infrastructure --- o...Alejandra Gonzalez-Beltran
 
GBIF-Norway status for the 6th European GBIF nodes meeting April 2014
GBIF-Norway status for the 6th European GBIF nodes meeting April 2014GBIF-Norway status for the 6th European GBIF nodes meeting April 2014
GBIF-Norway status for the 6th European GBIF nodes meeting April 2014Dag Endresen
 

Was ist angesagt? (20)

Lisa Johnson at #ICG13: Re-assembly, quality evaluation, and annotation of 67...
Lisa Johnson at #ICG13: Re-assembly, quality evaluation, and annotation of 67...Lisa Johnson at #ICG13: Re-assembly, quality evaluation, and annotation of 67...
Lisa Johnson at #ICG13: Re-assembly, quality evaluation, and annotation of 67...
 
Peer Review and Science2.0
Peer Review and Science2.0Peer Review and Science2.0
Peer Review and Science2.0
 
What is DataCite-screenshots
What is DataCite-screenshotsWhat is DataCite-screenshots
What is DataCite-screenshots
 
MESUR: Making sense and use of usage data
MESUR: Making sense and use of usage dataMESUR: Making sense and use of usage data
MESUR: Making sense and use of usage data
 
Aspects of Reproducibility in Earth Science
Aspects of Reproducibility in Earth ScienceAspects of Reproducibility in Earth Science
Aspects of Reproducibility in Earth Science
 
Proposal for Text Mining PubAg
Proposal for Text Mining PubAgProposal for Text Mining PubAg
Proposal for Text Mining PubAg
 
Columbia ONS Archiving May09
Columbia ONS Archiving May09Columbia ONS Archiving May09
Columbia ONS Archiving May09
 
re3data.org – Registry of Research Data Repositories
re3data.org – Registry of Research Data Repositoriesre3data.org – Registry of Research Data Repositories
re3data.org – Registry of Research Data Repositories
 
FAIRy Stories
FAIRy StoriesFAIRy Stories
FAIRy Stories
 
Workshop 5: Uptake of, and concepts in text and data mining
Workshop 5: Uptake of, and concepts in text and data miningWorkshop 5: Uptake of, and concepts in text and data mining
Workshop 5: Uptake of, and concepts in text and data mining
 
BioHackathon 2010 Intro
BioHackathon 2010 IntroBioHackathon 2010 Intro
BioHackathon 2010 Intro
 
2011 03-provenance-workshop-edingurgh
2011 03-provenance-workshop-edingurgh2011 03-provenance-workshop-edingurgh
2011 03-provenance-workshop-edingurgh
 
UPennONS
UPennONSUPennONS
UPennONS
 
The Chemtools LaBLog
The Chemtools LaBLogThe Chemtools LaBLog
The Chemtools LaBLog
 
Using Neo4j for exploring the research graph connections made by RD-Switchboard
Using Neo4j for exploring the research graph connections made by RD-SwitchboardUsing Neo4j for exploring the research graph connections made by RD-Switchboard
Using Neo4j for exploring the research graph connections made by RD-Switchboard
 
Science Commons Open Notebook Science Talk
Science Commons Open Notebook Science TalkScience Commons Open Notebook Science Talk
Science Commons Open Notebook Science Talk
 
Research Objects: more than the sum of the parts
Research Objects: more than the sum of the partsResearch Objects: more than the sum of the parts
Research Objects: more than the sum of the parts
 
Metagenomic Data Provenance and Management using the ISA infrastructure --- o...
Metagenomic Data Provenance and Management using the ISA infrastructure --- o...Metagenomic Data Provenance and Management using the ISA infrastructure --- o...
Metagenomic Data Provenance and Management using the ISA infrastructure --- o...
 
NISO May 8 Webinar: An astronomy library's quest for greater access to litera...
NISO May 8 Webinar: An astronomy library's quest for greater access to litera...NISO May 8 Webinar: An astronomy library's quest for greater access to litera...
NISO May 8 Webinar: An astronomy library's quest for greater access to litera...
 
GBIF-Norway status for the 6th European GBIF nodes meeting April 2014
GBIF-Norway status for the 6th European GBIF nodes meeting April 2014GBIF-Norway status for the 6th European GBIF nodes meeting April 2014
GBIF-Norway status for the 6th European GBIF nodes meeting April 2014
 

Andere mochten auch

Ab initio temperature phonons group theory
Ab initio temperature phonons group theoryAb initio temperature phonons group theory
Ab initio temperature phonons group theorySergey Sozykin
 
Electrochemistry perovskites defects
Electrochemistry perovskites defectsElectrochemistry perovskites defects
Electrochemistry perovskites defectsSergey Sozykin
 
Application of Al alloys
Application of Al alloysApplication of Al alloys
Application of Al alloysSergey Sozykin
 
Misfit layered compounds PbTa2
Misfit layered compounds PbTa2Misfit layered compounds PbTa2
Misfit layered compounds PbTa2Sergey Sozykin
 

Andere mochten auch (6)

Vaulin pohang 2010
Vaulin pohang 2010Vaulin pohang 2010
Vaulin pohang 2010
 
Ab initio temperature phonons group theory
Ab initio temperature phonons group theoryAb initio temperature phonons group theory
Ab initio temperature phonons group theory
 
Binary sigma phases
Binary sigma phasesBinary sigma phases
Binary sigma phases
 
Electrochemistry perovskites defects
Electrochemistry perovskites defectsElectrochemistry perovskites defects
Electrochemistry perovskites defects
 
Application of Al alloys
Application of Al alloysApplication of Al alloys
Application of Al alloys
 
Misfit layered compounds PbTa2
Misfit layered compounds PbTa2Misfit layered compounds PbTa2
Misfit layered compounds PbTa2
 

Ähnlich wie Materials informatics

UKSG Conference 2017 Breakout - Advancing the Research Paper of the Future: c...
UKSG Conference 2017 Breakout - Advancing the Research Paper of the Future: c...UKSG Conference 2017 Breakout - Advancing the Research Paper of the Future: c...
UKSG Conference 2017 Breakout - Advancing the Research Paper of the Future: c...UKSG: connecting the knowledge community
 
Open Research Data: Licensing | Standards | Future
Open Research Data: Licensing | Standards | FutureOpen Research Data: Licensing | Standards | Future
Open Research Data: Licensing | Standards | FutureRoss Mounce
 
HKU Data Curation MLIM7350 Class 8
HKU Data Curation MLIM7350 Class 8HKU Data Curation MLIM7350 Class 8
HKU Data Curation MLIM7350 Class 8Scott Edmunds
 
Mtsr2015 goble-keynote
Mtsr2015 goble-keynoteMtsr2015 goble-keynote
Mtsr2015 goble-keynoteCarole Goble
 
When The New Science Is In The Outliers
When The New Science Is In The OutliersWhen The New Science Is In The Outliers
When The New Science Is In The Outliersaimsnist
 
Benefits and practice of open science
Benefits and practice of open scienceBenefits and practice of open science
Benefits and practice of open scienceSarah Jones
 
NITLE Open Notebook Science Talk
NITLE Open Notebook Science TalkNITLE Open Notebook Science Talk
NITLE Open Notebook Science TalkJean-Claude Bradley
 
'Scikit-project': How open source is empowering open science – and vice versa
'Scikit-project': How open source is empowering open science – and vice versa'Scikit-project': How open source is empowering open science – and vice versa
'Scikit-project': How open source is empowering open science – and vice versaNathan Shammah
 
Specimen-level mining: bringing knowledge back 'home' to the Natural History ...
Specimen-level mining: bringing knowledge back 'home' to the Natural History ...Specimen-level mining: bringing knowledge back 'home' to the Natural History ...
Specimen-level mining: bringing knowledge back 'home' to the Natural History ...Ross Mounce
 
re3data.org – a Registry of Research Data Repositories
re3data.org – a Registry of Research Data Repositoriesre3data.org – a Registry of Research Data Repositories
re3data.org – a Registry of Research Data RepositoriesHeinz Pampel
 
Presentation for agINFRA Hackathon in Athens 12th December 2013
Presentation for agINFRA Hackathon in Athens 12th December 2013Presentation for agINFRA Hackathon in Athens 12th December 2013
Presentation for agINFRA Hackathon in Athens 12th December 2013Jane Bromley
 
Forschungsdaten-Repositorien Typen, Herausforderungen und Perspektiven
Forschungsdaten-Repositorien Typen, Herausforderungen und PerspektivenForschungsdaten-Repositorien Typen, Herausforderungen und Perspektiven
Forschungsdaten-Repositorien Typen, Herausforderungen und PerspektivenHeinz Pampel
 
Being Reproducible: SSBSS Summer School 2017
Being Reproducible: SSBSS Summer School 2017Being Reproducible: SSBSS Summer School 2017
Being Reproducible: SSBSS Summer School 2017Carole Goble
 
The eCrystals Federation
The eCrystals FederationThe eCrystals Federation
The eCrystals FederationManjulaPatel
 
Towards a Machine-Actionable Scholarly Communication System
Towards a Machine-Actionable Scholarly Communication SystemTowards a Machine-Actionable Scholarly Communication System
Towards a Machine-Actionable Scholarly Communication SystemHerbert Van de Sompel
 
An Open Context for Archaeology
An Open Context for ArchaeologyAn Open Context for Archaeology
An Open Context for Archaeologyguest756e05
 
Blogs Logs Pods: Smart Labs
Blogs Logs Pods: Smart LabsBlogs Logs Pods: Smart Labs
Blogs Logs Pods: Smart LabsJeremy Frey
 

Ähnlich wie Materials informatics (20)

UKSG Conference 2017 Breakout - Advancing the Research Paper of the Future: c...
UKSG Conference 2017 Breakout - Advancing the Research Paper of the Future: c...UKSG Conference 2017 Breakout - Advancing the Research Paper of the Future: c...
UKSG Conference 2017 Breakout - Advancing the Research Paper of the Future: c...
 
Open Research Data: Licensing | Standards | Future
Open Research Data: Licensing | Standards | FutureOpen Research Data: Licensing | Standards | Future
Open Research Data: Licensing | Standards | Future
 
HKU Data Curation MLIM7350 Class 8
HKU Data Curation MLIM7350 Class 8HKU Data Curation MLIM7350 Class 8
HKU Data Curation MLIM7350 Class 8
 
Mtsr2015 goble-keynote
Mtsr2015 goble-keynoteMtsr2015 goble-keynote
Mtsr2015 goble-keynote
 
When The New Science Is In The Outliers
When The New Science Is In The OutliersWhen The New Science Is In The Outliers
When The New Science Is In The Outliers
 
Benefits and practice of open science
Benefits and practice of open scienceBenefits and practice of open science
Benefits and practice of open science
 
NITLE Open Notebook Science Talk
NITLE Open Notebook Science TalkNITLE Open Notebook Science Talk
NITLE Open Notebook Science Talk
 
'Scikit-project': How open source is empowering open science – and vice versa
'Scikit-project': How open source is empowering open science – and vice versa'Scikit-project': How open source is empowering open science – and vice versa
'Scikit-project': How open source is empowering open science – and vice versa
 
Specimen-level mining: bringing knowledge back 'home' to the Natural History ...
Specimen-level mining: bringing knowledge back 'home' to the Natural History ...Specimen-level mining: bringing knowledge back 'home' to the Natural History ...
Specimen-level mining: bringing knowledge back 'home' to the Natural History ...
 
re3data.org – a Registry of Research Data Repositories
re3data.org – a Registry of Research Data Repositoriesre3data.org – a Registry of Research Data Repositories
re3data.org – a Registry of Research Data Repositories
 
OpenSciNY Open Notebook Science
OpenSciNY Open Notebook ScienceOpenSciNY Open Notebook Science
OpenSciNY Open Notebook Science
 
Presentation for agINFRA Hackathon in Athens 12th December 2013
Presentation for agINFRA Hackathon in Athens 12th December 2013Presentation for agINFRA Hackathon in Athens 12th December 2013
Presentation for agINFRA Hackathon in Athens 12th December 2013
 
Forschungsdaten-Repositorien Typen, Herausforderungen und Perspektiven
Forschungsdaten-Repositorien Typen, Herausforderungen und PerspektivenForschungsdaten-Repositorien Typen, Herausforderungen und Perspektiven
Forschungsdaten-Repositorien Typen, Herausforderungen und Perspektiven
 
Being Reproducible: SSBSS Summer School 2017
Being Reproducible: SSBSS Summer School 2017Being Reproducible: SSBSS Summer School 2017
Being Reproducible: SSBSS Summer School 2017
 
The eCrystals Federation
The eCrystals FederationThe eCrystals Federation
The eCrystals Federation
 
Towards a Machine-Actionable Scholarly Communication System
Towards a Machine-Actionable Scholarly Communication SystemTowards a Machine-Actionable Scholarly Communication System
Towards a Machine-Actionable Scholarly Communication System
 
Reproducible Research and the Cloud
Reproducible Research and the CloudReproducible Research and the Cloud
Reproducible Research and the Cloud
 
An Open Context for Archaeology
An Open Context for ArchaeologyAn Open Context for Archaeology
An Open Context for Archaeology
 
Blogs Logs Pods: Smart Labs
Blogs Logs Pods: Smart LabsBlogs Logs Pods: Smart Labs
Blogs Logs Pods: Smart Labs
 
Open science platforms
Open science platformsOpen science platforms
Open science platforms
 

Mehr von Sergey Sozykin

Susu seminar summer_2012
Susu seminar summer_2012Susu seminar summer_2012
Susu seminar summer_2012Sergey Sozykin
 
лекция 5 graphen
лекция 5 graphenлекция 5 graphen
лекция 5 graphenSergey Sozykin
 
лекция 3 дефекты в полупроводниках ga n alsb
лекция 3 дефекты в полупроводниках ga n alsbлекция 3 дефекты в полупроводниках ga n alsb
лекция 3 дефекты в полупроводниках ga n alsbSergey Sozykin
 
лекция 2 атомные смещения в бинарных сплавах
лекция 2 атомные смещения в бинарных сплавах лекция 2 атомные смещения в бинарных сплавах
лекция 2 атомные смещения в бинарных сплавах Sergey Sozykin
 
лекция 5 memristor
лекция 5 memristorлекция 5 memristor
лекция 5 memristorSergey Sozykin
 
лекция 1 обзор методов вычислительной физики
лекция 1 обзор методов вычислительной физикилекция 1 обзор методов вычислительной физики
лекция 1 обзор методов вычислительной физикиSergey Sozykin
 

Mehr von Sergey Sozykin (6)

Susu seminar summer_2012
Susu seminar summer_2012Susu seminar summer_2012
Susu seminar summer_2012
 
лекция 5 graphen
лекция 5 graphenлекция 5 graphen
лекция 5 graphen
 
лекция 3 дефекты в полупроводниках ga n alsb
лекция 3 дефекты в полупроводниках ga n alsbлекция 3 дефекты в полупроводниках ga n alsb
лекция 3 дефекты в полупроводниках ga n alsb
 
лекция 2 атомные смещения в бинарных сплавах
лекция 2 атомные смещения в бинарных сплавах лекция 2 атомные смещения в бинарных сплавах
лекция 2 атомные смещения в бинарных сплавах
 
лекция 5 memristor
лекция 5 memristorлекция 5 memristor
лекция 5 memristor
 
лекция 1 обзор методов вычислительной физики
лекция 1 обзор методов вычислительной физикилекция 1 обзор методов вычислительной физики
лекция 1 обзор методов вычислительной физики
 

Kürzlich hochgeladen

Developing An App To Navigate The Roads of Brazil
Developing An App To Navigate The Roads of BrazilDeveloping An App To Navigate The Roads of Brazil
Developing An App To Navigate The Roads of BrazilV3cube
 
How to convert PDF to text with Nanonets
How to convert PDF to text with NanonetsHow to convert PDF to text with Nanonets
How to convert PDF to text with Nanonetsnaman860154
 
Handwritten Text Recognition for manuscripts and early printed texts
Handwritten Text Recognition for manuscripts and early printed textsHandwritten Text Recognition for manuscripts and early printed texts
Handwritten Text Recognition for manuscripts and early printed textsMaria Levchenko
 
A Domino Admins Adventures (Engage 2024)
A Domino Admins Adventures (Engage 2024)A Domino Admins Adventures (Engage 2024)
A Domino Admins Adventures (Engage 2024)Gabriella Davis
 
From Event to Action: Accelerate Your Decision Making with Real-Time Automation
From Event to Action: Accelerate Your Decision Making with Real-Time AutomationFrom Event to Action: Accelerate Your Decision Making with Real-Time Automation
From Event to Action: Accelerate Your Decision Making with Real-Time AutomationSafe Software
 
Injustice - Developers Among Us (SciFiDevCon 2024)
Injustice - Developers Among Us (SciFiDevCon 2024)Injustice - Developers Among Us (SciFiDevCon 2024)
Injustice - Developers Among Us (SciFiDevCon 2024)Allon Mureinik
 
Boost PC performance: How more available memory can improve productivity
Boost PC performance: How more available memory can improve productivityBoost PC performance: How more available memory can improve productivity
Boost PC performance: How more available memory can improve productivityPrincipled Technologies
 
Scaling API-first – The story of a global engineering organization
Scaling API-first – The story of a global engineering organizationScaling API-first – The story of a global engineering organization
Scaling API-first – The story of a global engineering organizationRadu Cotescu
 
Exploring the Future Potential of AI-Enabled Smartphone Processors
Exploring the Future Potential of AI-Enabled Smartphone ProcessorsExploring the Future Potential of AI-Enabled Smartphone Processors
Exploring the Future Potential of AI-Enabled Smartphone Processorsdebabhi2
 
08448380779 Call Girls In Greater Kailash - I Women Seeking Men
08448380779 Call Girls In Greater Kailash - I Women Seeking Men08448380779 Call Girls In Greater Kailash - I Women Seeking Men
08448380779 Call Girls In Greater Kailash - I Women Seeking MenDelhi Call girls
 
Presentation on how to chat with PDF using ChatGPT code interpreter
Presentation on how to chat with PDF using ChatGPT code interpreterPresentation on how to chat with PDF using ChatGPT code interpreter
Presentation on how to chat with PDF using ChatGPT code interpreternaman860154
 
Data Cloud, More than a CDP by Matt Robison
Data Cloud, More than a CDP by Matt RobisonData Cloud, More than a CDP by Matt Robison
Data Cloud, More than a CDP by Matt RobisonAnna Loughnan Colquhoun
 
Kalyanpur ) Call Girls in Lucknow Finest Escorts Service 🍸 8923113531 🎰 Avail...
Kalyanpur ) Call Girls in Lucknow Finest Escorts Service 🍸 8923113531 🎰 Avail...Kalyanpur ) Call Girls in Lucknow Finest Escorts Service 🍸 8923113531 🎰 Avail...
Kalyanpur ) Call Girls in Lucknow Finest Escorts Service 🍸 8923113531 🎰 Avail...gurkirankumar98700
 
Top 5 Benefits OF Using Muvi Live Paywall For Live Streams
Top 5 Benefits OF Using Muvi Live Paywall For Live StreamsTop 5 Benefits OF Using Muvi Live Paywall For Live Streams
Top 5 Benefits OF Using Muvi Live Paywall For Live StreamsRoshan Dwivedi
 
Driving Behavioral Change for Information Management through Data-Driven Gree...
Driving Behavioral Change for Information Management through Data-Driven Gree...Driving Behavioral Change for Information Management through Data-Driven Gree...
Driving Behavioral Change for Information Management through Data-Driven Gree...Enterprise Knowledge
 
08448380779 Call Girls In Civil Lines Women Seeking Men
08448380779 Call Girls In Civil Lines Women Seeking Men08448380779 Call Girls In Civil Lines Women Seeking Men
08448380779 Call Girls In Civil Lines Women Seeking MenDelhi Call girls
 
08448380779 Call Girls In Diplomatic Enclave Women Seeking Men
08448380779 Call Girls In Diplomatic Enclave Women Seeking Men08448380779 Call Girls In Diplomatic Enclave Women Seeking Men
08448380779 Call Girls In Diplomatic Enclave Women Seeking MenDelhi Call girls
 
The Role of Taxonomy and Ontology in Semantic Layers - Heather Hedden.pdf
The Role of Taxonomy and Ontology in Semantic Layers - Heather Hedden.pdfThe Role of Taxonomy and Ontology in Semantic Layers - Heather Hedden.pdf
The Role of Taxonomy and Ontology in Semantic Layers - Heather Hedden.pdfEnterprise Knowledge
 
Factors to Consider When Choosing Accounts Payable Services Providers.pptx
Factors to Consider When Choosing Accounts Payable Services Providers.pptxFactors to Consider When Choosing Accounts Payable Services Providers.pptx
Factors to Consider When Choosing Accounts Payable Services Providers.pptxKatpro Technologies
 
Workshop - Best of Both Worlds_ Combine KG and Vector search for enhanced R...
Workshop - Best of Both Worlds_ Combine  KG and Vector search for  enhanced R...Workshop - Best of Both Worlds_ Combine  KG and Vector search for  enhanced R...
Workshop - Best of Both Worlds_ Combine KG and Vector search for enhanced R...Neo4j
 

Kürzlich hochgeladen (20)

Developing An App To Navigate The Roads of Brazil
Developing An App To Navigate The Roads of BrazilDeveloping An App To Navigate The Roads of Brazil
Developing An App To Navigate The Roads of Brazil
 
How to convert PDF to text with Nanonets
How to convert PDF to text with NanonetsHow to convert PDF to text with Nanonets
How to convert PDF to text with Nanonets
 
Handwritten Text Recognition for manuscripts and early printed texts
Handwritten Text Recognition for manuscripts and early printed textsHandwritten Text Recognition for manuscripts and early printed texts
Handwritten Text Recognition for manuscripts and early printed texts
 
A Domino Admins Adventures (Engage 2024)
A Domino Admins Adventures (Engage 2024)A Domino Admins Adventures (Engage 2024)
A Domino Admins Adventures (Engage 2024)
 
From Event to Action: Accelerate Your Decision Making with Real-Time Automation
From Event to Action: Accelerate Your Decision Making with Real-Time AutomationFrom Event to Action: Accelerate Your Decision Making with Real-Time Automation
From Event to Action: Accelerate Your Decision Making with Real-Time Automation
 
Injustice - Developers Among Us (SciFiDevCon 2024)
Injustice - Developers Among Us (SciFiDevCon 2024)Injustice - Developers Among Us (SciFiDevCon 2024)
Injustice - Developers Among Us (SciFiDevCon 2024)
 
Boost PC performance: How more available memory can improve productivity
Boost PC performance: How more available memory can improve productivityBoost PC performance: How more available memory can improve productivity
Boost PC performance: How more available memory can improve productivity
 
Scaling API-first – The story of a global engineering organization
Scaling API-first – The story of a global engineering organizationScaling API-first – The story of a global engineering organization
Scaling API-first – The story of a global engineering organization
 
Exploring the Future Potential of AI-Enabled Smartphone Processors
Exploring the Future Potential of AI-Enabled Smartphone ProcessorsExploring the Future Potential of AI-Enabled Smartphone Processors
Exploring the Future Potential of AI-Enabled Smartphone Processors
 
08448380779 Call Girls In Greater Kailash - I Women Seeking Men
08448380779 Call Girls In Greater Kailash - I Women Seeking Men08448380779 Call Girls In Greater Kailash - I Women Seeking Men
08448380779 Call Girls In Greater Kailash - I Women Seeking Men
 
Presentation on how to chat with PDF using ChatGPT code interpreter
Presentation on how to chat with PDF using ChatGPT code interpreterPresentation on how to chat with PDF using ChatGPT code interpreter
Presentation on how to chat with PDF using ChatGPT code interpreter
 
Data Cloud, More than a CDP by Matt Robison
Data Cloud, More than a CDP by Matt RobisonData Cloud, More than a CDP by Matt Robison
Data Cloud, More than a CDP by Matt Robison
 
Kalyanpur ) Call Girls in Lucknow Finest Escorts Service 🍸 8923113531 🎰 Avail...
Kalyanpur ) Call Girls in Lucknow Finest Escorts Service 🍸 8923113531 🎰 Avail...Kalyanpur ) Call Girls in Lucknow Finest Escorts Service 🍸 8923113531 🎰 Avail...
Kalyanpur ) Call Girls in Lucknow Finest Escorts Service 🍸 8923113531 🎰 Avail...
 
Top 5 Benefits OF Using Muvi Live Paywall For Live Streams
Top 5 Benefits OF Using Muvi Live Paywall For Live StreamsTop 5 Benefits OF Using Muvi Live Paywall For Live Streams
Top 5 Benefits OF Using Muvi Live Paywall For Live Streams
 
Driving Behavioral Change for Information Management through Data-Driven Gree...
Driving Behavioral Change for Information Management through Data-Driven Gree...Driving Behavioral Change for Information Management through Data-Driven Gree...
Driving Behavioral Change for Information Management through Data-Driven Gree...
 
08448380779 Call Girls In Civil Lines Women Seeking Men
08448380779 Call Girls In Civil Lines Women Seeking Men08448380779 Call Girls In Civil Lines Women Seeking Men
08448380779 Call Girls In Civil Lines Women Seeking Men
 
08448380779 Call Girls In Diplomatic Enclave Women Seeking Men
08448380779 Call Girls In Diplomatic Enclave Women Seeking Men08448380779 Call Girls In Diplomatic Enclave Women Seeking Men
08448380779 Call Girls In Diplomatic Enclave Women Seeking Men
 
The Role of Taxonomy and Ontology in Semantic Layers - Heather Hedden.pdf
The Role of Taxonomy and Ontology in Semantic Layers - Heather Hedden.pdfThe Role of Taxonomy and Ontology in Semantic Layers - Heather Hedden.pdf
The Role of Taxonomy and Ontology in Semantic Layers - Heather Hedden.pdf
 
Factors to Consider When Choosing Accounts Payable Services Providers.pptx
Factors to Consider When Choosing Accounts Payable Services Providers.pptxFactors to Consider When Choosing Accounts Payable Services Providers.pptx
Factors to Consider When Choosing Accounts Payable Services Providers.pptx
 
Workshop - Best of Both Worlds_ Combine KG and Vector search for enhanced R...
Workshop - Best of Both Worlds_ Combine  KG and Vector search for  enhanced R...Workshop - Best of Both Worlds_ Combine  KG and Vector search for  enhanced R...
Workshop - Best of Both Worlds_ Combine KG and Vector search for enhanced R...
 

Materials informatics

  • 1. Evgeny Blokhin Chelyabinsk SUSU’2013 summer workshop Max-Planck Institute for Solid State Research Stuttgart, Germany Materials informatics
  • 2. Outlook 1. Data-mining in materials science 2. Blue Obelisk 3. Python programming language
  • 3. What is data-mining? statistics databases information theory machine learning artificial intelligence optimization Data mining
  • 4. Tasks of data-mining 1. Classification 2. Prognosing 3. Visualization 4. Reasoning 5. Analysis 6. Expert systems
  • 5. Big data in materials science EXAMPLE: nearly for the last 4 years with my colleagues-theoreticians we produced: over 9000 simulation output files over 50 articles
  • 6.
  • 7. 1. Accelrys Pipeline Pilot and Materials Studio, http://accelrys.com/products 2. AFLOW framework and Aflowlib.org repository, http://www.aflowlib.org 3. AIDA, Bosch LLC 4. Blue Obelisk Data Repository (XSLT, XML), http://bodr.sourceforge.net 5. CCLib (Python), http://cclib.sf.net 6. CDF (Python), http://kitchingroup.cheme.cmu.edu/cdf 7. CMR (Python), https://wiki.fysik.dtu.dk/cmr 8. Comp. Chem. Comparison and Benchmark Database, http://cccbdb.nist.gov 9. cctbx: Computational Crystallography Toolbox, http://cctbx.sourceforge.net 10. ESTEST (Python, XQuery), http://estest.ucdavis.edu 11. J-ICE online viewer (based on Jmol, Java), http://j-ice.sourceforge.net 12. Materials Project (Python), http://www.materialsproject.org 13. PAULING FILE world largest database for inorganic compounds, http://paulingfile.com 14. Quixote, http://quixote.wikispot.org 15. Scipio (Java), https://scipio.iciq.es 16. WebMO: Web-based interface to computational chemistry packages (Java, Perl), http://webmo.net New type of modeling software
  • 8. …and smart codes ENCUT = 500 IBRION = 2 ISIF = 3 NSW = 20 IDIOT = 3 NELMIN = 5 EDIFF = 1.0e-08 EDIFFG = -1.0e-08 IALGO = 38 ISMEAR = 0 LREAL = .FALSE. LWAVE = .FALSE. *** VASP MASTER: I AM SURE YOU KNOW WHAT YOU ARE DOING ***
  • 9. d-metal oxides band gap problem standard DFT GGA approach Hartree-Fock admixing LCAO approximation Usage of Gaussian basis sets good atomization energy Example of inference over an ontology
  • 10.
  • 11. Open data, open standards, open source in chemistry
  • 12. Open data, open standards, open source in chemistry 1.Elsevier, Wiley, Springer publishers are “evil” 2.“The right to read is right to mine” 3.“Jailbreaking” the scientific data from PDFs: access, reuse, integrity 4.Why the level of collaboration is so low?
  • 13. Materials Project Prof. G. Ceder, MIT, Boston
  • 14. Guido van Rossum, Google, Dropbox http://goo.gl/FtFS7h Python programming language
  • 15. Advantages of Python Syntax: tabulation, syntactic sugar, speech- like, flexibility, expression VERY fast prototyping Great popularity in scientific community 100% cross-platform and portable
  • 16. Disadvantages of Python Relatively slow speed comparing to compiled languages like C++ or Fortran Global Interpreter Lock (GIL) Historically not popular in some narrow scientific areas (“reigns” of Java)
  • 17. Two examples list = [x**2 for x in range(10)] numbers = [10, 4, 2, -1, 6] filter(lambda x: x < 5, numbers)
  • 18. 1. Multi-dimensional array manipulation (fast!) 2. Discrete fourier transform 3. Linear Algebra 4. Mathematical functions 5. Matrix library 6. Polynomials 7. Set routines 8. Sorting, searching and counting 9. Statistics
  • 19. eigvals, eigvecs = numpy.linalg.eigh(dynmat) Solving eigenvalue problem for a dynamical matrix (phonopy code):