SlideShare ist ein Scribd-Unternehmen logo
1 von 30
Downloaden Sie, um offline zu lesen
Citrine Informatics
Citrine InformaticsThe data analytics platform for the physical world
Citrination Tutorial
Eric Lundberg
Dec. 2017
Citrine Informatics
Outline
Objective: Familiarize new users with the key functionality of the Citrination
platform
• Search
• Upload Data
• Create and Apply ML Models:
• Create a Data View
• Plot data
• Predict properties of unknown materials
• Design materials to meet parameters
• Assess model quality
NOTE: Your experience may vary because each Citrination site is different, and we update our platform regularly
• All example images are from the base Citrination.com website. Each private Citrination site may look different
• We add functionality, fix bugs, and update the user interface frequently. Revisit the Citrination website for an updated version of these
instructions
• If the site doesn’t seem to work or you see an error message, closely check that you followed these instructions and contact us at
training@citrine.io
E. Lundberg, elundberg@citrine.io2
Citrine Informatics
Search Function
• What is it? Explore the world’s largest database of materials and
chemicals information, including: polymers, alloys, semiconductors,
and many more. This database is broken up into material cards,
where the properties of a known material are consolidated into a
single view.
• How does this help you? Want to know the properties of a
material? Want to find materials with certain properties? Use
Advanced Search to apply more filters to narrow down your options.
• How it works: You can search in 2 ways: all datasets or a specific
dataset. Your search will return a number of materials cards that
meet the requirements you set. Click on one to explore more!
E. Lundberg, elundberg@citrine.io3
Citrine Informatics
E. Lundberg, elundberg@citrine.io4
Search – All
Datasets (1/3)
Explore materials data
across all available data
sets with the Search tab.
1. Click Citrination’s
Search tab
2. Click “Advanced
Search Options”
3. Type material OR
property of interest
4. Set constraints
5. See number of
results returned
6. Click on an material
card under “results”
1
2
3
4
6
5
Citrine Informatics
E. Lundberg, elundberg@citrine.io5
Search –
Specific Dataset
(2/3)
Learn more about a specific
research project’s collection of
records with the Search-Dataset
tab.
1. Click Datasets tab
2. Choose type of dataset
• Public: Visible to everyone on the
site
• Private: Uploaded by you
• Shared: Shared with you by
others
• Purchased: Purchased by your
organization
3. Click on dataset
4. Type material name or
property of interest
5. See results
6. Click on a material card
1
3
4
6
2
5
Citrine Informatics
E. Lundberg, elundberg@citrine.io6
Once you click on a search
result you can see this card.
It displays the properties of
a known material
1. Chemical formula/
composition
2. Type of data (e.g.
property, composition,
preparation, method,
or references)
See http://help.citrination.com/ for
more information what can
go into a material card or
what types of data
Citrination recognizes
3. Data
1
2
Search –
Material Card
(3/3)
3
Citrine Informatics
Upload Data
• What is it? This feature allows you to upload data (or any type
of file) to the Citrination platform, organize it, and keep it private
or share it with the rest of the users on the site. If you have a
private Citrination site, that information is kept secure.
• How does this help you? Uploading your data to our site
makes it searchable, shareable, and accessible for machine
learning (ML)
• How it works: The Citrination platform will turn your data into a
series of materials cards. We have dozens of ingesters to
upload structured data files (e.g. CSV files, XRD files, and VASP
DFT files)
E. Lundberg, elundberg@citrine.io7
Citrine Informatics
E. Lundberg, elundberg@citrine.io8
Help on
Ingesters
Learn how to upload
your data so
Citrination can
process it. One
common data format
is the CSV Template.
See the help.citrination.com
page for more details
1. Open Citrination
help site
2. Browse or search
key words for a
concept
3. Template CSV
page
2
3
1 help.citrination.com
2
Citrine Informatics
E. Lundberg, elundberg@citrine.io9
Add Data
(1/2)
Upload your data to the secure
Citrination site with the Add Data
tab. You can create a new
dataset or update an existing
data set, (e.g. after
experimentation).
Any file can be uploaded, but
only some formats are parsed
into materials records.
1. Click Add Data tab
2. Select new/existing dataset
3. Type Title
Each dataset for a given user
must have a unique title
4. Type Description
5. Choose appropriate
ingester (important)
6. Upload the file
7. Submit!
1
3
2
4
5
6
7
Citrine Informatics
E. Lundberg, elundberg@citrine.io10
Add Data
(2/2)
8. Review
uploaded file
9. Refresh to see
the file’s
progress (may
take up to <10m):
a. Initializing
b. Processing
c. Finished or
Failed
10. Log file if data
upload fails
8 9
9
10
Citrine Informatics
E. Lundberg, elundberg@citrine.io11
Share
Dataset
If you’re a public Citrination user, this
shares your data with everyone.
If you’re a private Citrination user (you
have your own site), only Citrine
employees and users at your company
can view it.
The process for sharing Data Views is
similar- just click ”Access”
1. Click Access
Tab
2. Review current
status
3. Click to Share
You can also share with Groups (short-
term, can be added by your members)
or Teams (long-term, managed by
Citrine)
1
2 3
Citrine Informatics
Create Models: Data Views
• What is it? This feature allows you to uses Citrination’s
machine learning (ML) software to visualize and model data.
You can predict the properties of untested materials or design
new materials that meet your specifications. It also give you a
report on the ML model quality.
• How does this help you?
• Use predict to assess material candidates of interest to you
• Use design to suggest promising new candidates based on your target
specifications
• How it works: The Citrination platform will build AI models to
identify trends and make predictions.
E. Lundberg, elundberg@citrine.io12
Citrine Informatics
5
3
4
3
E. Lundberg, elundberg@citrine.io13
Data Views –
Create (1/4)
Analyze your data in the
Citrination platform by
creating a Data View
1. Click Data Views tab
2. Click Create New Data
View
Next, identify which
datasets to use to train
your model
3. Search based on
properties contained in
the dataset of interest
4. Select 1 or more
dataset(s)
5. Click Next
1
2
Citrine Informatics
E. Lundberg, elundberg@citrine.io14
Data Views –
Create (2/4)
6. Search/click
properties you
want to include
as inputs or
outputs to the
Machine
Learning (ML)
model
7. Selected
properties
8. Click Next
8
6
7
Citrine Informatics
E. Lundberg, elundberg@citrine.io15
Data Views –
Create (3/4)
9. Type Data
View Name
10.Type
Description
11.Click Save
12.Click to
Configure ML
(Once ML is configured,
Citrination can analyze the
data and start to make
predictions)
11
9
10
12
Citrine Informatics
E. Lundberg, elundberg@citrine.io16
Data Views –
Create (4/4)
13. Click to view instructions
14. Check data closely for
accuracy
• Column name should be the
property
• Descriptor type should be the
property’s format- formula,
composition, real, or categorical
• Parameter type is the type of
variable (inputs are controllable
degrees of freedom, outputs are
target properties)
• Value(s) are the valid values for the
property
15. Click Edit (if inaccurate)
16. Select material/ variable
types/ value range as required
17. Click Okay
18. Click Save – this (re)trains
your machine learning model
19. Click Search to watch the
view train
1714
15
16
18
Scroll	Up	to	Save
13
19
Citrine Informatics
E. Lundberg, elundberg@citrine.io17
Data Views –
Summary
Review the
important
information on
the Summary tab
once you
configure
machine learning
1. Data View –
Summary tab
2. Summary
information
1
2
Citrine Informatics
E. Lundberg, elundberg@citrine.io18
Data Views –
Populate (1/2)
Fill in your data set’s
gaps with predicted
values and uncertainty
with the Populate
button
1. Click Data Views-
Search tab
2. Training and
Testing Models
Citrination will begin training the models,
and will display a purple "Training and
Testing Data" button during this process.
When the progress bars go away and the
purple button is replaced by "Populate
with Data" proceed to the next step.
Wait times are highly variable based on
the amount and complexity of data. Most
views train in 2-60 minutes. If it takes
more than 12 hours, please contact
Citrine.
1
2
Citrine Informatics
E. Lundberg, elundberg@citrine.io19
3. Click Populate
with Data when
it becomes
visible
4. Predicted
Values from the
model are in
GREEN with
uncertainty
5. Recorded
Values from
your data
source are in
BLACK
3
5
4
Data Views –
Populate (2/2)
Citrine Informatics
E. Lundberg, elundberg@citrine.io20
Data Views –
Export
Manipulate and view
the data in Excel by
using the Export
function
1. Click Data View-
Search tab
2. Click Export
3. Click Confirm
4. Check email
5. Click Link
6. Download &
Open File
5
4
6
2
3
1
Citrine Informatics
E. Lundberg, elundberg@citrine.io21
Data Views –
Material Cards
View the properties
of a material in the
data view from the
Data Views-Search
tab
1. Click Data
Views – Search
tab
2. Click General
3. View material
card
1
3
2
1
Citrine Informatics
E. Lundberg, elundberg@citrine.io22
Data Views –
Plots
Use the Plots tab to
visualize the data on a
variety of plot types.
1. Click Data Views/
Search Tab
2. Click Plots
3. Select Plot Type
4. Select Point Hover
values (if you put your
cursor over the point,
you see this data)
5. Type # responses
6. Select X Axis property
7. Select Y Axis property
8. Click Generate Plot
1
8
3 4 5
6 7
2
Citrine Informatics
E. Lundberg, elundberg@citrine.io23
Data Views –
Predict
Use the Predict tab to
predict the properties
of a specific input (e.g.
chemical formula)
1. Click Data Views –
Predict tab
2. Type your input
3. Click Predict
4. Numerical
prediction with
uncertainty range
of ±	1 standard
deviation
1
2
4
3
Citrine Informatics
E. Lundberg, elundberg@citrine.io24
Data Views –
Design (1/2)
Use the Design tab to generate
candidate materials based on
targets and constraints.
1. Click Data Views – Design
tab
2. Select Maximum time for
computer to explore
options
3. Select Number of
Candidates to return
4. Select space over which
design will search for
promising candidates
5. Select Optimized property
and target
6. Select Constraints on the
target properties
7. Click Run
1
2
4
3
7
5
6
Scroll	Down
Citrine Informatics
E. Lundberg, elundberg@citrine.io25
Data Views –
Design (2/2)
8. Click Export to
CSV to download
results
9. Best Materials (short
term success)
10. Suggested
Experiments (long
term success)
11. Click Save Design
Results to save
these candidates
or Export to CSV
to download
results
(Citrination deletes unsaved design results
when you leave the page)
8
11a
9
10
11b
Citrine Informatics
E. Lundberg, elundberg@citrine.io26
Data Views –
Reports
You can use the Reports
tab to understand your
model quality.
1. Click Data Views-
Reports tab
2. Click Model Report
tab
3. Review ML settings
4. Review Features
and their impact on
the model
5. Review ML Model
performance
6. Review predicted vs
actual plot
1
2
3
4
5
6
3
Citrine Informatics
Conclusion
We’ve learned how to:
• Search
• Upload Data
• Use Data Views to…
• Create a view
• Plot data
• Predict properties of unknown materials
• Design materials to meet parameters
• Assess model quality
E. Lundberg, elundberg@citrine.io27
Citrine Informatics
Citrine InformaticsThe data analytics platform for the physical world
elundberg@citrine.io
Thank you
Citrine Informatics
E. Lundberg, elundberg@citrine.io29
Appendix A:
Zoom Setup
1. Go to https://zoom.us
2. Sign up for an
account
3. Start your test
meeting
4. Test audio – select
your speaker and mic
and make sure that
your sound works
5. Check “Automatically
join audio by
computer when
joining a meeting”
6. Test video – make
sure you can see
yourself
1
1
2
3
4
5
6
Citrine Informatics
E. Lundberg, elundberg@citrine.io30
Appendix A:
Join Zoom
Meeting
Follow the invitation link from
your Citrine point of contact
1. Click Join Audio (if
required)
2. Click Join Audio
Conference by Computer
(if required)
3. Click Share Screen
4. Click your browser (or
desktop if you don't see
it)
5. Click Share Screen
6. Go to the Citrination site
and make sure you see
the GREEN and RED
boxes in the top of your
screen (if these aren't
visible, make sure a
Citrine member can view
your screen)
1
2
3
4
5
6

Weitere ähnliche Inhalte

Ähnlich wie Citrination tutorial

Unit_8_Data_processing,_analysis_and_presentation_and_Application (1).pptx
Unit_8_Data_processing,_analysis_and_presentation_and_Application (1).pptxUnit_8_Data_processing,_analysis_and_presentation_and_Application (1).pptx
Unit_8_Data_processing,_analysis_and_presentation_and_Application (1).pptx
tesfkeb
 
Introducition to Data scinece compiled by hu
Introducition to Data scinece compiled by huIntroducition to Data scinece compiled by hu
Introducition to Data scinece compiled by hu
wekineheshete
 

Ähnlich wie Citrination tutorial (20)

Citi Global T4I Accelerator Data and Analytics Presentation
Citi Global T4I Accelerator Data and Analytics PresentationCiti Global T4I Accelerator Data and Analytics Presentation
Citi Global T4I Accelerator Data and Analytics Presentation
 
L1 Introduction DS.pptx
L1 Introduction DS.pptxL1 Introduction DS.pptx
L1 Introduction DS.pptx
 
BAS 250 Lecture 1
BAS 250 Lecture 1BAS 250 Lecture 1
BAS 250 Lecture 1
 
Data mining and business intelligence
Data mining and business intelligenceData mining and business intelligence
Data mining and business intelligence
 
Data Science- Basics.pptx
Data Science- Basics.pptxData Science- Basics.pptx
Data Science- Basics.pptx
 
Supervised learning
Supervised learningSupervised learning
Supervised learning
 
Data Stewardship for SPATIAL/IsoCamp 2014
Data Stewardship for SPATIAL/IsoCamp 2014Data Stewardship for SPATIAL/IsoCamp 2014
Data Stewardship for SPATIAL/IsoCamp 2014
 
3 Ways to Take Your Audience on a Survey Adventure
3 Ways to Take Your Audience on a Survey Adventure3 Ways to Take Your Audience on a Survey Adventure
3 Ways to Take Your Audience on a Survey Adventure
 
Denodo DataFest 2016: Comparing and Contrasting Data Virtualization With Data...
Denodo DataFest 2016: Comparing and Contrasting Data Virtualization With Data...Denodo DataFest 2016: Comparing and Contrasting Data Virtualization With Data...
Denodo DataFest 2016: Comparing and Contrasting Data Virtualization With Data...
 
Disrupting Data Discovery
Disrupting Data DiscoveryDisrupting Data Discovery
Disrupting Data Discovery
 
Doing Analytics Right - Building the Analytics Environment
Doing Analytics Right - Building the Analytics EnvironmentDoing Analytics Right - Building the Analytics Environment
Doing Analytics Right - Building the Analytics Environment
 
Data science lecture3_doaa_mohey
Data science lecture3_doaa_mohey Data science lecture3_doaa_mohey
Data science lecture3_doaa_mohey
 
Lecture 5: Mining, Analysis and Visualisation
Lecture 5: Mining, Analysis and VisualisationLecture 5: Mining, Analysis and Visualisation
Lecture 5: Mining, Analysis and Visualisation
 
Data science presentation 2nd CI day
Data science presentation 2nd CI dayData science presentation 2nd CI day
Data science presentation 2nd CI day
 
Unit_8_Data_processing,_analysis_and_presentation_and_Application (1).pptx
Unit_8_Data_processing,_analysis_and_presentation_and_Application (1).pptxUnit_8_Data_processing,_analysis_and_presentation_and_Application (1).pptx
Unit_8_Data_processing,_analysis_and_presentation_and_Application (1).pptx
 
Introducition to Data scinece compiled by hu
Introducition to Data scinece compiled by huIntroducition to Data scinece compiled by hu
Introducition to Data scinece compiled by hu
 
Machine learning and big data
Machine learning and big dataMachine learning and big data
Machine learning and big data
 
SentricWorkforce Query Builder
SentricWorkforce Query BuilderSentricWorkforce Query Builder
SentricWorkforce Query Builder
 
lec1.pdf
lec1.pdflec1.pdf
lec1.pdf
 
Data council sf amundsen presentation
Data council sf    amundsen presentationData council sf    amundsen presentation
Data council sf amundsen presentation
 

Kürzlich hochgeladen

+971581248768>> SAFE AND ORIGINAL ABORTION PILLS FOR SALE IN DUBAI AND ABUDHA...
+971581248768>> SAFE AND ORIGINAL ABORTION PILLS FOR SALE IN DUBAI AND ABUDHA...+971581248768>> SAFE AND ORIGINAL ABORTION PILLS FOR SALE IN DUBAI AND ABUDHA...
+971581248768>> SAFE AND ORIGINAL ABORTION PILLS FOR SALE IN DUBAI AND ABUDHA...
?#DUbAI#??##{{(☎️+971_581248768%)**%*]'#abortion pills for sale in dubai@
 
Finding Java's Hidden Performance Traps @ DevoxxUK 2024
Finding Java's Hidden Performance Traps @ DevoxxUK 2024Finding Java's Hidden Performance Traps @ DevoxxUK 2024
Finding Java's Hidden Performance Traps @ DevoxxUK 2024
Victor Rentea
 
Cloud Frontiers: A Deep Dive into Serverless Spatial Data and FME
Cloud Frontiers:  A Deep Dive into Serverless Spatial Data and FMECloud Frontiers:  A Deep Dive into Serverless Spatial Data and FME
Cloud Frontiers: A Deep Dive into Serverless Spatial Data and FME
Safe Software
 

Kürzlich hochgeladen (20)

+971581248768>> SAFE AND ORIGINAL ABORTION PILLS FOR SALE IN DUBAI AND ABUDHA...
+971581248768>> SAFE AND ORIGINAL ABORTION PILLS FOR SALE IN DUBAI AND ABUDHA...+971581248768>> SAFE AND ORIGINAL ABORTION PILLS FOR SALE IN DUBAI AND ABUDHA...
+971581248768>> SAFE AND ORIGINAL ABORTION PILLS FOR SALE IN DUBAI AND ABUDHA...
 
Biography Of Angeliki Cooney | Senior Vice President Life Sciences | Albany, ...
Biography Of Angeliki Cooney | Senior Vice President Life Sciences | Albany, ...Biography Of Angeliki Cooney | Senior Vice President Life Sciences | Albany, ...
Biography Of Angeliki Cooney | Senior Vice President Life Sciences | Albany, ...
 
Artificial Intelligence Chap.5 : Uncertainty
Artificial Intelligence Chap.5 : UncertaintyArtificial Intelligence Chap.5 : Uncertainty
Artificial Intelligence Chap.5 : Uncertainty
 
[BuildWithAI] Introduction to Gemini.pdf
[BuildWithAI] Introduction to Gemini.pdf[BuildWithAI] Introduction to Gemini.pdf
[BuildWithAI] Introduction to Gemini.pdf
 
Corporate and higher education May webinar.pptx
Corporate and higher education May webinar.pptxCorporate and higher education May webinar.pptx
Corporate and higher education May webinar.pptx
 
Boost Fertility New Invention Ups Success Rates.pdf
Boost Fertility New Invention Ups Success Rates.pdfBoost Fertility New Invention Ups Success Rates.pdf
Boost Fertility New Invention Ups Success Rates.pdf
 
ProductAnonymous-April2024-WinProductDiscovery-MelissaKlemke
ProductAnonymous-April2024-WinProductDiscovery-MelissaKlemkeProductAnonymous-April2024-WinProductDiscovery-MelissaKlemke
ProductAnonymous-April2024-WinProductDiscovery-MelissaKlemke
 
Strategize a Smooth Tenant-to-tenant Migration and Copilot Takeoff
Strategize a Smooth Tenant-to-tenant Migration and Copilot TakeoffStrategize a Smooth Tenant-to-tenant Migration and Copilot Takeoff
Strategize a Smooth Tenant-to-tenant Migration and Copilot Takeoff
 
Finding Java's Hidden Performance Traps @ DevoxxUK 2024
Finding Java's Hidden Performance Traps @ DevoxxUK 2024Finding Java's Hidden Performance Traps @ DevoxxUK 2024
Finding Java's Hidden Performance Traps @ DevoxxUK 2024
 
Understanding the FAA Part 107 License ..
Understanding the FAA Part 107 License ..Understanding the FAA Part 107 License ..
Understanding the FAA Part 107 License ..
 
FWD Group - Insurer Innovation Award 2024
FWD Group - Insurer Innovation Award 2024FWD Group - Insurer Innovation Award 2024
FWD Group - Insurer Innovation Award 2024
 
Cloud Frontiers: A Deep Dive into Serverless Spatial Data and FME
Cloud Frontiers:  A Deep Dive into Serverless Spatial Data and FMECloud Frontiers:  A Deep Dive into Serverless Spatial Data and FME
Cloud Frontiers: A Deep Dive into Serverless Spatial Data and FME
 
Six Myths about Ontologies: The Basics of Formal Ontology
Six Myths about Ontologies: The Basics of Formal OntologySix Myths about Ontologies: The Basics of Formal Ontology
Six Myths about Ontologies: The Basics of Formal Ontology
 
Navigating the Deluge_ Dubai Floods and the Resilience of Dubai International...
Navigating the Deluge_ Dubai Floods and the Resilience of Dubai International...Navigating the Deluge_ Dubai Floods and the Resilience of Dubai International...
Navigating the Deluge_ Dubai Floods and the Resilience of Dubai International...
 
Apidays New York 2024 - The Good, the Bad and the Governed by David O'Neill, ...
Apidays New York 2024 - The Good, the Bad and the Governed by David O'Neill, ...Apidays New York 2024 - The Good, the Bad and the Governed by David O'Neill, ...
Apidays New York 2024 - The Good, the Bad and the Governed by David O'Neill, ...
 
WSO2's API Vision: Unifying Control, Empowering Developers
WSO2's API Vision: Unifying Control, Empowering DevelopersWSO2's API Vision: Unifying Control, Empowering Developers
WSO2's API Vision: Unifying Control, Empowering Developers
 
Introduction to Multilingual Retrieval Augmented Generation (RAG)
Introduction to Multilingual Retrieval Augmented Generation (RAG)Introduction to Multilingual Retrieval Augmented Generation (RAG)
Introduction to Multilingual Retrieval Augmented Generation (RAG)
 
Elevate Developer Efficiency & build GenAI Application with Amazon Q​
Elevate Developer Efficiency & build GenAI Application with Amazon Q​Elevate Developer Efficiency & build GenAI Application with Amazon Q​
Elevate Developer Efficiency & build GenAI Application with Amazon Q​
 
EMPOWERMENT TECHNOLOGY GRADE 11 QUARTER 2 REVIEWER
EMPOWERMENT TECHNOLOGY GRADE 11 QUARTER 2 REVIEWEREMPOWERMENT TECHNOLOGY GRADE 11 QUARTER 2 REVIEWER
EMPOWERMENT TECHNOLOGY GRADE 11 QUARTER 2 REVIEWER
 
Apidays New York 2024 - The value of a flexible API Management solution for O...
Apidays New York 2024 - The value of a flexible API Management solution for O...Apidays New York 2024 - The value of a flexible API Management solution for O...
Apidays New York 2024 - The value of a flexible API Management solution for O...
 

Citrination tutorial

  • 1. Citrine Informatics Citrine InformaticsThe data analytics platform for the physical world Citrination Tutorial Eric Lundberg Dec. 2017
  • 2. Citrine Informatics Outline Objective: Familiarize new users with the key functionality of the Citrination platform • Search • Upload Data • Create and Apply ML Models: • Create a Data View • Plot data • Predict properties of unknown materials • Design materials to meet parameters • Assess model quality NOTE: Your experience may vary because each Citrination site is different, and we update our platform regularly • All example images are from the base Citrination.com website. Each private Citrination site may look different • We add functionality, fix bugs, and update the user interface frequently. Revisit the Citrination website for an updated version of these instructions • If the site doesn’t seem to work or you see an error message, closely check that you followed these instructions and contact us at training@citrine.io E. Lundberg, elundberg@citrine.io2
  • 3. Citrine Informatics Search Function • What is it? Explore the world’s largest database of materials and chemicals information, including: polymers, alloys, semiconductors, and many more. This database is broken up into material cards, where the properties of a known material are consolidated into a single view. • How does this help you? Want to know the properties of a material? Want to find materials with certain properties? Use Advanced Search to apply more filters to narrow down your options. • How it works: You can search in 2 ways: all datasets or a specific dataset. Your search will return a number of materials cards that meet the requirements you set. Click on one to explore more! E. Lundberg, elundberg@citrine.io3
  • 4. Citrine Informatics E. Lundberg, elundberg@citrine.io4 Search – All Datasets (1/3) Explore materials data across all available data sets with the Search tab. 1. Click Citrination’s Search tab 2. Click “Advanced Search Options” 3. Type material OR property of interest 4. Set constraints 5. See number of results returned 6. Click on an material card under “results” 1 2 3 4 6 5
  • 5. Citrine Informatics E. Lundberg, elundberg@citrine.io5 Search – Specific Dataset (2/3) Learn more about a specific research project’s collection of records with the Search-Dataset tab. 1. Click Datasets tab 2. Choose type of dataset • Public: Visible to everyone on the site • Private: Uploaded by you • Shared: Shared with you by others • Purchased: Purchased by your organization 3. Click on dataset 4. Type material name or property of interest 5. See results 6. Click on a material card 1 3 4 6 2 5
  • 6. Citrine Informatics E. Lundberg, elundberg@citrine.io6 Once you click on a search result you can see this card. It displays the properties of a known material 1. Chemical formula/ composition 2. Type of data (e.g. property, composition, preparation, method, or references) See http://help.citrination.com/ for more information what can go into a material card or what types of data Citrination recognizes 3. Data 1 2 Search – Material Card (3/3) 3
  • 7. Citrine Informatics Upload Data • What is it? This feature allows you to upload data (or any type of file) to the Citrination platform, organize it, and keep it private or share it with the rest of the users on the site. If you have a private Citrination site, that information is kept secure. • How does this help you? Uploading your data to our site makes it searchable, shareable, and accessible for machine learning (ML) • How it works: The Citrination platform will turn your data into a series of materials cards. We have dozens of ingesters to upload structured data files (e.g. CSV files, XRD files, and VASP DFT files) E. Lundberg, elundberg@citrine.io7
  • 8. Citrine Informatics E. Lundberg, elundberg@citrine.io8 Help on Ingesters Learn how to upload your data so Citrination can process it. One common data format is the CSV Template. See the help.citrination.com page for more details 1. Open Citrination help site 2. Browse or search key words for a concept 3. Template CSV page 2 3 1 help.citrination.com 2
  • 9. Citrine Informatics E. Lundberg, elundberg@citrine.io9 Add Data (1/2) Upload your data to the secure Citrination site with the Add Data tab. You can create a new dataset or update an existing data set, (e.g. after experimentation). Any file can be uploaded, but only some formats are parsed into materials records. 1. Click Add Data tab 2. Select new/existing dataset 3. Type Title Each dataset for a given user must have a unique title 4. Type Description 5. Choose appropriate ingester (important) 6. Upload the file 7. Submit! 1 3 2 4 5 6 7
  • 10. Citrine Informatics E. Lundberg, elundberg@citrine.io10 Add Data (2/2) 8. Review uploaded file 9. Refresh to see the file’s progress (may take up to <10m): a. Initializing b. Processing c. Finished or Failed 10. Log file if data upload fails 8 9 9 10
  • 11. Citrine Informatics E. Lundberg, elundberg@citrine.io11 Share Dataset If you’re a public Citrination user, this shares your data with everyone. If you’re a private Citrination user (you have your own site), only Citrine employees and users at your company can view it. The process for sharing Data Views is similar- just click ”Access” 1. Click Access Tab 2. Review current status 3. Click to Share You can also share with Groups (short- term, can be added by your members) or Teams (long-term, managed by Citrine) 1 2 3
  • 12. Citrine Informatics Create Models: Data Views • What is it? This feature allows you to uses Citrination’s machine learning (ML) software to visualize and model data. You can predict the properties of untested materials or design new materials that meet your specifications. It also give you a report on the ML model quality. • How does this help you? • Use predict to assess material candidates of interest to you • Use design to suggest promising new candidates based on your target specifications • How it works: The Citrination platform will build AI models to identify trends and make predictions. E. Lundberg, elundberg@citrine.io12
  • 13. Citrine Informatics 5 3 4 3 E. Lundberg, elundberg@citrine.io13 Data Views – Create (1/4) Analyze your data in the Citrination platform by creating a Data View 1. Click Data Views tab 2. Click Create New Data View Next, identify which datasets to use to train your model 3. Search based on properties contained in the dataset of interest 4. Select 1 or more dataset(s) 5. Click Next 1 2
  • 14. Citrine Informatics E. Lundberg, elundberg@citrine.io14 Data Views – Create (2/4) 6. Search/click properties you want to include as inputs or outputs to the Machine Learning (ML) model 7. Selected properties 8. Click Next 8 6 7
  • 15. Citrine Informatics E. Lundberg, elundberg@citrine.io15 Data Views – Create (3/4) 9. Type Data View Name 10.Type Description 11.Click Save 12.Click to Configure ML (Once ML is configured, Citrination can analyze the data and start to make predictions) 11 9 10 12
  • 16. Citrine Informatics E. Lundberg, elundberg@citrine.io16 Data Views – Create (4/4) 13. Click to view instructions 14. Check data closely for accuracy • Column name should be the property • Descriptor type should be the property’s format- formula, composition, real, or categorical • Parameter type is the type of variable (inputs are controllable degrees of freedom, outputs are target properties) • Value(s) are the valid values for the property 15. Click Edit (if inaccurate) 16. Select material/ variable types/ value range as required 17. Click Okay 18. Click Save – this (re)trains your machine learning model 19. Click Search to watch the view train 1714 15 16 18 Scroll Up to Save 13 19
  • 17. Citrine Informatics E. Lundberg, elundberg@citrine.io17 Data Views – Summary Review the important information on the Summary tab once you configure machine learning 1. Data View – Summary tab 2. Summary information 1 2
  • 18. Citrine Informatics E. Lundberg, elundberg@citrine.io18 Data Views – Populate (1/2) Fill in your data set’s gaps with predicted values and uncertainty with the Populate button 1. Click Data Views- Search tab 2. Training and Testing Models Citrination will begin training the models, and will display a purple "Training and Testing Data" button during this process. When the progress bars go away and the purple button is replaced by "Populate with Data" proceed to the next step. Wait times are highly variable based on the amount and complexity of data. Most views train in 2-60 minutes. If it takes more than 12 hours, please contact Citrine. 1 2
  • 19. Citrine Informatics E. Lundberg, elundberg@citrine.io19 3. Click Populate with Data when it becomes visible 4. Predicted Values from the model are in GREEN with uncertainty 5. Recorded Values from your data source are in BLACK 3 5 4 Data Views – Populate (2/2)
  • 20. Citrine Informatics E. Lundberg, elundberg@citrine.io20 Data Views – Export Manipulate and view the data in Excel by using the Export function 1. Click Data View- Search tab 2. Click Export 3. Click Confirm 4. Check email 5. Click Link 6. Download & Open File 5 4 6 2 3 1
  • 21. Citrine Informatics E. Lundberg, elundberg@citrine.io21 Data Views – Material Cards View the properties of a material in the data view from the Data Views-Search tab 1. Click Data Views – Search tab 2. Click General 3. View material card 1 3 2 1
  • 22. Citrine Informatics E. Lundberg, elundberg@citrine.io22 Data Views – Plots Use the Plots tab to visualize the data on a variety of plot types. 1. Click Data Views/ Search Tab 2. Click Plots 3. Select Plot Type 4. Select Point Hover values (if you put your cursor over the point, you see this data) 5. Type # responses 6. Select X Axis property 7. Select Y Axis property 8. Click Generate Plot 1 8 3 4 5 6 7 2
  • 23. Citrine Informatics E. Lundberg, elundberg@citrine.io23 Data Views – Predict Use the Predict tab to predict the properties of a specific input (e.g. chemical formula) 1. Click Data Views – Predict tab 2. Type your input 3. Click Predict 4. Numerical prediction with uncertainty range of ± 1 standard deviation 1 2 4 3
  • 24. Citrine Informatics E. Lundberg, elundberg@citrine.io24 Data Views – Design (1/2) Use the Design tab to generate candidate materials based on targets and constraints. 1. Click Data Views – Design tab 2. Select Maximum time for computer to explore options 3. Select Number of Candidates to return 4. Select space over which design will search for promising candidates 5. Select Optimized property and target 6. Select Constraints on the target properties 7. Click Run 1 2 4 3 7 5 6 Scroll Down
  • 25. Citrine Informatics E. Lundberg, elundberg@citrine.io25 Data Views – Design (2/2) 8. Click Export to CSV to download results 9. Best Materials (short term success) 10. Suggested Experiments (long term success) 11. Click Save Design Results to save these candidates or Export to CSV to download results (Citrination deletes unsaved design results when you leave the page) 8 11a 9 10 11b
  • 26. Citrine Informatics E. Lundberg, elundberg@citrine.io26 Data Views – Reports You can use the Reports tab to understand your model quality. 1. Click Data Views- Reports tab 2. Click Model Report tab 3. Review ML settings 4. Review Features and their impact on the model 5. Review ML Model performance 6. Review predicted vs actual plot 1 2 3 4 5 6 3
  • 27. Citrine Informatics Conclusion We’ve learned how to: • Search • Upload Data • Use Data Views to… • Create a view • Plot data • Predict properties of unknown materials • Design materials to meet parameters • Assess model quality E. Lundberg, elundberg@citrine.io27
  • 28. Citrine Informatics Citrine InformaticsThe data analytics platform for the physical world elundberg@citrine.io Thank you
  • 29. Citrine Informatics E. Lundberg, elundberg@citrine.io29 Appendix A: Zoom Setup 1. Go to https://zoom.us 2. Sign up for an account 3. Start your test meeting 4. Test audio – select your speaker and mic and make sure that your sound works 5. Check “Automatically join audio by computer when joining a meeting” 6. Test video – make sure you can see yourself 1 1 2 3 4 5 6
  • 30. Citrine Informatics E. Lundberg, elundberg@citrine.io30 Appendix A: Join Zoom Meeting Follow the invitation link from your Citrine point of contact 1. Click Join Audio (if required) 2. Click Join Audio Conference by Computer (if required) 3. Click Share Screen 4. Click your browser (or desktop if you don't see it) 5. Click Share Screen 6. Go to the Citrination site and make sure you see the GREEN and RED boxes in the top of your screen (if these aren't visible, make sure a Citrine member can view your screen) 1 2 3 4 5 6