SlideShare a Scribd company logo
1 of 28
Download to read offline
Programming
Languages for
Biological Modeling
Jean Yang
Carnegie Mellon University
WeCode, February 4, 2017
When People Think of
Programming Languages…
Do you study PHP? JavaScript?
What language
should I learn?
I’m not a programmer,
so your work probably
doesn’t apply to me.
What Programming Languages
Research Is Really About
Programming languages provide
logical frameworks for reasoning
about complex systems.
• Reasoning about software systems
helps us make them faster and more
correct.
• Reasoning about other systems can
help us with understanding.
Complex System of Interest:
Cellular Signaling
• Important:
Understanding
cell signaling
would help with
curing disease!
• Difficult:
Existing models
are difficult to
scale and
understand.
State of the Art Modelling
Approaches
Systems of
differential
equations
Boolean networks
and Petri nets
Problems with Existing
Approaches
• Limited scaling. With differential
equations, for instance, need a different
equation for each interaction.
• Need the whole picture. People don’t
just sit down and tweak differential
equations until they are right.
• Gap between model and intuition.
Equations look nothing like what is in
biologists’ heads. This is an obstacle in
creating models!
Insight! Surprising Connection
between Logic and Chemistry
Programming
language semantics
Chemical reaction
semantics
+  @ (0.8)
+  @ (0.5)
Solution: Model Mechanism
Using Rule-Based Programs
Mechanistic Kappa rules
+  @ (0.2)
Initial mixture One step Final state
Kappa Advantages
• Rules are patterns. Model does not need
to enumerate all possible interactions, and
we can build up the model piece by piece.
• Programs have structure. Can use
structure to analyze programs for
properties such as reachability, and for
reducing models.
• Programs have precise semantics.
Can get sound guarantees from model
analyses.
Two Ways to “Run” Kappa
Programs
Simulation to
approximate
differential semantics
Analyzing rule
structure via program
analysis
Continuous-time
stochastic Monte Carlo
simulation allows
investigation of runtime
behavior.
Determine properties
such as reachability, rule
symmetries, and model
coarse-graining.
Problems: Kappa Models
Difficult to Generate
'EGF.EGFR' EGF(r), EGFR(L,CR) -> EGF(r!1), EGFR(L!1,CR) @ 'k_on'
'EGF/EGFR' EGF(r!1), EGFR(L!1,CR) -> EGF(r), EGFR(L,CR) @ 'k_off'
'Shc.Grb2' Shc(Y~p), Grb2(SH2) -> Shc(Y~p!1), Grb2(SH2!1) @ 5*'k_on'
'Shc/Grb2' Shc(Y~p!1), Grb2(SH2!1) -> Shc(Y~p), Grb2(SH2) @ 'k_off'
'EGFR.Grb2' EGFR(Y1092~p), Grb2(SH2) -> EGFR(Y1092~p!1), Grb2(SH2!1) @ 'k_on'
'EGFR/Grb2' EGFR(Y1092~p!1), Grb2(SH2!1) -> EGFR(Y1092~p), Grb2(SH2) @ 'k_off'
'EGFR.Shc' EGFR(Y1172~p), Shc(PTB) -> EGFR(Y1172~p!1), Shc(PTB!1) @ 'k_on'
'EGFR/Shc' EGFR(Y1172~p!1), Shc(PTB!1) -> EGFR(Y1172~p), Shc(PTB) @ 'k_off'
'Grb2.SoS' Grb2(SH3n), SoS(PR,S~u) -> Grb2(SH3n!1), SoS(PR!1,S~u) @ 'k_on'
'Grb2/SoS' Grb2(SH3n!1), SoS(PR!1) -> Grb2(SH3n), SoS(PR) @ 'k_off'
'EGFR.int' EGFR(CR!1,N,C), EGFR(CR!1,N,C) -> EGFR(CR!1,N!2,C), EGFR(CR!1,N,C!2) @ 'k_on'
'EGFR/int' EGFR(CR!1,N!2,C), EGFR(CR!1,N,C!2) -> EGFR(CR!1,N,C), EGFR(CR!1,N,C) @ 'k_off'
'pY1092@EGFR' EGFR(N!1), EGFR(C!1,Y1092~u) -> EGFR(N!1), EGFR(C!1,Y1092~p) @ 'k_cat'
'pY1172@EGFR' EGFR(N!1), EGFR(C!1,Y1172~u) -> EGFR(N!1), EGFR(C!1,Y1172~p) @ 'k_cat'
'uY1092@EGFR' EGFR(Y1092~p) -> EGFR(Y1092~u) @ 'k_cat'
'uY1172@EGFR' EGFR(Y1172~p) -> EGFR(Y1172~u) @ 'k_cat'
From the SOS model
The Dream of Big Mechanism
+  @ (0.8)
+  @ (0.5)
+  @ (0.2)
︙
Requires precise
mechanistic
reasoning!
Obstacle: Natural Language is
Ambiguous
NLP
Hypothesis: Can use logical
deduction to navigate this space.
Scientific
literature
and
databases
Imprecision of
natural language
+  @ (0.8)
+  @ (0.5)
+  @ (0.2)
︙
Running Non-Biology
Example
Billy has a
sibling.
Billy has a
sister.
Billy’s Family Tree
Billy’s
father
Billy’s
mother
Billy
Billy’s
sibling
Billy’s
sister
Using Domain Facts to Clean
up Model
Billy’s Family Tree
Billy’s
father
Billy’s
mother
Billy
Billy’s
sister
Domain fact: A
sister is a kind
of sibling.
Specificity
Hierarchy
sibling
sister
Family
member type
Gendered
family member
type
Things Get Harder with
Implication
Billy’s Family Tree, if
we trust the neighbor
Billy’s
father
Billy’s
mother
Billy
Billy’s
sibling
Billy’s
sister
If we trust the
neighbor, Billy
has a sibling.
If we trust the
neighbor, Billy
has a sister.
Deducible Facts
Facts from
literature
∀𝑎. ℎ𝑎𝑠(𝑎, 𝑠𝑖𝑠𝑡𝑒𝑟) ⇒ ℎ𝑎𝑠(𝑎, 𝑠𝑖𝑏𝑙𝑖𝑛𝑔)
𝑡𝑟𝑢𝑠𝑡 𝑛𝑒𝑖𝑔ℎ𝑏𝑜𝑟 ⇒ ℎ𝑎𝑠(𝐵𝑖𝑙𝑙𝑦, 𝑠𝑖𝑏𝑙𝑖𝑛𝑔)
Domain
fact
𝑡𝑟𝑢𝑠𝑡 𝑛𝑒𝑖𝑔ℎ𝑏𝑜𝑟 ⇒ ℎ𝑎𝑠(𝐵𝑖𝑙𝑙𝑦, 𝑠𝑖𝑏𝑙𝑖𝑛𝑔)
Deducible
fact
Fact 𝑓 is deducible (and thus redundant) if
¬𝑓 is inconsistent with previous facts.
What About Specificity and
Implication?
If we trust the
neighbor, Billy has a
sibling.
If we trust anybody,
Billy has a sibling.
Relative Fact Specificity as
Subtyping
Subtyping Atomic Statements
∀𝑎1, 𝑎2. 𝑔𝑒𝑛𝑑𝑒𝑟𝑒𝑑𝐹𝑎𝑚𝑖𝑙𝑦𝑀𝑒𝑚𝑏𝑒𝑟(𝑎1) <: 𝑓𝑎𝑚𝑖𝑙𝑦𝑀𝑒𝑚𝑏𝑒𝑟 𝑎2
Subtyping Implications
𝑝𝑠 ⇒ 𝑞 𝑠 <: 𝑝𝑡 ⇒ 𝑞𝑡
Subtyping symbol
𝑡𝑟𝑢𝑠𝑡 𝑛𝑒𝑖𝑔ℎ𝑏𝑜𝑟 ⇒ ℎ𝑎𝑠 𝐵𝑖𝑙𝑙𝑦, 𝑠𝑖𝑠𝑡𝑒𝑟 <: 𝑡𝑟𝑢𝑠𝑡 𝑛𝑒𝑖𝑔ℎ𝑏𝑜𝑟 ⇒ ℎ𝑎𝑠(𝐵𝑖𝑙𝑙𝑦, 𝑠𝑖𝑏𝑙𝑖𝑛𝑔)
𝑡𝑟𝑢𝑠𝑡 𝑎𝑛𝑦𝑏𝑜𝑑𝑦 ⇒ ℎ𝑎𝑠 𝐵𝑖𝑙𝑙𝑦, 𝑠𝑖𝑏𝑙𝑖𝑛𝑔 <: 𝑡𝑟𝑢𝑠𝑡 𝑛𝑒𝑖𝑔ℎ𝑏𝑜𝑟 ⇒ ℎ𝑎𝑠(𝐵𝑖𝑙𝑙𝑦, 𝑠𝑖𝑏𝑙𝑖𝑛𝑔)
Want 𝑝𝑠 ⇒ 𝑞 𝑠 to be
consistent with any
set of formulas 𝑝𝑡 ⇒ 𝑞𝑡
is consistent with.
𝑝𝑡 <: 𝑝𝑠
𝑝𝑡 ⊢ 𝑞 𝑠 <: 𝑞𝑡
Domain Facts in Biology
Phosphorylation
implies activity
A activates B
A phosphorylates B
Can determine
these are
redundant, where
“A phosphorylates
B” is the more
specific
statement.
Hierarchy of Specificity
If C or D are
present, A
phosphorylates B
If C or D are
present, A
activates B
If C is present, A
phosphorylates B
Translating Statements to
Graph Logic
A phosphorylates B
Phosphorylation
implies activity
A activates B
〈𝐺, 𝛼〉
Set of allowable graphs
Iota rules [Husson & Krivine]
Actions over graphs
preconditions postconditions
The Syndra Tool
JeanYang/September2016
Syndra
Domain
knowledge
NLP
output
Consistency
checking,
synthesis of
new
statements
using first-
order logic
Currently: “If LRP5/6 and Axin are in a
mixture, then it is consistent with
phosphorylation by CK1-alpha and GSK3-
beta.”
Reasoning Goals
JeanYang/September2016
Nuggets Processed
• LRP5/6 is phosphorylated by CK1-alpha and GSK3-
beta.
• When phosphorylated, LRP5/6 can bind Axin.
Biological Intuition to Check Against
If LRP5/6 is next to Axin, then it is likely to have been
phosphorylated by CK1-alpha and GSK3-beta.
Currently working on:
1. Ways to represent facts about underlying biochemistry.
2. Ways to represent causal relationships.
3. Layer of probabilistic reasoning.
Causality in Kappa
+  @ (0.5)
+  @ (0.2)
1
2
1
2
In Kappa, the rule structure allows us to
talk about causal relationships called
stories.
Ongoing work: story generation; relative
frequencies of different stories.
The Grand Plan
+  @ (0.8)
+  @ (0.5)
+  @ (0.2)
︙
Space of
possible models
Rule-based models
Experiments,
diagnoses,
and
discoveries
Analyses
involving rule
structures and
rates
Reasoning is Useful to
Domains Other Than Biology!
Other fun application domains:
• Interactive games and digital
humanities
• Strange hardware architectures
• 3D knitting
Main Takeaways
• Programming languages research is
not so much about programming as it
is reasoning in principled ways
about complex systems.
• The structure of rule-based models
allows us to ask questions we could not
ask before.
• Using a rule-based modelling language
with precise operational semantics
allows us to do all sorts of analyses.

More Related Content

Viewers also liked

Cybersecurity: How to Use What We Already Know
Cybersecurity: How to Use What We Already KnowCybersecurity: How to Use What We Already Know
Cybersecurity: How to Use What We Already Knowjxyz
 
Introduction to Business Analysis
Introduction to Business AnalysisIntroduction to Business Analysis
Introduction to Business AnalysisAMJAD SHAIKH
 
Business analyst interview questions and answers
Business analyst interview questions and answersBusiness analyst interview questions and answers
Business analyst interview questions and answersRobin G
 
Business Analyst Interview Questions with Answers
Business Analyst Interview Questions with AnswersBusiness Analyst Interview Questions with Answers
Business Analyst Interview Questions with AnswersMaria FutureThoughts
 
Business Analyst Training
Business  Analyst  TrainingBusiness  Analyst  Training
Business Analyst TrainingCraig Brown
 
85 business analyst interview questions and answers
85 business analyst interview questions and answers85 business analyst interview questions and answers
85 business analyst interview questions and answersBusinessAnalyst247
 
Business Analysis Techniques
Business Analysis TechniquesBusiness Analysis Techniques
Business Analysis TechniquesIIBA UK Chapter
 
Business Analysis Fundamentals
Business Analysis FundamentalsBusiness Analysis Fundamentals
Business Analysis Fundamentalswaelsaid75
 

Viewers also liked (10)

Cybersecurity: How to Use What We Already Know
Cybersecurity: How to Use What We Already KnowCybersecurity: How to Use What We Already Know
Cybersecurity: How to Use What We Already Know
 
Plc ppt
Plc pptPlc ppt
Plc ppt
 
Introduction to Business Analysis
Introduction to Business AnalysisIntroduction to Business Analysis
Introduction to Business Analysis
 
Business analyst interview questions and answers
Business analyst interview questions and answersBusiness analyst interview questions and answers
Business analyst interview questions and answers
 
Business analyst ppt
Business analyst pptBusiness analyst ppt
Business analyst ppt
 
Business Analyst Interview Questions with Answers
Business Analyst Interview Questions with AnswersBusiness Analyst Interview Questions with Answers
Business Analyst Interview Questions with Answers
 
Business Analyst Training
Business  Analyst  TrainingBusiness  Analyst  Training
Business Analyst Training
 
85 business analyst interview questions and answers
85 business analyst interview questions and answers85 business analyst interview questions and answers
85 business analyst interview questions and answers
 
Business Analysis Techniques
Business Analysis TechniquesBusiness Analysis Techniques
Business Analysis Techniques
 
Business Analysis Fundamentals
Business Analysis FundamentalsBusiness Analysis Fundamentals
Business Analysis Fundamentals
 

Similar to Programming Languages for Biological Modeling

Tweeting beyond Facts – The Need for a Linguistic Perspective
Tweeting beyond Facts – The Need for a Linguistic PerspectiveTweeting beyond Facts – The Need for a Linguistic Perspective
Tweeting beyond Facts – The Need for a Linguistic PerspectiveData Science Society
 
Formal languages to map Genotype to Phenotype in Natural Genomes
Formal languages to map Genotype to Phenotype in Natural GenomesFormal languages to map Genotype to Phenotype in Natural Genomes
Formal languages to map Genotype to Phenotype in Natural Genomesmadalladam
 
ACL読み会2014@PFI "Less Grammar, More Features"
ACL読み会2014@PFI "Less Grammar, More Features"ACL読み会2014@PFI "Less Grammar, More Features"
ACL読み会2014@PFI "Less Grammar, More Features"nozyh
 
Use of SNP-HapMaps in plant breeding
Use of SNP-HapMaps in plant breeding Use of SNP-HapMaps in plant breeding
Use of SNP-HapMaps in plant breeding Anilkumar C
 
Large-scale data and text mining - Linking proteins, chemicals, and side effects
Large-scale data and text mining - Linking proteins, chemicals, and side effectsLarge-scale data and text mining - Linking proteins, chemicals, and side effects
Large-scale data and text mining - Linking proteins, chemicals, and side effectsLars Juhl Jensen
 
The Language of the Gene Ontology
The Language of the Gene OntologyThe Language of the Gene Ontology
The Language of the Gene Ontologyrobertstevens65
 
Drosophila Three-Point Test Cross Lab Write-Up Instructions.docx
Drosophila Three-Point Test Cross Lab Write-Up Instructions.docxDrosophila Three-Point Test Cross Lab Write-Up Instructions.docx
Drosophila Three-Point Test Cross Lab Write-Up Instructions.docxharold7fisher61282
 
Bioinfomatics Presentation
Bioinfomatics PresentationBioinfomatics Presentation
Bioinfomatics PresentationZhenhong Bao
 
Publish Your Papers In The Top Scientific Journals
Publish Your Papers In The Top Scientific JournalsPublish Your Papers In The Top Scientific Journals
Publish Your Papers In The Top Scientific Journalsguestd02fab
 
Individual Brain Charting, a high-resolution fMRI dataset for cognitive mappi...
Individual Brain Charting, a high-resolution fMRI dataset for cognitive mappi...Individual Brain Charting, a high-resolution fMRI dataset for cognitive mappi...
Individual Brain Charting, a high-resolution fMRI dataset for cognitive mappi...Ana Luísa Pinho
 
Comparative Genomics and Visualisation - Part 2
Comparative Genomics and Visualisation - Part 2Comparative Genomics and Visualisation - Part 2
Comparative Genomics and Visualisation - Part 2Leighton Pritchard
 
Rna lecture
Rna lectureRna lecture
Rna lecturenishulpu
 
Presentación plegable1
Presentación plegable1Presentación plegable1
Presentación plegable1Leslie M.
 
Presentación plegable 1
Presentación plegable 1Presentación plegable 1
Presentación plegable 1Leslie M.
 
Essay On Starvation Based Modulation
Essay On Starvation Based ModulationEssay On Starvation Based Modulation
Essay On Starvation Based ModulationTiffany Graham
 
Lecture 3 l dand_haplotypes_full
Lecture 3 l dand_haplotypes_fullLecture 3 l dand_haplotypes_full
Lecture 3 l dand_haplotypes_fullLekki Frazier-Wood
 
SHSARP paper final
SHSARP paper finalSHSARP paper final
SHSARP paper finalKaylee Racs
 

Similar to Programming Languages for Biological Modeling (20)

Tweeting beyond Facts – The Need for a Linguistic Perspective
Tweeting beyond Facts – The Need for a Linguistic PerspectiveTweeting beyond Facts – The Need for a Linguistic Perspective
Tweeting beyond Facts – The Need for a Linguistic Perspective
 
Formal languages to map Genotype to Phenotype in Natural Genomes
Formal languages to map Genotype to Phenotype in Natural GenomesFormal languages to map Genotype to Phenotype in Natural Genomes
Formal languages to map Genotype to Phenotype in Natural Genomes
 
ACL読み会2014@PFI "Less Grammar, More Features"
ACL読み会2014@PFI "Less Grammar, More Features"ACL読み会2014@PFI "Less Grammar, More Features"
ACL読み会2014@PFI "Less Grammar, More Features"
 
Use of SNP-HapMaps in plant breeding
Use of SNP-HapMaps in plant breeding Use of SNP-HapMaps in plant breeding
Use of SNP-HapMaps in plant breeding
 
Large-scale data and text mining - Linking proteins, chemicals, and side effects
Large-scale data and text mining - Linking proteins, chemicals, and side effectsLarge-scale data and text mining - Linking proteins, chemicals, and side effects
Large-scale data and text mining - Linking proteins, chemicals, and side effects
 
AI Lesson 17
AI Lesson 17AI Lesson 17
AI Lesson 17
 
The Language of the Gene Ontology
The Language of the Gene OntologyThe Language of the Gene Ontology
The Language of the Gene Ontology
 
Drosophila Three-Point Test Cross Lab Write-Up Instructions.docx
Drosophila Three-Point Test Cross Lab Write-Up Instructions.docxDrosophila Three-Point Test Cross Lab Write-Up Instructions.docx
Drosophila Three-Point Test Cross Lab Write-Up Instructions.docx
 
Bioinfomatics Presentation
Bioinfomatics PresentationBioinfomatics Presentation
Bioinfomatics Presentation
 
Thesis biobix
Thesis biobixThesis biobix
Thesis biobix
 
Publish Your Papers In The Top Scientific Journals
Publish Your Papers In The Top Scientific JournalsPublish Your Papers In The Top Scientific Journals
Publish Your Papers In The Top Scientific Journals
 
Individual Brain Charting, a high-resolution fMRI dataset for cognitive mappi...
Individual Brain Charting, a high-resolution fMRI dataset for cognitive mappi...Individual Brain Charting, a high-resolution fMRI dataset for cognitive mappi...
Individual Brain Charting, a high-resolution fMRI dataset for cognitive mappi...
 
Comparative Genomics and Visualisation - Part 2
Comparative Genomics and Visualisation - Part 2Comparative Genomics and Visualisation - Part 2
Comparative Genomics and Visualisation - Part 2
 
Lucas...Cowell 2014
Lucas...Cowell 2014Lucas...Cowell 2014
Lucas...Cowell 2014
 
Rna lecture
Rna lectureRna lecture
Rna lecture
 
Presentación plegable1
Presentación plegable1Presentación plegable1
Presentación plegable1
 
Presentación plegable 1
Presentación plegable 1Presentación plegable 1
Presentación plegable 1
 
Essay On Starvation Based Modulation
Essay On Starvation Based ModulationEssay On Starvation Based Modulation
Essay On Starvation Based Modulation
 
Lecture 3 l dand_haplotypes_full
Lecture 3 l dand_haplotypes_fullLecture 3 l dand_haplotypes_full
Lecture 3 l dand_haplotypes_full
 
SHSARP paper final
SHSARP paper finalSHSARP paper final
SHSARP paper final
 

Recently uploaded

Stunning ➥8448380779▻ Call Girls In Panchshil Enclave Delhi NCR
Stunning ➥8448380779▻ Call Girls In Panchshil Enclave Delhi NCRStunning ➥8448380779▻ Call Girls In Panchshil Enclave Delhi NCR
Stunning ➥8448380779▻ Call Girls In Panchshil Enclave Delhi NCRDelhi Call girls
 
Unlocking the Potential: Deep dive into ocean of Ceramic Magnets.pptx
Unlocking  the Potential: Deep dive into ocean of Ceramic Magnets.pptxUnlocking  the Potential: Deep dive into ocean of Ceramic Magnets.pptx
Unlocking the Potential: Deep dive into ocean of Ceramic Magnets.pptxanandsmhk
 
❤Jammu Kashmir Call Girls 8617697112 Personal Whatsapp Number 💦✅.
❤Jammu Kashmir Call Girls 8617697112 Personal Whatsapp Number 💦✅.❤Jammu Kashmir Call Girls 8617697112 Personal Whatsapp Number 💦✅.
❤Jammu Kashmir Call Girls 8617697112 Personal Whatsapp Number 💦✅.Nitya salvi
 
PossibleEoarcheanRecordsoftheGeomagneticFieldPreservedintheIsuaSupracrustalBe...
PossibleEoarcheanRecordsoftheGeomagneticFieldPreservedintheIsuaSupracrustalBe...PossibleEoarcheanRecordsoftheGeomagneticFieldPreservedintheIsuaSupracrustalBe...
PossibleEoarcheanRecordsoftheGeomagneticFieldPreservedintheIsuaSupracrustalBe...Sérgio Sacani
 
Pests of cotton_Sucking_Pests_Dr.UPR.pdf
Pests of cotton_Sucking_Pests_Dr.UPR.pdfPests of cotton_Sucking_Pests_Dr.UPR.pdf
Pests of cotton_Sucking_Pests_Dr.UPR.pdfPirithiRaju
 
Discovery of an Accretion Streamer and a Slow Wide-angle Outflow around FUOri...
Discovery of an Accretion Streamer and a Slow Wide-angle Outflow around FUOri...Discovery of an Accretion Streamer and a Slow Wide-angle Outflow around FUOri...
Discovery of an Accretion Streamer and a Slow Wide-angle Outflow around FUOri...Sérgio Sacani
 
Formation of low mass protostars and their circumstellar disks
Formation of low mass protostars and their circumstellar disksFormation of low mass protostars and their circumstellar disks
Formation of low mass protostars and their circumstellar disksSérgio Sacani
 
TEST BANK For Radiologic Science for Technologists, 12th Edition by Stewart C...
TEST BANK For Radiologic Science for Technologists, 12th Edition by Stewart C...TEST BANK For Radiologic Science for Technologists, 12th Edition by Stewart C...
TEST BANK For Radiologic Science for Technologists, 12th Edition by Stewart C...ssifa0344
 
Spermiogenesis or Spermateleosis or metamorphosis of spermatid
Spermiogenesis or Spermateleosis or metamorphosis of spermatidSpermiogenesis or Spermateleosis or metamorphosis of spermatid
Spermiogenesis or Spermateleosis or metamorphosis of spermatidSarthak Sekhar Mondal
 
DIFFERENCE IN BACK CROSS AND TEST CROSS
DIFFERENCE IN  BACK CROSS AND TEST CROSSDIFFERENCE IN  BACK CROSS AND TEST CROSS
DIFFERENCE IN BACK CROSS AND TEST CROSSLeenakshiTyagi
 
Nightside clouds and disequilibrium chemistry on the hot Jupiter WASP-43b
Nightside clouds and disequilibrium chemistry on the hot Jupiter WASP-43bNightside clouds and disequilibrium chemistry on the hot Jupiter WASP-43b
Nightside clouds and disequilibrium chemistry on the hot Jupiter WASP-43bSérgio Sacani
 
Isotopic evidence of long-lived volcanism on Io
Isotopic evidence of long-lived volcanism on IoIsotopic evidence of long-lived volcanism on Io
Isotopic evidence of long-lived volcanism on IoSérgio Sacani
 
Hubble Asteroid Hunter III. Physical properties of newly found asteroids
Hubble Asteroid Hunter III. Physical properties of newly found asteroidsHubble Asteroid Hunter III. Physical properties of newly found asteroids
Hubble Asteroid Hunter III. Physical properties of newly found asteroidsSérgio Sacani
 
Chromatin Structure | EUCHROMATIN | HETEROCHROMATIN
Chromatin Structure | EUCHROMATIN | HETEROCHROMATINChromatin Structure | EUCHROMATIN | HETEROCHROMATIN
Chromatin Structure | EUCHROMATIN | HETEROCHROMATINsankalpkumarsahoo174
 
Chemistry 4th semester series (krishna).pdf
Chemistry 4th semester series (krishna).pdfChemistry 4th semester series (krishna).pdf
Chemistry 4th semester series (krishna).pdfSumit Kumar yadav
 
Nanoparticles synthesis and characterization​ ​
Nanoparticles synthesis and characterization​  ​Nanoparticles synthesis and characterization​  ​
Nanoparticles synthesis and characterization​ ​kaibalyasahoo82800
 
Biological Classification BioHack (3).pdf
Biological Classification BioHack (3).pdfBiological Classification BioHack (3).pdf
Biological Classification BioHack (3).pdfmuntazimhurra
 
Pulmonary drug delivery system M.pharm -2nd sem P'ceutics
Pulmonary drug delivery system M.pharm -2nd sem P'ceuticsPulmonary drug delivery system M.pharm -2nd sem P'ceutics
Pulmonary drug delivery system M.pharm -2nd sem P'ceuticssakshisoni2385
 
GBSN - Microbiology (Unit 1)
GBSN - Microbiology (Unit 1)GBSN - Microbiology (Unit 1)
GBSN - Microbiology (Unit 1)Areesha Ahmad
 

Recently uploaded (20)

Stunning ➥8448380779▻ Call Girls In Panchshil Enclave Delhi NCR
Stunning ➥8448380779▻ Call Girls In Panchshil Enclave Delhi NCRStunning ➥8448380779▻ Call Girls In Panchshil Enclave Delhi NCR
Stunning ➥8448380779▻ Call Girls In Panchshil Enclave Delhi NCR
 
Unlocking the Potential: Deep dive into ocean of Ceramic Magnets.pptx
Unlocking  the Potential: Deep dive into ocean of Ceramic Magnets.pptxUnlocking  the Potential: Deep dive into ocean of Ceramic Magnets.pptx
Unlocking the Potential: Deep dive into ocean of Ceramic Magnets.pptx
 
❤Jammu Kashmir Call Girls 8617697112 Personal Whatsapp Number 💦✅.
❤Jammu Kashmir Call Girls 8617697112 Personal Whatsapp Number 💦✅.❤Jammu Kashmir Call Girls 8617697112 Personal Whatsapp Number 💦✅.
❤Jammu Kashmir Call Girls 8617697112 Personal Whatsapp Number 💦✅.
 
PossibleEoarcheanRecordsoftheGeomagneticFieldPreservedintheIsuaSupracrustalBe...
PossibleEoarcheanRecordsoftheGeomagneticFieldPreservedintheIsuaSupracrustalBe...PossibleEoarcheanRecordsoftheGeomagneticFieldPreservedintheIsuaSupracrustalBe...
PossibleEoarcheanRecordsoftheGeomagneticFieldPreservedintheIsuaSupracrustalBe...
 
Pests of cotton_Sucking_Pests_Dr.UPR.pdf
Pests of cotton_Sucking_Pests_Dr.UPR.pdfPests of cotton_Sucking_Pests_Dr.UPR.pdf
Pests of cotton_Sucking_Pests_Dr.UPR.pdf
 
Discovery of an Accretion Streamer and a Slow Wide-angle Outflow around FUOri...
Discovery of an Accretion Streamer and a Slow Wide-angle Outflow around FUOri...Discovery of an Accretion Streamer and a Slow Wide-angle Outflow around FUOri...
Discovery of an Accretion Streamer and a Slow Wide-angle Outflow around FUOri...
 
Formation of low mass protostars and their circumstellar disks
Formation of low mass protostars and their circumstellar disksFormation of low mass protostars and their circumstellar disks
Formation of low mass protostars and their circumstellar disks
 
TEST BANK For Radiologic Science for Technologists, 12th Edition by Stewart C...
TEST BANK For Radiologic Science for Technologists, 12th Edition by Stewart C...TEST BANK For Radiologic Science for Technologists, 12th Edition by Stewart C...
TEST BANK For Radiologic Science for Technologists, 12th Edition by Stewart C...
 
Spermiogenesis or Spermateleosis or metamorphosis of spermatid
Spermiogenesis or Spermateleosis or metamorphosis of spermatidSpermiogenesis or Spermateleosis or metamorphosis of spermatid
Spermiogenesis or Spermateleosis or metamorphosis of spermatid
 
DIFFERENCE IN BACK CROSS AND TEST CROSS
DIFFERENCE IN  BACK CROSS AND TEST CROSSDIFFERENCE IN  BACK CROSS AND TEST CROSS
DIFFERENCE IN BACK CROSS AND TEST CROSS
 
Nightside clouds and disequilibrium chemistry on the hot Jupiter WASP-43b
Nightside clouds and disequilibrium chemistry on the hot Jupiter WASP-43bNightside clouds and disequilibrium chemistry on the hot Jupiter WASP-43b
Nightside clouds and disequilibrium chemistry on the hot Jupiter WASP-43b
 
Isotopic evidence of long-lived volcanism on Io
Isotopic evidence of long-lived volcanism on IoIsotopic evidence of long-lived volcanism on Io
Isotopic evidence of long-lived volcanism on Io
 
Hubble Asteroid Hunter III. Physical properties of newly found asteroids
Hubble Asteroid Hunter III. Physical properties of newly found asteroidsHubble Asteroid Hunter III. Physical properties of newly found asteroids
Hubble Asteroid Hunter III. Physical properties of newly found asteroids
 
Chromatin Structure | EUCHROMATIN | HETEROCHROMATIN
Chromatin Structure | EUCHROMATIN | HETEROCHROMATINChromatin Structure | EUCHROMATIN | HETEROCHROMATIN
Chromatin Structure | EUCHROMATIN | HETEROCHROMATIN
 
CELL -Structural and Functional unit of life.pdf
CELL -Structural and Functional unit of life.pdfCELL -Structural and Functional unit of life.pdf
CELL -Structural and Functional unit of life.pdf
 
Chemistry 4th semester series (krishna).pdf
Chemistry 4th semester series (krishna).pdfChemistry 4th semester series (krishna).pdf
Chemistry 4th semester series (krishna).pdf
 
Nanoparticles synthesis and characterization​ ​
Nanoparticles synthesis and characterization​  ​Nanoparticles synthesis and characterization​  ​
Nanoparticles synthesis and characterization​ ​
 
Biological Classification BioHack (3).pdf
Biological Classification BioHack (3).pdfBiological Classification BioHack (3).pdf
Biological Classification BioHack (3).pdf
 
Pulmonary drug delivery system M.pharm -2nd sem P'ceutics
Pulmonary drug delivery system M.pharm -2nd sem P'ceuticsPulmonary drug delivery system M.pharm -2nd sem P'ceutics
Pulmonary drug delivery system M.pharm -2nd sem P'ceutics
 
GBSN - Microbiology (Unit 1)
GBSN - Microbiology (Unit 1)GBSN - Microbiology (Unit 1)
GBSN - Microbiology (Unit 1)
 

Programming Languages for Biological Modeling

  • 1. Programming Languages for Biological Modeling Jean Yang Carnegie Mellon University WeCode, February 4, 2017
  • 2. When People Think of Programming Languages… Do you study PHP? JavaScript? What language should I learn? I’m not a programmer, so your work probably doesn’t apply to me.
  • 3. What Programming Languages Research Is Really About Programming languages provide logical frameworks for reasoning about complex systems. • Reasoning about software systems helps us make them faster and more correct. • Reasoning about other systems can help us with understanding.
  • 4. Complex System of Interest: Cellular Signaling • Important: Understanding cell signaling would help with curing disease! • Difficult: Existing models are difficult to scale and understand.
  • 5. State of the Art Modelling Approaches Systems of differential equations Boolean networks and Petri nets
  • 6. Problems with Existing Approaches • Limited scaling. With differential equations, for instance, need a different equation for each interaction. • Need the whole picture. People don’t just sit down and tweak differential equations until they are right. • Gap between model and intuition. Equations look nothing like what is in biologists’ heads. This is an obstacle in creating models!
  • 7. Insight! Surprising Connection between Logic and Chemistry Programming language semantics Chemical reaction semantics
  • 8. +  @ (0.8) +  @ (0.5) Solution: Model Mechanism Using Rule-Based Programs Mechanistic Kappa rules +  @ (0.2) Initial mixture One step Final state
  • 9. Kappa Advantages • Rules are patterns. Model does not need to enumerate all possible interactions, and we can build up the model piece by piece. • Programs have structure. Can use structure to analyze programs for properties such as reachability, and for reducing models. • Programs have precise semantics. Can get sound guarantees from model analyses.
  • 10. Two Ways to “Run” Kappa Programs Simulation to approximate differential semantics Analyzing rule structure via program analysis Continuous-time stochastic Monte Carlo simulation allows investigation of runtime behavior. Determine properties such as reachability, rule symmetries, and model coarse-graining.
  • 11. Problems: Kappa Models Difficult to Generate 'EGF.EGFR' EGF(r), EGFR(L,CR) -> EGF(r!1), EGFR(L!1,CR) @ 'k_on' 'EGF/EGFR' EGF(r!1), EGFR(L!1,CR) -> EGF(r), EGFR(L,CR) @ 'k_off' 'Shc.Grb2' Shc(Y~p), Grb2(SH2) -> Shc(Y~p!1), Grb2(SH2!1) @ 5*'k_on' 'Shc/Grb2' Shc(Y~p!1), Grb2(SH2!1) -> Shc(Y~p), Grb2(SH2) @ 'k_off' 'EGFR.Grb2' EGFR(Y1092~p), Grb2(SH2) -> EGFR(Y1092~p!1), Grb2(SH2!1) @ 'k_on' 'EGFR/Grb2' EGFR(Y1092~p!1), Grb2(SH2!1) -> EGFR(Y1092~p), Grb2(SH2) @ 'k_off' 'EGFR.Shc' EGFR(Y1172~p), Shc(PTB) -> EGFR(Y1172~p!1), Shc(PTB!1) @ 'k_on' 'EGFR/Shc' EGFR(Y1172~p!1), Shc(PTB!1) -> EGFR(Y1172~p), Shc(PTB) @ 'k_off' 'Grb2.SoS' Grb2(SH3n), SoS(PR,S~u) -> Grb2(SH3n!1), SoS(PR!1,S~u) @ 'k_on' 'Grb2/SoS' Grb2(SH3n!1), SoS(PR!1) -> Grb2(SH3n), SoS(PR) @ 'k_off' 'EGFR.int' EGFR(CR!1,N,C), EGFR(CR!1,N,C) -> EGFR(CR!1,N!2,C), EGFR(CR!1,N,C!2) @ 'k_on' 'EGFR/int' EGFR(CR!1,N!2,C), EGFR(CR!1,N,C!2) -> EGFR(CR!1,N,C), EGFR(CR!1,N,C) @ 'k_off' 'pY1092@EGFR' EGFR(N!1), EGFR(C!1,Y1092~u) -> EGFR(N!1), EGFR(C!1,Y1092~p) @ 'k_cat' 'pY1172@EGFR' EGFR(N!1), EGFR(C!1,Y1172~u) -> EGFR(N!1), EGFR(C!1,Y1172~p) @ 'k_cat' 'uY1092@EGFR' EGFR(Y1092~p) -> EGFR(Y1092~u) @ 'k_cat' 'uY1172@EGFR' EGFR(Y1172~p) -> EGFR(Y1172~u) @ 'k_cat' From the SOS model
  • 12. The Dream of Big Mechanism +  @ (0.8) +  @ (0.5) +  @ (0.2) ︙ Requires precise mechanistic reasoning!
  • 13. Obstacle: Natural Language is Ambiguous NLP Hypothesis: Can use logical deduction to navigate this space. Scientific literature and databases Imprecision of natural language +  @ (0.8) +  @ (0.5) +  @ (0.2) ︙
  • 14. Running Non-Biology Example Billy has a sibling. Billy has a sister. Billy’s Family Tree Billy’s father Billy’s mother Billy Billy’s sibling Billy’s sister
  • 15. Using Domain Facts to Clean up Model Billy’s Family Tree Billy’s father Billy’s mother Billy Billy’s sister Domain fact: A sister is a kind of sibling. Specificity Hierarchy sibling sister Family member type Gendered family member type
  • 16. Things Get Harder with Implication Billy’s Family Tree, if we trust the neighbor Billy’s father Billy’s mother Billy Billy’s sibling Billy’s sister If we trust the neighbor, Billy has a sibling. If we trust the neighbor, Billy has a sister.
  • 17. Deducible Facts Facts from literature ∀𝑎. ℎ𝑎𝑠(𝑎, 𝑠𝑖𝑠𝑡𝑒𝑟) ⇒ ℎ𝑎𝑠(𝑎, 𝑠𝑖𝑏𝑙𝑖𝑛𝑔) 𝑡𝑟𝑢𝑠𝑡 𝑛𝑒𝑖𝑔ℎ𝑏𝑜𝑟 ⇒ ℎ𝑎𝑠(𝐵𝑖𝑙𝑙𝑦, 𝑠𝑖𝑏𝑙𝑖𝑛𝑔) Domain fact 𝑡𝑟𝑢𝑠𝑡 𝑛𝑒𝑖𝑔ℎ𝑏𝑜𝑟 ⇒ ℎ𝑎𝑠(𝐵𝑖𝑙𝑙𝑦, 𝑠𝑖𝑏𝑙𝑖𝑛𝑔) Deducible fact Fact 𝑓 is deducible (and thus redundant) if ¬𝑓 is inconsistent with previous facts.
  • 18. What About Specificity and Implication? If we trust the neighbor, Billy has a sibling. If we trust anybody, Billy has a sibling.
  • 19. Relative Fact Specificity as Subtyping Subtyping Atomic Statements ∀𝑎1, 𝑎2. 𝑔𝑒𝑛𝑑𝑒𝑟𝑒𝑑𝐹𝑎𝑚𝑖𝑙𝑦𝑀𝑒𝑚𝑏𝑒𝑟(𝑎1) <: 𝑓𝑎𝑚𝑖𝑙𝑦𝑀𝑒𝑚𝑏𝑒𝑟 𝑎2 Subtyping Implications 𝑝𝑠 ⇒ 𝑞 𝑠 <: 𝑝𝑡 ⇒ 𝑞𝑡 Subtyping symbol 𝑡𝑟𝑢𝑠𝑡 𝑛𝑒𝑖𝑔ℎ𝑏𝑜𝑟 ⇒ ℎ𝑎𝑠 𝐵𝑖𝑙𝑙𝑦, 𝑠𝑖𝑠𝑡𝑒𝑟 <: 𝑡𝑟𝑢𝑠𝑡 𝑛𝑒𝑖𝑔ℎ𝑏𝑜𝑟 ⇒ ℎ𝑎𝑠(𝐵𝑖𝑙𝑙𝑦, 𝑠𝑖𝑏𝑙𝑖𝑛𝑔) 𝑡𝑟𝑢𝑠𝑡 𝑎𝑛𝑦𝑏𝑜𝑑𝑦 ⇒ ℎ𝑎𝑠 𝐵𝑖𝑙𝑙𝑦, 𝑠𝑖𝑏𝑙𝑖𝑛𝑔 <: 𝑡𝑟𝑢𝑠𝑡 𝑛𝑒𝑖𝑔ℎ𝑏𝑜𝑟 ⇒ ℎ𝑎𝑠(𝐵𝑖𝑙𝑙𝑦, 𝑠𝑖𝑏𝑙𝑖𝑛𝑔) Want 𝑝𝑠 ⇒ 𝑞 𝑠 to be consistent with any set of formulas 𝑝𝑡 ⇒ 𝑞𝑡 is consistent with. 𝑝𝑡 <: 𝑝𝑠 𝑝𝑡 ⊢ 𝑞 𝑠 <: 𝑞𝑡
  • 20. Domain Facts in Biology Phosphorylation implies activity A activates B A phosphorylates B Can determine these are redundant, where “A phosphorylates B” is the more specific statement.
  • 21. Hierarchy of Specificity If C or D are present, A phosphorylates B If C or D are present, A activates B If C is present, A phosphorylates B
  • 22. Translating Statements to Graph Logic A phosphorylates B Phosphorylation implies activity A activates B 〈𝐺, 𝛼〉 Set of allowable graphs Iota rules [Husson & Krivine] Actions over graphs preconditions postconditions
  • 23. The Syndra Tool JeanYang/September2016 Syndra Domain knowledge NLP output Consistency checking, synthesis of new statements using first- order logic Currently: “If LRP5/6 and Axin are in a mixture, then it is consistent with phosphorylation by CK1-alpha and GSK3- beta.”
  • 24. Reasoning Goals JeanYang/September2016 Nuggets Processed • LRP5/6 is phosphorylated by CK1-alpha and GSK3- beta. • When phosphorylated, LRP5/6 can bind Axin. Biological Intuition to Check Against If LRP5/6 is next to Axin, then it is likely to have been phosphorylated by CK1-alpha and GSK3-beta. Currently working on: 1. Ways to represent facts about underlying biochemistry. 2. Ways to represent causal relationships. 3. Layer of probabilistic reasoning.
  • 25. Causality in Kappa +  @ (0.5) +  @ (0.2) 1 2 1 2 In Kappa, the rule structure allows us to talk about causal relationships called stories. Ongoing work: story generation; relative frequencies of different stories.
  • 26. The Grand Plan +  @ (0.8) +  @ (0.5) +  @ (0.2) ︙ Space of possible models Rule-based models Experiments, diagnoses, and discoveries Analyses involving rule structures and rates
  • 27. Reasoning is Useful to Domains Other Than Biology! Other fun application domains: • Interactive games and digital humanities • Strange hardware architectures • 3D knitting
  • 28. Main Takeaways • Programming languages research is not so much about programming as it is reasoning in principled ways about complex systems. • The structure of rule-based models allows us to ask questions we could not ask before. • Using a rule-based modelling language with precise operational semantics allows us to do all sorts of analyses.