SlideShare ist ein Scribd-Unternehmen logo
1 von 120
MAPPING GENOTYPE TO
PHENOTYPE USING ATTRIBUTE
GRAMMAR
LAURA ADAM
Genetics, Bioinformatics and Computational Biology Program
ADVISORY COMMITTEE:
Jean Peccoud (Chair)
David Bevan, Harold Garner, François Képès, Naren Ramakrishnan, John Tyson
Doctoral Dissertation Defense
Thursday, July 25th, 2013
Outline
Introduction:
– Synthetic Biology
– Computer Assisted Design
Design:
– GenoCAD: formal language to design biology
– Domain Specific Language: Grammar editor
Simulation:
– Attribute Grammars to compile a mathematical model
– Design Genotype to Phenotype languages
Build:
– Biosecurity questions
Conclusion
2
SYNTHETIC BIOLOGY
Designing Biology
3
Synthetic Biology is:
• The design and construction of new
biological parts, devices, and systems
• Engineering standard biological parts, circuits
• Chemical synthesis of DNA
• The re-design of existing, natural
biological systems for useful purposes or
understanding their properties
• Minimal genome/minimal life
• Synthetic cells/protocells from scratch
• Orthogonal biological systems
4
Applications of Synthetic Biology
Khalil, A. S., & Collins, J. J. (2010). Synthetic biology: applications come of age. Nature reviews. Genetics, 11(5), 367–79.
doi:10.1038/nrg2775
http://syntheticbiology.org/
“[…] change our lives over the coming years, leading to cheaper drugs, 'green' means to fuel our
cars and targeted therapies for attacking 'superbugs' and diseases, such as cancer.”
5
Designing Biology
• No established design
methodology
• Attempts to mimic others
engineering disciplines
(electrical engineering –
logic gates; assembly
language, etc.)
Andrianantoandro, E., Basu, S., Karig, D. K., & Weiss, R.
(2006). Synthetic biology : new engineering rules for an
emerging discipline. Molecular Systems Biology, 1–14.
doi:10.1038/msb4100073
Creating and analyzing new biological systems
Gardner, T. S., Cantor, C. R., & Collins, J. J. (2000).
Construction of a genetic toggle switch in Escherichia
coli. Nature, 403(6767), 339–42. doi:10.1038/35002131
Elowitz, M. B., & Leibler, S. (2000). A synthetic
oscillatory network of transcriptional regulators.
Nature, 403(6767), 335–338. doi:10.1038/35002125
Gibson, D. G. D. G., Glass, J. I. J. I. J. I., Lartigue, C.,
Noskov, V. N. N. V. N., Chuang, R.-Y. Y., Algire, M. A. M.
a., … Venter, J. C. (2010). Creation of a Bacterial Cell
Controlled by a Chemically Synthesized Genome.
Science, 329(5987), 52–6. doi:10.1126/science.1190719
Ro, D. K. et al. Production of the antimalarial drug
precursor artemisinic acid in engineered yeast. Nature
440, 940–943 (2006).
How to
Scale up the complexity
of design in Synthetic Biology?6
Designing Biology?
• Ad hoc, getting too
complex to be done by
hand, error prone,
lengthy (hence, costly)
H. Koeppl et al. (eds.), Design and Analysis of Biomolecular Circuits: Engineering
Approaches to Systems and Synthetic Biology, DOI 10.1007/978-1-4419-6766-4 10,
7
Purnick, P. E. M., & Weiss, R. (2009). The second wave of synthetic
biology: from modules to systems. Nature Reviews Molecular Cell
Biology, 10(6), 410–422. doi:10.1038/nrm2698
Need for a Design Methodology
Assist in the process of:
• designing a system with a DESIRED BEHAVIOR
• constructing its physical realization
Design process in
Synthetic Biology
Design
• Databases
• CAD tools
Analyze
• Computational
tools
Build
• Assembly of parts
• DNA synthesis
8
Design
Design
• Databases
• CAD tools
Analyze
• Computational
tools
Build
• Assembly of parts
• DNA synthesis
9
COMPUTER-ASSISTED DESIGN
(CAD)
FOR SYNTHETIC BIOLOGY
Design
10
CAD for Synthetic Biology
• Top-down: specify a behavior (high level)  constructs
– Lack of quantitative parameters data for parts behavior (see
BioFAB or Matt Lux’s thesis)
• Bottom-up: design a construct from parts  predict its behavior
• Assembly of biological parts, modules, circuits
– Databases, Biobricks™ repository
– aggregated parts sequences  physical DNA of construct
11
CAD tools – design a sequence
• Graphic interface (drag and drop / point and click parts)
– Clotho
– GenoCAD
– SynbioSS
• Programming language-like / text-based
– GEC
• Network diagram
– TinkerCell
– ProMoT
• Chemical equations
– Antimony
12
How to
Represent a genetic construct?
SYNTHETIC BIOLOGY OPEN
LANGUAGE (SBOL)
13
Laura Adam (Virginia Bioinformatics Institute), Aaron Adler (BBN Technologies), J.
Christopher Anderson (Dept of Bioengineering, University of California Berkeley), Jacob
Beal (BBN Technologies), Matthieu Bultelle (Bioengineering, Imperial College London),
Kevin Clancy (Life Technologies), Kendall G. Clark (Clark & Parsia, LLC.), Douglas
Densmore (Electrical and Computer Engineering, Boston University), Omri Drory
(Genome Compiler), Drew Endy (BIOFAB and Dept of Bioengineering, Stanford
University), John H. Gennari (Biomedical and Health Informatics, University of
Washington), Raik Gruenberg (EMBL-CRG Systems Biology program, CRG), Jennifer
Hallinan (School of Computing Science, Newcastle University), Timothy Ham (Joint
BioEnergy Institute), Allan Kuchinsky (Agilent Technologies), Matthew W. Lux (Virginia
Bioinformatics Institute), Curtis Madsen (Electrical and Computer Engineering,
University of Utah), Akshay Maheshwari (UCSD), Barry Moore (Human Genetics,
University of Utah), Chris J. Myers (Electrical and Computer Engineering, University of
Utah), Carlos Olguin (Autodesk Research), Jean Peccoud (Virginia Bioinformatics
Institute), Hector Plahar (Joint BioEnergy Institute), Matthew Pocock (School of
Computing Science, Newcastle University), Cesar A. Rodriguez (BIOFAB), Nicholas
Roehner (Electrical and Computer Engineering, University of Utah), Vincent Rouilly
(Biozentrum, University of Basel), Trevor F. Smith (Agilent Technologies), Guy-Bart
Stan (Bioengineering, Imperial College London), Vinod Tek (Bioengineering, Imperial
College London), Alan Villalobos (DNA 2.0, Inc.), Mandy Wilson (Virginia Bioinformatics
Institute), Chris Winstead (Electrical and Computer Engineering Utah State University),
Anil Wipat (School of Computing Science, Newcastle University), and Fusun Yaman
Sirin (BBN Technologies).
Need for standards in Synthetic
Biology
14
• Core Data Model
• SBOL visual
• libSBOL
– (java, C, python)
Synthetic Biology Open Language (SBOL):
a data exchange standard for descriptions of genetic parts,
devices, modules, and systems
Participation in SBOL
• Workshops I attended:
– Blacksburg, Virginia on January 7-10, 2011
– San Diego, California on June 8, 2011
 Manuscript in submission to Nature Biotechnology
• Won SBOL logo competition
15
GENOCAD, RULE-BASED
DESIGN OF DNA MOLECULES
16
DNA as a Language
What insights can we get from
computational linguistics? 17
Example of formal grammar
A grammar is a:
 Set of rules describing how to form sentences from a language’s vocabulary
18
R1: Sentence → Subject + Verb + Object
R2: Subject → NounPhrase
R3: Object → NounPhrase
R4: NounPhrase → NounPhrase + Modifier
R5: Modifier → PrepositionalPhrase
How to write a sentence? Build a
tree!
19
Sentence
Subject
NounPhrase
Verb Object
NounPhrase
NounPhrase Modifier
Prepositional
Phrase
R1: Sentence → Subject + Verb + Object
R2: Subject → NounPhrase
R3: Object → NounPhrase
R4: NounPhrase → NounPhrase + Modifier
R5: Modifier → PrepositionalPhrase
R1
R3R2
R4
R5
NounPhrase
Verb
NounPhrase
Prepositional
Phrase
Noun
Phrase
People
Verb
are
Noun
Phrase
DNA’s
way
Prepositional
phrase
of making
more DNA
20
~ Edward O. Wilson, 1975
Words make the sentence
Rules make the structure
Rule-based design of DNA
sequences
21
Construct  Promoter Cistron Terminator
Cistron  Cistron Cistron
Cistron  RBS Gene
Terminator  Terminator Terminator
Grammar rules indicates how categories can be arranged to form a ‘valid’ design, parts
from the library implement it.
Rule-based design of DNA
sequences
22
Construct
Promoter
BBa_R0040
Cistron
Cistron
RBS
BBa_B0034
Gene
BBa_E002
0
Cistron
RBS
BBa_B0034
Gene
BBa_E0030
Terminator
Terminator
BBa_B0010
Terminator
BBa_B0012
Construct  Promoter Cistron Terminator
Cistron  Cistron Cistron
Cistron  RBS Gene
Terminator  Terminator Terminator
Context-Free Grammar (CFG)
Design and Validate
23
Construct  Promoter Cistron Terminator
Cistron  Cistron Cistron
Cistron  RBS Gene
Terminator  Terminator Terminator
Construct
Prom
oter
BBa_R
0040
Cistron
Cistron
RBS
BBa_B
0034
Gene
BBa_E
0020
Cistron
RBS
BBa_B
0034
Gene
BBa_E
0030
Terminator
Termin
ator
BBa_B0
010
Termin
ator
BBa_B
0012
Design
Tree
Design
by
Derivation
CFG
Validate
by
Parsing
2007 ~ GenoCAD v1
24Cai, Y., Hartnett, B., Gustafsson, C., & Peccoud, J. (2007). A syntactic model to design and verify synthetic genetic constructs
derived from standard biological parts. Bioinformatics (Oxford, England), 23(20), 2760–7. doi:10.1093/bioinformatics/btm446
Domain Specific Languages
• No universal language
– C, SQL, HTML, Flash
• Languages express design strategies
– Domain-specific
• bacterial, yeast, mammalian
– Project-specific
• gene therapy, synthetic biology
– Organization-specific
• intellectual property, teaching
 Empower end-users to develop their own DSL
25
Bacterial
Yeast
Mammalian
GenoCAD v2 :
Language engineering
26
o GenoCAD.org
o Open Source
o Tutorials
GenoCAD
Context Free Grammar Editor
27
A language for the chloroplast of
Chlamydomonas reinhardtii
• Project to design nitrogen fixating algae
• Gene expression in the chloroplast of
microalgae
– genomic sequences for targeting the
insertion of construct in the chloroplast
28
Chlamydomonas reinhardtii
Identify and Define Categories
Category Definition
5FLR / 3FLR 5’ / 3’ Flanking region for homologous recombination
SIS Short Interval Sequences used to make polycistronic cassettes
STP Stop codon
ATG Start codon
GEN Gene or protein domain. By convention does not include start and stop codons.
CDS Open reading frame composed of several protein domains. Does not include start and
stop codons.
TAG Epitope tags. By convention does not include Start or Stop codons.
PBS Sequence associated with the initiation of transcription and translation.
TCS Targeted expression cassette. Expression cassette flanked with two adjacent
genomic sequences for homologous recombination.
CAS Expression cassette delimited by a promoter in 5’ and a transcription terminator in 3’.
29
Category Definition
[ and ] Negative orientation delimiters
( and ) Plasmid delimiters
{ and } Chromosome delimiters
Grammar Editor – Add/Edit Categories
30
Grammar Editor – Add/Edit Categories
31
Define rewriting rules
Code Rule Comment
CAS S -> TCS This rule is used to design only one expression cassette
1PLAS S ->. ( VEC TCS ) This rule is used to specify the expression cassette along with the vector where it is
inserted. The output is the entire plasmid sequence.
2PLAS S ->. ( VEC TCS ) ( VEC TCS ) This rule is for designs that involve two plasmids.
TGS TCS -> 5FLR CAS 3FLR Specifies the flanking regions for homologous recombination.
PRCT CAS-> PBS CDS TER A gene expression cassette is composed of a promoter, open reading frame, and a
transcription terminator.
2CAS CAS -> CAS CAS This rule makes it possible to have more than one expression cassette on a
construct.
rCAS CAS -> [ CAS ] This rule is used to specify that the cassette is coded on the negative strand.
2CDS CDS -> CDS SIS CDS This rule makes it possible to design polycistronic constructs.
SGEN CDS -> ATG GEN STP The open reading frame is composed of a single gene flanked by a start and stop
codon.
TGEN GEN  GEN TAG This rule is used to add a tag to a coding sequence. It can be used iteratively to
add more than one tag.
2GEN GEN-> GEN GEN This rule can be used to fuse two coding sequences that are not tags.
32
Grammar Editor – Add/Edit Rules
33
Add parts in libraries
34
GenoCAD
Design with your language
35
Point-and-click design tool
36
A dynamic language
• You can change the model …
– Add or Change a Rule in a Grammar
– Delete a Rule from a Grammar
– Remove a Part from a library
– Change a Part’s sequence
– Change a Part’s category
… but how does this affect the dependent
designs?
37
ATGGTGAGCAAGGGCGAGGAGAATAACA
TGGCCATCATCAAGGAGTTCATGCGCTTC
AAGGTGCGCATGGAGGGCTCCGTGAAC
GGCCACGAGTTCGAGATCGAGGGCGAG
GGCGAGGGCCGCCCCTACGAGGGCTTT
CAGACCGCTAAGCTGAAGGTGACC
ATGGTGAGCAAGGGCGAGGAGAATAACA
TGGCCATCATCAAGGAGTTCATGCGCTTC
AAGGTGCGCATGGAGGGCTCCGTGAAC
GGCCACGAGTTCGAGATCGAGGGCGAG
GGCGAGGGCCGCCCCTACGAGGGCTTT
CAGACCGCTAAGCTGAAGGTGACC
CAAGGGCGAGGAGAATAACATGGCCATC
ATCAAGGAGTTCATGCGCTTCAAGGTGC
GCATGGAGGGCTCCGTGAACGGCCACG
AGTTCGAGATCGAGGGCGAGGGCGAGG
GCCGCCCCTACGAGGGCTTTCAGACCGC
TAAGCTGAAGGTGACC
Available Designs
38
Different design statuses
39
Valid – the sequence could be decomposed into its
parts, and the parts’ categories make up a grammar-
sanctioned framework.
Needs validation – either grammar, part, or library has
changed, and the sequence has not been validated
since
Under construction – design is unfinished, so cannot
be compiled.
Out of Date – although design is still valid with respect
to grammars and libraries, the parts have changed.
Invalid – the sequence cannot be resolved.
Left recursion (CIS
 CIS CIS)
Remove orphan
rules
We need to generate
CFG compilers
40
Design - Recap
Design
• Databases
• CAD tools
Analyze
• Computational
tools
Build
• Assembly of parts
• DNA synthesis
 Participate in SBOL, community
effort for standard in Synthetic
Biology
 Context-Free Grammar to design
DNA molecules
 GenoCAD
 Domain specific languages
 Edit libraries of parts and grammars
rules
 CFG compiler generation
41
Analyze
Design
• Databases
• CAD tools
Analyze
• Computational
tools
Build
• Assembly of parts
• DNA synthesis
42
43
44
ATTRIBUTE GRAMMAR FOR
SYNTHETIC BIOLOGY
45
Attribute Grammars
DNA compilation
46
Compiler
Semantic DNA Compilation
Genetic
Design A
Get Chemical
Equations for
A
Attribute
Grammar
47
Semantic DNA Compilation
Genetic
Design B
Get Chemical
Equations for
B
Attribute
Grammar
48
Attribute Grammars (AG)
• AG = a CFG plus:
– Categories and Parts have attributes
– Rules have semantic actions to compute
attributes values
While going through the parse tree, we
now also evaluate the semantics
(meaning)
49
Example - Target output
– Transcription:
• dna  dna + mrna
– Translation:
• mrna  mrna + protein
– Degradation mrna:
• mrna  []
– Degradation protein:
• protein  []
– Interaction promoter protein:
• dna + repressor <-> dna_repressor_x
50
Example – Parts Attributes
• Promoter: transcription rate, repressor
– Promoter(transcription_rate, repressor)  ptetr (50, tetr)
– Promoter(transcription_rate, repressor)  placi (10, laci)
• RBS: translation rate
– RBS(translation_rate)  rbsA (25)
– RBS(translation_rate)  rbsB (50)
• CDS: degradation rates for the protein and the mRNA
– CDS(protein_deg,mrna_deg)  laci(1,1)
– CDS(protein_deg,mrna_deg)  tetr(1,1)
• Terminator
– Terminator  t1
51
Example: Rules Semantic Actions
• CAS  PROMOTER(transcription_rate,
repressor), CIS, TERMINATOR
– Transcription: dna  dna + mrna, [transcription_rate]
– Interaction: if repressor in construct then dna +
repressor <-> dna_repressor_X
• CISTRON  RBS(translation_rate),
CDS(protein_deg,mrna_deg)
– Translation: mrna  mrna + protein, [translation_rate]
– Degradation_mrna: mrna  ϕ, [mrna_deg]
– Degradation_protein: protein  ϕ, [protein_deg]
52
CONSTRUCT
CAS
PROMOTER
ptetr
CIX
CISTRON
RBS
rbsA
CDS
laci
TERMINATOR
TERMINATOR
t1
TERMINATOR
t2
[ CAS
PROMOTER
placi
CIX
CISTRON
RBS
rbsB
CDS
tetr
CISTRON
RBS
rbsB
CDS
gfp
TERMINATOR
t1
]
Example: Toggle switch
53
CAS
PROMOTER
ptetr
CIX
CISTRON
RBS
rbsA
CDS
laci
TERMINATOR
TERMINATOR
t1
TERMINATOR
t2
Example: Toggle switch
54
CAS
PROMOTER
Transcription rate
Repressor
ptetr
CIX
CISTRON
RBS
rbsA
CDS
laci
TERMINATOR
TERMINATOR
t1
TERMINATOR
t2
Example: Toggle switch
55
CAS
PROMOTER
Transcription rate
Repressor
ptetr
50
tetr
CIX
CISTRON
RBS
rbsA
CDS
laci
TERMINATOR
TERMINATOR
t1
TERMINATOR
t2
Example: Toggle switch
56
CAS
PROMOTER: ptetr
ptetr
CIX
CISTRON
RBS
rbsA
CDS
laci
TERMINATOR
TERMINATOR
t1
TERMINATOR
t2
Example: Toggle switch
PROMOTER:
ptetr
50
tetr
ptetr
50
tetr
57
CAS
PROMOTER: ptetr
ptetr
CIX
CISTRON
RBS
rbsA
CDS
laci
TERMINATOR
TERMINATOR
t1
TERMINATOR
t2
Example: Toggle switch
ptetr
50
tetr
PROMOTER:
ptetr
50
tetr
58
CAS
PROMOTER: ptetr
ptetr
CIX
CISTRON
RBS
Translation rate
rbsA
CDS
mRNA deg rate
Protein deg rate
laci
TERMINATOR
TERMINATOR
t1
TERMINATOR
t2
Example: Toggle switch
ptetr
50
tetr
PROMOTER:
ptetr
50
tetr
• Translation: mrna  mrna +
protein, [translation_rate]
• Degradation_mrna: mrna  ø,
[mrna_deg]
• Degradation_protein: protein 
ø, [protein_deg] 59
CAS
PROMOTER: ptetr
ptetr
CIX
CISTRON
RBS: rbsA
25
rbsA
25
CDS
mRNA deg rate
Protein deg rate
laci
TERMINATOR
TERMINATOR
t1
TERMINATOR
t2
Example: Toggle switch
ptetr
50
tetr
PROMOTER:
ptetr
50
tetr
• Translation: mrna  mrna +
protein, [translation_rate]
• Degradation_mrna: mrna  ø,
[mrna_deg]
• Degradation_protein: protein 
ø, [protein_deg] 60
CAS
PROMOTER: ptetr
50
tetr
ptetr
50
tetr
CIX
CISTRON
RBS: rbsA
25
rbsA
25
CDS: laci
1
1
laci
1
1
TERMINATOR
TERMINATOR
t1
TERMINATOR
t2
Example: Toggle switch
• Translation: mrna  mrna +
protein, [translation_rate]
• Degradation_mrna: mrna  ø,
[mrna_deg]
• Degradation_protein: protein 
ø, [protein_deg] 61
CAS
PROMOTER: ptetr
50
tetr
ptetr
50
tetr
CIX
CISTRON
RBS: rbsA
25
rbsA
25
CDS: laci
1
1
laci
1
1
TERMINATOR
TERMINATOR
t1
TERMINATOR
t2
Example: Toggle switch
• Translation: mrna_rbsA_laci 
mrna_rbsA_laci + protein_laci,
[25]
• Degradation_mrna:
mrna_rbsA_laci  ø, [1]
• Degradation_protein:
protein_laci  ø, [1] 62
CAS
PROMOTER: ptetr
50
tetr
ptetr
50
tetr
CIX
CISTRON
RBS: rbsA
25
rbsA
25
CDS: laci
1
1
laci
1
1
TERMINATOR
TERMINATOR
t1
TERMINATOR
t2
Example: Toggle switch
• Translation: mrna_rbsA_laci 
mrna_rbsA_laci + protein_laci,
[25]
• Degradation_mrna:
mrna_rbsA_laci  ø, [1]
• Degradation_protein:
protein_laci  ø, [1] 63
CAS
PROMOTER: ptetr
50
tetr
ptetr
50
tetr
CIX
CISTRON
RBS: rbsA
25
rbsA
25
CDS: laci
1
1
laci
1
1
TERMINATOR
TERMINATOR
t1
TERMINATOR
t2
Example: Toggle switch
• Translation: mrna_rbsA_laci 
mrna_rbsA_laci + protein_laci,
[25]
• Degradation_mrna:
mrna_rbsA_laci  ø, [1]
• Degradation_protein:
protein_laci  ø, [1] 64
CAS
PROMOTER: ptetr
50
tetr
ptetr
50
tetr
CIX
CISTRON
RBS: rbsA
25
rbsA
25
CDS: laci
1
1
laci
1
1
TERMINATOR
TERMINATOR
t1
TERMINATOR
t2
Example: Toggle switch
• Translation: mrna_rbsA_laci 
mrna_rbsA_laci + protein_laci,
[25]
• Degradation_mrna:
mrna_rbsA_laci  ø, [1]
• Degradation_protein:
protein_laci  ø, [1] 65
Example: Toggle switch
• Transcription: dna  dna +
mrna, [transcription_rate]
• Interaction: if repressor in
construct then dna + repressor
 dna_repressor_X
CAS
PROMOTER: ptetr
50
tetr
ptetr
50
tetr
CIX
CISTRON
RBS: rbsA
25
rbsA
25
CDS: laci
1
1
laci
1
1
TERMINATOR
TERMINATOR
t1
TERMINATOR
t2
• Translation: mrna_rbsA_laci 
mrna_rbsA_laci + protein_laci,
[25]
• Degradation_mrna:
mrna_rbsA_laci  ø, [1]
• Degradation_protein:
protein_laci  ø, [1] 66
Example: Toggle switch
CAS
PROMOTER: ptetr
50
tetr
ptetr
50
tetr
CIX
CISTRON
RBS: rbsA
25
rbsA
25
CDS: laci
1
1
laci
1
1
TERMINATOR
TERMINATOR
t1
TERMINATOR
t2
• Translation: mrna_rbsA_laci 
mrna_rbsA_laci + protein_laci,
[25]
• Degradation_mrna:
mrna_rbsA_laci  ø, [1]
• Degradation_protein:
protein_laci  ø, [1]
Transcription: dna_ptetr_rbsA_laci
 dna_ptetr_rbsA_laci +
mrna_rbsA_laci , [50]
Interaction: if tetr in construct
dna_ptetr_rbsA_laci + protein_tetr
 dna_ptetr_rbsA_laci _tetr_X
67
CONSTRUCT
CAS
PROMOTER
ptetr
CIX
CISTRON
RBS
rbsA
CDS
laci
TERMINATOR
TERMINATOR
t1
TERMINATOR
t2
[ CAS
PROMOTER
placi
CIX
CISTRON
RBS
rbsB
CDS
tetr
CISTRON
RBS
rbsB
CDS
gfp
TERMINATOR
t1
]
Example: Toggle switch
68
Toggle switch
laci/tetr
Get Chemical
Equations for
A
Example: Toggle switch
Transcription: dna_ptetr_rbsA_laci 
dna_ptetr_rbsA_laci + mrna_rbsA_laci ,
[50]
Interaction: if tetr in construct
dna_ptetr_rbsA_laci + protein_tetr 
dna_ptetr_rbsA_laci _tetr_X
Translation: mrna_rbsA_laci 
mrna_rbsA_laci + protein_laci, [25]
Degradation_mrna: mrna_rbsA_laci  φ,
[1]
Degradation_protein: protein_laci  φ, [1]
Transcription: dna_placi_rbsB_tetr 
dna_placi_rbsB_tetr + mrna_rbsB_tetr , [10]
Interaction: if laci in construct
dna_placi_rbsB_tetr + protein_tetr 
dna_placi_rbsB_tetr_laci_X
Translation: mrna_rbsB_tetr 
mrna_rbsB_tetr + protein_tetr, [50]
Degradation_mrna: mrna_rbsB_tetr  φ,
[1]
Degradation_protein: protein_tetr  φ, [1]
Attribute
Grammar
CONS
TRUC
T
CAS
PROM
OTER
ptetr
CIX
CISTR
ON
RBS
rbsA
CDS
laci
TERM
INAT
ORTERM
INAT
OR
t1
TERM
INAT
OR
t2
[ CAS
PROM
OTER
placi
CIX
CISTR
ON
RBS
rbsB
CDS
tetr
CISTR
ON
RBS
rbsB
CDS
gfp
TERM
INAT
OR
t1
]
69
DESIGN OF LANGUAGES FOR
SYSTEMS AND SYNTHETIC
BIOLOGY
Translate genetic designs into mathematical models
70
Attribute Grammar for Systems and
Synthetic Biology
71
Use an API for the Grammar’s
semantic actions
SBML API (use java libSBML)
• species(Name, InitConc)
• parameter(Name, Value)
• reaction(Name, Modifiers, Reactants, Products, Math)
• event(Name, Event_assignments, Trigger_math)
• Etc.
TRANS API
Use keywords to match declared Species, use naming
convention
– TYPE = “DNA”, “PROT”….
– KEY = “LAC”, “TET”,…
72
Parts Attributes
73
Category PartsID Parameters
PTE pTetR parameter('k_transcription_pTetR',10)
PLA placI parameter('k_transcription_placI',25)
PCI pcI parameter('k_transcription_pcI',50)
TER B0010
RBS Strong_RBS
parameter('k_translation_Strong_RBS’,25),
parameter('k_degradation_Strong_RBS',1)
CDS
lacI parameter('k_degradation_LAC_lacI',0.1)
tetR parameter('k_degradation_TET_tetR',0.1)
cIts parameter('k_degradation_CI_cIts',0.1)
Rules Semantic Actions
74
Reaction
Rules Name Modifiers Reactants Products Math
CAS -->
PRO CIS TER Transcription_<CAS.Construct> DNA_<PRO.Construct> mRNA_<CIS.Construct> k_transcription_<PRO.Construct>
CIS --> RBS
CDS
Translation_<CIS.Construct> mRNA_<CIS.Construct> PROT_<CDS.Construct> k_translation_<RBS.Construct>
Degradation_mRNA_<CIS.Construct> mRNA_<CIS.Construct>
-k_degradation_<RBS.Construct>*
mRNA_<CIS.Construct>
Degradation_PROT_<CDS.Construct> PROT_<CDS.Construct>
-k_degradation_<CDS.Construct>*
PROT_<CDS.Construct>
Rules Species Reactions
CAS --> PRO CIS TER species_amount(DNA_<PRO.Construct>,1)
reaction(Transcription_<CAS.Construct>,
[DNA_<PRO.Construct>], [], [mRNA_<CIS.Construct>],
"k_transcription_<PRO.Construct>")
CIS --> RBS CDS
species(mRNA_<CIS.Construct>,0),
species(PROT_<CDS.Construct>,
init_<CDS.Construct>.getValue())
reaction(Translation_<CIS.Construct>,
[mRNA_<CIS.Construct>], [], [PROT_<CDS.Construct>],
"k_translation_<RBS.Construct>"),
reaction(Degradation_mRNA_<CIS.Construct>, [],
[mRNA_<CIS.Construct>], [], ”-
k_degradation_<RBS.Construct>* mRNA_<CIS.Construct>"),
reaction(Degradation_PROT_<CDS.Construct>, [],
[PROT_<CDS.Construct>], [], "-
k_degradation_<CDS.Construct>* PROT_<CDS.Construct>")
Rules Semantic Actions - TRANS
75
PRO  PLA: semantic action for the POSSIBLE interaction (trans) reaction
Name interaction_LAC_<PLA.Construct>
Modifiers
Reactants TRANSspecies{PROT-LAC}
Products TRANSspecies_and_declare{[PROT-LAC],PROT-
LAC_DNA_<PLA.Construct>_x,0}
Math TRANS{[PROT-LAC],+ k_binding_PROT-
LAC_<PLA.Construct> - k_release_PROT-
LAC_<PLA.Construct> * PROT-
LAC_DNA_<PLA.Construct>_x,0}
GenoCAD supports AG
Design to Simulation
76
Add it to GenoCAD
77
Database model for
Attribute Grammars
78
Compiler generation
79
Synthetic Biology designs
Toggle switch (Garner, 2000) Oscillator (Elowitz, 2000)
Same libraries of parts but different layouts.
Used SBML API to design an attribute grammar using Wilson-Cowan rate laws 80
The genetic toggle switch
81
The genetic toggle switch
82
The oscillator
83
A G2P language for Natural
Genome
• Scale-up to handle natural genome
– Illustrate Attribute Grammar as a formalism to
map Genotype to Phenotype
• Systems Biology
– Regulation of the cell cycle of the budding
yeast
84
Cell Cycle AG Syntax
Genome --> InitialCondition Model ( ChrI ) ( ChrII ) ( ChrIII ) ( ChrIV ) ( ChrV ) ( ChrVI
) ( ChrVII ) ( ChrVIII ) ( ChrIX ) ( ChrX ) ( ChrXI ) ( ChrXII ) ( ChrXIII ) ( ChrXIV ) (
ChrXV ) ( ChrXVI )
ChrI --> ChrI_L [Cln3 ] ChrI_M1 [Lte1 ] ChrI_M2 [Cdc15 ] ChrI_R
ChrIV --> ChrIV_L [Pds1 ] ChrIV_M [Swi5 ] ChrIV_R
ChrV --> ChrV_L [SBF] CHrV_M Bck2 ChrV_R
ChrVI --> ChrVI_L [Cdc14 ] ChrVI_R
ChrVII --> ChrVII_L Cdc20 ChrVII_M1 [Cdh1 ] ChrVII_M2 [Esp1 ] ChrVII_R
ChrX --> ChrX_L Cdc6 ChrX_M1 Net1 ChrX_M2 Mad2 ChrX_R
ChrXI --> ChrXI_L [APC] ChrXI_R
ChrXII --> ChrXII_L Sic1 ChrXII_R
ChrXIII --> ChrXIII_L [Tem1 ] ChrXIII_M1 Mcm1 ChrXIII_M2 [Bub2 ] ChrXIII_R
ChrXVI --> ChrXVI_L [Cln2 ] ChrXVI_M1 Clb2 ChrXVI_M2 [Clb5 ] ChrXVI_R
85Chen, K. C., Calzone, L., Csikasz-Nagy, A., Cross, F. R., Novak, B., & Tyson, J. J. (2004). Integrative analysis of cell cycle control in
budding yeast. Molecular biology of the cell, 15(8), 3841-62. doi:10.1091/mbc.E03-11-0794
Cell Cycle AG semantics
86
Cell Cycle Wild-Type
Design and Simulation
87
Mutant: cln2Δ
88
Discrete Boolean Network
• Design variety of language!
• Qualitative
• API GinML: edges and nodes
Thieffry, D., & Thomas, R. (1995). Dynamical behaviour of biological regulatory networks—II. Immunity control in
bacteriophage lambda. Bulletin of Mathematical Biology. Retrieved from
http://link.springer.com/article/10.1007/BF02460619
89
Analyze - Recap
Design
• Databases
• CAD tools
Analyze
• Computational
tools
Build
• Assembly of parts
• DNA synthesis
 Attribute Grammar to map genotype and phenotype
 Designing G2P languages, use SBML or GinML API
 Model natural genome, the cell cycle example
 Implementation in GenoCAD: design to simulation
 Generation of AG compilers from database 90
Build
Design
• Databases
• CAD tools
Analyze
• Computational
tools
Build
• Assembly of parts
• DNA synthesis
91
BIOSECURITY IMPLICATIONS
DNA synthesis
92
The Risk of Deliberate Misuse
93
A T C G A T C G A T C G A T C G A T C G A T C G A T C G A T C G A T C G A T C G A T C G A T C G A T
C G A T C G A T C G A T C G A T C G A T C G A T C G A T C G A T C G A T C G A T C G A T C G A T C G
A T C G A T C G A T C G A T C G A T C G A T C G A T C G A T C G A T C G A T C G A T C G A T C G A T
C G A T C G A T C G A T C G A T C G A T C G A T C G A T C G A T C G A T C G A T C G A T C G A T C G
A T C G A T C G A T C G A T C G A T C G A T C G A T C G A T C G A T C G A T C G A T C G A T C G A T
C G A T C G A T C G A T C G A T C G A T C G A T C G A T C G A T C G A T C G A T C G A T C G A T C G
A T C G A T C G A T C G A T C G A T C G A T C G A T C G A T C G A T C G A T C G A T C G A T C G A T
C G A T C G A T C G A T C G A T C G A T C G A T C G A T C G A T C G A T C G A T C G A T C G A T C G
A T C G A T C G A T C G A T C G A T C G A T C G A T C G A T C G A T C G A T C G A T C G A T C G A T
C G A T C G A T C G A T C G A T C G A T C G A T C G A T C G A T C G A T C G A T C G A T C G A T C G
A T C G A T C G A T C G A T C G A T C G A T C G A T C G A T C G A T C G A T C G A T C G A T C G A T
C G A T C G A T C G A T C G A T C G A T C G A T C G A T C G A T C G A T C G A T C G A T C G A T C G
A T C G A T C G A T C G A T C G A T C G A T C G A T C G A T C G A T C G A T C G A T C G A T C G A T
C G A T C G A T C G A T C G A T C G A T C G A T C G A T C G A T C G A T C G A T C G A T C G A T C G
A T C G A T C G A T C G A T C G A T C G A T C G A T C G A T C G A T C G A T C G A T C G A T C G A T
C G A T C G A T C G A T C G A T C G A T C G A T C G A T C G A T C G A T C G A T C G A T C G A T C G
A T C G A T C G A T C G A T C G A T C G A T C G A T C G A T C G A T C G A T C G A T C G A T C G A T
C G A T C G A T C G A T C G A T C G A T C G A T C G A T C G A T C G A T C G A T C G A T C G A T C G
A T C G A T C G A T C G A T C G A T C G A T C G A T C G A T C G A T C G A T C G A T C G A T C G A T
C G A T C G A T C G A T C G A T C G A T C G A T C G A T C G A T C G A T C G A T C G A T C G A T C G
A T C G A T C G A T C G A T C G A T C G A T C G A T C G A T C G A T C G A T C G A T C G A T C G A T
C G A T C G A T C G A T C G A T C G A T C G A T C G A T C G A T C G A T C G A T C G A T C G A T C G
A T C G A T C G A T C G A T C G A T C G A T C G A T C G A T C G A T C G A T C G A T C G A T C G A T
C G A T C G A T C G A T C G A T C G A T C G A T C G A T C G A T C G A T C G A T C G A T C G A T C G
A T C G A T C G A T C G A T C G A T C G A T C G A T C G A T C G A T C G A T C G A T C G A T C G A T
C G A T C G A T C G A T C G A T C G A T C G A T C G A T C G A T C G A T C G A T C G A T C G A T C G
A T C G A T C G A T C G A T C G A T C G A T C G A T C G A T C G A T C G A T C G A T C G A T C G A T
C G A T C G A T C G A T C G A T C G A T C G A T C G A T C G A T C G A T C G A T C G A T C G A T C G
A T C G A T C G A T C G A T C G A T C G A T C G A T C G A T C G A T C G A T C G A T C G A T C G A T
C G A T C G A T C G A T C G A T C G A T C G A T C G A T C G A T C G A T C G A T C G A T C G A T C G
A T C G A T C G A T C G A T C G A T C G A T C G A T C G A T C G A T C G A T C G A T C G A T C G A T
C G A T C G A T C G A T C G A T C G A T C G A T C G A T C G A T C G A T C G A T C G A T C G A T C G
A T C G A T C G A T C G A T C G A T C G A T C G A T C G A T C G A T C G A T C G A T C G A T C G A T
C G A T C G A T C G A T C G A T C G A T C G A T C G A T C G A T C G A T C G A T C G A T C G A T C G
A T C G A T C G A T C G A T C G A T C G A T C G A T C G A T C G A T C G A T C G A T C G A T C G A T
C G A T C G A T C G A T C G A T C G A T C G A T C G A T C G A T C G A T C G A T C G A T C G A T C G
A T C G A T C G A T C G A T C G A T C G A T C G A T C G A T C G A T C G A T C G A T C G A T C G A T
C G A T C G A T C G A T C G A T C G A T C G A T C G A T C G A T C G A T C G A T C G A T C G A T C G
A T C G A T C G A T C G A T C G A T C G A T C G A T C G A T C G A T C G A T C G A T C G A T C G A T
C G A T C G A T C G A T C G A T C G A T C G A T C G A T C G A T C G A T C G A T C G A T C G A T C G
A T C G A T C G A T C G A T C G A T C G A T C G A T C G A T C G A T C G A T C G A T C G A T C G A T
C G A T C G A T C G A T C G A T C G A T C G A T C G A T C G A T C G A T C G A T C G A T C G A T C G
A T C G A T C G A T C G A T C G A T C G A T C G A T C G A T C G A T C G A T C G A T C G A T C G A T
C G A T C G A T C G A T C G A T C G A T C G A T C G A T C G A T C G A T C G A T C G A T C G A T C G
A T C G A T C G A T C G A T C G A T C G A T C G A T C G A T C G A T C G A T C G A T C G A T C G A T
C G A T C G A T C G A T C G A T C G A T C G A T C G A T C G A T C G A T C G A T C G A T C G A T C G
A T C G A T C G A T C G A T C G A T C G A T C G A T C G A T C G A T C G A T C G A T C G A T C G A T
C G A T C G A T C G A T C G A T C G A T C G A T C G A T C G A T C G A T C G A T C G A T C G A T C G
A T C G A T C G A T C G A T C G A T C G A T C G A T C G A T C G A T C G A T C G A T C G A T C G A T
C G A T C G A T C G A T C G A T C G A T C G A T C G A T C G A T C G A T C G A T C G A T C G A T C
A T C G A T C G A T C G A T C G A T C G A T C G A T C G A T C G A T C G A T C G A T C G A T C G A T
C G A T C G A T C G A T C G A T C G A T C G A T C G A T C G A T C G A T C G A T C G A T C G A T C G
A T C G A T C G A T C G A T C G A T C G A T C G A T C G A T C G A T C G A T C G A T C G A T C G A T
C G A T C G A T C G A T C G A T C G A T C G A T C G A T C G A T C G A T C G A T C G A T C G A T C G
A T C G A T C G A T C G A T C G A T C G A T C G A T C G A T C G A T C G A T C G A T C G A T C G A T
C G A T C G A T C G A T C G A T C G A T C G A T C G A T C G A T C G A T C G A T C G A T C G A T C G
A T C G A T C G A T C G A T C G A T C G A T C G A T C G A T C G A T C G A T C G A T C G A T C G A T
C G A T C G A T C G A T C G A T C G A T C G A T C G A T C G A T C G A T C G A T C G A T C G A T C G
A T C G A T C G A T C G A T C G A T C G A T C G A T C G A T C G A T C G A T C G A T C G A T C G A T
C G A T C G A T C G A T C G A T C G A T C G A T C G A T C G A T C G A T C G A T C G A T C G A T C G
A T C G A T C G A T C G A T C G A T C G A T C G A T C G A T C G A T C G A T C G A T C G A T C G A T
C G A T C G A T C G A T C G A T C G A T C G A T C G A T C G A T C G A T C G A T C G A T C G A T C G
A T C G A T C G A T C G A T C G A T C G A T C G A T C G A T C G A T C G A T C G A T C G A T C G A T
C G A T C G A T C G A T C G A T C G A T C G A T C G A T C G A T C G A T C G A T C G A T C G A T C G
A T C G A T C G A T C G A T C G A T C G A T C G A T C G A T C G A T C G A T C G A T C G A T C G A T
C G A T C G A T C G A T C G A T C G A T C G A T C G A T C G A T C G A T C G A T C G A T C G A T C G
A T C G A T C G A T C G A T C G A T C G A T C G A T C G A T C G A T C G A T C G A T C G A T C G A T
C G A T C G A T C G A T C G A T C G A T C G A T C G A T C G A T C G A T C G A T C G A T C G A T C G
A T C G A T C G A T C G A T C G A T C G A T C G A T C G A T C G A T C G A T C G A T C G A T C G A T
C G A T C G A T C G A T C G A T C G A T C G A T C G A T C G A T C G A T C G A T C G A T C G A T C G
A T C G A T C G A T C G A T C G A T C G A T C G A T C G A T C G A T C G A T C G A T C G A T C G A T
C G A T C G A T C G A T C G A T C G A T C G A T C G A T C G A T C G A T C G A T C G A T C G A T C G
A T C G A T C G A T C G A T C G A T C G A T C G A T C G A T C G A T C G A T C G A T C G A T C G A T
C G A T C G A T C G A T C G A T C G A T C G A T C G A T C G A T C G A T C G A T C G A T C G A T C G
A T C G A T C G A T C G A T C G A T C G A T C G A T C G A T C G A T C G A T C G A T C G A T C G A T
C G A T C G A T C G A T C G A T C G A T C G A T C G A T C G A T C G A T C G A T C G A T C G A T C G
A T C G A T C G A T C G A T C G A T C G A T C G A T C G A T C G A T C G A T C G A T C G A T C G A T
C G A T C G A T C G A T C G A T C G A T C G A T C G A T C G A T C G A T C G A T C G A T C G A T C G
A T C G A T C G A T C G A T C G A T C G A T C G A T C G A T C G A T C G A T C G A T C G A T C G A T
C G A T C G A T C G A T C G A T C G A T C G A T C G A T C G A T C G A T C G A T C G A T C G A T C G
A T C G A T C G A T C G A T C G A T C G A T C G A T C G A T C G A T C G A T C G A T C G A T C G A T
C G A T C G A T C G A T C G A T C G A T C G A T C G A T C G A T C G A T C G A T C G A T C G A T C G
A T C G A T C G A T C G A T C G A T C G A T C G A T C G A T C G A T C G A T C G A T C G A T C G A T
C G A T C G A T C G A T C G A T C G A T C G A T C G A T C G A T C G A T C G A T C G A T C G A T C G
A T C G A T C G A T C G A T C G A T C G A T C G A T C G A T C G A T C G A T C G A T C G A T C G A T
C G A T C G A T C G A T C G A T C G A T C G A T C G A T C G A T C G A T C G A T C G A T C G A T C G
A T C G A T C G A T C G A T C G A T C G A T C G A T C G A T C G A T C G A T C G A T C G A T C G A T
C G A T C G A T C G A T C G A T C G A T C G A T C G A T C G A T C G A T C G A T C G A T C G A T C G
A T C G A T C G A T C G A T C G A T C G A T C G A T C G A T C G A T C G A T C G A T C G A T C G A T
C G A T C G A T C G A T C G A T C G A T C G A T C G A T C G A T C G A T C G A T C G A T C G A T C G
A T C G A T C G A T C G A T C G A T C G A T C G A T C G A T C G A T C G A T C G A T C G A T C G A T
C G A T C G A T C G A T C G A T C G A T C G A T C G A T C G A T C G A T C G A T C G A T C G A T C G
A T C G A T C G A T C G A T C G A T C G A T C G A T C G A T C G A T C G A T C G A T C G A T C G A T
C G A T C G A T C G A T C G A T C G A T C G A T C G A T C G A T C G A T C G A T C G A T C G A T C G
A T C G A T C G A T C G A T C G A T C G A T C G A T C G A T C G A T C G A T C G A T C G A T C G A T
C G A T C G A T C G A T C G A T C G A T C G A T C G A T C G A T C G A T C G A T C G A T C G A T C G
A T C G A T C G A T C G A T C G A T C G A T C G A T C G A T C G A T C G A T C G A T C G A T C G A T
C G A T C G A T C G A T C G A T C G A T C G A T C G A T C G A T C G A T C G A T C G A T C G A T C G
A T C G A T C G A T C G A T C G A T C G A T C G A T C G A T C G A T C G A T C G A T C G A T C G A T
C G A T C G A T C G A T C G A T C G A T C G A T C G A T C G A T C G A T C G A T C G A T C G A T C
94
Sections:
Customer screening
Sequence screening
Record retention
Government contact
“[…] to minimize the risk that unauthorized individuals or
individuals with malicious intent will obtain “toxins
and agents of concern” through the use of nucleic
acid synthesis technologies, and to simultaneously
minimize any negative impacts on the conduct of
research and business operations.” 95
96
97
98
99
iGEM--International Genetically
Engineered Machine
• Summer project for teams of undergrads in Synthetic Biology
– Projects range from a rainbow of pigmented bacteria, to banana
smelling bacteria, an arsenic biosensor, etc.
– 165 teams in 2011
• Judge
– iGEM 2012 Americas East Jamboree (Information Processing &
Fundamental Advances & Software track) in Pittsburgh, PA
– aGEM 2012 in Edmonton, Canada
– iGEM 2011 World Championship (Software track)
in MIT Cambridge, MA
– iGEM 2011 Americas Jamboree (Information Processing &
Software track) in Indianapolis, IN
– iGEM 2010 World Championship (Poster) in MIT Cambridge, MA
• Advisor of teams
– Virginia Tech iGEM 2011 team
– VT-ENSIMAG Biosecurity software team for iGEM 2010 100
Biological weapons nonproliferation
• 1 year postdoc fellowship
• Center for Nonproliferation Studies at the
Monterey Institute for International Studies
101
Build - Recap
Design
• Databases
• CAD tools
Analyze
• Computational
tools
Build
• Assembly of parts
• DNA synthesis
 Screening a DNA
sequence for Select Agent
and Toxins
 Relations our science and
policy
 Engaging the students
(iGEM)
102
CONCLUSIONS
103
Conclusions
 Design: GenoCAD – Rule-based genetic design tools user-
friendly and Domain Specific Language
 Galdzicki, M., Wilson, M. L., Rodriguez, C. A., Pocock, M. R., Oberortner,
E., Adam, L., … Sauro, H. M. (2012). Synthetic Biology Open Language
(SBOL) Version 1.1.0, 1–26. (and NBT paper in submission)
 Wilson, M. L., Hertzberg, R., Adam, L., & Peccoud, J. (2011). A step-by-
step introduction to rule-based design of synthetic genetic constructs
using GenoCAD. (C. Voigt, Ed.)Methods in enzymology, 498, 173–88.
doi:10.1016/B978-0-12-385120-8.00008-5
 Mandy L Wilson, Sakiko Okumoto, Laura Adam and Jean Peccoud.
Development of a domain-specific genetic language to design
Chlamydomonas reinhardtii expression vectors.(Manuscript in
preparation)
– Talk: Adam, L. & Peccoud, J. Formal grammars to protect intellectual
properties in synthetic biology. International Conference on Synthetic
Biology at Evry, France, December 15-16, 2010.
– GenoCAD tutorials
104
Conclusions
 Design: GenoCAD – Rule-based genetic design tools user-
friendly and Domain Specific Language
 Analyze: Semantic models of DNA sequences
 Cai, Y., Lux, M. W., Adam, L., & Peccoud, J. (2009). Modeling structure-
function relationships in synthetic DNA sequences using attribute
grammars. PLoS computational biology, 5(10)
 Laura Adam, Matthew W. Lux, Mandy L. Wilson, Tian Hong, Jean Peccoud.
Design of Languages for Systems and Synthetic Biology to translate
genetic designs into mathematical models. (Manuscript in preparation)
– Poster: Adam, L. & Peccoud, J. (2011). Using user defined semantic
languages in synthetic biology: generating DNA compilers. Third
International Workshop on BioDesign Automation (IWBDA) at 48th
ACM/EDAC/IEEE Design Automation Conference (DAC) in San Diego, CA.
– Talk:. Formal languages to map Genotype to Phenotype in Natural
Genomes. GBCB seminar, 2012.
105
Conclusions
 Design: GenoCAD – Rule-based genetic design tools user-
friendly and Domain Specific Language
 Analyze: Semantic models of DNA sequences
 Build: Biosecurity issues and DNA synthesis
 Adam, L., et al. (2011). Strengths and limitations of the federal guidance
on synthetic DNA. Nature Biotechnology, 29(3), 208–210.
doi:10.1038/nbt.1802
– Talk: GenoTHREAT: A biosecurity software to screen DNA synthesis
orders against Pathogens. GBCB seminar, 2011.
– Adam, L.(2011). Scientists need to be proactive to foment international
biosecurity. Runner up essay for the “Young Scientists” essay contest
organized by the Implementation and Support Unit of the Biological Weapon
Convention at the United Nations
106
Conclusions
 Design: GenoCAD – Rule-based genetic design tools user-
friendly and Domain Specific Language
 Analyze: Semantic models of DNA sequences
 Build: Biosecurity issues and DNA synthesis
 Define your Domain Specific G2P languages, Design
mutants and Analyze them in GenoCAD in minutes!
107
Acknowledgements
VBI SynBio Group
• J. Peccoud (P.I.)
• N. Adames
• D. Ball
• M. Lux
• C. Overend
• M. Wilson
• and Patrick (Yizhi) Cai
• and R. Hertzberg
PhD committee:
 Dr. Bevan
 Dr. Garner
 Dr. Kepes
 Dr. Peccoud
 Dr. Ramakrishnan
 Dr. Tyson
• And Dennie Munson!
108
Collaborators
• SBOL: H. Sauro, C.Myers, D. Densmore, C. Rodriguez, M.
Galdzicki and many more
• Language: Eric Van Wyck
• GenoGUARD: Ed You (FBI)
Questions?
109
ADDITIONAL INFORMATION
Resources
110
SBOL
111
What if we could…
• Compile circular DNA
• Read the different messages on both
strands
• Lexical analysis of natural sequences
• Customize (GenoCAD) and standardize?
(SBOL)
• Handle ambiguity
112
Natural
language
Natural
genome
Formal
language
Synthetic
biology
Syntactic Limitation
113
The Chomsky hierachy
Searls, D.B. “Linguistic approaches to biological sequences.” Bioinformatics 13, no. 4 (1997): 333.
http://bioinformatics.oxfordjournals.org/cgi/content/abstract/13/4/333.
Parsing
114
Left to Right
Top-Down
Parse
The Parse Tree of the Sentence
"The boy went home“
Right to Left
Top-Down
Parse
Left to Right
Bottom-Up
Parse
Right to Left
Bottom-Up
Parse
Use of attribute grammar in
synthetic biology
115
Formal definition Semantic In the synthetic
biology context
V, a finite set of non-
terminals
Attributes Parts categories
Σ, a finite set of
terminals
Attributes values Genetic Parts
R, a finite relation from
V to (VUΣ)*
Semantic actions Design Rules
S∈V, the start symbol Hard-coded
declarations
Start
http://www.daisyginsberg.com
Engineering Life
The Synthetic Kingdom
116
117
Insights from
Genotype-to-Phenotype (G2P)
mapping
G2P map
The
Phenotype
is X
Genotype
• genetic makeup of a cell, an
organism, or an individual
• specific alleles
• inherited
Phenotype
• observable characteristics or
traits
118
Traditional G2P mapping is linear
• Sui Huang, Rational drug discovery: what can we learn from regulatory networks?, Drug Discovery Today, Volume 7, Issue 20, 15
October 2002
• Peccoud, J., Velden, K. V., Podlich, D., Winkler, C., Arthur, L., & Cooper, M. (2004). The selective values of alleles in a molecular
network model are context dependent. Genetics, 166(4), 1715–25.
Phenotypes
Central dogma
119
Current Formalisms
Databases:
genetic mapping, genome annotation,
genotype, mutant, transcriptome,
proteome and metabolome data.
Ontologies:
Controlled vocabulary for annotation of
genes and their products (cellular
component, molecular function,
biological process)
Actually, G2P maps are nonlinear:
Gene Networks
• Priest, N. K., Rudkin, J. K., Feil, E. J., van den Elsen, J. M. H., Cheung, A., Peacock, S. J., Laabei, M., et al. (2012). From
genotype to phenotype: can systems biology be used to predict Staphylococcus aureus virulence? Nature reviews. Microbiology,
10(11), 791–7. doi:10.1038/nrmicro2880
• Benfey, P. N., & Mitchell-Olds, T. (2008). From genotype to phenotype: systems biology meets natural variation. Science.
“replacing the linear pathways with interconnected networks.”
120

Weitere ähnliche Inhalte

Andere mochten auch

Project report-on-bio-informatics
Project report-on-bio-informaticsProject report-on-bio-informatics
Project report-on-bio-informaticsDaniela Rotariu
 
Bioinformatics A Biased Overview
Bioinformatics A Biased OverviewBioinformatics A Biased Overview
Bioinformatics A Biased OverviewPhilip Bourne
 
Bioinformatics in the Era of Open Science and Big Data
Bioinformatics in the Era of Open Science and Big DataBioinformatics in the Era of Open Science and Big Data
Bioinformatics in the Era of Open Science and Big DataPhilip Bourne
 
De novo genome assembly - T.Seemann - IMB winter school 2016 - brisbane, au ...
De novo genome assembly  - T.Seemann - IMB winter school 2016 - brisbane, au ...De novo genome assembly  - T.Seemann - IMB winter school 2016 - brisbane, au ...
De novo genome assembly - T.Seemann - IMB winter school 2016 - brisbane, au ...Torsten Seemann
 
Flow Cytometry Training : Introduction day 1 session 1
Flow Cytometry Training : Introduction day 1 session 1Flow Cytometry Training : Introduction day 1 session 1
Flow Cytometry Training : Introduction day 1 session 1Robert (Rob) Salomon
 
DNA Markers Techniques for Plant Varietal Identification
DNA Markers Techniques for Plant Varietal Identification DNA Markers Techniques for Plant Varietal Identification
DNA Markers Techniques for Plant Varietal Identification Senthil Natesan
 
Ap Chapter 21
Ap Chapter 21Ap Chapter 21
Ap Chapter 21smithbio
 
Molecular Markers: Major Applications in Insects
Molecular Markers: Major Applications in InsectsMolecular Markers: Major Applications in Insects
Molecular Markers: Major Applications in InsectsSaramita De Chakravarti
 
How to be a bioinformatician
How to be a bioinformaticianHow to be a bioinformatician
How to be a bioinformaticianChristian Frech
 
Basics of bioinformatics
Basics of bioinformaticsBasics of bioinformatics
Basics of bioinformaticsAbhishek Vatsa
 

Andere mochten auch (11)

Project report-on-bio-informatics
Project report-on-bio-informaticsProject report-on-bio-informatics
Project report-on-bio-informatics
 
Bioinformatics A Biased Overview
Bioinformatics A Biased OverviewBioinformatics A Biased Overview
Bioinformatics A Biased Overview
 
Bioinformatics in the Era of Open Science and Big Data
Bioinformatics in the Era of Open Science and Big DataBioinformatics in the Era of Open Science and Big Data
Bioinformatics in the Era of Open Science and Big Data
 
De novo genome assembly - T.Seemann - IMB winter school 2016 - brisbane, au ...
De novo genome assembly  - T.Seemann - IMB winter school 2016 - brisbane, au ...De novo genome assembly  - T.Seemann - IMB winter school 2016 - brisbane, au ...
De novo genome assembly - T.Seemann - IMB winter school 2016 - brisbane, au ...
 
Flow Cytometry Training : Introduction day 1 session 1
Flow Cytometry Training : Introduction day 1 session 1Flow Cytometry Training : Introduction day 1 session 1
Flow Cytometry Training : Introduction day 1 session 1
 
DNA Markers Techniques for Plant Varietal Identification
DNA Markers Techniques for Plant Varietal Identification DNA Markers Techniques for Plant Varietal Identification
DNA Markers Techniques for Plant Varietal Identification
 
Ap Chapter 21
Ap Chapter 21Ap Chapter 21
Ap Chapter 21
 
Molecular Markers: Major Applications in Insects
Molecular Markers: Major Applications in InsectsMolecular Markers: Major Applications in Insects
Molecular Markers: Major Applications in Insects
 
How to be a bioinformatician
How to be a bioinformaticianHow to be a bioinformatician
How to be a bioinformatician
 
Gene concept
Gene conceptGene concept
Gene concept
 
Basics of bioinformatics
Basics of bioinformaticsBasics of bioinformatics
Basics of bioinformatics
 

Ähnlich wie Mapping Genotype to Phenotype using Attribute Grammar, Laura Adam

What Synthetic Biology Can Do For You
What Synthetic Biology Can Do For YouWhat Synthetic Biology Can Do For You
What Synthetic Biology Can Do For YouEric Ma
 
Cyberinfrastructure Day 2010: Applications in Biocomputing
Cyberinfrastructure Day 2010: Applications in BiocomputingCyberinfrastructure Day 2010: Applications in Biocomputing
Cyberinfrastructure Day 2010: Applications in BiocomputingJeremy Yang
 
Advanced computationalsyntbio
Advanced computationalsyntbioAdvanced computationalsyntbio
Advanced computationalsyntbioNatalio Krasnogor
 
The Emerging Global Community of Microbial Metagenomics Researchers
The Emerging Global Community of Microbial Metagenomics ResearchersThe Emerging Global Community of Microbial Metagenomics Researchers
The Emerging Global Community of Microbial Metagenomics ResearchersLarry Smarr
 
Biological Science Collections Tagging and Tracking presented at SPNHC
Biological Science Collections Tagging and Tracking presented at SPNHCBiological Science Collections Tagging and Tracking presented at SPNHC
Biological Science Collections Tagging and Tracking presented at SPNHCRob Guralnick
 
2014 11-13-sbsm032-reproducible research
2014 11-13-sbsm032-reproducible research2014 11-13-sbsm032-reproducible research
2014 11-13-sbsm032-reproducible researchYannick Wurm
 
Bioengineering: making life from scratch
Bioengineering: making life from scratchBioengineering: making life from scratch
Bioengineering: making life from scratchMelanie Swan
 
Collaboratively Creating the Knowledge Graph of Life
Collaboratively Creating the Knowledge Graph of LifeCollaboratively Creating the Knowledge Graph of Life
Collaboratively Creating the Knowledge Graph of LifeChris Mungall
 
Genome annotation with open source software: Apollo, Jbrowse and the GO in Ga...
Genome annotation with open source software: Apollo, Jbrowse and the GO in Ga...Genome annotation with open source software: Apollo, Jbrowse and the GO in Ga...
Genome annotation with open source software: Apollo, Jbrowse and the GO in Ga...Nathan Dunn
 
DNA Barcoding and its application in species identification
DNA Barcoding and its application in species identificationDNA Barcoding and its application in species identification
DNA Barcoding and its application in species identificationsupriya k
 
Pipeline development for analysis of solid mutants of Solanum tuberosum Gr. ...
Pipeline development  for analysis of solid mutants of Solanum tuberosum Gr. ...Pipeline development  for analysis of solid mutants of Solanum tuberosum Gr. ...
Pipeline development for analysis of solid mutants of Solanum tuberosum Gr. ...AlexisQuintero28
 
Synthetic Biology-Engaging Biology with Engineering
Synthetic Biology-Engaging Biology with EngineeringSynthetic Biology-Engaging Biology with Engineering
Synthetic Biology-Engaging Biology with EngineeringNavaneetha Krishnan J
 
Enabling Patient-Driven Medicine Using Graph Database
Enabling Patient-Driven Medicine Using Graph DatabaseEnabling Patient-Driven Medicine Using Graph Database
Enabling Patient-Driven Medicine Using Graph DatabaseNeo4j
 
Bioinformatica 29-09-2011-t1-bioinformatics
Bioinformatica 29-09-2011-t1-bioinformaticsBioinformatica 29-09-2011-t1-bioinformatics
Bioinformatica 29-09-2011-t1-bioinformaticsProf. Wim Van Criekinge
 
Introduction to Ontologies for Environmental Biology
Introduction to Ontologies for Environmental BiologyIntroduction to Ontologies for Environmental Biology
Introduction to Ontologies for Environmental BiologyBarry Smith
 

Ähnlich wie Mapping Genotype to Phenotype using Attribute Grammar, Laura Adam (20)

What Synthetic Biology Can Do For You
What Synthetic Biology Can Do For YouWhat Synthetic Biology Can Do For You
What Synthetic Biology Can Do For You
 
Cyberinfrastructure Day 2010: Applications in Biocomputing
Cyberinfrastructure Day 2010: Applications in BiocomputingCyberinfrastructure Day 2010: Applications in Biocomputing
Cyberinfrastructure Day 2010: Applications in Biocomputing
 
NRNB EAC Report 2011
NRNB EAC Report 2011NRNB EAC Report 2011
NRNB EAC Report 2011
 
CV_10/17
CV_10/17CV_10/17
CV_10/17
 
Cv long
Cv longCv long
Cv long
 
Advanced computationalsyntbio
Advanced computationalsyntbioAdvanced computationalsyntbio
Advanced computationalsyntbio
 
MoM2010: Bioinformatics
MoM2010: BioinformaticsMoM2010: Bioinformatics
MoM2010: Bioinformatics
 
The Emerging Global Community of Microbial Metagenomics Researchers
The Emerging Global Community of Microbial Metagenomics ResearchersThe Emerging Global Community of Microbial Metagenomics Researchers
The Emerging Global Community of Microbial Metagenomics Researchers
 
Biological Science Collections Tagging and Tracking presented at SPNHC
Biological Science Collections Tagging and Tracking presented at SPNHCBiological Science Collections Tagging and Tracking presented at SPNHC
Biological Science Collections Tagging and Tracking presented at SPNHC
 
2014 11-13-sbsm032-reproducible research
2014 11-13-sbsm032-reproducible research2014 11-13-sbsm032-reproducible research
2014 11-13-sbsm032-reproducible research
 
Bioengineering: making life from scratch
Bioengineering: making life from scratchBioengineering: making life from scratch
Bioengineering: making life from scratch
 
Collaboratively Creating the Knowledge Graph of Life
Collaboratively Creating the Knowledge Graph of LifeCollaboratively Creating the Knowledge Graph of Life
Collaboratively Creating the Knowledge Graph of Life
 
Genome annotation with open source software: Apollo, Jbrowse and the GO in Ga...
Genome annotation with open source software: Apollo, Jbrowse and the GO in Ga...Genome annotation with open source software: Apollo, Jbrowse and the GO in Ga...
Genome annotation with open source software: Apollo, Jbrowse and the GO in Ga...
 
DNA Barcoding and its application in species identification
DNA Barcoding and its application in species identificationDNA Barcoding and its application in species identification
DNA Barcoding and its application in species identification
 
NRNB EAC Meeting 2012
NRNB EAC Meeting 2012NRNB EAC Meeting 2012
NRNB EAC Meeting 2012
 
Pipeline development for analysis of solid mutants of Solanum tuberosum Gr. ...
Pipeline development  for analysis of solid mutants of Solanum tuberosum Gr. ...Pipeline development  for analysis of solid mutants of Solanum tuberosum Gr. ...
Pipeline development for analysis of solid mutants of Solanum tuberosum Gr. ...
 
Synthetic Biology-Engaging Biology with Engineering
Synthetic Biology-Engaging Biology with EngineeringSynthetic Biology-Engaging Biology with Engineering
Synthetic Biology-Engaging Biology with Engineering
 
Enabling Patient-Driven Medicine Using Graph Database
Enabling Patient-Driven Medicine Using Graph DatabaseEnabling Patient-Driven Medicine Using Graph Database
Enabling Patient-Driven Medicine Using Graph Database
 
Bioinformatica 29-09-2011-t1-bioinformatics
Bioinformatica 29-09-2011-t1-bioinformaticsBioinformatica 29-09-2011-t1-bioinformatics
Bioinformatica 29-09-2011-t1-bioinformatics
 
Introduction to Ontologies for Environmental Biology
Introduction to Ontologies for Environmental BiologyIntroduction to Ontologies for Environmental Biology
Introduction to Ontologies for Environmental Biology
 

Kürzlich hochgeladen

Biological Classification BioHack (3).pdf
Biological Classification BioHack (3).pdfBiological Classification BioHack (3).pdf
Biological Classification BioHack (3).pdfmuntazimhurra
 
GBSN - Biochemistry (Unit 1)
GBSN - Biochemistry (Unit 1)GBSN - Biochemistry (Unit 1)
GBSN - Biochemistry (Unit 1)Areesha Ahmad
 
9654467111 Call Girls In Raj Nagar Delhi Short 1500 Night 6000
9654467111 Call Girls In Raj Nagar Delhi Short 1500 Night 60009654467111 Call Girls In Raj Nagar Delhi Short 1500 Night 6000
9654467111 Call Girls In Raj Nagar Delhi Short 1500 Night 6000Sapana Sha
 
Biopesticide (2).pptx .This slides helps to know the different types of biop...
Biopesticide (2).pptx  .This slides helps to know the different types of biop...Biopesticide (2).pptx  .This slides helps to know the different types of biop...
Biopesticide (2).pptx .This slides helps to know the different types of biop...RohitNehra6
 
VIRUSES structure and classification ppt by Dr.Prince C P
VIRUSES structure and classification ppt by Dr.Prince C PVIRUSES structure and classification ppt by Dr.Prince C P
VIRUSES structure and classification ppt by Dr.Prince C PPRINCE C P
 
Raman spectroscopy.pptx M Pharm, M Sc, Advanced Spectral Analysis
Raman spectroscopy.pptx M Pharm, M Sc, Advanced Spectral AnalysisRaman spectroscopy.pptx M Pharm, M Sc, Advanced Spectral Analysis
Raman spectroscopy.pptx M Pharm, M Sc, Advanced Spectral AnalysisDiwakar Mishra
 
Isotopic evidence of long-lived volcanism on Io
Isotopic evidence of long-lived volcanism on IoIsotopic evidence of long-lived volcanism on Io
Isotopic evidence of long-lived volcanism on IoSérgio Sacani
 
Labelling Requirements and Label Claims for Dietary Supplements and Recommend...
Labelling Requirements and Label Claims for Dietary Supplements and Recommend...Labelling Requirements and Label Claims for Dietary Supplements and Recommend...
Labelling Requirements and Label Claims for Dietary Supplements and Recommend...Lokesh Kothari
 
Orientation, design and principles of polyhouse
Orientation, design and principles of polyhouseOrientation, design and principles of polyhouse
Orientation, design and principles of polyhousejana861314
 
Spermiogenesis or Spermateleosis or metamorphosis of spermatid
Spermiogenesis or Spermateleosis or metamorphosis of spermatidSpermiogenesis or Spermateleosis or metamorphosis of spermatid
Spermiogenesis or Spermateleosis or metamorphosis of spermatidSarthak Sekhar Mondal
 
Recombinant DNA technology (Immunological screening)
Recombinant DNA technology (Immunological screening)Recombinant DNA technology (Immunological screening)
Recombinant DNA technology (Immunological screening)PraveenaKalaiselvan1
 
Pests of cotton_Sucking_Pests_Dr.UPR.pdf
Pests of cotton_Sucking_Pests_Dr.UPR.pdfPests of cotton_Sucking_Pests_Dr.UPR.pdf
Pests of cotton_Sucking_Pests_Dr.UPR.pdfPirithiRaju
 
Green chemistry and Sustainable development.pptx
Green chemistry  and Sustainable development.pptxGreen chemistry  and Sustainable development.pptx
Green chemistry and Sustainable development.pptxRajatChauhan518211
 
Chromatin Structure | EUCHROMATIN | HETEROCHROMATIN
Chromatin Structure | EUCHROMATIN | HETEROCHROMATINChromatin Structure | EUCHROMATIN | HETEROCHROMATIN
Chromatin Structure | EUCHROMATIN | HETEROCHROMATINsankalpkumarsahoo174
 
Broad bean, Lima Bean, Jack bean, Ullucus.pptx
Broad bean, Lima Bean, Jack bean, Ullucus.pptxBroad bean, Lima Bean, Jack bean, Ullucus.pptx
Broad bean, Lima Bean, Jack bean, Ullucus.pptxjana861314
 
GFP in rDNA Technology (Biotechnology).pptx
GFP in rDNA Technology (Biotechnology).pptxGFP in rDNA Technology (Biotechnology).pptx
GFP in rDNA Technology (Biotechnology).pptxAleenaTreesaSaji
 
PossibleEoarcheanRecordsoftheGeomagneticFieldPreservedintheIsuaSupracrustalBe...
PossibleEoarcheanRecordsoftheGeomagneticFieldPreservedintheIsuaSupracrustalBe...PossibleEoarcheanRecordsoftheGeomagneticFieldPreservedintheIsuaSupracrustalBe...
PossibleEoarcheanRecordsoftheGeomagneticFieldPreservedintheIsuaSupracrustalBe...Sérgio Sacani
 
All-domain Anomaly Resolution Office U.S. Department of Defense (U) Case: “Eg...
All-domain Anomaly Resolution Office U.S. Department of Defense (U) Case: “Eg...All-domain Anomaly Resolution Office U.S. Department of Defense (U) Case: “Eg...
All-domain Anomaly Resolution Office U.S. Department of Defense (U) Case: “Eg...Sérgio Sacani
 
Unlocking the Potential: Deep dive into ocean of Ceramic Magnets.pptx
Unlocking  the Potential: Deep dive into ocean of Ceramic Magnets.pptxUnlocking  the Potential: Deep dive into ocean of Ceramic Magnets.pptx
Unlocking the Potential: Deep dive into ocean of Ceramic Magnets.pptxanandsmhk
 

Kürzlich hochgeladen (20)

Biological Classification BioHack (3).pdf
Biological Classification BioHack (3).pdfBiological Classification BioHack (3).pdf
Biological Classification BioHack (3).pdf
 
GBSN - Biochemistry (Unit 1)
GBSN - Biochemistry (Unit 1)GBSN - Biochemistry (Unit 1)
GBSN - Biochemistry (Unit 1)
 
9654467111 Call Girls In Raj Nagar Delhi Short 1500 Night 6000
9654467111 Call Girls In Raj Nagar Delhi Short 1500 Night 60009654467111 Call Girls In Raj Nagar Delhi Short 1500 Night 6000
9654467111 Call Girls In Raj Nagar Delhi Short 1500 Night 6000
 
Biopesticide (2).pptx .This slides helps to know the different types of biop...
Biopesticide (2).pptx  .This slides helps to know the different types of biop...Biopesticide (2).pptx  .This slides helps to know the different types of biop...
Biopesticide (2).pptx .This slides helps to know the different types of biop...
 
VIRUSES structure and classification ppt by Dr.Prince C P
VIRUSES structure and classification ppt by Dr.Prince C PVIRUSES structure and classification ppt by Dr.Prince C P
VIRUSES structure and classification ppt by Dr.Prince C P
 
Raman spectroscopy.pptx M Pharm, M Sc, Advanced Spectral Analysis
Raman spectroscopy.pptx M Pharm, M Sc, Advanced Spectral AnalysisRaman spectroscopy.pptx M Pharm, M Sc, Advanced Spectral Analysis
Raman spectroscopy.pptx M Pharm, M Sc, Advanced Spectral Analysis
 
Isotopic evidence of long-lived volcanism on Io
Isotopic evidence of long-lived volcanism on IoIsotopic evidence of long-lived volcanism on Io
Isotopic evidence of long-lived volcanism on Io
 
Labelling Requirements and Label Claims for Dietary Supplements and Recommend...
Labelling Requirements and Label Claims for Dietary Supplements and Recommend...Labelling Requirements and Label Claims for Dietary Supplements and Recommend...
Labelling Requirements and Label Claims for Dietary Supplements and Recommend...
 
9953056974 Young Call Girls In Mahavir enclave Indian Quality Escort service
9953056974 Young Call Girls In Mahavir enclave Indian Quality Escort service9953056974 Young Call Girls In Mahavir enclave Indian Quality Escort service
9953056974 Young Call Girls In Mahavir enclave Indian Quality Escort service
 
Orientation, design and principles of polyhouse
Orientation, design and principles of polyhouseOrientation, design and principles of polyhouse
Orientation, design and principles of polyhouse
 
Spermiogenesis or Spermateleosis or metamorphosis of spermatid
Spermiogenesis or Spermateleosis or metamorphosis of spermatidSpermiogenesis or Spermateleosis or metamorphosis of spermatid
Spermiogenesis or Spermateleosis or metamorphosis of spermatid
 
Recombinant DNA technology (Immunological screening)
Recombinant DNA technology (Immunological screening)Recombinant DNA technology (Immunological screening)
Recombinant DNA technology (Immunological screening)
 
Pests of cotton_Sucking_Pests_Dr.UPR.pdf
Pests of cotton_Sucking_Pests_Dr.UPR.pdfPests of cotton_Sucking_Pests_Dr.UPR.pdf
Pests of cotton_Sucking_Pests_Dr.UPR.pdf
 
Green chemistry and Sustainable development.pptx
Green chemistry  and Sustainable development.pptxGreen chemistry  and Sustainable development.pptx
Green chemistry and Sustainable development.pptx
 
Chromatin Structure | EUCHROMATIN | HETEROCHROMATIN
Chromatin Structure | EUCHROMATIN | HETEROCHROMATINChromatin Structure | EUCHROMATIN | HETEROCHROMATIN
Chromatin Structure | EUCHROMATIN | HETEROCHROMATIN
 
Broad bean, Lima Bean, Jack bean, Ullucus.pptx
Broad bean, Lima Bean, Jack bean, Ullucus.pptxBroad bean, Lima Bean, Jack bean, Ullucus.pptx
Broad bean, Lima Bean, Jack bean, Ullucus.pptx
 
GFP in rDNA Technology (Biotechnology).pptx
GFP in rDNA Technology (Biotechnology).pptxGFP in rDNA Technology (Biotechnology).pptx
GFP in rDNA Technology (Biotechnology).pptx
 
PossibleEoarcheanRecordsoftheGeomagneticFieldPreservedintheIsuaSupracrustalBe...
PossibleEoarcheanRecordsoftheGeomagneticFieldPreservedintheIsuaSupracrustalBe...PossibleEoarcheanRecordsoftheGeomagneticFieldPreservedintheIsuaSupracrustalBe...
PossibleEoarcheanRecordsoftheGeomagneticFieldPreservedintheIsuaSupracrustalBe...
 
All-domain Anomaly Resolution Office U.S. Department of Defense (U) Case: “Eg...
All-domain Anomaly Resolution Office U.S. Department of Defense (U) Case: “Eg...All-domain Anomaly Resolution Office U.S. Department of Defense (U) Case: “Eg...
All-domain Anomaly Resolution Office U.S. Department of Defense (U) Case: “Eg...
 
Unlocking the Potential: Deep dive into ocean of Ceramic Magnets.pptx
Unlocking  the Potential: Deep dive into ocean of Ceramic Magnets.pptxUnlocking  the Potential: Deep dive into ocean of Ceramic Magnets.pptx
Unlocking the Potential: Deep dive into ocean of Ceramic Magnets.pptx
 

Mapping Genotype to Phenotype using Attribute Grammar, Laura Adam

  • 1. MAPPING GENOTYPE TO PHENOTYPE USING ATTRIBUTE GRAMMAR LAURA ADAM Genetics, Bioinformatics and Computational Biology Program ADVISORY COMMITTEE: Jean Peccoud (Chair) David Bevan, Harold Garner, François Képès, Naren Ramakrishnan, John Tyson Doctoral Dissertation Defense Thursday, July 25th, 2013
  • 2. Outline Introduction: – Synthetic Biology – Computer Assisted Design Design: – GenoCAD: formal language to design biology – Domain Specific Language: Grammar editor Simulation: – Attribute Grammars to compile a mathematical model – Design Genotype to Phenotype languages Build: – Biosecurity questions Conclusion 2
  • 4. Synthetic Biology is: • The design and construction of new biological parts, devices, and systems • Engineering standard biological parts, circuits • Chemical synthesis of DNA • The re-design of existing, natural biological systems for useful purposes or understanding their properties • Minimal genome/minimal life • Synthetic cells/protocells from scratch • Orthogonal biological systems 4
  • 5. Applications of Synthetic Biology Khalil, A. S., & Collins, J. J. (2010). Synthetic biology: applications come of age. Nature reviews. Genetics, 11(5), 367–79. doi:10.1038/nrg2775 http://syntheticbiology.org/ “[…] change our lives over the coming years, leading to cheaper drugs, 'green' means to fuel our cars and targeted therapies for attacking 'superbugs' and diseases, such as cancer.” 5
  • 6. Designing Biology • No established design methodology • Attempts to mimic others engineering disciplines (electrical engineering – logic gates; assembly language, etc.) Andrianantoandro, E., Basu, S., Karig, D. K., & Weiss, R. (2006). Synthetic biology : new engineering rules for an emerging discipline. Molecular Systems Biology, 1–14. doi:10.1038/msb4100073 Creating and analyzing new biological systems Gardner, T. S., Cantor, C. R., & Collins, J. J. (2000). Construction of a genetic toggle switch in Escherichia coli. Nature, 403(6767), 339–42. doi:10.1038/35002131 Elowitz, M. B., & Leibler, S. (2000). A synthetic oscillatory network of transcriptional regulators. Nature, 403(6767), 335–338. doi:10.1038/35002125 Gibson, D. G. D. G., Glass, J. I. J. I. J. I., Lartigue, C., Noskov, V. N. N. V. N., Chuang, R.-Y. Y., Algire, M. A. M. a., … Venter, J. C. (2010). Creation of a Bacterial Cell Controlled by a Chemically Synthesized Genome. Science, 329(5987), 52–6. doi:10.1126/science.1190719 Ro, D. K. et al. Production of the antimalarial drug precursor artemisinic acid in engineered yeast. Nature 440, 940–943 (2006). How to Scale up the complexity of design in Synthetic Biology?6
  • 7. Designing Biology? • Ad hoc, getting too complex to be done by hand, error prone, lengthy (hence, costly) H. Koeppl et al. (eds.), Design and Analysis of Biomolecular Circuits: Engineering Approaches to Systems and Synthetic Biology, DOI 10.1007/978-1-4419-6766-4 10, 7 Purnick, P. E. M., & Weiss, R. (2009). The second wave of synthetic biology: from modules to systems. Nature Reviews Molecular Cell Biology, 10(6), 410–422. doi:10.1038/nrm2698 Need for a Design Methodology Assist in the process of: • designing a system with a DESIRED BEHAVIOR • constructing its physical realization
  • 8. Design process in Synthetic Biology Design • Databases • CAD tools Analyze • Computational tools Build • Assembly of parts • DNA synthesis 8
  • 9. Design Design • Databases • CAD tools Analyze • Computational tools Build • Assembly of parts • DNA synthesis 9
  • 11. CAD for Synthetic Biology • Top-down: specify a behavior (high level)  constructs – Lack of quantitative parameters data for parts behavior (see BioFAB or Matt Lux’s thesis) • Bottom-up: design a construct from parts  predict its behavior • Assembly of biological parts, modules, circuits – Databases, Biobricks™ repository – aggregated parts sequences  physical DNA of construct 11
  • 12. CAD tools – design a sequence • Graphic interface (drag and drop / point and click parts) – Clotho – GenoCAD – SynbioSS • Programming language-like / text-based – GEC • Network diagram – TinkerCell – ProMoT • Chemical equations – Antimony 12 How to Represent a genetic construct?
  • 13. SYNTHETIC BIOLOGY OPEN LANGUAGE (SBOL) 13 Laura Adam (Virginia Bioinformatics Institute), Aaron Adler (BBN Technologies), J. Christopher Anderson (Dept of Bioengineering, University of California Berkeley), Jacob Beal (BBN Technologies), Matthieu Bultelle (Bioengineering, Imperial College London), Kevin Clancy (Life Technologies), Kendall G. Clark (Clark & Parsia, LLC.), Douglas Densmore (Electrical and Computer Engineering, Boston University), Omri Drory (Genome Compiler), Drew Endy (BIOFAB and Dept of Bioengineering, Stanford University), John H. Gennari (Biomedical and Health Informatics, University of Washington), Raik Gruenberg (EMBL-CRG Systems Biology program, CRG), Jennifer Hallinan (School of Computing Science, Newcastle University), Timothy Ham (Joint BioEnergy Institute), Allan Kuchinsky (Agilent Technologies), Matthew W. Lux (Virginia Bioinformatics Institute), Curtis Madsen (Electrical and Computer Engineering, University of Utah), Akshay Maheshwari (UCSD), Barry Moore (Human Genetics, University of Utah), Chris J. Myers (Electrical and Computer Engineering, University of Utah), Carlos Olguin (Autodesk Research), Jean Peccoud (Virginia Bioinformatics Institute), Hector Plahar (Joint BioEnergy Institute), Matthew Pocock (School of Computing Science, Newcastle University), Cesar A. Rodriguez (BIOFAB), Nicholas Roehner (Electrical and Computer Engineering, University of Utah), Vincent Rouilly (Biozentrum, University of Basel), Trevor F. Smith (Agilent Technologies), Guy-Bart Stan (Bioengineering, Imperial College London), Vinod Tek (Bioengineering, Imperial College London), Alan Villalobos (DNA 2.0, Inc.), Mandy Wilson (Virginia Bioinformatics Institute), Chris Winstead (Electrical and Computer Engineering Utah State University), Anil Wipat (School of Computing Science, Newcastle University), and Fusun Yaman Sirin (BBN Technologies).
  • 14. Need for standards in Synthetic Biology 14 • Core Data Model • SBOL visual • libSBOL – (java, C, python) Synthetic Biology Open Language (SBOL): a data exchange standard for descriptions of genetic parts, devices, modules, and systems
  • 15. Participation in SBOL • Workshops I attended: – Blacksburg, Virginia on January 7-10, 2011 – San Diego, California on June 8, 2011  Manuscript in submission to Nature Biotechnology • Won SBOL logo competition 15
  • 16. GENOCAD, RULE-BASED DESIGN OF DNA MOLECULES 16
  • 17. DNA as a Language What insights can we get from computational linguistics? 17
  • 18. Example of formal grammar A grammar is a:  Set of rules describing how to form sentences from a language’s vocabulary 18 R1: Sentence → Subject + Verb + Object R2: Subject → NounPhrase R3: Object → NounPhrase R4: NounPhrase → NounPhrase + Modifier R5: Modifier → PrepositionalPhrase
  • 19. How to write a sentence? Build a tree! 19 Sentence Subject NounPhrase Verb Object NounPhrase NounPhrase Modifier Prepositional Phrase R1: Sentence → Subject + Verb + Object R2: Subject → NounPhrase R3: Object → NounPhrase R4: NounPhrase → NounPhrase + Modifier R5: Modifier → PrepositionalPhrase R1 R3R2 R4 R5 NounPhrase Verb NounPhrase Prepositional Phrase
  • 20. Noun Phrase People Verb are Noun Phrase DNA’s way Prepositional phrase of making more DNA 20 ~ Edward O. Wilson, 1975 Words make the sentence Rules make the structure
  • 21. Rule-based design of DNA sequences 21 Construct  Promoter Cistron Terminator Cistron  Cistron Cistron Cistron  RBS Gene Terminator  Terminator Terminator Grammar rules indicates how categories can be arranged to form a ‘valid’ design, parts from the library implement it.
  • 22. Rule-based design of DNA sequences 22 Construct Promoter BBa_R0040 Cistron Cistron RBS BBa_B0034 Gene BBa_E002 0 Cistron RBS BBa_B0034 Gene BBa_E0030 Terminator Terminator BBa_B0010 Terminator BBa_B0012 Construct  Promoter Cistron Terminator Cistron  Cistron Cistron Cistron  RBS Gene Terminator  Terminator Terminator
  • 23. Context-Free Grammar (CFG) Design and Validate 23 Construct  Promoter Cistron Terminator Cistron  Cistron Cistron Cistron  RBS Gene Terminator  Terminator Terminator Construct Prom oter BBa_R 0040 Cistron Cistron RBS BBa_B 0034 Gene BBa_E 0020 Cistron RBS BBa_B 0034 Gene BBa_E 0030 Terminator Termin ator BBa_B0 010 Termin ator BBa_B 0012 Design Tree Design by Derivation CFG Validate by Parsing
  • 24. 2007 ~ GenoCAD v1 24Cai, Y., Hartnett, B., Gustafsson, C., & Peccoud, J. (2007). A syntactic model to design and verify synthetic genetic constructs derived from standard biological parts. Bioinformatics (Oxford, England), 23(20), 2760–7. doi:10.1093/bioinformatics/btm446
  • 25. Domain Specific Languages • No universal language – C, SQL, HTML, Flash • Languages express design strategies – Domain-specific • bacterial, yeast, mammalian – Project-specific • gene therapy, synthetic biology – Organization-specific • intellectual property, teaching  Empower end-users to develop their own DSL 25 Bacterial Yeast Mammalian
  • 26. GenoCAD v2 : Language engineering 26 o GenoCAD.org o Open Source o Tutorials
  • 28. A language for the chloroplast of Chlamydomonas reinhardtii • Project to design nitrogen fixating algae • Gene expression in the chloroplast of microalgae – genomic sequences for targeting the insertion of construct in the chloroplast 28 Chlamydomonas reinhardtii
  • 29. Identify and Define Categories Category Definition 5FLR / 3FLR 5’ / 3’ Flanking region for homologous recombination SIS Short Interval Sequences used to make polycistronic cassettes STP Stop codon ATG Start codon GEN Gene or protein domain. By convention does not include start and stop codons. CDS Open reading frame composed of several protein domains. Does not include start and stop codons. TAG Epitope tags. By convention does not include Start or Stop codons. PBS Sequence associated with the initiation of transcription and translation. TCS Targeted expression cassette. Expression cassette flanked with two adjacent genomic sequences for homologous recombination. CAS Expression cassette delimited by a promoter in 5’ and a transcription terminator in 3’. 29 Category Definition [ and ] Negative orientation delimiters ( and ) Plasmid delimiters { and } Chromosome delimiters
  • 30. Grammar Editor – Add/Edit Categories 30
  • 31. Grammar Editor – Add/Edit Categories 31
  • 32. Define rewriting rules Code Rule Comment CAS S -> TCS This rule is used to design only one expression cassette 1PLAS S ->. ( VEC TCS ) This rule is used to specify the expression cassette along with the vector where it is inserted. The output is the entire plasmid sequence. 2PLAS S ->. ( VEC TCS ) ( VEC TCS ) This rule is for designs that involve two plasmids. TGS TCS -> 5FLR CAS 3FLR Specifies the flanking regions for homologous recombination. PRCT CAS-> PBS CDS TER A gene expression cassette is composed of a promoter, open reading frame, and a transcription terminator. 2CAS CAS -> CAS CAS This rule makes it possible to have more than one expression cassette on a construct. rCAS CAS -> [ CAS ] This rule is used to specify that the cassette is coded on the negative strand. 2CDS CDS -> CDS SIS CDS This rule makes it possible to design polycistronic constructs. SGEN CDS -> ATG GEN STP The open reading frame is composed of a single gene flanked by a start and stop codon. TGEN GEN  GEN TAG This rule is used to add a tag to a coding sequence. It can be used iteratively to add more than one tag. 2GEN GEN-> GEN GEN This rule can be used to fuse two coding sequences that are not tags. 32
  • 33. Grammar Editor – Add/Edit Rules 33
  • 34. Add parts in libraries 34
  • 37. A dynamic language • You can change the model … – Add or Change a Rule in a Grammar – Delete a Rule from a Grammar – Remove a Part from a library – Change a Part’s sequence – Change a Part’s category … but how does this affect the dependent designs? 37 ATGGTGAGCAAGGGCGAGGAGAATAACA TGGCCATCATCAAGGAGTTCATGCGCTTC AAGGTGCGCATGGAGGGCTCCGTGAAC GGCCACGAGTTCGAGATCGAGGGCGAG GGCGAGGGCCGCCCCTACGAGGGCTTT CAGACCGCTAAGCTGAAGGTGACC ATGGTGAGCAAGGGCGAGGAGAATAACA TGGCCATCATCAAGGAGTTCATGCGCTTC AAGGTGCGCATGGAGGGCTCCGTGAAC GGCCACGAGTTCGAGATCGAGGGCGAG GGCGAGGGCCGCCCCTACGAGGGCTTT CAGACCGCTAAGCTGAAGGTGACC CAAGGGCGAGGAGAATAACATGGCCATC ATCAAGGAGTTCATGCGCTTCAAGGTGC GCATGGAGGGCTCCGTGAACGGCCACG AGTTCGAGATCGAGGGCGAGGGCGAGG GCCGCCCCTACGAGGGCTTTCAGACCGC TAAGCTGAAGGTGACC
  • 39. Different design statuses 39 Valid – the sequence could be decomposed into its parts, and the parts’ categories make up a grammar- sanctioned framework. Needs validation – either grammar, part, or library has changed, and the sequence has not been validated since Under construction – design is unfinished, so cannot be compiled. Out of Date – although design is still valid with respect to grammars and libraries, the parts have changed. Invalid – the sequence cannot be resolved.
  • 40. Left recursion (CIS  CIS CIS) Remove orphan rules We need to generate CFG compilers 40
  • 41. Design - Recap Design • Databases • CAD tools Analyze • Computational tools Build • Assembly of parts • DNA synthesis  Participate in SBOL, community effort for standard in Synthetic Biology  Context-Free Grammar to design DNA molecules  GenoCAD  Domain specific languages  Edit libraries of parts and grammars rules  CFG compiler generation 41
  • 42. Analyze Design • Databases • CAD tools Analyze • Computational tools Build • Assembly of parts • DNA synthesis 42
  • 43. 43
  • 44. 44
  • 47. Semantic DNA Compilation Genetic Design A Get Chemical Equations for A Attribute Grammar 47
  • 48. Semantic DNA Compilation Genetic Design B Get Chemical Equations for B Attribute Grammar 48
  • 49. Attribute Grammars (AG) • AG = a CFG plus: – Categories and Parts have attributes – Rules have semantic actions to compute attributes values While going through the parse tree, we now also evaluate the semantics (meaning) 49
  • 50. Example - Target output – Transcription: • dna  dna + mrna – Translation: • mrna  mrna + protein – Degradation mrna: • mrna  [] – Degradation protein: • protein  [] – Interaction promoter protein: • dna + repressor <-> dna_repressor_x 50
  • 51. Example – Parts Attributes • Promoter: transcription rate, repressor – Promoter(transcription_rate, repressor)  ptetr (50, tetr) – Promoter(transcription_rate, repressor)  placi (10, laci) • RBS: translation rate – RBS(translation_rate)  rbsA (25) – RBS(translation_rate)  rbsB (50) • CDS: degradation rates for the protein and the mRNA – CDS(protein_deg,mrna_deg)  laci(1,1) – CDS(protein_deg,mrna_deg)  tetr(1,1) • Terminator – Terminator  t1 51
  • 52. Example: Rules Semantic Actions • CAS  PROMOTER(transcription_rate, repressor), CIS, TERMINATOR – Transcription: dna  dna + mrna, [transcription_rate] – Interaction: if repressor in construct then dna + repressor <-> dna_repressor_X • CISTRON  RBS(translation_rate), CDS(protein_deg,mrna_deg) – Translation: mrna  mrna + protein, [translation_rate] – Degradation_mrna: mrna  ϕ, [mrna_deg] – Degradation_protein: protein  ϕ, [protein_deg] 52
  • 59. CAS PROMOTER: ptetr ptetr CIX CISTRON RBS Translation rate rbsA CDS mRNA deg rate Protein deg rate laci TERMINATOR TERMINATOR t1 TERMINATOR t2 Example: Toggle switch ptetr 50 tetr PROMOTER: ptetr 50 tetr • Translation: mrna  mrna + protein, [translation_rate] • Degradation_mrna: mrna  ø, [mrna_deg] • Degradation_protein: protein  ø, [protein_deg] 59
  • 60. CAS PROMOTER: ptetr ptetr CIX CISTRON RBS: rbsA 25 rbsA 25 CDS mRNA deg rate Protein deg rate laci TERMINATOR TERMINATOR t1 TERMINATOR t2 Example: Toggle switch ptetr 50 tetr PROMOTER: ptetr 50 tetr • Translation: mrna  mrna + protein, [translation_rate] • Degradation_mrna: mrna  ø, [mrna_deg] • Degradation_protein: protein  ø, [protein_deg] 60
  • 61. CAS PROMOTER: ptetr 50 tetr ptetr 50 tetr CIX CISTRON RBS: rbsA 25 rbsA 25 CDS: laci 1 1 laci 1 1 TERMINATOR TERMINATOR t1 TERMINATOR t2 Example: Toggle switch • Translation: mrna  mrna + protein, [translation_rate] • Degradation_mrna: mrna  ø, [mrna_deg] • Degradation_protein: protein  ø, [protein_deg] 61
  • 62. CAS PROMOTER: ptetr 50 tetr ptetr 50 tetr CIX CISTRON RBS: rbsA 25 rbsA 25 CDS: laci 1 1 laci 1 1 TERMINATOR TERMINATOR t1 TERMINATOR t2 Example: Toggle switch • Translation: mrna_rbsA_laci  mrna_rbsA_laci + protein_laci, [25] • Degradation_mrna: mrna_rbsA_laci  ø, [1] • Degradation_protein: protein_laci  ø, [1] 62
  • 63. CAS PROMOTER: ptetr 50 tetr ptetr 50 tetr CIX CISTRON RBS: rbsA 25 rbsA 25 CDS: laci 1 1 laci 1 1 TERMINATOR TERMINATOR t1 TERMINATOR t2 Example: Toggle switch • Translation: mrna_rbsA_laci  mrna_rbsA_laci + protein_laci, [25] • Degradation_mrna: mrna_rbsA_laci  ø, [1] • Degradation_protein: protein_laci  ø, [1] 63
  • 64. CAS PROMOTER: ptetr 50 tetr ptetr 50 tetr CIX CISTRON RBS: rbsA 25 rbsA 25 CDS: laci 1 1 laci 1 1 TERMINATOR TERMINATOR t1 TERMINATOR t2 Example: Toggle switch • Translation: mrna_rbsA_laci  mrna_rbsA_laci + protein_laci, [25] • Degradation_mrna: mrna_rbsA_laci  ø, [1] • Degradation_protein: protein_laci  ø, [1] 64
  • 65. CAS PROMOTER: ptetr 50 tetr ptetr 50 tetr CIX CISTRON RBS: rbsA 25 rbsA 25 CDS: laci 1 1 laci 1 1 TERMINATOR TERMINATOR t1 TERMINATOR t2 Example: Toggle switch • Translation: mrna_rbsA_laci  mrna_rbsA_laci + protein_laci, [25] • Degradation_mrna: mrna_rbsA_laci  ø, [1] • Degradation_protein: protein_laci  ø, [1] 65
  • 66. Example: Toggle switch • Transcription: dna  dna + mrna, [transcription_rate] • Interaction: if repressor in construct then dna + repressor  dna_repressor_X CAS PROMOTER: ptetr 50 tetr ptetr 50 tetr CIX CISTRON RBS: rbsA 25 rbsA 25 CDS: laci 1 1 laci 1 1 TERMINATOR TERMINATOR t1 TERMINATOR t2 • Translation: mrna_rbsA_laci  mrna_rbsA_laci + protein_laci, [25] • Degradation_mrna: mrna_rbsA_laci  ø, [1] • Degradation_protein: protein_laci  ø, [1] 66
  • 67. Example: Toggle switch CAS PROMOTER: ptetr 50 tetr ptetr 50 tetr CIX CISTRON RBS: rbsA 25 rbsA 25 CDS: laci 1 1 laci 1 1 TERMINATOR TERMINATOR t1 TERMINATOR t2 • Translation: mrna_rbsA_laci  mrna_rbsA_laci + protein_laci, [25] • Degradation_mrna: mrna_rbsA_laci  ø, [1] • Degradation_protein: protein_laci  ø, [1] Transcription: dna_ptetr_rbsA_laci  dna_ptetr_rbsA_laci + mrna_rbsA_laci , [50] Interaction: if tetr in construct dna_ptetr_rbsA_laci + protein_tetr  dna_ptetr_rbsA_laci _tetr_X 67
  • 69. Toggle switch laci/tetr Get Chemical Equations for A Example: Toggle switch Transcription: dna_ptetr_rbsA_laci  dna_ptetr_rbsA_laci + mrna_rbsA_laci , [50] Interaction: if tetr in construct dna_ptetr_rbsA_laci + protein_tetr  dna_ptetr_rbsA_laci _tetr_X Translation: mrna_rbsA_laci  mrna_rbsA_laci + protein_laci, [25] Degradation_mrna: mrna_rbsA_laci  φ, [1] Degradation_protein: protein_laci  φ, [1] Transcription: dna_placi_rbsB_tetr  dna_placi_rbsB_tetr + mrna_rbsB_tetr , [10] Interaction: if laci in construct dna_placi_rbsB_tetr + protein_tetr  dna_placi_rbsB_tetr_laci_X Translation: mrna_rbsB_tetr  mrna_rbsB_tetr + protein_tetr, [50] Degradation_mrna: mrna_rbsB_tetr  φ, [1] Degradation_protein: protein_tetr  φ, [1] Attribute Grammar CONS TRUC T CAS PROM OTER ptetr CIX CISTR ON RBS rbsA CDS laci TERM INAT ORTERM INAT OR t1 TERM INAT OR t2 [ CAS PROM OTER placi CIX CISTR ON RBS rbsB CDS tetr CISTR ON RBS rbsB CDS gfp TERM INAT OR t1 ] 69
  • 70. DESIGN OF LANGUAGES FOR SYSTEMS AND SYNTHETIC BIOLOGY Translate genetic designs into mathematical models 70
  • 71. Attribute Grammar for Systems and Synthetic Biology 71
  • 72. Use an API for the Grammar’s semantic actions SBML API (use java libSBML) • species(Name, InitConc) • parameter(Name, Value) • reaction(Name, Modifiers, Reactants, Products, Math) • event(Name, Event_assignments, Trigger_math) • Etc. TRANS API Use keywords to match declared Species, use naming convention – TYPE = “DNA”, “PROT”…. – KEY = “LAC”, “TET”,… 72
  • 73. Parts Attributes 73 Category PartsID Parameters PTE pTetR parameter('k_transcription_pTetR',10) PLA placI parameter('k_transcription_placI',25) PCI pcI parameter('k_transcription_pcI',50) TER B0010 RBS Strong_RBS parameter('k_translation_Strong_RBS’,25), parameter('k_degradation_Strong_RBS',1) CDS lacI parameter('k_degradation_LAC_lacI',0.1) tetR parameter('k_degradation_TET_tetR',0.1) cIts parameter('k_degradation_CI_cIts',0.1)
  • 74. Rules Semantic Actions 74 Reaction Rules Name Modifiers Reactants Products Math CAS --> PRO CIS TER Transcription_<CAS.Construct> DNA_<PRO.Construct> mRNA_<CIS.Construct> k_transcription_<PRO.Construct> CIS --> RBS CDS Translation_<CIS.Construct> mRNA_<CIS.Construct> PROT_<CDS.Construct> k_translation_<RBS.Construct> Degradation_mRNA_<CIS.Construct> mRNA_<CIS.Construct> -k_degradation_<RBS.Construct>* mRNA_<CIS.Construct> Degradation_PROT_<CDS.Construct> PROT_<CDS.Construct> -k_degradation_<CDS.Construct>* PROT_<CDS.Construct> Rules Species Reactions CAS --> PRO CIS TER species_amount(DNA_<PRO.Construct>,1) reaction(Transcription_<CAS.Construct>, [DNA_<PRO.Construct>], [], [mRNA_<CIS.Construct>], "k_transcription_<PRO.Construct>") CIS --> RBS CDS species(mRNA_<CIS.Construct>,0), species(PROT_<CDS.Construct>, init_<CDS.Construct>.getValue()) reaction(Translation_<CIS.Construct>, [mRNA_<CIS.Construct>], [], [PROT_<CDS.Construct>], "k_translation_<RBS.Construct>"), reaction(Degradation_mRNA_<CIS.Construct>, [], [mRNA_<CIS.Construct>], [], ”- k_degradation_<RBS.Construct>* mRNA_<CIS.Construct>"), reaction(Degradation_PROT_<CDS.Construct>, [], [PROT_<CDS.Construct>], [], "- k_degradation_<CDS.Construct>* PROT_<CDS.Construct>")
  • 75. Rules Semantic Actions - TRANS 75 PRO  PLA: semantic action for the POSSIBLE interaction (trans) reaction Name interaction_LAC_<PLA.Construct> Modifiers Reactants TRANSspecies{PROT-LAC} Products TRANSspecies_and_declare{[PROT-LAC],PROT- LAC_DNA_<PLA.Construct>_x,0} Math TRANS{[PROT-LAC],+ k_binding_PROT- LAC_<PLA.Construct> - k_release_PROT- LAC_<PLA.Construct> * PROT- LAC_DNA_<PLA.Construct>_x,0}
  • 76. GenoCAD supports AG Design to Simulation 76
  • 77. Add it to GenoCAD 77
  • 80. Synthetic Biology designs Toggle switch (Garner, 2000) Oscillator (Elowitz, 2000) Same libraries of parts but different layouts. Used SBML API to design an attribute grammar using Wilson-Cowan rate laws 80
  • 81. The genetic toggle switch 81
  • 82. The genetic toggle switch 82
  • 84. A G2P language for Natural Genome • Scale-up to handle natural genome – Illustrate Attribute Grammar as a formalism to map Genotype to Phenotype • Systems Biology – Regulation of the cell cycle of the budding yeast 84
  • 85. Cell Cycle AG Syntax Genome --> InitialCondition Model ( ChrI ) ( ChrII ) ( ChrIII ) ( ChrIV ) ( ChrV ) ( ChrVI ) ( ChrVII ) ( ChrVIII ) ( ChrIX ) ( ChrX ) ( ChrXI ) ( ChrXII ) ( ChrXIII ) ( ChrXIV ) ( ChrXV ) ( ChrXVI ) ChrI --> ChrI_L [Cln3 ] ChrI_M1 [Lte1 ] ChrI_M2 [Cdc15 ] ChrI_R ChrIV --> ChrIV_L [Pds1 ] ChrIV_M [Swi5 ] ChrIV_R ChrV --> ChrV_L [SBF] CHrV_M Bck2 ChrV_R ChrVI --> ChrVI_L [Cdc14 ] ChrVI_R ChrVII --> ChrVII_L Cdc20 ChrVII_M1 [Cdh1 ] ChrVII_M2 [Esp1 ] ChrVII_R ChrX --> ChrX_L Cdc6 ChrX_M1 Net1 ChrX_M2 Mad2 ChrX_R ChrXI --> ChrXI_L [APC] ChrXI_R ChrXII --> ChrXII_L Sic1 ChrXII_R ChrXIII --> ChrXIII_L [Tem1 ] ChrXIII_M1 Mcm1 ChrXIII_M2 [Bub2 ] ChrXIII_R ChrXVI --> ChrXVI_L [Cln2 ] ChrXVI_M1 Clb2 ChrXVI_M2 [Clb5 ] ChrXVI_R 85Chen, K. C., Calzone, L., Csikasz-Nagy, A., Cross, F. R., Novak, B., & Tyson, J. J. (2004). Integrative analysis of cell cycle control in budding yeast. Molecular biology of the cell, 15(8), 3841-62. doi:10.1091/mbc.E03-11-0794
  • 86. Cell Cycle AG semantics 86
  • 87. Cell Cycle Wild-Type Design and Simulation 87
  • 89. Discrete Boolean Network • Design variety of language! • Qualitative • API GinML: edges and nodes Thieffry, D., & Thomas, R. (1995). Dynamical behaviour of biological regulatory networks—II. Immunity control in bacteriophage lambda. Bulletin of Mathematical Biology. Retrieved from http://link.springer.com/article/10.1007/BF02460619 89
  • 90. Analyze - Recap Design • Databases • CAD tools Analyze • Computational tools Build • Assembly of parts • DNA synthesis  Attribute Grammar to map genotype and phenotype  Designing G2P languages, use SBML or GinML API  Model natural genome, the cell cycle example  Implementation in GenoCAD: design to simulation  Generation of AG compilers from database 90
  • 91. Build Design • Databases • CAD tools Analyze • Computational tools Build • Assembly of parts • DNA synthesis 91
  • 93. The Risk of Deliberate Misuse 93
  • 94. A T C G A T C G A T C G A T C G A T C G A T C G A T C G A T C G A T C G A T C G A T C G A T C G A T C G A T C G A T C G A T C G A T C G A T C G A T C G A T C G A T C G A T C G A T C G A T C G A T C G A T C G A T C G A T C G A T C G A T C G A T C G A T C G A T C G A T C G A T C G A T C G A T C G A T C G A T C G A T C G A T C G A T C G A T C G A T C G A T C G A T C G A T C G A T C G A T C G A T C G A T C G A T C G A T C G A T C G A T C G A T C G A T C G A T C G A T C G A T C G A T C G A T C G A T C G A T C G A T C G A T C G A T C G A T C G A T C G A T C G A T C G A T C G A T C G A T C G A T C G A T C G A T C G A T C G A T C G A T C G A T C G A T C G A T C G A T C G A T C G A T C G A T C G A T C G A T C G A T C G A T C G A T C G A T C G A T C G A T C G A T C G A T C G A T C G A T C G A T C G A T C G A T C G A T C G A T C G A T C G A T C G A T C G A T C G A T C G A T C G A T C G A T C G A T C G A T C G A T C G A T C G A T C G A T C G A T C G A T C G A T C G A T C G A T C G A T C G A T C G A T C G A T C G A T C G A T C G A T C G A T C G A T C G A T C G A T C G A T C G A T C G A T C G A T C G A T C G A T C G A T C G A T C G A T C G A T C G A T C G A T C G A T C G A T C G A T C G A T C G A T C G A T C G A T C G A T C G A T C G A T C G A T C G A T C G A T C G A T C G A T C G A T C G A T C G A T C G A T C G A T C G A T C G A T C G A T C G A T C G A T C G A T C G A T C G A T C G A T C G A T C G A T C G A T C G A T C G A T C G A T C G A T C G A T C G A T C G A T C G A T C G A T C G A T C G A T C G A T C G A T C G A T C G A T C G A T C G A T C G A T C G A T C G A T C G A T C G A T C G A T C G A T C G A T C G A T C G A T C G A T C G A T C G A T C G A T C G A T C G A T C G A T C G A T C G A T C G A T C G A T C G A T C G A T C G A T C G A T C G A T C G A T C G A T C G A T C G A T C G A T C G A T C G A T C G A T C G A T C G A T C G A T C G A T C G A T C G A T C G A T C G A T C G A T C G A T C G A T C G A T C G A T C G A T C G A T C G A T C G A T C G A T C G A T C G A T C G A T C G A T C G A T C G A T C G A T C G A T C G A T C G A T C G A T C G A T C G A T C G A T C G A T C G A T C G A T C G A T C G A T C G A T C G A T C G A T C G A T C G A T C G A T C G A T C G A T C G A T C G A T C G A T C G A T C G A T C G A T C G A T C G A T C G A T C G A T C G A T C G A T C G A T C G A T C G A T C G A T C G A T C G A T C G A T C G A T C G A T C G A T C G A T C G A T C G A T C G A T C G A T C G A T C G A T C G A T C G A T C G A T C G A T C G A T C G A T C G A T C G A T C G A T C G A T C G A T C G A T C G A T C G A T C G A T C G A T C G A T C G A T C G A T C G A T C G A T C G A T C G A T C G A T C G A T C G A T C G A T C G A T C G A T C G A T C G A T C G A T C G A T C G A T C G A T C G A T C G A T C G A T C G A T C G A T C G A T C G A T C G A T C G A T C G A T C G A T C G A T C G A T C G A T C G A T C G A T C G A T C G A T C G A T C G A T C G A T C G A T C G A T C G A T C G A T C G A T C G A T C G A T C G A T C G A T C G A T C G A T C G A T C G A T C G A T C G A T C G A T C G A T C G A T C G A T C G A T C G A T C G A T C G A T C G A T C G A T C G A T C G A T C G A T C G A T C G A T C G A T C G A T C G A T C G A T C G A T C G A T C G A T C G A T C G A T C G A T C G A T C G A T C G A T C G A T C G A T C G A T C G A T C G A T C G A T C G A T C G A T C G A T C G A T C G A T C G A T C G A T C G A T C G A T C G A T C G A T C G A T C G A T C G A T C G A T C G A T C G A T C G A T C G A T C G A T C G A T C G A T C G A T C G A T C G A T C G A T C G A T C G A T C G A T C G A T C G A T C G A T C G A T C G A T C G A T C G A T C G A T C G A T C G A T C G A T C G A T C G A T C G A T C G A T C G A T C G A T C G A T C G A T C G A T C G A T C G A T C G A T C G A T C G A T C G A T C G A T C G A T C G A T C G A T C G A T C G A T C G A T C G A T C G A T C G A T C G A T C G A T C G A T C G A T C G A T C G A T C G A T C G A T C G A T C G A T C G A T C G A T C G A T C G A T C G A T C G A T C G A T C G A T C G A T C G A T C G A T C G A T C G A T C G A T C G A T C G A T C G A T C G A T C G A T C G A T C G A T C G A T C G A T C G A T C G A T C G A T C G A T C G A T C G A T C G A T C G A T C G A T C G A T C G A T C G A T C G A T C G A T C G A T C G A T C G A T C G A T C G A T C G A T C G A T C G A T C G A T C G A T C G A T C G A T C G A T C G A T C G A T C G A T C G A T C G A T C G A T C G A T C G A T C G A T C G A T C G A T C G A T C G A T C G A T C G A T C G A T C G A T C G A T C G A T C G A T C G A T C G A T C G A T C G A T C G A T C G A T C G A T C G A T C G A T C G A T C G A T C G A T C G A T C G A T C G A T C G A T C G A T C G A T C G A T C G A T C G A T C G A T C G A T C G A T C G A T C G A T C G A T C G A T C G A T C G A T C G A T C G A T C G A T C G A T C G A T C G A T C G A T C G A T C G A T C G A T C G A T C G A T C G A T C G A T C G A T C G A T C G A T C G A T C G A T C G A T C G A T C G A T C G A T C G A T C G A T C G A T C G A T C G A T C G A T C G A T C G A T C G A T C G A T C G A T C G A T C G A T C G A T C G A T C G A T C G A T C G A T C G A T C A T C G A T C G A T C G A T C G A T C G A T C G A T C G A T C G A T C G A T C G A T C G A T C G A T C G A T C G A T C G A T C G A T C G A T C G A T C G A T C G A T C G A T C G A T C G A T C G A T C G A T C G A T C G A T C G A T C G A T C G A T C G A T C G A T C G A T C G A T C G A T C G A T C G A T C G A T C G A T C G A T C G A T C G A T C G A T C G A T C G A T C G A T C G A T C G A T C G A T C G A T C G A T C G A T C G A T C G A T C G A T C G A T C G A T C G A T C G A T C G A T C G A T C G A T C G A T C G A T C G A T C G A T C G A T C G A T C G A T C G A T C G A T C G A T C G A T C G A T C G A T C G A T C G A T C G A T C G A T C G A T C G A T C G A T C G A T C G A T C G A T C G A T C G A T C G A T C G A T C G A T C G A T C G A T C G A T C G A T C G A T C G A T C G A T C G A T C G A T C G A T C G A T C G A T C G A T C G A T C G A T C G A T C G A T C G A T C G A T C G A T C G A T C G A T C G A T C G A T C G A T C G A T C G A T C G A T C G A T C G A T C G A T C G A T C G A T C G A T C G A T C G A T C G A T C G A T C G A T C G A T C G A T C G A T C G A T C G A T C G A T C G A T C G A T C G A T C G A T C G A T C G A T C G A T C G A T C G A T C G A T C G A T C G A T C G A T C G A T C G A T C G A T C G A T C G A T C G A T C G A T C G A T C G A T C G A T C G A T C G A T C G A T C G A T C G A T C G A T C G A T C G A T C G A T C G A T C G A T C G A T C G A T C G A T C G A T C G A T C G A T C G A T C G A T C G A T C G A T C G A T C G A T C G A T C G A T C G A T C G A T C G A T C G A T C G A T C G A T C G A T C G A T C G A T C G A T C G A T C G A T C G A T C G A T C G A T C G A T C G A T C G A T C G A T C G A T C G A T C G A T C G A T C G A T C G A T C G A T C G A T C G A T C G A T C G A T C G A T C G A T C G A T C G A T C G A T C G A T C G A T C G A T C G A T C G A T C G A T C G A T C G A T C G A T C G A T C G A T C G A T C G A T C G A T C G A T C G A T C G A T C G A T C G A T C G A T C G A T C G A T C G A T C G A T C G A T C G A T C G A T C G A T C G A T C G A T C G A T C G A T C G A T C G A T C G A T C G A T C G A T C G A T C G A T C G A T C G A T C G A T C G A T C G A T C G A T C G A T C G A T C G A T C G A T C G A T C G A T C G A T C G A T C G A T C G A T C G A T C G A T C G A T C G A T C G A T C G A T C G A T C G A T C G A T C G A T C G A T C G A T C G A T C G A T C G A T C G A T C G A T C G A T C G A T C G A T C G A T C G A T C G A T C G A T C G A T C G A T C G A T C G A T C G A T C G A T C G A T C G A T C G A T C G A T C G A T C G A T C G A T C G A T C G A T C G A T C G A T C G A T C G A T C G A T C G A T C G A T C G A T C G A T C G A T C G A T C G A T C G A T C G A T C G A T C G A T C G A T C G A T C G A T C G A T C G A T C G A T C G A T C G A T C G A T C G A T C G A T C G A T C G A T C G A T C G A T C G A T C G A T C G A T C G A T C G A T C G A T C G A T C G A T C G A T C G A T C G A T C G A T C G A T C G A T C G A T C G A T C G A T C G A T C G A T C G A T C G A T C G A T C G A T C G A T C G A T C G A T C G A T C G A T C G A T C G A T C G A T C G A T C G A T C G A T C G A T C G A T C G A T C G A T C G A T C G A T C G A T C G A T C G A T C G A T C G A T C G A T C G A T C G A T C G A T C G A T C G A T C G A T C G A T C G A T C G A T C G A T C G A T C G A T C G A T C G A T C G A T C G A T C G A T C G A T C G A T C G A T C G A T C G A T C G A T C G A T C G A T C G A T C G A T C G A T C G A T C G A T C G A T C G A T C G A T C G A T C G A T C G A T C G A T C G A T C G A T C G A T C G A T C G A T C G A T C G A T C G A T C G A T C G A T C G A T C G A T C G A T C G A T C G A T C G A T C G A T C G A T C G A T C G A T C G A T C G A T C G A T C G A T C G A T C G A T C G A T C G A T C G A T C G A T C G A T C G A T C G A T C G A T C G A T C G A T C G A T C G A T C G A T C G A T C G A T C G A T C G A T C G A T C G A T C G A T C G A T C G A T C G A T C G A T C G A T C G A T C G A T C G A T C G A T C G A T C G A T C G A T C G A T C G A T C G A T C G A T C G A T C G A T C G A T C G A T C G A T C G A T C G A T C G A T C G A T C G A T C G A T C G A T C G A T C G A T C G A T C G A T C G A T C G A T C G A T C G A T C G A T C G A T C G A T C G A T C G A T C G A T C G A T C G A T C G A T C G A T C G A T C G A T C G A T C G A T C G A T C G A T C G A T C G A T C G A T C G A T C G A T C G A T C G A T C G A T C G A T C G A T C G A T C G A T C G A T C G A T C G A T C G A T C G A T C G A T C G A T C G A T C G A T C G A T C G A T C G A T C G A T C G A T C G A T C G A T C G A T C G A T C G A T C G A T C G A T C G A T C G A T C G A T C G A T C G A T C G A T C G A T C G A T C G A T C G A T C G A T C G A T C G A T C G A T C G A T C G A T C G A T C G A T C G A T C G A T C G A T C G A T C G A T C G A T C G A T C G A T C G A T C G A T C G A T C G A T C G A T C G A T C G A T C G A T C G A T C G A T C G A T C G A T C G A T C G A T C G A T C G A T C G A T C G A T C G A T C G A T C G A T C G A T C G A T C G A T C G A T C G A T C G A T C G A T C G A T C G A T C G A T C G A T C G A T C G A T C G A T C G A T C G A T C G A T C G A T C G A T C G A T C 94
  • 95. Sections: Customer screening Sequence screening Record retention Government contact “[…] to minimize the risk that unauthorized individuals or individuals with malicious intent will obtain “toxins and agents of concern” through the use of nucleic acid synthesis technologies, and to simultaneously minimize any negative impacts on the conduct of research and business operations.” 95
  • 96. 96
  • 97. 97
  • 98. 98
  • 99. 99
  • 100. iGEM--International Genetically Engineered Machine • Summer project for teams of undergrads in Synthetic Biology – Projects range from a rainbow of pigmented bacteria, to banana smelling bacteria, an arsenic biosensor, etc. – 165 teams in 2011 • Judge – iGEM 2012 Americas East Jamboree (Information Processing & Fundamental Advances & Software track) in Pittsburgh, PA – aGEM 2012 in Edmonton, Canada – iGEM 2011 World Championship (Software track) in MIT Cambridge, MA – iGEM 2011 Americas Jamboree (Information Processing & Software track) in Indianapolis, IN – iGEM 2010 World Championship (Poster) in MIT Cambridge, MA • Advisor of teams – Virginia Tech iGEM 2011 team – VT-ENSIMAG Biosecurity software team for iGEM 2010 100
  • 101. Biological weapons nonproliferation • 1 year postdoc fellowship • Center for Nonproliferation Studies at the Monterey Institute for International Studies 101
  • 102. Build - Recap Design • Databases • CAD tools Analyze • Computational tools Build • Assembly of parts • DNA synthesis  Screening a DNA sequence for Select Agent and Toxins  Relations our science and policy  Engaging the students (iGEM) 102
  • 104. Conclusions  Design: GenoCAD – Rule-based genetic design tools user- friendly and Domain Specific Language  Galdzicki, M., Wilson, M. L., Rodriguez, C. A., Pocock, M. R., Oberortner, E., Adam, L., … Sauro, H. M. (2012). Synthetic Biology Open Language (SBOL) Version 1.1.0, 1–26. (and NBT paper in submission)  Wilson, M. L., Hertzberg, R., Adam, L., & Peccoud, J. (2011). A step-by- step introduction to rule-based design of synthetic genetic constructs using GenoCAD. (C. Voigt, Ed.)Methods in enzymology, 498, 173–88. doi:10.1016/B978-0-12-385120-8.00008-5  Mandy L Wilson, Sakiko Okumoto, Laura Adam and Jean Peccoud. Development of a domain-specific genetic language to design Chlamydomonas reinhardtii expression vectors.(Manuscript in preparation) – Talk: Adam, L. & Peccoud, J. Formal grammars to protect intellectual properties in synthetic biology. International Conference on Synthetic Biology at Evry, France, December 15-16, 2010. – GenoCAD tutorials 104
  • 105. Conclusions  Design: GenoCAD – Rule-based genetic design tools user- friendly and Domain Specific Language  Analyze: Semantic models of DNA sequences  Cai, Y., Lux, M. W., Adam, L., & Peccoud, J. (2009). Modeling structure- function relationships in synthetic DNA sequences using attribute grammars. PLoS computational biology, 5(10)  Laura Adam, Matthew W. Lux, Mandy L. Wilson, Tian Hong, Jean Peccoud. Design of Languages for Systems and Synthetic Biology to translate genetic designs into mathematical models. (Manuscript in preparation) – Poster: Adam, L. & Peccoud, J. (2011). Using user defined semantic languages in synthetic biology: generating DNA compilers. Third International Workshop on BioDesign Automation (IWBDA) at 48th ACM/EDAC/IEEE Design Automation Conference (DAC) in San Diego, CA. – Talk:. Formal languages to map Genotype to Phenotype in Natural Genomes. GBCB seminar, 2012. 105
  • 106. Conclusions  Design: GenoCAD – Rule-based genetic design tools user- friendly and Domain Specific Language  Analyze: Semantic models of DNA sequences  Build: Biosecurity issues and DNA synthesis  Adam, L., et al. (2011). Strengths and limitations of the federal guidance on synthetic DNA. Nature Biotechnology, 29(3), 208–210. doi:10.1038/nbt.1802 – Talk: GenoTHREAT: A biosecurity software to screen DNA synthesis orders against Pathogens. GBCB seminar, 2011. – Adam, L.(2011). Scientists need to be proactive to foment international biosecurity. Runner up essay for the “Young Scientists” essay contest organized by the Implementation and Support Unit of the Biological Weapon Convention at the United Nations 106
  • 107. Conclusions  Design: GenoCAD – Rule-based genetic design tools user- friendly and Domain Specific Language  Analyze: Semantic models of DNA sequences  Build: Biosecurity issues and DNA synthesis  Define your Domain Specific G2P languages, Design mutants and Analyze them in GenoCAD in minutes! 107
  • 108. Acknowledgements VBI SynBio Group • J. Peccoud (P.I.) • N. Adames • D. Ball • M. Lux • C. Overend • M. Wilson • and Patrick (Yizhi) Cai • and R. Hertzberg PhD committee:  Dr. Bevan  Dr. Garner  Dr. Kepes  Dr. Peccoud  Dr. Ramakrishnan  Dr. Tyson • And Dennie Munson! 108 Collaborators • SBOL: H. Sauro, C.Myers, D. Densmore, C. Rodriguez, M. Galdzicki and many more • Language: Eric Van Wyck • GenoGUARD: Ed You (FBI)
  • 112. What if we could… • Compile circular DNA • Read the different messages on both strands • Lexical analysis of natural sequences • Customize (GenoCAD) and standardize? (SBOL) • Handle ambiguity 112 Natural language Natural genome Formal language Synthetic biology
  • 113. Syntactic Limitation 113 The Chomsky hierachy Searls, D.B. “Linguistic approaches to biological sequences.” Bioinformatics 13, no. 4 (1997): 333. http://bioinformatics.oxfordjournals.org/cgi/content/abstract/13/4/333.
  • 114. Parsing 114 Left to Right Top-Down Parse The Parse Tree of the Sentence "The boy went home“ Right to Left Top-Down Parse Left to Right Bottom-Up Parse Right to Left Bottom-Up Parse
  • 115. Use of attribute grammar in synthetic biology 115 Formal definition Semantic In the synthetic biology context V, a finite set of non- terminals Attributes Parts categories Σ, a finite set of terminals Attributes values Genetic Parts R, a finite relation from V to (VUΣ)* Semantic actions Design Rules S∈V, the start symbol Hard-coded declarations Start
  • 117. 117
  • 118. Insights from Genotype-to-Phenotype (G2P) mapping G2P map The Phenotype is X Genotype • genetic makeup of a cell, an organism, or an individual • specific alleles • inherited Phenotype • observable characteristics or traits 118
  • 119. Traditional G2P mapping is linear • Sui Huang, Rational drug discovery: what can we learn from regulatory networks?, Drug Discovery Today, Volume 7, Issue 20, 15 October 2002 • Peccoud, J., Velden, K. V., Podlich, D., Winkler, C., Arthur, L., & Cooper, M. (2004). The selective values of alleles in a molecular network model are context dependent. Genetics, 166(4), 1715–25. Phenotypes Central dogma 119
  • 120. Current Formalisms Databases: genetic mapping, genome annotation, genotype, mutant, transcriptome, proteome and metabolome data. Ontologies: Controlled vocabulary for annotation of genes and their products (cellular component, molecular function, biological process) Actually, G2P maps are nonlinear: Gene Networks • Priest, N. K., Rudkin, J. K., Feil, E. J., van den Elsen, J. M. H., Cheung, A., Peacock, S. J., Laabei, M., et al. (2012). From genotype to phenotype: can systems biology be used to predict Staphylococcus aureus virulence? Nature reviews. Microbiology, 10(11), 791–7. doi:10.1038/nrmicro2880 • Benfey, P. N., & Mitchell-Olds, T. (2008). From genotype to phenotype: systems biology meets natural variation. Science. “replacing the linear pathways with interconnected networks.” 120