SlideShare ist ein Scribd-Unternehmen logo
1 von 58
Bioinformatics:
What, Why and
Where?
Mohamed El-Hadidi
Assistant Professor of Bioinformatics
Biomedical Informatics Program Director
School of Information Technology and Computer Science
Nile University
Where DNA is Located in our Body?
6/3/2020 Bioinformatics: What, Why and Where? 2
From Human Body to DNA Sequences
DNA Sequencers
Sequence Files
How many cells
in the Human
Body?
10 Trillion Cells!
6/3/2020 Bioinformatics: What, Why and Where? 3
From Human Body to DNA Sequences
DNA Sequencers
Sequence Files
How many
chromosomes in one
cell?
46
Chromosomes!
6/3/2020 4Bioinformatics: What, Why and Where?
From Human Body to DNA Sequences
DNA Sequencers
Sequence Files
What is the length of
all chromosomes in
one cell?
2 m in one cell!
1500 times from Earth to
moon (all cells)
6/3/2020 5Bioinformatics: What, Why and Where?
From Human Body to DNA Sequences
DNA Sequencers
Sequence Files
What are in
these files?
GAATTTGGGCAAGAATCCAGGCATTGGAACTTATTCAAATAACTAGTTTGCCTGTAATTTTCACTTTTTC
AGAGTCATCTGATAAAGCTTTCTTGCTACACATTTAGATAGATACACTCAATCCAGTTGTCTAGAAAGTT
CCCTGAGCCAGCTGGGAGCAGGAGGGGTAGTTGGGGCCAGGAATATTGGGGGTGTGTTTACTGAGCCCCT
AGAAAGTAAGTGCTAGATTTGACATTTCAATCCCTGAAGGCCCTGAAGTTCAGTATCAAATGACTGGTCC
TGTGGACTGAGCATCTGTGAATTGCATATGCTTAGAGTAAATTTTACTCCTACCAGTTTCAGCAGCTTGC
TTTAGCAAGCAGTATGGAAACACTAACATGGGGGAGTAGAATTTCTCTCTCTGATCCAAGTTTTATCTCA
TTCTGGTGGGTTTTCAAGGAGAGACTCGGAGTCCAAGTGTCCTTTCTGAATATATCTGGAACTTCTCATT
AACAAAAGACTCAAGTTATAATTTAGGGGACAAGGCACCCAATGAGAATGCCTTGCAGGCAGCCCTAAGT
ACACCTGCAATTACACCATTACTAGCGCGGCAGCACACATGGCCCTGACTTAGTTTAAATAATTACGTAA
GTCAACCATGATTGTTTGCCCTTTGCATAGAAGGGCAAGTATTGGTACCTGTTACAACTTAGGCTTTTTT
TTCTTTATGTTTGAGCCATGATGAGTGATTTACACTGTTGCATCCATATGTTGAGATGTAAGAATAAATT
AGACTTGGTAATTGCCCTTAAGTGTCTGGAAGTCAACTGGGGAAAGAGAGCTAGAGATAATAAGTGTGAA
ACAATGTCACAGAATCAATGACGGAACTCTTCCCAGGACAAAGGATGACTTTTGAGTTCAGTCTTTGCCT
TTAATTCTACATGGGGAGGAGAGCACGTTTAGCCACAAATGGAAGGGATTACTCATTTGAGCTATTTGGT
TATATGATTATTTCCCCAGAGAATAGGATGTGCAGGGCATTACACAAGCAGTGCCAATAGCAGCAAAGTT
CTTGAGAGTGCTAGTAATTCAAATGGCAGGAAGAGAAGGAATAAATGGTAAGGCTACCTACAGTTCACAG
AGAGCTCCATCCTCACTGTGGCTTTGGATTTTGTCCTGTGTGAAAGAGAAGTGACTGTGAACTGACATGC
TGTGTTTGGTGTTTTAGAAAGATGGCTGCAGCAGCGGTTTGGGGAATGGACTGCAGGAGTGGCATTGGAA
ACAGGAAGGTTCATGACTATTGCCAGAGACAGAGGATGAAGCAGGAGCAAGGAAGATTCAGGACAGGGGA
CTCCGGGGCTGATCAGGAGGCAGAACTGGTTGATAAGTATATGTAGCAGCATAAGAAAGAAAGAATCCCA
GATTGACACCCAGGCTTCTCACTTGGAAGCCTGGATAGATACTGAATGCAATCACAAAGGCTGGGAAGTC
AATGGGACTGCAGGGAAGGGAAGGGAAGGGAGGAGAAGAGGAAGGGCAGGAGGGTCCAATATCAATATTC
AGCTTTTAGATGTGTTGAGCTTGAAGTGCTCAGATGGAGAAGTCCAGGAGGCAGTAGAATACGGTGGTCC
AGAGCACAGGAGAGCAATGTGGCTTGAGTTGTCATTTGCTCACATATTTCCGTGTCAGTTACTTGTCTTA
GATCACAGAACAAGTTCTCCTCTCACAGTTTCCTGGCTCCACCTGTCTCATGCTCACCGTCAGCATCGAA
ATTGAGCCACACCAGGGGTTCTGGATACCAGCTTCTCTCTAGGTGAGGCTGCTATAGTCAGCAGCTGATT
AGTTGCAGTTATCAGCAACTGGTAATATAATATATTGTGCATATAAGTGTACCAGAAGTCATGTTTATAT
ATTGCTGCAAATACTCGGAATGGGGATCTCTTGTTCCCTGCTTAAGACCACATCACATTACTTGGTTTTG
TACGCTAGTGGCTGAACCAAAAAAAGTAGGAGATGATTTTTTTTCTTTTTTCTTAAAGCAGTAGCTTTTG
AACCTTGACCATGCTTTCTAACCAGCTGAGGGGCTTTTGAAAAAGAGGGTGCCTTACTGTGCCCCAGACC
AGGACAATCAGTATTTCTGGGGAATGGAGCCTGGCACACACACATTTCTTAAAGCTCCCTTGGCAATTCT
GAGGAGTGGATTACATGTTGTATGTAGCTCGTAACGAAAGAAATCTTGTCTTTGCTCTCAGACCCCCATT
TCTTACTCATCTCATGAGCTCCTTCGAGATCCAGAAACAGTTGCATATTTCATTAGTAAATCAGTTCCAG
AGTCACATTTTATTTCACAAGTTAGTCCATTAAAAGTTTCCTGCAGTGAGGAAATAGCCAGAAAGAACAC
TCCACCCCTCCTCCTTTTTATAACTATAGGGTCTGGCTCGACAGAGCAGGAGCATCGCCATCTTGGACAA
6/3/2020 6Bioinformatics: What, Why and Where?
From Human Body to DNA Sequences
DNA Sequencers
Sequence Files
What are in
these files?
GAATTTGGGCAAGAATCCAGGCATTGGAACTTATTCAAATAACTAGTTTGCCTGTAATTTTCACTTTTTC
AGAGTCATCTGATAAAGCTTTCTTGCTACACATTTAGATAGATACACTCAATCCAGTTGTCTAGAAAGTT
CCCTGAGCCAGCTGGGAGCAGGAGGGGTAGTTGGGGCCAGGAATATTGGGGGTGTGTTTACTGAGCCCCT
AGAAAGTAAGTGCTAGATTTGACATTTCAATCCCTGAAGGCCCTGAAGTTCAGTATCAAATGACTGGTCC
TGTGGACTGAGCATCTGTGAATTGCATATGCTTAGAGTAAATTTTACTCCTACCAGTTTCAGCAGCTTGC
TTTAGCAAGCAGTATGGAAACACTAACATGGGGGAGTAGAATTTCTCTCTCTGATCCAAGTTTTATCTCA
TTCTGGTGGGTTTTCAAGGAGAGACTCGGAGTCCAAGTGTCCTTTCTGAATATATCTGGAACTTCTCATT
AACAAAAGACTCAAGTTATAATTTAGGGGACAAGGCACCCAATGAGAATGCCTTGCAGGCAGCCCTAAGT
ACACCTGCAATTACACCATTACTAGCGCGGCAGCACACATGGCCCTGACTTAGTTTAAATAATTACGTAA
GTCAACCATGATTGTTTGCCCTTTGCATAGAAGGGCAAGTATTGGTACCTGTTACAACTTAGGCTTTTTT
TTCTTTATGTTTGAGCCATGATGAGTGATTTACACTGTTGCATCCATATGTTGAGATGTAAGAATAAATT
AGACTTGGTAATTGCCCTTAAGTGTCTGGAAGTCAACTGGGGAAAGAGAGCTAGAGATAATAAGTGTGAA
ACAATGTCACAGAATCAATGACGGAACTCTTCCCAGGACAAAGGATGACTTTTGAGTTCAGTCTTTGCCT
TTAATTCTACATGGGGAGGAGAGCACGTTTAGCCACAAATGGAAGGGATTACTCATTTGAGCTATTTGGT
TATATGATTATTTCCCCAGAGAATAGGATGTGCAGGGCATTACACAAGCAGTGCCAATAGCAGCAAAGTT
CTTGAGAGTGCTAGTAATTCAAATGGCAGGAAGAGAAGGAATAAATGGTAAGGCTACCTACAGTTCACAG
AGAGCTCCATCCTCACTGTGGCTTTGGATTTTGTCCTGTGTGAAAGAGAAGTGACTGTGAACTGACATGC
TGTGTTTGGTGTTTTAGAAAGATGGCTGCAGCAGCGGTTTGGGGAATGGACTGCAGGAGTGGCATTGGAA
ACAGGAAGGTTCATGACTATTGCCAGAGACAGAGGATGAAGCAGGAGCAAGGAAGATTCAGGACAGGGGA
CTCCGGGGCTGATCAGGAGGCAGAACTGGTTGATAAGTATATGTAGCAGCATAAGAAAGAAAGAATCCCA
GATTGACACCCAGGCTTCTCACTTGGAAGCCTGGATAGATACTGAATGCAATCACAAAGGCTGGGAAGTC
AATGGGACTGCAGGGAAGGGAAGGGAAGGGAGGAGAAGAGGAAGGGCAGGAGGGTCCAATATCAATATTC
AGCTTTTAGATGTGTTGAGCTTGAAGTGCTCAGATGGAGAAGTCCAGGAGGCAGTAGAATACGGTGGTCC
AGAGCACAGGAGAGCAATGTGGCTTGAGTTGTCATTTGCTCACATATTTCCGTGTCAGTTACTTGTCTTA
GATCACAGAACAAGTTCTCCTCTCACAGTTTCCTGGCTCCACCTGTCTCATGCTCACCGTCAGCATCGAA
ATTGAGCCACACCAGGGGTTCTGGATACCAGCTTCTCTCTAGGTGAGGCTGCTATAGTCAGCAGCTGATT
AGTTGCAGTTATCAGCAACTGGTAATATAATATATTGTGCATATAAGTGTACCAGAAGTCATGTTTATAT
ATTGCTGCAAATACTCGGAATGGGGATCTCTTGTTCCCTGCTTAAGACCACATCACATTACTTGGTTTTG
TACGCTAGTGGCTGAACCAAAAAAAGTAGGAGATGATTTTTTTTCTTTTTTCTTAAAGCAGTAGCTTTTG
AACCTTGACCATGCTTTCTAACCAGCTGAGGGGCTTTTGAAAAAGAGGGTGCCTTACTGTGCCCCAGACC
AGGACAATCAGTATTTCTGGGGAATGGAGCCTGGCACACACACATTTCTTAAAGCTCCCTTGGCAATTCT
GAGGAGTGGATTACATGTTGTATGTAGCTCGTAACGAAAGAAATCTTGTCTTTGCTCTCAGACCCCCATT
TCTTACTCATCTCATGAGCTCCTTCGAGATCCAGAAACAGTTGCATATTTCATTAGTAAATCAGTTCCAG
AGTCACATTTTATTTCACAAGTTAGTCCATTAAAAGTTTCCTGCAGTGAGGAAATAGCCAGAAAGAACAC
TCCACCCCTCCTCCTTTTTATAACTATAGGGTCTGGCTCGACAGAGCAGGAGCATCGCCATCTTGGACAA
6/3/2020 7Bioinformatics: What, Why and Where?
From Human Body to DNA Sequences
DNA Sequencers
Sequence Files
GAATTTGGGCAAGAATCCAGGCATTGGAACTTATTCAAATAACTAGTTTGCCTGTAATTTTCACTTTTTC
AGAGTCATCTGATAAAGCTTTCTTGCTACACATTTAGATAGATACACTCAATCCAGTTGTCTAGAAAGTT
CCCTGAGCCAGCTGGGAGCAGGAGGGGTAGTTGGGGCCAGGAATATTGGGGGTGTGTTTACTGAGCCCCT
AGAAAGTAAGTGCTAGATTTGACATTTCAATCCCTGAAGGCCCTGAAGTTCAGTATCAAATGACTGGTCC
TGTGGACTGAGCATCTGTGAATTGCATATGCTTAGAGTAAATTTTACTCCTACCAGTTTCAGCAGCTTGC
TTTAGCAAGCAGTATGGAAACACTAACATGGGGGAGTAGAATTTCTCTCTCTGATCCAAGTTTTATCTCA
TTCTGGTGGGTTTTCAAGGAGAGACTCGGAGTCCAAGTGTCCTTTCTGAATATATCTGGAACTTCTCATT
AACAAAAGACTCAAGTTATAATTTAGGGGACAAGGCACCCAATGAGAATGCCTTGCAGGCAGCCCTAAGT
ACACCTGCAATTACACCATTACTAGCGCGGCAGCACACATGGCCCTGACTTAGTTTAAATAATTACGTAA
GTCAACCATGATTGTTTGCCCTTTGCATAGAAGGGCAAGTATTGGTACCTGTTACAACTTAGGCTTTTTT
TTCTTTATGTTTGAGCCATGATGAGTGATTTACACTGTTGCATCCATATGTTGAGATGTAAGAATAAATT
AGACTTGGTAATTGCCCTTAAGTGTCTGGAAGTCAACTGGGGAAAGAGAGCTAGAGATAATAAGTGTGAA
ACAATGTCACAGAATCAATGACGGAACTCTTCCCAGGACAAAGGATGACTTTTGAGTTCAGTCTTTGCCT
TTAATTCTACATGGGGAGGAGAGCACGTTTAGCCACAAATGGAAGGGATTACTCATTTGAGCTATTTGGT
TATATGATTATTTCCCCAGAGAATAGGATGTGCAGGGCATTACACAAGCAGTGCCAATAGCAGCAAAGTT
CTTGAGAGTGCTAGTAATTCAAATGGCAGGAAGAGAAGGAATAAATGGTAAGGCTACCTACAGTTCACAG
AGAGCTCCATCCTCACTGTGGCTTTGGATTTTGTCCTGTGTGAAAGAGAAGTGACTGTGAACTGACATGC
TGTGTTTGGTGTTTTAGAAAGATGGCTGCAGCAGCGGTTTGGGGAATGGACTGCAGGAGTGGCATTGGAA
ACAGGAAGGTTCATGACTATTGCCAGAGACAGAGGATGAAGCAGGAGCAAGGAAGATTCAGGACAGGGGA
CTCCGGGGCTGATCAGGAGGCAGAACTGGTTGATAAGTATATGTAGCAGCATAAGAAAGAAAGAATCCCA
GATTGACACCCAGGCTTCTCACTTGGAAGCCTGGATAGATACTGAATGCAATCACAAAGGCTGGGAAGTC
AATGGGACTGCAGGGAAGGGAAGGGAAGGGAGGAGAAGAGGAAGGGCAGGAGGGTCCAATATCAATATTC
AGCTTTTAGATGTGTTGAGCTTGAAGTGCTCAGATGGAGAAGTCCAGGAGGCAGTAGAATACGGTGGTCC
AGAGCACAGGAGAGCAATGTGGCTTGAGTTGTCATTTGCTCACATATTTCCGTGTCAGTTACTTGTCTTA
GATCACAGAACAAGTTCTCCTCTCACAGTTTCCTGGCTCCACCTGTCTCATGCTCACCGTCAGCATCGAA
ATTGAGCCACACCAGGGGTTCTGGATACCAGCTTCTCTCTAGGTGAGGCTGCTATAGTCAGCAGCTGATT
AGTTGCAGTTATCAGCAACTGGTAATATAATATATTGTGCATATAAGTGTACCAGAAGTCATGTTTATAT
ATTGCTGCAAATACTCGGAATGGGGATCTCTTGTTCCCTGCTTAAGACCACATCACATTACTTGGTTTTG
TACGCTAGTGGCTGAACCAAAAAAAGTAGGAGATGATTTTTTTTCTTTTTTCTTAAAGCAGTAGCTTTTG
AACCTTGACCATGCTTTCTAACCAGCTGAGGGGCTTTTGAAAAAGAGGGTGCCTTACTGTGCCCCAGACC
AGGACAATCAGTATTTCTGGGGAATGGAGCCTGGCACACACACATTTCTTAAAGCTCCCTTGGCAATTCT
GAGGAGTGGATTACATGTTGTATGTAGCTCGTAACGAAAGAAATCTTGTCTTTGCTCTCAGACCCCCATT
TCTTACTCATCTCATGAGCTCCTTCGAGATCCAGAAACAGTTGCATATTTCATTAGTAAATCAGTTCCAG
AGTCACATTTTATTTCACAAGTTAGTCCATTAAAAGTTTCCTGCAGTGAGGAAATAGCCAGAAAGAACAC
TCCACCCCTCCTCCTTTTTATAACTATAGGGTCTGGCTCGACAGAGCAGGAGCATCGCCATCTTGGACAA
How many
nucleotides in the
Human body?
3 Billion
Nucleotides!
6/3/2020 8Bioinformatics: What, Why and Where?
From Human Body to DNA Sequences
DNA Sequencers
Sequence Files
What is the size
of data?
150 GB/person
6/3/2020 9Bioinformatics: What, Why and Where?
How These Files were Generated?
6/3/2020 Bioinformatics: What, Why and Where? 10
How These Files were Generated?
6/3/2020 Bioinformatics: What, Why and Where? 11
Bioinformatics Data is
Increasing Rapidly!
• Speed of sequencing?
 10,000 bp/day/machine ->
billions bp/day/machine.
• Computing cost and time?
 Sequencing cost is falling 5X
faster than computing
• Price / genome?
 Dropped to $1000!
• Storage cost?
 150 GB/genome
Bioinformatics: What, Why and Where? 12
How These Files were Generated?
6/3/2020 13
How These Files were Generated?
Bioinformatics: What, Why and Where?
6/3/2020 Bioinformatics: What, Why and Where? 14
What to Do with These Files?
Making sense of this BIG DATA!
How to Make Sense of This BIG DATA?
Through Bioinformatics!
What is Bioinformatics??!
6/3/2020 Bioinformatics: What, Why and Where? 15
What Do You Need to Learn Bioinformatics?
6/3/2020 Bioinformatics: What, Why and Where? 16
Statistics
Computer
Science
Biology
Bioinformatics
Data
Science
Biostatistics Computational
Biology
What is
Bioinformatics?
https://qph.fs.quoracdn.net/main-qimg-73f348d1d5ee87af6955de6c53a444cf
6/3/2020 Bioinformatics: What, Why and Where? 17
What is
Bioinformatics?
https://bioinformaticsonline.com/mod/file/thumbnail.php?file_guid=4482&size=large&icontime=1379016276
6/3/2020 Bioinformatics: What, Why and Where? 18
What is Bioinformatics?
6/3/2020 Bioinformatics: What, Why and Where? 19
GAATTTGGGCAAGAATCCAGGCATTGGAACTTATTCAAATAACTAGTTTGCCTGTAATTTTCACTTTTTC
AGAGTCATCTGATAAAGCTTTCTTGCTACACATTTAGATAGATACACTCAATCCAGTTGTCTAGAAAGTT
CCCTGAGCCAGCTGGGAGCAGGAGGGGTAGTTGGGGCCAGGAATATTGGGGGTGTGTTTACTGAGCCCCT
AGAAAGTAAGTGCTAGATTTGACATTTCAATCCCTGAAGGCCCTGAAGTTCAGTATCAAATGACTGGTCC
TGTGGACTGAGCATCTGTGAATTGCATATGCTTAGAGTAAATTTTACTCCTACCAGTTTCAGCAGCTTGC
TTTAGCAAGCAGTATGGAAACACTAACATGGGGGAGTAGAATTTCTCTCTCTGATCCAAGTTTTATCTCA
TTCTGGTGGGTTTTCAAGGAGAGACTCGGAGTCCAAGTGTCCTTTCTGAATATATCTGGAACTTCTCATT
AACAAAAGACTCAAGTTATAATTTAGGGGACAAGGCACCCAATGAGAATGCCTTGCAGGCAGCCCTAAGT
ACACCTGCAATTACACCATTACTAGCGCGGCAGCACACATGGCCCTGACTTAGTTTAAATAATTACGTAA
GTCAACCATGATTGTTTGCCCTTTGCATAGAAGGGCAAGTATTGGTACCTGTTACAACTTAGGCTTTTTT
TTCTTTATGTTTGAGCCATGATGAGTGATTTACACTGTTGCATCCATATGTTGAGATGTAAGAATAAATT
AGACTTGGTAATTGCCCTTAAGTGTCTGGAAGTCAACTGGGGAAAGAGAGCTAGAGATAATAAGTGTGAA
ACAATGTCACAGAATCAATGACGGAACTCTTCCCAGGACAAAGGATGACTTTTGAGTTCAGTCTTTGCCT
TTAATTCTACATGGGGAGGAGAGCACGTTTAGCCACAAATGGAAGGGATTACTCATTTGAGCTATTTGGT
TATATGATTATTTCCCCAGAGAATAGGATGTGCAGGGCATTACACAAGCAGTGCCAATAGCAGCAAAGTT
CTTGAGAGTGCTAGTAATTCAAATGGCAGGAAGAGAAGGAATAAATGGTAAGGCTACCTACAGTTCACAG
AGAGCTCCATCCTCACTGTGGCTTTGGATTTTGTCCTGTGTGAAAGAGAAGTGACTGTGAACTGACATGC
TGTGTTTGGTGTTTTAGAAAGATGGCTGCAGCAGCGGTTTGGGGAATGGACTGCAGGAGTGGCATTGGAA
ACAGGAAGGTTCATGACTATTGCCAGAGACAGAGGATGAAGCAGGAGCAAGGAAGATTCAGGACAGGGGA
CTCCGGGGCTGATCAGGAGGCAGAACTGGTTGATAAGTATATGTAGCAGCATAAGAAAGAAAGAATCCCA
GATTGACACCCAGGCTTCTCACTTGGAAGCCTGGATAGATACTGAATGCAATCACAAAGGCTGGGAAGTC
AATGGGACTGCAGGGAAGGGAAGGGAAGGGAGGAGAAGAGGAAGGGCAGGAGGGTCCAATATCAATATTC
AGCTTTTAGATGTGTTGAGCTTGAAGTGCTCAGATGGAGAAGTCCAGGAGGCAGTAGAATACGGTGGTCC
AGAGCACAGGAGAGCAATGTGGCTTGAGTTGTCATTTGCTCACATATTTCCGTGTCAGTTACTTGTCTTA
GATCACAGAACAAGTTCTCCTCTCACAGTTTCCTGGCTCCACCTGTCTCATGCTCACCGTCAGCATCGAA
ATTGAGCCACACCAGGGGTTCTGGATACCAGCTTCTCTCTAGGTGAGGCTGCTATAGTCAGCAGCTGATT
AGTTGCAGTTATCAGCAACTGGTAATATAATATATTGTGCATATAAGTGTACCAGAAGTCATGTTTATAT
ATTGCTGCAAATACTCGGAATGGGGATCTCTTGTTCCCTGCTTAAGACCACATCACATTACTTGGTTTTG
TACGCTAGTGGCTGAACCAAAAAAAGTAGGAGATGATTTTTTTTCTTTTTTCTTAAAGCAGTAGCTTTTG
AACCTTGACCATGCTTTCTAACCAGCTGAGGGGCTTTTGAAAAAGAGGGTGCCTTACTGTGCCCCAGACC
AGGACAATCAGTATTTCTGGGGAATGGAGCCTGGCACACACACATTTCTTAAAGCTCCCTTGGCAATTCT
GAGGAGTGGATTACATGTTGTATGTAGCTCGTAACGAAAGAAATCTTGTCTTTGCTCTCAGACCCCCATT
TCTTACTCATCTCATGAGCTCCTTCGAGATCCAGAAACAGTTGCATATTTCATTAGTAAATCAGTTCCAG
AGTCACATTTTATTTCACAAGTTAGTCCATTAAAAGTTTCCTGCAGTGAGGAAATAGCCAGAAAGAACAC
TCCACCCCTCCTCCTTTTTATAACTATAGGGTCTGGCTCGACAGAGCAGGAGCATCGCCATCTTGGACAA
What is Bioinformatics?
6/3/2020 Bioinformatics: What, Why and Where? 20
GAATTTGGGCAAGAATCCAGGCATTGGAACTTATTCAAATAACTAGTTTGCCTGTAATTTTCACTTTTTC
AGAGTCATCTGATAAAGCTTTCTTGCTACACATTTAGATAGATACACTCAATCCAGTTGTCTAGAAAGTT
CCCTGAGCCAGCTGGGAGCAGGAGGGGTAGTTGGGGCCAGGAATATTGGGGGTGTGTTTACTGAGCCCCT
AGAAAGTAAGTGCTAGATTTGACATTTCAATCCCTGAAGGCCCTGAAGTTCAGTATCAAATGACTGGTCC
TGTGGACTGAGCATCTGTGAATTGCATATGCTTAGAGTAAATTTTACTCCTACCAGTTTCAGCAGCTTGC
TTTAGCAAGCAGTATGGAAACACTAACATGGGGGAGTAGAATTTCTCTCTCTGATCCAAGTTTTATCTCA
TTCTGGTGGGTTTTCAAGGAGAGACTCGGAGTCCAAGTGTCCTTTCTGAATATATCTGGAACTTCTCATT
AACAAAAGACTCAAGTTATAATTTAGGGGACAAGGCACCCAATGAGAATGCCTTGCAGGCAGCCCTAAGT
ACACCTGCAATTACACCATTACTAGCGCGGCAGCACACATGGCCCTGACTTAGTTTAAATAATTACGTAA
GTCAACCATGATTGTTTGCCCTTTGCATAGAAGGGCAAGTATTGGTACCTGTTACAACTTAGGCTTTTTT
TTCTTTATGTTTGAGCCATGATGAGTGATTTACACTGTTGCATCCATATGTTGAGATGTAAGAATAAATT
AGACTTGGTAATTGCCCTTAAGTGTCTGGAAGTCAACTGGGGAAAGAGAGCTAGAGATAATAAGTGTGAA
ACAATGTCACAGAATCAATGACGGAACTCTTCCCAGGACAAAGGATGACTTTTGAGTTCAGTCTTTGCCT
TTAATTCTACATGGGGAGGAGAGCACGTTTAGCCACAAATGGAAGGGATTACTCATTTGAGCTATTTGGT
TATATGATTATTTCCCCAGAGAATAGGATGTGCAGGGCATTACACAAGCAGTGCCAATAGCAGCAAAGTT
CTTGAGAGTGCTAGTAATTCAAATGGCAGGAAGAGAAGGAATAAATGGTAAGGCTACCTACAGTTCACAG
AGAGCTCCATCCTCACTGTGGCTTTGGATTTTGTCCTGTGTGAAAGAGAAGTGACTGTGAACTGACATGC
TGTGTTTGGTGTTTTAGAAAGATGGCTGCAGCAGCGGTTTGGGGAATGGACTGCAGGAGTGGCATTGGAA
ACAGGAAGGTTCATGACTATTGCCAGAGACAGAGGATGAAGCAGGAGCAAGGAAGATTCAGGACAGGGGA
CTCCGGGGCTGATCAGGAGGCAGAACTGGTTGATAAGTATATGTAGCAGCATAAGAAAGAAAGAATCCCA
GATTGACACCCAGGCTTCTCACTTGGAAGCCTGGATAGATACTGAATGCAATCACAAAGGCTGGGAAGTC
AATGGGACTGCAGGGAAGGGAAGGGAAGGGAGGAGAAGAGGAAGGGCAGGAGGGTCCAATATCAATATTC
AGCTTTTAGATGTGTTGAGCTTGAAGTGCTCAGATGGAGAAGTCCAGGAGGCAGTAGAATACGGTGGTCC
AGAGCACAGGAGAGCAATGTGGCTTGAGTTGTCATTTGCTCACATATTTCCGTGTCAGTTACTTGTCTTA
GATCACAGAACAAGTTCTCCTCTCACAGTTTCCTGGCTCCACCTGTCTCATGCTCACCGTCAGCATCGAA
ATTGAGCCACACCAGGGGTTCTGGATACCAGCTTCTCTCTAGGTGAGGCTGCTATAGTCAGCAGCTGATT
AGTTGCAGTTATCAGCAACTGGTAATATAATATATTGTGCATATAAGTGTACCAGAAGTCATGTTTATAT
ATTGCTGCAAATACTCGGAATGGGGATCTCTTGTTCCCTGCTTAAGACCACATCACATTACTTGGTTTTG
TACGCTAGTGGCTGAACCAAAAAAAGTAGGAGATGATTTTTTTTCTTTTTTCTTAAAGCAGTAGCTTTTG
AACCTTGACCATGCTTTCTAACCAGCTGAGGGGCTTTTGAAAAAGAGGGTGCCTTACTGTGCCCCAGACC
AGGACAATCAGTATTTCTGGGGAATGGAGCCTGGCACACACACATTTCTTAAAGCTCCCTTGGCAATTCT
GAGGAGTGGATTACATGTTGTATGTAGCTCGTAACGAAAGAAATCTTGTCTTTGCTCTCAGACCCCCATT
TCTTACTCATCTCATGAGCTCCTTCGAGATCCAGAAACAGTTGCATATTTCATTAGTAAATCAGTTCCAG
AGTCACATTTTATTTCACAAGTTAGTCCATTAAAAGTTTCCTGCAGTGAGGAAATAGCCAGAAAGAACAC
TCCACCCCTCCTCCTTTTTATAACTATAGGGTCTGGCTCGACAGAGCAGGAGCATCGCCATCTTGGACAA
Use Existing tools to build
analysis workflows
• Linux
• Command Line
• Scripting
Develop your own tools
• Programming
• Algorithm Design
• Machine Learning
What is Bioinformatics?
6/3/2020 Bioinformatics: What, Why and Where? 21
GAATTTGGGCAAGAATCCAGGCATTGGAACTTATTCAAATAACTAGTTTGCCTGTAATTTTCACTTTTTC
AGAGTCATCTGATAAAGCTTTCTTGCTACACATTTAGATAGATACACTCAATCCAGTTGTCTAGAAAGTT
CCCTGAGCCAGCTGGGAGCAGGAGGGGTAGTTGGGGCCAGGAATATTGGGGGTGTGTTTACTGAGCCCCT
AGAAAGTAAGTGCTAGATTTGACATTTCAATCCCTGAAGGCCCTGAAGTTCAGTATCAAATGACTGGTCC
TGTGGACTGAGCATCTGTGAATTGCATATGCTTAGAGTAAATTTTACTCCTACCAGTTTCAGCAGCTTGC
TTTAGCAAGCAGTATGGAAACACTAACATGGGGGAGTAGAATTTCTCTCTCTGATCCAAGTTTTATCTCA
TTCTGGTGGGTTTTCAAGGAGAGACTCGGAGTCCAAGTGTCCTTTCTGAATATATCTGGAACTTCTCATT
AACAAAAGACTCAAGTTATAATTTAGGGGACAAGGCACCCAATGAGAATGCCTTGCAGGCAGCCCTAAGT
ACACCTGCAATTACACCATTACTAGCGCGGCAGCACACATGGCCCTGACTTAGTTTAAATAATTACGTAA
GTCAACCATGATTGTTTGCCCTTTGCATAGAAGGGCAAGTATTGGTACCTGTTACAACTTAGGCTTTTTT
TTCTTTATGTTTGAGCCATGATGAGTGATTTACACTGTTGCATCCATATGTTGAGATGTAAGAATAAATT
AGACTTGGTAATTGCCCTTAAGTGTCTGGAAGTCAACTGGGGAAAGAGAGCTAGAGATAATAAGTGTGAA
ACAATGTCACAGAATCAATGACGGAACTCTTCCCAGGACAAAGGATGACTTTTGAGTTCAGTCTTTGCCT
TTAATTCTACATGGGGAGGAGAGCACGTTTAGCCACAAATGGAAGGGATTACTCATTTGAGCTATTTGGT
TATATGATTATTTCCCCAGAGAATAGGATGTGCAGGGCATTACACAAGCAGTGCCAATAGCAGCAAAGTT
CTTGAGAGTGCTAGTAATTCAAATGGCAGGAAGAGAAGGAATAAATGGTAAGGCTACCTACAGTTCACAG
AGAGCTCCATCCTCACTGTGGCTTTGGATTTTGTCCTGTGTGAAAGAGAAGTGACTGTGAACTGACATGC
TGTGTTTGGTGTTTTAGAAAGATGGCTGCAGCAGCGGTTTGGGGAATGGACTGCAGGAGTGGCATTGGAA
ACAGGAAGGTTCATGACTATTGCCAGAGACAGAGGATGAAGCAGGAGCAAGGAAGATTCAGGACAGGGGA
CTCCGGGGCTGATCAGGAGGCAGAACTGGTTGATAAGTATATGTAGCAGCATAAGAAAGAAAGAATCCCA
GATTGACACCCAGGCTTCTCACTTGGAAGCCTGGATAGATACTGAATGCAATCACAAAGGCTGGGAAGTC
AATGGGACTGCAGGGAAGGGAAGGGAAGGGAGGAGAAGAGGAAGGGCAGGAGGGTCCAATATCAATATTC
AGCTTTTAGATGTGTTGAGCTTGAAGTGCTCAGATGGAGAAGTCCAGGAGGCAGTAGAATACGGTGGTCC
AGAGCACAGGAGAGCAATGTGGCTTGAGTTGTCATTTGCTCACATATTTCCGTGTCAGTTACTTGTCTTA
GATCACAGAACAAGTTCTCCTCTCACAGTTTCCTGGCTCCACCTGTCTCATGCTCACCGTCAGCATCGAA
ATTGAGCCACACCAGGGGTTCTGGATACCAGCTTCTCTCTAGGTGAGGCTGCTATAGTCAGCAGCTGATT
AGTTGCAGTTATCAGCAACTGGTAATATAATATATTGTGCATATAAGTGTACCAGAAGTCATGTTTATAT
ATTGCTGCAAATACTCGGAATGGGGATCTCTTGTTCCCTGCTTAAGACCACATCACATTACTTGGTTTTG
TACGCTAGTGGCTGAACCAAAAAAAGTAGGAGATGATTTTTTTTCTTTTTTCTTAAAGCAGTAGCTTTTG
AACCTTGACCATGCTTTCTAACCAGCTGAGGGGCTTTTGAAAAAGAGGGTGCCTTACTGTGCCCCAGACC
AGGACAATCAGTATTTCTGGGGAATGGAGCCTGGCACACACACATTTCTTAAAGCTCCCTTGGCAATTCT
GAGGAGTGGATTACATGTTGTATGTAGCTCGTAACGAAAGAAATCTTGTCTTTGCTCTCAGACCCCCATT
TCTTACTCATCTCATGAGCTCCTTCGAGATCCAGAAACAGTTGCATATTTCATTAGTAAATCAGTTCCAG
AGTCACATTTTATTTCACAAGTTAGTCCATTAAAAGTTTCCTGCAGTGAGGAAATAGCCAGAAAGAACAC
TCCACCCCTCCTCCTTTTTATAACTATAGGGTCTGGCTCGACAGAGCAGGAGCATCGCCATCTTGGACAA
Use Existing tools to build
analysis workflows
Develop your own tools
• Linux
• Command Line
• Scripting
• Programming
• Algorithm Design
• Machine Learning
A = 1765 G = 3561
C = 2677 T = 1121
What is Bioinformatics?
6/3/2020 Bioinformatics: What, Why and Where? 22
Use Existing tools to build
analysis workflows
Develop your own tools
• Linux
• Command Line
• Scripting
• Programming
• Algorithm Design
• Machine Learning
Biologist (Biology Background)
Use existing bioinformatics tools
Computer Scientist (CS Background)
Develops bioinformatics tools
Basic User
Windows OS
Web-based Tools
GUI Standalone tools
No Programming skills
Advanced User
Linux OS
Command line Standalone
tools
Basic Programming Skills
Developer
Basic Biology Knowledge
Advanced Programming Skills
Advanced Mathematics
Advanced Statistics
Who Can Be a Bioinformatician?
6/3/2020 Bioinformatics: What, Why and Where? 23
How can I Learn Bioinformatics?
Tons of free courses are available online!
More than 26 million
results when searching
without comma!
6/3/2020 Bioinformatics: What, Why and Where? 24
How can I Learn Bioinformatics?
Tons of free courses are available online!
More than 46 million
results when searching
without comma!
6/3/2020 Bioinformatics: What, Why and Where? 25
Examples of Free Online Bioinformatics MOOCs
Websites
6/3/2020 Bioinformatics: What, Why and Where? 26
6/3/2020 Bioinformatics: What, Why and Where? 27
Milestones of
Bioinformatics
28
• OMICS Sciences
• Programming and Data
Structure
•Algorithm Design
• LINUX
• Statistics
•Basic Mathematics
• AI and Data Science
•Data Visualization
• Results Interpretation
Milestones of
Bioinformatics
29
Shall I learn Everything?
Next Step?
30
Read Papers and Reproduce
Results!
Compare
Modify
Explain
Seek Internships Options!
Real Life Problems!
Advice…
31
Perceive Biology as CS and
Perceive CS as Biology!
The Link!
No Need for a Supercomputer!
6/3/2020 Bioinformatics: What, Why and Where? 32
Where to Find a Job (Egypt and Abroad)?
6/3/2020 Bioinformatics: What, Why and Where? 33
Research
Academia
Companies
Startup
Freelancer
Salaries
6/3/2020 Bioinformatics: What, Why and Where? 34
Salaries
6/3/2020 Bioinformatics: What, Why and Where? 35
Salaries
6/3/2020 Bioinformatics: What, Why and Where? 36
Salaries
6/3/2020 Bioinformatics: What, Why and Where? 37
6/3/2020 Bioinformatics: What, Why and Where? 38
Institute/Company Department Sequencer
American University in Cairo (AUC) Biology Ion S5
American University in Cairo (AUC)
Global Health and Human
Ecology MiSeq
National Research Center (NRC) Genetics MiSeq
Zewail City of Science and Technology Center for Genomics
MiSeq and
NextSeq 500
Kasr Alainy School of Medicine Clinical Oncology 3 MiSeq
CCHE 57357 Genomics program
MiSeq and
NextSeq 500
Ahram Canadian University Central Research Lab
Agilent
Bioanalyzer 2100
National Research Center (NRC) Genetics Ion torrent
National Research Center (NRC) Environmental department Ion torrent PGM
MASRI ain shams University Center
Ion S5 and Ion
shef
Air forces specialised hospital Labs Miseq
Maadi military hospital Labs Ion S5
Mansoura University Stem cells center Ion torrent
National Cancer Institute (NCI) Molecular biology Ion S5
Abo Alraish Hospital Microbiology Labs MiSeq
Alexandria Regional Center for Women's Health
and Development Ion S5
Tanta University - Faculty of Medicine - Center of
Exellence Genomic Signature Center MiSeq
Magdi Yacoub Foundation
MiSeq and
NextSeq
Generations Genetics Labs MiSeq
Sequencers in Egypt
(Sample)
Source: Prof. Ahmed Moustafa, AUC.
What Bioinformatics Can Do for Life Sciences?
6/3/2020 Bioinformatics: What, Why and Where? 39
Genome Assembly
6/3/2020 Bioinformatics: What, Why and Where? 40
Gene Prediction
• Gene structure
• Open Reading Frames (ORFs).
• Start and stop of the gene
• Locations of exons and introns
• Splice variants
• Gene prediction is one of the first and
most important steps in understanding
any genome after being sequenced.
6/3/2020 Bioinformatics: What, Why and Where? 41
Sequence Comparison
• Compare unknown gene or protein
sequences against known sequences to
identify their origin or function.
• Finding Signatures that can be used in
diagnostics
6/3/2020 Bioinformatics: What, Why and Where? 42
Phylogenetic Analyses
• Evolutionary relationship among a
group of related molecules or
organisms
• Track gene flow based on sequence
similarity
6/3/2020 Bioinformatics: What, Why and Where? 43
Understand the Functions of Genes (Pathway
Analysis)
6/3/2020 Bioinformatics: What, Why and Where? 44
Predicting Protein Structure and Function
• Protein’s 3D structure Prediction
• Understand how biomolecules
interact with other molecules
• Predict functions based on
interactions
6/3/2020 Bioinformatics: What, Why and Where? 45
Drug Design
• It is faster to analyze molecules on
computer as compared to
experimental approaches.
• Helps in identifying drug
targets easily
• Simulating drug effects on computers
6/3/2020 Bioinformatics: What, Why and Where? 46
Applications of
Bioinformatics
6/3/2020 Bioinformatics: What, Why and Where? 47
Applications of
Bioinformatics in Medicine
• The Human Genome Project (HGP) helps scientists to
search for genes directly associated with diseases and
understand the molecular basis of those identified
diseases.
• This new Information will help in better understanding
of the mechanisms of diseases and hence develop
better treatment and preventive methods.
6/3/2020 Bioinformatics: What, Why and Where? 48
Applications of
Bioinformatics in Pharmacy
• Identification and validating new drugs through
Computer Aided Drug Design (CADD).
• Helps to develop specific drugs with less side
effect
6/3/2020 Bioinformatics: What, Why and Where? 49
Applications of Bioinformatics
in Food Security
• Large amount of genomics data is available from plants and
animals
• Bioinformatic analysis of plant and animal genomes will
help scientists to improve crops
• Resistant to drought
• Resistant to insects and pests
• More nutritional value
• Animals with higher meat quality and productivity
6/3/2020 Bioinformatics: What, Why and Where? 50
Applications of Bioinformatics
in the Environment
• Sequencing and analysis of microbial genomes and search
for genes expressing enzymes for
• Bioremediation and biodegradation
• Climate change studies (Microbes that use CO2 as their
sole source of enegy)
• Alternative energy sources (energy from light)
• Microbes with industrial benefits
• Generation of Biogas
6/3/2020 Bioinformatics: What, Why and Where? 51
Bioinformatics Tools…
https://www.omicsonline.org/articles-images/data-mining-genomics-Application-bioinformatics-tools-5-158-g001.png
6/3/2020 Bioinformatics: What, Why and Where? 52
Take Home Messages
• Understand the biological background first (in details)!
• For writing a software
• For using a software
• Which tool/software to use?
• Understand the algorithm behind each software/tool
• Test different parameters
• Select the best tool
• Free software are everywhere
• Read about benchmarking studies first
• Before Writing your own software
• Check if it is exist (don’t work from scratch)
• Modify existing tools
6/3/2020 Bioinformatics: What, Why and Where? 53
Biologists and Computer Scientitst Should
Communicate!
6/3/2020 Bioinformatics: What, Why and Where? 54
6/3/2020 Bioinformatics: What, Why and Where? 55
6/3/2020 Bioinformatics: What, Why and Where? 56
6/3/2020 Bioinformatics: What, Why and Where? 57
Thank You for Your Attention!
Open Discussion…
melhadidi@nu.edu.eg
hadidi.bioinfo@gmail.com
6/3/2020 Bioinformatics: What, Why and Where? 58

Weitere ähnliche Inhalte

Was ist angesagt?

Scoring schemes in bioinformatics
Scoring schemes in bioinformaticsScoring schemes in bioinformatics
Scoring schemes in bioinformaticsSumatiHajela
 
Nucleic acid database
Nucleic acid database Nucleic acid database
Nucleic acid database bhargvi sharma
 
New insights into the human genome by ENCODE project
New insights into the human genome by ENCODE project New insights into the human genome by ENCODE project
New insights into the human genome by ENCODE project Senthil Natesan
 
Role of bioinformatics in life sciences research
Role of bioinformatics in life sciences researchRole of bioinformatics in life sciences research
Role of bioinformatics in life sciences researchAnshika Bansal
 
Sequence Submission Tools
Sequence Submission ToolsSequence Submission Tools
Sequence Submission ToolsRishikaMaji
 
Embryonic stem cells
 Embryonic  stem cells Embryonic  stem cells
Embryonic stem cellssunitafeme
 
High-Throughput Sequencing
High-Throughput SequencingHigh-Throughput Sequencing
High-Throughput SequencingMark Pallen
 
Techniques used for separation in proteomics
Techniques used for separation in proteomicsTechniques used for separation in proteomics
Techniques used for separation in proteomicsNilesh Chandra
 
Introduction to ncbi, embl, ddbj
Introduction to ncbi, embl, ddbjIntroduction to ncbi, embl, ddbj
Introduction to ncbi, embl, ddbjKAUSHAL SAHU
 
Co and post translationational modification of proteins
Co and post translationational modification of proteinsCo and post translationational modification of proteins
Co and post translationational modification of proteinsSukirti Vedula
 
Protein databases
Protein databasesProtein databases
Protein databasessarumalay
 
GENOMICS AND BIOINFORMATICS
GENOMICS AND BIOINFORMATICSGENOMICS AND BIOINFORMATICS
GENOMICS AND BIOINFORMATICSsandeshGM
 
Bioinformatics
BioinformaticsBioinformatics
BioinformaticsAmna Jalil
 

Was ist angesagt? (20)

PROTEIN DATABASE
PROTEIN DATABASEPROTEIN DATABASE
PROTEIN DATABASE
 
Scoring schemes in bioinformatics
Scoring schemes in bioinformaticsScoring schemes in bioinformatics
Scoring schemes in bioinformatics
 
Nucleic acid database
Nucleic acid database Nucleic acid database
Nucleic acid database
 
EMBL
EMBLEMBL
EMBL
 
New insights into the human genome by ENCODE project
New insights into the human genome by ENCODE project New insights into the human genome by ENCODE project
New insights into the human genome by ENCODE project
 
Role of bioinformatics in life sciences research
Role of bioinformatics in life sciences researchRole of bioinformatics in life sciences research
Role of bioinformatics in life sciences research
 
Sequence Submission Tools
Sequence Submission ToolsSequence Submission Tools
Sequence Submission Tools
 
Embryonic stem cells
 Embryonic  stem cells Embryonic  stem cells
Embryonic stem cells
 
High-Throughput Sequencing
High-Throughput SequencingHigh-Throughput Sequencing
High-Throughput Sequencing
 
Techniques used for separation in proteomics
Techniques used for separation in proteomicsTechniques used for separation in proteomics
Techniques used for separation in proteomics
 
Microsatellite
MicrosatelliteMicrosatellite
Microsatellite
 
Introduction to ncbi, embl, ddbj
Introduction to ncbi, embl, ddbjIntroduction to ncbi, embl, ddbj
Introduction to ncbi, embl, ddbj
 
Co and post translationational modification of proteins
Co and post translationational modification of proteinsCo and post translationational modification of proteins
Co and post translationational modification of proteins
 
Swiss prot
Swiss protSwiss prot
Swiss prot
 
Bioinformatics
BioinformaticsBioinformatics
Bioinformatics
 
TrEMBL
TrEMBLTrEMBL
TrEMBL
 
Protein databases
Protein databasesProtein databases
Protein databases
 
GENOMICS AND BIOINFORMATICS
GENOMICS AND BIOINFORMATICSGENOMICS AND BIOINFORMATICS
GENOMICS AND BIOINFORMATICS
 
Ion Torrent Sequencing
Ion Torrent SequencingIon Torrent Sequencing
Ion Torrent Sequencing
 
Bioinformatics
BioinformaticsBioinformatics
Bioinformatics
 

Ähnlich wie Bioinformatics: What, Why and Where?

Introduction to Bioinformatics.
 Introduction to Bioinformatics. Introduction to Bioinformatics.
Introduction to Bioinformatics.Elena SĂźgis
 
Sequencing Genomics: The New Big Data Driver
Sequencing Genomics:The New Big Data DriverSequencing Genomics:The New Big Data Driver
Sequencing Genomics: The New Big Data DriverLarry Smarr
 
Next generation genomics: Petascale data in the life sciences
Next generation genomics: Petascale data in the life sciencesNext generation genomics: Petascale data in the life sciences
Next generation genomics: Petascale data in the life sciencesGuy Coates
 
01. Introduction to Bioinformatics.pptx
01. Introduction to Bioinformatics.pptx01. Introduction to Bioinformatics.pptx
01. Introduction to Bioinformatics.pptxHussainTaqi1
 
2015 illinois-talk
2015 illinois-talk2015 illinois-talk
2015 illinois-talkc.titus.brown
 
Life sciences big data use cases
Life sciences big data use casesLife sciences big data use cases
Life sciences big data use casesGuy Coates
 
Bioinformatics issues and challanges presentation at s p college
Bioinformatics  issues and challanges  presentation at s p collegeBioinformatics  issues and challanges  presentation at s p college
Bioinformatics issues and challanges presentation at s p collegeSKUASTKashmir
 
2016 davis-plantbio
2016 davis-plantbio2016 davis-plantbio
2016 davis-plantbioc.titus.brown
 
2012 hpcuserforum talk
2012 hpcuserforum talk2012 hpcuserforum talk
2012 hpcuserforum talkc.titus.brown
 
Centralized Model Organism Database (Biocuration 2014 poster)
Centralized Model Organism Database (Biocuration 2014 poster)Centralized Model Organism Database (Biocuration 2014 poster)
Centralized Model Organism Database (Biocuration 2014 poster)Andrew Su
 
Bauhina Genome slides for school visit
Bauhina Genome slides for school visitBauhina Genome slides for school visit
Bauhina Genome slides for school visitScott Edmunds
 
Bioinformatica 29-09-2011-t1-bioinformatics
Bioinformatica 29-09-2011-t1-bioinformaticsBioinformatica 29-09-2011-t1-bioinformatics
Bioinformatica 29-09-2011-t1-bioinformaticsProf. Wim Van Criekinge
 
Emerging challenges in data-intensive genomics
Emerging challenges in data-intensive genomicsEmerging challenges in data-intensive genomics
Emerging challenges in data-intensive genomicsmikaelhuss
 
Bioinformatics relevance with biotechnology
Bioinformatics relevance with biotechnologyBioinformatics relevance with biotechnology
Bioinformatics relevance with biotechnologyKAUSHAL SAHU
 
B.sc biochem i bobi u 2 database
B.sc biochem i bobi u 2 databaseB.sc biochem i bobi u 2 database
B.sc biochem i bobi u 2 databaseRai University
 
Deep learning for biomedicine
Deep learning for biomedicineDeep learning for biomedicine
Deep learning for biomedicineDeakin University
 

Ähnlich wie Bioinformatics: What, Why and Where? (20)

Introduction to Bioinformatics.
 Introduction to Bioinformatics. Introduction to Bioinformatics.
Introduction to Bioinformatics.
 
Sequencing Genomics: The New Big Data Driver
Sequencing Genomics:The New Big Data DriverSequencing Genomics:The New Big Data Driver
Sequencing Genomics: The New Big Data Driver
 
Next generation genomics: Petascale data in the life sciences
Next generation genomics: Petascale data in the life sciencesNext generation genomics: Petascale data in the life sciences
Next generation genomics: Petascale data in the life sciences
 
01. Introduction to Bioinformatics.pptx
01. Introduction to Bioinformatics.pptx01. Introduction to Bioinformatics.pptx
01. Introduction to Bioinformatics.pptx
 
2015 illinois-talk
2015 illinois-talk2015 illinois-talk
2015 illinois-talk
 
2015 03 13_puurs_v_public
2015 03 13_puurs_v_public2015 03 13_puurs_v_public
2015 03 13_puurs_v_public
 
Life sciences big data use cases
Life sciences big data use casesLife sciences big data use cases
Life sciences big data use cases
 
Bioinformatics issues and challanges presentation at s p college
Bioinformatics  issues and challanges  presentation at s p collegeBioinformatics  issues and challanges  presentation at s p college
Bioinformatics issues and challanges presentation at s p college
 
2014 aus-agta
2014 aus-agta2014 aus-agta
2014 aus-agta
 
2016 davis-plantbio
2016 davis-plantbio2016 davis-plantbio
2016 davis-plantbio
 
2012 hpcuserforum talk
2012 hpcuserforum talk2012 hpcuserforum talk
2012 hpcuserforum talk
 
Centralized Model Organism Database (Biocuration 2014 poster)
Centralized Model Organism Database (Biocuration 2014 poster)Centralized Model Organism Database (Biocuration 2014 poster)
Centralized Model Organism Database (Biocuration 2014 poster)
 
Bauhina Genome slides for school visit
Bauhina Genome slides for school visitBauhina Genome slides for school visit
Bauhina Genome slides for school visit
 
Bioinformatica 29-09-2011-t1-bioinformatics
Bioinformatica 29-09-2011-t1-bioinformaticsBioinformatica 29-09-2011-t1-bioinformatics
Bioinformatica 29-09-2011-t1-bioinformatics
 
Emerging challenges in data-intensive genomics
Emerging challenges in data-intensive genomicsEmerging challenges in data-intensive genomics
Emerging challenges in data-intensive genomics
 
Big data nebraska
Big data nebraskaBig data nebraska
Big data nebraska
 
Bioinformatics relevance with biotechnology
Bioinformatics relevance with biotechnologyBioinformatics relevance with biotechnology
Bioinformatics relevance with biotechnology
 
B.sc biochem i bobi u 2 database
B.sc biochem i bobi u 2 databaseB.sc biochem i bobi u 2 database
B.sc biochem i bobi u 2 database
 
Deep learning for biomedicine
Deep learning for biomedicineDeep learning for biomedicine
Deep learning for biomedicine
 
Big data nebraska
Big data nebraskaBig data nebraska
Big data nebraska
 

KĂźrzlich hochgeladen

Davis plaque method.pptx recombinant DNA technology
Davis plaque method.pptx recombinant DNA technologyDavis plaque method.pptx recombinant DNA technology
Davis plaque method.pptx recombinant DNA technologycaarthichand2003
 
Radiation physics in Dental Radiology...
Radiation physics in Dental Radiology...Radiation physics in Dental Radiology...
Radiation physics in Dental Radiology...navyadasi1992
 
Best Call Girls In Sector 29 Gurgaon❤️8860477959 EscorTs Service In 24/7 Delh...
Best Call Girls In Sector 29 Gurgaon❤️8860477959 EscorTs Service In 24/7 Delh...Best Call Girls In Sector 29 Gurgaon❤️8860477959 EscorTs Service In 24/7 Delh...
Best Call Girls In Sector 29 Gurgaon❤️8860477959 EscorTs Service In 24/7 Delh...lizamodels9
 
BIOETHICS IN RECOMBINANT DNA TECHNOLOGY.
BIOETHICS IN RECOMBINANT DNA TECHNOLOGY.BIOETHICS IN RECOMBINANT DNA TECHNOLOGY.
BIOETHICS IN RECOMBINANT DNA TECHNOLOGY.PraveenaKalaiselvan1
 
Volatile Oils Pharmacognosy And Phytochemistry -I
Volatile Oils Pharmacognosy And Phytochemistry -IVolatile Oils Pharmacognosy And Phytochemistry -I
Volatile Oils Pharmacognosy And Phytochemistry -INandakishor Bhaurao Deshmukh
 
Call Girls In Nihal Vihar Delhi ❤️8860477959 Looking Escorts In 24/7 Delhi NCR
Call Girls In Nihal Vihar Delhi ❤️8860477959 Looking Escorts In 24/7 Delhi NCRCall Girls In Nihal Vihar Delhi ❤️8860477959 Looking Escorts In 24/7 Delhi NCR
Call Girls In Nihal Vihar Delhi ❤️8860477959 Looking Escorts In 24/7 Delhi NCRlizamodels9
 
Environmental Biotechnology Topic:- Microbial Biosensor
Environmental Biotechnology Topic:- Microbial BiosensorEnvironmental Biotechnology Topic:- Microbial Biosensor
Environmental Biotechnology Topic:- Microbial Biosensorsonawaneprad
 
preservation, maintanence and improvement of industrial organism.pptx
preservation, maintanence and improvement of industrial organism.pptxpreservation, maintanence and improvement of industrial organism.pptx
preservation, maintanence and improvement of industrial organism.pptxnoordubaliya2003
 
Harmful and Useful Microorganisms Presentation
Harmful and Useful Microorganisms PresentationHarmful and Useful Microorganisms Presentation
Harmful and Useful Microorganisms Presentationtahreemzahra82
 
THE ROLE OF PHARMACOGNOSY IN TRADITIONAL AND MODERN SYSTEM OF MEDICINE.pptx
THE ROLE OF PHARMACOGNOSY IN TRADITIONAL AND MODERN SYSTEM OF MEDICINE.pptxTHE ROLE OF PHARMACOGNOSY IN TRADITIONAL AND MODERN SYSTEM OF MEDICINE.pptx
THE ROLE OF PHARMACOGNOSY IN TRADITIONAL AND MODERN SYSTEM OF MEDICINE.pptxNandakishor Bhaurao Deshmukh
 
Pests of soyabean_Binomics_IdentificationDr.UPR.pdf
Pests of soyabean_Binomics_IdentificationDr.UPR.pdfPests of soyabean_Binomics_IdentificationDr.UPR.pdf
Pests of soyabean_Binomics_IdentificationDr.UPR.pdfPirithiRaju
 
Topic 9- General Principles of International Law.pptx
Topic 9- General Principles of International Law.pptxTopic 9- General Principles of International Law.pptx
Topic 9- General Principles of International Law.pptxJorenAcuavera1
 
(9818099198) Call Girls In Noida Sector 14 (NOIDA ESCORTS)
(9818099198) Call Girls In Noida Sector 14 (NOIDA ESCORTS)(9818099198) Call Girls In Noida Sector 14 (NOIDA ESCORTS)
(9818099198) Call Girls In Noida Sector 14 (NOIDA ESCORTS)riyaescorts54
 
Vision and reflection on Mining Software Repositories research in 2024
Vision and reflection on Mining Software Repositories research in 2024Vision and reflection on Mining Software Repositories research in 2024
Vision and reflection on Mining Software Repositories research in 2024AyushiRastogi48
 
Call Girls in Majnu Ka Tilla Delhi 🔝9711014705🔝 Genuine
Call Girls in Majnu Ka Tilla Delhi 🔝9711014705🔝 GenuineCall Girls in Majnu Ka Tilla Delhi 🔝9711014705🔝 Genuine
Call Girls in Majnu Ka Tilla Delhi 🔝9711014705🔝 Genuinethapagita
 
Neurodevelopmental disorders according to the dsm 5 tr
Neurodevelopmental disorders according to the dsm 5 trNeurodevelopmental disorders according to the dsm 5 tr
Neurodevelopmental disorders according to the dsm 5 trssuser06f238
 
GenBio2 - Lesson 1 - Introduction to Genetics.pptx
GenBio2 - Lesson 1 - Introduction to Genetics.pptxGenBio2 - Lesson 1 - Introduction to Genetics.pptx
GenBio2 - Lesson 1 - Introduction to Genetics.pptxBerniceCayabyab1
 
OECD bibliometric indicators: Selected highlights, April 2024
OECD bibliometric indicators: Selected highlights, April 2024OECD bibliometric indicators: Selected highlights, April 2024
OECD bibliometric indicators: Selected highlights, April 2024innovationoecd
 
Speech, hearing, noise, intelligibility.pptx
Speech, hearing, noise, intelligibility.pptxSpeech, hearing, noise, intelligibility.pptx
Speech, hearing, noise, intelligibility.pptxpriyankatabhane
 

KĂźrzlich hochgeladen (20)

Davis plaque method.pptx recombinant DNA technology
Davis plaque method.pptx recombinant DNA technologyDavis plaque method.pptx recombinant DNA technology
Davis plaque method.pptx recombinant DNA technology
 
Radiation physics in Dental Radiology...
Radiation physics in Dental Radiology...Radiation physics in Dental Radiology...
Radiation physics in Dental Radiology...
 
Best Call Girls In Sector 29 Gurgaon❤️8860477959 EscorTs Service In 24/7 Delh...
Best Call Girls In Sector 29 Gurgaon❤️8860477959 EscorTs Service In 24/7 Delh...Best Call Girls In Sector 29 Gurgaon❤️8860477959 EscorTs Service In 24/7 Delh...
Best Call Girls In Sector 29 Gurgaon❤️8860477959 EscorTs Service In 24/7 Delh...
 
BIOETHICS IN RECOMBINANT DNA TECHNOLOGY.
BIOETHICS IN RECOMBINANT DNA TECHNOLOGY.BIOETHICS IN RECOMBINANT DNA TECHNOLOGY.
BIOETHICS IN RECOMBINANT DNA TECHNOLOGY.
 
Volatile Oils Pharmacognosy And Phytochemistry -I
Volatile Oils Pharmacognosy And Phytochemistry -IVolatile Oils Pharmacognosy And Phytochemistry -I
Volatile Oils Pharmacognosy And Phytochemistry -I
 
Call Girls In Nihal Vihar Delhi ❤️8860477959 Looking Escorts In 24/7 Delhi NCR
Call Girls In Nihal Vihar Delhi ❤️8860477959 Looking Escorts In 24/7 Delhi NCRCall Girls In Nihal Vihar Delhi ❤️8860477959 Looking Escorts In 24/7 Delhi NCR
Call Girls In Nihal Vihar Delhi ❤️8860477959 Looking Escorts In 24/7 Delhi NCR
 
Environmental Biotechnology Topic:- Microbial Biosensor
Environmental Biotechnology Topic:- Microbial BiosensorEnvironmental Biotechnology Topic:- Microbial Biosensor
Environmental Biotechnology Topic:- Microbial Biosensor
 
preservation, maintanence and improvement of industrial organism.pptx
preservation, maintanence and improvement of industrial organism.pptxpreservation, maintanence and improvement of industrial organism.pptx
preservation, maintanence and improvement of industrial organism.pptx
 
Harmful and Useful Microorganisms Presentation
Harmful and Useful Microorganisms PresentationHarmful and Useful Microorganisms Presentation
Harmful and Useful Microorganisms Presentation
 
THE ROLE OF PHARMACOGNOSY IN TRADITIONAL AND MODERN SYSTEM OF MEDICINE.pptx
THE ROLE OF PHARMACOGNOSY IN TRADITIONAL AND MODERN SYSTEM OF MEDICINE.pptxTHE ROLE OF PHARMACOGNOSY IN TRADITIONAL AND MODERN SYSTEM OF MEDICINE.pptx
THE ROLE OF PHARMACOGNOSY IN TRADITIONAL AND MODERN SYSTEM OF MEDICINE.pptx
 
Pests of soyabean_Binomics_IdentificationDr.UPR.pdf
Pests of soyabean_Binomics_IdentificationDr.UPR.pdfPests of soyabean_Binomics_IdentificationDr.UPR.pdf
Pests of soyabean_Binomics_IdentificationDr.UPR.pdf
 
Topic 9- General Principles of International Law.pptx
Topic 9- General Principles of International Law.pptxTopic 9- General Principles of International Law.pptx
Topic 9- General Principles of International Law.pptx
 
(9818099198) Call Girls In Noida Sector 14 (NOIDA ESCORTS)
(9818099198) Call Girls In Noida Sector 14 (NOIDA ESCORTS)(9818099198) Call Girls In Noida Sector 14 (NOIDA ESCORTS)
(9818099198) Call Girls In Noida Sector 14 (NOIDA ESCORTS)
 
Vision and reflection on Mining Software Repositories research in 2024
Vision and reflection on Mining Software Repositories research in 2024Vision and reflection on Mining Software Repositories research in 2024
Vision and reflection on Mining Software Repositories research in 2024
 
Call Girls in Majnu Ka Tilla Delhi 🔝9711014705🔝 Genuine
Call Girls in Majnu Ka Tilla Delhi 🔝9711014705🔝 GenuineCall Girls in Majnu Ka Tilla Delhi 🔝9711014705🔝 Genuine
Call Girls in Majnu Ka Tilla Delhi 🔝9711014705🔝 Genuine
 
Hot Sexy call girls in Moti Nagar,🔝 9953056974 🔝 escort Service
Hot Sexy call girls in  Moti Nagar,🔝 9953056974 🔝 escort ServiceHot Sexy call girls in  Moti Nagar,🔝 9953056974 🔝 escort Service
Hot Sexy call girls in Moti Nagar,🔝 9953056974 🔝 escort Service
 
Neurodevelopmental disorders according to the dsm 5 tr
Neurodevelopmental disorders according to the dsm 5 trNeurodevelopmental disorders according to the dsm 5 tr
Neurodevelopmental disorders according to the dsm 5 tr
 
GenBio2 - Lesson 1 - Introduction to Genetics.pptx
GenBio2 - Lesson 1 - Introduction to Genetics.pptxGenBio2 - Lesson 1 - Introduction to Genetics.pptx
GenBio2 - Lesson 1 - Introduction to Genetics.pptx
 
OECD bibliometric indicators: Selected highlights, April 2024
OECD bibliometric indicators: Selected highlights, April 2024OECD bibliometric indicators: Selected highlights, April 2024
OECD bibliometric indicators: Selected highlights, April 2024
 
Speech, hearing, noise, intelligibility.pptx
Speech, hearing, noise, intelligibility.pptxSpeech, hearing, noise, intelligibility.pptx
Speech, hearing, noise, intelligibility.pptx
 

Bioinformatics: What, Why and Where?

  • 1. Bioinformatics: What, Why and Where? Mohamed El-Hadidi Assistant Professor of Bioinformatics Biomedical Informatics Program Director School of Information Technology and Computer Science Nile University
  • 2. Where DNA is Located in our Body? 6/3/2020 Bioinformatics: What, Why and Where? 2
  • 3. From Human Body to DNA Sequences DNA Sequencers Sequence Files How many cells in the Human Body? 10 Trillion Cells! 6/3/2020 Bioinformatics: What, Why and Where? 3
  • 4. From Human Body to DNA Sequences DNA Sequencers Sequence Files How many chromosomes in one cell? 46 Chromosomes! 6/3/2020 4Bioinformatics: What, Why and Where?
  • 5. From Human Body to DNA Sequences DNA Sequencers Sequence Files What is the length of all chromosomes in one cell? 2 m in one cell! 1500 times from Earth to moon (all cells) 6/3/2020 5Bioinformatics: What, Why and Where?
  • 6. From Human Body to DNA Sequences DNA Sequencers Sequence Files What are in these files? GAATTTGGGCAAGAATCCAGGCATTGGAACTTATTCAAATAACTAGTTTGCCTGTAATTTTCACTTTTTC AGAGTCATCTGATAAAGCTTTCTTGCTACACATTTAGATAGATACACTCAATCCAGTTGTCTAGAAAGTT CCCTGAGCCAGCTGGGAGCAGGAGGGGTAGTTGGGGCCAGGAATATTGGGGGTGTGTTTACTGAGCCCCT AGAAAGTAAGTGCTAGATTTGACATTTCAATCCCTGAAGGCCCTGAAGTTCAGTATCAAATGACTGGTCC TGTGGACTGAGCATCTGTGAATTGCATATGCTTAGAGTAAATTTTACTCCTACCAGTTTCAGCAGCTTGC TTTAGCAAGCAGTATGGAAACACTAACATGGGGGAGTAGAATTTCTCTCTCTGATCCAAGTTTTATCTCA TTCTGGTGGGTTTTCAAGGAGAGACTCGGAGTCCAAGTGTCCTTTCTGAATATATCTGGAACTTCTCATT AACAAAAGACTCAAGTTATAATTTAGGGGACAAGGCACCCAATGAGAATGCCTTGCAGGCAGCCCTAAGT ACACCTGCAATTACACCATTACTAGCGCGGCAGCACACATGGCCCTGACTTAGTTTAAATAATTACGTAA GTCAACCATGATTGTTTGCCCTTTGCATAGAAGGGCAAGTATTGGTACCTGTTACAACTTAGGCTTTTTT TTCTTTATGTTTGAGCCATGATGAGTGATTTACACTGTTGCATCCATATGTTGAGATGTAAGAATAAATT AGACTTGGTAATTGCCCTTAAGTGTCTGGAAGTCAACTGGGGAAAGAGAGCTAGAGATAATAAGTGTGAA ACAATGTCACAGAATCAATGACGGAACTCTTCCCAGGACAAAGGATGACTTTTGAGTTCAGTCTTTGCCT TTAATTCTACATGGGGAGGAGAGCACGTTTAGCCACAAATGGAAGGGATTACTCATTTGAGCTATTTGGT TATATGATTATTTCCCCAGAGAATAGGATGTGCAGGGCATTACACAAGCAGTGCCAATAGCAGCAAAGTT CTTGAGAGTGCTAGTAATTCAAATGGCAGGAAGAGAAGGAATAAATGGTAAGGCTACCTACAGTTCACAG AGAGCTCCATCCTCACTGTGGCTTTGGATTTTGTCCTGTGTGAAAGAGAAGTGACTGTGAACTGACATGC TGTGTTTGGTGTTTTAGAAAGATGGCTGCAGCAGCGGTTTGGGGAATGGACTGCAGGAGTGGCATTGGAA ACAGGAAGGTTCATGACTATTGCCAGAGACAGAGGATGAAGCAGGAGCAAGGAAGATTCAGGACAGGGGA CTCCGGGGCTGATCAGGAGGCAGAACTGGTTGATAAGTATATGTAGCAGCATAAGAAAGAAAGAATCCCA GATTGACACCCAGGCTTCTCACTTGGAAGCCTGGATAGATACTGAATGCAATCACAAAGGCTGGGAAGTC AATGGGACTGCAGGGAAGGGAAGGGAAGGGAGGAGAAGAGGAAGGGCAGGAGGGTCCAATATCAATATTC AGCTTTTAGATGTGTTGAGCTTGAAGTGCTCAGATGGAGAAGTCCAGGAGGCAGTAGAATACGGTGGTCC AGAGCACAGGAGAGCAATGTGGCTTGAGTTGTCATTTGCTCACATATTTCCGTGTCAGTTACTTGTCTTA GATCACAGAACAAGTTCTCCTCTCACAGTTTCCTGGCTCCACCTGTCTCATGCTCACCGTCAGCATCGAA ATTGAGCCACACCAGGGGTTCTGGATACCAGCTTCTCTCTAGGTGAGGCTGCTATAGTCAGCAGCTGATT AGTTGCAGTTATCAGCAACTGGTAATATAATATATTGTGCATATAAGTGTACCAGAAGTCATGTTTATAT ATTGCTGCAAATACTCGGAATGGGGATCTCTTGTTCCCTGCTTAAGACCACATCACATTACTTGGTTTTG TACGCTAGTGGCTGAACCAAAAAAAGTAGGAGATGATTTTTTTTCTTTTTTCTTAAAGCAGTAGCTTTTG AACCTTGACCATGCTTTCTAACCAGCTGAGGGGCTTTTGAAAAAGAGGGTGCCTTACTGTGCCCCAGACC AGGACAATCAGTATTTCTGGGGAATGGAGCCTGGCACACACACATTTCTTAAAGCTCCCTTGGCAATTCT GAGGAGTGGATTACATGTTGTATGTAGCTCGTAACGAAAGAAATCTTGTCTTTGCTCTCAGACCCCCATT TCTTACTCATCTCATGAGCTCCTTCGAGATCCAGAAACAGTTGCATATTTCATTAGTAAATCAGTTCCAG AGTCACATTTTATTTCACAAGTTAGTCCATTAAAAGTTTCCTGCAGTGAGGAAATAGCCAGAAAGAACAC TCCACCCCTCCTCCTTTTTATAACTATAGGGTCTGGCTCGACAGAGCAGGAGCATCGCCATCTTGGACAA 6/3/2020 6Bioinformatics: What, Why and Where?
  • 7. From Human Body to DNA Sequences DNA Sequencers Sequence Files What are in these files? GAATTTGGGCAAGAATCCAGGCATTGGAACTTATTCAAATAACTAGTTTGCCTGTAATTTTCACTTTTTC AGAGTCATCTGATAAAGCTTTCTTGCTACACATTTAGATAGATACACTCAATCCAGTTGTCTAGAAAGTT CCCTGAGCCAGCTGGGAGCAGGAGGGGTAGTTGGGGCCAGGAATATTGGGGGTGTGTTTACTGAGCCCCT AGAAAGTAAGTGCTAGATTTGACATTTCAATCCCTGAAGGCCCTGAAGTTCAGTATCAAATGACTGGTCC TGTGGACTGAGCATCTGTGAATTGCATATGCTTAGAGTAAATTTTACTCCTACCAGTTTCAGCAGCTTGC TTTAGCAAGCAGTATGGAAACACTAACATGGGGGAGTAGAATTTCTCTCTCTGATCCAAGTTTTATCTCA TTCTGGTGGGTTTTCAAGGAGAGACTCGGAGTCCAAGTGTCCTTTCTGAATATATCTGGAACTTCTCATT AACAAAAGACTCAAGTTATAATTTAGGGGACAAGGCACCCAATGAGAATGCCTTGCAGGCAGCCCTAAGT ACACCTGCAATTACACCATTACTAGCGCGGCAGCACACATGGCCCTGACTTAGTTTAAATAATTACGTAA GTCAACCATGATTGTTTGCCCTTTGCATAGAAGGGCAAGTATTGGTACCTGTTACAACTTAGGCTTTTTT TTCTTTATGTTTGAGCCATGATGAGTGATTTACACTGTTGCATCCATATGTTGAGATGTAAGAATAAATT AGACTTGGTAATTGCCCTTAAGTGTCTGGAAGTCAACTGGGGAAAGAGAGCTAGAGATAATAAGTGTGAA ACAATGTCACAGAATCAATGACGGAACTCTTCCCAGGACAAAGGATGACTTTTGAGTTCAGTCTTTGCCT TTAATTCTACATGGGGAGGAGAGCACGTTTAGCCACAAATGGAAGGGATTACTCATTTGAGCTATTTGGT TATATGATTATTTCCCCAGAGAATAGGATGTGCAGGGCATTACACAAGCAGTGCCAATAGCAGCAAAGTT CTTGAGAGTGCTAGTAATTCAAATGGCAGGAAGAGAAGGAATAAATGGTAAGGCTACCTACAGTTCACAG AGAGCTCCATCCTCACTGTGGCTTTGGATTTTGTCCTGTGTGAAAGAGAAGTGACTGTGAACTGACATGC TGTGTTTGGTGTTTTAGAAAGATGGCTGCAGCAGCGGTTTGGGGAATGGACTGCAGGAGTGGCATTGGAA ACAGGAAGGTTCATGACTATTGCCAGAGACAGAGGATGAAGCAGGAGCAAGGAAGATTCAGGACAGGGGA CTCCGGGGCTGATCAGGAGGCAGAACTGGTTGATAAGTATATGTAGCAGCATAAGAAAGAAAGAATCCCA GATTGACACCCAGGCTTCTCACTTGGAAGCCTGGATAGATACTGAATGCAATCACAAAGGCTGGGAAGTC AATGGGACTGCAGGGAAGGGAAGGGAAGGGAGGAGAAGAGGAAGGGCAGGAGGGTCCAATATCAATATTC AGCTTTTAGATGTGTTGAGCTTGAAGTGCTCAGATGGAGAAGTCCAGGAGGCAGTAGAATACGGTGGTCC AGAGCACAGGAGAGCAATGTGGCTTGAGTTGTCATTTGCTCACATATTTCCGTGTCAGTTACTTGTCTTA GATCACAGAACAAGTTCTCCTCTCACAGTTTCCTGGCTCCACCTGTCTCATGCTCACCGTCAGCATCGAA ATTGAGCCACACCAGGGGTTCTGGATACCAGCTTCTCTCTAGGTGAGGCTGCTATAGTCAGCAGCTGATT AGTTGCAGTTATCAGCAACTGGTAATATAATATATTGTGCATATAAGTGTACCAGAAGTCATGTTTATAT ATTGCTGCAAATACTCGGAATGGGGATCTCTTGTTCCCTGCTTAAGACCACATCACATTACTTGGTTTTG TACGCTAGTGGCTGAACCAAAAAAAGTAGGAGATGATTTTTTTTCTTTTTTCTTAAAGCAGTAGCTTTTG AACCTTGACCATGCTTTCTAACCAGCTGAGGGGCTTTTGAAAAAGAGGGTGCCTTACTGTGCCCCAGACC AGGACAATCAGTATTTCTGGGGAATGGAGCCTGGCACACACACATTTCTTAAAGCTCCCTTGGCAATTCT GAGGAGTGGATTACATGTTGTATGTAGCTCGTAACGAAAGAAATCTTGTCTTTGCTCTCAGACCCCCATT TCTTACTCATCTCATGAGCTCCTTCGAGATCCAGAAACAGTTGCATATTTCATTAGTAAATCAGTTCCAG AGTCACATTTTATTTCACAAGTTAGTCCATTAAAAGTTTCCTGCAGTGAGGAAATAGCCAGAAAGAACAC TCCACCCCTCCTCCTTTTTATAACTATAGGGTCTGGCTCGACAGAGCAGGAGCATCGCCATCTTGGACAA 6/3/2020 7Bioinformatics: What, Why and Where?
  • 8. From Human Body to DNA Sequences DNA Sequencers Sequence Files GAATTTGGGCAAGAATCCAGGCATTGGAACTTATTCAAATAACTAGTTTGCCTGTAATTTTCACTTTTTC AGAGTCATCTGATAAAGCTTTCTTGCTACACATTTAGATAGATACACTCAATCCAGTTGTCTAGAAAGTT CCCTGAGCCAGCTGGGAGCAGGAGGGGTAGTTGGGGCCAGGAATATTGGGGGTGTGTTTACTGAGCCCCT AGAAAGTAAGTGCTAGATTTGACATTTCAATCCCTGAAGGCCCTGAAGTTCAGTATCAAATGACTGGTCC TGTGGACTGAGCATCTGTGAATTGCATATGCTTAGAGTAAATTTTACTCCTACCAGTTTCAGCAGCTTGC TTTAGCAAGCAGTATGGAAACACTAACATGGGGGAGTAGAATTTCTCTCTCTGATCCAAGTTTTATCTCA TTCTGGTGGGTTTTCAAGGAGAGACTCGGAGTCCAAGTGTCCTTTCTGAATATATCTGGAACTTCTCATT AACAAAAGACTCAAGTTATAATTTAGGGGACAAGGCACCCAATGAGAATGCCTTGCAGGCAGCCCTAAGT ACACCTGCAATTACACCATTACTAGCGCGGCAGCACACATGGCCCTGACTTAGTTTAAATAATTACGTAA GTCAACCATGATTGTTTGCCCTTTGCATAGAAGGGCAAGTATTGGTACCTGTTACAACTTAGGCTTTTTT TTCTTTATGTTTGAGCCATGATGAGTGATTTACACTGTTGCATCCATATGTTGAGATGTAAGAATAAATT AGACTTGGTAATTGCCCTTAAGTGTCTGGAAGTCAACTGGGGAAAGAGAGCTAGAGATAATAAGTGTGAA ACAATGTCACAGAATCAATGACGGAACTCTTCCCAGGACAAAGGATGACTTTTGAGTTCAGTCTTTGCCT TTAATTCTACATGGGGAGGAGAGCACGTTTAGCCACAAATGGAAGGGATTACTCATTTGAGCTATTTGGT TATATGATTATTTCCCCAGAGAATAGGATGTGCAGGGCATTACACAAGCAGTGCCAATAGCAGCAAAGTT CTTGAGAGTGCTAGTAATTCAAATGGCAGGAAGAGAAGGAATAAATGGTAAGGCTACCTACAGTTCACAG AGAGCTCCATCCTCACTGTGGCTTTGGATTTTGTCCTGTGTGAAAGAGAAGTGACTGTGAACTGACATGC TGTGTTTGGTGTTTTAGAAAGATGGCTGCAGCAGCGGTTTGGGGAATGGACTGCAGGAGTGGCATTGGAA ACAGGAAGGTTCATGACTATTGCCAGAGACAGAGGATGAAGCAGGAGCAAGGAAGATTCAGGACAGGGGA CTCCGGGGCTGATCAGGAGGCAGAACTGGTTGATAAGTATATGTAGCAGCATAAGAAAGAAAGAATCCCA GATTGACACCCAGGCTTCTCACTTGGAAGCCTGGATAGATACTGAATGCAATCACAAAGGCTGGGAAGTC AATGGGACTGCAGGGAAGGGAAGGGAAGGGAGGAGAAGAGGAAGGGCAGGAGGGTCCAATATCAATATTC AGCTTTTAGATGTGTTGAGCTTGAAGTGCTCAGATGGAGAAGTCCAGGAGGCAGTAGAATACGGTGGTCC AGAGCACAGGAGAGCAATGTGGCTTGAGTTGTCATTTGCTCACATATTTCCGTGTCAGTTACTTGTCTTA GATCACAGAACAAGTTCTCCTCTCACAGTTTCCTGGCTCCACCTGTCTCATGCTCACCGTCAGCATCGAA ATTGAGCCACACCAGGGGTTCTGGATACCAGCTTCTCTCTAGGTGAGGCTGCTATAGTCAGCAGCTGATT AGTTGCAGTTATCAGCAACTGGTAATATAATATATTGTGCATATAAGTGTACCAGAAGTCATGTTTATAT ATTGCTGCAAATACTCGGAATGGGGATCTCTTGTTCCCTGCTTAAGACCACATCACATTACTTGGTTTTG TACGCTAGTGGCTGAACCAAAAAAAGTAGGAGATGATTTTTTTTCTTTTTTCTTAAAGCAGTAGCTTTTG AACCTTGACCATGCTTTCTAACCAGCTGAGGGGCTTTTGAAAAAGAGGGTGCCTTACTGTGCCCCAGACC AGGACAATCAGTATTTCTGGGGAATGGAGCCTGGCACACACACATTTCTTAAAGCTCCCTTGGCAATTCT GAGGAGTGGATTACATGTTGTATGTAGCTCGTAACGAAAGAAATCTTGTCTTTGCTCTCAGACCCCCATT TCTTACTCATCTCATGAGCTCCTTCGAGATCCAGAAACAGTTGCATATTTCATTAGTAAATCAGTTCCAG AGTCACATTTTATTTCACAAGTTAGTCCATTAAAAGTTTCCTGCAGTGAGGAAATAGCCAGAAAGAACAC TCCACCCCTCCTCCTTTTTATAACTATAGGGTCTGGCTCGACAGAGCAGGAGCATCGCCATCTTGGACAA How many nucleotides in the Human body? 3 Billion Nucleotides! 6/3/2020 8Bioinformatics: What, Why and Where?
  • 9. From Human Body to DNA Sequences DNA Sequencers Sequence Files What is the size of data? 150 GB/person 6/3/2020 9Bioinformatics: What, Why and Where?
  • 10. How These Files were Generated? 6/3/2020 Bioinformatics: What, Why and Where? 10
  • 11. How These Files were Generated? 6/3/2020 Bioinformatics: What, Why and Where? 11
  • 12. Bioinformatics Data is Increasing Rapidly! • Speed of sequencing?  10,000 bp/day/machine -> billions bp/day/machine. • Computing cost and time?  Sequencing cost is falling 5X faster than computing • Price / genome?  Dropped to $1000! • Storage cost?  150 GB/genome Bioinformatics: What, Why and Where? 12 How These Files were Generated?
  • 13. 6/3/2020 13 How These Files were Generated? Bioinformatics: What, Why and Where?
  • 14. 6/3/2020 Bioinformatics: What, Why and Where? 14 What to Do with These Files? Making sense of this BIG DATA!
  • 15. How to Make Sense of This BIG DATA? Through Bioinformatics! What is Bioinformatics??! 6/3/2020 Bioinformatics: What, Why and Where? 15
  • 16. What Do You Need to Learn Bioinformatics? 6/3/2020 Bioinformatics: What, Why and Where? 16 Statistics Computer Science Biology Bioinformatics Data Science Biostatistics Computational Biology
  • 19. What is Bioinformatics? 6/3/2020 Bioinformatics: What, Why and Where? 19 GAATTTGGGCAAGAATCCAGGCATTGGAACTTATTCAAATAACTAGTTTGCCTGTAATTTTCACTTTTTC AGAGTCATCTGATAAAGCTTTCTTGCTACACATTTAGATAGATACACTCAATCCAGTTGTCTAGAAAGTT CCCTGAGCCAGCTGGGAGCAGGAGGGGTAGTTGGGGCCAGGAATATTGGGGGTGTGTTTACTGAGCCCCT AGAAAGTAAGTGCTAGATTTGACATTTCAATCCCTGAAGGCCCTGAAGTTCAGTATCAAATGACTGGTCC TGTGGACTGAGCATCTGTGAATTGCATATGCTTAGAGTAAATTTTACTCCTACCAGTTTCAGCAGCTTGC TTTAGCAAGCAGTATGGAAACACTAACATGGGGGAGTAGAATTTCTCTCTCTGATCCAAGTTTTATCTCA TTCTGGTGGGTTTTCAAGGAGAGACTCGGAGTCCAAGTGTCCTTTCTGAATATATCTGGAACTTCTCATT AACAAAAGACTCAAGTTATAATTTAGGGGACAAGGCACCCAATGAGAATGCCTTGCAGGCAGCCCTAAGT ACACCTGCAATTACACCATTACTAGCGCGGCAGCACACATGGCCCTGACTTAGTTTAAATAATTACGTAA GTCAACCATGATTGTTTGCCCTTTGCATAGAAGGGCAAGTATTGGTACCTGTTACAACTTAGGCTTTTTT TTCTTTATGTTTGAGCCATGATGAGTGATTTACACTGTTGCATCCATATGTTGAGATGTAAGAATAAATT AGACTTGGTAATTGCCCTTAAGTGTCTGGAAGTCAACTGGGGAAAGAGAGCTAGAGATAATAAGTGTGAA ACAATGTCACAGAATCAATGACGGAACTCTTCCCAGGACAAAGGATGACTTTTGAGTTCAGTCTTTGCCT TTAATTCTACATGGGGAGGAGAGCACGTTTAGCCACAAATGGAAGGGATTACTCATTTGAGCTATTTGGT TATATGATTATTTCCCCAGAGAATAGGATGTGCAGGGCATTACACAAGCAGTGCCAATAGCAGCAAAGTT CTTGAGAGTGCTAGTAATTCAAATGGCAGGAAGAGAAGGAATAAATGGTAAGGCTACCTACAGTTCACAG AGAGCTCCATCCTCACTGTGGCTTTGGATTTTGTCCTGTGTGAAAGAGAAGTGACTGTGAACTGACATGC TGTGTTTGGTGTTTTAGAAAGATGGCTGCAGCAGCGGTTTGGGGAATGGACTGCAGGAGTGGCATTGGAA ACAGGAAGGTTCATGACTATTGCCAGAGACAGAGGATGAAGCAGGAGCAAGGAAGATTCAGGACAGGGGA CTCCGGGGCTGATCAGGAGGCAGAACTGGTTGATAAGTATATGTAGCAGCATAAGAAAGAAAGAATCCCA GATTGACACCCAGGCTTCTCACTTGGAAGCCTGGATAGATACTGAATGCAATCACAAAGGCTGGGAAGTC AATGGGACTGCAGGGAAGGGAAGGGAAGGGAGGAGAAGAGGAAGGGCAGGAGGGTCCAATATCAATATTC AGCTTTTAGATGTGTTGAGCTTGAAGTGCTCAGATGGAGAAGTCCAGGAGGCAGTAGAATACGGTGGTCC AGAGCACAGGAGAGCAATGTGGCTTGAGTTGTCATTTGCTCACATATTTCCGTGTCAGTTACTTGTCTTA GATCACAGAACAAGTTCTCCTCTCACAGTTTCCTGGCTCCACCTGTCTCATGCTCACCGTCAGCATCGAA ATTGAGCCACACCAGGGGTTCTGGATACCAGCTTCTCTCTAGGTGAGGCTGCTATAGTCAGCAGCTGATT AGTTGCAGTTATCAGCAACTGGTAATATAATATATTGTGCATATAAGTGTACCAGAAGTCATGTTTATAT ATTGCTGCAAATACTCGGAATGGGGATCTCTTGTTCCCTGCTTAAGACCACATCACATTACTTGGTTTTG TACGCTAGTGGCTGAACCAAAAAAAGTAGGAGATGATTTTTTTTCTTTTTTCTTAAAGCAGTAGCTTTTG AACCTTGACCATGCTTTCTAACCAGCTGAGGGGCTTTTGAAAAAGAGGGTGCCTTACTGTGCCCCAGACC AGGACAATCAGTATTTCTGGGGAATGGAGCCTGGCACACACACATTTCTTAAAGCTCCCTTGGCAATTCT GAGGAGTGGATTACATGTTGTATGTAGCTCGTAACGAAAGAAATCTTGTCTTTGCTCTCAGACCCCCATT TCTTACTCATCTCATGAGCTCCTTCGAGATCCAGAAACAGTTGCATATTTCATTAGTAAATCAGTTCCAG AGTCACATTTTATTTCACAAGTTAGTCCATTAAAAGTTTCCTGCAGTGAGGAAATAGCCAGAAAGAACAC TCCACCCCTCCTCCTTTTTATAACTATAGGGTCTGGCTCGACAGAGCAGGAGCATCGCCATCTTGGACAA
  • 20. What is Bioinformatics? 6/3/2020 Bioinformatics: What, Why and Where? 20 GAATTTGGGCAAGAATCCAGGCATTGGAACTTATTCAAATAACTAGTTTGCCTGTAATTTTCACTTTTTC AGAGTCATCTGATAAAGCTTTCTTGCTACACATTTAGATAGATACACTCAATCCAGTTGTCTAGAAAGTT CCCTGAGCCAGCTGGGAGCAGGAGGGGTAGTTGGGGCCAGGAATATTGGGGGTGTGTTTACTGAGCCCCT AGAAAGTAAGTGCTAGATTTGACATTTCAATCCCTGAAGGCCCTGAAGTTCAGTATCAAATGACTGGTCC TGTGGACTGAGCATCTGTGAATTGCATATGCTTAGAGTAAATTTTACTCCTACCAGTTTCAGCAGCTTGC TTTAGCAAGCAGTATGGAAACACTAACATGGGGGAGTAGAATTTCTCTCTCTGATCCAAGTTTTATCTCA TTCTGGTGGGTTTTCAAGGAGAGACTCGGAGTCCAAGTGTCCTTTCTGAATATATCTGGAACTTCTCATT AACAAAAGACTCAAGTTATAATTTAGGGGACAAGGCACCCAATGAGAATGCCTTGCAGGCAGCCCTAAGT ACACCTGCAATTACACCATTACTAGCGCGGCAGCACACATGGCCCTGACTTAGTTTAAATAATTACGTAA GTCAACCATGATTGTTTGCCCTTTGCATAGAAGGGCAAGTATTGGTACCTGTTACAACTTAGGCTTTTTT TTCTTTATGTTTGAGCCATGATGAGTGATTTACACTGTTGCATCCATATGTTGAGATGTAAGAATAAATT AGACTTGGTAATTGCCCTTAAGTGTCTGGAAGTCAACTGGGGAAAGAGAGCTAGAGATAATAAGTGTGAA ACAATGTCACAGAATCAATGACGGAACTCTTCCCAGGACAAAGGATGACTTTTGAGTTCAGTCTTTGCCT TTAATTCTACATGGGGAGGAGAGCACGTTTAGCCACAAATGGAAGGGATTACTCATTTGAGCTATTTGGT TATATGATTATTTCCCCAGAGAATAGGATGTGCAGGGCATTACACAAGCAGTGCCAATAGCAGCAAAGTT CTTGAGAGTGCTAGTAATTCAAATGGCAGGAAGAGAAGGAATAAATGGTAAGGCTACCTACAGTTCACAG AGAGCTCCATCCTCACTGTGGCTTTGGATTTTGTCCTGTGTGAAAGAGAAGTGACTGTGAACTGACATGC TGTGTTTGGTGTTTTAGAAAGATGGCTGCAGCAGCGGTTTGGGGAATGGACTGCAGGAGTGGCATTGGAA ACAGGAAGGTTCATGACTATTGCCAGAGACAGAGGATGAAGCAGGAGCAAGGAAGATTCAGGACAGGGGA CTCCGGGGCTGATCAGGAGGCAGAACTGGTTGATAAGTATATGTAGCAGCATAAGAAAGAAAGAATCCCA GATTGACACCCAGGCTTCTCACTTGGAAGCCTGGATAGATACTGAATGCAATCACAAAGGCTGGGAAGTC AATGGGACTGCAGGGAAGGGAAGGGAAGGGAGGAGAAGAGGAAGGGCAGGAGGGTCCAATATCAATATTC AGCTTTTAGATGTGTTGAGCTTGAAGTGCTCAGATGGAGAAGTCCAGGAGGCAGTAGAATACGGTGGTCC AGAGCACAGGAGAGCAATGTGGCTTGAGTTGTCATTTGCTCACATATTTCCGTGTCAGTTACTTGTCTTA GATCACAGAACAAGTTCTCCTCTCACAGTTTCCTGGCTCCACCTGTCTCATGCTCACCGTCAGCATCGAA ATTGAGCCACACCAGGGGTTCTGGATACCAGCTTCTCTCTAGGTGAGGCTGCTATAGTCAGCAGCTGATT AGTTGCAGTTATCAGCAACTGGTAATATAATATATTGTGCATATAAGTGTACCAGAAGTCATGTTTATAT ATTGCTGCAAATACTCGGAATGGGGATCTCTTGTTCCCTGCTTAAGACCACATCACATTACTTGGTTTTG TACGCTAGTGGCTGAACCAAAAAAAGTAGGAGATGATTTTTTTTCTTTTTTCTTAAAGCAGTAGCTTTTG AACCTTGACCATGCTTTCTAACCAGCTGAGGGGCTTTTGAAAAAGAGGGTGCCTTACTGTGCCCCAGACC AGGACAATCAGTATTTCTGGGGAATGGAGCCTGGCACACACACATTTCTTAAAGCTCCCTTGGCAATTCT GAGGAGTGGATTACATGTTGTATGTAGCTCGTAACGAAAGAAATCTTGTCTTTGCTCTCAGACCCCCATT TCTTACTCATCTCATGAGCTCCTTCGAGATCCAGAAACAGTTGCATATTTCATTAGTAAATCAGTTCCAG AGTCACATTTTATTTCACAAGTTAGTCCATTAAAAGTTTCCTGCAGTGAGGAAATAGCCAGAAAGAACAC TCCACCCCTCCTCCTTTTTATAACTATAGGGTCTGGCTCGACAGAGCAGGAGCATCGCCATCTTGGACAA Use Existing tools to build analysis workflows • Linux • Command Line • Scripting Develop your own tools • Programming • Algorithm Design • Machine Learning
  • 21. What is Bioinformatics? 6/3/2020 Bioinformatics: What, Why and Where? 21 GAATTTGGGCAAGAATCCAGGCATTGGAACTTATTCAAATAACTAGTTTGCCTGTAATTTTCACTTTTTC AGAGTCATCTGATAAAGCTTTCTTGCTACACATTTAGATAGATACACTCAATCCAGTTGTCTAGAAAGTT CCCTGAGCCAGCTGGGAGCAGGAGGGGTAGTTGGGGCCAGGAATATTGGGGGTGTGTTTACTGAGCCCCT AGAAAGTAAGTGCTAGATTTGACATTTCAATCCCTGAAGGCCCTGAAGTTCAGTATCAAATGACTGGTCC TGTGGACTGAGCATCTGTGAATTGCATATGCTTAGAGTAAATTTTACTCCTACCAGTTTCAGCAGCTTGC TTTAGCAAGCAGTATGGAAACACTAACATGGGGGAGTAGAATTTCTCTCTCTGATCCAAGTTTTATCTCA TTCTGGTGGGTTTTCAAGGAGAGACTCGGAGTCCAAGTGTCCTTTCTGAATATATCTGGAACTTCTCATT AACAAAAGACTCAAGTTATAATTTAGGGGACAAGGCACCCAATGAGAATGCCTTGCAGGCAGCCCTAAGT ACACCTGCAATTACACCATTACTAGCGCGGCAGCACACATGGCCCTGACTTAGTTTAAATAATTACGTAA GTCAACCATGATTGTTTGCCCTTTGCATAGAAGGGCAAGTATTGGTACCTGTTACAACTTAGGCTTTTTT TTCTTTATGTTTGAGCCATGATGAGTGATTTACACTGTTGCATCCATATGTTGAGATGTAAGAATAAATT AGACTTGGTAATTGCCCTTAAGTGTCTGGAAGTCAACTGGGGAAAGAGAGCTAGAGATAATAAGTGTGAA ACAATGTCACAGAATCAATGACGGAACTCTTCCCAGGACAAAGGATGACTTTTGAGTTCAGTCTTTGCCT TTAATTCTACATGGGGAGGAGAGCACGTTTAGCCACAAATGGAAGGGATTACTCATTTGAGCTATTTGGT TATATGATTATTTCCCCAGAGAATAGGATGTGCAGGGCATTACACAAGCAGTGCCAATAGCAGCAAAGTT CTTGAGAGTGCTAGTAATTCAAATGGCAGGAAGAGAAGGAATAAATGGTAAGGCTACCTACAGTTCACAG AGAGCTCCATCCTCACTGTGGCTTTGGATTTTGTCCTGTGTGAAAGAGAAGTGACTGTGAACTGACATGC TGTGTTTGGTGTTTTAGAAAGATGGCTGCAGCAGCGGTTTGGGGAATGGACTGCAGGAGTGGCATTGGAA ACAGGAAGGTTCATGACTATTGCCAGAGACAGAGGATGAAGCAGGAGCAAGGAAGATTCAGGACAGGGGA CTCCGGGGCTGATCAGGAGGCAGAACTGGTTGATAAGTATATGTAGCAGCATAAGAAAGAAAGAATCCCA GATTGACACCCAGGCTTCTCACTTGGAAGCCTGGATAGATACTGAATGCAATCACAAAGGCTGGGAAGTC AATGGGACTGCAGGGAAGGGAAGGGAAGGGAGGAGAAGAGGAAGGGCAGGAGGGTCCAATATCAATATTC AGCTTTTAGATGTGTTGAGCTTGAAGTGCTCAGATGGAGAAGTCCAGGAGGCAGTAGAATACGGTGGTCC AGAGCACAGGAGAGCAATGTGGCTTGAGTTGTCATTTGCTCACATATTTCCGTGTCAGTTACTTGTCTTA GATCACAGAACAAGTTCTCCTCTCACAGTTTCCTGGCTCCACCTGTCTCATGCTCACCGTCAGCATCGAA ATTGAGCCACACCAGGGGTTCTGGATACCAGCTTCTCTCTAGGTGAGGCTGCTATAGTCAGCAGCTGATT AGTTGCAGTTATCAGCAACTGGTAATATAATATATTGTGCATATAAGTGTACCAGAAGTCATGTTTATAT ATTGCTGCAAATACTCGGAATGGGGATCTCTTGTTCCCTGCTTAAGACCACATCACATTACTTGGTTTTG TACGCTAGTGGCTGAACCAAAAAAAGTAGGAGATGATTTTTTTTCTTTTTTCTTAAAGCAGTAGCTTTTG AACCTTGACCATGCTTTCTAACCAGCTGAGGGGCTTTTGAAAAAGAGGGTGCCTTACTGTGCCCCAGACC AGGACAATCAGTATTTCTGGGGAATGGAGCCTGGCACACACACATTTCTTAAAGCTCCCTTGGCAATTCT GAGGAGTGGATTACATGTTGTATGTAGCTCGTAACGAAAGAAATCTTGTCTTTGCTCTCAGACCCCCATT TCTTACTCATCTCATGAGCTCCTTCGAGATCCAGAAACAGTTGCATATTTCATTAGTAAATCAGTTCCAG AGTCACATTTTATTTCACAAGTTAGTCCATTAAAAGTTTCCTGCAGTGAGGAAATAGCCAGAAAGAACAC TCCACCCCTCCTCCTTTTTATAACTATAGGGTCTGGCTCGACAGAGCAGGAGCATCGCCATCTTGGACAA Use Existing tools to build analysis workflows Develop your own tools • Linux • Command Line • Scripting • Programming • Algorithm Design • Machine Learning A = 1765 G = 3561 C = 2677 T = 1121
  • 22. What is Bioinformatics? 6/3/2020 Bioinformatics: What, Why and Where? 22 Use Existing tools to build analysis workflows Develop your own tools • Linux • Command Line • Scripting • Programming • Algorithm Design • Machine Learning
  • 23. Biologist (Biology Background) Use existing bioinformatics tools Computer Scientist (CS Background) Develops bioinformatics tools Basic User Windows OS Web-based Tools GUI Standalone tools No Programming skills Advanced User Linux OS Command line Standalone tools Basic Programming Skills Developer Basic Biology Knowledge Advanced Programming Skills Advanced Mathematics Advanced Statistics Who Can Be a Bioinformatician? 6/3/2020 Bioinformatics: What, Why and Where? 23
  • 24. How can I Learn Bioinformatics? Tons of free courses are available online! More than 26 million results when searching without comma! 6/3/2020 Bioinformatics: What, Why and Where? 24
  • 25. How can I Learn Bioinformatics? Tons of free courses are available online! More than 46 million results when searching without comma! 6/3/2020 Bioinformatics: What, Why and Where? 25
  • 26. Examples of Free Online Bioinformatics MOOCs Websites 6/3/2020 Bioinformatics: What, Why and Where? 26
  • 27. 6/3/2020 Bioinformatics: What, Why and Where? 27
  • 28. Milestones of Bioinformatics 28 • OMICS Sciences • Programming and Data Structure •Algorithm Design • LINUX • Statistics •Basic Mathematics • AI and Data Science •Data Visualization • Results Interpretation
  • 30. Next Step? 30 Read Papers and Reproduce Results! Compare Modify Explain Seek Internships Options! Real Life Problems!
  • 31. Advice… 31 Perceive Biology as CS and Perceive CS as Biology! The Link!
  • 32. No Need for a Supercomputer! 6/3/2020 Bioinformatics: What, Why and Where? 32
  • 33. Where to Find a Job (Egypt and Abroad)? 6/3/2020 Bioinformatics: What, Why and Where? 33 Research Academia Companies Startup Freelancer
  • 38. 6/3/2020 Bioinformatics: What, Why and Where? 38 Institute/Company Department Sequencer American University in Cairo (AUC) Biology Ion S5 American University in Cairo (AUC) Global Health and Human Ecology MiSeq National Research Center (NRC) Genetics MiSeq Zewail City of Science and Technology Center for Genomics MiSeq and NextSeq 500 Kasr Alainy School of Medicine Clinical Oncology 3 MiSeq CCHE 57357 Genomics program MiSeq and NextSeq 500 Ahram Canadian University Central Research Lab Agilent Bioanalyzer 2100 National Research Center (NRC) Genetics Ion torrent National Research Center (NRC) Environmental department Ion torrent PGM MASRI ain shams University Center Ion S5 and Ion shef Air forces specialised hospital Labs Miseq Maadi military hospital Labs Ion S5 Mansoura University Stem cells center Ion torrent National Cancer Institute (NCI) Molecular biology Ion S5 Abo Alraish Hospital Microbiology Labs MiSeq Alexandria Regional Center for Women's Health and Development Ion S5 Tanta University - Faculty of Medicine - Center of Exellence Genomic Signature Center MiSeq Magdi Yacoub Foundation MiSeq and NextSeq Generations Genetics Labs MiSeq Sequencers in Egypt (Sample) Source: Prof. Ahmed Moustafa, AUC.
  • 39. What Bioinformatics Can Do for Life Sciences? 6/3/2020 Bioinformatics: What, Why and Where? 39
  • 40. Genome Assembly 6/3/2020 Bioinformatics: What, Why and Where? 40
  • 41. Gene Prediction • Gene structure • Open Reading Frames (ORFs). • Start and stop of the gene • Locations of exons and introns • Splice variants • Gene prediction is one of the first and most important steps in understanding any genome after being sequenced. 6/3/2020 Bioinformatics: What, Why and Where? 41
  • 42. Sequence Comparison • Compare unknown gene or protein sequences against known sequences to identify their origin or function. • Finding Signatures that can be used in diagnostics 6/3/2020 Bioinformatics: What, Why and Where? 42
  • 43. Phylogenetic Analyses • Evolutionary relationship among a group of related molecules or organisms • Track gene flow based on sequence similarity 6/3/2020 Bioinformatics: What, Why and Where? 43
  • 44. Understand the Functions of Genes (Pathway Analysis) 6/3/2020 Bioinformatics: What, Why and Where? 44
  • 45. Predicting Protein Structure and Function • Protein’s 3D structure Prediction • Understand how biomolecules interact with other molecules • Predict functions based on interactions 6/3/2020 Bioinformatics: What, Why and Where? 45
  • 46. Drug Design • It is faster to analyze molecules on computer as compared to experimental approaches. • Helps in identifying drug targets easily • Simulating drug effects on computers 6/3/2020 Bioinformatics: What, Why and Where? 46
  • 48. Applications of Bioinformatics in Medicine • The Human Genome Project (HGP) helps scientists to search for genes directly associated with diseases and understand the molecular basis of those identified diseases. • This new Information will help in better understanding of the mechanisms of diseases and hence develop better treatment and preventive methods. 6/3/2020 Bioinformatics: What, Why and Where? 48
  • 49. Applications of Bioinformatics in Pharmacy • Identification and validating new drugs through Computer Aided Drug Design (CADD). • Helps to develop specific drugs with less side effect 6/3/2020 Bioinformatics: What, Why and Where? 49
  • 50. Applications of Bioinformatics in Food Security • Large amount of genomics data is available from plants and animals • Bioinformatic analysis of plant and animal genomes will help scientists to improve crops • Resistant to drought • Resistant to insects and pests • More nutritional value • Animals with higher meat quality and productivity 6/3/2020 Bioinformatics: What, Why and Where? 50
  • 51. Applications of Bioinformatics in the Environment • Sequencing and analysis of microbial genomes and search for genes expressing enzymes for • Bioremediation and biodegradation • Climate change studies (Microbes that use CO2 as their sole source of enegy) • Alternative energy sources (energy from light) • Microbes with industrial benefits • Generation of Biogas 6/3/2020 Bioinformatics: What, Why and Where? 51
  • 53. Take Home Messages • Understand the biological background first (in details)! • For writing a software • For using a software • Which tool/software to use? • Understand the algorithm behind each software/tool • Test different parameters • Select the best tool • Free software are everywhere • Read about benchmarking studies first • Before Writing your own software • Check if it is exist (don’t work from scratch) • Modify existing tools 6/3/2020 Bioinformatics: What, Why and Where? 53
  • 54. Biologists and Computer Scientitst Should Communicate! 6/3/2020 Bioinformatics: What, Why and Where? 54
  • 55. 6/3/2020 Bioinformatics: What, Why and Where? 55
  • 56. 6/3/2020 Bioinformatics: What, Why and Where? 56
  • 57. 6/3/2020 Bioinformatics: What, Why and Where? 57 Thank You for Your Attention!

Hinweis der Redaktion

  1. Each letter of letter (A,G,C or T) are called nucleotide