SlideShare ist ein Scribd-Unternehmen logo
1 von 42
Human genome diversity: Frequently asked questions Guido Barbujani Dipartimento di Biologia ed Evoluzione, Università di Ferrara [email_address]
Total size   3 272 480 987(haploid) N of protein-coding genes   22 320 N of RNA-coding genes    9 922 N of gene exons   530 906 N of transcripts  142 707 N of segregating sites   15 040 632 Nucleotide differences with chimp   1.23% Chimp orthologue genes   13 454 Human genes missing in chimp 36 totally, 17 largely Classes of genes with max. differences  immune response,  reproduction, olfaction A few human genome statistics From www.ensembl.org version 57.37b (Jan. 2010)
Human-mouse alignment
Human-chimp alignment Chimp chromosomes 2 and 2a The human genome is very similar to the chimpanzee genome
Phylogenetic tree of human (n=70), chimpanzee (n=30), bonobo (n=5), gorilla (n=11) and orang-utan (n=14), based on 10,000 bp sequences of a noncoding Xq13.3 region. Kaessmann et al. (2001). Individual genetic diversity among humans is the lowest  of all primates
Genomic estimates of  F ST  for the global human population are    0.12  Human populations display    12% of the maximum possible diversity, given their allele frequencies   N of markers Samples F ST Reference 599,356 SNPs 209 individuals from 4 populations: Caucasian, Chinese, Japanese, Yoruba 0.13 Weir et al. 2005 1,034,741 SNPs 71 individuals from 4 populations: Caucasian, Chinese, Japanese, Yoruba 0.10 Weir et al. 2005 1,007,329 SNPs 269 individuals from 4 populations: Caucasian, Chinese, Japanese, Yoruba 0.12 International HapMap Consortium 2005 443,434 SNPs 3845 worldwide distributed individuals 0.052 Auton et al. 2009 2,841,354 SNPs 210 individuals from 4 populations: Caucasian, Chinese, Japanese, Yoruba 0.11 Barreiro et al. 2008 243,855 SNPs 554 individuals from 27 worldwide populations 0.123 Xing et al. 2009 100 Alu insertions 710 individuals from 23 worldwide populations 0.095 Watkins et al. 2008 67 CNVs  270 individuals from 4 populations with ancestry in Europe, Africa or Asia 0.11 Redon et al. 2006
0.38 0.32 0.12 Genetic diversity among human populations is the lowest  of all primates F ST Geographically-variable selection Small population sizes Little gene flow Isolation Stabilizing selection Large population sizes Extensive gene flow Admixture
Li et al. (2009) Clinal variation in the geographical space is the rule for human populations Cavalli-Sfdorza et al. (1994)
Methods 1 : Estimating variances from sequence comparisons - TA C GAACATC A GGC - - TA T GAACATC A GGC - - TA T GAACATC G GGC - Polymorphic DNA sites
Genetic variances within and between populations Population 1  Population 2  variance between pops. 100% 19% 0%
Independent studies of genetic variances yield very similar results: 85, 5, 10 Lewontin (1972)    17 loci 85% 8% 6% Latter (1973)  18 86% 5% 9% Barbujani et al. (1997) 109 85% 5%  10% Jorde et al. (2000) 100 85% 2%  13% Romualdi et al. (2002)    32 83% 8% 9% Rosenberg et al. (2002)  377 93%   3%  4% Excoffier & Hamilton (2003) 377  88% 3% 9% Ramachandran et al. (2005)   17 90%   5%   5% Bastos-Rodriguez et al. (2006)  40 86%   2%   12% Li et al. (2008)  650 000   89%   2%   9% MEDIANA within populations between populations between races or continents 85%   5%   10%
What does it mean, in practice? 100% 100% 100% Members of our community are only slightly less different from us than members of distant populations 85% 85% 85%
Mind the numbers Humans and chimps share >98% of their genomes Among the 1.8% differences, 1.7% are fixed differences within species The remaining fraction, 0.1%, contains all human genomic variation The differences among the main continental groups represent 10% of 0.1% of the total, that is, 0.01% But 0.1% of >3 billion DNA sites means >3 million polymorphic DNA sites (3,213,401 according to Levy et al. 2007)
Methods 2 : Clustering genotypes or haplotypes K=3 K=4 Rosenberg et al., 2002
SNPs Haplotypes CNV Jakobsson et al. (2008) Structure inferred from SNPs and haplotypes differs from that inferred from Copy Number Variation
Genes, as well as morphology, suggest inconsistent clusterings of genotypes Y chromosome: Romualdi et al. 2002 X chromosome: Wilson et al. 2001 377 STR loci: Barbujani and Belle 2006 377 STR loci: Rosenberg et al. 2002 Europe, Ethiopia S. Africa   N. Guinea Asia Africa Asia, Europe, Australia, Americas Americas Melanesia Eurasia N Africa N America Maya S. Africa E Africa C Africa Piapoco Suruì Karitiana Kalash W. Eurasia E. Asia Africa Americas Oceania
Sampling assumptions have a large effect on the apparent structuring
Sampling assumptions have a large effect on the apparent structuring Serre and P ääbo (2004)
ASIP A8818G MATP C374G Genetic variation is discordant across loci
Similar skin colors are due to different combinations of alleles MATP C374G ASIP A8818G
16 completely sequenced genomes  (as of May 1st, 2010)   And 5 more published on May 6th:
Two persons from the same continent may share fewer SNPs than persons of different continents
81% of SNPs cosmopolitan. Alleles present in one continent only: 0.91% in Africa, 0.75% in Asia, practically 0 elsewhere. Jakobsson et al. 2008 (525910 SNPs, 396 CNVs)
In  the 117 megabases (Mb) of sequenced exome-containing intervals, the average rate of nucleotide difference between a pair of the Bushmen was 1.2 per kb, compared to an average of 1.0 per kb between a European and Asian individual. Schuster et al. (2010) Greater differences between Africans than between European and Asians
Genetic diversity out of Africa is often a subset ot the African genetic diversity   Tishkoff et al. (1998)
LD decreasing with physical distance between loci and with geographic distance from East Africa Jakobsson et al. 2008
Gene diversity declines as a function of distance from Africa Best fit of the model for an African exit 56,000 years ago Liu et al. (2006)
Patterns of  morphological  and  genetic  variation are compatible with the effects of dispersal from Africa Manica et al. (2007)
Models with an African population replacing previous human continental groups explain the data better than any alternative models Fagundes et al. (2007)
[object Object],[object Object],[object Object],[object Object],[object Object],[object Object],[object Object],[object Object],[object Object],[object Object],[object Object],[object Object],To make a long story short: So, what happened?
100,000 years ago
70,000 years ago
60,000 years ago
30,000 years ago
10,000 years ago
 
[object Object],[object Object],[object Object],[object Object],Preliminary analyses
Divergence from modern humans Neandertals  fall inside the variation of present-day humans. Overall divergence greater for the three Neandertal genomes (modes ~11%), whereas the San mode is ~9% and for the other present-day humans ~8%. For the Neandertals, 13% of windows have a divergence above 20%, whereas this is the case for 2.5% to 3.7% of windows in the current humans
Segments of Neandertal genome in non-African genomes ,[object Object],[object Object],[object Object],[object Object],[object Object],?
1. Comparison with the HapMap sequences and 5 newly sequenced individuals 2. No comparison within Eurasia  (Papuan-French-Han) or within Africa (Yoruba- San) shows significant skews in D  3. All comparisons of non-Africans and Africans show that the Neandertal is closer to the  non-Africans 4. All or almost all the gene flow detected was from Neandertals into modern humans 5. Some old haplotypes most likely owe their presence in non Africans to gene flow from Neandertals
Four processes potentially accounting for the data  Between 1 and 4% of the genomes of people in Eurasia are derived from Neandertals 1. From erectus to Neandertal 2. From late Neandertals into the first Europeans 3. From early Neandertals into the first Eurasians 4. Ancient population structure, preserved from before the Neandertal – modern sapiens separation
Enza Colonna And if you want to read more about all this  Trends in Genetics , July 2010

Weitere ähnliche Inhalte

Was ist angesagt? (20)

Molecular basis of mutations
Molecular basis of mutationsMolecular basis of mutations
Molecular basis of mutations
 
Arabidopsis thaliana genome project
Arabidopsis thaliana genome projectArabidopsis thaliana genome project
Arabidopsis thaliana genome project
 
TRANSPOSON TAGGING
TRANSPOSON TAGGINGTRANSPOSON TAGGING
TRANSPOSON TAGGING
 
Comparative genomics
Comparative genomicsComparative genomics
Comparative genomics
 
Modern gene concept
Modern gene conceptModern gene concept
Modern gene concept
 
Genetics
Genetics Genetics
Genetics
 
Phylogenetic analysis
Phylogenetic analysisPhylogenetic analysis
Phylogenetic analysis
 
repetitive and non repetitive dna.pptx
repetitive and non repetitive dna.pptxrepetitive and non repetitive dna.pptx
repetitive and non repetitive dna.pptx
 
Expressivity
ExpressivityExpressivity
Expressivity
 
DNA Libraries
DNA LibrariesDNA Libraries
DNA Libraries
 
Yeast Genome
Yeast Genome Yeast Genome
Yeast Genome
 
Comparative genomics
Comparative genomicsComparative genomics
Comparative genomics
 
Prokaryotic and eukaryotic genome
Prokaryotic and eukaryotic genomeProkaryotic and eukaryotic genome
Prokaryotic and eukaryotic genome
 
Dna extraction
Dna extractionDna extraction
Dna extraction
 
Organellar genome
Organellar genomeOrganellar genome
Organellar genome
 
Genome annotation
Genome annotationGenome annotation
Genome annotation
 
PHYLOGENETICS WITH MEGA
PHYLOGENETICS WITH MEGAPHYLOGENETICS WITH MEGA
PHYLOGENETICS WITH MEGA
 
Phylogenetic tree
Phylogenetic treePhylogenetic tree
Phylogenetic tree
 
Single nucleotide polymorphism
Single nucleotide polymorphismSingle nucleotide polymorphism
Single nucleotide polymorphism
 
Mutation & its detection
Mutation & its detectionMutation & its detection
Mutation & its detection
 

Andere mochten auch (16)

Gen pop7geneflow
Gen pop7geneflowGen pop7geneflow
Gen pop7geneflow
 
Barbujani leicester
Barbujani leicesterBarbujani leicester
Barbujani leicester
 
Comparing genes across linguistic families
Comparing genes across linguistic familiesComparing genes across linguistic families
Comparing genes across linguistic families
 
Gen pop1var
Gen pop1varGen pop1var
Gen pop1var
 
Genpop10coal e abc
Genpop10coal e abcGenpop10coal e abc
Genpop10coal e abc
 
Gen pop4ld
Gen pop4ldGen pop4ld
Gen pop4ld
 
Gen pop5mut
Gen pop5mutGen pop5mut
Gen pop5mut
 
Gen pop9mantpol
Gen pop9mantpolGen pop9mantpol
Gen pop9mantpol
 
Genetica di popolazioni 3
Genetica di popolazioni 3Genetica di popolazioni 3
Genetica di popolazioni 3
 
Milano darwinday
Milano darwindayMilano darwinday
Milano darwinday
 
Gen pop8selezione
Gen pop8selezioneGen pop8selezione
Gen pop8selezione
 
Genpop11a dna
Genpop11a dnaGenpop11a dna
Genpop11a dna
 
Genetica di Popolazioni 2
Genetica di Popolazioni 2Genetica di Popolazioni 2
Genetica di Popolazioni 2
 
Bari1 Darwin
Bari1 DarwinBari1 Darwin
Bari1 Darwin
 
Barbujani abt lecture
Barbujani abt lectureBarbujani abt lecture
Barbujani abt lecture
 
Gen pop6drift
Gen pop6driftGen pop6drift
Gen pop6drift
 

Ähnlich wie Lisbon genome diversity

Colloquium Presentation 2009 Fall Bongsoo
Colloquium Presentation 2009 Fall BongsooColloquium Presentation 2009 Fall Bongsoo
Colloquium Presentation 2009 Fall BongsooBongsoo Park
 
Insights into the genetic diversity and structure of indigenous ovi-caprine p...
Insights into the genetic diversity and structure of indigenous ovi-caprine p...Insights into the genetic diversity and structure of indigenous ovi-caprine p...
Insights into the genetic diversity and structure of indigenous ovi-caprine p...ILRI
 
Honors ~ Evolution 1011
Honors ~ Evolution 1011Honors ~ Evolution 1011
Honors ~ Evolution 1011Michael Edgar
 
Biodiverse - Rosauer talk @ iEvoBio conference June 2010
Biodiverse - Rosauer talk @ iEvoBio conference June 2010Biodiverse - Rosauer talk @ iEvoBio conference June 2010
Biodiverse - Rosauer talk @ iEvoBio conference June 2010Dan Rosauer
 
An integrated map of genetic variation from 1,092
An integrated map of genetic variation from 1,092An integrated map of genetic variation from 1,092
An integrated map of genetic variation from 1,092Grigory Sapunov
 
Unveiling Hidden Treasures of Indigenous Cattle In Zambia Using Macrosatilltes
Unveiling Hidden Treasures of Indigenous Cattle In Zambia Using MacrosatilltesUnveiling Hidden Treasures of Indigenous Cattle In Zambia Using Macrosatilltes
Unveiling Hidden Treasures of Indigenous Cattle In Zambia Using MacrosatilltesMSIMUKO ELLISON
 
J. Mollus. Stud.-2015-Carvalho-mollus-eyv014
J. Mollus. Stud.-2015-Carvalho-mollus-eyv014J. Mollus. Stud.-2015-Carvalho-mollus-eyv014
J. Mollus. Stud.-2015-Carvalho-mollus-eyv014Carolina Ruivo Pereira
 
Bioinformatics & biostatistics tools for monogenic and multifactorial disease...
Bioinformatics & biostatistics tools for monogenic and multifactorial disease...Bioinformatics & biostatistics tools for monogenic and multifactorial disease...
Bioinformatics & biostatistics tools for monogenic and multifactorial disease...Pasteur_Tunis
 
Talk at Institut Jean Nicod on 6 October 2010
Talk at Institut Jean Nicod on 6 October 2010Talk at Institut Jean Nicod on 6 October 2010
Talk at Institut Jean Nicod on 6 October 2010Robin Ryder
 
Human genetic diversity and origin of major human groups
Human genetic diversity and origin of major human groupsHuman genetic diversity and origin of major human groups
Human genetic diversity and origin of major human groupsMayank Sagar
 
Comparing the Amount and Quality of Information from Different Sequencing Str...
Comparing the Amount and Quality of Information from Different Sequencing Str...Comparing the Amount and Quality of Information from Different Sequencing Str...
Comparing the Amount and Quality of Information from Different Sequencing Str...jembrown
 
Genetic variation and its role in health pharmacology
Genetic variation and its role in health pharmacologyGenetic variation and its role in health pharmacology
Genetic variation and its role in health pharmacologyDeepak Kumar
 
Human genetic variation and its contribution to complex traits
Human genetic variation and its contribution to complex traitsHuman genetic variation and its contribution to complex traits
Human genetic variation and its contribution to complex traitsgroovescience
 
Early Human Origins
Early Human OriginsEarly Human Origins
Early Human Originsbdrydyk
 
Evolution of transposons, genomes, and organisms (Hertweck Fall 2014)
Evolution of transposons, genomes, and organisms (Hertweck Fall 2014)Evolution of transposons, genomes, and organisms (Hertweck Fall 2014)
Evolution of transposons, genomes, and organisms (Hertweck Fall 2014)Kate Hertweck
 

Ähnlich wie Lisbon genome diversity (20)

Colloquium Presentation 2009 Fall Bongsoo
Colloquium Presentation 2009 Fall BongsooColloquium Presentation 2009 Fall Bongsoo
Colloquium Presentation 2009 Fall Bongsoo
 
Nucleotide Variation and Selective Pressure in the Mitochondrial Genome of Af...
Nucleotide Variation and Selective Pressure in the Mitochondrial Genome of Af...Nucleotide Variation and Selective Pressure in the Mitochondrial Genome of Af...
Nucleotide Variation and Selective Pressure in the Mitochondrial Genome of Af...
 
Insights into the genetic diversity and structure of indigenous ovi-caprine p...
Insights into the genetic diversity and structure of indigenous ovi-caprine p...Insights into the genetic diversity and structure of indigenous ovi-caprine p...
Insights into the genetic diversity and structure of indigenous ovi-caprine p...
 
Honors ~ Evolution 1011
Honors ~ Evolution 1011Honors ~ Evolution 1011
Honors ~ Evolution 1011
 
Biodiverse - Rosauer talk @ iEvoBio conference June 2010
Biodiverse - Rosauer talk @ iEvoBio conference June 2010Biodiverse - Rosauer talk @ iEvoBio conference June 2010
Biodiverse - Rosauer talk @ iEvoBio conference June 2010
 
Rym kefi (1)
Rym kefi (1)Rym kefi (1)
Rym kefi (1)
 
Human Evolution Talk
Human Evolution TalkHuman Evolution Talk
Human Evolution Talk
 
Rice
RiceRice
Rice
 
An integrated map of genetic variation from 1,092
An integrated map of genetic variation from 1,092An integrated map of genetic variation from 1,092
An integrated map of genetic variation from 1,092
 
Unveiling Hidden Treasures of Indigenous Cattle In Zambia Using Macrosatilltes
Unveiling Hidden Treasures of Indigenous Cattle In Zambia Using MacrosatilltesUnveiling Hidden Treasures of Indigenous Cattle In Zambia Using Macrosatilltes
Unveiling Hidden Treasures of Indigenous Cattle In Zambia Using Macrosatilltes
 
J. Mollus. Stud.-2015-Carvalho-mollus-eyv014
J. Mollus. Stud.-2015-Carvalho-mollus-eyv014J. Mollus. Stud.-2015-Carvalho-mollus-eyv014
J. Mollus. Stud.-2015-Carvalho-mollus-eyv014
 
Bioinformatics & biostatistics tools for monogenic and multifactorial disease...
Bioinformatics & biostatistics tools for monogenic and multifactorial disease...Bioinformatics & biostatistics tools for monogenic and multifactorial disease...
Bioinformatics & biostatistics tools for monogenic and multifactorial disease...
 
Lise Sandenbergh poster
Lise Sandenbergh poster Lise Sandenbergh poster
Lise Sandenbergh poster
 
Talk at Institut Jean Nicod on 6 October 2010
Talk at Institut Jean Nicod on 6 October 2010Talk at Institut Jean Nicod on 6 October 2010
Talk at Institut Jean Nicod on 6 October 2010
 
Human genetic diversity and origin of major human groups
Human genetic diversity and origin of major human groupsHuman genetic diversity and origin of major human groups
Human genetic diversity and origin of major human groups
 
Comparing the Amount and Quality of Information from Different Sequencing Str...
Comparing the Amount and Quality of Information from Different Sequencing Str...Comparing the Amount and Quality of Information from Different Sequencing Str...
Comparing the Amount and Quality of Information from Different Sequencing Str...
 
Genetic variation and its role in health pharmacology
Genetic variation and its role in health pharmacologyGenetic variation and its role in health pharmacology
Genetic variation and its role in health pharmacology
 
Human genetic variation and its contribution to complex traits
Human genetic variation and its contribution to complex traitsHuman genetic variation and its contribution to complex traits
Human genetic variation and its contribution to complex traits
 
Early Human Origins
Early Human OriginsEarly Human Origins
Early Human Origins
 
Evolution of transposons, genomes, and organisms (Hertweck Fall 2014)
Evolution of transposons, genomes, and organisms (Hertweck Fall 2014)Evolution of transposons, genomes, and organisms (Hertweck Fall 2014)
Evolution of transposons, genomes, and organisms (Hertweck Fall 2014)
 

Mehr von Genetica, Ferrara University, Italy (17)

13 estensioni mendel
13 estensioni mendel13 estensioni mendel
13 estensioni mendel
 
08 genomica
08 genomica08 genomica
08 genomica
 
06 traduzione
06 traduzione06 traduzione
06 traduzione
 
04 funzione del gene
04 funzione del gene04 funzione del gene
04 funzione del gene
 
Perché non possiamo non dirci africani. Otto cose da ricordare sulla biodiver...
Perché non possiamo non dirci africani. Otto cose da ricordare sulla biodiver...Perché non possiamo non dirci africani. Otto cose da ricordare sulla biodiver...
Perché non possiamo non dirci africani. Otto cose da ricordare sulla biodiver...
 
Rovereto
RoveretoRovereto
Rovereto
 
Genpop9coal e abc
Genpop9coal e abcGenpop9coal e abc
Genpop9coal e abc
 
21 genetica di popolazioni
21 genetica di popolazioni21 genetica di popolazioni
21 genetica di popolazioni
 
20 genetica del cancro
20 genetica del cancro20 genetica del cancro
20 genetica del cancro
 
18 regolazione eucarioti
18 regolazione eucarioti18 regolazione eucarioti
18 regolazione eucarioti
 
17 regolazione procarioti
17 regolazione procarioti17 regolazione procarioti
17 regolazione procarioti
 
16 variazione cromosomi
16 variazione cromosomi16 variazione cromosomi
16 variazione cromosomi
 
15 mappe genetiche procarioti
15 mappe genetiche procarioti15 mappe genetiche procarioti
15 mappe genetiche procarioti
 
14 mappe genetiche eucarioti
14 mappe genetiche eucarioti14 mappe genetiche eucarioti
14 mappe genetiche eucarioti
 
12 basi cromosomiche
12 basi cromosomiche12 basi cromosomiche
12 basi cromosomiche
 
11 genetica mendeliana
11 genetica mendeliana11 genetica mendeliana
11 genetica mendeliana
 
05 trascrizione
05 trascrizione05 trascrizione
05 trascrizione
 

Kürzlich hochgeladen

The Future of Software Development - Devin AI Innovative Approach.pdf
The Future of Software Development - Devin AI Innovative Approach.pdfThe Future of Software Development - Devin AI Innovative Approach.pdf
The Future of Software Development - Devin AI Innovative Approach.pdfSeasiaInfotech2
 
My INSURER PTE LTD - Insurtech Innovation Award 2024
My INSURER PTE LTD - Insurtech Innovation Award 2024My INSURER PTE LTD - Insurtech Innovation Award 2024
My INSURER PTE LTD - Insurtech Innovation Award 2024The Digital Insurer
 
"LLMs for Python Engineers: Advanced Data Analysis and Semantic Kernel",Oleks...
"LLMs for Python Engineers: Advanced Data Analysis and Semantic Kernel",Oleks..."LLMs for Python Engineers: Advanced Data Analysis and Semantic Kernel",Oleks...
"LLMs for Python Engineers: Advanced Data Analysis and Semantic Kernel",Oleks...Fwdays
 
Bun (KitWorks Team Study 노별마루 발표 2024.4.22)
Bun (KitWorks Team Study 노별마루 발표 2024.4.22)Bun (KitWorks Team Study 노별마루 발표 2024.4.22)
Bun (KitWorks Team Study 노별마루 발표 2024.4.22)Wonjun Hwang
 
Leverage Zilliz Serverless - Up to 50X Saving for Your Vector Storage Cost
Leverage Zilliz Serverless - Up to 50X Saving for Your Vector Storage CostLeverage Zilliz Serverless - Up to 50X Saving for Your Vector Storage Cost
Leverage Zilliz Serverless - Up to 50X Saving for Your Vector Storage CostZilliz
 
DevoxxFR 2024 Reproducible Builds with Apache Maven
DevoxxFR 2024 Reproducible Builds with Apache MavenDevoxxFR 2024 Reproducible Builds with Apache Maven
DevoxxFR 2024 Reproducible Builds with Apache MavenHervé Boutemy
 
Scanning the Internet for External Cloud Exposures via SSL Certs
Scanning the Internet for External Cloud Exposures via SSL CertsScanning the Internet for External Cloud Exposures via SSL Certs
Scanning the Internet for External Cloud Exposures via SSL CertsRizwan Syed
 
Developer Data Modeling Mistakes: From Postgres to NoSQL
Developer Data Modeling Mistakes: From Postgres to NoSQLDeveloper Data Modeling Mistakes: From Postgres to NoSQL
Developer Data Modeling Mistakes: From Postgres to NoSQLScyllaDB
 
Unleash Your Potential - Namagunga Girls Coding Club
Unleash Your Potential - Namagunga Girls Coding ClubUnleash Your Potential - Namagunga Girls Coding Club
Unleash Your Potential - Namagunga Girls Coding ClubKalema Edgar
 
Powerpoint exploring the locations used in television show Time Clash
Powerpoint exploring the locations used in television show Time ClashPowerpoint exploring the locations used in television show Time Clash
Powerpoint exploring the locations used in television show Time Clashcharlottematthew16
 
WordPress Websites for Engineers: Elevate Your Brand
WordPress Websites for Engineers: Elevate Your BrandWordPress Websites for Engineers: Elevate Your Brand
WordPress Websites for Engineers: Elevate Your Brandgvaughan
 
Search Engine Optimization SEO PDF for 2024.pdf
Search Engine Optimization SEO PDF for 2024.pdfSearch Engine Optimization SEO PDF for 2024.pdf
Search Engine Optimization SEO PDF for 2024.pdfRankYa
 
Designing IA for AI - Information Architecture Conference 2024
Designing IA for AI - Information Architecture Conference 2024Designing IA for AI - Information Architecture Conference 2024
Designing IA for AI - Information Architecture Conference 2024Enterprise Knowledge
 
New from BookNet Canada for 2024: BNC CataList - Tech Forum 2024
New from BookNet Canada for 2024: BNC CataList - Tech Forum 2024New from BookNet Canada for 2024: BNC CataList - Tech Forum 2024
New from BookNet Canada for 2024: BNC CataList - Tech Forum 2024BookNet Canada
 
Are Multi-Cloud and Serverless Good or Bad?
Are Multi-Cloud and Serverless Good or Bad?Are Multi-Cloud and Serverless Good or Bad?
Are Multi-Cloud and Serverless Good or Bad?Mattias Andersson
 
Unraveling Multimodality with Large Language Models.pdf
Unraveling Multimodality with Large Language Models.pdfUnraveling Multimodality with Large Language Models.pdf
Unraveling Multimodality with Large Language Models.pdfAlex Barbosa Coqueiro
 
CloudStudio User manual (basic edition):
CloudStudio User manual (basic edition):CloudStudio User manual (basic edition):
CloudStudio User manual (basic edition):comworks
 
Connect Wave/ connectwave Pitch Deck Presentation
Connect Wave/ connectwave Pitch Deck PresentationConnect Wave/ connectwave Pitch Deck Presentation
Connect Wave/ connectwave Pitch Deck PresentationSlibray Presentation
 
DevEX - reference for building teams, processes, and platforms
DevEX - reference for building teams, processes, and platformsDevEX - reference for building teams, processes, and platforms
DevEX - reference for building teams, processes, and platformsSergiu Bodiu
 
Vector Databases 101 - An introduction to the world of Vector Databases
Vector Databases 101 - An introduction to the world of Vector DatabasesVector Databases 101 - An introduction to the world of Vector Databases
Vector Databases 101 - An introduction to the world of Vector DatabasesZilliz
 

Kürzlich hochgeladen (20)

The Future of Software Development - Devin AI Innovative Approach.pdf
The Future of Software Development - Devin AI Innovative Approach.pdfThe Future of Software Development - Devin AI Innovative Approach.pdf
The Future of Software Development - Devin AI Innovative Approach.pdf
 
My INSURER PTE LTD - Insurtech Innovation Award 2024
My INSURER PTE LTD - Insurtech Innovation Award 2024My INSURER PTE LTD - Insurtech Innovation Award 2024
My INSURER PTE LTD - Insurtech Innovation Award 2024
 
"LLMs for Python Engineers: Advanced Data Analysis and Semantic Kernel",Oleks...
"LLMs for Python Engineers: Advanced Data Analysis and Semantic Kernel",Oleks..."LLMs for Python Engineers: Advanced Data Analysis and Semantic Kernel",Oleks...
"LLMs for Python Engineers: Advanced Data Analysis and Semantic Kernel",Oleks...
 
Bun (KitWorks Team Study 노별마루 발표 2024.4.22)
Bun (KitWorks Team Study 노별마루 발표 2024.4.22)Bun (KitWorks Team Study 노별마루 발표 2024.4.22)
Bun (KitWorks Team Study 노별마루 발표 2024.4.22)
 
Leverage Zilliz Serverless - Up to 50X Saving for Your Vector Storage Cost
Leverage Zilliz Serverless - Up to 50X Saving for Your Vector Storage CostLeverage Zilliz Serverless - Up to 50X Saving for Your Vector Storage Cost
Leverage Zilliz Serverless - Up to 50X Saving for Your Vector Storage Cost
 
DevoxxFR 2024 Reproducible Builds with Apache Maven
DevoxxFR 2024 Reproducible Builds with Apache MavenDevoxxFR 2024 Reproducible Builds with Apache Maven
DevoxxFR 2024 Reproducible Builds with Apache Maven
 
Scanning the Internet for External Cloud Exposures via SSL Certs
Scanning the Internet for External Cloud Exposures via SSL CertsScanning the Internet for External Cloud Exposures via SSL Certs
Scanning the Internet for External Cloud Exposures via SSL Certs
 
Developer Data Modeling Mistakes: From Postgres to NoSQL
Developer Data Modeling Mistakes: From Postgres to NoSQLDeveloper Data Modeling Mistakes: From Postgres to NoSQL
Developer Data Modeling Mistakes: From Postgres to NoSQL
 
Unleash Your Potential - Namagunga Girls Coding Club
Unleash Your Potential - Namagunga Girls Coding ClubUnleash Your Potential - Namagunga Girls Coding Club
Unleash Your Potential - Namagunga Girls Coding Club
 
Powerpoint exploring the locations used in television show Time Clash
Powerpoint exploring the locations used in television show Time ClashPowerpoint exploring the locations used in television show Time Clash
Powerpoint exploring the locations used in television show Time Clash
 
WordPress Websites for Engineers: Elevate Your Brand
WordPress Websites for Engineers: Elevate Your BrandWordPress Websites for Engineers: Elevate Your Brand
WordPress Websites for Engineers: Elevate Your Brand
 
Search Engine Optimization SEO PDF for 2024.pdf
Search Engine Optimization SEO PDF for 2024.pdfSearch Engine Optimization SEO PDF for 2024.pdf
Search Engine Optimization SEO PDF for 2024.pdf
 
Designing IA for AI - Information Architecture Conference 2024
Designing IA for AI - Information Architecture Conference 2024Designing IA for AI - Information Architecture Conference 2024
Designing IA for AI - Information Architecture Conference 2024
 
New from BookNet Canada for 2024: BNC CataList - Tech Forum 2024
New from BookNet Canada for 2024: BNC CataList - Tech Forum 2024New from BookNet Canada for 2024: BNC CataList - Tech Forum 2024
New from BookNet Canada for 2024: BNC CataList - Tech Forum 2024
 
Are Multi-Cloud and Serverless Good or Bad?
Are Multi-Cloud and Serverless Good or Bad?Are Multi-Cloud and Serverless Good or Bad?
Are Multi-Cloud and Serverless Good or Bad?
 
Unraveling Multimodality with Large Language Models.pdf
Unraveling Multimodality with Large Language Models.pdfUnraveling Multimodality with Large Language Models.pdf
Unraveling Multimodality with Large Language Models.pdf
 
CloudStudio User manual (basic edition):
CloudStudio User manual (basic edition):CloudStudio User manual (basic edition):
CloudStudio User manual (basic edition):
 
Connect Wave/ connectwave Pitch Deck Presentation
Connect Wave/ connectwave Pitch Deck PresentationConnect Wave/ connectwave Pitch Deck Presentation
Connect Wave/ connectwave Pitch Deck Presentation
 
DevEX - reference for building teams, processes, and platforms
DevEX - reference for building teams, processes, and platformsDevEX - reference for building teams, processes, and platforms
DevEX - reference for building teams, processes, and platforms
 
Vector Databases 101 - An introduction to the world of Vector Databases
Vector Databases 101 - An introduction to the world of Vector DatabasesVector Databases 101 - An introduction to the world of Vector Databases
Vector Databases 101 - An introduction to the world of Vector Databases
 

Lisbon genome diversity

  • 1. Human genome diversity: Frequently asked questions Guido Barbujani Dipartimento di Biologia ed Evoluzione, Università di Ferrara [email_address]
  • 2. Total size 3 272 480 987(haploid) N of protein-coding genes 22 320 N of RNA-coding genes 9 922 N of gene exons 530 906 N of transcripts 142 707 N of segregating sites 15 040 632 Nucleotide differences with chimp 1.23% Chimp orthologue genes 13 454 Human genes missing in chimp 36 totally, 17 largely Classes of genes with max. differences immune response, reproduction, olfaction A few human genome statistics From www.ensembl.org version 57.37b (Jan. 2010)
  • 4. Human-chimp alignment Chimp chromosomes 2 and 2a The human genome is very similar to the chimpanzee genome
  • 5. Phylogenetic tree of human (n=70), chimpanzee (n=30), bonobo (n=5), gorilla (n=11) and orang-utan (n=14), based on 10,000 bp sequences of a noncoding Xq13.3 region. Kaessmann et al. (2001). Individual genetic diversity among humans is the lowest of all primates
  • 6. Genomic estimates of F ST for the global human population are  0.12 Human populations display  12% of the maximum possible diversity, given their allele frequencies N of markers Samples F ST Reference 599,356 SNPs 209 individuals from 4 populations: Caucasian, Chinese, Japanese, Yoruba 0.13 Weir et al. 2005 1,034,741 SNPs 71 individuals from 4 populations: Caucasian, Chinese, Japanese, Yoruba 0.10 Weir et al. 2005 1,007,329 SNPs 269 individuals from 4 populations: Caucasian, Chinese, Japanese, Yoruba 0.12 International HapMap Consortium 2005 443,434 SNPs 3845 worldwide distributed individuals 0.052 Auton et al. 2009 2,841,354 SNPs 210 individuals from 4 populations: Caucasian, Chinese, Japanese, Yoruba 0.11 Barreiro et al. 2008 243,855 SNPs 554 individuals from 27 worldwide populations 0.123 Xing et al. 2009 100 Alu insertions 710 individuals from 23 worldwide populations 0.095 Watkins et al. 2008 67 CNVs 270 individuals from 4 populations with ancestry in Europe, Africa or Asia 0.11 Redon et al. 2006
  • 7. 0.38 0.32 0.12 Genetic diversity among human populations is the lowest of all primates F ST Geographically-variable selection Small population sizes Little gene flow Isolation Stabilizing selection Large population sizes Extensive gene flow Admixture
  • 8. Li et al. (2009) Clinal variation in the geographical space is the rule for human populations Cavalli-Sfdorza et al. (1994)
  • 9. Methods 1 : Estimating variances from sequence comparisons - TA C GAACATC A GGC - - TA T GAACATC A GGC - - TA T GAACATC G GGC - Polymorphic DNA sites
  • 10. Genetic variances within and between populations Population 1 Population 2 variance between pops. 100% 19% 0%
  • 11. Independent studies of genetic variances yield very similar results: 85, 5, 10 Lewontin (1972) 17 loci 85% 8% 6% Latter (1973) 18 86% 5% 9% Barbujani et al. (1997) 109 85% 5% 10% Jorde et al. (2000) 100 85% 2% 13% Romualdi et al. (2002) 32 83% 8% 9% Rosenberg et al. (2002) 377 93% 3% 4% Excoffier & Hamilton (2003) 377 88% 3% 9% Ramachandran et al. (2005) 17 90% 5% 5% Bastos-Rodriguez et al. (2006) 40 86% 2% 12% Li et al. (2008) 650 000 89% 2% 9% MEDIANA within populations between populations between races or continents 85% 5% 10%
  • 12. What does it mean, in practice? 100% 100% 100% Members of our community are only slightly less different from us than members of distant populations 85% 85% 85%
  • 13. Mind the numbers Humans and chimps share >98% of their genomes Among the 1.8% differences, 1.7% are fixed differences within species The remaining fraction, 0.1%, contains all human genomic variation The differences among the main continental groups represent 10% of 0.1% of the total, that is, 0.01% But 0.1% of >3 billion DNA sites means >3 million polymorphic DNA sites (3,213,401 according to Levy et al. 2007)
  • 14. Methods 2 : Clustering genotypes or haplotypes K=3 K=4 Rosenberg et al., 2002
  • 15. SNPs Haplotypes CNV Jakobsson et al. (2008) Structure inferred from SNPs and haplotypes differs from that inferred from Copy Number Variation
  • 16. Genes, as well as morphology, suggest inconsistent clusterings of genotypes Y chromosome: Romualdi et al. 2002 X chromosome: Wilson et al. 2001 377 STR loci: Barbujani and Belle 2006 377 STR loci: Rosenberg et al. 2002 Europe, Ethiopia S. Africa N. Guinea Asia Africa Asia, Europe, Australia, Americas Americas Melanesia Eurasia N Africa N America Maya S. Africa E Africa C Africa Piapoco Suruì Karitiana Kalash W. Eurasia E. Asia Africa Americas Oceania
  • 17. Sampling assumptions have a large effect on the apparent structuring
  • 18. Sampling assumptions have a large effect on the apparent structuring Serre and P ääbo (2004)
  • 19. ASIP A8818G MATP C374G Genetic variation is discordant across loci
  • 20. Similar skin colors are due to different combinations of alleles MATP C374G ASIP A8818G
  • 21. 16 completely sequenced genomes (as of May 1st, 2010) And 5 more published on May 6th:
  • 22. Two persons from the same continent may share fewer SNPs than persons of different continents
  • 23. 81% of SNPs cosmopolitan. Alleles present in one continent only: 0.91% in Africa, 0.75% in Asia, practically 0 elsewhere. Jakobsson et al. 2008 (525910 SNPs, 396 CNVs)
  • 24. In the 117 megabases (Mb) of sequenced exome-containing intervals, the average rate of nucleotide difference between a pair of the Bushmen was 1.2 per kb, compared to an average of 1.0 per kb between a European and Asian individual. Schuster et al. (2010) Greater differences between Africans than between European and Asians
  • 25. Genetic diversity out of Africa is often a subset ot the African genetic diversity Tishkoff et al. (1998)
  • 26. LD decreasing with physical distance between loci and with geographic distance from East Africa Jakobsson et al. 2008
  • 27. Gene diversity declines as a function of distance from Africa Best fit of the model for an African exit 56,000 years ago Liu et al. (2006)
  • 28. Patterns of morphological and genetic variation are compatible with the effects of dispersal from Africa Manica et al. (2007)
  • 29. Models with an African population replacing previous human continental groups explain the data better than any alternative models Fagundes et al. (2007)
  • 30.
  • 36.  
  • 37.
  • 38. Divergence from modern humans Neandertals fall inside the variation of present-day humans. Overall divergence greater for the three Neandertal genomes (modes ~11%), whereas the San mode is ~9% and for the other present-day humans ~8%. For the Neandertals, 13% of windows have a divergence above 20%, whereas this is the case for 2.5% to 3.7% of windows in the current humans
  • 39.
  • 40. 1. Comparison with the HapMap sequences and 5 newly sequenced individuals 2. No comparison within Eurasia (Papuan-French-Han) or within Africa (Yoruba- San) shows significant skews in D 3. All comparisons of non-Africans and Africans show that the Neandertal is closer to the non-Africans 4. All or almost all the gene flow detected was from Neandertals into modern humans 5. Some old haplotypes most likely owe their presence in non Africans to gene flow from Neandertals
  • 41. Four processes potentially accounting for the data Between 1 and 4% of the genomes of people in Eurasia are derived from Neandertals 1. From erectus to Neandertal 2. From late Neandertals into the first Europeans 3. From early Neandertals into the first Eurasians 4. Ancient population structure, preserved from before the Neandertal – modern sapiens separation
  • 42. Enza Colonna And if you want to read more about all this Trends in Genetics , July 2010