SlideShare a Scribd company logo
1 of 27
Download to read offline
Laure GUILLOU
Station Biologique Roscoff
Diversity and Interactions within the oceanic plankton (DIPO team)
UMR 7144 CNRS, Paris VI
The Syndiniales Amoebophrya
ceratii-complex clade 2
infecting Heterocapsa triquetra
New chytrid (Dinomyces arenysensis )
infecting Alexandrium minutum
The gregarine Ancora
sagittata infecting the
polychaete Capitella capitata
Long term dynamic
of coastal waters
Nathalie Simon
Polar systems and RCC
Daniel Vaulot
Anne-Claire Baudoux
Marine viruses
Parasites in aquatic
systems
Laure Guillou
20
µm
The Roscoff DIPO Team
Fabrice Not
Radiolarians
http://ssu-rrna.org/pr2
Curated taxonomy of unicellular eukaryotes
Small SubUnit rRNA and rDNA sequences
Past of the PR2 database
1997 First Database (Daniel Vaulot)
2000
2003
2009
2013
http://keydnatools.com/
http://ssu-rrna.org/pr2
EU PICODIV project (Daniel Vaulot)
Available online
databases
(Laure Guillou)
EU Biomarks project
(Colomban de Vargas)
French ANR project
(Laure Guillou)
The genesis of PR2
• The first embryonic PR2 was created around 1997 by
D. Vaulot as an Excel file cataloguing the few hundred
algal 18S sequences available at the time
• Unfortunately despite heavy archeological digging,
no trace of this file has been found....
EU project PICODIV (2000-2003)
Coord. Vaulot Daniel
OLIPAC cruise Nov.
1994
Oslo 2003
Roscoff 2000
Bremerhaven 2002 Bremerhaven 2002
France
Spanish
England
Germany
Norway
We miss
Colomban!
Access database
ARB database
Shared between all participants
EU project PICODIV (2000-2003)
Coord. Vaulot Daniel
Important numbers of novel eukaryotic lineages
Formal
taxinomy
Novel lineages
Environmental
sequences
New classification of Eukaryotes
Using fixed framework (8 taxonomical fields)
MALV lineages
MAST lineages
First problem: environmental sequences
100 200 300 400 500 600 700 800 900 1000 1100 1200 1300 1400 1500 1600 1700 1800
A
B
A. Sequence AJ010408 (Micromonas pusilla, prasinophyte)
B. Squence M88521 (Symbiodinium microadriaticum, Dinophyceae)
V4 region V9 region
100 200 300 400 500 600 700 800 900 1000 1100 1200 1300 1400 1500 1600 1700 1800
B/A/B AB B
Detection of chimera
Second problem: chimera
http://keydnatools.com/
AACTGGTTTAAAGCTTGATTCGTAGCTGCGTTTaAGGGGAAATCGATAGCTT
ACTGGTTTAAAGCTT
GGGGAAATCGATAG
SSU rDNA
Small TAGs (Keys)
AACTGGTTTAAAGCTTGccctaGTAGCcgtaaatcTGGGGGAAATCGATAGCTT
Species 1
Species 2
ccctaGTAGCcgtaa
Order (1&2)
Class (1&2)
Species 1 TTCGTAGCTGCGTT
Species 2
…..
…..
…..
Annotation of
environmental
sequences
Automatic generation from referenced database (22501 sequences)
y = 8,7441x - 5558,7
R
2
= 0,8829
80,000
90,000
100,000
110,000
120,000
130,000
140,000
150,000
160,000
170,000
10,000 11,000 12,000 13,000 14,000 15,000 16,000 17,000 18,000 19,000
21 of November 2008
26 of April 2007
Number of sequences in the reference database
Numberofkeysgenerated
Last update: August 2012
Ambient
Elevated
atmospheric CO2
FgAr
Cer
StrM Alv
KeyDNAtools
Different annotation 8%
Chimera 19%
Converging annotation 73%
1936 almost complete sequences of 18S
From soil (not marine…)
Published
500 sequences per submission
This web site was stopped with the use of NGS technology
But was very useful to built a robust, chimera-free, referenced database
http://ssu-rrna.org/pr2
List of
experts
in taxonomy
+ Bioinfo
Curated taxonomy of unicellular eukaryotes
Small SubUnit rRNA and rDNA sequences
57 citations in two years
• PR2 is a database made by biologists for biologists
• This is a simple, fast evolving database, which adapts in size and
application to our own scientific projects
THIS IS A TOOL, opens to everyone, but not the central activity of our
scientific activity (as SILVA)
Updates are time-consuming, requier time and money.
Bacteria, Archaea and Eukaryota
January 2011: same initial database
Silva was not updated using PR2 since 2013 = updates over time are complicated and need a constant
effort from experts. PR2: last update in August 2014.
TOOLS require for the annotation process/validation need to be simplified
The future of PR2
PR2 Database moved to Roscoff - Fall 2015 (Richard Christen will retire soon).
Work in progress now…
Incorporate novel sequences AND
published updates of the taxonomy
(alveolates, radiolarians, Chlorophyta,
diatoms, haptophytes…)
Integration of the EukREF improvment if
possible ?
We are preparing a novel update of
PR2 for 2015
Future PR2 updates…
Biard et al. (in press)
Collodarians
Tragin et al. (in prep)
Green lineages Daniel VaulotFabrice Not
We will also contact different experts
soon (Bente E., Adriana Z. etc..)
Work in progress now… = making our live easier!
2- Upgrade and streamline PR2 web site
 Downloading new functions, simplification of the PR2 website
 NGS pipelines (using R) (in fact the tools we are currently using now for
sequence annotation)
 Metadata (in progress for Prasinophytes)
3- Incorporate NGS database – 2016 (Daniel)
Altran data management company- in progress: 2nd semester 2015
1- New tools to help in database creation and maintenance (functional genes,
ribosomal genes, …)
ALL OF THESE UPDATES ARE LINKED WITH OUR RESPECTIVE RUNNING PROJECTS
This is probably a critical point for the viability of all databases
Future of the PR2 database?
1997 First Database (Daniel Vaulot)
2000
2003
2009
2013
http://keydnatools.com/
http://ssu-rrna.org/pr2
EU PICODIV project (Daniel Vaulot)
Available online
databases (Laure
Guillou) UNIEUK (Colomban)
Diversity; metabarcoding = taxonomy is important BUT how
these organisms interact each other is primordial
AQUASYMBIO: a web site database recording all known
symbiotic (mutualistic symbioses, parasites, …) interactions in
aquatic systems .
French ANR project HAPAR (Guillou Laure and Not Fabrice)
AQUASYMBIO (Laure)
Described Interactions
HOST (Species X) AND SYMBIONT (Species Y)
Where?
When?
Ref
+
Species Z
Diagnosis
Live cycle
Ilustrations
Ref
Species X
Diagnosis
Live cycle
Ilustrations
Ref
Species W
Diagnosis
Live cycle
Ilustrations
Ref
Species Y
Diagnosis
Live cycle
Ilustrations
Ref
Species X
Species Y
Species Z
….
Hosts Symbionts
Interactome
Species description (with Glossary)
In progress
(1rst release in
2016)

More Related Content

Similar to The Protist Ribosomal Database (PR2)

ApplicationNote-Brian-D-Gregory_1008V1
ApplicationNote-Brian-D-Gregory_1008V1ApplicationNote-Brian-D-Gregory_1008V1
ApplicationNote-Brian-D-Gregory_1008V1
Jason Holzman
 
Jeff Grethe: CAMERA
Jeff Grethe: CAMERAJeff Grethe: CAMERA
Jeff Grethe: CAMERA
Iddo
 
wolstencroft-ogf20-astro
wolstencroft-ogf20-astrowolstencroft-ogf20-astro
wolstencroft-ogf20-astro
webuploader
 
EODATASERVICE.ORG - Digital Earth Platform to enable Muti-disciplinary Geospa...
EODATASERVICE.ORG - Digital Earth Platform to enable Muti-disciplinary Geospa...EODATASERVICE.ORG - Digital Earth Platform to enable Muti-disciplinary Geospa...
EODATASERVICE.ORG - Digital Earth Platform to enable Muti-disciplinary Geospa...
EUDAT
 

Similar to The Protist Ribosomal Database (PR2) (20)

Animal Repellent System for Smart Farming Using AI and Deep Learning
Animal Repellent System for Smart Farming Using AI and Deep LearningAnimal Repellent System for Smart Farming Using AI and Deep Learning
Animal Repellent System for Smart Farming Using AI and Deep Learning
 
2013 oct 2 rna sequencing
2013 oct 2 rna sequencing2013 oct 2 rna sequencing
2013 oct 2 rna sequencing
 
Cyberinfrastructure for Advanced Marine Microbial Ecology Research and Analys...
Cyberinfrastructure for Advanced Marine Microbial Ecology Research and Analys...Cyberinfrastructure for Advanced Marine Microbial Ecology Research and Analys...
Cyberinfrastructure for Advanced Marine Microbial Ecology Research and Analys...
 
Sensors and Internet of Things: the role of nanostructured semiconductors
Sensors and Internet of Things: the role of nanostructured semiconductorsSensors and Internet of Things: the role of nanostructured semiconductors
Sensors and Internet of Things: the role of nanostructured semiconductors
 
Creating a Cyberinfrastructure for Advanced Marine Microbial Ecology Research...
Creating a Cyberinfrastructure for Advanced Marine Microbial Ecology Research...Creating a Cyberinfrastructure for Advanced Marine Microbial Ecology Research...
Creating a Cyberinfrastructure for Advanced Marine Microbial Ecology Research...
 
Genome Assembly
Genome AssemblyGenome Assembly
Genome Assembly
 
De los rasgos poligénicos a los poligenómicos 250517
De los rasgos poligénicos a los poligenómicos 250517De los rasgos poligénicos a los poligenómicos 250517
De los rasgos poligénicos a los poligenómicos 250517
 
OptIPuter: Metagenomics at Light Speed
OptIPuter: Metagenomics at Light SpeedOptIPuter: Metagenomics at Light Speed
OptIPuter: Metagenomics at Light Speed
 
Building an Information Infrastructure to Support Microbial Metagenomic Sciences
Building an Information Infrastructure to Support Microbial Metagenomic SciencesBuilding an Information Infrastructure to Support Microbial Metagenomic Sciences
Building an Information Infrastructure to Support Microbial Metagenomic Sciences
 
Expression analysis of water stress related genes in tomato plant
Expression analysis of water stress related genes in tomato plantExpression analysis of water stress related genes in tomato plant
Expression analysis of water stress related genes in tomato plant
 
Improving spatial representation easton
Improving spatial representation   eastonImproving spatial representation   easton
Improving spatial representation easton
 
ApplicationNote-Brian-D-Gregory_1008V1
ApplicationNote-Brian-D-Gregory_1008V1ApplicationNote-Brian-D-Gregory_1008V1
ApplicationNote-Brian-D-Gregory_1008V1
 
Jeff Grethe: CAMERA
Jeff Grethe: CAMERAJeff Grethe: CAMERA
Jeff Grethe: CAMERA
 
wolstencroft-ogf20-astro
wolstencroft-ogf20-astrowolstencroft-ogf20-astro
wolstencroft-ogf20-astro
 
EB-eye Back End
EB-eye Back EndEB-eye Back End
EB-eye Back End
 
160620 sole nomics v2
160620 sole nomics v2160620 sole nomics v2
160620 sole nomics v2
 
A review on Implementation Of Integrated System to Avoid Flood Like Situation
A review on  Implementation Of Integrated System to Avoid Flood Like SituationA review on  Implementation Of Integrated System to Avoid Flood Like Situation
A review on Implementation Of Integrated System to Avoid Flood Like Situation
 
Collaborations Between Calit2, SIO, and the Venter Institute-a Beginning
Collaborations Between Calit2, SIO, and the Venter Institute-a BeginningCollaborations Between Calit2, SIO, and the Venter Institute-a Beginning
Collaborations Between Calit2, SIO, and the Venter Institute-a Beginning
 
EODATASERVICE.ORG - Digital Earth Platform to enable Muti-disciplinary Geospa...
EODATASERVICE.ORG - Digital Earth Platform to enable Muti-disciplinary Geospa...EODATASERVICE.ORG - Digital Earth Platform to enable Muti-disciplinary Geospa...
EODATASERVICE.ORG - Digital Earth Platform to enable Muti-disciplinary Geospa...
 
Software Pipelines: The Good, The Bad and The Ugly
Software Pipelines: The Good, The Bad and The UglySoftware Pipelines: The Good, The Bad and The Ugly
Software Pipelines: The Good, The Bad and The Ugly
 

More from EukRef

Aquatic protist communities in the midst of environmental change:
Aquatic protist communities in the midst of environmental change:Aquatic protist communities in the midst of environmental change:
Aquatic protist communities in the midst of environmental change:
EukRef
 
Photosynthetic euglenids
Photosynthetic euglenidsPhotosynthetic euglenids
Photosynthetic euglenids
EukRef
 
Division Haptophyta – phylogeny and diversity
Division Haptophyta – phylogeny and diversityDivision Haptophyta – phylogeny and diversity
Division Haptophyta – phylogeny and diversity
EukRef
 
Recent Contributions to the Eukaryote Tree of Life
Recent Contributions to the Eukaryote Tree of LifeRecent Contributions to the Eukaryote Tree of Life
Recent Contributions to the Eukaryote Tree of Life
EukRef
 

More from EukRef (12)

Towards a Natural Classification of Diatoms (Bacillariophyceae)
Towards a Natural Classification of Diatoms (Bacillariophyceae)Towards a Natural Classification of Diatoms (Bacillariophyceae)
Towards a Natural Classification of Diatoms (Bacillariophyceae)
 
Barcoding heliozoa
Barcoding heliozoaBarcoding heliozoa
Barcoding heliozoa
 
Phylum Ciliophora: Morphology, Phylogeny, and Systematics
Phylum Ciliophora: Morphology, Phylogeny, and SystematicsPhylum Ciliophora: Morphology, Phylogeny, and Systematics
Phylum Ciliophora: Morphology, Phylogeny, and Systematics
 
Aquatic protist communities in the midst of environmental change:
Aquatic protist communities in the midst of environmental change:Aquatic protist communities in the midst of environmental change:
Aquatic protist communities in the midst of environmental change:
 
Parabasalia (Excavata, Metamonada)
Parabasalia (Excavata, Metamonada)Parabasalia (Excavata, Metamonada)
Parabasalia (Excavata, Metamonada)
 
Rhodophyta: A cornucopia of cryptic diversity
Rhodophyta: A cornucopia of cryptic diversityRhodophyta: A cornucopia of cryptic diversity
Rhodophyta: A cornucopia of cryptic diversity
 
The Choanoflagellates
The ChoanoflagellatesThe Choanoflagellates
The Choanoflagellates
 
Trichomonads: free-living and parasitic protists of pets, livestock, wildlife...
Trichomonads: free-living and parasitic protists of pets, livestock, wildlife...Trichomonads: free-living and parasitic protists of pets, livestock, wildlife...
Trichomonads: free-living and parasitic protists of pets, livestock, wildlife...
 
Building the first SSU database of phagotrophic euglenids using single-cell a...
Building the first SSU database of phagotrophic euglenids using single-cell a...Building the first SSU database of phagotrophic euglenids using single-cell a...
Building the first SSU database of phagotrophic euglenids using single-cell a...
 
Photosynthetic euglenids
Photosynthetic euglenidsPhotosynthetic euglenids
Photosynthetic euglenids
 
Division Haptophyta – phylogeny and diversity
Division Haptophyta – phylogeny and diversityDivision Haptophyta – phylogeny and diversity
Division Haptophyta – phylogeny and diversity
 
Recent Contributions to the Eukaryote Tree of Life
Recent Contributions to the Eukaryote Tree of LifeRecent Contributions to the Eukaryote Tree of Life
Recent Contributions to the Eukaryote Tree of Life
 

Recently uploaded

development of diagnostic enzyme assay to detect leuser virus
development of diagnostic enzyme assay to detect leuser virusdevelopment of diagnostic enzyme assay to detect leuser virus
development of diagnostic enzyme assay to detect leuser virus
NazaninKarimi6
 
Module for Grade 9 for Asynchronous/Distance learning
Module for Grade 9 for Asynchronous/Distance learningModule for Grade 9 for Asynchronous/Distance learning
Module for Grade 9 for Asynchronous/Distance learning
levieagacer
 
Human genetics..........................pptx
Human genetics..........................pptxHuman genetics..........................pptx
Human genetics..........................pptx
Silpa
 
Reboulia: features, anatomy, morphology etc.
Reboulia: features, anatomy, morphology etc.Reboulia: features, anatomy, morphology etc.
Reboulia: features, anatomy, morphology etc.
Silpa
 
biology HL practice questions IB BIOLOGY
biology HL practice questions IB BIOLOGYbiology HL practice questions IB BIOLOGY
biology HL practice questions IB BIOLOGY
1301aanya
 
LUNULARIA -features, morphology, anatomy ,reproduction etc.
LUNULARIA -features, morphology, anatomy ,reproduction etc.LUNULARIA -features, morphology, anatomy ,reproduction etc.
LUNULARIA -features, morphology, anatomy ,reproduction etc.
Silpa
 
THE ROLE OF BIOTECHNOLOGY IN THE ECONOMIC UPLIFT.pptx
THE ROLE OF BIOTECHNOLOGY IN THE ECONOMIC UPLIFT.pptxTHE ROLE OF BIOTECHNOLOGY IN THE ECONOMIC UPLIFT.pptx
THE ROLE OF BIOTECHNOLOGY IN THE ECONOMIC UPLIFT.pptx
ANSARKHAN96
 
(May 9, 2024) Enhanced Ultrafast Vector Flow Imaging (VFI) Using Multi-Angle ...
(May 9, 2024) Enhanced Ultrafast Vector Flow Imaging (VFI) Using Multi-Angle ...(May 9, 2024) Enhanced Ultrafast Vector Flow Imaging (VFI) Using Multi-Angle ...
(May 9, 2024) Enhanced Ultrafast Vector Flow Imaging (VFI) Using Multi-Angle ...
Scintica Instrumentation
 

Recently uploaded (20)

FAIRSpectra - Enabling the FAIRification of Analytical Science
FAIRSpectra - Enabling the FAIRification of Analytical ScienceFAIRSpectra - Enabling the FAIRification of Analytical Science
FAIRSpectra - Enabling the FAIRification of Analytical Science
 
Atp synthase , Atp synthase complex 1 to 4.
Atp synthase , Atp synthase complex 1 to 4.Atp synthase , Atp synthase complex 1 to 4.
Atp synthase , Atp synthase complex 1 to 4.
 
Role of AI in seed science Predictive modelling and Beyond.pptx
Role of AI in seed science  Predictive modelling and  Beyond.pptxRole of AI in seed science  Predictive modelling and  Beyond.pptx
Role of AI in seed science Predictive modelling and Beyond.pptx
 
development of diagnostic enzyme assay to detect leuser virus
development of diagnostic enzyme assay to detect leuser virusdevelopment of diagnostic enzyme assay to detect leuser virus
development of diagnostic enzyme assay to detect leuser virus
 
Module for Grade 9 for Asynchronous/Distance learning
Module for Grade 9 for Asynchronous/Distance learningModule for Grade 9 for Asynchronous/Distance learning
Module for Grade 9 for Asynchronous/Distance learning
 
Gwalior ❤CALL GIRL 84099*07087 ❤CALL GIRLS IN Gwalior ESCORT SERVICE❤CALL GIRL
Gwalior ❤CALL GIRL 84099*07087 ❤CALL GIRLS IN Gwalior ESCORT SERVICE❤CALL GIRLGwalior ❤CALL GIRL 84099*07087 ❤CALL GIRLS IN Gwalior ESCORT SERVICE❤CALL GIRL
Gwalior ❤CALL GIRL 84099*07087 ❤CALL GIRLS IN Gwalior ESCORT SERVICE❤CALL GIRL
 
Human genetics..........................pptx
Human genetics..........................pptxHuman genetics..........................pptx
Human genetics..........................pptx
 
FAIRSpectra - Enabling the FAIRification of Spectroscopy and Spectrometry
FAIRSpectra - Enabling the FAIRification of Spectroscopy and SpectrometryFAIRSpectra - Enabling the FAIRification of Spectroscopy and Spectrometry
FAIRSpectra - Enabling the FAIRification of Spectroscopy and Spectrometry
 
Genome sequencing,shotgun sequencing.pptx
Genome sequencing,shotgun sequencing.pptxGenome sequencing,shotgun sequencing.pptx
Genome sequencing,shotgun sequencing.pptx
 
Climate Change Impacts on Terrestrial and Aquatic Ecosystems.pptx
Climate Change Impacts on Terrestrial and Aquatic Ecosystems.pptxClimate Change Impacts on Terrestrial and Aquatic Ecosystems.pptx
Climate Change Impacts on Terrestrial and Aquatic Ecosystems.pptx
 
Molecular markers- RFLP, RAPD, AFLP, SNP etc.
Molecular markers- RFLP, RAPD, AFLP, SNP etc.Molecular markers- RFLP, RAPD, AFLP, SNP etc.
Molecular markers- RFLP, RAPD, AFLP, SNP etc.
 
Genetics and epigenetics of ADHD and comorbid conditions
Genetics and epigenetics of ADHD and comorbid conditionsGenetics and epigenetics of ADHD and comorbid conditions
Genetics and epigenetics of ADHD and comorbid conditions
 
Reboulia: features, anatomy, morphology etc.
Reboulia: features, anatomy, morphology etc.Reboulia: features, anatomy, morphology etc.
Reboulia: features, anatomy, morphology etc.
 
300003-World Science Day For Peace And Development.pptx
300003-World Science Day For Peace And Development.pptx300003-World Science Day For Peace And Development.pptx
300003-World Science Day For Peace And Development.pptx
 
TransientOffsetin14CAftertheCarringtonEventRecordedbyPolarTreeRings
TransientOffsetin14CAftertheCarringtonEventRecordedbyPolarTreeRingsTransientOffsetin14CAftertheCarringtonEventRecordedbyPolarTreeRings
TransientOffsetin14CAftertheCarringtonEventRecordedbyPolarTreeRings
 
biology HL practice questions IB BIOLOGY
biology HL practice questions IB BIOLOGYbiology HL practice questions IB BIOLOGY
biology HL practice questions IB BIOLOGY
 
LUNULARIA -features, morphology, anatomy ,reproduction etc.
LUNULARIA -features, morphology, anatomy ,reproduction etc.LUNULARIA -features, morphology, anatomy ,reproduction etc.
LUNULARIA -features, morphology, anatomy ,reproduction etc.
 
THE ROLE OF BIOTECHNOLOGY IN THE ECONOMIC UPLIFT.pptx
THE ROLE OF BIOTECHNOLOGY IN THE ECONOMIC UPLIFT.pptxTHE ROLE OF BIOTECHNOLOGY IN THE ECONOMIC UPLIFT.pptx
THE ROLE OF BIOTECHNOLOGY IN THE ECONOMIC UPLIFT.pptx
 
Human & Veterinary Respiratory Physilogy_DR.E.Muralinath_Associate Professor....
Human & Veterinary Respiratory Physilogy_DR.E.Muralinath_Associate Professor....Human & Veterinary Respiratory Physilogy_DR.E.Muralinath_Associate Professor....
Human & Veterinary Respiratory Physilogy_DR.E.Muralinath_Associate Professor....
 
(May 9, 2024) Enhanced Ultrafast Vector Flow Imaging (VFI) Using Multi-Angle ...
(May 9, 2024) Enhanced Ultrafast Vector Flow Imaging (VFI) Using Multi-Angle ...(May 9, 2024) Enhanced Ultrafast Vector Flow Imaging (VFI) Using Multi-Angle ...
(May 9, 2024) Enhanced Ultrafast Vector Flow Imaging (VFI) Using Multi-Angle ...
 

The Protist Ribosomal Database (PR2)

  • 1. Laure GUILLOU Station Biologique Roscoff Diversity and Interactions within the oceanic plankton (DIPO team) UMR 7144 CNRS, Paris VI The Syndiniales Amoebophrya ceratii-complex clade 2 infecting Heterocapsa triquetra New chytrid (Dinomyces arenysensis ) infecting Alexandrium minutum The gregarine Ancora sagittata infecting the polychaete Capitella capitata
  • 2. Long term dynamic of coastal waters Nathalie Simon Polar systems and RCC Daniel Vaulot Anne-Claire Baudoux Marine viruses Parasites in aquatic systems Laure Guillou 20 µm The Roscoff DIPO Team Fabrice Not Radiolarians
  • 3. http://ssu-rrna.org/pr2 Curated taxonomy of unicellular eukaryotes Small SubUnit rRNA and rDNA sequences
  • 4. Past of the PR2 database 1997 First Database (Daniel Vaulot) 2000 2003 2009 2013 http://keydnatools.com/ http://ssu-rrna.org/pr2 EU PICODIV project (Daniel Vaulot) Available online databases (Laure Guillou) EU Biomarks project (Colomban de Vargas) French ANR project (Laure Guillou)
  • 5. The genesis of PR2 • The first embryonic PR2 was created around 1997 by D. Vaulot as an Excel file cataloguing the few hundred algal 18S sequences available at the time • Unfortunately despite heavy archeological digging, no trace of this file has been found....
  • 6. EU project PICODIV (2000-2003) Coord. Vaulot Daniel OLIPAC cruise Nov. 1994
  • 7. Oslo 2003 Roscoff 2000 Bremerhaven 2002 Bremerhaven 2002 France Spanish England Germany Norway We miss Colomban!
  • 8. Access database ARB database Shared between all participants EU project PICODIV (2000-2003) Coord. Vaulot Daniel
  • 9. Important numbers of novel eukaryotic lineages
  • 10. Formal taxinomy Novel lineages Environmental sequences New classification of Eukaryotes Using fixed framework (8 taxonomical fields) MALV lineages MAST lineages First problem: environmental sequences
  • 11. 100 200 300 400 500 600 700 800 900 1000 1100 1200 1300 1400 1500 1600 1700 1800 A B A. Sequence AJ010408 (Micromonas pusilla, prasinophyte) B. Squence M88521 (Symbiodinium microadriaticum, Dinophyceae) V4 region V9 region 100 200 300 400 500 600 700 800 900 1000 1100 1200 1300 1400 1500 1600 1700 1800 B/A/B AB B Detection of chimera Second problem: chimera
  • 12. http://keydnatools.com/ AACTGGTTTAAAGCTTGATTCGTAGCTGCGTTTaAGGGGAAATCGATAGCTT ACTGGTTTAAAGCTT GGGGAAATCGATAG SSU rDNA Small TAGs (Keys) AACTGGTTTAAAGCTTGccctaGTAGCcgtaaatcTGGGGGAAATCGATAGCTT Species 1 Species 2 ccctaGTAGCcgtaa Order (1&2) Class (1&2) Species 1 TTCGTAGCTGCGTT Species 2 ….. ….. ….. Annotation of environmental sequences Automatic generation from referenced database (22501 sequences)
  • 13. y = 8,7441x - 5558,7 R 2 = 0,8829 80,000 90,000 100,000 110,000 120,000 130,000 140,000 150,000 160,000 170,000 10,000 11,000 12,000 13,000 14,000 15,000 16,000 17,000 18,000 19,000 21 of November 2008 26 of April 2007 Number of sequences in the reference database Numberofkeysgenerated
  • 15. Ambient Elevated atmospheric CO2 FgAr Cer StrM Alv KeyDNAtools Different annotation 8% Chimera 19% Converging annotation 73% 1936 almost complete sequences of 18S From soil (not marine…) Published
  • 16.
  • 17. 500 sequences per submission This web site was stopped with the use of NGS technology But was very useful to built a robust, chimera-free, referenced database
  • 18. http://ssu-rrna.org/pr2 List of experts in taxonomy + Bioinfo Curated taxonomy of unicellular eukaryotes Small SubUnit rRNA and rDNA sequences
  • 19. 57 citations in two years
  • 20. • PR2 is a database made by biologists for biologists • This is a simple, fast evolving database, which adapts in size and application to our own scientific projects THIS IS A TOOL, opens to everyone, but not the central activity of our scientific activity (as SILVA) Updates are time-consuming, requier time and money.
  • 21. Bacteria, Archaea and Eukaryota January 2011: same initial database
  • 22. Silva was not updated using PR2 since 2013 = updates over time are complicated and need a constant effort from experts. PR2: last update in August 2014. TOOLS require for the annotation process/validation need to be simplified
  • 23. The future of PR2 PR2 Database moved to Roscoff - Fall 2015 (Richard Christen will retire soon). Work in progress now… Incorporate novel sequences AND published updates of the taxonomy (alveolates, radiolarians, Chlorophyta, diatoms, haptophytes…) Integration of the EukREF improvment if possible ? We are preparing a novel update of PR2 for 2015
  • 24. Future PR2 updates… Biard et al. (in press) Collodarians Tragin et al. (in prep) Green lineages Daniel VaulotFabrice Not We will also contact different experts soon (Bente E., Adriana Z. etc..)
  • 25. Work in progress now… = making our live easier! 2- Upgrade and streamline PR2 web site  Downloading new functions, simplification of the PR2 website  NGS pipelines (using R) (in fact the tools we are currently using now for sequence annotation)  Metadata (in progress for Prasinophytes) 3- Incorporate NGS database – 2016 (Daniel) Altran data management company- in progress: 2nd semester 2015 1- New tools to help in database creation and maintenance (functional genes, ribosomal genes, …) ALL OF THESE UPDATES ARE LINKED WITH OUR RESPECTIVE RUNNING PROJECTS This is probably a critical point for the viability of all databases
  • 26. Future of the PR2 database? 1997 First Database (Daniel Vaulot) 2000 2003 2009 2013 http://keydnatools.com/ http://ssu-rrna.org/pr2 EU PICODIV project (Daniel Vaulot) Available online databases (Laure Guillou) UNIEUK (Colomban) Diversity; metabarcoding = taxonomy is important BUT how these organisms interact each other is primordial AQUASYMBIO: a web site database recording all known symbiotic (mutualistic symbioses, parasites, …) interactions in aquatic systems . French ANR project HAPAR (Guillou Laure and Not Fabrice) AQUASYMBIO (Laure)
  • 27. Described Interactions HOST (Species X) AND SYMBIONT (Species Y) Where? When? Ref + Species Z Diagnosis Live cycle Ilustrations Ref Species X Diagnosis Live cycle Ilustrations Ref Species W Diagnosis Live cycle Ilustrations Ref Species Y Diagnosis Live cycle Ilustrations Ref Species X Species Y Species Z …. Hosts Symbionts Interactome Species description (with Glossary) In progress (1rst release in 2016)