during L. Guillou thesis • 2000-2003 • Access/ARB database maintained by D. Vaulot during PICODIV • 2006-2012 • KeyDNAtools developed by L. Guillou • 2013- mid-2016 • PR2 creation • Coordination: L. Guillou • Database maintained by R. Christen • Web site active until mid-2016 : ssu-rrna.org • Sept 2016 ... • Database maintained by D. Vaulot • raw data put on Figshare • Database included in FROGS • April 2017... • Database transferred to MySQL on SCROL server • Development of R scripts to clean and update the database • November 2017 • Repository on GitHub (versioning)
Contains 47 000 lines • Each name is unique (i.e. does not appear in different columns or different lines). • Any daughter taxon has a unique mother taxon. • Still needs some cleaning
some gb fields have been manually edited (e.g. gb_strain and gb_clone) • manual curate annotations (eg. sample_ fields) • fields computed from gb fields such as longitude and latitude
Biard 2015 Done Chlorophyta M. Tragin 2015 Done Haptophyta B. Edvardsen 2015 Done Stramenopiles Bolidophyceae D. Vaulot 2017 Done Stramenopiles Pelagophyceae D. Vaulot 2017 Done Alveolata Ciliates W. T. Chen, C. Bachy Done Alveolata Dinoflagellates S. Mordret, D. Sarno Done Radiolaria Nasselaria Spumellaria Miguel Mendez In progress • Taxonomy structure cleaned • no redundancy (same taxo name in at 2 different levels) • no taxo name with 2 different parents • Metadata from GenBank incorporated (e.g. environmental / cultures etc...)
Metabarcode assignment : Mothur / Qiime / Dada2 / Usearch • BLAST • Metadata • Note : PhytoRef (Plastid 16S rRNA) has also been transfered to Figshare : https://doi.org/10.6084/m9.figshare.4689826. An easy way to make data widely available
sequences, taxonomy • Update taxonomy: Green algae, Ciliates • Import new GenBank sequences with annotation • Incorporate new metadata (mixotrophs) • Analyze primers (Geisen) Web site • GitHub • Provide new formats • SQLite database • R data set (.rds) • Develop tutorial and example scripts to access PR2 • Develop small shiny apps
• Detect chimeras • Define reference sequences • Provide alignments for specific groups • Incorporate • 16S plastid • Functionalities for web site • BLAST • Visualisation of metadata (position etc...) • Download Chimeras • KeyDNA tools ? Keep informed on Research Gate : https://www.researchgate.net/project/Protist- Ribosomal-Reference-database-PR2