Journal_seminar-popgen.pdf

Slide 1

Slide 1 text

Traceability of mussel (Mytilus chilensis) in Southern Chile using microsatellite molecular markers and assignment algorithms: Exploratory survey Presented by Carmen, Recheta, and Soliven

Slide 2

Slide 2 text

Outline ● Background ○ Marine mussels ○ Genetic traceability and geographic origins ● Methods ● Results ● Discussion ● Conclusion

Slide 3

Slide 3 text

Marine Mussels - One of the most cultivated and marketed bivalves with the Mytilus genus used widely in prepared food products In Southern Chile, mussel culture is an important economic activity yielding 14.9% of the world’s Mytilidae aquaculture production in 2010 (FAO,2012). The international seafood trade has adopted the food chain or “from farm to fork” concept in terms of standards and regulations regarding food quality

Slide 4

Slide 4 text

3 Levels of Seafood Traceability: ● Identiﬁcation of the species ● Geographical origin determination ● Supply chain tracking and tracing

Slide 5

Slide 5 text

3 Levels of Seafood Traceability: ● Identiﬁcation of the species ● Geographical origin determination ● Supply chain tracking and tracing DNA-based methods

Slide 6

Slide 6 text

Genetic Traceability - DNA-based methods can verify the accuracy of traditional identiﬁcation methods such as product labeling - Determine species present in the product (DNA barcoding) - Identiﬁcation of breeds, local populations - Employing different DNA markers - Coding regions - Non-coding regions (more commonly used)

Slide 7

Slide 7 text

First level traceability: genetic species identification ● Employed to investigate commercial fraud by species substitution ● PCR-based assays allows for identification of various Mytilus species ○ RAPD (randomly amplified polymorphic DNA) ○ FINS (forensically relevant nucleotide sequencing) ○ AFLP (amplified fragment length polymorphisms

Slide 8

Slide 8 text

Second level of traceability: Geographical Origin ● Cannot be established merely by identifying the species due to worldwide distribution of Mytilus ● Equivalent to identifying its biological population (Ogden, 2008) ● Various chemical markers have been used (trace elements coupled with volatile compound analysis) ● DNA-based methods to study genetic diversity and population structure ○ Allozymes ○ Mitochondrial COI gene sequence ○ SNPs ○ Microsatellites

Slide 9

Slide 9 text

Assignment methods ● Assignment methods use genetic information to ascertain population (predeﬁned) or cluster (non-predeﬁned) membership of individuals ● Frequentist or likelihood methods ○ Frequentist - statistical hypothesis testing, giving a p value derived from a frequency distribution ○ Likelihood - assumes that observed data arises from a probabilistic model with unknown parameters ● Bayesian approach ○ Derives the posterior distribution, taking into account prior probability and a likelihood function derived from the observed data, allowing for incorporation of existing knowledge into data analysis

Slide 10

Slide 10 text

Objective To evaluate the potential of microsatellite markers combined with allocation algorithms for assigning Mytilus individuals from southern Chile to their geographical origin

Slide 11

Slide 11 text

MATERIALS & METHODS

Slide 12

Slide 12 text

Mussel Sample Collection and SSR Genotyping - Samples of mussels (n=50 by location) were collected in Southern Chile in six sites (inc. 1 wild population and 5 seed collection centers) - DNA extraction by phenol-chloroform method - Ampliﬁcation of 9 SSR loci - Genotyping: polyacrylamide gels (6%) with silver staining

Slide 13

Slide 13 text

Data Analysis ● Micro-Checker - used to test for the presence of null alleles, stuttering and large allele dropout ● GENEPOP 4.0.10 - used to test for Hardy Weinberg equilibrium and estimate Wright’s ﬁxation index (FST )

Slide 14

Slide 14 text

Data Analysis Four assignment methods to assign reference population as possible origin of individuals: ● Genetic distance-based criterion ● Frequency-based method ● Bayesian-based method - operating in an “unsupervised” mode ● Bayesian-based method - operating in a “supervised” mode

Slide 15

Slide 15 text

Data Analysis ● GeneClass2 - used for genetic distance and allele frequency - assigning a population of origin for each individual ● Structure 2.3.3 - used for Bayesian method - assignment of individuals were conducted under the mixture model - 50,000 initial burn-in, 100,000 Markov chain iterations - K = 6

Slide 16

Slide 16 text

Data Analysis Evaluation of the performance of all four assignment methods: ● Sensitivity (S) = # of individuals correctly assigned to their original location/ total # of individuals sampled from that location - Reflects how good a test is at correctly assigning individuals to a location ● Specificity (E) = # of individuals properly excluded from the population/total # of individuals who do not belong to the population - Reflects how good the test is at correctly excluding individuals who do not belong to that location ● Average probability assignment score (AP) - average of the likelihood of each successful re-assignment to the respective location ● Likelihood ratio (LR +) = s/(1-E) - How many times more (or less) likely it is that individuals belonging to a location will be assigned to this location as compared to individuals belonging to another location

Slide 17

Slide 17 text

RESULTS & DISCUSSION

Slide 18

Slide 18 text

Results and Discussion ● All 9 loci are polymorphic and a total of 68 alleles are observed ● Global FST - genetic differentiation between the 6 sites was 0.042 - Only 4.2% of total allele frequency variance lies among sample sites and is highly signiﬁcant (P<0.001) - 95.8% of the variance between allele frequencies is explained by variation within sites - In other words, the groups (as deﬁned by sampling sites) are very similar to one another

Slide 19

Slide 19 text

Results and Discussion: Assignment performance

Slide 20

Slide 20 text

35% Accuracy for Bayesian method without prior information

Slide 21

Slide 21 text

50.7% Accuracy for distance based method

Slide 22

Slide 22 text

50.7% Accuracy for frequency based method

Slide 23

Slide 23 text

95.3% Accuracy for Bayesian method with prior information

Slide 24

Slide 24 text

Results and Discussion Power of assignment depends closely on the level of population structure In this study, global Fst is low at 0.042

Slide 25

Slide 25 text

Results and Discussion The study performed better than: The study performed poorly compared to: Blue marlin between more separated locations (41% accuracy) Golden humped tench (64-92% accuracy) Based on literature, assignment accuracy for farmed populations is better than those found in wild populations (likely due to artiﬁcial selection and gene ﬂow being less likely between farms)

Slide 26

Slide 26 text

CONCLUSION & RECOMMENDATION

Slide 27

Slide 27 text

Conclusion ● The frequency-based algorithm showed the best performance in matching Mytilus individuals from Southern Chile to their geographical origin - 50% of correct assignment in a challenging scenario with low genetic differentiation among locations (Global Fst = 0.042)

Slide 28

Slide 28 text

Recommendation ● Increase assignment performance by increasing the number of microsatellite loci and/or adding SNPs

Slide 29

Slide 29 text

THANK YOU!