Prediction is very hard, Prediction is very hard, especially about the future especially about the future -Niels Bohr, Danish physicist (1885-1962) -Niels Bohr, Danish physicist (1885-1962)
Outcome A Outcome B Outcome C Outcome E Outcome F Do A Do B Do C Do D Do Nothing t = 0 t = 1 Time In a changing world, not making a decision has consequences, intended or otherwise.
How do we make the best use of the data and theory that we have to make predictions which take into account (potentially large) inherent uncertainties?
What information do I have? What can I go out and observe? What are the hypothesized processes which generated the data? Theory/models Data The Process
What information do I have? What can I go out and observe? What are the hypothesized processes which generated the data? Theory/models Data The Process
What information do I have? What can I go out and observe? What are the hypothesized processes which generated the data? Theory/models Simulate Hypothesized Biological Processes Data The Process
What information do I have? What can I go out and observe? What are the hypothesized processes which generated the data? Theory/models Simulate Hypothesized Biological Processes Data The Process
What information do I have? What can I go out and observe? What are the hypothesized processes which generated the data? Theory/models Simulate Hypothesized Biological Processes How well can we recapture patterns and processes? (parameter estimation, model discrimination, & derived variables) Data The Process pseudo-data
What information do I have? What can I go out and observe? What are the hypothesized processes which generated the data? Theory/models Simulate Hypothesized Biological Processes How well can we recapture patterns and processes? (parameter estimation, model discrimination, & derived variables) Data The Process pseudo-data
What information do I have? What can I go out and observe? What are the hypothesized processes which generated the data? Theory/models Simulate Hypothesized Biological Processes How well can we recapture patterns and processes? (parameter estimation, model discrimination, & derived variables) Data The Process pseudo-data
What information do I have? What can I go out and observe? What are the hypothesized processes which generated the data? Theory/models Simulate Hypothesized Biological Processes How well can we recapture patterns and processes? (parameter estimation, model discrimination, & derived variables) Data The Process pseudo-data
What information do I have? What can I go out and observe? What are the hypothesized processes which generated the data? Theory/models Simulate Hypothesized Biological Processes How well can we recapture patterns and processes? (parameter estimation, model discrimination, & derived variables) Does it fit the real data? Data The Process pseudo-data
What information do I have? What can I go out and observe? What are the hypothesized processes which generated the data? Theory/models Simulate Hypothesized Biological Processes How well can we recapture patterns and processes? (parameter estimation, model discrimination, & derived variables) Does it fit the real data? Data The Process pseudo-data
What information do I have? What can I go out and observe? What are the hypothesized processes which generated the data? Theory/models Simulate Hypothesized Biological Processes How well can we recapture patterns and processes? (parameter estimation, model discrimination, & derived variables) Does it fit the real data? Test Hypotheses Make forecasts (Forward Simulation) Data The Process pseudo-data
What information do I have? What can I go out and observe? What are the hypothesized processes which generated the data? Theory/models Simulate Hypothesized Biological Processes How well can we recapture patterns and processes? (parameter estimation, model discrimination, & derived variables) Does it fit the real data? Test Hypotheses Make forecasts (Forward Simulation) Optimize Decisions Scenario Analysis Data The Process pseudo-data
Invasive forest insects 1. International trade has many externalities 2. Total damages of existing pests 3. Estimate the probability of new high impact pest A. Guilds: which pathways? B. Economic sectors: who pays the costs?
• Base line information lacking – Compile all known non-indigenous forest pests – Identify short list of intermediate damaging pests – • National economic estimates lacking – In depth analysis of the most damaging pests – 3 guilds (borers, sap suckers, foliage feeders) – 3 economic cost sectors (government, households, market) Emerald Ash Borer Hemlock Woolly Adelgid Gypsy Moth
( ) ( ) ) ( ) | ( ) ( | | Pr 1 ϑ ϑ ϑ ϑ ϑ P c f P P M = m m ∝ ∝ ∏ c c If we knew the cost of each pest, we can fit our models using the simple likelihood function. 0 1 2 3 4 5 6 7 8 0 2 4 6 8 Cost ($) Frequency of pests
0 1 2 3 4 5 6 7 8 0 2 4 6 8 Cost ($) Frequency of pests 78 13 1 Pr (ϑ∣d)∝ [∏ i=1 I P(low∣ϑ) x∏ j=1 J P(intermediate∣ϑ) x∏ k=1 K P(high∣ϑ) ]P(ϑ) What we have are frequencies of species in different impact ranges.
The Framework: A) Species Frequencies in 3 categories. B) Which Model? C) Model estimation D) Probability distribution of derived variable of interest (total cost, probability of new high impact pest) Aukema JE, Leung B, Kovacs K, Chivers C, Britton KO, et al. (2011) Economic Impacts of Non-Native Forest Insects in the Continental United States. PLoS ONE 6(9): e24587
What information do I have? What can I go out and observe? What are the hypothesized processes which generated the data? Theory/models Simulate Hypothesized Biological Processes How well can we recapture patterns and processes? (parameter estimation, model discrimination, & derived variables) Does it fit the real data? Test Hypotheses Make forecasts (Forward Simulation) Optimize Decisions Scenario Analysis Data The Process pseudo-data
Results Highest impact are the Borer guild. Costs are born primarily by local governments. ~1.7 Billion USD per annum Single most damaging pests in each guild accounts for 25-50% of the total impacts. At current establishment rates ~32% chance of another high impact pest in the next ten years. Aukema JE, Leung B, Kovacs K, Chivers C, Britton KO, et al. (2011) Economic Impacts of Non-Native Forest Insects in the Continental United States. PLoS ONE 6(9): e24587
More imports, this time of the fishy sort. ● Given a proxy measure of propagule pressure, how well can we estimate the risk of establishment? Bradie, J., Chivers, C. & Leung, B. (2013) Importing risk: quantifying the propagule-pressure establishment relationship at the pathway level. in press Diversity and Distributions.
● Not all species are equally likely to establish ● In the absence of species specific lifehistory information, how well can we estimate overal pathway risk?
Bradie, J., Chivers, C. & Leung, B. (2013) Importing risk: quantifying the propagule-pressure establishment relationship at the pathway level. in press Diversity and Distributions. Effects of unaccounted variability
Results At an import level of 100,000 individuals, establishment risk of 19% Importing 1 million individuals leads to just under a 1 in 2 chance of establishment. Bradie, J., Chivers, C. & Leung, B. (2013) Importing risk: quantifying the propagule-pressure establishment relationship at the pathway level. in press Diversity and Distributions.
Alternative models of human behaviour ● Gravity Model – 'Pull' of attractive lakes ● Random Utility Model – Rational utility maximizers (Schneider et al. 1998, Leung et al. 2004, 2006) (Moore et al. 2005, Timar and Phaneuf 2009)
Alternative models of human behaviour ● Gravity Model – 'Pull' of attractive lakes ● Random Utility Model – Rational utility maximizers PGM T nj =A n W j e D nj −d , n=1,... ,n , j=1,..., J. A n =1/∑ k =1 L W k e D nk −d . U nj =V nj +ϵ nj , n=1,... , N , j=1,... J V nj = X nj PRUM T nj = expV nj ∑ k=1 J expV nk , n=1,... , N , j=1,... , J (Schneider et al. 1998, Leung et al. 2004, 2006) (Moore et al. 2005, Timar and Phaneuf 2009)
Figure A1: Simulated trip outcomes in a landscape of lakes with induced spatial auto-correlation. Size of circle is proportional to the size of the simulated lake.
Figure A3: Generating vs maximum likelihood estimates for the four parameters (panels) of the random utility model. The 1:1 line is also plotted for comparison. Figure A2: Generating vs maximum likelihood estimates for the four parameters (panels) of the gravity model. The 1:1 line is also plotted for comparison. Re-capture the parameter values?
What information do I have? What can I go out and observe? What are the hypothesized processes which generated the data? Theory/models Simulate Hypothesized Biological Processes How well can we recapture patterns and processes? (parameter estimation, model discrimination, & derived variables) Does it fit the real data? Test Hypotheses Make forecasts (Forward Simulation) Optimize Decisions Scenario Analysis Data Methodology for decision support pseudo-data
Thank you Supervisors: Dr. Brian Leung Dr. Elena Bennett Dr. Claire De Mazancourt Dr. Gregor Fussman Lab Mates: Johanna Bradie Paul Edwards Kristina Marie Enciso Andrew Sellers Lidia Della Venezia Erin Gertzen