DSRD.pdf

Bayesian Methods for Detecting and Characterizing Planets Around Other Stars
Benjamin Nelson @exobenelson NU Data Science Scholar Insight Data Science Fellow June 25, 2018 Data Science Research Day

How Do We Find Exoplanets? transit radial velocity June 25,
2018 Data Science Research Day

Statistical Challenges in Characterizing Exoplanets Is there evidence for a
planet in my data? For one system, what are the planet properties (orbital period, eccentricity, mass, etc.)? What can be inferred from populations of exoplanets? June 25, 2018 Data Science Research Day frequentist vs. Bayesian model comparison sampling in ~10s of parameters hierarchical Bayesian modeling sampling in ~100s to 1000s of parameters

What does it mean to “discover” a planet? Frequentist Approach
Reject the null hypothesis that a model without a planet could reasonably explain the data Bayesian Approach Evidence (i.e., marginalized likelihood) for a model with the planet is much greater than alternative models without the planet June 25, 2018 Data Science Research Day

Computing the “evidence” for an n-planet model prior probability distribution
likelihood function (i.e., sampling distribution) fully marginalized likelihood (i.e., Bayesian “evidence”) June 25, 2018 Data Science Research Day Z p(d|M) = p(θ|M)p(d|θ, M)dθ

Thermodynamic integration (HD208487, Gregory 2007) Nested sampling / MultiNest (GJ667C,
Feroz & Hobson 2014) Geometric path Monte Carlo (GJ581, Hou+ 2014) Transdimensional MCMC w/ nested sampling (ν Oph, Brewer & Donovan 2015) Importance sampling (GJ876, Nelson+ 2016; HD9174, Jenkins+ 2017) Computing the “evidence” for an n-planet model June 25, 2018 Data Science Research Day Z p(d|M) = p(θ|M)p(d|θ, M)dθ

Evidence Challenge June 25, 2018 Data Science Research Day

Evidence Challenge How accurately/precisely can one compute the “evidence” for
{0, 1, 2, 3} planets in RV data, given a set of priors and likelihood function? June 25, 2018 Data Science Research Day

Evidence Challenge How accurately/precisely can one compute the “evidence” for
{0, 1, 2, 3} planets in RV data, given a set of priors and likelihood function? Z p(d|M) = p(θ)p(d|θ, M)dθ June 25, 2018 Data Science Research Day

EPRV3 Evidence Challenge More details and results at: github.com/EPRV3EvidenceChallenge/ Methods
teams submitted: Frequentist BIC leave-one-out cross-validation time-series cross-validation Bayesian Chib’s approximation Laplace approximation Laplace approximation + l1 periodogram Perrakis estimator importance sampling + MCMC importance sampling + variational Bayes nested sampling (MultiNest) nested sampling + MCMC diffusive nested sampling (DNest4) June 25, 2018 Data Science Research Day

What different methods say about n vs n+1 planets dataset
numbers log Odds Ratio Broad Narrow June 25, 2018 Data Science Research Day

corner.py (pip install corner) Fitting a simple 1-planet model June
25, 2018 Data Science Research Day

corner.py (pip install corner) Fitting a complex 5-planet model June
25, 2018 Data Science Research Day

June 25, 2018 Data Science Research Day Different ways to
do MCMC Nelson, Ford, & Payne (2014) Radial velocity Using N-body Differential evolution Markov Chain Monte Carlo

Hoffman & Gelman (2011) Carpenter+ (2017) Comparing the performance of
these Python packages jakevdp.github.io/blog/2014/06/14/frequentism-and-bayesianism-4-bayesian-in-python/ andrewgelman.com/2015/10/15/whats-the-one-thing-you-have-to-know-about-pystan-and-pymc-click-here-to-find-out/ June 25, 2018 Data Science Research Day Different ways to do MCMC Nelson, Ford, & Payne (2014) Radial velocity Using N-body Differential evolution Markov Chain Monte Carlo Goodman & Weare (2010) Foreman-Mackey+ (2013) affine-invariant ensemble sampler Salvatier, Wiecki, & Fonnesbeck (2016) wide variety of samplers Hamiltonian Monte Carlo No U-Turn Sampler

Disk migration Eccentric migration Before: After: Source: http://jila.colorado.edu/~pja/planet_migration.html Hot Jupiter
Formation How do they form? June 25, 2018 Data Science Research Day

probabilistic graphical models with daft (pip install daft) Modeling Two
Overlapping Populations June 25, 2018 Data Science Research Day population-level parameters individual-level parameters data

Hamiltonian Monte Carlo arXiv: 1701.02434 + Sampler Hoffman & Gelman
2011 June 25, 2018 Data Science Research Day Sampling a 300+ Dimensional Space

Building a Stan model June 25, 2018 Data Science Research
Day

June 25, 2018 Data Science Research Day Multiple HJ populations
can be inferred from current data. RV+Kepler data are well explained as a single population with xl ≈ 2 For HAT+WASP data... 85% consistent with high-e migration history 15% consistent with disk migration history Within the limitations of our chosen models…

June 25, 2018 Data Science Research Day Want to play
around with different sampling methods? chi-feng.github.io/mcmc-demo/

DSRD.pdf

DSRD.pdf

Ben Nelson

More Decks by Ben Nelson

Featured

Transcript

Bayesian Methods for Detecting and Characterizing Planets Around Other Stars

How Do We Find Exoplanets? transit radial velocity June 25,

Statistical Challenges in Characterizing Exoplanets Is there evidence for a

Statistical Challenges in Characterizing Exoplanets Is there evidence for a

What does it mean to “discover” a planet? Frequentist Approach

Computing the “evidence” for an n-planet model prior probability distribution

Thermodynamic integration (HD208487, Gregory 2007) Nested sampling / MultiNest (GJ667C,

Evidence Challenge June 25, 2018 Data Science Research Day

Evidence Challenge How accurately/precisely can one compute the “evidence” for

Evidence Challenge How accurately/precisely can one compute the “evidence” for

EPRV3 Evidence Challenge More details and results at: github.com/EPRV3EvidenceChallenge/ Methods

What different methods say about n vs n+1 planets dataset

What different methods say about n vs n+1 planets dataset

Statistical Challenges in Characterizing Exoplanets Is there evidence for a

corner.py (pip install corner) Fitting a simple 1-planet model June

corner.py (pip install corner) Fitting a complex 5-planet model June

June 25, 2018 Data Science Research Day Different ways to

Hoffman & Gelman (2011) Carpenter+ (2017) Comparing the performance of

Statistical Challenges in Characterizing Exoplanets Is there evidence for a

Disk migration Eccentric migration Before: After: Source: http://jila.colorado.edu/~pja/planet_migration.html Hot Jupiter

probabilistic graphical models with daft (pip install daft) Modeling Two

Hamiltonian Monte Carlo arXiv: 1701.02434 + Sampler Hoffman & Gelman

Hamiltonian Monte Carlo arXiv: 1701.02434 + Sampler Hoffman & Gelman

Building a Stan model June 25, 2018 Data Science Research

Building a Stan model June 25, 2018 Data Science Research

June 25, 2018 Data Science Research Day Multiple HJ populations

June 25, 2018 Data Science Research Day Want to play