Typical Sets: What They Are and How to (Hopefully) Find Them

Typical Sets: What They Are and How to (Hopefully) Find
Them Josh Speagle [email protected] Based on this talk by Michael Betancourt at StanCon.

Intended Audience • Some experience with the basics of Bayesian
statistics.

statistics. • Some experience using MCMC for research.

statistics. • Some experience using MCMC for research. • Have heard of ensemble sampling methods such as emcee.

Bayesian Inference

Bayesian Inference Pr , M = Pr , M Pr
|M Pr M Bayes’ Theorem

|M Pr M Bayes’ Theorem Parameters

|M Pr M Bayes’ Theorem Data Parameters

|M Pr M Bayes’ Theorem Data Parameters Model

|M Pr M Bayes’ Theorem

|M Pr M Bayes’ Theorem Prior

|M Pr M Bayes’ Theorem Prior Likelihood

|M Pr M Bayes’ Theorem Prior Likelihood Posterior

|M Pr M Bayes’ Theorem Prior Likelihood Posterior Evidence

Bayesian Inference = ℒ Bayes’ Theorem ≡ Ω ℒ Posterior
Likelihood Prior Evidence

Bayesian Inference = ℒ Bayes’ Theorem Posterior Likelihood Prior Evidence
≡ Ω ℒ

Where is the posterior? ≡ Ω

Where is the posterior? ≡ {: =}

Where is the posterior? ≡ 0 ∞

Where is the posterior? ≡ 0 ∞ =

Where is the posterior? ≡ 0 ∞ “Amplitude” “Volume” =

= Where is the posterior? ≡ 0 ∞ “Typical Set”

Typical Sets: Gaussian Example

Typical Sets: Gaussian Example ∝ 0 ∞ − 2 2

Typical Sets: Gaussian Example ∝ 0 ∞ − 2 2
∝ 0 ∞ − 2 2 −1

Typical Distance Typical Sets: Gaussian Example

MCMC wants to draw samples from this “shell”

Tension in the Metropolis Update ′ = min 1, ′
′ ′

′ ′ Proposal

′ ′ “Volume”

′ ′ “Volume” “Amplitude”

Metropolis-Hastings

Metropolis-Hastings ′ = Normal ′ = , =

Metropolis-Hastings ′ = Normal ′ = , = Typical Distance

Metropolis-Hastings ′ = Normal ′ = , =

Ideal Metropolis-Hastings ′ = Normal ′ = , = Typical
Separation

Ideal Metropolis-Hastings ′ = Normal ′ = , = Typical
Separation M-H

Ideal Metropolis-Hastings ′ = Normal ′ = , = s
Typical Separation Adaptive M-H

Ensemble Sampling

emcee ′ = min 1, ′ −1 ~ = 1
from 1 , 0 otherwise “Stretch” factor

Ideal Typical Separation emcee M-H

Ideal Typical Separation emcee M-H emcee

Ideal Typical Separation emcee M-H emcee After weighting by acceptance
probability

emcee ′ = min 1, ′ −1 ~ = 1
from 1 , 0 otherwise “Stretch” factor

emcee ′ = min 1, ′ −1 ~ = 1
from 1 , 0 otherwise “Stretch” factor 

Summary • Volume scales as . • The posterior density
depends on both volume and amplitude. • Most of the posterior is concentrated in a “shell” around the best solution called the typical set. • MCMC draws samples from the typical set.

But what about corner plots?

But what about corner plots? 2-dimensional projection of D-dimensional shell

Hamiltonian Monte Carlo

Hamiltonian Monte Carlo Treat the particle at position q as
a point mass with mass matrix M and momentum p. Pr , ∝ , = − −1 2 Hamiltonian

Hamiltonian Monte Carlo Pr , ∝ , = − −1
2 Treat the particle at position q as a point mass with mass matrix M and momentum p. = = −1 = − = ln Hamiltonian Hamilton’s Equations

Hamiltonian Monte Carlo ′, −′ , = min 1, Pr
′, −′ Pr , ∼ Normal = , =

Typical Distance Hamiltonian Monte Carlo ∼ Normal = , =

Ideal Typical Separation M-H emcee Hamiltonian Monte Carlo ∼ Normal
= , =

Ideal Typical Separation M-H emcee Hamiltonian Monte Carlo ∼ Normal
= , = HMC

Typical Sets: What They Are and How to (Hopeful...

Typical Sets: What They Are and How to (Hopefully) Find Them

More Decks by Josh Speagle

Other Decks in Research

Featured

Transcript