Open Software for Astrophysics, AAS241

OPEN SOFTWARE FOR ASTROPHYSICS Dan Foreman-Mackey

case study: Gaussian Processes

AAS 225 / 2015 / Seattle AAS 231 / 2018
/ National Harbor

°0.6 °0.3 0.0 0.3 0.6 raw [ppt] 0 5 10
15 20 25 time [days] °0.30 °0.15 0.00 de-trended [ppt] N = 1000 reference: DFM+ (2017)

reference: Aigrain & DFM (2022)

reference: Aigrain & DFM (2022) ignoring correlated noise accounting for
correlated noise

reference: Aigrain & DFM (2022)

a Gaussian Process is a drop - in replacement for
chi - squared

more details: Aigrain & Foreman-Mackey (2023) arXiv:2209.08940

7 [1] model building [2] computational cost

k(tn , tm ; θ) “kernel” or “covariance”

import george import celerite import tinygp

my f i rst try: george 1

import numpy as np def log_likelihood(params, x, diag, r) :
K = build_kernel_matrix(params, x, diag) gof = r.T @ np.linalg.solve(K, r) norm = np.linalg.slogdet(K)[1] return -0.5 * (gof + norm)

k(tn , tm ; θ) “kernel” or “covariance”

from george.kernels import * k1 = 1.5 * ExpSquaredKernel(2.3) k2
= 5.5 * Matern32Kernel(0.1) kernel = 0.5 * (k1 + k2)

from george import GP gp = GP(kernel) gp.compute(x, yerr) gp.log_likelihood(y)

from george import GP gp = GP(kernel) gp.compute(x, yerr) gp.log_likelihood(y)
gp.f i t(y) ???

the astronomical Python ecosystem + MANY MORE!

* API design (library vs scripts) * don’t reinvent the
wheel

faster: celerite* 2 * yes, that truly is how you
pronounce it…

import numpy as np def log_likelihood(params, x, diag, r) :
K = build_kernel_matrix(params, x, diag) gof = r.T @ np.linalg.solve(K, r) norm = np.linalg.slogdet(K)[1] return -0.5 * (gof + norm)

“semi/quasi - separable” matrices

102 103 104 105 number of data points [N] 10
5 10 4 10 3 10 2 10 1 100 computational cost [seconds] 1 2 4 8 16 32 64 128 256 direct O(N) 100 101 number o reference: DFM, Agol, Ambikasaran, Angus (2017)

102 103 104 105 number of data points [N] 10
4 10 3 10 2 10 1 100 computational cost [seconds] 1 2 4 8 16 32 64 128 256 O(N) 100 101 number o reference: DFM, Agol, Ambikasaran, Angus (2017)

+ + vs

* interdisciplinary collaboration * importance of implementation

7 [1] 1 (ish) dimensional input [2] specif i c
type of kernel restrictions:

modern infrastructure: tinygp 3

what’s missing from the astronomical Python ecosystem?

7 [1] differentiable programming [2] hardware acceleration

the broader numerical computing Python ecosystem + SO MANY MORE!

jax.readthedocs.io

import numpy as np def linear_least_squares(x, y) : A =
np.vander(x, 2) return np.linalg.lstsq(A, y)[0]

import jax.numpy as jnp def linear_least_squares(x, y) : A =
jnp.vander(x, 2) return jnp.linalg.lstsq(A, y)[0]

import jax.numpy as jnp @jax.jit def linear_least_squares(x, y) : A
= jnp.vander(x, 2) return jnp.linalg.lstsq(A, y)[0]

tinygp.readthedocs.io

the broader numerical computing Python ecosystem + SO MANY MORE!

* I <3 JAX * don’t reinvent the wheel

the why & how of open software in astrophysics

credit: Adrian Price-Whelan / / data: SAO/NASA ADS

takeaways

open software is foundational to astrophysics research let’s consider &
discuss interface design and user interaction leverage existing infrastructure & learn when to start fresh

get in touch! dfm.io github.com/dfm

Open Software for Astrophysics, AAS241

Open Software for Astrophysics, AAS241

More Decks by Dan Foreman-Mackey

Other Decks in Science

Featured

Transcript