Upgrade to Pro
— share decks privately, control downloads, hide ads and more …
Speaker Deck
Features
Speaker Deck
PRO
Sign in
Sign up for free
Search
Search
Open software for Astronomical Data Analysis
Search
Dan Foreman-Mackey
February 28, 2023
Science
0
100
Open software for Astronomical Data Analysis
@ NASA Goddard
Dan Foreman-Mackey
February 28, 2023
Tweet
Share
More Decks by Dan Foreman-Mackey
See All by Dan Foreman-Mackey
Open Software for Astrophysics, AAS241
dfm
2
440
My research talk for CCA promotion
dfm
1
730
Astronomical software
dfm
1
680
emcee-odi
dfm
1
570
Exoplanet population inference: a tutorial
dfm
3
410
Data-driven discovery in the astronomical time domain
dfm
6
670
TensorFlow for astronomers
dfm
6
690
How to find a transiting exoplanets
dfm
1
420
Long-period transiting exoplanets
dfm
1
290
Other Decks in Science
See All in Science
Machine Learning for Materials (Lecture 2)
aronwalsh
0
650
Snowflakeによる統合バイオインフォマティクス
ktatsuya
0
200
Direct Preference Optimization
zchenry
0
230
ベイズ最適化をゼロから
brainpadpr
2
360
文系出身でも「アルゴリズム×数学」はスッキリ理解できた!話
wakamatsu_takumu
0
300
Raccoon Roundworm
uni_of_nomi
0
110
「国と音楽」 ~spotifyrを用いて~ #muana
bob3bob3
2
390
拡散モデルの概要 −§2. スコアベースモデルについて−
nearme_tech
PRO
0
220
Running llama.cpp on the CPU
ianozsvald
0
310
深層学習を利用して 大豆の外部欠陥を判別した研究事例の紹介
kentaitakura
0
150
Science of Scienceおよび科学計量学に関する研究論文の俯瞰可視化_LT版
hayataka88
0
710
プロダクト開発を通して学んだナレッジマネジメントの哲学
sonod
0
110
Featured
See All Featured
The Illustrated Children's Guide to Kubernetes
chrisshort
39
47k
Large-scale JavaScript Application Architecture
addyosmani
506
110k
4 Signs Your Business is Dying
shpigford
178
21k
Mobile First: as difficult as doing things right
swwweet
219
8.8k
Design and Strategy: How to Deal with People Who Don’t "Get" Design
morganepeng
121
18k
How GitHub Uses GitHub to Build GitHub
holman
471
290k
How GitHub (no longer) Works
holman
305
140k
Statistics for Hackers
jakevdp
792
220k
Fashionably flexible responsive web design (full day workshop)
malarkey
399
65k
Responsive Adventures: Dirty Tricks From The Dark Corners of Front-End
smashingmag
248
20k
Side Projects
sachag
451
42k
Building Effective Engineering Teams - LeadDev
addyosmani
47
2.2k
Transcript
OPEN SOFTWARE FOR ASTRONOMICAL DATA ANALYSIS by Dan Foreman-Mackey
None
open software for astrophysics 0
credit: Adrian Price-Whelan / / data: SAO/NASA ADS
7
many fundamental software packages have a shockingly small number of
maintainers.
7 credit: Adrian Price-Whelan
* astronomical software can be very high impact * we
should think about career trajectories & mechanisms for supporting this work
None
case study: gaussian processes 1
°0.6 °0.3 0.0 0.3 0.6 raw [ppt] 0 5 10
15 20 25 time [days] °0.30 °0.15 0.00 de-trended [ppt] N = 1000 reference: DFM+ (2017)
°0.6 °0.3 0.0 0.3 0.6 raw [ppt] 0 5 10
15 20 25 time [days] °0.30 °0.15 0.00 de-trended [ppt] N = 1000 reference: DFM+ (2017)
reference: Aigrain & DFM (2022)
reference: Aigrain & DFM (2022)
reference: Aigrain & DFM (2022) ignoring correlated noise accounting for
correlated noise
reference: Aigrain & DFM (2022)
a Gaussian Process is a drop - in replacement for
chi - squared
more details: Aigrain & Foreman-Mackey (2023) arXiv:2209.08940
None
7 [1] model building [2] computational cost
reference: Luger, DFM, Hedges (2021)
[2] computational cost
7 [1] bigger/better computers [2] exploit matrix structure [3] approximate
linear algebra [4] etc.
1 3 2
None
None
1 3 2
°0.6 °0.3 0.0 0.3 0.6 raw [ppt] 0 5 10
15 20 25 time [days] °0.30 °0.15 0.00 de-trended [ppt] N = 1000 reference: DFM+ (2017)
reference: Gordon, Agol, DFM (2020) / tinygp.readthedocs.io
* a Gaussian Process is a drop - in replacement
for chi squared * model building & computational cost are (solvable!) challenges * you should check out tinygp!
case study: probabilistic inference 2
have: physics = > data
want: data = > physics
7 [1] physical models [2] legacy code
None
number of parameters patience required a few tenish not outrageously
many reference: DFM (priv. comm.)
number of parameters patience required emcee a few tenish not
outrageously many reference: DFM (priv. comm.)
number of parameters patience required emcee a few tenish not
outrageously many how things should be reference: DFM (priv. comm.)
None
None
None
None
3.0 3.5 4.0 4.5 5.0 Wavelength [micron] 2.05 2.10 2.15
2.20 2.25 2.30 Transit Depth [%] Alderson et al. 2023 Joint Fit (N = 50) reference: Soichiro Hattori, Ruth Angus, DFM, . . . (in prep) WASP-39b / NIRSpec
reference: Soichiro Hattori, Ruth Angus, DFM, . . . (in
prep) showing 23 of the 404 parameters (8 per channel + 4 shared)
how?
d(physics = > data) / dphysics
automatic differentiation aka “backpropagation”
None
7 [1] physical models [2] legacy code
7 [1] domain - specif i c libraries [2] emulation
None
* gradient - based inference using autodiff can improve eff
i ciency * there are practical challenges with these methods in astro * of interest: domain - specif i c libraries & emulation
aside: JAX 3
None
import numpy as np def linear_least_squares(x, y) : A =
np.vander(x, 2) return np.linalg.lstsq(A, y)[0]
import jax.numpy as jnp def linear_least_squares(x, y) : A =
jnp.vander(x, 2) return jnp.linalg.lstsq(A, y)[0]
None
open research practices 4
None
None
None
None
None
None
None
open software is foundational to astrophysics research there are opportunities
at the interface of astro & applied f i elds there are ways you can participate & benef i t right away
7 I want to chat about… [1] your data analysis
problems [2] building astronomical software [3] writing documentation & tutorials
get in touch! dfm.io github.com/dfm