blasé: An interpretable transfer learning approach to cool star échelle spectroscopy

blasé A transfer learning approach to échelle spectra Michael Gully-Santiago
The University of Texas at Austin Machine Learning for Cool Stars Splinter Session Cool Stars 21 July 5, 2022

blasé is a general purpose experimentation framework aimed primary at
high bandwidth, high-resolution vis/near-IR spectra X-shooter CRIRES+ APOGEE IGRINS HPF UVES iSHELL NEID HARPS MAROON-X CARMENES VELOCE Keck NIRSPEC HRS etc…

blasé is a general purpose experimentation framework aimed primary at
high bandwidth, high-resolution vis/near-IR spectra IGRINS HPF (this talk) Park et al. 2014 Mahadevan 2012

blasé

The problem with échelle spectra: The mere act of comparing
an échelle spectrum to a model becomes a computational statistics challenge. • Voluminous data • Imperfect models • Confounding factors (tellurics) + • In f lexible models (~4 parameters in a typical grid) • Expensive models

Stellar spectral models are imperfect. Missing lines; under-/over- predicted depths,
widths, shapes, locations. λ (Å)

The model and data are voluminous. A typical HPF/IGRINS spectrum
of a K/M dwarf contains >10,000 stellar lines.

(In some places worse than others.) Telluric absorption confounds the
stellar spectrum.

Four modern solutions to (aspects of) these problems: github.com/Star f
ish-develop/Star f ish github.com/HajimeKawahara/exojax Czekala et al. 2015 Starfish wobble Bedell et al. 2019 github.com/megbedell/wobble exojax Kawahara et al. 2022 FAL Cargile et al. (in prep) github.com/pacargile/FAL (See also ThePayne by Phill Cargile)

wobble Bedell et al. 2019 github.com/megbedell/wobble github.com/HajimeKawahara/exojax exojax Kawahara et
al. 2022

al. 2022 more data-driven more model-driven more parameters (~105) fewer parameters (~15) more precise better out-of-band prediction accuracy

al. 2022 more data-driven more model-driven “Hybrid” github.com/gully/blase blasé Gully-Santiago & Morley (in prep)

al. 2022 github.com/gully/blase blasé Gully-Santiago & Morley (in prep) underlying machine learning frameworks:

underlying machine learning frameworks: machine learning frameworks provide GPU acceleration
and automatic differentiation* For a tutorial of autodi ff , Jacobians, and Hessians applied to spectroscopy, see: github.com/gully/TgiF *c.f. Gunes-Baydin et al. 2015 for a review ArXiV: 1502.05767

Key idea A typical echelle spectrum has: Nlines } (
𝛌 c , A, 𝛔 , 𝛄 ) Line center Amplitude Gaussian width Lorentzian width Four parameters for all 10,000 spectral lines

Key idea A typical echelle spectrum has: Nlines } (
𝛌 c , A, 𝛔 , 𝛄 ) Line center Amplitude Gaussian width Lorentzian width Four parameters for all 10,000 spectral lines blasé f its all 40,000 parameters simultaneously with autodi ff . = 40,000 total parameters

Step 0 Pick closest stellar model template to your target
of interest: Te = 4700 K logg = 4.5 [Fe/H] = 0

Step 1 Place a coarsely-tuned Voigt pro f ile at
the location of all lines.

Step 1 Optimize all 40,000 line parameters simultaneously.

Step 0.5 (optional) Pick closest telluric model template to your
time of observation: Tobs = 17° C Humidity = 40% Gullikson et al. 2014: TelFit

Step 1.5 (optional) Place a coarsely-tuned Voigt pro f ile
at the location of all lines. Gullikson et al. 2014: TelFit

Step 1.5 (optional) Optimize all 12,000 telluric line parameters simultaneously.
Gullikson et al. 2014: TelFit

Clone the spectra Step 1 Apply vsini & RV Step
2 Step 0 Choose Templates Multiply by telluric Step 3 Instrument convolve & resample Step 4

Clone the spectra Step 1 Apply vsini & RV Step
2 Step 0 Choose Templates Multiply by telluric Step 3 Instrument convolve & resample Step 4 Compare to data Step 5 RV, vsini, and all stellar and telluric line properties are tuned simultaneously.

Before λ (Å)

λ (Å) After

blasé learns a super-resolution template. before λ (Å)

blasé learns a super-resolution template. after λ (Å)

56 s End-to-end HPF whole-spectrum f itting on NVIDIA RTX
2070 GPU. GPUs, sparse matrices, and approximations yield huge speedups. blasé is fast. End-to-end HPF whole-spectrum f itting on M1 MacBook Air (CPU only). 1h15m

Key idea: Sparsity We use sparse matrices* in PyTorch to
achieve 100x speedups. *c.f. Saad 2003

Main challenge: Regularization We face a bias/variance tradeo ff .
c.f. Bedell et al. 2019 In fi nite regularization No regularization Laissez-faire: Surrealist over fi tting No learning: recover the input template

Main challenge: Regularization We face a bias/variance tradeo ff .
c.f. Bedell et al. 2019 In fi nite regularization No regularization Laissez-faire: Surrealist over fi tting No learning: recover the input template “Sweet spot” Regularization encodes our intuition: Line properties should match PHOENIX unless there’s enough data to say otherwise

What’s possible? blasé serves as a foundation for many new
strategies. 1. Doppler Imaging with imperfect templates (Luger et al. 2021) • Zonal bands (Cross f ield et al. 2014) • Polar spots (Roettenbacher et al. 2017) 2. Starspot spectral decomposition (Gully-Santiago et al. 2017) 3. Spectroscopic binary joint modeling (Czekala et al. 2017) 4. Automatic equivalent width tabulation 5. Discover “true” lineshapes with hidden neural network layers 6. Spectral calibration across the grid dimensions (c.f. previous talk by Bello-Garcia) 7. Brown Dwarfs with non-parametric pseudo-continuum (e.g. GPyTorch) 8. Adapt the templates to magnetic f ields / Zeeman effect 9. Provide quantitative feedback to modelers at native resolution

blasé is open source. We have tutorials, deep dives, FAQs,
and API documentation. blase.readthedocs.io

blasé is user-friendly. It serves as a gentle introduction to
key concepts in ML for newcomers. blase.readthedocs.io

blasé is a work in progress, we invite your help:
Engage with us on GitHub Issues and more. github.com/gully/blase

Typical spectral analysis work f low with blasé blasé 2D
echellogram reduction pipeline 1D spectrum post-processing Initial spectral template selection Spectral f itting or wobble or starfish or … muler 🆕 gollum 🆕 Gully-Santiago et al. 2022; JOSS github.com/OttoStruve/muler github.com/BrownDwarf/gollum

blasé • blasé offers an interpretable model for echelle spectra,
including tellurics. • It leverages the key enabling technology autodiff to achieve excellent precision. • By using Sparse Matrices and GPUs we achieve excellent computational performance. • It has tons of conceivable extensions. • We have also made ancillary tools muler and gollum for lowering the barrier to entry. • Altogether these codes form an easy-to-use ecosystem for end-to-end spectral analysis.

The end.

blasé: An interpretable transfer learning appro...

blasé: An interpretable transfer learning approach to cool star échelle spectroscopy

More Decks by gully

Other Decks in Science

Featured

Transcript