Upgrade to Pro — share decks privately, control downloads, hide ads and more …

Spectral Timing, Bayesian Inference and You

Spectral Timing, Bayesian Inference and You

Spectral timing has emerged as a key way to study energetic sources: simultaneously considering both time and energy information allows us to understand relationships between the different physical components in a system, such as different regions in an accretion flow.
These developments are coming at a time where technology is enabling rapid improvements in our ability to infer knowledge from data. In this talk, I’m aiming to build a bridge between spectral timing in astronomy on one side, and recent, exciting developments in statistics and computer science on the other. I will present these in the context of current work on mitigating dead time effects on timing studies with neural network-based density estimation. I will also introduce the spectral timing package stingray, and give a sneak peak of what’s in store for current and near-future developments of the software.

Daniela Huppenkothen

June 14, 2021
Tweet

Transcript

  1. Spectral Timing, Bayesian Inference and You Daniela Huppenkothen SRON Netherlands

    Institute for Space Research ! Tiana_Athriel " dhuppenkothen ! [email protected]
  2. Time [s] Frequenc Time Malzac 2008 Photon Energy [keV] Photon

    Energy * Flux [detector units] “soft state” “hard state” high-frequency/low-frequency X-ray brightness total brightness Spectral States Credit: Sera Markoff
  3. Yamaoka et al (2010); Huppenkothen et al. (2019) Variability: GX

    339-4 Fast Fourier Transform stochastic variability (red noise) quasi-periodic oscillation (QPO) white noise
  4. X-ray reverberation around accreting black holes 5 Fig. 3 Time

    lag (8–13 keV relative to 2–4 keV) versus frequency for a hard state obser- vation of Cyg X-1 obtained by RXTE in December 1996. The trend can be very roughly approximated with a power-law of slope −0.7, but note the clear step-like features, which correspond roughly to different Lorentzian features in the power spectrum (Nowak 2000). Cygnus X-1: Nowak, 2000 1 10 0.5 2 5 0.5 Energy (keV) Fig. 9 The ratio spectrum of 1H0707-495 to a continuum model (Fabian et al. 2009). T broad iron K and iron L band are clearly evident in the data. The origin of the soft exce below 1 keV in this source had been debatable, but in this work was found to be dominat by relativistically broadened emission lines. 10−5 10−4 10−3 0.01 −50 0 50 100 150 200 Lag (s) Temporal Frequency (Hz) Fig. 10 The frequency-dependent lags in 1H0707-495 between the continuum dominat hard band at 1–4 keV and the reflection dominated soft band at 0.3–1 keV. and found significant high-frequency soft lags in 15 sources. Plotting the am 1H0707-495: Uttley et al, 2014 Time Lags Temporal variations and energy spectra are intricately linked
  5. • Limited functionality • Last updated 2009 (?) • Designed

    for spectroscopy • has some timing capabilities • https://space.mit.edu/CXC/isis/ + closed-source packages maintained by individual groups
  6. https://stingray.science • 3 lead developers/maintainers (Huppenkothen, Bachetti, Stevens) • ~10

    contributors • 5 completed Google Summer of Code Projects • astropy-affiliated project • connecting functionality to astropy.TimeSeries and lightkurve Huppenkothen et al (2019)
  7. stingray.science Huppenkothen et al (2019) • spectral timing classes +

    functions • building blocks • simulation, pulsars, modeling • command-line interface built on stingray • quick-look (spectral) timing analysis • graphical user interface • interactive data analysis
  8. Top-level functionality Events Lightcurve Powerspectrum / Crossspectrum AveragedPowerspectrum / AveragedCrossspectrum

    Crosscorrelation / Autocorrelation Bispectrum Lag-Energy Spectra supporting functionality: statistics, good time intervals, I/O Sub-modules pulse modeling simulator deadtime
  9. Current + Future Work • rework the modeling interface +

    autodiff: current GSoC project • multi-taper periodograms: current GSoC project • update to DAVE GUI: current GSOC project • fix bugs (there are always bugs) • improve API (aka: what were we thinking?!) • improve documentation (there is never enough documentation) • performance + memory optimization • better integration with current X-ray missions • better integration with spectral modeling packages • better integration with astropy.timeseries and lightkurve • higher-order Fourier products
  10. Get Involved! Stingray is a project for the community, but

    there’s only so much we as maintainers can do …
  11. How to get involved (please do!) • find bugs (and

    report them as a GitHub Issue) • fix bugs (as a GitHub Pull Request) • make feature requests (also via GitHub Issue) • implement new features (also via GitHub Pull Request) • test documentation/tutorials (and report mistakes/fix bugs etc) • … Don’t know where to start? • “Good First Issue” tag on GitHub • join the slack + ask us! We’ll help :)
  12. Maximum Likelihood Estimation “How probable is it that my data

    D came from a model M with parameters ?”
  13. log(p(D|θ)) = log ( n ∏ i=1 p(d i |θ)

    ) = − n 2 log(2πσ2) − 1 2σ2 n ∑ i=1 (d i − f(x i , θ))2
  14. log(p(D|θ)) = log ( n ∏ i=1 p(d i |θ)

    ) = − n 2 log(2πσ2) − 1 2σ2 n ∑ i=1 (d i − f(x i , θ))2 “chi-square fitting” caution: this is not a probability of the parameters
  15. When this might not be enough … • your likelihood

    is not unimodal • your likelihood might be skewed or otherwise complex • you might have useful prior information • you might be interested in the properties of a population • you might have a complex model/data collection that you can simulate, but not write down an analytical function for
  16. Useful Prior Information data parameters θ = D = p(D|θ)

    = n ∏ i=1 p(d i |θ) p(θ|D) ∝ p(θ) n ∏ i=1 p(d i |θ) p(θ)
  17. Useful Prior Information data parameters θ = D = p(D|θ)

    = n ∏ i=1 p(d i |θ) p(θ|D) ∝ p(θ) n ∏ i=1 p(d i |θ) p(θ)
  18. Useful Prior Information data parameters θ = D = p(D|θ)

    = n ∏ i=1 p(d i |θ) p(θ|D) ∝ p(θ) n ∏ i=1 p(d i |θ) p(θ) You can do this with stingray.modeling
  19. The Chandra ABC Guide to Pile-Up There are many effects

    or models that you can (and should) simulate, but that are hard to take into account in model fitting.
  20. Yamaoka et al (2010); Huppenkothen et al. (2019) Variability: GX

    339-4 Fast Fourier Transform We understand how to do this, unless the sources is very bright
  21. Huppenkothen et al (2017) Figure 5. Left panel: averaged periodogram

    of the part of Chandra observation 17696 containing the QPOs at 73 mHz and 1.03 H periodogram of the two Fermi/GBM triggers simultaneous with the Chandra data (right panel). In blue, we show the logarithmically data sets, we show the MAP model with four (Chandra) or two ( Fermi/GBM) Lorentzian components in purple and the combined m The Astrophysical Journal, 834:90 (17pp), 2017 January 1 Problem: (Bayesian) parameter inference
  22. p(θ|D) ∝ p(D|θ)p(θ) data D D θ θ θ parameters

    θ = D = likelihood prior posterior intractable! ☹
  23. Sadegh + Vrugt, 2014, see also: Brehmer et al (2018a,

    2018b), Cranmer et al (2020) Step 1: draw parameters from prior Step 2: simulate data sets Step 3: compare simulated to observed data Step 4: keep parameters that produce simulations similar to the data simulator
  24. What are the properties of the evolution of the QPO

    centroid frequency? p(θ|D) ∝ n ∏ i=1 p(D i |θ)p(θ) p({θ}n i=1 , α|D) ∝ n ∏ i=1 [p(D i |θ i )p(θ i |α)] p(α)
  25. What are the properties of the evolution of the QPO

    centroid frequency? p(θ|D) ∝ n ∏ i=1 p(D i |θ)p(θ) p({θ}n i=1 , α|D) ∝ n ∏ i=1 [p(D i |θ i )p(θ i |α)] p(α) Hierarchical Bayesian Modeling is ideal for population-level problems
  26. Things I’m curious about • Where else should be be

    using simulation-based inference? • What do you think are the next big challenges in spectral timing? • What population-level problems should we be addressing with hierarchical inference?
  27. • Use stingray! It’s fun! Conclusions • Also, help us

    build stingray! That’s also fun! • Bayesian statistics allows you to go beyond standard fitting problems to take into account • SBI enables principled inference when models are complex, numerical and/or generate data • New statistical methods and software tools will set us up for answering complex questions with complex models, using current and future instrumentation