A data scientist’s guide to direct imaging of exoplanets

A data scientist’s guide to direct imaging of exoplanets Carlos
Alberto Gomez Gonzalez RADA big data workshop (Medellín). Feb 12, 2019

whoami Colombia Russia Spain Belgium France (PhD astrophysics and ML/CV)

whoami Research data scientist for Earth, Space and environmental sciences

whoami I have a very particular set of skills :)

A data scientist’s guide to direct imaging of exoplanets Domain
expertise

A data scientist’s guide to direct imaging of exoplanets Programming

A data scientist’s guide to direct imaging of exoplanets Stats,
ML

A data scientist’s guide to direct imaging of exoplanets Visualization,
communication skills

Credit: Martin Vargic

Solar System A data scientist’s guide to direct imaging of
exoplanets

Exo from extrasolar (outside of the Solar System) A data
scientist’s guide to direct imaging of exoplanets

Exo from extrasolar (outside of the Solar System)

A data scientist’s guide to direct imaging of exoplanets

Credit: NASA, http://planetquest.jpl.nasa.gov High-contrast imaging Lighthouse -> Star Fireﬂy ->
exoplanet

Credit: NASA, http://planetquest.jpl.nasa.gov High-contrast imaging Lighthouse -> Star Fireﬂy ->
exoplanet Stars never shut off !!!

http://exoplanetarchive.ipac.caltech.edu, 25 Jan 2018

http://exoplanetarchive.ipac.caltech.edu, 25 Jan 2018 Task: try to spot this color
and estimate the total contribution of direct imaging detection… about 1% !!!

Debris disk Beta Pictoris (occulted)

Yep, this is the exoplanet (Beta Pictoris b)

HR8799 bcde (Marois et al. 2008-2010) Several epochs (ﬁnal images)

HR8799, Marois et al. 2010 20 AU 0.5” Konopacky et
al. 2013 Bowler 2016 Why direct imaging? Milli et al. 2016

Fake planet (low contrast) Signal and noise

spatially rescaled images t0 t0 with an integral ﬁeld spectrograph
Signal and noise

Multi-dimensional arrays ( ) exoplanet t, λ, x, y

Image processing pipeline

Image processing pipeline Getting rid of the noise…

Image processing pipeline Look ma, I’ve found an exoplanet!

Raw astronomical images Sequence of calibrated images Basic calibration and
“cosmetics” • Dark/bias subtraction • Flat ﬁelding • Sky/thermal background subtraction • Bad pixel correction Image recentering Bad frames removal Image combination PSF modeling • Median • Pairwise, ANDROMEDA • LOCI • PCA/KLIP, NMF • LLSG Model PSF subtraction Detection on residual frame or detection map Characterization of “detected” companions R1 ,!1 ,F1 R2 ,!2 ,F2 R3 ,!3 ,F3 R4 ,!4 ,F4 Raw Clean

Algorithms Ten years of research: • Median frame subtraction •
Pairwise subtraction • Least squares image combination • PCA (forward modeling), NMF • Low-rank plus sparse decompositions • Matched ﬁltering • Maximum likelihood estimation

Researcher creates an open-source library

• https://github.com/vortex-exoplanet/VIP • http://vip.readthedocs.io/ • CI, test suite, jupyter tutorials,
paper on A&A

VIP: algorithmic zoo Comparing the resulting images (using different algorithms)

Task: remove the noise and reveal the exoplanet signal VIP:
algorithmic zoo

Designing APIs for science

Building blocks… D — M = R Noise reduction algorithm
R - residuals containing the exoplanet signal

Observer (classiﬁer) D — M = R or Noise reduction
algorithm R - residuals containing the exoplanet signal Building blocks…

Model PSF subtraction drawbacks Planet signal is subtracted along with
the speckles

Image sequence Final residual image Detection

Image sequence Final residual image ? ? ? ? ?
? ? Detection

Image sequence Final residual image ? ? ? ? ?
? ? Speckles (?) Real planet Synthetic planets Detection

Unsupervised Supervised PC 1 PC 2 Dimensionality reduction Clustering Reinforcement
Density estimation Regression Classification ML

f = arg min fθ,θ∈Θ n i=1 L(yi, fθ (xi
)) + g(θ) f : X → Y, Training data chihuahua mufﬁn (xi, yi )i=1,...,n Supervised learning

{ Model architecture f = arg min fθ,θ∈Θ n i=1
L(yi, fθ (xi )) + g(θ) f : X → Y, (xi, yi )i=1,...,n Supervised learning

Loss function and regularization f = arg min fθ,θ∈Θ n
i=1 L(yi, fθ (xi )) + g(θ) f : X → Y, (xi, yi )i=1,...,n Supervised learning

{ Optimization f = arg min fθ,θ∈Θ n i=1 L(yi,
fθ (xi )) + g(θ) f : X → Y, (xi, yi )i=1,...,n Supervised learning

Input X 1st Layer (data transformation)

Input X 1st Layer (data transformation) Perceptron (Rosenblatt 1958)

▸ Activation function Input X 1st Layer (data transformation) 2nd
Layer (data transformation) Nth Layer (data transformation) …

▸ Max pooling ▸ Dropout ▸ BatchNorm Input X 1st
Layer (data transformation) 2nd Layer (data transformation) Nth Layer (data transformation) …

Input X Nth Layer (data transformation) … 1st Layer (data
transformation) 2nd Layer (data transformation) Predictions Ŷ Labels Y Loss function weights weights weights weight update Optimizer loss score

Reframing the problem: from unsupervised to supervised learning N …
• Sequences of images without labels • Not enough archival data (observed stars) • We can generate semi- synthetic data by injecting a planet (PSF) template! • We grab patches: signal/noise PSF template

Labeled data generation Neural network design and training Prediction (detection)
Gomez Gonzalez et al. 2018

Gomez Gonzalez et al. 2018

• Reproducible results • Hyper-parameter and network architecture tuning •
Comparison of labeling strategies DataLabeler Model Predictor • Flux vs S/N sampling • Fluxes/contrast estimation • Training data generation • Data augmentation • Data persistence (load/save with HDF5) • Network creation (Keras and Tensorﬂow) • Model training • Model persistence (load/ save with HDF5) • Target samples generation • Predictions (based on trained model) • Probability map inspection • Results to HDF5 SODINN library

Corresponding labels: y ∈ {c−, c+} … MLAR samples C+
C-

Discriminative model: Neural Network Goal - to make correct predictions
on new samples: f : X → Y ˆ y = p(c+| MLAR sample) SGD with a binary cross-entropy loss: L = − n (yn ln(ˆ yn ) + (1 − yn ) ln(1 − ˆ yn )) Learning a mapping function

2d Max pooling size=(2x2) 2d Max pooling size=(2x2) 2d Max
pooling size=(2x2) 2d Max pooling size=(2x2) 2d Max pooling size=(2x2) 2d Max pooling size=(2x2) 3d Convolutional layer kernel=(3x3x3), filters=40 3d Convolutional layer kernel=(2x2x2), filters=80 Dense layer units=128 Output dense layer units=1 3d Max pooling size=(2x2x2) 3d Max pooling size=(2x2x2) ReLU activation + dropout Sigmoid activation X and y to train/test/validation sets 2d Convolutional layer kernel=(3x3), filters=40 Dense layer units=128 Output dense layer units=1 ReLU activation + dropout Sigmoid activation X and y to train/test/validation sets 2d Convolutional layer kernel=(3x3), filters=40 2d Convolutional layer kernel=(3x3), filters=40 2d Convolutional layer kernel=(2x2), filters=80 2d Convolutional layer kernel=(2x2), filters=80 2d Convolutional layer kernel=(2x2), filters=80 … … B-LSTM / B-GRU layer 2d Max pooling size=(2x2) 2d Max pooling size=(2x2) 2d Max pooling size=(2x2) 2d Max pooling size=(2x2) 2d Max pooling size=(2x2) 2d Max pooling size=(2x2) 3d Convolutional layer kernel=(3x3x3), filters=40 3d Convolutional layer kernel=(2x2x2), filters=80 Dense layer units=128 Output dense layer units=1 3d Max pooling size=(2x2x2) 3d Max pooling size=(2x2x2) ReLU activation + dropout Sigmoid activation X and y to train/test/validation sets 2d Convolutional layer kernel=(3x3), filters=40 Dense layer units=128 Output dense layer units=1 ReLU activation + dropout Sigmoid activation X and y to train/test/validation sets 2d Convolutional layer kernel=(3x3), filters=40 2d Convolutional layer kernel=(3x3), filters=40 2d Convolutional layer kernel=(2x2), filters=80 2d Convolutional layer kernel=(2x2), filters=80 2d Convolutional layer kernel=(2x2), filters=80 … … B-LSTM / B-GRU layer

Let’s inject some fake planets

VS Let’s inject some fake planets

PCA least squares Find the exoplanet ;)

VS VS PCA least squares SODINN SODINN

Performance assessment Receiver operating characteristic curves

https://carlgogo.github.io/exoimaging_challenge/ https://github.com/carlgogo/exoimaging_challenge_extras Do you have an idea for a new
algorithm? We are about to launch a data challenge!

• Data from the most representative instruments • Metrics: •
Phases: • Will run on Codalab https://carlgogo.github.io/exoimaging_challenge/ https://github.com/carlgogo/exoimaging_challenge_extras

Connections with other ﬁelds https://arxiv.org/pdf/1506.04214.pdf

Connections with other ﬁelds http://juliandewit.github.io/kaggle-ndsb2017/

Connections with other ﬁelds https://arxiv.org/pdf/1609.09143.pdf ReCTnet architecture for object detection
in multi-slice medical images

Connections with other ﬁelds https://arxiv.org/pdf/1811.02471.pdf two-component convolutional long short-term memory
network (LSTM) Convolutional LSTMs for Cloud-Robust Segmentation of Remote Sensing Imagery

¡Gracias! carlgogo.github.io/ github.com/carlgogo/ speakerdeck.com/carlgogo/ linkedin.com/in/carlgogo/

A data scientist’s guide to direct imaging of e...

A data scientist’s guide to direct imaging of exoplanets

More Decks by Carlos Alberto Gomez Gonzalez

Featured

Transcript