Slide 1

Slide 1 text

Flare Transformer: Solar Flare Prediction using Magnetograms and Sunspot Physical Features Kanta Kaneda1, Yuiga Wada1, Tsumugi Iida1, Naoto Nishizuka2, Yûki Kubo2, Komei Sugiura1 1 Keio University, Japan 2 National Institute of Information and Communications Technology, Japan

Slide 2

Slide 2 text

Introduction: Large solar flares can cause extensive damage - 2 - Solar Flares:Explosion occurring near sunspots on the solar surface ⇒ Monitored and forecasted by national agencies (USA : NOAA) If solar flare can be predicted in advance, damage can be minimized Disasters caused by Solar Flares 1989 Massive blackout at Quebec 2003 Damaged spacecraft HAYABUSA 2022 Damaged SpaceX’s satellites NASA, https://svs.gsfc.nasa.gov/4491 Estimated damage $160 billion at North America

Slide 3

Slide 3 text

Related Works: Existing solar flare prediction models are insufficient - 3 - Task Typical Models Overview Time-series Forecasting DeepAR [Salinias+, IJF20] Time-series forecasting model using an autoregressive RNN Informer [Zhou+, AAAI21] Prediction model using ProbSparse self-attention Solar Flare Forecasting [Park+, ApJ18] CNN-based model using full-disk images DeFN [Nishizuka+, ApJ18] Prediction model with sunspot physical features [Zhou+, AAAI21] [Park+, ApJ18] [Nishizuka+, ApJ18]

Slide 4

Slide 4 text

Problem Statement: Evaluation methods are important because of the imbalance in the class - 4 - ■ Solar Flare Prediction ■ Predicting the class of largest solar flare that will occur within 24 hours from time t ■ Input 1. Magnetograms ■ Magnetic images of the sun taken hourly 2. Sunspot Physical Features ■ Physical features including time-series features extracted from solar images ■ Output ■ Solar flare classes (X, M, C, O) Class Frequency Damage X Low High Large Small M C O

Slide 5

Slide 5 text

Problem Statement: Solar Flare Prediction using Magnetograms and Sunspot Physical Features - 5 - Characteristics of Solar Flare Prediction Task 1. It is important to predict X/M class solar flares that can cause significant damage 2. X/M class solar flares occur rarely ■ Evaluate using standard metrics ■ Gandin–Murphy–Gerrity score (GMGS) ■ True skill statistics (TSS) ■ Brier skill score (BSS) 1% 8% 32% 59% X-class M-class C-class O-class Breakdown of solar flare classes that occurred from 2010 to 2017

Slide 6

Slide 6 text

Proposed Method: Flare Transformer - 6 - ■ Introduced Magnetogram Module which handles magnetograms ■ Sunspot Feature Module which handles physical features ■ Introduced transformer attention mechanism to both modules ■ Introduced GMGS loss and BSS loss to balance the major metrics Novelty

Slide 7

Slide 7 text

Proposed Method: Predicts flare classes from magnetograms and physical features - 7 - Input ■ Magnetograms and 90 physical features from time to time Output ■ A 4 dimensional vector that denotes the predicted probabilities for each solar flare class

Slide 8

Slide 8 text

Proposed Method: Introduced MM and SFM to handle both magnetograms and physical features - 8 - Magnetogram Module / Sunspot Feature Module ■ Extract features from magnetograms/physical features ■ Model the temporal relationship by / transformer layers

Slide 9

Slide 9 text

Loss Function: Weighted sum of CE loss, GMGS loss and BSS loss - 9 - ■ Loss function: Trained to minimize Cross Entropy Loss BSS Loss GMGS Loss Cross Entropy Loss BSS Loss GMGS Loss BSS Loss GMGS Loss GMGS Loss Number of samples Number of classes Predicted probability Ground truth label Cross entropy between predicted labels and ground truth labels

Slide 10

Slide 10 text

Loss Function: Weighted sum of CE loss, GMGS loss and BSS loss - 10 - ■ Loss function: Trained to minimize Cross Entropy Loss BSS Loss GMGS Loss Cross Entropy Loss BSS Loss GMGS Loss BSS Loss GMGS Loss GMGS Loss Number of samples Number of classes Predicted probability Ground truth label BSS is a differentiable metric Introduce a loss function that maximizes BSS

Slide 11

Slide 11 text

Loss Function: Weighted sum of CE loss, GMGS loss and BSS loss - 11 - ■ Loss function: Trained to minimize Cross Entropy Loss BSS Loss GMGS Loss Cross Entropy Loss BSS Loss GMGS Loss BSS Loss GMGS Loss GMGS Loss Number of samples Number of classes Predicted probability Ground truth label Introduce weights that directly maximizes GMGS

Slide 12

Slide 12 text

Experiment Setting: Dataset containing magnetograms and physical features - 12 - ■ Total of 61315 samples from 2010-2017 ■ Contains magnetograms and a set of 90 physical features ■ Division based on time-series cross-validation [Tashman+, 00] Magnetogram Physical Features Training Set Test Set Period Num of Samples Period Num of Samples 2010-2013 29247 2014 8127 2010-2014 37374 2015 8155 2010-2015 45529 2016 7795 2010-2016 53324 2017 7991

Slide 13

Slide 13 text

Quantitative Results: Outperformed human experts in terms of GMGS and TSS - 13 - GMGS ↑ TSS≧M ↑ BSS≧M ↑ DeFN [Nishizuka+, AsJ18] 0.38±0.14 0.41±0.15 -0.02±0.78 DeFN-R [Nishizuka+, AsJ20] 0.30±0.06 0.28±0.16 0.04±0.98 Proposed Method 0.50±0.06 0.53±0.11 0.08±0.97 Human Experts [Kubo+, AsJ18][Murray, SW17] 0.48 0.50 0.16 ■ Our method outperformed the baseline methods (DeFN, DeFN-R) in terms of GMGS, BSS, and TSS +0.20 +0.02 +0.03 +0.25 +0.04 ■ Flare Transformer gives better predictions than can be achieved by human experts in terms of GMGS and TSS

Slide 14

Slide 14 text

Ablation Studies: Scores improved by handling both magnetograms and physical features as input - 14 - ■ Ablation (a) : Removed physical features from input Scores improved by handling both magnetograms and physical features as input ■ Ablation (b) : Reduced a transformer layer in SFM ■ Ablation (c) : Added a transformer layer in MM GMGS ↑ TSS≧M ↑ BSS≧M ↑ (a) w/o Ft-k+1:t 0.22±0.12 0.19±0.37 -1.77±0.23 (b) NV = 1, Nf = 1 0.52±0.09 0.49±0.08 0.05±1.05 (c) NV = 2, Nf = 2 0.56±0.07 0.55±0.12 0.01±0.97 (d) NV = 1, Nf = 2 0.50±0.06 0.53±0.11 0.08±0.97 Scores fluctuated slightly +0.28 +0.34 +1.85

Slide 15

Slide 15 text

■ Investigate the impact on GMGS from confusion matrices by GMGSInfluence ■ The main bottleneck is the misprediction of X-class flares for M-class flares Error Analysis: Main bottleneck is the misprediction of X-class flares - 15 - (Observed Class➙Predicted Class) 0 0.02 0.04 0.06 0.08 0.1 0.12 0.14 0.16 X➙M M➙C C➙O M➙O X➙O Element from the confusion matrix Element from the GMGS score matrix

Slide 16

Slide 16 text

■ We proposed the Flare Transformer, a solar flare prediction model which handles heterogenous data (magnetograms and physical features) ■ Flare Transformer gives better predictions than can be achieved by human experts in terms of GMGS and TSS (standard metrics in the field of solar flare prediction) Conclusion - 16 -