The Cannon: Data-Driven Spectral Modeling in the Era of Large Stellar Surveys

Gemini Observatory Workshop March 17, 2016 The Cannon: Data-Driven Spectral
Modeling in the Era of Large Stellar Surveys Anna Ho, Caltech with Melissa Ness (MPIA), David W. Hogg (NYU), and Hans-Walter Rix (MPIA) [email protected]

We have a large set of uniformly-observed stellar spectra. 2
Anna Ho, Caltech /

We know labels* for some subset of those spectra. We
have a large set of uniformly-observed stellar spectra. *parameters and abundances, e.g. Teff, logg, [Fe/H], … 3 2 Anna Ho, Caltech /

How can we infer labels for all the spectra? We
have a large set of uniformly-observed stellar spectra. We know labels* for some subset of those spectra. *parameters and abundances, e.g. Teff, logg, [Fe/H], … 2 Anna Ho, Caltech /

have a large set of uniformly-observed stellar spectra. We know labels* for some subset of those spectra. The Cannon* is a data-driven method for transferring labels from one dataset to another. 2 Anna Ho, Caltech /

have a large set of uniformly-observed stellar spectra. * We know labels* for some subset of those spectra. 2 Anna Ho, Caltech / The Cannon* is a data-driven method for transferring labels from one dataset to another.

Anna Ho, Caltech / 3 A suite of large-scale surveys
are systematically measuring spectra for stars in the Milky Way.

Anna Ho, Caltech / 3 RA (degrees) Dec (degrees) 100
50 150 200 250 300 350 0 -100 -50 0 50 100 APOGEE Survey: 150,000 near-IR spectra, R ~ 22,500 (Figure adapted from Melissa Ness) A suite of large-scale surveys are systematically measuring spectra for stars in the Milky Way.

Anna Ho, Caltech / RA (degrees) Dec (degrees) 100 50
150 200 250 300 350 0 -100 -50 0 50 100 APOGEE-2: 300,000 near-IR spectra, R ~ 22,500 (Figure adapted from Melissa Ness) A suite of large-scale surveys are systematically measuring spectra for stars in the Milky Way. 3

150 200 250 300 350 0 -100 -50 0 50 100 APOGEE Survey: 150,000 near-IR spectra, R ~ 22,500 GALAH: 1 million spectra, R ~ 28,000, 4710-7900 Å (Figure adapted from Melissa Ness) A suite of large-scale surveys are systematically measuring spectra for stars in the Milky Way. 3

150 200 250 300 350 0 -100 -50 0 50 100 APOGEE Survey: 150,000 near-IR spectra, R ~ 22,500 Gaia-ESO: 100,000 spectra, R > 20,000, 5800-8700 Å (Figure adapted from Melissa Ness) A suite of large-scale surveys are systematically measuring spectra for stars in the Milky Way. 3

Anna Ho, Caltech / LAMOST A suite of large-scale surveys
are systematically measuring spectra for stars in the Milky Way. 4

Anna Ho, Caltech / LAMOST A suite of large-scale surveys
are systematically measuring spectra for stars in the Milky Way. • Goal of these surveys: By dissecting the Milky Way into individual stars, we learn about the galaxy as a whole. 4

Anna Ho, Caltech / The Milky Way is a very
“ordinary” galaxy. 5

“ordinary” galaxy. (Figure adapted from Hans-Walter Rix) 5

“ordinary” galaxy. (Figure adapted from Hans-Walter Rix) The MW can be a “Rosetta Stone” for galaxy formation and evolution. 5

Anna Ho, Caltech / The diversity of surveys is also
an obstacle. 6

Anna Ho, Caltech / Survey A Survey B The diversity
of surveys is also an obstacle. 6

Anna Ho, Caltech / “Labels”: {M, L, Teff, logg, [Fe/H],
[α/Fe], [X/H], …} Survey A Survey B The diversity of surveys is also an obstacle. 6

Anna Ho, Caltech / Survey A Survey B Labels from
the Survey A pipeline Labels from the Survey B pipeline The diversity of surveys is also an obstacle. 6

Anna Ho, Caltech / Survey A Survey B The diversity
of surveys is also an obstacle. Labels from the Survey A pipeline Labels from the Survey B pipeline The Cannon can bring surveys onto the same footing. 6

Anna Ho, Caltech / Overview of the remainder of the
talk • Assumptions & methodology 7

talk • Assumptions & methodology • Highlights 7

talk • Assumptions & methodology • Highlights – Proof of concept: reproduce APOGEE DR10 (Ness et al. 2015) A 7

talk • Assumptions & methodology • Highlights – Proof of concept: reproduce APOGEE DR10 (Ness et al. 2015) – Cross-calibrate APOGEE and LAMOST (Ho et al. 2016) A L A 7

talk • Assumptions & methodology • Highlights – Proof of concept: reproduce APOGEE DR10 (Ness et al. 2015) – Cross-calibrate APOGEE and LAMOST (Ho et al. 2016) – The largest global age map of the Milky Way (Ness et al. 2016, Martig et al. 2016) A L A K A 7

talk • Assumptions & methodology • Highlights – Proof of concept: reproduce APOGEE DR10 (Ness et al. 2015) – Cross-calibrate APOGEE and LAMOST (Ho et al. 2016) – The largest global age map of the Milky Way (Ness et al. 2016, Martig et al. 2016) • Limitations & advantages A A L K A 7

Anna Ho, Caltech / There are two key assumptions underlying
The Cannon. 8

Anna Ho, Caltech / There are two key assumptions underlying
The Cannon. • Assumptions: – Stars with near-identical labels have near-identical spectra 8

Anna Ho, Caltech / • Assumptions: – Stars with near-identical
labels have near-identical spectra – A spectrum is a smooth function of the star’s labels There are two key assumptions underlying The Cannon. 8

labels have near-identical spectra – A spectrum is a smooth function of the star’s labels fn = g ( `n |✓ ) + noise There are two key assumptions underlying The Cannon. 8

labels have near-identical spectra – A spectrum is a smooth function of the star’s labels There are two key assumptions underlying The Cannon. 8 fn = ✓T · `n + noise

Anna Ho, Caltech / Overview of The Cannon methodology •
Assumptions: – Stars with near-identical labels have near-identical spectra – A spectrum is a smooth function of the star’s labels Survey A B 8

labels have near-identical spectra – A spectrum is a smooth function of the star’s labels 1. Training Step: fit a model 2. Test Step: infer labels Overview of The Cannon methodology Survey A B 8

labels have near-identical spectra – A spectrum is a smooth function of the star’s labels 1. Training Step: fit a model Overview of The Cannon methodology Survey A B 2. Test Step: infer labels fn = ✓T · `n + noise 8

labels have near-identical spectra – A spectrum is a smooth function of the star’s labels 1. Training Step: fit a model 2. Test Step: infer labels Overview of The Cannon methodology Survey A B fn = ✓T · `n + noise fn = ✓T · `n + noise 8

Anna Ho, Caltech / Proof of Concept: Reproduce APOGEE DR10
APOGEE DR10 (55,000 stars) Ness, Rix, Hogg, Ho, Zasowski (2015) 9

APOGEE DR10 (55,000 stars) Ness, Rix, Hogg, Ho, Zasowski (2015) Calibration clusters (542 stars) 9

Labels from high-resolution optical spectroscopy Spectra from APOGEE APOGEE DR10 (55,000 stars) Ness, Rix, Hogg, Ho, Zasowski (2015) 9 Calibration clusters (542 stars)

APOGEE DR10 (55,000 stars) Training Step: fn = a + b ( Te↵)n + c (log g )n + d ([ Fe/H ])n + (quadratic terms) + scatter Ness, Rix, Hogg, Ho, Zasowski (2015) 9 Calibration clusters (542 stars)

APOGEE DR10 (55,000 stars) Training Step: fn = a + b ( Te↵)n + c (log g )n + d ([ Fe/H ])n + (quadratic terms) + scatter Test Step: fn = a + b ( Te↵)n + c (log g )n + d ([ Fe/H ])n + (quadratic terms) + scatter Ness, Rix, Hogg, Ho, Zasowski (2015) 9 Calibration clusters (542 stars)

Anna Ho, Caltech / Cross-validation, for a field not included
in training Cannon values ASPCAP values Teff (K) logg (dex) [Fe/H] (dex) 4000 4500 5000 5500 3500 4000 4500 5000 5500 4 5 3 2 1 0 -1 1 3 5 RMS < 63 K RMS < 0.16 dex RMS < 0.06 dex Field: 4255 (l,b) = (60.0, -8.0) 1.0 0.5 0.0 -0.5 -1.0 -1.5 -2.0 -2.5 -3.0 -2.0 -1.0 0.0 1.0 Ness, Rix, Hogg, Ho, Zasowski (2015) 10

Anna Ho, Caltech / The spectral model matches the data.
Ness, Rix, Hogg, Ho, Zasowski (2015) Normalized Flux Wavelength (Å) Wavelength (Å) 1.0 0.6 1.0 0.6 0.8 0.8 1.0 0.8 0.6 1.0 0.8 0.6 15660 15700 15740 16160 15780 16200 16240 16280 11

Anna Ho, Caltech / Ness, Rix, Hogg, Ho, Zasowski (2015)
Teff (K) from The Cannon logg (dex) from The Cannon 4000 3500 5000 4500 5500 5 4 3 2 1 0 -1 12 The Cannon model can reproduce APOGEE DR10.

Anna Ho, Caltech / The Cannon model can reproduce APOGEE
DR10. Ness, Rix, Hogg, Ho, Zasowski (2015) Teff (K) from The Cannon logg (dex) from The Cannon 4000 3500 5000 4500 5500 5 4 3 2 1 0 -1 12 • Far faster than physical modeling • Labels lie near sensible isochrones, despite there being no priors on isochrones except the values of the original training set • Can do this at lower S/N

Anna Ho, Caltech / Cross-Calibration: APOGEE & LAMOST Ho, Ness,
Hogg, Rix, Liu, Yang (2016) Overlap: ~10,000 giants APOGEE LAMOST 14

Hogg, Rix, Liu, Yang (2016) Overlap: ~10,000 giants APOGEE LAMOST Labels: {Teff, logg, [Fe/H], [α/Fe]} 14

Hogg, Rix, Liu, Yang (2016) Overlap: ~10,000 giants APOGEE LAMOST APOGEE DR12: R ~ 22,500 15200-16900 Å 4 parameters (Teff, logg, [M/H], [alpha/M]), radial velocity, 15 element abundances ~100,000 giants Labels: {Teff, logg, [Fe/H], [α/Fe]} 14

Hogg, Rix, Liu, Yang (2016) Overlap: ~10,000 giants APOGEE LAMOST LAMOST DR2: R ~ 1,800 3700-9000 Å 3 parameters (Teff, logg, [Fe/H]), radial velocity 2.2 million stars Labels: {Teff, logg, [Fe/H], [α/Fe]} 14 APOGEE DR12: R ~ 22,500 15200-16900 Å 4 parameters (Teff, logg, [M/H], [alpha/M]), radial velocity, 15 element abundances ~100,000 giants

Hogg, Rix, Liu, Yang (2016) Overlap: ~10,000 giants APOGEE LAMOST Labels: {Teff, logg, [Fe/H], [α/Fe]} 14 APOGEE DR12: R ~ 22,500 15200-16900 Å 4 parameters (Teff, logg, [M/H], [alpha/M]), radial velocity, 15 element abundances ~100,000 giants LAMOST DR2: R ~ 1,800 3700-9000 Å 3 parameters (Teff, logg, [Fe/H]), radial velocity 2.2 million stars

Hogg, Rix, Liu, Yang (2016) Overlap: ~10,000 giants APOGEE LAMOST A B Labels: {Teff, logg, [Fe/H], [α/Fe]} 14 APOGEE DR12: R ~ 22,500 15200-16900 Å 4 parameters (Teff, logg, [M/H], [alpha/M]), radial velocity, 15 element abundances ~100,000 giants LAMOST DR2: R ~ 1,800 3700-9000 Å 3 parameters (Teff, logg, [Fe/H]), radial velocity 2.2 million stars

Hogg, Rix, Liu, Yang (2016) Overlap: ~10,000 giants APOGEE LAMOST Spectra Labels A B Labels: {Teff, logg, [Fe/H], [α/Fe]} 14

Hogg, Rix, Liu, Yang (2016) Overlap: ~10,000 giants APOGEE LAMOST Spectra Labels fn = a + b ( Te↵)n + c (log g )n + d ([ Fe/H ])n + e ([ ↵/Fe ])n + (quadratic terms) + scatter Training Step: A B Labels: {Teff, logg, [Fe/H], [α/Fe]} 14

Hogg, Rix, Liu, Yang (2016) Overlap: ~10,000 giants APOGEE LAMOST Spectra Labels fn = a + b ( Te↵)n + c (log g )n + d ([ Fe/H ])n + e ([ ↵/Fe ])n + (quadratic terms) + scatter fn = a + b ( Te↵)n + c (log g )n + d ([ Fe/H ])n + e ([ ↵/Fe ])n + (quadratic terms) + scatter Training Step: Test Step: A B Labels: {Teff, logg, [Fe/H], [α/Fe]} 14

Anna Ho, Caltech / APOGEE and LAMOST measure inconsistent labels.
Ho, Ness, Hogg, Rix, Liu, Yang (2016) Values from APOGEE DR12 Values from LAMOST, S/N > 60 logg (dex) [Fe/H] (dex) 4000 4400 4800 5200 1.0 1.5 2.0 2.5 3.0 3.5 1.0 1.5 2.0 2.5 3.0 3.5 4000 4400 4800 5200 -1.2 -0.8 -0.4 0.0 0.4 -1.0 -0.5 0.0 0.5 Teff (K) 15 APOGEE 0.05 dex LAMOST 0.15 dex

Anna Ho, Caltech / The Cannon can bring surveys onto
the same scale. Ho, Ness, Hogg, Rix, Liu, Yang (2016) 4000 4000 -1.0 -0.5 0.0 0.5 4500 5000 5500 4500 5000 1.0 2.0 3.0 4.0 1.0 2.0 3.0 4.0 -1.5 -2.0 -2.0 -1.0 0.0 Values from APOGEE DR12 Values from The Cannon, S/N > 60 Teff (K) logg (dex) [Fe/H] (dex) 16

the same scale. Ho, Ness, Hogg, Rix, Liu, Yang (2016) Teff (K) logg (dex) Teff (K) 4000 4000 5000 5000 1 2 3 4 LAMOST APOGEE 17

the same scale. Ho, Ness, Hogg, Rix, Liu, Yang (2016) Teff (K) logg (dex) Teff (K) Teff (K) 4000 4000 4000 5000 5000 5000 1 2 3 4 LAMOST LAMOST + Cannon APOGEE 17

Anna Ho, Caltech / The Cannon can transfer labels from
one survey to another. Ho, Ness, Hogg, Rix, Liu, Yang (2016) 0.3 0.2 0.1 0.0 -0.1 0.0 0.1 0.2 0.3 Values from The Cannon, S/N > 60 [α/M] (dex) 18 4000 4000 -1.0 -0.5 0.0 0.5 4500 5000 5500 4500 5000 1.0 2.0 3.0 4.0 1.0 2.0 3.0 4.0 -1.5 -2.0 -2.0 -1.0 0.0 Values from APOGEE DR12 Teff (K) logg (dex) [Fe/H] (dex)

Anna Ho, Caltech / The Cannon can transfer labels from
one survey to another. Ho, Ness, Hogg, Rix, Liu, Yang (2016) [Fe/H] (dex) from Cannon/LAMOST [α/Fe] (dex) 0.3 0.2 0.4 0.5 0.1 0.0 -0.1 -0.2 -2.0 -1.5 -1.0 -0.5 0.0 0.5 450,000 LAMOST giants 19

APOGEE [α/M] 100,000 giants

APOGEE [α/M] LAMOST [α/M] 100,000 giants + 450,000 giants

Anna Ho, Caltech / The largest age map of the
Milky Way Ness, Hogg, Rix, Martig, Pinsonneault, Ho (2016) Overlap: ~1600 giants Kepler APOGEE 21

Milky Way Overlap: ~1600 giants Kepler APOGEE Spectra Labels 21 Ness, Hogg, Rix, Martig, Pinsonneault, Ho (2016)

Milky Way Overlap: ~1600 giants Kepler APOGEE Spectra Labels 21 Labels: {Teff, logg, [Fe/H], [α/Fe], M} Ness, Hogg, Rix, Martig, Pinsonneault, Ho (2016)

Milky Way Overlap: ~1600 giants Kepler APOGEE Spectra Labels fn = a + b ( Te↵)n + c (log g )n + d ([ Fe/H ])n + e ([ ↵/Fe ])n + f ( M )n + (quadratic terms) + sca fn = a + b ( Te↵)n + c (log g )n + d ([ Fe/H ])n + e ([ ↵/Fe ])n + f ( M )n + (quadratic terms) + scatter Training Step: Labels: {Teff, logg, [Fe/H], [α/Fe], M} 21 Ness, Hogg, Rix, Martig, Pinsonneault, Ho (2016)

Milky Way Overlap: ~1600 giants Kepler APOGEE Spectra Labels fn = a + b ( Te↵)n + c (log g )n + d ([ Fe/H ])n + e ([ ↵/Fe ])n + f ( M )n + (quadratic terms) + sca fn = a + b ( Te↵)n + c (log g )n + d ([ Fe/H ])n + e ([ ↵/Fe ])n + f ( M )n + (quadratic terms) + scatter Training Step: Test Step: fn = a + b ( Te↵)n + c (log g )n + d ([ Fe/H ])n + e ([ ↵/Fe ])n + g ( M )n + (quadratic terms) + sca fn = a + b ( Te↵)n + c (log g )n + d ([ Fe/H ])n + e ([ ↵/Fe ])n + g ( M )n + (quadratic terms) + scatter 21 Labels: {Teff, logg, [Fe/H], [α/Fe], M} Ness, Hogg, Rix, Martig, Pinsonneault, Ho (2016)

Milky Way Overlap: ~1600 giants Kepler APOGEE fn = a + b ( Te↵)n + c (log g )n + d ([ Fe/H ])n + e ([ ↵/Fe ])n + f ( M )n + (quadratic terms) + sca fn = a + b ( Te↵)n + c (log g )n + d ([ Fe/H ])n + e ([ ↵/Fe ])n + f ( M )n + (quadratic terms) + scatter Training Step: Test Step: fn = a + b ( Te↵)n + c (log g )n + d ([ Fe/H ])n + e ([ ↵/Fe ])n + g ( M )n + (quadratic terms) + sca fn = a + b ( Te↵)n + c (log g )n + d ([ Fe/H ])n + e ([ ↵/Fe ])n + g ( M )n + (quadratic terms) + scatter 21 Labels: {Teff, logg, [Fe/H], [α/Fe], M} Ness, Hogg, Rix, Martig, Pinsonneault, Ho (2016)

Anna Ho, Caltech / 4000 4500 5000 TeffASPCAP 4000 4500
5000 The Cannon Labels Teff (K) 1.5 2.0 2.5 3.0 3.5 loggKEPLER 1.5 2.0 2.5 3.0 3.5 log g (dex) 1.0 0.5 0.0 0.5 [Fe/H]ASPCAP 1.0 0.5 0.0 0.5 [Fe/H] (dex) 0.1 0.0 0.1 0.2 0.3 [↵/Fe]ASPCAP 0.1 0.0 0.1 0.2 0.3 [↵/Fe] (dex) 0. log10 0.2 0.0 0.2 0.4 0.6 log 200 100 0 100 200 Teff 0 50 100 150 200 250 300 350 Number of Stars bias = 0.5 rms = 26.6 0.5 0.0 0.5 logg 0 50 100 150 200 250 300 350 400 bias = 0.007 rms = 0.06 0.2 0.1 0.0 0.1 0.2 [Fe/H] 0 50 100 150 200 250 300 350 bias = -0.0 rms = 0.02 0.1 0.0 0.1 ↵/Fe 0 50 100 150 200 250 300 350 bias = 0.0 rms = 0.02 0.4 0 50 100 150 200 250 300 bias rms 4000 4500 5000 TeffASPCAP 4000 4500 5000 The Cannon Labels Teff (K) 1.5 2.0 2.5 3.0 3.5 loggKEPLER 1.5 2.0 2.5 3.0 3.5 log g (dex) 1.0 0.5 0.0 0.5 [Fe/H]ASPCAP 1.0 0.5 0.0 0.5 [Fe/H] (dex) 0.1 0.0 0.1 0.2 0.3 [↵/Fe]ASPCAP 0.1 0.0 0.1 0.2 0.3 [↵/Fe] (dex) lo 0.2 0.0 0.2 0.4 0.6 200 100 0 100 200 Teff 0 50 100 150 200 250 300 350 Number of Stars bias = 0.5 rms = 26.6 0.5 0.0 0.5 logg 0 50 100 150 200 250 300 350 400 bias = 0.007 rms = 0.06 0.2 0.1 0.0 0.1 0.2 [Fe/H] 0 50 100 150 200 250 300 350 bias = -0.0 rms = 0.02 0.1 0.0 0.1 ↵/Fe 0 50 100 150 200 250 300 350 bias = 0.0 rms = 0.02 0 50 100 150 200 250 300 4000 4500 5000 TeffASPCAP 4000 4500 5000 The Cannon Labels Teff (K) 1.5 2.0 2.5 3.0 3.5 loggKEPLER 1.5 2.0 2.5 3.0 3.5 log g (dex) 1.0 0.5 0.0 0.5 [Fe/H]ASPCAP 1.0 0.5 0.0 0.5 [Fe/H] (dex) 0.1 0.0 0.1 0.2 0 [↵/Fe]ASPCA 0.1 0.0 0.1 0.2 0.3 [↵/Fe] (dex) 200 100 0 100 200 Teff 0 50 100 150 200 250 300 350 Number of Stars bias = 0.5 rms = 26.6 0.5 0.0 0.5 logg 0 50 100 150 200 250 300 350 400 bias = 0.007 rms = 0.06 0.2 0.1 0.0 0.1 0.2 [Fe/H] 0 50 100 150 200 250 300 350 bias = -0.0 rms = 0.02 0.1 0.0 0.1 ↵/Fe 0 50 100 150 200 250 300 350 bias = 0.0 rms = 0.02 0.5 0.0 0.5 Fe/H]ASPCAP [Fe/H] (dex) 0.1 0.0 0.1 0.2 0.3 [↵/Fe]ASPCAP 0.1 0.0 0.1 0.2 0.3 [↵/Fe] (dex) 0.2 0.0 0.2 0.4 0.6 log10 massKEPLER 0.2 0.0 0.2 0.4 0.6 log10 mass (M ) 0.5 0.0 0.5 1.0 1.5 log10 ageKEPLER 0.5 0.0 0.5 1.0 1.5 Derived log10 age (Gyr) as = -0.0 ms = 0.02 300 350 bias = 0.0 rms = 0.02 250 300 bias = 0.0 rms = 0.07 300 350 bias = 0.02 rms = 0.21 10 2 10 1 100 Number of Stars 1.0 0.5 0.0 0.5 [Fe/H]ASPCAP 1.0 0.5 0.0 0.5 [Fe/H] (dex) 0.1 0.0 0.1 0.2 0.3 [↵/Fe]ASPCAP 0.1 0.0 0.1 0.2 0.3 [↵/Fe] (dex) 0.2 0.0 0.2 0.4 0.6 log10 massKEPLER 0.2 0.0 0.2 0.4 0.6 log10 mass (M ) 0.5 0.0 0.5 1.0 1.5 log10 ageKEPLER 0.5 0.0 0.5 1.0 1.5 Derived log10 age (Gyr) 300 350 bias = -0.0 rms = 0.02 300 350 bias = 0.0 rms = 0.02 250 300 bias = 0.0 rms = 0.07 300 350 bias = 0.02 rms = 0.21 10 2 10 1 100 Number of Stars Values from The Cannon Teff (K) logg (dex) [Fe/H] (dex) [α/Fe] (dex) log10 mass (M⊙ ) ASPCAP ASPCAP Kepler Kepler ASPCAP ±27 K ±0.06 dex ±0.02 dex ±0.02 dex ±0.07 dex (20%) 22 -0.1 0.0 0.1 0.2 0.3 4000 -1.0 -0.5 0.0 0.5 4500 5000 2.0 3.0 2.5 1.5 3.5 Ness, Hogg, Rix, Martig, Pinsonneault, Ho (2016) 4000 4500 5000 1.5 2.5 3.5 -0.5 0.0 0.5 0.2 0.0 -0.2 0.4 0.6 0.1 0.3 -0.2 0.2 0.6 Cross validation with the APOKASC sample

Anna Ho, Caltech / 4000 4500 5000 TeffASPCAP 4000 4500
5000 The Cannon Labels Teff (K) 1.5 2.0 2.5 3.0 3.5 loggKEPLER 1.5 2.0 2.5 3.0 3.5 log g (dex) 1.0 0.5 0.0 0.5 [Fe/H]ASPCAP 1.0 0.5 0.0 0.5 [Fe/H] (dex) 0.1 0.0 0.1 0.2 0.3 [↵/Fe]ASPCAP 0.1 0.0 0.1 0.2 0.3 [↵/Fe] (dex) 0. log10 0.2 0.0 0.2 0.4 0.6 log 200 100 0 100 200 Teff 0 50 100 150 200 250 300 350 Number of Stars bias = 0.5 rms = 26.6 0.5 0.0 0.5 logg 0 50 100 150 200 250 300 350 400 bias = 0.007 rms = 0.06 0.2 0.1 0.0 0.1 0.2 [Fe/H] 0 50 100 150 200 250 300 350 bias = -0.0 rms = 0.02 0.1 0.0 0.1 ↵/Fe 0 50 100 150 200 250 300 350 bias = 0.0 rms = 0.02 0.4 0 50 100 150 200 250 300 bias rms 4000 4500 5000 TeffASPCAP 4000 4500 5000 The Cannon Labels Teff (K) 1.5 2.0 2.5 3.0 3.5 loggKEPLER 1.5 2.0 2.5 3.0 3.5 log g (dex) 1.0 0.5 0.0 0.5 [Fe/H]ASPCAP 1.0 0.5 0.0 0.5 [Fe/H] (dex) 0.1 0.0 0.1 0.2 0.3 [↵/Fe]ASPCAP 0.1 0.0 0.1 0.2 0.3 [↵/Fe] (dex) lo 0.2 0.0 0.2 0.4 0.6 200 100 0 100 200 Teff 0 50 100 150 200 250 300 350 Number of Stars bias = 0.5 rms = 26.6 0.5 0.0 0.5 logg 0 50 100 150 200 250 300 350 400 bias = 0.007 rms = 0.06 0.2 0.1 0.0 0.1 0.2 [Fe/H] 0 50 100 150 200 250 300 350 bias = -0.0 rms = 0.02 0.1 0.0 0.1 ↵/Fe 0 50 100 150 200 250 300 350 bias = 0.0 rms = 0.02 0 50 100 150 200 250 300 4000 4500 5000 TeffASPCAP 4000 4500 5000 The Cannon Labels Teff (K) 1.5 2.0 2.5 3.0 3.5 loggKEPLER 1.5 2.0 2.5 3.0 3.5 log g (dex) 1.0 0.5 0.0 0.5 [Fe/H]ASPCAP 1.0 0.5 0.0 0.5 [Fe/H] (dex) 0.1 0.0 0.1 0.2 0 [↵/Fe]ASPCA 0.1 0.0 0.1 0.2 0.3 [↵/Fe] (dex) 200 100 0 100 200 Teff 0 50 100 150 200 250 300 350 Number of Stars bias = 0.5 rms = 26.6 0.5 0.0 0.5 logg 0 50 100 150 200 250 300 350 400 bias = 0.007 rms = 0.06 0.2 0.1 0.0 0.1 0.2 [Fe/H] 0 50 100 150 200 250 300 350 bias = -0.0 rms = 0.02 0.1 0.0 0.1 ↵/Fe 0 50 100 150 200 250 300 350 bias = 0.0 rms = 0.02 0.5 0.0 0.5 Fe/H]ASPCAP [Fe/H] (dex) 0.1 0.0 0.1 0.2 0.3 [↵/Fe]ASPCAP 0.1 0.0 0.1 0.2 0.3 [↵/Fe] (dex) 0.2 0.0 0.2 0.4 0.6 log10 massKEPLER 0.2 0.0 0.2 0.4 0.6 log10 mass (M ) 0.5 0.0 0.5 1.0 1.5 log10 ageKEPLER 0.5 0.0 0.5 1.0 1.5 Derived log10 age (Gyr) as = -0.0 ms = 0.02 300 350 bias = 0.0 rms = 0.02 250 300 bias = 0.0 rms = 0.07 300 350 bias = 0.02 rms = 0.21 10 2 10 1 100 Number of Stars 1.0 0.5 0.0 0.5 [Fe/H]ASPCAP 1.0 0.5 0.0 0.5 [Fe/H] (dex) 0.1 0.0 0.1 0.2 0.3 [↵/Fe]ASPCAP 0.1 0.0 0.1 0.2 0.3 [↵/Fe] (dex) 0.2 0.0 0.2 0.4 0.6 log10 massKEPLER 0.2 0.0 0.2 0.4 0.6 log10 mass (M ) 0.5 0.0 0.5 1.0 1.5 log10 ageKEPLER 0.5 0.0 0.5 1.0 1.5 Derived log10 age (Gyr) 300 350 bias = -0.0 rms = 0.02 300 350 bias = 0.0 rms = 0.02 250 300 bias = 0.0 rms = 0.07 300 350 bias = 0.02 rms = 0.21 10 2 10 1 100 Number of Stars 5 3.0 3.5 PLER dex) 1.0 0.5 0.0 0.5 [Fe/H]ASPCAP 1.0 0.5 0.0 0.5 [Fe/H] (dex) 0.1 0.0 0.1 0.2 0.3 [↵/Fe]ASPCAP 0.1 0.0 0.1 0.2 0.3 [↵/Fe] (dex) 0.2 0.0 0.2 0.4 0.6 log10 massKEPLER 0.2 0.0 0.2 0.4 0.6 log10 mass (M ) 0.5 0.0 0.5 1.0 1.5 log10 ageKEPLER 0.5 0.0 0.5 1.0 1.5 Derived log10 age (Gyr) 300 350 bias = -0.0 rms = 0.02 300 350 bias = 0.0 rms = 0.02 250 300 bias = 0.0 rms = 0.07 300 350 bias = 0.02 rms = 0.21 10 2 10 1 100 Number of Stars Cross validation with the APOKASC sample Values from The Cannon Teff (K) logg (dex) [Fe/H] (dex) [α/Fe] (dex) log10 mass (M⊙ ) log10 age (Gyr) ASPCAP ASPCAP Kepler Kepler Kepler ASPCAP derived ±27 K ±0.06 dex ±0.02 dex ±0.02 dex ±0.07 dex (20%) ±0.21 dex (40%) 22 -0.1 0.0 0.1 0.2 0.3 4000 -1.0 -0.5 0.0 0.5 4500 5000 2.0 3.0 2.5 1.5 3.5 Ness, Hogg, Rix, Martig, Pinsonneault, Ho (2016) 4000 4500 5000 1.5 2.5 3.5 -0.5 0.0 0.5 0.2 0.0 -0.2 0.4 0.6 0.1 0.3 -0.2 0.2 0.6 0.0 -0.5 0.5 1.5 1.0 0.5 1.5

Anna Ho, Caltech / Galactic Centre 73 Galactic Radius (kpc)
Galactic Height (kpc) -5 0 5 10 15 20 -6 -4 -2 0 2 4 6 Mean Age (Gyr) 9.6 3.2 6.4 4.8 8.0 The largest age map of the Milky Way Anna Ho, Caltech / 23 Ness, Hogg, Rix, Martig, Pinsonneault, Ho (2016)

Anna Ho, Caltech / Limitations & advantages • Limitations –
The training set – Treating the pixels as independent – No partial or noisy labels – Quadratic order polynomial 24 • Strengths – Data-driven (no physical models required) – Fast – Performs well at low-SNR – Can handle noise and missing data in spectra – Model is interpretable

Anna Ho, Caltech / Limitations & advantages fn = a
+ b ( Te↵)n + c (log g )n + d ([ Fe/H ])n + e ([ ↵/Fe ])n + (quadratic terms) + scatter 24 • Strengths – Data-driven (no physical models required) – Fast – Performs well at low-SNR – Can handle noise and missing data in spectra – Model is interpretable

The code is open source and available: pip install TheCannon
Documentation & Tutorial: https://annayqho.github.io/TheCannon/ 25 Anna Ho, Caltech / Read more: Ness et al. 2015, arXiv: 1501.07604 Ness et al. 2016, arXiv:1511.08204 Hogg et al. 2016, arXiv:1601.05413 Ho et al. 2016, arXiv:1602.00303 Casey et al. 2016, arXiv:1603.03040 Original paper on The Cannon and APOGEE Spectroscopic masses & ages Chemical tagging APOGEE/LAMOST cross-calibration The Cannon 2: compressed sensing & detailed element abundances [email protected]

Anna Ho, Caltech / Where is logg encoded in an
APOGEE spectrum? 25 Figure from Melissa Ness

Anna Ho, Caltech / Where is mass encoded in an
APOGEE spectrum? 25 Figure from Melissa Ness

Anna Ho, Caltech / Where is info encoded in a
LAMOST spectrum? 25 4000 Å 5000 Å 6000 Å 7000 Å 8000 Å 9000 Å

The Cannon: Data-Driven Spectral Modeling in th...

The Cannon: Data-Driven Spectral Modeling in the Era of Large Stellar Surveys

More Decks by Anna Ho

Other Decks in Science

Featured

Transcript