A Typical User's Ground-Level Perspective on Machine Learning in Astronomy

A Typical User's Ground-Level Perspective on Machine Learning in Astronomy
James R. A. Davenport jradavenport UNIVERSITY OF WASHINGTON !1

A Typical User's Ground-Level Perspective on Machine Learning in Astronomy
jradavenport !2 Number of ADS abstracts including “machine learning”

jradavenport !3 Machine Learning is Boring*

jradavenport !4 *normal, lots of people doing it. And this
is a good thing! (i.e. don’t be scared!) Machine Learning is Boring*

jradavenport !5 (i.e. can run most algorithms on your laptop)
Yet sample size is big enough that interesting/rare things can be found Most of our work isn’t “big data”

jradavenport !6 Our data is becoming better suited for ML
• big datasets (Mario’s talk), especially Gaia! • easier than ever to get data (Vizier/Xmatch, ADS, journals, Github, Zenodo…) • value-added datasets for surveys (e.g. stellar parameters from SDSS)

jradavenport !7 ML is easier than ever to use •
robust, open source libraries available • many programming languages • many domain (astro) experts & workshops available

jradavenport !8

jradavenport !9 also astroML!   (see Brigitta’s talk later)

jradavenport !10 Problems that ML is good for:

jradavenport !11 Each algorithm has speciﬁc use cases (sometimes: just
try them all!) Clustering Classiﬁcation

jradavenport !12 Goal: demonstrate 3 real problems that ML can
be used for

jradavenport !13 Example 1: Clustering starspot evolution tracks Kepler 17

jradavenport !14 Kepler 17 Example 1: Clustering starspot evolution tracks

jradavenport !15 Kepler 17 Example 1: Clustering starspot evolution tracks

jradavenport !16 Question: Which tracks are starspots, how do they
emerge/decay? Kepler 17 Example 1: Clustering starspot evolution tracks

emerge/decay? Manual clustering? Kepler 17 Example 1: Clustering starspot evolution tracks

emerge/decay? Manual clustering? Kepler 17 Example 1: Clustering starspot evolution tracks Aside: “training data” is super   important for many problems!

Example 1: Clustering starspot evolution tracks jradavenport !19 DBSCAN Gaussian
Mixture

Example 1: Clustering starspot evolution tracks jradavenport !20 DBSCAN Gaussian
Mixture need to predeﬁne Nclusters?

jradavenport !21 DBSCAN: Density-based spatial clustering of applications with noise
Kepler 17 Example 1: Clustering starspot evolution tracks

jradavenport !22 DBSCAN: Density-based spatial clustering of applications with noise
One starspot’s emergence/decay! Kepler 17 See more upcoming work by Kosuke Namekata Example 1: Clustering starspot evolution tracks

Time Flux jradavenport !23 Example 2: Modeling a complex stellar
ﬂare Flare! Question: Is there (quasi-) sinusoidal behavior in the ﬂare decay? https://github.com/RileyWClarke/QPP-GP

Time Flux jradavenport !24 Flare! Question: Is there (quasi-) sinusoidal
behavior in the ﬂare decay? https://github.com/RileyWClarke/QPP-GP versus Example 2: Modeling a complex stellar ﬂare

jradavenport !25 Time Flux Only study decay Residual Flux Time
Could fit with a damped   harmonic oscillator Question: Is there (quasi-) sinusoidal behavior in the flare decay? Difficult to classify sinusoidal vs. stochastic,  & strict vs quasi sinusoid https://github.com/RileyWClarke/QPP-GP Example 2: Modeling a complex stellar flare

jradavenport !26 Residual Flux https://github.com/RileyWClarke/QPP-GP Celerite Gaussian Process Example 2:
Modeling a complex stellar ﬂare

jradavenport !27 Residual Flux Gaussian Process https://github.com/RileyWClarke/QPP-GP Celerite Use an
exponential +   simple-harmonic-oscillator kernel Example 2: Modeling a complex stellar ﬂare

jradavenport !28 https://github.com/RileyWClarke/QPP-GP No Period Candidate Periods Objective search Robust
uncertainties!  (with MCMC) Gaussian Process Example 2: Modeling a complex stellar ﬂare

jradavenport !29 Example 3: Modeling photometric metallicities From APOGEE [3.4]
- [4.6] Observation: [Fe/H] gradient in stars https://github.com/jradavenport/ingot/ Gaia DR2 + (WISE + 2MASS) + APOGEE

jradavenport !30 From APOGEE [3.4] - [4.6] Observation: [Fe/H] gradient
in stars We could build a complex   polynomial or spline model https://github.com/jradavenport/ingot/ Example 3: Modeling photometric metallicities

jradavenport !31 Observation: [Fe/H] gradient in stars From APOGEE [3.4]
- [4.6] We could build a complex   polynomial or spline model Tedious, and diﬃcult to add  additional dimensions! https://github.com/jradavenport/ingot/ Example 3: Modeling photometric metallicities

jradavenport !32 Observation: [Fe/H] gradient in stars From APOGEE [3.4]
- [4.6] We could build a complex   polynomial or spline model Tedious, and diﬃcult to add  additional dimensions! Or use a simple, ﬂexible   ML model! https://github.com/jradavenport/ingot/ Example 3: Modeling photometric metallicities

jradavenport !33 KNearestNeighbors Xdata = (G-J, W1-W2)  Ydata = [Fe/H]
Data Fit https://github.com/jradavenport/ingot/ Example 3: Modeling photometric metallicities

jradavenport !34 KNearestNeighbors Xdata = (G-J, W1-W2)  Ydata = [Fe/H]
Data Fit https://github.com/jradavenport/ingot/ Example 3: Modeling photometric metallicities

jradavenport !35 KNearestNeighbors Result: a simple to use “surface”,  no
tweaking for shape/order,   extend to addtional dimensions easily 1 Million new stars with no spectra https://github.com/jradavenport/ingot/ Example 3: Modeling photometric metallicities

jradavenport !36 KNearestNeighbors Result: a simple to use “surface”,  no
tweaking for shape/order,   extend to addtional dimensions easily 1 Million new stars with no spectra https://github.com/jradavenport/ingot/ Example 3: Modeling photometric metallicities

!37 Conclusions Clustering Regression GP’s ML is easier and more
“boring” than ever! jradavenport

A Typical User's Ground-Level Perspective on Ma...

A Typical User's Ground-Level Perspective on Machine Learning in Astronomy

More Decks by James Davenport

Other Decks in Science

Featured

Transcript