PromptWorks Talk Tuesdays: Dustin Ingram 8/23/16 "Detecting Asteroids with Neural Networks in TensorFlow"

Slide 1

Slide 1 text

Detecting Asteroids with Neural Networks in TensorFlow

Slide 2

Slide 2 text

Outline • What's the goal? • What's the data? • Getting started • Building a feature set • Building the neural network • Training the network • Results

Slide 3

Slide 3 text

Goal Build and train a neural network to correctly identify asteroids in astrophotography data.

Slide 4

Slide 4 text

So... How do we do it?

Slide 5

Slide 5 text

No content

Slide 6

Slide 6 text

So... How do we really do it?

Slide 7

Slide 7 text

The data The Sloan Digital Sky Survey: One of the most ambitious and inﬂuential surveys in the history of astronomy."

Slide 8

Slide 8 text

The data • Approx 35% of sky • Largest uniform survey of the sky yet accomplished • Data is freely available online • Each image is 922x680 pixels

Slide 9

Slide 9 text

No content

Slide 10

Slide 10 text

No content

Slide 11

Slide 11 text

An example asteroid

Slide 12

Slide 12 text

No content

Slide 13

Slide 13 text

How does this work? This exploits a property of CCDs: • SDSS telescopes use ﬁve different ﬁlters • They are not simultaneous • Moving objects appear in different locations • Always the same order

Slide 14

Slide 14 text

No content

Slide 15

Slide 15 text

Getting started Getting the initial training data: * Small tool to extract potential candidates from full-scale images * Extremely naive, approx 100:5 false positives to actual positives * Very low false negatives (approx 1:1000) * Incredibly slow (complex scan of 100Ks of potentials) * Manual classiﬁcation, somewhat slow * Yields approx 250 valid items, 500 invalid items

Slide 16

Slide 16 text

The feature set Good ideas for features: • Ratio valid hues to non-valid hues • Best possible cluster collinearity • Best possible average cluster distance

Slide 17

Slide 17 text

Feature: Ratio valid hues to non- valid hues The goal here is to match the colors, a.k.a. "hues": • First step: convert to HSV space • For pixels in the valid value-spectrum (0.25 < v < 0.90) • How many are within 2 standard deviations from an optimal value? • What's the ratio to ones that aren't?

Slide 18

Slide 18 text

An example HSV plot

Slide 19

Slide 19 text

No content

Slide 20

Slide 20 text

Feature: Best possible cluster collinearity k-means clustering • Using the valid hues from the previous feature • Attempts to cluster n points into k groups • Here, k=3 • Produces three centroids

Slide 21

Slide 21 text

No content

Slide 22

Slide 22 text

k-means clustering of the same asteroid

Slide 23

Slide 23 text

No content

Slide 24

Slide 24 text

Feature: Best possible cluster collinearity • The property of a set of points which lie on the same line • Iterate the k-means clustering approx. 20 times • The resulting metric is the ratio between the actual collinearity and the maximum potential colinearity

Slide 25

Slide 25 text

Feature: Best possible cluster collinearity • Given points a, b, and c: colin = (c.x - a.x) * (b.y - a.y) + (c.y - a.y) * (a.x - b.x)

Slide 26

Slide 26 text

No content

Slide 27

Slide 27 text

k-means clustering of the same asteroid

Slide 28

Slide 28 text

No content

Slide 29

Slide 29 text

An example non- asteroid

Slide 30

Slide 30 text

No content

Slide 31

Slide 31 text

k-means clustering of the same non- asteroid

Slide 32

Slide 32 text

No content

Slide 33

Slide 33 text

Feature: Best possible average cluster distance • Using the same k-means clusters from the previous features • What is the average distance from any point in a cluster to the center of the cluster?

Slide 34

Slide 34 text

No content

Slide 35

Slide 35 text

k-means clustering of the same asteroid

Slide 36

Slide 36 text

An example non- asteroid

Slide 37

Slide 37 text

No content

Slide 38

Slide 38 text

k-means clustering of the same non- asteroid

Slide 39

Slide 39 text

No content

Slide 40

Slide 40 text

A comparison of all three features | Hue Ratio | Collinearity | Cluster distance Asteroid | 0.687 | 0.046 | 0.432 Non-asteroid | 0.376 | 0.388 | 0.557

Slide 41

Slide 41 text

A comparison of all three features We see that the for a valid asteroid: • The hue ratio is much higher • The colinearity metric is much lower • The mean cluster disance is smaller

Slide 42

Slide 42 text

Ok... where's the AI? This type of classiﬁcation is extrememly well suited for a neural network:

Slide 43

Slide 43 text

Ok... where's the AI? • We have a clear set of training data • The output is either afﬁrmative (1) or negative (0) • Each of the input features can be resolved to a 0 -> 1 metric • There is a small amount of input features which can accurately deﬁne an item

Slide 44

Slide 44 text

Building the neural network The resulting neural network: • Performs binary classiﬁcation; • Use supervised learning; • Uses a backpropagation trainer; • Is deep;

Slide 45

Slide 45 text

Building the neural network Four layers: • Input layer; • Two hidden layers; • Output layer

Slide 46

Slide 46 text

Building the neural network Total of 154 "neurons": • 3 input neurons (hue ratio, collinearity metric, distance metric) • 150 hidden neurons (100 in ﬁrst layer, 50 in second) • 1 output neuron (1 if valid asteroid, 0 if invalid)

Slide 47

Slide 47 text

The resulting neural network

Slide 48

Slide 48 text

No content

Slide 49

Slide 49 text

$ head astro.train 0.72, 0.0230326797386, 0.265427036314, 1.0 0.223404255319, 0.453424956758, 0.620237280488, 0.0 0.625954198473, 0.282509136048, 0.489543705893, 0.0 0.297297297297, 0.217278447678, 0.456831265365, 0.0 0.526315789474, 0.125389748718, 0.52048369676, 1.0 0.4, 0.430241745731, 0.597850990407, 0.0 0.0797872340426, 0.375153031291, 0.601415832623, 0.0 0.403361344538, 0.268341944886, 0.485098390444, 0.0 0.592356687898, 0.34747482416, 0.559112235938, 0.0 0.0976744186047, 0.0213790202131, 0.586822094967, 0.0

Slide 50

Slide 50 text

Load training data import pandas as pd df_train = pd.read_csv( tf.gfile.Open('./astro.train'), names=[ 'hue_rat', 'col_min', 'dis_min', 'label' ], skipinitialspace=True)

Slide 51

Slide 51 text

Turn label into boolean df_train['label'] = ( df_train["astro"].apply( lambda x: x == 1.0 ) ).astype(int)

Slide 52

Slide 52 text

Build an estimator import tensorflow as tf def build_estimator(model_dir): hue_rat = tf.contrib.layers.real_valued_column("hue_rat") col_min = tf.contrib.layers.real_valued_column("col_min") dis_min = tf.contrib.layers.real_valued_column("dis_min") return tf.contrib.learn.DNNClassifier( model_dir=model_dir, feature_columns=[hue_rat, col_min, dis_min], hidden_units=[100, 50] )

Slide 53

Slide 53 text

Input function def input_fn(df): feature_cols = { k: tf.constant(df[k].values) for k in [ 'hue_rat', 'col_min', 'dis_min' ] } label = tf.constant(df['label'].values) return feature_cols, label

Slide 54

Slide 54 text

Build & Fit the model m = build_estimator('./model') m.fit(input_fn=lambda: input_fn(df_train), steps=200)

Slide 55

Slide 55 text

Training the network • Approx 250 valid items; • Approx 500 invalid items; • Trained for 200 steps; • Took < 1 minute;

Slide 56

Slide 56 text

Evaluating the model results = m.evaluate( input_fn=lambda: input_fn(df_test_1), steps=1 )

Slide 57

Slide 57 text

Results • Trial 1: 99.2973% accuracy • Trial 2: 94.4056% accuracy • Trial 3: 94.7644% accuracy

Slide 58

Slide 58 text

Why use features? MNIST is a database of handwritten digits:

Slide 59

Slide 59 text

Why use features? 28x28 pixels = 784 numbers between 0 and 1: df_train = [0.0, 0.1, 0.9, ..., 1.0]

Slide 60

Slide 60 text

Why use features? Astro data: * 40x40 pixels * RGB space 40*40*3 = 4800

Slide 61

Slide 61 text

Conclusion • Using a neural network allows us to do it faster, and more accurately • Need to spend time coming up with good features for the data • TensorFlow is really nice (and fast!)

Slide 62

Slide 62 text

References • https://www.tensorﬂow.org/ • http://www.sdss.org/ • http://en.wikipedia.org/wiki/Sloan_Digital_Sky_Survey • http://en.wikipedia.org/wiki/Collinearity • http://en.wikipedia.org/wiki/K-means_clustering