Upgrade to Pro — share decks privately, control downloads, hide ads and more …

Convnets + Galaxies

Ryan Keisler
January 15, 2015
220

Convnets + Galaxies

from Computing the Universe, January 2015, UC Berkeley. More info at http://stanford.edu/~rkeisler/gz/ .

Ryan Keisler

January 15, 2015
Tweet

Transcript

  1. Galaxy Zoo 2: detailed morphological classifications for ! 304,122 galaxies

    from the Sloan Digital Sky Survey! ! K. Willett et al 2013 • ~300e3 galaxies (mr > 17) • ~16e6 classifications • ~50 classifications / galaxy 100s of 1000s of volunteers!
  2. 0 B B B B B B B B B

    B @ n1 . . . . . . . . . . . . n37 1 C C C C C C C C C C A 3 x 96 x 96 = 27648 numbers 37 numbers People (even “pro astronomers”) disagree, so there is no single classification per galaxy.
 
 The distribution of votes is the “classification”.
  3. 0 B B B B B B B B B

    B @ n1 . . . . . . . . . . . . n37 1 C C C C C C C C C C A 3 x 96 x 96 = 27648 numbers 37 numbers Thanks for the labels, volunteers! ! Now time for some supervised machine learning. some algorithm
  4. 0 B B B B B B B B B

    B @ n1 . . . . . . . . . . . . n37 1 C C C C C C C C C C A 3 x 96 x 96 = 27648 numbers 37 numbers What algorithm? some algorithm
  5. 0 B B B B B B B B B

    B @ n1 . . . . . . . . . . . . n37 1 C C C C C C C C C C A 3 x 96 x 96 = 27648 numbers 37 numbers An early try: PCA + RF PCA Random Forest Regressor
  6. 0 B B B B B B B B B

    B @ n1 . . . . . . . . . . . . n37 1 C C C C C C C C C C A 3 x 96 x 96 = 27648 numbers 37 numbers What algorithm? some algorithm
  7. - “Analyzing complex image data? 
 Use a deep, convolutional

    neural network.” etc. lots of buzz over the past 2 years
  8. What can they do? classification localization! ! …building block for

    
 machine vision pipelines Russakovsky et al 2014 Karpathy et al 2014
  9. What is a 
 deep convolutional neural net? Self-learn a

    set of multi-channel convolutional kernels.
 (tiles, filter bank) Take advantage of approximate translational invariance. Fewer parameters to learn.
  10. • number of layers • width of layers • size

    of conv. kernels • pooling stride • choice of non-linear fn’s. • dropout • learning rates • … Lots of hyper-parameters ! ! ! ! ! ! ! ! Convnets might be overkill for a domain as “limited” as astronomical images. Computationally expensive to train (GPU-days or CPU-weeks) Cons
  11. neuron 0 neuron 1 neuron 2 neuron 3 neuron 4

    neuron 5 neuron 6 neuron 7 neuron 8 neuron 9 …
  12. Astro. Applications? • objective (non-human) morphology • finding strong gravitational

    lenses • image-based redshifts (as opposed to photo-z.) • spectrograms or other 2d data • extending any of above to LSST-like scale • ?
  13. “Smooth Galaxies” movie! 1000 nearby spirals 15 frames (galaxies) per

    second Subsequent frames chosen to be close in 
 morphology space and image space. http://vimeo.com/86254924