hybrid-vocal-classifier SciPy 2017

github.com/NickleDave/hybrid-vocal-classifier @nicholdav David Nicholson graduate student, neuroscience Emory University

• Introduction • • • Methods (of methods talk) •
Results • spoiler alerts: neural nets win

• a library that automates labeling vocalizations • what it
is not: • Shazam for songbirds • what it is: • like speech to text but for songbirds

• Songbirds: a model system for understanding how the brain
learns speech and similar motor skills • learn their vocalizations by social interaction, from a tutor Photo: Jon Sakata. Spectrogram: Dooling lab

• Songbirds: a model system for studying how the brain
learns speech and similar motor skills • learn their vocalizations by social interaction, from a tutor • must acquire their song during a sensitive period in development Photo: Jon Sakata. Spectrogram: Dooling lab

• Songbirds: a model system for studying how the brain
learns speech and similar motor skills • learn their vocalizations by social interaction, from a tutor • must acquire their song during a sensitive period in development • perfect their song through a process of sensorimotor learning Photo: Jon Sakata. Spectrogram: Dooling lab

• Birdsong • consists of elements called syllables • segment
sound file into syllables by threshold crossings of amplitude

• Each bird’s song similar to song of its tutor
• But each individual of a species will have a unique song • So syllable “c” for Bengalese finch #1 is not syllable “c” for Bengalese finch #2

adapted from Tumer Brainard 2007

• Adult song is actively maintained by error correction Sober
Brainard 2009

• Problem: • Birds sing 100s of songs a day
• Many more than can be labeled by hand

• Many more than can be labeled by hand • Previous work: • Sound Analysis Pro (soundanalysispro.com): • Open source

• Many more than can be labeled by hand • Previous work: • Sound Analysis Pro (soundanalysispro.com): • Open source • Avoids labeling, produces similarity scores similarity spectrogram 1 spectrogram 2 100% 0%

• Many more than can be labeled by hand • Previous work: • Sound Analysis Pro (soundanalysispro.com): • Open source • Avoids labeling, produces similarity scores • Can use output to label by clustering • Clustering doesn’t work very well for some bird species similarity spectrogram 1 spectrogram 2 100% 0%

• Many more than can be labeled by hand • Previous work: • Sound Analysis Pro • Other machine learning methods applied to Bengalese finch song • K nearest neighbors (k-NN) • Support Vector Machines (SVM) • Convolutional Neural Network (CNN)

• Many more than can be labeled by hand • Previous work: • Sound Analysis Pro • Other machine learning methods applied to Bengalese finch song • Hard to compare different machine learning methods • not all open-source • not all well-documented software • very little in the way of publicly available repository of song

• • open source, Pure Python • built on numpy/scipy
stack • implements previously proposed approaches: • SVM and k-NN via scikit-learn • neural nets using Keras • easy to use, run on YAML config file • released with a large data set • hand-labeled data • well-segmented song • days, ~20k data points/day

• • goals of library: • make it easy to
label song in an automated way import hvc hvc.extract(‘extract.config.yml’) hvc.select(‘select.config.yml’) hvc.predict(‘predict.config.yml’)

• • goals of library: • make it easy to
label song in an automated way • make it easy to: • compare different previously-proposed machine learning methods • make it easy to test new machine learning methods

• Comparing previous models: • k-Nearest Neighbors: http://www.utsa.edu/troyerlab/software.html https://en.wikipedia.org/wiki/K-nearest_neighbors_algorithm#/media/File:KnnClassification.svg Antti
Ajanki, Wikipedia, CC 3.0

• Comparing previous models: • k-Nearest Neighbors: http://www.utsa.edu/troyerlab/software.html 50 ms

• Comparing previous models: • Support Vector Machine Tachibana et
al. 2014 https://en.wikipedia.org/wiki/Support_vector_machine#/media/File:Svm_max_sep_hyperplane_with_margin.png

• Comparing previous models: • Support Vector Machine Tachibana et
al. 2014 • features: • average spectra, cepstra plus • many from CUIDADO feature set (Peeters 2004): • spectral centroid • spectral spread • etc. https://en.wikipedia.org/wiki/Support_vector_machine#/media/File:Svm_max_sep_hyperplane_with_margin.png

• Comparing previous models: • Convolutional neural net • Koumura
Okanoya 2016

• Comparing previous models: • Convolutional neural net • architecture
• convolutional layer • max-pooling • “window” layer • goal: segmentation + classification

• Comparing previous models: • Convolutional neural net • open
code + data!!! ty!

• For comparing models • Plot learning curves • accuracy
v. training data (# of hand-labeled songs) • I want the best model for the least data

v. training data (# of hand-labeled songs) • I want the best model for the least data • 5-fold cross validation

v. training data (# of hand-labeled songs) • I want the best model for the least data • 5-fold cross validation • For each fold: random grab of n songs from training set, measure average accuracy across syllables

v. training data (# of hand-labeled songs) • I want the best model for the least data • 5-fold cross validation • For each fold: random grab of n songs from training set, measure average accuracy across syllables • Fit learning curves: • c = accuracy at asymptote • = time to asymptote = (1 − −/)

• Using hybrid-vocal- classifier on our data set, I find:
• SVM outperforms k- NN if a radial basis function is used github.com/NickleDave/ML-comparison-birdsong

• Using hybrid-vocal- classifier on our data set, I find:
• A simple convolutional neural net with minimal training data outperforms SVM github.com/NickleDave/ML-comparison-birdsong

• Across birds, convolutional neural net outperforms both SVM and
k-NN github.com/NickleDave/ML-comparison-birdsong

• Segmentation: • Convolutional nets for image segmentation • hybrid
RNN-CNN • segmental RNN

SciPy, Sober lab, BioCompNS • Jonah Queen NickleDave/hybrid-vocal-classifier @nicholdav

hybrid-vocal-classifier SciPy 2017

hybrid-vocal-classifier SciPy 2017

More Decks by David Nicholson

Other Decks in How-to & DIY

Featured

Transcript