Error Analysis in Machine Learning

ERROR ANALYSIS WEEK 3 COURSE 2 30 MAY 2019

OUTLINE • ERROR ANALYSIS • TRANSFER LEARNING • MULTI TASK
LEARNING • END-TO-END LEARNING

WHAT IS ERROR ANALYSIS? • Is the process of manually
examining dev set examples that your algorithm misclassified. • Will help you understand the cause of the problem. • This will give you a better insight on what to do next.

CONT. Example. • Look at your dev example of your
cat classifier App • So, the question is should you try make your cat classifier do better on dogs?

CONT. ERROR ANALYSIS: • Get a sample of 100 mislabeled
dev set examples. • Look at these examples manually, and count what fraction of them are dog images. OUTPUT: • 5 % of images are dogs. • 50 % of images are dogs.

EVALUATE MULTIPLE IDEAS IN PARALLEL • Your team may have
several ideas for improving the cat detector: • Fix picture of dogs being recognized as cats • Fix great cats (lion, Panther, etc. . . ) being misrecognized • Improve performance on blurry images

CLEANING UP MISLABELED DATA • During error analysis, you might
notice that some examples in your dev set are mislabeled ( the pictures were already mislabeled by a human ). • Should you correct the labels in your dev set ?

CONT. Over all error: 10 % 2% Error due to
incorrect Label: 0.6% 0.6% Error due to other causes: 9.4% 1.4%

CONT. Additional guideline to consider. • Make sure you dev
and test sets come from same distribution • Consider examining examples your algorithm got right as well as once it got wrong.

BUILD YOUR FIRS SYSTEM QUICKLY & ITERATE • If you
are tackling a brand Machine learning problem, it is advised that to build your first system quickly and then iterate. • Speech recognition example. • Noisy background (Café noise, car noise) •Accented speech •Far from microphone •Stuttering, etc.

CONT. Build your system quickly step 1: set up dev/test
set matric (set up target). step 2: Build your initial system quickly. step 3: Use bias/Variance analysis and error analysis to prioritize next steps.

TRAINING AND TESTING ON DIFFERENT DIST. • Cat App Example
•What can you do ? 10,000 images 200,000 images You care about this data

CONT. Option 1 : •Put all data sets together •Randomly
shuffle them and divide to train, dev and test • Advantage is all your sets come from the same distribution. • Disadvantage is most of your dev set images come from the web data Train 205,000 images dev 2.5k test 2.5k

CONT. Option 2: • put all the data that came
from the web to train sets or you could add some from mobile app. •Make your dev and test sets come from the mobile app • Disadvantage is all your train set now come from the web Train 205k dev 2.5k test 2.5k 205k from web, 5k from mobile app All from mobile app

BIAS AND VARIANCE WITH MISMATCHED DATA DIST. • Estimating bias
and variance of your learning algorithm will also help you prioritize on what to next. • Analyzing bias and variance change when your training sets comes from different distribution than dev/test sets will help you identify your error. • Cat classifier App Example •Training error – 1% •Dev error – 10%

CONT. • You can conclude two problems. 1. May be
you have large variance problem 2. Distribution problem and your algorithm is just doing right • To identify the problem introduce Training-Dev set. •Training-Dev set : Same distribution as training set, but not used for training Test set Dev set Training/Dev set Training set

CONT. • Lets say : •Training error : 1 %
1% •Train/Dev error : 9% 1.5% •Dev set error : 10% 10% Dev set Training/Dev set Training set This shows high variance problem This shows high data mismatch problem Test set

ADDRESSING DATA MISMATCH PROBLEM • Carryout manual error analysis to
try to understand difference between training and dev/test sets. • Make training data more similar; or collect more data similar to dev/test sets.

TRANSFER LEARNING • Sometimes you can take knowledge that some
neural network has learned from one task and apply that knowledge to tackle separate task. This is called Transfer Learning. • Image recognition Example (X,Y) Y ˆ (w,b) Radiology diagnosis Image recognition Then, Swap new input (X,Y) from radiology and train you NN

WHEN TO USE TL? • Task A and Task B
has the same input X. • You have a lot more data for Task A than task B.

MULTI TASK LEARNING • Is to make your single neural
network to do several things simultaneously at the same time. • Autonomous driving Example •Features you need to identify: •Pedestrian •Cars •Stop sign •Traffic Lights

CONT. This makes sense usually if data you have for
each task is quite similar [4,1] vector

END-TO-END DEEP LEARNING • Most learning system require multiple stages
of processing, In End- to-End deep learning; it takes all those multiple stages and replace it with single neural network. • Speech recognition Example Audio Features Phonemes Words Transcript X Y MFCC ˆ Traditional DL: Audio Transcript Y ˆ X E2E DL:

CONT. Challenges of E2E DL: • Requires lots of data
before it works well. If you have smaller data traditional pipelines works better. • If your data is not big enough, you could take some stages to your E2E DL pipeline. (i.e. audio > features > transcript) ADV: • Less hand designing components required

THANK YOU

Error Analysis in Machine Learning

Error Analysis in Machine Learning

Yohannis K Telila

More Decks by Yohannis K Telila

Other Decks in Education

Featured

Transcript

ERROR ANALYSIS WEEK 3 COURSE 2 30 MAY 2019

OUTLINE • ERROR ANALYSIS • TRANSFER LEARNING • MULTI TASK

WHAT IS ERROR ANALYSIS? • Is the process of manually

CONT. Example. • Look at your dev example of your

CONT. ERROR ANALYSIS: • Get a sample of 100 mislabeled

EVALUATE MULTIPLE IDEAS IN PARALLEL • Your team may have

CONT.

CLEANING UP MISLABELED DATA • During error analysis, you might

CONT. Over all error: 10 % 2% Error due to

CONT. Additional guideline to consider. • Make sure you dev

BUILD YOUR FIRS SYSTEM QUICKLY & ITERATE • If you

CONT. Build your system quickly step 1: set up dev/test

TRAINING AND TESTING ON DIFFERENT DIST. • Cat App Example

CONT. Option 1 : •Put all data sets together •Randomly

CONT. Option 2: • put all the data that came

BIAS AND VARIANCE WITH MISMATCHED DATA DIST. • Estimating bias

CONT. • You can conclude two problems. 1. May be

CONT. • Lets say : •Training error : 1 %

ADDRESSING DATA MISMATCH PROBLEM • Carryout manual error analysis to

TRANSFER LEARNING • Sometimes you can take knowledge that some

WHEN TO USE TL? • Task A and Task B

MULTI TASK LEARNING • Is to make your single neural

CONT. This makes sense usually if data you have for

END-TO-END DEEP LEARNING • Most learning system require multiple stages

CONT. Challenges of E2E DL: • Requires lots of data

THANK YOU