Interpreting Machine Learning Models: Why and How!

INTERPRETING MACHINE LEARNING MODELS: WHY AND HOW! OMAYMA SAID

HOW WOULD YOUR MENTAL MODEL LABEL THIS IMAGE? CAT →
0 DOG → 1 OR

HOW WOULD YOUR MENTAL MODEL LABEL THIS IMAGE? DUCK →
0 RABBIT → 1 OR

WHY ? HOW WOULD YOUR MENTAL MODEL LABEL THIS IMAGE?

https:/ /github.com/minimaxir/optillusion-animation Max Woolf

INTERPRETABILITY TRADEOFF COMPLEXITY

AI-POWERED [----] ML-ENABLED [----] MORE AND MORE

TAMMY DOBBS, Arkansas CEREBRAL PALSY REDUCED HOURS OF HOME CARE
VISITS PER WEEK

TAMMY DOBBS, Arkansas CEREBRAL PALSY REDUCED HOURS OF HOME CARE
VISITS PER WEEK WHY?

“ ” [In Idaho], the state declined to disclose the
formula it was using, saying that its math qualiﬁed as a TRADE SECRET.

“AUTOMATED REDACTION, TRANSCRIPTION, REPORTING”

WHAT ELSE ? I'm In to Connect and Serve* “AUTOMATED
REDACTION, TRANSCRIPTION, REPORTING”

Amazon’s system TAUGHT ITSELF that male candidates were preferable. It
penalized resumes that included the word “women’s,” as in “women’s chess club captain.” And it downgraded graduates of two all-women’s colleges, according to people familiar with the matter. They did not specify the names of the schools. “

Amazon’s system TAUGHT ITSELF that male candidates were preferable. It
penalized resumes that included the word “women’s,” as in “women’s chess club captain.” And it downgraded graduates of two all-women’s colleges, according to people familiar with the matter. They did not specify the names of the schools. “ LEARNED FROM HUMANS

COLLECT/LABEL DATA IT IS HUMANS WHO WRITE ALGORITHMS DEFINE METRICS

COLLECT/LABEL DATA IT IS HUMANS WHO BIAS IN: - REPRESENTATION
- DISTRIBUTION - LABELS AND MORE….. WRITE ALGORITHMS DEFINE METRICS

IT IS HUMANS WHO DEFINE METRICS WRITE ALGORITHMS COLLECT/LABEL DATA
- TRAIN/TEST SPLIT - FEATURES/PROXIES - BLACK-BOX MODELS AND MORE…..

IT IS HUMANS WHO COLLECT/LABEL DATA DEFINE METRICS WRITE ALGORITHMS
- WHAT IS THE IMPACT OF DIFFERENT ERROR TYPES ON DIFFERENT GROUPS? - WHAT DO YOU OPTIMIZE FOR?

Practitioners consistently: - overestimate their model’s accuracy. - propagate feedback
loops. - fail to notice data leaks. “ ” “Why Should I Trust You?” Explaining the Predictions of Any Classiﬁer https:/ /arxiv.org/pdf/1602.04938.pdf

ETHICAL IMPLICATIONS INTERPRETABLE MACHINE LEARNING: WHY?

TRANSPARENCY INTERPRETABLE MACHINE LEARNING: WHY?

DEBUGGING MODELS INTERPRETABLE MACHINE LEARNING: WHY?

SECURITY (e.g. Dealing with adversarial attacks) INTERPRETABLE MACHINE LEARNING: WHY?

1 LIME https:/ /github.com/thomasp85/lime Local Interpretable Model-agnostic Explanations iml https:/
/github.com/christophM/iml INTERPRETABLE MACHINE LEARNING: HOW?

LIME Explain individual predictions of black-box models using a local
interpretable model. “ ”

this documentary was boring and quite stupid….. I-Tabular Data II-Images
III-Text LIME

1- Select a point to explain (red). Based on an
example in “Interpretable Machine Learning” Book by Christoph Molnar LIME (Tabular Data)

2- Sample data points. LIME (Tabular Data) Based on an
example in “Interpretable Machine Learning” Book by Christoph Molnar

3- Weight points according to their proximity to the selected
point. LIME (Tabular Data) Based on an example in “Interpretable Machine Learning” Book by Christoph Molnar

4- Train a weighted, interpretable local model. LIME (Tabular Data)
Based on an example in “Interpretable Machine Learning” Book by Christoph Molnar

5- Explain the black-box model prediction using the local model.
LIME (Tabular Data) Based on an example in “Interpretable Machine Learning” Book by Christoph Molnar

LIME (Tabular Data) https:/ /github.com/OmaymaS/lime_explainer

LIME (Tabular Data) GUESS THE DATASET ?

LIME (Tabular Data) Iris (Classiﬁcation)

set.seed(5658) ## load libraries library(caret) library(lime) ## partition the data
intrain <- createDataPartition(y = iris$Species,p = 0.8, list = F) ## create train and test data train_data <- iris[intrain, ] test_data <- iris[-intrain, ] ## train Random Forest model on train_data model <- train(x = train_data[, 1:4], y = train_data[, 5], method = 'rf') TRAIN

intrain <- createDataPartition(y = iris$Species, p = 0.8, list = F) ## create train and test data train_data <- iris[intrain, ] test_data <- iris[-intrain, ] ## train Random Forest model on train_data model <- train(x = train_data[, 1:4], y = train_data[, 5], method = 'rf') ## create an explainer object using train_data explainer <- lime(train_data, model) EXPLAIN

intrain <- createDataPartition(y = iris$Species, p = 0.8, list = F) ## create train and test data train_data <- iris[intrain, ] test_data <- iris[-intrain, ] ## train Random Forest model on train_data model <- train(x = train_data[, 1:4], y = train_data[, 5], method = 'rf') ## create an explainer object using train_data explainer <- lime(train_data, model) ## explain new observations in test data explanation <- explain(test_data[, 1], explainer, n_labels = 1, n_features = 4) EXPLAIN

intrain <- createDataPartition(y = iris$Species, p = 0.8, list = F) ## create train and test data train_data <- iris[intrain, ] test_data <- iris[-intrain, ] ## train Random Forest model on train_data model <- train(x = train_data[, 1:4], y = train_data[, 5], method = 'rf') ## create an explainer object using train_data explainer <- lime(train_data, model) ## explain new observations in test data explanation <- explain(test_data[, 1:4], explainer, n_labels = 1, n_features = 4) https:/ /github.com/OmaymaS/satRday2019_talk_scripts/blob/master/R/lime_tabular.R

## plot features plot_features(explanation) LIME (Tabular Data)

LIME (Tabular Data)

Label: toy terrier Probability: 0.81 Explanation Fit: 0.38 LIME (Images)
pre-trained ImageNet model

LIME (Images) superpixels 50 100 150

Label: tabby, tabby cat Probability: 0.29 Explanation Fit: 0.77 Label:
Egyptian Cat Probability: 0.28 Explanation Fit: 0.69 LIME (Images) pre-trained ImageNet model

Label: tabby, tabby cat Probability: 0.29 Explanation Fit: 0.77 Label:
Egyptian Cat Probability: 0.28 Explanation Fit: 0.69 Type: Supports Type: Contradicts LIME (Images) pre-trained ImageNet model

LIME (Images) “Why Should I Trust You?” Explaining the Predictions
of Any Classiﬁer https:/ /arxiv.org/pdf/1602.04938.pdf

LIME (Images) “Why Should I Trust You?” Explaining the Predictions
of Any Classiﬁer https:/ /arxiv.org/pdf/1602.04938.pdf SNOW

IMDB reviews sentiment classiﬁcation LIME (Text) Keras model (CNN+LSTM)

Boring Stupid Dumb Waste information Label predicted: -ve sentiment LIME
(Text)

Pros LIME - Provides human-friendly explanations. - Gives a ﬁdelity
measure. - Can use other features than the black-box model.

Pros LIME Cons - Provides human-friendly explanations. - Gives a
ﬁdelity measure. - Can use other features than the original model. - The deﬁnition of proximity is not totally resolved in tabular data. - Instability of explanations.

Pros - Provides human-friendly explanations. - Gives a ﬁdelity measure.
- Can use other features than the original model. Cons - Instability of explanations. LIME - The deﬁnition of proximity is not totally resolved in tabular data.

2 SHAPLEY VALUES iml https:/ /github.com/christophM/iml INTERPRETABLE MACHINE LEARNING: HOW?

SHAPLEY VALUES Explain the diﬀerence between the actual prediction and
the average prediction of the black-box model. coalitional game theory “ ”

UCI Bike Sharing (Regression) SHAPLEY VALUES

library(tidyverse) library(caret) library(iml) ## partition the data intrain <- createDataPartition(y
= bike$cnt, p = 0.9, list = F) ## create train and test data train_data <- bike[intrain, ] test_data <- bike[-intrain, ] train_x <- select(train_data, -cnt) test_x <- select(test_data, -cnt) ## train model model <- train(x = train_x, y = train_data$cnt, method = 'rf', ntree = 30, maximise = FALSE) TRAIN

= bike$cnt, p = 0.9, list = F) ## create train and test data train_data <- bike[intrain, ] test_data <- bike[-intrain, ] train_x <- select(train_data, -cnt) test_x <- select(test_data, -cnt) ## train model model <- train(x = train_x, y = train_data$cnt, method = 'rf', ntree = 30, maximise = FALSE) ## create predictor predictor <- Predictor$new(model, data = train_x) EXPLAIN

= bike$cnt, p = 0.9, list = F) ## create train and test data train_data <- bike[intrain, ] test_data <- bike[-intrain, ] train_x <- select(train_data, -cnt) test_x <- select(test_data, -cnt) ## train model model <- train(x = train_x, y = train_data$cnt, method = 'rf', ntree = 30, maximise = FALSE) ## create predictor predictor <- Predictor$new(model, data = train_x) ## calculate shapley values for a new instance shapley_values <- Shapley$new(predictor, x.interest = test_x[10,]) EXPLAIN

= bike$cnt, p = 0.9, list = F) ## create train and test data train_data <- bike[intrain, ] test_data <- bike[-intrain, ] train_x <- select(train_data, -cnt) test_x <- select(test_data, -cnt) ## train model model <- train(x = train_x, y = train_data$cnt, method = 'rf', ntree = 30, maximise = FALSE) ## create predictor predictor <- Predictor$new(model, data = train_x) ## calculate shapley values for a new instance shapley_values <- Shapley$new(predictor, x.interest = new_istance) https:/ /github.com/OmaymaS/satRday2019_talk_scripts/blob/master/R/shapley_tabular.R

SHAPLEY VALUES

SHAPLEY VALUES The contribution of temp value (4.416) to the
diﬀerence between the actual prediction and the mean prediction is the estimated Shapley value (~ -1000).

SHAPLEY VALUES

Pros - Solid theory. - The diﬀerence between the prediction
and the average prediction is fairly distributed among the feature values of the instance. SHAPLEY VALUES

Pros Cons - Solid theory - The diﬀerence between the
prediction and the average prediction is fairly distributed among the feature values of the instance. - Computationally expensive. - Can be misinterpreted. SHAPLEY VALUES - Uses all features (not ideal for explanations that contain few features).

Pros Cons - Solid theory. - The diﬀerence between the
prediction and the average prediction is fairly distributed among the feature values of the instance. - Computationally expensive. - Can be misinterpreted. SHAPLEY VALUES - Uses all features (not ideal for explanations that contain few features).

> LIME > SHAPLEY VALUES > ANCHORS …. And more
iml METHODS AND PACKAGES

EXTRA READINGS

INTERPRETING MACHINE LEARNING MODELS: WHY AND HOW! OMAYMA SAID

Interpreting Machine Learning Models: Why and How!

Interpreting Machine Learning Models: Why and How!

More Decks by OmaymaS

Other Decks in Technology

Featured

Transcript