Preventing churn like a bandit

Preventing churn like a bandit Gerben Oostra – Machine Learning
Engineer @ BigData Republic

Churn is a revenue drain Preventing churn like a bandit
* 25% & 75% quantiles, depends on business size. https://www.profitwell.com/blog/average-revenue-churn-rate-benchmarks Companies lose between 2% and 16%* of revenue every month due to churn.

3 issues within churn prevention Preventing churn like a bandit
1: Models predicting churn 2: Assuming correlation is causation 3: Select the best predicted treatment

Running example: Telco (internet) Preventing churn like a bandit Telco
Inc Provides subscriptions with TV & Internet & Mobile Has a lot of diverse customers

Preventing churn like a bandit Issue 1: Models predict churn
Churn Classification Model Do nothing Direct mail Telemarketing Door 2 door Choose between Features Predicted churn propensity Policy Train Historical data

Preventing churn like a bandit Issue 1: It can’t be
that bad, right? Retrain Churn Classification Model Low scores Do nothing Churners High scores We learn from non-actions Non-churners Low scores

Preventing churn like a bandit Issue 1: It can’t be
that bad, right? Retrain Churn Classification Model High scores Telemarketing Persuadable Low scores Lost causes High scores We learn to keep calling lost causes We learn to stop calling the persuadable

Action P(churn | A) P(retention | A) Uplift 0.6 0.4
0 @ 0.65 0.35 -0.05 TM 0.4 0.6 0.2 D2D 0.35 0.65 0.25 Classification Model Direct mail Telemarketing Door 2 door Features Predictions Solution 1a: Predict uplift Policy Do nothing Preventing churn like a bandit

0 @ 0.65 0.35 -0.05 TM 0.4 0.6 0.2 D2D 0.35 0.65 0.25 Classification Model Direct mail Telemarketing Door 2 door Features Predictions Solution 1a: Predict uplift Policy The default Preventing churn like a bandit

0 @ 0.65 0.35 -0.05 TM 0.4 0.6 0.2 D2D 0.35 0.65 0.25 Classification Model Direct mail Telemarketing Door 2 door Features Predictions Solution 1a: Predict uplift Policy Uplift(A,x) = P(retention | A, x) – P(retention | , x) The default Preventing churn like a bandit

0 @ 0.65 0.35 -0.05 TM 0.4 0.6 0.2 D2D 0.35 0.65 0.25 Classification Model Direct mail Telemarketing Door 2 door Features Predictions Solution 1a: Predict uplift Policy Uplift(A,x) = P(retention | A, x) – P(retention | , x) Transformed Outcome Trick Regression model with labels (0, -2, 2) Athey, S., & Imbens, G. W. (2015). Machine learning methods for estimating heterogeneous causal effects. stat, 1050(5). Uplift Regression Model The default Preventing churn like a bandit

Preventing churn like a bandit CLV € 300 Cost €
0.- € 0.25 € 5.- € 20.- Solution 1b : Base policy on economic result Action Uplift 0 @ -0.05 TM 0.20 D2D 0.25 × Result € 0.- € -15.25 € 55,- € 55,- − = Model Policy

Preventing churn like a bandit How to do predict treatment
effect (uplift) 1. Transformed outcome 2. Dealing with counterfactuals

Preventing churn like a bandit Predicting uplift: The Transformed Outcome
Retained? Y Treated W==1 Control W==0 Yes 2 -2 No 0 0 ∗ = − 1 − Y = retained? (1 or 0) W = treated? (1 or 0) p = P(W==1) treatment policy When: p = 0.5 Athey, S., & Imbens, G. W. (2015). Machine learning methods for estimating heterogeneous causal effects. stat, 1050(5).

Preventing churn like a bandit Minimizing RMSE results in Uplift
Retained? Treated Control Yes 2 -2 No 0 0 2 RMSE Lost causes -2 0 0 Uplift:

Retained? Treated Control Yes 2 -2 No 0 0 RMSE Persuadable -2 0 1 Uplift: 2

Retained? Treated Control Yes 2 -2 No 0 0 RMSE Sleeping dogs -2 0 -1 Uplift: 2

Retained? Treated Control Yes 2 -2 No 0 0 RMSE Sure causes -2 0 0 Uplift: 2

Preventing churn like a bandit Transformed outcome for multiple treatments
Retained? @ TM DM Yes -2 2 2 2 No 0 0 0 0 Churn prevention Features @ TM D2D 1 1 - - - 2 0 - - - 3 - - - 1 4 - 0 - - 5 - - 1 - Retained? Features @ TM D2D 1 -2 -2 -2 2 0 0 0 3 - - 2 4 0 - - 5 - 2 - Transformed outcome = control @, TM, D2D = Treated Retained? Control Treated Yes -2 2 No 0 0

Preventing churn like a bandit Model setup for Uplift Churn
prevention Action Uplift @ -0.15 TM 0.2 D2D 0.26 Regression Model (Re)Train Predictions Features Features labels Features @ TM D2D 1 -2 -2 -2 2 0 0 0 3 - - 2 4 0 - - 5 - 2 -

Preventing churn like a bandit Handling counterfactuals with masked feedback
Features 1 2 3 4 5 Features @ TM D2D ✕ ✕ ✕ : Loss: RMSE TM? @? D2D? TM @ D2D @ TM D2D -2 -2 -2 0 0 0 0 0 2 0 0 0 0 2 0 labels

Features 1 2 3 4 5 @ TM D2D 1 1 1 1 1 1 0 0 1 1 0 0 0 1 0 Features @ TM D2D ✕ ✕ ✕ : Loss: RMSE TM? @? D2D? TM @ D2D @ TM D2D -2 -2 -2 0 0 0 0 0 2 0 0 0 0 2 0 labels mask

Features 1 2 3 4 5 @ TM D2D 1 1 1 1 1 1 0 0 1 1 0 0 0 1 0 Features @ TM D2D ✕ ✕ ✕ : Loss: RMSE TM? @? D2D? TM @ D2D Predicted uplifts @ TM D2D -0.15 0.2 0.25 @ TM D2D -2 -2 -2 0 0 0 0 0 2 0 0 0 0 2 0 labels mask

Preventing churn like a bandit Inference with core network Features
1 2 3 4 5 Features @ TM D2D Action Uplift @ -0.15 TM 0.2 D2D 0.25 Predictions

Preventing churn like a bandit Issue 1: Predicting churn Churn
Classification Model Policy (Re)Train Features Historical data Labels Features Predicted churn propensity Choose Action

Preventing churn like a bandit Solution 1: Contextual bandit maximizing
revenue Uplift regression Model x CLV – Cost (Re)Train Predicted uplift Action based on value Feedback (churned/retained) Features Historical data Transformed outcome Labels Features

Preventing churn like a bandit x CLV – Cost Predicted
uplift Action based on value Feedback (churned/retained) Features Predict @ TM D2D → (Re)Train @ TM D2D → ✕ ✕ ✕ TM? @? D2D? TM @ D2D Transformed Outcome Labels Masks Features Solution 1: Contextual bandit maximizing revenue

Preventing churn like a bandit Issue 2: Assuming correlation is
causation T R Retention Treatment Features Causal graph Correlation has predictive power We need causation for prescriptive power We only observe correlations Here correlation is causation

Preventing churn like a bandit Issue 2: Assuming correlation is
causation T R Retention Treatment Features Historically: Past retention campaigns Future: Our model Causal graph Correlation has predictive power We need causation for prescriptive power We only observe correlations Correlation is not causation

Preventing churn like a bandit Issue 2: Removing `feature –
treatment` bias T R Retention Treatment Features To be removed Causal graph 1. Random trials 2. Change features • Feature selection • Encoding 3. Causal inference + Simple & straightforward - Not using historical data - Expensive experiments - No feedback loop - Only possible once: Created model recreates bias - Restricting predictive power - Harder to model - Models actual situation

Preventing churn like a bandit Solution 2: Causal inference (Inverse
propensity weighting) T R Retained? Treatment Features Propensity model to learn the correlation Propensity Model Age / location / .. P(T | x) @ TM - = 1 | Age / location / .. @ TM 1 2 - Propensity Model | → Weight samples inverse to propensity

Preventing churn like a bandit How to do propensity weighting
1. Calibrate propensities 2. Propensity clipping

Preventing churn like a bandit Propensity weighting: calibration

Preventing churn like a bandit Propensity weighting : clipping -
= 1 | lim 6 7|8 →9 - → ∞ 1. Clip value 2. Trim dataset ; 7|8 = 0.95, 0.05, | ′- = F | GH, 0.05 ≤ ≤ 0.95 0, > 0.95 0, < 0.05

Preventing churn like a bandit Solution 2: Causal model with
inverse propensity weighting x CLV – Cost Predicted uplift Best Action Feedback (churned/retained) Features Predict @ TM D2D → (Re)Train @ TM D2D → ✕ ✕ ✕ TM? @? D2D? TM @ D2D Transformed Outcome Labels Masks - = 1 | Propensity Model | → Weights Features

Preventing churn like a bandit Issue 3: Select the best
predicted treatment Action Result € 0.- @ € -15.25 TM € 55,- D2D € 55,- Greedy Policy 100% Exploit limits feedback • We learn if selection worked • We never learn alternatives Feedback (Future) training data

Preventing churn like a bandit Solution 3: Balance exploration &
exploitation 1. Determine optimal treatment for each customer 2. Determine which customers to treat

Preventing churn like a bandit Solution 3a: Thompson sampling for
treatment assignment Action Uplift @ -0.05 TM 0.2 D2D 0.25 Uplift P(Uplift | T) @ TM D2D Most likely Uplift Underlying distributions of Uplift Use sample as prediction Action Uplift Result @ 0.5 € 149.75 TM 0.1 € 25 D2D 0.2 € 40 Action Uplift Result @ -0.5 €-150.25 TM 0.3 € 85 D2D 0.15 € 25 Action Uplift Result @ -0.5 €-150.25 TM 0.3 € 85 D2D 0.15 € 25 Action Uplift Result @ -0.5 €-150.25 TM 0.3 € 85 D2D 0.15 € 25 Bayesian modelling

Preventing churn like a bandit Solution 3b: Perturbed Results for
customer selection # customers Desc Result Change x% to random value from all results X % Selected Budget Every bin has random selection of full range

Preventing churn like a bandit Solution 3: Balance exploration and
exploitation x CLV – Cost Predicted uplift Best Action Feedback (churned/retained) Features Predict @ TM D2D → (Re)Train @ TM D2D → ✕ ✕ ✕ TM? @? D2D? TM @ D2D Transformed Outcome Labels Masks - = 1 | Propensity Model | → Weights Features

Preventing churn like a bandit Solution 3: Balance exploration and
exploitation Thompson Sampling Feedback (churned/retained) Features x CLV – Cost - = 1 | Propensity Model | → Action Uplift @ -0.05 TM 0.2 D2D 0.25 @ TM D2D Sampled prediction Perturbate Results (Re)Train @ TM D2D → ✕ ✕ ✕ TM? @? D2D? TM @ D2D Predict @ TM D2D → Transformed Outcome Weights Labels Masks Predicted uplift Best Action Features

Preventing churn like a bandit Contextual bandit with Uplift modelling,
causal inference & Thompson sampling Thompson Sampling Feedback (churned/retained) Features x CLV – Cost - = 1 | Propensity Model | → Action Uplift @ -0.05 TM 0.2 D2D 0.25 @ TM D2D Sampled prediction Perturbate Results (Re)Train @ TM D2D → ✕ ✕ ✕ TM? @? D2D? TM @ D2D Predict @ TM D2D → Transformed Outcome Weights Labels Masks Predicted uplift Best Action Features

Phone +31 (0)168 479294 Email [email protected] Coltbaan 4C, 3439 NG
Nieuwegein, The Netherlands Address Questions?

Preventing churn like a bandit

Preventing churn like a bandit

More Decks by Gerben Oostra

Other Decks in Technology

Featured

Transcript