Predicting the effectiveness of churn prevention measures

Predicting the Effectiveness of Churn Prevention Measures Gerben Oostra –
Machine Learning Engineer @ BigData Republic

Churn is a revenue drain Churn prevention * 25% &
75% quantiles, depends on business size. https://www.profitwell.com/blog/average-revenue-churn-rate-benchmarks Companies lose between 2% and 16%* of revenue every month due to churn.

3 issues within churn prevention Churn prevention 1: Models predicting
churn 2: Assuming correlation is causation 3: Select the best predicted treatment

Running example: Telco (internet) Churn prevention Telco Inc Provides subscriptions
with TV & Internet & Mobile Has a lot of diverse customers

Churn prevention Issue 1: Models predict churn Churn Classification Model
Do nothing Direct mail Telemarketing Door 2 door Choose between Features Predicted churn propensity Policy Train Historical data

Action P(churn | A) P(retention | A) Uplift 0.6 0.4
0 @ 0.65 0.35 -0.05 TM 0.4 0.6 0.2 D2D 0.35 0.65 0.25 Classification Model Direct mail Telemarketing Door 2 door Features Predictions Solution 1: Predict uplift Policy Uplift(A,x) = P(retention | A, x) – P(retention | , x) Transformed Outcome Trick Regression model with labels (0, -2, 2) Athey, S., & Imbens, G. W. (2015). Machine learning methods for estimating heterogeneous causal effects. stat, 1050(5). Do nothing Uplift Regression Model The default

Churn prevention CLV € 300 Cost € 0.- € 0.25
€ 5.- € 20.- Solution 1 : Base policy on economic result Action Uplift 0 @ -0.05 TM 0.20 D2D 0.25 × Result € 0.- € -15.25 € 55,- € 55,- − = Model Policy

Churn prevention Issue 2: Assuming correlation is causation T R
Retention Treatment Features Causal graph Correlation has predictive power We need causation for prescriptive power We only observe correlations Here correlation is causation

Churn prevention Issue 2: Assuming correlation is causation T R
Retention Treatment Features Historically: Past retention campaigns Future: Our model Causal graph Correlation has predictive power We need causation for prescriptive power We only observe correlations Correlation is not causation

Churn prevention Solution 2: Causal model (Inverse propensity weighting) T
R Retained? Treatment Features Propensity model to learn the correlation Propensity Model Age / location / .. P(T | x) @ TM $ = 1 | Age / location / .. @ TM * + $ Propensity Model | → Weight samples inverse to propensity

Churn prevention Issue 3: Select the best predicted treatment Action
Result € 0.- @ € -15.25 TM € 55,- D2D € 55,- Greedy Policy 100% Exploit limits feedback • We learn if selection worked • We never learn alternatives Feedback (Future) training data

Churn prevention Solution 3: Balance exploration & exploitation (Thompson sampling)
Action Uplift @ -0.05 TM 0.2 D2D 0.25 Uplift P(Uplift | T) @ TM D2D Most likely Uplift Underlying distributions of Uplift Use sample as prediction Action Uplift Result @ 0.5 € 149.75 TM 0.1 € 25 D2D 0.2 € 40 Action Uplift Result @ -0.5 €-150.25 TM 0.3 € 85 D2D 0.15 € 25 Action Uplift Result @ -0.5 €-150.25 TM 0.3 € 85 D2D 0.15 € 25 Action Uplift Result @ -0.5 €-150.25 TM 0.3 € 85 D2D 0.15 € 25 Bayesian modelling

Churn prevention Preventing churn with 3 complementary techniques Model Policy
(Re)Train 1: Actionable predictions: • Predict Uplift • Policy on business value 2: Causal model: • Inverse propensity weighting 3: Balance exploration & exploitation: • Thompson sampling • Bayesian modelling Predictions Action Feedback (churned/retained) Features Historical data Propensity Model Weights Transformed outcome Labels Features

Phone +31 (0)168 479294 Email [email protected] Coltbaan 4C, 3439 NG
Nieuwegein, The Netherlands Address Questions?

Predicting the effectiveness of churn preventio...

Predicting the effectiveness of churn prevention measures

Gerben Oostra

More Decks by Gerben Oostra

Other Decks in Programming

Featured

Transcript

Predicting the Effectiveness of Churn Prevention Measures Gerben Oostra –

Churn is a revenue drain Churn prevention * 25% &

3 issues within churn prevention Churn prevention 1: Models predicting

Running example: Telco (internet) Churn prevention Telco Inc Provides subscriptions

Churn prevention Issue 1: Models predict churn Churn Classification Model

Action P(churn | A) P(retention | A) Uplift 0.6 0.4

Churn prevention CLV € 300 Cost € 0.- € 0.25

Churn prevention Issue 2: Assuming correlation is causation T R

Churn prevention Issue 2: Assuming correlation is causation T R

Churn prevention Solution 2: Causal model (Inverse propensity weighting) T

Churn prevention Issue 3: Select the best predicted treatment Action

Churn prevention Solution 3: Balance exploration & exploitation (Thompson sampling)

Churn prevention Preventing churn with 3 complementary techniques Model Policy

Phone +31 (0)168 479294 Email [email protected] Coltbaan 4C, 3439 NG