at Deliveroo Learnings What we learnt from those experiments Best Practices Best Practices that you should know about before running your own experiments
multiple versions of a feature against each other • Both versions are live at the same time to different users ◦ Compare apples to apples • Analyse results and release best performing variant
to do some important work when you open the app • Fetching the user’s location • Fetching feature flag values • We have a pretty animation to hide that work
of our most successful experiments of 2021 • Only build as much as you need to be able to run the experiment ◦ We only did this initially on iOS ◦ You need to be confident that the feature works, but maybe those UI tests can wait until later?
was one of our worst experiments of 2021 • Negative results are not necessarily bad • Challenge your assumptions ◦ We almost shipped this without experimenting • Knowing the exact impact of your work is engaging
tracked metrics which can be used for customer insight and building future experiments • Solving a known customer problem is a big win! For example, customer will no longer contact our Care teams to ask to switch tiers
the app experience • Can be frustrating when irrelevant or too frequent How do we feel about pop-ups? Our developer perspective: • Pop-ups / dialogs and banners can be difficult to work with if there are lots of them, especially if using a third-party tools to drive in app-messages • Can be hard to test • Can be hard to make accessible • Already handling permissions / errors using dialogs can result in overlap
promotional pop-ups on the basket screen Hypothesis: Removing these pop-ups will provide insight on how valuable they are in increasing Plus subscriptions
of the features that are rolled out • Learnings from one experiment often informs the next one • Data driven decision making • Apples to apples comparison
experiment • Power analysis ◦ How long does our experiment need to run for? • More effort does not equal better results • Avoid overlapping interaction between experiments • Cleanup after experiment finishes
experiments/flags are enabled for a given user • Isolated systems will have the same outcomes for every user, feature flag and experiment. • When increasing and lowering rollout fractions, the same users will be included and excluded at each fraction. • Other tools are available!