Upgrade to Pro — share decks privately, control downloads, hide ads and more …

A/B Testing

Addam Hardy
February 05, 2020

A/B Testing

Addam Hardy

February 05, 2020
Tweet

More Decks by Addam Hardy

Other Decks in Research

Transcript

  1. A B

  2. :: also known as “Split Testing” a method of measuring

    the outcome of two versions of an action against each other to determine which one performs better. What is A/B Testing?
  3. Specifically in the world of websites and applications, A/B testing

    is an experiment where two or more variants of a page are shown to users at random, and statistical analysis is used to determine which variation performs better for a given conversion goal. What is A/B Testing?
  4. To A/B test UI, we randomly send half of the

    users to the control UI and the other half of random users to the variation UI What is A/B Testing?
  5. Key Detail randomly segmenting users to either the control or

    the variation is vitally important. it must be actually random
  6. FAQ no, but you are significantly increasing volume of data

    you will need in order to see a statistically significant change in the data do the population sizes have to be 50/50?
  7. A/B Testing is also a form of statistical analysis called

    hypothesis testing. What is hypothesis testing? What is A/B Testing?
  8. What is A/B Testing? What is hypothesis testing? Hypothesis testing

    is the use of statistics to determine the probability that a given hypothesis is true. The usual process of hypothesis testing consists of four steps. 1. Formulate the null hypothesis 2. Identify a test statistic that can be used to assess the truth of they null hypothesis 3. Compute a p-value (probability that a test statistic at least as signi fi cant as the one observed would be obtained assuming the null hypothesis is true) 4. Compare p-value to acceptable signi fi cance value (a). if p <= a, null hypothesis is ruled out and alternative hypothesis is value
  9. what is a null hypothesis? the hypothesis that given statistical

    significance you would get the same results not changing the test as you would if you changed it and that any variation is from random chance. basically: nothing special is happening here What is A/B Testing?
  10. what is the alternative hypothesis? the contrary to the null

    hypothesis. if proven, the results are from a real effect and not purely from random chance. basically: what we changed made an real difference in the results What is A/B Testing?
  11. What does this even mean? Test is A Control is

    B A/B Testing == Test/Control Testing Test & Control
  12. FAQ Yes. That is called multivariate testing and often referred

    to as A/B/n testing. However you are significantly increasing the complexity of the math if you do this. A/B is two things. Can I test more than two
  13. Control Variation Add to Cart hypothesis: a purchase CTA with

    a green background will have higher engagement than one with a blue background. Add to Cart
  14. Control Variation Add to Cart Add to Cart hypothesis: a

    purchase CTA with a green background will have higher engagement than one with a blue background.
  15. Control Variation Lift Add to Cart Add to Cart hypothesis:

    a purchase CTA with a green background will have higher engagement than one with a blue background.
  16. Control Variation Lift Add to Cart Add to Cart hypothesis:

    a purchase CTA with a green background will have higher engagement than one with a blue background. PROVEN TRUE! 😄
  17. Control Variation Lift Add to Cart Add to Cart hypothesis:

    a purchase CTA with a green background will have higher engagement than one with a blue background. PROVEN TRUE?? or is it? 😰
  18. Key Detail statistical signi fi cance is important If you

    make an assertion with statistically insigni fi cant data, you are not proving anything. In fact, you could be communicating something that makes the situation worse than it is without change.
  19. what makes something statistically significant? Statistical signi fi cance is

    the likelihood that the difference in measurements between a given variation and the baseline is not due to random chance. A result of an experiment is said to have statistical signi fi cance, or be statistically signi fi cant, if it is likely not caused by chance for a given statistical signi fi cance level. Statistical Significance
  20. If you rolled a double six, would you bet $1000

    on rolling it again on the next try?
  21. 1/6 1/6 If you rolled a double six, would you

    bet $1000 on rolling it again on the next try?
  22. 1/6 1/6 1/6 x 1/6 = 1/36 chance If you

    rolled a double six, would you bet $1000 on rolling it again on the next try?
  23. 1/6 1/6 1/6 x 1/6 = 1/36 chance If you

    rolled a double six, would you bet $1000 on rolling it again on the next try?
  24. no, you wouldn’t. so why would you tell a client

    75% of users were female if you only had 20 page views?
  25. that’s great addam… but i’m not teaching statistics at university

    for a reason. what am i supposed to do? Statistical Significance
  26. could you repeat that? make a hypothesis i think X

    will do better than Y test one thing at a time i want to test changing the text and the color, but I’ll start with color. verify the samples are random ensure there’s no confirmation bias watch your sample size use the a/b test calculators online to determine optimal sample size verify statistical significance make sure you are not using stats to tell mathematically accurate lies iterate on the next hypothesis! it doesn’t stop after you’ve proven your first test. Use those results to make a new hypothesis and iterate.