Libra Report: Scaling A/B Tests for Real Services

2019 DevDay Libra Report: Scaling A/B Tests for Real Services
> Kwangyeol Ryu > LINE Data Science Team1 Data Engineer

Who Am I Kwangyeol Ryu | Data Engineer Building a
fast data analytics platform to increase Data Scientists’ productivity Data Science Team1 | Data Labs, LINE corporation

Agenda > How we do AB Tests at LINE >
Libra Report > Summary

AB Tests in One Slide > KDD 2019 Tutorial, Challenges,
Best Practices and Pitfalls in Evaluating Results of Online Controlled Experiments Treatment Collect Control Collect Randomly Assign to Two Variants User Come to the Experiment New Feature X Compare Metrics and Determine the Winner AB tests are the best scientific way to prove causality!

AB Tests at LINE for Real Services Treatment Collect Control
Collect Randomly Assign Into Two Variants User Coming Into Experiment New Feature X Compare Metrics and Determine the Winner Daily 100s Million Active Users Countries, OSs, App Versions Lots of New Features Large-Scale   Log Data Complex Metrics

AB Tests at LINE for Real Services Treatment Collect Control
Collect Randomly Assign Into Two Variants User Coming Into Experiment New Feature X Compare Metrics and Determine the Winner Daily 100s Million Active Users Countries, OSs, App Versions Lots of New Features Large-Scale   Log Data Complex Metrics Human Errors Ad-Hoc Data Processing Ad-Hoc Test Dashboard Ad-Hoc Metrics

Issues From Data Scientists > Test conditions were delivered via
wikis, emails, slack channels, etc. > The codes are hard to reuse and share. > Data scientists put lots of effort into controlling the overall test.

AB Tests Systems at LINE Treatment Collect Control Collect Randomly
Assign Into Two Variants User Coming Into Experiment New Feature X Compare Metrics and Determine the Winner Daily 100s Million Active Users Countries, OSs, App Versions Lots of New Features Large-Scale   Log Data Complex Metrics

AB Tests Systems at LINE Central Dogma > Update user
app’s configurations Libra > Manage the result of AB test design Libra Report > Manage Test Metrics > Generate Test Dashboards > https://line.github.io/centraldogma/ > DevDay 2018, LINE AB Test Standardization with Our Own Toolset

How Libra Report Works Click Stream Service Logs > Conditions
> Logics Dashboard DB > Create Dynamic DAGs API Dashboard User DB > AB Test Configurations > Key Metric Definitions (Hive SQL, Rscript) Data Metadata Orchestration

Libra Report: Metadata

Libra Report: Orchestration Apache Airflow > Airflow Dynamic DAG •
Generated from metadata > Manage Complex Data Dependancies > Ensure Filter-out Privacy data • Check user’s privacy policy agreement > Calculate Various Metrics > Calculate Basic Statistics by default • p-value, lift, etc > Easy to backfill when logic changes

Libra Report: Dashboard

Summary > Using AB Tests Systems we can do more
AB tests! > Through AB Tests, we can ensure LINE services make user value! > Data scientists are expensive to hire. They can do more with proper tools!

Thank You

Libra Report: Scaling A/B Tests for Real Services

Libra Report: Scaling A/B Tests for Real Services

LINE DevDay 2019

More Decks by LINE DevDay 2019

Other Decks in Technology

Featured

Transcript

2019 DevDay Libra Report: Scaling A/B Tests for Real Services

Who Am I Kwangyeol Ryu | Data Engineer Building a

Agenda > How we do AB Tests at LINE >

AB Tests in One Slide > KDD 2019 Tutorial, Challenges,

AB Tests at LINE for Real Services Treatment Collect Control

AB Tests at LINE for Real Services Treatment Collect Control

Issues From Data Scientists > Test conditions were delivered via

AB Tests Systems at LINE Treatment Collect Control Collect Randomly

AB Tests Systems at LINE Central Dogma > Update user

How Libra Report Works Click Stream Service Logs > Conditions

Libra Report: Metadata

Libra Report: Orchestration Apache Airflow > Airflow Dynamic DAG •

Libra Report: Dashboard

Summary > Using AB Tests Systems we can do more

Thank You