[Valera Zakharov] Mobile performance testing at Slack

How to Build a Performance Testing Pipeline from Scratch VALERA
ZAKHAROV

"A slack client should be fast as fuck!”

Let’s fix some perf bugs

Trends

Alerts

Pre-merge Alerts

Naive Approach Measure dev version value Compare against baseline Alert
if they are diﬀerent Execution time Frame metrics Resource usage Anything that can   be measured latest master

Problem Results are variable and sometimes VERY VARIABLE

Stats to the Rescue C O M PA R E
DEV BUILD VALUES MASTER BUILD VALUES

Mann-Whitney U Test P-VALUE CONFIDENCE

Statistical Approach Collect set of N values from dev version
Test against data set from master Alert if conﬁdence > threshold

Collect set of N values from dev version Test against
data set from master Alert if diﬀ conﬁdence > threshold WE CONTROL THESE Statistical Approach

Higher Number of values = better stats Higher alert threshold
= lower false alert rate lower chance of valid detection more device time Statistical Approach

For whom?

Trust Valid detections Noise =

Valid detections Noise = Trust

PerfTest Job Backend ! " merge to master open PR
Trends trigger perf run run tests + gather data perf data Alert

PerfTest Job run tests + gather data

Naive approach Build Node runner Build Node backend aggregate metrics
test metrics PerfTest Job run tests + gather data

Naive approach Build Node runner Build Node backend aggregate metrics
test metrics test metrics test metrics PerfTest Job run tests + gather data

Naiv-ish approach Build Node runner Build Node backend aggregate metrics
test metrics test metrics test metrics device provider get release PerfTest Job run tests + gather data

https://code.fb.com/android/the-mobile-device-lab-at-the-prineville-data-center Do you have resources to build this?

PerfTest Job Cloud Version Bui runner Build Node backend AGGREGATE
METRICS TEST METRICS TEST METRICS TEST METRICS device provider GET RELEASE run tests + gather data

Cloud Version # Stability Scalability $ Control PerfTest Job run
tests + gather data

Instrumented Application Instrumentation Test EventTracker.startPerfTracking(Beacon.CHANNEL_SYNC) // code that does channel
sync EventTracker.endPerfTracking(Beacon.CHANNEL_SYNC) persist_rtm_start,44 process_rtm_start,19 ms_time_to_connect,703 channel_sync,381

Focus on client Network is highly unstable & variable Backend
regressions should not block client developers Use Record & Replay github.com/airbnb/okreplay

Keep it real We want to catch regressions that represent
the real world Preserve the prod object graph Run against release-like conﬁg LargeTest

Keep it real What about small unit perf tests?

Make it stable Perf tests will be executed a lot
Stability bar is very high Don’t compromise on ﬂakiness Use IdlingResource

Keep it working

Backend createBuild | completeBuild API store & analyze data check
sanity perf data

Backend Perf Data { "build_info":{ "platform":"android", “author_slack_id”:”W1234567”, "branch_name":"master", "build_cause":"Fixed sort
order for starred unreads. (#9838)", "id":8668, "jenkins_build_number":"9287", "author_name":"Kevin Lai", "job_name":"android-master-perf" }, "tests":[ { "status":"complete", "name":"com.Slack.ui.perf.SignInPerfTest#firstSignin_medium", "metric_results":[ {"name":"inflate_flannel_start","value":263}, {"name":"quickswitcher_show",”value”:30}, {"name":"inflate_flannel_start","value":314}, {"name":"quickswitcher_show","value":45} ] } ] }

Backend Backend Stack New shiny tech is great … …
but use whatever stack you have in house

" Trends

! Alert

! More on debugging Pre-merge alerting is great for experimenting
Detailed trace info would be great nice https://github.com/facebookincubator/proﬁlo looks promising

Thank you! Questions? @valera_zakharov

[Valera Zakharov] Mobile performance testing at...

[Valera Zakharov] Mobile performance testing at Slack

More Decks by Google Developers Group Lviv

Other Decks in Technology

Featured

Transcript