Quantifying the Health of End-to-End Test Automation

Quantifying the Health of End-to-End Test Automation Nozomi Ito

• CEO of MagicPod • MagicPod: a no-code, cloud-based test
automation tool • Organizer of the Japan Selenium User Community • Author of a Japanese Selenium book About me Nozomi Ito

Agenda 1. Health Score of End-to-End Test Automation 2. Breakdown
of the Health Score 3. Improving the Health Score

• By reduced cost? • By number of bugs found?
• By DORA Metrics? ⚬ Deployment Frequency, Lead Time, Change Failure Rate, MTTR • By business growth? • By profit margin? How do we quantify the success of End-to-End test automation?

Continue Results justify the cost Cancel For MagicPod Sucess of
the customer’s test automation They continue to use the product without cancellation Results do not justify the cost

Interviewed over 100 MagicPod customers (including those who cancelled) Measuring
Success with the ‘Health Score’ Defined ‘health score’ metrics based on key indicators correlated with automation success Calibrated the logic over 6 months by monitoring scores and customer cancellations Implemented Customer Success actions based on the health score ❶ ❷ ❸ ❹

• Total Score: 100 points • Green(Healthy) for 80 and
above, Orange(Normal) for 50-79 points, Red(Unhealthy) for below 50 points • Calculated daily, weekly, monthly Health Score Details 35 pts Tests are run at least once a day (Num. of tests run in a week) / (Num. of tests x Num. of weekdays) 35 pts Tests are successful Num. of passed tests / Num. of executed tests 20 pts Tests are maintenable Num. of shared steps, percentage of overly long tests, etc. 10 pts Others Approximate breakdown

Does the Health Score measure test automation sucess? If the
score improves from Red to Green, the cancellation rate is reduced to If the score improves from Red to Orange, the cancellation rate is reduced to 50% or less 20% or less

Does the Health Score measure test automation sucess? If the
score improves from Red to Green, the cancellation rate is reduced to If the score improves from Red to Orange, the cancellation rate is reduced to 50% or less 20% or less The score measures cancellation rate (≈success of test automation) well

Users can check their Health Score

Agenda 1. Health score of End-toEnd Test Automation 2. Breakdown

• Health score quantifies ‘whether daily reliable feedbacks from tests
are available and sustainable’ • Not ‘whether cost is cut’ nor ‘whether bugs are found’ 35 pts Tests are run at least once per day → Is the test feedback delivered in faster cycles? 35 pts Tests are successful → Is the test feedback reliable? 20 pts Tests are maintenable → Is the reliability of the test feedback sustainable? 10 pts Others Overview of Health Score Calculation Logic

Details of Health Score Calculation Logic Item Score 1. Sufficient
number of test cases 6 pts 2. Sufficient members in the project 3 pts 3. Utilization of shared steps 8 pts 4. Avoid long tests 5 pts 5. Use stable locators 8 pts 6. Tests are run at least once a day 35 pts 7. Low test failure rate 35 pts

1. Sufficient number of test cases (6 pts) To maximize
the benefits of automated testing, sufficient number of test cases is essential • 20+ test cases (user stories) earn full points

2. Sufficient members in the project (3 pts) • 4+
members earn full points Involvement of other members is important for long- term success

• Creating shared steps for 10-20% of total tests earn
full points Shared steps (common functions) promote standardization and reduce maintenance efforts 3. Utilization of shared steps (8 pts)

Long tests make error isolation difficult and should be avoided
when possible 4. Avoid long tests (5 pts) • Tests with over 200-300 steps are considered long • Keeping long tests within 10% of total tests earn full points

5. Use stable locators (8 pts) • Using stable locators
for at least 80% of total tests earn full points Complex locators are vulnerable to page structure changes

What locators are considered stable? Stable Unstable All other locators
• Attributes or text that uniquely identify elements • ‘contains’, ‘starts-with’ are allowed • The same rules apply for CSS locators ・id and accessibility information ・#id CSS selector ・//xxx[yyy=‘zzz’] XPath For elements used in tests, we recommend assigning a unique id, accessibility information, attribute, or text within the application

Tests that aren't run daily become difficult to maintain 6.
Tests are run at least once a day (35 pts) • Running all tests at least 4.2 days (average number of weekdays) per week earn full points • (Num. of tests run in a week) / (Num. of tests x Num. of weekdays)

7. Low test failure rate (35 pts) High failure rate
indicates lack of maintenance • Batch runs with a success rate of 90% or higher earn full points

Summary Item Score 1. Sufficient number of test cases 6
pts 2. Sufficient members in the project 3 pts 3. Utilization of shared steps 8 pts 4. Avoid long tests 5 pts 5. Use stable locators 8 pts 6. Tests are run at least once a day 35 pts 7. Low test failure rate 35 pts

Agenda 1. Health Score of End-toEnd Test Automation 2. Breakdown

Why is the Health Score low? 1.Project structure 2.Lack of
involvement from developers 3.Over-reliance on End-to-End test automation Main reasons we found through user interviews

Development Test Type Deploy frequency Health score In-house In-house Fully
In-house High Outsourced Outsourced Fully Outsourced Medium In-house Outsourced Third-party Testing Medium Outsourced In-house Automated UAT Low 1.Project structure Generally, fully “in-house” projects tend to achieve a high health score

Solution Development Test Type Health score Solution In-house In-house Fully
In-house Run tests every day Outsourced Outsourced Fully Outsourced Ask the development company to run tests every day In-house Outsourced Third-party Testing Ask the test staffs to involve deeply as members of the team Outsourced In-house Automated UAT Ask the development company to automate and run tests

2.Lack of involvement from developers • CI integration needs developer’s
support • Developers can improve the testability of the product • The benefits of test automation cannot be maximized if it does not contribute to developer productivity Successful End-to-End test automation cannot be realized without the help of developers

Solution • Involve developers in End-to-End test automation • Run
tests every day on the CI server • Improve developer productivity through faster feedbak • Improve testability of the product

3.Over-reliance on End-to-End test automation Some people try to automate
everything in the End-to-End test layer The project fails due to flakiness and slowness Automate all existing manual tests!!

Solution to maintain appropriate test pyramid Unit tests Integration tests
End-to-End tests Involve developers in the planning of test automation

What is important for the success of End-to-End test automation?
• Run tests continually to get faster feedback • Keep tests stable and reliable • Involve developers in End-to-End testing

Quantifying the Health of End-to-End Test Autom...

Quantifying the Health of End-to-End Test Automation

MagicPod

More Decks by MagicPod

Other Decks in Technology

Featured

Transcript