Are Comprehensive Quality Models Necessary for Evaluating Software Quality?

www.uni-stuttgart.de Stefan Wagner PROMISE 2013 Baltimore, USA 9 October 2013
Comprehensive Quality Models Necessary for Are @prof_wagnerst joined work with Klaus Lochmann and Jasmin Ramadani Evaluating ? Software Quality

"We deployed a bug prediction algorithm across Google, and found
no identiﬁable change in developer behavior." Lewis et al., ICSE'13

"Quality is a complex and multi-faceted concept... it is also
the source of great confusion." –David A. Garvin

Comment ratio ISO 9126 ISO 25010 Quality attribute Clone coverage
Measure Cyclomatic complexity

Comment ratio Quality attribute Clone coverage Measure Cyclomatic complexity ?
ISO 9126 ISO 25010

The Benchmark for Software Quality http://www.quamoco.de

Comment ratio Quality attribute Clone coverage Measure Cyclomatic complexity ?
ISO 9126 ISO 25010

Comment ratio Quality attribute Clone coverage Measure Cyclomatic complexity Product
factor ISO 9126 ISO 25010

Statically unused method Analyzability Quality attribute Measure Usefulness of method
Product factor Maintainability Gendarme: Avoid Uncalled Private Code PMD: Unused Private Method Instrument

root C GUI object-oriented Java Base Model C# C++

1 Software Product Quality Control Stefan Wagner

Quality Model Maintenance

Research Strategy

RQ 1: What is the performance of focused quality models
built using machine learning algorithms? RQ 2: What is the performance of the focused quality models including additional expert-based measures?

Predictor Models Used • Random guessing • Linear regression (forward
selection) • Linear regression (backward elimination) • Linear regression (bidirectional elimination) • Classiﬁcation tree • Random forest

• Mean absolute residual (MAR) • Standardised accuracy measure (SA)
• Effect size Model Comparison MAR = Pn 1 |(yi ˆ yi)| n SApi = 1 MARpi MARp0 = MARpi MARp0 sp0

Study Objects • 1994 Java systems from SDS repository •
15 Java systems for which we have manual measures

Procedure • Collection of all measures and evaluations for maintainability
• Building of predictors (4-fold cross validation) • Calculation of model comparison measures

Results

0 10 20 30 40 50 60 35 40 45
50 55 60 65 SA (percentage of improvement over random guessing) Random Forest (Forward Selection) Classification Tree (Forward Selection) Classification Tree (different complexity param.) Regression (Forward Selection) Regression (Bidirectional Elimination) Regression (Backward Elimination) SA / # of Variables # of variables

0 10 20 30 40 50 50 55 60 65
Number of Variables Random Forest (Forward Selection) SA / # of Variables

0 5 10 15 20 25 30 0 10 20
30 40 50 60 SA Systems without experXďFEWIHQIEWYVIW Systems with experXďFEWIHQIEWYVIW With and Without Manual Measures # of variables

Threats to Validity • Expert measures not included in RQ
1 • For RQ 2 only 15 systems • Set of predictors and comparison measures • Only maintainability • Only Java systems

Conclusions

• Comprehensive models to capture all the different aspects and
quality factors • More focused models measuring only few measures • Focused model with 61% accuracy but only 10 measures (compared to 378) • Expert-based measures reduce accuracy • So what should we use?

Are Comprehensive Quality Models Necessary for ...

Are Comprehensive Quality Models Necessary for Evaluating Software Quality?

PROMISE'13: The 9th International Conference on Predictive Models in Software Engineering

More Decks by PROMISE'13: The 9th International Conference on Predictive Models in Software Engineering

Other Decks in Research

Featured

Transcript

www.uni-stuttgart.de Stefan Wagner PROMISE 2013 Baltimore, USA 9 October 2013

"We deployed a bug prediction algorithm across Google, and found

"Quality is a complex and multi-faceted concept... it is also

Comment ratio ISO 9126 ISO 25010 Quality attribute Clone coverage

Comment ratio Quality attribute Clone coverage Measure Cyclomatic complexity ?

The Benchmark for Software Quality http://www.quamoco.de

Comment ratio Quality attribute Clone coverage Measure Cyclomatic complexity ?

Comment ratio Quality attribute Clone coverage Measure Cyclomatic complexity Product

Statically unused method Analyzability Quality attribute Measure Usefulness of method

root C GUI object-oriented Java Base Model C# C++

1 Software Product Quality Control Stefan Wagner

Quality Model Maintenance

Research Strategy

RQ 1: What is the performance of focused quality models

Predictor Models Used • Random guessing • Linear regression (forward

• Mean absolute residual (MAR) • Standardised accuracy measure (SA)

Study Objects • 1994 Java systems from SDS repository •

Procedure • Collection of all measures and evaluations for maintainability

Results

0 10 20 30 40 50 60 35 40 45

0 10 20 30 40 50 50 55 60 65

0 5 10 15 20 25 30 0 10 20

Threats to Validity • Expert measures not included in RQ

Conclusions

• Comprehensive models to capture all the different aspects and