Uncovering Causal Relationships between Software Metrics and Bugs (CSMR 2012)

Uncovering Causal Relationships between Software Metrics and Bugs Nicolas Anquentil
RMoD Team [email protected] Cesar Couto, Marco Tulio Valente, Roberto Bigonha Department of Computer Science {cesarfmc,mtov,bigonha}@dcc.ufmg.br

Introduction  Bug Prediction:  Input: system S + information
on S (changes, bugs, source code metrics)  Output: bug prediction model  Bug Prediction Model:  Input: class C from S + information on C  Output: in the next t months, C will present n bugs 2

Introduction  No questions on the value of this information
 Software Quality  Preventive Maintenance  Project Management  The real question is:  How reliable is this information? 3

Most common statistics behind prediction models  [Linear] regression model
4 Class Independent Variable (any metric) Dependent Variable (# of post-release defects) C1 5 2 C2 6 4 C3 4 1 … … … #defects #metric y = α + βx

Common predictors  D'Ambros; Lanza; MSR 2009 5

Problem: Correlation does not imply causation  Spurious Correlation 
Business Week 2011 6

Our approach for bug prediction  Granger Causality Test 
Clive Granger, Nobel Prize Winner in 2003  "For methods of analyzing economic time series”  Time series X is useful in forecasting Y?  Example:  Do changes in oil prices cause recession?  Does money growth cause inflation? 7

Granger Test  Given two time series X and Y
 “X Granger-cause Y” if  X helps to predict Y in the future  Y = Bugs  Number of defects in a class in a time frame  X = a software metric [that changes with time]  Size (number of methods, lines etc)  CK (coupling, cohesion, inheritance etc) 8

Statistics behind Granger  Univariate:  Bivariate: where p =
auto-regressive lag (parameter) 9 1. Build two autoregressive models: 2. If Bivariate is better than Univariate (F-test) then “X Granger-Cause Y”

Granger Test example 10

Granger vs Regression  Regression Techniques  Do not rely
on past values  For each class, regression correlates:  Current value of the independent variable (metrics)  Current value of the dependent variable (bugs)  Goal is to discover the number of bugs in the future  Granger Causality  Trend analysis technique, time series analysis  Goal is to infer if X helps to predict Y 11

Experiment  Dataset  B = bugs  D =
defects 12

Experiment  Metrics 13

Methodology  success[c,m,lag]:  boolean matrix with results for Granger
 Dimensions represents classes, metrics, lags 14 for each class c! for each metric m! for lag = 1 to 4 do! if granger(tsm[c,m], tsd[c], lag)! then success[c,m,lag] = true! else success[c,m,lag] = false!

Results  How many defects were “predicted” by Granger? 
How many defects were found in the classes where Granger indicated positive result for any metric?  D = defects  DVC = defects in valid classes  DPG = defects predicted by Granger 15

Results  What are the metrics that most contributed to
predict defects? 16

Results  What are the lag values that most led
to positive results for Granger? 17

Conclusions  First work to bug prediction that used Granger
 84% of the defects were predicted by Granger  Not identify a “holy grail” for bug prediction 18

Future work  Implement a tool to alert developers about
future defect 19

Questions?

Uncovering Causal Relationships between Softwar...

Uncovering Causal Relationships between Software Metrics and Bugs (CSMR 2012)

ASERG, DCC, UFMG

More Decks by ASERG, DCC, UFMG

Other Decks in Research

Featured

Transcript

Uncovering Causal Relationships between Software Metrics and Bugs Nicolas Anquentil

Introduction  Bug Prediction:  Input: system S + information

Introduction  No questions on the value of this information

Most common statistics behind prediction models  [Linear] regression model

Common predictors  D'Ambros; Lanza; MSR 2009 5

Problem: Correlation does not imply causation  Spurious Correlation 

Our approach for bug prediction  Granger Causality Test 

Granger Test  Given two time series X and Y

Statistics behind Granger  Univariate:  Bivariate: where p =

Granger Test example 10

Granger vs Regression  Regression Techniques  Do not rely

Experiment  Dataset  B = bugs  D =

Experiment  Metrics 13

Methodology  success[c,m,lag]:  boolean matrix with results for Granger

Results  How many defects were “predicted” by Granger? 

Results  What are the metrics that most contributed to

Results  What are the lag values that most led

Conclusions  First work to bug prediction that used Granger

Future work  Implement a tool to alert developers about

Questions?