Understanding the Impact of Domain Term Explanation on Duplicate Bug Report Detection

preencoded.png Understanding the Impact of Domain Term Explanation on Duplicate
Bug Report Detection Usmi Mukherjee [email protected] Masud Rahman [email protected]

preencoded.png The Duplication Conundrum in Bug Tracking Systems Independent Report
Submission Users submit bugs reports asynchronously. High Duplication Rate Up to 42% of reports are duplicates. (Zou et al, TSE 2018) Significant Overhead Duplicates add maintenance burden. (Jalbert et al, DSN 2008)

Let’s look at a Bug Report 3 78% bug reports
are short, take 121 days longer to resolve (Zhang et al, 2017) Domain-Specific Terms make bug reports are difficult to comprehend, difficult to detect duplicates

The probability of two persons using the same text to
explain the same issue is very low (e.g., 10%–15%) Furnas et al, Communications ACM 1987

Textual Dissimilarity of Bug Reports Different textual descriptions for the
same underlying issue Variation in component descriptions and observed behaviors Missing or differently written components across reports Jahan and Rahman, SANER 2023

The prevalence of domain-specific terms in bug reports could be
crucial for understanding and finding duplicates Motivation 78% of bug reports contain less than 100 words Zhang et al, ICPC 2021 Difficult to understand and find duplicates Zhang et al, ICPC 2021 Designed for textually similar, but 19-23% are textually dissimilar Jahan and Rahman, SANER 2023

preencoded.png Workflow Step 1: Construction of Explanation Module Fine-tune LLM
for explanations of domain-terms Step 2: Enriching Bug Report Add domain term explanations to reports. Step 3: Duplicate Bug Report Report Detection Apply enriched reports to existing techniques.

preencoded.png Step 1 : Construction of Explanation Module

preencoded.png Step 2 : Enriching Bug Report

preencoded.png Step 3 : Duplicate Bug Report Detection Classification Based
1. Siamese-CNN 2. DC-CNN 3. CTEDB Ranking Based 1. BM25 2. LDA+GloVe 3. SBERT 4. CUPID Ranking Based Recall Rate @ K Classification Based AUC, Precision, Recall, F1Score

RQ1 : Does enrichment help improve existing techniques in detecting
duplicate bug reports ? RQ2 : Does enrichment help in detecting textually dissimilar duplicate bug reports? Research Questions

RQ 1 - Does enrichment help improve existing duplicate bug
report detection techniques ? Ranking Based Techniques 66.88% SBERT Recall@1 Highest gain in recall. 41.39% LDA+GloVe Recall@5 Best recall improvement.

RQ 1 - Does enrichment help improve existing duplicate bug
report detection? Classification Based Techniques 5.29% DC-CNN AUC Significant AUC increase. 5.70% CTEDB Precision Notable performance boost.

Impact of the number of domain term explanations

preencoded.png RQ1: Key Insights Better representation, bridges vocabulary gap Negligible
computational overhead (0.11 sec per report)

RQ 2 - Does enrichment help in detecting textually dissimilar
duplicate bug reports? Textually Similar Textually Dissimilar Ranking Based Techniques LDA+GloVe Impact 63% recall improvement for similar LDA+GloVe Impact 137% recall improvement for dissimilar

RQ 2 - Does enrichment help in detecting textually dissimilar
duplicate bug reports? Classification Based Techniques Textually Similar Textually Dissimilar DC-CNN Benefits 4-8% gain for textually similar DC-CNN Benefits 5-9% gain for textually dissimilar

preencoded.png RQ2: Key Insights Enhances semantic representation Increases keyword overlap
Improved domain-specific context

preencoded.png Actionable Insights – Software Practitioners Improved Management Better bug
report writing Enhanced Search Better search Bug Report Comprehension Better understandability Other Improved SE Artifacts Improve software artifacts like requirements documentation

preencoded.png Actionable Insights – Researchers Extraction Optimization Improve term extraction
Project Ecosystem Analysis Variation of terms from open source and industrial projects Impact on other Software Engineering tasks Influence on downstream tasks like bug localization

THANK YOU Replication Package QUESTIONS ? Pre-print

Understanding the Impact of Domain Term Explana...

Understanding the Impact of Domain Term Explanation on Duplicate Bug Report Detection

Masud Rahman

More Decks by Masud Rahman

Other Decks in Research

Featured

Transcript

preencoded.png Understanding the Impact of Domain Term Explanation on Duplicate

preencoded.png The Duplication Conundrum in Bug Tracking Systems Independent Report

Let’s look at a Bug Report 3 78% bug reports

The probability of two persons using the same text to

Textual Dissimilarity of Bug Reports Different textual descriptions for the

The prevalence of domain-specific terms in bug reports could be

preencoded.png Workflow Step 1: Construction of Explanation Module Fine-tune LLM

preencoded.png Step 1 : Construction of Explanation Module

preencoded.png Step 2 : Enriching Bug Report

preencoded.png Step 3 : Duplicate Bug Report Detection Classification Based

RQ1 : Does enrichment help improve existing techniques in detecting

RQ 1 - Does enrichment help improve existing duplicate bug

RQ 1 - Does enrichment help improve existing duplicate bug

Impact of the number of domain term explanations

preencoded.png RQ1: Key Insights Better representation, bridges vocabulary gap Negligible

RQ 2 - Does enrichment help in detecting textually dissimilar

RQ 2 - Does enrichment help in detecting textually dissimilar

preencoded.png RQ2: Key Insights Enhances semantic representation Increases keyword overlap

preencoded.png Actionable Insights – Software Practitioners Improved Management Better bug

preencoded.png Actionable Insights – Researchers Extraction Optimization Improve term extraction

THANK YOU Replication Package QUESTIONS ? Pre-print