Why Did your PR Get Rejected? Defining Guidelines for Avoiding PR Rejection in Open Source Projects

Why Did your PR Get Rejected? Deﬁning Guidelines for Avoiding
PR Rejection in Open Source Projects Nick Papadakis, Ayan Patel, Tanay Gottigundala, Alexandra Garro, Xavier Graham, Bruno da Silva [email protected] CHASE Workshop, June 2020

Reviewers’ perspective

Focus on quantitative analysis only or limited by PRs from
the core team

They built a list of PR rejection reasons from surveying
the PR authors. We want to do manual analysis directly on the PRs and spell out concrete guidelines for contributors

We focus on all types of PR authors Rejected PRs
only Qualitative investigation

RQ1: most frequent reasons why PRs are rejected

RQ2: PR rejection vs. sentiment on comments

RQ3: interaction length on PRs vs. rejection reason and sentiment

We manually analyzed 231 rejected PRs A python packet manipulation
program and lib. 5.3k stars, 1.2k forks RQ1: most frequent reasons why PRs are rejected RQ2: PR rejection vs. sentiment on comments RQ3: interaction length on PRs vs. rejection reason and sentiment

5 researchers Manual classiﬁcation of: • Sentiment (pos, neg, neutral)
• Rejection reason

5 researchers Manual classification of: • Sentiment (pos, neg, neutral)
• Rejection reason Calibration session: 25 PRs, everyone present Each of the remaining PRs assigned to 2 different researchers Individual classification sessions Conflict resolution sessions

RESULTS

RQ1: most frequent reasons why PRs are rejected PR conﬂicts
…after excluding PRs closed by author or accidentally, and PRs with no conversation

…after excluding PRs closed by author or accidentally, and PRs with no conversation code rebase We found a lot of => researchers: note these ‘false rejections’ since code rebase is another way to integrate code (accepting the changes fully or partially)

…after excluding PRs closed by author or accidentally, and PRs with no conversation code rebase We found a lot of => researchers: note these ‘false rejections’ since code rebase is another way to integrate code (accepting the changes fully or partially) Unnecessary functionality

…after excluding PRs closed by author or accidentally, and PRs with no conversation code rebase We found a lot of => researchers: note these ‘false rejections’ since code rebase is another way to integrate code (accepting the changes fully or partially) Unnecessary functionality Needs testing

…after excluding PRs closed by author or accidentally, and PRs with no conversation code rebase We found a lot of => researchers: note these ‘false rejections’ since code rebase is another way to integrate code (accepting the changes fully or partially) Unnecessary functionality Needs testing Author unable to ﬁx issues

RQ2: PR rejection vs. sentiment on comments The vast majority
of the PR conversations were neutral in sentiment (~66%)

of the PR conversations were neutral in sentient (~66%) ~30% are positive

of the PR conversations were neutral in sentient (~66%) ~30% are positive ~3% are negative … and this distribution does not vary signiﬁcantly as you navigate through speciﬁc rejection categories

Author issues Version control issues Code issues Side effect issues Unnecessary changes Avg comment count 10 8 7 4 3

Avg comment count 10 10 5 Do people act more on things they’re emotionally motivated? More conversation => more sentiment expressed In many PRs with positive sentiment, reviewers were trying to help the author to get the PR accepted

a) Know well the scope and requirements of your change
and the context around it. b) Make sure that someone else has not already opened a pull request to address the same issue. c) Understand well the project functionalities before creating new ones. Make sure new functionalities are really necessary. d) Always include tests to cover your changes. Make sure your code meets test coverage expectation. e) Make sure to follow up on your pull requests; reviewers may request changes before it is accepted. v0.1

Why Did your PR Get Rejected? Deﬁning Guidelines for Avoiding
PR Rejection in Open Source Projects Nick Papadakis, Ayan Patel, Tanay Gottigundala, Alexandra Garro, Xavier Graham, Bruno da Silva [email protected] CHASE Workshop, June 2020

Why Did your PR Get Rejected? Defining Guidelin...

Why Did your PR Get Rejected? Defining Guidelines for Avoiding PR Rejection in Open Source Projects

Bruno C. da Silva

More Decks by Bruno C. da Silva

Other Decks in Research

Featured

Transcript

Why Did your PR Get Rejected? Deﬁning Guidelines for Avoiding

?

Reviewers’ perspective

Focus on quantitative analysis only or limited by PRs from

They built a list of PR rejection reasons from surveying

We focus on all types of PR authors Rejected PRs

RQ1: most frequent reasons why PRs are rejected

RQ2: PR rejection vs. sentiment on comments

RQ3: interaction length on PRs vs. rejection reason and sentiment

We manually analyzed 231 rejected PRs A python packet manipulation

5 researchers Manual classiﬁcation of: • Sentiment (pos, neg, neutral)

5 researchers Manual classiﬁcation of: • Sentiment (pos, neg, neutral)

RESULTS

RQ1: most frequent reasons why PRs are rejected PR conﬂicts

RQ1: most frequent reasons why PRs are rejected PR conﬂicts

RQ1: most frequent reasons why PRs are rejected PR conﬂicts

RQ1: most frequent reasons why PRs are rejected PR conﬂicts

RQ1: most frequent reasons why PRs are rejected PR conﬂicts

RQ2: PR rejection vs. sentiment on comments The vast majority

RQ2: PR rejection vs. sentiment on comments The vast majority

RQ2: PR rejection vs. sentiment on comments The vast majority

RQ3: interaction length on PRs vs. rejection reason and sentiment

RQ3: interaction length on PRs vs. rejection reason and sentiment

a) Know well the scope and requirements of your change

Why Did your PR Get Rejected? Deﬁning Guidelines for Avoiding