Gender-diversity analysis of technical contributions Daniel Izquierdo Cortázar @dizquierdo dizquierdo at bitergia dot com https://speakerdeck.com/bitergia OpenStack Summit, Barcelona 2016
/me CDO in Bitergia, the software development analytics company Lately involved in understanding the gender diversity in some OSS communities Involved in OPNFV dashboard (opnfv.biterg.io) Disclaimer: not involved in any working group, own analysis and interest, I may have missed some stuff...
Why this study Diversity matters I attended some (Women of OpenStack) talks in the OpenStack Summit (Tokyo and Austin) Produced some numbers that gained some attention: OpenStack and Linux Kernel In the end this is all about transparency and improvement Update the numbers
What we have so far FOSS Survey in 2013: - http://floss2013.libresoft.es/results.en.html - 11% of women answered the survey The Industry Gender Gap by the World Economic Forum. - 5% for CEOs, 21% for Mid-level roles, 32% of Junior roles
OpenStack (Austin) numbers Women activity (all of the history): ~ 10,5% of the population ( ~ 570 developers ) ~ 6,8% of the activity ( >=16k commits )
Summary Conclusions not representative, but: - Women represents around 30%/40% of the workforce in tech companies. - And between 10% and 20% if focused on tech teams. - OpenStack shows a 11% of the population - Linux Kernel shows a 10% of the population
Some Definitions Contributions: commit, patchset, code review, email Other potential metrics: diversity by company, fairness in the code review among organizations and genders, transparency in the process Available but sensitive info: affiliation, countries, time to review
Architecture Mining Tools Perceval ● Produces JSON documents from the usual data sources in OSS ● Part of the GrimoireLab toolchain ● grimoirelab.github.io
Architecture Viz ElasticSearch + Kibana ● ElasticSearch: Schemaless db ● Kibana: works great with ES ● This tandem helps a lot to verify info ● Drill down capabilities ● Extra info available (but not displayed)
Gerrit WOO Activity ● 28,503 changesets sent, 9,4% activity ● 812 women sending changesets, 11,87% of the population ● 9,56% of the activity and 13% of the population during the last year Women sending changesets
Some Answers ● Similar activity in Git: increase in the number of repositories ● WOO lower activity as core reviewers (~ -9%) ○ Activity has increased on the other hand (~ 6%)
Open Questions from Last Talk Question: Is there a specific action for helping you with the data correctness or the name identification? Suggestion: integrate openstack id with gerrit and in the members foundation directory, there's specific information related to gender Video: https://www.youtube.com/watch?v=TQIQCT-Aqpo
Open Questions from Last Talk Comment: the reason why the documentation project is doing so great is because they have great inclusive leaders Comment: Another interesting point is 'retention': how to bring them on board and keep them contributing Video: https://www.youtube.com/watch?v=TQIQCT-Aqpo
Open Questions from Last Talk Suggestion: work on relative numbers and not that much in the net numbers. As projects come and go, it would be interesting to work at this level. Comment: working at the level of high school, works done in the USA/Europe? People are willing to help with this line. Suggestion: address people outside of the gender binary Video: https://www.youtube.com/watch?v=TQIQCT-Aqpo
Further Work Sensitive info: dashboard still private Extra analysis: time to merge fairness, companies women %, Outreachy follow ups, quarterly reports, updated data, specific policies ROI and others. This [hopefully] helps to have a better picture Other minorities analysis could be done
Gender-diversity analysis of technical contributions Daniel Izquierdo Cortázar @dizquierdo dizquierdo at bitergia dot com https://speakerdeck.com/bitergia OpenStack Summit, Barcelona 2016