Upgrade to Pro
— share decks privately, control downloads, hide ads and more …
Speaker Deck
Features
Speaker Deck
PRO
Sign in
Sign up for free
Search
Search
Gender Diversity Analysis in OSS Projects
Search
Sponsored
·
Ship Features Fearlessly
Turn features on and off without deploys. Used by thousands of Ruby developers.
→
Bitergia
June 29, 2017
Technology
0
34
Gender Diversity Analysis in OSS Projects
PyData Meetup, Madrid
Bitergia
June 29, 2017
Tweet
Share
More Decks by Bitergia
See All by Bitergia
Building and Supporting Open Source Communities through Metrics
bitergia
0
68
Defining the limits of Risk
bitergia
0
87
Present and Future of GrimoireLab
bitergia
0
77
InnerSource Commons
bitergia
0
100
Collaboration as Health Indicator
bitergia
0
130
La estrella de mi comunidad es un bot. ¿Dónde están los humanos?
bitergia
0
110
IoT Projects in FLOSS Foundations, a report based on community data
bitergia
0
130
Contributor Leaderboards to Incentivize Good Community Citizenship
bitergia
0
130
FreeScout: Cómo montar un departamento de soporte/atención al cliente con software libre
bitergia
0
490
Other Decks in Technology
See All in Technology
FinTech SREのAWSサービス活用/Leveraging AWS Services in FinTech SRE
maaaato
0
130
Kiro IDEのドキュメントを全部読んだので地味だけどちょっと嬉しい機能を紹介する
khmoryz
0
200
日本の85%が使う公共SaaSは、どう育ったのか
taketakekaho
1
230
フルカイテン株式会社 エンジニア向け採用資料
fullkaiten
0
10k
データの整合性を保ちたいだけなんだ
shoheimitani
8
3.1k
プロポーザルに込める段取り八分
shoheimitani
1
280
20260208_第66回 コンピュータビジョン勉強会
keiichiito1978
0
150
コミュニティが変えるキャリアの地平線:コロナ禍新卒入社のエンジニアがAWSコミュニティで見つけた成長の羅針盤
kentosuzuki
0
120
Claude_CodeでSEOを最適化する_AI_Ops_Community_Vol.2__マーケティングx_AIはここまで進化した.pdf
riku_423
2
580
AzureでのIaC - Bicep? Terraform? それ早く言ってよ会議
torumakabe
1
570
会社紹介資料 / Sansan Company Profile
sansan33
PRO
15
400k
生成AIを活用した音声文字起こしシステムの2つの構築パターンについて
miu_crescent
PRO
2
210
Featured
See All Featured
Skip the Path - Find Your Career Trail
mkilby
0
57
職位にかかわらず全員がリーダーシップを発揮するチーム作り / Building a team where everyone can demonstrate leadership regardless of position
madoxten
57
50k
世界の人気アプリ100個を分析して見えたペイウォール設計の心得
akihiro_kokubo
PRO
66
37k
Why You Should Never Use an ORM
jnunemaker
PRO
61
9.7k
Making Projects Easy
brettharned
120
6.6k
GraphQLの誤解/rethinking-graphql
sonatard
74
11k
Context Engineering - Making Every Token Count
addyosmani
9
660
Navigating Algorithm Shifts & AI Overviews - #SMXNext
aleyda
0
1.1k
The Mindset for Success: Future Career Progression
greggifford
PRO
0
240
Why Our Code Smells
bkeepers
PRO
340
58k
[SF Ruby Conf 2025] Rails X
palkan
1
760
A Modern Web Designer's Workflow
chriscoyier
698
190k
Transcript
Gender Diversity Analysis in OSS Projects PyData Meetup 29th June,
2017 Madrid Daniel Izquierdo, CDO
[email protected]
@dizquierdo speakerdeck.com/bitergia
None
None
Tweet => 13% of people attending the OpenStack Summit were
women Tweet => How many of them are actually contributing to the source code? Intro
Gender diversity of the technical contributions in the OpenStack Project
Teams Problem
Goal-Question-Metric approach • Contextualize tech gender-diversity groups • Data sources
available for the analysis • Tooling • Results and Further Work How To
Governance -> Goals <- Questions <- Metrics Goal: Increase gender
diversity in the OpenStack Foundation Context
How’s performing the industry with this respect? Context
FOSS Survey in 2013: - 11% of women answered the
survey The Industry Gender Gap by the World Economic Forum. - 5% for CEOs, 21% for Mid-level roles, 32% of Junior roles Context
Some companies https://blog.pinterest.com/en/our -plan-more-diverse-pinterest http://www.google.com/diversity/ http://newsroom.fb.com/news/2015/0 6/driving-diversity-at-facebook/ https://blogs.dropbox.com/dropbox/2014/11/stren gthening-dropbox-through-diversity/
Question: How’s performing OpenStack? We need data!!!! Context
Git: https://git.openstack.org Gerrit: https://review.openstack.org/ Others: Mailing Lists, Launchpad, IRC… >1M
commits and > 1.5M code review votes Data
Git example: commit 61ab0c46b09299c07e86320d612a0fcc281491b1 Author: Daniel Izquierdo <
[email protected]
> Date: Fri
Apr 28 00:04:05 2017 +0200 Add 'by default' values when eventizing Data
Tooling Original Data Sources Mining Tools Perceval @ GrimoireLab Info
Enrich. Genderize.io Ceres/ Pandas Jupyter Notebooks Manual work Viz ElasticSearch + Kibana
Results Original Data Sources • Git and Gerrit repos based
on yaml at Governance • ~ 1M commits • ~ 500K changesets • ~ 1.5M patchset uploads • ~ 1.8M patches code reviews
Results Mining Tools Perceval • At grimoirelab.github.io • Parses API’s,
logs, etc and produces JSON documents • Those are later stored in ElasticSearch
Results Info Enrich. Genderize.io Pandas Jupyter Notebooks Manual work •
Genderize.io: name database • Ceres: data analysis lib. to work with Perceval • Jupyter Notebook: web app. For data analysis • Manual work:
Results Viz ElasticSearch + Kibana • ElasticSearch: Schemaless db •
Kibana: works great with ES • This tandem helps a lot to verify info • Drill down capabilities
Demo OpenStack Diversity Dashboard (private access)
Women activity (last year): ~ 11% of the population (
~ 340 active developers ) ~ 9% of the activity ( >=6k commits ) OpenStack (Austin)
Women activity (last year): ~ 6.8% of the activity (
~ 4k commits ) ~ 9.9% of the population ( ~ 330 active developers ) Linux Kernel
Women activity (last year): ~2K commits (6.5% of the activity)
71 developers (8.5% of the population) Hadoop
Users It’s important to understand your potential users! C-level? Middle
management? Developers? Community? This study aims at understanding the current situation And look for best practices
Decisions based on data!
Bitergia Software Development Analytics for your peace of mind
Thanks! Daniel Izquierdo, CDO
[email protected]
@dizquierdo