Upgrade to Pro
— share decks privately, control downloads, hide ads and more …
Speaker Deck
Features
Speaker Deck
PRO
Sign in
Sign up for free
Search
Search
Gender Diversity Analysis in OSS Projects
Search
Sponsored
·
Your Podcast. Everywhere. Effortlessly.
Share. Educate. Inspire. Entertain. You do you. We'll handle the rest.
→
Bitergia
June 29, 2017
Technology
36
0
Share
Gender Diversity Analysis in OSS Projects
PyData Meetup, Madrid
Bitergia
June 29, 2017
More Decks by Bitergia
See All by Bitergia
Building and Supporting Open Source Communities through Metrics
bitergia
0
76
Defining the limits of Risk
bitergia
0
110
Present and Future of GrimoireLab
bitergia
0
97
InnerSource Commons
bitergia
0
110
Collaboration as Health Indicator
bitergia
0
140
La estrella de mi comunidad es un bot. ¿Dónde están los humanos?
bitergia
0
120
IoT Projects in FLOSS Foundations, a report based on community data
bitergia
0
140
Contributor Leaderboards to Incentivize Good Community Citizenship
bitergia
0
140
FreeScout: Cómo montar un departamento de soporte/atención al cliente con software libre
bitergia
0
580
Other Decks in Technology
See All in Technology
How to learn AWS Well-Architected with AWS BuilderCards: Security Edition
coosuke
PRO
0
180
【関西製造業祭り2026春】現場を変える技術はここまで来た〜世界最大の製造業見本市から持って帰ってきたもの〜
tanakaseiya
0
190
ワールドカフェ再び、そしてゴール・ルール・ロール・ツール / World Café Revisited, and the Goals-Rules-Roles-Tools
ks91
PRO
0
180
続 運用改善、不都合な真実 〜 物理制約のない運用改善はほとんど無価値 / 20260518-ssmjp-kaizen-no-value-without-physical-constraints
opelab
2
270
AWS WAFの運用を地道に改善し、自社で運用可能にするプラクティス
andpad
1
580
Cortex(Code) を ML モデルの 精度改善サイクルに組み込む.pdf
oimo23
0
230
PdM・Eng・QAで進めるAI駆動開発の現在地/aidd-with-pdm-eng-qa
shota_kusaba
0
260
LookerとADKで作る社内AIエージェント
chanyou0311
0
270
コーディングエージェントはTypeScriptの 型エラーをどう自己修正しているのか
melonps
2
180
AIのために、AIを使った、Effect-TSからの脱却 〜テストを活用した安全なリファクタリングの進め方〜
bitkey
PRO
0
140
TypeScriptはどのようにどこまで推論できるのか ─ とにかく as は禁止で
ypresto
0
190
Purview Endpoint DLP 動かしてみた
kozakigh
1
450
Featured
See All Featured
Color Theory Basics | Prateek | Gurzu
gurzu
0
310
RailsConf & Balkan Ruby 2019: The Past, Present, and Future of Rails at GitHub
eileencodes
141
35k
Balancing Empowerment & Direction
lara
6
1.1k
Connecting the Dots Between Site Speed, User Experience & Your Business [WebExpo 2025]
tammyeverts
11
910
The Straight Up "How To Draw Better" Workshop
denniskardys
239
140k
How Fast Is Fast Enough? [PerfNow 2025]
tammyeverts
3
570
Leo the Paperboy
mayatellez
7
1.8k
How to audit for AI Accessibility on your Front & Back End
davetheseo
0
370
A brief & incomplete history of UX Design for the World Wide Web: 1989–2019
jct
2
370
So, you think you're a good person
axbom
PRO
2
2k
Agile Actions for Facilitating Distributed Teams - ADO2019
mkilby
0
190
The Art of Programming - Codeland 2020
erikaheidi
57
14k
Transcript
Gender Diversity Analysis in OSS Projects PyData Meetup 29th June,
2017 Madrid Daniel Izquierdo, CDO
[email protected]
@dizquierdo speakerdeck.com/bitergia
None
None
Tweet => 13% of people attending the OpenStack Summit were
women Tweet => How many of them are actually contributing to the source code? Intro
Gender diversity of the technical contributions in the OpenStack Project
Teams Problem
Goal-Question-Metric approach • Contextualize tech gender-diversity groups • Data sources
available for the analysis • Tooling • Results and Further Work How To
Governance -> Goals <- Questions <- Metrics Goal: Increase gender
diversity in the OpenStack Foundation Context
How’s performing the industry with this respect? Context
FOSS Survey in 2013: - 11% of women answered the
survey The Industry Gender Gap by the World Economic Forum. - 5% for CEOs, 21% for Mid-level roles, 32% of Junior roles Context
Some companies https://blog.pinterest.com/en/our -plan-more-diverse-pinterest http://www.google.com/diversity/ http://newsroom.fb.com/news/2015/0 6/driving-diversity-at-facebook/ https://blogs.dropbox.com/dropbox/2014/11/stren gthening-dropbox-through-diversity/
Question: How’s performing OpenStack? We need data!!!! Context
Git: https://git.openstack.org Gerrit: https://review.openstack.org/ Others: Mailing Lists, Launchpad, IRC… >1M
commits and > 1.5M code review votes Data
Git example: commit 61ab0c46b09299c07e86320d612a0fcc281491b1 Author: Daniel Izquierdo <
[email protected]
> Date: Fri
Apr 28 00:04:05 2017 +0200 Add 'by default' values when eventizing Data
Tooling Original Data Sources Mining Tools Perceval @ GrimoireLab Info
Enrich. Genderize.io Ceres/ Pandas Jupyter Notebooks Manual work Viz ElasticSearch + Kibana
Results Original Data Sources • Git and Gerrit repos based
on yaml at Governance • ~ 1M commits • ~ 500K changesets • ~ 1.5M patchset uploads • ~ 1.8M patches code reviews
Results Mining Tools Perceval • At grimoirelab.github.io • Parses API’s,
logs, etc and produces JSON documents • Those are later stored in ElasticSearch
Results Info Enrich. Genderize.io Pandas Jupyter Notebooks Manual work •
Genderize.io: name database • Ceres: data analysis lib. to work with Perceval • Jupyter Notebook: web app. For data analysis • Manual work:
Results Viz ElasticSearch + Kibana • ElasticSearch: Schemaless db •
Kibana: works great with ES • This tandem helps a lot to verify info • Drill down capabilities
Demo OpenStack Diversity Dashboard (private access)
Women activity (last year): ~ 11% of the population (
~ 340 active developers ) ~ 9% of the activity ( >=6k commits ) OpenStack (Austin)
Women activity (last year): ~ 6.8% of the activity (
~ 4k commits ) ~ 9.9% of the population ( ~ 330 active developers ) Linux Kernel
Women activity (last year): ~2K commits (6.5% of the activity)
71 developers (8.5% of the population) Hadoop
Users It’s important to understand your potential users! C-level? Middle
management? Developers? Community? This study aims at understanding the current situation And look for best practices
Decisions based on data!
Bitergia Software Development Analytics for your peace of mind
Thanks! Daniel Izquierdo, CDO
[email protected]
@dizquierdo