Upgrade to Pro
— share decks privately, control downloads, hide ads and more …
Speaker Deck
Features
Speaker Deck
PRO
Sign in
Sign up for free
Search
Search
Gender Diversity Analysis in OSS Projects
Search
Sponsored
·
Ship Features Fearlessly
Turn features on and off without deploys. Used by thousands of Ruby developers.
→
Bitergia
June 29, 2017
Technology
40
0
Share
Embed
Copy iframe code
Copy JS code
Copy link
Start on current slide
Gender Diversity Analysis in OSS Projects
PyData Meetup, Madrid
Bitergia
June 29, 2017
More Decks by Bitergia
See All by Bitergia
Building and Supporting Open Source Communities through Metrics
bitergia
0
82
Defining the limits of Risk
bitergia
0
120
Present and Future of GrimoireLab
bitergia
0
110
InnerSource Commons
bitergia
0
120
Collaboration as Health Indicator
bitergia
0
150
La estrella de mi comunidad es un bot. ¿Dónde están los humanos?
bitergia
0
130
IoT Projects in FLOSS Foundations, a report based on community data
bitergia
0
150
Contributor Leaderboards to Incentivize Good Community Citizenship
bitergia
0
140
FreeScout: Cómo montar un departamento de soporte/atención al cliente con software libre
bitergia
0
600
Other Decks in Technology
See All in Technology
徹底討論!ECS vs EKS!
daitak
3
1.8k
GitHub Copilot運用のリアル ~AI Credit時代にどう向き合うか~
takafumisu2uk1
0
490
Agile and AI Redmine Japan 2026
hiranabe
4
500
AIAU_UMEMOGU_ninomiya_slide
ninomiya_ii
0
280
週末にループ・エンジニアリングの理解を深めるためのスライド
nagatsu
0
590
スタートアップにAmazon EKSは早すぎる? マルチプロダクト戦略を加速する Platform Engineeringの実践 / Is Amazon EKS Too Soon for Startups? Practical Platform Engineering to Accelerate a Multi-Product Strategy
elmodev09
1
1.9k
「勝手に広まる」人気 AI エージェントを爆速で作ろう!(AWS Summit Japan 2026講演資料)
minorun365
PRO
10
2.6k
AWS Security Hub CSPMの成功・失敗体験
cmusudakeisuke
0
590
AWS Summit の片隅で、体育座りしながらコミュニティがにぎわう理由を考えた
k_adachi_01
2
240
脱SaaS!FDEを支えるプロビジョニングと分離設計
knih
0
300
起点・思考・出力で分解する 〜PM業務の自動化設計〜
kazu_kichi_67
2
1.1k
CVE-2026-20833_脆弱性対応とAES 化について
jukishiya
0
140
Featured
See All Featured
The Cost Of JavaScript in 2023
addyosmani
55
10k
Google's AI Overviews - The New Search
badams
0
1k
Leadership Guide Workshop - DevTernity 2021
reverentgeek
1
310
Imperfection Machines: The Place of Print at Facebook
scottboms
270
14k
Building an army of robots
kneath
306
46k
What does AI have to do with Human Rights?
axbom
PRO
1
2.2k
WENDY [Excerpt]
tessaabrams
11
38k
Building a Scalable Design System with Sketch
lauravandoore
463
34k
Music & Morning Musume
bryan
47
7.2k
Taking LLMs out of the black box: A practical guide to human-in-the-loop distillation
inesmontani
PRO
3
2.3k
Speed Design
sergeychernyshev
33
1.9k
Skip the Path - Find Your Career Trail
mkilby
1
150
Transcript
Gender Diversity Analysis in OSS Projects PyData Meetup 29th June,
2017 Madrid Daniel Izquierdo, CDO
[email protected]
@dizquierdo speakerdeck.com/bitergia
None
None
Tweet => 13% of people attending the OpenStack Summit were
women Tweet => How many of them are actually contributing to the source code? Intro
Gender diversity of the technical contributions in the OpenStack Project
Teams Problem
Goal-Question-Metric approach • Contextualize tech gender-diversity groups • Data sources
available for the analysis • Tooling • Results and Further Work How To
Governance -> Goals <- Questions <- Metrics Goal: Increase gender
diversity in the OpenStack Foundation Context
How’s performing the industry with this respect? Context
FOSS Survey in 2013: - 11% of women answered the
survey The Industry Gender Gap by the World Economic Forum. - 5% for CEOs, 21% for Mid-level roles, 32% of Junior roles Context
Some companies https://blog.pinterest.com/en/our -plan-more-diverse-pinterest http://www.google.com/diversity/ http://newsroom.fb.com/news/2015/0 6/driving-diversity-at-facebook/ https://blogs.dropbox.com/dropbox/2014/11/stren gthening-dropbox-through-diversity/
Question: How’s performing OpenStack? We need data!!!! Context
Git: https://git.openstack.org Gerrit: https://review.openstack.org/ Others: Mailing Lists, Launchpad, IRC… >1M
commits and > 1.5M code review votes Data
Git example: commit 61ab0c46b09299c07e86320d612a0fcc281491b1 Author: Daniel Izquierdo <
[email protected]
> Date: Fri
Apr 28 00:04:05 2017 +0200 Add 'by default' values when eventizing Data
Tooling Original Data Sources Mining Tools Perceval @ GrimoireLab Info
Enrich. Genderize.io Ceres/ Pandas Jupyter Notebooks Manual work Viz ElasticSearch + Kibana
Results Original Data Sources • Git and Gerrit repos based
on yaml at Governance • ~ 1M commits • ~ 500K changesets • ~ 1.5M patchset uploads • ~ 1.8M patches code reviews
Results Mining Tools Perceval • At grimoirelab.github.io • Parses API’s,
logs, etc and produces JSON documents • Those are later stored in ElasticSearch
Results Info Enrich. Genderize.io Pandas Jupyter Notebooks Manual work •
Genderize.io: name database • Ceres: data analysis lib. to work with Perceval • Jupyter Notebook: web app. For data analysis • Manual work:
Results Viz ElasticSearch + Kibana • ElasticSearch: Schemaless db •
Kibana: works great with ES • This tandem helps a lot to verify info • Drill down capabilities
Demo OpenStack Diversity Dashboard (private access)
Women activity (last year): ~ 11% of the population (
~ 340 active developers ) ~ 9% of the activity ( >=6k commits ) OpenStack (Austin)
Women activity (last year): ~ 6.8% of the activity (
~ 4k commits ) ~ 9.9% of the population ( ~ 330 active developers ) Linux Kernel
Women activity (last year): ~2K commits (6.5% of the activity)
71 developers (8.5% of the population) Hadoop
Users It’s important to understand your potential users! C-level? Middle
management? Developers? Community? This study aims at understanding the current situation And look for best practices
Decisions based on data!
Bitergia Software Development Analytics for your peace of mind
Thanks! Daniel Izquierdo, CDO
[email protected]
@dizquierdo