Upgrade to Pro
— share decks privately, control downloads, hide ads and more …
Speaker Deck
Features
Speaker Deck
PRO
Sign in
Sign up for free
Search
Search
Gender Diversity Analysis in OSS Projects
Search
Bitergia
June 29, 2017
Technology
0
31
Gender Diversity Analysis in OSS Projects
PyData Meetup, Madrid
Bitergia
June 29, 2017
Tweet
Share
More Decks by Bitergia
See All by Bitergia
Building and Supporting Open Source Communities through Metrics
bitergia
0
31
Defining the limits of Risk
bitergia
0
43
Present and Future of GrimoireLab
bitergia
0
33
InnerSource Commons
bitergia
0
77
Collaboration as Health Indicator
bitergia
0
93
La estrella de mi comunidad es un bot. ¿Dónde están los humanos?
bitergia
0
80
IoT Projects in FLOSS Foundations, a report based on community data
bitergia
0
86
Contributor Leaderboards to Incentivize Good Community Citizenship
bitergia
0
100
FreeScout: Cómo montar un departamento de soporte/atención al cliente con software libre
bitergia
0
290
Other Decks in Technology
See All in Technology
10XにおけるData Contractの導入について: Data Contract事例共有会
10xinc
6
660
Security-JAWS【第35回】勉強会クラウドにおけるマルウェアやコンテンツ改ざんへの対策
4su_para
0
180
100 名超が参加した日経グループ横断の競技型 AWS 学習イベント「Nikkei Group AWS GameDay」の紹介/mediajaws202411
nikkei_engineer_recruiting
1
170
SSMRunbook作成の勘所_20241120
koichiotomo
3
160
iOS/Androidで同じUI体験をネ イティブで作成する際に気をつ けたい落とし穴
fumiyasac0921
1
110
これまでの計測・開発・デプロイ方法全部見せます! / Findy ISUCON 2024-11-14
tohutohu
3
370
Oracle Cloud Infrastructureデータベース・クラウド:各バージョンのサポート期間
oracle4engineer
PRO
28
13k
OTelCol_TailSampling_and_SpanMetrics
gumamon
1
190
iOSチームとAndroidチームでブランチ運用が違ったので整理してます
sansantech
PRO
0
150
初心者向けAWS Securityの勉強会mini Security-JAWSを9ヶ月ぐらい実施してきての近況
cmusudakeisuke
0
130
OCI Vault 概要
oracle4engineer
PRO
0
9.7k
飲食店データの分析事例とそれを支えるデータ基盤
kimujun
0
140
Featured
See All Featured
jQuery: Nuts, Bolts and Bling
dougneiner
61
7.5k
Making the Leap to Tech Lead
cromwellryan
133
8.9k
We Have a Design System, Now What?
morganepeng
50
7.2k
The Illustrated Children's Guide to Kubernetes
chrisshort
48
48k
Learning to Love Humans: Emotional Interface Design
aarron
273
40k
Templates, Plugins, & Blocks: Oh My! Creating the theme that thinks of everything
marktimemedia
26
2.1k
A Tale of Four Properties
chriscoyier
156
23k
Rebuilding a faster, lazier Slack
samanthasiow
79
8.7k
How to train your dragon (web standard)
notwaldorf
88
5.7k
Fight the Zombie Pattern Library - RWD Summit 2016
marcelosomers
232
17k
Site-Speed That Sticks
csswizardry
0
27
Optimising Largest Contentful Paint
csswizardry
33
2.9k
Transcript
Gender Diversity Analysis in OSS Projects PyData Meetup 29th June,
2017 Madrid Daniel Izquierdo, CDO
[email protected]
@dizquierdo speakerdeck.com/bitergia
None
None
Tweet => 13% of people attending the OpenStack Summit were
women Tweet => How many of them are actually contributing to the source code? Intro
Gender diversity of the technical contributions in the OpenStack Project
Teams Problem
Goal-Question-Metric approach • Contextualize tech gender-diversity groups • Data sources
available for the analysis • Tooling • Results and Further Work How To
Governance -> Goals <- Questions <- Metrics Goal: Increase gender
diversity in the OpenStack Foundation Context
How’s performing the industry with this respect? Context
FOSS Survey in 2013: - 11% of women answered the
survey The Industry Gender Gap by the World Economic Forum. - 5% for CEOs, 21% for Mid-level roles, 32% of Junior roles Context
Some companies https://blog.pinterest.com/en/our -plan-more-diverse-pinterest http://www.google.com/diversity/ http://newsroom.fb.com/news/2015/0 6/driving-diversity-at-facebook/ https://blogs.dropbox.com/dropbox/2014/11/stren gthening-dropbox-through-diversity/
Question: How’s performing OpenStack? We need data!!!! Context
Git: https://git.openstack.org Gerrit: https://review.openstack.org/ Others: Mailing Lists, Launchpad, IRC… >1M
commits and > 1.5M code review votes Data
Git example: commit 61ab0c46b09299c07e86320d612a0fcc281491b1 Author: Daniel Izquierdo <
[email protected]
> Date: Fri
Apr 28 00:04:05 2017 +0200 Add 'by default' values when eventizing Data
Tooling Original Data Sources Mining Tools Perceval @ GrimoireLab Info
Enrich. Genderize.io Ceres/ Pandas Jupyter Notebooks Manual work Viz ElasticSearch + Kibana
Results Original Data Sources • Git and Gerrit repos based
on yaml at Governance • ~ 1M commits • ~ 500K changesets • ~ 1.5M patchset uploads • ~ 1.8M patches code reviews
Results Mining Tools Perceval • At grimoirelab.github.io • Parses API’s,
logs, etc and produces JSON documents • Those are later stored in ElasticSearch
Results Info Enrich. Genderize.io Pandas Jupyter Notebooks Manual work •
Genderize.io: name database • Ceres: data analysis lib. to work with Perceval • Jupyter Notebook: web app. For data analysis • Manual work:
Results Viz ElasticSearch + Kibana • ElasticSearch: Schemaless db •
Kibana: works great with ES • This tandem helps a lot to verify info • Drill down capabilities
Demo OpenStack Diversity Dashboard (private access)
Women activity (last year): ~ 11% of the population (
~ 340 active developers ) ~ 9% of the activity ( >=6k commits ) OpenStack (Austin)
Women activity (last year): ~ 6.8% of the activity (
~ 4k commits ) ~ 9.9% of the population ( ~ 330 active developers ) Linux Kernel
Women activity (last year): ~2K commits (6.5% of the activity)
71 developers (8.5% of the population) Hadoop
Users It’s important to understand your potential users! C-level? Middle
management? Developers? Community? This study aims at understanding the current situation And look for best practices
Decisions based on data!
Bitergia Software Development Analytics for your peace of mind
Thanks! Daniel Izquierdo, CDO
[email protected]
@dizquierdo