Upgrade to Pro
— share decks privately, control downloads, hide ads and more …
Speaker Deck
Features
Speaker Deck
PRO
Sign in
Sign up for free
Search
Search
Gender Diversity Analysis in OSS Projects
Search
Bitergia
June 29, 2017
Technology
0
32
Gender Diversity Analysis in OSS Projects
PyData Meetup, Madrid
Bitergia
June 29, 2017
Tweet
Share
More Decks by Bitergia
See All by Bitergia
Building and Supporting Open Source Communities through Metrics
bitergia
0
50
Defining the limits of Risk
bitergia
0
63
Present and Future of GrimoireLab
bitergia
0
59
InnerSource Commons
bitergia
0
92
Collaboration as Health Indicator
bitergia
0
120
La estrella de mi comunidad es un bot. ¿Dónde están los humanos?
bitergia
0
96
IoT Projects in FLOSS Foundations, a report based on community data
bitergia
0
110
Contributor Leaderboards to Incentivize Good Community Citizenship
bitergia
0
120
FreeScout: Cómo montar un departamento de soporte/atención al cliente con software libre
bitergia
0
380
Other Decks in Technology
See All in Technology
スマートファクトリーの第一歩 〜AWSマネージドサービスで 実現する予知保全と生成AI活用まで
ganota
0
120
[ JAWS-UG 東京 CommunityBuilders Night #2 ]SlackとAmazon Q Developerで 運用効率化を模索する
sh_fk2
1
220
ヘブンバーンズレッドのレンダリングパイプライン刷新
gree_tech
PRO
0
590
実践!カスタムインストラクション&スラッシュコマンド
puku0x
0
150
【初心者向け】ローカルLLMの色々な動かし方まとめ
aratako
7
3.3k
機械学習を扱うプラットフォーム開発と運用事例
lycorptech_jp
PRO
0
200
オブザーバビリティが広げる AIOps の世界 / The World of AIOps Expanded by Observability
aoto
PRO
0
310
Nstockの一人目エンジニアが 3年間かけて向き合ってきた セキュリティのこととこれから〜あれから半年〜
yo41sawada
0
210
Webブラウザ向け動画配信プレイヤーの 大規模リプレイスから得た知見と学び
yud0uhu
0
210
なぜテストマネージャの視点が 必要なのか? 〜 一歩先へ進むために 〜
moritamasami
0
170
クラウドセキュリティを支える技術と運用の最前線 / Cutting-edge Technologies and Operations Supporting Cloud Security
yuj1osm
2
290
「魔法少女まどか☆マギカ Magia Exedra」のグローバル展開を支える、開発チームと翻訳チームの「意識しない協創」を実現するローカライズシステム
gree_tech
PRO
0
580
Featured
See All Featured
Cheating the UX When There Is Nothing More to Optimize - PixelPioneers
stephaniewalter
285
13k
Save Time (by Creating Custom Rails Generators)
garrettdimon
PRO
32
1.5k
Building Better People: How to give real-time feedback that sticks.
wjessup
368
19k
Large-scale JavaScript Application Architecture
addyosmani
512
110k
Imperfection Machines: The Place of Print at Facebook
scottboms
268
13k
Build The Right Thing And Hit Your Dates
maggiecrowley
37
2.8k
Distributed Sagas: A Protocol for Coordinating Microservices
caitiem20
333
22k
How GitHub (no longer) Works
holman
315
140k
Designing Experiences People Love
moore
142
24k
The Cult of Friendly URLs
andyhume
79
6.6k
Building a Scalable Design System with Sketch
lauravandoore
462
33k
What’s in a name? Adding method to the madness
productmarketing
PRO
23
3.6k
Transcript
Gender Diversity Analysis in OSS Projects PyData Meetup 29th June,
2017 Madrid Daniel Izquierdo, CDO
[email protected]
@dizquierdo speakerdeck.com/bitergia
None
None
Tweet => 13% of people attending the OpenStack Summit were
women Tweet => How many of them are actually contributing to the source code? Intro
Gender diversity of the technical contributions in the OpenStack Project
Teams Problem
Goal-Question-Metric approach • Contextualize tech gender-diversity groups • Data sources
available for the analysis • Tooling • Results and Further Work How To
Governance -> Goals <- Questions <- Metrics Goal: Increase gender
diversity in the OpenStack Foundation Context
How’s performing the industry with this respect? Context
FOSS Survey in 2013: - 11% of women answered the
survey The Industry Gender Gap by the World Economic Forum. - 5% for CEOs, 21% for Mid-level roles, 32% of Junior roles Context
Some companies https://blog.pinterest.com/en/our -plan-more-diverse-pinterest http://www.google.com/diversity/ http://newsroom.fb.com/news/2015/0 6/driving-diversity-at-facebook/ https://blogs.dropbox.com/dropbox/2014/11/stren gthening-dropbox-through-diversity/
Question: How’s performing OpenStack? We need data!!!! Context
Git: https://git.openstack.org Gerrit: https://review.openstack.org/ Others: Mailing Lists, Launchpad, IRC… >1M
commits and > 1.5M code review votes Data
Git example: commit 61ab0c46b09299c07e86320d612a0fcc281491b1 Author: Daniel Izquierdo <
[email protected]
> Date: Fri
Apr 28 00:04:05 2017 +0200 Add 'by default' values when eventizing Data
Tooling Original Data Sources Mining Tools Perceval @ GrimoireLab Info
Enrich. Genderize.io Ceres/ Pandas Jupyter Notebooks Manual work Viz ElasticSearch + Kibana
Results Original Data Sources • Git and Gerrit repos based
on yaml at Governance • ~ 1M commits • ~ 500K changesets • ~ 1.5M patchset uploads • ~ 1.8M patches code reviews
Results Mining Tools Perceval • At grimoirelab.github.io • Parses API’s,
logs, etc and produces JSON documents • Those are later stored in ElasticSearch
Results Info Enrich. Genderize.io Pandas Jupyter Notebooks Manual work •
Genderize.io: name database • Ceres: data analysis lib. to work with Perceval • Jupyter Notebook: web app. For data analysis • Manual work:
Results Viz ElasticSearch + Kibana • ElasticSearch: Schemaless db •
Kibana: works great with ES • This tandem helps a lot to verify info • Drill down capabilities
Demo OpenStack Diversity Dashboard (private access)
Women activity (last year): ~ 11% of the population (
~ 340 active developers ) ~ 9% of the activity ( >=6k commits ) OpenStack (Austin)
Women activity (last year): ~ 6.8% of the activity (
~ 4k commits ) ~ 9.9% of the population ( ~ 330 active developers ) Linux Kernel
Women activity (last year): ~2K commits (6.5% of the activity)
71 developers (8.5% of the population) Hadoop
Users It’s important to understand your potential users! C-level? Middle
management? Developers? Community? This study aims at understanding the current situation And look for best practices
Decisions based on data!
Bitergia Software Development Analytics for your peace of mind
Thanks! Daniel Izquierdo, CDO
[email protected]
@dizquierdo