Slide 1

Slide 1 text

Demographics of Linux kernel developers how old are they? (work in progress) Jesus M. Gonzalez-Barahona [email protected] http://identi.ca/jgbarah http://twitter.com/jgbarah Bitergia GSyC/LibreSoft (Universidad Rey Juan Carlos) SOS-Evol, Brussels, February 1st, 2013 Jesus Gonzalez-Barahona (Bitergia) Demographics of Linux kernel developers SOS-Evol 2013 1 / 20

Slide 2

Slide 2 text

c 2013 Bitergia Some rights reserved. This presentation is distributed under the “Attribution-ShareAlike 3.0” license, by Creative Commons, available at http://creativecommons.org/licenses/by-sa/3.0/ Jesus Gonzalez-Barahona (Bitergia) Demographics of Linux kernel developers SOS-Evol 2013 2 / 20

Slide 3

Slide 3 text

Approach General question: “Can we learn about sustainability by looking at the history?” Specific question: “Can we characterize the evolution of the development community by looking at the git repository?” Jesus Gonzalez-Barahona (Bitergia) Demographics of Linux kernel developers SOS-Evol 2013 3 / 20

Slide 4

Slide 4 text

The Linux kernel case large project large community migrated to git on 2005 (about 8 years of history) How much “new blood” is entering the project? How many “old people” are retiring? Jesus Gonzalez-Barahona (Bitergia) Demographics of Linux kernel developers SOS-Evol 2013 4 / 20

Slide 5

Slide 5 text

Methodology Get git commit records into a database (CVSAnalY) Find out unique developers (merge developers with several identities) For several moments (yearly) in the history of the project: We consider “time in project” as “age” Calculate and plot demographic pyramid Generations calculated as “entered in the same period” Periods are of three months Analyze results Jesus Gonzalez-Barahona (Bitergia) Demographics of Linux kernel developers SOS-Evol 2013 5 / 20

Slide 6

Slide 6 text

Methodology (unique developers) Identities as “Name” (name), “email address” (eaddress) Remove “???”, “” , etc. Duplicates: Name in eaddress (probably name is an email address) Email address in eaddress (same eadress, different names) From 9,952 entries to 9,101 “unique” developers (seems good enough, by manual inspection of random sample) Jesus Gonzalez-Barahona (Bitergia) Demographics of Linux kernel developers SOS-Evol 2013 6 / 20

Slide 7

Slide 7 text

Methodology (finding intervals) SELECT upeople.uid as id, people.name as name, MIN(scmlog.date) as firstdatestr, MAX(scmlog.date) as lastdatestr FROM scmlog, people, upeople where scmlog.author_id = upeople.id AND people.id = upeople.id group by upeople.uid" Jesus Gonzalez-Barahona (Bitergia) Demographics of Linux kernel developers SOS-Evol 2013 7 / 20

Slide 8

Slide 8 text

October 2007 Jesus Gonzalez-Barahona (Bitergia) Demographics of Linux kernel developers SOS-Evol 2013 8 / 20

Slide 9

Slide 9 text

October 2008 Jesus Gonzalez-Barahona (Bitergia) Demographics of Linux kernel developers SOS-Evol 2013 9 / 20

Slide 10

Slide 10 text

October 2009 Jesus Gonzalez-Barahona (Bitergia) Demographics of Linux kernel developers SOS-Evol 2013 10 / 20

Slide 11

Slide 11 text

October 2010 Jesus Gonzalez-Barahona (Bitergia) Demographics of Linux kernel developers SOS-Evol 2013 11 / 20

Slide 12

Slide 12 text

October 2011 Jesus Gonzalez-Barahona (Bitergia) Demographics of Linux kernel developers SOS-Evol 2013 12 / 20

Slide 13

Slide 13 text

October 2012 Jesus Gonzalez-Barahona (Bitergia) Demographics of Linux kernel developers SOS-Evol 2013 13 / 20

Slide 14

Slide 14 text

All together (dodged) Jesus Gonzalez-Barahona (Bitergia) Demographics of Linux kernel developers SOS-Evol 2013 14 / 20

Slide 15

Slide 15 text

All together (faceted) Jesus Gonzalez-Barahona (Bitergia) Demographics of Linux kernel developers SOS-Evol 2013 15 / 20

Slide 16

Slide 16 text

All together (superposed) Jesus Gonzalez-Barahona (Bitergia) Demographics of Linux kernel developers SOS-Evol 2013 16 / 20

Slide 17

Slide 17 text

Conclusions Generations are smaller and smaller from about 100-150 to 30-50 per quarter Older generations are disappearing Last generations quite smaller now than they were six years ago Jesus Gonzalez-Barahona (Bitergia) Demographics of Linux kernel developers SOS-Evol 2013 17 / 20

Slide 18

Slide 18 text

Conclusions Does this mean something? (This is still work in progress) Jesus Gonzalez-Barahona (Bitergia) Demographics of Linux kernel developers SOS-Evol 2013 18 / 20

Slide 19

Slide 19 text

Bitergia: a start-up on free software metrics Started operations in July 2012 Builds on the experience of LibreSoft R&D group Offering professional products and services Focused on: Metrics about software developent (including community metrics) Specialized support for development forges (including metrics for projects) http://bitergia.com http://blog.bitergia.com http://libresoft.es Jesus Gonzalez-Barahona (Bitergia) Demographics of Linux kernel developers SOS-Evol 2013 19 / 20

Slide 20

Slide 20 text

This is the end Have you learned something useful? [I would love to know what interested you the most] [...and the least] Jesus Gonzalez-Barahona (Bitergia) Demographics of Linux kernel developers SOS-Evol 2013 20 / 20