Slide 1

Slide 1 text

Data Science for Community Managers J. Manrique López de la Fuente @jsmanrique jsmanrique at bitergia dot com https://speakerdeck.com/bitergia Open Source Summit North America, Los Angeles (USA), Sep. 13th 2017

Slide 2

Slide 2 text

Introduction A bit about me

Slide 3

Slide 3 text

/me Hello, my name is Manrique and I am a community junkie Involved in: HPCC, AsturLiNUX, HispaLiNUX, GPE, Maemo, Meego, Gnome, GDG, Mozilla, ... Business, marketing & alliances developer in Bitergia, the software development analytics company

Slide 4

Slide 4 text

/bitergia Software Development Analytics for your peace of mind

Slide 5

Slide 5 text

Open Development Open Development Community Management Software Development Management Everybody to be Open

Slide 6

Slide 6 text

/everybody loves openness Everybody Open Development

Slide 7

Slide 7 text

/open source management Who is contributing* to open source software? How much/what/where are they contributing*? Who is driving successful projects? Who is behind an open source project? * Contributions can be made by code, issues, requests, etc.

Slide 8

Slide 8 text

/everybody loves community Everybody Community

Slide 9

Slide 9 text

/everybody loves community Everybody Community “Developers want to be part of something bigger; they want to contribute and influence where that technology is going” The Power of Community in Open Source, The Newstack

Slide 10

Slide 10 text

/community management Which projects are more community attractive? Are projects retaining contributors? Are projects community-driven or enterprise-driven? How diverse is your community? Who are projects top contributors? Where does my community come from? Where else my community contributes to?

Slide 11

Slide 11 text

/everybody loves collaboration Everybody Collaboration

Slide 12

Slide 12 text

/software development management How openness can be applied to any IT development? Inner Source Managing Inner Source Projects (bit.ly/managing-innersource) DevOps DevOps framework: CALMS (Culture/Collaboration, Automation, Lean, Measurement, and Sharing) Agile Agile Manifesto

Slide 13

Slide 13 text

/software development management How fast are we dealing with feedback (issues, requests, comments)? How long does it take to add new features? Are people engaged with the project/company? Is the project/company allowing innovation?

Slide 14

Slide 14 text

/questions

Slide 15

Slide 15 text

Open Development Analytics Let’s get answers! Why? Collecting Data Sharing Data Be aware Methodology

Slide 16

Slide 16 text

/why? Why do you need metrics / analytics? Transparency Awareness Governance support Motivational

Slide 17

Slide 17 text

/collecting_data

Slide 18

Slide 18 text

/collecting data nightmare Community Manager Nightmares Start

Slide 19

Slide 19 text

/GrimoireLab grimoirelab.github.io “Collecting data is only the first step toward wisdom, …” +30 different data sources Easy to extend

Slide 20

Slide 20 text

/GrimoireLab “Collecting data is only the first step toward wisdom, but sharing data is the first step toward community” Henry Louis Gates Jr., IBM/Linux Commercials grimoirelab.github.io

Slide 21

Slide 21 text

/GrimoireLab “Collecting data is only the first step toward wisdom, but sharing data is the first step toward community” Henry Louis Gates Jr., IBM/Linux Commercials Community Health Analytics Open Source Software chaoss.community grimoirelab.github.io

Slide 22

Slide 22 text

/demo time Let’s play!!

Slide 23

Slide 23 text

/data/activity

Slide 24

Slide 24 text

/data/community

Slide 25

Slide 25 text

/data/performance

Slide 26

Slide 26 text

/attraction-retention -network

Slide 27

Slide 27 text

/community/ dependency_KPIs Onion model ASF Pony factor Bitergia Elephant factor Bitergia Zapata factor Linux Kernel Zapata factor ~ 200 Bitergia United Fruit Company factor Linux Kernel UFCo factor ~ 10 Linux kernel ownership analysis: linux.biterg.io 7 core ~ 40 regular ~ 85 casual Pony factor: 1 Elephant factor: 2

Slide 28

Slide 28 text

/collecting data nightmare More Community Manager Nightmares

Slide 29

Slide 29 text

/be aware Metrics can be very dangerous

Slide 30

Slide 30 text

/danger_zone “Human beings adjust behavior based on the metrics they’re held against. Anything you measure will impel a person to optimize his score on that metric. What you measure is what you’ll get. Period”. You Are What You Measure by Dan Ariely

Slide 31

Slide 31 text

/methodology Governance -> Goals <- Questions <- Metrics 1. Develop a set of corporate, division and project business goals and associated measurement goals for productivity and quality 2. Generate questions (based on models) that define those goals as completely as possible in a quantifiable way 3. Specify the measures needed to be collected to answer those questions and track process and product conformance to the goals 4. Develop mechanisms for data collection 5. Collect, validate and analyze the data in real time to provide feedback to projects for corrective action 6. Analyze the data in a post mortem fashion to assess conformance to the goals and to make recommendations for future improvements

Slide 32

Slide 32 text

/be_nice! It’s about the people!

Slide 33

Slide 33 text

Use cases How people is using open development analytics CNCF & Kubernetes Puppet Inc. OpenStack Foundation Samsung Open Source Group Liferay Inc.

Slide 34

Slide 34 text

/CNCF Increase enterprises involvement in Open Source software development cncf.biterg.io

Slide 35

Slide 35 text

/CNCF/Kubernetes Reduce single company dependency

Slide 36

Slide 36 text

/CNCF/Kubernetes Reduce single company dependency

Slide 37

Slide 37 text

/Puppet_Inc. Increase participation beyond coding puppet.biterg.io

Slide 38

Slide 38 text

/Puppet_Inc. Increase participation beyond coding

Slide 39

Slide 39 text

/OpenStack/ Foundation

Slide 40

Slide 40 text

/Samsung/ Open_Source_Group Samsung OSG video from GrimoireCON NA 2017 http://bit.ly/samsung_osg_video

Slide 41

Slide 41 text

/Liferay_Inc. “Grimoire Lab gives us a well rounded picture of our overall community health to help determine the areas where we are doing good and the areas where we can help improve.” Jamie Sammons - Developer Advocate at Liferay Inc.

Slide 42

Slide 42 text

Join the Community grimoirelab.github.io How to start playing Just ask Cauldron

Slide 43

Slide 43 text

/start playing How to start playing...

Slide 44

Slide 44 text

/how_to_start $ pip install perceval … $ pip install grimoire-elk …

Slide 45

Slide 45 text

/where_to_ask GrimoireLab Training Tutorial www.gitbook.com/book/grimoirelab/grimoirelab-training GrimoireLab Mailing List lists.linuxfoundation.org/mailman/listinfo/grimoirelab-discussions GrimoireLab IRC Channel #GrimoireLab in Freenode.net

Slide 46

Slide 46 text

/bonus The Cauldron GitHub organization or user analysis Latest 30 active repos per organization analysis FREE cauldron.io BETA

Slide 47

Slide 47 text

/data driven community management Your project & community data are already there…

Slide 48

Slide 48 text

Take care of your project! grimoirelab.github.io chaoss.community [email protected]