Upgrade to Pro
— share decks privately, control downloads, hide ads and more …
Speaker Deck
Features
Speaker Deck
PRO
Sign in
Sign up for free
Search
Search
PyData ATL Lightning Talk
Search
Will McGinnis
June 17, 2016
Programming
0
290
PyData ATL Lightning Talk
https://www.github.com/wdm0006/git-pandas
Will McGinnis
June 17, 2016
Tweet
Share
More Decks by Will McGinnis
See All by Will McGinnis
I made a model, now what?
wdm0006
0
250
Encoding categorical variables with categorical encoders
wdm0006
0
410
Other Decks in Programming
See All in Programming
Behind VS Code Extensions for JavaScript / TypeScript Linnting and Formatting
unvalley
5
1k
From Spring Boot 2 to Spring Boot 3 with Java 21 and Jakarta EE
ivargrimstad
0
190
GitHub Actionsで泣かないためにやっておきたい設定 / Recommended GHA settings to avoid crying
pinkumohikan
3
540
TCAとKMPを用いた新規動画配信アプリ 「ABEMA Live」の設計
tomu28
1
120
単体テストを書かない技術 #phpcon_odawara
o0h
PRO
27
8.3k
Blue/Greenデプロイの導入による 運用フローの改善
kudoas
1
390
PHP8.3の機能を振り返る / Review of PHP 8.3 features
seike460
PRO
1
110
スキーマ駆動開発による品質とスピードの両立 - 私達は何故、スキーマを書くのか
kentaroutakeda
0
170
ゆるい個人開発のススメ
kuroppe1819
10
1k
SIMD Parallel Programming with the Vector API
josepaumard
0
200
Tailwind CSSを本気でカスタマイズする方法
fsubal
14
5.3k
2 週間で Twitter Bot を作ってみた
contour_gara
0
630
Featured
See All Featured
Designing for humans not robots
tammielis
248
25k
Building Better People: How to give real-time feedback that sticks.
wjessup
355
18k
[Rails World 2023 - Day 1 Closing Keynote] - The Magic of Rails
eileencodes
2
1.3k
The Mythical Team-Month
searls
216
42k
Building an army of robots
kneath
300
41k
10 Git Anti Patterns You Should be Aware of
lemiorhan
648
58k
RailsConf 2023
tenderlove
4
540
Ruby is Unlike a Banana
tanoku
96
10k
Building Your Own Lightsaber
phodgson
99
5.7k
Typedesign – Prime Four
hannesfritz
36
2.1k
[RailsConf 2023] Rails as a piece of cake
palkan
23
4k
How to Ace a Technical Interview
jacobian
272
22k
Transcript
Analyzing Git/ Github data with git-pandas (or: vanity metrics at
scale)
Who am I? • Will McGinnis • Write code at
Predikto (we’re hiring) • www.predikto.com • twitter.com/willmcginnis • github.com/wdm0006
What is git-pandas? • Open source library: https://github.com/wdm0006/git-pandas • Represents
git data as pandas dataframes • Abstracts groups of repos into an object called a ProjectDirectory • Does some common analysis tasks for you
Org-wide Punchcards
Cumulative Blame
Estimate Code Quality • “file owner” • metric for refactors
• how long will an owner’s file go without being refactored?
GitNOC
None
None
Other things • Bus factors (for files, repos and orgs)
• Development time estimation • Rate of change metrics (risk) • Basic datasets (commit history, file changes, branches, tags, etc.) • File owners • File details • and more: https://github.com/wdm0006/git- pandas