Speaker Deck

What can you learn about thousands of Groovy source files in Github using BigQuery

by Guillaume Laforge

Published November 22, 2016 in Technology

Github and Google recently released a huge dataset on BigQuery about all the Open Source code contained on Github, thanks to the data from the Github Archive project. What can you learn from such a dataset? What about learning about the most frequently used dependencies and imports? Which version of Gradle is the most widespread? What are the most frequent test libraries used?

Guillaume will show you some concrete examples from the dataset, with examples applied to the Apache Groovy language, to learn more about our programming habits. And figure out on your own about your favorite language, web framework and build tool usage patterns!