source installed Linux 2001 wrote how to for the linux doc project contributor to the KDE 3.x/Kaffeine media player 2003 2006 leading the development of Alitheia Core 2008 google summer of code participant founding member, greek OSS society 2008 2010 work on OSS cloud infrastructures 2011 started the GHTorrent project
dump • the most refined software engineering dataset at the time • supported by an EC FP6 project • 6 partners • ~20 publications • 4 PhDs, mine included, funded
mining challenge 40% of all papers on GitHub (Cosentino et al. 2016) many best paper awards used at: microsoft, delloite, blackduck received funding from: microsoft, google ghtorrent impact
research target! • true, but so was Sourceforge when Alitheia Core analysed it • (alitheia core was) not invented here! • true, but GHTorrent was of worse quality when available • i don’t want to invest time in your infrastructure! • true, but you still do it with GHTorrent (ok, less)
MIT for source code • CC-BY-SA for data and other materials • choose a platform • github for src • zenodo for data, gives a DOI! • slideshare or speakerdeck for slides • figshare, pure.tudelft.nl or your site for papers
MIT for source code • CC-BY-SA for data and other materials • choose a platform • github for src • zenodo for data, gives a DOI! • slideshare or speakerdeck for slides • figshare, pure.tudelft.nl or your site for papers
Viable Product • what is the least possible amount of work that will make sense to somebody else? • work in iterations • open, gather feedback, improve, repeat • Embrace the “Hacker Way”
have created something worth stealing! • if someone invests time in stealing: • what you created is great • you have a head start • if nobody invests time in stealing: • is what you created worth your time/effort? • is your research relevant? good artists copy; great artists steal