Program Properties from “Big Code”’ by 'Veselin Raychev, Martin Vechev, Andreas Krause, POPL’15 • http://www.srl.inf.ethz.ch/papers/jsnice15.pdf • Bridge Software Analysis and Machine Learning
software analysis and big data analytics to improve the way software is built, debugged and verified. Big Code Initiative http://www.datanami.com/2014/05/05/darpa-launches-big-code-initiative/
alien observing earth • You want to learn about the top programming language on earth • Of course that’s JavaScript! • Your task: How is ‘writeFileSync’ used? • Context, Argument names, Argument types • Talking to humans complicated, but <3 analysing data!
math) • Pattern recognition art in Machine Learning • How to represent program “elements” • Idea: • Model program as dependency graph • Find most likely assignments
indexed js files: 324’501 • ~ 7’000’000 features for names • ~ 70’000 features for names ➡ 10 h / 1 h to learn on 32 core Xeon machine • But have to do this only once, reuse results