200 people • Data munging: R 60%, Python 50%, SQL 40%, Hadoop (mostly Hive) 30%, Unix shell 20%, Excel 10% + Perl, Matlab, SAS, Impala, Pig, Shark... • Visualization: R 40%,Python 30%, Tableau 10%, Javascript 10% + Matlab, Excel... • Machine learning/modeling: R 30%, Python 30% + Vowpal Wabbit, Matlab, Mahout, SAS, SPSS... http://bit.ly/datasc-tools-survey many other surveys, but...