raw text is boring, hinders new projects • Help non-programmers investigate dirty data • Keynote “The real unsolved problems in data science” • What tooling exists? • Can't the machine do this for us?
Write-ups: http://ianozsvald.com/ • http://annotate.io/ announce email list & working demo (Python 2.7&3.4) • Tell me • what data do you want to clean? • where have you lost time cleaning before?