group (600 FTE) • 30 machine learning engineers, data scientists, data engineers • Commercial focus on Dutch top 200 or category leaders • Based in Amsterdam
app Dashboard / Reporting Traditional Business app API External API Logs Chat/transcripts Scraping Unstructured data Traditional operational data sources
to be able to participate in actually building production quality systems vs being proficient enough in R or python to hack together a prototype on a very small dataset • Supply of the second group keeps growing while demand is flat or shrinking • especially as executives get burned by “data scientists” who don't know how to help them build things of value
happening • We (GoDataDriven) are definitely only interested in these profiles (people who are already there, or that are getting there) • Many of our clients are in the same position
trouble hiring good technical people • The “IQ” test is not really representative of applied data science • At GoDataDriven we do a “at home, at your convenience” assessment • Real dataset, real business question, real product
hand with a Kaggle-like mentality: delivering the best model in absolute terms, no matter what the practical implications are. • In reality it's not the best model that we implement, but the one that combines quality and practicality. • Netflix competition
difficult parts • Easier to attract people with everchanging challenges • Easier to cross pollinate knowledge • Formalized and efficient process • Backlog out of the business • Prioritization is unclear • Maintenance and continuous improvement is done by different people
• Maintenance and improvement are (mostly) done by the same people • More difficult to have many good teams • Risk of reinventing the wheel everywhere • Sharing knowledge is not as easy • Lots of different standards • Difficult professionalization as the business has different priorities