Slide 1

Slide 1 text

A starter data science process for software engineers @IanOzsvald – ianozsvald.com Ian Ozsvald PyLondinium 2019

Slide 2

Slide 2 text

 Interim Chief Data Scientist  19+ years experience  Quickly build strategic data science plans  Team coaching & public courses Introductions By [ian]@ianozsvald[.com] Ian Ozsvald

Slide 3

Slide 3 text

 Numerate management ask good data-driven questions  You have suitable data  Well defined achievable outcomes are defined  Change is enabled by these projects Data Science shows value when... By [ian]@ianozsvald[.com] Ian Ozsvald

Slide 4

Slide 4 text

 What’s the driver? Is there a fire under it?  Joonatan’s example from PyDataLT – OCR  Cost/benefit estimate accepting uncertainty  Automatable Checking business need By [ian]@ianozsvald[.com] Ian Ozsvald

Slide 5

Slide 5 text

 States a clearly defined problem  Guesses at unknowns (and project torpedoes!)  Proposed milestones and Gold Standard/metrics  Clear “definition of done”  Story from 10 years back You need a Project Specification By [ian]@ianozsvald[.com] Ian Ozsvald

Slide 6

Slide 6 text

 Want to automate “MPG estimates” to help engineers  It only needs to be good enough for ranking, to assist the team in prioritising their investigations  We need to gain the team’s trust in stages  Pandas, sklearn, Yellowbrick, custom estimator A pretend example & live demo By [ian]@ianozsvald[.com] Ian Ozsvald

Slide 7

Slide 7 text

“Software Engineering for Data Scientists” - early July Resources By [ian]@ianozsvald[.com] Ian Ozsvald

Slide 8

Slide 8 text

 Your organisers are volunteers  Thank all volunteers & speakers please  Get a free signed book around 3.30pm Thank your organisers By [ian]@ianozsvald[.com] Ian Ozsvald

Slide 9

Slide 9 text

 Automate parts of a high value problem  Deliver value incrementally  Communicate early & often  Join my thoughts+jobs list for tips and my training list  Lots of past talks on ianozsvald.com Summary By [ian]@ianozsvald[.com] Ian Ozsvald