Battle tested observations on ways to improve the likelihood that your data science project goes smoothly and gets delivered correctly. Given at the inaugural PyDataPrague.
us more [money|signups|...]” - desire for magic • Desire over actual need – vanity projects • Lack of technical leadership – poor specs • Bad data – lies, mistakes, confusion • Lack of client buy-in
clearly defined problem • Where are the unknowns? • Known unknowns • What might kill the project? • Propose milestones • Where’s your Gold Standard data set? • What’s your “definition of done” • Minimal results and great results • Appropriate metrics to communicate results
your data? • Explain your data – what does it say? • What’s good and what’s bad? • What are the relationships? • Where is the signal in the data? • Export your Notebook as html artefact • Data Story proposed by Bertil (Medium)