Upgrade to Pro — share decks privately, control downloads, hide ads and more …

Inside Darwin Analytics

Jelmer Snoeck
February 21, 2013

Inside Darwin Analytics

A small look into the development side of Darwin Analytics.

Notes: https://gist.github.com/jelmersnoeck/5005575

Jelmer Snoeck

February 21, 2013
Tweet

More Decks by Jelmer Snoeck

Other Decks in Technology

Transcript

  1. MORE INSIGHTS, LESS METRICS.

    View full-size slide

  2. The tree has
    all the answers.

    View full-size slide

  3. data sources

    View full-size slide

  4. data sources
    magic

    View full-size slide

  5. data sources
    magic
    profit++

    View full-size slide

  6. Overview & insights
    Advice
    Future

    View full-size slide

  7. Overview & insights
    Advice
    Future

    View full-size slide

  8. Overview & insights
    Advice
    Future

    View full-size slide

  9. Answers to what?

    View full-size slide

  10. Which channel is most effective
    in reaching your marketing objectives?
    Effect

    View full-size slide

  11. Which channel is most cost-efficient
    in reaching your marketing objectives?
    Cost

    View full-size slide

  12. How am I evolving towards
    my marketing objectives?
    Evolution

    View full-size slide

  13. Answers for who?

    View full-size slide

  14. Third party
    connections
    data sources

    View full-size slide

  15. Business logic layer
    magic

    View full-size slide

  16. Harvesters
    Business logic layer
    Processors

    View full-size slide

  17. Business logic layer
    Processors
    Harvesters

    View full-size slide

  18. Business logic layer
    Processors
    Harvesters

    View full-size slide

  19. Business logic layer
    Processors
    Harvesters

    View full-size slide

  20. Business logic layer
    Processors
    Harvesters

    View full-size slide

  21. Business logic layer
    Processors
    Harvesters

    View full-size slide

  22. Presentation layer
    profit

    View full-size slide

  23. Future: API layer

    View full-size slide

  24. “#TDD is the process of describing what code
    should do before writing it. Saying that you
    can’t do that is saying that you can’t think!”
    ~ @everzet

    View full-size slide

  25. Fail fast, succeed faster.
    130+ tests, 350+ assertions
    Still too low (no functional tests)

    View full-size slide

  26. “Ink is better than memory”
    phpDocumentor 2
    16k+ ncloc
    8k+ cloc

    View full-size slide

  27. Vagrant
    dependencies

    View full-size slide

  28. Vagrant
    dependencies
    easy configuration

    View full-size slide

  29. Parallelization

    View full-size slide

  30. Parallelization
    Multiprocessing

    View full-size slide

  31. Parallelization
    Multiprocessing
    Streaming

    View full-size slide

  32. Parallelization
    Multiprocessing
    Streaming
    Message Queue

    View full-size slide

  33. Machine learning

    View full-size slide

  34. Build server

    View full-size slide

  35. Whoops, there’s my exit!
    @jelmersnoeck

    View full-size slide