Upgrade to Pro — share decks privately, control downloads, hide ads and more …

Patsy (PyData Berlin)

Patsy (PyData Berlin)

PyData Berlin / Canada Day, 2017 at 11:45am - 12:30pm

Max Humber

July 01, 2017
Tweet

More Decks by Max Humber

Other Decks in Programming

Transcript

  1. @maxhumber
    Canada Day, 2017

    View Slide

  2. patsy

    View Slide

  3. View Slide

  4. r python patsy
    intro
    data
    motivation
    ??
    summary

    View Slide

  5. View Slide

  6. View Slide

  7. View Slide

  8. View Slide

  9. View Slide

  10. View Slide

  11. View Slide

  12. View Slide

  13. View Slide

  14. View Slide

  15. type A type b

    View Slide

  16. View Slide

  17. View Slide

  18. View Slide

  19. patsy

    View Slide

  20. View Slide

  21. View Slide

  22. 1 0

    View Slide

  23. View Slide

  24. create data

    View Slide

  25. View Slide

  26. View Slide

  27. View Slide

  28. View Slide

  29. View Slide

  30. r python patsy

    View Slide

  31. View Slide

  32. View Slide

  33. View Slide

  34. View Slide

  35. wait for it

    View Slide

  36. View Slide

  37. View Slide

  38. View Slide

  39. View Slide

  40. View Slide

  41. View Slide

  42. View Slide

  43. View Slide

  44. View Slide

  45. View Slide

  46. View Slide

  47. let’s try again

    View Slide

  48. View Slide

  49. View Slide

  50. View Slide

  51. View Slide

  52. View Slide

  53. View Slide

  54. View Slide

  55. View Slide

  56. View Slide

  57. View Slide

  58. View Slide

  59. not the only one …

    View Slide

  60. View Slide

  61. View Slide

  62. View Slide

  63. View Slide

  64. View Slide

  65. Patsy is a Python library for describing statistical
    models (especially linear models, or models that have a
    linear component) and building design matrices. Patsy
    brings the convenience of R "formulas" to Python.
    https://github.com/pydata/patsy

    View Slide

  66. pip install patsy

    View Slide

  67. View Slide

  68. View Slide

  69. View Slide

  70. View Slide

  71. View Slide

  72. View Slide

  73. View Slide

  74. View Slide


  75. View Slide

  76. View Slide

  77. View Slide

  78. View Slide

  79. View Slide

  80. View Slide


  81. View Slide

  82. View Slide

  83. View Slide

  84. Beer: 1.5, Warm: -0.5, Family: Yes

    View Slide

  85. Beer: -0.9, Warm: 0.3, Family: No

    View Slide

  86. View Slide

  87. https://patsy.readthedocs.io/en/latest/quickstart.html

    View Slide

  88. https://patsy.readthedocs.io/en/latest/quickstart.html

    View Slide

  89. View Slide

  90. Our goal is to make Patsy the de facto standard for
    describing models in Python, regardless of the
    underlying package in use – just as formulas are the
    standard interface to all R packages. Therefore we’ve
    tried to make it as easy as possible for you to build
    Patsy support into your libraries.
    https://patsy.readthedocs.io/en/latest/library-developers.html

    View Slide

  91. View Slide

  92. View Slide

  93. PyMC3 is a Python package for Bayesian statistical
    modeling and Probabilistic Machine Learning which
    focuses on advanced Markov chain Monte Carlo and
    variational fitting algorithms. Its flexibility
    and extensibility make it applicable to a large
    suite of problems.
    https://pymc-devs.github.io/pymc3/

    View Slide

  94. View Slide

  95. View Slide

  96. View Slide

  97. summary

    View Slide

  98. View Slide

  99. patsy

    View Slide

  100. thanks!
    NJS
    https://vorpus.org/

    View Slide

  101. maxhumber

    View Slide

  102. maxhumber
    let’s talk let’s celebrate

    View Slide