Web scraping for data scientists

Web scraping for data scientists

8c01de31d4ea71756e85a0aca3852e2e?s=128

Irio Musskopf

May 24, 2016
Tweet

Transcript

  1. Web scraping Irio Musskopf Data Science Retreat for data scientists

  2. Finding data Not always easy

  3. 1. Downloadable dataset

  4. 2.APIs

  5. 3. Scraping

  6. 4.Talk with other companies

  7. 4.Produce yourself

  8. Doesn’t matter how complex the system is. It is possible.

  9. Doesn’t matter how complex the system is. It is possible.

    Unless there’s a captcha.
  10. None
  11. DEMO

  12. Selectors Limitations User agents Proxies

  13. Irio Musskopf iirineu@gmail.com Thanks