Upgrade to Pro — share decks privately, control downloads, hide ads and more …

PythonによるSNSクローリングと自然言語処理を用いたデータ分析・可視化

Sponsored · Your Podcast. Everywhere. Effortlessly. Share. Educate. Inspire. Entertain. You do you. We'll handle the rest.
Avatar for knsk knsk
September 12, 2018

 PythonによるSNSクローリングと自然言語処理を用いたデータ分析・可視化

Avatar for knsk

knsk

September 12, 2018
Tweet

Other Decks in Technology

Transcript

  1. എܠ • FiNCTI -;>500,LB )%!+ • [PS] ')#42?<Q1: R8(") 

    • SNS=H.9KENO/F 0A 5 M3   • CJG*#G$+ 6@ 7+&D  ໺ࡊιϜϦΤ  ਓ 'J/$ఏܞܖ໿ ΠϯϑϧΤϯαʔ ʢ'J/$Ξϯόαμʔʣ ਓ Ϟσϧ ௕୩઒ ཧܙ ϓϩαοΧʔબख ߳઒ ਅ࢘ ޒྠۚϝμϦετ ๺ౡ ߁հ λϨϯτ ৿Լ ༔ཬ ʛ #FBVUZˍ)FBMUIͷઐ໳Ո  ਓ ΞεϦʔτϑʔυ ϚΠελʔ  ਓ 'J/$ఏܞܖ໿ ϔϧεέΞઐ໳Ո ʢӫཆ࢜ɾ؅ཧӫཆ࢜ɾ τϨʔφʔ౳ʣ ' J / $ Φ ϑ Ο γ ϟ ϧ α ϙ ʔ λ ʔ ঁ༏ ಺ࢁ ཧ໊ ϤΨΠϯετϥΫλʔ .":6,0 ,ϑΝΠλʔ খᖒւే  ສਓ ֤छ4/4ϑΥϩϫʔ ϑΟοτωε ϤΨ౳ʹؔ৺ߴ
  2. 4/4σʔλͷࣗવݴޠॲཧ  • $* &!+,$ ( (")$ ) ' FiNC

    #% MeCab#mecab-python3# NFDBC ͢΋΋΋΋΋΋΋΋ͷ͏ͪ ͢΋΋ ໊ࢺ Ұൠ ͢΋΋ εϞϞ εϞϞ ΋ ॿࢺ ܎ॿࢺ ΋ Ϟ Ϟ ΋΋ ໊ࢺ Ұൠ ΋΋ ϞϞ ϞϞ ΋ ॿࢺ ܎ॿࢺ ΋ Ϟ Ϟ ΋΋ ໊ࢺ Ұൠ ΋΋ ϞϞ ϞϞ ͷ ॿࢺ ࿈ମԽ ͷ ϊ ϊ ͏ͪ ໊ࢺ ඇཱࣗ ෭ࢺՄೳ ͏ͪ ΢ν ΢ν &04
  3. •   •  *+'$) ,,, "- ! •

    stop words %&-  #-   (! • word_cloud: https://github.com/amueller/word_cloud 4/4σʔλͷࣗવݴޠॲཧ  ΰϧϑΞϯόαμʔͷ"͞Μ ϤΨΞϯόαμʔͷ#͞Μ
  4. 4/4σʔλͷࣗવݴޠॲཧ  • 10 • %!' 0 & 0 

    • "*.$#( 10+/) • 2, nodeedge3- ΰϧϑ ϑΟοτωε ঁࢠ உࢠ ΢ΣΞ Πϯελ ΰϧϑ෦ τϨʔχϯά΢ΣΞ τϨʔχϯά HPMG εϙʔπ ίʔσ ΰϧϑ             ϑΟοτωε             ঁࢠ             உࢠ             ΢ΣΞ             Πϯελ             ΰϧϑ෦             τϨʔχϯά΢ΣΞ             τϨʔχϯά             HPMG             εϙʔπ             ίʔσ             ڞىޠߦྻ ڞىޠωοτϫʔΫ
  5. ࣗવݴޠॲཧɾՄࢹԽʹ࢖༻ͨ͠ϥΠϒϥϦ •  • mecab-python3: https://github.com/SamuraiT/mecab-python3 •   •

    word_cloud: https://github.com/amueller/word_cloud •   • scikit-learn: http://scikit-learn.org •  • NetworkX: https://networkx.github.io • matplotlib: https://matplotlib.org
  6. ·ͱΊ • Python ;6=>07< SNS38/15@ *& +'"*29   

      • Python-A:4)+$($.?>%!#)+,    B ΰϧϑΞϯόαμʔͷ"͞Μ ϤΨΞϯόαμʔͷ#͞Μ
  7. / / . /     αʔόʔαΠυΤϯδχΞ ػցֶशɾ"*ΤϯδχΞ

    ΫϥΠΞϯτΤϯδχΞ 4XJGU,PUMJOͰେن໛ͳΫϥΠΞϯτ։ൃΛ ࢼ͍ͨ͠ํ σδλϧϔϧεྖҬͰ%FFQMFBSOJOHΛۦ࢖͠ɺ "*ٕज़Λ࣮ફͰߴΊ͍ͨํʂ 3VCZPO3BJMTΛϝΠϯʹϚΠΫϩαʔϏεͷ ઃܭ͔Β࣮૷·Ͱߦ͍͖͍ͬͯͨํ