Wantedly の Data 系メンバー 5 名で WWW 2020 に参加しました! 中でもソーシャルネットワークにおけるつながりの活用に関する論文をいくつか参加報告会で紹介しました。 https://connpass.com/event/174856/
©2020 Wantedly, Inc.Social Networks ʹ͓͚Δinfluence ͱ fake accounts ʹؔ͢ΔจհThe Web Conference 2020 ࢀՃใࠂձApr 30, 2020 - Naomichi Agata
View Slide
©2018 Wantedly, Inc.Naomichi Agata- 2018/04~ Wantedly, Inc.- Wantedly People / Data Chapter Leader / Engineer- σʔλΛͬͯαʔϏεΛվળ͢ΔνʔϜ- Wantedly Platform ্ͷʮͭͳ͕ΓʯΛվળ͢Δɾ׆༻͢Δ͜ͱ͕࠷ۙͷϛογϣϯ- kaggle expert@agatan_ࣗݾհ
©2020 Wantedly, Inc.հ͢Δจ• Social Network Influence Ranking via Embedding Network Interactionsfor User Recommendation•Hongbo Bo, Ruan McConville, Jun Hong, Weiru Liu, 2020.•https://dl.acm.org/doi/abs/10.1145/3366424.3383299• On Twitter Purge: A Retrospective Analysis of Suspended Users•Farhan Asif Chowdhury , Lawrence Allen , Mohammad Yousuf , Abdullah Mueen, 2020.•https://dl.acm.org/doi/abs/10.1145/3366424.3383298• Friend or Faux: Graph-Based Early Detection of Fake Accountson Social Networks• Adam Breuer, Roee Eilat, Udi Weinsberg, 2020.• https://dl.acm.org/doi/abs/10.1145/3366423.3380204
©2020 Wantedly, Inc.Social Network Influence Rankingvia Embedding Network Interactionsfor User RecommendationհจຊHongbo Bo, Ruan McConville, Jun Hong, Weiru Liu
©2020 Wantedly, Inc.తSNS্ͷӨڹྗͷఆྔԽʹϢʔβؒͷinteractionΛߟྀ͍ͨ͠հจ (1): Social Network Influence Ranking via Embedding Network Interactions for User Recommendation• Өڹྗͷߴ͞ (Influence) ΛదʹఆྔԽͯ͠ਪનʹཱ͍ͯͨ• Ծఆ: ϢʔβӨڹྗͷߴ͍ϢʔβΛϑΥϩʔ͍͢͠• ϑΥϩʔɾϑΥϩϫʔͷ͚ؔͩͰෆे
©2020 Wantedly, Inc.ఏҊख๏ͷ֓ཁ• άϥϑ্ͷϊʔυͷॏཁΛܾఆ͢ΔΞϧΰϦζϜͰ͋Δ PageRank Λϕʔεͱ͢Δ EIRank ΛఏҊ• follow ؔͻͱͭͻͱͭʹॏΈΛ͚ͭΔ• interaction ใΛ༻͍ͯɼΑΓີͳ follow ؔʹߴ͍ॏΈΛׂΓͯΔ (= ॏࢹ͢Δ)• interaction graph ্ͷ node2vec Λར༻͠ɼfollow ؔͷڧ͞ΛఆྔԽհจ (1): Social Network Influence Ranking via Embedding Network Interactions for User Recommendation
©2020 Wantedly, Inc.લఏ: PageRank• άϥϑશମʹରͯ͠ɼ֤ϊʔυ͕Ͳͷ͘Β͍ॏཁ͔Λఆྔతʹද͢• ͨ͘͞ΜϑΥϩʔ͞Ε͍ͯΔͱߴ͘ͳΔ• ॏཁ͕ߴ͍ਓʹϑΥϩʔ͞Ε͍ͯΔͱߴ͘ͳΔ• ͨͩ͠ϑΥϩʔ͕ଟ͍ਓʹϑΥϩʔ͞Ε্ͯঢ෯খ͍͞հจ (1): Social Network Influence Ranking via Embedding Network Interactions for User Recommendation
©2020 Wantedly, Inc.EIRank• interaction ΛΤοδͱΈͳͨ͠ॏΈ͖άϥϑΛߏங͠ɼͦͷάϥϑ্ͰͷϢʔβؒͷڑ (≒ ີ) Λ node2vec ͰఆྔԽ͢Δհจ (1): Social Network Influence Ranking via Embedding Network Interactions for User Recommendation
©2020 Wantedly, Inc.EIRank• node2vec ͷڑΛͱʹϑΥϩʔ͝ͱͷີΛܭࢉ• PageRank ͰҰ༷ʹ 1/N ͱ͍ͯͨ͠෦Λີʹஔ͖͑Δհจ (1): Social Network Influence Ranking via Embedding Network Interactions for User Recommendation
©2020 Wantedly, Inc.࣮ݧɾධՁ• Twitter ͷϑΥϩʔਪનͰධՁ͍ͯ͠Δ• ਖ਼ྫ 1, ෛྫ 10 ͷ 11 ਓΛฒͼସ͑ͯɼਖ਼ྫͷฏۉॱҐΛൺֱ• Influence ͷӨڹ͕େ͖͍ Platform Ͱ͋Δ͜ͱ͕Α͘Θ͔Δ…հจ (1): Social Network Influence Ranking via Embedding Network Interactions for User Recommendation
©2020 Wantedly, Inc.·ͱΊ• Influence ͷఆྔԽʹϢʔβؒͷ interaction ΛՃຯ͢Δ EIRank ΛఏҊ• ϢʔβਪનͷԠ༻ΛධՁ͠ɼطଘख๏Λ͑ΔੑೳΛ֬ೝͨ͠ײ• ͭͳ͕Γ͕ฏͰͳ͍͜ͱΛఆྔԽ͍͓ͯͯ͠͠Ζ͔ͬͨ• ෦తͳΞΠσΟΞ;ͭ͏ʹ׆༻Ͱ͖ͦ͏• Influence ͕ϢʔβਪનʹͲͷ͘Β͍ޮ͔͘ϓϥοτϑΥʔϜͷੑ࣭ʹґଘͦ͠͏• interaction ͷछྨʹΑΔڧऑ͕ߟྀ͞Ε͍ͯͳ͍ͷ͕ؾʹͳͬͨհจ (1): Social Network Influence Ranking via Embedding Network Interactions for User Recommendation
©2020 Wantedly, Inc.On Twitter Purge: A Retrospective Analysis ofSuspended UsersհจຊFarhan Asif Chowdhury , Lawrence Allen , Mohammad Yousuf , Abdullah Mueen
©2020 Wantedly, Inc.Twitter ͕ abuse / span Ϣʔβͱͯ͠ purge ͨ͠ϢʔβͨͪʹযΛͯͯɼߦಈಛΛੳ͍ͯ͠Δհจ (2): On Twitter Purge: A Retrospective Analysis of Suspended Users• author twitter ͷਓ͡Όͳ͍ͷͰσʔλ࡞Γ͔Βݥ͍͠…• Twitter API Λ༻͍ͯ͋Δ࣌ͰͷϢʔβϦετΛ࡞• 1 ϲ݄ޙʹ࠶ͦͷϢʔβͨͪΛऔಘ͢Δ͜ͱͰpurge ͞ΕͨϢʔβϦετΛ࡞͍ͬͯΔ• purge ͞Εͨޙ͔ΒͦͷϢʔβͷ activity Λऔಘ͢Δ͜ͱͰ͖ͳ͍ͷͰsampling Ͱͻͨ͢Β activity ΛूΊ͓͍ͯͯɼۮʑώοτͨ͠ͷΛ͏
©2020 Wantedly, Inc.key findings (ൈਮ)հจ (2): On Twitter Purge: A Retrospective Analysis of Suspended Users• ࣗಈͰ࡞ΒΕΔ spam ͕ओ• త࣏ܥόΠϥϧϚʔέ͕ଟ͍• 60% Ҏ্͕ 2 ؒ Twitter Λ͍ͬͯΔ (!)• ϓϩϑΟʔϧ͕ৗʹಉظ͍ͯ͠ΔΫϥελ͕͋ͬͨΓ͢Δ• purged Ϣʔβಉ͕࢜ RT ͋͠͏͜ͱଟ͍
©2020 Wantedly, Inc.Friend or Faux:Graph-Based Early Detection of Fake Accountson Social NetworksհจຊAdam Breuer, Roee Eilat, Udi Weinsberg
©2020 Wantedly, Inc.తFake account Λૣظʹݕग़͍ͨ͠հจ (3): Friend or Faux: Graph-Based Early Detection of Fake Accounts on Social Networks• Fake account ΛऔΓআ͘͜ͱɼSocial Networks ͷ݈શੑҡ࣋ʹ͔ܽͤͳ͍ॏཁͳλεΫ• ͦͷѱӨڹ͕·Δ͜ͱΛ͙ͨΊʹɼૣظʹݕग़͢Δ͜ͱ͕ॏཁ• طଘͷΞϓϩʔνߏங͞ΕͨωοτϫʔΫߏʹ͢Δͷ͕ଟ͘ɼωοτϫʔΫ͕ेʹߏங͞ΕΔ·Ͱݕग़͢Δ͜ͱ͕͔ͬͨ͠
©2020 Wantedly, Inc.എܠͱલఏ• Facebook ͷݚڀ• 2019 1Q Ͱ 2.2 billion ͷ fake account Λݕग़͍ͯ͠Δ…• طʹ 99.8% Ϣʔβʹใࠂ͞ΕΔલʹݕग़Ͱ͖͍ͯΔհจ (3): Friend or Faux: Graph-Based Early Detection of Fake Accounts on Social Networks
©2020 Wantedly, Inc.՝: fake ϢʔβͷૹΔϦΫΤετͦͦ͜͜ Accept ͞Εͯ͠·͏୯७ͳ Reject ͷΑ͏ͳࢦඪͰݕͰ͖ͳ͍հจ (3): Friend or Faux: Graph-Based Early Detection of Fake Accounts on Social Networks
©2020 Wantedly, Inc.Ұ෦ͷϢʔβɼfake Ϣʔβ͔ΒϦΫΤετΛૹΒΕ͍͢ʮfake Ϣʔβ͔ΒϦΫΤετΛૹΒΕ͍͢ϢʔβʯʹϦΫΤετΛૹ͍ͬͯΔ=fake Β͕͠͞ߴ͍ͱ͍͑ΔͷͰʁհจ (3): Friend or Faux: Graph-Based Early Detection of Fake Accounts on Social Networks
©2020 Wantedly, Inc.Ұ෦ͷϢʔβɼreal or fake Λఆͯ͠ accept ͢Δ͔Λஅ͍ͯ͠Δʮfake Λڋ൱ͯ͠ real Λঝೝ͍͢͠ʯϢʔβ͔Βঝೝ͞Ε͍ͯΔ=real Β͕͠͞ߴ͍ͱ͍͑ΔͷͰʁհจ (3): Friend or Faux: Graph-Based Early Detection of Fake Accounts on Social Networks
©2020 Wantedly, Inc.ఏҊख๏ͷ֓ཁ• ৽نϢʔβ͕୭ʹϦΫΤετΛૹ͍ͬͯΔ͔Λूܭ͢Δ• fake Ϣʔβ͔ΒϦΫΤετΛૹΒΕ͍͢Ϣʔβʹ͢Δ• ৽نϢʔβ͕ૹͬͨϦΫΤετͷฦΛूܭ͢Δ• fake Ϣʔβ͔ΒͷϦΫΤετʹɼ;ͭ͏ͷϦΫΤετͱҧ͏ԠΛࣔ͢Α͏ͳϢʔβʹ͢Δ• ߴ͍֬৴Ͱ real or fake Λ༧ଌͰ͖ΔϢʔβʹ͢ΔݸʑͷϢʔβ͝ͱͷಛΛՃຯͯ͠அ͢Δ͜ͱͰૣظݕΛ࣮ݱ͢Δհจ (3): Friend or Faux: Graph-Based Early Detection of Fake Accounts on Social Networks
©2020 Wantedly, Inc.↑ʹՃ͑ͯ Confidence ͷߴ͍ใʹɼߴ͍ॏΈΛ༩͑ΔΑ͏ͳ re-weight Λߦ͏ʢ࣌ؒͷ߹Ͱলུ…ʣSybilEdgeհจ (3): Friend or Faux: Graph-Based Early Detection of Fake Accounts on Social Networks
©2020 Wantedly, Inc.ධՁ• ϦΫΤετ 5 ݅ͷஈ֊͔Βߴ͍ AUC• ୯७ͳ RejectRate Ͱेʹใ͕͋ͬͯݫ͍͜͠ͱ͕Θ͔Δ• false positive ΛݟͯΈΔͱɼreal ϢʔβͰ͋Δ͕ਪનΛར༻ͯ͠େྔʹϦΫΤετΛૹΔ“spammy” ͳϢʔβͩͬͨհจ (3): Friend or Faux: Graph-Based Early Detection of Fake Accounts on Social Networks
©2020 Wantedly, Inc.·ͱΊ• fake ͔ΒϦΫΤετ͞Ε͍͢Ϣʔβɼfake ͱ real Ͱঝೝ͕ҧ͏Ϣʔβ͕͍Δ͜ͱΛൃݟͨ͠• Ϣʔβ͝ͱͷʹ͢Δ͜ͱͰɼfake account ͷૣظݕग़Λվળͨ͠ײ• ࠷ऴతͳࣜώϡʔϦεςΟοΫͷੵΈ্͛Ͱɼ͜ΕͰ͜͜·Ͱͷੑೳ͕ग़ͤΔͷ͍͢͝• Fake account ଆ͕͔ͳΓຊؾ͡Όͳ͍ͱ͜͜·Ͱͷੳෆཁͳؾ͕͢ΔͷͰɼFacebook ͱ͍͏ϓϥοτϑΥʔϜͷڧ͞Λײͨ͡հจ (3): Friend or Faux: Graph-Based Early Detection of Fake Accounts on Social Networks