Upgrade to Pro
— share decks privately, control downloads, hide ads and more …
Speaker Deck
Features
Speaker Deck
PRO
Sign in
Sign up for free
Search
Search
try to scraping with linebot
Search
Gin
February 13, 2020
Technology
1
270
try to scraping with linebot
Gin
February 13, 2020
Tweet
Share
More Decks by Gin
See All by Gin
Cで理解するソケットプログラミング
gin2525
0
510
論理と感覚
gin2525
0
240
Other Decks in Technology
See All in Technology
BigQuery Remote FunctionでLooker Studioをインタラクティブ化
cuebic9bic
3
270
Prox Industries株式会社 会社紹介資料
proxindustries
0
280
Кто отправит outbox? Валентин Удальцов, автор канала Пых
lamodatech
0
340
Javaで作る RAGを活用した Q&Aアプリケーション
recruitengineers
PRO
1
110
AWS CDK 実践的アプローチ N選 / aws-cdk-practical-approaches
gotok365
6
730
第9回情シス転職ミートアップ_テックタッチ株式会社
forester3003
0
230
20250625 Snowflake Summit 2025活用事例 レポート / Nowcast Snowflake Summit 2025 Case Study Report
kkuv
1
310
Liquid Glass革新とSwiftUI/UIKit進化
fumiyasac0921
0
200
生成AIでwebアプリケーションを作ってみた
tajimon
2
150
25分で解説する「最小権限の原則」を実現するための AWS「ポリシー」大全 / 20250625-aws-summit-aws-policy
opelab
9
1.1k
データプラットフォーム技術におけるメダリオンアーキテクチャという考え方/DataPlatformWithMedallionArchitecture
smdmts
5
630
初めてのAzure FunctionsをClaude Codeで作ってみた / My first Azure Functions using Claude Code
hideakiaoyagi
1
220
Featured
See All Featured
[RailsConf 2023 Opening Keynote] The Magic of Rails
eileencodes
29
9.5k
Building Adaptive Systems
keathley
43
2.6k
The Cost Of JavaScript in 2023
addyosmani
51
8.4k
Evolution of real-time – Irina Nazarova, EuRuKo, 2024
irinanazarova
8
790
Build The Right Thing And Hit Your Dates
maggiecrowley
36
2.8k
Build your cross-platform service in a week with App Engine
jlugia
231
18k
Fashionably flexible responsive web design (full day workshop)
malarkey
407
66k
Connecting the Dots Between Site Speed, User Experience & Your Business [WebExpo 2025]
tammyeverts
5
210
Building a Modern Day E-commerce SEO Strategy
aleyda
42
7.3k
Balancing Empowerment & Direction
lara
1
370
How To Stay Up To Date on Web Technology
chriscoyier
790
250k
4 Signs Your Business is Dying
shpigford
184
22k
Transcript
LINEBOT With Scraping twitter : @gin2_5 Created at 2020/02
ࣗݾհ ֶੜʢ̎̍ࡀʣ LINE FUKUOKAʹ ɹɹɹɹɹظΠϯλʔϯ AI, IoT(ϥζύΠ) ,Java ɹɹɹɹɹɹͲΕ࠳தɻ ࢿ֨ɿԠ༻ใٕज़ऀ
࠷ۙɺ ػցֶशͷֶश ΛαϘͬͯ·͢ɻ ࣜʹർΕ·ͨ͠ɻ ॏճؼͰଉΕͰ͢ɻ
Կ͔ɺଉൈ͖Ͱ ؔ࿈͋Δ͜ͱ͍ͨ͠ͳɻ
εΫϨΠϐϯάͬͯ ໘നͦ͏ɻ
ҰԠɺεΫϨΠϐϯάʹ͍ͭͯ “ WebεΫϨΠϐϯάͱɺWebαΠτ͔Β WebϖʔδͷHTMLσʔλΛऩूͯ͠ɺಛ ఆͷσʔλΛநग़ɺܗ͢͜͠ͱͰ͋Δ ” (weblio ༷ΑΓҾ༻)
HTML File Python Something (DB,File, and more) Scrape Output About
scraping
Α͘ɺ͜Μͳ͜ͱ͋Γ·ͤΜ͔ʁ ໘നͦ͏ͳITܥͷΠϕϯτΛݟಀͪ͠Όͬͨɻ ؾ͍ͨΒɺఆһ͍ͬͺ͍ɻ ʢओʹͰ͢ʣ
࡞Γ·ͨ͠ LINEͰΠϕϯτΛ ௨ɾݕࡧͯ͘͠ΕΔౕ
heroku ߏ DB LINE API Flask python cron (ఆظతʹಈ͘) python
Connpassͷ ެࣜαΠτ (HTMLϑΝΠϧ)
݁ߏ؆୯ʹεΫϨΠϐϯάग़དྷͪΌ͍·ͨ͠ $ pip install beautifulsoup4 from bs4 import BeautifulSoup import
requests bash scrape.py Install Import
url = ’https:!//something.com' r = requests.get(url) soup = BeautifulSoup(r.content, "html.parser")
events_name = soup.select('a.url.summary') events_date = soup.select('p.date') events_year = soup.select('p.year') events_img = soup.select('p.event_thumbnail img') scrape.py
ͨͩɺਅ໘ͳɻ Web APIΛ͍ͬͯΔΘ͚Ͱͳ͍ͷͰɺ ϖʔδͷߏΛม͑ΒΕͨΒɺऴΘΓ·͢ɻ ࣗવݴޠॲཧͱ͔བྷΊͯͬͯΈ͍ͨͳ
͓͠·͍ɻ ͋Γ͕ͱ͏͍͟͝·ͨ͠ɻ
None
!<- ͜Εɺ શ֯ʮzʴhʯ Ͱೖྗग़དྷ·͢ ͓·͚