Upgrade to Pro
— share decks privately, control downloads, hide ads and more …
Speaker Deck
Features
Speaker Deck
PRO
Sign in
Sign up for free
Search
Search
F#でスクレイピングをしてみた!(F# SCRAPING)
Search
Sponsored
·
Ship Features Fearlessly
Turn features on and off without deploys. Used by thousands of Ruby developers.
→
callmekohei
July 02, 2016
Programming
960
1
Share
F#でスクレイピングをしてみた!(F# SCRAPING)
2016/07/03開催のF#談話室(23)の発表資料です。
callmekohei
July 02, 2016
Other Decks in Programming
See All in Programming
1人1案件のプロダクトエンジニア時代に、"プロセス監督"としてチャレンジしたこと
non0113
0
120
新規プロダクトを高速で生み出すハーネスエンジニアリング
seanchas116
3
210
Modding RubyKaigi for Myself
yui_knk
0
240
実践ハーネスエンジニアリング:ステアリングループを実例から読み解く / Practical Harness Engineering: Understanding Steering Loops Through Real-World Examples
nrslib
5
5.7k
TSKaigi2026-静的解析への投資がAI時代のコード品質を支える ── カスタムESLintルールの設計と運用
hayatokudou
3
340
My daily life on Ruby
a_matsuda
3
420
RailsTokyo 2026#4: AI様があれば、 Hotwireの弱点は消えるか?
naofumi
3
450
サークル参加から学ぶ、小さな事業の回し方
yuzneri
0
210
Sans tests, vos agents ne sont pas fiables
nabondance
0
140
Hive Metastoreを通して学ぶIceberg REST Catalog ― 仕様から実装まで
okumin
0
210
Stage 3 Decorators でできること / できないこと / TSKaigi 2026
susisu
0
150
ビジネスモデルから紐解く、AI+型駆動開発
hirokiomote
1
490
Featured
See All Featured
Why Mistakes Are the Best Teachers: Turning Failure into a Pathway for Growth
auna
0
140
Designing Dashboards & Data Visualisations in Web Apps
destraynor
231
55k
Rails Girls Zürich Keynote
gr2m
96
14k
SEO in 2025: How to Prepare for the Future of Search
ipullrank
3
3.4k
Redefining SEO in the New Era of Traffic Generation
szymonslowik
1
300
The AI Revolution Will Not Be Monopolized: How open-source beats economies of scale, even for LLMs
inesmontani
PRO
3
3.5k
Test your architecture with Archunit
thirion
1
2.2k
The Mindset for Success: Future Career Progression
greggifford
PRO
0
330
Crafting Experiences
bethany
1
150
[RailsConf 2023] Rails as a piece of cake
palkan
59
6.6k
It's Worth the Effort
3n
188
29k
brightonSEO & MeasureFest 2025 - Christian Goodrich - Winning strategies for Black Friday CRO & PPC
cargoodrich
3
700
Transcript
F# SCRAPING F#ͰεΫϨΠϐϯάΛͯ͠Έͨʂ callmekohei 2016/07/03 Sun
Overview • ࣗݾհ • εΫϨΠϐϯάͷ͖͔͚ͬ • HTMLͷ2ͭͷऔಘํ๏ • HTML͔ΒσʔλʔΛͱΓͩͯ͠ΈΔ
callmkohei VBA 3 years F# 9 months ͷ໊લ ϨΦɻ ΑΖ͘͠Ͷʂ
εΫϨΠϐϯάͷ͖͔͚ͬ
͜Μͳײ͡Ͱ ϩτ̓ͷ༧ଌΛ ͍ͨ͠
ϩτ̓ͷબ൪߸Λ ϗʔϜϖʔδΑΓ ೖख͍ͨ͠
None
ϗʔϜϖʔδ HTMLιʔεͰ Ͱ͖͍ͯΔ
None
ϗʔϜϖʔδͷσʔλʔΛ औಘ͢Δʹ HTMLιʔε͕ඞཁ
Ͳ͏ͬͯ HTMLιʔεΛ औಘ͢Δ͔
HTMLιʔεΛऔಘ͢Δ̎ͭͷํ๏
ͦͷ̍ System.NetΛ͔ͭ͏
ͦͷ̎ PhantomJS Λ͔ͭ͏
جຊతʹ System.NetͷΈͰ େৎʢͩͱࢥ;ɻɻɻʣ ͜͜ ߟ͑Ͳ͜Ζ
Ͳ͏্ͯ͠ख͘ औಘͰ͖ͳ͍߹ PhantomJSΛͬͯΈΔ
ͨͩ PhantomJS ͍ͷͰ͢
αʔόʔͷଓൺֱ System.Net: 2s PhantomJS: 7s ࣮ߦ࣌ؒࢀߟ
͜͜Ͱ ࣮ࡍʹαʔόʔʹ ͭͳ͛ͯΈΔ demo
HTMLιʔε͔Β σʔλʔΛ ͱΓͩͯ͠ΈΔ
͜͜Ͱ ศརͳ ϥΠϒϥϦ FSharp.Data
FSharp Data ͱʁ The F# Data library implements everything you
need to access data in your F# applications and scripts. CSV, HTML, JSON and XML ʹରͯ͠ͷศརϥΠϒϥϦ http://fsharp.github.io/FSharp.Data/
a tag
͜͜Ͱ a tag Λ ͱΓͩͯ͠ΈΔ demo
table tag
͜͜Ͱ table tag Λ ͱΓͩͯ͠ΈΔ demo
εΫϨΠϐϯά͢Δͱ͖ʹ ศརͩͱࢥ͏ؔΛ ·ͱΊͯΈͨ
SCRAPINGfs https://github.com/callmekohei/SCRAPINGfs
͞Βʹɺɺ
FSharp.Data HtmlDocument.Load ͍͚ͯͳ͍ͱࢥͬͯΔͱ͜Ζ
ϒϥβʔ͡Όͳ͍ͱ͔͡ΕΔ จࣈίʔυ͕͏·͘ॲཧ͞Εͳ͍ Http Utilities Λ͏͜ͱͰղܾʂ
Formॲཧ͕Ͱ͖ͳ͍
Formॲཧ͕Ͱ͖ͳ͍ Http Utilities Λ͏͜ͱͰͪΐͬͱղܾʂ
Formॲཧ͕Ͱ͖ͳ͍ System.Net Λ͏͜ͱͰղܾʂ
؆୯ͳൺֱද
͝ਗ਼ௌ͋Γ͕ͱ͏͍͟͝·ͨ͠