$30 off During Our Annual Pro Sale. View Details »
Speaker Deck
Features
Speaker Deck
PRO
Sign in
Sign up for free
Search
Search
How to collect large scale data using Javascript
Search
Leonardo Rifeli
June 02, 2022
Programming
0
59
How to collect large scale data using Javascript
Leonardo Rifeli
June 02, 2022
Tweet
Share
More Decks by Leonardo Rifeli
See All by Leonardo Rifeli
Acate: Processamento distribuído - Como processamos milhões de dados diariamente
leonardorifeli
0
29
Reviewr Data Consolidation Case
leonardorifeli
0
120
Distributed processing: How we process millions of data daily with EMR
leonardorifeli
0
49
Building Crawlers with serverless
leonardorifeli
0
78
Other Decks in Programming
See All in Programming
AI Agent Dojo #4: watsonx Orchestrate ADK体験
oniak3ibm
PRO
0
110
Flutter On-device AI로 완성하는 오프라인 앱, 박제창 @DevFest INCHEON 2025
itsmedreamwalker
1
140
宅宅自以為的浪漫:跟 AI 一起為自己辦的研討會寫一個售票系統
eddie
0
520
愛される翻訳の秘訣
kishikawakatsumi
3
340
Spinner 軸ズレ現象を調べたらレンダリング深淵に飲まれた #レバテックMeetup
bengo4com
0
130
Findy AI+の開発、運用におけるMCP活用事例
starfish719
0
1.7k
Jetpack XR SDKから紐解くAndroid XR開発と技術選定のヒント / about-androidxr-and-jetpack-xr-sdk
drumath2237
1
180
AIエンジニアリングのご紹介 / Introduction to AI Engineering
rkaga
8
3.2k
堅牢なフロントエンドテスト基盤を構築するために行った取り組み
shogo4131
8
2.5k
Navigating Dependency Injection with Metro
l2hyunwoo
1
170
Vibe codingでおすすめの言語と開発手法
uyuki234
0
100
AIエージェントの設計で注意するべきポイント6選
har1101
5
2k
Featured
See All Featured
Neural Spatial Audio Processing for Sound Field Analysis and Control
skoyamalab
0
130
So, you think you're a good person
axbom
PRO
0
1.8k
The Hidden Cost of Media on the Web [PixelPalooza 2025]
tammyeverts
2
120
Balancing Empowerment & Direction
lara
5
810
Facilitating Awesome Meetings
lara
57
6.7k
Stewardship and Sustainability of Urban and Community Forests
pwiseman
0
69
Lessons Learnt from Crawling 1000+ Websites
charlesmeaden
0
940
ReactJS: Keep Simple. Everything can be a component!
pedronauck
666
130k
[RailsConf 2023 Opening Keynote] The Magic of Rails
eileencodes
31
9.8k
We Have a Design System, Now What?
morganepeng
54
7.9k
Done Done
chrislema
186
16k
Fashionably flexible responsive web design (full day workshop)
malarkey
407
66k
Transcript
How to collect large scale data using Javascript seo local
| reviews | pesquisas
None
Agora a experiência é o novo marketing
Somos a Harmo, a plataforma de marketing de experiência mais
completa do Brasil.
SEO Local A única plataforma 3 x 1 do Brasil
Faça a gestão da presença digital da sua rede de lojas e seja encontrado no topo do ranking das pesquisas de forma 100% orgânica. 1 2 3
Reviews A única plataforma 3 x 1 do Brasil Colete,
analise e responda todos os reviews dos seus clientes, conquiste a confiança do consumidor e seja a marca escolhida. 1 2 3
A única plataforma 3 x 1 do Brasil Pesquisas multimétricas
para medir a experiência do cliente durante toda a jornada. Identifique promotores e ative o programa de indicação de reviews. Pesquisas 1 2 3
Harmo, uma poderosa máquina de geração de ROI. Escute, interaja,
analise e atue focado nos anseios dos clientes, durante toda a jornada, transformando os seus clientes no principal canal de aquisição de novos clientes.
Grandes marcas atestam a qualidade da nossa plataforma e metodologia
com foco em resultados
NUMBERS Establishments +30k Reviews +15kk Integrations +54k Emails +6,6kk SMS
+250k Answer of Review +1kk
▷ Distributed Process ▷ Scrapping vs Crawlers ▷ Some Concepts
▷ Why Javascript? ▷ Architecture for Scale ▷ Lessons Learning ▷ Example ▷ Conclusion Topics
Distributed Process
None
Scraping vs Crawlers
None
Collector Concepts
Be "Browserless"
Recursion is your friends
Single Responsability
Normalize Data (input & output)
Code reuse with packages
Collector !== Processor
Why Javascript?
Use native streams
Dynamic typing
Do more with less
Most used in the world
Architecture for Scale
None
None
None
Lessons Learning
Use code-base version alert
Code reuse with packages
Create E2E tests from the begin
Be "Browserless"
Use Puppeteer *reduce images
None
None
Use Promise.all
None
Use monorepos
Otherwise it will be chaos
None
None
▷ Web Scraping vs Web Crawling: The Differences ▷ HOW
TO RUN ASYNC JAVASCRIPT FUNCTIONS IN SEQUENCE OR PARALLEL Links
Collector Example
None
None
Leonardo Rifeli | CTO
[email protected]
harmo.me seo local | reviews
| pesquisas