Upgrade to Pro
— share decks privately, control downloads, hide ads and more …
Speaker Deck
Features
Speaker Deck
PRO
Sign in
Sign up for free
Search
Search
OpenTalks.AI - Дмитрий Пагин, Fast cars detecti...
Search
OpenTalks.AI
February 21, 2020
Science
0
2.1k
OpenTalks.AI - Дмитрий Пагин, Fast cars detection and traffic estimation
OpenTalks.AI
February 21, 2020
Tweet
Share
More Decks by OpenTalks.AI
See All by OpenTalks.AI
OpenTalks.AI - Виктор Лемпицкий, Моделирование 3Д сцен: новые подходы в 2020 году
opentalks
0
490
OpenTalks.AI - Алексей Чернявский, Нейросетевые алгоритмы для повышения качества медицинских изображений
opentalks
0
440
OpenTalks.AI - Александр Громов, Устойчивость нейросетевых моделей при анализе КТ/НДКТ-исследований
opentalks
0
380
OpenTalks.AI - Денис Тимонин, Megatron-LM: Обучение мультимиллиардных LMs при помощи техники Model Parallelism
opentalks
0
520
OpenTalks.AI - Егор Филимонов, Возможности платформы Huawei Atlas и эффективный гетерогенный инференс.
opentalks
0
160
OpenTalks.AI - Александр Прозоров, Референсная архитектура робота сервисного центра в отраслях с изменчивыми бизнес-процессами
opentalks
0
390
OpenTalks.AI - Наталья Лукашевич, Анализ тональности по отношению к компании — с чем не справился BERT
opentalks
0
340
OpenTalks.AI - Константин Воронцов, Фейковые новости и другие типы потенциально опасного дискурса: типология, подходы, датасеты, соревнования
opentalks
0
450
OpenTalks.AI - Дмитрий Ветров, Фрактальность функции потерь, эффект двойного спуска и степенные законы в глубинном обучении - фрагменты одной мозаики
opentalks
0
480
Other Decks in Science
See All in Science
【RSJ2025】PAMIQ Core: リアルタイム継続学習のための⾮同期推論・学習フレームワーク
gesonanko
0
710
データマイニング - グラフ埋め込み入門
trycycle
PRO
1
190
論文紹介 音源分離:SCNET SPARSE COMPRESSION NETWORK FOR MUSIC SOURCE SEPARATION
kenmatsu4
0
580
データマイニング - ノードの中心性
trycycle
PRO
0
350
Rashomon at the Sound: Reconstructing all possible paleoearthquake histories in the Puget Lowland through topological search
cossatot
0
740
データベース15: ビッグデータ時代のデータベース
trycycle
PRO
0
470
HDC tutorial
michielstock
1
580
俺たちは本当に分かり合えるのか? ~ PdMとスクラムチームの “ずれ” を科学する
bonotake
2
2.1k
Optimization of the Tournament Format for the Nationwide High School Kyudo Competition in Japan
konakalab
0
170
Celebrate UTIG: Staff and Student Awards 2025
utig
0
1.3k
データベース14: B+木 & ハッシュ索引
trycycle
PRO
0
680
ド文系だった私が、 KaggleのNCAAコンペでソロ金取れるまで
wakamatsu_takumu
2
2.2k
Featured
See All Featured
Self-Hosted WebAssembly Runtime for Runtime-Neutral Checkpoint/Restore in Edge–Cloud Continuum
chikuwait
0
410
Done Done
chrislema
186
16k
It's Worth the Effort
3n
188
29k
The Curious Case for Waylosing
cassininazir
0
280
Organizational Design Perspectives: An Ontology of Organizational Design Elements
kimpetersen
PRO
1
650
Why Mistakes Are the Best Teachers: Turning Failure into a Pathway for Growth
auna
0
94
職位にかかわらず全員がリーダーシップを発揮するチーム作り / Building a team where everyone can demonstrate leadership regardless of position
madoxten
62
53k
Intergalactic Javascript Robots from Outer Space
tanoku
273
27k
No one is an island. Learnings from fostering a developers community.
thoeni
21
3.6k
SERP Conf. Vienna - Web Accessibility: Optimizing for Inclusivity and SEO
sarafernandez
1
1.4k
Building a Modern Day E-commerce SEO Strategy
aleyda
45
9k
The agentic SEO stack - context over prompts
schlessera
0
710
Transcript
Fast cars detection and traffic estimation Dmitriy Pagin, ML and
CV developer
Task Road traffic analysis in Russia is manual. It takes
more than 8 hours for 15 minutes video today
Task • detect cars
Task • detect cars • track cars
Baseline - people tracking
Problems Cars: - faster (2 metres per frame!) - smaller
(10 px in minimal dimension) + more predictable movement
YOLOv2 - blinking - problems on small cars - problems
on edges
YOLOv2 1 fps
YOLOv3 - bigger + accurate on small + fullHD frame
+ robust
YOLOv3 7 fps
> 70k cars on 4k images Dataset
better than 1024x1024x1 Learning and Fine-tuning - 608x608 px -
batchSize = 3 - custom augmenters
None
Learning and Fine-tuning - 608x608 px - batchSize = 3
- custom augmenters - Radam optimizer (instead warmup + reduce LR) - Hard negative mining for trucks
Learning and Fine-tuning - 608x608 px - batchSize = 3
- custom augmenters - Radam optimizer (instead warmup + reduce LR) - Hard negative mining for trucks mAP75 = 0.96
Baseline Inference Speed 7 fps
Weights Pruning
Weights Pruning -25% convs = size: 240 mb mAp: 0.9656
inf: 150 ms size: 155 mb mAp: 0.9622 inf: 100 ms 10 fps
OpticalFlow step or classical cv is alive ! - find
good features to track - calculate sparse optical flow
OpticalFlow step 19 fps Calculation doesnt work for 3 consistent
frames
Speed extrapolation step - estimate speed as pixels/frame - extrapolate
next position 28 fps
Final pipeline 1 2 3 4 5 6 Update trajectories
4 5 6 step 1 step 2 Speed Extrapolation OpticalFlow YOLOv3 Detection Engine
1 fps -> 28 fps on FULLHD
Tracking - IoU - Color descriptor (it’s enough!)
Bridges! - Allowed zone by motion vector - Size overlap
- Color descriptor
Bridges! - Allowed zone by motion vector - Size overlap
- Color descriptor
Thanks! Questions?
[email protected]
+7 952 335 65 70
Appendix. Examples
Appendix. Examples
Appendix. Examples
Appendix. Yolov3
Weights Pruning Шаг mAP75 Число параметров, млн Размер сети, мб
От изначальной, % Время прогона, мс Условие обрезания 0 0.965 60 241 100 150 - 1 0.962 55 218 91 140 5% от всех 2 0.962 50 197 83 132 5% от всех 3 0.963 39 155 64 112 15% для слоев с 400+ сверток 4 0.955 31 124 51 100 10% для слоев с 100+ сверток
Appendix. Radam
Pruning convs
Pruning convs. Good choice 2000
Pruning convs. Bad choice 25
Pruning flat