Experimental Setup
▸ In-house simulator modeling a commercial
mobile SoC: Nvidia Tegra X2
▹ Real board measurement
▸ Develop RTL models for IPs unavailable on TX2
▹ CNN Accelerator (651 mW, 1.58 mm2)
▹ Motion Controller (2.2 mW, 0.035 mm2)
14
▸ Evaluate on Object Tracking and Object Detection
▹Important domains that are building blocks for many vision applications
▹IP vendors have started shipping standalone tracking/detection IPs
▸ Object Detection
▹Baseline CNN: YOLOv2 (state-of-the-art detection results)
▸ SCALESim: A systolic array-based, cycle-accurate CNN accelerator
simulator. https://github.com/ARM-software/SCALE-Sim.