Slide 89
Slide 89 text
Impact of GPU
Accelerated computations on float32 data
Matrix multiplication, convolution, and large
element-wise operations can be accelerated a lot
Difficult to parallelize dense neural networks on
multiple GPU efficiently (Active area of research)
Copying of large quantities of data to and from a
device is relatively slow
CUDA has released cuDNN