M. Isaev, and P. Micikevicius. Integer Quantization for Deep Learning Inference: Principles and Empirical Evaluation. arXiv:2004.09602, 2020. 2. R. Krishnamoorthi. Quantizing deep convolutional networks for efficient inference: A whitepaper. arXiv:1806.08342, 2017. 3. T. Sheng, C. Feng, S. Zhuo, X. Zhang, L. Shen, and M. Aleksic. A Quantization-Friendly Separable Convolution for MobileNets. In EMC2 Workshop, 2018. 参考文献 14