References¶
Yun-Nan Chang. Design of low-cost iou circuit for post-processing of object detection. In 2023 IEEE 12th Global Conference on Consumer Electronics (GCCE), volume, 407–408. 2023. doi:10.1109/GCCE59613.2023.10315334.
Suyog Gupta, Ankur Agrawal, Kailash Gopalakrishnan, and Pritish Narayanan. Deep learning with limited numerical precision. 2015. arXiv:1502.02551.
Karen Simonyan and Andrew Zisserman. Very deep convolutional networks for large-scale image recognition. 2015. arXiv:1409.1556.
Vivienne Sze, Yu-Hsin Chen, Tien-Ju Yang, and Joel S. Emer. Efficient Processing of Deep Neural Networks. Synthesis Lectures on Computer Architecture. Morgan & Claypool Publishers, 2020. ISBN 978-3-031-00638-8. URL: https://doi.org/10.2200/S01004ED1V01Y202004CAC050, doi:10.2200/S01004ED1V01Y202004CAC050.
Various. 8-bit inference with tensorrt. 2017. URL: https://on-demand.gputechconf.com/gtc/2017/presentation/s7310-8-bit-inference-with-tensorrt.pdf.
Hao Wu, Patrick Judd, Xiaojie Zhang, Mikhail Isaev, and Paulius Micikevicius. Integer quantization for deep learning inference: principles and empirical evaluation. CoRR, 2020. URL: https://arxiv.org/abs/2004.09602, arXiv:2004.09602.
Yangjie Zhou, Mengtian Yang, Cong Guo, Jingwen Leng, Yun Liang, Quan Chen, Minyi Guo, and Yuhao Zhu. Characterizing and demystifying the implicit convolution algorithm on commercial matrix-multiplication accelerators. In 2021 IEEE International Symposium on Workload Characterization (IISWC), volume, 214–225. 2021. doi:10.1109/IISWC53511.2021.00029.