Skip to Main content Skip to Navigation
Conference papers

Cross-Layer CNN Approximations for Hardware Implementation

Abstract : Convolution Neural Networks (CNNs) are widely used for image classification and object detection applications. The deployment of these architectures in embedded applications is a great challenge. This challenge arises from CNNs’ high computation complexity that is required to be implemented on platforms with limited hardware resources like FPGA. Since these applications are inherently error-resilient, approximate computing (AC) offers an interesting trade-off between resource utilization and accuracy. In this paper, we study the impact on CNN performances when several approximation techniques are applied simultaneously. We focus on two of the widely used approximation techniques, namely quantization and pruning. Our experimental results showed that for CNN networks of different parameter sizes and 3% loss in accuracy, we can obtain up to 27.9%–47.2% reduction in computation complexity in terms of FLOPs for CIFAR-10 and MNIST datasets.
Complete list of metadata

https://hal-uphf.archives-ouvertes.fr/hal-03430302
Contributor : Mylène Delrue Connect in order to contact the contributor
Submitted on : Tuesday, November 16, 2021 - 10:52:31 AM
Last modification on : Saturday, May 14, 2022 - 11:35:57 AM

Identifiers

Citation

Karim Mohamed Abedallah Ali, Ihsen Alouani, Abdessamad Ait El Cadi, Hamza Ouarnoughi, Smail Niar. Cross-Layer CNN Approximations for Hardware Implementation. ARC 2020 : Applied Reconfigurable Computing, Apr 2020, Toldedo (Virtual Event), Spain. pp.151-165, ⟨10.1007/978-3-030-44534-8_12⟩. ⟨hal-03430302⟩

Share

Metrics

Record views

29