Skip to Main content Skip to Navigation
Conference papers

Performance prediction for convolutional neural networks on edge GPUs

Abstract : Edge computing is increasingly used for Artificial Intelligence (AI) purposes to meet latency, privacy, and energy challenges. Convolutional Neural networks (CNN) are more frequently deployed on Edge devices for several applications. However, due to their constrained computing resources and energy budget, Edge devices struggle to meet CNN's latency requirements while maintaining good accuracy. It is, therefore, crucial to choose the CNN with the best accuracy and latency trade-off while respecting hardware constraints. This paper presents and compares five of the widely used Machine Learning (ML) based approaches to predict CNN's inference execution time on Edge GPUs. For these 5 methods, in addition to their prediction accuracy, we also explore the time needed for their training and their hyperparameters' tuning. Finally, we compare times to run the prediction models on different platforms. The use of these methods will highly facilitate design space exploration by quickly providing the best CNN on a target Edge GPU. Experimental results show that XGBoost provides an interesting average prediction error even for unexplored and unseen CNN architectures. Random Forest depicts comparable accuracy but needs more effort and time to be trained. The other 3 approaches (OLS, MLP, and SVR) are less accurate for CNN performance estimation.
Document type :
Conference papers
Complete list of metadata

https://hal-uphf.archives-ouvertes.fr/hal-03379997
Contributor : Kathleen Torck Connect in order to contact the contributor
Submitted on : Friday, October 15, 2021 - 12:12:03 PM
Last modification on : Wednesday, October 27, 2021 - 1:16:08 PM

Identifiers

Collections

Citation

Bouzidi Halima, Hamza Ouarnoughi, Smail Niar, Abdessamad Ait El Cadi. Performance prediction for convolutional neural networks on edge GPUs. CF '21: Computing Frontiers Conference, May 2021, Catania, Italy. pp.54-62, ⟨10.1145/3457388.3458666⟩. ⟨hal-03379997⟩

Share

Metrics

Record views

10