Execution Time Modeling for CNN Inference on Embedded GPUs

Noureddine Bouhali; Hamza Ouarnoughi; Smail Niar; Abdessamad Ait El Cadi

doi:10.1145/3444950.3447284

Communication Dans Un Congrès Année : 2021

Execution Time Modeling for CNN Inference on Embedded GPUs

(1) , (2) , (2) , (2)

1
2

Noureddine Bouhali

Fonction : Auteur
PersonId : 1295323

École nationale polytechnique [Alger, Algérie]

Hamza Ouarnoughi

Fonction : Auteur
PersonId : 1103143

Laboratoire d'Automatique, de Mécanique et d'Informatique industrielles et Humaines - UMR 8201

Smail Niar

Fonction : Auteur
PersonId : 921561
IdHAL : smail-niar
ORCID : 0000-0002-7550-484X
IdRef : 084751142

Laboratoire d'Automatique, de Mécanique et d'Informatique industrielles et Humaines - UMR 8201

Abdessamad Ait El Cadi

Fonction : Auteur
PersonId : 173525
IdHAL : aaitelcadi
ORCID : 0000-0001-6382-6588
IdRef : 223418056

Laboratoire d'Automatique, de Mécanique et d'Informatique industrielles et Humaines - UMR 8201

Résumé

Machine learning is one of the most cutting edge methods in computer vision. Convolutional Neural Networks (CNN) in particular are widely used in edge computing based applications such as autonomous driving for image recognition or object tracking. Different constraints exist in this application area such as real-time, energy consumption, memory resources, etc. Choosing the optimal CNN for each GPU at hand is really hard to do, while maintaining high levels of accuracy and performance. This makes prior knowledge about the execution time a necessary prerequisite information before the final deployment of the CNN on the edge GPU platform. In this paper, we compare 5 execution time prediction models on a large set of CNNs-based applications. The tested predictors use machine learning regression approach. The proposed methodology is based on the utilization of high level CNN features. At the opposite of state-of-the-art approaches, no implementation or profiling on the hardware is required. A Mean Absolute Percentage Error (MAPE) of 5% using Support Vector Regression and Artificial Neural Networks has been obtained in the experiments. Our comparison shows the efficiency of these models to rapidly explore a large space of CNN models or Hardware configurations.

Mots clés

Computing methodologies Machine learning Machine learning approaches Neural networks

Domaines

Informatique [cs]

Kathleen TORCK : Connectez-vous pour contacter le contributeur

https://uphf.hal.science/hal-03381837

Soumis le : lundi 18 octobre 2021-08:44:47

Dernière modification le : jeudi 7 mars 2024-10:34:03

Dates et versions

hal-03381837 , version 1 (18-10-2021)

Identifiants

HAL Id : hal-03381837 , version 1
DOI : 10.1145/3444950.3447284

Citer

Noureddine Bouhali, Hamza Ouarnoughi, Smail Niar, Abdessamad Ait El Cadi. Execution Time Modeling for CNN Inference on Embedded GPUs. DroneSE and RAPIDO '21: Methods and Tools, Jan 2021, Budapest, Hungary. pp.59-65, ⟨10.1145/3444950.3447284⟩. ⟨hal-03381837⟩

Exporter

BibTeX XML-TEI Dublin Core DC Terms EndNote DataCite

Collections

CNRS UNIV-VALENCIENNES INSA-GROUPE LAMIH INSA-HAUTS-DE-FRANCE

29 Consultations

0 Téléchargements

Execution Time Modeling for CNN Inference on Embedded GPUs

Résumé

Mots clés

Domaines

Dates et versions

Identifiants

Citer

Exporter

Collections

Altmetric

Partager