Automatic Speech Interruption Detection: Analysis, Corpus, and System - Structuration, Analyse et Modélisation de documents Vidéo et Audio Accéder directement au contenu
Communication Dans Un Congrès Année : 2024

Automatic Speech Interruption Detection: Analysis, Corpus, and System

Résumé

Interruption detection is a new yet challenging task in the field of speech processing. This article presents a comprehensive study on automatic speech interruption detection, from the definition of this task, the assembly of a specialized corpus, and the development of an initial baseline system. We provide three main contributions: Firstly, we define the task, taking into account the nuanced nature of interruptions within spontaneous conversations. Secondly, we introduce a new corpus of conversational data, annotated for interruptions, to facilitate research in this domain. This corpus serves as a valuable resource for evaluating and advancing interruption detection techniques. Lastly, we present a first baseline system, which use speech processing methods to automatically identify interruptions in speech with promising results. In this article, we derivate from theoretical notions of interruption to build a simplification of this notion based on overlapped speech detection. Our findings can not only serve as a foundation for further research in the field but also provide a benchmark for assessing future advancements in automatic speech interruption detection.

Mots clés

Fichier principal
Vignette du fichier
LREC2024.pdf (1.42 Mo) Télécharger le fichier
Origine : Fichiers produits par l'(les) auteur(s)

Dates et versions

hal-04576488 , version 1 (16-05-2024)

Identifiants

  • HAL Id : hal-04576488 , version 1

Citer

Martin Lebourdais, Marie Tahon, Antoine Laurent, Sylvain Meignier. Automatic Speech Interruption Detection: Analysis, Corpus, and System. Joint International Conference on Computational Linguistics, Language Resources and Evaluation (LREC-Coling 2024), ELRA Language Resources Association (ELRA); International Committee on Computational Linguistics (ICCL), May 2024, Torino, Italy. à paraître. ⟨hal-04576488⟩
0 Consultations
0 Téléchargements

Partager

Gmail Facebook X LinkedIn More