Sparsity-based audio declipping methods: overview, new algorithms, and large-scale evaluation - ENS de Lyon - École normale supérieure de Lyon Accéder directement au contenu
Pré-Publication, Document De Travail Année : 2020

Sparsity-based audio declipping methods: overview, new algorithms, and large-scale evaluation

Résumé

Recent advances in audio declipping have substantially improved the state of the art in certain saturation regimes. Yet, practitioners need guidelines to choose a method, and while existing benchmarks have been instrumental in advancing the field, larger-scale experiments are needed to guide such choices. First, we show that the saturation levels in existing small-scale benchmarks are moderate and call for benchmarks with more perceptually significant saturation levels. We then propose a general algorithmic framework for declipping that covers existing and new combinations of flavors of state-of-the-art techniques exploiting time-frequency sparsity: synthesis vs analysis sparsity, with plain or structured sparsity. Finally, we systematically compare these combinations and state-of-the-art methods. Using a large-scale numerical benchmarks and a smaller scale formal listening test, we provide guidelines for various saturation levels, both for speech and various musical “genres” from the RWC database. The code is made publicly available for the purpose of reproducible research and benchmarking.
Fichier principal
Vignette du fichier
Declip2020.pdf (4.01 Mo) Télécharger le fichier
Origine : Fichiers produits par l'(les) auteur(s)

Dates et versions

hal-02611226 , version 1 (18-05-2020)
hal-02611226 , version 2 (30-11-2020)
hal-02611226 , version 3 (28-01-2021)

Identifiants

Citer

Clément Gaultier, Srđan Kitić, Rémi Gribonval, Nancy Bertin. Sparsity-based audio declipping methods: overview, new algorithms, and large-scale evaluation. 2020. ⟨hal-02611226v1⟩
365 Consultations
648 Téléchargements

Altmetric

Partager

Gmail Facebook X LinkedIn More