Résumé
Machine listening systems often rely on fixed taxonomies to organize and label audio data, key for training and evaluating deep neural networks (DNNs) and other supervised algorithms. However, such taxonomies face significant constraints: they are composed of application-dependent predefined categories, which hinders the integration of new or varied sounds, and exhibits limited cross-dataset compatibility due to inconsistent labeling standards. To overcome these limitations, we introduce SALT: Standardized Audio event Label Taxonomy. Building upon the hierarchical structure of AudioSet's ontology, our taxonomy extends and standardizes labels across 24 publicly available environmental sound datasets, allowing the mapping of class labels from diverse datasets to a unified system. Our proposal comes with a new Python package designed for navigating and utilizing this taxonomy, easing cross-dataset label searching and hierarchical exploration. Notably, our package allows effortless data aggregation from diverse sources, hence easy experimentation with combined datasets.
Origine | Fichiers produits par l'(les) auteur(s) |
---|
Paraskevas Stamatiadis : Connectez-vous pour contacter le contributeur
https://hal.science/hal-04695595
Soumis le : jeudi 12 septembre 2024-14:18:43
Dernière modification le : mercredi 23 octobre 2024-10:30:04
Dates et versions
- HAL Id : hal-04695595 , version 1