3D Dynamic Spatiotemporal Atlas of the Vocal Tract during Consonant-Vowel Production from 2D Real Time MRI.
Fiche publication
Date publication
août 2022
Journal
Journal of imaging
Auteurs
Membres identifiés du Cancéropôle Est :
Pr FELBLINGER Jacques, Dr VUISSOZ Pierre-André
Tous les auteurs :
Douros IK, Xie Y, Dourou C, Isaieva K, Vuissoz PA, Felblinger J, Laprie Y
Lien Pubmed
Résumé
In this work, we address the problem of creating a 3D dynamic atlas of the vocal tract that captures the dynamics of the articulators in all three dimensions in order to create a global speaker model independent of speaker-specific characteristics. The core steps of the proposed method are the temporal alignment of the real-time MR images acquired in several sagittal planes and their combination with adaptive kernel regression. As a preprocessing step, a reference space was created to be used in order to remove anatomical information of the speakers and keep only the variability in speech production for the construction of the atlas. The adaptive kernel regression makes the choice of atlas time points independently of the time points of the frames that are used as an input for the construction. The evaluation of this atlas construction method was made by mapping two new speakers to the atlas and by checking how similar the resulting mapped images are. The use of the atlas helps in reducing subject variability. The results show that the use of the proposed atlas can capture the dynamic behavior of the articulators and is able to generalize the speech production process by creating a universal-speaker reference space.
Mots clés
adaptive gaussian kernel, generic speaker model, spatiotemporal atlas
Référence
J Imaging. 2022 08 25;8(9):