A comparison of high precision F0 extraction algorithms for sustained vowels

Perturbation analysis of sustained vowel waveforms is used routinely in the clinical evaluation of pathological voices and in monitoring patient progress during treatment. Accurate estimation of voice fundamental frequency (F0) is essential for accurate perturbation analysis. Several algorithms have...

Celý popis

Uloženo v:
Podrobná bibliografie
Vydáno v:Journal of speech, language, and hearing research Ročník 42; číslo 1; s. 112 - 126
Hlavní autoři: Parsa, V, Jamieson, D G
Médium: Journal Article
Jazyk:angličtina
Vydáno: United States 01.02.1999
Témata:
ISSN:1092-4388
On-line přístup:Získat plný text
Tagy: Přidat tag
Žádné tagy, Buďte první, kdo vytvoří štítek k tomuto záznamu!
Popis
Shrnutí:Perturbation analysis of sustained vowel waveforms is used routinely in the clinical evaluation of pathological voices and in monitoring patient progress during treatment. Accurate estimation of voice fundamental frequency (F0) is essential for accurate perturbation analysis. Several algorithms have been proposed for fundamental frequency extraction. To be appropriate for clinical use, a key consideration is that an F0 extraction algorithm be robust to such extraneous factors as the presence of noise and modulations in voice frequency and amplitude that are commonly associated with the voice pathologies under study. This work examines the performance of seven F0 algorithms, based on the average magnitude difference function (AMDF), the input autocorrelation function (AC), the autocorrelation function of the center-clipped signal (ACC), the autocorrelation function of the inverse filtered signal (IFAC), the signal cepstrum (CEP), the Harmonic Product Spectrum (HPS) of the signal, and the waveform matching function (WM) respectively. These algorithms were evaluated using sustained vowel samples collected from normal and pathological subjects. The effect of background noise and of frequency and amplitude modulations on these algorithms was also investigated, using synthetic vowel waveforms.
Bibliografie:ObjectType-Article-2
SourceType-Scholarly Journals-1
ObjectType-Feature-1
content type line 23
ObjectType-Article-1
ObjectType-Feature-2
ISSN:1092-4388
DOI:10.1044/jslhr.4201.112