𝔖 Bobbio Scriptorium
✦   LIBER   ✦

A method for pitch extraction of speech signals using autocorrelation functions through multiple window lengths

✍ Scribed by Tohru Takagi; Nobumasa Seiyama; Eiichi Miyasaka


Publisher
John Wiley and Sons
Year
2000
Tongue
English
Weight
356 KB
Volume
83
Category
Article
ISSN
1042-0967

No coin nor oath required. For personal study only.

✦ Synopsis


A high-performance method for pitch extraction is proposed for the purposes of real-time sequential speech processing that can be used in such applications as speech rate conversion systems. According to this method, autocorrelation functions of the input speech waveforms are calculated for one analyzed point in time using multiple lengths of the analysis windows, and the largest peaks of each autocorrelation function are detected within the appropriate ranges, after which the optimum pitch period is selected by weighting the candidates of the pitch period obtained by the number of windows. Such selection processing is carried out independently for each analyzed point without using such characteristics as the continuity of the fundamental frequencies of the entire speech segment. This method was applied to analysis of a large number of speech materials, including recordings made by different speakers and speech samples mixed with noise. The tests have demonstrated that the proposed method features pitch extraction potentials superior to those of the cepstrum pitch determination method and the LPC residual autocorrelation method within a wide range of fundamental frequencies and power levels.