A LSP Analysis-Synthesis Method on Mel Frequency Scale Combined with Linear One

Shuuichi ARAI  Arata MIYAUCHI  Shinji OZAWA  

IEICE TRANSACTIONS (1976-1990)   Vol.E71   No.7   pp.648-653
Publication Date: 1988/07/25
Online ISSN: 
Print ISSN: 0000-0000
Type of Manuscript: PAPER
Category: Speech

Full Text: PDF>>
Buy this Article

In general, the analysis-synthesis systems are constructed on a linear frequency scale. On the other hand, the frequency resolution of human hearing system have non-linear characteristics. So, it is interesting to study about the analysis-synthesis system on such a non-linear frequency scale like MEL scale. And it is well known that LSP analysis-synthesis method is superior to LPC or PARCOR method in frame rate and quantization characteristics. In this paper, we describe an LSP analysis-synthesis system on MEL frequency scale. At first, we propose the way to obtain LSP parameters on Mel frequency scale (Mel LSP parameters) from the speech signal in linear time domain. Next we propose how to construct the analysis and synthesis filters in linear time domain using the MEL LSP parameters. Furthermore, we combine this system with the ordinary LSP analysis-synthesis system to improve the quality of the synthetic speech. We carried out some experiments to make clear the characteristics of the combined system. The results of tests show that the quality of synthetic speech with the combined system is higher than that with the ordinary LSP system and that with the MEL LSP system on condition that total prediction order is 10. Through the further experiments, we confirm that the synthetic speech quality with the combined system is as good as the that with the standard LSP system at prediction order 12.