For Full-Text PDF, please login, if you are a member of IEICE,|
or go to Pay Per View on menu list, if you are a nonmember of IEICE.
Phrase Recognition in Conversational Speech Using Prosodic and Phonemic Information
Shigeki OKAWA Takashi ENDO Tetsunori KOBAYASHI Katsuhiko SHIRAI
IEICE TRANSACTIONS on Information and Systems
Publication Date: 1993/01/25
Print ISSN: 0916-8532
Type of Manuscript: Special Section PAPER (Special Issue on Speech and Discourse Processing in Dialogue Systems)
phrase recognition, prosody, DP matching, information criteria,
Full Text: PDF(548.4KB)>>
In this paper, a new scheme for ohrase recognition in conversational speech is proposed, in which prosodic and phonemic information processing are usefully combined. This approach is employed both to produce candidates of phrase boundaries and to discriminate phonemes. The fundamental frequency patterns of continuous utterances are statistically analyzed and the likelihood of the occurrence of a phrase boundary is calculated for every frame. At the same time, the likelihood of phonemic characteristics of each frame can be obtained using a hierarchical clustering method. These two scores, along with lexical and grammatical constraints, can be effectively utilized to develop a possible word sequences or a word lattices which correspond to the continuous speech utterances. Our preliminary experjment shows the feasibility of applying prosody for continuous speech recognition especially for conversational style utterances.