Knowledge-Based Manner Class Segmentation Based on the Acoustic Event and Landmark Detection Algorithm

Jung-In LEE  Jeung-Yoon CHOI  Hong-Goo KANG  

IEICE TRANSACTIONS on Information and Systems   Vol.E97-D   No.6   pp.1682-1685
Publication Date: 2014/06/01
Online ISSN: 1745-1361
DOI: 10.1587/transinf.E97.D.1682
Type of Manuscript: LETTER
Category: Speech and Hearing
speech recognition,  speech segmentation,  acoustic events,  landmark detection,  

Full Text: PDF(2.3MB)>>
Buy this Article

There have been steady demands for a speech segmentation method to handle various speech applications. Conventional segmentation algorithms show reliable performance but they require a sufficient training database. This letter proposes a manner class segmentation method based on the acoustic event and landmark detection used in the knowledge-based speech recognition system. Measurements of sub-band abruptness and additional parameters are used to detect the acoustic events. Candidates of manner classes are segmented from the acoustic events and determined based on the knowledge of acoustic phonetics and acoustic parameters. Manners of vowel/glide, nasal, fricative, stop burst, stop closure, and silence are segmented in this system. In total, 71% of manner classes are correctly segmented with 20-ms error boundaries.