A Robust Speech Communication into Smart Info-Media System

Yoshikazu MIYANAGA  Wataru TAKAHASHI  Shingo YOSHIZAWA  

Publication
IEICE TRANSACTIONS on Fundamentals of Electronics, Communications and Computer Sciences   Vol.E96-A   No.11   pp.2074-2080
Publication Date: 2013/11/01
Online ISSN: 1745-1337
DOI: 10.1587/transfun.E96.A.2074
Print ISSN: 0916-8508
Type of Manuscript: INVITED PAPER (Special Section on Smart Multimedia & Communication Systems)
Category: 
Keyword: 
smart info-media system,  robust speech recognition,  voice activity detection,  speech rejection,  ASIC,  low power consumption design,  

Full Text: PDF>>
Buy this Article




Summary: 
This paper introduces our developed noise robust speech communication techniques and describes its implementation to a smart info-media system, i.e., a small robot. Our designed speech communication system consists of automatic speech detection, recognition, and rejection. By using automatic speech detection and recognition, an observed speech waveform can be recognized without a manual trigger. In addition, using speech rejection, this system only accepts registered speech phrases and rejects any other words. In other words, although an arbitrary input speech waveform can be fed into this system and recognized, the system responds only to the registered speech phrases. The developed noise robust speech processing can reduce various noises in many environments. In addition to the design of noise robust speech recognition, the LSI design of this system has been introduced. By using the design of speech recognition application specific IC (ASIC), we can simultaneously realize low power consumption and real-time processing. This paper describes the LSI architecture of this system and its performances in some field experiments. In terms of current speech recognition accuracy, the system can realize 85-99% under 0-20dB SNR and echo environments.