Comparison of Two Speech and Audio Coders at 8 kb/s from the Viewpoints of Coding Scheme and Quality

Nobuhiko KITAWAKI  Takehiro MORIYA  Takao KANEKO  Naoki IWAKAMI  

Publication
IEICE TRANSACTIONS on Communications   Vol.E81-B   No.11   pp.2007-2012
Publication Date: 1998/11/25
Online ISSN: 
DOI: 
Print ISSN: 0916-8516
Type of Manuscript: Special Section PAPER (Special Issue on Performance and Quality of Service (QoS) of Multimedia Networks)
Category: Media Management
Keyword: 
TwinVQ,  CS-ACELP,  G. 729,  coding quality,  Internet,  

Full Text: PDF>>
Buy this Article




Summary: 
Low bit-rate speech and audio codings are key technologies for multimedia communications. A number of coding scheme have been developed for various applications. In Internet application, good speech and audio quality at very low bit-rate (8-16 kb/s) is valuable. Two recently proposed speech and audio-coding schemes, CS-ACELP (Conjugate Structure Algebraic Code Excited Linear Prediction, standardized by the ITU-T in Recommendation G. 729) and TwinVQ (Transform-domain Weighted INterleave Vector Quantization, one of the candidates for MPEG-4 audio) were compared from the viewpoints of coding schemes and quality. Although there are significant differences in their basic structures and frame lengths, this paper describes that both use the same compression techniques, such as LPC (Linear Predictive Coding)-analysis pitch-period estimation and vector quantization. While CS-ACELP provides toll quality for speech at 8 kb/s, the quality it provides for music signals is insufficient. The TwinVQ transform coder is based on LPC and vector quantization and is also capable of operating at 8 kb/s. Evaluation of these two schemes in terms of their fundamental technologies, quality, delay, and complexity showed that the quality of TwinVQ for music signals is better than that of CS-ACELP, and that the quality of CS-ACELP is better for speech signals. Therefore, TwinVQ may be better suited for one-directional Internet applications, and CS-ACELP may be better for two-directional communication.