Keyword : speech recognition


Development of the “VoiceTra” Multi-Lingual Speech Translation System
Shigeki MATSUDA Teruaki HAYASHI Yutaka ASHIKARI Yoshinori SHIGA Hidenori KASHIOKA Keiji YASUDA Hideo OKUMA Masao UCHIYAMA Eiichiro SUMITA Hisashi KAWAI Satoshi NAKAMURA 
Publication:   
Publication Date: 2017/04/01
Vol. E100-D  No. 4 ; pp. 621-632
Type of Manuscript:  INVITED PAPER (Special Section on Award-winning Papers)
Category: 
Keyword: 
speech translationstatistical machine translationspeech recognitionspeech synthesis
 Summary | Full Text:PDF(1.2MB)

Speeding up Deep Neural Networks in Speech Recognition with Piecewise Quantized Sigmoidal Activation Function
Anhao XING Qingwei ZHAO Yonghong YAN 
Publication:   IEICE TRANSACTIONS on Information and Systems
Publication Date: 2016/10/01
Vol. E99-D  No. 10 ; pp. 2558-2561
Type of Manuscript:  Special Section LETTER (Special Section on Recent Advances in Machine Learning for Spoken Language Processing)
Category: Acoustic modeling
Keyword: 
deep neural networksspeech recognitionactivation functionfixed-point quantization
 Summary | Full Text:PDF(151.4KB)

Multi-Task Learning in Deep Neural Networks for Mandarin-English Code-Mixing Speech Recognition
Mengzhe CHEN Jielin PAN Qingwei ZHAO Yonghong YAN 
Publication:   IEICE TRANSACTIONS on Information and Systems
Publication Date: 2016/10/01
Vol. E99-D  No. 10 ; pp. 2554-2557
Type of Manuscript:  Special Section LETTER (Special Section on Recent Advances in Machine Learning for Spoken Language Processing)
Category: Acoustic modeling
Keyword: 
multi-task learningdeep neural networkMandarin-English code mixingspeech recognition
 Summary | Full Text:PDF(205.3KB)

Error Correction Using Long Context Match for Smartphone Speech Recognition
Yuan LIANG Koji IWANO Koichi SHINODA 
Publication:   IEICE TRANSACTIONS on Information and Systems
Publication Date: 2015/11/01
Vol. E98-D  No. 11 ; pp. 1932-1942
Type of Manuscript:  PAPER
Category: Speech and Hearing
Keyword: 
speech recognitionerror correctionmultimodal interfaceword confusion networkcontext match
 Summary | Full Text:PDF(674KB)

Automatic Lecture Transcription Based on Discriminative Data Selection for Lightly Supervised Acoustic Model Training
Sheng LI Yuya AKITA Tatsuya KAWAHARA 
Publication:   IEICE TRANSACTIONS on Information and Systems
Publication Date: 2015/08/01
Vol. E98-D  No. 8 ; pp. 1545-1552
Type of Manuscript:  PAPER
Category: Speech and Hearing
Keyword: 
speech recognitionacoustic modellightly supervised traininglecture transcription
 Summary | Full Text:PDF(1.1MB)

One-Step Error Detection and Correction Approach for Voice Word Processor
Junhwi CHOI Seonghan RYU Kyusong LEE Gary Geunbae LEE 
Publication:   IEICE TRANSACTIONS on Information and Systems
Publication Date: 2015/08/01
Vol. E98-D  No. 8 ; pp. 1517-1525
Type of Manuscript:  PAPER
Category: Artificial Intelligence, Data Mining
Keyword: 
speech recognitionnatural language processinglanguages and software systems
 Summary | Full Text:PDF(545KB)

Speaker Adaptation Based on PPCA of Acoustic Models in a Two-Way Array Representation
Yongwon JEONG 
Publication:   IEICE TRANSACTIONS on Information and Systems
Publication Date: 2014/08/01
Vol. E97-D  No. 8 ; pp. 2200-2204
Type of Manuscript:  LETTER
Category: Speech and Hearing
Keyword: 
expectation-maximizationprobabilistic principal component analysisspeaker adaptationspeech recognition
 Summary | Full Text:PDF(90.3KB)

Adaptation of Acoustic Models in Joint Speaker and Noise Space Using Bilinear Models
Yongwon JEONG Hyung Soon KIM 
Publication:   IEICE TRANSACTIONS on Information and Systems
Publication Date: 2014/08/01
Vol. E97-D  No. 8 ; pp. 2195-2199
Type of Manuscript:  LETTER
Category: Speech and Hearing
Keyword: 
bilinear modeleigenvoice speaker adaptationenvironment adaptationspeaker adaptationspeech recognition
 Summary | Full Text:PDF(100.3KB)

Knowledge-Based Manner Class Segmentation Based on the Acoustic Event and Landmark Detection Algorithm
Jung-In LEE Jeung-Yoon CHOI Hong-Goo KANG 
Publication:   IEICE TRANSACTIONS on Information and Systems
Publication Date: 2014/06/01
Vol. E97-D  No. 6 ; pp. 1682-1685
Type of Manuscript:  LETTER
Category: Speech and Hearing
Keyword: 
speech recognitionspeech segmentationacoustic eventslandmark detection
 Summary | Full Text:PDF(2.3MB)

Cross-Lingual Phone Mapping for Large Vocabulary Speech Recognition of Under-Resourced Languages
Van Hai DO Xiong XIAO Eng Siong CHNG Haizhou LI 
Publication:   IEICE TRANSACTIONS on Information and Systems
Publication Date: 2014/02/01
Vol. E97-D  No. 2 ; pp. 285-295
Type of Manuscript:  PAPER
Category: Speech and Hearing
Keyword: 
speech recognitionunder-resourced languagecross-lingual LVCSRcontext-dependentphone mapping
 Summary | Full Text:PDF(474.6KB)

Discriminative Approach to Build Hybrid Vocabulary for Conversational Telephone Speech Recognition of Agglutinative Languages
Xin LI Jielin PAN Qingwei ZHAO Yonghong YAN 
Publication:   IEICE TRANSACTIONS on Information and Systems
Publication Date: 2013/11/01
Vol. E96-D  No. 11 ; pp. 2478-2482
Type of Manuscript:  LETTER
Category: Speech and Hearing
Keyword: 
agglutinative languagesspeech recognitionsub-wordsdiscriminative learninghybrid system
 Summary | Full Text:PDF(554.1KB)

Speaker Adaptation Based on PARAFAC2 of Transformation Matrices for Continuous Speech Recognition
Yongwon JEONG Sangjun LIM Young Kuk KIM Hyung Soon KIM 
Publication:   IEICE TRANSACTIONS on Information and Systems
Publication Date: 2013/09/01
Vol. E96-D  No. 9 ; pp. 2152-2155
Type of Manuscript:  LETTER
Category: Speech and Hearing
Keyword: 
maximum likelihood linear regressionparallel factor analysisPARAFAC2speaker adaptationspeech recognition
 Summary | Full Text:PDF(135.3KB)

Speaker Adaptation in Sparse Subspace of Acoustic Models
Yongwon JEONG 
Publication:   IEICE TRANSACTIONS on Information and Systems
Publication Date: 2013/06/01
Vol. E96-D  No. 6 ; pp. 1402-1405
Type of Manuscript:  LETTER
Category: Speech and Hearing
Keyword: 
eigenvoice speaker adaptationrobust speech recognitionsparse principal component analysisspeaker adaptationspeech recognition
 Summary | Full Text:PDF(397.7KB)

A Bayesian Framework Using Multiple Model Structures for Speech Recognition
Sayaka SHIOTA Kei HASHIMOTO Yoshihiko NANKAKU Keiichi TOKUDA 
Publication:   IEICE TRANSACTIONS on Information and Systems
Publication Date: 2013/04/01
Vol. E96-D  No. 4 ; pp. 939-948
Type of Manuscript:  PAPER
Category: Speech and Hearing
Keyword: 
speech recognitionacoustic modelingBayesian approachmodel structure integrationdeterministic annealing
 Summary | Full Text:PDF(548.8KB)

Refinement of Landmark Detection and Extraction of Articulator-Free Features for Knowledge-Based Speech Recognition
Jung-In LEE Jeung-Yoon CHOI Hong-Goo KANG 
Publication:   IEICE TRANSACTIONS on Information and Systems
Publication Date: 2013/03/01
Vol. E96-D  No. 3 ; pp. 746-749
Type of Manuscript:  LETTER
Category: Speech and Hearing
Keyword: 
speech recognitionacoustic eventslandmark detection
 Summary | Full Text:PDF(336.4KB)

Acoustic Model Training Using Pseudo-Speaker Features Generated by MLLR Transformations for Robust Speaker-Independent Speech Recognition
Arata ITOH Sunao HARA Norihide KITAOKA Kazuya TAKEDA 
Publication:   IEICE TRANSACTIONS on Information and Systems
Publication Date: 2012/10/01
Vol. E95-D  No. 10 ; pp. 2479-2485
Type of Manuscript:  PAPER
Category: Speech and Hearing
Keyword: 
speech recognitionacoustic model trainingpseudo speakersfeature generationMLLR
 Summary | Full Text:PDF(724.6KB)

Class-Based N-Gram Language Model for New Words Using Out-of-Vocabulary to In-Vocabulary Similarity
Welly NAPTALI Masatoshi TSUCHIYA Seiichi NAKAGAWA 
Publication:   IEICE TRANSACTIONS on Information and Systems
Publication Date: 2012/09/01
Vol. E95-D  No. 9 ; pp. 2308-2317
Type of Manuscript:  PAPER
Category: Speech and Hearing
Keyword: 
out-of-vocabularyclass-based n-gramlanguage modeladjusted perplexityspeech recognition
 Summary | Full Text:PDF(365.4KB)

Hidden Conditional Neural Fields for Continuous Phoneme Speech Recognition
Yasuhisa FUJII Kazumasa YAMAMOTO Seiichi NAKAGAWA 
Publication:   IEICE TRANSACTIONS on Information and Systems
Publication Date: 2012/08/01
Vol. E95-D  No. 8 ; pp. 2094-2104
Type of Manuscript:  PAPER
Category: Speech and Hearing
Keyword: 
hidden conditional neural fieldshidden conditional random fieldshidden Markov modelspeech recognitiondeep learning
 Summary | Full Text:PDF(324.9KB)

A VLSI Architecture with Multiple Fast Store-Based Block Parallel Processing for Output Probability and Likelihood Score Computations in HMM-Based Isolated Word Recognition
Kazuhiro NAKAMURA Ryo SHIMAZAKI Masatoshi YAMAMOTO Kazuyoshi TAKAGI Naofumi TAKAGI 
Publication:   IEICE TRANSACTIONS on Electronics
Publication Date: 2012/04/01
Vol. E95-C  No. 4 ; pp. 456-467
Type of Manuscript:  Special Section PAPER (Special Section on Solid-State Circuit Design – Architecture, Circuit, Device and Design Methodology)
Category: 
Keyword: 
speech recognitionhidden Markov model (HMM)VLSI architectureisolated word recognition
 Summary | Full Text:PDF(2.3MB)

Decision Tree-Based Acoustic Models for Speech Recognition with Improved Smoothness
Masami AKAMINE Jitendra AJMERA 
Publication:   IEICE TRANSACTIONS on Information and Systems
Publication Date: 2011/11/01
Vol. E94-D  No. 11 ; pp. 2250-2258
Type of Manuscript:  PAPER
Category: Speech and Hearing
Keyword: 
speech recognitionacoustic modelingdecision treesprobability estimationlikelihood computation
 Summary | Full Text:PDF(495.5KB)

Enhancing Eigenspace-Based MLLR Speaker Adaptation Using a Fuzzy Logic Learning Control Scheme
Ing-Jr DING 
Publication:   IEICE TRANSACTIONS on Information and Systems
Publication Date: 2011/10/01
Vol. E94-D  No. 10 ; pp. 1909-1916
Type of Manuscript:  Special Section PAPER (Special Section on Information-Based Induction Sciences and Machine Learning)
Category: 
Keyword: 
speech recognitionspeaker adaptationHMMEigen-MLLRfuzzy control
 Summary | Full Text:PDF(225.7KB)

VLSI Architecture of GMM Processing and Viterbi Decoder for 60,000-Word Real-Time Continuous Speech Recognition
Hiroki NOGUCHI Kazuo MIURA Tsuyoshi FUJINAGA Takanobu SUGAHARA Hiroshi KAWAGUCHI Masahiko YOSHIMOTO 
Publication:   IEICE TRANSACTIONS on Electronics
Publication Date: 2011/04/01
Vol. E94-C  No. 4 ; pp. 458-467
Type of Manuscript:  Special Section PAPER (Special Section on Circuits and Design Techniques for Advanced Large Scale Integration)
Category: 
Keyword: 
speech recognitionhidden Markov model (HMM)VLSI architecture
 Summary | Full Text:PDF(1.8MB)

Bayesian Context Clustering Using Cross Validation for Speech Recognition
Kei HASHIMOTO Heiga ZEN Yoshihiko NANKAKU Akinobu LEE Keiichi TOKUDA 
Publication:   IEICE TRANSACTIONS on Information and Systems
Publication Date: 2011/03/01
Vol. E94-D  No. 3 ; pp. 668-678
Type of Manuscript:  PAPER
Category: Speech and Hearing
Keyword: 
Bayesian approachspeech recognitionHMMcontext clusteringcross validation
 Summary | Full Text:PDF(453.4KB)

Estimation of Speech Intelligibility Using Speech Recognition Systems
Yusuke TAKANO Kazuhiro KONDO 
Publication:   IEICE TRANSACTIONS on Information and Systems
Publication Date: 2010/12/01
Vol. E93-D  No. 12 ; pp. 3368-3376
Type of Manuscript:  PAPER
Category: Speech and Hearing
Keyword: 
objective estimationspeech intelligibilityspeech recognitionJapanese Diagnostic Rhyme Testnoise adaptation
 Summary | Full Text:PDF(1.8MB)

Learning Speech Variability in Discriminative Acoustic Model Adaptation
Shoei SATO Takahiro OKU Shinichi HOMMA Akio KOBAYASHI Toru IMAI 
Publication:   IEICE TRANSACTIONS on Information and Systems
Publication Date: 2010/09/01
Vol. E93-D  No. 9 ; pp. 2370-2378
Type of Manuscript:  Special Section PAPER (Special Section on Processing Natural Speech Variability for Improved Verbal Human-Computer Interaction)
Category: Adaptation
Keyword: 
speech recognitionspeech variabilitydiscriminative trainingacoustic model
 Summary | Full Text:PDF(666.8KB)

Acoustic Model Adaptation for Speech Recognition
Koichi SHINODA 
Publication:   IEICE TRANSACTIONS on Information and Systems
Publication Date: 2010/09/01
Vol. E93-D  No. 9 ; pp. 2348-2362
Type of Manuscript:  INVITED PAPER (Special Section on Processing Natural Speech Variability for Improved Verbal Human-Computer Interaction)
Category: 
Keyword: 
speech recognitionacoustic model adaptationhidden Markov models
 Summary | Full Text:PDF(311.2KB)

Intentional Voice Command Detection for Trigger-Free Speech Interface
Yasunari OBUCHI Takashi SUMIYOSHI 
Publication:   IEICE TRANSACTIONS on Information and Systems
Publication Date: 2010/09/01
Vol. E93-D  No. 9 ; pp. 2440-2450
Type of Manuscript:  Special Section PAPER (Special Section on Processing Natural Speech Variability for Improved Verbal Human-Computer Interaction)
Category: Robust Speech Recognition
Keyword: 
speech recognitionspeech/non-speech discriminationVADutterance verificationemotion recognitionhands-freetrigger-freeIVCD
 Summary | Full Text:PDF(833.5KB)

Enhancing the Robustness of the Posterior-Based Confidence Measures Using Entropy Information for Speech Recognition
Yanqing SUN Yu ZHOU Qingwei ZHAO Pengyuan ZHANG Fuping PAN Yonghong YAN 
Publication:   IEICE TRANSACTIONS on Information and Systems
Publication Date: 2010/09/01
Vol. E93-D  No. 9 ; pp. 2431-2439
Type of Manuscript:  Special Section PAPER (Special Section on Processing Natural Speech Variability for Improved Verbal Human-Computer Interaction)
Category: Robust Speech Recognition
Keyword: 
OOVspeech recognitionconfidence measureentropy informationphoneme-level posterior
 Summary | Full Text:PDF(1.2MB)

Unsupervised Speaker Adaptation Using Speaker-Class Models for Lecture Speech Recognition
Tetsuo KOSAKA Yuui TAKEDA Takashi ITO Masaharu KATO Masaki KOHDA 
Publication:   IEICE TRANSACTIONS on Information and Systems
Publication Date: 2010/09/01
Vol. E93-D  No. 9 ; pp. 2363-2369
Type of Manuscript:  Special Section PAPER (Special Section on Processing Natural Speech Variability for Improved Verbal Human-Computer Interaction)
Category: Adaptation
Keyword: 
speech recognitionspeaker adaptationspeaker-class modelLVCSRcorpus of spontaneous Japanese
 Summary | Full Text:PDF(393.6KB)

A New Subband-Weighted MVDR-Based Front-End for Robust Speech Recognition
Sanaz SEYEDIN Seyed Mohammad AHADI 
Publication:   IEICE TRANSACTIONS on Information and Systems
Publication Date: 2010/08/01
Vol. E93-D  No. 8 ; pp. 2252-2261
Type of Manuscript:  PAPER
Category: Speech and Hearing
Keyword: 
feature extractionrobust MVDR power spectral estimationspeech recognition
 Summary | Full Text:PDF(584.8KB)

Novel Confidence Feature Extraction Algorithm Based on Latent Topic Similarity
Wei CHEN Gang LIU Jun GUO Shinichiro OMACHI Masako OMACHI Yujing GUO 
Publication:   IEICE TRANSACTIONS on Information and Systems
Publication Date: 2010/08/01
Vol. E93-D  No. 8 ; pp. 2243-2251
Type of Manuscript:  PAPER
Category: Speech and Hearing
Keyword: 
speech recognitionconfidence annotationconfidence featurelatent topic similarity
 Summary | Full Text:PDF(438.6KB)

Acoustic Feature Transformation Combining Average and Maximum Classification Error Minimization Criteria
Makoto SAKAI Norihide KITAOKA Kazuya TAKEDA 
Publication:   IEICE TRANSACTIONS on Information and Systems
Publication Date: 2010/07/01
Vol. E93-D  No. 7 ; pp. 2005-2008
Type of Manuscript:  LETTER
Category: Speech and Hearing
Keyword: 
speech recognitiondimensionality reductionBayes error
 Summary | Full Text:PDF(131.9KB)

Acoustic Feature Transformation Based on Discriminant Analysis Preserving Local Structure for Speech Recognition
Makoto SAKAI Norihide KITAOKA Kazuya TAKEDA 
Publication:   IEICE TRANSACTIONS on Information and Systems
Publication Date: 2010/05/01
Vol. E93-D  No. 5 ; pp. 1244-1252
Type of Manuscript:  PAPER
Category: Speech and Hearing
Keyword: 
speech recognitionfeature extractionmultidimensional signal processing
 Summary | Full Text:PDF(214.2KB)

Speech Enhancement Using a Square Microphone Array in the Presence of Directional and Diffuse Noise
Tetsuji OGAWA Shintaro TAKADA Kenzo AKAGIRI Tetsunori KOBAYASHI 
Publication:   IEICE TRANSACTIONS on Fundamentals of Electronics, Communications and Computer Sciences
Publication Date: 2010/05/01
Vol. E93-A  No. 5 ; pp. 926-935
Type of Manuscript:  PAPER
Category: Speech and Hearing
Keyword: 
directional noise reductiondiffuse noise reductionsquare microphone arrayspeech recognitionmobile devices
 Summary | Full Text:PDF(2.1MB)

Evaluation of Combinational Use of Discriminant Analysis-Based Acoustic Feature Transformation and Discriminative Training
Makoto SAKAI Norihide KITAOKA Yuya HATTORI Seiichi NAKAGAWA Kazuya TAKEDA 
Publication:   IEICE TRANSACTIONS on Information and Systems
Publication Date: 2010/02/01
Vol. E93-D  No. 2 ; pp. 395-398
Type of Manuscript:  LETTER
Category: Speech and Hearing
Keyword: 
speech recognitionfeature extractiondiscriminative training
 Summary | Full Text:PDF(78.2KB)

Cepstral Domain Feature Extraction Utilizing Entropic Distance-Based Filterbank
Youngjoo SUH Hoirin KIM 
Publication:   IEICE TRANSACTIONS on Information and Systems
Publication Date: 2010/02/01
Vol. E93-D  No. 2 ; pp. 392-394
Type of Manuscript:  LETTER
Category: Speech and Hearing
Keyword: 
cepstral featureentropic distancefilterbankspeech recognition
 Summary | Full Text:PDF(198.4KB)

A VLSI Architecture for Output Probability Computations of HMM-Based Recognition Systems with Store-Based Block Parallel Processing
Kazuhiro NAKAMURA Masatoshi YAMAMOTO Kazuyoshi TAKAGI Naofumi TAKAGI 
Publication:   IEICE TRANSACTIONS on Information and Systems
Publication Date: 2010/02/01
Vol. E93-D  No. 2 ; pp. 300-305
Type of Manuscript:  PAPER
Category: VLSI Systems
Keyword: 
speech recognitionhidden Markov model (HMM)VLSI architecture
 Summary | Full Text:PDF(549KB)

A Single-Chip Speech Dialogue Module and Its Evaluation on a Personal Robot, PaPeRo-Mini
Miki SATO Toru IWASAWA Akihiko SUGIYAMA Toshihiro NISHIZAWA Yosuke TAKANO 
Publication:   IEICE TRANSACTIONS on Fundamentals of Electronics, Communications and Computer Sciences
Publication Date: 2010/01/01
Vol. E93-A  No. 1 ; pp. 261-271
Type of Manuscript:  PAPER
Category: Digital Signal Processing
Keyword: 
speech recognitionDOA estimationnoise cancellationmicrophone arrayecho cancellationspeech dialogue module
 Summary | Full Text:PDF(1MB)

Effective Prediction of Errors by Non-native Speakers Using Decision Tree for Speech Recognition-Based CALL System
Hongcui WANG Tatsuya KAWAHARA 
Publication:   IEICE TRANSACTIONS on Information and Systems
Publication Date: 2009/12/01
Vol. E92-D  No. 12 ; pp. 2462-2468
Type of Manuscript:  PAPER
Category: Speech and Hearing
Keyword: 
speech recognitionCALLgrammar networkdecision tree
 Summary | Full Text:PDF(480.7KB)

Robust Feature Extraction Using Variable Window Function in Autocorrelation Domain for Speech Recognition
Sangho LEE Jeonghyun HA Jaekeun HONG 
Publication:   IEICE TRANSACTIONS on Fundamentals of Electronics, Communications and Computer Sciences
Publication Date: 2009/11/01
Vol. E92-A  No. 11 ; pp. 2917-2921
Type of Manuscript:  LETTER
Category: Speech and Hearing
Keyword: 
variable windowAMFCCspeech recognitionrobust feature
 Summary | Full Text:PDF(584.9KB)

Voice Activity Detection Based on High Order Statistics and Online EM Algorithm
David COURNAPEAU Tatsuya KAWAHARA 
Publication:   IEICE TRANSACTIONS on Information and Systems
Publication Date: 2008/12/01
Vol. E91-D  No. 12 ; pp. 2854-2861
Type of Manuscript:  PAPER
Category: Speech and Hearing
Keyword: 
speech recognitionvoice activity detectionhigh order statisticsonline EM
 Summary | Full Text:PDF(1008KB)

A Fully Consistent Hidden Semi-Markov Model-Based Speech Recognition System
Keiichiro OURA Heiga ZEN Yoshihiko NANKAKU Akinobu LEE Keiichi TOKUDA 
Publication:   IEICE TRANSACTIONS on Information and Systems
Publication Date: 2008/11/01
Vol. E91-D  No. 11 ; pp. 2693-2700
Type of Manuscript:  PAPER
Category: Speech and Hearing
Keyword: 
speech recognitionhidden Markov modelhidden semi-Markov modelweighted finite-state transducer
 Summary | Full Text:PDF(531.3KB)

Effective Acoustic Modeling for Pronunciation Quality Scoring of Strongly Accented Mandarin Speech
Fengpei GE Changliang LIU Jian SHAO Fuping PAN Bin DONG Yonghong YAN 
Publication:   IEICE TRANSACTIONS on Information and Systems
Publication Date: 2008/10/01
Vol. E91-D  No. 10 ; pp. 2485-2492
Type of Manuscript:  PAPER
Category: Speech and Hearing
Keyword: 
CALLspeech recognitionHLDAspeaker-dependent CMNe-learning
 Summary | Full Text:PDF(734.2KB)

HMM-Based Mask Estimation for a Speech Recognition Front-End Using Computational Auditory Scene Analysis
Ji Hun PARK Jae Sam YOON Hong Kook KIM 
Publication:   IEICE TRANSACTIONS on Information and Systems
Publication Date: 2008/09/01
Vol. E91-D  No. 9 ; pp. 2360-2364
Type of Manuscript:  LETTER
Category: Speech and Hearing
Keyword: 
computational auditory scene analysismask estimationhidden Markov modelspeech recognition
 Summary | Full Text:PDF(344.1KB)

Multichannel Speech Enhancement Based on Generalized Gamma Prior Distribution with Its Online Adaptive Estimation
Tran HUY DAT Kazuya TAKEDA Fumitada ITAKURA 
Publication:   IEICE TRANSACTIONS on Information and Systems
Publication Date: 2008/03/01
Vol. E91-D  No. 3 ; pp. 439-447
Type of Manuscript:  Special Section PAPER (Special Section on Robust Speech Processing in Realistic Environments)
Category: Speech Enhancement
Keyword: 
multi-channel speech enhancementspeech recognitiongeneralized gamma distributionmoment matching
 Summary | Full Text:PDF(805.1KB)

Mutual Information Based Dynamic Integration of Multiple Feature Streams for Robust Real-Time LVCSR
Shoei SATO Akio KOBAYASHI Kazuo ONOE Shinichi HOMMA Toru IMAI Tohru TAKAGI Tetsunori KOBAYASHI 
Publication:   IEICE TRANSACTIONS on Information and Systems
Publication Date: 2008/03/01
Vol. E91-D  No. 3 ; pp. 815-824
Type of Manuscript:  PAPER
Category: Speech and Hearing
Keyword: 
speech recognitionstream integrationentropymutual informationactive hypotheses
 Summary | Full Text:PDF(777.1KB)

Local Peak Enhancement for In-Car Speech Recognition in Noisy Environment
Osamu ICHIKAWA Takashi FUKUDA Masafumi NISHIMURA 
Publication:   IEICE TRANSACTIONS on Information and Systems
Publication Date: 2008/03/01
Vol. E91-D  No. 3 ; pp. 635-639
Type of Manuscript:  Special Section LETTER (Special Section on Robust Speech Processing in Realistic Environments)
Category: 
Keyword: 
harmonicsformantspeech enhancementnoise reductionspeech recognition
 Summary | Full Text:PDF(1MB)

Bi-Spectral Acoustic Features for Robust Speech Recognition
Kazuo ONOE Shoei SATO Shinichi HOMMA Akio KOBAYASHI Toru IMAI Tohru TAKAGI 
Publication:   IEICE TRANSACTIONS on Information and Systems
Publication Date: 2008/03/01
Vol. E91-D  No. 3 ; pp. 631-634
Type of Manuscript:  Special Section LETTER (Special Section on Robust Speech Processing in Realistic Environments)
Category: 
Keyword: 
bi-spectrum non-Gaussianityphase informationspeech recognition
 Summary | Full Text:PDF(394.2KB)

An Improved Greedy Search Algorithm for the Development of a Phonetically Rich Speech Corpus
Jin-Song ZHANG Satoshi NAKAMURA 
Publication:   IEICE TRANSACTIONS on Information and Systems
Publication Date: 2008/03/01
Vol. E91-D  No. 3 ; pp. 615-630
Type of Manuscript:  Special Section PAPER (Special Section on Robust Speech Processing in Realistic Environments)
Category: Corpus
Keyword: 
greedy searchminimum sentence setspeech recognitionspeech corpus
 Summary | Full Text:PDF(1.2MB)

Selection of Optimum Vocabulary and Dialog Strategy for Noise-Robust Spoken Dialog Systems
Akinori ITO Takanobu OBA Takashi KONASHI Motoyuki SUZUKI Shozo MAKINO 
Publication:   IEICE TRANSACTIONS on Information and Systems
Publication Date: 2008/03/01
Vol. E91-D  No. 3 ; pp. 538-548
Type of Manuscript:  Special Section PAPER (Special Section on Robust Speech Processing in Realistic Environments)
Category: ASR System Architecture
Keyword: 
spoken dialog systemnoisy environmentdialog strategyneural networkspeech recognition
 Summary | Full Text:PDF(809.4KB)

Using Mutual Information Criterion to Design an Efficient Phoneme Set for Chinese Speech Recognition
Jin-Song ZHANG Xin-Hui HU Satoshi NAKAMURA 
Publication:   IEICE TRANSACTIONS on Information and Systems
Publication Date: 2008/03/01
Vol. E91-D  No. 3 ; pp. 508-513
Type of Manuscript:  Special Section PAPER (Special Section on Robust Speech Processing in Realistic Environments)
Category: Acoustic Modeling
Keyword: 
mutual informationChinese lexical tonestone dependent unitsspeech recognition
 Summary | Full Text:PDF(342.5KB)

Linear Discriminant Analysis Using a Generalized Mean of Class Covariances and Its Application to Speech Recognition
Makoto SAKAI Norihide KITAOKA Seiichi NAKAGAWA 
Publication:   IEICE TRANSACTIONS on Information and Systems
Publication Date: 2008/03/01
Vol. E91-D  No. 3 ; pp. 478-487
Type of Manuscript:  Special Section PAPER (Special Section on Robust Speech Processing in Realistic Environments)
Category: Feature Extraction
Keyword: 
speech recognitionfeature extractionmultidimensional signal processing
 Summary | Full Text:PDF(421.5KB)

Recognizing Reverberant Speech Based on Amplitude and Frequency Modulation
Yotaro KUBO Shigeki OKAWA Akira KUREMATSU Katsuhiko SHIRAI 
Publication:   IEICE TRANSACTIONS on Information and Systems
Publication Date: 2008/03/01
Vol. E91-D  No. 3 ; pp. 448-456
Type of Manuscript:  Special Section PAPER (Special Section on Robust Speech Processing in Realistic Environments)
Category: ASR under Reverberant Conditions
Keyword: 
speech recognitiontemporal featuretandem approachmultistream combinationreverberant speech
 Summary | Full Text:PDF(1.4MB)

Feature Compensation Employing Multiple Environmental Models for Robust In-Vehicle Speech Recognition
Wooil KIM John H.L. HANSEN 
Publication:   IEICE TRANSACTIONS on Information and Systems
Publication Date: 2008/03/01
Vol. E91-D  No. 3 ; pp. 430-438
Type of Manuscript:  Special Section PAPER (Special Section on Robust Speech Processing in Realistic Environments)
Category: Noisy Speech Recognition
Keyword: 
speech recognitionin-vehicle conditionfeature compensationenvironment transition modelmixture sharing
 Summary | Full Text:PDF(331.8KB)

Noise Suppression Based on Multi-Model Compositions Using Multi-Pass Search with Multi-Label N-gram Models
Takatoshi JITSUHIRO Tomoji TORIYAMA Kiyoshi KOGURE 
Publication:   IEICE TRANSACTIONS on Information and Systems
Publication Date: 2008/03/01
Vol. E91-D  No. 3 ; pp. 402-410
Type of Manuscript:  Special Section PAPER (Special Section on Robust Speech Processing in Realistic Environments)
Category: Noisy Speech Recognition
Keyword: 
speech recognitionnoise suppressionmodel compositionmulti-pass searchE-Nightingale project
 Summary | Full Text:PDF(2.3MB)

Ears of the Robot: Three Simultaneous Speech Segregation and Recognition Using Robot-Mounted Microphones
Naoya MOCHIKI Tetsuji OGAWA Tetsunori KOBAYASHI 
Publication:   IEICE TRANSACTIONS on Information and Systems
Publication Date: 2007/09/01
Vol. E90-D  No. 9 ; pp. 1465-1468
Type of Manuscript:  LETTER
Category: Speech and Hearing
Keyword: 
robot auditionsound source segregationspeech recognitionSAFIAspectral subtraction
 Summary | Full Text:PDF(772.7KB)

Online Speech Detection and Dual-Gender Speech Recognition for Captioning Broadcast News
Toru IMAI Shoei SATO Shinichi HOMMA Kazuo ONOE Akio KOBAYASHI 
Publication:   IEICE TRANSACTIONS on Information and Systems
Publication Date: 2007/08/01
Vol. E90-D  No. 8 ; pp. 1286-1291
Type of Manuscript:  PAPER
Category: Speech and Hearing
Keyword: 
speech recognitionspeech detectiongender identificationlow latencybroadcast captioning
 Summary | Full Text:PDF(454.4KB)

Dynamic Bayesian Network Inversion for Robust Speech Recognition
Lei XIE Hongwu YANG 
Publication:   IEICE TRANSACTIONS on Information and Systems
Publication Date: 2007/07/01
Vol. E90-D  No. 7 ; pp. 1117-1120
Type of Manuscript:  LETTER
Category: Speech and Hearing
Keyword: 
speech recognitionhidden Markov modeldynamic Bayesian network
 Summary | Full Text:PDF(198.8KB)

Effective Energy Feature Compensation Using Modified Log-energy Dynamic Range Normalization for Robust Speech Recognition
Yoonjae LEE Hanseok KO 
Publication:   IEICE TRANSACTIONS on Communications
Publication Date: 2007/06/01
Vol. E90-B  No. 6 ; pp. 1508-1511
Type of Manuscript:  LETTER
Category: Fundamental Theories for Communications
Keyword: 
log-energy dynamic range normalization (ERN)energy-subtractionmodified ERNspeech recognition
 Summary | Full Text:PDF(351.5KB)

Response Time Reduction of Speech Recognizers Using Single Gaussians
Sangbae JEONG Hoirin KIM Minsoo HAHN 
Publication:   IEICE TRANSACTIONS on Information and Systems
Publication Date: 2007/05/01
Vol. E90-D  No. 5 ; pp. 868-871
Type of Manuscript:  LETTER
Category: Speech and Hearing
Keyword: 
speech recognitionfast likelihood computation
 Summary | Full Text:PDF(194.9KB)

Incremental Language Modeling for Automatic Transcription of Broadcast News
Katsutoshi OHTSUKI Long NGUYEN 
Publication:   IEICE TRANSACTIONS on Information and Systems
Publication Date: 2007/02/01
Vol. E90-D  No. 2 ; pp. 526-532
Type of Manuscript:  PAPER
Category: Speech and Hearing
Keyword: 
speech recognitionout-of-vocabularylanguage modelbroadcast news
 Summary | Full Text:PDF(176.7KB)

A Systolic FPGA Architecture of Two-Level Dynamic Programming for Connected Speech Recognition
Yong KIM Hong JEONG 
Publication:   IEICE TRANSACTIONS on Information and Systems
Publication Date: 2007/02/01
Vol. E90-D  No. 2 ; pp. 562-568
Type of Manuscript:  PAPER
Category: Speech and Hearing
Keyword: 
speech recognitionhidden Markov model (HMM)two-level dynamic programming (TLDP)FPGA
 Summary | Full Text:PDF(659KB)

Reducing Computation Time of the Rapid Unsupervised Speaker Adaptation Based on HMM-Sufficient Statistics
Randy GOMEZ Tomoki TODA Hiroshi SARUWATARI Kiyohiro SHIKANO 
Publication:   IEICE TRANSACTIONS on Information and Systems
Publication Date: 2007/02/01
Vol. E90-D  No. 2 ; pp. 554-561
Type of Manuscript:  PAPER
Category: Speech and Hearing
Keyword: 
HMM-sufficient statisticsunsupervisedrapid adaptationspeech recognition
 Summary | Full Text:PDF(1.5MB)

Feature Compensation with Model-Based Estimation for Noise Masking
Young Joon KIM Nam Soo KIM 
Publication:   IEICE TRANSACTIONS on Information and Systems
Publication Date: 2007/02/01
Vol. E90-D  No. 2 ; pp. 603-605
Type of Manuscript:  LETTER
Category: Speech and Hearing
Keyword: 
speech recognitionfeature compensationIMMnoise masking
 Summary | Full Text:PDF(95.9KB)

N-gram Adaptation with Dynamic Interpolation Coefficient Using Information Retrieval Technique
Joon-Ki CHOI Yung-Hwan OH 
Publication:   IEICE TRANSACTIONS on Information and Systems
Publication Date: 2006/09/01
Vol. E89-D  No. 9 ; pp. 2579-2582
Type of Manuscript:  LETTER
Category: Speech and Hearing
Keyword: 
language model adaptationadaptation corpusdynamic interpolation coefficientspeech recognition
 Summary | Full Text:PDF(210.5KB)

Verification of Speech Recognition Results Incorporating In-domain Confidence and Discourse Coherence Measures
Ian R. LANE Tatsuya KAWAHARA 
Publication:   IEICE TRANSACTIONS on Information and Systems
Publication Date: 2006/03/01
Vol. E89-D  No. 3 ; pp. 931-938
Type of Manuscript:  Special Section PAPER (Special Section on Statistical Modeling for Speech Processing)
Category: Speech Recognition
Keyword: 
speech recognitionconfidence measureutterance verificationin-domain confidencediscourse coherence
 Summary | Full Text:PDF(936.6KB)

Trigger-Based Language Model Adaptation for Automatic Transcription of Panel Discussions
Carlos TRONCOSO Tatsuya KAWAHARA 
Publication:   IEICE TRANSACTIONS on Information and Systems
Publication Date: 2006/03/01
Vol. E89-D  No. 3 ; pp. 1024-1031
Type of Manuscript:  Special Section PAPER (Special Section on Statistical Modeling for Speech Processing)
Category: Speech Recognition
Keyword: 
speech recognitionlanguage modeltrigger-based language modelTF/IDF
 Summary | Full Text:PDF(536.3KB)

Robust Speech Recognition by Using Compensated Acoustic Scores
Shoei SATO Kazuo ONOE Akio KOBAYASHI Toru IMAI 
Publication:   IEICE TRANSACTIONS on Information and Systems
Publication Date: 2006/03/01
Vol. E89-D  No. 3 ; pp. 915-921
Type of Manuscript:  Special Section PAPER (Special Section on Statistical Modeling for Speech Processing)
Category: Speech Recognition
Keyword: 
speech recognitionnoisy environmentacoustic score
 Summary | Full Text:PDF(307.5KB)

Speech Recognition Based on Student's t-Distribution Derived from Total Bayesian Framework
Shinji WATANABE Atsushi NAKAMURA 
Publication:   IEICE TRANSACTIONS on Information and Systems
Publication Date: 2006/03/01
Vol. E89-D  No. 3 ; pp. 970-980
Type of Manuscript:  Special Section PAPER (Special Section on Statistical Modeling for Speech Processing)
Category: Speech Recognition
Keyword: 
speech recognitiontotal Bayesian framework VBECBayesian predictionstudent's t-distribution
 Summary | Full Text:PDF(574.8KB)

Production-Oriented Models for Speech Recognition
Erik MCDERMOTT Atsushi NAKAMURA 
Publication:   IEICE TRANSACTIONS on Information and Systems
Publication Date: 2006/03/01
Vol. E89-D  No. 3 ; pp. 1006-1014
Type of Manuscript:  Special Section PAPER (Special Section on Statistical Modeling for Speech Processing)
Category: Speech Recognition
Keyword: 
speech recognitionspeech productionarticulatory modelinglinear dynamical systems
 Summary | Full Text:PDF(566.6KB)

Single-Channel Multiple Regression for In-Car Speech Enhancement
Weifeng LI Katsunobu ITOU Kazuya TAKEDA Fumitada ITAKURA 
Publication:   IEICE TRANSACTIONS on Information and Systems
Publication Date: 2006/03/01
Vol. E89-D  No. 3 ; pp. 1032-1039
Type of Manuscript:  Special Section PAPER (Special Section on Statistical Modeling for Speech Processing)
Category: Speech Enhancement
Keyword: 
speech enhancementspeech recognitionmulti-layer perceptronmean opinion scorepairwise preference testenvironmental adaptationK-means clustering
 Summary | Full Text:PDF(505.6KB)

Gamma Modeling of Speech Power and Its On-Line Estimation for Statistical Speech Enhancement
Tran Huy DAT Kazuya TAKEDA Fumitada ITAKURA 
Publication:   IEICE TRANSACTIONS on Information and Systems
Publication Date: 2006/03/01
Vol. E89-D  No. 3 ; pp. 1040-1049
Type of Manuscript:  Special Section PAPER (Special Section on Statistical Modeling for Speech Processing)
Category: Speech Enhancement
Keyword: 
speech enhancementspeech recognitiongamma modelingfourth-order momentMMSEMAPspectral magnitudepowerlog-spectral magnitude
 Summary | Full Text:PDF(576.3KB)

Training Augmented Models Using SVMs
Mark J.F. GALES Martin I. LAYTON 
Publication:   IEICE TRANSACTIONS on Information and Systems
Publication Date: 2006/03/01
Vol. E89-D  No. 3 ; pp. 892-899
Type of Manuscript:  INVITED PAPER (Special Section on Statistical Modeling for Speech Processing)
Category: 
Keyword: 
speech recognitionhidden Markov modelssupport vector machinesaugmented statistical models
 Summary | Full Text:PDF(232.9KB)

Non-Audible Murmur (NAM) Recognition
Yoshitaka NAKAJIMA Hideki KASHIOKA Nick CAMPBELL Kiyohiro SHIKANO 
Publication:   IEICE TRANSACTIONS on Information and Systems
Publication Date: 2006/01/01
Vol. E89-D  No. 1 ; pp. 1-8
Type of Manuscript:  Special Section PAPER (Special Section on the 2004 IEICE Excellent Paper Award)
Category: 
Keyword: 
interfacespeech recognitionNon-Audible Murmur recognitionNAMwearable computing
 Summary | Full Text:PDF(3MB)

Frequency Domain Microphone Array Calibration and Beamforming for Automatic Speech Recognition
Jwu-Sheng HU Chieh-Cheng CHENG 
Publication:   IEICE TRANSACTIONS on Fundamentals of Electronics, Communications and Computer Sciences
Publication Date: 2005/09/01
Vol. E88-A  No. 9 ; pp. 2401-2411
Type of Manuscript:  PAPER
Category: Noise and Vibration
Keyword: 
beamformermicrophone arraycalibrationspeech recognitionspeech enhancement
 Summary | Full Text:PDF(1.9MB)

An Adaptive Noise Canceller with Low Signal-Distortion Based on Variable Stepsize Subfilters for Human-Robot Communication
Miki SATO Akihiko SUGIYAMA Shin'ichi OHNAKA 
Publication:   IEICE TRANSACTIONS on Fundamentals of Electronics, Communications and Computer Sciences
Publication Date: 2005/08/01
Vol. E88-A  No. 8 ; pp. 2055-2061
Type of Manuscript:  Special Section PAPER (Special Section on Papers Selected from the 19th Symposium on Signal Processing)
Category: Digital Signal Processing
Keyword: 
noise cancellerdistortioncrosstalkadaptive filteralgorithmspeech recognitionhuman-robot communication
 Summary | Full Text:PDF(619.7KB)

Simultaneous Adaptation of Echo Cancellation and Spectral Subtraction for In-Car Speech Recognition
Osamu ICHIKAWA Masafumi NISHIMURA 
Publication:   IEICE TRANSACTIONS on Fundamentals of Electronics, Communications and Computer Sciences
Publication Date: 2005/07/01
Vol. E88-A  No. 7 ; pp. 1732-1738
Type of Manuscript:  Special Section PAPER (Special Section on Multi-channel Acoustic Signal Processing)
Category: Speech Enhancement
Keyword: 
speech enhancementecho cancellernoise reductionspectral subtractionspeech recognition
 Summary | Full Text:PDF(775.1KB)

Adaptive Nonlinear Regression Using Multiple Distributed Microphones for In-Car Speech Recognition
Weifeng LI Chiyomi MIYAJIMA Takanori NISHINO Katsunobu ITOU Kazuya TAKEDA Fumitada ITAKURA 
Publication:   IEICE TRANSACTIONS on Fundamentals of Electronics, Communications and Computer Sciences
Publication Date: 2005/07/01
Vol. E88-A  No. 7 ; pp. 1716-1723
Type of Manuscript:  Special Section PAPER (Special Section on Multi-channel Acoustic Signal Processing)
Category: Speech Enhancement
Keyword: 
speech recognitionsupport vector machinemulti-layer perceptronsignal-to-deviation ratioK-means clusteringadaptive beamforming
 Summary | Full Text:PDF(1MB)

Interface for Barge-in Free Spoken Dialogue System Combining Adaptive Sound Field Control and Microphone Array
Tatsunori ASAI Hiroshi SARUWATARI Kiyohiro SHIKANO 
Publication:   IEICE TRANSACTIONS on Fundamentals of Electronics, Communications and Computer Sciences
Publication Date: 2005/06/01
Vol. E88-A  No. 6 ; pp. 1613-1618
Type of Manuscript:  LETTER
Category: Speech and Hearing
Keyword: 
spoken dialogue systembarge-inadaptive sound field controlmicrophone arrayspeech recognition
 Summary | Full Text:PDF(1.3MB)

Bayesian Confidence Scoring and Adaptation Techniques for Speech Recognition
Tae-Yoon KIM Hanseok KO 
Publication:   IEICE TRANSACTIONS on Communications
Publication Date: 2005/04/01
Vol. E88-B  No. 4 ; pp. 1756-1759
Type of Manuscript:  LETTER
Category: Multimedia Systems for Communications" Multimedia Systems for Communications
Keyword: 
speech recognitionconfidence measureadaptationOOV rejection
 Summary | Full Text:PDF(182.3KB)

Improving Keyword Recognition of Spoken Queries by Combining Multiple Speech Recognizer's Outputs for Speech-driven WEB Retrieval Task
Masahiko MATSUSHITA Hiromitsu NISHIZAKI Takehito UTSURO Seiichi NAKAGAWA 
Publication:   IEICE TRANSACTIONS on Information and Systems
Publication Date: 2005/03/01
Vol. E88-D  No. 3 ; pp. 472-480
Type of Manuscript:  Special Section PAPER (Special Section on Corpus-Based Speech Technologies)
Category: Spoken Language Systems
Keyword: 
speech recognitionmachine learningmultiple LVCSR modelsWEB retrieval
 Summary | Full Text:PDF(1.7MB)

Dialogue Speech Recognition by Combining Hierarchical Topic Classification and Language Model Switching
Ian R. LANE Tatsuya KAWAHARA Tomoko MATSUI Satoshi NAKAMURA 
Publication:   IEICE TRANSACTIONS on Information and Systems
Publication Date: 2005/03/01
Vol. E88-D  No. 3 ; pp. 446-454
Type of Manuscript:  Special Section PAPER (Special Section on Corpus-Based Speech Technologies)
Category: Spoken Language Systems
Keyword: 
speech recognitiontopic detectiontopic-dependent language modelingsupport vector machinesmulti-domain spoken dialogue
 Summary | Full Text:PDF(578.2KB)

Applying Sparse KPCA for Feature Extraction in Speech Recognition
Amaro LIMA Heiga ZEN Yoshihiko NANKAKU Keiichi TOKUDA Tadashi KITAMURA Fernando G. RESENDE 
Publication:   IEICE TRANSACTIONS on Information and Systems
Publication Date: 2005/03/01
Vol. E88-D  No. 3 ; pp. 401-409
Type of Manuscript:  Special Section PAPER (Special Section on Corpus-Based Speech Technologies)
Category: Feature Extraction and Acoustic Medelings
Keyword: 
kernelsparsityprincipal component analysisfeature extractionspeech recognition
 Summary | Full Text:PDF(582.4KB)

Automatic Generation of Non-uniform and Context-Dependent HMMs Based on the Variational Bayesian Approach
Takatoshi JITSUHIRO Satoshi NAKAMURA 
Publication:   IEICE TRANSACTIONS on Information and Systems
Publication Date: 2005/03/01
Vol. E88-D  No. 3 ; pp. 391-400
Type of Manuscript:  Special Section PAPER (Special Section on Corpus-Based Speech Technologies)
Category: Feature Extraction and Acoustic Medelings
Keyword: 
speech recognitionacoustic modeltopology trainingSSS algorithmvariational Bayesian approach
 Summary | Full Text:PDF(866.7KB)

Multiple Regression of Log Spectra for In-Car Speech Recognition Using Multiple Distributed Microphones
Weifeng LI Tetsuya SHINDE Hiroshi FUJIMURA Chiyomi MIYAJIMA Takanori NISHINO Katunobu ITOU Kazuya TAKEDA Fumitada ITAKURA 
Publication:   IEICE TRANSACTIONS on Information and Systems
Publication Date: 2005/03/01
Vol. E88-D  No. 3 ; pp. 384-390
Type of Manuscript:  Special Section PAPER (Special Section on Corpus-Based Speech Technologies)
Category: Feature Extraction and Acoustic Medelings
Keyword: 
speech recognitionmicrophone arraysadaptive beamformingsignal-to-deviation ratiomultiple regression
 Summary | Full Text:PDF(568.7KB)

Selection of Shared-State Hidden Markov Model Structure Using Bayesian Criterion
Shinji WATANABE Yasuhiro MINAMI Atsushi NAKAMURA Naonori UEDA 
Publication:   IEICE TRANSACTIONS on Information and Systems
Publication Date: 2005/01/01
Vol. E88-D  No. 1 ; pp. 1-9
Type of Manuscript:  Special Section PAPER (Special Section on the 2003 IEICE Excellent Paper Award)
Category: 
Keyword: 
speech recognitionshared-state HMMmodel structure selectionvariational BayesBayesian criterion
 Summary | Full Text:PDF(521.1KB)

On the Use of Kernel PCA for Feature Extraction in Speech Recognition
Amaro LIMA Heiga ZEN Yoshihiko NANKAKU Chiyomi MIYAJIMA Keiichi TOKUDA Tadashi KITAMURA 
Publication:   IEICE TRANSACTIONS on Information and Systems
Publication Date: 2004/12/01
Vol. E87-D  No. 12 ; pp. 2802-2811
Type of Manuscript:  PAPER
Category: Speech and Hearing
Keyword: 
kernelfeature spaceprincipal component analysisfeature extractionspeech recognition
 Summary | Full Text:PDF(425.6KB)

Automatic Generation of Non-uniform HMM Topologies Based on the MDL Criterion
Takatoshi JITSUHIRO Tomoko MATSUI Satoshi NAKAMURA 
Publication:   IEICE TRANSACTIONS on Information and Systems
Publication Date: 2004/08/01
Vol. E87-D  No. 8 ; pp. 2121-2129
Type of Manuscript:  PAPER
Category: Speech and Hearing
Keyword: 
speech recognitionacoustic modeltopology trainingMDL criterionSSS algorithm
 Summary | Full Text:PDF(1.2MB)

Cepstral Amplitude Range Normalization for Noise Robust Speech Recognition
Shingo YOSHIZAWA Noboru HAYASAKA Naoya WADA Yoshikazu MIYANAGA 
Publication:   IEICE TRANSACTIONS on Information and Systems
Publication Date: 2004/08/01
Vol. E87-D  No. 8 ; pp. 2130-2137
Type of Manuscript:  PAPER
Category: Speech and Hearing
Keyword: 
speech recognitionrobust featurescepstrumNoisex92
 Summary | Full Text:PDF(371KB)

A Statistical Method of Evaluating Pronunciation Proficiency for English Words Spoken by Japanese
Seiichi NAKAGAWA Naoki NAKAMURA Kazumasa MORI 
Publication:   IEICE TRANSACTIONS on Information and Systems
Publication Date: 2004/07/01
Vol. E87-D  No. 7 ; pp. 1917-1922
Type of Manuscript:  PAPER
Category: Speech and Hearing
Keyword: 
CALLevaluation of pronunciation proficiencyEnglish learningspeech recognition
 Summary | Full Text:PDF(463.1KB)

A Spoken Dialogue Interface for TV Operations Based on Data Collected by Using WOZ Method
Jun GOTO Kazuteru KOMINE Masaru MIYAZAKI Yeun-Bae KIM Noriyoshi URATANI 
Publication:   IEICE TRANSACTIONS on Information and Systems
Publication Date: 2004/06/01
Vol. E87-D  No. 6 ; pp. 1397-1404
Type of Manuscript:  Special Section PAPER (Special Section on Human Communication I)
Category: 
Keyword: 
spoken dialogue interfaceTV operationWOZspeech recognition
 Summary | Full Text:PDF(2.6MB)

One-Pass Semi-Dynamic Network Decoding Using a Subnetwork Caching Model for Large Vocabulary Continuous Speech Recongnition
Dong-Hoon AHN Minhwa CHUNG 
Publication:   IEICE TRANSACTIONS on Information and Systems
Publication Date: 2004/05/01
Vol. E87-D  No. 5 ; pp. 1164-1174
Type of Manuscript:  Special Section PAPER (Special Section on Speech Dynamics by Ear, Eye, Mouth and Machine)
Category: 
Keyword: 
speech recognitionsemi-dynamic network decodingsubnetwork cachingtail-sharing algorithm
 Summary | Full Text:PDF(1.5MB)

Improved Phoneme-History-Dependent Search Method for Large-Vocabulary Continuous-Speech Recognition
Takaaki HORI Yoshiaki NODA Shoichi MATSUNAGA 
Publication:   IEICE TRANSACTIONS on Information and Systems
Publication Date: 2003/06/01
Vol. E86-D  No. 6 ; pp. 1059-1067
Type of Manuscript:  PAPER
Category: Speech and Hearing
Keyword: 
speech recognitionsearch algorithmmulti-pass searchword graphphoneme-history-dependent search
 Summary | Full Text:PDF(403.4KB)

Speaker Tracking for Hands-Free Continuous Speech Recognition in Noise Based on a Spectrum-Entropy Beamforming Method
George NOKAS Evangelos DERMATAS 
Publication:   IEICE TRANSACTIONS on Information and Systems
Publication Date: 2003/04/01
Vol. E86-D  No. 4 ; pp. 755-758
Type of Manuscript:  LETTER
Category: Speech and Hearing
Keyword: 
speaker trackingmicrophone arrayspectrum entropyspeech recognitionspeaker beam-former
 Summary | Full Text:PDF(167KB)

Filter Bank Subtraction for Robust Speech Recognition
Kazuo ONOE Hiroyuki SEGI Takeshi KOBAYAKAWA Shoei SATO Shinichi HOMMA Toru IMAI Akio ANDO 
Publication:   IEICE TRANSACTIONS on Information and Systems
Publication Date: 2003/03/01
Vol. E86-D  No. 3 ; pp. 483-488
Type of Manuscript:  Special Section PAPER (Special Issue on Speech Information Processing)
Category: Robust Speech Recognition and Enhancement
Keyword: 
filter bankspectral subtractionspeech recognitionnoise
 Summary | Full Text:PDF(347.3KB)

Continuous Speech Recognition Using an On-Line Speaker Adaptation Method Based on Automatic Speaker Clustering
Wei ZHANG Seiichi NAKAGAWA 
Publication:   IEICE TRANSACTIONS on Information and Systems
Publication Date: 2003/03/01
Vol. E86-D  No. 3 ; pp. 464-473
Type of Manuscript:  Special Section PAPER (Special Issue on Speech Information Processing)
Category: Speech and Speaker Recognition
Keyword: 
speaker adaptationspeech recognitionspeaker clusteringMLLRMAP
 Summary | Full Text:PDF(910.3KB)

Language Modeling Using Patterns Extracted from Parse Trees for Speech Recognition
Takatoshi JITSUHIRO Hirofumi YAMAMOTO Setsuo YAMADA Genichiro KIKUI Yoshinori SAGISAKA 
Publication:   IEICE TRANSACTIONS on Information and Systems
Publication Date: 2003/03/01
Vol. E86-D  No. 3 ; pp. 446-453
Type of Manuscript:  Special Section PAPER (Special Issue on Speech Information Processing)
Category: Speech and Speaker Recognition
Keyword: 
speech recognitionlanguage modeln-gram modelparserpattern model
 Summary | Full Text:PDF(913.3KB)

Face-to-Talk: Audio-Visual Speech Detection for Robust Speech Recognition in Noisy Environment
Kazumasa MURAI Satoshi NAKAMURA 
Publication:   IEICE TRANSACTIONS on Information and Systems
Publication Date: 2003/03/01
Vol. E86-D  No. 3 ; pp. 505-513
Type of Manuscript:  Special Section PAPER (Special Issue on Speech Information Processing)
Category: Robust Speech Recognition and Enhancement
Keyword: 
speech recognitionspeech section detectionmulti-modalityface detection"face-to-talk"
 Summary | Full Text:PDF(682.8KB)

Speech Enhancement by Profile Fitting Method
Osamu ICHIKAWA Tetsuya TAKIGUCHI Masafumi NISHIMURA 
Publication:   IEICE TRANSACTIONS on Information and Systems
Publication Date: 2003/03/01
Vol. E86-D  No. 3 ; pp. 514-521
Type of Manuscript:  Special Section PAPER (Special Issue on Speech Information Processing)
Category: Robust Speech Recognition and Enhancement
Keyword: 
speech enhancementmicrophone arraybeamformernoise reductionspectral subtractionspeech recognition
 Summary | Full Text:PDF(840.5KB)

Simultaneous Subtitling System for Broadcast News Programs with a Speech Recognizer
Akio ANDO Toru IMAI Akio KOBAYASHI Shinich HOMMA Jun GOTO Nobumasa SEIYAMA Takeshi MISHIMA Takeshi KOBAYAKAWA Shoei SATO Kazuo ONOE Hiroyuki SEGI Atsushi IMAI Atsushi MATSUI Akira NAKAMURA Hideki TANAKA Tohru TAKAGI Eiichi MIYASAKA Haruo ISONO 
Publication:   IEICE TRANSACTIONS on Information and Systems
Publication Date: 2003/01/01
Vol. E86-D  No. 1 ; pp. 15-25
Type of Manuscript:  INVITED PAPER (Special Issue on the 2001 IEICE Excellent Paper Award)
Category: 
Keyword: 
closed-caption service for news programspeech recognitionrecognition error correctionreal-time processingsystem in the practical use
 Summary | Full Text:PDF(1.1MB)

Duration Modeling Using Cumulative Duration Probability
Tae-Young YANG Chungyong LEE Dae-Hee YOUN 
Publication:   IEICE TRANSACTIONS on Information and Systems
Publication Date: 2002/09/01
Vol. E85-D  No. 9 ; pp. 1452-1454
Type of Manuscript:  LETTER
Category: Speech and Hearing
Keyword: 
speech recognitionconnected digit recognitionduration modelingcumulative duration probability
 Summary | Full Text:PDF(108.9KB)

VLSI Architecture and Implementation for Speech Recognizer Based on Discriminative Bayesian Neural Network
Jhing-Fa WANG Jia-Ching WANG An-Nan SUEN Chung-Hsien WU Fan-Min LI 
Publication:   IEICE TRANSACTIONS on Fundamentals of Electronics, Communications and Computer Sciences
Publication Date: 2002/08/01
Vol. E85-A  No. 8 ; pp. 1861-1869
Type of Manuscript:  Special Section PAPER (Special Section on Digital Signal Processing)
Category: Implementations of Signal Processing Systems
Keyword: 
discriminative Bayesian neural networkspeech recognitionVLSI
 Summary | Full Text:PDF(835.6KB)

A Survey on Automatic Speech Recognition
Seiichi NAKAGAWA 
Publication:   IEICE TRANSACTIONS on Information and Systems
Publication Date: 2002/03/01
Vol. E85-D  No. 3 ; pp. 465-486
Type of Manuscript:  INVITED SURVEY PAPER
Category: Speech and Hearing
Keyword: 
speech recognitionacoustic modelHMMlanguage modelngram
 Summary | Full Text:PDF(1.1MB)

Recognition of Connected Digit Speech in Japanese Collected over the Telephone Network
Hisashi KAWAI Tohru SHIMIZU Norio HIGUCHI 
Publication:   IEICE TRANSACTIONS on Information and Systems
Publication Date: 2001/03/01
Vol. E84-D  No. 3 ; pp. 374-383
Type of Manuscript:  PAPER
Category: Speech and Hearing
Keyword: 
speech recognitiondigittelephonedata sizelow performance speakerssheep and goats
 Summary | Full Text:PDF(436KB)

Speaker Adaptation Based on a Maximum Observation Probability Criterion
Tae-Young YANG Chungyong LEE Dae-Hee YOUN 
Publication:   IEICE TRANSACTIONS on Information and Systems
Publication Date: 2001/02/01
Vol. E84-D  No. 2 ; pp. 286-288
Type of Manuscript:  LETTER
Category: Speech and Hearing
Keyword: 
speech recognitionspeaker adaptationmaximum observation probability criterion
 Summary | Full Text:PDF(106.2KB)

Japanese Pronunciation Instruction System Using Speech Recognition Methods
Chul-Ho JO Tatsuya KAWAHARA Shuji DOSHITA Masatake DANTSUJI 
Publication:   IEICE TRANSACTIONS on Information and Systems
Publication Date: 2000/11/25
Vol. E83-D  No. 11 ; pp. 1960-1968
Type of Manuscript:  PAPER
Category: Speech and Hearing
Keyword: 
speech recognitionCALLHMMarticulatory categoryformant
 Summary | Full Text:PDF(1.8MB)

Maximum Likelihood Successive State Splitting Algorithm for Tied-Mixture HMnet
Alexandre GIRARDI Harald SINGER Kiyohiro SHIKANO Satoshi NAKAMURA 
Publication:   IEICE TRANSACTIONS on Information and Systems
Publication Date: 2000/10/25
Vol. E83-D  No. 10 ; pp. 1890-1897
Type of Manuscript:  PAPER
Category: Speech and Hearing
Keyword: 
speech recognitionacoustic modelingHMMtied-mixtureclustering
 Summary | Full Text:PDF(520.3KB)

Spectral Peak-Weighted Liftering of Cepstral Coefficients for Speech Recognition
Hong Kook KIM Hwang Soo LEE 
Publication:   IEICE TRANSACTIONS on Information and Systems
Publication Date: 2000/07/25
Vol. E83-D  No. 7 ; pp. 1540-1549
Type of Manuscript:  PAPER
Category: Speech and Hearing
Keyword: 
speech recognitioncepstral analysispeak-weighted cepstral lifterframe-adaptive cepstral lifter
 Summary | Full Text:PDF(616.6KB)

Speech Enhancement Using Nonlinear Microphone Array Based on Complementary Beamforming
Hiroshi SARUWATARI Shoji KAJITA Kazuya TAKEDA Fumitada ITAKURA 
Publication:   IEICE TRANSACTIONS on Fundamentals of Electronics, Communications and Computer Sciences
Publication Date: 1999/08/25
Vol. E82-A  No. 8 ; pp. 1501-1510
Type of Manuscript:  Special Section PAPER (Special Section on Digital Signal Processing)
Category: 
Keyword: 
speech enhancementmicrophone arraycomplementary beamformingspectral subtractionspeech recognition
 Summary | Full Text:PDF(1MB)

Realization of Wide-Band Directivity with Three Microphones
Masataka NAKAMURA Katsuhito KOUNO Toshitaka YAMATO Kazuhiro SAKIYAMA 
Publication:   IEICE TRANSACTIONS on Fundamentals of Electronics, Communications and Computer Sciences
Publication Date: 1999/04/25
Vol. E82-A  No. 4 ; pp. 619-625
Type of Manuscript:  Special Section PAPER (Special Section on Advanced Signal Processing Techniques for Analysis of Acoustical and Vibrational Signals)
Category: 
Keyword: 
speech recognitionbeamformermicrophone arrayhigh SN ratioanalog signal processing
 Summary | Full Text:PDF(770.5KB)

Dynamic Cepstral Representations Based on Order-Dependent Windowing Methods
Hong Kook KIM Seung Ho CHOI Hwang Soo LEE 
Publication:   IEICE TRANSACTIONS on Information and Systems
Publication Date: 1998/05/25
Vol. E81-D  No. 5 ; pp. 434-440
Type of Manuscript:  PAPER
Category: Speech Processing and Acoustics
Keyword: 
cepstrumdynamic cepstrumorder-dependent windowingspeech recognition
 Summary | Full Text:PDF(636.4KB)

An Isolated Word Speech Recognition Based on Fusion of Visual and Auditory Information Usisng 30-frame/s and 24-bit Color Image
Akio OGIHARA Shinobu ASAO 
Publication:   IEICE TRANSACTIONS on Fundamentals of Electronics, Communications and Computer Sciences
Publication Date: 1997/08/25
Vol. E80-A  No. 8 ; pp. 1417-1422
Type of Manuscript:  Special Section PAPER (Special Section on Digital Signal Processing)
Category: 
Keyword: 
speech recognitionfusion of visual and auditoryhidden Markov modelsensor fusionfull-frame (30-frame/s) and full-color (24-bit color) image
 Summary | Full Text:PDF(457.5KB)

Discriminative Training Based on Minimum Classification Error for a Small Amount of Data Enhanced by Vector-Field-Smoothed Bayesian Learning
Jun-ichi TAKAHASHI Shigeki SAGAYAMA 
Publication:   IEICE TRANSACTIONS on Information and Systems
Publication Date: 1996/12/25
Vol. E79-D  No. 12 ; pp. 1700-1707
Type of Manuscript:  PAPER
Category: Speech Processing and Acoustics
Keyword: 
speech recognitionhidden Markov modeldiscriminative trainingspeaker adaptation
 Summary | Full Text:PDF(745.3KB)

Speech Recognition Based on Fusion of Visual and Auditory Information Using Full-Framse Color Image
Satoru IGAWA Akio OGIHARA Akira SHINTANI Shinobu TAKAMATSU 
Publication:   IEICE TRANSACTIONS on Fundamentals of Electronics, Communications and Computer Sciences
Publication Date: 1996/11/25
Vol. E79-A  No. 11 ; pp. 1836-1840
Type of Manuscript:  Special Section LETTER (Special Section of Letters Selected from the 1996 IEICE General Conference)
Category: 
Keyword: 
speech recognitionfusion of visual and auditorysensor fusionhidden Markov model
 Summary | Full Text:PDF(327.2KB)

An Isolated Word Speech Recognition Using Fusion of Auditory and Visual Information
Akira SHINTANI Akiko OGIHARA Naoshi DOI Shinobu TAKAMATSU 
Publication:   IEICE TRANSACTIONS on Fundamentals of Electronics, Communications and Computer Sciences
Publication Date: 1996/06/25
Vol. E79-A  No. 6 ; pp. 777-783
Type of Manuscript:  Special Section PAPER (Special Section of Papers Selected from 1995 Joint Technical Conference on Circuits/Systems, Computers and Communications (JTC-CSCC '95))
Category: 
Keyword: 
HMMfusionlinear combinationspeech recognitionauditory and visual information
 Summary | Full Text:PDF(548.3KB)

The Performance Prediction on Sentence Recognition Using a Finite State Word Automaton
Takashi OTSUKI Akinori ITO Shozo MAKINO Teruhiko OHTOMO 
Publication:   IEICE TRANSACTIONS on Information and Systems
Publication Date: 1996/01/25
Vol. E79-D  No. 1 ; pp. 47-53
Type of Manuscript:  PAPER
Category: Speech Processing and Acoustics
Keyword: 
speech recognitionsentence recognitionperformance predictionfinite state word automaton
 Summary | Full Text:PDF(494.6KB)

A Study on Mouth Shape Features Suitable for HMM Speech Recognition Using Fusion of Visual and Auditory Information
Naoshi DOI Akira SHINTANI Yasuhisa HAYASHI Akio OGIHARA Shinobu TAKAMATSU 
Publication:   IEICE TRANSACTIONS on Fundamentals of Electronics, Communications and Computer Sciences
Publication Date: 1995/11/25
Vol. E78-A  No. 11 ; pp. 1548-1552
Type of Manuscript:  Special Section LETTER (Special Section of Letters Selected from the 1995 IEICE General Conference)
Category: 
Keyword: 
speech recognitionfusion of visual and auditory informationfeature of mouth shapeimage processing
 Summary | Full Text:PDF(285.7KB)

A Minimum Error Approach to Spotting-Based Pattern Recognition
Takashi KOMORI Shigeru KATAGIRI 
Publication:   IEICE TRANSACTIONS on Information and Systems
Publication Date: 1995/08/25
Vol. E78-D  No. 8 ; pp. 1032-1043
Type of Manuscript:  PAPER
Category: Speech Processing and Acoustics
Keyword: 
pattern recognitionword spottingMCE/GPDspeech recognition
 Summary | Full Text:PDF(1014.6KB)

Unsupervised Speaker Adaptation Using All-Phoneme Ergodic Hidden Markov Network
Yasunage MIYAZAWA Jun-ichi TAKAMI Shigeki SAGAYAMA Shoichi MATSUNAGA 
Publication:   IEICE TRANSACTIONS on Information and Systems
Publication Date: 1995/08/25
Vol. E78-D  No. 8 ; pp. 1044-1050
Type of Manuscript:  PAPER
Category: Speech Processing and Acoustics
Keyword: 
speech recognitionunsupervised speaker adaptationall-phoneme ergodic hidden Markov networkcontext-dependent phoneme bigram
 Summary | Full Text:PDF(631.1KB)

Relationship among Recognition Rate, Rejection Rate and False Alarm Rate in a Spoken Word Recognition System
Atsuhiko KAI Seiichi NAKAGAWA 
Publication:   IEICE TRANSACTIONS on Information and Systems
Publication Date: 1995/06/25
Vol. E78-D  No. 6 ; pp. 698-704
Type of Manuscript:  Special Section PAPER (Special Issue on Spoken Language Processing)
Category: 
Keyword: 
speech recognitionunknown word processingsimulationrejection rate
 Summary | Full Text:PDF(537.8KB)

Speech Recognition Using Function-Word N-Grams and Content-Word N-Grams
Ryosuke ISOTANI Shoichi MATSUNAGA Shigeki SAGAYAMA 
Publication:   IEICE TRANSACTIONS on Information and Systems
Publication Date: 1995/06/25
Vol. E78-D  No. 6 ; pp. 692-697
Type of Manuscript:  Special Section PAPER (Special Issue on Spoken Language Processing)
Category: 
Keyword: 
speech recognitionstochastic language modelN-gramfunction wordscontent words
 Summary | Full Text:PDF(534.7KB)

A Scheme for Word Detection in Continuous Speech Using Likelihood Scores of Segments Modified by Their Context Within a Word
Sumio OHNO Keikichi HIROSE Hiroya FUJISAKI 
Publication:   IEICE TRANSACTIONS on Information and Systems
Publication Date: 1995/06/25
Vol. E78-D  No. 6 ; pp. 725-731
Type of Manuscript:  Special Section PAPER (Special Issue on Spoken Language Processing)
Category: 
Keyword: 
word spottingspeech recognitiontemplate matchinghuman speech perception
 Summary | Full Text:PDF(614.2KB)

Duration Modeling with Decreased Intra-Group Temporal Variation for HMM-Based Phoneme Recognition
Nobuaki MINEMATSU Keikichi HIROSE 
Publication:   IEICE TRANSACTIONS on Information and Systems
Publication Date: 1995/06/25
Vol. E78-D  No. 6 ; pp. 654-661
Type of Manuscript:  Special Section PAPER (Special Issue on Spoken Language Processing)
Category: 
Keyword: 
speech recognitionHMMGaussian mixturetemporal correspondenceduration modellooping ratesingle occupancy ratetemporal variation rate
 Summary | Full Text:PDF(796.8KB)

Error Analysis of Field Trial Results of a Spoken Dialogue System for Telecommunications Applications
Shingo KUROIWA Kazuya TAKEDA Masaki NAITO Naomi INOUE Seiichi YAMAMOTO 
Publication:   IEICE TRANSACTIONS on Information and Systems
Publication Date: 1995/06/25
Vol. E78-D  No. 6 ; pp. 636-641
Type of Manuscript:  Special Section PAPER (Special Issue on Spoken Language Processing)
Category: 
Keyword: 
speech recognitiondialogue systemfield trialtelephone
 Summary | Full Text:PDF(589.1KB)

Speaker-Consistent Parsing for Speaker-Independent Continuous Speech Recognition
Kouichi YAMAGUCHI Harald SINGER Shoichi MATSUNAGA Shigeki SAGAYAMA 
Publication:   IEICE TRANSACTIONS on Information and Systems
Publication Date: 1995/06/25
Vol. E78-D  No. 6 ; pp. 719-724
Type of Manuscript:  Special Section PAPER (Special Issue on Spoken Language Processing)
Category: 
Keyword: 
speech recognitionsearch algorithmhidden Markov modelspeaker adaptation
 Summary | Full Text:PDF(558.2KB)

Speech Recognition Using HMM Based on Fusion of Visual and Auditory Information
Akira SHINTANI Akio OGIHARA Yoshikazu YAMAGUCHI Yasuhisa HAYASHI Kunio FUKUNAGA 
Publication:   IEICE TRANSACTIONS on Fundamentals of Electronics, Communications and Computer Sciences
Publication Date: 1994/11/25
Vol. E77-A  No. 11 ; pp. 1875-1878
Type of Manuscript:  Special Section LETTER (Special Section of Letters Selected from the 1994 IEICE Spring Conference)
Category: 
Keyword: 
speech recognitionfusion of visual and auditorysensor fusionHidden Markov Model
 Summary | Full Text:PDF(239.5KB)

A MRF-Based Parallel Processing for Speech Recognition Using Linear Predictive HMM
Hideki NODA Mehdi N. SHIRAZI Mamoru NAKATSUI 
Publication:   IEICE TRANSACTIONS on Information and Systems
Publication Date: 1994/10/25
Vol. E77-D  No. 10 ; pp. 1142-1147
Type of Manuscript:  PAPER
Category: Speech Processing
Keyword: 
parallel processingspeech recognitionMRF modelHMMICM algorithm
 Summary | Full Text:PDF(510.9KB)

Spoken Sentence Recognition Based on HMM-LR with Hybrid Language Modeling
Kenji KITA Tsuyoshi MORIMOTO Kazumi OHKURA Shigeki SAGAYAMA Yaneo YANO 
Publication:   IEICE TRANSACTIONS on Information and Systems
Publication Date: 1994/02/25
Vol. E77-D  No. 2 ; pp. 258-265
Type of Manuscript:  Special Section PAPER (Special Issue on Natural Language Processing and Understanding)
Category: 
Keyword: 
speech recognitionLR parsinghidden Markov modelHMM-LRlanguage model
 Summary | Full Text:PDF(720.1KB)

Speech Recognition of lsolated Digits Using Simultaneous Generative Histogram
Yasuhisa HAYASHI Akio OGIHARA Kunio FUKUNAGA 
Publication:   IEICE TRANSACTIONS on Fundamentals of Electronics, Communications and Computer Sciences
Publication Date: 1993/12/25
Vol. E76-A  No. 12 ; pp. 2052-2054
Type of Manuscript:  Special Section LETTER (Special Section of Letters Selected from the 1993 IEICE Fall Conference)
Category: 
Keyword: 
speech recognitionhidden Markov modelseparate vector quantizationsimultaneous generative histogram
 Summary | Full Text:PDF(214KB)

A Hardware Architecture Design Methodology for Hidden Markov Model Based Recognition Systems Using Parallel Processing
Jun-ichi TAKAHASHI 
Publication:   IEICE TRANSACTIONS on Fundamentals of Electronics, Communications and Computer Sciences
Publication Date: 1993/06/25
Vol. E76-A  No. 6 ; pp. 990-1000
Type of Manuscript:  PAPER
Category: Digital Signal Processing
Keyword: 
hidden markov modelspeech recognitioncorrective trainingparallel processingarray processorsystem design
 Summary | Full Text:PDF(787.5KB)

Automatic Evaluation of English Pronunciation Based on Speech Recognition Techniques
Hiroshi HAMADA Satoshi MIKI Ryohei NAKATSU 
Publication:   IEICE TRANSACTIONS on Information and Systems
Publication Date: 1993/03/25
Vol. E76-D  No. 3 ; pp. 352-359
Type of Manuscript:  PAPER
Category: Speech Processing
Keyword: 
speech processingpronunciationspeech recognitionspeaker adaptationeducationenglish
 Summary | Full Text:PDF(732.1KB)

Task Adaptation in Syllable Trigram Models for Continuous Speech Recognition
Sho-ichi MATSUNAGA Tomokazu YAMADA Kiyohiro SHIKANO 
Publication:   IEICE TRANSACTIONS on Information and Systems
Publication Date: 1993/01/25
Vol. E76-D  No. 1 ; pp. 38-43
Type of Manuscript:  Special Section PAPER (Special Issue on Speech and Discourse Processing in Dialogue Systems)
Category: 
Keyword: 
speech recognitionstochastic language model
 Summary | Full Text:PDF(509.6KB)

Three Different LR Parsing Algorithms for Phoneme-Context-Dependent HMM-Based Continuous Speech Recognition
Akito NAGAI Shigeki SAGAYAMA Kenji KITA Hideaki KIKUCHI 
Publication:   IEICE TRANSACTIONS on Information and Systems
Publication Date: 1993/01/25
Vol. E76-D  No. 1 ; pp. 29-37
Type of Manuscript:  Special Section PAPER (Special Issue on Speech and Discourse Processing in Dialogue Systems)
Category: 
Keyword: 
speech recognitionHidden Markov ModelCFGLR parserallophonephoneme context
 Summary | Full Text:PDF(743.3KB)

LR Parsing with a Category Reachability Test Applied to Speech Recognition
Kenji KITA Tsuyoshi MORIMOTO Shigeki SAGAYAMA 
Publication:   IEICE TRANSACTIONS on Information and Systems
Publication Date: 1993/01/25
Vol. E76-D  No. 1 ; pp. 23-28
Type of Manuscript:  Special Section PAPER (Special Issue on Speech and Discourse Processing in Dialogue Systems)
Category: 
Keyword: 
speech recognitionHMMsLR parsingreachabilityLR-CRT algorithm
 Summary | Full Text:PDF(526.6KB)

Predicting the Next Utterance Linguistic Expressions Using Contextual Information
Hitoshi IIDA Takayuhi YAMAOKA Hidekazu ARITA 
Publication:   IEICE TRANSACTIONS on Information and Systems
Publication Date: 1993/01/25
Vol. E76-D  No. 1 ; pp. 62-73
Type of Manuscript:  Special Section PAPER (Special Issue on Speech and Discourse Processing in Dialogue Systems)
Category: 
Keyword: 
dialogue understandingplan recognitionuniversal pragmaticsutterance predictionspeech recognition
 Summary | Full Text:PDF(1MB)

How Might One Comfortably Converse with a Machine ?
Yasuhisa NIIMI 
Publication:   IEICE TRANSACTIONS on Information and Systems
Publication Date: 1993/01/25
Vol. E76-D  No. 1 ; pp. 9-16
Type of Manuscript:  INVITED PAPER (Special Issue on Speech and Discourse Processing in Dialogue Systems)
Category: 
Keyword: 
speech dialogue systemcomfirmationadaptationcooperativenessspeech recognitionspeech synthesis
 Summary | Full Text:PDF(710.2KB)

A Spoken Dialog System with Verification and Clarification Queries
Mikio YAMAMOTO Satoshi KOBAYASHI Yuji MORIYA Seiichi NAKAGAWA 
Publication:   IEICE TRANSACTIONS on Information and Systems
Publication Date: 1993/01/25
Vol. E76-D  No. 1 ; pp. 84-94
Type of Manuscript:  Special Section PAPER (Special Issue on Speech and Discourse Processing in Dialogue Systems)
Category: 
Keyword: 
natural language processingspeech recognitiondialog systemverificationclarification
 Summary | Full Text:PDF(938KB)

An SVQ-HMM Training Method Using Simultaneous Generative Histogram
Yasuhisa HAYASHI Satoshi KONDO Nobuyuki TAKASU Akio OGIHARA Shojiro YONEDA 
Publication:   IEICE TRANSACTIONS on Fundamentals of Electronics, Communications and Computer Sciences
Publication Date: 1992/07/25
Vol. E75-A  No. 7 ; pp. 905-907
Type of Manuscript:  Special Section LETTER (Special Section on the 1992 IEICE Spring Conference)
Category: 
Keyword: 
speech recognitionseparate vector quantizationhidden Markov modelsimultaneous generative histogram
 Summary | Full Text:PDF(212KB)

Neural Networks Applied to Speech Recognition
Hiroaki SAKOE 
Publication:   IEICE TRANSACTIONS on Fundamentals of Electronics, Communications and Computer Sciences
Publication Date: 1992/05/25
Vol. E75-A  No. 5 ; pp. 546-551
Type of Manuscript:  INVITED PAPER (Special Section on Nonlinear Dynamics--Adaptive, Learning and Neural Systems--)
Category: 
Keyword: 
neural networkspeech recognitiondynamic programminghidden Markov model
 Summary | Full Text:PDF(493KB)

Future Perspective of Automatic Telephone Interpretation
Akira KUREMATSU 
Publication:   IEICE TRANSACTIONS on Communications
Publication Date: 1992/01/25
Vol. E75-B  No. 1 ; pp. 14-19
Type of Manuscript:  INVITED PAPER (Special Section on Dreams of Future Communications)
Category: 
Keyword: 
speech recognitionmachine translationspeech synthesisspeech translation
 Summary | Full Text:PDF(515.5KB)

A Study of Line Spectrum Pair Frequency Representation for Speech Recognition
Fikret S. GURGEN Shigeki SAGAYAMA Sadaoki FURUI 
Publication:   IEICE TRANSACTIONS on Fundamentals of Electronics, Communications and Computer Sciences
Publication Date: 1992/01/25
Vol. E75-A  No. 1 ; pp. 98-102
Type of Manuscript:  PAPER
Category: Speech
Keyword: 
speech recognitionline spectrum pair (LSP)transitional parameter
 Summary | Full Text:PDF(415KB)