Keyword : speech recognition


A Bayesian Framework Using Multiple Model Structures for Speech Recognition
Sayaka SHIOTA  Kei HASHIMOTO  Yoshihiko NANKAKU  Keiichi TOKUDA 
Publication:   IEICE TRANSACTIONS on Information and Systems
Publication Date: 2013/04/01
Vol. E96-D  No. 4  pp. 939-948
Type of Manuscript: PAPER
Category: Speech and Hearing
Keyword: 
speech recognitionacoustic modelingBayesian approachmodel structure integrationdeterministic annealing
  Summary |  Full Text:PDF

Refinement of Landmark Detection and Extraction of Articulator-Free Features for Knowledge-Based Speech Recognition
Jung-In LEE  Jeung-Yoon CHOI  Hong-Goo KANG 
Publication:   IEICE TRANSACTIONS on Information and Systems
Publication Date: 2013/03/01
Vol. E96-D  No. 3  pp. 746-749
Type of Manuscript: LETTER
Category: Speech and Hearing
Keyword: 
speech recognitionacoustic eventslandmark detection
  Summary |  Full Text:PDF

Acoustic Model Training Using Pseudo-Speaker Features Generated by MLLR Transformations for Robust Speaker-Independent Speech Recognition
Arata ITOH  Sunao HARA  Norihide KITAOKA  Kazuya TAKEDA 
Publication:   IEICE TRANSACTIONS on Information and Systems
Publication Date: 2012/10/01
Vol. E95-D  No. 10  pp. 2479-2485
Type of Manuscript: PAPER
Category: Speech and Hearing
Keyword: 
speech recognitionacoustic model trainingpseudo speakersfeature generationMLLR
  Summary |  Full Text:PDF

Class-Based N-Gram Language Model for New Words Using Out-of-Vocabulary to In-Vocabulary Similarity
Welly NAPTALI  Masatoshi TSUCHIYA  Seiichi NAKAGAWA 
Publication:   IEICE TRANSACTIONS on Information and Systems
Publication Date: 2012/09/01
Vol. E95-D  No. 9  pp. 2308-2317
Type of Manuscript: PAPER
Category: Speech and Hearing
Keyword: 
out-of-vocabularyclass-based n-gramlanguage modeladjusted perplexityspeech recognition
  Summary |  Full Text:PDF

Hidden Conditional Neural Fields for Continuous Phoneme Speech Recognition
Yasuhisa FUJII  Kazumasa YAMAMOTO  Seiichi NAKAGAWA 
Publication:   IEICE TRANSACTIONS on Information and Systems
Publication Date: 2012/08/01
Vol. E95-D  No. 8  pp. 2094-2104
Type of Manuscript: PAPER
Category: Speech and Hearing
Keyword: 
hidden conditional neural fieldshidden conditional random fieldshidden Markov modelspeech recognitiondeep learning
  Summary |  Full Text:PDF

A VLSI Architecture with Multiple Fast Store-Based Block Parallel Processing for Output Probability and Likelihood Score Computations in HMM-Based Isolated Word Recognition
Kazuhiro NAKAMURA  Ryo SHIMAZAKI  Masatoshi YAMAMOTO  Kazuyoshi TAKAGI  Naofumi TAKAGI 
Publication:   IEICE TRANSACTIONS on Electronics
Publication Date: 2012/04/01
Vol. E95-C  No. 4  pp. 456-467
Type of Manuscript: Special Section PAPER (Special Section on Solid-State Circuit Design – Architecture, Circuit, Device and Design Methodology)
Category: 
Keyword: 
speech recognitionhidden Markov model (HMM)VLSI architectureisolated word recognition
  Summary |  Full Text:PDF

Decision Tree-Based Acoustic Models for Speech Recognition with Improved Smoothness
Masami AKAMINE  Jitendra AJMERA 
Publication:   IEICE TRANSACTIONS on Information and Systems
Publication Date: 2011/11/01
Vol. E94-D  No. 11  pp. 2250-2258
Type of Manuscript: PAPER
Category: Speech and Hearing
Keyword: 
speech recognitionacoustic modelingdecision treesprobability estimationlikelihood computation
  Summary |  Full Text:PDF

Enhancing Eigenspace-Based MLLR Speaker Adaptation Using a Fuzzy Logic Learning Control Scheme
Ing-Jr DING 
Publication:   IEICE TRANSACTIONS on Information and Systems
Publication Date: 2011/10/01
Vol. E94-D  No. 10  pp. 1909-1916
Type of Manuscript: Special Section PAPER (Special Section on Information-Based Induction Sciences and Machine Learning)
Category: 
Keyword: 
speech recognitionspeaker adaptationHMMEigen-MLLRfuzzy control
  Summary |  Full Text:PDF

VLSI Architecture of GMM Processing and Viterbi Decoder for 60,000-Word Real-Time Continuous Speech Recognition
Hiroki NOGUCHI  Kazuo MIURA  Tsuyoshi FUJINAGA  Takanobu SUGAHARA  Hiroshi KAWAGUCHI  Masahiko YOSHIMOTO 
Publication:   IEICE TRANSACTIONS on Electronics
Publication Date: 2011/04/01
Vol. E94-C  No. 4  pp. 458-467
Type of Manuscript: Special Section PAPER (Special Section on Circuits and Design Techniques for Advanced Large Scale Integration)
Category: 
Keyword: 
speech recognitionhidden Markov model (HMM)VLSI architecture
  Summary |  Full Text:PDF

Bayesian Context Clustering Using Cross Validation for Speech Recognition
Kei HASHIMOTO  Heiga ZEN  Yoshihiko NANKAKU  Akinobu LEE  Keiichi TOKUDA 
Publication:   IEICE TRANSACTIONS on Information and Systems
Publication Date: 2011/03/01
Vol. E94-D  No. 3  pp. 668-678
Type of Manuscript: PAPER
Category: Speech and Hearing
Keyword: 
Bayesian approachspeech recognitionHMMcontext clusteringcross validation
  Summary |  Full Text:PDF

Estimation of Speech Intelligibility Using Speech Recognition Systems
Yusuke TAKANO  Kazuhiro KONDO 
Publication:   IEICE TRANSACTIONS on Information and Systems
Publication Date: 2010/12/01
Vol. E93-D  No. 12  pp. 3368-3376
Type of Manuscript: PAPER
Category: Speech and Hearing
Keyword: 
objective estimationspeech intelligibilityspeech recognitionJapanese Diagnostic Rhyme Testnoise adaptation
  Summary |  Full Text:PDF

Unsupervised Speaker Adaptation Using Speaker-Class Models for Lecture Speech Recognition
Tetsuo KOSAKA  Yuui TAKEDA  Takashi ITO  Masaharu KATO  Masaki KOHDA 
Publication:   IEICE TRANSACTIONS on Information and Systems
Publication Date: 2010/09/01
Vol. E93-D  No. 9  pp. 2363-2369
Type of Manuscript: Special Section PAPER (Special Section on Processing Natural Speech Variability for Improved Verbal Human-Computer Interaction)
Category: Adaptation
Keyword: 
speech recognitionspeaker adaptationspeaker-class modelLVCSRcorpus of spontaneous Japanese
  Summary |  Full Text:PDF

Intentional Voice Command Detection for Trigger-Free Speech Interface
Yasunari OBUCHI  Takashi SUMIYOSHI 
Publication:   IEICE TRANSACTIONS on Information and Systems
Publication Date: 2010/09/01
Vol. E93-D  No. 9  pp. 2440-2450
Type of Manuscript: Special Section PAPER (Special Section on Processing Natural Speech Variability for Improved Verbal Human-Computer Interaction)
Category: Robust Speech Recognition
Keyword: 
speech recognitionspeech/non-speech discriminationVADutterance verificationemotion recognitionhands-freetrigger-freeIVCD
  Summary |  Full Text:PDF

Enhancing the Robustness of the Posterior-Based Confidence Measures Using Entropy Information for Speech Recognition
Yanqing SUN  Yu ZHOU  Qingwei ZHAO  Pengyuan ZHANG  Fuping PAN  Yonghong YAN 
Publication:   IEICE TRANSACTIONS on Information and Systems
Publication Date: 2010/09/01
Vol. E93-D  No. 9  pp. 2431-2439
Type of Manuscript: Special Section PAPER (Special Section on Processing Natural Speech Variability for Improved Verbal Human-Computer Interaction)
Category: Robust Speech Recognition
Keyword: 
OOVspeech recognitionconfidence measureentropy informationphoneme-level posterior
  Summary |  Full Text:PDF

Learning Speech Variability in Discriminative Acoustic Model Adaptation
Shoei SATO  Takahiro OKU  Shinichi HOMMA  Akio KOBAYASHI  Toru IMAI 
Publication:   IEICE TRANSACTIONS on Information and Systems
Publication Date: 2010/09/01
Vol. E93-D  No. 9  pp. 2370-2378
Type of Manuscript: Special Section PAPER (Special Section on Processing Natural Speech Variability for Improved Verbal Human-Computer Interaction)
Category: Adaptation
Keyword: 
speech recognitionspeech variabilitydiscriminative trainingacoustic model
  Summary |  Full Text:PDF

Acoustic Model Adaptation for Speech Recognition
Koichi SHINODA 
Publication:   IEICE TRANSACTIONS on Information and Systems
Publication Date: 2010/09/01
Vol. E93-D  No. 9  pp. 2348-2362
Type of Manuscript: Special Section PAPER (Special Section on Processing Natural Speech Variability for Improved Verbal Human-Computer Interaction)
Category: INVITED
Keyword: 
speech recognitionacoustic model adaptationhidden Markov models
  Summary |  Full Text:PDF

Novel Confidence Feature Extraction Algorithm Based on Latent Topic Similarity
Wei CHEN  Gang LIU  Jun GUO  Shinichiro OMACHI  Masako OMACHI  Yujing GUO 
Publication:   IEICE TRANSACTIONS on Information and Systems
Publication Date: 2010/08/01
Vol. E93-D  No. 8  pp. 2243-2251
Type of Manuscript: PAPER
Category: Speech and Hearing
Keyword: 
speech recognitionconfidence annotationconfidence featurelatent topic similarity
  Summary |  Full Text:PDF

A New Subband-Weighted MVDR-Based Front-End for Robust Speech Recognition
Sanaz SEYEDIN  Seyed Mohammad AHADI 
Publication:   IEICE TRANSACTIONS on Information and Systems
Publication Date: 2010/08/01
Vol. E93-D  No. 8  pp. 2252-2261
Type of Manuscript: PAPER
Category: Speech and Hearing
Keyword: 
feature extractionrobust MVDR power spectral estimationspeech recognition
  Summary |  Full Text:PDF

Acoustic Feature Transformation Combining Average and Maximum Classification Error Minimization Criteria
Makoto SAKAI  Norihide KITAOKA  Kazuya TAKEDA 
Publication:   IEICE TRANSACTIONS on Information and Systems
Publication Date: 2010/07/01
Vol. E93-D  No. 7  pp. 2005-2008
Type of Manuscript: LETTER
Category: Speech and Hearing
Keyword: 
speech recognitiondimensionality reductionBayes error
  Summary |  Full Text:PDF

Acoustic Feature Transformation Based on Discriminant Analysis Preserving Local Structure for Speech Recognition
Makoto SAKAI  Norihide KITAOKA  Kazuya TAKEDA 
Publication:   IEICE TRANSACTIONS on Information and Systems
Publication Date: 2010/05/01
Vol. E93-D  No. 5  pp. 1244-1252
Type of Manuscript: PAPER
Category: Speech and Hearing
Keyword: 
speech recognitionfeature extractionmultidimensional signal processing
  Summary |  Full Text:PDF

Speech Enhancement Using a Square Microphone Array in the Presence of Directional and Diffuse Noise
Tetsuji OGAWA  Shintaro TAKADA  Kenzo AKAGIRI  Tetsunori KOBAYASHI 
Publication:   IEICE TRANSACTIONS on Fundamentals of Electronics, Communications and Computer Sciences
Publication Date: 2010/05/01
Vol. E93-A  No. 5  pp. 926-935
Type of Manuscript: PAPER
Category: Speech and Hearing
Keyword: 
directional noise reductiondiffuse noise reductionsquare microphone arrayspeech recognitionmobile devices
  Summary |  Full Text:PDF

A VLSI Architecture for Output Probability Computations of HMM-Based Recognition Systems with Store-Based Block Parallel Processing
Kazuhiro NAKAMURA  Masatoshi YAMAMOTO  Kazuyoshi TAKAGI  Naofumi TAKAGI 
Publication:   IEICE TRANSACTIONS on Information and Systems
Publication Date: 2010/02/01
Vol. E93-D  No. 2  pp. 300-305
Type of Manuscript: PAPER
Category: VLSI Systems
Keyword: 
speech recognitionhidden Markov model (HMM)VLSI architecture
  Summary |  Full Text:PDF

Evaluation of Combinational Use of Discriminant Analysis-Based Acoustic Feature Transformation and Discriminative Training
Makoto SAKAI  Norihide KITAOKA  Yuya HATTORI  Seiichi NAKAGAWA  Kazuya TAKEDA 
Publication:   IEICE TRANSACTIONS on Information and Systems
Publication Date: 2010/02/01
Vol. E93-D  No. 2  pp. 395-398
Type of Manuscript: LETTER
Category: Speech and Hearing
Keyword: 
speech recognitionfeature extractiondiscriminative training
  Summary |  Full Text:PDF

Cepstral Domain Feature Extraction Utilizing Entropic Distance-Based Filterbank
Youngjoo SUH  Hoirin KIM 
Publication:   IEICE TRANSACTIONS on Information and Systems
Publication Date: 2010/02/01
Vol. E93-D  No. 2  pp. 392-394
Type of Manuscript: LETTER
Category: Speech and Hearing
Keyword: 
cepstral featureentropic distancefilterbankspeech recognition
  Summary |  Full Text:PDF

A Single-Chip Speech Dialogue Module and Its Evaluation on a Personal Robot, PaPeRo-Mini
Miki SATO  Toru IWASAWA  Akihiko SUGIYAMA  Toshihiro NISHIZAWA  Yosuke TAKANO 
Publication:   IEICE TRANSACTIONS on Fundamentals of Electronics, Communications and Computer Sciences
Publication Date: 2010/01/01
Vol. E93-A  No. 1  pp. 261-271
Type of Manuscript: PAPER
Category: Digital Signal Processing
Keyword: 
speech recognitionDOA estimationnoise cancellationmicrophone arrayecho cancellationspeech dialogue module
  Summary |  Full Text:PDF

Effective Prediction of Errors by Non-native Speakers Using Decision Tree for Speech Recognition-Based CALL System
Hongcui WANG  Tatsuya KAWAHARA 
Publication:   IEICE TRANSACTIONS on Information and Systems
Publication Date: 2009/12/01
Vol. E92-D  No. 12  pp. 2462-2468
Type of Manuscript: PAPER
Category: Speech and Hearing
Keyword: 
speech recognitionCALLgrammar networkdecision tree
  Summary |  Full Text:PDF

Robust Feature Extraction Using Variable Window Function in Autocorrelation Domain for Speech Recognition
Sangho LEE  Jeonghyun HA  Jaekeun HONG 
Publication:   IEICE TRANSACTIONS on Fundamentals of Electronics, Communications and Computer Sciences
Publication Date: 2009/11/01
Vol. E92-A  No. 11  pp. 2917-2921
Type of Manuscript: LETTER
Category: Speech and Hearing
Keyword: 
variable windowAMFCCspeech recognitionrobust feature
  Summary |  Full Text:PDF

Voice Activity Detection Based on High Order Statistics and Online EM Algorithm
David COURNAPEAU  Tatsuya KAWAHARA 
Publication:   IEICE TRANSACTIONS on Information and Systems
Publication Date: 2008/12/01
Vol. E91-D  No. 12  pp. 2854-2861
Type of Manuscript: PAPER
Category: Speech and Hearing
Keyword: 
speech recognitionvoice activity detectionhigh order statisticsonline EM
  Summary |  Full Text:PDF

A Fully Consistent Hidden Semi-Markov Model-Based Speech Recognition System
Keiichiro OURA  Heiga ZEN  Yoshihiko NANKAKU  Akinobu LEE  Keiichi TOKUDA 
Publication:   IEICE TRANSACTIONS on Information and Systems
Publication Date: 2008/11/01
Vol. E91-D  No. 11  pp. 2693-2700
Type of Manuscript: PAPER
Category: Speech and Hearing
Keyword: 
speech recognitionhidden Markov modelhidden semi-Markov modelweighted finite-state transducer
  Summary |  Full Text:PDF

Effective Acoustic Modeling for Pronunciation Quality Scoring of Strongly Accented Mandarin Speech
Fengpei GE  Changliang LIU  Jian SHAO  Fuping PAN  Bin DONG  Yonghong YAN 
Publication:   IEICE TRANSACTIONS on Information and Systems
Publication Date: 2008/10/01
Vol. E91-D  No. 10  pp. 2485-2492
Type of Manuscript: PAPER
Category: Speech and Hearing
Keyword: 
CALLspeech recognitionHLDAspeaker-dependent CMNe-learning
  Summary |  Full Text:PDF

HMM-Based Mask Estimation for a Speech Recognition Front-End Using Computational Auditory Scene Analysis
Ji Hun PARK  Jae Sam YOON  Hong Kook KIM 
Publication:   IEICE TRANSACTIONS on Information and Systems
Publication Date: 2008/09/01
Vol. E91-D  No. 9  pp. 2360-2364
Type of Manuscript: LETTER
Category: Speech and Hearing
Keyword: 
computational auditory scene analysismask estimationhidden Markov modelspeech recognition
  Summary |  Full Text:PDF

Local Peak Enhancement for In-Car Speech Recognition in Noisy Environment
Osamu ICHIKAWA  Takashi FUKUDA  Masafumi NISHIMURA 
Publication:   IEICE TRANSACTIONS on Information and Systems
Publication Date: 2008/03/01
Vol. E91-D  No. 3  pp. 635-639
Type of Manuscript: Special Section LETTER (Special Section on Robust Speech Processing in Realistic Environments)
Category: 
Keyword: 
harmonicsformantspeech enhancementnoise reductionspeech recognition
  Summary |  Full Text:PDF

Multichannel Speech Enhancement Based on Generalized Gamma Prior Distribution with Its Online Adaptive Estimation
Tran HUY DAT  Kazuya TAKEDA  Fumitada ITAKURA 
Publication:   IEICE TRANSACTIONS on Information and Systems
Publication Date: 2008/03/01
Vol. E91-D  No. 3  pp. 439-447
Type of Manuscript: Special Section PAPER (Special Section on Robust Speech Processing in Realistic Environments)
Category: Speech Enhancement
Keyword: 
multi-channel speech enhancementspeech recognitiongeneralized gamma distributionmoment matching
  Summary |  Full Text:PDF

Noise Suppression Based on Multi-Model Compositions Using Multi-Pass Search with Multi-Label N-gram Models
Takatoshi JITSUHIRO  Tomoji TORIYAMA  Kiyoshi KOGURE 
Publication:   IEICE TRANSACTIONS on Information and Systems
Publication Date: 2008/03/01
Vol. E91-D  No. 3  pp. 402-410
Type of Manuscript: Special Section PAPER (Special Section on Robust Speech Processing in Realistic Environments)
Category: Noisy Speech Recognition
Keyword: 
speech recognitionnoise suppressionmodel compositionmulti-pass searchE-Nightingale project
  Summary |  Full Text:PDF

Feature Compensation Employing Multiple Environmental Models for Robust In-Vehicle Speech Recognition
Wooil KIM  John H.L. HANSEN 
Publication:   IEICE TRANSACTIONS on Information and Systems
Publication Date: 2008/03/01
Vol. E91-D  No. 3  pp. 430-438
Type of Manuscript: Special Section PAPER (Special Section on Robust Speech Processing in Realistic Environments)
Category: Noisy Speech Recognition
Keyword: 
speech recognitionin-vehicle conditionfeature compensationenvironment transition modelmixture sharing
  Summary |  Full Text:PDF

Recognizing Reverberant Speech Based on Amplitude and Frequency Modulation
Yotaro KUBO  Shigeki OKAWA  Akira KUREMATSU  Katsuhiko SHIRAI 
Publication:   IEICE TRANSACTIONS on Information and Systems
Publication Date: 2008/03/01
Vol. E91-D  No. 3  pp. 448-456
Type of Manuscript: Special Section PAPER (Special Section on Robust Speech Processing in Realistic Environments)
Category: ASR under Reverberant Conditions
Keyword: 
speech recognitiontemporal featuretandem approachmultistream combinationreverberant speech
  Summary |  Full Text:PDF

Linear Discriminant Analysis Using a Generalized Mean of Class Covariances and Its Application to Speech Recognition
Makoto SAKAI  Norihide KITAOKA  Seiichi NAKAGAWA 
Publication:   IEICE TRANSACTIONS on Information and Systems
Publication Date: 2008/03/01
Vol. E91-D  No. 3  pp. 478-487
Type of Manuscript: Special Section PAPER (Special Section on Robust Speech Processing in Realistic Environments)
Category: Feature Extraction
Keyword: 
speech recognitionfeature extractionmultidimensional signal processing
  Summary |  Full Text:PDF

Using Mutual Information Criterion to Design an Efficient Phoneme Set for Chinese Speech Recognition
Jin-Song ZHANG  Xin-Hui HU  Satoshi NAKAMURA 
Publication:   IEICE TRANSACTIONS on Information and Systems
Publication Date: 2008/03/01
Vol. E91-D  No. 3  pp. 508-513
Type of Manuscript: Special Section PAPER (Special Section on Robust Speech Processing in Realistic Environments)
Category: Acoustic Modeling
Keyword: 
mutual informationChinese lexical tonestone dependent unitsspeech recognition
  Summary |  Full Text:PDF

Selection of Optimum Vocabulary and Dialog Strategy for Noise-Robust Spoken Dialog Systems
Akinori ITO  Takanobu OBA  Takashi KONASHI  Motoyuki SUZUKI  Shozo MAKINO 
Publication:   IEICE TRANSACTIONS on Information and Systems
Publication Date: 2008/03/01
Vol. E91-D  No. 3  pp. 538-548
Type of Manuscript: Special Section PAPER (Special Section on Robust Speech Processing in Realistic Environments)
Category: ASR System Architecture
Keyword: 
spoken dialog systemnoisy environmentdialog strategyneural networkspeech recognition
  Summary |  Full Text:PDF

An Improved Greedy Search Algorithm for the Development of a Phonetically Rich Speech Corpus
Jin-Song ZHANG  Satoshi NAKAMURA 
Publication:   IEICE TRANSACTIONS on Information and Systems
Publication Date: 2008/03/01
Vol. E91-D  No. 3  pp. 615-630
Type of Manuscript: Special Section PAPER (Special Section on Robust Speech Processing in Realistic Environments)
Category: Corpus
Keyword: 
greedy searchminimum sentence setspeech recognitionspeech corpus
  Summary |  Full Text:PDF

Bi-Spectral Acoustic Features for Robust Speech Recognition
Kazuo ONOE  Shoei SATO  Shinichi HOMMA  Akio KOBAYASHI  Toru IMAI  Tohru TAKAGI 
Publication:   IEICE TRANSACTIONS on Information and Systems
Publication Date: 2008/03/01
Vol. E91-D  No. 3  pp. 631-634
Type of Manuscript: Special Section LETTER (Special Section on Robust Speech Processing in Realistic Environments)
Category: 
Keyword: 
bi-spectrum non-Gaussianityphase informationspeech recognition
  Summary |  Full Text:PDF

Mutual Information Based Dynamic Integration of Multiple Feature Streams for Robust Real-Time LVCSR
Shoei SATO  Akio KOBAYASHI  Kazuo ONOE  Shinichi HOMMA  Toru IMAI  Tohru TAKAGI  Tetsunori KOBAYASHI 
Publication:   IEICE TRANSACTIONS on Information and Systems
Publication Date: 2008/03/01
Vol. E91-D  No. 3  pp. 815-824
Type of Manuscript: PAPER
Category: Speech and Hearing
Keyword: 
speech recognitionstream integrationentropymutual informationactive hypotheses
  Summary |  Full Text:PDF

Ears of the Robot: Three Simultaneous Speech Segregation and Recognition Using Robot-Mounted Microphones
Naoya MOCHIKI  Tetsuji OGAWA  Tetsunori KOBAYASHI 
Publication:   IEICE TRANSACTIONS on Information and Systems
Publication Date: 2007/09/01
Vol. E90-D  No. 9  pp. 1465-1468
Type of Manuscript: LETTER
Category: Speech and Hearing
Keyword: 
robot auditionsound source segregationspeech recognitionSAFIAspectral subtraction
  Summary |  Full Text:PDF

Online Speech Detection and Dual-Gender Speech Recognition for Captioning Broadcast News
Toru IMAI  Shoei SATO  Shinichi HOMMA  Kazuo ONOE  Akio KOBAYASHI 
Publication:   IEICE TRANSACTIONS on Information and Systems
Publication Date: 2007/08/01
Vol. E90-D  No. 8  pp. 1286-1291
Type of Manuscript: PAPER
Category: Speech and Hearing
Keyword: 
speech recognitionspeech detectiongender identificationlow latencybroadcast captioning
  Summary |  Full Text:PDF

Dynamic Bayesian Network Inversion for Robust Speech Recognition
Lei XIE  Hongwu YANG 
Publication:   IEICE TRANSACTIONS on Information and Systems
Publication Date: 2007/07/01
Vol. E90-D  No. 7  pp. 1117-1120
Type of Manuscript: LETTER
Category: Speech and Hearing
Keyword: 
speech recognitionhidden Markov modeldynamic Bayesian network
  Summary |  Full Text:PDF

Effective Energy Feature Compensation Using Modified Log-energy Dynamic Range Normalization for Robust Speech Recognition
Yoonjae LEE  Hanseok KO 
Publication:   IEICE TRANSACTIONS on Communications
Publication Date: 2007/06/01
Vol. E90-B  No. 6  pp. 1508-1511
Type of Manuscript: LETTER
Category: Fundamental Theories for Communications
Keyword: 
log-energy dynamic range normalization (ERN)energy-subtractionmodified ERNspeech recognition
  Summary |  Full Text:PDF

Response Time Reduction of Speech Recognizers Using Single Gaussians
Sangbae JEONG  Hoirin KIM  Minsoo HAHN 
Publication:   IEICE TRANSACTIONS on Information and Systems
Publication Date: 2007/05/01
Vol. E90-D  No. 5  pp. 868-871
Type of Manuscript: LETTER
Category: Speech and Hearing
Keyword: 
speech recognitionfast likelihood computation
  Summary |  Full Text:PDF

Feature Compensation with Model-Based Estimation for Noise Masking
Young Joon KIM  Nam Soo KIM 
Publication:   IEICE TRANSACTIONS on Information and Systems
Publication Date: 2007/02/01
Vol. E90-D  No. 2  pp. 603-605
Type of Manuscript: LETTER
Category: Speech and Hearing
Keyword: 
speech recognitionfeature compensationIMMnoise masking
  Summary |  Full Text:PDF

Incremental Language Modeling for Automatic Transcription of Broadcast News
Katsutoshi OHTSUKI  Long NGUYEN 
Publication:   IEICE TRANSACTIONS on Information and Systems
Publication Date: 2007/02/01
Vol. E90-D  No. 2  pp. 526-532
Type of Manuscript: PAPER
Category: Speech and Hearing
Keyword: 
speech recognitionout-of-vocabularylanguage modelbroadcast news
  Summary |  Full Text:PDF

Reducing Computation Time of the Rapid Unsupervised Speaker Adaptation Based on HMM-Sufficient Statistics
Randy GOMEZ  Tomoki TODA  Hiroshi SARUWATARI  Kiyohiro SHIKANO 
Publication:   IEICE TRANSACTIONS on Information and Systems
Publication Date: 2007/02/01
Vol. E90-D  No. 2  pp. 554-561
Type of Manuscript: PAPER
Category: Speech and Hearing
Keyword: 
HMM-sufficient statisticsunsupervisedrapid adaptationspeech recognition
  Summary |  Full Text:PDF

A Systolic FPGA Architecture of Two-Level Dynamic Programming for Connected Speech Recognition
Yong KIM  Hong JEONG 
Publication:   IEICE TRANSACTIONS on Information and Systems
Publication Date: 2007/02/01
Vol. E90-D  No. 2  pp. 562-568
Type of Manuscript: PAPER
Category: Speech and Hearing
Keyword: 
speech recognitionhidden Markov model (HMM)two-level dynamic programming (TLDP)FPGA
  Summary |  Full Text:PDF

N-gram Adaptation with Dynamic Interpolation Coefficient Using Information Retrieval Technique
Joon-Ki CHOI  Yung-Hwan OH 
Publication:   IEICE TRANSACTIONS on Information and Systems
Publication Date: 2006/09/01
Vol. E89-D  No. 9  pp. 2579-2582
Type of Manuscript: LETTER
Category: Speech and Hearing
Keyword: 
language model adaptationadaptation corpusdynamic interpolation coefficientspeech recognition
  Summary |  Full Text:PDF

Robust Speech Recognition by Using Compensated Acoustic Scores
Shoei SATO  Kazuo ONOE  Akio KOBAYASHI  Toru IMAI 
Publication:   IEICE TRANSACTIONS on Information and Systems
Publication Date: 2006/03/01
Vol. E89-D  No. 3  pp. 915-921
Type of Manuscript: Special Section PAPER (Special Section on Statistical Modeling for Speech Processing)
Category: Speech Recognition
Keyword: 
speech recognitionnoisy environmentacoustic score
  Summary |  Full Text:PDF

Trigger-Based Language Model Adaptation for Automatic Transcription of Panel Discussions
Carlos TRONCOSO  Tatsuya KAWAHARA 
Publication:   IEICE TRANSACTIONS on Information and Systems
Publication Date: 2006/03/01
Vol. E89-D  No. 3  pp. 1024-1031
Type of Manuscript: Special Section PAPER (Special Section on Statistical Modeling for Speech Processing)
Category: Speech Recognition
Keyword: 
speech recognitionlanguage modeltrigger-based language modelTF/IDF
  Summary |  Full Text:PDF

Single-Channel Multiple Regression for In-Car Speech Enhancement
Weifeng LI  Katsunobu ITOU  Kazuya TAKEDA  Fumitada ITAKURA 
Publication:   IEICE TRANSACTIONS on Information and Systems
Publication Date: 2006/03/01
Vol. E89-D  No. 3  pp. 1032-1039
Type of Manuscript: Special Section PAPER (Special Section on Statistical Modeling for Speech Processing)
Category: Speech Enhancement
Keyword: 
speech enhancementspeech recognitionmulti-layer perceptronmean opinion scorepairwise preference testenvironmental adaptationK-means clustering
  Summary |  Full Text:PDF

Gamma Modeling of Speech Power and Its On-Line Estimation for Statistical Speech Enhancement
Tran Huy DAT  Kazuya TAKEDA  Fumitada ITAKURA 
Publication:   IEICE TRANSACTIONS on Information and Systems
Publication Date: 2006/03/01
Vol. E89-D  No. 3  pp. 1040-1049
Type of Manuscript: Special Section PAPER (Special Section on Statistical Modeling for Speech Processing)
Category: Speech Enhancement
Keyword: 
speech enhancementspeech recognitiongamma modelingfourth-order momentMMSEMAPspectral magnitudepowerlog-spectral magnitude
  Summary |  Full Text:PDF

Verification of Speech Recognition Results Incorporating In-domain Confidence and Discourse Coherence Measures
Ian R. LANE  Tatsuya KAWAHARA 
Publication:   IEICE TRANSACTIONS on Information and Systems
Publication Date: 2006/03/01
Vol. E89-D  No. 3  pp. 931-938
Type of Manuscript: Special Section PAPER (Special Section on Statistical Modeling for Speech Processing)
Category: Speech Recognition
Keyword: 
speech recognitionconfidence measureutterance verificationin-domain confidencediscourse coherence
  Summary |  Full Text:PDF

Training Augmented Models Using SVMs
Mark J.F. GALES  Martin I. LAYTON 
Publication:   IEICE TRANSACTIONS on Information and Systems
Publication Date: 2006/03/01
Vol. E89-D  No. 3  pp. 892-899
Type of Manuscript: Special Section PAPER (Special Section on Statistical Modeling for Speech Processing)
Category: INVITED
Keyword: 
speech recognitionhidden Markov modelssupport vector machinesaugmented statistical models
  Summary |  Full Text:PDF

Speech Recognition Based on Student's t-Distribution Derived from Total Bayesian Framework
Shinji WATANABE  Atsushi NAKAMURA 
Publication:   IEICE TRANSACTIONS on Information and Systems
Publication Date: 2006/03/01
Vol. E89-D  No. 3  pp. 970-980
Type of Manuscript: Special Section PAPER (Special Section on Statistical Modeling for Speech Processing)
Category: Speech Recognition
Keyword: 
speech recognitiontotal Bayesian framework VBECBayesian predictionstudent's t-distribution
  Summary |  Full Text:PDF

Production-Oriented Models for Speech Recognition
Erik MCDERMOTT  Atsushi NAKAMURA 
Publication:   IEICE TRANSACTIONS on Information and Systems
Publication Date: 2006/03/01
Vol. E89-D  No. 3  pp. 1006-1014
Type of Manuscript: Special Section PAPER (Special Section on Statistical Modeling for Speech Processing)
Category: Speech Recognition
Keyword: 
speech recognitionspeech productionarticulatory modelinglinear dynamical systems
  Summary |  Full Text:PDF

Non-Audible Murmur (NAM) Recognition
Yoshitaka NAKAJIMA  Hideki KASHIOKA  Nick CAMPBELL  Kiyohiro SHIKANO 
Publication:   IEICE TRANSACTIONS on Information and Systems
Publication Date: 2006/01/01
Vol. E89-D  No. 1  pp. 1-8
Type of Manuscript: Special Section PAPER (Special Section on the 2004 IEICE Excellent Paper Award)
Category: 
Keyword: 
interfacespeech recognitionNon-Audible Murmur recognitionNAMwearable computing
  Summary |  Full Text:PDF

Frequency Domain Microphone Array Calibration and Beamforming for Automatic Speech Recognition
Jwu-Sheng HU  Chieh-Cheng CHENG 
Publication:   IEICE TRANSACTIONS on Fundamentals of Electronics, Communications and Computer Sciences
Publication Date: 2005/09/01
Vol. E88-A  No. 9  pp. 2401-2411
Type of Manuscript: PAPER
Category: Noise and Vibration
Keyword: 
beamformermicrophone arraycalibrationspeech recognitionspeech enhancement
  Summary |  Full Text:PDF

An Adaptive Noise Canceller with Low Signal-Distortion Based on Variable Stepsize Subfilters for Human-Robot Communication
Miki SATO  Akihiko SUGIYAMA  Shin'ichi OHNAKA 
Publication:   IEICE TRANSACTIONS on Fundamentals of Electronics, Communications and Computer Sciences
Publication Date: 2005/08/01
Vol. E88-A  No. 8  pp. 2055-2061
Type of Manuscript: Special Section PAPER (Special Section on Papers Selected from the 19th Symposium on Signal Processing)
Category: Digital Signal Processing
Keyword: 
noise cancellerdistortioncrosstalkadaptive filteralgorithmspeech recognitionhuman-robot communication
  Summary |  Full Text:PDF

Simultaneous Adaptation of Echo Cancellation and Spectral Subtraction for In-Car Speech Recognition
Osamu ICHIKAWA  Masafumi NISHIMURA 
Publication:   IEICE TRANSACTIONS on Fundamentals of Electronics, Communications and Computer Sciences
Publication Date: 2005/07/01
Vol. E88-A  No. 7  pp. 1732-1738
Type of Manuscript: Special Section PAPER (Special Section on Multi-channel Acoustic Signal Processing)
Category: Speech Enhancement
Keyword: 
speech enhancementecho cancellernoise reductionspectral subtractionspeech recognition
  Summary |  Full Text:PDF

Adaptive Nonlinear Regression Using Multiple Distributed Microphones for In-Car Speech Recognition
Weifeng LI  Chiyomi MIYAJIMA  Takanori NISHINO  Katsunobu ITOU  Kazuya TAKEDA  Fumitada ITAKURA 
Publication:   IEICE TRANSACTIONS on Fundamentals of Electronics, Communications and Computer Sciences
Publication Date: 2005/07/01
Vol. E88-A  No. 7  pp. 1716-1723
Type of Manuscript: Special Section PAPER (Special Section on Multi-channel Acoustic Signal Processing)
Category: Speech Enhancement
Keyword: 
speech recognitionsupport vector machinemulti-layer perceptronsignal-to-deviation ratioK-means clusteringadaptive beamforming
  Summary |  Full Text:PDF

Interface for Barge-in Free Spoken Dialogue System Combining Adaptive Sound Field Control and Microphone Array
Tatsunori ASAI  Hiroshi SARUWATARI  Kiyohiro SHIKANO 
Publication:   IEICE TRANSACTIONS on Fundamentals of Electronics, Communications and Computer Sciences
Publication Date: 2005/06/01
Vol. E88-A  No. 6  pp. 1613-1618
Type of Manuscript: LETTER
Category: Speech and Hearing
Keyword: 
spoken dialogue systembarge-inadaptive sound field controlmicrophone arrayspeech recognition
  Summary |  Full Text:PDF

Bayesian Confidence Scoring and Adaptation Techniques for Speech Recognition
Tae-Yoon KIM  Hanseok KO 
Publication:   IEICE TRANSACTIONS on Communications
Publication Date: 2005/04/01
Vol. E88-B  No. 4  pp. 1756-1759
Type of Manuscript: LETTER
Category: Multimedia Systems for Communications" Multimedia Systems for Communications
Keyword: 
speech recognitionconfidence measureadaptationOOV rejection
  Summary |  Full Text:PDF

Dialogue Speech Recognition by Combining Hierarchical Topic Classification and Language Model Switching
Ian R. LANE  Tatsuya KAWAHARA  Tomoko MATSUI  Satoshi NAKAMURA 
Publication:   IEICE TRANSACTIONS on Information and Systems
Publication Date: 2005/03/01
Vol. E88-D  No. 3  pp. 446-454
Type of Manuscript: Special Section PAPER (Special Section on Corpus-Based Speech Technologies)
Category: Spoken Language Systems
Keyword: 
speech recognitiontopic detectiontopic-dependent language modelingsupport vector machinesmulti-domain spoken dialogue
  Summary |  Full Text:PDF

Applying Sparse KPCA for Feature Extraction in Speech Recognition
Amaro LIMA  Heiga ZEN  Yoshihiko NANKAKU  Keiichi TOKUDA  Tadashi KITAMURA  Fernando G. RESENDE 
Publication:   IEICE TRANSACTIONS on Information and Systems
Publication Date: 2005/03/01
Vol. E88-D  No. 3  pp. 401-409
Type of Manuscript: Special Section PAPER (Special Section on Corpus-Based Speech Technologies)
Category: Feature Extraction and Acoustic Medelings
Keyword: 
kernelsparsityprincipal component analysisfeature extractionspeech recognition
  Summary |  Full Text:PDF

Automatic Generation of Non-uniform and Context-Dependent HMMs Based on the Variational Bayesian Approach
Takatoshi JITSUHIRO  Satoshi NAKAMURA 
Publication:   IEICE TRANSACTIONS on Information and Systems
Publication Date: 2005/03/01
Vol. E88-D  No. 3  pp. 391-400
Type of Manuscript: Special Section PAPER (Special Section on Corpus-Based Speech Technologies)
Category: Feature Extraction and Acoustic Medelings
Keyword: 
speech recognitionacoustic modeltopology trainingSSS algorithmvariational Bayesian approach
  Summary |  Full Text:PDF

Multiple Regression of Log Spectra for In-Car Speech Recognition Using Multiple Distributed Microphones
Weifeng LI  Tetsuya SHINDE  Hiroshi FUJIMURA  Chiyomi MIYAJIMA  Takanori NISHINO  Katunobu ITOU  Kazuya TAKEDA  Fumitada ITAKURA 
Publication:   IEICE TRANSACTIONS on Information and Systems
Publication Date: 2005/03/01
Vol. E88-D  No. 3  pp. 384-390
Type of Manuscript: Special Section PAPER (Special Section on Corpus-Based Speech Technologies)
Category: Feature Extraction and Acoustic Medelings
Keyword: 
speech recognitionmicrophone arraysadaptive beamformingsignal-to-deviation ratiomultiple regression
  Summary |  Full Text:PDF

Improving Keyword Recognition of Spoken Queries by Combining Multiple Speech Recognizer's Outputs for Speech-driven WEB Retrieval Task
Masahiko MATSUSHITA  Hiromitsu NISHIZAKI  Takehito UTSURO  Seiichi NAKAGAWA 
Publication:   IEICE TRANSACTIONS on Information and Systems
Publication Date: 2005/03/01
Vol. E88-D  No. 3  pp. 472-480
Type of Manuscript: Special Section PAPER (Special Section on Corpus-Based Speech Technologies)
Category: Spoken Language Systems
Keyword: 
speech recognitionmachine learningmultiple LVCSR modelsWEB retrieval
  Summary |  Full Text:PDF

Selection of Shared-State Hidden Markov Model Structure Using Bayesian Criterion
Shinji WATANABE  Yasuhiro MINAMI  Atsushi NAKAMURA  Naonori UEDA 
Publication:   IEICE TRANSACTIONS on Information and Systems
Publication Date: 2005/01/01
Vol. E88-D  No. 1  pp. 1-9
Type of Manuscript: Special Section PAPER (Special Section on the 2003 IEICE Excellent Paper Award)
Category: 
Keyword: 
speech recognitionshared-state HMMmodel structure selectionvariational BayesBayesian criterion
  Summary |  Full Text:PDF

On the Use of Kernel PCA for Feature Extraction in Speech Recognition
Amaro LIMA  Heiga ZEN  Yoshihiko NANKAKU  Chiyomi MIYAJIMA  Keiichi TOKUDA  Tadashi KITAMURA 
Publication:   IEICE TRANSACTIONS on Information and Systems
Publication Date: 2004/12/01
Vol. E87-D  No. 12  pp. 2802-2811
Type of Manuscript: PAPER
Category: Speech and Hearing
Keyword: 
kernelfeature spaceprincipal component analysisfeature extractionspeech recognition
  Summary |  Full Text:PDF

Cepstral Amplitude Range Normalization for Noise Robust Speech Recognition
Shingo YOSHIZAWA  Noboru HAYASAKA  Naoya WADA  Yoshikazu MIYANAGA 
Publication:   IEICE TRANSACTIONS on Information and Systems
Publication Date: 2004/08/01
Vol. E87-D  No. 8  pp. 2130-2137
Type of Manuscript: PAPER
Category: Speech and Hearing
Keyword: 
speech recognitionrobust featurescepstrumNoisex92
  Summary |  Full Text:PDF

Automatic Generation of Non-uniform HMM Topologies Based on the MDL Criterion
Takatoshi JITSUHIRO  Tomoko MATSUI  Satoshi NAKAMURA 
Publication:   IEICE TRANSACTIONS on Information and Systems
Publication Date: 2004/08/01
Vol. E87-D  No. 8  pp. 2121-2129
Type of Manuscript: PAPER
Category: Speech and Hearing
Keyword: 
speech recognitionacoustic modeltopology trainingMDL criterionSSS algorithm
  Summary |  Full Text:PDF

A Statistical Method of Evaluating Pronunciation Proficiency for English Words Spoken by Japanese
Seiichi NAKAGAWA  Naoki NAKAMURA  Kazumasa MORI 
Publication:   IEICE TRANSACTIONS on Information and Systems
Publication Date: 2004/07/01
Vol. E87-D  No. 7  pp. 1917-1922
Type of Manuscript: PAPER
Category: Speech and Hearing
Keyword: 
CALLevaluation of pronunciation proficiencyEnglish learningspeech recognition
  Summary |  Full Text:PDF

A Spoken Dialogue Interface for TV Operations Based on Data Collected by Using WOZ Method
Jun GOTO  Kazuteru KOMINE  Masaru MIYAZAKI  Yeun-Bae KIM  Noriyoshi URATANI 
Publication:   IEICE TRANSACTIONS on Information and Systems
Publication Date: 2004/06/01
Vol. E87-D  No. 6  pp. 1397-1404
Type of Manuscript: Special Section PAPER (Special Section on Human Communication I)
Category: 
Keyword: 
spoken dialogue interfaceTV operationWOZspeech recognition
  Summary |  Full Text:PDF

One-Pass Semi-Dynamic Network Decoding Using a Subnetwork Caching Model for Large Vocabulary Continuous Speech Recongnition
Dong-Hoon AHN  Minhwa CHUNG 
Publication:   IEICE TRANSACTIONS on Information and Systems
Publication Date: 2004/05/01
Vol. E87-D  No. 5  pp. 1164-1174
Type of Manuscript: Special Section PAPER (Special Section on Speech Dynamics by Ear, Eye, Mouth and Machine)
Category: 
Keyword: 
speech recognitionsemi-dynamic network decodingsubnetwork cachingtail-sharing algorithm
  Summary |  Full Text:PDF

Improved Phoneme-History-Dependent Search Method for Large-Vocabulary Continuous-Speech Recognition
Takaaki HORI  Yoshiaki NODA  Shoichi MATSUNAGA 
Publication:   IEICE TRANSACTIONS on Information and Systems
Publication Date: 2003/06/01
Vol. E86-D  No. 6  pp. 1059-1067
Type of Manuscript: PAPER
Category: Speech and Hearing
Keyword: 
speech recognitionsearch algorithmmulti-pass searchword graphphoneme-history-dependent search
  Summary |  Full Text:PDF

Speaker Tracking for Hands-Free Continuous Speech Recognition in Noise Based on a Spectrum-Entropy Beamforming Method
George NOKAS  Evangelos DERMATAS 
Publication:   IEICE TRANSACTIONS on Information and Systems
Publication Date: 2003/04/01
Vol. E86-D  No. 4  pp. 755-758
Type of Manuscript: LETTER
Category: Speech and Hearing
Keyword: 
speaker trackingmicrophone arrayspectrum entropyspeech recognitionspeaker beam-former
  Summary |  Full Text:PDF

Speech Enhancement by Profile Fitting Method
Osamu ICHIKAWA  Tetsuya TAKIGUCHI  Masafumi NISHIMURA 
Publication:   IEICE TRANSACTIONS on Information and Systems
Publication Date: 2003/03/01
Vol. E86-D  No. 3  pp. 514-521
Type of Manuscript: Special Section PAPER (Special Issue on Speech Information Processing)
Category: Robust Speech Recognition and Enhancement
Keyword: 
speech enhancementmicrophone arraybeamformernoise reductionspectral subtractionspeech recognition
  Summary |  Full Text:PDF

Face-to-Talk: Audio-Visual Speech Detection for Robust Speech Recognition in Noisy Environment
Kazumasa MURAI  Satoshi NAKAMURA 
Publication:   IEICE TRANSACTIONS on Information and Systems
Publication Date: 2003/03/01
Vol. E86-D  No. 3  pp. 505-513
Type of Manuscript: Special Section PAPER (Special Issue on Speech Information Processing)
Category: Robust Speech Recognition and Enhancement
Keyword: 
speech recognitionspeech section detectionmulti-modalityface detection"face-to-talk"
  Summary |  Full Text:PDF

Filter Bank Subtraction for Robust Speech Recognition
Kazuo ONOE  Hiroyuki SEGI  Takeshi KOBAYAKAWA  Shoei SATO  Shinichi HOMMA  Toru IMAI  Akio ANDO 
Publication:   IEICE TRANSACTIONS on Information and Systems
Publication Date: 2003/03/01
Vol. E86-D  No. 3  pp. 483-488
Type of Manuscript: Special Section PAPER (Special Issue on Speech Information Processing)
Category: Robust Speech Recognition and Enhancement
Keyword: 
filter bankspectral subtractionspeech recognitionnoise
  Summary |  Full Text:PDF

Continuous Speech Recognition Using an On-Line Speaker Adaptation Method Based on Automatic Speaker Clustering
Wei ZHANG  Seiichi NAKAGAWA 
Publication:   IEICE TRANSACTIONS on Information and Systems
Publication Date: 2003/03/01
Vol. E86-D  No. 3  pp. 464-473
Type of Manuscript: Special Section PAPER (Special Issue on Speech Information Processing)
Category: Speech and Speaker Recognition
Keyword: 
speaker adaptationspeech recognitionspeaker clusteringMLLRMAP
  Summary |  Full Text:PDF

Language Modeling Using Patterns Extracted from Parse Trees for Speech Recognition
Takatoshi JITSUHIRO  Hirofumi YAMAMOTO  Setsuo YAMADA  Genichiro KIKUI  Yoshinori SAGISAKA 
Publication:   IEICE TRANSACTIONS on Information and Systems
Publication Date: 2003/03/01
Vol. E86-D  No. 3  pp. 446-453
Type of Manuscript: Special Section PAPER (Special Issue on Speech Information Processing)
Category: Speech and Speaker Recognition
Keyword: 
speech recognitionlanguage modeln-gram modelparserpattern model
  Summary |  Full Text:PDF

Simultaneous Subtitling System for Broadcast News Programs with a Speech Recognizer
Akio ANDO  Toru IMAI  Akio KOBAYASHI  Shinich HOMMA  Jun GOTO  Nobumasa SEIYAMA  Takeshi MISHIMA  Takeshi KOBAYAKAWA  Shoei SATO  Kazuo ONOE  Hiroyuki SEGI  Atsushi IMAI  Atsushi MATSUI  Akira NAKAMURA  Hideki TANAKA  Tohru TAKAGI  Eiichi MIYASAKA  Haruo ISONO 
Publication:   IEICE TRANSACTIONS on Information and Systems
Publication Date: 2003/01/01
Vol. E86-D  No. 1  pp. 15-25
Type of Manuscript: INVITED PAPER (Special Issue on the 2001 IEICE Excellent Paper Award)
Category: 
Keyword: 
closed-caption service for news programspeech recognitionrecognition error correctionreal-time processingsystem in the practical use
  Summary |  Full Text:PDF

Duration Modeling Using Cumulative Duration Probability
Tae-Young YANG  Chungyong LEE  Dae-Hee YOUN 
Publication:   IEICE TRANSACTIONS on Information and Systems
Publication Date: 2002/09/01
Vol. E85-D  No. 9  pp. 1452-1454
Type of Manuscript: LETTER
Category: Speech and Hearing
Keyword: 
speech recognitionconnected digit recognitionduration modelingcumulative duration probability
  Summary |  Full Text:PDF

VLSI Architecture and Implementation for Speech Recognizer Based on Discriminative Bayesian Neural Network
Jhing-Fa WANG  Jia-Ching WANG  An-Nan SUEN  Chung-Hsien WU  Fan-Min LI 
Publication:   IEICE TRANSACTIONS on Fundamentals of Electronics, Communications and Computer Sciences
Publication Date: 2002/08/01
Vol. E85-A  No. 8  pp. 1861-1869
Type of Manuscript: Special Section PAPER (Special Section on Digital Signal Processing)
Category: Implementations of Signal Processing Systems
Keyword: 
discriminative Bayesian neural networkspeech recognitionVLSI
  Summary |  Full Text:PDF

A Survey on Automatic Speech Recognition
Seiichi NAKAGAWA 
Publication:   IEICE TRANSACTIONS on Information and Systems
Publication Date: 2002/03/01
Vol. E85-D  No. 3  pp. 465-486
Type of Manuscript: INVITED SURVEY PAPER
Category: Speech and Hearing
Keyword: 
speech recognitionacoustic modelHMMlanguage modelngram
  Summary |  Full Text:PDF

Recognition of Connected Digit Speech in Japanese Collected over the Telephone Network
Hisashi KAWAI  Tohru SHIMIZU  Norio HIGUCHI 
Publication:   IEICE TRANSACTIONS on Information and Systems
Publication Date: 2001/03/01
Vol. E84-D  No. 3  pp. 374-383
Type of Manuscript: PAPER
Category: Speech and Hearing
Keyword: 
speech recognitiondigittelephonedata sizelow performance speakerssheep and goats
  Summary |  Full Text:PDF

Speaker Adaptation Based on a Maximum Observation Probability Criterion
Tae-Young YANG  Chungyong LEE  Dae-Hee YOUN 
Publication:   IEICE TRANSACTIONS on Information and Systems
Publication Date: 2001/02/01
Vol. E84-D  No. 2  pp. 286-288
Type of Manuscript: LETTER
Category: Speech and Hearing
Keyword: 
speech recognitionspeaker adaptationmaximum observation probability criterion
  Summary |  Full Text:PDF

Japanese Pronunciation Instruction System Using Speech Recognition Methods
Chul-Ho JO  Tatsuya KAWAHARA  Shuji DOSHITA  Masatake DANTSUJI 
Publication:   IEICE TRANSACTIONS on Information and Systems
Publication Date: 2000/11/20
Vol. E83-D  No. 11  pp. 1960-1968
Type of Manuscript: PAPER
Category: Speech and Hearing
Keyword: 
speech recognitionCALLHMMarticulatory categoryformant
  Summary |  Full Text:PDF

Maximum Likelihood Successive State Splitting Algorithm for Tied-Mixture HMnet
Alexandre GIRARDI  Harald SINGER  Kiyohiro SHIKANO  Satoshi NAKAMURA 
Publication:   IEICE TRANSACTIONS on Information and Systems
Publication Date: 2000/10/20
Vol. E83-D  No. 10  pp. 1890-1897
Type of Manuscript: PAPER
Category: Speech and Hearing
Keyword: 
speech recognitionacoustic modelingHMMtied-mixtureclustering
  Summary |  Full Text:PDF

Spectral Peak-Weighted Liftering of Cepstral Coefficients for Speech Recognition
Hong Kook KIM  Hwang Soo LEE 
Publication:   IEICE TRANSACTIONS on Information and Systems
Publication Date: 2000/07/20
Vol. E83-D  No. 7  pp. 1540-1549
Type of Manuscript: PAPER
Category: Speech and Hearing
Keyword: 
speech recognitioncepstral analysispeak-weighted cepstral lifterframe-adaptive cepstral lifter
  Summary |  Full Text:PDF

Speech Enhancement Using Nonlinear Microphone Array Based on Complementary Beamforming
Hiroshi SARUWATARI  Shoji KAJITA  Kazuya TAKEDA  Fumitada ITAKURA 
Publication:   IEICE TRANSACTIONS on Fundamentals of Electronics, Communications and Computer Sciences
Publication Date: 1999/08/20
Vol. E82-A  No. 8  pp. 1501-1510
Type of Manuscript: Special Section PAPER (Special Section on Digital Signal Processing)
Category: 
Keyword: 
speech enhancementmicrophone arraycomplementary beamformingspectral subtractionspeech recognition
  Summary |  Full Text:PDF

Realization of Wide-Band Directivity with Three Microphones
Masataka NAKAMURA  Katsuhito KOUNO  Toshitaka YAMATO  Kazuhiro SAKIYAMA 
Publication:   IEICE TRANSACTIONS on Fundamentals of Electronics, Communications and Computer Sciences
Publication Date: 1999/04/20
Vol. E82-A  No. 4  pp. 619-625
Type of Manuscript: Special Section PAPER (Special Section on Advanced Signal Processing Techniques for Analysis of Acoustical and Vibrational Signals)
Category: 
Keyword: 
speech recognitionbeamformermicrophone arrayhigh SN ratioanalog signal processing
  Summary |  Full Text:PDF

Dynamic Cepstral Representations Based on Order-Dependent Windowing Methods
Hong Kook KIM  Seung Ho CHOI  Hwang Soo LEE 
Publication:   IEICE TRANSACTIONS on Information and Systems
Publication Date: 1998/05/20
Vol. E81-D  No. 5  pp. 434-440
Type of Manuscript: PAPER
Category: Speech Processing and Acoustics
Keyword: 
cepstrumdynamic cepstrumorder-dependent windowingspeech recognition
  Summary |  Full Text:PDF

An Isolated Word Speech Recognition Based on Fusion of Visual and Auditory Information Usisng 30-frame/s and 24-bit Color Image
Akio OGIHARA  Shinobu ASAO 
Publication:   IEICE TRANSACTIONS on Fundamentals of Electronics, Communications and Computer Sciences
Publication Date: 1997/08/20
Vol. E80-A  No. 8  pp. 1417-1422
Type of Manuscript: Special Section PAPER (Special Section on Digital Signal Processing)
Category: 
Keyword: 
speech recognitionfusion of visual and auditoryhidden Markov modelsensor fusionfull-frame (30-frame/s) and full-color (24-bit color) image
  Summary |  Full Text:PDF

Discriminative Training Based on Minimum Classification Error for a Small Amount of Data Enhanced by Vector-Field-Smoothed Bayesian Learning
Jun-ichi TAKAHASHI  Shigeki SAGAYAMA 
Publication:   IEICE TRANSACTIONS on Information and Systems
Publication Date: 1996/12/20
Vol. E79-D  No. 12  pp. 1700-1707
Type of Manuscript: PAPER
Category: Speech Processing and Acoustics
Keyword: 
speech recognitionhidden Markov modeldiscriminative trainingspeaker adaptation
  Summary |  Full Text:PDF

Speech Recognition Based on Fusion of Visual and Auditory Information Using Full-Framse Color Image
Satoru IGAWA  Akio OGIHARA  Akira SHINTANI  Shinobu TAKAMATSU 
Publication:   IEICE TRANSACTIONS on Fundamentals of Electronics, Communications and Computer Sciences
Publication Date: 1996/11/20
Vol. E79-A  No. 11  pp. 1836-1840
Type of Manuscript: Special Section LETTER (Special Section of Letters Selected from the 1996 IEICE General Conference)
Category: 
Keyword: 
speech recognitionfusion of visual and auditorysensor fusionhidden Markov model
  Summary |  Full Text:PDF

An Isolated Word Speech Recognition Using Fusion of Auditory and Visual Information
Akira SHINTANI  Akiko OGIHARA  Naoshi DOI  Shinobu TAKAMATSU 
Publication:   IEICE TRANSACTIONS on Fundamentals of Electronics, Communications and Computer Sciences
Publication Date: 1996/06/20
Vol. E79-A  No. 6  pp. 777-783
Type of Manuscript: Special Section PAPER (Special Section of Papers Selected from 1995 Joint Technical Conference on Circuits/Systems, Computers and Communications (JTC-CSCC '95))
Category: 
Keyword: 
HMMfusionlinear combinationspeech recognitionauditory and visual information
  Summary |  Full Text:PDF

The Performance Prediction on Sentence Recognition Using a Finite State Word Automaton
Takashi OTSUKI  Akinori ITO  Shozo MAKINO  Teruhiko OHTOMO 
Publication:   IEICE TRANSACTIONS on Information and Systems
Publication Date: 1996/01/20
Vol. E79-D  No. 1  pp. 47-53
Type of Manuscript: PAPER
Category: Speech Processing and Acoustics
Keyword: 
speech recognitionsentence recognitionperformance predictionfinite state word automaton
  Summary |  Full Text:PDF

A Study on Mouth Shape Features Suitable for HMM Speech Recognition Using Fusion of Visual and Auditory Information
Naoshi DOI  Akira SHINTANI  Yasuhisa HAYASHI  Akio OGIHARA  Shinobu TAKAMATSU 
Publication:   IEICE TRANSACTIONS on Fundamentals of Electronics, Communications and Computer Sciences
Publication Date: 1995/11/20
Vol. E78-A  No. 11  pp. 1548-1552
Type of Manuscript: Special Section LETTER (Special Section of Letters Selected from the 1995 IEICE General Conference)
Category: 
Keyword: 
speech recognitionfusion of visual and auditory informationfeature of mouth shapeimage processing
  Summary |  Full Text:PDF

Unsupervised Speaker Adaptation Using All-Phoneme Ergodic Hidden Markov Network
Yasunage MIYAZAWA  Jun-ichi TAKAMI  Shigeki SAGAYAMA  Shoichi MATSUNAGA 
Publication:   IEICE TRANSACTIONS on Information and Systems
Publication Date: 1995/08/20
Vol. E78-D  No. 8  pp. 1044-1050
Type of Manuscript: PAPER
Category: Speech Processing and Acoustics
Keyword: 
speech recognitionunsupervised speaker adaptationall-phoneme ergodic hidden Markov networkcontext-dependent phoneme bigram
  Summary |  Full Text:PDF

A Minimum Error Approach to Spotting-Based Pattern Recognition
Takashi KOMORI  Shigeru KATAGIRI 
Publication:   IEICE TRANSACTIONS on Information and Systems
Publication Date: 1995/08/20
Vol. E78-D  No. 8  pp. 1032-1043
Type of Manuscript: PAPER
Category: Speech Processing and Acoustics
Keyword: 
pattern recognitionword spottingMCE/GPDspeech recognition
  Summary |  Full Text:PDF

A Scheme for Word Detection in Continuous Speech Using Likelihood Scores of Segments Modified by Their Context Within a Word
Sumio OHNO  Keikichi HIROSE  Hiroya FUJISAKI 
Publication:   IEICE TRANSACTIONS on Information and Systems
Publication Date: 1995/06/20
Vol. E78-D  No. 6  pp. 725-731
Type of Manuscript: Special Section PAPER (Special Issue on Spoken Language Processing)
Category: 
Keyword: 
word spottingspeech recognitiontemplate matchinghuman speech perception
  Summary |  Full Text:PDF

Error Analysis of Field Trial Results of a Spoken Dialogue System for Telecommunications Applications
Shingo KUROIWA  Kazuya TAKEDA  Masaki NAITO  Naomi INOUE  Seiichi YAMAMOTO 
Publication:   IEICE TRANSACTIONS on Information and Systems
Publication Date: 1995/06/20
Vol. E78-D  No. 6  pp. 636-641
Type of Manuscript: Special Section PAPER (Special Issue on Spoken Language Processing)
Category: 
Keyword: 
speech recognitiondialogue systemfield trialtelephone
  Summary |  Full Text:PDF

Duration Modeling with Decreased Intra-Group Temporal Variation for HMM-Based Phoneme Recognition
Nobuaki MINEMATSU  Keikichi HIROSE 
Publication:   IEICE TRANSACTIONS on Information and Systems
Publication Date: 1995/06/20
Vol. E78-D  No. 6  pp. 654-661
Type of Manuscript: Special Section PAPER (Special Issue on Spoken Language Processing)
Category: 
Keyword: 
speech recognitionHMMGaussian mixturetemporal correspondenceduration modellooping ratesingle occupancy ratetemporal variation rate
  Summary |  Full Text:PDF

Speech Recognition Using Function-Word N-Grams and Content-Word N-Grams
Ryosuke ISOTANI  Shoichi MATSUNAGA  Shigeki SAGAYAMA 
Publication:   IEICE TRANSACTIONS on Information and Systems
Publication Date: 1995/06/20
Vol. E78-D  No. 6  pp. 692-697
Type of Manuscript: Special Section PAPER (Special Issue on Spoken Language Processing)
Category: 
Keyword: 
speech recognitionstochastic language modelN-gramfunction wordscontent words
  Summary |  Full Text:PDF

Relationship among Recognition Rate, Rejection Rate and False Alarm Rate in a Spoken Word Recognition System
Atsuhiko KAI  Seiichi NAKAGAWA 
Publication:   IEICE TRANSACTIONS on Information and Systems
Publication Date: 1995/06/20
Vol. E78-D  No. 6  pp. 698-704
Type of Manuscript: Special Section PAPER (Special Issue on Spoken Language Processing)
Category: 
Keyword: 
speech recognitionunknown word processingsimulationrejection rate
  Summary |  Full Text:PDF

Speaker-Consistent Parsing for Speaker-Independent Continuous Speech Recognition
Kouichi YAMAGUCHI  Harald SINGER  Shoichi MATSUNAGA  Shigeki SAGAYAMA 
Publication:   IEICE TRANSACTIONS on Information and Systems
Publication Date: 1995/06/20
Vol. E78-D  No. 6  pp. 719-724
Type of Manuscript: Special Section PAPER (Special Issue on Spoken Language Processing)
Category: 
Keyword: 
speech recognitionsearch algorithmhidden Markov modelspeaker adaptation
  Summary |  Full Text:PDF

Speech Recognition Using HMM Based on Fusion of Visual and Auditory Information
Akira SHINTANI  Akio OGIHARA  Yoshikazu YAMAGUCHI  Yasuhisa HAYASHI  Kunio FUKUNAGA 
Publication:   IEICE TRANSACTIONS on Fundamentals of Electronics, Communications and Computer Sciences
Publication Date: 1994/11/20
Vol. E77-A  No. 11  pp. 1875-1878
Type of Manuscript: Special Section LETTER (Special Section of Letters Selected from the 1994 IEICE Spring Conference)
Category: 
Keyword: 
speech recognitionfusion of visual and auditorysensor fusionHidden Markov Model
  Summary |  Full Text:PDF

A MRF-Based Parallel Processing for Speech Recognition Using Linear Predictive HMM
Hideki NODA  Mehdi N. SHIRAZI  Mamoru NAKATSUI 
Publication:   IEICE TRANSACTIONS on Information and Systems
Publication Date: 1994/10/20
Vol. E77-D  No. 10  pp. 1142-1147
Type of Manuscript: PAPER
Category: Speech Processing
Keyword: 
parallel processingspeech recognitionMRF modelHMMICM algorithm
  Summary |  Full Text:PDF

Spoken Sentence Recognition Based on HMM-LR with Hybrid Language Modeling
Kenji KITA  Tsuyoshi MORIMOTO  Kazumi OHKURA  Shigeki SAGAYAMA  Yaneo YANO 
Publication:   IEICE TRANSACTIONS on Information and Systems
Publication Date: 1994/02/20
Vol. E77-D  No. 2  pp. 258-265
Type of Manuscript: Special Section PAPER (Special Issue on Natural Language Processing and Understanding)
Category: 
Keyword: 
speech recognitionLR parsinghidden Markov modelHMM-LRlanguage model
  Summary |  Full Text:PDF

Speech Recognition of lsolated Digits Using Simultaneous Generative Histogram
Yasuhisa HAYASHI  Akio OGIHARA  Kunio FUKUNAGA 
Publication:   IEICE TRANSACTIONS on Fundamentals of Electronics, Communications and Computer Sciences
Publication Date: 1993/12/20
Vol. E76-A  No. 12  pp. 2052-2054
Type of Manuscript: Special Section LETTER (Special Section of Letters Selected from the 1993 IEICE Fall Conference)
Category: 
Keyword: 
speech recognitionhidden Markov modelseparate vector quantizationsimultaneous generative histogram
  Summary |  Full Text:PDF

A Hardware Architecture Design Methodology for Hidden Markov Model Based Recognition Systems Using Parallel Processing
Jun-ichi TAKAHASHI 
Publication:   IEICE TRANSACTIONS on Fundamentals of Electronics, Communications and Computer Sciences
Publication Date: 1993/06/20
Vol. E76-A  No. 6  pp. 990-1000
Type of Manuscript: PAPER
Category: Digital Signal Processing
Keyword: 
hidden markov modelspeech recognitioncorrective trainingparallel processingarray processorsystem design
  Summary |  Full Text:PDF

Automatic Evaluation of English Pronunciation Based on Speech Recognition Techniques
Hiroshi HAMADA  Satoshi MIKI  Ryohei NAKATSU 
Publication:   IEICE TRANSACTIONS on Information and Systems
Publication Date: 1993/03/20
Vol. E76-D  No. 3  pp. 352-359
Type of Manuscript: PAPER
Category: Speech Processing
Keyword: 
speech processingpronunciationspeech recognitionspeaker adaptationeducationenglish
  Summary |  Full Text:PDF

Three Different LR Parsing Algorithms for Phoneme-Context-Dependent HMM-Based Continuous Speech Recognition
Akito NAGAI  Shigeki SAGAYAMA  Kenji KITA  Hideaki KIKUCHI 
Publication:   IEICE TRANSACTIONS on Information and Systems
Publication Date: 1993/01/20
Vol. E76-D  No. 1  pp. 29-37
Type of Manuscript: Special Section PAPER (Special Issue on Speech and Discourse Processing in Dialogue Systems)
Category: 
Keyword: 
speech recognitionHidden Markov ModelCFGLR parserallophonephoneme context
  Summary |  Full Text:PDF

Task Adaptation in Syllable Trigram Models for Continuous Speech Recognition
Sho-ichi MATSUNAGA  Tomokazu YAMADA  Kiyohiro SHIKANO 
Publication:   IEICE TRANSACTIONS on Information and Systems
Publication Date: 1993/01/20
Vol. E76-D  No. 1  pp. 38-43
Type of Manuscript: Special Section PAPER (Special Issue on Speech and Discourse Processing in Dialogue Systems)
Category: 
Keyword: 
speech recognitionstochastic language model
  Summary |  Full Text:PDF

Predicting the Next Utterance Linguistic Expressions Using Contextual Information
Hitoshi IIDA  Takayuhi YAMAOKA  Hidekazu ARITA 
Publication:   IEICE TRANSACTIONS on Information and Systems
Publication Date: 1993/01/20
Vol. E76-D  No. 1  pp. 62-73
Type of Manuscript: Special Section PAPER (Special Issue on Speech and Discourse Processing in Dialogue Systems)
Category: 
Keyword: 
dialogue understandingplan recognitionuniversal pragmaticsutterance predictionspeech recognition
  Summary |  Full Text:PDF

A Spoken Dialog System with Verification and Clarification Queries
Mikio YAMAMOTO  Satoshi KOBAYASHI  Yuji MORIYA  Seiichi NAKAGAWA 
Publication:   IEICE TRANSACTIONS on Information and Systems
Publication Date: 1993/01/20
Vol. E76-D  No. 1  pp. 84-94
Type of Manuscript: Special Section PAPER (Special Issue on Speech and Discourse Processing in Dialogue Systems)
Category: 
Keyword: 
natural language processingspeech recognitiondialog systemverificationclarification
  Summary |  Full Text:PDF

LR Parsing with a Category Reachability Test Applied to Speech Recognition
Kenji KITA  Tsuyoshi MORIMOTO  Shigeki SAGAYAMA 
Publication:   IEICE TRANSACTIONS on Information and Systems
Publication Date: 1993/01/20
Vol. E76-D  No. 1  pp. 23-28
Type of Manuscript: Special Section PAPER (Special Issue on Speech and Discourse Processing in Dialogue Systems)
Category: 
Keyword: 
speech recognitionHMMsLR parsingreachabilityLR-CRT algorithm
  Summary |  Full Text:PDF

How Might One Comfortably Converse with a Machine ?
Yasuhisa NIIMI 
Publication:   IEICE TRANSACTIONS on Information and Systems
Publication Date: 1993/01/20
Vol. E76-D  No. 1  pp. 9-16
Type of Manuscript: INVITED PAPER (Special Issue on Speech and Discourse Processing in Dialogue Systems)
Category: 
Keyword: 
speech dialogue systemcomfirmationadaptationcooperativenessspeech recognitionspeech synthesis
  Summary |  Full Text:PDF

An SVQ-HMM Training Method Using Simultaneous Generative Histogram
Yasuhisa HAYASHI  Satoshi KONDO  Nobuyuki TAKASU  Akio OGIHARA  Shojiro YONEDA 
Publication:   IEICE TRANSACTIONS on Fundamentals of Electronics, Communications and Computer Sciences
Publication Date: 1992/07/20
Vol. E75-A  No. 7  pp. 905-907
Type of Manuscript: Special Section LETTER (Special Section on the 1992 IEICE Spring Conference)
Category: 
Keyword: 
speech recognitionseparate vector quantizationhidden Markov modelsimultaneous generative histogram
  Summary |  Full Text:PDF

Neural Networks Applied to Speech Recognition
Hiroaki SAKOE 
Publication:   IEICE TRANSACTIONS on Fundamentals of Electronics, Communications and Computer Sciences
Publication Date: 1992/05/20
Vol. E75-A  No. 5  pp. 546-551
Type of Manuscript: INVITED PAPER (Special Section on Nonlinear Dynamics--Adaptive, Learning and Neural Systems--)
Category: 
Keyword: 
neural networkspeech recognitiondynamic programminghidden Markov model
  Summary |  Full Text:PDF

Future Perspective of Automatic Telephone Interpretation
Akira KUREMATSU 
Publication:   IEICE TRANSACTIONS on Communications
Publication Date: 1992/01/20
Vol. E75-B  No. 1  pp. 14-19
Type of Manuscript: INVITED PAPER (Special Section on Dreams of Future Communications)
Category: 
Keyword: 
speech recognitionmachine translationspeech synthesisspeech translation
  Summary |  Full Text:PDF

A Study of Line Spectrum Pair Frequency Representation for Speech Recognition
Fikret S. GURGEN  Shigeki SAGAYAMA  Sadaoki FURUI 
Publication:   IEICE TRANSACTIONS on Fundamentals of Electronics, Communications and Computer Sciences
Publication Date: 1992/01/20
Vol. E75-A  No. 1  pp. 98-102
Type of Manuscript: PAPER
Category: Speech
Keyword: 
speech recognitionline spectrum pair (LSP)transitional parameter
  Summary |  Full Text:PDF