Yonghong YAN


Short Text Classification Based on Distributional Representations of Words
Chenglong MA Qingwei ZHAO Jielin PAN Yonghong YAN 
Publication:   IEICE TRANSACTIONS on Information and Systems
Publication Date: 2016/10/01
Vol. E99-D  No. 10  pp. 2562-2565
Type of Manuscript:  Special Section LETTER (Special Section on Recent Advances in Machine Learning for Spoken Language Processing)
Category: Text classification
Keyword: 
short text classificationword embeddinggaussian model
 Summary | Full Text:PDF(171.1KB)

Speeding up Deep Neural Networks in Speech Recognition with Piecewise Quantized Sigmoidal Activation Function
Anhao XING Qingwei ZHAO Yonghong YAN 
Publication:   IEICE TRANSACTIONS on Information and Systems
Publication Date: 2016/10/01
Vol. E99-D  No. 10  pp. 2558-2561
Type of Manuscript:  Special Section LETTER (Special Section on Recent Advances in Machine Learning for Spoken Language Processing)
Category: Acoustic modeling
Keyword: 
deep neural networksspeech recognitionactivation functionfixed-point quantization
 Summary | Full Text:PDF(151.4KB)

Multi-Task Learning in Deep Neural Networks for Mandarin-English Code-Mixing Speech Recognition
Mengzhe CHEN Jielin PAN Qingwei ZHAO Yonghong YAN 
Publication:   IEICE TRANSACTIONS on Information and Systems
Publication Date: 2016/10/01
Vol. E99-D  No. 10  pp. 2554-2557
Type of Manuscript:  Special Section LETTER (Special Section on Recent Advances in Machine Learning for Spoken Language Processing)
Category: Acoustic modeling
Keyword: 
multi-task learningdeep neural networkMandarin-English code mixingspeech recognition
 Summary | Full Text:PDF(205.3KB)

Improved End-to-End Speech Recognition Using Adaptive Per-Dimensional Learning Rate Methods
Xuyang WANG Pengyuan ZHANG Qingwei ZHAO Jielin PAN Yonghong YAN 
Publication:   IEICE TRANSACTIONS on Information and Systems
Publication Date: 2016/10/01
Vol. E99-D  No. 10  pp. 2550-2553
Type of Manuscript:  Special Section LETTER (Special Section on Recent Advances in Machine Learning for Spoken Language Processing)
Category: Acoustic modeling
Keyword: 
connectionist temporal classificationadaptive per-dimensional learning rate methodend-to-end ASR
 Summary | Full Text:PDF(175.3KB)

Policy Optimization for Spoken Dialog Management Using Genetic Algorithm
Hang REN Qingwei ZHAO Yonghong YAN 
Publication:   IEICE TRANSACTIONS on Information and Systems
Publication Date: 2016/10/01
Vol. E99-D  No. 10  pp. 2499-2507
Type of Manuscript:  Special Section PAPER (Special Section on Recent Advances in Machine Learning for Spoken Language Processing)
Category: Spoken dialog system
Keyword: 
spoken dialog managementspoken dialog systemgenetic algorithm
 Summary | Full Text:PDF(698.7KB)

A Hybrid Approach for Reverberation Simulation
Risheng XIA Junfeng LI Andrea PRIMAVERA Stefania CECCHI Yôiti SUZUKI Yonghong YAN 
Publication:   IEICE TRANSACTIONS on Fundamentals of Electronics, Communications and Computer Sciences
Publication Date: 2015/10/01
Vol. E98-A  No. 10  pp. 2101-2108
Type of Manuscript:  PAPER
Category: Engineering Acoustics
Keyword: 
image-source methodfeedback delay networkenergy decay relief
 Summary | Full Text:PDF(2MB)

Discriminative Pronunciation Modeling Using the MPE Criterion
Meixu SONG Jielin PAN Qingwei ZHAO Yonghong YAN 
Publication:   IEICE TRANSACTIONS on Information and Systems
Publication Date: 2015/03/01
Vol. E98-D  No. 3  pp. 717-720
Type of Manuscript:  LETTER
Category: Speech and Hearing
Keyword: 
automatic speech recognitionpronunciation modelsdiscriminative trainingMandarin conversational speech recognition
 Summary | Full Text:PDF(98.5KB)

Smoothing Method for Improved Minimum Phone Error Linear Regression
Yaohui QI Fuping PAN Fengpei GE Qingwei ZHAO Yonghong YAN 
Publication:   IEICE TRANSACTIONS on Information and Systems
Publication Date: 2014/08/01
Vol. E97-D  No. 8  pp. 2105-2113
Type of Manuscript:  PAPER
Category: Speech and Hearing
Keyword: 
speaker adaptation (SA)maximum likelihood linear regression (MLLR)maximum a posteriori linear regression (MAPLR)minimum phone error linear regression (MPELR)discriminative maximum a posteriori linear regression (DMAPLR)
 Summary | Full Text:PDF(789.4KB)

Discriminative Approach to Build Hybrid Vocabulary for Conversational Telephone Speech Recognition of Agglutinative Languages
Xin LI Jielin PAN Qingwei ZHAO Yonghong YAN 
Publication:   IEICE TRANSACTIONS on Information and Systems
Publication Date: 2013/11/01
Vol. E96-D  No. 11  pp. 2478-2482
Type of Manuscript:  LETTER
Category: Speech and Hearing
Keyword: 
agglutinative languagesspeech recognitionsub-wordsdiscriminative learninghybrid system
 Summary | Full Text:PDF(554.1KB)

Speaker Recognition Using Sparse Probabilistic Linear Discriminant Analysis
Hai YANG Yunfei XU Qinwei ZHAO Ruohua ZHOU Yonghong YAN 
Publication:   IEICE TRANSACTIONS on Fundamentals of Electronics, Communications and Computer Sciences
Publication Date: 2013/10/01
Vol. E96-A  No. 10  pp. 1938-1945
Type of Manuscript:  Special Section PAPER (Special Section on Sparsity-aware Signal Processing)
Category: 
Keyword: 
speaker recognitioni-vectorssparse representationlaplace prior
 Summary | Full Text:PDF(1.3MB)

Fuzzy Matching of Semantic Class in Chinese Spoken Language Understanding
Yanling LI Qingwei ZHAO Yonghong YAN 
Publication:   IEICE TRANSACTIONS on Information and Systems
Publication Date: 2013/08/01
Vol. E96-D  No. 8  pp. 1845-1852
Type of Manuscript:  PAPER
Category: Natural Language Processing
Keyword: 
fuzzy matchingConditional Random Field (CRF)Spoken Language Understanding (SLU)Named Entity Recognition (NER)similarity function
 Summary | Full Text:PDF(2.2MB)

A Novel Discriminative Method for Pronunciation Quality Assessment
Junbo ZHANG Fuping PAN Bin DONG Qingwei ZHAO Yonghong YAN 
Publication:   IEICE TRANSACTIONS on Information and Systems
Publication Date: 2013/05/01
Vol. E96-D  No. 5  pp. 1145-1151
Type of Manuscript:  PAPER
Category: Speech and Hearing
Keyword: 
pronunciation assessmentautomatic scoringdistinctiveness trainingmaximum entropy
 Summary | Full Text:PDF(527.4KB)

A Forced Alignment Based Approach for English Passage Reading Assessment
Junbo ZHANG Fuping PAN Bin DONG Qingwei ZHAO Yonghong YAN 
Publication:   IEICE TRANSACTIONS on Information and Systems
Publication Date: 2012/12/01
Vol. E95-D  No. 12  pp. 3046-3052
Type of Manuscript:  PAPER
Category: Speech and Hearing
Keyword: 
CALLautomatic assessmentforced alignmentformant
 Summary | Full Text:PDF(1.1MB)

Factor Analysis of Neighborhood-Preserving Embedding for Speaker Verification
Chunyan LIANG Lin YANG Qingwei ZHAO Yonghong YAN 
Publication:   IEICE TRANSACTIONS on Information and Systems
Publication Date: 2012/10/01
Vol. E95-D  No. 10  pp. 2572-2576
Type of Manuscript:  LETTER
Category: Speech and Hearing
Keyword: 
speaker verificationneighborhood-preserving embeddingtotal variabilitysupport vector machinecosine distance scoring
 Summary | Full Text:PDF(73.2KB)

Noise Robust Feature Scheme for Automatic Speech Recognition Based on Auditory Perceptual Mechanisms
Shang CAI Yeming XIAO Jielin PAN Qingwei ZHAO Yonghong YAN 
Publication:   IEICE TRANSACTIONS on Information and Systems
Publication Date: 2012/06/01
Vol. E95-D  No. 6  pp. 1610-1618
Type of Manuscript:  PAPER
Category: Speech and Hearing
Keyword: 
automatic speech recognitionnoise robustnesscritical bandwidthfrequency maskingtemporal masking
 Summary | Full Text:PDF(503.4KB)

Two-Microphone Noise Reduction Using Spatial Information-Based Spectral Amplitude Estimation
Kai LI Yanmeng GUO Qiang FU Junfeng LI Yonghong YAN 
Publication:   IEICE TRANSACTIONS on Information and Systems
Publication Date: 2012/05/01
Vol. E95-D  No. 5  pp. 1454-1464
Type of Manuscript:  PAPER
Category: Speech and Hearing
Keyword: 
microphone arrayphase differencemagnitude squared coherencespectral amplitude estimator
 Summary | Full Text:PDF(1.4MB)

Logarithmic Adaptive Quantization Projection for Audio Watermarking
Xuemin ZHAO Yuhong GUO Jian LIU Yonghong YAN Qiang FU 
Publication:   IEICE TRANSACTIONS on Information and Systems
Publication Date: 2012/05/01
Vol. E95-D  No. 5  pp. 1436-1445
Type of Manuscript:  PAPER
Category: Information Network
Keyword: 
audio watermarkingquantization index modulation (QIM)psychoacoustic model
 Summary | Full Text:PDF(599.2KB)

A Hybrid Speech Emotion Recognition System Based on Spectral and Prosodic Features
Yu ZHOU Junfeng LI Yanqing SUN Jianping ZHANG Yonghong YAN Masato AKAGI 
Publication:   IEICE TRANSACTIONS on Information and Systems
Publication Date: 2010/10/01
Vol. E93-D  No. 10  pp. 2813-2821
Type of Manuscript:  PAPER
Category: Human-computer Interaction
Keyword: 
speech emotion recognitionnon-uniform subband processingspectral featureprosodic feature
 Summary | Full Text:PDF(449.9KB)

Enhancing the Robustness of the Posterior-Based Confidence Measures Using Entropy Information for Speech Recognition
Yanqing SUN Yu ZHOU Qingwei ZHAO Pengyuan ZHANG Fuping PAN Yonghong YAN 
Publication:   IEICE TRANSACTIONS on Information and Systems
Publication Date: 2010/09/01
Vol. E93-D  No. 9  pp. 2431-2439
Type of Manuscript:  Special Section PAPER (Special Section on Processing Natural Speech Variability for Improved Verbal Human-Computer Interaction)
Category: Robust Speech Recognition
Keyword: 
OOVspeech recognitionconfidence measureentropy informationphoneme-level posterior
 Summary | Full Text:PDF(1.2MB)

Acoustic Feature Optimization Based on F-Ratio for Robust Speech Recognition
Yanqing SUN Yu ZHOU Qingwei ZHAO Yonghong YAN 
Publication:   IEICE TRANSACTIONS on Information and Systems
Publication Date: 2010/09/01
Vol. E93-D  No. 9  pp. 2417-2430
Type of Manuscript:  Special Section PAPER (Special Section on Processing Natural Speech Variability for Improved Verbal Human-Computer Interaction)
Category: Robust Speech Recognition
Keyword: 
mismatched speechrobust speech recognitionF-Ratiosubband designfeature optimization
 Summary | Full Text:PDF(1.2MB)

Approximate Decision Function and Optimization for GMM-UBM Based Speaker Verification
Xiang XIAO Xiang ZHANG Haipeng WANG Hongbin SUO Qingwei ZHAO Yonghong YAN 
Publication:   IEICE TRANSACTIONS on Information and Systems
Publication Date: 2009/09/01
Vol. E92-D  No. 9  pp. 1798-1802
Type of Manuscript:  LETTER
Category: Speech and Hearing
Keyword: 
automatic speaker verificationcontribution weight re-estimationoptimization
 Summary | Full Text:PDF(1.2MB)

An LVCSR Based Reading Miscue Detection System Using Knowledge of Reference and Error Patterns
Changliang LIU Fuping PAN Fengpei GE Bin DONG Hongbin SUO Yonghong YAN 
Publication:   IEICE TRANSACTIONS on Information and Systems
Publication Date: 2009/09/01
Vol. E92-D  No. 9  pp. 1716-1724
Type of Manuscript:  PAPER
Category: Speech and Hearing
Keyword: 
CALLreading tutorreading miscuesLVCSRmultiple pronunciation
 Summary | Full Text:PDF(867.4KB)

Automatic Singing Performance Evaluation for Untrained Singers
Chuan CAO Ming LI Xiao WU Hongbin SUO Jian LIU Yonghong YAN 
Publication:   IEICE TRANSACTIONS on Information and Systems
Publication Date: 2009/08/01
Vol. E92-D  No. 8  pp. 1596-1600
Type of Manuscript:  LETTER
Category: Music Information Processing
Keyword: 
automatic/objective evaluationsinging performance assessmentfeature combination
 Summary | Full Text:PDF(80.3KB)

Using a Kind of Novel Phonotactic Information for SVM Based Speaker Recognition
Xiang ZHANG Hongbin SUO Qingwei ZHAO Yonghong YAN 
Publication:   IEICE TRANSACTIONS on Information and Systems
Publication Date: 2009/04/01
Vol. E92-D  No. 4  pp. 746-749
Type of Manuscript:  LETTER
Category: Speech and Hearing
Keyword: 
speaker recognitionGaussian mixture modeluniversal background modelsupport vector machine
 Summary | Full Text:PDF(260.1KB)

Speech Enhancement Using Improved Adaptive Null-Forming in Frequency Domain with Postfilter
Heng ZHANG Qiang FU Yonghong YAN 
Publication:   IEICE TRANSACTIONS on Fundamentals of Electronics, Communications and Computer Sciences
Publication Date: 2008/12/01
Vol. E91-A  No. 12  pp. 3812-3816
Type of Manuscript:  LETTER
Category: Speech and Hearing
Keyword: 
speech enhancementadaptive null-formingauditory subbandpostfilterrobust ASR
 Summary | Full Text:PDF(280.8KB)

Robust Speaker Clustering Using Affinity Propagation
Xiang ZHANG Ping LU Hongbin SUO Qingwei ZHAO Yonghong YAN 
Publication:   IEICE TRANSACTIONS on Information and Systems
Publication Date: 2008/11/01
Vol. E91-D  No. 11  pp. 2739-2741
Type of Manuscript:  LETTER
Category: Speech and Hearing
Keyword: 
speaker clusteringagglomerative hierarchical clusteringaffinity propagationgeneralized likelihood ratio
 Summary | Full Text:PDF(78.2KB)

Effective Acoustic Modeling for Pronunciation Quality Scoring of Strongly Accented Mandarin Speech
Fengpei GE Changliang LIU Jian SHAO Fuping PAN Bin DONG Yonghong YAN 
Publication:   IEICE TRANSACTIONS on Information and Systems
Publication Date: 2008/10/01
Vol. E91-D  No. 10  pp. 2485-2492
Type of Manuscript:  PAPER
Category: Speech and Hearing
Keyword: 
CALLspeech recognitionHLDAspeaker-dependent CMNe-learning
 Summary | Full Text:PDF(734.2KB)

Melody Track Selection Using Discriminative Language Model
Xiao WU Ming LI Hongbin SUO Yonghong YAN 
Publication:   IEICE TRANSACTIONS on Information and Systems
Publication Date: 2008/06/01
Vol. E91-D  No. 6  pp. 1838-1840
Type of Manuscript:  LETTER
Category: Music Information Processing
Keyword: 
melody stylemelody track selectionmelody extraction
 Summary | Full Text:PDF(66.7KB)

Development of a Mandarin-English Bilingual Speech Recognition System for Real World Music Retrieval
Qingqing ZHANG Jielin PAN Yang LIN Jian SHAO Yonghong YAN 
Publication:   IEICE TRANSACTIONS on Information and Systems
Publication Date: 2008/03/01
Vol. E91-D  No. 3  pp. 514-521
Type of Manuscript:  Special Section PAPER (Special Section on Robust Speech Processing in Realistic Environments)
Category: Acoustic Modeling
Keyword: 
bilingual speech recognitionphone clusteringTCMlog-likelihood measurenon-native adaptationMandarin-English
 Summary | Full Text:PDF(235.3KB)

Automatic Language Identification with Discriminative Language Characterization Based on SVM
Hongbin SUO Ming LI Ping LU Yonghong YAN 
Publication:   IEICE TRANSACTIONS on Information and Systems
Publication Date: 2008/03/01
Vol. E91-D  No. 3  pp. 567-575
Type of Manuscript:  Special Section PAPER (Special Section on Robust Speech Processing in Realistic Environments)
Category: Language Identification
Keyword: 
language identificationsupervised speaker clusteringsupport vector machinediscriminative language characterization score vectorpair-wise posterior probability estimation
 Summary | Full Text:PDF(460.2KB)

A One-Pass Real-Time Decoder Using Memory-Efficient State Network
Jian SHAO Ta LI Qingqing ZHANG Qingwei ZHAO Yonghong YAN 
Publication:   IEICE TRANSACTIONS on Information and Systems
Publication Date: 2008/03/01
Vol. E91-D  No. 3  pp. 529-537
Type of Manuscript:  Special Section PAPER (Special Section on Robust Speech Processing in Realistic Environments)
Category: ASR System Architecture
Keyword: 
real-timememory-efficientlayer-dependent beam pruning
 Summary | Full Text:PDF(655.5KB)

Effects of the Temporal Fine Structure in Different Frequency Bands on Mandarin Tone Perception
Lin YANG Jianping ZHANG Jian SHAO Yonghong YAN 
Publication:   IEICE TRANSACTIONS on Information and Systems
Publication Date: 2008/02/01
Vol. E91-D  No. 2  pp. 371-374
Type of Manuscript:  LETTER
Category: Speech and Hearing
Keyword: 
tone perceptionauditory chimaerasfine structure cues
 Summary | Full Text:PDF(283.3KB)