Keyword : speech synthesis


DNN-Based Speech Synthesis Using Speaker Codes
Nobukatsu HOJO Yusuke IJIMA Hideyuki MIZUNO 
Publication:   
Publication Date: 2018/02/01
Vol. E101-D  No. 2 ; pp. 462-472
Type of Manuscript:  PAPER
Category: Speech and Hearing
Keyword: 
speech synthesisacoustic modeldeep neural networkspeaker codes
 Summary | Full Text:PDF(1.5MB)

Development and Evaluation of Online Infrastructure to Aid Teaching and Learning of Japanese Prosody
Nobuaki MINEMATSU Ibuki NAKAMURA Masayuki SUZUKI Hiroko HIRANO Chieko NAKAGAWA Noriko NAKAMURA Yukinori TAGAWA Keikichi HIROSE Hiroya HASHIMOTO 
Publication:   
Publication Date: 2017/04/01
Vol. E100-D  No. 4 ; pp. 662-669
Type of Manuscript:  INVITED PAPER (Special Section on Award-winning Papers)
Category: 
Keyword: 
speech training in Japaneseword accentintonationspeech synthesisaccent predictionassessment
 Summary | Full Text:PDF(3.2MB)

Development of the “VoiceTra” Multi-Lingual Speech Translation System
Shigeki MATSUDA Teruaki HAYASHI Yutaka ASHIKARI Yoshinori SHIGA Hidenori KASHIOKA Keiji YASUDA Hideo OKUMA Masao UCHIYAMA Eiichiro SUMITA Hisashi KAWAI Satoshi NAKAMURA 
Publication:   
Publication Date: 2017/04/01
Vol. E100-D  No. 4 ; pp. 621-632
Type of Manuscript:  INVITED PAPER (Special Section on Award-winning Papers)
Category: 
Keyword: 
speech translationstatistical machine translationspeech recognitionspeech synthesis
 Summary | Full Text:PDF(1.2MB)

Investigation of Using Continuous Representation of Various Linguistic Units in Neural Network Based Text-to-Speech Synthesis
Xin WANG Shinji TAKAKI Junichi YAMAGISHI 
Publication:   IEICE TRANSACTIONS on Information and Systems
Publication Date: 2016/10/01
Vol. E99-D  No. 10 ; pp. 2471-2480
Type of Manuscript:  Special Section PAPER (Special Section on Recent Advances in Machine Learning for Spoken Language Processing)
Category: Speech synthesis
Keyword: 
text-to-speechspeech synthesisrecurrent neural networkcontextsword embedding
 Summary | Full Text:PDF(903KB)

WORLD: A Vocoder-Based High-Quality Speech Synthesis System for Real-Time Applications
Masanori MORISE Fumiya YOKOMORI Kenji OZAWA 
Publication:   IEICE TRANSACTIONS on Information and Systems
Publication Date: 2016/07/01
Vol. E99-D  No. 7 ; pp. 1877-1884
Type of Manuscript:  PAPER
Category: Speech and Hearing
Keyword: 
speech analysisspeech synthesisvocodersound qualityreal-time processing
 Summary | Full Text:PDF(756.2KB)

Unsupervised Prosodic Labeling of Speech Synthesis Databases Using Context-Dependent HMMs
Chen-Yu YANG Zhen-Hua LING Li-Rong DAI 
Publication:   IEICE TRANSACTIONS on Information and Systems
Publication Date: 2014/06/01
Vol. E97-D  No. 6 ; pp. 1449-1460
Type of Manuscript:  Special Section PAPER (Special Section on Advances in Modeling for Real-world Speech Information Processing and its Application)
Category: Speech Synthesis and Related Topics
Keyword: 
speech synthesisprosodic labelinghidden Markov modelprosodic phrase boundaryemphasis expression
 Summary | Full Text:PDF(1.1MB)

Admissible Stopping in Viterbi Beam Search for Unit Selection Speech Synthesis
Shinsuke SAKAI Tatsuya KAWAHARA 
Publication:   IEICE TRANSACTIONS on Information and Systems
Publication Date: 2013/06/01
Vol. E96-D  No. 6 ; pp. 1359-1367
Type of Manuscript:  PAPER
Category: Speech and Hearing
Keyword: 
speech synthesisunit selectionconcatenation costViterbi search
 Summary | Full Text:PDF(562.9KB)

Spectral Features for Perceptually Natural Phoneme Replacement by Another Speaker's Speech
Reiko TAKOU Hiroyuki SEGI Tohru TAKAGI Nobumasa SEIYAMA 
Publication:   IEICE TRANSACTIONS on Fundamentals of Electronics, Communications and Computer Sciences
Publication Date: 2012/04/01
Vol. E95-A  No. 4 ; pp. 751-759
Type of Manuscript:  PAPER
Category: Speech and Hearing
Keyword: 
individualityspeech perceptionspeech synthesisvoice quality
 Summary | Full Text:PDF(823.2KB)

Probabilistic Concatenation Modeling for Corpus-Based Speech Synthesis
Shinsuke SAKAI Tatsuya KAWAHARA Hisashi KAWAI 
Publication:   IEICE TRANSACTIONS on Information and Systems
Publication Date: 2011/10/01
Vol. E94-D  No. 10 ; pp. 2006-2014
Type of Manuscript:  PAPER
Category: Speech and Hearing
Keyword: 
speech synthesisunit selectionconcatenation costjoin cost
 Summary | Full Text:PDF(322.6KB)

Improvements of the One-to-Many Eigenvoice Conversion System
Yamato OHTANI Tomoki TODA Hiroshi SARUWATARI Kiyohiro SHIKANO 
Publication:   IEICE TRANSACTIONS on Information and Systems
Publication Date: 2010/09/01
Vol. E93-D  No. 9 ; pp. 2491-2499
Type of Manuscript:  Special Section PAPER (Special Section on Processing Natural Speech Variability for Improved Verbal Human-Computer Interaction)
Category: Voice Conversion
Keyword: 
speech synthesiseigenvoice conversionSTRAIGHT mixed excitationglobal varianceadaptive training
 Summary | Full Text:PDF(1.1MB)

Adaptive Training for Voice Conversion Based on Eigenvoices
Yamato OHTANI Tomoki TODA Hiroshi SARUWATARI Kiyohiro SHIKANO 
Publication:   IEICE TRANSACTIONS on Information and Systems
Publication Date: 2010/06/01
Vol. E93-D  No. 6 ; pp. 1589-1598
Type of Manuscript:  PAPER
Category: Speech and Hearing
Keyword: 
speech synthesisvoice conversionGaussian mixture modeleigenvoiceadaptive training
 Summary | Full Text:PDF(437.3KB)

A Covariance-Tying Technique for HMM-Based Speech Synthesis
Keiichiro OURA Heiga ZEN Yoshihiko NANKAKU Akinobu LEE Keiichi TOKUDA 
Publication:   IEICE TRANSACTIONS on Information and Systems
Publication Date: 2010/03/01
Vol. E93-D  No. 3 ; pp. 595-601
Type of Manuscript:  PAPER
Category: Speech and Hearing
Keyword: 
HMMspeech synthesisdecision treecontext-clusteringMDL criterionembedded device
 Summary | Full Text:PDF(662.9KB)

State Duration Modeling for HMM-Based Speech Synthesis
Heiga ZEN Takashi MASUKO Keiichi TOKUDA Takayoshi YOSHIMURA Takao KOBAYASIH Tadashi KITAMURA 
Publication:   IEICE TRANSACTIONS on Information and Systems
Publication Date: 2007/03/01
Vol. E90-D  No. 3 ; pp. 692-693
Type of Manuscript:  LETTER
Category: Speech and Hearing
Keyword: 
duration modelingspeech synthesishidden Markov model
 Summary | Full Text:PDF(54.1KB)

Hybrid Voice Conversion of Unit Selection and Generation Using Prosody Dependent HMM
Tadashi OKUBO Ryo MOCHIZUKI Tetsunori KOBAYASHI 
Publication:   IEICE TRANSACTIONS on Information and Systems
Publication Date: 2006/11/01
Vol. E89-D  No. 11 ; pp. 2775-2782
Type of Manuscript:  PAPER
Category: Speech and Hearing
Keyword: 
voice conversionspeech synthesisHMMunit selectionMLLR
 Summary | Full Text:PDF(419.3KB)

Implementation and Evaluation of an HMM-Based Korean Speech Synthesis System
Sang-Jin KIM Jong-Jin KIM Minsoo HAHN 
Publication:   IEICE TRANSACTIONS on Information and Systems
Publication Date: 2006/03/01
Vol. E89-D  No. 3 ; pp. 1116-1119
Type of Manuscript:  Special Section LETTER (Special Section on Statistical Modeling for Speech Processing)
Category: 
Keyword: 
HMM-based speech synthesisspeech synthesisHMMcontext clusteringKorean
 Summary | Full Text:PDF(413.8KB)

Concatenative Speech Synthesis Based on the Plural Unit Selection and Fusion Method
Tatsuya MIZUTANI Takehiko KAGOSHIMA 
Publication:   IEICE TRANSACTIONS on Information and Systems
Publication Date: 2005/11/01
Vol. E88-D  No. 11 ; pp. 2565-2572
Type of Manuscript:  PAPER
Category: Speech and Hearing
Keyword: 
speech synthesisplural unit selectionunit fusionunit trainingsense of stability and sense of voice
 Summary | Full Text:PDF(919.1KB)

Rules and Algorithms for Phonetic Transcription of Standard Malay
Yousif A. EL-IMAM Zuraidah Mohd DON 
Publication:   IEICE TRANSACTIONS on Information and Systems
Publication Date: 2005/10/01
Vol. E88-D  No. 10 ; pp. 2354-2372
Type of Manuscript:  PAPER
Category: Speech and Hearing
Keyword: 
grapheme-to-phoneme conversionStandard Malay phonology and phoneticsspeech synthesistext-to-speech conversion
 Summary | Full Text:PDF(3.2MB)

Fundamental Frequency Modeling for Speech Synthesis Based on a Statistical Learning Technique
Shinsuke SAKAI 
Publication:   IEICE TRANSACTIONS on Information and Systems
Publication Date: 2005/03/01
Vol. E88-D  No. 3 ; pp. 489-495
Type of Manuscript:  Special Section PAPER (Special Section on Corpus-Based Speech Technologies)
Category: Speech Synthesis and Prosody
Keyword: 
speech synthesisfundamental frequencyadditive modelsstatistical learning
 Summary | Full Text:PDF(308.7KB)

Developments in Corpus-Based Speech Synthesis: Approaching Natural Conversational Speech
Nick CAMPBELL 
Publication:   IEICE TRANSACTIONS on Information and Systems
Publication Date: 2005/03/01
Vol. E88-D  No. 3 ; pp. 376-383
Type of Manuscript:  INVITED PAPER (Special Section on Corpus-Based Speech Technologies)
Category: 
Keyword: 
speech synthesiscorporaconcatenationparalinguistic informationcommunicationaffect
 Summary | Full Text:PDF(136.2KB)

A Low-Band Spectrum Envelope Reconstruction Method for PSOLA-Based F0 Modification
Ryo MOCHIZUKI Tetsunori KOBAYASHI 
Publication:   IEICE TRANSACTIONS on Information and Systems
Publication Date: 2004/10/01
Vol. E87-D  No. 10 ; pp. 2426-2429
Type of Manuscript:  LETTER
Category: Speech and Hearing
Keyword: 
F0 modificationspectrum envelopePSOLAspeech synthesis
 Summary | Full Text:PDF(253.8KB)

A Stochastic F0 Contour Model Based on Clustering and a Probabilistic Measure
Yoichi YAMASHITA Tomoyoshi ISHIDA Kazuki SHIMADERA 
Publication:   IEICE TRANSACTIONS on Information and Systems
Publication Date: 2003/03/01
Vol. E86-D  No. 3 ; pp. 543-549
Type of Manuscript:  Special Section PAPER (Special Issue on Speech Information Processing)
Category: Speech Synthesis and Prosody
Keyword: 
speech synthesisF0 modelclusteringbunsetsu F0 shapestochastic method
 Summary | Full Text:PDF(503.7KB)

Fractal Modeling of Fluctuations in the Steady Part of Sustained Vowels for High Quality Speech Synthesis
Naofumi AOKI Tohru IFUKUBE 
Publication:   IEICE TRANSACTIONS on Fundamentals of Electronics, Communications and Computer Sciences
Publication Date: 1998/09/25
Vol. E81-A  No. 9 ; pp. 1803-1810
Type of Manuscript:  Special Section PAPER (Special Section on Nonlinear Theory and Its Applications)
Category: Chaos, Bifurcation and Fractal
Keyword: 
naturalness of sustained vowelsspeech synthesisrandom fractal1/fβ fluctuation
 Summary | Full Text:PDF(753.6KB)

Linguistic Intelligent CAI System Using Speech Data-Base
Kyu-Keon LEE Katsuhiko SHIRAI 
Publication:   IEICE TRANSACTIONS on Fundamentals of Electronics, Communications and Computer Sciences
Publication Date: 1995/11/25
Vol. E78-A  No. 11 ; pp. 1562-1565
Type of Manuscript:  Special Section LETTER (Special Section of Letters Selected from the 1995 IEICE General Conference)
Category: 
Keyword: 
ICAI systemspeech synthesispitch pattern
 Summary | Full Text:PDF(346.6KB)

High Quality Synthetic Speech Generation Using Synchronized Oscillators
Kenji HASHIMOTO Takemi MOCHIDA Yasuaki SATO Tetsunori KOBAYASHI Katsuhiko SHIRAI 
Publication:   IEICE TRANSACTIONS on Fundamentals of Electronics, Communications and Computer Sciences
Publication Date: 1993/11/25
Vol. E76-A  No. 11 ; pp. 1949-1956
Type of Manuscript:  Special Section PAPER (Special Section on Speech Synthesis: Current Technologies and Thier Application)
Category: 
Keyword: 
speech synthesissinusoidal modelnon-linear differential equationpitch control
 Summary | Full Text:PDF(652.3KB)

High Quality Speech Synthesis System Based on Waveform Concatenation of Phoneme Segment
Tomohisa HIROKAWA Kenzo ITOH Hirokazu SATO 
Publication:   IEICE TRANSACTIONS on Fundamentals of Electronics, Communications and Computer Sciences
Publication Date: 1993/11/25
Vol. E76-A  No. 11 ; pp. 1964-1970
Type of Manuscript:  Special Section PAPER (Special Section on Speech Synthesis: Current Technologies and Thier Application)
Category: 
Keyword: 
speech synthesistext-to-speechwaveform dictionaryprosody set rule
 Summary | Full Text:PDF(642.2KB)

Development of TTS Card for PCs and TTS Software for WSs
Yoshiyuki HARA Tsuneo NITTA Hiroyoshi SAITO Ken'ichiro KOBAYASHI 
Publication:   IEICE TRANSACTIONS on Fundamentals of Electronics, Communications and Computer Sciences
Publication Date: 1993/11/25
Vol. E76-A  No. 11 ; pp. 1999-2007
Type of Manuscript:  Special Section PAPER (Special Section on Speech Synthesis: Current Technologies and Thier Application)
Category: 
Keyword: 
speech synthesistext-to-speechmultimediamedia conversionhuman interface
 Summary | Full Text:PDF(801.7KB)

Speech Segment Selection for Concatenative Synthesis Based on Spectral Distortion Minimization
Naoto IWAHASHI Nobuyoshi KAIKI Yoshinori SAGISAKA 
Publication:   IEICE TRANSACTIONS on Fundamentals of Electronics, Communications and Computer Sciences
Publication Date: 1993/11/25
Vol. E76-A  No. 11 ; pp. 1942-1948
Type of Manuscript:  Special Section PAPER (Special Section on Speech Synthesis: Current Technologies and Thier Application)
Category: 
Keyword: 
speech synthesissegment selectiondynamic programmingspectral distortion
 Summary | Full Text:PDF(662.4KB)

Tree-Based Approaches to Automatic Generation of Speech Synthesis Rules for Prosodic Parameters
Yoichi YAMASHITA Manabu TANAKA Yoshitake AMAKO Yasuo NOMURA Yoshikazu OHTA Atsunori KITOH Osamu KAKUSHO Riichiro MIZOGUCHI 
Publication:   IEICE TRANSACTIONS on Fundamentals of Electronics, Communications and Computer Sciences
Publication Date: 1993/11/25
Vol. E76-A  No. 11 ; pp. 1934-1941
Type of Manuscript:  Special Section PAPER (Special Section on Speech Synthesis: Current Technologies and Thier Application)
Category: 
Keyword: 
speech synthesisdecision treeautomatic rule generationaccent componentlong noun phrase
 Summary | Full Text:PDF(774.9KB)

Phoneme Power Control for Speech Synthesis
Kenzo ITOH Tomohisa HIROKAWA Hirokazu SATO 
Publication:   IEICE TRANSACTIONS on Fundamentals of Electronics, Communications and Computer Sciences
Publication Date: 1993/11/25
Vol. E76-A  No. 11 ; pp. 1911-1918
Type of Manuscript:  Special Section PAPER (Special Section on Speech Synthesis: Current Technologies and Thier Application)
Category: 
Keyword: 
speech synthesistext-to-speech power controlphoneme environment
 Summary | Full Text:PDF(610.5KB)

Significance of Suitability Assessment in Speech Synthesis Applications
Hideki KASUYA 
Publication:   IEICE TRANSACTIONS on Fundamentals of Electronics, Communications and Computer Sciences
Publication Date: 1993/11/25
Vol. E76-A  No. 11 ; pp. 1893-1897
Type of Manuscript:  INVITED PAPER (Special Section on Speech Synthesis: Current Technologies and Thier Application)
Category: 
Keyword: 
speech synthesisquality assessmenthuman factorsprosody
 Summary | Full Text:PDF(404.8KB)

How Might One Comfortably Converse with a Machine ?
Yasuhisa NIIMI 
Publication:   IEICE TRANSACTIONS on Information and Systems
Publication Date: 1993/01/25
Vol. E76-D  No. 1 ; pp. 9-16
Type of Manuscript:  INVITED PAPER (Special Issue on Speech and Discourse Processing in Dialogue Systems)
Category: 
Keyword: 
speech dialogue systemcomfirmationadaptationcooperativenessspeech recognitionspeech synthesis
 Summary | Full Text:PDF(710.2KB)

Prosodic Control to Express Emotions for Man-Machine Speech Interaction
Yoshinori KITAHARA Yoh'ichi TOHKURA 
Publication:   IEICE TRANSACTIONS on Fundamentals of Electronics, Communications and Computer Sciences
Publication Date: 1992/02/25
Vol. E75-A  No. 2 ; pp. 155-163
Type of Manuscript:  Special Section PAPER (Special Section on Fundamentals of Next Generation Human Interface)
Category: 
Keyword: 
emotionprosodyspeech synthesishuman interface
 Summary | Full Text:PDF(561.9KB)

Future Perspective of Automatic Telephone Interpretation
Akira KUREMATSU 
Publication:   IEICE TRANSACTIONS on Communications
Publication Date: 1992/01/25
Vol. E75-B  No. 1 ; pp. 14-19
Type of Manuscript:  INVITED PAPER (Special Section on Dreams of Future Communications)
Category: 
Keyword: 
speech recognitionmachine translationspeech synthesisspeech translation
 Summary | Full Text:PDF(515.5KB)