| 
[1]
 | 
Srikanth Ronanki, Oliver Watts, and Simon King.
 A Hierarchical Encoder-Decoder Model for Statistical Parametric
  Speech Synthesis.
 In Proc. Interspeech 2017, August 2017.
[ bib | 
.pdf | 
Abstract ]
 | 
| 
[2]
 | 
Felipe Espic, Cassia Valentini-Botinhao, and Simon King.
 Direct modelling of magnitude and phase spectra for statistical
  parametric speech synthesis.
 In Proc. Interspeech, Stochohlm, Sweden, August 2017.
[ bib | 
.PDF | 
Abstract ]
 | 
| 
[3]
 | 
Joseph Mendelson, Pilar Oplustil, Oliver Watts, and Simon King.
 Nativization of foreign names in tts for automatic reading of world
  news in swahili.
 In Interspeech 2017, May 2017.
[ bib | 
.pdf | 
Abstract ]
 | 
| 
[4]
 | 
Srikanth Ronanki, Oliver Watts, Simon King, and Gustav Eje Henter.
 Median-Based Generation of Synthetic Speech Durations using a
  Non-Parametric Approach.
 In Proc. IEEE Workshop on Spoken Language Technology (SLT),
  December 2016.
[ bib | 
.pdf | 
Abstract ]
 | 
| 
[5]
 | 
Srikanth Ronanki, Siva Reddy, Bajibabu Bollepalli, and Simon King.
 DNN-based Speech Synthesis for Indian Languages from ASCII text.
 In Proc. 9th ISCA Speech Synthesis Workshop (SSW9), Sunnyvale,
  CA, USA, September 2016.
[ bib | 
.pdf | 
Abstract ]
 | 
| 
[6]
 | 
Srikanth Ronanki, Gustav Eje Henter, Zhizheng Wu, and Simon King.
 A template-based approach for speech synthesis intonation generation
  using LSTMs.
 In Proc. Interspeech, San Francisco, USA, September 2016.
[ bib | 
.pdf | 
Abstract ]
 | 
| 
[7]
 | 
Srikanth Ronanki, Zhizheng Wu, Oliver Watts, and Simon King.
 A Demonstration of the Merlin Open Source Neural Network Speech
  Synthesis System.
 In Proc. Speech Synthesis Workshop (SSW9), September 2016.
[ bib | 
.pdf | 
Abstract ]
 | 
| 
[8]
 | 
Felipe Espic, Cassia Valentini-Botinhao, Zhizheng Wu, and Simon King.
 Waveform generation based on signal reshaping for statistical
  parametric speech synthesis.
 In Proc. Interspeech, pages 2263-2267, San Francisco, CA, USA,
  September 2016.
[ bib | 
.PDF | 
Abstract ]
 | 
| 
[9]
 | 
Zhizheng Wu, Oliver Watts, and Simon King.
 Merlin: An open source neural network speech synthesis system.
 In 9th ISCA Speech Synthesis Workshop (2016), pages 218-223,
  September 2016.
[ bib | 
.pdf | 
Abstract ]
 | 
| 
[10]
 | 
Korin Richmond and Simon King.
 Smooth talking: Articulatory join costs for unit selection.
 In Proc. IEEE International Conference on Acoustics, Speech, and
  Signal Processing (ICASSP), pages 5150-5154, March 2016.
[ bib | 
.pdf | 
Abstract ]
 | 
| 
[11]
 | 
Gustav Eje Henter, Srikanth Ronanki, Oliver Watts, Mirjam Wester, Zhizheng Wu,
  and Simon King.
 Robust TTS duration modelling using DNNs.
 In Proc. ICASSP, volume 41, pages 5130-5134, Shanghai, China,
  March 2016.
[ bib | 
http | 
.pdf | 
Abstract ]
 | 
| 
[12]
 | 
Oliver Watts, Gustav Eje Henter, Thomas Merritt, Zhizheng Wu, and Simon King.
 From HMMs to DNNs: where do the improvements come from?
 In Proc. ICASSP, volume 41, pages 5505-5509, Shanghai, China,
  March 2016.
[ bib | 
http | 
.pdf | 
Abstract ]
 | 
| 
[13]
 | 
Adriana Stan, Yoshitaka Mamiya, Junichi Yamagishi, Peter Bell, Oliver Watts,
  Rob Clark, and Simon King.
 ALISA: An automatic lightly supervised speech segmentation and
  alignment tool.
 Computer Speech and Language, 35:116-133, 2016.
[ bib | 
DOI | 
http | 
.pdf | 
Abstract ]
 | 
| 
[14]
 | 
Thomas Merritt, Robert A J Clark, Zhizheng Wu, Junichi Yamagishi, and Simon
  King.
 Deep neural network-guided unit selection synthesis.
 In Proc. ICASSP, 2016.
[ bib | 
.pdf | 
Abstract ]
 | 
| 
[15]
 | 
Lau Chee Yong, Oliver Watts, and Simon King.
 Combining lightly-supervised learning and user feedback to construct
  and improve a statistical parametric speech synthesizer for malay.
 Research Journal of Applied Sciences, Engineering and
  Technology, 11(11):1227-1232, December 2015.
[ bib | 
.pdf | 
Abstract ]
 | 
| 
[16]
 | 
C. Valentini-Botinhao, Z. Wu, and S. King.
 Towards minimum perceptual error training for DNN-based speech
  synthesis.
 In Proc. Interspeech, Dresden, Germany, September 2015.
[ bib | 
.pdf | 
Abstract ]
 | 
| 
[17]
 | 
Thomas Merritt, Junichi Yamagishi, Zhizheng Wu, Oliver Watts, and Simon King.
 Deep neural network context embeddings for model selection in
  rich-context HMM synthesis.
 In Proc. Interspeech, Dresden, September 2015.
[ bib | 
.pdf | 
Abstract ]
 | 
| 
[18]
 | 
Oliver Watts, Zhizheng Wu, and Simon King.
 Sentence-level control vectors for deep neural network speech
  synthesis.
 In INTERSPEECH 2015 16th Annual Conference of the International
  Speech Communication Association, pages 2217-2221. International Speech
  Communication Association, September 2015.
[ bib | 
.pdf | 
Abstract ]
 | 
| 
[19]
 | 
Marcus Tomalin, Mirjam Wester, Rasmus Dall, Bill Byrne, and Simon King.
 A lattice-based approach to automatic filled pause insertion.
 In Proc. DiSS 2015, Edinburgh, August 2015.
[ bib | 
.pdf | 
Abstract ]
 | 
| 
[20]
 | 
Z. Wu, C. Valentini-Botinhao, O. Watts, and S. King.
 Deep neural networks employing multi-task learning and stacked
  bottleneck features for speech synthesis.
 In Proc. ICASSP, pages 4460-4464, Brisbane, Australia, April
  2015.
[ bib | 
.pdf | 
Abstract ]
 | 
| 
[21]
 | 
Thomas Merritt, Javier Latorre, and Simon King.
 Attributing modelling errors in HMM synthesis by stepping gradually
  from natural to modelled speech.
 In Proceedings of the IEEE International Conference on
  Acoustics, Speech, and Signal Processing (ICASSP), pages 4220-4224,
  Brisbane, April 2015.
[ bib | 
.pdf | 
Abstract ]
 | 
| 
[22]
 | 
Zhizheng Wu and Simon King.
 Minimum trajectory error training for deep neural networks, combined
  with stacked bottleneck features.
 In Interspeech, 2015.
[ bib | 
.pdf ]
 | 
| 
[23]
 | 
Zhizheng Wu, Pawel Swietojanski, Christophe Veaux, Steve Renals, and Simon
  King.
 A study of speaker adaptation for DNN-based speech synthesis.
 In Interspeech, 2015.
[ bib | 
.pdf ]
 | 
| 
[24]
 | 
Zhizheng Wu, Cassia Valentini-Botinhao, Oliver Watts, and Simon King.
 Deep neural network employing multi-task learning and stacked
  bottleneck features for speech synthesis.
 In Proceedings of the IEEE International Conference on
  Acoustics, Speech, and Signal Processing (ICASSP), 2015.
[ bib | 
.pdf ]
 | 
| 
[25]
 | 
Zhizheng Wu, Ali Khodabakhsh, Cenk Demiroglu, Junichi Yamagishi, Daisuke Saito,
  Tomoki Toda, and Simon King.
 SAS: A speaker verification spoofing database containing diverse
  attacks.
 In Proceedings of the IEEE International Conference on
  Acoustics, Speech, and Signal Processing (ICASSP), 2015.
[ bib | 
.pdf ]
 | 
| 
[26]
 | 
Victor Poblete, Felipe Espic, Simon King, Richard M. Stern, Fernando Huenupan,
  Josue Fredes, and Nestor Becerra Yoma.
 A perceptually-motivated low-complexity instantaneous linear channel
  normalization technique applied to speaker verification.
 Computer Speech & Language, 31(1):1 - 27, 2015.
[ bib | 
DOI | 
http | 
.pdf | 
Abstract ]
 | 
| 
[27]
 | 
Cassia Valentini-Botinhao, Junichi Yamagishi, and Simon King.
 Intelligibility enhancement of speech in noise.
 In Proceedings of the Institute of Acoustics, volume 36 Pt. 2,
  pages 96-103, Birmingham, UK, October 2014.
[ bib | 
.pdf | 
Abstract ]
 | 
| 
[28]
 | 
Thomas Merritt, Tuomo Raitio, and Simon King.
 Investigating source and filter contributions, and their interaction,
  to statistical parametric speech synthesis.
 In Proc. Interspeech, pages 1509-1513, Singapore, September
  2014.
[ bib | 
.pdf | 
Abstract ]
 | 
| 
[29]
 | 
Gustav Eje Henter, Thomas Merritt, Matt Shannon, Catherine Mayo, and Simon
  King.
 Measuring the perceptual effects of modelling assumptions in speech
  synthesis using stimuli constructed from repeated natural speech.
 In Proc. Interspeech, volume 15, pages 1504-1508, September
  2014.
[ bib | 
.pdf | 
Abstract ]
 | 
| 
[30]
 | 
Oliver Watts, Siva Gangireddy, Junichi Yamagishi, Simon King, Steve Renals,
  Adriana Stan, and Mircea Giurgiu.
 Neural net word representations for phrase-break prediction without a
  part of speech tagger.
 In Proc. ICASSP, pages 2618-2622, Florence, Italy, May 2014.
[ bib | 
.pdf | 
Abstract ]
 | 
| 
[31]
 | 
Rasmus Dall, Junichi Yamagishi, and Simon King.
 Rating naturalness in speech synthesis: The effect of style and
  expectation.
 In Proc. Speech Prosody, May 2014.
[ bib | 
.pdf | 
Abstract ]
 | 
| 
[32]
 | 
C. Valentini-Botinhao, J. Yamagishi, S. King, and R. Maia.
 Intelligibility enhancement of HMM-generated speech in additive
  noise by modifying mel cepstral coefficients to increase the glimpse
  proportion.
 Computer Speech and Language, 28(2):665-686, 2014.
[ bib | 
DOI | 
.pdf | 
Abstract ]
 | 
| 
[33]
 | 
Moses Ekpenyong, Eno-Abasi Urua, Oliver Watts, Simon King, and Junichi
  Yamagishi.
 Statistical parametric speech synthesis for Ibibio.
 Speech Communication, 56:243-251, January 2014.
[ bib | 
DOI | 
http | 
.pdf | 
Abstract ]
 | 
| 
[34]
 | 
P. Lanchantin, M. J. F. Gales, S. King, and J. Yamagishi.
 Multiple-average-voice-based speech synthesis.
 In Proc. ICASSP, 2014.
[ bib | 
Abstract ]
 | 
| 
[35]
 | 
Rasmus Dall, Marcus Tomalin, Mirjam Wester, William Byrne, and Simon King.
 Investigating automatic & human filled pause insertion for speech
  synthesis.
 In Proc. Interspeech, 2014.
[ bib | 
.pdf | 
Abstract ]
 | 
| 
[36]
 | 
Herman Kamper, Aren Jansen, Simon King, and S. J. Goldwater.
 Unsupervised lexical clustering of speech segments using
  fixed-dimensional acoustic embeddings.
 In Proc. SLT, 2014.
[ bib | 
.pdf | 
Abstract ]
 | 
| 
[37]
 | 
C. Valentini-Botinhao, J. Yamagishi, S. King, and Y. Stylianou.
 Combining perceptually-motivated spectral shaping with loudness and
  duration modification for intelligibility enhancement of HMM-based synthetic
  speech in noise.
 In Proc. Interspeech, Lyon, France, August 2013.
[ bib | 
.pdf ]
 | 
| 
[38]
 | 
Cassia Valentini-Botinhao, Mirjam Wester, Junichi Yamagishi, and Simon King.
 Using neighbourhood density and selective SNR boosting to increase
  the intelligibility of synthetic speech in noise.
 In 8th ISCA Workshop on Speech Synthesis, pages 133-138,
  Barcelona, Spain, August 2013.
[ bib | 
.pdf | 
Abstract ]
 | 
| 
[39]
 | 
Thomas Merritt and Simon King.
 Investigating the shortcomings of HMM synthesis.
 In 8th ISCA Workshop on Speech Synthesis, pages 185-190,
  Barcelona, Spain, August 2013.
[ bib | 
.pdf | 
Abstract ]
 | 
| 
[40]
 | 
Maria Astrinaki, Alexis Moinet, Junichi Yamagishi, Korin Richmond, Zhen-Hua
  Ling, Simon King, and Thierry Dutoit.
 Mage - reactive articulatory feature control of HMM-based
  parametric speech synthesis.
 In 8th ISCA Workshop on Speech Synthesis, pages 227-231,
  Barcelona, Spain, August 2013.
[ bib | 
.pdf ]
 | 
| 
[41]
 | 
Adriana Stan, Peter Bell, Junichi Yamagishi, and Simon King.
 Lightly supervised discriminative training of grapheme models for
  improved sentence-level alignment of speech and text data.
 In Proc. Interspeech, Lyon, France, August 2013.
[ bib | 
.pdf | 
Abstract ]
 | 
| 
[42]
 | 
H. Christensen, M. Aniol, P. Bell, P. Green, T. Hain, S. King, and
  P. Swietojanski.
 Combining in-domain and out-of-domain speech data for automatic
  recognition of disordered speech.
 In Proc. Interspeech, Lyon, France, August 2013.
[ bib | 
.pdf | 
Abstract ]
 | 
| 
[43]
 | 
Kayoko Yanagisawa, Javier Latorre, Vincent Wan, Mark J. F. Gales, and Simon
  King.
 Noise robustness in HMM-TTS speaker adaptation.
 In 8th ISCA Workshop on Speech Synthesis, pages 139-144,
  Barcelona, Spain, August 2013.
[ bib | 
.pdf | 
Abstract ]
 | 
| 
[44]
 | 
Rubén San-Segundo, Juan Manuel Montero, Mircea Giurgiu, Ioana Muresan, and
  Simon King.
 Multilingual number transcription for text-to-speech conversion.
 In 8th ISCA Workshop on Speech Synthesis, pages 85-89,
  Barcelona, Spain, August 2013.
[ bib | 
.pdf | 
Abstract ]
 | 
| 
[45]
 | 
Heng Lu, Simon King, and Oliver Watts.
 Combining a vector space representation of linguistic context with a
  deep neural network for text-to-speech synthesis.
 In 8th ISCA Workshop on Speech Synthesis, pages 281-285,
  Barcelona, Spain, August 2013.
[ bib | 
.pdf | 
Abstract ]
 | 
| 
[46]
 | 
Yoshitaka Mamiya, Adriana Stan, Junichi Yamagishi, Peter Bell, Oliver Watts,
  Robert Clark, and Simon King.
 Using adaptation to improve speech transcription alignment in noisy
  and reverberant environments.
 In 8th ISCA Workshop on Speech Synthesis, pages 61-66,
  Barcelona, Spain, August 2013.
[ bib | 
.pdf | 
Abstract ]
 | 
| 
[47]
 | 
Oliver Watts, Adriana Stan, Rob Clark, Yoshitaka Mamiya, Mircea Giurgiu,
  Junichi Yamagishi, and Simon King.
 Unsupervised and lightly-supervised learning for rapid construction
  of TTS systems in multiple languages from 'found' data: evaluation and
  analysis.
 In 8th ISCA Workshop on Speech Synthesis, pages 121-126,
  Barcelona, Spain, August 2013.
[ bib | 
.pdf | 
Abstract ]
 | 
| 
[48]
 | 
Adriana Stan, Oliver Watts, Yoshitaka Mamiya, Mircea Giurgiu, Rob Clark,
  Junichi Yamagishi, and Simon King.
 TUNDRA: A Multilingual Corpus of Found Data for TTS Research Created
  with Light Supervision.
 In Proc. Interspeech, Lyon, France, August 2013.
[ bib | 
.pdf | 
Abstract ]
 | 
| 
[49]
 | 
James Scobbie, Alice Turk, Christian Geng, Simon King, Robin Lickley, and Korin
  Richmond.
 The Edinburgh speech production facility DoubleTalk corpus.
 In Proc. Interspeech, Lyon, France, August 2013.
[ bib | 
.pdf | 
Abstract ]
 | 
| 
[50]
 | 
Maria Astrinaki, Alexis Moinet, Junichi Yamagishi, Korin Richmond, Zhen-Hua
  Ling, Simon King, and Thierry Dutoit.
 Mage - HMM-based speech synthesis reactively controlled by the
  articulators.
 In 8th ISCA Workshop on Speech Synthesis, page 243, Barcelona,
  Spain, August 2013.
[ bib | 
.pdf | 
Abstract ]
 | 
| 
[51]
 | 
Chee-Ming Ting, Simon King, Sh-Hussain Salleh, and A. K. Ariff.
 Discriminative tandem features for HMM-based EEG classification.
 In Proc. 35th Annual International Conference of the IEEE
  Engineering in Medicine and Biology Society (EMBC 13), Osaka, Japan, July
  2013.
[ bib | 
.pdf | 
Abstract ]
 | 
| 
[52]
 | 
C. Valentini-Botinhao, E. Godoy, Y. Stylianou, B. Sauert, S. King, and
  J. Yamagishi.
 Improving intelligibility in noise of HMM-generated speech via
  noise-dependent and -independent methods.
 In Proc. ICASSP, Vancouver, Canada, May 2013.
[ bib | 
.pdf ]
 | 
| 
[53]
 | 
H. Lu and S. King.
 Factorized context modelling for text-to-speech synthesis.
 In Proc. ICASSP, Vancouver, Canada, May 2013.
[ bib | 
.pdf | 
Abstract ]
 | 
| 
[54]
 | 
Mark Sinclair and Simon King.
 Where are the challenges in speaker diarization?
 In Acoustics, Speech and Signal Processing (ICASSP), 2013 IEEE
  International Conference on, Vancouver, British Columbia, USA, May 2013.
[ bib | 
.pdf | 
Abstract ]
 | 
| 
[55]
 | 
John Dines, Hui Liang, Lakshmi Saheer, Matthew Gibson, William Byrne, Keiichiro
  Oura, Keiichi Tokuda, Junichi Yamagishi, Simon King, Mirjam Wester, Teemu
  Hirsimäki, Reima Karhila, and Mikko Kurimo.
 Personalising speech-to-speech translation: Unsupervised
  cross-lingual speaker adaptation for HMM-based speech synthesis.
 Computer Speech and Language, 27(2):420-437, February 2013.
[ bib | 
DOI | 
http | 
Abstract ]
 | 
| 
[56]
 | 
Javier Tejedor, Doroteo T. Toledano, Dong Wang, Simon King, and Jose Colas.
 Feature analysis for discriminative confidence estimation in spoken
  term detection.
 Computer Speech and Language, To appear, 2013.
[ bib | 
.pdf | 
Abstract ]
 | 
| 
[57]
 | 
P. Lal and S. King.
 Cross-lingual automatic speech recognition using tandem features.
 IEEE Transactions on Audio, Speech, and Language Processing, To
  appear, 2013.
[ bib | 
DOI | 
.pdf | 
Abstract ]
 | 
| 
[58]
 | 
Yoshitaka Mamiya, Junichi Yamagishi, Oliver Watts, Robert A.J. Clark, Simon
  King, and Adriana Stan.
 Lightly supervised gmm vad to use audiobook for speech synthesiser.
 In Proc. ICASSP, 2013.
[ bib | 
.pdf | 
Abstract ]
 | 
| 
[59]
 | 
Christian Geng, Alice Turk, James M. Scobbie, Cedric Macmartin, Philip Hoole,
  Korin Richmond, Alan Wrench, Marianne Pouplier, Ellen Gurman Bard, Ziggy
  Campbell, Catherine Dickie, Eddie Dubourg, William Hardcastle, Evia Kainada,
  Simon King, Robin Lickley, Satsuki Nakai, Steve Renals, Kevin White, and
  Ronny Wiegand.
 Recording speech articulation in dialogue: Evaluating a synchronized
  double electromagnetic articulography setup.
 Journal of Phonetics, 41(6):421 - 431, 2013.
[ bib | 
DOI | 
http | 
.pdf | 
Abstract ]
 | 
| 
[60]
 | 
Adriana Stan, Peter Bell, and Simon King.
 A grapheme-based method for automatic alignment of speech and text
  data.
 In Proc. IEEE Workshop on Spoken Language Technology, Miami,
  Florida, USA, December 2012.
[ bib | 
.pdf | 
Abstract ]
 | 
| 
[61]
 | 
Heng Lu and Simon King.
 Using Bayesian networks to find relevant context features for
  HMM-based speech synthesis.
 In Proc. Interspeech, Portland, Oregon, USA, September 2012.
[ bib | 
.pdf | 
Abstract ]
 | 
| 
[62]
 | 
Rasmus Dall, Christophe Veaux, Junichi Yamagishi, and Simon King.
 Analysis of speaker clustering techniques for HMM-based speech
  synthesis.
 In Proc. Interspeech, September 2012.
[ bib | 
.pdf | 
Abstract ]
 | 
| 
[63]
 | 
C. Valentini-Botinhao, J. Yamagishi, and S. King.
 Evaluating speech intelligibility enhancement for HMM-based
  synthetic speech in noise.
 In Proc. Sapa Workshop, Portland, USA, September 2012.
[ bib | 
.pdf | 
Abstract ]
 | 
| 
[64]
 | 
Ruben San-Segundo, Juan M. Montero, Veronica Lopez-Luden, and Simon King.
 Detecting acronyms from capital letter sequences in spanish.
 In Proc. Interspeech, Portland, Oregon, USA, September 2012.
[ bib | 
.pdf | 
Abstract ]
 | 
| 
[65]
 | 
C. Valentini-Botinhao, J. Yamagishi, and S. King.
 Mel cepstral coefficient modification based on the Glimpse
  Proportion measure for improving the intelligibility of HMM-generated
  synthetic speech in noise.
 In Proc. Interspeech, Portland, USA, September 2012.
[ bib | 
Abstract ]
 | 
| 
[66]
 | 
C. Valentini-Botinhao, J. Yamagishi, and S. King.
 Using an intelligibility measure to create noise robust cepstral
  coefficients for HMM-based speech synthesis.
 In Proc. LISTA Workshop, Edinburgh, UK, May 2012.
[ bib | 
.pdf ]
 | 
| 
[67]
 | 
C. Valentini-Botinhao, R. Maia, J. Yamagishi, S. King, and H. Zen.
 Cepstral analysis based on the Glimpse proportion measure for
  improving the intelligibility of HMM-based synthetic speech in noise.
 In Proc. ICASSP, pages 3997-4000, Kyoto, Japan, March 2012.
[ bib | 
DOI | 
.pdf | 
Abstract ]
 | 
| 
[68]
 | 
Chen-Yu Yang, G. Brown, Liang Lu, J. Yamagishi, and S. King.
 Noise-robust whispered speech recognition using a non-audible-murmur
  microphone with vts compensation.
 In Chinese Spoken Language Processing (ISCSLP), 2012 8th
  International Symposium on, pages 220-223, 2012.
[ bib | 
DOI | 
Abstract ]
 | 
| 
[69]
 | 
Jaime Lorenzo-Trueba, Oliver Watts, Roberto Barra-Chicote, Junichi Yamagishi,
  Simon King, and Juan M Montero.
 Simple4all proposals for the albayzin evaluations in speech
  synthesis.
 In Proc. Iberspeech 2012, 2012.
[ bib | 
.pdf | 
Abstract ]
 | 
| 
[70]
 | 
Dong Wang, Javier Tejedor, Simon King, and Joe Frankel.
 Term-dependent confidence normalization for out-of-vocabulary spoken
  term detection.
 Journal of Computer Science and Technology, 27(2), 2012.
[ bib | 
DOI | 
Abstract ]
 | 
| 
[71]
 | 
Keiichiro Oura, Junichi Yamagishi, Mirjam Wester, Simon King, and Keiichi
  Tokuda.
 Analysis of unsupervised cross-lingual speaker adaptation for
  HMM-based speech synthesis using KLD-based transform mapping.
 Speech Communication, 54(6):703-714, 2012.
[ bib | 
DOI | 
http | 
Abstract ]
 | 
| 
[72]
 | 
Kei Hashimoto, Junichi Yamagishi, William Byrne, Simon King, and Keiichi
  Tokuda.
 Impacts of machine translation and speech synthesis on
  speech-to-speech translation.
 Speech Communication, 54(7):857-866, 2012.
[ bib | 
DOI | 
http | 
Abstract ]
 | 
| 
[73]
 | 
Junichi Yamagishi, Christophe Veaux, Simon King, and Steve Renals.
 Speech synthesis technologies for individuals with vocal
  disabilities: Voice banking and reconstruction.
 Acoustical Science and Technology, 33(1):1-5, 2012.
[ bib | 
DOI | 
http | 
.pdf | 
Abstract ]
 | 
| 
[74]
 | 
Oliver Watts, Junichi Yamagishi, and Simon King.
 Unsupervised continuous-valued word features for phrase-break
  prediction without a part-of-speech tagger.
 In Proc. Interspeech, pages 2157-2160, Florence, Italy, August
  2011.
[ bib | 
.pdf | 
Abstract ]
 | 
| 
[75]
 | 
Cassia Valentini-Botinhao, Junichi Yamagishi, and Simon King.
 Can objective measures predict the intelligibility of modified
  HMM-based synthetic speech in noise?
 In Proc. Interspeech, August 2011.
[ bib | 
.pdf | 
Abstract ]
 | 
| 
[76]
 | 
Korin Richmond, Phil Hoole, and Simon King.
 Announcing the electromagnetic articulography (day 1) subset of the
  mngu0 articulatory corpus.
 In Proc. Interspeech, pages 1505-1508, Florence, Italy, August
  2011.
[ bib | 
.pdf | 
Abstract ]
 | 
| 
[77]
 | 
Ming Lei, Junichi Yamagishi, Korin Richmond, Zhen-Hua Ling, Simon King, and
  Li-Rong Dai.
 Formant-controlled HMM-based speech synthesis.
 In Proc. Interspeech, pages 2777-2780, Florence, Italy, August
  2011.
[ bib | 
.pdf | 
Abstract ]
 | 
| 
[78]
 | 
S. Andraszewicz, J. Yamagishi, and S. King.
 Vocal attractiveness of statistical speech synthesisers.
 In Acoustics, Speech and Signal Processing (ICASSP), 2011 IEEE
  International Conference on, pages 5368-5371, May 2011.
[ bib | 
DOI | 
Abstract ]
 | 
| 
[79]
 | 
Cassia Valentini-Botinhao, Junichi Yamagishi, and Simon King.
 Evaluation of objective measures for intelligibility prediction of
  HMM-based synthetic speech in noise.
 In Acoustics, Speech and Signal Processing (ICASSP), 2011 IEEE
  International Conference on, pages 5112-5115, May 2011.
[ bib | 
DOI | 
.pdf | 
Abstract ]
 | 
| 
[80]
 | 
K. Hashimoto, J. Yamagishi, W. Byrne, S. King, and K. Tokuda.
 An analysis of machine translation and speech synthesis in
  speech-to-speech translation system.
 In Acoustics, Speech and Signal Processing (ICASSP), 2011 IEEE
  International Conference on, pages 5108-5111, May 2011.
[ bib | 
DOI | 
Abstract ]
 | 
| 
[81]
 | 
Dong Wang, Nicholas Evans, Raphael Troncy, and Simon King.
 Handling overlaps in spoken term detection.
 In Proc. International Conference on Acoustics, Speech and
  Signal Processing, pages 5656-5659, May 2011.
[ bib | 
DOI | 
.pdf | 
Abstract ]
 | 
| 
[82]
 | 
Dong Wang and Simon King.
 Letter-to-sound pronunciation prediction using conditional random
  fields.
 IEEE Signal Processing Letters, 18(2):122-125, February 2011.
[ bib | 
DOI | 
.pdf | 
Abstract ]
 | 
| 
[83]
 | 
J. Dines, J. Yamagishi, and S. King.
 Measuring the gap between HMM-based ASR and TTS.
 IEEE Selected Topics in Signal Processing, 2011.
 (in press).
[ bib | 
DOI | 
Abstract ]
 | 
| 
[84]
 | 
Adriana Stan, Junichi Yamagishi, Simon King, and Matthew Aylett.
 The Romanian speech synthesis (RSS) corpus: Building a high
  quality HMM-based speech synthesis system using a high sampling rate.
 Speech Communication, 53(3):442-450, 2011.
[ bib | 
DOI | 
http | 
Abstract ]
 | 
| 
[85]
 | 
C. Mayo, R. A. J. Clark, and S. King.
 Listeners' weighting of acoustic cues to synthetic speech
  naturalness: A multidimensional scaling analysis.
 Speech Communication, 53(3):311-326, 2011.
[ bib | 
DOI | 
Abstract ]
 | 
| 
[86]
 | 
Dong Wang, Simon King, Nick Evans, and Raphael Troncy.
 Direct posterior confidence for out-of-vocabulary spoken term
  detection.
 In Proc. ACM Multimedia 2010 Searching Spontaneous
  Conversational Speech Workshop, October 2010.
[ bib | 
DOI | 
.pdf | 
Abstract ]
 | 
| 
[87]
 | 
Dong Wang, Simon King, Nick Evans, and Raphael Troncy.
 CRF-based stochastic pronunciation modelling for out-of-vocabulary
  spoken term detection.
 In Proc. Interspeech, Makuhari, Chiba, Japan, September 2010.
[ bib | 
Abstract ]
 | 
| 
[88]
 | 
Oliver Watts, Junichi Yamagishi, and Simon King.
 The role of higher-level linguistic features in HMM-based speech
  synthesis.
 In Proc. Interspeech, pages 841-844, Makuhari, Japan,
  September 2010.
[ bib | 
.pdf | 
Abstract ]
 | 
| 
[89]
 | 
Junichi Yamagishi, Oliver Watts, Simon King, and Bela Usabaev.
 Roles of the average voice in speaker-adaptive HMM-based speech
  synthesis.
 In Proc. Interspeech, pages 418-421, Makuhari, Japan,
  September 2010.
[ bib | 
.pdf | 
Abstract ]
 | 
| 
[90]
 | 
Mirjam Wester, John Dines, Matthew Gibson, Hui Liang, Yi-Jian Wu, Lakshmi
  Saheer, Simon King, Keiichiro Oura, Philip N. Garner, William Byrne, Yong
  Guan, Teemu Hirsimäki, Reima Karhila, Mikko Kurimo, Matt Shannon, Sayaka
  Shiota, Jilei Tian, Keiichi Tokuda, and Junichi Yamagishi.
 Speaker adaptation and the evaluation of speaker similarity in the
  EMIME speech-to-speech translation project.
 In Proc. 7th ISCA Speech Synthesis Workshop, Kyoto, Japan,
  September 2010.
[ bib | 
.pdf | 
Abstract ]
 | 
| 
[91]
 | 
Javier Tejedor, Doroteo T. Toledano, Miguel Bautista, Simon King, Dong Wang,
  and Jose Colas.
 Augmented set of features for confidence estimation in spoken term
  detection.
 In Proc. Interspeech, September 2010.
[ bib | 
.pdf | 
Abstract ]
 | 
| 
[92]
 | 
Oliver Watts, Junichi Yamagishi, and Simon King.
 Letter-based speech synthesis.
 In Proc. Speech Synthesis Workshop 2010, pages 317-322, Nara,
  Japan, September 2010.
[ bib | 
.pdf | 
Abstract ]
 | 
| 
[93]
 | 
O. Watts, J. Yamagishi, S. King, and K. Berkling.
 Synthesis of child speech with HMM adaptation and voice conversion.
 Audio, Speech, and Language Processing, IEEE Transactions on,
  18(5):1005-1016, July 2010.
[ bib | 
DOI | 
.pdf | 
Abstract ]
 | 
| 
[94]
 | 
Alice Turk, James Scobbie, Christian Geng, Barry Campbell, Catherine Dickie,
  Eddie Dubourg, Ellen Gurman Bard, William Hardcastle, Mariam Hartinger, Simon
  King, Robin Lickley, Cedric Macmartin, Satsuki Nakai, Steve Renals, Korin
  Richmond, Sonja Schaeffler, Kevin White, Ronny Wiegand, and Alan Wrench.
 An Edinburgh speech production facility.
 Poster presented at the 12th Conference on Laboratory Phonology,
  Albuquerque, New Mexico., July 2010.
[ bib | 
.pdf ]
 | 
| 
[95]
 | 
D. Wang, S. King, and J. Frankel.
 Stochastic pronunciation modelling for out-of-vocabulary spoken term
  detection.
 Audio, Speech, and Language Processing, IEEE Transactions on,
  PP(99), July 2010.
[ bib | 
DOI | 
Abstract ]
 | 
| 
[96]
 | 
Mikko Kurimo, William Byrne, John Dines, Philip N. Garner, Matthew Gibson, Yong
  Guan, Teemu Hirsimäki, Reima Karhila, Simon King, Hui Liang, Keiichiro
  Oura, Lakshmi Saheer, Matt Shannon, Sayaka Shiota, Jilei Tian, Keiichi
  Tokuda, Mirjam Wester, Yi-Jian Wu, and Junichi Yamagishi.
 Personalising speech-to-speech translation in the EMIME project.
 In Proc. ACL 2010 System Demonstrations, Uppsala, Sweden, July
  2010.
[ bib | 
.pdf | 
Abstract ]
 | 
| 
[97]
 | 
J. Yamagishi, B. Usabaev, S. King, O. Watts, J. Dines, J. Tian, R. Hu, Y. Guan,
  K. Oura, K. Tokuda, R. Karhila, and M. Kurimo.
 Thousands of voices for HMM-based speech synthesis - analysis and
  application of TTS systems built on various ASR corpora.
 IEEE Transactions on Audio, Speech and Language Processing,
  18(5):984-1004, July 2010.
[ bib | 
DOI | 
Abstract ]
 | 
| 
[98]
 | 
R. Barra-Chicote, J. Yamagishi, S. King, J. Manuel Monero, and
  J. Macias-Guarasa.
 Analysis of statistical parametric and unit-selection speech
  synthesis systems applied to emotional speech.
 Speech Communication, 52(5):394-404, May 2010.
[ bib | 
DOI | 
Abstract ]
 | 
| 
[99]
 | 
Dong Wang, Simon King, Joe Frankel, and Peter Bell.
 Stochastic pronunciation modelling and soft match for
  out-of-vocabulary spoken term detection.
 In Proc. ICASSP, Dallas, Texas, USA, March 2010.
[ bib | 
.pdf | 
Abstract ]
 | 
| 
[100]
 | 
Simon King.
 Speech synthesis.
 In Morgan and Ellis, editors, Speech and Audio Signal
  Processing. Wiley, 2010.
[ bib | 
Abstract ]
 | 
| 
[101]
 | 
Steve Renals and Simon King.
 Automatic speech recognition.
 In William J. Hardcastle, John Laver, and Fiona E. Gibbon, editors,
  Handbook of Phonetic Sciences, chapter 22. Wiley Blackwell, 2010.
[ bib ]
 | 
| 
[102]
 | 
Keiichiro Oura, Keiichi Tokuda, Junichi Yamagishi, Mirjam Wester, and Simon
  King.
 Unsupervised cross-lingual speaker adaptation for HMM-based speech
  synthesis.
 In Proc. ICASSP, volume I, pages 4954-4957, 2010.
[ bib | 
.pdf | 
Abstract ]
 | 
| 
[103]
 | 
Volker Strom and Simon King.
 A classifier-based target cost for unit selection speech synthesis
  trained on perceptual data.
 In Proc. Interspeech, Makuhari, Japan, 2010.
[ bib | 
.ps | 
.pdf | 
Abstract ]
 | 
| 
[104]
 | 
Alice Turk, James Scobbie, Christian Geng, Cedric Macmartin, Ellen Bard, Barry
  Campbell, Catherine Dickie, Eddie Dubourg, Bill Hardcastle, Phil Hoole, Evia
  Kanaida, Robin Lickley, Satsuki Nakai, Marianne Pouplier, Simon King, Steve
  Renals, Korin Richmond, Sonja Schaeffler, Ronnie Wiegand, Kevin White, and
  Alan Wrench.
 The Edinburgh Speech Production Facility's articulatory corpus of
  spontaneous dialogue.
 The Journal of the Acoustical Society of America,
  128(4):2429-2429, 2010.
[ bib | 
DOI | 
Abstract ]
 | 
| 
[105]
 | 
J. Yamagishi and S. King.
 Simple methods for improving speaker-similarity of HMM-based speech
  synthesis.
 In Proc. ICASSP 2010, Dallas, Texas, USA, 2010.
[ bib | 
.pdf ]
 | 
| 
[106]
 | 
Simon King.
 A tutorial on HMM speech synthesis (invited paper).
 In Sadhana - Academy Proceedings in Engineering Sciences,
  Indian Institute of Sciences, 2010.
[ bib | 
.pdf | 
Abstract ]
 | 
| 
[107]
 | 
Peter Bell and Simon King.
 Diagonal priors for full covariance speech recognition.
 In Proc. IEEE Workshop on Automatic Speech Recognition and
  Understanding, Merano, Italy, December 2009.
[ bib | 
DOI | 
.pdf | 
Abstract ]
 | 
| 
[108]
 | 
Dong Wang, Simon King, and Joe Frankel.
 Stochastic pronunciation modelling for spoken term detection.
 In Proc. Interspeech, pages 2135-2138, Brighton, UK, September
  2009.
[ bib | 
.pdf | 
Abstract ]
 | 
| 
[109]
 | 
Oliver Watts, Junichi Yamagishi, Simon King, and Kay Berkling.
 HMM adaptation and voice conversion for the synthesis of child
  speech: A comparison.
 In Proc. Interspeech 2009, pages 2627-2630, Brighton, U.K.,
  September 2009.
[ bib | 
.pdf | 
Abstract ]
 | 
| 
[110]
 | 
Simon King and Vasilis Karaiskos.
 The Blizzard Challenge 2009.
 In Proc. Blizzard Challenge Workshop, Edinburgh, UK, September
  2009.
[ bib | 
.pdf | 
Abstract ]
 | 
| 
[111]
 | 
Dong Wang, Simon King, Joe Frankel, and Peter Bell.
 Term-dependent confidence for out-of-vocabulary term detection.
 In Proc. Interspeech, pages 2139-2142, Brighton, UK, September
  2009.
[ bib | 
.pdf | 
Abstract ]
 | 
| 
[112]
 | 
Junichi Yamagishi, Mike Lincoln, Simon King, John Dines, Matthew Gibson, Jilei
  Tian, and Yong Guan.
 Analysis of unsupervised and noise-robust speaker-adaptive
  HMM-based speech synthesis systems toward a unified ASR and TTS
  framework.
 In Proc. Interspeech 2009, Edinburgh, U.K., September 2009.
[ bib | 
Abstract ]
 | 
| 
[113]
 | 
J. Dines, J. Yamagishi, and S. King.
 Measuring the gap between HMM-based ASR and TTS.
 In Proc. Interspeech, pages 1391-1394, Brighton, U.K.,
  September 2009.
[ bib | 
Abstract ]
 | 
| 
[114]
 | 
Javier Tejedor, Dong Wang, Simon King, Joe Frankel, and Jose Colas.
 A posterior probability-based system hybridisation and combination
  for spoken term detection.
 In Proc. Interspeech, pages 2131-2134, Brighton, UK, September
  2009.
[ bib | 
.pdf | 
Abstract ]
 | 
| 
[115]
 | 
J. Yamagishi, Bela Usabaev, Simon King, Oliver Watts, John Dines, Jilei Tian,
  Rile Hu, Yong Guan, Keiichiro Oura, Keiichi Tokuda, Reima Karhila, and Mikko
  Kurimo.
 Thousands of voices for HMM-based speech synthesis.
 In Proc. Interspeech, pages 420-423, Brighton, U.K., September
  2009.
[ bib | 
http | 
Abstract ]
 | 
| 
[116]
 | 
Dong Wang, Tejedor Tejedor, Joe Frankel, and Simon King.
 Posterior-based confidence measures for spoken term detection.
 In Proc. ICASSP09, Taiwan, April 2009.
[ bib | 
.pdf | 
Abstract ]
 | 
| 
[117]
 | 
Matthew P. Aylett, Simon King, and Junichi Yamagishi.
 Speech synthesis without a phone inventory.
 In Interspeech, pages 2087-2090, 2009.
[ bib | 
.pdf | 
Abstract ]
 | 
| 
[118]
 | 
Junichi Yamagishi, Takashi Nose, Heiga Zen, Zhenhua Ling, Tomoki Toda, Keiichi
  Tokuda, Simon King, and Steve Renals.
 Robust speaker-adaptive HMM-based text-to-speech synthesis.
 IEEE Transactions on Audio, Speech and Language Processing,
  17(6):1208-1230, 2009.
[ bib | 
http | 
www: | 
Abstract ]
 | 
| 
[119]
 | 
R. Barra-Chicote, J. Yamagishi, J.M. Montero, S. King, S. Lutfi, and
  J. Macias-Guarasa.
 Generacion de una voz sintetica en Castellano basada en HSMM para
  la Evaluacion Albayzin 2008: conversion texto a voz.
 In V Jornadas en Tecnologia del Habla, pages 115-118, November
  2008.
 (in Spanish).
[ bib | 
.pdf ]
 | 
| 
[120]
 | 
Javier Tejedor, Dong Wang, Joe Frankel, Simon King, and José Colás.
 A comparison of grapheme and phoneme-based units for Spanish spoken
  term detection.
 Speech Communication, 50(11-12):980-991, November 2008.
[ bib | 
DOI | 
Abstract ]
 | 
| 
[121]
 | 
Oliver Watts, Junichi Yamagishi, Kay Berkling, and Simon King.
 HMM-based synthesis of child speech.
 In Proc. 1st Workshop on Child, Computer and Interaction
  (ICMI'08 post-conference workshop), Crete, Greece, October 2008.
[ bib | 
.pdf | 
Abstract ]
 | 
| 
[122]
 | 
Peter Bell and Simon King.
 A shrinkage estimator for speech recognition with full covariance
  HMMs.
 In Proc. Interspeech, Brisbane, Australia, September 2008.
 Shortlisted for best student paper award.
[ bib | 
.pdf | 
Abstract ]
 | 
| 
[123]
 | 
Junichi Yamagishi, Zhenhua Ling, and Simon King.
 Robustness of hmm-based speech synthesis.
 In Proc. Interspeech 2008, pages 581-584, Brisbane, Australia,
  September 2008.
[ bib | 
.pdf | 
Abstract ]
 | 
| 
[124]
 | 
Dong Wang, Ivan Himawan, Joe Frankel, and Simon King.
 A posterior approach for microphone array based speech recognition.
 In Proc. Interspeech, pages 996-999, September 2008.
[ bib | 
.pdf | 
Abstract ]
 | 
| 
[125]
 | 
Joe Frankel, Dong Wang, and Simon King.
 Growing bottleneck features for tandem ASR.
 In Proc. Interspeech, page 1549, September 2008.
[ bib | 
.pdf | 
Abstract ]
 | 
| 
[126]
 | 
Simon King, Keiichi Tokuda, Heiga Zen, and Junichi Yamagishi.
 Unsupervised adaptation for hmm-based speech synthesis.
 In Proc. Interspeech, pages 1869-1872, Brisbane, Australia,
  September 2008.
[ bib | 
.PDF | 
Abstract ]
 | 
| 
[127]
 | 
Laszlo Toth, Joe Frankel, Gabor Gosztolya, and Simon King.
 Cross-lingual portability of mlp-based tandem features - a case
  study for english and hungarian.
 In Proc. Interspeech, pages 2695-2698, Brisbane, Australia,
  September 2008.
[ bib | 
.PDF | 
Abstract ]
 | 
| 
[128]
 | 
Vasilis Karaiskos, Simon King, Robert A. J. Clark, and Catherine Mayo.
 The blizzard challenge 2008.
 In Proc. Blizzard Challenge Workshop, Brisbane, Australia,
  September 2008.
[ bib | 
.pdf | 
Abstract ]
 | 
| 
[129]
 | 
Peter Bell and Simon King.
 Covariance updates for discriminative training by constrained line
  search.
 In Proc. Interspeech, Brisbane, Australia, September 2008.
[ bib | 
.pdf | 
Abstract ]
 | 
| 
[130]
 | 
Olga Goubanova and Simon King.
 Bayesian networks for phone duration prediction.
 Speech Communication, 50(4):301-311, April 2008.
[ bib | 
DOI | 
Abstract ]
 | 
| 
[131]
 | 
Dong Wang, Joe Frankel, Javier Tejedor, and Simon King.
 A comparison of phone and grapheme-based spoken term detection.
 In Proc. ICASSP, pages 4969-4972, March 2008.
[ bib | 
DOI | 
Abstract ]
 | 
| 
[132]
 | 
Matthew P. Aylett and Simon King.
 Single speaker segmentation and inventory selection using dynamic
  time warping self organization and joint multigram mapping.
 In SSW06, pages 258-263, 2008.
[ bib | 
.pdf | 
Abstract ]
 | 
| 
[133]
 | 
Volker Strom and Simon King.
 Investigating Festival's target cost function using perceptual
  experiments.
 In Proc. Interspeech, Brisbane, 2008.
[ bib | 
.ps | 
.pdf | 
Abstract ]
 | 
| 
[134]
 | 
J. Frankel and S. King.
 Factoring Gaussian precision matrices for linear dynamic models.
 Pattern Recognition Letters, 28(16):2264-2272, December 2007.
[ bib | 
DOI | 
.pdf | 
Abstract ]
 | 
| 
[135]
 | 
Ö. Çetin, M. Magimai-Doss, A. Kantor, S. King, C. Bartels, J. Frankel, and
  K. Livescu.
 Monolingual and crosslingual comparison of tandem features derived
  from articulatory and phone MLPs.
 In Proc. ASRU, Kyoto, December 2007. IEEE.
[ bib | 
.pdf | 
Abstract ]
 | 
| 
[136]
 | 
J. Frankel, M. Wester, and S. King.
 Articulatory feature recognition using dynamic Bayesian networks.
 Computer Speech & Language, 21(4):620-640, October 2007.
[ bib | 
.pdf | 
Abstract ]
 | 
| 
[137]
 | 
J. Frankel, M. Magimai-Doss, S. King, K. Livescu, and Ö. Çetin.
 Articulatory feature classifiers trained on 2000 hours of telephone
  speech.
 In Proc. Interspeech, Antwerp, Belgium, August 2007.
[ bib | 
.pdf | 
Abstract ]
 | 
| 
[138]
 | 
Junichi Yamagishi, Takao Kobayashi, Steve Renals, Simon King, Heiga Zen, Tomoki
  Toda, and Keiichi Tokuda.
 Improved average-voice-based speech synthesis using gender-mixed
  modeling and a parameter generation algorithm considering GV.
 In Proc. 6th ISCA Workshop on Speech Synthesis (SSW-6), August
  2007.
[ bib | 
.pdf | 
Abstract ]
 | 
| 
[139]
 | 
Robert A. J. Clark, Monika Podsiadlo, Mark Fraser, Catherine Mayo, and Simon
  King.
 Statistical analysis of the Blizzard Challenge 2007 listening
  test results.
 In Proc. Blizzard 2007 (in Proc. Sixth ISCA Workshop on Speech
  Synthesis), Bonn, Germany, August 2007.
[ bib | 
.pdf | 
Abstract ]
 | 
| 
[140]
 | 
Mark Fraser and Simon King.
 The Blizzard Challenge 2007.
 In Proc. Blizzard 2007 (in Proc. Sixth ISCA Workshop on Speech
  Synthesis), Bonn, Germany, August 2007.
[ bib | 
.pdf | 
Abstract ]
 | 
| 
[141]
 | 
Volker Strom, Ani Nenkova, Robert Clark, Yolanda Vazquez-Alvarez, Jason
  Brenier, Simon King, and Dan Jurafsky.
 Modelling prominence and emphasis improves unit-selection synthesis.
 In Proc. Interspeech 2007, Antwerp, Belgium, August 2007.
[ bib | 
.pdf | 
Abstract ]
 | 
| 
[142]
 | 
Peter Bell and Simon King.
 Sparse gaussian graphical models for speech recognition.
 In Proc. Interspeech 2007, Antwerp, Belgium, August 2007.
[ bib | 
.pdf | 
Abstract ]
 | 
| 
[143]
 | 
Ö. Çetin, A. Kantor, S. King, C. Bartels, M. Magimai-Doss, J. Frankel, and
  K. Livescu.
 An articulatory feature-based tandem approach and factored
  observation modeling.
 In Proc. ICASSP, Honolulu, April 2007.
[ bib | 
.pdf | 
Abstract ]
 | 
| 
[144]
 | 
K. Livescu, Ö. Çetin, M. Hasegawa-Johnson, S. King, C. Bartels, N. Borges,
  A. Kantor, P. Lal, L. Yung, S. Bezman, Dawson-Haggerty, B. Woods, J. Frankel,
  M. Magimai-Doss, and K. Saenko.
 Articulatory feature-based methods for acoustic and audio-visual
  speech recognition: Summary from the 2006 JHU Summer Workshop.
 In Proc. ICASSP, Honolulu, April 2007.
[ bib | 
.pdf | 
Abstract ]
 | 
| 
[145]
 | 
K. Livescu, A. Bezman, N. Borges, L. Yung, Ö. Çetin, J. Frankel, S. King,
  M. Magimai-Doss, X. Chi, and L. Lavoie.
 Manual transcription of conversational speech at the articulatory
  feature level.
 In Proc. ICASSP, Honolulu, April 2007.
[ bib | 
.pdf | 
Abstract ]
 | 
| 
[146]
 | 
S. King, J. Frankel, K. Livescu, E. McDermott, K. Richmond, and M. Wester.
 Speech production knowledge in automatic speech recognition.
 Journal of the Acoustical Society of America, 121(2):723-742,
  February 2007.
[ bib | 
.pdf | 
Abstract ]
 | 
| 
[147]
 | 
J. Frankel and S. King.
 Speech recognition using linear dynamic models.
 IEEE Transactions on Speech and Audio Processing,
  15(1):246-256, January 2007.
[ bib | 
.ps | 
.pdf | 
Abstract ]
 | 
| 
[148]
 | 
Robert A. J. Clark, Korin Richmond, and Simon King.
 Multisyn: Open-domain unit selection for the Festival speech
  synthesis system.
 Speech Communication, 49(4):317-330, 2007.
[ bib | 
DOI | 
.pdf | 
Abstract ]
 | 
| 
[149]
 | 
Jithendra Vepa and Simon King.
 Subjective evaluation of join cost and smoothing methods for unit
  selection speech synthesis.
 IEEE Transactions on Speech and Audio Processing,
  14(5):1763-1771, September 2006.
[ bib | 
.pdf | 
Abstract ]
 | 
| 
[150]
 | 
J. Frankel and S. King.
 Observation process adaptation for linear dynamic models.
 Speech Communication, 48(9):1192-1199, September 2006.
[ bib | 
.ps | 
.pdf | 
Abstract ]
 | 
| 
[151]
 | 
R. Clark, K. Richmond, V. Strom, and S. King.
 Multisyn voices for the Blizzard Challenge 2006.
 In Proc. Blizzard Challenge Workshop (Interspeech Satellite),
  Pittsburgh, USA, September 2006.
 (http://festvox.org/blizzard/blizzard2006.html).
[ bib | 
.pdf | 
Abstract ]
 | 
| 
[152]
 | 
Robert A. J. Clark and Simon King.
 Joint prosodic and segmental unit selection speech synthesis.
 In Proc. Interspeech 2006, Pittsburgh, USA, September 2006.
[ bib | 
.ps | 
.pdf | 
Abstract ]
 | 
| 
[153]
 | 
Simon King.
 Handling variation in speech and language processing.
 In Keith Brown, editor, Encyclopedia of Language and
  Linguistics. Elsevier, 2nd edition, 2006.
[ bib ]
 | 
| 
[154]
 | 
Simon King.
 Language variation in speech technologies.
 In Keith Brown, editor, Encyclopedia of Language and
  Linguistics. Elsevier, 2nd edition, 2006.
[ bib ]
 | 
| 
[155]
 | 
Volker Strom, Robert Clark, and Simon King.
 Expressive prosody for unit-selection speech synthesis.
 In Proc. Interspeech, Pittsburgh, 2006.
[ bib | 
.ps | 
.pdf | 
Abstract ]
 | 
| 
[156]
 | 
Robert A.J. Clark, Korin Richmond, and Simon King.
 Multisyn voices from ARCTIC data for the Blizzard challenge.
 In Proc. Interspeech 2005, September 2005.
[ bib | 
.pdf | 
Abstract ]
 | 
| 
[157]
 | 
C. Mayo, R. A. J. Clark, and S. King.
 Multidimensional scaling of listener responses to synthetic speech.
 In Proc. Interspeech 2005, Lisbon, Portugal, September 2005.
[ bib | 
.pdf ]
 | 
| 
[158]
 | 
J. Frankel and S. King.
 A hybrid ANN/DBN approach to articulatory feature recognition.
 In Proc. Eurospeech, Lisbon, September 2005.
[ bib | 
.ps | 
.pdf | 
Abstract ]
 | 
| 
[159]
 | 
Alexander Gutkin and Simon King.
 Inductive String Template-Based Learning of Spoken
  Language.
 In Hugo Gamboa and Ana Fred, editors, Proc. 5th International
  Workshop on Pattern Recognition in Information Systems (PRIS-2005), In
  conjunction with the 7th International Conference on Enterprise Information
  Systems (ICEIS-2005), pages 43-51, Miami, USA, May 2005. INSTICC Press.
[ bib | 
.ps.gz | 
.pdf | 
Abstract ]
 | 
| 
[160]
 | 
Alexander Gutkin and Simon King.
 Detection of Symbolic Gestural Events in Articulatory
  Data for Use in Structural Representations of Continuous Speech.
 In Proc. IEEE International Conference on Acoustics, Speech, and
  Signal Processing (ICASSP-05), volume I, pages 885-888, Philadelphia, PA,
  USA, March 2005. IEEE Signal Processing Society Press.
[ bib | 
.ps.gz | 
.pdf | 
Abstract ]
 | 
| 
[161]
 | 
Simon King, Chris Bartels, and Jeff Bilmes.
 Svitchboard 1: Small vocabulary tasks from switchboard 1.
 In Proc. Interspeech 2005, Lisbon, Portugal, 2005.
[ bib | 
.pdf | 
Abstract ]
 | 
| 
[162]
 | 
Olga Goubanova and Simon King.
 Predicting consonant duration with Bayesian belief networks.
 In Proc. Interspeech 2005, Lisbon, Portugal, 2005.
[ bib | 
.pdf | 
Abstract ]
 | 
| 
[163]
 | 
M. Wester, J. Frankel, and S. King.
 Asynchronous articulatory feature recognition using dynamic
  Bayesian networks.
 In Proc. IEICI Beyond HMM Workshop, Kyoto, December 2004.
[ bib | 
.ps | 
.pdf | 
Abstract ]
 | 
| 
[164]
 | 
Yoshinori Shiga and Simon King.
 Source-filter separation for articulation-to-speech synthesis.
 In Proc. ICSLP, Jeju, Korea, October 2004.
[ bib | 
.ps | 
.pdf | 
Abstract ]
 | 
| 
[165]
 | 
Jithendra Vepa and Simon King.
 Subjective evaluation of join cost functions used in unit selection
  speech synthesis.
 In Proc. 8th International Conference on Spoken Language
  Processing (ICSLP), Jeju, Korea, October 2004.
[ bib | 
.pdf | 
Abstract ]
 | 
| 
[166]
 | 
Yoshinori Shiga and Simon King.
 Estimating detailed spectral envelopes using articulatory clustering.
 In Proc. ICSLP, Jeju, Korea, October 2004.
[ bib | 
.ps | 
.pdf | 
Abstract ]
 | 
| 
[167]
 | 
Alexander Gutkin and Simon King.
 Phone classification in pseudo-Euclidean vector spaces.
 In Proc. 8th International Conference on Spoken Language
  Processing (ICSLP), volume II, pages 1453-1457, Jeju Island, Korea, October
  2004.
[ bib | 
.ps.gz | 
.pdf | 
Abstract ]
 | 
| 
[168]
 | 
J. Frankel, M. Wester, and S. King.
 Articulatory feature recognition using dynamic Bayesian networks.
 In Proc. ICSLP, September 2004.
[ bib | 
.ps | 
.pdf | 
Abstract ]
 | 
| 
[169]
 | 
Alexander Gutkin and Simon King.
 Structural Representation of Speech for Phonetic
  Classification.
 In Proc. 17th International Conference on Pattern Recognition
  (ICPR), volume 3, pages 438-441, Cambridge, UK, August 2004. IEEE Computer
  Society Press.
[ bib | 
.ps.gz | 
.pdf | 
Abstract ]
 | 
| 
[170]
 | 
J. Vepa and S. King.
 Subjective evaluation of join cost and smoothing methods.
 In Proc. 5th ISCA speech synthesis workshop, Pittsburgh, USA,
  June 2004.
[ bib | 
.pdf | 
Abstract ]
 | 
| 
[171]
 | 
Yoshinori Shiga and Simon King.
 Accurate spectral envelope estimation for articulation-to-speech
  synthesis.
 In Proc. 5th ISCA Speech Synthesis Workshop, pages 19-24, CMU,
  Pittsburgh, USA, June 2004.
[ bib | 
.ps | 
.pdf | 
Abstract ]
 | 
| 
[172]
 | 
Jithendra Vepa and Simon King.
 Join cost for unit selection speech synthesis.
 In Abeer Alwan and Shri Narayanan, editors, Speech Synthesis.
  Prentice Hall, 2004.
[ bib | 
.ps ]
 | 
| 
[173]
 | 
Robert A.J. Clark, Korin Richmond, and Simon King.
 Festival 2 - build your own general purpose unit selection speech
  synthesiser.
 In Proc. 5th ISCA workshop on speech synthesis, 2004.
[ bib | 
.ps | 
.pdf | 
Abstract ]
 | 
| 
[174]
 | 
Ben Gillett and Simon King.
 Transforming F0 contours.
 In Proc. Eurospeech, Geneva, September 2003.
[ bib | 
.pdf | 
Abstract ]
 | 
| 
[175]
 | 
Yoshinori Shiga and Simon King.
 Estimating the spectral envelope of voiced speech using multi-frame
  analysis.
 In Proc. Eurospeech-2003, volume 3, pages 1737-1740, Geneva,
  Switzerland, September 2003.
[ bib | 
.ps | 
.pdf | 
Abstract ]
 | 
| 
[176]
 | 
James Horlock and Simon King.
 Named entity extraction from word lattices.
 In Proc. Eurospeech, Geneva, September 2003.
[ bib | 
.pdf | 
Abstract ]
 | 
| 
[177]
 | 
James Horlock and Simon King.
 Discriminative methods for improving named entity extraction on
  speech data.
 In Proc. Eurospeech, Geneva, September 2003.
[ bib | 
.pdf | 
Abstract ]
 | 
| 
[178]
 | 
Ben Gillett and Simon King.
 Transforming voice quality.
 In Proc. Eurospeech, Geneva, September 2003.
[ bib | 
.pdf | 
Abstract ]
 | 
| 
[179]
 | 
Yoshinori Shiga and Simon King.
 Estimation of voice source and vocal tract characteristics based on
  multi-frame analysis.
 In Proc. Eurospeech, volume 3, pages 1749-1752, Geneva,
  Switzerland, September 2003.
[ bib | 
.ps | 
.pdf | 
Abstract ]
 | 
| 
[180]
 | 
K. Richmond, S. King, and P. Taylor.
 Modelling the uncertainty in recovering articulation from acoustics.
 Computer Speech and Language, 17:153-172, 2003.
[ bib | 
.pdf | 
Abstract ]
 | 
| 
[181]
 | 
Christophe Van Bael and Simon King.
 An accent-independent lexicon for automatic speech recognition.
 In Proc. ICPhS, pages 1165-1168, 2003.
[ bib | 
.pdf | 
Abstract ]
 | 
| 
[182]
 | 
J. Vepa and S. King.
 Kalman-filter based join cost for unit-selection speech synthesis.
 In Proc. Eurospeech, Geneva, Switzerland, 2003.
[ bib | 
.pdf | 
Abstract ]
 | 
| 
[183]
 | 
Simon King.
 Dependence and independence in automatic speech recognition and
  synthesis.
 Journal of Phonetics, 31(3-4):407-411, 2003.
[ bib | 
.pdf | 
Abstract ]
 | 
| 
[184]
 | 
J. Vepa, S. King, and P. Taylor.
 Objective distance measures for spectral discontinuities in
  concatenative speech synthesis.
 In Proc. ICSLP, Denver, USA, September 2002.
[ bib | 
.pdf | 
Abstract ]
 | 
| 
[185]
 | 
J. Vepa, S. King, and P. Taylor.
 New objective distance measures for spectral discontinuities in
  concatenative speech synthesis.
 In Proc. IEEE 2002 workshop on speech synthesis, Santa
  Monica, USA, September 2002.
[ bib | 
.pdf | 
Abstract ]
 | 
| 
[186]
 | 
Jesper Salomon, Simon King, and Miles Osborne.
 Framewise phone classification using support vector machines.
 In Proceedings International Conference on Spoken Language
  Processing, Denver, 2002.
[ bib | 
.ps | 
.pdf | 
Abstract ]
 | 
| 
[187]
 | 
J. Frankel and S. King.
 ASR - articulatory speech recognition.
 In Proc. Eurospeech, pages 599-602, Aalborg, Denmark,
  September 2001.
[ bib | 
.ps | 
.pdf | 
Abstract ]
 | 
| 
[188]
 | 
J. Frankel and S. King.
 Speech recognition in the articulatory domain: investigating an
  alternative to acoustic HMMs.
 In Proc. Workshop on Innovations in Speech Processing, April
  2001.
[ bib | 
.ps | 
.pdf | 
Abstract ]
 | 
| 
[189]
 | 
J. Frankel, K. Richmond, S. King, and P. Taylor.
 An automatic speech recognition system using neural networks and
  linear dynamic models to recover and model articulatory traces.
 In Proc. ICSLP, 2000.
[ bib | 
.ps | 
.pdf | 
Abstract ]
 | 
| 
[190]
 | 
S. King, P. Taylor, J. Frankel, and K. Richmond.
 Speech recognition via phonetically-featured syllables.
 In PHONUS, volume 5, pages 15-34, Institute of Phonetics,
  University of the Saarland, 2000.
[ bib | 
.ps | 
.pdf | 
Abstract ]
 | 
| 
[191]
 | 
Simon King and Paul Taylor.
 Detection of phonological features in continuous speech using neural
  networks.
 Computer Speech and Language, 14(4):333-353, 2000.
[ bib | 
.ps | 
.pdf | 
Abstract ]
 | 
| 
[192]
 | 
Simon King and Alan Wrench.
 Dynamical system modelling of articulator movement.
 In Proc. ICPhS 99, pages 2259-2262, San Francisco, August
  1999.
[ bib | 
.ps | 
.pdf | 
Abstract ]
 | 
| 
[193]
 | 
Simon King, Todd Stephenson, Stephen Isard, Paul Taylor, and Alex Strachan.
 Speech recognition via phonetically featured syllables.
 In Proc. ICSLP `98, pages 1031-1034, Sydney, Australia,
  December 1998.
[ bib | 
.ps | 
.pdf | 
Abstract ]
 | 
| 
[194]
 | 
Paul A. Taylor, S. King, S. D. Isard, and H. Wright.
 Intonation and dialogue context as constraints for speech
  recognition.
 Language and Speech, 41(3):493-512, 1998.
[ bib | 
.ps | 
.pdf ]
 | 
| 
[195]
 | 
Simon King.
 Using Information Above the Word Level for Automatic Speech
  Recognition.
 PhD thesis, University of Edinburgh, 1998.
[ bib | 
.ps | 
.pdf | 
Abstract ]
 | 
| 
[196]
 | 
Simon King, Thomas Portele, and Florian Höfer.
 Speech synthesis using non-uniform units in the Verbmobil project.
 In Proc. Eurospeech 97, volume 2, pages 569-572, Rhodes,
  Greece, September 1997.
[ bib | 
.ps | 
.pdf | 
Abstract ]
 | 
| 
[197]
 | 
Simon King.
 Final report for Verbmobil Teilprojekt 4.4.
 Technical Report ISSN 1434-8845, IKP, Universitaet Bonn, January
  1997.
 Verbmobil-Report 195 available at http://verbmobil.dfki.de.
[ bib | 
Abstract ]
 | 
| 
[198]
 | 
Paul A. Taylor, Simon King, Stephen Isard, Helen Wright, and Jacqueline Kowtko.
 Using intonation to constrain language models in speech recognition.
 In Proc. Eurospeech'97, Rhodes, 1997.
[ bib | 
.pdf | 
Abstract ]
 | 
| 
[199]
 | 
Simon King.
 Users Manual for Verbmobil Teilprojekt 4.4.
 IKP, Universitaet Bonn, October 1996.
[ bib | 
Abstract ]
 | 
| 
[200]
 | 
Simon King.
 Inventory design for Verbmobil Teilprojekt 4.4.
 Technical report, IKP, Universität Bonn, October 1996.
[ bib | 
Abstract ]
 | 
| 
[201]
 | 
Paul A. Taylor, Hiroshi Shimodaira, Stephen Isard, Simon King, and Jacqueline
  Kowtko.
 Using prosodic information to constrain language models for spoken
  dialogue.
 In Proc. ICSLP `96, Philadelphia, 1996.
[ bib | 
.ps | 
.pdf | 
Abstract ]
 | 
| 
[202]
 | 
Stephen Isard, Simon King, Paul A. Taylor, and Jacqueline Kowtko.
 Prosodic information in a speech recognition system intended for
  dialogue.
 In IEEE Workshop in speech recognition, Snowbird, Utah, 1995.
[ bib | 
Abstract ]
 |