Benigno Uria, Steve Renals, and Korin Richmond. A deep neural network for acoustic-articulatory speech inversion. In Proc. NIPS 2011 Workshop on Deep Learning and Unsupervised Feature Learning, Sierra Nevada, Spain, December 2011. [ bib | .pdf | Abstract ]

Atef Ben Youssef. Control of talking heads by acoustic-to-articulatory inversion for language learning and rehabilitation. PhD thesis, Grenoble University, October 2011. [ bib | .pdf | Abstract ]

Oliver Watts, Junichi Yamagishi, and Simon King. Unsupervised continuous-valued word features for phrase-break prediction without a part-of-speech tagger. In Proc. Interspeech, pages 2157-2160, Florence, Italy, August 2011. [ bib | .pdf | Abstract ]

Cassia Valentini-Botinhao, Junichi Yamagishi, and Simon King. Can objective measures predict the intelligibility of modified HMM-based synthetic speech in noise? In Proc. Interspeech, August 2011. [ bib | .pdf | Abstract ]

Korin Richmond, Phil Hoole, and Simon King. Announcing the electromagnetic articulography (day 1) subset of the mngu0 articulatory corpus. In Proc. Interspeech, pages 1505-1508, Florence, Italy, August 2011. [ bib | .pdf | Abstract ]

Ming Lei, Junichi Yamagishi, Korin Richmond, Zhen-Hua Ling, Simon King, and Li-Rong Dai. Formant-controlled HMM-based speech synthesis. In Proc. Interspeech, pages 2777-2780, Florence, Italy, August 2011. [ bib | .pdf | Abstract ]

Oliver Watts and Bowen Zhou. Unsupervised features from text for speech synthesis in a speech-to-speech translation system. In Proc. Interspeech, pages 2153-2156, Florence, Italy, August 2011. [ bib | .pdf | Abstract ]

Zhen-Hua Ling, Korin Richmond, and Junichi Yamagishi. Feature-space transform tying in unified acoustic-articulatory modelling of articulatory control of HMM-based speech synthesis. In Proc. Interspeech, pages 117-120, Florence, Italy, August 2011. [ bib | .pdf | Abstract ]

Atef Ben Youssef, Thomas Hueber, Pierre Badin, and Gérard Bailly. Toward a multi-speaker visual articulatory feedback system. In Proc. Interspeech, pages 589-592, Florence, Italie, August 2011. [ bib | .pdf | Abstract ]

Fergus R. McInnes and Sharon J. Goldwater. Unsupervised extraction of recurring words from infant-directed speech. In Proceedings of CogSci 2011, Boston, Massachusetts, July 2011. [ bib | .pdf | Abstract ]

Myroslava Dzikovska, Amy Isard, Peter Bell, Johanna Moore, Natalie Steinhauser, and Gwendolyn Campbell. Beetle II: an adaptable tutorial dialogue system. In Proceedings of the SIGDIAL 2011 Conference, demo session, pages 338-340, Portland, Oregon, June 2011. Association for Computational Linguistics. [ bib | http | Abstract ]

S. Andraszewicz, J. Yamagishi, and S. King. Vocal attractiveness of statistical speech synthesisers. In Acoustics, Speech and Signal Processing (ICASSP), 2011 IEEE International Conference on, pages 5368-5371, May 2011. [ bib | DOI | Abstract ]

P.L. De Leon, I. Hernaez, I. Saratxaga, M. Pucher, and J. Yamagishi. Detection of synthetic speech for the problem of imposture. In Acoustics, Speech and Signal Processing (ICASSP), 2011 IEEE International Conference on, pages 4844-4847, May 2011. [ bib | DOI | Abstract ]

Cassia Valentini-Botinhao, Junichi Yamagishi, and Simon King. Evaluation of objective measures for intelligibility prediction of HMM-based synthetic speech in noise. In Acoustics, Speech and Signal Processing (ICASSP), 2011 IEEE International Conference on, pages 5112-5115, May 2011. [ bib | DOI | .pdf | Abstract ]

J.P. Cabral, S. Renals, J. Yamagishi, and K. Richmond. HMM-based speech synthesiser using the LF-model of the glottal source. In Acoustics, Speech and Signal Processing (ICASSP), 2011 IEEE International Conference on, pages 4704-4707, May 2011. [ bib | DOI | .pdf | Abstract ]

K. Hashimoto, J. Yamagishi, W. Byrne, S. King, and K. Tokuda. An analysis of machine translation and speech synthesis in speech-to-speech translation system. In Acoustics, Speech and Signal Processing (ICASSP), 2011 IEEE International Conference on, pages 5108-5111, May 2011. [ bib | DOI | Abstract ]

Dong Wang, Nicholas Evans, Raphael Troncy, and Simon King. Handling overlaps in spoken term detection. In Proc. International Conference on Acoustics, Speech and Signal Processing, pages 5656-5659, May 2011. [ bib | DOI | .pdf | Abstract ]

Dong Wang and Simon King. Letter-to-sound pronunciation prediction using conditional random fields. IEEE Signal Processing Letters, 18(2):122-125, February 2011. [ bib | DOI | .pdf | Abstract ]

Reima Karhila and Mirjam Wester. Rapid adaptation of foreign-accented HMM-based speech synthesis. In Proc. Interspeech, Florence, Italy, 2011. [ bib | .pdf | Abstract ]

Myroslava Dzikovska, Amy Isard, Peter Bell, Johanna D. Moore, Natalie B. Steinhauser, Gwendolyn E. Campbell, Leanne S. Taylor, Simon Caine, and Charlie Scott. Adaptive intelligent tutorial dialogue in the Beetle II system. In Artificial Intelligence in Education - 15th International Conference (AIED 2011), interactive event, volume 6738 of Lecture Notes in Computer Science, page 621, Auckland, New Zealand, 2011. Springer. [ bib | DOI ]

Mirjam Wester and Hui Liang. Cross-lingual speaker discrimination using natural and synthetic speech. In Proc. Interspeech, Florence, Italy, 2011. [ bib | .pdf | Abstract ]

T. Raitio, A. Suni, J. Yamagishi, H. Pulakka, J. Nurminen, M. Vainio, and P. Alku. HMM-based speech synthesis utilizing glottal inverse filtering. IEEE Transactions on Audio, Speech and Language Processing, 19(1):153-165, January 2011. [ bib | DOI | Abstract ]

Theresa Wilson and Gregor Hofer. Using linguistic and vocal expressiveness in social role recognition. In Proc Int. Conf. on Intelligent User Interfaces, IUI2011, Palo Alto, USA, 2011. ACM. [ bib | .pdf | Abstract ]

J. Dines, J. Yamagishi, and S. King. Measuring the gap between HMM-based ASR and TTS. IEEE Selected Topics in Signal Processing, 2011. (in press). [ bib | DOI | Abstract ]

Mirjam Wester and Reima Karhila. Speaker similarity evaluation of foreign-accented speech synthesis using HMM-based speaker adaptation. In Proc. ICASSP, pages 5372-5375, Prague, Czech Republic, 2011. [ bib | .pdf | Abstract ]

Maria Klara Wolters, Christine Johnson, and Karl B Isaac. Can the hearing handicap inventory for adults be used as a screen for perception experiments? In Proc. ICPhS XVII, Hong Kong, 2011. [ bib | .pdf | Abstract ]

Adriana Stan, Junichi Yamagishi, Simon King, and Matthew Aylett. The Romanian speech synthesis (RSS) corpus: Building a high quality HMM-based speech synthesis system using a high sampling rate. Speech Communication, 53(3):442-450, 2011. [ bib | DOI | http | Abstract ]

L. Lu, A. Ghoshal, and S. Renals. Regularized subspace gausian mixture models for speech recognition. IEEE Signal Processing Letters, 18(7):419-422, 2011. [ bib | .pdf | Abstract ]

A. G. Pipe, R. Vaidyanathan, C. Melhuish, P. Bremner, P. Robinson, R. A. J. Clark, A. Lenz, K. Eder, N. Hawes, Z. Ghahramani, M. Fraser, M. Mermehdi, P. Healey, and S. Skachek. Affective robotics: Human motion and behavioural inspiration for cooperation between humans and assistive robots. In Yoseph Bar-Cohen, editor, Biomimetics: Nature-Based Innovation, chapter 15. Taylor and Francis, 2011. [ bib ]

Michael A. Berger, Gregor Hofer, and Hiroshi Shimodaira. Carnival - combining speech technology and computer animation. IEEE Computer Graphics and Applications, 31:80-89, 2011. [ bib | DOI ]

Jonathan Kilgour, Jean Carletta, and Steve Renals. The Ambient Spotlight: Personal meeting capture with a microphone array. In Proc. HSCMA, 2011. [ bib | DOI | .pdf | Abstract ]

S Renals. Automatic analysis of multiparty meetings. SADHANA - Academy Proceedings in Engineering Sciences, 36(5):917-932, 2011. [ bib | DOI | .pdf | Abstract ]

Mirjam Wester and Hui Liang. The EMIME Mandarin Bilingual Database. Technical Report EDI-INF-RR-1396, The University of Edinburgh, 2011. [ bib | .pdf | Abstract ]

Andi K. Winterboer, Martin I. Tietze, Maria K. Wolters, and Johanna D. Moore. The user-model based summarize and refine approach improves information presentation in spoken dialog systems. Computer Speech and Language, 25(2):175-191, 2011. [ bib | .pdf | Abstract ]

C. Mayo, R. A. J. Clark, and S. King. Listeners' weighting of acoustic cues to synthetic speech naturalness: A multidimensional scaling analysis. Speech Communication, 53(3):311-326, 2011. [ bib | DOI | Abstract ]

L. Lu, A. Ghoshal, and S. Renals. Regularized subspace Gausian mixture models for cross-lingual speech recognition. In Proc. ASRU, 2011. [ bib | .pdf | Abstract ]

Atef Ben Youssef, Thomas Hueber, Pierre Badin, Gérard Bailly, and Frédéric Elisei. Toward a speaker-independent visual articulatory feedback system. In 9th International Seminar on Speech Production, ISSP9, Montreal, Canada, 2011. [ bib | .pdf ]

Thomas Hueber, Pierre Badin, Gérard Bailly, Atef Ben Youssef, Frédéric Elisei, Bruce Denby, and Gérard Chollet. Statistical mapping between articulatory and acoustic data. application to silent speech interface and visual articulatory feedback. In Proceedings of the 1st International Workshop on Performative Speech and Singing Synthesis (p3s), Vancouver, Canada, 2011. [ bib | .pdf | Abstract ]