The Centre for Speech Technology Research, The university of Edinburgh

Publications by Steve Renals

[1] Ahmed Ali, Preslav Nakov, Peter Bell, and Steve Renals. Werd: Using social text spelling variants for evaluating dialectal speech recognition. In Proc. ASRU. IEEE, December 2017. [ bib | .pdf | Abstract ]
[2] Joanna Rownicka, Steve Renals, and Peter Bell. Simplifying very deep convolutional neural network architectures for robust speech recognition. In Proc. 2017 IEEE Automatic Speech Recognition and Understanding Workshop (ASRU), Okinawa, Japan, December 2017. [ bib | .pdf | Abstract ]
[3] Emiru Tsunoo, Ondrej Klejch, Peter Bell, and Steve Renals. Hierarchical recurrent neural network for story segmentation using fusion of lexical and acoustic features. In Proc. IEEE Automatic Speech Recognition and Understanding Workshop, Okinawa, Japan, December 2017. [ bib | .pdf | Abstract ]
[4] Emiru Tsunoo, Ondrej Klejch, Peter Bell, and Steve Renals. Hierarchical recurrent neural network for story segmentation using fusion of lexical and acoustic features. In Proc. ASRU. IEEE, August 2017. [ bib | .pdf | Abstract ]
[5] Emiru Tsunoo, Peter Bell, and Steve Renals. Hierarchical recurrent neural network for story segmentation. In Proc. Interspeech, August 2017. [ bib | .pdf | Abstract ]
[6] Renars Liepins, Ulrich Germann, Guntis Barzdins, Alexandra Birch, Steve Renals, Susanne Weber, Peggy van der Kreeft, Hervé Bourlard, João Prieto, Ondřej Klejch, Peter Bell, Alexandros Lazaridis, Alfonso Mendes, Sebastian Riedel, Mariana S. C. Almeida, Pedro Balage, Shay Cohen, Tomasz Dwojak, Phil Garner, Andreas Giefer, Marcin Junczys-Dowmunt, Hina Imrani, David Nogueira, Ahmed Ali, Sebastião Miranda, Andrei Popescu-Belis, Lesly Miculicich Werlen, Nikos Papasarantopoulos, Abiola Obamuyide, Clive Jones, Fahim Dalvi, Andreas Vlachos, Yang Wang, Sibo Tong, Rico Sennrich, Nikolaos Pappas, Shashi Narayan, Marco Damonte, Nadir Durrani, Sameer Khurana, Ahmed Abdelali, Hassan Sajjad, Stephan Vogel, David Sheppey, and Chris Hernon. The summa platform prototype. In Proceedings of the EACL 2017 Software Demonstrations, page 116–119. Association for Computational Linguistics (ACL), April 2017. [ bib | .pdf | Abstract ]
[7] Ondrej Klejch, Peter Bell, and Steve Renals. Sequence-to-sequence models for punctuated transcription combining lexical and acoustic features. In Proc. IEEE International Conference on Acoustics, Speech, and Signal Processing (ICASSP), New Orleans, USA, March 2017. [ bib | .pdf | Abstract ]
[8] Joachim Fainberg, Steve Renals, and Peter Bell. Factorised representations for neural network adaptation to diverse acoustic environments. Proc. Interspeech 2017, pages 749-753, 2017. [ bib | .pdf | Abstract ]
[9] Peter Bell, Pawel Swietojanski, and Steve Renals. Multitask learning of context-dependent targets in deep neural network acoustic models. IEEE/ACM Transactions on Audio, Speech, and Language Processing, 25(2):238-247, 2017. [ bib | .pdf | Abstract ]
[10] Ondrej Klejch, Peter Bell, and Steve Renals. Punctuated transcription of multi-genre broadcasts using acoustic and lexical approaches. In Proc. IEEE Workshop on Spoken Language Technology, San Diego, USA, December 2016. [ bib | .pdf | Abstract ]
[11] P. Swietojanski and S. Renals. Differentiable Pooling for Unsupervised Acoustic Model Adaptation. IEEE/ACM Transactions on Audio, Speech, and Language Processing, 24(10):1773-1784, October 2016. [ bib | DOI | .pdf | Abstract ]
[12] Joachim Fainberg, Peter Bell, Mike Lincoln, and Steve Renals. Improving children's speech recognition through out-of-domain data augmentation. In Proc. Interspeech, San Francisco, USA, September 2016. [ bib | .pdf | Abstract ]
[13] Siva Reddy Gangireddy, Pawel Swietojanski, Peter Bell, and Steve Renals. Unsupervised adaptation of Recurrent Neural Network Language Models. In Proc. Interspeech, San Francisco, USA, September 2016. [ bib | .pdf | Abstract ]
[14] P. Swietojanski, J. Li, and S. Renals. Learning hidden unit contributions for unsupervised acoustic model adaptation. IEEE/ACM Transactions on Audio, Speech, and Language Processing, 24(8):1450-1463, August 2016. [ bib | DOI | .pdf | Abstract ]
[15] P. Swietojanski and S. Renals. Sat-lhuc: Speaker adaptive training for learning hidden unit contributions. In Proc. IEEE ICASSP, Shanghai, China, March 2016. [ bib | .pdf | Abstract ]
[16] Ahmed Ali, Najim Dehak, Patrick Cardinal, Sameer Khurana, Sree Harsha Yella, James Glass, Peter Bell, and Steve Renals. Automatic dialect detection in arabic broadcast speech. In Proc. Interspeech, 2016. [ bib | .pdf | Abstract ]
[17] P. Swietojanski and S. Renals. SAT-LHUC: Speaker adaptive training for learning hidden unit contributions. In Proc. IEEE Int. Conf. Acoustic, Speech Signal Processing (ICASSP), pages 5010-5014, 2016. [ bib | .pdf | Abstract ]
[18] A. Ali, P. Bell, J. Glass, Y. Messaoui, H. Mubarak, S. Renals, and Y. Zhang. The MGB-2 Challenge: Arabic multi-dialect broadcast media recognition. In Proc. SLT, 2016. [ bib | .pdf | Abstract ]
[19] P. Swietojanski, P. Bell, and S. Renals. Structured output layer with auxiliary targets for context-dependent acoustic modelling. In Proc. Interspeech, Dresden, Germany, September 2015. [ bib | DOI | .pdf | Abstract ]
[20] Peter Bell and Steve Renals. Complementary tasks for context-dependent deep neural network acoustic models. In Proc. Interspeech, Dresden, Germany, September 2015. [ bib | .pdf | Abstract ]
[21] Siva Reddy Gangireddy, Steve Renals, Yoshihiko Nankaku, and Akinobu Lee. Prosodically-enahanced recurrent neural network language models. In Proc. Interspeech, page 2390—2394, Dresden, Germany, September 2015. [ bib | .pdf | Abstract ]
[22] B. Uria, I. Murray, S. Renals, C. Valentini-Botinhao, and J. Bridle. Modelling acoustic feature dependencies with artificial neural networks: Trajectory-RNADE. In Proc. ICASSP, pages 4465-4469, Brisbane, Australia, April 2015. [ bib | .pdf | Abstract ]
[23] P. Bell and S. Renals. Regularization of context-dependent deep neural networks with context-independent multi-task training. In Proc. ICASSP, Brisbane, Australia, April 2015. [ bib | .pdf | Abstract ]
[24] P. Swietojanski and S. Renals. Differentiable pooling for unsupervised speaker adaptation. In Proceedings of the IEEE International Conference on Acoustics, Speech, and Signal Processing (ICASSP), 2015. [ bib | .pdf | Abstract ]
[25] Zhizheng Wu, Pawel Swietojanski, Christophe Veaux, Steve Renals, and Simon King. A study of speaker adaptation for DNN-based speech synthesis. In Interspeech, 2015. [ bib | .pdf ]
[26] Liang Lu, Xingxing Zhang, KyungHyun Cho, and Steve Renals. A study of the recurrent neural network encoder-decoder for large vocabulary speech recognition. In Proc. Interspeech, 2015. [ bib | .pdf | Abstract ]
[27] Liang Lu and Steve Renals. Feature-space speaker adaptation for probabilistic linear discriminant analysis acoustic models. In Proc. Interspeech, 2015. [ bib | .pdf | Abstract ]
[28] Liang Lu and Steve Renals. Multi-frame factorisation for long-span acoustic modelling. In Proc. ICASSP, 2015. [ bib | .pdf | Abstract ]
[29] Peter Bell and Steve Renals. A system for automatic alignment of broadcast media captions using weighted finite-state transducers. In Proc. ASRU, 2015. [ bib | .pdf | Abstract ]
[30] Ahmed Ali, Walid Magdy, Peter Bell, and Steve Renals. Multi-reference WER for evaluating ASR for languages with no orthographic rules. In Proc. ASRU, 2015. [ bib | .pdf | Abstract ]
[31] Peter Bell, Mark Gales, Thomas Hain, Jonathan Kilgour, Pierre Lanchantin, Xunying Liu, Andrew McParland, Steve Renals, Oscar Saz, Mirjam Wester, and Phil Woodland. The MGB challenge: Evaluating multi-genre broadcast media recognition. In Proc. ASRU, 2015. [ bib | .pdf | Abstract ]
[32] P. Swietojanski and S. Renals. Learning hidden unit contributions for unsupervised speaker adaptation of neural network acoustic models. In Proc. IEEE Workshop on Spoken Language Technology, Lake Tahoe, USA, December 2014. [ bib | .pdf | Abstract ]
[33] Peter Bell, Pawel Swietojanski, Joris Driesen, Mark Sinclair, Fergus McInnes, and Steve Renals. The UEDIN ASR systems for the IWSLT 2014 evaluation. In Proc. IWSLT, South Lake Tahoe, USA, December 2014. [ bib | .pdf | Abstract ]
[34] P. Swietojanski, A. Ghoshal, and S. Renals. Convolutional neural networks for distant speech recognition. Signal Processing Letters, IEEE, 21(9):1120-1124, September 2014. [ bib | DOI | .pdf | Abstract ]
[35] Siva Reddy Gangireddy, Fergus McInnes, and Steve Renals. Feed forward pre-training for recurrent neural network language models. In Proc. Interspeech, pages 2620-2624, September 2014. [ bib | .pdf | Abstract ]
[36] Oliver Watts, Siva Gangireddy, Junichi Yamagishi, Simon King, Steve Renals, Adriana Stan, and Mircea Giurgiu. Neural net word representations for phrase-break prediction without a part of speech tagger. In Proc. ICASSP, pages 2618-2622, Florence, Italy, May 2014. [ bib | .pdf | Abstract ]
[37] J.P. Cabral, K. Richmond, J. Yamagishi, and S. Renals. Glottal spectral separation for speech synthesis. Selected Topics in Signal Processing, IEEE Journal of, 8(2):195-208, April 2014. [ bib | DOI | .pdf | Abstract ]
[38] Liang Lu, Arnab Ghoshal, and Steve Renals. Cross-lingual subspace Gaussian mixture model for low-resource speech recognition. IEEE Transactions on Audio, Speech and Language Processing, 22(1):17-27, 2014. [ bib | DOI | .pdf | Abstract ]
[39] S. Renals and P. Swietojanski. Neural networks for distant speech recognition. In The 4th Joint Workshop on Hands-free Speech Communication and Microphone Arrays (HSCMA), 2014. [ bib | .pdf | Abstract ]
[40] Liang Lu and Steve Renals. Probabilistic linear discriminant analysis for acoustic modelling. IEEE Signal Processing Letters, 21(6):702-706, 2014. [ bib | DOI | .pdf | Abstract ]
[41] Catherine Lai and Steve Renals. Incorporating lexical and prosodic information at different levels for meeting summarization. In Proc. Interspeech 2014, 2014. [ bib | .pdf | Abstract ]
[42] Liang Lu and Steve Renals. Probabilistic linear discriminant analysis with bottleneck features for speech recognition. In Proc. Interspeech, 2014. [ bib | .pdf | Abstract ]
[43] P. Bell, J. Driesen, and S. Renals. Cross-lingual adaptation with multi-task adaptive networks. In Proc. Interspeech, 2014. [ bib | .pdf | Abstract ]
[44] P. Swietojanski, A. Ghoshal, and S. Renals. Hybrid acoustic models for distant and multichannel large vocabulary speech recognition. In Proc. IEEE Workshop on Automatic Speech Recognition and Understanding (ASRU), December 2013. [ bib | DOI | .pdf | Abstract ]
[45] Joris Driesen and Steve Renals. Lightly supervised automatic subtitling of weather forecasts. In Proc. Automatic Speech Recognition and Understanding Workshop, Olomouc, Czech Republic, December 2013. [ bib | DOI | .pdf | Abstract ]
[46] Joris Driesen, Peter Bell, Mark Sinclair, and Steve Renals. Description of the UEDIN system for German ASR. In Proc IWSLT, Heidelberg, Germany, December 2013. [ bib | .pdf | Abstract ]
[47] Peter Bell, Hitoshi Yamamoto, Pawel Swietojanski, Youzheng Wu, Fergus McInnes, Chiori Hori, and Steve Renals. A lecture transcription system combining neural network acoustic and language models. In Proc. Interspeech, Lyon, France, August 2013. [ bib | .pdf | Abstract ]
[48] H. Bourlard, M. Ferras, N. Pappas, A. Popescu-Belis, S. Renals, F. McInnes, P. Bell, S. Ingram, and M. Guillemot. Processing and linking audio events in large multimedia archives: The EU inEvent project. In Proceedings of SLAM 2013 (First Workshop on Speech, Language and Audio in Multimedia), Marseille, France, August 2013. [ bib | .pdf | Abstract ]
[49] Peter Bell, Pawel Swietojanski, and Steve Renals. Multi-level adaptive networks in tandem and hybrid ASR systems. In Proc. ICASSP, Vancouver, Canada, May 2013. [ bib | DOI | .pdf | Abstract ]
[50] Liang Lu, KK Chin, Arnab Ghoshal, and Steve Renals. Joint uncertainty decoding for noise robust subspace Gaussian mixture models. IEEE Transactions on Audio, Speech and Language Processing, 21(9):1791-1804, 2013. [ bib | DOI | .pdf | Abstract ]
[51] Pawel Swietojanski, Arnab Ghoshal, and Steve Renals. Revisiting hybrid and GMM-HMM system combination techniques. In Proceedings of the IEEE International Conference on Acoustics, Speech, and Signal Processing (ICASSP), 2013. [ bib | DOI | .pdf | Abstract ]
[52] Arnab Ghoshal, Pawel Swietojanski, and Steve Renals. Multilingual training of deep neural networks. In Proceedings of the IEEE International Conference on Acoustics, Speech, and Signal Processing (ICASSP), 2013. [ bib | DOI | .pdf | Abstract ]
[53] Catherine Lai, Jean Carletta, and Steve Renals. Detecting summarization hot spots in meetings using group level involvement and turn-taking features. In Proc. Interspeech 2013, Lyon, France, 2013. [ bib | .pdf | Abstract ]
[54] Catherine Lai, Jean Carletta, and Steve Renals. Modelling participant affect in meetings with turn-taking features. In Proceedings of WASSS 2013, Grenoble, France, 2013. [ bib | .pdf | Abstract ]
[55] P. Lanchantin, P. Bell, M. Gales, T. Hain, X. Liu, Y. Long, J. Quinnell, S. Renals, O. Saz, M. Seigel, P. Swietojanski, and P. Woodland. Automatic transcription of multi-genre media archives. In Proc. Workshop on Speech, Language and Audio in Multimedia, Marseille, France, 2013. [ bib | .pdf | Abstract ]
[56] Liang Lu, Arnab Ghoshal, and Steve Renals. Noise adaptive training for subspace Gaussian mixture models. In Proc. Interspeech, 2013. [ bib | .pdf | Abstract ]
[57] Liang Lu, Arnab Ghoshal, and Steve Renals. Acoustic data-driven pronunciation lexicon for large vocabulary speech recognition. In Proc. ASRU, 2013. [ bib | DOI | .pdf | Abstract ]
[58] Christian Geng, Alice Turk, James M. Scobbie, Cedric Macmartin, Philip Hoole, Korin Richmond, Alan Wrench, Marianne Pouplier, Ellen Gurman Bard, Ziggy Campbell, Catherine Dickie, Eddie Dubourg, William Hardcastle, Evia Kainada, Simon King, Robin Lickley, Satsuki Nakai, Steve Renals, Kevin White, and Ronny Wiegand. Recording speech articulation in dialogue: Evaluating a synchronized double electromagnetic articulography setup. Journal of Phonetics, 41(6):421 - 431, 2013. [ bib | DOI | http | .pdf | Abstract ]
[59] Peter Bell, Fergus McInnes, Siva Reddy Gangireddy, Mark Sinclair, Alexandra Birch, and Steve Renals. The UEDIN english ASR system for the IWSLT 2013 evaluation. In Proc. International Workshop on Spoken Language Translation, 2013. [ bib | .pdf | Abstract ]
[60] E. Zwyssig, F. Faubel, S. Renals, and M. Lincoln. Recognition of overlapping speech using digital MEMS microphone arrays. In Proc IEEE ICASSP, 2013. [ bib | DOI | .pdf | Abstract ]
[61] P. Swietojanski, A. Ghoshal, and S. Renals. Unsupervised cross-lingual knowledge transfer in DNN-based LVCSR. In Proc. IEEE Workshop on Spoken Language Technology, pages 246-251, Miami, Florida, USA, December 2012. [ bib | DOI | .pdf | Abstract ]
[62] P. Bell, M. Gales, P. Lanchantin, X. Liu, Y. Long, S. Renals, P. Swietojanski, and P. Woodland. Transcription of multi-genre media archives using out-of-domain data. In Proc. IEEE Workshop on Spoken Language Technology, pages 324-329, Miami, Florida, USA, December 2012. [ bib | DOI | .pdf | Abstract ]
[63] Korin Richmond and Steve Renals. Ultrax: An animated midsagittal vocal tract display for speech therapy. In Proc. Interspeech, Portland, Oregon, USA, September 2012. [ bib | .pdf | Abstract ]
[64] Benigno Uria, Iain Murray, Steve Renals, and Korin Richmond. Deep architectures for articulatory inversion. In Proc. Interspeech, Portland, Oregon, USA, September 2012. [ bib | .pdf | Abstract ]
[65] Eva Hasler, Peter Bell, Arnab Ghoshal, Barry Haddow, Philipp Koehn, Fergus McInnes, Steve Renals, and Pawel Swietojanski. The UEDIN system for the IWSLT 2012 evaluation. In Proc. International Workshop on Spoken Language Translation, 2012. [ bib | .pdf | Abstract ]
[66] Ravichander Vipperla, Maria Wolters, and Steve Renals. Spoken dialogue interfaces for older people. In Kenneth J. Turner, editor, Advances in Home Care Technologies. IOS Press, 2012. [ bib | .pdf | Abstract ]
[67] E. Zwyssig, S. Renals, and M. Lincoln. On the effect of SNR and superdirective beamforming in speaker diarisation in meetings. In Acoustics, Speech and Signal Processing (ICASSP), 2012 IEEE International Conference on, pages 4177-4180, 2012. [ bib | DOI | .pdf | Abstract ]
[68] E. Zwyssig, S. Renals, and M. Lincoln. Determining the number of speakers in a meeting using microphone array features. In Acoustics, Speech and Signal Processing (ICASSP), 2012 IEEE International Conference on, pages 4765-4768, 2012. [ bib | DOI | .pdf | Abstract ]
[69] L. Lu, A. Ghoshal, and S. Renals. Maximum a posteriori adaptation of subspace Gaussian mixture models for cross-lingual speech recognition. In Proc. ICASSP, pages 4877-4880, 2012. [ bib | DOI | .pdf | Abstract ]
[70] L. Lu, A. Ghoshal, and S. Renals. Joint uncertainty decoding with unscented transform for noise robust subspace Gaussian mixture model. In Proc. Sapa-Scale workshop, 2012. [ bib | .pdf | Abstract ]
[71] L. Lu, KK Chin, A. Ghoshal, and S. Renals. Noise compensation for subspace Gaussian mixture models. In Proc. Interspeech, 2012. [ bib | .pdf | Abstract ]
[72] Junichi Yamagishi, Christophe Veaux, Simon King, and Steve Renals. Speech synthesis technologies for individuals with vocal disabilities: Voice banking and reconstruction. Acoustical Science and Technology, 33(1):1-5, 2012. [ bib | DOI | http | .pdf | Abstract ]
[73] Steve Renals, Hervé Bourlard, Jean Carletta, and Andrei Popescu-Belis, editors. Multimodal Signal Processing: Human Interactions in Meetings. Cambridge University Press, 2012. [ bib ]
[74] Benigno Uria, Steve Renals, and Korin Richmond. A deep neural network for acoustic-articulatory speech inversion. In Proc. NIPS 2011 Workshop on Deep Learning and Unsupervised Feature Learning, Sierra Nevada, Spain, December 2011. [ bib | .pdf | Abstract ]
[75] J.P. Cabral, S. Renals, J. Yamagishi, and K. Richmond. HMM-based speech synthesiser using the LF-model of the glottal source. In Acoustics, Speech and Signal Processing (ICASSP), 2011 IEEE International Conference on, pages 4704-4707, May 2011. [ bib | DOI | .pdf | Abstract ]
[76] L. Lu, A. Ghoshal, and S. Renals. Regularized subspace gausian mixture models for speech recognition. IEEE Signal Processing Letters, 18(7):419-422, 2011. [ bib | .pdf | Abstract ]
[77] Jonathan Kilgour, Jean Carletta, and Steve Renals. The Ambient Spotlight: Personal meeting capture with a microphone array. In Proc. HSCMA, 2011. [ bib | DOI | .pdf | Abstract ]
[78] L. Lu, A. Ghoshal, and S. Renals. Regularized subspace Gausian mixture models for cross-lingual speech recognition. In Proc. ASRU, 2011. [ bib | .pdf | Abstract ]
[79] João Cabral, Steve Renals, Korin Richmond, and Junichi Yamagishi. Transforming voice source parameters in a HMM-based speech synthesiser with glottal post-filtering. In Proc. 7th ISCA Speech Synthesis Workshop (SSW7), pages 365-370, NICT/ATR, Kyoto, Japan, September 2010. [ bib | .pdf | Abstract ]
[80] Ravi Chander Vipperla, Steve Renals, and Joe Frankel. Augmentation of adaptation data. In Proc. Interspeech, pages 530-533, Makuhari, Japan, September 2010. [ bib | .pdf | Abstract ]
[81] Alice Turk, James Scobbie, Christian Geng, Barry Campbell, Catherine Dickie, Eddie Dubourg, Ellen Gurman Bard, William Hardcastle, Mariam Hartinger, Simon King, Robin Lickley, Cedric Macmartin, Satsuki Nakai, Steve Renals, Korin Richmond, Sonja Schaeffler, Kevin White, Ronny Wiegand, and Alan Wrench. An Edinburgh speech production facility. Poster presented at the 12th Conference on Laboratory Phonology, Albuquerque, New Mexico., July 2010. [ bib | .pdf ]
[82] Erich Zwyssig, Mike Lincoln, and Steve Renals. A digital microphone array for distant speech recognition. In Proc. IEEE ICASSP-10, pages 5106-5109, 2010. [ bib | DOI | .pdf | Abstract ]
[83] Steve Renals. Recognition and understanding of meetings. In Proc. NAACL/HLT, pages 1-9, 2010. [ bib | .pdf | Abstract ]
[84] Jonathan Kilgour, Jean Carletta, and Steve Renals. The Ambient Spotlight: Queryless desktop search from meeting speech. In Proc ACM Multimedia 2010 Workshop SSCS 2010, 2010. [ bib | DOI | .pdf | Abstract ]
[85] Songfang Huang and Steve Renals. Hierarchical Bayesian language models for conversational speech recognition. IEEE Transactions on Audio, Speech and Language Processing, 18(8):1941-1954, January 2010. [ bib | DOI | http | .pdf | Abstract ]
[86] Songfang Huang and Steve Renals. Power law discounting for n-gram language models. In Proc. IEEE ICASSP-10, pages 5178-5181, 2010. [ bib | DOI | http | .pdf | Abstract ]
[87] Maria K. Wolters, Karl B. Isaac, and Steve Renals. Evaluating speech synthesis intelligibility using Amazon Mechanical Turk. In Proc. 7th Speech Synthesis Workshop (SSW7), pages 136-141, 2010. [ bib | .pdf | Abstract ]
[88] Steve Renals and Simon King. Automatic speech recognition. In William J. Hardcastle, John Laver, and Fiona E. Gibbon, editors, Handbook of Phonetic Sciences, chapter 22. Wiley Blackwell, 2010. [ bib ]
[89] Ravi Chander Vipperla, Steve Renals, and Joe Frankel. Ageing voices: The effect of changes in voice parameters on ASR performance. EURASIP Journal on Audio, Speech, and Music Processing, 2010. [ bib | DOI | http | .pdf | Abstract ]
[90] Steve Renals and Thomas Hain. Speech recognition. In Alex Clark, Chris Fox, and Shalom Lappin, editors, Handbook of Computational Linguistics and Natural Language Processing. Wiley Blackwell, 2010. [ bib ]
[91] Alice Turk, James Scobbie, Christian Geng, Cedric Macmartin, Ellen Bard, Barry Campbell, Catherine Dickie, Eddie Dubourg, Bill Hardcastle, Phil Hoole, Evia Kanaida, Robin Lickley, Satsuki Nakai, Marianne Pouplier, Simon King, Steve Renals, Korin Richmond, Sonja Schaeffler, Ronnie Wiegand, Kevin White, and Alan Wrench. The Edinburgh Speech Production Facility's articulatory corpus of spontaneous dialogue. The Journal of the Acoustical Society of America, 128(4):2429-2429, 2010. [ bib | DOI | Abstract ]
[92] Jonathan Kilgour, Jean Carletta, and Steve Renals. The Ambient Spotlight: Personal multimodal search without query. In Proc. ICMI-MLMI, 2010. [ bib | DOI | http | .pdf | Abstract ]
[93] Maria Wolters, Ravichander Vipperla, and Steve Renals. Age recognition for spoken dialogue systems: Do we need it? In Proc. Interspeech, September 2009. [ bib | .pdf | Abstract ]
[94] Songfang Huang and Steve Renals. A parallel training algorithm for hierarchical Pitman-Yor process language models. In Proc. Interspeech'09, pages 2695-2698, Brighton, UK, September 2009. [ bib | .pdf | Abstract ]
[95] J. Cabral, S. Renals, K. Richmond, and J. Yamagishi. HMM-based speech synthesis with an acoustic glottal source model. In Proc. The First Young Researchers Workshop in Speech Technology, April 2009. [ bib | .pdf | Abstract ]
[96] Gabriel Murray, Thomas Kleinbauer, Peter Poller, Tilman Becker, Steve Renals, and Jonathan Kilgour. Extrinsic summarization evaluation: A decision audit task. ACM Transactions on Speech and Language Processing, 6(2):1-29, 2009. [ bib | DOI | http | .pdf | Abstract ]
[97] Ravi Chander Vipperla, Maria Wolters, Kallirroi Georgila, and Steve Renals. Speech input from older users in smart environments: Challenges and perspectives. In Proc. HCI International: Universal Access in Human-Computer Interaction. Intelligent and Ubiquitous Interaction Environments, number 5615 in Lecture Notes in Computer Science. Springer, 2009. [ bib | DOI | http | .pdf | Abstract ]
[98] Heriberto Cuayáhuitl, Steve Renals, Oliver Lemon, and Hiroshi Shimodaira. Evaluation of a hierarchical reinforcement learning spoken dialogue system. Computer Speech and Language, 24(2):395-429, 2009. [ bib | DOI | .pdf | Abstract ]
[99] Junichi Yamagishi, Takashi Nose, Heiga Zen, Zhenhua Ling, Tomoki Toda, Keiichi Tokuda, Simon King, and Steve Renals. Robust speaker-adaptive HMM-based text-to-speech synthesis. IEEE Transactions on Audio, Speech and Language Processing, 17(6):1208-1230, 2009. [ bib | http | www: | Abstract ]
[100] Y. Hifny and S. Renals. Speech recognition using augmented conditional random fields. IEEE Transactions on Audio, Speech and Language Processing, 17(2):354-365, 2009. [ bib | http | .pdf | Abstract ]
[101] Songfang Huang and Steve Renals. Unsupervised language model adaptation based on topic and role information in multiparty meetings. In Proc. Interspeech'08, pages 833-836, Brisbane, Australia, September 2008. [ bib | .pdf | Abstract ]
[102] C. Qin, M. Carreira-Perpiñán, K. Richmond, A. Wrench, and S. Renals. Predicting tongue shapes from a few landmark locations. In Proc. Interspeech, pages 2306-2309, Brisbane, Australia, September 2008. [ bib | .PDF | Abstract ]
[103] J. Cabral, S. Renals, K. Richmond, and J. Yamagishi. Glottal spectral separation for parametric speech synthesis. In Proc. Interspeech, pages 1829-1832, Brisbane, Australia, September 2008. [ bib | .PDF | Abstract ]
[104] Songfang Huang and Steve Renals. Using participant role in multiparty meetings as prior knowledge for nonparametric topic modeling. In Proc. ICML/UAI/COLT Workshop on Prior Knowledge for Text and Language Processing, pages 21-24, Helsinki, Finland, July 2008. [ bib | .pdf | Abstract ]
[105] Steve Renals, Thomas Hain, and Hervé Bourlard. Interpretation of multiparty meetings: The AMI and AMIDA projects. In IEEE Workshop on Hands-Free Speech Communication and Microphone Arrays, 2008. HSCMA 2008, pages 115-118, 2008. [ bib | DOI | http | .pdf | Abstract ]
[106] Ravichander Vipperla, Steve Renals, and Joe Frankel. Longitudinal study of ASR performance on ageing voices. In Proc. Interspeech, Brisbane, 2008. [ bib | .pdf | Abstract ]
[107] Le Zhang and Steve Renals. Acoustic-articulatory modelling with the trajectory HMM. IEEE Signal Processing Letters, 15:245-248, 2008. [ bib | .pdf | Abstract ]
[108] Gabriel Murray, Thomas Kleinbauer, Peter Poller, Steve Renals, and Jonathan Kilgour. Extrinsic summarization evaluation: A decision audit task. In Machine Learning for Multimodal Interaction (Proc. MLMI '08), number 5237 in Lecture Notes in Computer Science, pages 349-361. Springer, 2008. [ bib | DOI | .pdf | Abstract ]
[109] Gabriel Murray and Steve Renals. Detecting action items in meetings. In Machine Learning for Multimodal Interaction (Proc. MLMI '08), number 5237 in Lecture Notes in Computer Science, pages 208-213. Springer, 2008. [ bib | DOI | http | .pdf | Abstract ]
[110] Giulia Garau and Steve Renals. Combining spectral representations for large vocabulary continuous speech recognition. IEEE Transactions on Audio, Speech and Language Processing, 16(3):508-518, 2008. [ bib | DOI | http | .pdf | Abstract ]
[111] Heidi Christensen, Yoshihiko Gotoh, and Steve Renals. A cascaded broadcast news highlighter. IEEE Transactions on Audio, Speech and Language Processing, 16:151-161, 2008. [ bib | DOI | http | .pdf | Abstract ]
[112] Songfang Huang and Steve Renals. Modeling topic and role information in meetings using the hierarchical Dirichlet process. In A. Popescu-Belis and R. Stiefelhagen, editors, Machine Learning for Multimodal Interaction V, volume 5237 of Lecture Notes in Computer Science, pages 214-225. Springer, 2008. [ bib | .pdf | Abstract ]
[113] Gabriel Murray and Steve Renals. Meta comments for summarizing meeting speech. In Machine Learning for Multimodal Interaction (Proc. MLMI '08), number 5237 in Lecture Notes in Computer Science, pages 236-247. Springer, 2008. [ bib | DOI | http | .pdf | Abstract ]
[114] Giulia Garau and Steve Renals. Pitch adaptive features for LVCSR. In Proc. Interspeech '08, 2008. [ bib | .pdf | Abstract ]
[115] Alfred Dielmann and Steve Renals. Recognition of dialogue acts in multiparty meetings using a switching DBN. IEEE Transactions on Audio, Speech and Language Processing, 16(7):1303-1314, 2008. [ bib | DOI | http | .pdf | Abstract ]
[116] Herve Bourlard and Steve Renals. Recognition and understanding of meetings: Overview of the European AMI and AMIDA projects. In Proc. LangTech 2008, 2008. [ bib | .pdf | Abstract ]
[117] Songfang Huang and Steve Renals. Hierarchical Pitman-Yor language models for ASR in meetings. In Proc. IEEE Workshop on Automatic Speech Recognition and Understanding (ASRU'07), pages 124-129, Kyoto, Japan, December 2007. [ bib | .pdf | Abstract ]
[118] Junichi Yamagishi, Takao Kobayashi, Steve Renals, Simon King, Heiga Zen, Tomoki Toda, and Keiichi Tokuda. Improved average-voice-based speech synthesis using gender-mixed modeling and a parameter generation algorithm considering GV. In Proc. 6th ISCA Workshop on Speech Synthesis (SSW-6), August 2007. [ bib | .pdf | Abstract ]
[119] Heriberto Cuayáhuitl, Steve Renals, Oliver Lemon, and Hiroshi Shimodaira. Hierarchical dialogue optimization using semi-markov decision processes. In Proc. Interspeech, August 2007. [ bib | .pdf | Abstract ]
[120] A. Dielmann and S. Renals. DBN based joint dialogue act recognition of multiparty meetings. In Proc. IEEE ICASSP, volume 4, pages 133-136, April 2007. [ bib | .pdf | Abstract ]
[121] A. Dielmann and S. Renals. Automatic dialogue act recognition using a dynamic Bayesian network. In S. Renals, S. Bengio, and J. Fiscus, editors, Proc. Multimodal Interaction and Related Machine Learning Algorithms Workshop (MLMI-06), pages 178-189. Springer, 2007. [ bib | .pdf | Abstract ]
[122] Songfang Huang and Steve Renals. Modeling prosodic features in language models for meetings. In A. Popescu-Belis, S. Renals, and H. Bourlard, editors, Machine Learning for Multimodal Interaction IV, volume 4892 of Lecture Notes in Computer Science, pages 191-202. Springer, 2007. [ bib | .pdf | Abstract ]
[123] Alejandro Jaimes, Hervé Bourlard, Steve Renals, and Jean Carletta. Recording, indexing, summarizing, and accessing meeting videos: An overview of the AMI project. In Proc IEEE ICIAPW, pages 59-64, 2007. [ bib | DOI | http | .pdf | Abstract ]
[124] Steve Renals, Thomas Hain, and Hervé Bourlard. Recognition and interpretation of meetings: The AMI and AMIDA projects. In Proc. IEEE Workshop on Automatic Speech Recognition and Understanding (ASRU '07), 2007. [ bib | .pdf | Abstract ]
[125] Gabriel Murray and Steve Renals. Towards online speech summarization. In Proc. Interspeech '07, 2007. [ bib | .PDF | Abstract ]
[126] Gabriel Murray and Steve Renals. Term-weighting for summarization of multi-party spoken dialogues. In A. Popescu-Belis, S. Renals, and H. Bourlard, editors, Machine Learning for Multimodal Interaction IV, volume 4892 of Lecture Notes in Computer Science, pages 155-166. Springer, 2007. [ bib | .pdf | Abstract ]
[127] J. Cabral, S. Renals, K. Richmond, and J. Yamagishi. Towards an improved modeling of the glottal source in statistical parametric speech synthesis. In Proc.of the 6th ISCA Workshop on Speech Synthesis, Bonn, Germany, 2007. [ bib | .pdf | Abstract ]
[128] Alfred Dielmann and Steve Renals. Automatic meeting segmentation using dynamic Bayesian networks. IEEE Transactions on Multimedia, 9(1):25-36, 2007. [ bib | DOI | http | .pdf | Abstract ]
[129] Heriberto Cuayáhuitl, Steve Renals, Oliver Lemon, and Hiroshi Shimodaira. Reinforcement learning of dialogue strategies with hierarchical abstract machines. In Proc. IEEE/ACL Workshop on Spoken Language Technology (SLT), December 2006. [ bib | .pdf | Abstract ]
[130] Heriberto Cuayáhuitl, Steve Renals, Oliver Lemon, and Hiroshi Shimodaira. Learning multi-goal dialogue strategies using reinforcement learning with reduced state-action spaces. In Proc. Interspeech, September 2006. [ bib | .pdf | Abstract ]
[131] Le Zhang and Steve Renals. Phone recognition analysis for trajectory HMM. In Proc. Interspeech 2006, Pittsburgh, USA, September 2006. [ bib | .pdf | Abstract ]
[132] G. Murray and S. Renals. Dialogue act compression via pitch contour preservation. In Proceedings of the 9th International Conference on Spoken Language Processing, Pittsburgh, USA, September 2006. [ bib | .pdf | Abstract ]
[133] G. Murray, S. Renals, J. Moore, and J. Carletta. Incorporating speaker and discourse features into speech summarization. In Proceedings of the Human Language Technology Conference - North American Chapter of the Association for Computational Linguistics Meeting (HLT-NAACL) 2006, New York City, USA, June 2006. [ bib | .pdf | Abstract ]
[134] G. Murray, S. Renals, and M. Taboada. Prosodic correlates of rhetorical relations. In Proceedings of HLT/NAACL ACTS Workshop, 2006, New York City, USA, June 2006. [ bib | .pdf | Abstract ]
[135] M. Al-Hames, A. Dielmann, D. Gatica-Perez, S. Reiter, S. Renals, G. Rigoll, and D. Zhang. Multimodal integration for meeting group action segmentation and recognition. In S. Renals and S. Bengio, editors, Proc. Multimodal Interaction and Related Machine Learning Algorithms Workshop (MLMI-05), pages 52-63. Springer, 2006. [ bib | Abstract ]
[136] Steve Renals, Samy Bengio, and Jonathan Fiscus, editors. Machine learning for multimodal interaction (Proceedings of MLMI '06), volume 4299 of Lecture Notes in Computer Science. Springer-Verlag, 2006. [ bib ]
[137] P. Hsueh, J. Moore, and S. Renals. Automatic segmentation of multiparty dialogue. In Proc. EACL06, 2006. [ bib | .pdf | Abstract ]
[138] Marc Al-Hames, Thomas Hain, Jan Cernocky, Sascha Schreiber, Mannes Poel, Ronald Mueller, Sebastien Marcel, David van Leeuwen, Jean-Marc Odobez, Sileye Ba, Hervé Bourlard, Fabien Cardinaux, Daniel Gatica-Perez, Adam Janin, Petr Motlicek, Stephan Reiter, Steve Renals, Jeroen van Rest, Rutger Rienks, Gerhard Rigoll, Kevin Smith, Andrew Thean, and Pavel Zemcik. Audio-video processing in meetings: Seven questions and current AMI answers. In S. Renals, S. Bengio, and J. G. Fiscus, editors, Machine Learning for Multimodal Interaction (Proc. MLMI '06), volume 4299 of Lecture Notes in Computer Science, pages 24-35. Springer, 2006. [ bib ]
[139] Steve Renals and Samy Bengio, editors. Machine learning for multimodal interaction (Proceedings of MLMI '05), volume 3869 of Lecture Notes in Computer Science. Springer-Verlag, 2006. [ bib ]
[140] Heriberto Cuayáhuitl, Steve Renals, Oliver Lemon, and Hiroshi Shimodaira. Human-computer dialogue simulation using hidden markov models. In Proc. IEEE Workshop on Automatic Speech Recognition and Understanding (ASRU), November 2005. [ bib | .pdf | Abstract ]
[141] G. Garau, S. Renals, and T. Hain. Applying vocal tract length normalization to meeting recordings. In Proc. Interspeech, September 2005. [ bib | .pdf | Abstract ]
[142] G. Murray, S. Renals, and J. Carletta. Extractive summarization of meeting recordings. In Proc. Interspeech, September 2005. [ bib | .pdf | Abstract ]
[143] G. Murray, S. Renals, J. Carletta, and J. Moore. Evaluating automatic summaries of meeting recordings. In Proceedings of the 43rd Annual Meeting of the Association for Computational Linguistics, Ann Arbor, MI, USA, June 2005. [ bib | .pdf | Abstract ]
[144] H. Christensen, B. Kolluru, Y. Gotoh, and S. Renals. Maximum entropy segmentation of broadcast news. In Proc. IEEE ICASSP, 2005. [ bib | .ps.gz | .pdf | Abstract ]
[145] T. Hain, J. Dines, G. Garau, M. Karafiat, D. Moore, V. Wan, R. Ordelman, and S. Renals. Transcription of conference room meetings: an investigation. In Proc. Interspeech, 2005. [ bib | .pdf | Abstract ]
[146] T. Hain, L. Burget, J. Dines, G. Garau, M. Karafiat, M. Lincoln, I. McCowan, D. Moore, V. Wan, R. Ordelman, and S. Renals. The 2005 AMI system for the transcription of speech in meetings. In Proceedings of the Rich Transcription 2005 Spring Meeting Recognition Evaluation, 2005. [ bib | .pdf | Abstract ]
[147] S. J. Wrigley, G. J. Brown, V. Wan, and S. Renals. Speech and crosstalk detection in multi-channel audio. IEEE Trans. on Speech and Audio Processing, 13:84-91, 2005. [ bib | .pdf | Abstract ]
[148] Jerry Goldman, Steve Renals, Steven Bird, Franciska de Jong, Marcello Federico, Carl Fleischhauer, Mark Kornbluh, Lori Lamel, Doug Oard, Clare Stewart, and Richard Wright. Accessing the spoken word. International Journal of Digital Libraries, 5(4):287-298, 2005. [ bib | .ps.gz | .pdf | Abstract ]
[149] Y. Hifny, S. Renals, and N. Lawrence. A hybrid MaxEnt/HMM based ASR system. In Proc. Interspeech, 2005. [ bib | .pdf | Abstract ]
[150] A. Dielmann and S. Renals. Multistream dynamic Bayesian network for meeting segmentation. In S. Bengio and H. Bourlard, editors, Proc. Multimodal Interaction and Related Machine Learning Algorithms Workshop (MLMI-04), pages 76-86. Springer, 2005. [ bib | .ps.gz | .pdf | Abstract ]
[151] V. Wan and S. Renals. Speaker verification using sequence discriminant support vector machines. IEEE Trans. on Speech and Audio Processing, 13:203-210, 2005. [ bib | .ps.gz | .pdf | Abstract ]
[152] Konstantinos Koumpis and Steve Renals. Automatic summarization of voicemail messages using lexical and prosodic features. ACM Transactions on Speech and Language Processing, 2(1):1-24, 2005. [ bib | .ps.gz | .pdf | Abstract ]
[153] Konstantinos Koumpis and Steve Renals. Content-based access to spoken audio. IEEE Signal Processing Magazine, 22(5):61-69, 2005. [ bib | .pdf | Abstract ]
[154] T. Hain, L. Burget, J. Dines, G. Garau, M. Karafiat, M. Lincoln, I. McCowan, D. Moore, V. Wan, R. Ordelman, and S. Renals. The development of the AMI system for the transcription of speech in meetings. In 2nd Joint Workshop on Multimodal Interaction and Related Machine Learning Algorithms, 2005. [ bib | .pdf | Abstract ]
[155] H. Christensen, B. Kolluru, Y. Gotoh, and S. Renals. From text summarisation to style-specific summarisation for broadcast news. In Proc. ECIR-2004, 2004. [ bib | .ps.gz | .pdf | Abstract ]
[156] A. Dielmann and S. Renals. Dynamic Bayesian networks for meeting structuring. In Proc. IEEE ICASSP, 2004. [ bib | .ps.gz | .pdf | Abstract ]
[157] A. Dielmann and S. Renals. Multi-stream segmentation of meetings. In Proc. IEEE Workshop on Multimedia Signal Processing, 2004. [ bib | .ps.gz | .pdf | Abstract ]
[158] Y. H. Abdel-Haleem, S. Renals, and N. D. Lawrence. Acoustic space dimensionality selection and combination using the maximum entropy principle. In Proc. IEEE ICASSP, 2004. [ bib | .pdf | Abstract ]
[159] Y. Gotoh and S. Renals. Language modelling. In Renals and Grefenstette [167], pages 78-105. [ bib | Abstract ]
[160] K. Koumpis and S. Renals. Evaluation of extractive voicemail summarization. In Proc. ISCA Workshop on Multilingual Spoken Document Retrieval, pages 19-24, 2003. [ bib | .ps.gz | .pdf | Abstract ]
[161] S. Renals and D. Ellis. Audio information access from meeting rooms. In Proc. IEEE ICASSP, volume 4, pages 744-747, 2003. [ bib | .ps.gz | .pdf | Abstract ]
[162] B. Kolluru, H. Christensen, Y. Gotoh, and S. Renals. Exploring the style-technique interaction in extractive summarization of broadcast news. In Proc. IEEE Automatic Speech Recognition and Understanding Workshop, 2003. [ bib | .ps.gz | .pdf | Abstract ]
[163] K. Koumpis and S. Renals. Multi-class extractive voicemail summarization. In Proc. Eurospeech, pages 2785-2788, 2003. [ bib | .pdf | Abstract ]
[164] V. Wan and S. Renals. SVMSVM: Support vector machine speaker verification methodology. In Proc. IEEE ICASSP, volume 2, pages 221-224, 2003. [ bib | .ps.gz | .pdf | Abstract ]
[165] H. Christensen, Y. Gotoh, B. Kolluru, and S. Renals. Are extractive text summarisation techniques portable to broadcast news? In Proc. IEEE Automatic Speech Recognition and Understanding Workshop, 2003. [ bib | .ps.gz | .pdf | Abstract ]
[166] S. Wrigley, G. Brown, V. Wan, and S. Renals. Feature selection for the classification of crosstalk in multi-channel audio. In Proc. Eurospeech, pages 469-472, 2003. [ bib | .pdf | Abstract ]
[167] S. Renals and G. Grefenstette, editors. Text and Speech Triggered Information Access. Number 2705 in Lecture Notes in Computer Science. Springer-Verlag, 2003. [ bib | http | Abstract ]
[168] A. J. Robinson, G. D. Cook, D. P. W. Ellis, E. Fosler-Lussier, S. J. Renals, and D. A. G. Williams. Connectionist speech recognition of broadcast news. Speech Communication, 37:27-45, 2002. [ bib | .ps.gz | .pdf | Abstract ]
[169] O. Pietquin and S. Renals. ASR system modeling for automatic evaluation and optimization of dialogue systems. In Proc IEEE ICASSP, pages 46-49, 2002. [ bib | .pdf | Abstract ]
[170] V. Wan and S. Renals. Evaluation of kernel methods for speaker verification and identification. In Proc IEEE ICASSP, pages 669-672, 2002. [ bib | .pdf | Abstract ]
[171] K. Koumpis, S. Renals, and M. Niranjan. Extractive summarization of voicemail using lexical and prosodic feature subset selection. In Proc. Eurospeech, pages 2377-2380, Aalborg, Denmark, 2001. [ bib | .ps.gz | .pdf | Abstract ]
[172] K. Koumpis, C. Ladas, and S. Renals. An advanced integrated architecture for wireless voicemail retrieval. In Proc. 15th IEEE International Conference on Information Networking, pages 403-410, 2001. [ bib | .ps.gz | Abstract ]
[173] S. Renals and D. Abberley. The THISL SDR system at TREC-9. In Proc. Ninth Text Retrieval Conference (TREC-9), 2001. [ bib | .ps.gz | .pdf | Abstract ]
[174] H. Christensen, Y. Gotoh, and S. Renals. Punctuation annotation using statistical prosody models. In Proc. ISCA Workshop on Prosody in Speech Recognition and Understanding, Red Bank, NJ, USA, 2001. [ bib | .ps.gz | .pdf | Abstract ]
[175] K. Koumpis and S. Renals. The role of prosody in a voicemail summarization system. In Proc. ISCA Workshop on Prosody in Speech Recognition and Understanding, Red Bank, NJ, USA, 2001. [ bib | .ps.gz | .pdf | Abstract ]
[176] Y. Gotoh and S. Renals. Information extraction from broadcast news. Philosophical Transactions of the Royal Society of London, Series A, 358:1295-1310, 2000. [ bib | .ps.gz | .pdf | Abstract ]
[177] S. Renals, D. Abberley, D. Kirby, and T. Robinson. Indexing and retrieval of broadcast news. Speech Communication, 32:5-20, 2000. [ bib | .ps.gz | .pdf | Abstract ]
[178] M. Carreira-Perpiñán and S. Renals. Practical identifiability of finite mixtures of multivariate Bernoulli distributions. Neural Computation, 12:141-152, 2000. [ bib | .ps.gz | .pdf | Abstract ]
[179] K. Koumpis and S. Renals. Transcription and summarization of voicemail speech. In Proc. ICSLP, volume 2, pages 688-691, Beijing, 2000. [ bib | .ps.gz | .pdf | Abstract ]
[180] Y. Gotoh and S. Renals. Variable word rate n-grams. In Proc IEEE ICASSP, pages 1591-1594, Istanbul, 2000. [ bib | .ps.gz | .pdf | Abstract ]
[181] Y. Gotoh and S. Renals. Sentence boundary detection in broadcast speech transcripts. In ISCA ITRW: ASR2000, pages 228-235, Paris, 2000. [ bib | .ps.gz | .pdf | Abstract ]
[182] D. Abberley, S. Renals, D. Ellis, and T. Robinson. The THISL SDR system at TREC-8. In Proc. Eighth Text Retrieval Conference (TREC-8), 2000. [ bib | .ps.gz | .pdf | Abstract ]
[183] G. Cook, K. Al-Ghoneim, D. Ellis, E. Fosler-Lussier, Y. Gotoh, B. Kingsbury, N. Morgan, S. Renals, T. Robinson, and G. Williams. The SPRACH system for the transcription of broadcast news. In Proc. DARPA Broadcast News Workshop, pages 161-166, 1999. [ bib | .html | .ps.gz | .pdf | Abstract ]
[184] T. Robinson, D. Abberley, D. Kirby, and S. Renals. Recognition, indexing and retrieval of British broadcast news with the THISL system. In Proc. Eurospeech, pages 1067-1070, Budapest, 1999. [ bib | .ps.gz | .pdf | Abstract ]
[185] Y. Gotoh and S. Renals. Statistical annotation of named entities in spoken audio. In Proc. ESCA Workshop on Accessing Information In Spoken Audio, pages 43-48, Cambridge, 1999. [ bib | .ps.gz | .pdf | Abstract ]
[186] M. Carreira-Perpiñán and S. Renals. A latent-variable modelling approach to the acoustic-to-articulatory mapping problem. In Proc. 14th Int. Congress of Phonetic Sciences, pages 2013-2016, San Francisco, 1999. [ bib | .ps.gz | .pdf | Abstract ]
[187] S. Renals and M. Hochberg. Start-synchronous search for large vocabulary continuous speech recognition. IEEE Trans. on Speech and Audio Processing, 7:542-553, 1999. [ bib | .ps.gz | .pdf | Abstract ]
[188] Y. Gotoh, S. Renals, and G. Williams. Named entity tagged language models. In Proc IEEE ICASSP, pages 513-516, Phoenix AZ, 1999. [ bib | .ps.gz | .pdf | Abstract ]
[189] S. Renals and Y. Gotoh. Integrated transcription and identification of named entities in broadcast speech. In Proc. Eurospeech, pages 1039-1042, Budapest, 1999. [ bib | .ps.gz | .pdf | Abstract ]
[190] G. Williams and S. Renals. Confidence measures from local posterior probability estimates. Computer Speech and Language, 13:395-411, 1999. [ bib | .ps.gz | .pdf | Abstract ]
[191] Y. Gotoh and S. Renals. Topic-based mixture language modelling. Journal of Natural Language Engineering, 5:355-375, 1999. [ bib | .ps.gz | .pdf | Abstract ]
[192] S. Renals, D. Abberley, D. Kirby, and T. Robinson. The THISL system for indexing and retrieval of broadcast news. In Proc. IEEE Workshop on Multimedia Signal Processing, pages 77-82, Copenhagen, 1999. [ bib | http | .ps.gz | .pdf | Abstract ]
[193] D. Abberley, D. Kirby, S. Renals, and T. Robinson. The THISL broadcast news retrieval system. In Proc. ESCA Workshop on Accessing Information In Spoken Audio, pages 19-24, Cambridge, 1999. [ bib | http | .ps.gz | .pdf | Abstract ]
[194] S. Renals, Y. Gotoh, R. Gaizauskas, and M. Stevenson. The SPRACH/LaSIE system for named entity identification in broadcast news. In Proc. DARPA Broadcast News Workshop, pages 47-50, 1999. [ bib | .html | .ps.gz | .pdf | Abstract ]
[195] D. Abberley, S. Renals, G. Cook, and T. Robinson. Retrieval of broadcast news documents with the THISL system. In Proc. Seventh Text Retrieval Conference (TREC-7), pages 181-190, 1999. [ bib | .ps.gz | .pdf | Abstract ]
[196] D. Abberley, S. Renals, and G. Cook. Retrieval of broadcast news documents with the THISL system. In Proc IEEE ICASSP, pages 3781-3784, Seattle, 1998. [ bib | .ps.gz | .pdf | Abstract ]
[197] S. Renals and D. Abberley. The THISL spoken document retrieval system. In Proc. 14th Twente Workshop on Language Technology, pages 129-140, 1998. [ bib | .ps.gz | .pdf | Abstract ]
[198] M. Carreira-Perpiñán and S. Renals. Experimental evaluation of latent variable models for dimensionality reduction. In IEEE Proc. Neural Networks for Signal Processing, volume 8, pages 165-173, Cambridge, 1998. [ bib | .ps.gz | .pdf | Abstract ]
[199] J. Barker, G. Williams, and S. Renals. Acoustic confidence measures for segmenting broadcast news. In Proc. ICSLP, pages 2719-2722, Sydney, 1998. [ bib | .ps.gz | .pdf | Abstract ]
[200] D. Abberley, S. Renals, G. Cook, and T. Robinson. The 1997 THISL spoken document retrieval system. In Proc. Sixth Text Retrieval Conference (TREC-6), pages 747-752, 1998. [ bib | .ps.gz | .pdf | Abstract ]
[201] G. Williams and S. Renals. Confidence measures derived from an acceptor HMM. In Proc. ICSLP, pages 831-834, Sydney, 1998. [ bib | .ps.gz | .pdf | Abstract ]
[202] M. Carreira-Perpiñán and S. Renals. Dimensionality reduction of electropalatographic data using latent variable models. Speech Communication, 26:259-282, 1998. [ bib | .ps.gz | .pdf | Abstract ]
[203] G. Williams and S. Renals. Confidence measures for evaluating pronunciation models. In ESCA Workshop on Modeling pronunciation variation for automatic speech recognition, pages 151-155, Kerkrade, Netherlands, 1998. [ bib | .ps.gz | .pdf | Abstract ]
[204] J. Hennebert, C. Ris, H. Bourlard, S. Renals, and N. Morgan. Estimation of global posteriors and forward-backward training of hybrid HMM/ANN systems. In Proc. Eurospeech, pages 1951-1954, Rhodes, 1997. [ bib | .ps.gz | .pdf | Abstract ]
[205] G. Williams and S. Renals. Confidence measures for hybrid HMM/ANN speech recognition. In Proc. Eurospeech, pages 1955-1958, Rhodes, 1997. [ bib | .ps.gz | .pdf | Abstract ]
[206] Y. Gotoh and S. Renals. Document space models using latent semantic analysis. In Proc. Eurospeech, pages 1443-1446, Rhodes, 1997. [ bib | .ps.gz | .pdf | Abstract ]
[207] B. L. Karlsen, G. J. Brown, M. Cooke, P. Green, and S. Renals. Analysis of a simultaneous speaker sound corpus. In D. F. Rosenthal and H. G. Okuno, editors, Computational Auditory Scene Analysis, pages 321-334. Lawrence Erlbaum Associates, 1997. [ bib ]
[208] S. Renals. Phone deactivation pruning in large vocabulary continuous speech recognition. IEEE Signal Processing Letters, 3:4-6, 1996. [ bib | .ps.gz | Abstract ]
[209] D. Kershaw, T. Robinson, and S. Renals. The 1995 Abbot LVCSR system for multiple unknown microphones. In Proc. ICSLP, pages 1325-1328, Philadelphia PA, 1996. [ bib ]
[210] T. Robinson, M. Hochberg, and S. Renals. The use of recurrent networks in continuous speech recognition. In C.-H. Lee, K. K. Paliwal, and F. K. Soong, editors, Automatic Speech and Speaker Recognition - Advanced Topics, pages 233-258. Kluwer Academic Publishers, 1996. [ bib | .ps.gz | Abstract ]
[211] S. Renals and M. Hochberg. Efficient evaluation of the LVCSR search space using the NOWAY decoder. In Proc IEEE ICASSP, pages 149-152, Atlanta, 1996. [ bib | .ps.gz | Abstract ]
[212] D. Kershaw, T. Robinson, and S. Renals. The 1995 Abbot hybrid connectionist-HMM large vocabulary recognition system. In Proc. ARPA Spoken Language Technology Conference, pages 93-99, 1996. [ bib ]
[213] M. Hochberg, G. Cook, S. Renals, T. Robinson, and R. Schechtman. The 1994 Abbot hybrid connectionist-HMM large vocabulary recognition system. In Proc. ARPA Spoken Language Technology Workshop, pages 170-175, 1995. [ bib | .ps.gz ]
[214] T. Robinson, J. Fransen, D. Pye, J. Foote, and S. Renals. WSJCAM0: A British English speech corpus for large vocabulary continuous speech recognition. In Proc IEEE ICASSP, pages 81-84, Detroit, 1995. [ bib ]
[215] S. Renals and M. Hochberg. Efficient search using posterior phone probability estimates. In Proc IEEE ICASSP, pages 596-599, Detroit, 1995. [ bib | .ps.gz | Abstract ]
[216] J. Neto, L. Almeida, M. Hochberg, C. Martins, L. Nunes, S. Renals, and T. Robinson. Speaker adaptation for hybrid HMM-ANN continuous speech recogniton system. In Proc. Eurospeech, pages 2171-2174, Madrid, 1995. [ bib | .ps.gz | Abstract ]
[217] M. Hochberg, S. Renals, T. Robinson, and G. Cook. Recent improvements to the Abbot large vocabulary CSR system. In Proc IEEE ICASSP, pages 69-72, Detroit, 1995. [ bib | .ps.gz | Abstract ]
[218] M. Hochberg, S. Renals, and T. Robinson. Abbot: The CUED hybrid connectionist/HMM large vocabulary recognition system. In Proc. ARPA Spoken Language Technology Workshop, pages 102-105, 1994. [ bib ]
[219] N. Morgan, H. Bourlard, S. Renals, M. Cohen, and H. Franco. Hybrid neural network/hidden Markov model systems for continuous speech recognition. In I. Guyon and P. S. P. Wang, editors, Advances in Pattern Recognition Systems using Neural Networks Technologies, volume 7 of Series in Machine Perception and Artificial Intelligence. World Scientific Publications, 1994. [ bib ]
[220] M. Hochberg, S. Renals, T. Robinson, and D. Kershaw. Large vocabulary continuous speech recognition using a hybrid connectionist/HMM system. In Proc. ICSLP, pages 1499-1502, Yokohama, 1994. [ bib ]
[221] S. Renals, N. Morgan, H. Bourlard, M. Cohen, and H. Franco. Connectionist probability estimators in HMM speech recognition. IEEE Trans. on Speech and Audio Processing, 2:161-175, 1994. [ bib | .ps.gz | Abstract ]
[222] S. Renals, M. Hochberg, and T. Robinson. Learning temporal dependencies in connectionist speech recognition. In J. D. Cowan, G. Tesauro, and J. Alspector, editors, Advances in Neural Information Processing Systems, volume 6, pages 1051-1058. Morgan Kaufmann, 1994. [ bib | .ps.gz | .pdf ]
[223] S. Renals and M. Hochberg. Using Gamma filters to model temporal dependencies in speech. In Proc. ICSLP, pages 1491-1494, Yokohama, 1994. [ bib | .ps.gz ]
[224] T. Robinson, M. Hochberg, and S. Renals. IPA: Improved phone modelling with recurrent neural networks. In Proc IEEE ICASSP, pages 37-40, Adelaide, 1994. [ bib ]
[225] M. Hochberg, G. Cook, S. Renals, and T. Robinson. Connectionist model combination for large vocabulary speech recognition. In IEEE Proc. Neural Networks for Signal Processing, volume 4, pages 269-278, 1994. [ bib | .ps.gz ]
[226] N. Morgan, H. Bourlard, S. Renals, M. Cohen, and H. Franco. Hybrid neural network/hidden Markov model systems for continuous speech recognition. Intl. J. Pattern Recog. and Artific. Intell., 7:899-916, 1993. [ bib ]
[227] A. J. Robinson, L. Almeida, J.-M. Boite, H. Bourlard, F. Fallside, M. Hochberg, D. Kershaw, P. Kohn, Y. Konig, N. Morgan, J. P. Neto, S. Renals, M. Saerens, and C. Wooters. A neural network based, speaker independent, large vocabulary, continuous speech recognition system: the Wernicke project. In Proc. Eurospeech, pages 1941-1944, Berlin, 1993. [ bib ]
[228] S. Renals and D. MacKay. Bayesian regularisation methods in a hybrid MLP-HMM system. In Proc. Eurospeech, pages 1719-1722, Berlin, 1993. [ bib | .ps.gz ]
[229] H. Bourlard, N. Morgan, and S. Renals. Neural nets and hidden Markov models: Review and generalizations. Speech Communication, 11:237-246, 1992. [ bib ]
[230] S. Renals, N. Morgan, M. Cohen, and H. Franco. Connectionist probability estimation in the Decipher speech recognition system. In Proc IEEE ICASSP, pages 601-604, San Francisco, 1992. [ bib | .ps.gz ]
[231] S. Renals, H. Bourlard, N. Morgan, H. Franco, and M. Cohen. Connectionist optimisation of tied mixture hidden Markov models. In J. E. Moody, S. J. Hanson, and R. P. Lippmann, editors, Advances in Neural Information Processing Systems, volume 4, pages 167-174. Morgan-Kaufmann, 1992. [ bib ]
[232] S. Renals, N. Morgan, M. Cohen, H. Franco, and H. Bourlard. Improving statistical speech recognition. In Proc. IJCNN, volume 2, pages 301-307, Baltimore MD, 1992. [ bib | .ps.gz ]
[233] H. Bourlard, N. Morgan, C. Wooters, and S. Renals. CDNN: A context-dependent neural network for continuous speech recognition. In Proc IEEE ICASSP, pages 349-352, San Francisco, 1992. [ bib ]
[234] S. Renals, D. McKelvie, and F. McInnes. A comparative study of continuous speech recognition using neural networks and hidden Markov models. In Proc IEEE ICASSP, pages 369-372, Toronto, 1991. [ bib ]
[235] S. Renals, N. Morgan, and H. Bourlard. Probability estimation by feed-forward networks in continuous speech recognition. In IEEE Proc. Neural Networks for Signal Processing, pages 309-318, Princeton NJ, 1991. [ bib | .ps.gz ]
[236] S. Renals. Chaos in neural networks. In L. B. Almeida and C. J. Wellekens, editors, Neural Networks, number 412 in Lecture Notes in Computer Science, pages 90-99. Springer-Verlag, 1990. [ bib ]
[237] S. Renals and R. Rohwer. A study of network dynamics. J. Stat. Phys., 58:825-847, 1990. [ bib ]
[238] S. Renals and R. Rohwer. Phoneme classification experiments using radial basis functions. In Proc. IJCNN, pages 461-468, Washington DC, 1989. [ bib ]
[239] S. Renals and R. Rohwer. Neural networks for speech pattern classification. In IEE Conference Publication 313, 1st IEE Conference on Artificial Neural Networks, pages 292-296, London, 1989. [ bib ]
[240] S. Renals and R. Rohwer. Learning phoneme recognition using neural networks. In Proc IEEE ICASSP, pages 413-416, Glasgow, 1989. [ bib ]
[241] S. Renals and J. Dalby. Analysis of a neural network model for speech recognition. In Proc. Eurospeech, volume 1, pages 333-336, Paris, 1989. [ bib ]
[242] R. Rohwer and S. Renals. Training recurrent networks. In L. Personnaz and G. Dreyfus, editors, Neural networks from models to applications (Proc. nEuro '88), pages 207-216, Paris, 1988. I.D.S.E.T. [ bib ]
[243] M. Terry, S. Renals, R. Rohwer, and J. Harrington. A connectionist approach to speech recognition using peripheral auditory modelling. In Proc IEEE ICASSP, pages 699-702, New York, 1988. [ bib ]
[244] S. Renals, R. Rohwer, and M. Terry. A comparison of speech recognition front ends using a connectionist classifier. In Proc. FASE Speech '88, pages 1381-1388, Edinburgh, 1988. [ bib ]
[245] R. Rohwer, S. Renals, and M. Terry. Unstable connectionist networks in speech recognition. In Proc IEEE ICASSP, pages 426-428, New York, 1988. [ bib ]
[246] S. Renals. Radial basis functions network for speech pattern classification. Electronics Letters, 25:437-439, 1988. [ bib ]