The Centre for Speech Technology Research, The university of Edinburgh

Publications by Hiroshi Shimodaira

[1] Atef Ben Youssef, Hiroshi Shimodaira, and David Braude. Speech driven talking head from estimated articulatory features. In Proc. ICASSP, pages 4606-4610, Florence, Italy, May 2014. [ bib | .pdf | Abstract ]
[2] Atef Ben Youssef, Hiroshi Shimodaira, and David A. Braude. Head motion analysis and synthesis over different tasks. In Proc. Intelligent Virtual Agents, pages 285-294. Springer, September 2013. [ bib | .pdf | Abstract ]
[3] Atef Ben Youssef, Hiroshi Shimodaira, and David A. Braude. Articulatory features for speech-driven head motion synthesis. In Proc. Interspeech, pages 2758-2762, Lyon, France, August 2013. [ bib | .pdf | Abstract ]
[4] David A. Braude, Hiroshi Shimodaira, and Atef Ben Youssef. Template-warping based speech driven head motion synthesis. In Proc. Interspeech, pages 2763-2767, Lyon, France, August 2013. [ bib | .pdf | Abstract ]
[5] David Adam Braude, Hiroshi Shimodaira, and Atef Ben Youssef. Template-warping based speech driven head motion synthesis. In Interspeech, pages 2763 - 2767, 2013. [ bib | .pdf | Abstract ]
[6] David A. Braude, Hiroshi Shimodaira, and Atef Ben Youssef. The University of Edinburgh head-motion and audio storytelling (UoE-HaS) dataset. In Proc. Intelligent Virtual Agents, pages 466-467. Springer, 2013. [ bib | .pdf | Abstract ]
[7] Michael A. Berger, Gregor Hofer, and Hiroshi Shimodaira. Carnival - combining speech technology and computer animation. IEEE Computer Graphics and Applications, 31:80-89, 2011. [ bib | DOI ]
[8] Michael Berger, Gregor Hofer, and Hiroshi Shimodaira. Carnival: a modular framework for automated facial animation. Poster at SIGGRAPH 2010, 2010. Bronze award winner, ACM Student Research Competition. [ bib | .pdf ]
[9] Michal Dziemianko, Gregor Hofer, and Hiroshi Shimodaira. HMM-based automatic eye-blink synthesis from speech. In Proc. Interspeech, pages 1799-1802, Brighton, UK, September 2009. [ bib | .pdf | Abstract ]
[10] Heriberto Cuayáhuitl, Steve Renals, Oliver Lemon, and Hiroshi Shimodaira. Evaluation of a hierarchical reinforcement learning spoken dialogue system. Computer Speech and Language, 24(2):395-429, 2009. [ bib | DOI | .pdf | Abstract ]
[11] Gregor Hofer, Junichi Yamagishi, and Hiroshi Shimodaira. Speech-driven lip motion generation with a trajectory HMM. In Proc. Interspeech 2008, pages 2314-2317, Brisbane, Australia, September 2008. [ bib | .pdf | Abstract ]
[12] Gregor Hofer and Hiroshi Shimodaira. Automatic head motion prediction from speech data. In Proc. Interspeech 2007, Antwerp, Belgium, August 2007. [ bib | .pdf | Abstract ]
[13] Heriberto Cuayáhuitl, Steve Renals, Oliver Lemon, and Hiroshi Shimodaira. Hierarchical dialogue optimization using semi-markov decision processes. In Proc. Interspeech, August 2007. [ bib | .pdf | Abstract ]
[14] Gregor Hofer, Hiroshi Shimodaira, and Junichi Yamagishi. Speech-driven head motion synthesis based on a trajectory model. Poster at Siggraph 2007, 2007. [ bib | .pdf ]
[15] Gregor Hofer, Hiroshi Shimodaira, and Junichi Yamagishi. Lip motion synthesis using a context dependent trajectory hidden Markov model. Poster at SCA 2007, 2007. [ bib | .pdf ]
[16] Heriberto Cuayáhuitl, Steve Renals, Oliver Lemon, and Hiroshi Shimodaira. Reinforcement learning of dialogue strategies with hierarchical abstract machines. In Proc. IEEE/ACL Workshop on Spoken Language Technology (SLT), December 2006. [ bib | .pdf | Abstract ]
[17] Chie Shimodaira, Hiroshi Shimodaira, and Susumu Kunifuji. A Divergent-Style Learning Support Tool for English Learners Using a Thesaurus Diagram. In Proc. KES2006, Bournemouth, United Kingdom, October 2006. [ bib | .pdf | Abstract ]
[18] Junko Tokuno, Mitsuru Nakai, Hiroshi Shimodaira, Shigeki Sagayama, and Masaki Nakagawa. On-line Handwritten Character Recognition Selectively employing Hierarchical Spatial Relationships among Subpatterns. In Proc. IWFHR-10, La Baule, France, October 2006. [ bib | Abstract ]
[19] Heriberto Cuayáhuitl, Steve Renals, Oliver Lemon, and Hiroshi Shimodaira. Learning multi-goal dialogue strategies using reinforcement learning with reduced state-action spaces. In Proc. Interspeech, September 2006. [ bib | .pdf | Abstract ]
[20] Heriberto Cuayáhuitl, Steve Renals, Oliver Lemon, and Hiroshi Shimodaira. Human-computer dialogue simulation using hidden markov models. In Proc. IEEE Workshop on Automatic Speech Recognition and Understanding (ASRU), November 2005. [ bib | .pdf | Abstract ]
[21] Mitsuru Nakai, Shigeki Sagayama, and Hiroshi Shimodaira. On-line Handwriting Recognition Based on Sub-stroke HMM. Trans. IEICE D-II, J88-D2(8), August 2005. (in press) (in Japanese). [ bib | Abstract ]
[22] Junko Tokuno, Nobuhito Inami, Mitsuru Nakai, Hiroshi Shimodaira, and Shigeki Sagayama. Context-dependent Sub-stroke Model for HMM-based On-line Handwriting Recognition. Trans. IEICE D-II, J88-D2(8), August 2005. (in press), (in Japanese). [ bib | Abstract ]
[23] Hiroshi Shimodaira, Keisuke Uematsu, Shin'ichi Kawamoto, Gregor Hofer, and Mitsuru Nakai. Analysis and Synthesis of Head Motion for Lifelike Conversational Agents. In Proc. MLMI2005, July 2005. [ bib | .pdf ]
[24] Shin-ichi Kawamoto, Hiroshi Shimodaira, Shigeki Sagayama, et al. Galatea: Open-Source Software for Developing Anthropomorphic Spoken Dialog Agents. Life-Like Characters. Tools, Affective Functions, and Applications. Helmut Prendinger et al. (Eds.) Springer, pages 187-212, November 2003. [ bib | .pdf | Abstract ]
[25] Hiroshi Shimodaira, Takashi Sudo, Mitsuru Nakai, and Shigeki Sagayama. On-line Overlaid-Handwriting Recognition Based on Substroke HMMs. In ICDAR'03, pages 1043-1047, August 2003. [ bib | .pdf | Abstract ]
[26] Mitsuru Nakai, Hiroshi Shimodaira, and Shigeki Sagayama. Generation of Hierarchical Dictionary for Stroke-order Free Kanji Handwriting Recognition Based on Substroke HMM. In Proc. ICDAR2003, pages 514-518, August 2003. [ bib | .pdf | Abstract ]
[27] Shigeki Matsuda, Mitsuru Nakai, Hiroshi Shimodaira, and Shigeki Sagayama. Speech Recognition Using Asynchronous Transition HMM. IEICE Trans. D-II, J86-D-II(6):741-754, June 2003. (in Japanese). [ bib | Abstract ]
[28] Kanad Keeni, Kunio Goto, and Hiroshi Shimodaira. Automatic Filtering of Network IntrusionDetection System Alarms Using Multi-layer Feed-forward Neural Networks. In International Conference on Neural Information Processing (ICONIP2003), June 2003. [ bib ]
[29] Tokuno Junko, Naoto Akira, Mitsuru Nakai, Hiroshi Shimodaira, and Shigeki Sagayama. Blind-handwriting Interface for Wearable Computing. In Proc. Human - Computer Interaction (HCI) International 2003, Volume 2, pages 303-307, June 2003. [ bib | Abstract ]
[30] Kanad Keeni, Kunio Goto, and Hiroshi Shimodaira. On fast learning of Multi-layer Feed-forward Neural Networks Using Back Propagation. In International Conference on Enterprise and Information Systems (ICEIS2003), pages 266-271, April 2003. [ bib | Abstract ]
[31] Tu Bao Ho, Trong Dung Nguyen, Hiroshi Shimodaira, and Masayuki Kimura. A Knowledge Discovery System with Support for Model Selection and Visualization. Applied Intelligence, 19:125-141, 2003. [ bib ]
[32] Haruto Takeda, Naoki Saito, Tomoshi Otsuki, Mitsuru Nakai, Hiroshi Shimodaira, and Shigeki Sagayama. Hidden Markov Model for AUtomatic Transcription of MIDI Signals. In 2002 International Workshop on Multimedia Signal Processing, December 2002. [ bib | .pdf ]
[33] Kanad Keeni and Hiroshi Shimodaira. On Selection of Training Data for Fast Learning of Neural Networks Using Back Propagation. In IASTED International Conference on Artificial Intelligence and Application (AIA2002), pages 474-478, September 2002. [ bib ]
[34] Junko Tokuno, Nobuhito Inami, Shigeki Matsuda, Mitsuru Nakai, Hiroshi Shimodaira, and Shigeki Sagayama. Context-Dependent Substroke Model for HMM-based On-line Handwriting Recognition. In Proc. IWFHR-8, pages 78-83, August 2002. [ bib | .pdf | Abstract ]
[35] Mitsuru Nakai, Takashi Sudo, Hiroshi Shimodaira, and Shigeki Sagayama. Pen Pressure Features for Writer-Independent On-Line Handwriting Recognition Based on Substroke HMM. In Proc. ICPR2002, III, pages 220-223, August 2002. [ bib | .pdf ]
[36] Shin-ichi Kawamoto, Hiroshi Shimodaira, Tsuneo Nitta, Takuya Nishimoto, Satoshi Nakamura, Katsunobu Itou, Shigeo Morishima, Tatsuo Yotsukura, Atsuhiko Kai, Akinobu Lee, Yoichi Yamashita, Takao Kobayashi, Keiichi Tokuda, Keikichi Hirose, Nobuaki Minematsu, Atsushi Yamada, Yasuharu Den, Takehito Utsuro, and Shigeki Sagayama. Open-source software for developing anthropomorphic spoken dialog agent. In Proc. PRICAI-02, International Workshop on Lifelike Animated Agents, pages 64-69, August 2002. [ bib | .pdf ]
[37] Shin-ichi Kawamoto, Hiroshi Shimodaira, et al. Design of Software Toolkit for Anthromorphic Spoken Dialog Agent Software with Customization-oriented Features. Information Processing Society of Japan (IPSJ) Journal, 43(7):2249-2263, July 2002. (in Japanese). [ bib ]
[38] Jun Rokui, Mitsuru Nakai, Hiroshi Shimodaira, and Shigeki Sagayama. Speaker Normalization Using Linear Transformation of Vocal Tract Length Based on Maximum Likelihood Estimation. Information Processing Society of Japan (IPSJ), 43(7):2030-2037, July 2002. (in Japanese). [ bib | Abstract ]
[39] Hiroshi Shimodaira, Nobuyoshi Sakai, Mitsuru Nakai, and Shigeki Sagayama. Jacobian Joint Adaptation to Noise, Channel and Vocal Tract Length. In Proc. ICASSP2002, pages 197-200, May 2002. [ bib | .pdf | Abstract ]
[40] Yoshinori Matsushita, Shinnichi Kawamoto, Mitsuru Nakai, Hiroshi Shimodaira, and Shigeki Sagayama. A Head-Behavior Synchronization Model with Utterance for Anthropomorphic Spoken-Dialog Agent. In Technical Report of IEICE, HIS2001, March 2002. (in Japanese). [ bib | Abstract ]
[41] Tomoshi Otsuki, Naoki Saitou, Mitsuru Nakai, Hiroshi Shimodaira, and Shigeki Sagayama. Musical Rhythm Recognition Using Hidden Markov Model. Information Processing Society of Japan (IPSJ) JOURNAL, 43(2), February 2002. (in Japanese). [ bib ]
[42] Hiroshi Shimodaira, Ken-ichi Noma, Mitsuru Nakai, and Shigeki Sagayama. Dynamic Time-Alignment Kernel in Support Vector Machine. Advances in Neural Information Processing Systems 14, NIPS2001, 2:921-928, December 2001. [ bib | .pdf | Abstract ]
[43] Mitsuru Nakai, Naoto Akira, Hiroshi Shimodaira, and Shigeki Sagayama. Substroke Approach to HMM-based On-line Kanji Handwriting Recognition. In Proc. ICDAR'01, pages 491-495, September 2001. [ bib | .pdf | Abstract ]
[44] Shigeki Sagayama, Yutaka Kato, Mitsuru Nakai, and Hiroshi Shimodaira. Jacobian Approach to Joint Adaptation to Noise, Channel and Vocal Tract Length. In Proc. ISCA Workshop on Adaptation Methods (Sophia Antipolis, France), pages 117-120, August 2001. [ bib ]
[45] Shigeki Sagayama, Koichi Shinoda, Mitsuru Nakai, and Hiroshi Shimodaira. Analytic Methods for Acoustic Model Adaptation: A Review. In Proc. ISCA Workshop on Adaptation Methods (Sophia Antipolis France), pages 67-76, August 2001. Invited Paper. [ bib ]
[46] Kanad Keeni, Kunio Goto, and Hiroshi Shimodaira. On Extraction of E-Mail Address from Fax Message for Automatic Delivery to Individual Recipient. In IASTED International Conference on Siganl Processing Pattern Recognition and Application, July 2001. [ bib ]
[47] Katsuhisa Fujinaga, Mitsuru Nakai, Hiroshi Shimodaira, and Shigeki Sagayama. Multiple-Regression Hidden Markov Model. In Proc. ICASSP 2001, May 2001. [ bib | .pdf ]
[48] Shigeki Matsuda, Mitsuru Nakai, Hiroshi Shimodaira, and Shigeki Sagayama. Feature-dependent Allophone Clustering. In Proc. ICSLP2000, pages 413-416, October 2000. [ bib | .pdf | Abstract ]
[49] Hiroshi Shimodaira, Toshihiko Akae, Mitsuru Nakai, and Shigeki Sagayama. Jacobian Adaptation of HMM with Initial Model Selection for Noisy Speech Recognition. In Proc. ICSLP2000, pages 1003-1006, October 2000. [ bib | .pdf | Abstract ]
[50] Shigeki Matsuda, Mitsuru Nakai, Hiroshi Shimodaira, and Shigeki Sagayama. Asynchronous-Transition HMM. In Proc. ICASSP 2000 (Istanbul, Turkey), Vol. II, pages 1001-1004, June 2000. [ bib | .pdf | Abstract ]
[51] Jun Rokui and Hiroshi Shimodaira. Multistage Building Learning based on Misclassification Measure. In 9-th International Conference on Artificial Neural Networks, Edinburgh, UK, September 1999. [ bib ]
[52] Kanad Keeni, Kenji Nakayama, and Hiroshi Shimodaira. A Training Scheme for Pattern Classification Using Multi-layer Feed-forward Neural Networks. In IEEE International Conference on Computational Intelligence and Multimedia Applications, pages 307-311, September 1999. [ bib ]
[53] Kanad Keeni, Kenji Nakayama, and Hiroshi Shimodaira. Estimation of Initial Weights and Hidden Units for Fast Learning of Multi-layer Neural Networks for Pattern Classification. In IEEE International Joint Conference on Neural Networks (IJCNN'99), July 1999. [ bib ]
[54] Hiroshi Shimodaira, Jun Rokui, and Mitsuru Nakai. Improving The Generalization Performance Of The MCE/GPD Learning. In ICSLP'98, Australia, December 1998. [ bib | .pdf | Abstract ]
[55] Mitsuru Nakai and Hiroshi Shimodaira. The Use of F0 Reliability Function for Prosodic Command Analysis on F0 Contour Generation Model. In Proc. ICSLP'98, December 1998. [ bib | .pdf ]
[56] Kanad Keeni, Kenji Nakayama, and Hiroshi Shimodaira. Automatic Generation of Initial Weights and Target Outputs of Multi-layer Neural Networks and its Application to Pattern Classification. In International Conference on Neural Information Processing (ICONIP'98), pages 1622-1625, October 1998. [ bib ]
[57] Jun Rokui and Hiroshi Shimodaira. Modified Minimum Classification Error Learning and Its Application to Neural Networks. In ICONIP'98, Kitakyushu, Japan, October 1998. [ bib ]
[58] Eiji Iida, Hiroshi Shimodaira, Susumu Kunifuji, and Masayuki Kimura. A system to Perform Human Problem Solving. In The 5th International Conference on Soft Computing and Information / Intelligent Systems (IIZUKA'98), October 1998. [ bib ]
[59] Kanad Keeni, Kenji Nakayama, and Hiroshi Shimodaira. Automatic Generation of Initial Weights and Estimation of Hidden Units for Pattern Classification Using Neural Networks. In 14th International Conference on Pattern Recognition (ICPR'98), pages 1568-1571, August 1998. [ bib ]
[60] Eiji Iida, Susumu Kunifuji, Hiroshi Shimodaira, and Masayuki Kimura. A Scale-Down Solution of N^2-1 Puzzle. Trans. IEICE(D-I), J81-D-I(6):604-614, June 1998. (in Japanese). [ bib ]
[61] Kanad Keeni, Hiroshi Shimodaira, Kenji Nakayama, and Kazunori Kotani. On Parameter Initialization of Multi-layer Feed-forward Neural Networks for Pattern Recognition. In International Conference on Computational Linguistics, Speech and Document Processing (ICCLSDP-'98), Calcutta, India, pages D8-12, February 1998. [ bib ]
[62] Hiroshi Shimodaira, Jun Rokui, and Mitsuru Nakai. Modified Minimum Classification Error Learning and Its Application to Neural Networks. In 2nd International Workshop on Statistical Techniques in Pattern Recognition (SPR'98), Sydney, Australia, 1998. [ bib | .pdf | Abstract ]
[63] Mitsuru Nakai, Harald Singer, Yoshimori Sagisaka, and Hiroshi Shimodaira. Accent Phrase Segmentation Based on F0 Templates Using a Superpositional Prosodic Model. Trans. IEICE (D-II), J80-D-II(10):2605-2614, October 1997. (in Japanese). [ bib ]
[64] Hiroshi Shimodaira, Mitsuru Nakai, and Akihiro Kumata. Restration of Pitch Pattern of Speech Based on a Pitch Gereration Model. In Proc. EuroSpeech'97, pages 512-524, September 1997. [ bib | .pdf | Abstract ]
[65] Mitsuru Nakai and Hiroshi Shimodaira. On Representation of Fundamental Frequency of Speech for Prosody Analysis Using Reliability Function. In Proc. EuroSpeech'97, pages 243-246, September 1997. [ bib | .pdf ]
[66] Kanad Keeni, Hiroshi Shimodaira, and Kenji Nakayama. On Distributed Representation of Output Layer for Recognizing Japanese Kana Characters Using Neural Networks. In Proceedings of the 4'th International Conference on Document Analysis and Recognition, ICDAR'97, pages 600-603, July 1997. Ulm, Germany. [ bib ]
[67] Tu Bao Ho, Nguyen Trong Dung, Hiroshi Shimodaira, and Masayuki Kimura. An Interactive-Graphic Environment for Discovering and Using Conceptual Knowledge. In 7th European-Japanese Conference on Information Modelling and Knowledge Bases, pages 327-343, May 1997. [ bib ]
[68] Kanad Keeni and Hiroshi Shimodaira. On Representation of Output Layer for Recognizing Japanese Kana Characters Using Neural Networks. In Proc. the `17'th International Conference on Computer Processing of Oriental Languages, pages 305-308, April 1997. Baptist University, Kowloon Tong, Hong Kong. [ bib ]
[69] Mitsuru Nakai, Harald Singer, Yoshinori Sagisaka, and Hiroshi Shimodaira. Accent Phrase Segmentation by F0 Clustering Using Superpositional Modeling, pages 343-360. January 1997. [ bib ]
[70] Sukeyasu Kanno and Hiroshi Shimodaira. Voiced Sound Detection under Nonstationary and Heavy Noisy Environment Using the Prediction Error of Low-Frequency Spectrum. Trans. IEICE(D-II), J80-D-II(1):26-35, January 1997. (in Japanese). [ bib ]
[71] Kanad Keeni, Hiroshi Shimodaira, Tetsuro Nishino, and Yasuo Tan. Recognition of Devanagari Characters Using Neural Networks. IEICE, E79-D(5):523-528, May 1996. [ bib ]
[72] Paul A. Taylor, Hiroshi Shimodaira, Stephen Isard, Simon King, and Jacqueline Kowtko. Using prosodic information to constrain language models for spoken dialogue. In Proc. ICSLP `96, Philadelphia, 1996. [ bib | .ps | .pdf | Abstract ]
[73] Hisao Koba, hiroshi Shimodaira, and Masayuki Kimura. Intelligent Automatic Document Transcription System for Braille: To Improve Accessibility to Printed Matter for the Visually Impaired. In HIC International'95, July 1995. [ bib ]
[74] and Hiroshi Shimodaira. HI Design Based on the Costs of Human Information-processing Model. In HIC international'95, July 1995. [ bib ]
[75] Mitsuru Nakai, Singer Harald, Yoshinori Sagisaka, and Hiroshi Shimodaira. Automatic Prosodic Segmentation by F0 Clustering Using Superpositional Modeling. In Proc. ICASSP-95, PR08.6, pages 624-627, May 1995. [ bib | .pdf ]
[76] Mitsuru Nakai and Hiroshi Shimodaira. Accent Phrase Segmentation by Finding N-best Sequences of Pitch Pattern Templates. In Proc. ICSLP94, 8.10, pages 347-350, September 1994. [ bib | .pdf ]
[77] Mitsuru Nakai, Hiroshi Shimodaira, and Shigeki Sagayama. Prosodic Phrase Segmentation Based on Pitch-Pattern Clustering. Electronics and Communications in Japan, Part 3, 77(6):80-91, June 1994. (in Japanese). [ bib ]
[78] Hiroshi Shimodaira and Mitsuru Nakai. Prosodic phrase segmentation by pitch pattern clustering. In Proc. ICASSP-94, 76.5, vol.II, pages 185-188, March 1994. [ bib | .pdf | Abstract ]
[79] Mitsuru Nakai, Hiroshi Shimodaira, and Shigeki Sagayama. Prosodic phrase segmentation based on pitch-pattern clustering. Trans. IEICE (A), J77-A(2):206-214, February 1994. (in Japanese). [ bib ]
[80] Hiroshi Shimodaira and Mitsuru Nakai. Accent phrase segmentation using transition probabilities between pitch pattern templates. In Proc. EuroSpeech'93, pages 1767-1770, September 1993. [ bib | .ps.gz | Abstract ]
[81] Hiroshi Shimodaira and Mitsuru Nakai. Robust pitch detection by narrow band spectrum analysis. In Proc. ICSLP-92, pages 1597-1600, October 1992. [ bib | .pdf | Abstract ]