The Centre for Speech Technology Research, The university of Edinburgh

Submissions and listening test results from previous Blizzard Challenges

These distributions include the synthetic speech submitted by participants in the challenge along with listeners' scores for the subset synthetic speech used in the listening test. Possible uses of these materials include:

Terms of use

Generally, these data (including the wav files submitted by participants and the corresponding listening test scores) may be used for: and some parts of the data also allow

The following restrictions apply to all data:

However, varying restrictions are associated with certain parts of the data (e.g., wav files from certain systems may be used for commercial research and development, whilst others may only be used for academic research). Where applicable, this is specified within the distributions. It is your responsibility to check the permissions described within each distribution and to ensure that you only use the data in a way that is consistent with those permissions. File sizes and checksums are given in parentheses. If you obtain any results based on this data, please:
  1. Let us know (email Simon King)
  2. Put an acknowledgement in all publications to "The organisers of the Blizzard Challenge"
  3. Cite an appropriate reference, such as:
    • "Measuring a decade of progress in Text-to-Speech", Simon King. Loquens, Vol 1, No 1 (2014). doi:10.3989/loquens.2014.006
    • "The Blizzard Challenge 2013", Simon King and Vasilis Karaiskos, in Proc. Blizzard Challenge workshop 2013.
    • "The Blizzard Challenge 2012", Simon King and Vasilis Karaiskos, in Proc. Blizzard Challenge workshop 2012.
    • "The Blizzard Challenge 2011", Simon King and Vasilis Karaiskos, in Proc. Blizzard Challenge workshop 2011.
    • "The Blizzard Challenge 2010", Simon King and Vasilis Karaiskos, in Proc. Blizzard Challenge workshop 2010.
    • "The Blizzard Challenge 2009", Simon King and Vasilis Karaiskos, in Proc. Blizzard Challenge workshop 2009.
    • "The Blizzard Challenge 2008", Vasilis Karaiskos, Simon King, Robert A. J. Clark, Catherine Mayo, in Proc. Blizzard Challenge workshop 2008.
    • "The Blizzard Challenge -- 2005: Evaluating Corpus-Based Speech Synthesis on Common Datasets", Alan W. Black, Keiichi Tokuda, in Proc. Interspeech 2005, Lisbon, Portugal.
    which can be found via the Blizzard Challenge website.

Additional data

Samsung have released a data set comprising newly-created synthetic speech for the test materials from the above 2007-2016 Challenges, and other sources, along with a very substantial number of listener ratings, as SOMOS: The Samsung Open MOS Dataset for the Evaluation of Neural Text-to-Speech Synthesis