|
[1]
|
Alexander Gutkin.
Towards Formal Structural Representation of Spoken
Language: An Evolving Transformation System (ETS) Approach.
PhD thesis, School of Informatics, University of Edinburgh, UK,
December 2005.
Internal version.
[ bib |
.pdf ]
|
|
[2]
|
Alexander Gutkin and David R. Gay.
Structural representation and matching of articulatory speech
structures based on the evolving transformation system (ETS) formalism.
In Proc. Nineteenth International Joint Conference on Artificial
Intelligence (IJCAI-05), Edinburgh, UK, August 2005.
[ bib |
.pdf ]
|
|
[3]
|
Alexander Gutkin and David R. Gay.
Structural Representation and Matching of Articulatory
Speech Structures based on the Evolving Transformation System
(ETS) Formalism.
In Michael Hofbaur, Bernhard Rinner, and Franz Wotawa, editors,
Proc. 19th International Workshop on Qualitative Reasoning (QR-05), pages
89-96, Graz, Austria, May 2005.
[ bib |
.pdf ]
A formal structural representation of speech
consistent with the principles of combinatorial
structure theory is presented in this paper. The
representation is developed within the Evolving
Transformation System (ETS) formalism and encapsulates
speech processes at the articulatory level. We show how
the class structure of several consonantal phonemes of
English can be expressed with the help of articulatory
gestures-the atomic combinatorial units of speech. As
a preliminary step towards the design of a speech
recognition architecture based on the structural
approaches to physiology and articulatory phonology, we
present an algorithm for the structural detection of
phonemic class elements inside gestural ETS structures
derived from continuous speech. Experiments designed to
verify the adequacy of the hypothesised gestural class
structure conducted on the MOCHA articulatory corpus
are then described. Our experimental results support
the hypothesis that the articulatory representation
captures sufficient information for the accurate
structural identification of the phonemic classes in
question.
|
|
[4]
|
Alexander Gutkin and Simon King.
Inductive String Template-Based Learning of Spoken
Language.
In Hugo Gamboa and Ana Fred, editors, Proc. 5th International
Workshop on Pattern Recognition in Information Systems (PRIS-2005), In
conjunction with the 7th International Conference on Enterprise Information
Systems (ICEIS-2005), pages 43-51, Miami, USA, May 2005. INSTICC Press.
[ bib |
.ps.gz |
.pdf ]
This paper deals with formulation of alternative
structural approach to the speech recognition problem.
In this approach, we require both the representation
and the learning algorithms defined on it to be
linguistically meaningful, which allows the speech
recognition system to discover the nature of the
linguistic classes of speech patterns corresponding to
the speech waveforms. We briefly discuss the current
formalisms and propose an alternative - a
phonologically inspired string-based inductive speech
representation, defined within an analytical framework
specifically designed to address the issues of class
and object representation. We also present the results
of the phoneme classification experiments conducted on
the TIMIT corpus of continuous speech.
|
|
[5]
|
Alexander Gutkin and Simon King.
Detection of Symbolic Gestural Events in Articulatory
Data for Use in Structural Representations of Continuous Speech.
In Proc. IEEE International Conference on Acoustics, Speech, and
Signal Processing (ICASSP-05), volume I, pages 885-888, Philadelphia, PA,
USA, March 2005. IEEE Signal Processing Society Press.
[ bib |
.ps.gz |
.pdf ]
One of the crucial issues which often needs to be
addressed in structural approaches to speech
representation is the choice of fundamental symbolic
units of representation. In this paper, a
physiologically inspired methodology for defining these
symbolic atomic units in terms of primitive
articulatory events is proposed. It is shown how the
atomic articulatory events (gestures) can be detected
directly in the articulatory data. An algorithm for
evaluating the reliability of the articulatory events
is described and promising results of the experiments
conducted on MOCHA articulatory database are presented.
|
|
[6]
|
Alexander Gutkin and Simon King.
Phone classification in pseudo-Euclidean vector spaces.
In Proc. 8th International Conference on Spoken Language
Processing (ICSLP), volume II, pages 1453-1457, Jeju Island, Korea, October
2004.
[ bib |
.ps.gz |
.pdf ]
Recently we have proposed a structural framework for
modelling speech, which is based on patterns of
phonological distinctive features, a linguistically
well-motivated alternative to standard vector-space
acoustic models like HMMs. This framework gives
considerable representational freedom by working with
features that have explicit linguistic interpretation,
but at the expense of the ability to apply the wide
range of analytical decision algorithms available in
vector spaces, restricting oneself to more
computationally expensive and less-developed symbolic
metric tools. In this paper we show that a
dissimilarity-based distance-preserving transition from
the original structural representation to a
corresponding pseudo-Euclidean vector space is
possible. Promising results of phone classification
experiments conducted on the TIMIT database are
reported.
|
|
[7]
|
Alexander Gutkin and Simon King.
Structural Representation of Speech for Phonetic
Classification.
In Proc. 17th International Conference on Pattern Recognition
(ICPR), volume 3, pages 438-441, Cambridge, UK, August 2004. IEEE Computer
Society Press.
[ bib |
.ps.gz |
.pdf ]
This paper explores the issues involved in using
symbolic metric algorithms for automatic speech
recognition (ASR), via a structural representation of
speech. This representation is based on a set of
phonological distinctive features which is a
linguistically well-motivated alternative to the
“beads-on-a-string” view of speech that is standard
in current ASR systems. We report the promising results
of phoneme classification experiments conducted on a
standard continuous speech task.
|
|
[8]
|
Alexander Gutkin, David Gay, Lev Goldfarb, and Mirjam Wester.
On the Articulatory Representation of Speech within the
Evolving Transformation System Formalism.
In Lev Goldfarb, editor, Pattern Representation and the Future
of Pattern Recognition (Proc. Satellite Workshop of 17th International
Conference on Pattern Recognition), pages 57-76, Cambridge, UK, August
2004.
[ bib |
.ps.gz |
.pdf ]
This paper deals with the formulation of an
alternative, structural, approach to the speech
representation and recognition problem. In this
approach, we require both the representation and the
learning algorithms to be linguistically meaningful and
to naturally represent the linguistic data at hand.
This allows the speech recognition system to discover
the emergent combinatorial structure of the linguistic
classes. The proposed approach is developed within the
ETS formalism, the first formalism in applied
mathematics specifically designed to address the issues
of class and object/event representation. We present an
initial application of ETS to the articulatory
modelling of speech based on elementary physiological
gestures that can be reliably represented as the ETS
primitives. We discuss the advantages of this gestural
approach over prevalent methods and its promising
potential to mathematical modelling and representation
in linguistics.
|
|
[9]
|
Alexander Gutkin.
Log-Linear Interpolation of Language Models.
MPhil. thesis, Department of Engineering, University of Cambridge,
UK, December 2000.
[ bib |
.ps.gz |
.pdf ]
|