|
[1]
|
E. Zwyssig, S. Renals, and M. Lincoln.
Determining the number of speakers in a meeting using microphone
array features.
In Acoustics, Speech and Signal Processing (ICASSP), 2012 IEEE
International Conference on, pages 4765-4768, 2012.
[ bib ]
|
|
[2]
|
E. Zwyssig, S. Renals, and M. Lincoln.
On the effect of SNR and superdirective beamforming in speaker
diarisation in meetings.
In Acoustics, Speech and Signal Processing (ICASSP), 2012 IEEE
International Conference on, pages 4177-4180, 2012.
[ bib ]
|
|
[3]
|
Erich Zwyssig, Mike Lincoln, and Steve Renals.
A digital microphone array for distant speech recognition.
In Proc. IEEE ICASSP-10, pages 5106-5109, 2010.
[ bib |
DOI |
.pdf ]
In this paper, the design, implementation and testing
of a digital microphone array is presented. The array
uses digital MEMS microphones which integrate the
microphone, amplifier and analogue to digital converter
on a single chip in place of the analogue microphones
and external audio interfaces currently used. The
device has the potential to be smaller, cheaper and
more flexible than typical analogue arrays, however the
effect on speech recognition performance of using
digital microphones is as yet unknown. In order to
evaluate the effect, an analogue array and the new
digital array are used to simultaneously record test
data for a speech recognition experiment. Initial
results employing no adaptation show that performance
using the digital array is significantly worse (14%
absolute WER) than the analogue device. Subsequent
experiments using MLLR and CMLLR channel adaptation
reduce this gap, and employing MLLR for both channel
and speaker adaptation reduces the difference between
the arrays to 4.5% absolute WER.
|