Moore, R. & Maier, V. (2007) Preserving FPD using episodic memory: automatic speech recognition with Minerva2. In Trouvain, J. & Barry, W. (eds.), Proceedings of the 16th International Congress of Phonetic Sciences, p197-204. Paper ID 1724.

Full text

Previous research has demonstrated competitive recognition results using a simulation of episodic memory - ‘MINERVA2’ - on the Peterson & Barney corpus of vowel formant data. This paper presents a modified implementation designed to work on real speech data, and results are reported on isolated-word recognition experiments conducted using the TI-ALPHA corpus. It is shown that access to fine phonetic detail is critical for achieving high recognition accuracy, whether it is provided by the episodic model or by hidden Markov models incorporating large numbers of Gaussian mixture components. However, it is confirmed that although MINERVA2 offers a powerful means for generalizing by accessing the fine detail retained in all the training data, it is severely hampered by its inability to model temporal sequence. It is concluded that a new episodic model is needed that is based on the principles of MINERVA2 but which overcomes such limitations.

< Back to Publications

August 2010
S M T W T F S
1 2 3 4 5 6 7
8 9 10 11 12 13 14
15 16 17 18 19 20 21
22 23 24 25 26 27 28
29 30 31        

Marie Curie Logo