RUS  ENG
Full version
JOURNALS // Eurasian Journal of Mathematical and Computer Applications // Archive

Eurasian Journal of Mathematical and Computer Applications, 2013, Volume 1, Issue 2, Pages 102–109 (Mi ejmca75)

A ‘by part’ method of russian word speech recognition

A. V. Nitsenko

Institute of Artificial Intelligence, the Ministry of Education and Science of Ukraine and National Academy of Science of Ukraine, Donetsk

Abstract: The present article is a description of a speech recognition method based on the idea of recognizing words by their component parts. The method proceeds from automatic phonetic segmentation, using full variation digital analogue, to further compose a diphone base and carry out a DTW algorithm-based speech recognition: rstly, for a variable word part (a quasiexion) and secondly, for its static part (a quasibase), with reference templates automatically formed from diphone templates. It results in considerable reduction of the running time and the reliability growth of word form speech recognition. This method can be employed for recognizing large and very large vocabularies.

Keywords: segmentation of speech signal, diphone, dynamic time warping, feature vector, quasiexion.

MSC: 68T10, 68T50

Received: 02.12.2013

Language: English



© Steklov Math. Inst. of RAS, 2026