Abstract:
For extra-large vocabulary continuous speech recognition a language model that describes permissible phrases is needed. In the paper, the results of experiments on extra-large vocabulary (above $100$ K words) continuous speech recognition, with usage of $n$-gram models, are presented. A quantitative comparison of recognition accuracy of words, symbols, and phonemes depending on $n$-gram model, with $n$ value varying from $0$ till $3$, was made.
Keywords:continuous Russian speech recognition, extra-large vocabulary, language models.