RUS  ENG
Full version
JOURNALS // Informatics and Automation // Archive

Tr. SPIIRAN, 2011 Issue 19, Pages 87–101 (Mi trspy471)

This article is cited in 1 paper

A quantitative analysis of the English lexicon in Wiktionaries and WordNet

A. A. Krizhanovsky

St. Petersburg Institute for Informatics and Automation of RAS

Abstract: A quantitative analysis of the English lexicon was performed in the paper. The three electronic dictionaries are under examination: the English Wiktionary, WordNet, and the Russian Wiktionary. The quantity of English words and their meanings (senses) are calculated. The distribution of words for each part of speech, the quantity of monosemous and polysemous words and the distribution of words by number of meanings were calculated and compared across these dictionaries. The analysis shows that the average polysemy, the number and the distribution of word senses follow similar patterns in both expert and collaborative resources with relatively minor differences.

Keywords: computational linguistics, lexicography, lexical analysis, English language.

UDC: 004.912

PACS: 01.30.Kj

MSC: 68T50, 68P20

Received: 29.11.2011
Accepted: 29.11.2011



© Steklov Math. Inst. of RAS, 2026