RUS  ENG
Full version
JOURNALS // Vestnik Sankt-Peterburgskogo Universiteta. Seriya 10. Prikladnaya Matematika. Informatika. Protsessy Upravleniya // Archive

Vestnik S.-Petersburg Univ. Ser. 10. Prikl. Mat. Inform. Prots. Upr., 2011 Issue 3, Pages 127–133 (Mi vspui52)

Informatics

Thematic text document segmentation

A. N. Mishenin

St. Petersburg State University, Department of Mathematics and Mechanics

Abstract: A method for automatic text segmentation and annotation is presented. It first discovers the themes presented in the document collection and then split each document according to these themes.

Keywords: text segmentation, natural language processing, information retrieval.

UDC: 519.688


Accepted: March 10, 2011



© Steklov Math. Inst. of RAS, 2026