RUS  ENG
Full version
JOURNALS // Journal of Siberian Federal University. Mathematics & Physics // Archive

J. Sib. Fed. Univ. Math. Phys., 2016 Volume 9, Issue 2, Pages 235–245 (Mi jsfu481)

This article is cited in 2 papers

Topic categorization based on collectives of term weighting methods for natural language call routing

Roman B. Sergienkoa, Muhammad Shana, Wolfgang Minkera, Eugene S. Semenkinb

a Institute of Telecommunication Engineering, Ulm University, Albert-Einstein-Allee, 43, Ulm, 89081, Germany
b Informatics and Telecommunications Institute, Siberian State Aerospace University, Krasnoyarskiy Rabochiy, 31, Krasnoyarsk, 660037, Russia

Abstract: Natural language call routing is an important data analysis problem which can be applied in different domains including airspace industry. This paper presents the investigation of collectives of term weighting methods for natural language call routing based on text classification. The main idea is that collectives of different term weighting methods can provide classification effectiveness improvement with the same classification algorithm. Seven different unsupervised and supervised term weighting methods were tested and compared with each other for classification with k-NN. After that different combinations of term weighting methods were formed as collectives. Two approaches for the handling of the collectives were considered: the meta-classifier based on the rule induction and the majority vote procedure. The numerical experiments have shown that the best result is provided with the vote of all seven different term weighting methods. This combination provides a significant increasing of classification effectiveness in comparison with the most effective term weighting methods.

Keywords: natural language call routing, text classification, term weighting.

UDC: 004.93

Received: 26.12.2015
Received in revised form: 11.01.2016
Accepted: 20.02.2016

Language: English

DOI: 10.17516/1997-1397-2016-9-2-235-245



Bibliographic databases:


© Steklov Math. Inst. of RAS, 2026