A Document Classification System Using Modified ECCD and Category Weight for each Document 


Vol. 19,  No. 4, pp. 237-242, Aug.  2012
10.3745/KIPSTB.2012.19.4.237


PDF
  Abstract

Web information service needs a document classification system for efficient management and conveniently searches. Existing document classification systems have a problem of low accuracy in classification, if a few number of feature words is selected in documents or if the number of documents that belong to a specific category is excessively large. To solve this problem, we propose a document classification system using ``Modified ECCD`` feature selection method and ``Category Weight for each Document``. Experimental results show that the ``Modified ECCD`` feature selection method has higher accuracy in classification than  and the ECCD method. Moreover, combining the ``Category Weight for each Document`` feature value and ``Modified ECCD`` feature selection method results better accuracy in classification.

  Statistics


  Cite this article

[IEEE Style]

S. W. Lee, C. S. Han, S. Y. Park, "A Document Classification System Using Modified ECCD and Category Weight for each Document," The KIPS Transactions:PartB , vol. 19, no. 4, pp. 237-242, 2012. DOI: 10.3745/KIPSTB.2012.19.4.237.

[ACM Style]

Soo Won Lee, Chung Seok Han, and Sang Yong Park. 2012. A Document Classification System Using Modified ECCD and Category Weight for each Document. The KIPS Transactions:PartB , 19, 4, (2012), 237-242. DOI: 10.3745/KIPSTB.2012.19.4.237.