Construction Scheme of Training Data using Automated Exploring of Boundary Categories 


Vol. 16,  No. 6, pp. 479-488, Dec.  2009
10.3745/KIPSTB.2009.16.6.479


PDF
  Abstract

This paper shows a reinforced construction scheme of training data for improvement of text classification by automatic search of boundary category. The documents laid on boundary area are usually misclassified as they are including multiple topics and features. which is the main factor that we focus on. In this paper, we propose an automated exploring methodology of optimal boundary category based on previous research. We consider the boundary area among target categories to new category to be required training, which are then added to the target category sementically. In experiments, we applied our method to complex documents by intentionally making errors in training process. The experimental results show that our system has high accuracy and reliability in noisy environment.

  Statistics


  Cite this article

[IEEE Style]

Y. J. Choi, J. G. Jee, S. S. Park, "Construction Scheme of Training Data using Automated Exploring of Boundary Categories," The KIPS Transactions:PartB , vol. 16, no. 6, pp. 479-488, 2009. DOI: 10.3745/KIPSTB.2009.16.6.479.

[ACM Style]

Yun Jeong Choi, Jeong Gyu Jee, and Seung Soo Park. 2009. Construction Scheme of Training Data using Automated Exploring of Boundary Categories. The KIPS Transactions:PartB , 16, 6, (2009), 479-488. DOI: 10.3745/KIPSTB.2009.16.6.479.