A Korean Language Stemmer based on Unsupervised Learning 


Vol. 8,  No. 6, pp. 675-684, Dec.  2001
10.3745/KIPSTB.2001.8.6.675


PDF
  Abstract

This paper describes a method for stemming of Korean language by using unsupervised learning from raw corpus. This technique does not require a lexicon or any language-specific knowledge. Since we use unsupervised learning, the time and effort required for learning is negligible. Unlike heuristic approaches that are theoretically ungrounded, this method is based on widely accepted statistical methods, and therefore can be easily extended. The method is currently applied only to Korean language, but it can easily be adapted to other agglutinative languages, since it is not language-dependent.

  Statistics


  Cite this article

[IEEE Style]

S. H. Cho, "A Korean Language Stemmer based on Unsupervised Learning," The KIPS Transactions:PartB , vol. 8, no. 6, pp. 675-684, 2001. DOI: 10.3745/KIPSTB.2001.8.6.675.

[ACM Style]

Se Hyeong Cho. 2001. A Korean Language Stemmer based on Unsupervised Learning. The KIPS Transactions:PartB , 8, 6, (2001), 675-684. DOI: 10.3745/KIPSTB.2001.8.6.675.