Collection and Extraction Algorithm of Field Associated Terms 


Vol. 10,  No. 3, pp. 347-358, Jun.  2003
10.3745/KIPSTB.2003.10.3.347


PDF
  Abstract

Field-associated term is a single or compound word whose terms occur in any document, and which makes it possible to recognize a field of text by using common knowledge of human. For example, human recognizes the field of document such as or , a field name of text, when she encounters a word ´pitcher´ or ´election,´ respectively. We proposes an efficient construction method of field-associated terms (FTs) for specializing field to decide a field of text. We could fix document classification scheme from well-classified document database or corpus. Considering focus field we discuss levels and stability ranks of field-associated terms. To construct a balanced FT collection, we construct a single FTs. From the collections we could automatically construct FT´s levels, and stability ranks. We propose a new extraction algorithms of FT´s for document classification by using FT´s concentration rate, its occurrence frequencies.

  Statistics


  Cite this article

[IEEE Style]

S. S. K. Lee and W. K. Lee, "Collection and Extraction Algorithm of Field Associated Terms," The KIPS Transactions:PartB , vol. 10, no. 3, pp. 347-358, 2003. DOI: 10.3745/KIPSTB.2003.10.3.347.

[ACM Style]

Samuel Sang Kon Lee and Wan Kwon Lee. 2003. Collection and Extraction Algorithm of Field Associated Terms. The KIPS Transactions:PartB , 10, 3, (2003), 347-358. DOI: 10.3745/KIPSTB.2003.10.3.347.