Generalization of error decision rules in a grammar checker using Korean WordNet, KorLex 


Vol. 18,  No. 6, pp. 405-414, Dec.  2011
10.3745/KIPSTB.2011.18.6.405


PDF
  Abstract

Korean grammar checkers typically detect context-dependent errors by employing heuristic rules that are manually formulated by a language expert. These rules are appended each time a new error pattern is detected. However, such grammar checkers are not consistent. In order to resolve this shortcoming, we propose new method for generalizing error decision rules to detect the above errors. For this purpose, we use an existing thesaurus KorLex, which is the Korean version of Princeton WordNet. KorLex has hierarchical word senses for nouns, but does not contain any information about the relationships between cases in a sentence. Through the Tree Cut Model and the MDL(minimum description length) model based on information theory, we extract noun classes from KorLex and generalize error decision rules from these noun classes. In order to verify the accuracy of the new method in an experiment, we extracted nouns used as an object of the four predicates usually confused from a large corpus, and subsequently extracted noun classes from these nouns. We found that the number of error decision rules generalized from these noun classes has decreased to about 64.8%. In conclusion, the precision of our grammar checker exceeds that of conventional ones by 6.2%.

  Statistics


  Cite this article

[IEEE Style]

S. G. Ja, S. H. Lee, H. C. Kwon, "Generalization of error decision rules in a grammar checker using Korean WordNet, KorLex," The KIPS Transactions:PartB , vol. 18, no. 6, pp. 405-414, 2011. DOI: 10.3745/KIPSTB.2011.18.6.405.

[ACM Style]

So Gil Ja, Seung Hee Lee, and Hyuk Chul Kwon. 2011. Generalization of error decision rules in a grammar checker using Korean WordNet, KorLex. The KIPS Transactions:PartB , 18, 6, (2011), 405-414. DOI: 10.3745/KIPSTB.2011.18.6.405.