Automatic Extraction of Collocations based on Corpus using mutual information 


Vol. 1,  No. 4, pp. 461-468, Nov.  1994
10.3745/KIPSTE.1994.1.4.461


PDF
  Abstract

This paper describes the automatic extraction of collocations based on corpus. The collocations are extracted from corpus using cooccurrence frequency and mutual information between words. In English, 5 types of collocations are defined. These collocations are transitive verb and object, intransitive verb and subject, adjective and noun, verb and adverb, and adverb and adjective. In this paper another type of collocation is recognized and extracted, which consists of verb and preposition. So 6 types of collocations are extracted based on corpus.

  Statistics


  Cite this article

[IEEE Style]

L. H. Suk, "Automatic Extraction of Collocations based on Corpus using mutual information," The Transactions of the Korea Information Processing Society (1994 ~ 2000), vol. 1, no. 4, pp. 461-468, 1994. DOI: 10.3745/KIPSTE.1994.1.4.461.

[ACM Style]

Lee Ho Suk. 1994. Automatic Extraction of Collocations based on Corpus using mutual information. The Transactions of the Korea Information Processing Society (1994 ~ 2000), 1, 4, (1994), 461-468. DOI: 10.3745/KIPSTE.1994.1.4.461.