Multilingual Story Link Detection based on Properties of Event Terms 


Vol. 12,  No. 1, pp. 81-90, Feb.  2005
10.3745/KIPSTB.2005.12.1.81


PDF
  Abstract

In this paper, we propose a novel approach which models multilingual story link detection by adapting the features such as timelines and multilingual spaces as weighting components to give distinctive weights to terms related to events. On timelines term significance is calculated by comparing term distribution of the documents on that dat with that on the total document collection reported, and used to represent the document vectors on that day. Since two languages can provide more information than one language, term significance is measured on each language space and used to refer the other language space as a bridge on multilingual spaces. Evaluating the method on Korean and Japanese news articles, our method archieved 14.3% and 16.7% improvement for mono- and multi-lingual story pairs, and for multilingual story pairs, respectively. By measuring the space density, the proposed weighting components are verified with a high density of the intra-event stories and a low density of the inter-events stories. This result indicates that the proposed method is helpful for multilingual story link detection.

  Statistics


  Cite this article

[IEEE Style]

K. S. Lee, "Multilingual Story Link Detection based on Properties of Event Terms," The KIPS Transactions:PartB , vol. 12, no. 1, pp. 81-90, 2005. DOI: 10.3745/KIPSTB.2005.12.1.81.

[ACM Style]

Kyung Soon Lee. 2005. Multilingual Story Link Detection based on Properties of Event Terms. The KIPS Transactions:PartB , 12, 1, (2005), 81-90. DOI: 10.3745/KIPSTB.2005.12.1.81.