Extracting curved text Lines using the chain composition and the expanded grouping method
Vol. 14, No. 6, pp. 453-460,
Oct. 2007
10.3745/KIPSTB.2007.14.6.453
PDF
Abstract
In this paper, we present a method to extract the text lines in poorly structured documents. The text lines may have different orientations, considerably curved shapes, and there are possibly a few wide inter-word gaps in a text line. Those text lines can be found in posters, blocks of addresses, artistic documents. Our method based on the traditional perceptual grouping but we develop novel solutions to overcome the problems of insufficient seed points and varied orientations in a single line. In this paper, we assume that text lines contained some connected components, in which each connected components is a set of black pixels within a letter, or some touched letters. In our scheme, the connected components closer than an iteratively incremented threshold will make together a chain. Elongate chains are identified as the seed chains of lines. Then the seed chains are extended to the left and the right regarding the local orientations. The local orientations will be reevaluated at each side of the chains when it is extended. By this process, all text lines are finally constructed. The proposed method is good for extraction of the considerably curved text lines from logos and slogans in our experiment; 98% and 94% for the straight-line extraction and the curved-line extraction, respectively.
Statistics
Cite this article
[IEEE Style]
J. S. Yoon, Y. J. Song, N. Kim, Y. G. Kim, "Extracting curved text Lines using the chain composition and the expanded grouping method," The KIPS Transactions:PartB , vol. 14, no. 6, pp. 453-460, 2007. DOI: 10.3745/KIPSTB.2007.14.6.453.
[ACM Style]
Jin Seon Yoon, Young Jun Song, Nam Kim, and Yong Gi Kim. 2007. Extracting curved text Lines using the chain composition and the expanded grouping method. The KIPS Transactions:PartB , 14, 6, (2007), 453-460. DOI: 10.3745/KIPSTB.2007.14.6.453.