An Efficient Block Segmentation and Classification Method for Document Image Analysis Using SGLDM and BP 


Vol. 2,  No. 6, pp. 937-946, Nov.  1995
10.3745/KIPSTE.1995.2.6.937


PDF
  Abstract

We proposed an efficient block segmentation and classification method for the document analysis using SGLDM(spatial gray level dependence matrix) and BP(back propagation) neural network. Seven texture features are extracted directly from the SGLDM of each gray-level block image, and by using the nonlinear classifier of neural network BP, we can classify document blocks into 9 categories. The proposed method classifies the equation block, the table block and the flow chart block, which are mostly composed of the characters, out of the blocks that are conventionally classified as non-character blocks. By applying Sobel operator on the gray-level document image before binarization, we can reduce the effect of the background noises and by using the additional horizontal-vertical smoothing aw well as the vertical-horizontal smoothing of images, we can obtain an effective block segmentation that does not lead to the segmentation into small pieces. The result of experiment shows that a document can be segmented and classified into the character blocks of large fonts, medium fonts, small fonts, the character recognigible candidates of tables, flow charts, equations, and the non-character blocks of photos, figures, and graphs.

  Statistics


  Cite this article

[IEEE Style]

K. J. Soo, L. J. Hwan, C. H. Moon, "An Efficient Block Segmentation and Classification Method for Document Image Analysis Using SGLDM and BP," The Transactions of the Korea Information Processing Society (1994 ~ 2000), vol. 2, no. 6, pp. 937-946, 1995. DOI: 10.3745/KIPSTE.1995.2.6.937.

[ACM Style]

Kim Joong Soo, Lee Jeong Hwan, and Choi Heung Moon. 1995. An Efficient Block Segmentation and Classification Method for Document Image Analysis Using SGLDM and BP. The Transactions of the Korea Information Processing Society (1994 ~ 2000), 2, 6, (1995), 937-946. DOI: 10.3745/KIPSTE.1995.2.6.937.