Statistical Multi-resolution Schemes For Historical Document Binarization
In previous work, we proposed the application of the Expectation- Maximization (EM) algorithm in the binarization of historical documents by defining a multi-resolution framework. In this work, we extend the multiresolution framework to the Otsu algorithm for effective binarization of historical documents. We compare the effectiveness of the EM based binarization technique to the Otsu thresholding algorithm on historical documents. We demonstrate how the EM can be extended to perform an effective segmentation of historical documents by taking into account multiple features beyond the intensity of the document image. Experimental results, analysis and comparisons to known techniques are presented using the document image collection from the DIBCO 2009 contest. © 2011 SPIE-IS&T.
T. Obafemi-Ajayi and G. Agam, "Statistical Multi-resolution Schemes For Historical Document Binarization," Proceedings of SPIE - The International Society for Optical Engineering, vol. 7874, article no. 78740S, Society of Photo-optical Instrumentation Engineers, May 2011.
The definitive version is available at https://doi.org/10.1117/12.876582
Electrical and Computer Engineering
Keywords and Phrases
binarization; document image analysis; historical documents; image thresholding
International Standard Book Number (ISBN)
International Standard Serial Number (ISSN)
Article - Conference proceedings
© 2023 Society of Photo-optical Instrumentation Engineers, All rights reserved.
12 May 2011