Statistical Multi-resolution Schemes For Historical Document Binarization
Abstract
In previous work, we proposed the application of the Expectation- Maximization (EM) algorithm in the binarization of historical documents by defining a multi-resolution framework. In this work, we extend the multiresolution framework to the Otsu algorithm for effective binarization of historical documents. We compare the effectiveness of the EM based binarization technique to the Otsu thresholding algorithm on historical documents. We demonstrate how the EM can be extended to perform an effective segmentation of historical documents by taking into account multiple features beyond the intensity of the document image. Experimental results, analysis and comparisons to known techniques are presented using the document image collection from the DIBCO 2009 contest. © 2011 SPIE-IS&T.
Recommended Citation
T. Obafemi-Ajayi and G. Agam, "Statistical Multi-resolution Schemes For Historical Document Binarization," Proceedings of SPIE - The International Society for Optical Engineering, vol. 7874, article no. 78740S, Society of Photo-optical Instrumentation Engineers, May 2011.
The definitive version is available at https://doi.org/10.1117/12.876582
Department(s)
Electrical and Computer Engineering
Keywords and Phrases
binarization; document image analysis; historical documents; image thresholding
International Standard Book Number (ISBN)
978-081948411-6
International Standard Serial Number (ISSN)
0277-786X
Document Type
Article - Conference proceedings
Document Version
Final Version
File Type
text
Language(s)
English
Rights
© 2023 Society of Photo-optical Instrumentation Engineers, All rights reserved.
Publication Date
12 May 2011