Statistical Multi-resolution Schemes For Historical Document Binarization

Abstract

In previous work, we proposed the application of the Expectation- Maximization (EM) algorithm in the binarization of historical documents by defining a multi-resolution framework. In this work, we extend the multiresolution framework to the Otsu algorithm for effective binarization of historical documents. We compare the effectiveness of the EM based binarization technique to the Otsu thresholding algorithm on historical documents. We demonstrate how the EM can be extended to perform an effective segmentation of historical documents by taking into account multiple features beyond the intensity of the document image. Experimental results, analysis and comparisons to known techniques are presented using the document image collection from the DIBCO 2009 contest. © 2011 SPIE-IS&T.

Department(s)

Electrical and Computer Engineering

Keywords and Phrases

binarization; document image analysis; historical documents; image thresholding

International Standard Book Number (ISBN)

978-081948411-6

International Standard Serial Number (ISSN)

0277-786X

Document Type

Article - Conference proceedings

Document Version

Final Version

File Type

text

Language(s)

English

Rights

© 2023 Society of Photo-optical Instrumentation Engineers, All rights reserved.

Publication Date

12 May 2011

Share

 
COinS