Efficient Implementation of Local Adaptive Thresholding Techniques Using Integral Images

Faisal Shafait, Daniel Keysers, Thomas Breuel

In: Proceedings of the 15th Document Recognition and Retrieval Conference (DRR-2008), Part of the IS&T/SPIE International Symposium on Electronic Imaging, January 26-31, San Jose, CA, USA. SPIE Conference on Document Recognition and Retrieval (DRR) 6815 SPIE 1/2008.


Adaptive binarization is an important first step in many document analysis and OCR processes. This paper describes a fast adaptive binarization algorithm∗ that yields the same quality of binarization as the Sauvola method,1 but runs in time close to that of global thresholding methods (like Otsu’s method2 ), independent of the window size. The algorithm combines the statistical constraints of Sauvola’s method with integral images.3 Testing on the UW-1 dataset demonstrates a 20-fold speedup compared to the original Sauvola algorithm.

FsDkTmbEfficientImplSpie2008.pdf (pdf, 594 KB )

Deutsches Forschungszentrum für Künstliche Intelligenz
German Research Center for Artificial Intelligence