Gestural Interaction for an Automatic Document Capture System

Christian Kofler, Daniel Keysers, A. Koetsier, J. Laagland, Thomas Breuel

In: Proceedings of the 2nd International Workshop on Camera-Based Document Analysis and Recognition (CBDAR-2007) at (ICDAR-2007), September 22, Curitiba, Brasilien. International Workshop on Camera-Based Document Analysis and Recognition (CBDAR) Pages 161-167 Online-Proceedings 2007.


The amount of printed documents used today is still very large despite increased use of digital formats. To bridge the gap between analog paper and digital media, paper documents need to be captured. We present a prototype that allows for cost-effective, fast, and robust document capture using a standard consumer camera. The user's physical desktop is continuously monitored. Whenever a document is detected, the system acquires its content in one of two ways. Either the entire document is captured or a region of interest is extracted, which the user can specify easily by pointing at it. In both modes a high resolution image is taken and the contained information is digitized. The main challenges in designing and implementing such a capturing system are real-time performance, accurate detection of documents, reliable detection of the user's hand and robustness against perturbations such as lighting changes and shadows. This paper presents approaches that address these challenges and discusses the integration into a robust document capture system with gestural interaction.

CkDkTmbiDesk-cbdar07.pdf (pdf, 1 MB )

German Research Center for Artificial Intelligence
Deutsches Forschungszentrum für Künstliche Intelligenz