Proceedings-Artikel
IAMonDo-Database: an Online Handwritten Document Database with Non-Uniform Contents
Emanuel Indermühle; Marcus Liwicki; Horst Bunke
In: Proceedings of the 9th IAPR International Workshop on Document Analysis Systems. IAPR International Workshop on Document Analysis Systems (DAS-10), June 9-11, Boston, MA, United States, Pages 97-104, o.A. 2010.
Abstract
In this paper we present a new database of online handwritten documents with dierent contents such as text, drawings, diagrams, formulas, tables, lists, and markings. It was designed to serve as a standard dataset for the development, training, testing and comparison of methods in the eld of handwritten document analysis. The database can serve as a basis for layout analysis, and dierent segmentation and recognition tasks considering online or just oine information. Its size is 1,000 documents produced by approximately 200 writers including a total of 329,849 online strokes. Few constraints were imposed on the writers when creating the documents. Nonetheless, the database has a stable distribution of the dierent content types. A software tool was developed to allow easy access to the documents which are stored in InkML. In this paper we also present two experiments which show the challenge this database poses. They may gure as references for further research in this area.
