ABSTRACT
Transcription of large-scale historical handwritten document images is a tedious task. Machine learning techniques, such as deep learning, are popularly used for quick transcription, but often require a substantial amount of pre-transcribed word examples for training. Instead of line-by-line word transcription, this paper proposes a simple training-free gamification strategy where all occurrences of each arbitrarily selected word is transcribed once, using an intelligent user interface implemented in this work. The proposed approach offers a fast and user-friendly semi-automatic transcription that allows multiple users to work on the same document collection simultaneously.
- Vicent Alabau and Luis Leiva. 2012. Transcribing handwritten text images with a word soup game. In CHI'12 Extended Abstracts on Human Factors in Computing Systems. ACM, 2273--2278. Google ScholarDigital Library
- Sebastian Deterding, Dan Dixon, Rilla Khaled, and Lennart Nacke. 2011. From game design elements to gamefulness: defining gamification. In Proceedings of the 15th international academic MindTrek conference: Envisioning future media environments. ACM, 9--15. Google ScholarDigital Library
- Anders Hast, Per Cullhed, and Ekta Vats. 2017. TexT - Text Extractor Tool for Handwritten Document Transcription and Annotation. In 14:th Italian Research Conference on Digital Libraries, IRCDL. 1--12.Google Scholar
- Anders Hast and Alicia Fornés. 2016. A Segmentation-free Handwritten Word Spotting Approach by Relaxed Feature Matching. In Document Analysis Systems, 2016 12th IAPR Workshop on. IEEE, 150--155.Google ScholarCross Ref
- Oriol Ramos Terrades, A. H. Toselli, N. Serrano, V. Romero, E. Vidal, and A. Juan. 2010. Interactive layout analysis and transcription systems for historic handwritten documents. In 10th ACM Symposium on Document Engineering. 219--222. Google ScholarDigital Library
Index Terms
- An Intelligent User Interface for Efficient Semi-automatic Transcription of Historical Handwritten Documents
Recommendations
Handwritten text recognition for historical documents in the transcriptorium project
DATeCH '14: Proceedings of the First International Conference on Digital Access to Textual Cultural HeritageTranscription of historical handwritten documents is a crucial problem for making easier the access to these documents to the general public. Currently, huge amount of historical handwritten documents are being made available by on-line portals ...
Large Vocabulary Recognition of On-Line Handwritten Cursive Words
This paper presents a writer independent system for large vocabulary recognition of on-line handwritten cursive words. The system first uses a filtering module, based on simple letter features, to quickly reduce a large reference dictionary (lexicon) to ...
An online overlaid handwritten Japanese text recognition system for small tablet
The paper presents a recognition system of online overlaid handwritten Japanese text patterns on a smart phone or baby-face tablet. The proposed system oversegments a sequence of strokes into primitive segments at candidate off-strokes between strokes ...
Comments