Abstract
This paper presents a template-based approach for survey form analysis. User input areas on the survey forms are identified. Regions of interest (ROI) include checkboxes, radio buttons, underlined and parenthesized handwritten regions. Scale invariant vertical and horizontal projection profile patterns for each of those four types of ROI are defined as matching templates. The templates are matched against projection profiles of connected components in the given survey form. Experiments with 158 different forms at 100-dpi scan resolution resulted in recognition rate of over 99%.
Similar content being viewed by others
References
B. Yu and A. K. Jain, “A Generic System for Form Dropout,” IEEE Trans. Pattern Anal. Mach. Intell. 18(11), 1127–1134 (1996).
D. Tuganbaev, A. Pakhchanian, and D. Deryagin, “Universal Data Capture Technology from Semistructured Forms,” in Proc. 8th Int. Conf. on Document Analysis and Recognition (Washington, 2005), pp. 458–462.
J. Perez-Cortes, L. Andreu, and J. Arlandis, “A Model-Based Field Frame Detection for Hand-written Filled-in Forms,” in Proc. 8th IAPR Int. Workshop on Document Analysis Systems (IEEE Computer Society, Washington, 2008), pp. 362–368.
J. Liu, X. Ding, and Y. Wu, “Description and Recognition of Form and Automated Form Data Entry,” in 3rd Int. Conf. Document Analysis and Recognition (Montreal, 1995), pp. 579–582.
L. Y. Tseng and R. C. Chen, “Recognition and Data Extraction of Form Documents Based on Three Types of Line Segments,” Pattern Recogn. 31(10), 1525–1540 (1998).
M. Dar, P. Nagabhushan, and A. H. Mir, “Pre-Printed Form Recognition and Extraction of Data,” in Int. Electronic Conf. of Computer Sci., AIP Conf. Proc. (2008), Vol. 1060, pp. 233–239.
R. G. Casey, D. R. Ferguson, K. Mohiuddin, and E. Walach, “Intelligent Forms Processing System,” Mach. Vision Appl. 5(3), 143–155 (1992).
S. L. Taylor, R. Fritzson, and J. A. Pastor, “Extraction of Data from Preprinted Forms,” Mach. Vision Appl. 5(3), 211–222 (1998).
J. L. Chen and H. J. Lee, “An Efficient Algorithm for Form Structure Extraction Using Strip Projection,” Pattern. Recogn. 31(9), 1353–1368 (1998).
N. Sherkat, T. Allen, and W. S. Wong, “Use of Colour for Hand-Filled Form Analysis and Recognition,” Pattern Anal. Appl. 8(1), 163–180 (2005).
N. Otsu, “A Threshold Selection Method from Gray-Level Histograms,” IEEE Trans. Syst., Man, Cybern. 9(1), 62–66 (1979).
C. Singh, N. Bhatia, and A. Kaur, “Hough Transform Based Fast Skew Detection and Accurate Skew Correction Methods,” Pattern Recogn. 41(12), 3528–3546 (2008).
Author information
Authors and Affiliations
Corresponding author
Additional information
The article is published in the original.
Rights and permissions
About this article
Cite this article
Chien, P.H., Lee, G.C. A template-based method for identifying input regions in survey forms. Pattern Recognit. Image Anal. 21, 469 (2011). https://doi.org/10.1134/S1054661811020210
Received:
Published:
DOI: https://doi.org/10.1134/S1054661811020210