Skip to main content
Log in

A template-based method for identifying input regions in survey forms

  • Applied Problems
  • Published:
Pattern Recognition and Image Analysis Aims and scope Submit manuscript

Abstract

This paper presents a template-based approach for survey form analysis. User input areas on the survey forms are identified. Regions of interest (ROI) include checkboxes, radio buttons, underlined and parenthesized handwritten regions. Scale invariant vertical and horizontal projection profile patterns for each of those four types of ROI are defined as matching templates. The templates are matched against projection profiles of connected components in the given survey form. Experiments with 158 different forms at 100-dpi scan resolution resulted in recognition rate of over 99%.

This is a preview of subscription content, log in via an institution to check access.

Access this article

Price excludes VAT (USA)
Tax calculation will be finalised during checkout.

Instant access to the full article PDF.

Similar content being viewed by others

References

  1. B. Yu and A. K. Jain, “A Generic System for Form Dropout,” IEEE Trans. Pattern Anal. Mach. Intell. 18(11), 1127–1134 (1996).

    Article  Google Scholar 

  2. D. Tuganbaev, A. Pakhchanian, and D. Deryagin, “Universal Data Capture Technology from Semistructured Forms,” in Proc. 8th Int. Conf. on Document Analysis and Recognition (Washington, 2005), pp. 458–462.

  3. J. Perez-Cortes, L. Andreu, and J. Arlandis, “A Model-Based Field Frame Detection for Hand-written Filled-in Forms,” in Proc. 8th IAPR Int. Workshop on Document Analysis Systems (IEEE Computer Society, Washington, 2008), pp. 362–368.

    Chapter  Google Scholar 

  4. J. Liu, X. Ding, and Y. Wu, “Description and Recognition of Form and Automated Form Data Entry,” in 3rd Int. Conf. Document Analysis and Recognition (Montreal, 1995), pp. 579–582.

  5. L. Y. Tseng and R. C. Chen, “Recognition and Data Extraction of Form Documents Based on Three Types of Line Segments,” Pattern Recogn. 31(10), 1525–1540 (1998).

    Article  Google Scholar 

  6. M. Dar, P. Nagabhushan, and A. H. Mir, “Pre-Printed Form Recognition and Extraction of Data,” in Int. Electronic Conf. of Computer Sci., AIP Conf. Proc. (2008), Vol. 1060, pp. 233–239.

    Google Scholar 

  7. R. G. Casey, D. R. Ferguson, K. Mohiuddin, and E. Walach, “Intelligent Forms Processing System,” Mach. Vision Appl. 5(3), 143–155 (1992).

    Article  Google Scholar 

  8. S. L. Taylor, R. Fritzson, and J. A. Pastor, “Extraction of Data from Preprinted Forms,” Mach. Vision Appl. 5(3), 211–222 (1998).

    Article  Google Scholar 

  9. J. L. Chen and H. J. Lee, “An Efficient Algorithm for Form Structure Extraction Using Strip Projection,” Pattern. Recogn. 31(9), 1353–1368 (1998).

    Article  MathSciNet  Google Scholar 

  10. N. Sherkat, T. Allen, and W. S. Wong, “Use of Colour for Hand-Filled Form Analysis and Recognition,” Pattern Anal. Appl. 8(1), 163–180 (2005).

    Article  MathSciNet  Google Scholar 

  11. N. Otsu, “A Threshold Selection Method from Gray-Level Histograms,” IEEE Trans. Syst., Man, Cybern. 9(1), 62–66 (1979).

    Article  MathSciNet  Google Scholar 

  12. C. Singh, N. Bhatia, and A. Kaur, “Hough Transform Based Fast Skew Detection and Accurate Skew Correction Methods,” Pattern Recogn. 41(12), 3528–3546 (2008).

    Article  MATH  Google Scholar 

Download references

Author information

Authors and Affiliations

Authors

Corresponding author

Correspondence to P. H. Chien.

Additional information

The article is published in the original.

Rights and permissions

Reprints and permissions

About this article

Cite this article

Chien, P.H., Lee, G.C. A template-based method for identifying input regions in survey forms. Pattern Recognit. Image Anal. 21, 469 (2011). https://doi.org/10.1134/S1054661811020210

Download citation

  • Received:

  • Published:

  • DOI: https://doi.org/10.1134/S1054661811020210

Keywords

Navigation