Abstract
Recognition of Indian language scripts is a challenging problem. Work for the development of complete OCR systems for Indian language scripts is still in infancy. Complete OCR systems have recently been developed for Devanagri and Bangla scripts. Research in the field of recognition of Gurmukhi script faces major problems mainly related to the unique characteristics of the script like connectivity of characters on the headline, characters in a word present in both horizontal and vertical directions, two or more characters in a word having intersecting minimum bounding rectangles along horizontal direction, existence of a large set of visually similar character pairs, multi-component characters, touching characters which are present even in clean documents and horizontally overlapping text segments. This paper addresses the problems in the various stages of the development of a complete OCR for Gurmukhi script and discusses potential solutions.
Chapter PDF
References
Govindan, V. K., Shivaprasad, A. P.: Character recognition-A review. Pattern Recognition. Vol. 23. (1990) 671–683.
S. N. S. Rajasekaran, S. N. S., Deekshatulu, B. L.: Recognition of printed Telugu characters. Computer Graphics and Image Processing. Vol. 6. (1977) 335–360.
G. Siromoney, G., Chandrasekaran, R., Chandrasekaran, M.: Machine recognition of printed Tamil characters. Pattern Recognition. Vol. 10. (1978) 243–247.
Sinha, R. M. K., Mahabala, H. N.: Machine recognition of Devanagari script. IEEE Trans on Systems, Man and Cybernetics. Vol. 9. (1979) 435–449.
Chaudhuri, B. B., Pal, U.: A complete printed Bangla OCR system. Pattern Recognition. Vol. 31. (1998) 531–549.
Bansal, V.: Integrating knowledge sources in Devanagri text recognition. Ph.D. thesis. IIT Kanpur (1999).
Lehal, G. S., Singh, C.: Text segmentation of machine printed Gurmukhi script. Document Recognition and Retrieval VIII. Paul B. Kantor, Daniel P. Lopresti, Jiangying Zhou (eds.), Proceedings SPIE, USA. Vol. 4307. (2001) 223–231.
Lehal, G. S., Singh, C.: A shape based post processor for Gurmukhi OCR. Proceedings 6th International Conference on Document Analysis and Recognition, Seattle, USA. (2001) 1105–1109.
Author information
Authors and Affiliations
Editor information
Editors and Affiliations
Rights and permissions
Copyright information
© 2002 Springer-Verlag Berlin Heidelberg
About this paper
Cite this paper
Lehal, G.S., Singh, C. (2002). A Complete OCR System for Gurmukhi Script. In: Caelli, T., Amin, A., Duin, R.P.W., de Ridder, D., Kamel, M. (eds) Structural, Syntactic, and Statistical Pattern Recognition. SSPR /SPR 2002. Lecture Notes in Computer Science, vol 2396. Springer, Berlin, Heidelberg. https://doi.org/10.1007/3-540-70659-3_37
Download citation
DOI: https://doi.org/10.1007/3-540-70659-3_37
Published:
Publisher Name: Springer, Berlin, Heidelberg
Print ISBN: 978-3-540-44011-6
Online ISBN: 978-3-540-70659-5
eBook Packages: Springer Book Archive