A CCD based machine vision system for real-time text detection

Zhao, Shihua; Sun, Lipeng; Li, Gang; Liu, Yun; Liu, Binbing

doi:10.1007/s12200-019-0854-0

A CCD based machine vision system for real-time text detection

Research Article
Published: 05 August 2019

Volume 13, pages 418–424, (2020)
Cite this article

Frontiers of Optoelectronics Aims and scope Submit manuscript

Shihua Zhao¹,
Lipeng Sun¹,
Gang Li²,
Yun Liu¹ &
…
Binbing Liu³

104 Accesses
1 Citation
Explore all metrics

Abstract

Text detection and recognition is a hot topic in computer vision, which is considered to be the further development of the traditional optical character recognition (OCR) technology. With the rapid development of machine vision system and the wide application of deep learning algorithms, text recognition has achieved excellent performance. In contrast, detecting text block from complex natural scenes is still a challenging task. At present, many advanced natural scene text detection algorithms have been proposed, but most of them run slow due to the complexity of the detection pipeline and cannot be applied to industrial scenes. In this paper, we proposed a CCD based machine vision system for real-time text detection in invoice images. In this system, we applied optimizations from several aspects including the optical system, the hardware architecture, and the deep learning algorithm to improve the speed performance of the machine vision system. The experimental data confirms that the optimization methods can significantly improve the running speed of the machine vision system and make it meeting the real-time text detection requirements in industrial scenarios.

This is a preview of subscription content, log in via an institution to check access.

Access this article

Log in via an institution

Price excludes VAT (USA)
Tax calculation will be finalised during checkout.

Instant access to the full article PDF.

Institutional subscriptions

Text Detection and Recognition from the Scene Images Using RCNN and EasyOCR

Exploiting colour information for better scene text detection and recognition

Article 19 February 2015

Text Detection with Deep Neural Network

References

Contes A, Carpenter B, Case C, Satheesh S, Suresh B, Wang T, Wu J D, Ng A Y. Text detection and character recognition in scene images with unsupervised feature learning. In: Proceedings of International Conference on Document Analysis and Recognition. Beijing: IEEE, 2011, 440–445
Google Scholar
Ye Q, Doermann D. Text detection and recognition in imagery: a survey. IEEE Transactions on Pattern Analysis and Machine Intelligence, 2015, 37(7): 1480–1500
Article Google Scholar
Zhang X, Gao X, Tian C. Text detection in natural scene images based on color prior guided MSER. Neurocomputing, 2018, 307: 61–71
Article Google Scholar
Smith R. An overview of the tesseract OCR engine. In: Proceedings of International Conference on Document Analysis and Recognition. Parana: IEEE, 2007, 629–633
Google Scholar
Epshtein B, Ofek E, Wexler Y. Detecting text in natural scenes with stroke width transform. In: Proceedings of IEEE Computer Society Conference on Computer Vision and Pattern Recognition. San Francisco: IEEE, 2010, 2963–2970
Google Scholar
Jaderberg M, Simonyan K, Vedaldi A, Zisserman A. Reading text in the wild with convolutional neural networks. International Journal of Computer Vision, 2016, 116(1): 1–20
Article MathSciNet Google Scholar
Ren S, He K, Girshick R, Sun J. Faster R-CNN: towards real-time object detection with region proposal networks. IEEE Transactions on Pattern Analysis and Machine Intelligence, 2017, 39(6): 1137–1149
Article Google Scholar
Liu W, Anguelov D, Erhan D, Szegedy C, Reed S E, Fu C, Berg A C. SSD: single shot MultiBox detector. In: Proceedings of European Conference on Computer Vision. Berlin: Springer, 2016, 21–37
Google Scholar
Redmon J, Divvala S K, Girshick R B, Farhadi A. You only look once: unified, real-time object detection. In: Proceedings of IEEE Conference on Computer Vision and Pattern Recognition. Las Vegas: IEEE, 2016, 779–788
Google Scholar
Tian Z, Huang W, He T, He P, Qiao Y. Detecting text in natural image with connectionist text proposal network. In: Proceedings of European Conference on Computer Vision. Berlin: Springer, 2016, 56–72
Google Scholar
Ma J, Shao W, Ye H,Wang L,Wang H, Zheng Y, Xue X. Arbitrary-oriented scene text detection via rotation proposals. IEEE Transactions on Multimedia, 2018, 20(11): 3111–3122
Article Google Scholar
Liu Y, Jin L. Deep matching prior network: toward tighter multioriented text detection. In: Proceedings of IEEE Conference on Computer Vision and Pattern Recognition. Honolulu: IEEE, 2017, 3454–3461
Google Scholar
Shi B, Bai X, Belongie S. Detecting oriented text in natural images by linking segments. In: Proceedings of IEEE Conference on Computer Vision and Pattern Recognition. Honolulu: IEEE, 2017: 3482–3490
Google Scholar
Liao M, Shi B, Bai X, Wang X, Liu W. TextBoxes: A fast text detector with a single deep neural network. 2016, arXiv:1611.06779
Liao M, Shi B, Bai X. TextBoxes ++: a single-shot oriented scene text detector. IEEE Transactions on Image Processing, 2018, 27(8): 3676–3690
Article MathSciNet Google Scholar
Dai Y, Huang Z, Gao Y, Xu Y, Chen K, Guo J, Qiu W. Fused text segmentation networks for multi-oriented scene text detection. 2018, arXiv:1709.03272
Hu H, Zhang C, Luo Y, Wang Y, Han J, Ding E. WordSup: exploiting word annotations for character based text detection. In: Proceedings of IEEE International Conference on Computer Vision. Venice: IEEE, 2017, 4950–4959
Google Scholar
Deng J, Dong W, Socher R, Li L J, Li K, Li F F. ImageNet: a largescale hierarchical image database. In: Proceedings of IEEE Conference on Computer Vision and Pattern Recognition. Miami: IEEE, 2009, 248–255
Google Scholar
Karatzas D, Shafait F, Uchida S, Iwamura M, Bigorda L G I, Mestre S R, Mas J, Mota D F, Almazàn J A, Heras L P D L. ICDAR 2013 robust reading competition. In: Proceedings of International Conference on Document Analysis and Recognition. Washington, DC: IEEE, 2013, 1484–1493
Google Scholar
Simonyan K, Zisserman A. Very deep convolutional networks for large-scale image recognition. 2014, arXiv:1409.1556
Howard A G, Zhu M, Chen B, Kalenichenko D, Wang W, Weyand T, Andreetto M, Adam H. MobileNets: efficient convolutional neural networks for mobile vision applications. 2017, arXiv:1704.04861
Zhang X, Zhou X, Lin M, Sun J. ShuffleNet: an extremely efficient convolutional neural network for mobile devices. 2017, arXiv:1707.01083v2
Iandola F N, Han S, Moskewicz M W, Ashraf K, DallyW J, Keutzer K. SqueezeNet: AlexNet-level accuracy with 50 × fewer parameters and < 0.5 MB model size. 2016, arXiv:1602.07360v4

Download references

Author information

Authors and Affiliations

State Grid Hunan Electric Power Corporation Limited Research Institute, Changsha, 410007, China
Shihua Zhao, Lipeng Sun & Yun Liu
State Grid Hunan Electric Power Corporation Limited, Changsha, 410007, China
Gang Li
School of Optical and Electronics Information, Huazhong University of Science and Technology, Wuhan, 430074, China
Binbing Liu

Authors

Shihua Zhao
View author publications
You can also search for this author in PubMed Google Scholar
Lipeng Sun
View author publications
You can also search for this author in PubMed Google Scholar
Gang Li
View author publications
You can also search for this author in PubMed Google Scholar
Yun Liu
View author publications
You can also search for this author in PubMed Google Scholar
Binbing Liu
View author publications
You can also search for this author in PubMed Google Scholar

Corresponding author

Correspondence to Binbing Liu.

Additional information

Shihua Zhao graduated from Chongqing University and obtained the Ph.D. degree in 2013. He is now working in State Grid Hunan Electric Power Corporation Limited Research Institute. His main research interests include high voltage technology, power transformer fault detection and diagnosis. He is the author or the co-author of several technical papers.

Lipeng Sun completed the B.S. degree in 2005 from Chongqing University, Chongqing, China. He also received his master degree in 2008 from Chongqing University, Chongqing, China. His current research interest is high voltage and insulation technology.

Gang Li completed the B.S. degree in 1996 from Xi’an Jiaotong University, Xi’an, China. He has been engaged in operation and maintenance technology of transformer over 20 years. His current research interest is high voltage and insulation technology.

Yun Liu received his master degree in 2009 from Guangxi University, Nanning, China. He received deputy senior engineer in 2013. His research interest is high voltage and insulation technology.

Binbing Liu received the bachelor degree in Optoelectronics from Huazhong University of Science and Technology, Wuhan, in 2000, and got the master degree and Ph.D. degree in Physical Electronics from the same university, in 2003 and 2013, respectively. Now, he is working at School of Optical and Electronics Information as a lecturer. His research interests include computer vision and machine learning.

Rights and permissions

Reprints and permissions

About this article

Cite this article

Zhao, S., Sun, L., Li, G. et al. A CCD based machine vision system for real-time text detection. Front. Optoelectron. 13, 418–424 (2020). https://doi.org/10.1007/s12200-019-0854-0

Download citation

Received: 28 August 2018
Accepted: 08 October 2018
Published: 05 August 2019
Issue Date: December 2020
DOI: https://doi.org/10.1007/s12200-019-0854-0

Keywords

Access this article

Log in via an institution

Price excludes VAT (USA)
Tax calculation will be finalised during checkout.

Instant access to the full article PDF.

Institutional subscriptions

A CCD based machine vision system for real-time text detection

Abstract

Access this article

Similar content being viewed by others

Text Detection and Recognition from the Scene Images Using RCNN and EasyOCR

Exploiting colour information for better scene text detection and recognition

Text Detection with Deep Neural Network

References

Author information

Authors and Affiliations

Corresponding author

Additional information

Rights and permissions

About this article

Cite this article

Keywords

Navigation

A CCD based machine vision system for real-time text detection

Abstract

Access this article

Similar content being viewed by others

Text Detection and Recognition from the Scene Images Using RCNN and EasyOCR

Exploiting colour information for better scene text detection and recognition

Text Detection with Deep Neural Network

References

Author information

Authors and Affiliations

Corresponding author

Additional information

Rights and permissions

About this article

Cite this article

Share this article

Keywords

Search

Navigation