Skip to main content

An Automated Invoice Handling Method Using OCR

  • Conference paper
  • First Online:
Data Intelligence and Cognitive Informatics

Part of the book series: Algorithms for Intelligent Systems ((AIS))

Abstract

Conventional methods of managing invoices also call for unnecessary human capital. Invoice handling (IIH) has gained significant attention because it can increase process performance and save manpower costs as well. Among these procedures, one of the most critical steps is the extraction of usable data from invoices. Optical character recognition (OCR) technology is a technique that gives you full alphanumeric recognition of handwritten or printed characters present in images and the export of information. This paper focuses on automated invoice processing using OCR which is necessary in this case and that could help in the real-time applications. The objective is to extract details from invoices like invoice number, date, final payment amounts, and related descriptions of bills and invoices. The required information extracted from the invoice is exported into a database for future reference. The records from these invoices or bills can be used extensively later on for statistical analysis or in machine learning.

This is a preview of subscription content, log in via an institution to check access.

Access this chapter

Chapter
USD 29.95
Price excludes VAT (USA)
  • Available as PDF
  • Read on any device
  • Instant download
  • Own it forever
eBook
USD 169.00
Price excludes VAT (USA)
  • Available as EPUB and PDF
  • Read on any device
  • Instant download
  • Own it forever
Softcover Book
USD 219.99
Price excludes VAT (USA)
  • Compact, lightweight edition
  • Dispatched in 3 to 5 business days
  • Free shipping worldwide - see info
Hardcover Book
USD 219.99
Price excludes VAT (USA)
  • Durable hardcover edition
  • Dispatched in 3 to 5 business days
  • Free shipping worldwide - see info

Tax calculation will be finalised at checkout

Purchases are for personal use only

Institutional subscriptions

Similar content being viewed by others

References

  1. Bayar S (2016) Invoice processing performance analysis on e-archive on various embedded platforms. In: October 2016 in proceedings of IEEE 10th international conference on application of information communication technology (AICT), Baku, Azerbaijan, pp 1–4

    Google Scholar 

  2. Delie M, Jian L, Jinwen T (2002) Designing and implementing a Chinese framework for the identification of financial invoices. In June 2002 in proceedings of international symposium on VIPromCom video/image processing multimedia communication, Zadar, Croatia, pp 79–82

    Google Scholar 

  3. Alippi C, Pessina F, Roveri M (2005) Adaptive program for the classification of automated invoices –documents. In: September 2005 in proceedings of international conference on image processing, Genova, Italy, pp II-526

    Google Scholar 

  4. Wang X, Hong Y, Chang H, Park K, Langdale G, Hu J, Zhu H (2019) Hyperscan: for modern CPUs, a fast regex matcher with multi patterns. In: 2019 in 16th {USENIX} symposium on the development and implementation of networked systems ({DINS} 2019), pp 631–648

    Google Scholar 

  5. Peng KC, Singhose W (2009) Machine vision and wand following crane control. In: 2009 IEEE international conference on mechatronics, Malaga, pp 1–6

    Google Scholar 

  6. Chui C, Parmar M (2019) Outward Inc. Image processing techniques based on machine learning. U.S. Patent Application 16/056,110

    Google Scholar 

  7. Sun Y, Zhang J, Meng Y, Yang J, Gui G (2019) Intelligent, smartphone-based invoice classification system using deep learning. IEEE Access 7:118046–118054

    Google Scholar 

  8. Alotaibi F, Abdullah MT, Abdullah RB, Rahmat RW, Hashem IA, Sangaiah AK (2017) Optical character recognition for matching similarity of quranic image. IEEE Access 6:554–562

    Google Scholar 

  9. Zhu C, Chen Y, Zhang Y, Liu S, Li G (2019) ResGAN: a low-level image processing network to restore compressed images to the original quality of JPEG. In: 2019 Data Compression Conference (DCC). IEEE, pp 616

    Google Scholar 

  10. Bartoli A, De Lorenzo A, Medvet E, Tarlao F (2017) Active learning of regular expressions for entity extraction. IEEE Trans Cybern 48(3):1067–1080

    Google Scholar 

  11. Manoharan S (2019) A smart image processing algorithm for text recognition, information extraction and vocalization for the visually challenged. J Innov Image Process (JIIP) 1(01):31–38

    Article  Google Scholar 

  12. Tesseract For Open Source http://code.google.com/p/tesseract-ocr. Accessed November 2017

  13. A Description of the Tesseract OCR Engine—Analysis at Google. https://research.google.com/pubs/archive/33418.pdf

Download references

Author information

Authors and Affiliations

Authors

Corresponding author

Correspondence to Pranay Kumar .

Editor information

Editors and Affiliations

Rights and permissions

Reprints and permissions

Copyright information

© 2021 The Author(s), under exclusive license to Springer Nature Singapore Pte Ltd.

About this paper

Check for updates. Verify currency and authenticity via CrossMark

Cite this paper

Kumar, P., Revathy, S. (2021). An Automated Invoice Handling Method Using OCR. In: Jeena Jacob, I., Kolandapalayam Shanmugam, S., Piramuthu, S., Falkowski-Gilski, P. (eds) Data Intelligence and Cognitive Informatics. Algorithms for Intelligent Systems. Springer, Singapore. https://doi.org/10.1007/978-981-15-8530-2_19

Download citation

Publish with us

Policies and ethics