Image Classification Using Convolutional Neural Network (CNN) and Recurrent Neural Network (RNN): A Review

Dhruv, Patel; Naskar, Subham

doi:10.1007/978-981-15-1884-3_34

Patel Dhruv¹⁷ &
Subham Naskar¹⁷

Part of the book series: Advances in Intelligent Systems and Computing ((AISC,volume 1101))

39 Citations

Abstract

With the advent of technologies, real-time data is essentially required for future development. Everyday, a huge amount of visual data is being collected, but to use it efficiently, we need to recognize, understand and arrange the visual data for a perfect approach. So, the neural network was introduced to find out patterns from images, a form of visual data as the neuron functionality in a human brain. It is biologically inspired programming approach to allow the machine to learn from observational data. Neural networks have provided solutions to several problems of image recognition, and it is actively utilized in the medical field due to its efficiency. This paper concentrates upon the use of RNN and CNN in the feature extraction of images and the challenges. The paper also presents a brief literature review of the neural networks like CNN and RNN.

This is a preview of subscription content, log in via an institution to check access.

Access this chapter

Log in via an institution

Chapter: USD 29.95; Price excludes VAT (USA)

eBook: USD 169.00; Price excludes VAT (USA)

Softcover Book: USD 219.99; Price excludes VAT (USA)

Tax calculation will be finalised at checkout

Purchases are for personal use only

Institutional subscriptions

References

Image shows the sideview of Varanasi, and is downloaded from https://www.cleartrip.com/activities/Varanasi/ganga-aarti-tour-4-hours. on August 16, 2019.
LeCun, Y., L. Bottou, Y. Bengio, and P. Haffner. 1998. Gradient-based learning applied to document recognition. Proceedings of the IEEE 86: 2278–2324.
Article Google Scholar
Yamins, D, H. Hong, C. Cadieu, and J.J. Dicarlo. 2013. Hierarchical modular optimization of convolutional networks achieves representations similar to Macaque IT and human ventral stream. In 27th Annual Conference on Neural Information Processing Systems, NIPS 2013, December 5–10, 2013, Lake Tahoe, NV, United States, 2013. Air Force Office of Scientific Research (AFOSR), Amazon.com, Facebook, Google, Microsoft Research.
Google Scholar
Hochreiter, S., and J. Schmidhuber. Long short-term memory. Neural computation.
Google Scholar
Understanding a 3D CNN and its Uses. https://missinglink.ai/guides/neural-network-concepts. on 16 August, 2019.
Ouadfel, S., and S. Meshoul. 2012. Handling fuzzy image clustering with a modified ABC algorithm. International Journal of Intelligent Systems and Applications 4: 65.
Article Google Scholar
Das, S., and A. Konar. 2009. Automatic image pixel clustering with an improved differential evolution. Applied Soft Computing 9: 226–236.
Article Google Scholar
Yu, Z., W. Yu, R. Zou, and S. Yu. 2009. On ACO-based fuzzy clustering for image segmentation. In Advances in Neural Networks–ISNN 2009, 717–726. Berlin, Heidelberg: Springer.
Google Scholar
Prabhu. 2018. Understanding of Convolutional Neural Network (CNN)—Deep Learning. https://medium.com/@RaghavPrabhu/understanding-of-convolutional-neural-network-cnn-deep-learning-99760835f148. March 4, 2018.
Wikipedia. 2019. Convolution Neural Network. https://en.wikipedia.org/wiki/Convolutional_neural_network. July 9, 2019.
Oruganti, Ram Manohar. 2016. Image description using deep neural networks. Thesis, Rochester Institute of Technology.
Google Scholar
Ava Soleimany. 2019. MIT 6.S191: Convolutional Neural Networks. https://www.youtube.com/watch?v=HHVZJ7kGI0&list=PLtBwnjQRUrwp5__7C0oIVt26ZgjG9NI&index=3. on February 11, 2019.
Functionality of CNN. https://medium.com/@RaghavPrabhu/understanding-of-convolutional-neural-networkcnn-deep-learning-99760835f148. on August 16, 2019.
Banerjee, Survo. 2018. An Introduction to Recurrent Neural Networks. on May 23, 2018.
Google Scholar
Gupta, Dishashree. 2017. Fundamentals of Deep Learning—Introduction to Recurrent Neural Networks.
Google Scholar
Thapliyal, Manish. 2018. Vanishing Gradients in RNN.
Google Scholar
Sammani, Fawaz. 2019. Applied Deep Learning with Pytorch published.
Google Scholar
LeCun, B.B., J.S. Denker, D. Henderson, R.E. Howard, W. Hubbard, and L.D. Jackel. 1990. Handwritten digit recognition with a back-propagation network. In NIPS.
Google Scholar
Krizhevsky, Alex I. Sutskever, and G.E. Hinton. 2012. Imagenet classification with deep convolutional neural networks. In NIPS.
Google Scholar
Zeiler, M., Taylor, G., and Fergus, R. 2011. Adaptive deconvolutional networks or mid and high-level feature learning. In ICCV.
Google Scholar
Szegedy, C., W. Liu, Y. Jia, P. Sermanet, S. Reed. 2014. Going deeper with convolutions. In CVPR.
Google Scholar
Ioffe, S., and C. Szegedy. 2015. Batch normalization: accelerating deep network training by reducing internal covariate shift. In Proceedings of the 32nd ICML.
Google Scholar
Szegedy, C., V. Vanhoucke, S. Ioffe, J. Shlens, and Z. Wojna. 2015. Rethinking the inception architecture for computer vision. arXiv:1512.
Christian Szegedy, Sergey Ioffe, Vincent Vanhoucke. 2016. Inception-v4, Inception-ResNet and the impact of residual connections on learning. arXiv:1602.07261.
Hinton, G.E., N. Srivastava, A. Krizhevsky, I. Sutskever, and R.R. Salakhutdinov. 2012. Improving neural networks by preventing co-adaptation of feature detectors. Preprint at arXiv:1207.0580.
Thad Hughes, and Keir Mierle. 2013. RNN for voice activity detection published. In IEEE International Conference on Acoustics, Speech and Signal Processing.
Google Scholar
Heigold, G.V., A. Vanhoucke, P. Senior, M. Nguyen, M. Ranzato, and Devin J. Dean. 2013. Multilingual acoustic models using distributed deep neural networks. In IEEE International Conference on Acoustics, Speech and Signal Processing.
Google Scholar
Vanhoucke, Vincent, Matthieu Devi, and Georg Heigold. 2013. Multiframe deep neural networks for acoustic modeling. In IEEE International Conference on Acoustics, Speech and Signal Processing.
Google Scholar
Yik-Cheung Tam, Yun Lei, Jing Zheng, and Wen Wang. 2014. ASR error detection using recurrent neural network language model and complementary ASR. In IEEE (ICASSP).
Google Scholar

Download references

Author information

Authors and Affiliations

School of Computer Engineering, KIIT, Bhubaneswar, India
Patel Dhruv & Subham Naskar

Authors

Patel Dhruv
View author publications
You can also search for this author in PubMed Google Scholar
Subham Naskar
View author publications
You can also search for this author in PubMed Google Scholar

Corresponding author

Correspondence to Subham Naskar .

Editor information

Editors and Affiliations

Department of Computer Science, Rama Devi Women’s University, Bhubaneswar, Odisha, India
Debabala Swain
School of Computer Engineering, Kalinga Institute of Industrial Technology Deemed to be University, Bhubaneswar, Odisha, India
Prasant Kumar Pattnaik
Department of Computer Science and Engineering, Jaypee University of Information Technology, Waknaghat, Himachal Pradesh, India
Pradeep K. Gupta

Rights and permissions

Reprints and permissions

Copyright information

About this paper

Cite this paper

Dhruv, P., Naskar, S. (2020). Image Classification Using Convolutional Neural Network (CNN) and Recurrent Neural Network (RNN): A Review. In: Swain, D., Pattnaik, P., Gupta, P. (eds) Machine Learning and Information Processing. Advances in Intelligent Systems and Computing, vol 1101. Springer, Singapore. https://doi.org/10.1007/978-981-15-1884-3_34

Download citation

DOI: https://doi.org/10.1007/978-981-15-1884-3_34
Published: 24 March 2020
Publisher Name: Springer, Singapore
Print ISBN: 978-981-15-1883-6
Online ISBN: 978-981-15-1884-3
eBook Packages: Intelligent Technologies and RoboticsIntelligent Technologies and Robotics (R0)

Publish with us

Policies and ethics