Abstract
Compared with other existing video coding standards, H.264/AVC can achieve a significant improvement in compression performances. A robust criterion named the rate distortion optimization (RDO) is employed to select the optimal coding modes and motion vectors for each macroblock (MB), which achieves a high compression ratio while leading to a great increase in the complexity and computational load unfortunately. In this paper, a fast mode decision algorithm for H.264/AVC intra prediction based on integer transform and adaptive threshold is proposed. Before the intra prediction, integer transform operations on the original image are executed to find the directions of local textures. According to this direction, only a small part of the possible intra prediction modes are tested for RDO calculation at the first step. If the minimum mean absolute error (MMAE) of the reconstructed block corresponding to the best mode is smaller than an adaptive threshold which depends on the quantization parameter (QP), the RDO calculation is terminated. Otherwise, more possible modes need to be tested. The adaptive threshold aims to balance the compression performance and the computational load. Simulation results with various video sequences show that the fast mode decision algorithm proposed in this paper can accelerate the encoding speed significantly only with negligible PSNR loss or bit rate increment.
Similar content being viewed by others
References
Iain, E., Richardson, G.: H.264 and MPEG-4 Video Compression: Video Coding for Next-generation Multimedia, Chap. 6, pp. 159–224 John Wiley, New York, English (2003)
Sullivan, G., Wiegnad, T.: Video compression-from concepts to the H.264/AVC standard. In: Proceedings of IEEE, 93, pp. (1), 18–31 (2005)
Sullivan, G., Wiegand, T., Lim, K.P.: Joint model reference encoding methods and decoding concealment methods. Presented at the 9th Joint Video Team (JVT) of ISO/IEC MPEG and ITU-T VECG Metting (JVT-I049d0), San Diego CA (2003)
Pan F., Lin X., Rahardja S., Lim K.P., Li Z.G., Wu D.J. and Wu S. (2005). Fast mode decision algorithm for intraprediction in H.264/AVC Video Coding. IEEE Tans. Circuits Syst. Video Technol. 15(7): 813–822
Pan, F., Lin, X., Rahardja, S., Lim, K.P., Li, Z.G., Feng, G.N., Wu, D.J., Wu, S.: Fast mode decision algorithm for JVT intra prediction. Presented at the 7th Joint Video Team (JVT) of ISO/IEC MPEG and ITU-T VECG Metting (JVT-G013), Pattaya, Thailand, March 7–14 (2003)
Wu D., Pan F., Lim K.P., Wu S., Li Z.G., Lin X., Rahardja S. and Ko C.C. (2005). Fast intermode decision in H.264/ AVC video coding. IEEE Tans. Circuits Syst. Video Technol. 15(7): 953–958
Chen, Z., Zhou, P., He, Y.: Fast integer pel and fractional pel motion estimation for JVT. In: Presented at the 6th Joint Video Team (JVT) of ISO/IEC MPEG and ITU-T VECG Metting (JVT-F017), Awaji, Island, JP, 5–13, December 2002
Chen, Z., Zhou, P., He, Y.: Fast motion estimation for JVT. In: Presented at the 7th Joint Video Team (JVT) of ISO/IEC MPEG and ITU-T VECG Metting (JVT-G016), Pattaya, Thailand, 7–14, March 2003
Chio, W.I., Lee, J., Yang, S., Jeon, B.: Fast motion estimation and mode decision with variable motion block sizes. In: Proceedings SPIE—The International Society for Optical Engineering, vol. 5150 III, pp. 1561–1572 (2003)
Xin, J., Vetro, A., Sun, H.: Efficient macroblock coding-mode decision for H.264/AVC video coding. In: Proceedings of Picture Coding Symposium (PCS) (2004)
Lu, X., Tourapis, A., Yin, P., Boyce, J.: Fast mode decision and motion estimation for H.264 with a focus on MPEG-2/H.264 transcoding. In: Proceedings of 2005 IEEE International Symposium on Circuits and Systems (ISCAS), vol. 2, 23–26 May, pp. 1246–1249 (2005)
Jayant N.S. and Noll P. (1984). Digital Coding of Waveforms. Prentice-Hall, Englewood Cliffs
Pao I-M. and Sun M.-T. (1999). Modeling DCT coefficients for fast video encoding. IEEE Trans. Circuits Syst. Video Technol. 9(4): 608–616
Marpe, D., George, V., Cycon, H.L., Barthel, K.U.: Performance evaluation of motion-JPEG2000 in comparison with H.264/AVC operated in intra coding mode. In: SPIE Conference Wavelet Application in Industrial Processing, pp. 129–137 (2003)
Information Technology-Coding of Audio-Visual Objects-Part 10: Advanced Video Coding, Final Draft International Standard, ISO/IEC FDIS 14496-10, (2003)
Jain A.K. (1989). Fundamentals of Digital Image Processing. Prentice-Hall, Englewood Cliffs
JM6.1, Reference Software of JVT. [online] http://bs.hhi.de/ suehring/tml/download/jm61d.zip
Sullivan, G., Bjontegarrd, G.: Recommended simulation common conditions for H.26L coding efficiency experiments on low-resolution progressive-scan source material. In: Presented at the 14th VCEG-N81Meeting, Santa Barbara (2001)
JVT Test Model Ad Hoc Group: Evalution Sheet for Motion Estimation. ISO/IEC JTC1/SC29/WG11 and ITU-T SG16 Q.6, Draft version 4, February 19 (2003)
Bjontegarrd, G.: Calculation of average PSNR differences between RD-curves. In: Presented at 13th VCEG-M33 Meeting, Austin (2001)
Su, R., Liu, G., Zhang, T.: Fast mode decision algorithm for intra prediction in H.264/AVC. In: Acoustics, Speech and Signal Processing, vol. 2, 14–19 May, pp. 921–924 (2006)
Author information
Authors and Affiliations
Corresponding author
Additional information
This work is supported in part by China National Natural Science Foundation (CNSF) under Project No.60572045, the Ministry of Education of China Ph.D. Program Foundation under Project No.20050698033, and by a Cooperation Project (2005.7– 2007.7) with Microsoft Research Asia.
Rights and permissions
About this article
Cite this article
Su, R., Liu, G. & Zhang, T. Fast mode decision algorithm for intra prediction in H.264/AVC with integer transform and adaptive threshold. SIViP 1, 11–27 (2007). https://doi.org/10.1007/s11760-007-0002-y
Received:
Revised:
Accepted:
Published:
Issue Date:
DOI: https://doi.org/10.1007/s11760-007-0002-y