Subjective quality evaluation of compressed digital compound images

https://doi.org/10.1016/j.jvcir.2014.11.001Get rights and content

Highlights

Abstract

Visual quality evaluation of compressed Digital Compound Images (DCIs) becomes important in many multi-device communication systems. In this paper, we study subjective quality evaluation for compressed DCIs and investigate whether existing Image Quality Assessment (IQA) metrics are effective to evaluate the visual quality of compressed DCIs. A new Compound Image Quality Assessment Database (CIQAD) is therefore constructed, including 24 reference and 576 compressed DCIs. The subjective scores of these DCIs are obtained via visual judgement of 62 subjects using Paired Comparison (PC) in which the Hodgerank decomposition is adopted to generate uncompleted but near balanced pairs. Fourteen state-of-the-art IQA metrics are adopted to assess quality of images in CIQAD, and experimental results indicate that the existing IQA methods are limited in evaluating visual quality of DCIs. Compression results of five coding methods are thus compared with respect to different quality metrics to illustrate the limitation.

Introduction

Inspired by various Internet related applications, such as remote computing platforms [1], virtualized screen sharing systems [2], multi-client communication systems [3], and increasing visual contents are shared between different digital devices (i.e., computers, mobile phones, tablets, etc.). In these applications, visual contents (e.g., web pages, PowerPoint and PDF files, and computer screen images) are represented in the form of Digital Compound Images (DCIs). For efficient transmission among different devices, transmission systems should be deployed in a platform-free manners (without consideration of different protocols of different systems) and DCIs have to be highly compressed. For this requirement, cloud computing systems provide a strategy to share DCIs from the cloud to thin-clients; such systems render visual contents as DCIs in the cloud and then transmit them after compression to clients [4]. Many other computer generated compound images, such as promotional posters and microblogs, are also represented and stored in the form of DCIs.

Existing studies have demonstrated that traditional compression algorithms (such as JPEG, JPEG2000 and H.264 intra coding) cannot encode DCIs efficiently due to the high frequency components in textual regions [5], [6]. To address this problem, many studies [4], [5], [6], [7], [8] propose segmentation techniques to decompose DCIs into textual and pictorial parts, and encode these two different parts using different coding techniques. In these methods, the compression performance is generally evaluated by Peak Signal-to-Noise Ratio (PSNR), which is proved to be inconsistent with human visual perception [9]. The associated allocation of coding bits to textual and pictorial parts does not align with characteristics of Human Visual System (HVS). Generally, higher clarity of textual contents is required for better reading experience, with less degradation in pictorial regions.

Quality of Experience (QoE) has been investigated to evaluate users’ visual experience on web pages, so-called Web QoE [10]. Unfortunately, current Web QoE studies mainly focus on objective Quality of Service (QoS) metrics, e.g., loss ratio, round-trip times, full rendering time, etc. They rarely take into account the human perception for the webpage contents. For example, if texts and pictures are encoded and transmitted independently, given a certain overall loss ratio, different allocations to pictorial and textual parts may lead to quite different visual experiences.

Therefore, it is much desired to investigate the quality evaluation of compressed DCIs from the perspective of human visual perception. According to the results of visual quality evaluation, we can compare the compression performance of different algorithms for encoding DCIs. Moreover, visual quality evaluation for DCIs can be used to guide the processing (e.g., encoding, transmitting, rendering and enhancement) of DCIs. Currently, the visual quality evaluation of compressed DCIs has not yet been studied. Although various Image Quality Assessment (IQA) approaches have been proposed to evaluate quality of distorted natural images [11], [12], whether these IQA methods can be applied to evaluate the quality of compressed DCIs is still an open problem.

In this paper, we present an in-depth study on subjective quality assessment of DCIs with compression artifacts. A Compound Image Quality Assessment Database (CIQAD) is built, consisting of 24 reference images and 576 compressed versions. The reference images are obtained from Internet with various layouts of content. The distorted images are derived by five classical compression methods: JPEG, JPEG2000, H.264 intra coding, DjVu [13] and Layer Segmentation based Compression (LSC) [4]. We adopt the Pairwise Comparison (PC) method [14] to calculate Mean Opinion Score (MOS) values of these images towards better evaluation accuracy [15]. In particular, we employ the Hodgerank framework [16] to generate incomplete pairs in subjective test. This framework induces less effort and labor to human observers without significantly sacrificing evaluation accuracy and reliability. Based on the computed MOS values, 14 commonly used IQA approaches are employed to evaluate the quality of DCIs in CIQAD. To the best of our knowledge, this is the first attempt to study the quality assessment of DCIs. The CIQAD as well as the subjective scores are availabe to the research community for further quality assessment investigation of DCIs at https://sites.google.com/site/ciqadatabase/.

In the remaining of this paper, we first introduce the related work in Section 2. In Section 3, we present the detailed configuration of CIQAD. The subjective assessment methodology is described in Section 4. We report the experimental results in Section 5, and conclude the paper in the final section.

Section snippets

Related work

Based on image content, IQA methods can generally be classified into the following three categories: (1) IQA for natural images; (2) Document Image Quality Assessment (DIQA); and (3) Compound Image Quality Assessment (CIQA).

IQA for natural images has been studied tremendously during the past decade, and various metrics have been proposed [11], [17]. For subjective IQA, ITU-R BT.500-13 [14] defines several subjective methodologies for quality assessment of pictures, e.g. Single-Stimulus (SS),

Compound Image Quality Assessment Database (CIQAD)

To analyze visual quality of DCIs and the compression performance of different coding methods, we construct a new digital compound image database, CIQAD, which consists of 24 reference and 576 distorted DCIs. A subjective test is then conducted based on this database. In this section, we will describe the database in detail.

Subjective testing methodology

In this paper, we employ the PC approach for the subjective test in the experiment: each subject is simply asked to compare two images juxtaposed on a screen, and judge which one has better quality based on his/her visual perception. This methodology has been proven to be less annoying for subjects, as the multi-scale rating (e.g. 1–5) is reduced to a dichotomous choice, and has inherently less variation for images with highly apparent distortions. The subjective test is described in detail as

Experimental results

In this section, we first analyze the reliability of the computed MOS values, and then check the effectiveness of existing IQA methods for evaluating visual quality of images in CIQAD. Additionally, we compare the performance of the five compression algorithms for DCIs with respect to MOS values. Corresponding analysis and discussion for the experiments are also provided.

Conclusion and future work

In this paper, a new digital compound image database, CIQAD, has been constructed to investigate the quality evaluation of digital compound images. The PC strategy is adopted to implement the subjective testing, and the Hodgerank decomposition is employed to reduce effort and execution time with good reliability of the comparison results. The built database will facilitate further research in the related quality evaluation and other processing algorithms (e.g., compression, post-processing, and

References (58)

  • W. Lin et al.

    Perceptual visual quality metrics: a survey

    J. Vis. Commun. Image Represent.

    (2011)
  • H. Shen, Y. Lu, F. Wu, S. Li, A high-performance remote computing platform, in: IEEE International Conference on...
  • Y. Lu, S. Li, H. Shen, Virtualized Screen: a third element for cloud-mobile convergence, in: IEEE Computer Society,...
  • T.-H. Chang, Y. Li, Deep shot: a framework for migrating tasks across devices using mobile phone cameras, in: ACM...
  • Z. Pan et al.

    A low-complexity screen compression scheme for interactive screen sharing

    IEEE Trans. Circ. Syst. Video Technol.

    (2013)
  • T. Lin et al.

    Compound image compression for real-time computer screen image transmission

    IEEE Trans. Image Process.

    (2005)
  • C. Lan et al.

    Compress compound images in H.264/MPGE-4 AVC by exploiting spatial correlation

    IEEE Trans. Image Process.

    (2010)
  • H. Yang, W. Lin, C. Deng, Learning based screen image compression, in: IEEE International Workshop on Multimedia Signal...
  • Z. Pan, H. Shen, Y. Lu, Brower-friendly hybrid codec for compound image compression, in: IEEE International Symposium...
  • Z. Wang et al.

    Mean squared error: love it or leave it?

    IEEE Signal Process. Magaz.

    (2009)
  • T. Ciszkowski et al.

    Towards quality of experience-based reputation models for future web service provisioning

    Telecommun. Syst.

    (2012)
  • D.M. Chandler, Seven challenges in image quality assessment: past, present, and future research, ISRN Signal Processing...
  • L. Bottou et al.

    High quality document image compression with DjVu

    J. Electron. Imag.

    (1998)
  • I.-R. BT.500-13, Methodology for the subjective assessment of the quality of television pictures, in: International...
  • J. Li, M. Barkowsky, P.L. Callet, Subjective assessment methodology for preference of experience in 3DTV, in: IEEE...
  • Q. Xu et al.

    Hodgerank on random graphs for subjective video quality assessment

    IEEE Trans. Multimedia

    (2012)
  • A. Moorthy et al.

    Visual quality assessment algorithms: what does the future hold?

    Multimedia Tools Appl.

    (2011)
  • H. Quan et al.

    Study of rating scales for subjective quality assessment of high-definition video

    IEEE Trans. Broadcast.

    (2011)
  • R. Mantiuk et al.

    Comparison of four subjective methods for image quality assessment

    Comp. Graph. Forum

    (2012)
  • S. Winkler

    Analysis of public image and video databases for quality assessment

    IEEE J. Select. Topics Signal Process.

    (2012)
  • J. Wang et al.

    Computational model of stereoscopic 3D visual saliency

    IEEE Trans. Image Process.

    (2013)
  • M. Rubinstein, D. Gutierrez, O. Sorkine, A. Shamir, A comparative study of image retargeting, in: ACM SIGGRAPH Asia,...
  • H. Sheikh et al.

    A statistical evaluation of recent full reference image quality assessment algorithms

    IEEE Trans. Image Process.

    (2006)
  • N. Ponomarenko et al.

    TID2008A database for evaluation of full-reference visual quality assessment metrics

    Adv. Modern Radioelect.

    (2009)
  • E.C. Larson et al.

    Most apparent distortion: full-reference image quality assessment and the role of strategy

    J. Electron. Imag.

    (2010)
  • P.L. Callet, F. Autrusseau, Subjective Quality Assessment IRCCYN/IVC Database, 2005...
  • Z. Wang et al.

    Image quality assessment: from error visibility to structural similarity

    IEEE Trans. Image Process.

    (2004)
  • H. Sheikh et al.

    Image information and visual quality

    IEEE Trans. n Image Process.

    (2006)
  • L. Zhang et al.

    FSIM: a feature similarity index for image quality assessment

    IEEE Trans. Image Process.

    (2011)
  • Cited by (11)

    View all citing articles on Scopus

    This paper has been recommended for acceptance by M.T. Sun.

    View full text