Transcript Slide 1
ON BETWEEN-COEFFICIENT
CONTRAST MASKING OF DCT BASIS
FUNCTIONS
1
Nikolay Ponomarenko (*), Flavia Silvestri(**),
Karen Egiazarian (***), Marco Carli (**),
Jaakko Astola (***) and Vladimir Lukin (*)
(*) National Aerospace University, Kharkov, Ukraine
(**) University of Rome "Roma TRE", Rome, Italy
(***) Tampere University of Technology, Tampere, Finland
Marco Carli
VPQM 2006
26/01/2007
2
Outline
Outline
1.
2.
3.
4.
5.
Introduction
Proposed model of between-coefficient contrast masking of DCT basis functions
Modification of PSNR using a new masking model
MATLAB implementation of the proposed measure
A set of test images for comparative analysis for taking into account the masking
effect in quality metrics
6. Subjective experiment to test quality measures
7. Results of the experiment
8. Examples of quality assessment of test images
9. Example of use of the proposed model to masking noise on a real image
10.Summary and Conclusion
Marco Carli
VPQM 2006
26/01/2007
3
Introduction
Human visual sensitivity
varies as a function of several key
image properties, such as:
Light level
Spatial frequency
Color
Masking model can be used in :
Image and video compression
Image filtering
Digital watermarking
Validation of effectiveness of image processing
methods
Local image contrast
Eccentricity
Temporal frequency
Goal of the research:
Efficient accounting for local image
contrast using a model of betweencoefficient contrast masking of
DCT basis functions
Marco Carli
Requirements to the model:
Images compressed (filtered or processed) with
accounting the model can be visualized in
unknown illumination conditions, monitor
brightness, distance to the monitor, viewing angle,
etc. Thus such model should operate by only
some averaged parameters of image visualization
VPQM 2006
26/01/2007
4
Proposed model of between-coefficient
contrast masking of DCT basis functions
Let us denote a weighted energy of DCT coefficients of an image block 8x8 as Ew(X):
7
7
E w (X ) X ij C ij
2
i 0
(1)
j 0
where Xij is a DCT coefficient with indices i,j, Cij is a correcting factor determined by the CSF.
The DCT coefficients X and Y are visually undistinguished if Ew(X-Y) < max(Ew(X)/16,
Ew(Y)/16), where Ew(X)/16 is a masking effect Em of DCT coefficients X (normalizing factor 16
has been selected experimentally).
Reducing of the masking effect due to an edge presence in the
analyzed image block: we propose to reduce a masking effect
for a block D proportionally to the local variances V(.) in blocks
D1, D2, D3, D4 in comparison to the entire block:
Em(D) = Ew(D)δ(D)/16,
(2)
where δ(D) = (V(D1)+V(D2)+V(D3)+V(D4))/4V(D), V(D) is the
variance of the pixel values in block D.
Marco Carli
VPQM 2006
26/01/2007
5
Proposed model of between-coefficient
contrast masking of DCT basis functions
Values of Cij have been obtained using the quantization table for the color component Y of
JPEG (the values of quantization table JPEG have been normalized by 10 and squared)
JPEG Quantization table of Y
component
16
11
10
16
24
40
51
Values of Cij
i\j
0
1
2
3
4
5
6
7
0
0
0.8264
1.0000
0.3906
0.1736
0.0625
0.0384
0.0269
1
0.6944 0.6944
0.5102
0.2770
0.1479
0.0297
0.0278
0.0331
2
0.5102 0.5917
0.3906
0.1736
0.0625
0.0308
0.0210
0.0319
3
0.5102 0.3460
0.2066
0.1189
0.0384
0.0132
0.0156
0.0260
4
0.3086 0.2066
0.0730
0.0319
0.0216
0.0084
0.0094
0.0169
61
12
12
14
19
26
58
60
55
14
13
16
24
40
57
69
56
14
17
22
29
51
87
80
62
18
22
37
56
68 109 103 77
24
35
55
64
81 104 113 92
49
64
78
87 103 121 120 101
5
0.1736 0.0816
0.0331
0.0244
0.0152
0.0092
0.0078
0.0118
72
92
95
98 112 100 103 99
6
0.0416 0.0244
0.0164
0.0132
0.0094
0.0068
0.0069
0.0098
7
0.0193 0.0118
0.0111
0.0104
0.0080
0.0100
0.0094
0.0102
Marco Carli
VPQM 2006
26/01/2007
6
Modification of PSNR using a new
masking model
A basis of the proposed metric is a PSNR-HVS (Egiazarian K., Astola J., Ponomarenko N., Lukin V.,
Battisti F., Carli M. “New full-reference quality metrics based on HVS”, CD-ROM Proceedings of the Second
International Workshop on Video Processing and Quality Metrics, Scottsdale, USA, 2006, 4 p).
Block 8x8
of original
image
Block 8x8
of distorted
image
DCT of
difference
between
pixel
values
Reduction
by value of
contrast
masking
MSEH
calculation
of the block
Flow-chart of PSNR-HVS-M calculation
Reduction by value of contrast masking in
accordance to the proposed model is carried out in
the following manner. First, the maximal masking
effect Emax is calculated as max(Em(Xe), Em(Xd))
where Xe and Xd are the DCT coefficients of a
original image block and a distorted image block,
respectively. Then, the visible difference between Xe
and Xd is determined as:
Marco Carli
X eij X dij , i 0, j 0
0, X eij X dij E norm / C ij
X∆ij =
X eij X dij E norm / C ij , X eij X dij E norm / C ij
X X E / C , otherwise
dij
norm
ij
eij
where Enorm is
E max / 64 .
VPQM 2006
26/01/2007
7
MATLAB implementation of the
proposed measure
The MATLAB implementation of PSNR-HVS-M is available on www.cs.tut.fi/~ponom/psnrhvsm.htm
Marco Carli
VPQM 2006
26/01/2007
A set of test images for comparative
analysis for taking into account the
masking effect in quality metrics
8
While creating an image test set we took into
consideration the following:
Such set should contain images with both spatially
uncorrelated and correlated noise (the latter one is
typical for images formed by digital cameras and is
more visible for humans);
The set should contain images with noise distributed
spatially uniformly and with noise which is masked or
unmasked (concentrated in regions with maximal and
minimal masking properties, respectively);
The set is to be maximally simple for visual
comparison by humans (because of this in our set we
used only three values of noise variance σ2 and a total
number of distorted test images was 2x3x3 = 18
images).
Original test images having a lot of different type
regions with high masking effect
Marco Carli
VPQM 2006
26/01/2007
9
Subjective experiment to test quality
measures
Result of the experiment: the test image set ordered according to subjective visual quality.
Number of observers: 155 (45 from Finland, 43 from Italy, 67 from Ukraine).
Number of comparisons of visual appearance of test images: 8192 (on average 53 for each observer).
17” or 19” Monitor Resolution: 1152x864 pixels.
Number of experiments carried out using CRT monitors: 128.
Number of experiments carried out using LCD monitors: 27.
Cross correlation factors
Group of observers
Spearman correlation
Kendall correlation
Finland – Italy
0.996
0.895
Finland – Ukraine
0.996
0.935
Italy - Ukraine
0.997
0.961
CRT - LCD
0.998
0.922
Marco Carli
VPQM 2006
26/01/2007
10
Results of the experiment
Spearman Kendall
correlation correlation
Measure
Reference
PSNR-HVS-M
This paper
0.984
0.948
PSNR-HVS
Egiazarian K., Astola J., Ponomarenko N., Lukin V., Battisti F., Carli M. “New fullreference quality metrics based on HVS”, CD-ROM Proceedings of the Second Intern.
Workshop on Video Processing and Quality Metrics, Scottsdale, USA, 2006, 4 p
0.895
0.712
NQM
Damera-Venkata N., Kite T., Geisler W., Evans B. and Bovik A. "Image Quality
Assessment Based on a Degradation Model", IEEE Trans. on Image Processing, Vol.
9, 2000, pp. 636-650
0.857
0.673
Solomon J. A., Watson A. B., and Ahumada A. “Visibility of DCT basis functions:
Effects of contrast masking”. Proc. of Data Compression Conf., 1994, pp. 361-370
http://vision.arc.nasa.gov/dctune/ - DCTune 2.0 page
0.829
0.712
Wang Z., Bovik A. “A universal image quality index”, IEEE Signal Processing Letters,
vol. 9, March, 2002, pp. 81–84
0.550
0.438
0.537
0.359
DCTune
UQI
PSNR
Peak Signal to Noise Ratio
VQM
Xiao F. “DCT-based Video Quality Evaluation”, Final Project for EE392J, 2000
0.441
0.281
SSIM
Wang Z., Bovik A., Sheikh H., Simoncelli E. “Image quality assessment: from error
visibility to structural similarity”, IEEE Trans. on Image Proc., vol.13, 2004, pp.600-612
0.406
0.358
VIF
Sheikh H. R. and Bovik A. C., "Image Information and Visual Quality", IEEE
Transactions on Image Processing, vol. 15, February, 2006, pp. 430-444
0.377
0.255
PQS
Miyahara, M., Kotani, K., Algazi, V.R. ”Objective picture quality scale (PQS) for image
coding”, IEEE Transactions on Communications, vol. 46, issue 9, 1998, pp. 1215-1226
0.302
0.242
Marco Carli
VPQM 2006
26/01/2007
11
Examples of quality assessment of test
images
DCTune = 24.9, PSNR-HVS-M = 33.20 dB
PSNR-HVS-M says: “This is better!”
Marco Carli
DCTune = 24.5, PSNR-HVS-M = 29.31 dB
DCTune says: “This is better!”
VPQM 2006
26/01/2007
12
Examples of quality assessment of test
images
SSIM = 0.80, PSNR-HVS-M = 25.50 dB
SSIM says: “This is better!”
Marco Carli
SSIM = 0.79, PSNR-HVS-M = 31.29 dB
PSNR-HVS-M says: “This is better!”
VPQM 2006
26/01/2007
13
Example of use of the proposed model
to masking noise on a real image
Original test image Baboon
Marco Carli
The image with masked noise,
PSNR=26.18 dB, MSE=158,
PSNR-HVS=34.43 dB, PSNR-HVS-M=51.67 dB
VPQM 2006
26/01/2007
14
Summary and Conclusion
Summary
A simple and efficient model of between-coefficient contrast masking of DCT basis functions
is proposed;
A modification of PSNR that takes into account this masking model is proposed;
Subjective experiments on comparison of known quality metrics are carried out;
Conclusions
The proposed measure based on the designed masking model has demonstrated the best
correspondence to the results of the subjective experiments. However for providing more
reliable conclusions on efficiency of the proposed model it is necessary to carry out additional
more extensive experiments and research.
The proposed test set has allowed to demonstrate drawbacks of many well known metrics
that do not fully or even badly correspond to human visual perception.
Marco Carli
VPQM 2006
26/01/2007