TY - JOUR
T1 - Divisively Normalized Sparse Coding
T2 - Toward Perceptual Visual Signal Representation
AU - Zhang, Xiang
AU - Ma, Siwei
AU - Wang, Shiqi
AU - Zhang, Jian
AU - Sun, Huifang
AU - Gao, Wen
N1 - Publisher Copyright:
© 2013 IEEE.
PY - 2021/8
Y1 - 2021/8
N2 - Sparse representation has been shown to be highly correlated with the visual perception of natural images, which can be characterized by a linear combination of neuronal responses in the visual cortex. Divisive normalization transform (DNT) has been proven to be an effective method in reducing statistical and perceptual dependencies for nonlinear properties in primary visual cortex. In this paper, we develop a divisively normalized sparse coding scheme, aiming to further bridge the gap between sparse representation and human visual perception. We show that such a scheme is perceptually meaningful for representing visual signals, with which the pixel-domain image representation and processing tasks can be feasibly and efficiently achieved in the divisively normalized sparse-domain. Specifically, we develop a sparse-domain similarity (SDS) index for perceptual quality evaluation, where the DNT is employed for transforming image signals into a perceptually uniform space. Furthermore, the proposed SDS index is employed to optimize the sparse coding process when representing natural images. The experimental results indicate that the SDS can provide accurate and consistent predictions of perceived image quality, and the performance of sparse coding can be significantly improved in terms of both objective and subjective quality evaluations.
AB - Sparse representation has been shown to be highly correlated with the visual perception of natural images, which can be characterized by a linear combination of neuronal responses in the visual cortex. Divisive normalization transform (DNT) has been proven to be an effective method in reducing statistical and perceptual dependencies for nonlinear properties in primary visual cortex. In this paper, we develop a divisively normalized sparse coding scheme, aiming to further bridge the gap between sparse representation and human visual perception. We show that such a scheme is perceptually meaningful for representing visual signals, with which the pixel-domain image representation and processing tasks can be feasibly and efficiently achieved in the divisively normalized sparse-domain. Specifically, we develop a sparse-domain similarity (SDS) index for perceptual quality evaluation, where the DNT is employed for transforming image signals into a perceptually uniform space. Furthermore, the proposed SDS index is employed to optimize the sparse coding process when representing natural images. The experimental results indicate that the SDS can provide accurate and consistent predictions of perceived image quality, and the performance of sparse coding can be significantly improved in terms of both objective and subjective quality evaluations.
KW - Divisive normalization
KW - image quality assessment (IQA)
KW - sparse representation
KW - visual perception
UR - https://www.scopus.com/pages/publications/85104509521
U2 - 10.1109/TCYB.2019.2899005
DO - 10.1109/TCYB.2019.2899005
M3 - 文章
C2 - 30843814
AN - SCOPUS:85104509521
SN - 2168-2267
VL - 51
SP - 4237
EP - 4250
JO - IEEE Transactions on Cybernetics
JF - IEEE Transactions on Cybernetics
IS - 8
M1 - 8657363
ER -