TY - JOUR
T1 - Learning the Scale in Reference Picture Resampling for Versatile Video Coding
AU - Lu, Riyu
AU - Zhang, Yingwen
AU - Man, Hengyu
AU - Wang, Meng
AU - Wang, Shiqi
AU - Fan, Xiaopeng
N1 - Publisher Copyright:
© IEEE. 2025 IEEE.
PY - 2025
Y1 - 2025
N2 - Compressing high-resolution videos under low bitrate constraints is a challenging task. Resampling-based compression, which reduces the resolution before encoding and restores it after decoding, has great potential to improve the rate-distortion performance in such scenarios. In this paper, we propose a learning-based frame-level coding scale control scheme that enhances the coding performance by adjusting the coding scale for each frame. The scheme cooperates with the Reference Picture Resampling of the latest video coding standard Versatile Video Coding (VVC), which allows coding scale variations on each frame. More specifically, a dataset with 5200 videos is created by a greedy rate-distortion optimization algorithm employed to select the optimal coding scale for each frame. A neural network-based decision model is further incorporated into VVC, learning to predict the coding scale for each frame in one pass. The scheme is implemented into the Fraunhofer Versatile Video Encoder (VVenC), a fast and efficient VVC encoder, and evaluated on 4 K contents. Experimental results show that the proposed scheme outperforms GOP-based coding scale adaptation methods, achieving average bitrate savings of 3.06% and 4.14% in terms of PSNR and MS-SSIM.
AB - Compressing high-resolution videos under low bitrate constraints is a challenging task. Resampling-based compression, which reduces the resolution before encoding and restores it after decoding, has great potential to improve the rate-distortion performance in such scenarios. In this paper, we propose a learning-based frame-level coding scale control scheme that enhances the coding performance by adjusting the coding scale for each frame. The scheme cooperates with the Reference Picture Resampling of the latest video coding standard Versatile Video Coding (VVC), which allows coding scale variations on each frame. More specifically, a dataset with 5200 videos is created by a greedy rate-distortion optimization algorithm employed to select the optimal coding scale for each frame. A neural network-based decision model is further incorporated into VVC, learning to predict the coding scale for each frame in one pass. The scheme is implemented into the Fraunhofer Versatile Video Encoder (VVenC), a fast and efficient VVC encoder, and evaluated on 4 K contents. Experimental results show that the proposed scheme outperforms GOP-based coding scale adaptation methods, achieving average bitrate savings of 3.06% and 4.14% in terms of PSNR and MS-SSIM.
KW - Rate-distortion optimization
KW - resampling-based compression
KW - versatile video coding
KW - video coding
UR - https://www.scopus.com/pages/publications/85218713041
U2 - 10.1109/TMM.2025.3543098
DO - 10.1109/TMM.2025.3543098
M3 - 文章
AN - SCOPUS:85218713041
SN - 1520-9210
VL - 27
SP - 5831
EP - 5842
JO - IEEE Transactions on Multimedia
JF - IEEE Transactions on Multimedia
ER -