跳到主要导航 跳到搜索 跳到主要内容

Learning the Scale in Reference Picture Resampling for Versatile Video Coding

  • Riyu Lu
  • , Yingwen Zhang
  • , Hengyu Man
  • , Meng Wang
  • , Shiqi Wang*
  • , Xiaopeng Fan*
  • *此作品的通讯作者
  • Harbin Institute of Technology
  • City University of Hong Kong
  • Lingnan University
  • Peng Cheng Laboratory

科研成果: 期刊稿件文章同行评审

摘要

Compressing high-resolution videos under low bitrate constraints is a challenging task. Resampling-based compression, which reduces the resolution before encoding and restores it after decoding, has great potential to improve the rate-distortion performance in such scenarios. In this paper, we propose a learning-based frame-level coding scale control scheme that enhances the coding performance by adjusting the coding scale for each frame. The scheme cooperates with the Reference Picture Resampling of the latest video coding standard Versatile Video Coding (VVC), which allows coding scale variations on each frame. More specifically, a dataset with 5200 videos is created by a greedy rate-distortion optimization algorithm employed to select the optimal coding scale for each frame. A neural network-based decision model is further incorporated into VVC, learning to predict the coding scale for each frame in one pass. The scheme is implemented into the Fraunhofer Versatile Video Encoder (VVenC), a fast and efficient VVC encoder, and evaluated on 4 K contents. Experimental results show that the proposed scheme outperforms GOP-based coding scale adaptation methods, achieving average bitrate savings of 3.06% and 4.14% in terms of PSNR and MS-SSIM.

源语言英语
页(从-至)5831-5842
页数12
期刊IEEE Transactions on Multimedia
27
DOI
出版状态已出版 - 2025
已对外发布

指纹

探究 'Learning the Scale in Reference Picture Resampling for Versatile Video Coding' 的科研主题。它们共同构成独一无二的指纹。

引用此