跳到主要导航 跳到搜索 跳到主要内容

Bi-Directional Deep Contextual Video Compression

  • Xihua Sheng
  • , Li Li*
  • , Dong Liu
  • , Shiqi Wang
  • *此作品的通讯作者
  • University of Science and Technology of China
  • Hefei Comprehensive National Science Center
  • City University of Hong Kong

科研成果: 期刊稿件文章同行评审

摘要

Deep video compression has made impressive process in recent years, with the majority of advancements concentrated on P-frame coding. Although efforts to enhance B-frame coding are ongoing, their compression performance is still far behind that of traditional bi-directional video codecs. In this article, we introduce a bi-directional deep contextual video compression scheme tailored for B-frames, termed DCVC-B, to improve the compression performance of deep B-frame coding. Our scheme mainly has three key innovations. First, we develop a bi-directional motion difference context propagation method for effective motion difference coding, which significantly reduces the bit cost of bi-directional motions. Second, we propose a bi-directional contextual compression model and a corresponding bi-directional temporal entropy model, to make better use of the multi-scale temporal contexts. Third, we propose a hierarchical quality structure-based training strategy, leading to an effective bit allocation across large groups of pictures (GOP). Experimental results show that our DCVC-B achieves an average reduction of 26.6% in BD-Rate compared to the reference software for H.265/HEVC under random access conditions. Remarkably, it surpasses the performance of the H.266/VVC reference software on certain test datasets under the same configuration. We anticipate our work can provide valuable insights and bring up deep B-frame coding to the next level.

源语言英语
页(从-至)5632-5646
页数15
期刊IEEE Transactions on Multimedia
27
DOI
出版状态已出版 - 2025
已对外发布

指纹

探究 'Bi-Directional Deep Contextual Video Compression' 的科研主题。它们共同构成独一无二的指纹。

引用此