跳到主要导航 跳到搜索 跳到主要内容

Towards Modality Transferable Visual Information Representation with Optimal Model Compression

  • Rongqun Lin
  • , Linwei Zhu
  • , Shiqi Wang*
  • , Sam Kwong*
  • *此作品的通讯作者

科研成果: 书/报告/会议事项章节会议稿件同行评审

摘要

Compactly representing the visual signals is of fundamental importance in various image/video-centered applications. Although numerous approaches were developed for improving the image and video coding performance by removing the redundancies within visual signals, much less work has been dedicated to the transformation of the visual signals to another well-established modality for better representation capability. In this paper, we propose a new scheme for visual signal representation that leverages the philosophy of transferable modality. In particular, the deep learning model, which characterizes and absorbs the statistics of the input scene with online training, could be efficiently represented in the sense of rate-utility optimization to serve as the enhancement layer in the bitstream. As such, the overall performance can be further guaranteed by optimizing the new modality incorporated. The proposed framework is implemented on the state-of-the-art video coding standard (i.e., versatile video coding), and significantly better representation capability has been observed based on extensive evaluations.

源语言英语
主期刊名MM 2020 - Proceedings of the 28th ACM International Conference on Multimedia
出版商Association for Computing Machinery, Inc
3705-3714
页数10
ISBN(电子版)9781450379885
DOI
出版状态已出版 - 12 10月 2020
已对外发布
活动28th ACM International Conference on Multimedia, MM 2020 - Virtual, Online, 美国
期限: 12 10月 202016 10月 2020

出版系列

姓名MM 2020 - Proceedings of the 28th ACM International Conference on Multimedia

会议

会议28th ACM International Conference on Multimedia, MM 2020
国家/地区美国
Virtual, Online
时期12/10/2016/10/20

指纹

探究 'Towards Modality Transferable Visual Information Representation with Optimal Model Compression' 的科研主题。它们共同构成独一无二的指纹。

引用此