Skip to main navigation Skip to search Skip to main content

Compressing Scene Dynamics: A Generative Approach

  • Shanzhi Yin
  • , Zihan Zhang
  • , Bolin Chen
  • , Shiqi Wang
  • , Yan Ye

Research output: Chapter in Book/Report/Conference proceedingConference contributionpeer-review

Abstract

This paper proposes to learn generative priors from the motion patterns instead of video contents for generative video compression. The priors are derived from small motion dynamics in common scenes such as swinging trees in the wind and floating boat on the sea. Utilizing such compact motion priors, a novel generative scene dynamics compression framework is built to realize ultra-low bit-rate communication and high-quality reconstruction for diverse scene contents. At the encoder side, motion priors are characterized into compact representations in a dense-to-sparse manner. At the decoder side, the decoded motion priors serve as the trajectory hints for scene dynamics reconstruction via a diffusion based flow-driven generator. The experimental results illustrate that the proposed method can achieve superior rate-distortion performance and outperform the state-of-the-art conventional video codec Versatile Video Coding (VVC) on scene dynamics sequences.

Original languageEnglish
Title of host publicationProceedings - DCC 2025
Subtitle of host publication2025 Data Compression Conference
EditorsAli Bilgin, James E. Fowler, Joan Serra-Sagrista, Yan Ye, James A. Storer
PublisherInstitute of Electrical and Electronics Engineers Inc.
Pages414
Number of pages1
ISBN (Electronic)9798331534714
DOIs
StatePublished - 2025
Externally publishedYes
Event2025 Data Compression Conference, DCC 2025 - Snowbird, United States
Duration: 18 Mar 202521 Mar 2025

Publication series

NameData Compression Conference Proceedings
ISSN (Print)1068-0314

Conference

Conference2025 Data Compression Conference, DCC 2025
Country/TerritoryUnited States
CitySnowbird
Period18/03/2521/03/25

Keywords

  • diffusion model
  • generative coding
  • motion tokenization

Fingerprint

Dive into the research topics of 'Compressing Scene Dynamics: A Generative Approach'. Together they form a unique fingerprint.

Cite this