Occupancy Map Guided Attributes Artifacts Removal for Video-Based Point Cloud Compression

  • Peilin Chen
  • , Shiqi Wang*
  • , Zhu Li
  • *Corresponding author for this work

Research output: Contribution to journalArticlepeer-review

Abstract

Point clouds offer realistic 3D representations of objects and scenes at the expense of large data volumes. To represent such data compactly in real-world applications, Video-Based Point Cloud Compression (V-PCC) converts their texture into 2D attributes and occupancy maps before applying lossy video compression. Unfortunately, the coding artifacts introduced in the decoded attribute maps eventually degrade the quality of the reconstructed point cloud, thereby influencing its immersive experience. This article proposes a deep learning-based attribute map enhancement method that fully leverages the occupancy map's guidance. The design philosophy is that the cross-modality guidance from occupancy can be leveraged as critical information to enhance the attribute. Therefore, instead of treating attribute and occupancy as two separate sources of signals, occupancy serves as an indispensable auxiliary, such that the proposed framework explicitly provides the model with abundant clues by conducting local feature modification and global dependencies aggregation. In particular, the proposed framework is compatible with existing V-PCC bitstreams and can be feasibly incorporated into the standardized decoder pipeline. Extensive evaluations show the effectiveness of the proposed framework in attribute enhancement, with equivalently 6.0% Bjontegaard Delta-rate (BD-rate) savings obtained.

Original languageEnglish
Article number387
JournalACM Transactions on Multimedia Computing, Communications and Applications
Volume20
Issue number12
DOIs
StatePublished - 26 Nov 2024
Externally publishedYes

Keywords

  • 3-D texture enhancement
  • occupancy utilization
  • Point could compression

Fingerprint

Dive into the research topics of 'Occupancy Map Guided Attributes Artifacts Removal for Video-Based Point Cloud Compression'. Together they form a unique fingerprint.

Cite this