Skip to main navigation Skip to search Skip to main content

Semantic Face Compression for Metaverse: A Compact 3D Descriptor Based Approach

  • Binzhe Li
  • , Bolin Chen
  • , Zhao Wang
  • , Shiqi Wang*
  • , Yan Ye
  • *Corresponding author for this work
  • Peking University
  • City University of Hong Kong
  • Alibaba Group Holding Ltd.

Research output: Contribution to journalArticlepeer-review

Abstract

The metaverse, a 3D virtual world, requires efficient interactive avatar communication. To achieve this goal, we envision a new metaverse paradigm for virtual avatar faces and develop semantic face compression with compact 3D facial descriptors. The paradigm comprises a compression framework that transmits 3D face descriptors for semantic compression and applications based on the semantic descriptors. The fundamental principle is that the communication of virtual avatar faces primarily emphasizes the conveyance of semantic information. In light of this, the proposed scheme offers the advantages of being highly flexible, efficient, and semantically meaningful. The promise of the proposed paradigm is also demonstrated by performance comparisons with the state-of-the-art video coding standard, Versatile Video Coding. A significant improvement in terms of rate-accuracy performance has been achieved. The proposed scheme is expected to enable numerous applications especially for real-time communication in the metaverse, such as digital human communication based on machine analysis, and to form the cornerstone of interactions.

Original languageEnglish
Pages (from-to)8978-8982
Number of pages5
JournalIEEE Transactions on Circuits and Systems for Video Technology
Volume34
Issue number9
DOIs
StatePublished - 2024
Externally publishedYes

Keywords

  • 3D descriptor
  • Metaverse
  • emotion recognition
  • face identification
  • intelligent machine task
  • real-time communication
  • semantic face compression

Fingerprint

Dive into the research topics of 'Semantic Face Compression for Metaverse: A Compact 3D Descriptor Based Approach'. Together they form a unique fingerprint.

Cite this