TY - GEN
T1 - An efficient coding framework for compact descriptors extracted from video sequence
AU - Huang, Zhangshuai
AU - Duan, Ling Yu
AU - Lin, Jie
AU - Wang, Shiqi
AU - Ma, Siwei
AU - Huang, Tiejun
N1 - Publisher Copyright:
© 2015 IEEE.
PY - 2015/12/9
Y1 - 2015/12/9
N2 - Towards effective and efficient image matching or retrieval tasks, the emerging MPEG standard, named Compact Descriptors for Visual Search (CDVS), has fulfilled compact descriptors for still images, consisting of compressed local and global descriptor. Nevertheless, the frame-level coding of CDVS descriptors from a video sequence does not address the inter-frame redundancy issue, which may consume considerable bandwidth and storage resources. In this work, we propose an efficient coding framework of CDVS descriptors to generate compact descriptors for video sequences. For local descriptors, we propose a multiple reference predictive technique to exploit the temporal correlation of local descriptors and location coordinates over a sequence of frames. To further improve the prediction performance, keypoint tracking is applied to identify temporally repeated keypoints. For global descriptors, a propagation coding way is employed to compress the global descriptors of adjacent frames. The empirical evaluation has shown that the proposed coding approach has yielded a low bit rate of less than 40kbps on average, while maintaining comparable matching and retrieval performance. Compared to the sequence of original frame-level CDVS descriptors, the proposed approach has achieved over 25× bit rate reduction.
AB - Towards effective and efficient image matching or retrieval tasks, the emerging MPEG standard, named Compact Descriptors for Visual Search (CDVS), has fulfilled compact descriptors for still images, consisting of compressed local and global descriptor. Nevertheless, the frame-level coding of CDVS descriptors from a video sequence does not address the inter-frame redundancy issue, which may consume considerable bandwidth and storage resources. In this work, we propose an efficient coding framework of CDVS descriptors to generate compact descriptors for video sequences. For local descriptors, we propose a multiple reference predictive technique to exploit the temporal correlation of local descriptors and location coordinates over a sequence of frames. To further improve the prediction performance, keypoint tracking is applied to identify temporally repeated keypoints. For global descriptors, a propagation coding way is employed to compress the global descriptors of adjacent frames. The empirical evaluation has shown that the proposed coding approach has yielded a low bit rate of less than 40kbps on average, while maintaining comparable matching and retrieval performance. Compared to the sequence of original frame-level CDVS descriptors, the proposed approach has achieved over 25× bit rate reduction.
KW - Compact descriptor
KW - Interest points tracking
KW - MPEG CDVS
KW - Predictive coding
KW - Propagation coding
UR - https://www.scopus.com/pages/publications/84956650755
U2 - 10.1109/ICIP.2015.7351520
DO - 10.1109/ICIP.2015.7351520
M3 - 会议稿件
AN - SCOPUS:84956650755
T3 - Proceedings - International Conference on Image Processing, ICIP
SP - 3822
EP - 3826
BT - 2015 IEEE International Conference on Image Processing, ICIP 2015 - Proceedings
PB - IEEE Computer Society
T2 - IEEE International Conference on Image Processing, ICIP 2015
Y2 - 27 September 2015 through 30 September 2015
ER -