中国邮电高校学报(英文) ›› 2017, Vol. 24 ›› Issue (5): 68-76.doi: 10.1016/S1005-8885(17)60235-8

• Signal processing • 上一篇    下一篇

Extraction technique of region of interest from stereoscopic video

Lü Chaohui, Pan Jiaying   

  1. School of Information Engineering, Communication University of China, Beijing 100024, China
  • 收稿日期:2017-06-07 修回日期:2017-09-28 出版日期:2017-10-30 发布日期:2017-12-18
  • 通讯作者: Lü Chaohui, E-mail: llvch@hotmail.com E-mail:llvch@hotmail.com
  • 作者简介:Lü Chaohui, E-mail: llvch@hotmail.com
  • 基金资助:
    This work was supported by the National Natural Science Foundation of China (61201236), National Key Technology Support Program (2012BAH01F04), and Beijing Key Laboratory of Science and Technology (Z141101004414045).

Extraction technique of region of interest from stereoscopic video

Lü Chaohui, Pan Jiaying   

  1. School of Information Engineering, Communication University of China, Beijing 100024, China
  • Received:2017-06-07 Revised:2017-09-28 Online:2017-10-30 Published:2017-12-18
  • Contact: Lü Chaohui, E-mail: llvch@hotmail.com E-mail:llvch@hotmail.com
  • About author:Lü Chaohui, E-mail: llvch@hotmail.com
  • Supported by:
    This work was supported by the National Natural Science Foundation of China (61201236), National Key Technology Support Program (2012BAH01F04), and Beijing Key Laboratory of Science and Technology (Z141101004414045).

摘要: A feature fusion approach is presented to extract the region of interest (ROI) from the stereoscopic video. Based on human vision system (HVS), the depth feature, the color feature and the motion feature are chosen as vision features. The algorithm is shown as follows. Firstly, color saliency is calculated on superpixel scale. Color space distribution of the superpixel and the color difference between the superpixel and background pixel are used to describe color saliency and color salient region is detected. Then, the classic visual background extractor (Vibe) algorithm is improved from the update interval and update region of background model. The update interval is adjusted according to the image content. The update region is determined through non-obvious movement region and background point detection. So the motion region of stereoscopic video is extracted using improved Vibe algorithm. The depth salient region is detected by selecting the region with the highest gray value. Finally, three regions are fused into final ROI. Experiment results show that the proposed method can extract ROI from stereoscopic video effectively. In order to further verify the proposed method, stereoscopic video coding application is also carried out on the joint model (JM) encoder with different bit allocation in ROI and the background region.

关键词: stereoscopic video, depth, saliency, ROI, Vibe

Abstract: A feature fusion approach is presented to extract the region of interest (ROI) from the stereoscopic video. Based on human vision system (HVS), the depth feature, the color feature and the motion feature are chosen as vision features. The algorithm is shown as follows. Firstly, color saliency is calculated on superpixel scale. Color space distribution of the superpixel and the color difference between the superpixel and background pixel are used to describe color saliency and color salient region is detected. Then, the classic visual background extractor (Vibe) algorithm is improved from the update interval and update region of background model. The update interval is adjusted according to the image content. The update region is determined through non-obvious movement region and background point detection. So the motion region of stereoscopic video is extracted using improved Vibe algorithm. The depth salient region is detected by selecting the region with the highest gray value. Finally, three regions are fused into final ROI. Experiment results show that the proposed method can extract ROI from stereoscopic video effectively. In order to further verify the proposed method, stereoscopic video coding application is also carried out on the joint model (JM) encoder with different bit allocation in ROI and the background region.

Key words: stereoscopic video, depth, saliency, ROI, Vibe

中图分类号: