JOURNAL OF CHINA UNIVERSITIES OF POSTS AND TELECOM ›› 2017, Vol. 24 ›› Issue (5): 68-76.doi: 10.1016/S1005-8885(17)60235-8

Previous Articles     Next Articles

Extraction technique of region of interest from stereoscopic video

Lü Chaohui, Pan Jiaying   

  1. School of Information Engineering, Communication University of China, Beijing 100024, China
  • Received:2017-06-07 Revised:2017-09-28 Online:2017-10-30 Published:2017-12-18
  • Contact: Lü Chaohui, E-mail: llvch@hotmail.com E-mail:llvch@hotmail.com
  • About author:Lü Chaohui, E-mail: llvch@hotmail.com
  • Supported by:
    This work was supported by the National Natural Science Foundation of China (61201236), National Key Technology Support Program (2012BAH01F04), and Beijing Key Laboratory of Science and Technology (Z141101004414045).

Abstract: A feature fusion approach is presented to extract the region of interest (ROI) from the stereoscopic video. Based on human vision system (HVS), the depth feature, the color feature and the motion feature are chosen as vision features. The algorithm is shown as follows. Firstly, color saliency is calculated on superpixel scale. Color space distribution of the superpixel and the color difference between the superpixel and background pixel are used to describe color saliency and color salient region is detected. Then, the classic visual background extractor (Vibe) algorithm is improved from the update interval and update region of background model. The update interval is adjusted according to the image content. The update region is determined through non-obvious movement region and background point detection. So the motion region of stereoscopic video is extracted using improved Vibe algorithm. The depth salient region is detected by selecting the region with the highest gray value. Finally, three regions are fused into final ROI. Experiment results show that the proposed method can extract ROI from stereoscopic video effectively. In order to further verify the proposed method, stereoscopic video coding application is also carried out on the joint model (JM) encoder with different bit allocation in ROI and the background region.

Key words: stereoscopic video, depth, saliency, ROI, Vibe

CLC Number: