中国邮电高校学报(英文) ›› 2024, Vol. 31 ›› Issue (1): 1-11.doi: 10.19682/j.cnki.1005-8885.2024.2001

• •    下一篇

SNR-adaptive deep joint source-channel coding scheme for imagesemantic transmission with convolutional block attention module

Yang Yujia, Liu Yiming, Zhang Wenjia, Zhang Zhi   

  1. State Key Laboratory of Networking and Switch Technology, Beijing University of Posts and Telecommunications, Beijing 100876, China
  • 收稿日期:2023-11-21 修回日期:2024-01-04 接受日期:2024-02-22 出版日期:2024-02-29 发布日期:2024-02-29
  • 通讯作者: Corresponding author: Liu Yiming, E-mail: liuyiming@bupt.edu.cn E-mail:liuyiming@bupt.edu.cn
  • 基金资助:
    This work was supported in part by the National Natural Science Foundation of China (62293481), in part by the Young Elite Scientists Sponsorship Program by CAST (2023QNRC001), in part by the National Natural Science Foundation for Young Scientists of China (62001050), and in part by the Fundamental Research Funds for the Central Universities (2023RC95).

SNR-adaptive deep joint source-channel coding scheme for imagesemantic transmission with convolutional block attention module

Yang Yujia, Liu Yiming, Zhang Wenjia, Zhang Zhi   

  1. State Key Laboratory of Networking and Switch Technology, Beijing University of Posts and Telecommunications, Beijing 100876, China
  • Received:2023-11-21 Revised:2024-01-04 Accepted:2024-02-22 Online:2024-02-29 Published:2024-02-29
  • Contact: Corresponding author: Liu Yiming, E-mail: liuyiming@bupt.edu.cn E-mail:liuyiming@bupt.edu.cn
  • Supported by:
    This work was supported in part by the National Natural Science Foundation of China (62293481), in part by the Young Elite Scientists Sponsorship Program by CAST (2023QNRC001), in part by the National Natural Science Foundation for Young Scientists of China (62001050), and in part by the Fundamental Research Funds for the Central Universities (2023RC95).

摘要: With the development of deep learning (DL), joint source-channel coding (JSCC) solutions for end-to-end transmission have gained a lot of attention. Adaptive deep JSCC schemes support dynamically adjusting the rate according to different channel conditions during transmission, enhancing robustness in dynamic wireless environment. However, most of the existing adaptive JSCC schemes only consider different channel conditions, ignoring the different feature importance in the image processing and transmission. The uniform compression of different features in the image may result in the compromise of critical image details, particularly in low signal-to-noise ratio (SNR) scenarios. To address the above issues, in this paper, a dual attention mechanism is introduced and an SNR-adaptive deep JSCC mechanism with a convolutional block attention module (CBAM) is proposed, in which matrix operations are applied to features in spatial and channel dimensions respectively. The proposedsolution concatenates the pooling feature with the SNR level and passes it sequentially through the channel attention network and spatial attention network to obtain the importance evaluation result. Experiments show that the proposed solution outperforms other baseline schemes in terms of peak SNR (PSNR) and structural similarity (SSIM), particularly in low SNR scenarios or when dealing with complex image content.

关键词: semantic communication, joint source-channel coding, image transmission

Abstract: With the development of deep learning (DL), joint source-channel coding (JSCC) solutions for end-to-end transmission have gained a lot of attention. Adaptive deep JSCC schemes support dynamically adjusting the rate according to different channel conditions during transmission, enhancing robustness in dynamic wireless environment. However, most of the existing adaptive JSCC schemes only consider different channel conditions, ignoring the different feature importance in the image processing and transmission. The uniform compression of different features in the image may result in the compromise of critical image details, particularly in low signal-to-noise ratio (SNR) scenarios. To address the above issues, in this paper, a dual attention mechanism is introduced and an SNR-adaptive deep JSCC mechanism with a convolutional block attention module (CBAM) is proposed, in which matrix operations are applied to features in spatial and channel dimensions respectively. The proposedsolution concatenates the pooling feature with the SNR level and passes it sequentially through the channel attention network and spatial attention network to obtain the importance evaluation result. Experiments show that the proposed solution outperforms other baseline schemes in terms of peak SNR (PSNR) and structural similarity (SSIM), particularly in low SNR scenarios or when dealing with complex image content.

Key words: semantic communication, joint source-channel coding, image transmission