中国邮电高校学报(英文) ›› 2025, Vol. 32 ›› Issue (1): 74-87.doi: 10.19682/j.cnki.1005-8885.2025.0004

• IC and System Design • 上一篇    下一篇

Design and implementation of reconfigurable CNN accelerator architecture based on elastic storage 

山蕊,霍紫晴,许佳宁   

  1. 西安邮电大学
  • 收稿日期:2023-10-09 修回日期:2024-10-15 出版日期:2025-02-28 发布日期:2025-02-28
  • 通讯作者: 山蕊 E-mail:shanrui0112@163.com
  • 基金资助:
    科技创新2030——“新一代人工智能”重大项目;国家自然基金重点项目;国家自然基金青年项目

Design and implementation of reconfigurable CNN accelerator architecture based on elastic storage

  • Received:2023-10-09 Revised:2024-10-15 Online:2025-02-28 Published:2025-02-28
  • Contact: SHAN Rui E-mail:shanrui0112@163.com
  • Supported by:
    National Key R&D Program of China;National Natural Science Foundation of China;National Natural Science Foundation of China

摘要:

With the rapid iteration of neural network algorithms, higher requirements were placed on the computational performance and memory access bandwidth of neural network accelerators. Simply increasing bandwidth cannot improve energy efficiency, so improving the data reuse rate is a hot research topic. From the perspective of supporting data reuse, a reconfigurable convolutional neural network ( CNN) accelerator based on elastic storage ( RCAES) was designed in this paper. Supporting elastic memory access and flexible data flow reduces data movement between the processor and memory, eases the bandwidth pressure and enhances CNN acceleration performance. The experimental results indicate that by conducting 1 × 1 convolution and 3 × 3 convolution when performing convolution calculations, the execution speed increased by 25.00% and 61.61% , respectively. The   3 × 3 maximum pooling speed was increased by 76.04% .


关键词:

reconfigurable, array processor, distributed storage, neural network accelerator, data reuse


Abstract:

With the rapid iteration of neural network algorithms, higher requirements were placed on the computational performance and memory access bandwidth of neural network accelerators. Simply increasing bandwidth cannot improve energy efficiency, so improving the data reuse rate is a hot research topic. From the perspective of supporting data reuse, a reconfigurable convolutional neural network ( CNN) accelerator based on elastic storage ( RCAES) was designed in this paper. Supporting elastic memory access and flexible data flow reduces data movement between the processor and memory, eases the bandwidth pressure and enhances CNN acceleration performance. The experimental results indicate that by conducting 1 × 1 convolution and 3 × 3 convolution when performing convolution calculations, the execution speed increased by 25.00% and 61.61% , respectively. The   3 × 3 maximum pooling speed was increased by 76.04% .

Key words: reconfigurable, array processor, distributed storage, neural network accelerator, data reuse