The Journal of China Universities of Posts and Telecommunications ›› 2025, Vol. 32 ›› Issue (1): 74-87.doi: 10.19682/j.cnki.1005-8885.2025.0004

Previous Articles     Next Articles

Design and implementation of reconfigurable CNN accelerator architecture based on elastic storage

  

  • Received:2023-10-09 Revised:2024-10-15 Online:2025-02-28 Published:2025-02-28
  • Contact: SHAN Rui E-mail:shanrui0112@163.com
  • Supported by:
    National Key R&D Program of China;National Natural Science Foundation of China;National Natural Science Foundation of China

Abstract:

With the rapid iteration of neural network algorithms, higher requirements were placed on the computational performance and memory access bandwidth of neural network accelerators. Simply increasing bandwidth cannot improve energy efficiency, so improving the data reuse rate is a hot research topic. From the perspective of supporting data reuse, a reconfigurable convolutional neural network ( CNN) accelerator based on elastic storage ( RCAES) was designed in this paper. Supporting elastic memory access and flexible data flow reduces data movement between the processor and memory, eases the bandwidth pressure and enhances CNN acceleration performance. The experimental results indicate that by conducting 1 × 1 convolution and 3 × 3 convolution when performing convolution calculations, the execution speed increased by 25.00% and 61.61% , respectively. The   3 × 3 maximum pooling speed was increased by 76.04% .

Key words: reconfigurable, array processor, distributed storage, neural network accelerator, data reuse