The Journal of China Universities of Posts and Telecommunications ›› 2024, Vol. 31 ›› Issue (2): 94-104.doi: 10.19682/j.cnki.1005-8885.2024.0009

Special Issue: 集成电路

Previous Articles     Next Articles

Design and implementation of a multi-tile parallel scanning rasterization accelerator


  • Received:2023-12-04 Revised:2024-03-09 Online:2024-04-30 Published:2024-04-30
  • Contact: Li-Dong XING
  • Supported by:
    Scientific Research Program Funded by Shaanxi Provincial Education Department


In the design of a graphic processing unit (GPU), the processing speed of triangle rasterization is an important factor that determines the performance of the GPU. An architecture of a multi-tile parallel-scan rasterization accelerator was proposed in this paper. The accelerator uses a bounding box algorithm to improve scanning efficiency. It rasterizes multiple tiles in parallel and scans multiple lines at the same time within each tile. This highly parallel approach drastically improves the performance of rasterization. Using 65nm process standard cell library of Semiconductor Manufacturing International Corporation (SMIC), the accelerator can be synthesized to a maximum clock frequency of 220MHz. An implementation on the Genesys2 field programmable gate array (FPGA) board fully verifies the functionality of the accelerator. The implementation shows a significant improvement in rendering speed and efficiency and proves its suitability for high- performance rasterization.

Key words:  GPU, rasterization, multi-tile, multi-line, parallelism