Design and implementation of a multi-tile parallel scanning rasterization accelerator

doi:10.19682/j.cnki.1005-8885.2024.0009

中国邮电高校学报(英文) ›› 2024, Vol. 31 ›› Issue (2): 94-104.doi: 10.19682/j.cnki.1005-8885.2024.0009

所属专题：集成电路

• IC and System Design • 上一篇下一篇

Design and implementation of a multi-tile parallel scanning rasterization accelerator

邢立冬,郭强,彭欣龙,冯臻夫

西安邮电大学

收稿日期:2023-12-04 修回日期:2024-03-09 出版日期:2024-04-30 发布日期:2024-04-30
通讯作者: 邢立冬 E-mail:zmy_xld@163.com
基金资助:
陕西省教育厅科研计划项目资助

Design and implementation of a multi-tile parallel scanning rasterization accelerator

Received:2023-12-04 Revised:2024-03-09 Online:2024-04-30 Published:2024-04-30
Contact: Li-Dong XING E-mail:zmy_xld@163.com
Supported by:
Scientific Research Program Funded by Shaanxi Provincial Education Department

摘要/Abstract

摘要：

In the design of a graphic processing unit (GPU), the processing speed of triangle rasterization is an important factor that determines the performance of the GPU. An architecture of a multi-tile parallel-scan rasterization accelerator was proposed in this paper. The accelerator uses a bounding box algorithm to improve scanning efficiency. It rasterizes multiple tiles in parallel and scans multiple lines at the same time within each tile. This highly parallel approach drastically improves the performance of rasterization. Using 65nm process standard cell library of Semiconductor Manufacturing International Corporation (SMIC), the accelerator can be synthesized to a maximum clock frequency of 220MHz. An implementation on the Genesys2 field programmable gate array (FPGA) board fully verifies the functionality of the accelerator. The implementation shows a significant improvement in rendering speed and efficiency and proves its suitability for high- performance rasterization.

关键词: GPU, rasterization, multi-tile, multi-line, parallelism

Abstract:

Key words: GPU, rasterization, multi-tile, multi-line, parallelism

参考文献

1. CAI Z C. Research on key approach of hybrid real-time rendering. Master Thesis. Chengdu, China: University of ElectronicScience and Technology of China, 2020 (in Chinese).

2. AKENINE-MÖLLER T , HAINES E, HOFFMAN N, et al. Real-time rendering. 4th edn. Boca Raton, FL, USA: CRC Press, 2018.

3. FROLOV V A, GALAKTIONOV V A, BARLADYAN B H. Comparative study of high performance software rasterization techniques. Mathematica Montisnigri, 2020, 47: 152-175.

4. BAEK N. A prototype implementation of a CUDA-based customized rasterizer. International Journal of Advanced Computer Science and Applications, 2022, 13(8): 776-781.

5. LIU Q N, ZENG Z C G, DU H M, et al. An embedded GPU triangle rasterization algorithm for hardware implementation.Microelectronics & Computer, 2018, 35(9): 26-31 (in Chinese).

6. LIU Z. The research and implementation of triangle rasterization algorithm based on tile. Master Thesis. Xi’an China: Xidian University, 2018 (in Chinese).

7. CAI Y F. Design and verification of GPU’s 3D engine graphics rendering pipeline. Master Thesis. Xi’an China: Xidian University, 2019 (in Chinese).

8. XUE J J, AI D H, WANG M J. An improved midpoint rasterization algorithm implemented in FPGA. Displays, 2023, 76: Article 102364/1-19.

9. FU H, XIE Y F. High efficiency triangle raster algorithm in GPU based on TBR architecture. Journal of Image and Graphics, 2015, 20(4): 527-532 (in Chinese).

10. FAN M, JIANG L, YANG B W, et al. Design of multi-directional and parallel triangle scanning algorithm of tile. Computer Engineering and Design, 2020, 41(8): 2360-2366 (in Chinese).

11. ZHAO J L, LI T, DONG Z H, et al. Multi-threaded bidirectional parallel scan rasterization hardware accelerator. Transducer and Microsystem Technologies, 2021, 40(5): 91-93, 97 (in Chinese).

12. HEARN D, BAKER M P, BAKER M P. Computer graphics with OpenGL. Upper Saddle River, NJ, USA: Pearson Prentice Hall, 2004.

[1]	Song Yue, Wu Chengmao, Tian Xiaoping, Song Qiuyu. Enhanced kernel-based fuzzy local information clustering integrating neighborhood membership [J]. 中国邮电高校学报(英文版), 2021, 28(6): 65-81.
[2]	李齐山胡智群温向明路兆铭亓航. On-line learning algorithm for dynamic sensitivity control in IEEE 802.11ax network[J]. 中国邮电高校学报(英文版), 2018, 25(5): 67-74.

Design and implementation of a multi-tile parallel scanning rasterization accelerator

Design and implementation of a multi-tile parallel scanning rasterization accelerator

PDF

可视化

摘要/Abstract

引用本文

使用本文

参考文献

相关文章 2

编辑推荐

Metrics

本文评价