1. Meinerzhagen P A, Tokunaga C, Malavasi A, et al. An Energy-Efficient Graphics Processor in 14-nm Tri-Gate CMOS Featuring Integrated Voltage Regulators for Fine-Grain DVFS, Retentive Sleep, and vmin Optimization. IEEE Journal of Solid-State Circuits, 2018, 54(1): 144-157. 2. Zhu M, Zhuo Y, Wang C, et al. Performance evaluation and optimization of HBM-Enabled GPU for data-intensive applications. IEEE Transactions on Very Large Scale Integration (VLSI) Systems, 2018, 26(5): 831-840. 3. Deng J, Tao L I, Jiang L, et al. Design and optimization for multiprocessor interactive GPU. The Journal of China Universities of Posts and Telecommunications, 2014, 21(3): 85-97. 4. Park S I, Cao Y, Watson L T, et al. Performance analysis of a novel GPU computation-to-core mapping scheme for robust facet image modeling. Journal of Real-Time Image Processing, 2015, 10(3): 485-500. 5. Merrill D, Garland M, Grimshaw A. High-performance and scalable GPU graph traversal. ACM Transactions on Parallel Computing, 2015, 1(2): 14:1-14:30. 6. Makimoto T. Implications of Makimoto's Wave. Computer, 2013, 46(12): 32-37. 7. J. Xu and Y. Yin, “Semiconductor features cycle and reconfigurable chips,” Embedded Systems and Applications. 2005(2): 2-4 (in Chinese). 8. Wei S, Liu L, Yin S. Key techniques of recon gurable computing processor. SCIENTIA SINICA Informationis, 2012, 42(12): 1559-1576 (in Chinese). 9. Zhu M, Liu L, Yin S, et al. A reconfigurable multi-processor SoC for media applications. Proceedings of 2010 IEEE International Symposium on Circuits and Systems. IEEE, 2010: 2011-2014. 10. Phong B T. Illumination for computer generated pictures. Communications of the ACM, 1975, 18(6): 311-317. 11. Cook R L, Torrance K E. A reflectance model for computer graphics. ACM Transactions on Graphics , 1982, 1(1): 7-24. 12. Blinn J F. Models of light reflection for computer synthesized pictures. ACM SIGGRAPH computer graphics. ACM, 1977, 11(2): 192-198. 13. Torrance K E, Sparrow E M. Theory for off-specular reflection from roughened surfaces. Journal of the Optical Society of America, 1967, 57(9): 1105-1114. 14. M. Segal and K. Akeley, The OpenGL Graphics System:A Specification(Version 4.6 (Core Profile)). The Khronos Group Inc. (2018-05-14).https://www.khronos.org/registry/OpenGL/specs/gl/glspec46.core.pdf. 15. Issa J, Figueira S. Graphics Processor performance analysis for 3D applications. 2012 2nd International Conference on Advances in Computational Tools for Engineering Applications. IEEE, 2012: 269-272. 16. Garcia O G, Lambert P A. Light mounting fixture assembly: U.S. Patent 4,222,093. 1980-9-9. 17. Basri R, Jacobs D W. Lambertian reflectance and linear subspaces. IEEE Transactions on Pattern Analysis & Machine Intelligence, 2003 (2): 218-233. 18. Panda R, Song S, Dean J, et al. Wait of a decade: Did spec cpu 2017 broaden the performance horizon? 2018 IEEE International Symposium on High Performance Computer Architecture (HPCA). IEEE, 2018: 271-282. 19. Jiang L, Chen L, Qiu J. Performance characterization of multi-threaded graph processing applications on many-integrated-core architecture. 2018 IEEE International Symposium on Performance Analysis of Systems and Software (ISPASS). IEEE, 2018: 199-208. 20. De Melo A C. The new linux’perf’tools. Slides from Linux Kongress. 2010, 18.p42. 21. Reinders J. VTune performance analyzer essentials. Intel Press, 2005.p15. 22. Keckler S W, Dally W J, Khailany B, et al. GPUs and the future of parallel computing. IEEE Micro, 2011, 31(5): 7-17. 23. Han S, Liu X, Mao H, et al. EIE: efficient inference engine on compressed deep neural network. 2016 ACM/IEEE 43rd Annual International Symposium on Computer Architecture. IEEE, 2016: 243-254. 24. Phansalkar A, Joshi A, John L K. Analysis of redundancy and application balance in the SPEC CPU2006 benchmark suite. ACM SIGARCH Computer Architecture News, 2007, 35(2): 412-423. 25. Borshukov G, Lewis J P. Realistic human face rendering for the matrix reloaded. ACM Siggraph 2005 Courses. ACM, 2005: 13. |