李介民, 张善从. 五级流水线RISC-V处理器的研究与性能优化[J]. 微电子学与计算机, 2022, 39(3): 78-85. DOI: 10.19304/J.ISSN1000-7180.2021.0856
引用本文: 李介民, 张善从. 五级流水线RISC-V处理器的研究与性能优化[J]. 微电子学与计算机, 2022, 39(3): 78-85. DOI: 10.19304/J.ISSN1000-7180.2021.0856
LI Jiemin, ZHANG Shancong. Research and performance optimization of five stage pipelined RISC-V processor[J]. Microelectronics & Computer, 2022, 39(3): 78-85. DOI: 10.19304/J.ISSN1000-7180.2021.0856
Citation: LI Jiemin, ZHANG Shancong. Research and performance optimization of five stage pipelined RISC-V processor[J]. Microelectronics & Computer, 2022, 39(3): 78-85. DOI: 10.19304/J.ISSN1000-7180.2021.0856

五级流水线RISC-V处理器的研究与性能优化

Research and performance optimization of five stage pipelined RISC-V processor

  • 摘要: 国内基于RISC-V指令集的嵌入式处理器的研究在近几年内得到了快速发展.在性能评估研究上多集中于2-3级流水的小规模、低功耗处理器,针对5级流水架构处理器的性能量化研究较少.针对该问题,在传统5级顺序流水架构的基础上,分别从RISC-V指令预测、流水线机制、乘除法算法、存储架构等方面分析处理器优化的策略.针对不同策略优化的处理器以AHB片上互联、APB桥接的方案实现外部模块的搭载.在FPGA上完成软硬件协同仿真验证,并在Xilinx的XC7K325T开发板上运行性能评估测试程序CoreMark,依据运行结果着重分析不同静态预测算法、不同周期乘除法运算、外挂存储的容量与设计等因素对处理器的影响.最终实现的处理器基于RV32IMZicsr架构,采用了半静态预测(资源优化)、流水线检测转发机制(处理优化)、短周期乘除法(计算优化)、最优存储架构(取指与访存优化)等性能优化策略.CoreMark跑分达到3.06 CoreMark/MHz.

     

    Abstract: In recent years, the research of embedded processor based on RISC-V instruction set has developed rapidly in China. The research on performance evaluation is mostly focused on the small scale and low power processors with 2-3 level pipelined architectures, and there are few studies on the performance quantification of the processors with 5-level pipelined architectures. To solve this problem, based on the traditional five-level sequential pipelining architecture, the strategy of processor optimization is analyzed from the aspects of RISC-V instruction prediction, pipelining mechanism, multiplication and division algorithm, storage architecture, etc. For the processors optimized for different strategies, AHB on-chip interconnection and APB-bridge are used to carry external modules. The software and hardware co-simulation verification is completed on FPGA, and the performance evaluation test program Coremark is run on Xilinx's XC7K325T development board. According to the running results, the influences of different static prediction algorithms, different cycle multiplication and division operations, capacity and design of plug-in storage on the processor are analyzed emphatically. The final processor is based on RV32IMZicsr architecture, and adopts performance optimization strategies such as semi-static prediction (resource optimization), pipeline detection and forwarding mechanism (processing optimization), short-period multiplication-division method (computing optimization), and optimal storage architecture (pointing and memory access optimization). Coremark scored 3.06 Coremark /MHz.

     

/

返回文章
返回