李笑天, 郭德源, 何虎. 分支预测与值预测在VLIW处理器中的实现[J]. 微电子学与计算机, 2015, 32(1): 54-59.
引用本文: 李笑天, 郭德源, 何虎. 分支预测与值预测在VLIW处理器中的实现[J]. 微电子学与计算机, 2015, 32(1): 54-59.
LI Xiao-tian, GUO De-yuan, HE Hu. Realization of Branch Prediction and Value Prediction in VLIW[J]. Microelectronics & Computer, 2015, 32(1): 54-59.
Citation: LI Xiao-tian, GUO De-yuan, HE Hu. Realization of Branch Prediction and Value Prediction in VLIW[J]. Microelectronics & Computer, 2015, 32(1): 54-59.

分支预测与值预测在VLIW处理器中的实现

Realization of Branch Prediction and Value Prediction in VLIW

  • 摘要: 为了降低超长指令字(VLIW)架构的平均跳转开销和平均访存时延,并减少VLIW程序的代码体积,提出了一种全新的将分支预测与值预测技术应用于VLIW架构的方法.首先分析现有超标量(Superscalar)架构中动态预测技术与VLIW架构中指令静态并行之间所存在的矛盾;通过拓展原有跳转指令和读内存指令,使之与不同的延时槽个数相对应,并根据不同的指令来阻塞流水线或延时写回寄存器,从而解决动态预测技术造成VLIW架构静态调度周期错乱的问题.基于Gem5仿真平台和清华大学Magnolia VLIW数字信号处理器(DSP)的基准测试程序实验表明,该分支预测与值预测技术能显著地提高VLIW架构的性能,缩小VLIW程序的代码体积.

     

    Abstract: To reduce the average branch penalty, the average memory reference latency and the program code size in VLIW(Very Long Instruction Word) architecture, A new method to implement branch prediction and value prediction in VLIW is presented. Firstly, the confliction of the dynamic prediction in Superscalar and the static instruction parallelism in VLIW is analyzed. Then the branch and load instruction is expanded, one-to-one correspondence between the delay slot and the new expanded instructions. The pipeline is stalled and the write back stage is delayed according to the given instruction. Benchmark tests based on Gem5 and Magnolia VLIW DSP of Tsinghua University are presented to prove the advantage of the branch prediction and value prediction in VLIW.

     

/

返回文章
返回