Abstract:
In order to improve GPU's texture unit efficiency,a multiported texture cache architecture is proposed. The architecture employs a tiling rasterization order and a block interleaving memory organization,which can fully exploit the data locality and improve cache hit rate. In addition, the architecture employs cache prefetching technology,which can hide memory latency. In order to further improve data throughput, four read ports supporting 4 parallel reading are designed.Simulation results show that the hit rate of the proposed cache is about 92% and can attain 90% of the performance of a zero latency memory system.The data throughput is about 3~4 times of the single ported cache.