张多利, 张玲佳, 宋宇鲲. 变维度FFT硬件加速器结构设计及FPGA实现[J]. 微电子学与计算机, 2017, 34(12): 34-39, 44.
引用本文: 张多利, 张玲佳, 宋宇鲲. 变维度FFT硬件加速器结构设计及FPGA实现[J]. 微电子学与计算机, 2017, 34(12): 34-39, 44.
ZHANG Duo-li, ZHANG Ling-jia, SONG Yu-kun. Structure Design and FPGA Implementation of a Variable Dimension FFT Hardware Accelerator[J]. Microelectronics & Computer, 2017, 34(12): 34-39, 44.
Citation: ZHANG Duo-li, ZHANG Ling-jia, SONG Yu-kun. Structure Design and FPGA Implementation of a Variable Dimension FFT Hardware Accelerator[J]. Microelectronics & Computer, 2017, 34(12): 34-39, 44.

变维度FFT硬件加速器结构设计及FPGA实现

Structure Design and FPGA Implementation of a Variable Dimension FFT Hardware Accelerator

  • 摘要: 大点数FFT变换对处理器的运算能力和访存带宽要求非常高, 通常是高实时性DSP应用的瓶颈, 因此研究高速、低资源消耗且便于硬件化的FFT加速器变得极有实用价值.本文设计了一种变维度FFT硬件加速器, 其采用体-面-线的数据组织形式, 提出了一种面划分1兼多路并行的架构, 从面和线2个层次展开计算, 以面为基本存储单位, 以线为基本计算单位, 提高了FFT运算的并行度, 减少了处理器间的数据交互, 并通过乒乓预读取的设计和无冲突的地址调整, 提高了整机的运算访存比.本文设计的FFT加速器内含32个并行计算单元, 支持IEEE-754标准下的32位单精度浮点数32点到64 K点一维FFT运算, 32点到256点的二维/三维FFT运算, 且具有较强的可扩展性, 可根据需要实现m×n×p序列的FFT运算.该设计已在Xilinx Virtex6 FPGA芯片上进行原型验证, 最高工作频率184.88 MHz.

     

    Abstract: The large number of Fast Fourier Transform (FFT) has high requirements on the processor's computing and memory bandwidth. It is usually the bottlenecks of DSP applications with strong real-time demand. It is of great practical value to study the FFT accelerator with high speed, low resource consumption and easy hardware.In this paper, a variable dimension FFT hardware accelerator is designed. By using the data form of body-surface-line, the calculation is carried out from two levels of surface and line. It adopts surface partitioning1and multiple parallel architecture to improve the parallelism of FFT. Furthermore by using Ping-Pong operation, pre-read and flexible address adjustment to conceal data transmitting time. In this paper, the FFT accelerator consists of 32 parallel computing units to carry out 2n(n is from range of 5 to 16) single-precision floating-point 1-D FFT and 2n (n is from range of 5 to 8) single-precision floating-point 2-D/3-D FFT. What is more, it is important that the structure has strong expansibility which can achieve FFT of the sequence of m×n×p. As a result the design's maximal frequency is up to 184.88 MHz. It has been successfully applied on the Xilinx Virtex6 FPGA chip.

     

/

返回文章
返回