Ramaswami A, Kenter T, Kühne TD, Plessl C (2021)
Publication Type: Conference contribution
Publication year: 2021
Publisher: Springer Science and Business Media Deutschland GmbH
Book Volume: 12700 LNCS
Pages Range: 285-294
Conference Proceedings Title: Lecture Notes in Computer Science (including subseries Lecture Notes in Artificial Intelligence and Lecture Notes in Bioinformatics)
Event location: Virtual, Online
ISBN: 9783030790240
DOI: 10.1007/978-3-030-79025-7_21
The 3D Fast Fourier Transformation (3D FFT) is a critical routine in a number of applications of today’s HPC workloads. Optimising it for computational performance with the use of FPGAs has been a focus of several studies. However, a systematic study has been missing on the viability of different scenarios how FPGA-accelerated 3D FFT implementations can be integrated into real world applications originally implemented on high-end CPUs. In this paper, we address this with two scenarios for offloading 3D FFT computations to an FPGA and investigate their feasibility in comparison to highly optimised FFTW based executions on CPU in terms of computation time and power consumption. In the first scenario, performance of individual 3D FFT offloading to FPGA is found to be limited by latency and unidirectional bandwidth of PCIe data transfers. This bottleneck is overcome in the second scenario by overlapping in a batched mode data transfers and computations in the FPGA to find performance competitive to CPU. In both these scenarios, projections to next generation PCIe connections show additional potential for the FPGA with up to 2x speedup over CPU executions. Furthermore, measurements indicate 3.7x to 4.1x lower average power consumption on the FPGA.
APA:
Ramaswami, A., Kenter, T., Kühne, T.D., & Plessl, C. (2021). Evaluating the Design Space for Offloading 3D FFT Calculations to an FPGA for High-Performance Computing. In Steven Derrien, Frank Hannig, Pedro C. Diniz, Daniel Chillet (Eds.), Lecture Notes in Computer Science (including subseries Lecture Notes in Artificial Intelligence and Lecture Notes in Bioinformatics) (pp. 285-294). Virtual, Online: Springer Science and Business Media Deutschland GmbH.
MLA:
Ramaswami, Arjun, et al. "Evaluating the Design Space for Offloading 3D FFT Calculations to an FPGA for High-Performance Computing." Proceedings of the 17th International Symposium on Applied Reconfigurable Computing, ARC 2021, Virtual, Online Ed. Steven Derrien, Frank Hannig, Pedro C. Diniz, Daniel Chillet, Springer Science and Business Media Deutschland GmbH, 2021. 285-294.
BibTeX: Download