Efficient Number Theoretic Transform accelerator on the versal platform powered by the AI Engine

Lattice-based cryptography, essential for fully homomorphic encryption, primarily relies on the computationally intensive Number Theoretic Transform (NTT). This paper proposes an NTT accelerator based on AMD/Xilinx Versal ACAP and AI Engine (AIE), featuring data engines on Programmable Logic (PL) an...

Celý popis

Uložené v:
Podrobná bibliografia
Vydané v:Future generation computer systems Ročník 166; s. 107728
Hlavní autori: Bao, Zhenshan, Zang, Tianhao, Liu, Yiqi, Zhang, Wenbo
Médium: Journal Article
Jazyk:English
Vydavateľské údaje: Elsevier B.V 01.05.2025
Predmet:
ISSN:0167-739X
On-line prístup:Získať plný text
Tagy: Pridať tag
Žiadne tagy, Buďte prvý, kto otaguje tento záznam!
Popis
Shrnutí:Lattice-based cryptography, essential for fully homomorphic encryption, primarily relies on the computationally intensive Number Theoretic Transform (NTT). This paper proposes an NTT accelerator based on AMD/Xilinx Versal ACAP and AI Engine (AIE), featuring data engines on Programmable Logic (PL) and compute engines on the AIE. For inter-core parallelism on the AIE array, we propose an efficient method that applies the communication avoidance strategy to meet resource constraints; for intra-core data parallelism, we explore the modular multiplication algorithm suitable for AIE’s SIMD processors, proposing optimized software to support extensive NTT parameters while ensuring efficiency. Specialized data units are also proposed to compensate the slow DDR interface, enhancing data flow and overall performance. Our design outperforms CPU-based solutions by an average of 8.30× and Tesla V100 GPU-based solutions by 1.44× to 1.89×. Compared to most FPGA-based solutions, our approach shows shorter latency, improving by an average of 2.62×, while ensuring scalability and flexibility.
ISSN:0167-739X
DOI:10.1016/j.future.2025.107728