Lai, Zewei and Ye, Jinhui and Wang, Xiaohang and Fu, Zheang and Singh, Amit Kumar and Jiang, Yingtao and Ren, Kui and Yang, Mei and Qiu, Sihai and Li, Xiaodong and Tang, Xin and Song, Jie and Zhang, Mingzhe (2025) On Optimizing Intra- and Inter-chiplet Interconnection Networks in Multi-chiplet Systems for Accelerating FHE Encrypted Neural Network Applications. Transactions on Embedded Computing Systems. (In Press)
Lai, Zewei and Ye, Jinhui and Wang, Xiaohang and Fu, Zheang and Singh, Amit Kumar and Jiang, Yingtao and Ren, Kui and Yang, Mei and Qiu, Sihai and Li, Xiaodong and Tang, Xin and Song, Jie and Zhang, Mingzhe (2025) On Optimizing Intra- and Inter-chiplet Interconnection Networks in Multi-chiplet Systems for Accelerating FHE Encrypted Neural Network Applications. Transactions on Embedded Computing Systems. (In Press)
Lai, Zewei and Ye, Jinhui and Wang, Xiaohang and Fu, Zheang and Singh, Amit Kumar and Jiang, Yingtao and Ren, Kui and Yang, Mei and Qiu, Sihai and Li, Xiaodong and Tang, Xin and Song, Jie and Zhang, Mingzhe (2025) On Optimizing Intra- and Inter-chiplet Interconnection Networks in Multi-chiplet Systems for Accelerating FHE Encrypted Neural Network Applications. Transactions on Embedded Computing Systems. (In Press)
Abstract
Fully Homomorphic Encryption (FHE) is regarded as a promising way to protect data privacy with encrypted computation. Due to high computation overhead, hardware based FHE accelerators were proposed to speed up FHE applications. To support complicated FHE-encrypted neural network applications, multi-chiplet based FHE accelerators were further proposed for scaling up system size, whereas one of the challenges is designing efficient intra- and inter-chiplet interconnection networks to accelerate data transfer. Conventional regular topologies like mesh or Kite either lead to high inter-chiplet transmission latency or excessive power consumption as these topologies assume uniform bandwidth or radix for nodes/links, ignoring the highly irregular distribution of inter-chiplet communication volumes. On the other hand, the problem of generating customized intra- and inter-chiplet interconnection networks has high complexity and previous network- on-chip topology generation works cannot efficiently improve the intra- and inter-chiplet interconnection networks. In this paper, the intra- and inter-chiplet interconnection optimization problem is defined, aiming to minimize the execution time of FHE applications under cost and power constraints. To efficiently solve this problem, we propose a bilevel optimization algorithm, which decomposes the problem into three sub-problems: (1) FHE parameters selection, (2) task-to-core mapping, and (3) intra-/inter-chiplet interconnection network topology generation. These sub-problems are then solved iteratively. Experimental results demonstrate that our proposed method reduces execution time by 51.66%, 43.16%, 39.44%, 43.34%, and 27.70% compared to REED and four multi-chiplet based FHE accelerators with mesh, Kite, Butterfly, and Florets as inter-chiplet networks. Therefore, the proposed method can effectively accelerate FHE applications on large-scale multi-chiplet systems.
Item Type: | Article |
---|---|
Uncontrolled Keywords: | Multi-chiplet Systems; Intra- and Inter-chiplet Interconnection Network optimization |
Subjects: | Z Bibliography. Library Science. Information Resources > ZR Rights Retention |
Divisions: | Faculty of Science and Health > Computer Science and Electronic Engineering, School of |
SWORD Depositor: | Unnamed user with email elements@essex.ac.uk |
Depositing User: | Unnamed user with email elements@essex.ac.uk |
Date Deposited: | 15 Jul 2025 11:37 |
Last Modified: | 15 Jul 2025 11:39 |
URI: | http://repository.essex.ac.uk/id/eprint/41252 |
Available files
Filename: 25_ESWEEK_Paper_Revise_V4.pdf
Licence: Creative Commons: Attribution 4.0