Hybrid Communication with TCA and InfiniBand on a Parallel Programming Language XcalableACC for GPU Clusters

For the execution of parallel HPC applications on GPU-ready clusters, high communication latency between GPUs over nodes will be a serious problem on strong scalability. To reduce the communication latency between GPUs, we proposed the Tightly Coupled Accelerator (TCA) architecture and developed the...

Full description

Saved in:
Bibliographic Details
Published in:Proceedings / IEEE International Conference on Cluster Computing pp. 627 - 634
Main Authors: Odajima, Tetsuya, Boku, Taisuke, Hanawa, Toshihiro, Murai, Hitoshi, Nakao, Masahiro, Tabuchi, Akihiro, Sato, Mitsuhisa
Format: Conference Proceeding
Language:English
Published: IEEE 01.09.2015
Subjects:
ISSN:1552-5244
Online Access:Get full text
Tags: Add Tag
No Tags, Be the first to tag this record!
Be the first to leave a comment!
You must be logged in first