Designing High-Performance MPI Libraries with On-the-fly Compression for Modern GPU Clusters *

2021 IEEE International Parallel and Distributed Processing Symposium (IPDPS)(2021)

引用 12|浏览17
暂无评分
摘要
While the memory bandwidth of accelerators such as GPU has significantly improved over the last decade, the commodity networks such as Ethernet and InfiniBand are lagging in terms of raw throughput creating. Although there are significant research efforts on improving the large message data transfers for GPU-resident data, the inter-node communication remains the major performance bottleneck due t...
更多
查看译文
关键词
Liquids,Heuristic algorithms,Graphics processing units,Data science,Throughput,Libraries,Real-time systems
AI 理解论文
溯源树
样例
生成溯源树,研究论文发展脉络
Chat Paper
正在生成论文摘要