Scaling communication-intensive applications on BlueGene/P using one-sided communication and overlap

Rome(2009)

引用 70|浏览2
暂无评分
摘要
In earlier work, we showed that the one-sided communication model found in PGAS languages (such as UPC) offers significant advantages in communication efficiency by decoupling data transfer from processor synchronization. We explore the use of the PGAS model on IBM BlueGene/P, an architecture that combines low-power, quad-core processors with extreme scalability. We demonstrate that the PGAS model, using a new port of the Berkeley UPC compiler and GASNet one-sided communication layer, outperforms two-sided (MPI) communication in both microbenchmarks and a case study of the communication-limited benchmark, NAS FT. We scale the benchmark up to 16,384 cores of the BlueGene/P and demonstrate that UPC consistently outperforms MPI by as much as 66% for some processor configurations and an average of 32%. In addition, the results demonstrate the scalability of the PGAS model and the Berkeley implementation of UPC, the viability of using it on machines with multicore nodes, and the effectiveness of the BG/P communication layer for supporting one-sided communication and PGAS languages.
更多
查看译文
关键词
berkeley implementation,ibm bluegene,berkeley upc compiler,pgas model,pgas language,communication efficiency,communication-intensive application,one-sided communication model,one-sided communication,p communication layer,gasnet one-sided communication layer,data transfer,parallel processing,computer science,scalability,high level languages,application software,electronics packaging,bandwidth,quad core processors,benchmark testing,hardware,high performance computing
AI 理解论文
溯源树
样例
生成溯源树,研究论文发展脉络
Chat Paper
正在生成论文摘要