DG-RePlAce: A Dataflow-Driven GPU-Accelerated Analytical Global Placement Framework for Machine Learning Accelerators
arxiv(2024)
摘要
Global placement is a fundamental step in VLSI physical design. The wide use
of 2D processing element (PE) arrays in machine learning accelerators poses new
challenges of scalability and Quality of Results (QoR) for state-of-the-art
academic global placers. In this work, we develop DG-RePlAce, a new and fast
GPU-accelerated global placement framework built on top of the OpenROAD
infrastructure, which exploits the inherent dataflow and datapath structures of
machine learning accelerators. Experimental results with a variety of machine
learning accelerators using a commercial 12nm enablement show that, compared
with RePlAce (DREAMPlace), our approach achieves an average reduction in routed
wirelength by 10
global placement and on-par total runtimes relative to DREAMPlace. Empirical
studies on the TILOS MacroPlacement Benchmarks further demonstrate that
post-route improvements over RePlAce and DREAMPlace may reach beyond the
motivating application to machine learning accelerators.
更多查看译文
AI 理解论文
溯源树
样例
生成溯源树,研究论文发展脉络
Chat Paper
正在生成论文摘要