Sensitivity Analysis for Time Dependent Problems: Optimal Checkpoint-Recompute HPC Workflows

Workflows in Support of Large-Scale Science(2014)

引用 3|浏览23
暂无评分
摘要
Sensitivity analysis (SA) is a fundamental tool of uncertainty quantification(UQ). Adjoint-based SA is the optimal approach in many large-scale applications, such as the direct numerical simulation (DNS) of combustion. However, one of the challenges of the adjoint workflow for time-dependent applications is the storage and I/O requirements for the application state. During the time-reversal portion of the workflow, forward state is required in last-in-first-out order. The resulting requirements for storage at exascale are enormous. To mitigate this requirement, application state is regenerated from checkpoints over short windows of application time. This approach drastically reduces the total volume of stored data, allows the caching of state in the regeneration window in memory and on local SSDs, may accelerate the application execution by reducing output frequency, and reduces the power overhead from I/O. We explore variations to this workflow, applied to a proxy for the SA of turbulent combustion, by varying checkpoint number, state storage, and other regeneration options to find efficient implementations for minimizing compute time or power consumption.
更多
查看译文
关键词
parallel processing,sensitivity analysis,storage management,workflow management software,DNS,I/O requirements,adjoint-based SA,direct numerical simulation,forward state,last-in-first-out order,local SSDs,optimal checkpoint-recompute HPC workflows,output frequency reduction,power consumption,power overhead,regeneration window,sensitivity analysis,state storage,time dependent problems,time-reversal portion,turbulent combustion,uncertainty quantification
AI 理解论文
溯源树
样例
生成溯源树,研究论文发展脉络
Chat Paper
正在生成论文摘要