The Design and Evaluation of a Practical System for Fault-Tolerant Virtual Machines

msra(2010)

引用 31|浏览13
暂无评分
摘要
We have implemented a commercial enterprise-grade system for providing fault-tolerant virtual machines, based on the approach of replicating the execution of a primary virtual machine (VM) via a backup virtual machine on another server. We have designed a complete system in VMware vSphere 4.0 that is easy to use, runs on commodity servers, and typically reduces performance of real applications by less than 10%. Our method for replicating VM execution is similar to that described in Bressoud (3), but we have made a number of significant design changes that greatly improve performance. In addition, an easy-to- use, commercial system that automatically restores redundancy after failure requires many additional components beyond replicated VM execution. We have designed and implemented these extra components and addressed many practical issues encountered in supporting VMs running enterprise applications. In this paper, we describe our basic design, discuss alternate design choices and a number of the implementation details, and provide an evaluation of our
更多
查看译文
关键词
performance for both micro-benchmarks and real applications. key words and phrases: virtual machines,deterministic replay,fault tolerance,virtual machine,fault tolerant
AI 理解论文
溯源树
样例
生成溯源树,研究论文发展脉络
Chat Paper
正在生成论文摘要