Ripple: Improved Architecture and Programming Model for Bulk Synchronous Parallel Style of Analytics

Distributed Computing Systems(2013)

引用 4|浏览4
暂无评分
摘要
We present Ripple, an architecture and a programming model for a broad set of data analytics. Ripple builds on the ideas of iterated MapReduce and adds two innovations. First it has a richer programming model, including more ideas from the Bulk Synchronous Parallel (BSP) model of computation and others. By doing so, Ripple creates a flexible and higher-level platform that is easier for both application programmers and platform implementors. Second, Ripple is based on a limited interface for key/value storage making it portable among many different key/value store implementations. By building on these two ideas Ripple improves the scope, performance, and openness of the data analytics platform. We evaluate Ripple using three representative, and non-trivial, data analysis scenarios requiring iterative computation. Using these examples, we show how Ripple achieves clear performance advantages over iterated MapReduce.
更多
查看译文
关键词
data analysis,distributed databases,iterative methods,parallel programming,software architecture,BSP model,MapReduce,Ripple,application programmers,architecture,bulk synchronous parallel model,data analysis,data analytics,distributed database,iterative computation,key/value storage,platform implementors,programming model,Distributed databases,Distributed programming
AI 理解论文
溯源树
样例
生成溯源树,研究论文发展脉络
Chat Paper
正在生成论文摘要