Towards an Energy Complexity Model for Distributed Data Processing Algorithms

IEEE TRANSACTIONS ON BIG DATA(2023)

引用 0|浏览3
暂无评分
摘要
Modern data centers exist as infrastructure in the era of Big Data. Big data processing applications are the major computing workload of data centers. Electricity cost accounts for about 50% of data centers' operational costs. Therefore, the energy consumed for running distributed data processing algorithms on a data center is starting to attract both academia and industry. Most works study the energy consumption from the hardware perspective and only a few of them from the algorithm perspective. A general and hardware-independent energy evaluation model for the algorithms is in demand. With the model, algorithm designers can evaluate the energy consumption, compare energy consumption features and facilitate energy consumption optimization of distributed data processing algorithms. Inspired by the time complexity model, we propose an energy complexity model for describing the trends that an algorithm's energy consumption grows with the algorithm's input size. We argue that a good algorithm, especially for processing Big Data, should have a 'small' energy complexity. We define E(n) to represent the functional relationship that associates an algorithm's input size n with its notional energy consumption E. Based on the well-known abstract Bulk Synchronous Parallel (BSP) computer and programming model, we present a complete E(n) solution, including abstraction, generalization, quantification, derivation, comparison, analysis, examples, verification, and applications. Comprehensive experimental analysis shows that the proposed energy complexity model is practical, interestingly, and not equivalent to time complexity.
更多
查看译文
关键词
Energy consumption,energy complexity,distributed data processing algorithm,green computing
AI 理解论文
溯源树
样例
生成溯源树,研究论文发展脉络
Chat Paper
正在生成论文摘要