Optimizing Distributed Top-k Queries

WEB INFORMATION SYSTEMS ENGINEERING - WISE 2008, PROCEEDINGS(2008)

引用 5|浏览0
暂无评分
摘要
Top-kquery processing is a fundamental building block for efficient ranking in a large number of applications. Efficiency is a central issue, especially for distributed settings, when the data is spread across different nodes in a network. This paper introduces novel optimization methods for top-kaggregation queries in such distributed environments that can be applied to all algorithms that fall into the frameworks of the prior TPUT and KLEE methods. The optimizations address 1) hierarchically grouping input lists into top-koperator trees and optimizing the tree structure, and 2) computing data-adaptive scan depths for different input sources. The paper presents comprehensive experiments with two different real-life datasets, using the ns-2 network simulator for a packet-level simulation of a large Internet-style network.
更多
查看译文
关键词
top-k queries,hierarchically grouping input list,different input source,central issue,different node,ns-2 network simulator,top-kquery processing,large number,different real-life datasets,large internet-style network,klee method,tree structure,distributed environment,network simulator
AI 理解论文
溯源树
样例
生成溯源树,研究论文发展脉络
Chat Paper
正在生成论文摘要