Efficient Multidimensional Fuzzy Search for Personal Information Management Systems

IEEE Transactions on Knowledge and Data Engineering(2012)

引用 11|浏览1
暂无评分
摘要
With the explosion in the amount of semistructured data users access and store in personal information management systems, there is a critical need for powerful search tools to retrieve often very heterogeneous data in a simple and efficient way. Existing tools typically support some IR-style ranking on the textual part of the query, but only consider structure (e.g., file directory) and metadata (e.g., date, file type) as filtering conditions. We propose a novel multidimensional search approach that allows users to perform fuzzy searches for structure and metadata conditions in addition to keyword conditions. Our techniques individually score each dimension and integrate the three dimension scores into a meaningful unified score. We also design indexes and algorithms to efficiently identify the most relevant files that match multidimensional queries. We perform a thorough experimental evaluation of our approach and show that our relaxation and scoring framework for fuzzy query conditions in noncontent dimensions can significantly improve ranking accuracy. We also show that our query processing strategies perform and scale well, making our fuzzy search approach practical for every day usage.
更多
查看译文
关键词
heterogeneous data,personal information management system,multidimensional query,fuzzy search,fuzzy search approach,personal information management systems,efficient multidimensional fuzzy search,dimension score,powerful search tool,multidimensional search,fuzzy query condition,novel multidimensional search approach,ir-style ranking,query processing strategy,information retrieval,query processing,optimization,indexing,xml,information management
AI 理解论文
溯源树
样例
生成溯源树,研究论文发展脉络
Chat Paper
正在生成论文摘要