Incremental discovery of denial constraints

Chaoqin Qian, Menglu Li,Zijing Tan,Ai Ran,Shuai Ma

The VLDB Journal(2023)

引用 46|浏览33
暂无评分
摘要
We investigate the problem of incremental denial constraint (DC) discovery, aiming at discovering DCs in response to a set r of tuple insertions to a given relational instance r and the known set of DCs holding on r . The need for the study is evident since real-life data are often frequently updated, and it is often prohibitively expensive to perform DC discovery from scratch for every update. We tackle this problem with two steps. We first employ indexing techniques to efficiently identify the incremental evidences caused by r . We present algorithms to build indexes for and r in the pre-processing step, and to visit and update indexes in response to r. In particular, we propose a novel indexing technique for two inequality comparisons possibly across the attributes of r . By leveraging the indexes, we can identify all the tuple pairs incurred by r that simultaneously satisfy the two comparisons, with a cost dependent on log( | r | ). We then compute the changes to based on the incremental evidences, such that ⊕ is the set of DCs holding on r+ r . may contain new DCs that are added into and obsolete DCs that are removed from . Our experimental evaluations show that our incremental approach is faster than the two state-of-the-art batch DC discovery approaches that compute from scratch on r + r by orders of magnitude, even when r is up to 30
更多
查看译文
关键词
Data profiling,Denial constraint,Dependency discovery
AI 理解论文
溯源树
样例
生成溯源树,研究论文发展脉络
Chat Paper
正在生成论文摘要