String Similarity Joins and Search Under Edit Distance

user-5d54d98b530c705f51c2fe5a(2020)

引用 0|浏览30
暂无评分
摘要
As one of the most important distance metrics, edit distance can reflect noise and errors in sequence data and thus has various applications in data cleaning and integration, databases, bioinformatics, collaborative filtering, and natural language processing. On the other hand, edit distance is also difficult to compute and estimate, which draws both theorists’ and practitioners’ interest for decades. In this thesis, we will investigate several of the most critical problems related to edit distance and propose algorithms that are efficient in practice and have provable theoretical guarantees. We summarize our contributions as follows:
更多
查看译文
AI 理解论文
溯源树
样例
生成溯源树,研究论文发展脉络
Chat Paper
正在生成论文摘要