Extracting Sequences from the Web

Meeting of the Association for Computational Linguistics(2010)

引用 24|浏览84
暂无评分
摘要
Classical Information Extraction (IE) systems fill slots in domain-specific frames. This paper reports on SEQ, a novel open IE system that leverages a domain-independent frame to extract ordered sequences such as presidents of the United States or the most common causes of death in the U.S. SEQ leverages regularities about sequences to extract a coherent set of sequences from Web text. SEQ nearly doubles the area under the precision-recall curve compared to an extractor that does not exploit these regularities.
更多
查看译文
关键词
web text,united states,extracting sequence,domain-specific frame,common cause,classical information extraction,u.s. seq leverages regularity,novel open ie system,domain-independent frame,coherent set,paper report
AI 理解论文
溯源树
样例
生成溯源树,研究论文发展脉络
Chat Paper
正在生成论文摘要