Extracting Sequences from the Web

Anthony Fader,Stephen Soderland,Oren Etzioni

Meeting of the Association for Computational Linguistics（2010）

引用 24|浏览84

暂无评分

摘要

Classical Information Extraction (IE) systems fill slots in domain-specific frames. This paper reports on SEQ, a novel open IE system that leverages a domain-independent frame to extract ordered sequences such as presidents of the United States or the most common causes of death in the U.S. SEQ leverages regularities about sequences to extract a coherent set of sequences from Web text. SEQ nearly doubles the area under the precision-recall curve compared to an extractor that does not exploit these regularities.

查看译文

关键词

web text,united states,extracting sequence,domain-specific frame,common cause,classical information extraction,u.s. seq leverages regularity,novel open ie system,domain-independent frame,coherent set,paper report

AI 理解论文

溯源树

样例

生成溯源树，研究论文发展脉络

Chat Paper

正在生成论文摘要