Evidence combination in biomedical natural-language processing.

BIOKDD'03: Proceedings of the 3rd International Conference on Data Mining in Bioinformatics(2003)

引用 18|浏览35
暂无评分
摘要
In many natural language tasks, such as information extraction and semantic lexicon building, individual entities and relations of interest may be found in multiple contexts within the corpus. In deciding which putative entities and relations should be extracted, a key problem is how to combine evidence across the multiple occurrences of these entities and relations. We present a novel statistical approach to address this issue, and evaluate it in the context of extracting protein names and protein-protein interactions from MEDLINE abstracts. We experimentally compare our method against a number of intuitive and simpler baselines. Our experimental results suggest that the issue of combining evidence is indeed important in these tasks. Furthermore, we show that our proposed method outperforms the baselines considered in a variety of settings.
更多
查看译文
AI 理解论文
溯源树
样例
生成溯源树,研究论文发展脉络
Chat Paper
正在生成论文摘要