Unsupervised Discovery of a Statistical Verb Lexicon.

EMNLP '06: Proceedings of the 2006 Conference on Empirical Methods in Natural Language Processing(2006)

引用 52|浏览32
暂无评分
摘要
This paper demonstrates how unsupervised techniques can be used to learn models of deep linguistic structure. Determining the semantic roles of a verb's dependents is an important step in natural language understanding. We present a method for learning models of verb argument patterns directly from unannotated text. The learned models are similar to existing verb lexicons such as VerbNet and PropBank, but additionally include statistics about the linkings used by each verb. The method is based on a structured probabilistic model of the domain, and unsupervised learning is performed with the EM algorithm. The learned models can also be used discriminatively as semantic role labelers, and when evaluated relative to the PropBank annotation, the best learned model reduces 28% of the error between an informed baseline and an oracle upper bound.
更多
查看译文
关键词
verb argument pattern,verb lexicon,PropBank annotation,semantic role,semantic role labelers,structured probabilistic model,unsupervised learning,unsupervised technique,EM algorithm,deep linguistic structure,Unsupervised discovery,statistical verb lexicon
AI 理解论文
溯源树
样例
生成溯源树,研究论文发展脉络
Chat Paper
正在生成论文摘要