Multi-View Source Ablation for Faithful Summarization

17TH CONFERENCE OF THE EUROPEAN CHAPTER OF THE ASSOCIATION FOR COMPUTATIONAL LINGUISTICS, EACL 2023(2023)

引用 0|浏览37
暂无评分
摘要
In this paper, we present MUFASSA (Multiview Faithfulness Scoring via Source Ablation), a metric for evaluating faithfulness of abstractive summaries, and for guiding training of more faithful summarizers. For evaluation, MUFASSA employs different strategies (e.g., masking entity mentions) to first remove information from the source document to form multiple ablated views. Then, the faithfulness level of each token in a generated summary is measured by the difference between the token generation probabilities when given the original document and the ablated document as inputs to trained summarizers. For training, MUFASSA uses a novel word truncation objective that drops unfaithful tokens located by MUFASSA in both the decoder input and output. Alignments with human-annotated faithfulness labels on AGGREFACT show that MUFASSA is comparable to or better than existing metrics built on classifiers or QA models pre-trained on other tasks. In experiments on summarization with XSum and CNN/DailyMail, models trained with word truncation using MUFASSA outperform competitive methods according to both automatic faithfulness metrics and human assessments.
更多
查看译文
AI 理解论文
溯源树
样例
生成溯源树,研究论文发展脉络
Chat Paper
正在生成论文摘要