Stable and actionable explanations of black-box models through factual and counterfactual rules

Riccardo Guidotti,Anna Monreale,Salvatore Ruggieri,Francesca Naretto,Franco Turini,Dino Pedreschi,Fosca Giannotti

DATA MINING AND KNOWLEDGE DISCOVERY（2022）

引用 3|浏览93

暂无评分

摘要

Recent years have witnessed the rise of accurate but obscure classification models that hide the logic of their internal decision processes. Explaining the decision taken by a black-box classifier on a specific input instance is therefore of striking interest. We propose a local rule-based model-agnostic explanation method providing stable and actionable explanations. An explanation consists of a factual logic rule, stating the reasons for the black-box decision, and a set of actionable counterfactual logic rules, proactively suggesting the changes in the instance that lead to a different outcome. Explanations are computed from a decision tree that mimics the behavior of the black-box locally to the instance to explain. The decision tree is obtained through a bagging-like approach that favors stability and fidelity: first, an ensemble of decision trees is learned from neighborhoods of the instance under investigation; then, the ensemble is merged into a single decision tree. Neighbor instances are synthetically generated through a genetic algorithm whose fitness function is driven by the black-box behavior. Experiments show that the proposed method advances the state-of-the-art towards a comprehensive approach that successfully covers stability and actionability of factual and counterfactual explanations.

查看译文

关键词

Explainable AI,Local explanations,Model-agnostic explanations,Rule-based explanations,Counterfactuals

AI 理解论文

溯源树

样例

生成溯源树，研究论文发展脉络

Chat Paper

正在生成论文摘要