Mining Pareto-Optimal Counterfactual Antecedents with a Branch-And-Bound Model-Agnostic Algorithm

Marcos Medeiros Raimundo, Luis Gustavo Nonato, Jorge Poco
Data Mining and Knowledge Discovery · 2024 · December 16, 2022
Mining Pareto-Optimal Counterfactual Antecedents with a Branch-And-Bound Model-Agnostic Algorithm

Enumeration for Logistic Regression of counterfactual antecedents with Pareto-optimal values when each feature is considered as an objective function. Orig column is the original sample and the columns C1 to C28 are counterfactual antecedents.

Publication Details

Venue
Data Mining and Knowledge Discovery
Year
2024
Publication Date
December 16, 2022
DOI
https://doi.org/10.1007/s10618-022-00906-4

Abstract

Mining counterfactual antecedents became a valuable tool to discover knowledge and explain machine learning models. It consists of generating synthetic samples from an original sample to achieve the desired outcome in a machine learning model, thus helping to understand the prediction. An insightful methodology would explore a broader set of counterfactual antecedents to reveal multiple possibilities while operating on any classifier. Therefore, we create a tree-based search that requires monotonicity from the objective functions (a.k.a. cost functions); it allows pruning branches that will not improve the objective functions. Since monotonicity is only required for the objective function, this method can be used for any family of classifiers (e.g., linear models, neural networks, and decision trees). However, additional classifier properties speed up the tree search when it foresees branches that will not result in feasible actions. Moreover, the proposed optimization generates a diverse set of Pareto-optimal counterfactual antecedents by relying on multi-objective concepts. The results show an algorithm with working guarantees that enumerates a wide range of counterfactual antecedents. It helps the decision-maker understand the machine learning decision and finds alternatives to achieve the desired outcome. The user can inspect these multiple counterfactual antecedents to find the most suitable one and better understand the prediction.

Cite this publication (BIBTEX)

@article{2024-MAPOCAM, 
  title={Mining Pareto-Optimal Counterfactual Antecedents with a Branch-And-Bound Model-Agnostic Algorithm}, 
  author={Marcos Medeiros Raimundo and Luis Gustavo Nonato and Jorge Poco}, 
  journal={Data Mining and Knowledge Discovery}, 
  year={2024}, 
  url={https://doi.org/10.1007/s10618-022-00906-4},
  date={2022-12-16},
  volume={38},
  issue={5}
}