🤖 AI Summary
This work addresses the limited decision transparency of multiclass Bayesian network classifiers, which stems from the absence of efficient and interpretable logical representations. The paper proposes a novel algorithm that, for the first time, compiles multiclass Bayesian network classifiers into class formulas—logical expressions that precisely capture their input–output behavior. Moving beyond existing approaches restricted to binary classification, the method significantly improves compilation efficiency and produces negation normal form (NNF) circuits endowed with OR-decomposability, thereby offering a structured foundation for downstream logic-based explanations. Experimental results demonstrate that the proposed approach drastically reduces compilation time while yielding logical representations that are more amenable to interpretation.
📝 Abstract
We propose a new algorithm for compiling Bayesian network classifier (BNC) into class formulas. Class formulas are logical formulas that represent a classifier's input-output behavior, and are crucial in the recent line of work that uses logical reasoning to explain the decisions made by classifiers. Compared to prior work on compiling class formulas of BNCs, our proposed algorithm is not restricted to binary classifiers, shows significant improvement in compilation time, and outputs class formulas as negation normal form (NNF) circuits that are OR-decomposable, which is an important property when computing explanations of classifiers.