Mythos of Model Interpretability
Rigorous Science of Interpretable ML
Decision Tree Interpretability
Counterfactual Explanations for Support Vector Machine Models
ReduNet- A White-box Deep Network from the Principle ofMaximizing Rate Reduction
Identifying Interpretable Subspaces in Image Representations
A Mathematical Framework for Transformer Circuits
Mathematical Framework for Transformer Circuits