Interpretable Machine Learning

Sep 1, 2020

Photo by sdecoret on Shutterstock

How can we make a machine learning model convincing? If accuracy is undoubtedly necessary, it is rarely sufficient. Models such as neural networks typically involve millions of operations to turn their input data into a prediction. This complexity permits to accurately solve hard problems like computer vision and protein structure prediction. However, this accuracy comes at the expense of interpretability: these complex models appear as black boxes for human users. When models penetrate critical areas such as medicine, finance and the criminal justice system, their black-box nature appears as a major issue. An important question follows: is it possible to explain the predictions of complex machine-learning models?

Explainable AI tackles this question by providing an interface between complex models and human users. To illustrate, let us consider the example of a medical machine learning model that recommends a treatment for a patient. By using post-hoc explainability, we can answer crucial questions such as “What part of this patient’s data motivates the model’s recommendation?” or “Are there similar patients previously seen by the model for which this treatment worked?”. In a setting where human knowledge is available (e.g. computer vision), this type of information is crucial to validate/debug the model. In a setting where little human knowledge (e.g. scientific discovery), this type of information permits to extract knowledge from the model.

Machine learning Deep learning Transparency Explainability Interpretability

Jonathan Crabbé

PhD Researcher

My research focuses on explainable artificial intelligence, representation learning and robust machine learning.

Posts

2 Papers Accepted at ICML 2022

Label-Free Explainability and Data-SUITE have been accepted at ICML 2022. These are two new methods that machine learning interpretability and robustness. I provide a quick summary of the methods along with the relevant links.

Jonathan Crabbé

Jun 8, 2022 1 min read Conferences

2 Papers Accepted at ICML 2022

Publications

Robust multimodal models have outlier features and encode more concepts

We investigate what makes multimodal models that show good robustness with respect to natural distribution shifts (e.g., zero-shot CLIP) different from models with lower robustness using interpretability.

Jonathan Crabbé, Pau Rodríguez, Vaishaal Shankar, Luca Zappella, Arno Blaas

Robust multimodal models have outlier features and encode more concepts

Explaining the Absorption Features of Deep Learning Hyperspectral Classification Models

Over the past decade, Deep Learning (DL) models have proven to be efficient at classifying remotely sensed Earth Observation (EO) …

Arthur Vandenhoeke, Lennert Antson, Guillem Ballesteros, Jonathan Crabbé, Michal Shimoni

Explaining the Absorption Features of Deep Learning Hyperspectral Classification Models

Evaluating the Robustness of Interpretability Methods through Explanation Invariance and Equivariance

We assess the robustness of various interpretability methods by measuring how their explanations change when applying symmetries of the model to the input features.

Jonathan Crabbé, Mihaela van der Schaar

Evaluating the Robustness of Interpretability Methods through Explanation Invariance and Equivariance

Concept Activation Regions: A Generalized Framework For Concept-Based Explanations

We extend existing feature and example importance methods to unsupervised learning.

Jonathan Crabbé, Mihaela van der Schaar

Concept Activation Regions: A Generalized Framework For Concept-Based Explanations

Benchmarking Heterogeneous Treatment Effect Models through the Lens of Interpretability

We benchmark treatment effect models with interpretability tools.

Jonathan Crabbé, Alicia Curth, Ioana Bica, Mihaela van der Schaar

Label-Free Explainability for Unsupervised Models

We extend existing feature and example importance methods to unsupervised learning.

Jonathan Crabbé, Mihaela van der Schaar

Label-Free Explainability for Unsupervised Models

Latent Density Models for Uncertainty Categorization

We introduce DAUX, an interpretability framework to interpret model uncertainty.

Hao Sun, Boris van Breugel, Jonathan Crabbé, Nabeel Seedat, Mihaela van der Schaar

Explaining Latent Representations with a Corpus of Examples

We introduce SimplEx, a case-based reasoning explanation method that permits to decompose latent representations with a corpus of example.

Jonathan Crabbé, Zhaozhi Qian, Fergus Imrie, Mihaela van der Schaar

Explaining Latent Representations with a Corpus of Examples

Explaining Time Series Predictions with Dynamic Masks

We introduce Dynamask, a perturbation-based feature importance method to explain the predictions of time series models.

Jonathan Crabbé, Mihaela van der Schaar

Explaining Time Series Predictions with Dynamic Masks

Learning outside the black-box: the pursuit of interpretable models

We introduce Symbolic Pursuit, a new method for symbolic regression based on Meijer G-functions and the projection pursuit algorithm.

Jonathan Crabbé, Yao Zhang, William R. Zame, Mihaela van der Schaar

Events

2 Spotlight Presentations at ICML 2022

Presentation of two accepted papers at the main conference.

Jul 17, 2022 — Jul 23, 2022 Baltimore Convention Center

2 Spotlight Presentations at ICML 2022