Interpretable AI Series (3) Model-agnostic methods: Global interpretability
The 3rd chapter is mainly about the first part of the model-agnostic methods: global interpretability, which interprets the influences of features on the dependent variable (label) on the whole model level. The model used in this chapter is the random forest, or decision tree ensemble, while the method can be easily extended to other black-box models. Here "model-agnostic" means these methods can be used to interpret any model without modifying the details of the method.
The first section introduced the dataset: high school student performance and several other features, such as race, parent level of education, etc, followed by an introduction of exploratory data analysis (EDA for short). Here the author mainly used histograms to show the grade distribution according to different categorical features, and their combinations.
The following section recaps tree ensembles, including 2 possible approaches of an ensemble: bagging (random forest) and boosting (adaptive boosting and gradient boosting). How to train a random forest, adaptive boost, and gradient boosting trees with Scikit-learn, followed by how to use precision, recall, and f1 score to evaluate the performance of a model. All these provided a necessary background for the topic of this chapter.
The third section is a brief introduction to the random-forest-specific interpretation technique, a weighted sum of the feature importance of each element (tree), and the corresponding implementation in Python.
The last section is the essence of this chapter, which consists of 2 methods: partial dependence plots (PDP for short), and the method how investigating the interaction between features.
A package named "pdpbox" is used to plot the partial dependence plots. The author plotted PDP on the student performance dataset and interpreted the plot thoroughly. While PDP provides many insights about the influences of features, it assumes features are independent of each other, which is not true in most cases. In others words, if there're interactions between features, the PDP won't be trusted completely. Here comes the feature interaction plot. Like a heatmap, the x- and y-axis are values of that two categorical interactive features. For example, different races on the x-axis, and different parent education levels on the y-axis. The color of an intersected block represents the influence of the combination of the two categorical values.
Comments
Post a Comment