Skip to main content Skip to main navigation


Enhanced Explanations for Knowledge-Augmented Clustering using Subgroup Discovery

Maciej Szelążek; Dan Hudson; Szymon Bobek; Grzegorz J. Nalepa; Martin Atzmueller
In: 2023 IEEE 10th International Conference on Data Science and Advanced Analytics (DSAA). International Conference on Data Science and Advanced Analytics (DSAA-2023), October 9-13, Thessaloniki, Greece, Pages 1-11, IEEE, 11/2023.


Contemporary machine learning techniques are capable of extracting complex structure from data in a way that complements or exceeds manual examination, yet, as is welldocumented, many of these techniques suffer from a lack of interpretability. This paper extends previous work on explainable and interpretable machine learning, in particular on the ‘Knowledge-Augmented Clusters (KnAC)’ approach, allowing human users to benefit from uninterpretable ‘black box’ models to extract structure from datasets by clustering and to make this better understandable. One of the key functions of KnAC is to relate expert-annotated clusters to clusters that have been identified by a machine learning method, and then provide a comprehensible explanation, thus clarifying the relationships that KnAC discovered. Our novel contribution in this paper is to examine the usefulness of subgroup discovery as a way to generate comprehensible explanations within KnAC, and to compare this to the existing approach based on the XAI algorithm Anchors through a detailed evaluation. We find that the approach using subgroup discovery performs equally or better in our extensive experimentation testing this on six different datasets.