Abstract
Objective: Clustering is widely used to identify meaningful subgroups in biomedical data, but interpretation remains challenging, especially in the absence of ground-truth labels. Moreover, clustering often produces multiple plausible solutions without a single correct answer. Using dementia phenotypes as a case study, we introduce a Formal Explanation Space (FES) to improve interpretability and facilitate comparison across competing cluster solutions. Methods: We used spectral coclustering and spectral biclustering to cluster a dataset of dementia patients based on clinical phenotypes (signs and symptoms). To enhance interpretability, we constructed an FES to explain algorithm behavior, assess cluster quality, identify influential features, and characterize cluster composition. Although simultaneous clustering is unsupervised, interpretation was aided by diagnostic labels, which we used for external validation of cluster composition. Results: Spectral coclustering and spectral biclustering each identified five biologically plausible dementia subgroups, though subgroup composition differed by method. The FES provided a structured framework for comparing these divergent outputs. Conclusions: Clustering complex biomedical data often produces multiple biologically plausible solutions. Retaining and comparing such solutions within a formal explanation space enhances interpretability and supports the discovery of complementary insights across methods.
Recommended Citation
R. Yelugam et al., "A Formal Explanation Space for the Simultaneous Clustering of Neurologic Diseases based on their Signs and Symptoms," BMC Medical Informatics and Decision Making, vol. 26, no. 1, article no. 11, BioMed Central, Dec 2026.
The definitive version is available at https://doi.org/10.1186/s12911-025-03297-w
Department(s)
Electrical and Computer Engineering
Second Department
Computer Science
Publication Status
Open Access
Keywords and Phrases
Biclusters; Biological plausibility; Cluster composition; Cluster interpretation; Explainable AI; Explanation space; Heat maps; Phenotype; Simultaneous clustering; Word clouds
International Standard Serial Number (ISSN)
1472-6947
Document Type
Article - Journal
Document Version
Final Version
File Type
text
Language(s)
English
Rights
© 2026 The Authors, All rights reserved.
Creative Commons Licensing

This work is licensed under a Creative Commons Attribution 4.0 License.
Publication Date
01 Dec 2026
PubMed ID
41361443

Comments
National Science Foundation, Grant 2420248