Abstract

Objective: Clustering is widely used to identify meaningful subgroups in biomedical data, but interpretation remains challenging, especially in the absence of ground-truth labels. Moreover, clustering often produces multiple plausible solutions without a single correct answer. Using dementia phenotypes as a case study, we introduce a Formal Explanation Space (FES) to improve interpretability and facilitate comparison across competing cluster solutions. Methods: We used spectral coclustering and spectral biclustering to cluster a dataset of dementia patients based on clinical phenotypes (signs and symptoms). To enhance interpretability, we constructed an FES to explain algorithm behavior, assess cluster quality, identify influential features, and characterize cluster composition. Although simultaneous clustering is unsupervised, interpretation was aided by diagnostic labels, which we used for external validation of cluster composition. Results: Spectral coclustering and spectral biclustering each identified five biologically plausible dementia subgroups, though subgroup composition differed by method. The FES provided a structured framework for comparing these divergent outputs. Conclusions: Clustering complex biomedical data often produces multiple biologically plausible solutions. Retaining and comparing such solutions within a formal explanation space enhances interpretability and supports the discovery of complementary insights across methods.

Department(s)

Electrical and Computer Engineering

Second Department

Computer Science

Publication Status

Open Access

Comments

National Science Foundation, Grant 2420248

Keywords and Phrases

Biclusters; Biological plausibility; Cluster composition; Cluster interpretation; Explainable AI; Explanation space; Heat maps; Phenotype; Simultaneous clustering; Word clouds

International Standard Serial Number (ISSN)

1472-6947

Document Type

Article - Journal

Document Version

Final Version

File Type

text

Language(s)

English

Rights

© 2026 The Authors, All rights reserved.

Creative Commons Licensing

Creative Commons License
This work is licensed under a Creative Commons Attribution 4.0 License.

Publication Date

01 Dec 2026

PubMed ID

41361443

Available for download on Tuesday, December 01, 2026

Share

 
COinS