Electrical and Computer Engineering Faculty Research & Creative Works

Towards Explainability of Dimension Reduction Plots of Unsupervised Learning Model Outcomes

Tony E.Astuhuaman Davila
Daniel B. Hier, Missouri University of Science and TechnologyFollow
Tayo Obafemi-Ajayi, Missouri University of Science and TechnologyFollow

Abstract

Dimension reduction methods are used to visualize the output of unsupervised learning models when applied to complex data. These techniques improve interpretability by transforming a high-dimension space to a lower-dimension space (usually 2D or 3D). The results are typically viewed as 2D scatter plots, and class centroids may be added to increase interpretability. Although useful, the relationship of these class centroids to the underlying feature space remains opaque. The innovative aspect of this work is to create a strong link between the dimension-reduced space and the underlying high-dimension feature space by adding selected feature centroids to the 2D scatter plots. This approach simultaneously visualizes the centers for the classes and the features on the same 2D scatter plot. Since classes are often imbalanced, we provide a method to balance class sizes. We present an automated framework that performs a grid search to find the optimal dimension reduction parameters, balances the class sizes, uses an ensemble approach to find the most important features, and adds class centroids and selected feature centroids to 2D dimension-reduced plots. This is especially useful when applied to complex, feature-rich biomedical data, as addition of feature centroids to 2D scatter plots serve as landmarks for the previously featureless dimension-reduced space. The utility of this approach is demonstrated by its application to seven classes of neurogenetic diseases with 31 defining phenotypic features.

Recommended Citation

T. E. Davila et al., "Towards Explainability of Dimension Reduction Plots of Unsupervised Learning Model Outcomes," 21st IEEE Conference on Computational Intelligence in Bioinformatics and Computational Biology, CIBCB 2024, Institute of Electrical and Electronics Engineers, Jan 2024.

The definitive version is available at https://doi.org/10.1109/CIBCB58642.2024.10702157

Department(s)

Electrical and Computer Engineering

Document Type

Article - Conference proceedings

Document Version

Citation

File Type

text

Language(s)

English

Rights

Publication Date

01 Jan 2024

Download

Full Text Link

Included in

Electrical and Computer Engineering Commons

COinS

Electrical and Computer Engineering Faculty Research & Creative Works

Towards Explainability of Dimension Reduction Plots of Unsupervised Learning Model Outcomes

Abstract

Recommended Citation

Department(s)

Document Type

Document Version

File Type

Language(s)

Rights

Publication Date

Included in

Search

Browse

Author Corner

Related Content

Useful Links

Article Locations

Electrical and Computer Engineering Faculty Research & Creative Works

Towards Explainability of Dimension Reduction Plots of Unsupervised Learning Model Outcomes

Author

Abstract

Recommended Citation

Department(s)

Document Type

Document Version

File Type

Language(s)

Rights

Publication Date

Included in

Share

Search

Browse

Author Corner

Related Content

Useful Links

Article Locations