Computer Science Faculty Research & Creative Works

MP²SDA: Multi-Party Parallelized Sparse Discriminant Learning

Jiang Bian
Haoyi Xiong, Missouri University of Science and TechnologyFollow
Yanjie Fu, Missouri University of Science and TechnologyFollow
Jun Huan
Zhishan Guo

Abstract

Sparse Discriminant Analysis (SDA) has been widely used to improve the performance of classical Fisher's Linear Discriminant Analysis in supervised metric learning, feature selection, and classification. With the increasing needs of distributed data collection, storage, and processing, enabling the Sparse Discriminant Learning to embrace the multi-party distributed computing environments becomes an emerging research topic. This article proposes a novel multi-party SDA algorithm, which can learn SDA models effectively without sharing any raw data and basic statistics among machines. The proposed algorithm (1) leverages the direct estimation of SDA to derive a distributed loss function for the discriminant learning, (2) parameterizes the distributed loss function with local/global estimates through bootstrapping, and (3) approximates a global estimation of linear discriminant projection vector by optimizing the "distributed bootstrapping loss function" with gossip-based stochastic gradient descent. Experimental results on both synthetic and real-world benchmark datasets show that our algorithm can compete with the aggregated SDA with similar performance, and significantly outperforms the most recent distributed SDA in terms of accuracy and F1-score.

Recommended Citation

J. Bian et al., "MP²SDA: Multi-Party Parallelized Sparse Discriminant Learning," ACM Transactions on Knowledge Discovery from Data, vol. 14, no. 3, Association for Computing Machinery (ACM), May 2020.

The definitive version is available at https://doi.org/10.1145/3374919

Department(s)

Computer Science

Comments

Work supported by NSF RAISE CA-FW-HTF 1937833 and NSF CRII CSR 1755965.

Keywords and Phrases

Distributed; Multi-party; Parallelized; Sparse discriminant analysis

International Standard Serial Number (ISSN)

1556-4681; 1556-472X

Document Type

Article - Journal

Document Version

Citation

File Type

text

Language(s)

English

Rights

Publication Date

08 May 2020

Link to Full Text

COinS

Computer Science Faculty Research & Creative Works

MP²SDA: Multi-Party Parallelized Sparse Discriminant Learning

Abstract

Recommended Citation

Department(s)

Comments

Keywords and Phrases

International Standard Serial Number (ISSN)

Document Type

Document Version

File Type

Language(s)

Rights

Publication Date

Search

Browse

Author Corner

Related Content

Useful Links

Article Locations

Computer Science Faculty Research & Creative Works

MP²SDA: Multi-Party Parallelized Sparse Discriminant Learning

Author

Abstract

Recommended Citation

Department(s)

Comments

Keywords and Phrases

International Standard Serial Number (ISSN)

Document Type

Document Version

File Type

Language(s)

Rights

Publication Date

Share

Search

Browse

Author Corner

Related Content

Useful Links

Article Locations