OGM: Online Gaussian Graphical Models on the Fly
Gaussian Graphical Model is widely used to understand the dependencies between variables from high-dimensional data and can enable a wide range of applications such as principal component analysis, discriminant analysis, and canonical analysis. With respect to the streaming nature of big data, we study a novel Online Gaussian Graphical Model (OGM) that can estimate the inverse covariance matrix over the high-dimensional streaming data, in this paper. Specifically, given a small number of samples to initialize the learning process, OGM first estimates a low-rank estimation of inverse covariance matrix; then, when each individual new sample arrives, it updates the estimation of inverse covariance matrix using a low-complexity updating rule, without using the past data and matrix inverse. The significant edges of Gaussian graphical models can be discovered through thresholding the inverse covariance matrices. Theoretical analysis shows the convergence rate of OGM to the true parameters is guaranteed under Bernstein-style with mild conditions. We evaluate OGM using extensive experiments. The evaluation results backup our theory.
S. Yang et al., "OGM: Online Gaussian Graphical Models on the Fly," Applied Intelligence, vol. 52, no. 3, pp. 3103 - 3117, Springer Verlag, Feb 2022.
The definitive version is available at https://doi.org/10.1007/s10489-021-02563-4
Keywords and Phrases
Advanced Analytics; Gaussian Graphical Models; Online Learning over Streaming Data
International Standard Serial Number (ISSN)
Article - Journal
© 2022 Springer, All rights reserved.
01 Feb 2022