Bayesian Robust PCA for Incomplete Data

Reference:

Jaakko Luttinen, Alexander Ilin, and Juha Karhunen. Bayesian robust PCA for incomplete data. In Proceedings of the 8th International Conference on Independent Component Analysis and Blind Signal Separation (ICA 2009), pages 66–73, Paraty, Brazil, March 2009.

Abstract:

We present a probabilistic model for robust principal component analysis (PCA) in which the observation noise is modelled by Student- distributions that are independent for different data dimensions. A heavy-tailed noise distribution is used to reduce the negative effect of outliers. Intractability of posterior evaluation is solved using variational Bayesian approximation methods. We show experimentally that the proposed model can be a useful tool for PCA preprocessing for incomplete noisy data. We also demonstrate that the assumed noise model can yield more accurate reconstructions of missing values: Corrupted dimensions of a ``bad'' sample may be reconstructed well from other dimensions of the same data vector. The model was motivated by a real-world weather dataset which was used for comparison of the proposed technique to relevant probabilistic PCA models.

Suggested BibTeX entry:

@inproceedings{Luttinen09rpca,
    address = {Paraty, Brazil},
    author = {Jaakko Luttinen and Alexander Ilin and Juha Karhunen},
    booktitle = {Proceedings of the 8th International Conference on Independent Component Analysis and Blind Signal Separation ({ICA} 2009)},
    month = {March},
    pages = {66--73},
    title = {{B}ayesian Robust {PCA} for Incomplete Data},
    year = {2009},
}

This work is not available online here.