ABSTRACT
Building a common representation for several related data sets is an important problem in multi-view learning. CCA and its extensions have shown that they are effective in finding the shared variation among all data sets. However, these models generally fail to exploit the common structure of the data when the views are with private information. Recently, methods explicitly modeling the information into shared part and private parts have been proposed, but they presume to know the prior knowledge about the latent space, which is usually impossible to obtain. In this paper, we propose a probabilistic model, which could simultaneously learn the structure of the latent space whilst factorize the information correctly, therefore the prior knowledge of the latent space is unnecessary. Furthermore, as a probabilistic model, our method is able to deal with missing data problem in a natural way. We show that our approach attains the performance of state-of-art methods on the task of human pose estimation when the motion capture view is completely missing, and significantly improves the inference accuracy with only a few observed data.
- C. Archambeau and F. Bach. Sparse probabilistic projections. In Proceedings of the Conference on Neural Information Processing Systems (NIPS), volume 21. MIT Press, 2008.Google Scholar
- F. Bach and M. Jordan. A probabilistic interpretation of canonical correlation analysis. Technical report, Department of Statistics, University of California, Berkeley, 2005.Google Scholar
- C. Ek, J. Rihan, P. Torr, G. Rogez, and N. Lawrence. Ambiguity modeling in latent spaces. In Proceedings of the International Conference on Machine Learning for Multimodal Interaction, pages 62--73. Springer-Verlag, 2008. Google ScholarDigital Library
- C. Ek, P. Torr, and N. Lawrence. Gaussian process latent variable models for human pose estimation. In Proceedings of the International Conference on Machine learning for Multimodal Interaction, pages 132--143. Springer-Verlag, 2007. Google ScholarDigital Library
- J. Fan, W. G. Aref, A. K. Elmagarmid, M.-S. Hacid, M. S. Marzouk, and X. Zhu. Multiview: Multilevel video content representation and retrieval. J. Electronic Imaging.Google Scholar
- D. Hardoon and J. Shawe-Taylor. Sparse canonical correlation analysis. Machine Learning, pages 1--23, 2008. Google ScholarDigital Library
- Y. Jia, M. Salzmann, and T. Darrell. Factorized Latent Spaces with Structured Sparsity. In Proceedings of the Conference on Neural Information Processing Systems (NIPS), volume 23. MIT Press, 2010.Google Scholar
- A. Klami and S. Kaski. Probabilistic approach to detecting dependencies between data sets. Neurocomputing, 72(1--3):39--46, 2008. Google ScholarDigital Library
- M. Kuss and T. Graepel. The geometry of kernel canonical correlation analysis. Technical report, Max Planck Institution, 2003.Google Scholar
- K. Lange, D. Hunter, and I. Yang. Optimization transfer using surrogate objective functions. Journal of Computational and Graphical Statistics, 9(1):1--20, 2000.Google Scholar
- N. Lawrence. Gaussian process latent variable models for visualisation of high dimensional data. In Proceedings of the Conference on Neural Information Processing Systems (NIPS), volume 16, pages 329--336. MIT Press, 2004.Google Scholar
- G. Leen. Context assisted information extraction. PhD thesis, Univerisyt of the West of Scotland,, 2008.Google Scholar
- R. Navaratnam, A. Fitzgibbon, and R. Cipolla. The joint manifold model for semi-supervised multi-valued regression. In Proceedings of the IEEE International Conference on Computer Vision (ICCV), 2007.Google ScholarCross Ref
- M. Salzmann, C. Ek, R. Urtasun, and T. Darrell. Factorized orthogonal latent spaces. In International Conference on Artificial Intelligence and Statistics, 2010.Google Scholar
- A. Shon, K. Grochow, A. Hertzmann, and R. Rao. Learning shared latent structure for image synthesis and robotic imitation. In Proceedings of the Conference on Neural Information Processing Systems (NIPS), volume 18, pages 1233--1240. MIT Press, 2006.Google Scholar
- L. Sigal and M. Black. Humaneva: Synchronized video and motion capture dataset for evaluation of articulated human motion. Technical report, Brown Univertsity, 2006.Google Scholar
- L. Sigal, R. Memisevic, and D. Fleet. Shared kernel information embedding for discriminative inference. In Proceedings of the IEEE Conference on Computer Vision and Pattern Recognition (CVPR), 2009.Google ScholarCross Ref
- A. Tipping. Analysis of sparse Bayesian learning. In Proceedings of the Conference on Neural Information Processing Systems (NIPS), pages 383--390. MIT Press, 2002.Google Scholar
- M. Tipping and C. Bishop. Probabilistic principal component analysis. Journal of the Royal Statistical Society. Series B (Statistical Methodology), 61(3):611--622, 1999.Google Scholar
Recommendations
Factorized latent spaces with structured sparsity
NIPS'10: Proceedings of the 23rd International Conference on Neural Information Processing Systems - Volume 1Recent approaches to multi-view learning have shown that factorizing the information into parts that are shared across all views and parts that are private to each view could effectively account for the dependencies and independencies between the ...
Latent-Space Variational Bayes
Variational Bayesian Expectation-Maximization (VBEM), an approximate inference method for probabilistic models based on factorizing over latent variables and model parameters, has been a standard technique for practical Bayesian inference. In this paper,...
Probabilistic latent maximal marginal relevance
SIGIR '10: Proceedings of the 33rd international ACM SIGIR conference on Research and development in information retrievalDiversity has been heavily motivated in the information retrieval literature as an objective criterion for result sets in search and recommender systems. Perhaps one of the most well-known and most used algorithms for result set diversification is that ...
Comments