A Method of Moments for Mixture Models and Hidden Markov Models

Animashree Anandkumar; Daniel Hsu; Sham M. Kakade

A Method of Moments for Mixture Models and Hidden Markov Models

Animashree Anandkumar, Daniel Hsu, Sham M. Kakade

Proceedings of the 25th Annual Conference on Learning Theory, PMLR 23:33.1-33.34, 2012.

Abstract

Mixture models are a fundamental tool in applied statistics and machine learning for treating data taken from multiple subpopulations. The current practice for estimating the parameters of such models relies on local search heuristics (\emphe.g., the EM algorithm) which are prone to failure, and existing consistent methods are unfavorable due to their high computational and sample complexity which typically scale exponentially with the number of mixture components. This work develops an efficient \emphmethod of moments approach to parameter estimation for a broad class of high-dimensional mixture models with many components, including multi-view mixtures of Gaussians (such as mixtures of axis-aligned Gaussians) and hidden Markov models. The new method leads to rigorous unsupervised learning results for mixture models that were not achieved by previous works; and, because of its simplicity, it offers a viable alternative to EM for practical deployment.

Cite this Paper

BibTeX


@InProceedings{pmlr-v23-anandkumar12,
  title = 	 {A Method of Moments for Mixture Models and Hidden Markov Models},
  author = 	 {Anandkumar, Animashree and Hsu, Daniel and Kakade, Sham M.},
  booktitle = 	 {Proceedings of the 25th Annual Conference on Learning Theory},
  pages = 	 {33.1--33.34},
  year = 	 {2012},
  editor = 	 {Mannor, Shie and Srebro, Nathan and Williamson, Robert C.},
  volume = 	 {23},
  series = 	 {Proceedings of Machine Learning Research},
  address = 	 {Edinburgh, Scotland},
  month = 	 {25--27 Jun},
  publisher =    {PMLR},
  pdf = 	 {http://proceedings.mlr.press/v23/anandkumar12/anandkumar12.pdf},
  url = 	 {https://proceedings.mlr.press/v23/anandkumar12.html},
  abstract = 	 {Mixture models are a fundamental tool in applied statistics and machine learning for treating data taken from multiple subpopulations. The current practice for estimating the parameters of such models relies on local search heuristics (\emphe.g., the EM algorithm) which are prone to failure, and existing consistent methods are unfavorable due to their high computational and sample complexity which typically scale exponentially with the number of mixture components. This work develops an efficient \emphmethod of moments approach to parameter estimation for a broad class of high-dimensional mixture models with many components, including multi-view mixtures of Gaussians (such as mixtures of axis-aligned Gaussians) and hidden Markov models. The new method leads to rigorous unsupervised learning results for mixture models that were not achieved by previous works; and, because of its simplicity, it offers a viable alternative to EM for practical deployment.}
}

Endnote

%0 Conference Paper
%T A Method of Moments for Mixture Models and Hidden Markov Models
%A Animashree Anandkumar
%A Daniel Hsu
%A Sham M. Kakade
%B Proceedings of the 25th Annual Conference on Learning Theory
%C Proceedings of Machine Learning Research
%D 2012
%E Shie Mannor
%E Nathan Srebro
%E Robert C. Williamson	
%F pmlr-v23-anandkumar12
%I PMLR
%P 33.1--33.34
%U https://proceedings.mlr.press/v23/anandkumar12.html
%V 23
%X Mixture models are a fundamental tool in applied statistics and machine learning for treating data taken from multiple subpopulations. The current practice for estimating the parameters of such models relies on local search heuristics (\emphe.g., the EM algorithm) which are prone to failure, and existing consistent methods are unfavorable due to their high computational and sample complexity which typically scale exponentially with the number of mixture components. This work develops an efficient \emphmethod of moments approach to parameter estimation for a broad class of high-dimensional mixture models with many components, including multi-view mixtures of Gaussians (such as mixtures of axis-aligned Gaussians) and hidden Markov models. The new method leads to rigorous unsupervised learning results for mixture models that were not achieved by previous works; and, because of its simplicity, it offers a viable alternative to EM for practical deployment.

RIS


TY  - CPAPER
TI  - A Method of Moments for Mixture Models and Hidden Markov Models
AU  - Animashree Anandkumar
AU  - Daniel Hsu
AU  - Sham M. Kakade
BT  - Proceedings of the 25th Annual Conference on Learning Theory
DA  - 2012/06/16
ED  - Shie Mannor
ED  - Nathan Srebro
ED  - Robert C. Williamson	
ID  - pmlr-v23-anandkumar12
PB  - PMLR
DP  - Proceedings of Machine Learning Research
VL  - 23
SP  - 33.1
EP  - 33.34
L1  - http://proceedings.mlr.press/v23/anandkumar12/anandkumar12.pdf
UR  - https://proceedings.mlr.press/v23/anandkumar12.html
AB  - Mixture models are a fundamental tool in applied statistics and machine learning for treating data taken from multiple subpopulations. The current practice for estimating the parameters of such models relies on local search heuristics (\emphe.g., the EM algorithm) which are prone to failure, and existing consistent methods are unfavorable due to their high computational and sample complexity which typically scale exponentially with the number of mixture components. This work develops an efficient \emphmethod of moments approach to parameter estimation for a broad class of high-dimensional mixture models with many components, including multi-view mixtures of Gaussians (such as mixtures of axis-aligned Gaussians) and hidden Markov models. The new method leads to rigorous unsupervised learning results for mixture models that were not achieved by previous works; and, because of its simplicity, it offers a viable alternative to EM for practical deployment.
ER  -

APA


Anandkumar, A., Hsu, D. & Kakade, S.M.. (2012). A Method of Moments for Mixture Models and Hidden Markov Models. Proceedings of the 25th Annual Conference on Learning Theory, in Proceedings of Machine Learning Research 23:33.1-33.34 Available from https://proceedings.mlr.press/v23/anandkumar12.html.

Related Material

Download PDF