27
• Topic Models (LDA etc., 2003~)
• Define a generative structure involving latent variables (e.g topics)
using well-structured distributions and infer the parameters
• Represent documents / words using low-dimensional, highly
interpretable distributions
• Extensively used in industry. Many open source tools
• Extensive research on speeding up / scaling up
• D. M. Blei, A. Y. Ng, and M. I. Jordan. Latent Dirichlet Allocation.
Journal of Machine Learning Research, 3:993–1022, 2003
• Tutorial: Parameter estimation for text analysis, Gregor Heinrich 2008
Solutions
Copyright 2017 Aaron Li (
[email protected])
Copyright 2017 Aaron Li
(
[email protected])