Latent Dirichlet Allocation (LDA) is a common method of topic modeling. That is, if I have a document and want to figure out if it's a sports article or a mathematics paper, I can use LDA to build a system that looks at other sports articles or mathematics papers and automatically decides whether this unseen document's topic is sports or math.
参考文献:
http://www.quora.com/What-is-a-good-explanation-of-Latent-Dirichlet-Allocation
浙公网安备 33010602011771号