http://spark.apache.org/docs/1.6.1/mllib-clustering.html#latent-dirichlet-allocation-lda  

http://spark.apache.org/docs/1.6.1/api/python/pyspark.mllib.html#pyspark.mllib.clustering.LDAModel