摘要: kmeans demo 摘自:http://spark.apache.org/docs/latest/api/python/pyspark.mllib.html#module-pyspark.mllib.feature pyspark.mllib.feature module Python pack 阅读全文
posted @ 2017-11-02 20:11 bonelee 阅读(2628) 评论(1) 推荐(0)
摘要: For (much) more power and flexibility, use a dedicated spellchecking library like PyEnchant. There's a tutorial, or you could just dive straight in: P 阅读全文
posted @ 2017-11-02 12:10 bonelee 阅读(18145) 评论(0) 推荐(1)
摘要: 公用函数的放到了 common.py 文件中. 通过 --py-files 可以在pyspark中可以顺利导入: pyspark --py-files lib/common.py > import common OK 阅读全文
posted @ 2017-11-02 12:09 bonelee 阅读(2883) 评论(0) 推荐(0)