摘要: features = sc.parallelize(data_group[idx]).map(lambda x: (x.host_ip+'^'+x.domain, 1)).reduceByKey(operator.add).map(get_domain_features) def get_domai 阅读全文
posted @ 2018-04-13 11:02 bonelee 阅读(556) 评论(0) 推荐(0)