摘要:
union、intersection、subtract、cartesian rdd1 = sc.parallelize([1,2,4,5,2,3]) rdd2 = sc.parallelize([4,6,5,7,8,6]) rdd1.union(rdd2).collect(): 所有rdd1和rdd 阅读全文
posted @ 2021-03-15 23:41
boye169
阅读(503)
评论(0)
推荐(0)
摘要:
【Example】 from pysoark. sql import SparkSession def split_line(line): try: return line.split(b"\t") except:pass def map_partitions(partitions): for li 阅读全文
posted @ 2021-03-15 23:31
boye169
阅读(521)
评论(0)
推荐(0)


浙公网安备 33010602011771号