2021 年 4月 22 日随笔档案 - 红豆哒

2021年4月22日

摘要： 1. 用Pyspark自主实现词频统计过程。 >>> lines = sc.textFile('file:///home/hadoop/cipintongji.txt')>>> words = lines.flatmap(lambda line: line.lower().split())Trace 阅读全文

posted @ 2021-04-22 20:49 红豆哒阅读(118) 评论(0) 推荐(0)

红豆哒

公告