jieba 分词聊斋

import jieba
f = open(r'C:\liaozhai.txt', mode="r", encoding='gbk')
txt=f.read()
words  = jieba.lcut(txt)

counts = {}
for word in words:
    if len(word)  == 1:
        continue

    else:
        counts[word] =counts.get(word,0)+1

items = list(counts.items())
items.sort(key=lambda x:x[1], reverse=True)
for i in range(20):
    word,count = items[i]
    print (u"{0:<10}{1:>5}".format(word, count))

posted on 2020-11-15 11:59 汤圆喵喵阅读(28) 评论(0) 收藏举报

刷新页面返回顶部

jieba 分词 聊斋

jieba 分词聊斋