随笔分类 - Python
摘要:文本处理工具 - TextBlob 文本处理工具 - TextBlob 文本处理工具 - TextBlob 文本处理工具 - TextBlob TextBlob基本介绍 TextBlob是一个用Python编写的开源的文本处理库。它可以用来执行很多自然语言处理的任务,比如,词性标注,名词性成分提取,
阅读全文
摘要:https://towardsdatascience.com/how-to-extract-keywords-from-pdfs-and-arrange-in-order-of-their-weights-using-python-841556083341 Problem Statement - G
阅读全文
摘要:1)查看DataFrame数据及属性 1 2 3 4 5 6 7 8 9 10 11 2)使用DataFrame选择数据: 1 2 3 4 3)使用DataFrame重置数据: 1 4)使用DataFrame筛选数据(类似SQL中的WHERE): 1 2 3 5)使用DataFrame模糊筛选数据(
阅读全文
摘要:import pymysql from pandas import DataFrame import pandas as pd import matplotlib.pylab as pyl conn = pymysql.connect(host="127.0.0.1", user="root", passwd="wangmianny111", db="galaxy_macau_ad",chars...
阅读全文
摘要:import jiebaimport numpy as np#打开词典文件,返回列表def open_dict(Dict = 'mini', path=r'/Users/apple888/PycharmProjects/Textming/Sent_Dict/Hownet/'): path = pat
阅读全文
摘要:#Author:Mini#!/usr/bin/env pythonimport numpy as nimport pymysqlimport pandas as pdconn = pymysql.connect(host="127.0.0.1", user="root", passwd="wangm
阅读全文
摘要:#Author:Mini#!/usr/bin/env pythonimport jiebaimport jieba.possegsentence=""jieba.load_userdict("C:/Users/Administrator/Desktop/tripadvisor_gm/tripadvi
阅读全文
摘要:#Author:Mini#!/usr/bin/env pythonimport jiebaimport numpy as nimport pymysqlconn = pymysql.connect(host="127.0.0.1", user="root", passwd="wangmianny11
阅读全文
摘要:synonyms.txt: 北京,首都,京城,北平城,故都******************************************************#Author:Mini#!/usr/bin/env pythonimport jiebacombine_dict = {}for l
阅读全文
摘要:a b c d 类 p(a)>p(b)>p(c)>p(d) m 属于 a 类
阅读全文
摘要:from numpy import *import operatorfrom os import listdir#从列方向扩展#tile(a,(size,1))def knn(k,testdata,traindata,labels): traindatasize=traindata.shape[0]
阅读全文
摘要:THEN reboot(repare the VMware) Shell: sbin/hadoop-deamon.sh start mamennode(shell=.text (commands)) cd
阅读全文
摘要:#tf-idf (term frequency inverse document frequency) 1:读取文档 2:分词 3:对文档整理成所需格式 4:计算词频 5:对词频低的词语进行过滤 6:通过语料库建立成词典 7:加载要计算对比的文档 8:将要对比文档转化为系数向量(doc2bow) 9
阅读全文
摘要:恢复内容开始 1. observe accoding to the purpose of analysis 2. decide a model of specific algorithm 3. clear the steps 4. write the codes classify algorithm
阅读全文
摘要:抽象的组织 数据分析处理 分类,聚类,关联,回归
阅读全文
摘要:import urllib.requestdata=urllib.request.urlopen("http://127.0.0.1/txt1.txt").read().decode("utf-8","ignore")word10=jieba.analyse.extract_tags(data,20
阅读全文
摘要:import win32api import win32con import win32gui from ctypes import * import time VK_CODE = { 'backspace':0x08, 'tab':0x09, 'clear':0x0C, 'enter':0x0D,
阅读全文
摘要:import numpy as n[] import pandas as pd conn= sql1=select * from table data=pd.read_sql(sql1,conn) print(data.describe()) #cleaning missing numbers da
阅读全文