07 2017 档案
摘要:import pandas as pd import numpy as np data = pd.read_csv(r'data.csv') train = data.iloc[:,0:4] #计算不同样本之间的欧几里得距离, #如果不同样本数据的刻度不一致,要对数据进行规格化处理 def nearest(traini,center): distance = np.zeros((len...
阅读全文
摘要:写了识别手写数字的KNN算法,如下图所示。参考链接http://blog.csdn.net/april_newnew/article/details/44176059。 注:训练数据集有2,210条记录,测试数据有670条。准确率并不高,只有0.45。目前不知道为什么,以后多学习,争取优化代码。
阅读全文
摘要:# -*- coding: utf-8 -*- import pandas as pd import numpy as np data = pd.read_csv(r'data.csv') train = data.ix[0:4000,:] Y = list(train.columns)[-1] C = lis...
阅读全文
摘要:# -*- coding: utf-8 -*- #导入数据 import pandas as pd data = pd.read_csv(r'E:\Python\machine learning\own\decision_tree\test.csv') X = data.ix[:,0:4].valu
阅读全文
摘要:# -*- coding: utf-8 -*- #计算各个属性各个值的嫡 import numpy as np def H(tdata): n = tdata.shape[1] -1 C = tdata.ix[:,n] result = 0 counts = list(C.value_counts()) for i in range(len(counts)...
阅读全文

浙公网安备 33010602011771号