何美玲

2018年12月17日

摘要：二、中文文本分类import numpy as np #导入数组numpy库import ospath='C:\\Users\\Administrator\\Desktop\\data1' import jieba #导入结巴库with open(r'C:\Users\Administrator\D 阅读全文

posted @ 2018-12-17 18:40 何美玲阅读(280) 评论(0) 推荐(0)

2018年12月13日

期末大作业，最后一周上课检查

摘要： #读取数据 from sklearn.datasets import load_boston #导入房价数据集 boston=load_boston() boston.data boston.target boston.data.shape #将数据转化为数据框形式 import pandas as pd df=pd.DataFrame(boston.data) df #建立变... 阅读全文

posted @ 2018-12-13 11:57 何美玲阅读(201) 评论(0) 推荐(0)

2018年12月6日

回归模型与房价预测

摘要： import matplotlib.pyplot as pltx=boston.data[:,12].reshape(-1,1)y=boston.targetplt.figure(figsize=(10,6))plt.scatter(x,y) from sklearn.linear_model im 阅读全文

posted @ 2018-12-06 10:23 何美玲阅读(563) 评论(0) 推荐(0)

2018年12月1日

朴素贝叶斯应用：垃圾邮件分类(更新)

摘要： #读取数据集 import csv file_path=r'jiangnan.txt' sms=open(file_path,'r',encoding='utf-8') sms_data=[] sms_label=[] text=csv.reader(sms,delimiter='\t') text #预处理 def preprocessing(text): #text=text.de... 阅读全文

posted @ 2018-12-01 21:06 何美玲阅读(329) 评论(1) 推荐(0)

2018年11月29日

朴素贝叶斯应用：垃圾邮件分类

摘要： #读取数据集 import csv file_path=r'C:\Users\Administrator\Desktop\江南.txt' sms=open(file_path,'r',encoding='utf-8') text=csv.reader(sms,delimiter='\t') text #预处理 def preprocessing(text): #text=text.de... 阅读全文

posted @ 2018-11-29 11:53 何美玲阅读(161) 评论(0) 推荐(0)

2018年11月22日

sklearn中的朴素贝叶斯模型及其应用

摘要： from sklearn.datasets import load_iris iris=load_iris() from sklearn.naive_bayes import GaussianNB gnb=GaussianNB() #模型 pred=gnb.fit(iris.data,iris.ta 阅读全文

posted @ 2018-11-22 11:54 何美玲阅读(242) 评论(0) 推荐(0)

2018年11月16日

分类与监督学习，朴素贝叶斯分类算法

摘要： from sklearn.datasets import load_irisiris=load_iris()from sklearn.naive_bayes import GaussianNBgnb=GaussianNB() #模型gnb.fit(iris.data,iris.target) #训练阅读全文

posted @ 2018-11-16 21:06 何美玲阅读(184) 评论(0) 推荐(0)

2018年11月4日

K-means算法应用：图片压缩

摘要：理解贝叶斯定理： M桶：7红3黄 N桶：1红9黄现在：拿出了一个红球试问：这个红球是M、N桶拿出来的概率分别是多少？阅读全文

posted @ 2018-11-04 00:00 何美玲阅读(761) 评论(0) 推荐(0)

2018年10月28日

聚类--K均值算法：自主实现与sklearn.cluster.KMeans调用

摘要： import numpy as np x=np.random.randint(1,100,[20,1]) #在1-100范围中产生20行1列的随机数组 y=np.zeros(20) k=3 #取数据中k个对象为初始中心,每个对象代表一个聚类中心 def initcenter(x,k): return x[:k] kc=initcenter(x,k)... 阅读全文

posted @ 2018-10-28 12:06 何美玲阅读(670) 评论(0) 推荐(0)

2018年10月20日

numpy 统计分布显示

摘要： import numpy as np #导入numpy数组并命名np from sklearn.datasets import load_iris #导入鸢尾花数据集 data=load_iris() iris_len=data.data[:,2] print('鸢尾花数组：',iris_len) print('最大值:',np.max(iris_len)) #计算鸢尾花长度的最大... 阅读全文

posted @ 2018-10-20 11:33 何美玲阅读(198) 评论(0) 推荐(0)

公告