04 2018 档案

摘要:# 导入必备的包 # 本文爬取的是顶点小说中的完美世界为列。文中的aa.text,bb.text为自己创建的text文件 import requests from bs4 import BeautifulSoup # 爬取目标url url = 'https://www.x23us.com/html/42/42377/' headers = { 'User-Agent': 'Mozi... 阅读全文
posted @ 2018-04-28 14:27 python赵小弟 阅读(1664) 评论(1) 推荐(0)
摘要:import pandas as pd import os os.chdir(u'E:\内网通得东西\练习4') #参数初始化 filename = 'bankloan.xls' data = pd.read_excel(filename) x = data.iloc[:,:8].as_matrix() y = data.iloc[:,8].as_matrix() from sklearn.... 阅读全文
posted @ 2018-04-21 09:27 python赵小弟 阅读(1852) 评论(0) 推荐(0)
摘要:import os import pandas as pd from sklearn.cross_validation import train_test_split from sklearn import tree from sklearn import metrics infile = 'sales_data.xls' os.chdir('E:\pycharm\machine learni... 阅读全文
posted @ 2018-04-21 09:26 python赵小弟 阅读(162) 评论(0) 推荐(0)
摘要:#encoding=utf-8 from __future__ import unicode_literals import sys sys.path.append("../") import jieba import jieba.posseg import jieba.analyse print('='*40) print('1. 分词') print('-'*40) seg_list ... 阅读全文
posted @ 2018-04-21 09:21 python赵小弟 阅读(257) 评论(0) 推荐(0)
摘要:from sklearn import datasets import pandas as pd import numpy as np from sklearn.cross_validation import train_test_split from sklearn.preprocessing import StandardScaler from sklearn.linear_model im... 阅读全文
posted @ 2018-04-21 09:18 python赵小弟 阅读(2042) 评论(0) 推荐(0)
摘要:上面的为最终结果import requests import re import xlwt import json # 导入必须的包: xlwt,json,requests,re. headers = { 'User-Agent': 'Mozilla/5.0 (Windows NT 10.0; WOW64) AppleWebKit/537.36 (KHTML, like Gecko) Ch... 阅读全文
posted @ 2018-04-16 20:35 python赵小弟 阅读(5905) 评论(0) 推荐(1)