python数据分析 - 随笔分类(第3页) - OTAKU_nicole

AttributeError: module 'pandas' has no attribute 'scatter_matrix'

摘要：pd.scatter_matrix(trans_data,diagonal='kde',color='k',alpha=0.3)报错，改为pd.plotting.scatter_matrix(trans_data,diagonal='kde',color='k',alpha=0.3) 阅读全文

posted @ 2021-09-01 14:19 OTAKU_nicole 阅读(323) 评论(0) 推荐(0)

AttributeError: 'Rectangle' object has no property 'normed'

摘要：values.hist(bins=100,alpha=0.3,color='g',normed=True)报错改为density=Truevalues.hist(bins=100,alpha=0.3,color='g',density=True) 阅读全文

posted @ 2021-09-01 11:35 OTAKU_nicole 阅读(1508) 评论(0) 推荐(0)

matplotlib中文标题

摘要：fig, ax = plt.subplots() ax.plot(2, 3) plt.rcParams['font.sans-serif'] = ['SimHei'] # 正常显示中文 ax.set_title('中文标题') plt.show 阅读全文

posted @ 2021-08-31 11:39 OTAKU_nicole 阅读(305) 评论(0) 推荐(0)

正则表达式 Python for Data Analysis 笔记

摘要：import re # 描述一个或多个空白符的regex是\s+ text = "foo bar\t baz \tqux" regex = re.compile('\s+') print(regex.split(text)) # 等于 re.split('\s+',text) # ['foo', ' 阅读全文

posted @ 2021-08-30 10:25 OTAKU_nicole 阅读(56) 评论(0) 推荐(0)

计算指标/哑变量

摘要：from pandas import DataFrame,Series import pandas as pd import numpy as np # 如果一个DataFrame的某一列中含有K个不同值，则可以派生出一个K列矩阵 df = DataFrame({'key':['b','b','a' 阅读全文

posted @ 2021-08-27 17:29 OTAKU_nicole 阅读(160) 评论(0) 推荐(0)

ParserWarning: Falling back to the 'python' engine because the 'c' engine does not support regex separators

摘要：pd.read_table('movies.dat', sep='::') ParserWarning: Falling back to the 'python' engine because the 'c' engine does not support regex separators (sep 阅读全文

posted @ 2021-08-26 17:58 OTAKU_nicole 阅读(1946) 评论(0) 推荐(0)

UnicodeDecodeError: 'utf-8' codec can't decode byte 0xe9 in position 3114: invalid continuation byte

摘要：pd.read_table('movies.dat', sep='::') 增加 encoding='ISO-8859-1' 可解决 pd.read_table('movies.dat', sep='::',encoding='ISO-8859-1') 阅读全文

posted @ 2021-08-26 17:39 OTAKU_nicole 阅读(915) 评论(0) 推荐(0)

DataFrame随机采样

摘要：from pandas import DataFrame,Series import pandas as pd import numpy as np # 使用numpy.random.permutation可实现对Series或DataFrame的列排列 df = DataFrame(np.aran 阅读全文

posted @ 2021-07-01 16:34 OTAKU_nicole 阅读(290) 评论(0) 推荐(0)

检测和过滤异常值

摘要：from pandas import DataFrame,Series import pandas as pd import numpy as np np.random.seed(12345) data = DataFrame(np.random.randn(1000,4)) # 找出某列中绝对值大阅读全文

posted @ 2021-07-01 16:16 OTAKU_nicole 阅读(111) 评论(0) 推荐(0)

离散化和面元划分

摘要：from pandas import DataFrame,Series import pandas as pd import numpy as np ages = [20, 22, 25, 27, 21, 23, 37, 31, 61, 45, 41, 32] bins = [18,25,35,60 阅读全文

posted @ 2021-07-01 16:00 OTAKU_nicole 阅读(80) 评论(0) 推荐(0)

AttributeError: 'Categorical' object has no attribute 'levels'

摘要：Categorical.levels属性停止使用，现在为Categorical.categories 阅读全文

posted @ 2021-07-01 15:31 OTAKU_nicole 阅读(314) 评论(0) 推荐(0)

AttributeError: 'Categorical' object has no attribute 'labels'

摘要：Categorical.labels属性停止使用，现在为Categorical.codes 阅读全文

posted @ 2021-07-01 15:30 OTAKU_nicole 阅读(672) 评论(0) 推荐(0)

重命名轴索引

摘要：from pandas import DataFrame,Series import pandas as pd import numpy as np data = DataFrame(np.arange(12).reshape((3,4)), index=["Aa","Bb","Cc"], colu 阅读全文

posted @ 2021-07-01 15:17 OTAKU_nicole 阅读(61) 评论(0) 推荐(0)

替换值replace

摘要：from pandas import Series import numpy as np data = Series([1,-999,2,-999,-1000,3]) print(data) ''' 0 1 1 -999 2 2 3 -999 4 -1000 5 3 dtype: int64 ''' 阅读全文

posted @ 2021-07-01 15:03 OTAKU_nicole 阅读(109) 评论(0) 推荐(0)

DataFrame利用函数或映射进行数据转换map

摘要：from pandas import DataFrame,Series import pandas as pd import numpy as np data = DataFrame({'k1':['A']*3+['B']*4, 'k2':[1,1,2,3,3,4,4]}) print(data) 阅读全文

posted @ 2021-03-15 14:50 OTAKU_nicole 阅读(673) 评论(0) 推荐(0)

TypeError: drop_duplicates() got an unexpected keyword argument 'take_last'

摘要：将 take_last=True 改为 keep='last' 阅读全文

posted @ 2021-03-15 14:26 OTAKU_nicole 阅读(518) 评论(0) 推荐(0)

DataFrame合并：合并重叠数据combine_first

摘要：from pandas import DataFrame,Series import numpy as np a = Series([np.nan,2.5,np.nan,3.5,4.5,np.nan], index=['f','e','d','c','b','a']) print(a) ''' f 阅读全文

posted @ 2021-03-15 10:50 OTAKU_nicole 阅读(465) 评论(0) 推荐(0)

DataFrame合并：轴向链接concat

摘要：from pandas import DataFrame,Series import pandas as pd import numpy as np arr = np.arange(12).reshape((3,4)) print(arr) ''' [[ 0 1 2 3] [ 4 5 6 7] [ 阅读全文

posted @ 2021-03-04 17:08 OTAKU_nicole 阅读(160) 评论(0) 推荐(0)

DataFrame索引合并 join

摘要：from pandas import DataFrame left = DataFrame([[1,2],[3,4],[5,6]],index=['a','c','e'],columns=['item1','item2']) right = DataFrame([[7,8],[9,10],[11,1 阅读全文

posted @ 2021-02-25 15:35 OTAKU_nicole 阅读(760) 评论(0) 推荐(0)

DataFrame合并数据集 pandas.merge

摘要：from pandas import DataFrame import pandas as pd df1 = DataFrame({'key':['b','b','a','c','a','a','b'], 'data1':range(7)}) df2 = DataFrame({'key':['a', 阅读全文

posted @ 2021-02-25 14:32 OTAKU_nicole 阅读(90) 评论(0) 推荐(0)

OTAKU_nicole

随笔分类 - python数据分析

公告