python处理csv文件
csv文件就是由逗号隔开的文件格式,可以用excel打开,简洁,读起来比较方便。
如下为本人曾经处理的一个csv文件的一部分:
author,year,journal,pub_typ,title,topic Santosh Devasia,2016,Automatica,article,"Iterative learning control with time-partitioned update for collaborative output tracking. ",17 Frederic Mazenc,2016,Automatica,article,"New control design for bounded backstepping under input delays. ",6 Michael Malisoff,2016,Automatica,article,"New control design for bounded backstepping under input delays. ",6 Alexey Matveev,2016,Automatica,article,"Observation of nonlinear systems via finite capacity channels: Constructive data rate limits. ",0
这里命名该文件名为1.csv,下面为读取该csv的代码:
#-*- coding:UTF-8 -*- import csv with open('1.csv', 'r') as csvFile: dict_reader1 = csv.DictReader(csvFile) for i in dict_reader1: print i['author'] print i['title'].replace('\n','')
----------------------------------------------------------------------------------------------------
上面的情况为csv存储在自己的磁盘,下面为在线的csv文件的处理
#-*- coding:UTF-8 -*- from urllib2 import urlopen from io import StringIO import csv
#去除所有非ASCII的字符 data=urlopen('http://pythonscraping.com/files/MontyPythonAlbums.csv').read().decode('ascii','ignore') dataFile=StringIO(data) # csvreader=csv.reader(dataFile) # for row in csvreader: # print row csvreader=csv.DictReader(dataFile) print (csvreader.fieldnames) for i in csvreader: print i
以上注释掉的为一种方法,没有注释掉的是另一种方法

浙公网安备 33010602011771号