python 面试题 - 随笔分类(第4页) - anobscureretreat

请写出一段python代码实现删除list里面的重复元素？

摘要：l1 = ['b','c','d','c','a','a'] l2 = list(set(l1)) print(l2) 阅读全文

posted @ 2019-07-16 01:12 anobscureretreat 阅读(334) 评论(0) 推荐(0)

给定两个列表，怎么找出他们相同的元素和不同的元素？

摘要：list1 = [1,2,3] list2 = [3,4,5] set1 = set(list1) set2 = set(list2) print(set1 & set2) print(set1 ^ set2) 阅读全文

posted @ 2019-07-16 01:11 anobscureretreat 阅读(539) 评论(0) 推荐(0)

写一个列表生成式，产生一个公差为11的等差数列

摘要：print([x*11 for x in range(10)]) 阅读全文

posted @ 2019-07-16 01:10 anobscureretreat 阅读(665) 评论(0) 推荐(0)

如果对方网站反爬取，封IP了怎么办？

摘要：放慢抓取熟速度，减小对目标网站造成的压力，但是这样会减少单位时间内的数据抓取量使用代理IP（免费的可能不稳定，收费的可能不划算）阅读全文

posted @ 2019-07-16 01:08 anobscureretreat 阅读(338) 评论(0) 推荐(0)

遇到的反爬虫策略以及解决方法?

摘要：通过headers反爬虫：自定义headers，添加网页中的headers数据。基于用户行为的反爬虫(封IP)：可以使用多个代理IP爬取或者将爬取的频率降低。动态网页反爬虫(JS或者Ajax请求数据)：动态网页可以使用 selenium + phantomjs 抓取。对部分数据加密处理(数据乱阅读全文

posted @ 2019-07-16 01:01 anobscureretreat 阅读(749) 评论(0) 推荐(0)

遇到反爬机制怎么处理？

摘要：headers方向判断User-Agent、判断Referer、判断Cookie。将浏览器的headers信息全部添加进去注意：Accept-Encoding；gzip,deflate需要注释掉阅读全文

posted @ 2019-07-16 00:53 anobscureretreat 阅读(426) 评论(0) 推荐(0)

列举网络爬虫所用到的网络数据包，解析包？

摘要：网络数据包 urllib、urllib2、requests 解析包 re、xpath、beautiful soup、lxml 阅读全文

posted @ 2019-07-16 00:51 anobscureretreat 阅读(943) 评论(0) 推荐(0)

python中的关键字yield有什么作用？

摘要：保存当前运行状态，然后暂停执行，即将函数挂起。yield关键字后面表达式的值作为返回值返回。当使用next(),send()函数从断点处继续执行。阅读全文

posted @ 2019-07-16 00:48 anobscureretreat 阅读(497) 评论(0) 推荐(0)

如下代码输出的是什么？

摘要：输出阅读全文

posted @ 2019-07-16 00:29 anobscureretreat 阅读(206) 评论(0) 推荐(0)

如何看到zen of python

摘要：Python之禅import this 阅读全文

posted @ 2019-07-16 00:26 anobscureretreat 阅读(207) 评论(0) 推荐(0)

不用中间变量交换a和b的值？

摘要：输出阅读全文

posted @ 2019-07-16 00:23 anobscureretreat 阅读(284) 评论(0) 推荐(0)

如何用Python删除一个文件？

摘要：删除文件 path，删除时候如果path是一个目录，抛出 OSError错误。 remove() 同 unlink() 的功能是一样的如果remove文件夹就会报错现在删除下面这个文件删除xx.txt os.removedirs(path)，删除文件夹，但是文件夹必须为空。递归地删除目录。阅读全文

posted @ 2019-07-16 00:18 anobscureretreat 阅读(16528) 评论(0) 推荐(0)

Python里面如何实现tuple和list的转换？

摘要：输出阅读全文

posted @ 2019-07-15 21:37 anobscureretreat 阅读(11700) 评论(0) 推荐(0)

filter方法求出列表所有奇数并构造新列

摘要：a = [1, 2, 3, 4, 5, 6, 7, 8, 9, 10] b = filter(lambda x: x % 2 != 0, a) for i in b: print(i) 阅读全文

posted @ 2019-07-15 18:28 anobscureretreat 阅读(548) 评论(0) 推荐(0)

列表推导式求列表所有奇数并构造新列表

摘要：a = [1, 2, 3, 4, 5, 6, 7, 8, 9, 10] b = [i for i in a if i % 2 != 0] print(b) 阅读全文

posted @ 2019-07-15 18:26 anobscureretreat 阅读(1140) 评论(0) 推荐(0)

a=”hello”和b=”世界”编码成bytes类型

摘要：输出阅读全文