正则表达式读取文件中的邮箱和固话
<div class="foot-box"> <p class="map-content-line-footer"></p> <div class="contact-mail"> <div class="mail-box"> <div class="mail-title"> 合作联系 </div> <div class="mail-content"> <span class="mail-content-text"> 市场合作:upmco@baidu.com </span><span class="mail-content-text"> 校园合作:campusmaster@baidu.com </span><span class="mail-content-text"> 战略合作:zhanzhangpingtai@baidu.com </span> </div> </div><div class="mail-box"> <div class="mail-title"> 业务联系 </div> <div class="mail-content"> <span class="mail-content-text"> 百度无线:mbaidu@baidu.com </span><span class="mail-content-text"> 百度推广售后热线:400-921-9999 </span><span class="mail-content-text"> 百度推广销售热线:400-800-8888 </span> </div> </div><div class="mail-box"> <div class="mail-title"> 投诉中心 </div> <div class="mail-content"> <span class="mail-content-text"> 投诉中心网址: http://help.baidu.com </span><span class="mail-content-text"> 职业道德举报邮箱:bdjb@baidu.com </span> </div> </div> </div> </div> </div> >>> with open(r'd:\pyhton\one\contact.txt','r') as fp: ... content = fp.readlines() #返回的列表 >>> with open(r'd:\pyhton\one\contact.txt','r',encoding='utf-8') as fp: ... content = fp.read() #返回的字符串 >>> type(content) <class 'str'>
直接匹配字符串 >>> import re >>> re.findall(r'[a-z]+@baidu.com',content) ['upmco@baidu.com', 'campusmaster@baidu.com', 'zhanzhangpingtai@baidu.com', 'mba idu@baidu.com', 'bdjb@baidu.com'] >>> re.findall(r'\d{3}-\d{3}-\d{4}',content) ['400-921-9999', '400-800-8888'] >>>
javascript
posted on 2020-10-28 20:23 sunny_2016 阅读(138) 评论(0) 编辑 收藏 举报