正则表达式读取文件中的邮箱和固话

        <div class="foot-box">
          <p class="map-content-line-footer"></p>
          <div class="contact-mail">
            <div class="mail-box">
                <div class="mail-title">
                  合作联系                </div>
                <div class="mail-content">
                  <span class="mail-content-text">
                      市场合作:upmco@baidu.com                  </span><span class="mail-content-text">
                      校园合作:campusmaster@baidu.com                  </span><span class="mail-content-text">
                      战略合作:zhanzhangpingtai@baidu.com                  </span>                </div>
              </div><div class="mail-box">
                <div class="mail-title">
                  业务联系                </div>
                <div class="mail-content">
                  <span class="mail-content-text">
                      百度无线:mbaidu@baidu.com                  </span><span class="mail-content-text">
                      百度推广售后热线:400-921-9999                  </span><span class="mail-content-text">
                      百度推广销售热线:400-800-8888                  </span>                </div>
              </div><div class="mail-box">
                <div class="mail-title">
                  投诉中心                </div>
                <div class="mail-content">
                  <span class="mail-content-text">
                      投诉中心网址: http://help.baidu.com                  </span><span class="mail-content-text">
                      职业道德举报邮箱:bdjb@baidu.com                  </span>                </div>
              </div>          </div>
        </div>
        </div>

>>> with open(r'd:\pyhton\one\contact.txt','r') as fp:
...     content = fp.readlines()
#返回的列表
>>> with open(r'd:\pyhton\one\contact.txt','r',encoding='utf-8') as fp:
...     content = fp.read()
#返回的字符串
>>> type(content)
<class 'str'>
直接匹配字符串 >>> import re >>> re.findall(r'[a-z]+@baidu.com',content) ['upmco@baidu.com', 'campusmaster@baidu.com', 'zhanzhangpingtai@baidu.com', 'mba idu@baidu.com', 'bdjb@baidu.com'] >>> re.findall(r'\d{3}-\d{3}-\d{4}',content) ['400-921-9999', '400-800-8888'] >>>

 

posted on 2020-10-28 20:23  sunny_2016  阅读(138)  评论(0编辑  收藏  举报

导航