20.python提取字符串

1.获取指定内容
# 文本

<Server ServerId='0001' xxxx />
<Server ServerId='0002' xxxx />
<Server ServerId='0003' xxxx />
<Server ServerId='0004' xxxx />
<Server ServerId='0005' xxxx />
<Server ServerId='0006' xxxx />
<Server ServerId='0007' xxxx />
<Server ServerId='0008' xxxx />
<Server ServerId='0009' xxxx />
<Server ServerId='0010' xxxx />
# encoding: utf-8
import re

text = '''
<Server ServerId='0001' xxxx />
<Server ServerId='0002' xxxx />
<Server ServerId='0003' xxxx />
<Server ServerId='0004' xxxx />
<Server ServerId='0005' xxxx />
<Server ServerId='0006' xxxx />
<Server ServerId='0007' xxxx />
<Server ServerId='0008' xxxx />
<Server ServerId='0009' xxxx />
<Server ServerId='0010' xxxx />
'''
pattern = re.compile(r"^.*ServerId=\s*'(?P<ServerId>.*?)' .*")
def handle(matchobj):
    return matchobj.group(1)

def get_servers(text):
    servers = []
    for line in text.strip().split('\n'):
        if line.find('ServerId') != -1:
            servers.append(re.sub(pattern, handle, line))
    return servers

if __name__ == "__main__":
    print get_servers(text)

2.获取超链接
posted @ 2018-08-22 16:41  桃源仙居  阅读(242)  评论(0)    收藏  举报