xpath

xpath解析入门

1.xpath 是在xml文档中搜索内容的一门语言 2.html是xml的一个子集

<book>
<id>1</id>
<name>野花遍地香</name>
<price>1.23</price>
<author>
<nick>周大强</nick>
<nick>周芷若</nick>
</author>
</book>

/book/price=1.23 层层递进,靠着节点的关系来查找

 

需要安装lxml模块

pip install lxml

# from lxml import etree
#
# xml = """
# <book></book>
#     <id>1</id>
#     <name>野花遍地香</name>
#     <price>1.23</price>
#     <nick>臭豆腐</nick>
#     <author>
#         <nick id='10086'>周大强</nick>
#         <nick id='10010'>周芷若</nick>
#         <nick class='joy'>周杰伦</nick>
#         <nick class='jolin'>蔡依林</nick>
#         <div>
#             <nick>热热热热热热热1</nick>
#         </div>
#         <span>
#             <nick>热热热热热热热2</nick>
#         </span>
#     </author>
#     <partner>
#         <nick id="ppc">胖胖陈</nick>
#         <nick id="ppbc">胖胖不陈</nick>
#     </partner>
#
# """
# tree = etree.XML(xml)
#
# result = tree.xpath("/book") 表示层级关系,第一个/是根节点
result = tree.xpath("/book/name")
# result = tree.xpath("/book/name/text()") text() 拿文本
# result = tree.xpath("/book/author/nick/text()") ['周大强','周芷若'...]
# result = tree.xpath("/book//nick/text()")   所有


# print(result)
posted @ 2021-06-15 11:32  测试小能手1  阅读(78)  评论(0)    收藏  举报