爬虫xpath

阅读 63

2022-04-13

1 、xpath安装pip install lxml

2、xpath规则

3、xpath部分案列

from lxml import etree

text = """
<div>
<ul>
<li class="item-0"><a href="link1.html">first item</a></li>
....
</ul>
</div>
"""
resp_html = etree.HTML(text)

html = etree.parse('./test.html', etree.HTMLParser())
result = etree.tostring(html)
print(result.decode('utf-8'))
 

精彩评论(0)

0 0 举报