Getting certain attribute value using XPath
From the following HTML snippet:
<link rel="index" href="/index.php" />
<link rel="contents" href="/getdata.php" />
<link rel="copyright" href="/blabla.php" />
<link rel="shortcut icon" href="/img/all/favicon.ico" />
I'm trying to get the href
value of the link
tag with rel val开发者_开发知识库ue = "shortcut icon"
, I'm trying to achieve that using XPath.
How to do that in Python?
Like this:
data = """<link rel="index" href="/index.php" />
<link rel="contents" href="/getdata.php" />
<link rel="copyright" href="/blabla.php" />
<link rel="shortcut icon" href="/img/all/favicon.ico" />
"""
from lxml import etree
d = etree.HTML(data)
d.xpath('//link[@rel="shortcut icon"]/@href')
['/img/all/favicon.ico']
精彩评论