How do I get attribute/option values from XML with Nokogiri?

2023-03-01 04:15 问答作者：

I need to extract the URL from this tag:

<media:content url="http://video.ted.com/talk/podcast/2011/None/MikeMatas_开发者_如何学C2011.mp4" fileSize="15533795" type="video/mp4" />

Currently I use this code but I only get nil values:

page_content = Nokogiri::XML(open("http://www.ted.com/talks/rss"))

page_content.xpath('//item').each {|item|
   @url = course_hash[:videoUrl] = item.at_xpath('[media:content]')['url']
   puts @url
}

The node you are trying to access has a media namespace, so you'll need to take that into account when you try to locate it.

Generally we'd do something like:

require 'nokogiri'

xml = %q{
<xml xmlns:media="http://xml.my.org/file">
 <media:content url="http://video.ted.com/talk/podcast/2011/None/MikeMatas_2011.mp4" fileSize="15533795" type="video/mp4" /> 
</xml>
}

doc = Nokogiri::XML(xml)
doc.search('//media:content', 'media' => 'http://xml.my.org/file').each do |n|
  puts n['url']
end
# >> http://video.ted.com/talk/podcast/2011/None/MikeMatas_2011.mp4

Nokogiri will automatically register the namespace if it is defined in the <xml> tag, meaning we could use a simpler form:

doc.search('//media:content').each do |n|
  puts n['url']
end
# >> http://video.ted.com/talk/podcast/2011/None/MikeMatas_2011.mp4

Nokogiri also supports using CSS accessors with namespaces:

doc.search('media|content').each do |n|
  puts n['url']
end
# >> http://video.ted.com/talk/podcast/2011/None/MikeMatas_2011.mp4

I think your xpath expression is messed up: try using item.at_xpath('media:content')['url'] instead.

继续阅读：nokogiri ruby xml

How do I get attribute/option values from XML with Nokogiri?

更多精彩内容

精彩评论

最新问答

央视是哪个频道？

请问买过的朋友，舒提啦旅行箱实际使用体验如何？？

检查不孕不育需要的费用？

海信ULED电视画质有什么不同的地方?？

钉子可以挂的住画框幕布吗？

问答排行榜

河神2九牛入海钓河妖是第几集河妖什么来历可活吞牛？

性激素六项检查的最佳时间是多久？多少钱？？

Easiest way to get words of one line from istream into a vector?

《梦在燃烧 (《三国演义》动画片主题曲)》MP3歌词-汤子星？

抽烟只抽炫赫门？