RSS escaped HTML

2023-02-16 07:13 问答作者：

My understanding of RSS's "escaped HTML" is that something like this:

HTML:

1 &lt; 3

becomes (RSS):

1 &amp;lt; 3

So, then, should this:

<img src="http://somehost/开发者_StackOverflow社区someimage?a=foo&amp;b=bar" />

Become:

&lt;img src="http://somehost/someimage?a=foo&amp;amp;b=bar" /&gt;

(Note the &amp; If yes, is this then invalid RSS?

<description>
    ...
    &#60;img src="http://d.yimg.com/a/p/ap/20110309/capt.f6...02-0.jpg?x=91&amp;y=130&amp;q=85&amp;sig=6oI7fIgN0izc9olfgY56vw--" />
</description>

(Additionally, is the fact that the closing > isn't escaped bad?)

The problem with the above <description> that I'm having is that once you decode the first layer of entities (XML) to arrive at the contents of the <description> tag, you get one long CDATA, which should be HTML. The problem is that the <img> has just a &, which is an invalid entity. For the massive chunk above, I get something like <img src="....?x=1&y=2" />, which isn't valid HTML.

Am I just looking at crappy HTML that got shoved into RSS, or am I missing something here?

you need to use CDATA Sections

<description><![CDATA[ <img src="http://somehost/someimage?a=foo&amp;b=bar" /> ]]>
</description>

继续阅读：rss

RSS escaped HTML

更多精彩内容

精彩评论

最新问答

央视是哪个频道？

请问买过的朋友，舒提啦旅行箱实际使用体验如何？？

检查不孕不育需要的费用？

海信ULED电视画质有什么不同的地方?？

钉子可以挂的住画框幕布吗？

问答排行榜

河神2九牛入海钓河妖是第几集河妖什么来历可活吞牛？

性激素六项检查的最佳时间是多久？多少钱？？

Easiest way to get words of one line from istream into a vector?

《梦在燃烧 (《三国演义》动画片主题曲)》MP3歌词-汤子星？

抽烟只抽炫赫门？