HTML Character Encoding

2023-01-28 00:41 问答作者：

When outputting HTML content from a database, some 开发者_高级运维encoded characters are being properly interpreted by the browser while others are not.

For example, %20 properly becomes a space, but %AE does not become the registered trademark symbol.

Am I missing some sort of content encoding specifier?

(note: I cannot realistically change the content to, for example, ® as I do not have control over the input editor's generated markup)

%AE is not valid for HTML safe ASCII, You can view the table here: http://www.ascii.cl/htmlcodes.htm

It looks like you are dealing with Windows Word encoding (windows-1252?? something like that) it really will NOT convert to html safe, unless you do some sort of translation in the middle.

The byte AE is the ISO-8859-1 representation for the registered trademark. If you don't see anything, then apparently the URL decoder is using other charset to URL-decode it. In for example UTF-8, this byte does not represent any valid character.

To fix this, you need to URL-decode it using ISO-8859-1, or to convert the existing data to be URL-encoded using UTF-8.

That said, you should not confuse HTML(XML) encoding like ® with URL encoding like %AE.

The '%20' encoding is URL encoding. It's only useful for URLs, not for displaying HTML.

If you want to display the reg character in an HTML page, you have two options: Either use an HTML entity, or transmit your page as UTF-8.

If you do decide to use the entity code, it's fairly simple to convert them en-masse, since you can use numeric entities; you don't have to use the named entities -- ie use ® rather than &#reg;.

If you need to know entity codes for every character, I find this cheat-sheet very helpful: http://www.evotech.net/blog/2007/04/named-html-entities-in-numeric-order/

What server side language are you using? Check for a URL Decode function.

If you are using php you can use urldecode() but you should be careful about + characters.

继续阅读：character-encoding

HTML Character Encoding

更多精彩内容

精彩评论

最新问答

央视是哪个频道？

请问买过的朋友，舒提啦旅行箱实际使用体验如何？？

检查不孕不育需要的费用？

海信ULED电视画质有什么不同的地方?？

钉子可以挂的住画框幕布吗？

问答排行榜

河神2九牛入海钓河妖是第几集河妖什么来历可活吞牛？

性激素六项检查的最佳时间是多久？多少钱？？

Easiest way to get words of one line from istream into a vector?

《梦在燃烧 (《三国演义》动画片主题曲)》MP3歌词-汤子星？

抽烟只抽炫赫门？