Javascript or regex solution to make markup XHTML compliant

2023-01-14 05:39 问答作者：

I have an inline markup editor built into my website, which should produ开发者_开发知识库ce XHTML compliant markup. But as you can see, it uses the deprecated font tag and size attribute.

<font style="font-family: Courier New; color: rgb(0, 0, 153);" size="2">
   asdfa
   <span style="color: rgb(0, 51, 0);">
    a
    <font size="5">fds</font>
   </span>
</font>

On other browsers, it produces the  instead of 

Is there a Javascript/Regex solution to taking the first set of markup and replacing it with XHTML compliant markup using style attribute and span tag. Thanks in advance!!

(ps. jQuery can be used too)

The markup above is perfectly valid in XHTML 1.0 Transitional.

Whether deprecated elements like  are used are a completely orthogonal issue to whether XHTML or HTML syntax is used. XHTML 1.0 is nothing more or less than a restating of HTML 4.01 in XML syntax: consequently there are Transitional and Strict variants just as there are for HTML 4.

 and  are semantically equally useless. If you want markup to use a set of defined elements and classes that are meaningful in the context of your site, you'll have to hack the editor into using those, instead of being based purely on visual formatting.

You could parse the XHTML and alter it as a later step, to try to make it look better. But regex is not at all an adequate tool to do so, as previously mentioned. You would need an XML parser, then you'd fix up the elements and attributes, then re-serialise it to XHTML. It would be sensible to do this on the server-side, because getting an XML parser on the client-side is slightly tricky, and you will need to do it on the server side anyway if you're going to be cleaning non-whitelisted elements and attributes.

I wouldn't recommend REGEX for that sort of job. (see: the greatest 'Regex to Parse HTML' answer ever!) I know, you're not talking about a full-on parser, but I still think you'd be best off with JavaScript (or which ever back-end language you're using) and a library tailored to parsing html.

You may want to look at the Tidy open source project over on Sourceforge. There's an intro/overview at IBM: "Convert from HTML to XML with HTML Tidy".

Check out CKEDITOR if it's an option to implement an other WYSIWYG Editor in your application.

继续阅读：javascript jquery regex replace xhtml

Javascript or regex solution to make markup XHTML compliant

更多精彩内容

精彩评论

最新问答

央视是哪个频道？

请问买过的朋友，舒提啦旅行箱实际使用体验如何？？

检查不孕不育需要的费用？

海信ULED电视画质有什么不同的地方?？

钉子可以挂的住画框幕布吗？

问答排行榜

河神2九牛入海钓河妖是第几集河妖什么来历可活吞牛？

性激素六项检查的最佳时间是多久？多少钱？？

Easiest way to get words of one line from istream into a vector?

《梦在燃烧 (《三国演义》动画片主题曲)》MP3歌词-汤子星？

抽烟只抽炫赫门？