Java regex for HTML "<meta http-equiv="Content-Type" content="text/html; charset=ISO-8859-1"> " parsing

2023-02-05 20:23 问答作者：

I am new to regexps, can someone help me in getting a regex for parsing the tag

<meta http-equiv="Co开发者_如何学Cntent-Type" content="text/html; charset=ISO-8859-1">

with all the possiblities?

To cover "all the possibilities", you really should be using HTML 5's Determining the character encoding rules. These aren't expressible as a regular expression.

There is an open source Java implementation of it in validator.nu

If you insist on using a regular expression, then this will probably cover most cases where the encoding it declared with a meta element (it won't, for instance, cover XML declarations). It is however, dirty, makes some assumptions that are usually (but may not always be) right and I do not recommend it.

/<meta[^>]+charset=['"]?(.*?)['"]?[\/\s>]/i

继续阅读：matcher regex

Java regex for HTML "<meta http-equiv="Content-Type" content="text/html; charset=ISO-8859-1"> " parsing

更多精彩内容

精彩评论

最新问答

央视是哪个频道？

请问买过的朋友，舒提啦旅行箱实际使用体验如何？？

检查不孕不育需要的费用？

海信ULED电视画质有什么不同的地方?？

钉子可以挂的住画框幕布吗？

问答排行榜

河神2九牛入海钓河妖是第几集河妖什么来历可活吞牛？

性激素六项检查的最佳时间是多久？多少钱？？

Easiest way to get words of one line from istream into a vector?

《梦在燃烧 (《三国演义》动画片主题曲)》MP3歌词-汤子星？

抽烟只抽炫赫门？