开发者

How to configure nekohtml parser to properly close the anchor tag?

I'm using the nekohtml parser to p开发者_如何转开发arse my html code. Sometime my mistake while using anchor tag the content has been written like this.

<a href="http://abc.com"><a href="http://abc.com">abc</a></a>

After parsing throough the nekohtml i want the content to corrected like this.

<a href="http://abc.com"></a><a href="http://abc.com">abc</a>

For this to achieve please help to configure the nekohtml parsing.

Update:

After i tried with settings as

parser.setFeature( "http://cyberneko.org/html/features/balance-tags", true );

it is of no use. i doesn't give the result as i expected. it returns the same html content as i given


Need to set a balance-tags feature that specifies if the NekoHTML parser should attempt to balance the tags in the parsed document.

config.setFeature( "http://cyberneko.org/html/features/balance-tags", true );

from the docs:

  • Balancing the tags fixes up many common mistakes by adding missing parent elements, automatically closing elements with optional end tags, and correcting unbalanced inline element tags. In order to process HTML documents as XML, this feature should not be turned off. This feature is provided as a performance enhancement for applications that only care about the appearance of specific elements, attributes, and/or content regardless of the document's ill-formed structure.
0

上一篇:

下一篇:

精彩评论

暂无评论...
验证码 换一张
取 消

最新问答

问答排行榜