Convert links in blockquotes to plain text

2023-03-16 07:58 问答作者：

So, I've been asking a lot of Xpath questions recently. Sorry, but I've only just started using it, and I'm working on a kind of hard project. You see, at the moment I'm parsing HTML like this (not a copy and paste, just an example):

<span id="no153434"></span>
<blockquote>Text here.<br/>More text.<br/>Some more text.</blockquote>

And I'm using

//span[starts-with(@id, 'no')]/following::*[1][name()='blockquote']//node()

To get the text inside. It's working fine, although it's very frustrating. I need to manually check for

then manually combine the strings before and after the br, add a newline, and so on. But it stills works. Until there is a link in the text, that is. Then the code is like this:

<span id="no153434"></span>
<blockquote>Text here.<br/>Text.<br/><font class = "unkfunc"><a href="linkhere" class="link">linkhere</a></font></blockquote>

I have absolutely NO idea where to go from here, as the link is incl开发者_StackOverflowuded as a completely seperate item (twice) in the array. Atleast with the br I knew where it had to be moved to. Really contemplating giving up in this project after all this effort.

You can use this XPath to obtain text inside element: //span[starts-with(@id, 'no')]/following::*[1][name()='blockquote']//text()

So you receive following result:

Text here.
Text.
linkhere

If you want only text nodes and br:

 //span
  [starts-with(@id, 'no')]/
  following::*[1][name()='blockquote']
   //node()
   [ count(.|..//text()) = count(..//text())
     or 
     name()='br'
   ]

returns

Text here.
<br />
Text.
<br />
linkhere

The answer is to not use XPath for this kind of work. Got it working 1,000,000x easier with Objective-C-HTML-Parser.

继续阅读：objective-c

Convert links in blockquotes to plain text

更多精彩内容

精彩评论

最新问答

央视是哪个频道？

请问买过的朋友，舒提啦旅行箱实际使用体验如何？？

检查不孕不育需要的费用？

海信ULED电视画质有什么不同的地方?？

钉子可以挂的住画框幕布吗？

问答排行榜

河神2九牛入海钓河妖是第几集河妖什么来历可活吞牛？

性激素六项检查的最佳时间是多久？多少钱？？

Easiest way to get words of one line from istream into a vector?

《梦在燃烧 (《三国演义》动画片主题曲)》MP3歌词-汤子星？

抽烟只抽炫赫门？