parse html in adobe air

2022-12-15 19:21 问答作者：

I am trying to load and parse html in adobe air. The main purpose being to extract title, meta tags and links. I have been trying the HTMLLoader but I get all sort of errors, mainly javascript uncaught exceptions.

I also tried to load the html content directly (using URLLoader) and push the text into HTMLLoader (using loadString(...)) but got the same error. Last resort wa开发者_Go百科s to try and load the text into xml and then use E4X queries or xpath, no luck there cause the html is not well formed.

My questions are:

Is there simple and reliable (air/action script) DOM component there (I do not need to display the page and headless mode will do)?
Is there any library to convert (crappy) html into well formed xml so I can use xpath/E4X
Any other suggestions on how to do this?

thx

ActionScript is supposed to be a superset of JavaScript, and thankfully, there's...

Pure JavaScript/ActionScript HTML Parser

created by Javascript guru and jQuery creator John Resig :-)

One approach is to run the HTML through HTMLtoXML() then use E4X as you please :)

Afaik:

No :-(
No :-(
I think the easiest way to grab title and meta tags is writing some regular expressions. You can load the page's HTML code into a string and then read out whatever you need like this:

var str:String = ""; // put HTML code in here

var pattern:RegExp = /<title>(.+)<\/title>/i;

trace(pattern.exec(str));

继续阅读：actionscript air screen-scraping

parse html in adobe air

更多精彩内容

精彩评论

最新问答

央视是哪个频道？

请问买过的朋友，舒提啦旅行箱实际使用体验如何？？

检查不孕不育需要的费用？

海信ULED电视画质有什么不同的地方?？

钉子可以挂的住画框幕布吗？

问答排行榜

河神2九牛入海钓河妖是第几集河妖什么来历可活吞牛？

性激素六项检查的最佳时间是多久？多少钱？？

Easiest way to get words of one line from istream into a vector?

《梦在燃烧 (《三国演义》动画片主题曲)》MP3歌词-汤子星？

抽烟只抽炫赫门？