Need help extracting label from HTML page in C#

2023-03-06 19:41 问答作者：

I want to load one label's value from a remote HTML page. I have done that by loading the whole page and than using regex. I found the desired result but this method is very slow I want it to quickly load only labels value not the whole web page. Any suggestions?

This is what I'm doing at the moment:

using (var client = new WebClient())
{
    string result = c          client.DownloadString("http://web.archive.org/http://profiles.yahoo.com/italy_");
    var regex = new Regex(@"\w+([-+.]\w+)*@\w+([-.]\w+)*\.\w+([-.]\w+)*",
                          RegexOptions.Compiled);
    var s = result;
    f开发者_开发百科oreach (Match email in regex.Matches(s))
    {
        // Console.WriteLine(email.Value);
        label2.Text = email.Value;
    }
}

You must load the whole page - that's the way http requests generally work.

Maybe your regex could be improved? Not my area of expertise though, sorry.

I found the desired result but this method is very slow I want it to quickly load only labels value not the whole web page.

Couple of thoughts:

Archive.org is usually very slow in my experience. My guess is that's your bottleneck.
No, there is not a way to only make a partial request to a third-party page unless they have a response mechanism capable of returning more specific data (for example, a JSON-enabled web service that returns little snippets of HTML used on the page).
You will usually have better luck with parsing by loading data into some kind of HTML parser rather than using a regex.

Need help extracting label from HTML page in C#

更多精彩内容

精彩评论

最新问答

央视是哪个频道？

请问买过的朋友，舒提啦旅行箱实际使用体验如何？？

检查不孕不育需要的费用？

海信ULED电视画质有什么不同的地方?？

钉子可以挂的住画框幕布吗？

问答排行榜

王昌瑞《潜梦追凶》剧组庆生新锐演员未来可期？

Is it allowed to ask users to enter credit card details for own payment method?

Escaping "<" in Perl-generated XML

imessage会显示已读吗？

微信重新建群怎么建？