I\'m trying to use Zend_Dom for some very light screen scraping (I want to grab a headline, some body text and a link from a small block of news items on my website) and I\'m not sure how to handle th
If you were to convert this html content into a small thumbnail image , how would you do it? P.S. I\'m trying to do this to allow users on my site to browse through their posts(containing html elemen
I need help accomplishing the following: In my web app users should be able to submit products including a product image from a certain product site. They do this by first entering a product url, fo
is there any way how to use Mechanize with Python 3.x? Or is there any substitute which works in Python 3.x?
I am trying to access a web page using urllib2 and the automatic redirect in urllib2 does not seem toretrieve the entire page.
I need to write a scraper in Java + Groovy.. I was wondering if something able to parse HTML documents and select the informations I need through simple CSS selectors (instead that going through the
i have a function that pulls URLs from various web resources. needless to say some are full valid URLS and some are relative as per the HTML of the page. below is my asp.net/ c# logic i derived for ex
I\'m giving BeautifulSoup an html document and simply by constructing a BeautifulSoup object instance with the full html, it seems to choke on the following line of a jQuery script that\'s embedded wi
i sat in on part of a Microsoft PDC and heard the presenter talk about the cool new way that .net 4 and VS2010 allow for multi-threading. the code is smaller and cleaner and simpler,the logic is easie
It's difficult to tell what is being asked here. This question is ambiguous, vague, incomplete, overly broad, or rhetorical andcannot be reasonably answered in its current form. For help clari