I\'m parsing html pages to get specific information, but there are some pages that I cant get all the information displayed on the web page, for example in this page
i want scrap groupon.com now my problem is such sites when you load for the first time asks you to join their email service but when you reload the page they directly show you the content of the page.
I am trying to scrape a web page and to recieve the data i need to press a button. This is the source code for the button:
I am working on a Rails 3 project that relies heavily on screen scraping to开发者_开发知识库 collect data mainly using Nokogiri.I\'m aggregating essentially all the same data but I\'m grabbing it from
I\'m writing a Java screen-scraping application for a 3270 mainframe and rather than scroll through page after page of 80x24 chars I\'d like to output all pa开发者_如何学编程ges to a printer and then
I\'m building a basic screen scraper for personal use and learning purposes, so please do not post comments like \"You need to ask permission\" etc.
Have you ever looked for a web API for certain data or functionality, only to find that there isn\'t an API available to meet your needs, or that the APIs that are available are inadequate for some re
Have been trying several days to parse the following html code (notice that there is not a real hierarchal tree structure). Everything is pretty much on the same level.
I have a code that retrieves all the \"place names\" and all the \"addresses\" separately in this link:
I am in the following situation. I am trying to convert a messy scraped html code to a nice and neat xml structure.