Any idea on how to scrape pages which are behind __doPostBack('...');?
I am working on this php base scraper/crawler, which works fine until it get .net generated herf lin开发者_如何转开发k __doPostBack(...), any idea how to deal with this and crawl page behind those links ?
Instead of trying to automate clicking the JavaScript button, which requires additional libraries in PHP, try replicating what request is sent by your browser after clicking the button. There are various firefox extensions that will help you examine the request, such as TamperData, Firebug, and LiveHttp.
精彩评论