I'm parsing an HTML using lxml, but I only want a small subset of tags. What's the most efficient way to do this?
I'd prefer not to build the entire tree in memory and just pick the elements I'm lo开发者_StackOverflowoking for.
You could always use PyQuery; a JQuery like library for quick xml and xhtml manipulation.
精彩评论