I have a directory with about 4500 XML (HTML5) files, and I want to create a \"manifest\" of their data (essentially title and base/@href).
I\'ve been exploring the Stack Overflow data dumps and thus far taking advantage of the friendly XML and “parsing” with regular expressions. My attempts with various Haskell XML libraries to find th