Converting Websites in HTML/XML into RDF [closed]
Questions asking us to recommend or find a tool, library or开发者_JAVA百科 favorite off-site resource are off-topic for Stack Overflow as they tend to attract opinionated answers and spam. Instead, describe the problem and what has been done so far to solve it.
Closed 9 years ago.
Improve this questionI want to convert the data on any html website to rdf. do you know any html to rdf converter? or, do you recommend for this purpose the use of XQuery or GRDDL?
In general, an automatic conversion is not possible, since HTML says what something looks like, and not what it means. If the HTML contains existing markup, you could use Anything2Triples (http://developers.any23.org/) to get RDF out.
If it's just HTML you have to write your own extraction rules somehow. GRDDL would work, but I would probably simply use python + BeautifulSoup. It depends what technology/language you know already!
If the HTML contains embedded RDFa the you can use an RDFa parser to extract the information. There are parsers available for various platforms and languages so depends on your development environment.
精彩评论