How do I convert a document made in Jsoup (the Java html parser) into a string
I have a document that was made in jsoup that looks like this
Document doc = Jsoup.connect("http://en.wikipedia.org/").get();
How do i co开发者_如何学编程nvert that doc
into a string.
Have you tried:
Document doc = Jsoup.connect("http://en.wikipedia.org/").get();
String htmlString = doc.toString();
As Document extends Element it also has got the method html() which "Retrieves the element's inner HTML" according to the API. So that should work:
Document doc = Jsoup.connect("http://en.wikipedia.org/").get();
String htmlString = doc.html();
Additional Info:
Each Document object has got a reference to an instance of the inner class Document.OutputSettings which can be accessed via the method outputSettings() of Document. There you can enable/disable pretty-printing by using the setter prettyPrint(true/false). See the API for Document and Document.OutputSettings for furtherinformation
doc.toString()
works, as does doc.outerHtml()
.
Document doc = Jsoup.connect("http://en.wikipedia.org/").get();
Elements post = doc.select("div.post-content");
String dd = post.toString();
Document ddd = Jsoup.parse(dd);
After parsing the string to document then you can use on it document functions
Elements scriptTag = ddd.getElementsByTag("script");
System.out.println(scriptTag);
精彩评论