开发者

How can I parse XML using Java?

<?xml version="1.0" ?> 
<input>
    <sys>
      <protocol>TL1</protocol> 
      <ipAddress>10.05.2.3</ipAddress>
      <port>2001</port>
      <prompt>agent</prompt>       
       <TL1Command>
           <type>get</type>
           <command_code>...........</command_code>
           <staging_block>      
                <tid>...........</tid>
                <aid>...........</aid>
                <ctag>..........</ctag>
                <gen_block>.....</gen_block>
           </staging_block>
           <payload_block>
                <data_block>.......</data_block>
           </payload_block>
    </TL1Command>
    </sys>
    <sys>
      <protocol>TL1</protocol> 
      <ipAddress>10.5.2.98</ipAddress>
      <port>2001</port>
      <prompt>agent</prompt>       
       <TL1Command>
           <type>get</type>
           <command_code>...........</command_code>
           <staging_block>      
                <tid>...........</tid>
                <aid>...........</aid>
                <ctag>..........</ctag>
                <gen_block>.....</gen_block>
           </staging_block>
           <payload_b开发者_如何学Clock>
                <data_block>.......</data_block>
                <data_block>.......</data_block>
                <data_block>.......</data_block>
           </payload_block>
    </TL1Command>
    </sys>
</input>

I want to know how to parse this XML using Java. Such that I can use that data as it is in the same given way for my program. I know of how to parse it but the problem is for each command there might be different number of data blocks. So after parsing I need to use respective datablocks for respective commands. I mean for first command while retriving I should get only one data block value and for 2nd command 3 data blocks and so on. Please let me know any sample code for solving this issue.


There are two basic approaches to parsing XML.

  1. A cursor (StAX) or event (SAX) based approach. Much more lightweight but often more verbose. This is particularly good when you only want to grab out small parts or the processing is easy; and
  2. Tree-based approaches (DOM). Typically DOM parsers are built on top of SAX/StAX parsers anyway. They require more processing and typically require you to have an entire document in memory before you can do anything (whereas SAX/StAX can be much more efficient and quicker to respond). But this is useful for "random" access to document parts.

Pick whichever is most appropriate for your circumstances.


Simplest way would be to load the document as a DOM Document

Then get what you need using XPath

Document document =  DocumentBuilderFactory.newInstance().newDocumentBuilder().parse(inputStream);

NodeList nodeList = XPathAPI.selectNodeList(document, "/sys");
for (int i = 0; i < nodeList.getLength(); i++) {
    Node node = nodeList.item(i);
    System.out.println(node.getTextContent());
}

Check out an xpath tutorial here.


You can consider Xpath (it falls in the "tree based approach" as listed by cletus in this topic). It's the most handy/easy approach if all you want is just getting the values out of the xml document.

Here are some useful Xpath tutorials:
To learn the syntax: http://www.w3schools.com/Xpath/
To learn how to use in Java: http://www.ibm.com/developerworks/library/x-javaxpathapi.html


Are you simply looking for a library that will allow you to read the xml into an object graph?

see here for a list of many parsers: http://java-source.net/open-source/xml-parsers

A very commonly used library:

dom4j: http://www.dom4j.org/

Another SO question: Fastest XML parser for small, simple documents in Java


use either DOM(Document Object Model)parser or SAX(Simple API for XML)parser to parse your xml document. First create an xml document. (file with .xml extension) download the parser from the link http://archive.apache.org/dist/xml/xerces-j/ and parse your document


As described by cletus, you have to choose between the event based approach or the DOM tree you´ll have to traverse. Consider the event based scenario as some sort of state machine where you, as you enter the element "payload_block", you set a corresponding property and as long as it is set and the events from "data_block" come in, you read them as long as the close event from playload_block comes.

When you traverse the tree, you´ll read the children of "payload_block" and expect a list that you can iterate over and collect your data.


Hard to tell the actual problem. For simplicity's sake I'd build a DOM tree and read the data from that structure.

A typical class for TL1Command then could look like:

class TL1Command {
  String type;
  String commandLine;
  StagingBlock stagingBlock;
  List<DataBlock> dataBlocks;
}

This structure is flexible enough to handle different numbers of data blocks in each TL1Command. (was that the issue?)


I really like(and its just my opinion) the SAX approach when you know well the structure of your files.Here is a link that may help you SAX2 Tutorial


If you only need to get XML snippets into/out of an object graph then you might consider XStream which is a simple lightweight marshalling/unmarshalling library.


You might want to use JAXB


You might also want to know vtd-xml, another open source XML parsing/indexing lib ...


I've written a very simple API for precisely this reason. It uses the DOM parser underneath, but exposes a very simple and easy-to-use API that allows you to get to the XML data really easily. It's just a single Java file that you can use as a library in your code. Hope that helps.

http://argonrain.wordpress.com/2009/10/27/000/

0

上一篇:

下一篇:

精彩评论

暂无评论...
验证码 换一张
取 消

最新问答

问答排行榜