使用StAX读取XML字符串

sdo*_*oca 10 java stax xml-parsing

我第一次使用stax来解析XML String.我找到了一些例子,但无法让我的代码工作.这是我的代码的最新版本:

public class AddressResponseParser
{
    private static final String STATUS = "status";
    private static final String ADDRESS_ID = "address_id";
    private static final String CIVIC_ADDRESS = "civic_address";

    String status = null;
    String addressId = null;
    String civicAddress = null;

    public static AddressResponse parseAddressResponse(String response)
    {

        try
        {
            byte[] byteArray = response.getBytes("UTF-8");
            ByteArrayInputStream inputStream = new ByteArrayInputStream(byteArray);
            XMLInputFactory inputFactory = XMLInputFactory.newInstance();
            XMLStreamReader reader = inputFactory.createXMLStreamReader(inputStream);

            while (reader.hasNext())
            {
                int event = reader.next();

                if (event == XMLStreamConstants.START_ELEMENT)
                {
                    String element = reader.getLocalName();

                    if (element.equals(STATUS))
                    {
                        status = reader.getElementText();
                        continue;
                    }

                    if (element.equals(ADDRESS_ID))
                    {
                        addressId = reader.getText();
                        continue;
                    }

                    if (element.equals(CIVIC_ADDRESS))
                    {
                        civicAddress = reader.getText();
                        continue;
                    }
                }
            }
        }
        catch (Exception e)
        {
            log.error("Couldn't parse AddressResponse", e);
        }
    }
}
Run Code Online (Sandbox Code Playgroud)

我把手表放在"event"和"reader.getElementText()"上.当代码停止时

String element = reader.getLocalName();
Run Code Online (Sandbox Code Playgroud)

显示"reader.getElementText()"值,但一旦离开该行,就无法对其进行评估.当代码停止时:

status = reader.getElementText();
Run Code Online (Sandbox Code Playgroud)

"元素"手表显示正确的值.最后,当我再一步执行代码时,我抓住了这个异常:

(com.ctc.wstx.exc.WstxParsingException) com.ctc.wstx.exc.WstxParsingException: Current state not START_ELEMENT
 at [row,col {unknown-source}]: [1,29]
Run Code Online (Sandbox Code Playgroud)

我尝试过使用status = reader.getText();,但后来我得到了这个例外:

(java.lang.IllegalStateException) java.lang.IllegalStateException: Not a textual event (END_ELEMENT)
Run Code Online (Sandbox Code Playgroud)

谁能指出我做错了什么?

编辑:

添加用于测试的JUnit代码:

public class AddressResponseParserTest
{
    private String status = "OK";
    private String address_id = "123456";
    private String civic_address = "727";

    @Test
    public void testAddressResponseParser() throws UnsupportedEncodingException, XMLStreamException
    {
        AddressResponse parsedResponse = AddressResponseParser.parseAddressResponse(this.responseXML());

        assertEquals(this.status, parsedResponse.getStatus());

        assertEquals(this.address_id, parsedResponse.getAddress()
                .getAddressId());
        assertEquals(this.civic_address, parsedResponse.getAddress()
                .getCivicAddress());
    }

    private String responseXML()
    {
        StringBuffer buffer = new StringBuffer();

        buffer.append("<response>");
        buffer.append("<status>OK</status>");
        buffer.append("<address>");
        buffer.append("<address_id>123456</address_id>");
        buffer.append("<civic_address>727</civic_address>");
        buffer.append("</address>");
        buffer.append("</response>");

        return buffer.toString();
    }
}
Run Code Online (Sandbox Code Playgroud)

sdo*_*oca 9

我找到了一个使用XMLEventReader而不是XMLStreamReader的解决方案:

public MyObject parseXML(String xml)
    throws XMLStreamException, UnsupportedEncodingException
{
    byte[] byteArray = xml.getBytes("UTF-8");
    ByteArrayInputStream inputStream = new ByteArrayInputStream(byteArray);
    XMLInputFactory inputFactory = XMLInputFactory.newInstance();
    XMLEventReader reader = inputFactory.createXMLEventReader(inputStream);

    MyObject object = new MyObject();

    while (reader.hasNext())
    {
        XMLEvent event = (XMLEvent) reader.next();

        if (event.isStartElement())
        {
            StartElement element = event.asStartElement();

            if (element.getName().getLocalPart().equals("ElementOne"))
            {
                event = (XMLEvent) reader.next();

                if (event.isCharacters())
                {
                     String elementOne = event.asCharacters().getData();
                     object.setElementOne(elementOne);
                }
                continue;
            }
            if (element.getName().getLocalPart().equals("ElementTwo"))
            {
                event = (XMLEvent) reader.next();
                if (event.isCharacters())
                {
                     String elementTwo = event.asCharacters().getData();
                     object.setElementTwo(elementTwo);
                }
                continue;
            }
        }
    }

    return object;
}
Run Code Online (Sandbox Code Playgroud)

我仍然有兴趣看到使用XMLStreamReader的解决方案.


Sta*_*Man 5

请确保您阅读了 Stax 的 javadocs:由于它是完全流式解析模式,因此只有当前事件包含的信息可用。然而,也有一些例外;例如,getElementText() 必须从 START_ELEMENT 开始,但随后会尝试组合当前元素内部的所有文本标记;返回时,它将指向匹配的 END_ELEMENT。

相反, START_ELEMENT 上的 getText() 不会返回任何有用的东西(因为 START_ELEMENT 指的是标记,而不是“内部”开始/结束元素对的子文本标记/节点)。如果你想改用它,你必须通过调用 streamReader.next(); 在流中显式移动光标。而 getElementText() 为你做。

那么是什么导致了错误?在您使用完所有开始/结束元素对后,下一个标记将是 END_ELEMENT(匹配父标记的任何内容)。因此,您必须检查获得 END_ELEMENT 的情况,而不是另一个 START_ELEMENT。