Java:如何防止EntityResolver中的'systemId'#resolveEntity(String publicId,String systemId)从绝对化到当前工作目录

His*_*His 8 java entityresolver xml-parsing

我想解析以下XML文档来解析其中的所有实体:

 <!DOCTYPE doc SYSTEM 'mydoc.dtd'>
 <doc>&title;</doc>
Run Code Online (Sandbox Code Playgroud)

我的EntityResolver应该从数据库中获取具有给定系统ID的外部实体,然后执行解决方案,请参阅下面的插图:

 private static class MyEntityResolver
 {
    public InputSource resolveEntity(String publicId, String systemId)
        throws SAXException, IOException
    {
        // At this point, systemId is always absolutized to the current working directory, 
        // even though the XML document specified it as relative.
        // E.g. "file:///H:/mydoc.dtd" instead of just "mydoc.dtd"
        // Why???  How can I prevent this???

        SgmlEntity entity = findEntityFromDatabase(systemId);
        InputSource is = new InputSource(new ByteArrayInputStream(entity.getContents()));
        is.setPublicId(publicId);
        is.setSystemId(systemId);
        return is;
    }
 }
Run Code Online (Sandbox Code Playgroud)

我都尝试使用DOM(DocumentBuilder的)和SAX(XMLReader的),实体解析器设置为MyEntityResolver(即setEntityResolver(new MyEntityResolver())),但systemIdMyEntityResolver#resolveEntity(String publicId, String systemId)总是被绝对化到当前工作目录.

我也尝试过打电话setFeature("http://xml.org/sax/features/resolve-dtd-uris", false);,但那没有任何帮助.

那么我怎样才能实现我想要的呢?

谢谢!

His*_*His 8

显然,还有另一个名为EntityResolver2的接口,它是旧EntityResolver的扩展.(谈论令人困惑的名字!)

无论如何,我发现EntityResolver2实现了我想要的,也就是说,它没有对它进行任何更改systemId,因此它将始终完全是XML文档中指定的内容.