我有一个包含这样数据的大文件
<releases>
<release>
......
<companies>
<company>
</company>
</companies>
</release>
<release>
......
</releases>
Run Code Online (Sandbox Code Playgroud)
我想生成和输出与输入文件相同的文件,但删除了所有公司元素及其子元素。我正在尝试使用 Stax,我认为它遍历元素,然后如果我得到匹配,我可以只是不写该元素,该部分将被删除。即,如果我扔掉导致其中的元素并被删除的元素,但它似乎只是删除了元素本身而不是其他任何东西,对吗?
IE
<releases>
<release>
......
</release>
<release>
......
</releases>
Run Code Online (Sandbox Code Playgroud)
这是我目前拥有的代码:
XMLInputFactory inputFactory = XMLInputFactory.newInstance();
XMLOutputFactory outputFactory = XMLOutputFactory.newInstance();
InputStream in = new FileInputStream(source);
XMLEventReader reader = inputFactory.createXMLEventReader(in);
OutputStream out = new FileOutputStream(target);
XMLEventWriter writer = outputFactory.createXMLEventWriter(out);
XMLEvent event;
while(reader.hasNext()){
event = reader.nextEvent();
writer.add(event);
if(event.getEventType() == XMLStreamConstants.START_ELEMENT)
{
if(event.asStartElement().getName().toString().equalsIgnoreCase("companies"))
{
System.out.println("Deleting:"+event);
continue;
}
else
{
writer.add(event);
}
}
else
{
writer.add(event);
}
}
Run Code Online (Sandbox Code Playgroud)
我想我现在有了它,Stax 会逐个元素地处理,所以我需要在找到起始元素时进行标记,设置一个变量以指示在到达结束元素之前应该忽略所有后续数据。
代码示例:
import javax.xml.stream.*;
import javax.xml.stream.events.XMLEvent;
import java.io.*;
public class FixDb
{
public static void main(String[] args) throws Exception
{
File source = new File(args[0]);
if(!source.exists())
{
System.out.println("File:"+source+ " does not exist");
}
File target = new File(source+".new");
XMLInputFactory inputFactory = XMLInputFactory.newInstance();
XMLOutputFactory outputFactory = XMLOutputFactory.newInstance();
InputStream in = new FileInputStream(source);
XMLEventReader reader = inputFactory.createXMLEventReader(in);
OutputStream out = new FileOutputStream(target);
XMLEventWriter writer = outputFactory.createXMLEventWriter(out);
XMLEvent event;
boolean deleteSection = false;
while(reader.hasNext())
{
event = reader.nextEvent();
if(event.getEventType() == XMLStreamConstants.START_ELEMENT && event.asStartElement().getName().toString().equalsIgnoreCase("companies"))
{
deleteSection=true;
continue;
}
else if(event.getEventType() == XMLStreamConstants.END_ELEMENT && (event.asEndElement().getName().toString().equalsIgnoreCase("companies")))
{
deleteSection=false;
continue;
}
else if(deleteSection)
{
continue;
}
else
{
writer.add(event);
}
}
}
}
Run Code Online (Sandbox Code Playgroud)
| 归档时间: |
|
| 查看次数: |
2954 次 |
| 最近记录: |