我有以下xml文件
<?xml version="1.0"?>
<!DOCTYPE pathway SYSTEM "http://www.kegg.jp/kegg/xml/KGML_v0.7.1_.dtd">
<pathway name="path:ko01200" org="ko" >
<entry id="1" >
<graphics name="one"
type="circle" />
</entry>
<entry id="7" >
<graphics name="one"
type="rectangle" />
<graphics name="two"
type="rectangle"/>
</entry>
</pathway>
Run Code Online (Sandbox Code Playgroud)
我厌倦了使用xml简单解析它与下面的代码我被卡住,因为其中一个节点有2个图形元素.所以它抱怨.我假设我必须有另一个foreach循环的图形元素,但我不知道如何继续.
use strict;
use warnings;
use XML::Simple;
use Data::Dumper;
my $xml=new XML::Simple;
my $data=$xml->XMLin("file.xml",KeyAttr => ['id']);
print Dumper($data);
foreach my $entry ( keys %{$data->{entry}} ) {
print $data->{entry}->{$entry}->{graphics}->{type}."\n";
}
Run Code Online (Sandbox Code Playgroud)
这是代码结果
$VAR1 = {
'entry' => {
'1' => {
'graphics' => {
'name' => 'one...',
'type' => 'circle'
}
},
'7' => {
'graphics' => [
{
'name' => 'one',
'type' => 'rectangle'
},
{
'name' => 'two',
'type' => 'rectangle'
}
]
}
},
'org' => 'ko',
'name' => 'path:ko01200'
};
circle
Not a HASH reference at stack.pl line 12.
Run Code Online (Sandbox Code Playgroud)
XML::Simple缺乏一致性因为用户需要启用严格模式,所以graphics节点有时是哈希,有时是数组,具体取决于子元素的数量.
for my $entry ( keys %{$data->{entry}} ) {
my $graphics = $data->{entry}{$entry}{graphics};
$graphics = [ $graphics ] if ref $graphics eq "HASH";
print "$_->{type}\n" for @$graphics;
}
Run Code Online (Sandbox Code Playgroud)
有更好的XML解析模块,请检查XML :: LibXML
或者@RobEarl建议使用ForceArray参数:
XMLin("file.xml",KeyAttr => ['id'], ForceArray => [ 'graphics' ]);
Run Code Online (Sandbox Code Playgroud)