我正在提取CSV文件:
"ID","LASTNAME","FIRSTNAME","PERM_ADDR1","PERM_ADDR2","PERM_CITY","PERM_ST","PERM_ZIP","DOB","LIB_TYPE","BARCODE","EMAIL","LOCAL_ADDR1","LOCAL_ADDR2","LOCAL_CITY","LOCAL_ST","LOCAL_ZIP","CAMPUS_ADDR1","CAMPUS_ADDR2","CAMPUS_CITY","CAMPUS_ST","CAMPUS_ZIP","DEPARTMENT","MAJOR"
"123","Lastname","Firstname","123 Home St","","Home City","HS","12345-6789","0101","S","1234567890","last.first@domain.local","123 Local St","","Local City","LS","98765-4321","123 Campus St","","Campus City","CS","54321-6789","IT",""
Run Code Online (Sandbox Code Playgroud)
使用Text::CSV,我试图将其解析为哈希:
my $csv = Text::CSV->new();
chomp(my $line = <READ>);
$csv->column_names(split(/,/, $line));
until (eof(READ)) {
$line = $csv->getline_hr(*READ);
my %linein = %$line;
my %patron;
$patron{'patronid'} = $linein{'ID'};
$patron{'last'} = $linein{'LASTNAME'};
$patron{'first'} = $linein{'FIRSTNAME'};
print p(%linein)."\n";
print p(%patron)."\n";
}
Run Code Online (Sandbox Code Playgroud)
使用此代码,最后的print语句(使用Data::Printer)返回:
{
"BARCODE" 1234567890,
"CAMPUS_ADDR1" "123 Campus St",
"CAMPUS_ADDR2" "",
"CAMPUS_CITY" "Campus City",
"CAMPUS_ST" "CS",
"CAMPUS_ZIP" "54321-6789",
"DEPARTMENT" "IT",
"DOB" 0101,
"EMAIL" "last.first@domain.local",
"FIRSTNAME" "Firstname",
"ID" 123,
"LASTNAME" "Lastname",
"LIB_TYPE" "S",
"LOCAL_ADDR1" "123 Local St",
"LOCAL_ADDR2" "",
"LOCAL_CITY" "Local City",
"LOCAL_ST" "LS",
"LOCAL_ZIP" "98765-4321",
"MAJOR" "",
"PERM_ADDR1" "123 Home St",
"PERM_ADDR2" "",
"PERM_CITY" "Home City",
"PERM_ST" "HS",
"PERM_ZIP" "12345-6789"
}
{
first undef,
last undef,
patronid undef
}
Run Code Online (Sandbox Code Playgroud)
我不明白的是为什么%patron没有填充来自的价值观%linein.我想知道这是否与使用有关Text::CSV,因为我正在解析脚本中的其他文件并且它们工作得很好.但是,这些文件不是CSV,而是固定宽度,因此我手动解析它们.
尝试
$csv->column_names(map {/"(.*)"/ and $1} split(/,/, $line))
Run Code Online (Sandbox Code Playgroud)
代替
$csv->column_names(split(/,/, $line));
Run Code Online (Sandbox Code Playgroud)
您的CSV键被定义为文字字符串
'"LASTNAME"' , '"FIRSTNAME"'
Run Code Online (Sandbox Code Playgroud)
而不仅仅是
'LASTNAME' , 'FIRSTNAME'
Run Code Online (Sandbox Code Playgroud)
Data::Printer在向你展示正在发生的事情时,并没有做太糟糕的工作 - 所有的键p(%linein)都显示为包含双引号作为字符串的一部分,而不是p(%patron)
| 归档时间: |
|
| 查看次数: |
193 次 |
| 最近记录: |