FlatFileItemReader 制表符分隔符不起作用

Tyv*_*ain 4 java spring spring-batch

我从 spring 签出了这个项目: https ://github.com/spring-guides/gs-batch-processing

来源: https: //spring.io/guides/gs/batch-processing/

我将“sample-data.csv”中的“,”替换为“tab”:

Jill    Doe
Joe Doe
Justin  Doe
Jane    Doe
John    Doe
Run Code Online (Sandbox Code Playgroud)

然后我将新的分隔符添加到阅读器:

@Bean
public FlatFileItemReader<Person> reader() {
    return new FlatFileItemReaderBuilder<Person>()
        .name("personItemReader")
        .resource(new ClassPathResource("sample-data.csv"))
        .delimited()
        .delimiter(DelimitedLineTokenizer.DELIMITER_TAB) // NEW DELIMITER
        .names(new String[]{"firstName", "lastName"})
        .fieldSetMapper(new BeanWrapperFieldSetMapper<Person>() {{
            setTargetType(Person.class);
        }})
        .build();
}
Run Code Online (Sandbox Code Playgroud)

当我启动时,我收到此错误:

Caused by: org.springframework.batch.item.file.transform.IncorrectTokenCountException: Incorrect number of tokens found in record: expected 2 actual 1
    at org.springframework.batch.item.file.transform.AbstractLineTokenizer.tokenize(AbstractLineTokenizer.java:142) ~[spring-batch-infrastructure-4.0.1.RELEASE.jar:4.0.1.RELEASE]
    at org.springframework.batch.item.file.mapping.DefaultLineMapper.mapLine(DefaultLineMapper.java:43) ~[spring-batch-infrastructure-4.0.1.RELEASE.jar:4.0.1.RELEASE]
    at org.springframework.batch.item.file.FlatFileItemReader.doRead(FlatFileItemReader.java:180) ~[spring-batch-infrastructure-4.0.1.RELEASE.jar:4.0.1.RELEASE]
    ... 50 common frames omitted
Run Code Online (Sandbox Code Playgroud)

我尝试过使用“@”分隔符 - >它有效。由于某种原因,我无法使其与制表符分隔符一起使用......

当然,在我的实际项目中,我有一个带有“制表符”分隔符的输入文件......

这里有什么解决办法吗?

San*_*ose 6

您不能这样设置制表符分隔符。由于 tab ('\t') 不包含任何实际文本,因此它会被中DelimitedLineTokenizer的静态DelimitedBuilder类忽略FlatFileItemReaderBuilder.java。可以使用您在问题中给出的上述代码来设置任何非空白分隔符。

FlatFileItemReaderBuilder 源代码

这就是LineTokenizer实例的构建方式FlatFileItemReaderBuilder.java

public DelimitedLineTokenizer build() {
        Assert.notNull(this.fieldSetFactory, "A FieldSetFactory is required.");
        Assert.notEmpty(this.names, "A list of field names is required");

        DelimitedLineTokenizer tokenizer = new DelimitedLineTokenizer();

        tokenizer.setNames(this.names.toArray(new String[this.names.size()]));

        // the hasText ignores the tab delimiter.

        if(StringUtils.hasText(this.delimiter)) {
            tokenizer.setDelimiter(this.delimiter);
        }
// more code
Run Code Online (Sandbox Code Playgroud)

因此,要解决此问题,您需要提供显式配置有制表符分隔符的 DelimitedLineTokenizer 类型的 bean。

在 spring 配置文件中使用以下代码来设置制表符分隔符:

@Bean
public FlatFileItemReader<Person> reader() {
    return new FlatFileItemReaderBuilder<Person>().name("personItemReader")
            .resource(new ClassPathResource("sample-data.csv"))
            .lineMapper(lineMapper()).build();
}

@Bean
public DefaultLineMapper<Person> lineMapper(){
      DefaultLineMapper<Person> lineMapper = new DefaultLineMapper<>();
      lineMapper.setLineTokenizer(lineTokenizer());
      lineMapper.setFieldSetMapper(new BeanWrapperFieldSetMapper<Person>() {
                {
                    setTargetType(Person.class);
                }
            });
      return lineMapper;
}

@Bean
public DelimitedLineTokenizer lineTokenizer() {
    DelimitedLineTokenizer tokenizer = new DelimitedLineTokenizer(DelimitedLineTokenizer.DELIMITER_TAB);
    tokenizer.setNames(new String[] { "firstName", "lastName" });
    return tokenizer;
}
Run Code Online (Sandbox Code Playgroud)


Pav*_*vel 5

简单的方法:

@Bean
public FlatFileItemReader<Person> reader() {
    return new FlatFileItemReaderBuilder<Person>()
            .name("personItemReader")
            .resource(new ClassPathResource("sample-data.csv"))
            .lineTokenizer(new DelimitedLineTokenizer(DelimitedLineTokenizer.DELIMITER_TAB) {{
                setNames(new String[]{"firstName", "lastName"});
            }})
            .fieldSetMapper(new BeanWrapperFieldSetMapper<Person>() {{
                setTargetType(Person.class);
            }})
            .build();
}
Run Code Online (Sandbox Code Playgroud)