由于状态无效,无法为Kafka流打开商店

Yan*_*ann 1 java apache-kafka apache-kafka-streams

我正在尝试使用Kafka Streams,并且创建了以下拓扑:

    KStream<String, HistoryEvent> eventStream = builder.stream(applicationTopicName, Consumed.with(Serdes.String(),
            historyEventSerde));

    eventStream.selectKey((key, value) -> new HistoryEventKey(key, value.getIdentifier()))
            .groupByKey()
            .reduce((e1, e2) -> e2, Materialized.as(streamByKeyStoreName));
Run Code Online (Sandbox Code Playgroud)

稍后,我像这样启动流:

private void startKafkaStreams(KafkaStreams streams) {
    CompletableFuture<KafkaStreams.State> stateFuture = new CompletableFuture<>();
    streams.setStateListener((newState, oldState) -> {
        if(stateFuture.isDone()) {
            return;
        }

        if(newState == KafkaStreams.State.RUNNING || newState == KafkaStreams.State.ERROR) {
            stateFuture.complete(newState);
        }
    });

    streams.start();
    try {
        KafkaStreams.State finalState = stateFuture.get();
        if(finalState != KafkaStreams.State.RUNNING) {
            // ...
        }
    } catch (InterruptedException ex) {
        // ...
    } catch(ExecutionException ex) {
        // ...
    }
}
Run Code Online (Sandbox Code Playgroud)

我的流开始时没有错误,并且最终进入了RUNNING完成未来的状态。稍后,我尝试访问在KTable拓扑中创建的存储:

public KafkaFlowHistory createFlowHistory(String flowId) {
    ReadOnlyKeyValueStore<HistoryEventKey, HistoryEvent> store = streams.store(streamByKeyStoreName,
            QueryableStoreTypes.keyValueStore());
    return new KafkaFlowHistory(flowId, store, event -> topicProducer.send(new ProducerRecord<>(applicationTopicName, flowId, event)));
}
Run Code Online (Sandbox Code Playgroud)

我已经验证了createFlowHistoryRUNNING状态初始化完成后会调用,但是我始终无法执行此操作,并且KafkaStreams报告以下错误:

线程“主”中的异常org.apache.kafka.streams.errors.InvalidStateStoreException:由于流线程是PARTITIONS_ASSIGNED,而不是RUNNING,因此无法获取状态存储流事件流文件服务测试实例按键

显然,线程的状态已更改。在尝试查询商店并等待Kafka的内部线程进入正确状态时,是否需要手动进行此操作?

Mat*_*Sax 5

旧版本(2.2.0 之前

启动时,Kafka Streams会执行以下状态转换:

CREATED -> RUNNING -> REBALANCING -> RUNNING
Run Code Online (Sandbox Code Playgroud)

您需要等待第二个RUNNING状态才能查询。

新版本: 从2.2.0 版开始

启动时的状态转换行为已更改(通过https://issues.apache.org/jira/browse/KAFKA-7657)为:

CREATED -> REBALANCING -> RUNNING
Run Code Online (Sandbox Code Playgroud)

因此,您不应再遇到此问题。