复制队列因错误而卡住

wed*_*edi 5 clickhouse

我有 3 个节点集群,带有复制 2 和复制表stats

最近看到副本数据库有延迟使用 /replica_satatus

db.stats:   Absolute delay: 0. Relative delay: 0.
db2.stats:  Absolute delay: 912916. Relative delay: 912916.
Run Code Online (Sandbox Code Playgroud)

这里的数据来自 system.replication_queue

Row 1:
??????
database: db2
table: stats
replica_name:           replica_2
position:               3
node_name:              queue-0001743101
type:                   GET_PART
create_time:            2018-06-19 20:57:42
required_quorum:        0
source_replica:         replica_1
new_part_name:          20180619_20180619_823572_823572_0
parts_to_merge:         []
is_detach:              0
is_currently_executing: 0
num_tries:              917943
last_exception:
last_attempt_time:      2018-06-29 15:32:50
num_postponed:          118617
postpone_reason:
last_postpone_time:     2018-06-29 15:32:23

Row 2:
??????
database: db2
table: stats
replica_name:           replica_2
position:               4
node_name:              queue-0001743103
type:                   MERGE_PARTS
create_time:            2018-06-19 20:57:48
required_quorum:        0
source_replica:         replica_1
new_part_name:          20180619_20180619_823568_823573_1
parts_to_merge:         ['20180619_20180619_823568_823568_0','20180619_20180619_823569_823569_0','20180619_20180619_823570_823570_0','20180619_20180619_823571_823571_0','20180619_20180619_823572_823572_0','20180619_20180619_823573_823573_0']
is_detach:              0
is_currently_executing: 0
num_tries:              917943
last_exception:         Code: 234, e.displayText() = DB::Exception: No active replica has part 20180619_20180619_823568_823573_1 or covering part, e.what() = DB::Exception
last_attempt_time:      2018-06-29 15:32:50
num_postponed:          199384
postpone_reason:        Not merging into part 20180619_20180619_823568_823573_1 because part 20180619_20180619_823572_823572_0 is not ready yet (log entry for that part is being processed).
last_postpone_time:     2018-06-29 15:32:35
Run Code Online (Sandbox Code Playgroud)

任何线索如何处理它?

我应该分离损坏的 replika 分区并重新附加它吗?

del*_*nic 0

停止对此集群的所有插入,它应该自动清除复制队列。