spark.streams.addListener(new StreamingQueryListener() {\n ......\n override def onQueryProgress(queryProgress: QueryProgressEvent): Unit = {\n println("Query made progress: " + queryProgress.progress)\n }\n ......\n})\nRun Code Online (Sandbox Code Playgroud)\n\n当 StreamingQueryListener 添加到 Spark Structured Streaming 会话并连续输出 queryProgress 时,您将获得的指标之一是urationMs:
\n\nQuery made progress: {\n ......\n "durationMs" : {\n "addBatch" : 159136,\n "getBatch" : 0,\n "getEndOffset" : 0,\n "queryPlanning" : 38,\n "setOffsetRange" : 14,\n "triggerExecution" : 159518,\n "walCommit" : 182\n }\n ......\n}\xe2\x80\x8b\nRun Code Online (Sandbox Code Playgroud)\n\n谁能告诉我durationMs中的那些子指标在spark上下文中意味着什么?例如“addBatch 159136”是什么意思。
\n我在 Zookeeper 中看到了这个 shell 脚本zkClear.sh。
ZOOBIN="${BASH_SOURCE-$0}"
Run Code Online (Sandbox Code Playgroud)
谁能告诉我这个脚本的含义?谢谢。