我正在使用 splunk 转发器将我的 IBM MQ v9.1 错误日志转发到一个集中式集群,以查看我的分布式消息传递系统中发生的常见错误的趋势。
但是我无法解析所需的字段,因为 MQ 错误日志的格式各不相同,即消息的严重性可能是错误、警告、信息、严重和终止,并且每个字段本身都有不同的字段集并且不一致.
请让我知道是否有人在 splunk 中使用正则表达式来解析 v9.1 的 IBM MQ 错误日志的字段。
我尝试了一些正则表达式模式,但没有按预期解析。
我已经提到了下面的链接,但那是针对 v8 的,v9 的错误日志格式不同,https: //t-rob.net/2017/12/18/parsing-mq-error-logs-in -splunk/
此外,splunk 用户无法访问错误日志。我在 qm.ini 文件系统中更新了以下节:ValidateAuth=No
还将 chmod -R 755 设置为 /var/mqm/qmgrs/qmName/errors 文件夹。
尽管 ERROR 日志的权限在更新时不会更改,但当日志轮换时,权限将被撤销,并且 splunk 用户无法读取日志。
请让我知道如何在不将 splunk 用户添加到 mqm 组的情况下克服这个问题
我建议启用 JSON 日志记录并将这些日志转发到 Splunk,Splunk 应该能够解析此格式。
在 IBM MQ v9.0.4 CDS 版本中,IBM 添加了注销到 JSON 格式日志的功能,即使您启用 JSON 日志记录,MQ 也将始终记录到原始 AMQERR0x.LOG 文件。这包含在所有 MQ 9.1 LTS 和 CSD 版本中。
IBM MQ v9.1 知识中心页面IBM MQ>配置>更改 IBM MQ 和队列管理器配置信息>用于更改队列管理器配置信息的属性>诊断消息日志记录>诊断消息服务节>诊断消息服务包含有关该主题的信息。您可以将以下内容添加到您的文件中qm.ini,以使其将日志信息输出到AMQERR0x.json标准队列管理errors器目录中调用的 JSON 格式的文件中:
Run Code Online (Sandbox Code Playgroud)DiagnosticMessages: Service = File Name = JSONLogs Format = json FilePrefix = AMQERR
正如 OP 所指出的,JSON 格式的日志不包含您在正常日志中看到的EXPLANATION或部分。ACTION
在 IBM MQ v9.1 中,您可以使用该mqrc命令将 JSON 格式转换为您在 中看到的熟悉格式AMQERR01.LOG。
一个简单的例子如下:
cat <<EOL |mqrc -i json -o text -
{"ibm_messageId":"AMQ9209E","ibm_arithInsert1":0,"ibm_arithInsert2":0,"ibm_commentInsert1":"localhost (127.0.0.1)","ibm_commentInsert2":"TCP/IP","ibm_commentInsert3":"SYSTEM.DEF.SVRCONN","ibm_datetime":"2018-02-22T06:54:53.942Z","ibm_serverName":"QM1","type":"mq_log","host":"0df0ce19c711","loglevel":"ERROR","module":"amqccita.c:4214","ibm_sequence":"1519282493_947814358","ibm_remoteHost":"127.0.0.1","ibm_qmgrId":"QM1_2018-02-13_10.49.57","ibm_processId":4927,"ibm_threadId":4,"ibm_version":"9.1.0.5","ibm_processName":"amqrmppa","ibm_userName":"johndoe","ibm_installationName":"Installation1","ibm_installationDir":"/opt/mqm","message":"AMQ9209E: Connection to host 'localhost (127.0.0.1)' for channel 'SYSTEM.DEF.SVRCONN' closed."}
EOL
Run Code Online (Sandbox Code Playgroud)
输出将是:
02/22/2018 06:54:53 AM - User(johndoe) Program(amqrmppa)
Host(0df0ce19c711) Installation(Installation1)
VRMF(9.1.0.5) QMgr(QM1)
Time(2018-02-22T11:54:53.942Z)
RemoteHost(127.0.0.1)
CommentInsert1(localhost (127.0.0.1))
CommentInsert2(TCP/IP)
CommentInsert3(SYSTEM.DEF.SVRCONN)
AMQ9209E: Connection to host 'localhost (127.0.0.1)' for channel
'SYSTEM.DEF.SVRCONN' closed.
EXPLANATION:
An error occurred receiving data from 'localhost (127.0.0.1)' over TCP/IP. The
connection to the remote host has unexpectedly terminated.
The channel name is 'SYSTEM.DEF.SVRCONN'; in some cases it cannot be determined
and so is shown as '????'.
ACTION:
Tell the systems administrator.
----- amqccita.c : 4214 -------------------------------------------------------
Run Code Online (Sandbox Code Playgroud)
您还可以mqrc仅使用 JSON 中的错误消息,例如AMQ9209E,您可以运行如下命令:
mqrc AMQ9209E
Run Code Online (Sandbox Code Playgroud)
输出将是:
536908297 0x20009209 rrcE_CONNECTION_CLOSED
536908297 0x20009209 urcMS_CONN_CLOSED
MESSAGE:
Connection to host '<insert one>' for channel '<insert three>' closed.
EXPLANATION:
An error occurred receiving data from '<insert one>' over <insert two>. The
connection to the remote host has unexpectedly terminated.
The channel name is '<insert three>'; in some cases it cannot be determined and
so is shown as '????'.
ACTION:
Tell the systems administrator.
Run Code Online (Sandbox Code Playgroud)
您可以进一步指定 JSON 中的插入:
JSON 日志的示例部分:
"ibm_messageId":"AMQ9209E","ibm_arithInsert1":0,"ibm_arithInsert2":0,"ibm_commentInsert1":"localhost (127.0.0.1)","ibm_commentInsert2":"TCP/IP","ibm_commentInsert3":"SYSTEM.DEF.SVRCONN"
Run Code Online (Sandbox Code Playgroud)
在下面的命令中,每个命令都ibm_arthInsert指定了一个进行-n标记,每个命令后面都ibm_commentInsert带有一个进行-c标记:
mqrc AMQ9209E -n 0 -n 0 -c "localhost (127.0.0.1)" -c "TCP/IP" -c "SYSTEM.DEF.SVRCONN"
Run Code Online (Sandbox Code Playgroud)
输出如下:
536908297 0x20009209 rrcE_CONNECTION_CLOSED
536908297 0x20009209 urcMS_CONN_CLOSED
MESSAGE:
Connection to host 'localhost (127.0.0.1)' for channel 'SYSTEM.DEF.SVRCONN'
closed.
EXPLANATION:
An error occurred receiving data from 'localhost (127.0.0.1)' over TCP/IP. The
connection to the remote host has unexpectedly terminated.
The channel name is 'SYSTEM.DEF.SVRCONN'; in some cases it cannot be determined
and so is shown as '????'.
ACTION:
Tell the systems administrator.
Run Code Online (Sandbox Code Playgroud)