在 /var/log/messages 中,发生此错误:
Sep 19 13:18:15 wdc kernel: [2772302.630416] Machine check events logged
Run Code Online (Sandbox Code Playgroud)
不久之后,整个服务器都没有响应。这在 Xen 服务器的 Dom0 日志中(在 Debian Squeeze 上运行最新版本)。
任何人都可以阐明这个错误的含义吗?我应该订购新硬件吗?
编辑:另外,它似乎暗示它记录了一些东西,我在哪里可以找到?
有关更多信息,请检查日志文件(此日志文件可能存在也可能不存在,这取决于它在 /etc/mcelog/mcelog.conf 中的配置方式)应该在哪里找到问题的详细描述。
/var/log/mcelog
Run Code Online (Sandbox Code Playgroud)
或者只是运行命令
mcelog
Run Code Online (Sandbox Code Playgroud)
Mcelog 正在解码 x86 机器上的内核机器检查日志。来自man mcelog:
X86 CPUs report errors detected by the CPU as machine check events (MCEs). These
can be data corruption detected in the CPU caches, in main memory by an integrated
memory controller, data transfer errors on the front side bus or CPU interconnect or
other internal errors. Possible causes can be cosmic radiation, instable power
supplies, cooling problems, broken hardware, or bad luck.
Most errors can be corrected by the CPU by internal error correction mechanisms.
Uncorrected errors cause machine check exceptions which may panic the machine.
When a corrected error happens the x86 kernel writes a record describing the MCE into
a internal ring buffer available through the /dev/mcelog device mcelog retrieves
errors from /dev/mcelog, decodes them into a human readable format and prints them on
the standard output or optionally into the system log.
Run Code Online (Sandbox Code Playgroud)
您可以在项目网页Mcelog 项目网页上找到有关 mcelog 及其配置/错误/触发器的更多信息
| 归档时间: |
|
| 查看次数: |
14627 次 |
| 最近记录: |