Ubuntu 17.04中的CPU硬件错误


12

有人可以向我解释当我查看dmesg时得到的这些错误消息吗?我是Ubuntu和Linux World的新手。

[ 7.802351] CPU4: Core temperature above threshold, cpu clock throttled (total events = 1)
[ 7.802352] CPU0: Core temperature above threshold, cpu clock throttled (total events = 1)
[ 7.802353] CPU5: Package temperature above threshold, cpu clock throttled (total events = 1)
[ 7.802354] CPU0: Package temperature above threshold, cpu clock throttled (total events = 1)
[ 7.802354] CPU4: Package temperature above threshold, cpu clock throttled (total events = 1)
[ 7.802356] CPU1: Package temperature above threshold, cpu clock throttled (total events = 1)
[ 7.802356] mce: [Hardware Error]: Machine check events logged
[ 7.802362] mce: [Hardware Error]: CPU 4: Machine Check: 0 Bank 128: 00000000880a0003
[ 7.802363] mce: [Hardware Error]: TSC 99561677c
[ 7.802385] mce: [Hardware Error]: PROCESSOR 0:506e3 TIME 1501537538 SOCKET 0 APIC 1 microcode ba
[ 7.802387] mce: [Hardware Error]: CPU 0: Machine Check: 0 Bank 128: 00000000880a0003
[ 7.802387] mce: [Hardware Error]: TSC 995616be4
[ 7.802388] mce: [Hardware Error]: PROCESSOR 0:506e3 TIME 1501537538 SOCKET 0 APIC 0 microcode ba
[ 7.802389] CPU2: Package temperature above threshold, cpu clock throttled (total events = 1)
[ 7.802390] CPU6: Package temperature above threshold, cpu clock throttled (total events = 1)
[ 7.802391] CPU3: Package temperature above threshold, cpu clock throttled (total events = 1)
[ 7.802392] CPU7: Package temperature above threshold, cpu clock throttled (total events = 1)
[ 7.826359] CPU4: Core temperature/speed normal
[ 7.826359] CPU0: Core temperature/speed normal
[ 7.826360] CPU2: Package temperature/speed normal
[ 7.826361] CPU6: Package temperature/speed normal
[ 7.826361] CPU0: Package temperature/speed normal
[ 7.826362] CPU4: Package temperature/speed normal
[ 7.826363] mce: [Hardware Error]: Machine check events logged
[ 7.826367] mce: [Hardware Error]: CPU 4: Machine Check: 0 Bank 128: 00000000880b0002
[ 7.826368] mce: [Hardware Error]: TSC 99916f004
[ 7.826369] mce: [Hardware Error]: PROCESSOR 0:506e3 TIME 1501537538 SOCKET 0 APIC 1 microcode ba
[ 7.826369] mce: [Hardware Error]: CPU 0: Machine Check: 0 Bank 128: 00000000880b0002
[ 7.826370] mce: [Hardware Error]: TSC 99916f2ca
[ 7.826370] mce: [Hardware Error]: PROCESSOR 0:506e3 TIME 1501537538 SOCKET 0 APIC 0 microcode ba
[ 7.826400] CPU1: Package temperature/speed normal
[ 7.826401] CPU5: Package temperature/speed normal
[ 7.826402] CPU3: Package temperature/speed normal
[ 7.826402] CPU7: Package temperature/speed normal
[ 467.922330] CPU4: Core temperature above threshold, cpu clock throttled (total events = 73)
[ 467.922331] CPU0: Core temperature above threshold, cpu clock throttled (total events = 73)
[ 467.922332] CPU7: Package temperature above threshold, cpu clock throttled (total events = 86)
[ 467.922333] CPU3: Package temperature above threshold, cpu clock throttled 

我正在使用4.10.0-29-generic kernal运行Ubuntu 17.04


还是软件错误,我在新联想T480S的日志中也有相同的错误,请参见bugs.launchpad.net/ubuntu/+source/linux/+bug/1781924
Rolf Schumacher

Answers:


8

CPU过热,进入MCE(机器检查事件)...即:它崩溃了。如果您在syslog中没有看到其他与温度相关的事件,则可能是因为您的CPU冷却器/风扇/热管/导热膏没有起作用。

  • 使用此terminal命令检查系统日志...

    grep -i -e temp -e therm /var/log/syslog*
    
  • 如果机器很脏/很脏,则可能在机器过热中起主要作用。清理干净。

  • 如果您的计算机装有Intel处理器,请确保intel-microcode已安装。

    sudo apt-get update
    sudo apt-get install intel-microcode
    reboot
    
  • thermald尝试安装并控制温度。

    sudo apt-get update
    sudo apt-get install thermald
    reboot
    
  • 检查您的BIOS版本。开机时输入BIOS,并记下版本号。带有计算机品牌/型号的制造商的网站。转到支持/下载部分,查看是否有更新的BIOS。

  • 最后,很有可能,如果这是一台较旧的机器,则需要重新应用位于处理器与其热管/风扇冷却器之间的导热胶。这需要一些技术经验。


1
intel-micorocode对温度有什么影响?
gene_wood

就像我提到的@gene_wood一样,它可能是灰尘,风扇或散热问题,检查微码只是帮助远程诊断问题的又一步。CPU正在运行的旧微代码可能会导致各种问题。几秒钟检查一下。
heynnema '18
By using our site, you acknowledge that you have read and understand our Cookie Policy and Privacy Policy.
Licensed under cc by-sa 3.0 with attribution required.