linux_dsm_epyc7002/arch/x86/kernel/cpu/mcheck
Borislav Petkov 17fea54bf0 x86/mce: Fix MCE severity messages
Derek noticed that a critical MCE gets reported with the wrong
error type description:

  [Hardware Error]: CPU 34: Machine Check Exception: 5 Bank 9: f200003f000100b0
  [Hardware Error]: RIP !INEXACT! 10:<ffffffff812e14c1> {intel_idle+0xb1/0x170}
  [Hardware Error]: TSC 49587b8e321cb
  [Hardware Error]: PROCESSOR 0:306e4 TIME 1431561296 SOCKET 1 APIC 29
  [Hardware Error]: Some CPUs didn't answer in synchronization
  [Hardware Error]: Machine check: Invalid
				   ^^^^^^^

The last line with 'Invalid' should have printed the high level
MCE error type description we get from mce_severity, i.e.
something like:

  [Hardware Error]: Machine check: Action required: data load error in a user process

this happens due to the fact that mce_no_way_out() iterates over
all MCA banks and possibly overwrites the @msg argument which is
used in the panic printing later.

Change behavior to take the message of only and the (last)
critical MCE it detects.

Reported-by: Derek <denc716@gmail.com>
Signed-off-by: Borislav Petkov <bp@suse.de>
Cc: <stable@vger.kernel.org>
Cc: H. Peter Anvin <hpa@zytor.com>
Cc: Linus Torvalds <torvalds@linux-foundation.org>
Cc: Peter Zijlstra <peterz@infradead.org>
Cc: Thomas Gleixner <tglx@linutronix.de>
Cc: Tony Luck <tony.luck@intel.com>
Link: http://lkml.kernel.org/r/1431936437-25286-3-git-send-email-bp@alien8.de
Signed-off-by: Ingo Molnar <mingo@kernel.org>
2015-05-18 10:31:22 +02:00
..
Makefile ACPI, APEI, Generic Hardware Error Source memory error support 2010-05-19 22:41:16 -04:00
mce_amd.c x86/MCE/AMD: Enable thresholding interrupts by default if supported 2015-02-19 13:24:47 +01:00
mce_intel.c x86/MCE/intel: Cleanup CMCI storm logic 2015-02-19 13:24:25 +01:00
mce-apei.c ACPI, APEI, GHES: Do not report only correctable errors with SCI 2013-12-21 13:31:06 +01:00
mce-inject.c x86: Replace __get_cpu_var uses 2014-08-26 13:45:49 -04:00
mce-internal.h x86/mce: Define mce_severity function pointer 2015-03-24 12:14:15 +01:00
mce-severity.c x86/mce/severity: Fix warning about indented braces 2015-04-03 15:20:38 +02:00
mce.c x86/mce: Fix MCE severity messages 2015-05-18 10:31:22 +02:00
p5.c x86: Clean up cr4 manipulation 2015-02-04 12:10:41 +01:00
therm_throt.c x86/mce: Avoid showing repetitive message from intel_init_thermal() 2014-09-19 12:56:05 +02:00
threshold.c asmlinkage, x86: Add explicit __visible to arch/x86/* 2014-05-05 16:07:44 -07:00
winchip.c x86: Clean up cr4 manipulation 2015-02-04 12:10:41 +01:00