linux_dsm_epyc7002/drivers/gpu/drm/amd
Guchun Chen 12c17b9d62 drm/amdgpu: fix kernel page fault issue by ras recovery on sGPU
When running ras uncorrectable error injection and triggering GPU
reset on sGPU, below issue is observed. It's caused by the list
uninitialized when accessing.

[   80.047227] BUG: unable to handle page fault for address: ffffffffc0f4f750
[   80.047300] #PF: supervisor write access in kernel mode
[   80.047351] #PF: error_code(0x0003) - permissions violation
[   80.047404] PGD 12c20e067 P4D 12c20e067 PUD 12c210067 PMD 41c4ee067 PTE 404316061
[   80.047477] Oops: 0003 [#1] SMP PTI
[   80.047516] CPU: 7 PID: 377 Comm: kworker/7:2 Tainted: G           OE     5.4.0-rc7-guchchen #1
[   80.047594] Hardware name: System manufacturer System Product Name/TUF Z370-PLUS GAMING II, BIOS 0411 09/21/2018
[   80.047888] Workqueue: events amdgpu_ras_do_recovery [amdgpu]

Signed-off-by: Guchun Chen <guchun.chen@amd.com>
Reviewed-by: John Clements <John.Clements@amd.com>
Signed-off-by: Alex Deucher <alexander.deucher@amd.com>
2020-04-22 18:11:46 -04:00
..
acp drm: amd/acp: fix broken menu structure 2020-03-13 11:52:35 -04:00
amdgpu drm/amdgpu: fix kernel page fault issue by ras recovery on sGPU 2020-04-22 18:11:46 -04:00
amdkfd drm/amdkfd: kfree the wrong pointer 2020-04-01 14:44:22 -04:00
display drm/amd/display: remove redundant assignment to variable dp_ref_clk_khz 2020-04-22 18:11:45 -04:00
include drm/amd/powerplay: revise the way to retrieve the board parameters 2020-04-22 18:11:45 -04:00
powerplay drm/amd/powerplay: fix resume failed as smu table initialize early exit 2020-04-22 18:11:46 -04:00