linux_dsm_epyc7002

mirror of https://github.com/AuxXxilium/linux_dsm_epyc7002.git synced 2024-12-28 11:18:45 +07:00

Author	SHA1	Message	Date
Ben Skeggs	a2ac09a03d	drm/nouveau/core: allow detected chipset to be overridden Signed-off-by: Ben Skeggs <bskeggs@redhat.com>	2019-05-14 16:58:06 +10:00
Colin Ian King	a2f07d4c1e	drm/nouveau/fb/ramgk104: fix spelling mistake "sucessfully" -> "successfully" There is a spelling mistake in a nvkm_debug message. Fix it. Signed-off-by: Colin Ian King <colin.king@canonical.com> Reviewed-by: Mukesh Ojha <mojha@codeaurora.org> Signed-off-by: Ben Skeggs <bskeggs@redhat.com>	2019-05-01 11:08:39 +10:00
Lyude Paul	342406e4fb	drm/nouveau/i2c: Disable i2c bus access after ->fini() For a while, we've had the problem of i2c bus access not grabbing a runtime PM ref when it's being used in userspace by i2c-dev, resulting in nouveau spamming the kernel log with errors if anything attempts to access the i2c bus while the GPU is in runtime suspend. An example: [ 130.078386] nouveau 0000:01:00.0: i2c: aux 000d: begin idle timeout ffffffff Since the GPU is in runtime suspend, the MMIO region that the i2c bus is on isn't accessible. On x86, the standard behavior for accessing an unavailable MMIO region is to just return ~0. Except, that turned out to be a lie. While computers with a clean concious will return ~0 in this scenario, some machines will actually completely hang a CPU on certian bad MMIO accesses. This was witnessed with someone's Lenovo ThinkPad P50, where sensors-detect attempting to access the i2c bus while the GPU was suspended would result in a CPU hang: CPU: 5 PID: 12438 Comm: sensors-detect Not tainted 5.0.0-0.rc4.git3.1.fc30.x86_64 #1 Hardware name: LENOVO 20EQS64N17/20EQS64N17, BIOS N1EET74W (1.47 ) 11/21/2017 RIP: 0010:ioread32+0x2b/0x30 Code: 81 ff ff ff 03 00 77 20 48 81 ff 00 00 01 00 76 05 0f b7 d7 ed c3 48 c7 c6 e1 0c 36 96 e8 2d ff ff ff b8 ff ff ff ff c3 8b 07 <c3> 0f 1f 40 00 49 89 f0 48 81 fe ff ff 03 00 76 04 40 88 3e c3 48 RSP: 0018:ffffaac3c5007b48 EFLAGS: 00000292 ORIG_RAX: ffffffffffffff13 RAX: 0000000001111000 RBX: 0000000001111000 RCX: 0000043017a97186 RDX: 0000000000000aaa RSI: 0000000000000005 RDI: ffffaac3c400e4e4 RBP: ffff9e6443902c00 R08: ffffaac3c400e4e4 R09: ffffaac3c5007be7 R10: 0000000000000004 R11: 0000000000000001 R12: ffff9e6445dd0000 R13: 000000000000e4e4 R14: 00000000000003c4 R15: 0000000000000000 FS: 00007f253155a740(0000) GS:ffff9e644f600000(0000) knlGS:0000000000000000 CS: 0010 DS: 0000 ES: 0000 CR0: 0000000080050033 CR2: 00005630d1500358 CR3: 0000000417c44006 CR4: 00000000003606e0 DR0: 0000000000000000 DR1: 0000000000000000 DR2: 0000000000000000 DR3: 0000000000000000 DR6: 00000000fffe0ff0 DR7: 0000000000000400 Call Trace: g94_i2c_aux_xfer+0x326/0x850 [nouveau] nvkm_i2c_aux_i2c_xfer+0x9e/0x140 [nouveau] __i2c_transfer+0x14b/0x620 i2c_smbus_xfer_emulated+0x159/0x680 ? _raw_spin_unlock_irqrestore+0x1/0x60 ? rt_mutex_slowlock.constprop.0+0x13d/0x1e0 ? __lock_is_held+0x59/0xa0 __i2c_smbus_xfer+0x138/0x5a0 i2c_smbus_xfer+0x4f/0x80 i2cdev_ioctl_smbus+0x162/0x2d0 [i2c_dev] i2cdev_ioctl+0x1db/0x2c0 [i2c_dev] do_vfs_ioctl+0x408/0x750 ksys_ioctl+0x5e/0x90 __x64_sys_ioctl+0x16/0x20 do_syscall_64+0x60/0x1e0 entry_SYSCALL_64_after_hwframe+0x49/0xbe RIP: 0033:0x7f25317f546b Code: 0f 1e fa 48 8b 05 1d da 0c 00 64 c7 00 26 00 00 00 48 c7 c0 ff ff ff ff c3 66 0f 1f 44 00 00 f3 0f 1e fa b8 10 00 00 00 0f 05 <48> 3d 01 f0 ff ff 73 01 c3 48 8b 0d ed d9 0c 00 f7 d8 64 89 01 48 RSP: 002b:00007ffc88caab68 EFLAGS: 00000246 ORIG_RAX: 0000000000000010 RAX: ffffffffffffffda RBX: 00005630d0fe7260 RCX: 00007f25317f546b RDX: 00005630d1598e80 RSI: 0000000000000720 RDI: 0000000000000003 RBP: 00005630d155b968 R08: 0000000000000001 R09: 00005630d15a1da0 R10: 0000000000000070 R11: 0000000000000246 R12: 00005630d1598e80 R13: 00005630d12f3d28 R14: 0000000000000720 R15: 00005630d12f3ce0 watchdog: BUG: soft lockup - CPU#5 stuck for 23s! [sensors-detect:12438] Yikes! While I wanted to try to make it so that accessing an i2c bus on nouveau would wake up the GPU as needed, airlied pointed out that pretty much any usecase for userspace accessing an i2c bus on a GPU (mainly for the DDC brightness control that some displays have) is going to only be useful while there's at least one display enabled on the GPU anyway, and the GPU never sleeps while there's displays running. Since teaching the i2c bus to wake up the GPU on userspace accesses is a good deal more difficult than it might seem, mostly due to the fact that we have to use the i2c bus during runtime resume of the GPU, we instead opt for the easiest solution: don't let userspace access i2c busses on the GPU at all while it's in runtime suspend. Changes since v1: * Also disable i2c busses that run over DP AUX Signed-off-by: Lyude Paul <lyude@redhat.com> Cc: stable@vger.kernel.org Signed-off-by: Ben Skeggs <bskeggs@redhat.com>	2019-05-01 11:08:39 +10:00
Jon Derrick	15516bf9ab	drm/nouveau/mmu: qualify vmm during dtor If the BAR initialization failed it may leave the vmm structure in an unitialized state, leading to a null-pointer-dereference when the vmm is dereferenced during teardown. Signed-off-by: Jon Derrick <jonathan.derrick@intel.com> Signed-off-by: Ben Skeggs <bskeggs@redhat.com>	2019-05-01 11:08:39 +10:00
Jon Derrick	12e08beb32	drm/nouveau/bar/gf100: ensure BAR is mapped If the BAR is zero size, it indicates it was never successfully mapped. Ensure that the BAR is valid during initialization before attempting to use it. Signed-off-by: Jon Derrick <jonathan.derrick@intel.com> Signed-off-by: Ben Skeggs <bskeggs@redhat.com>	2019-05-01 11:08:39 +10:00
Jon Derrick	f10b83de1f	drm/nouveau/bar/nv50: ensure BAR is mapped If the BAR is zero size, it indicates it was never successfully mapped. Ensure that the BAR is valid during initialization before attempting to use it. Signed-off-by: Jon Derrick <jonathan.derrick@intel.com> Signed-off-by: Ben Skeggs <bskeggs@redhat.com>	2019-05-01 11:08:39 +10:00
Jon Derrick	307a312df9	drm/nouveau/bar/nv50: check bar1 vmm return value Check bar1's new vmm creation return value for errors. Signed-off-by: Jon Derrick <jonathan.derrick@intel.com> Signed-off-by: Ben Skeggs <bskeggs@redhat.com>	2019-05-01 11:08:39 +10:00
Ben Skeggs	a261a20c01	drm/nouveau/fault/gv100-: expose VoltaFaultBufferA This nvclass exposes the replayable fault buffer, which will be used by SVM to manage GPU page faults. Signed-off-by: Ben Skeggs <bskeggs@redhat.com>	2019-02-20 09:00:01 +10:00
Ben Skeggs	13e9572906	drm/nouveau/fault/gp100: expose MaxwellFaultBufferA This nvclass exposes the replayable fault buffer, which will be used by SVM to manage GPU page faults. Signed-off-by: Ben Skeggs <bskeggs@redhat.com>	2019-02-20 09:00:00 +10:00
Ben Skeggs	ab2ee9ffa3	drm/nouveau/mmu/gp100-: support vmms with gcc/tex replayable faults enabled Some GPU units are capable of supporting "replayable" page faults, where the execution unit will wait for SW to fixup GPU page tables rather than triggering a channel-fatal fault. This feature isn't useful (it's harmful, even) unless something like HMM is being used to manage events appearing in the replayable fault buffer, so, it's disabled by default. This commit allows a client to request it be enabled. Signed-off-by: Ben Skeggs <bskeggs@redhat.com>	2019-02-20 09:00:00 +10:00
Ben Skeggs	71871aa6df	drm/nouveau/mmu/gp100-: add privileged methods for fault replay/cancel Host methods exist to do at least some of what we need, but we are not currently pushing replay/cancels through a channel like UVM does as it's not clear whether it's necessary in our case (UVM also updates PTEs with the GPU). UVM also pushes a software method for fault cancels on Pascal, seemingly because the host methods don't appear to be sufficient. If/when we want to push the replay/cancel on the GPU, we can re-purpose the cancellation code here to implement that swmthd. Keep it simple for now, until we figure out exactly what we need here. Signed-off-by: Ben Skeggs <bskeggs@redhat.com>	2019-02-20 09:00:00 +10:00
Ben Skeggs	a5ff307fe1	drm/nouveau/mmu: add a privileged method to directly manage PTEs This provides a somewhat more direct method of manipulating the GPU page tables, which will be required to support SVM. Signed-off-by: Ben Skeggs <bskeggs@redhat.com>	2019-02-20 09:00:00 +10:00
Ben Skeggs	8e68271d7c	drm/nouveau/mmu: store mapped flag separately from memory pointer This will be used to support a privileged client providing PTEs directly, without a memory object to use as a reference. Signed-off-by: Ben Skeggs <bskeggs@redhat.com>	2019-02-20 09:00:00 +10:00
Ben Skeggs	2606f29162	drm/nouveau/mmu: support initialisation of client-managed address-spaces NVKM is currently responsible for managing the allocation of a client's GPU address-space, but there's various use-cases (ie. HMM address-space mirroring) where giving a client more direct control is desirable. This commit allows for a VMM to be created where the area allocated for NVKM is limited to a client-specified window, the remainder of address- space is controlled directly by the client. Leaving a window is necessary to support various internal requirements, but also to support existing allocation interfaces as not all of the HW is capable of working with a HMM allocation. Signed-off-by: Ben Skeggs <bskeggs@redhat.com>	2019-02-20 09:00:00 +10:00
Ben Skeggs	ae5ea7f6a8	drm/nouveau/gr/gf100-: expose method to determine current context MMU will need access to this info. Signed-off-by: Ben Skeggs <bskeggs@redhat.com>	2019-02-20 09:00:00 +10:00
Ben Skeggs	169f30b35d	drm/nouveau/gr/gf100-: expose fecs methods for pausing ctxsw MMU will need access to these. v2. Apply fix from Rhys Kidd to send correct FECS method for STOP_CTXSW. Signed-off-by: Ben Skeggs <bskeggs@redhat.com>	2019-02-20 09:00:00 +10:00
Colin Ian King	8e083686ec	drm/nouveau/falcon: fix a few indentation issues There are a few statements that are indented incorrectly. Fix these. Signed-off-by: Colin Ian King <colin.king@canonical.com> Signed-off-by: Ben Skeggs <bskeggs@redhat.com>	2019-02-20 09:00:00 +10:00
Ben Skeggs	d389fd4fa9	drm/nouveau/mmu/gf100-: virtualise setting pdb base address for invalidation It appears that Pascal and newer need something different. Signed-off-by: Ben Skeggs <bskeggs@redhat.com>	2019-02-20 09:00:00 +10:00
Ben Skeggs	874c1b56f3	drm/nouveau/mmu/gf100-: make mmu invalidate function more general Will want to reuse this for fault replay/cancellation swmthds. Signed-off-by: Ben Skeggs <bskeggs@redhat.com>	2019-02-20 09:00:00 +10:00
Ben Skeggs	8e44b987e8	drm/nouveau/gr/gf100-: store fecs/gpccs falcon pointers in substructures Future changes will want to add some additional things here, keep them grouped together. Signed-off-by: Ben Skeggs <bskeggs@redhat.com>	2019-02-20 09:00:00 +10:00
Ben Skeggs	b7f713b8d3	drm/nouveau/gr/gf100-: move fecs bind_pointer into a function Makes the code somewhat less magic. Signed-off-by: Ben Skeggs <bskeggs@redhat.com>	2019-02-20 09:00:00 +10:00
Ben Skeggs	8c7db76844	drm/nouveau/gr/gf100-: remove some unnecessary reg writes This is already done during golden context creation. Signed-off-by: Ben Skeggs <bskeggs@redhat.com>	2019-02-20 09:00:00 +10:00
Ben Skeggs	7d51bc85d7	drm/nouveau/gr/gf100-: move fecs elpg setup into functions Makes the code somewhat less magic. Signed-off-by: Ben Skeggs <bskeggs@redhat.com>	2019-02-20 09:00:00 +10:00
Ben Skeggs	8bf2d348bd	drm/nouveau/gr/gf100-: move fecs discover_pm_image_size into a function Makes the code somewhat less magic. Signed-off-by: Ben Skeggs <bskeggs@redhat.com>	2019-02-20 09:00:00 +10:00
Ben Skeggs	7d3f06881d	drm/nouveau/gr/gf100-: move fecs discover_zcull_image_size into a function Makes the code somewhat less magic. Signed-off-by: Ben Skeggs <bskeggs@redhat.com>	2019-02-20 08:59:59 +10:00
Ben Skeggs	0b89ca0dc3	drm/nouveau/gr/gf100-: move fecs discover_image_size into a function Makes the code somewhat less magic. Signed-off-by: Ben Skeggs <bskeggs@redhat.com>	2019-02-20 08:59:59 +10:00
Ben Skeggs	eb383e629c	drm/nouveau/gr/gf100-: move fecs set_watchdog_timeout method into a function Makes the code somewhat less magic. Signed-off-by: Ben Skeggs <bskeggs@redhat.com>	2019-02-20 08:59:59 +10:00
Ben Skeggs	a8ce8b65e1	drm/nouveau/disp/gf119-: decode exception reason to human-readable string We also change the error strings to match NVIDIA's naming. Signed-off-by: Ben Skeggs <bskeggs@redhat.com>	2019-02-20 08:59:59 +10:00
Ben Skeggs	eb972d1474	drm/nouveau/bios/init: handle INIT_GENERIC_CONDITION_ID_NO_PANEL_SEQ_DELAYS As I currently understand it, this is related to features we have no support for as of yet. In theory, this change should be a noop, just without the warning. Signed-off-by: Ben Skeggs <bskeggs@redhat.com>	2019-02-20 08:59:59 +10:00
Ben Skeggs	81f2bb5d65	drm/nouveau/bios/init: label existing INIT_GENERIC_CONDITION types Signed-off-by: Ben Skeggs <bskeggs@redhat.com>	2019-02-20 08:59:59 +10:00
Ben Skeggs	c774ce66c5	drm/nouveau/secboot: fix missing newline in error messages Signed-off-by: Ben Skeggs <bskeggs@redhat.com>	2019-02-20 08:59:59 +10:00
Ben Skeggs	8d2c1e3376	drm/nouveau/sec2/tu102-: instantiate SEC2 falcon Required for ACR. Signed-off-by: Ben Skeggs <bskeggs@redhat.com>	2019-02-20 08:59:59 +10:00
Ben Skeggs	fdad518362	drm/nouveau/sec2: utilise engine PRI address from TOP Turing has its SEC2 instance in an alternate location, and this avoids needing to duplicate the code here for it. Signed-off-by: Ben Skeggs <bskeggs@redhat.com>	2019-02-20 08:59:59 +10:00
Ben Skeggs	1a34693490	drm/nouveau/nvdec/tu102-: instantiate NVDEC0 falcon Required to run VPR scrubber binary as part of secboot. Signed-off-by: Ben Skeggs <bskeggs@redhat.com>	2019-02-20 08:59:58 +10:00
Ben Skeggs	0457427350	drm/nouveau/nvdec/gp102-: utilise engine PRI address from TOP Turing has its NVDEC instances in an alternate location. Signed-off-by: Ben Skeggs <bskeggs@redhat.com>	2019-02-20 08:59:58 +10:00
Ben Skeggs	2944b19b5c	drm/nouveau/gsp/gv100-: instantiate GSP falcon We need this for Turing ACR, but it's present from Volta onwards. Signed-off-by: Ben Skeggs <bskeggs@redhat.com>	2019-02-20 08:59:58 +10:00
Ben Skeggs	7975dfc36a	drm/nouveau/top/gv100-: translate entry for the GSP So we're able to connect fault/interrupt handling to the GSP subdev. Signed-off-by: Ben Skeggs <bskeggs@redhat.com>	2019-02-20 08:59:58 +10:00
Ben Skeggs	eec9ffe47f	drm/nouveau/top: add function to lookup PRI address for devices Will be using this in upcoming changes to avoid the need for entirely new subdevs to deal with Turing register moves. Signed-off-by: Ben Skeggs <bskeggs@redhat.com>	2019-02-20 08:59:58 +10:00
Ben Skeggs	78cdadb840	drm/nouveau/core: define GSP subdev Exact meaning of the acronym is unknown, but we need this for Turing ACR. Signed-off-by: Ben Skeggs <bskeggs@redhat.com>	2019-02-20 08:59:58 +10:00
Colin Ian King	b1d03fc36e	drm/nouveau/pmu: don't print reply values if exec is false Currently the uninitialized values in the array reply are printed out when exec is false and nvkm_pmu_send has not updated the array. Avoid confusion by only dumping out these values if they have been actually updated. Detected by CoverityScan, CID#1271291 ("Uninitialized scaler variable") Fixes: `ebb58dc2ef` ("drm/nouveau/pmu: rename from pwr (no binary change)") Signed-off-by: Colin Ian King <colin.king@canonical.com> Signed-off-by: Ben Skeggs <bskeggs@redhat.com>	2019-02-20 08:59:58 +10:00
Colin Ian King	13649101a2	drm/nouveau/bios/ramcfg: fix missing parentheses when calculating RON Currently, the expression for calculating RON is always going to result in zero no matter the value of ram->mr[1] because the ! operator has higher precedence than the shift >> operator. I believe the missing parentheses around the expression before appying the ! operator will result in the desired result. [ Note, not tested ] Detected by CoveritScan, CID#1324005 ("Operands don't affect result") Fixes: `c25bf7b615` ("drm/nouveau/bios/ramcfg: Separate out RON pull value") Signed-off-by: Colin Ian King <colin.king@canonical.com> Signed-off-by: Ben Skeggs <bskeggs@redhat.com>	2019-02-20 08:59:58 +10:00
Colin Ian King	d83d345338	drm/nouveau/bios/dp: make array vsoff static, shrinks object size Don't populate the array vsoff on the stack but instead make it static. Makes the object code smaller by 67 bytes: Before: text data bss dec hex filename 5753 112 0 5865 16e9 .../nouveau/nvkm/subdev/bios/dp.o After: text data bss dec hex filename 5622 176 0 5798 16a6 .../nouveau/nvkm/subdev/bios/dp.o (gcc version 8.2.0 x86_64) Signed-off-by: Colin Ian King <colin.king@canonical.com> Signed-off-by: Ben Skeggs <bskeggs@redhat.com>	2019-02-20 08:59:58 +10:00
Ben Skeggs	b6c8285476	drm/nouveau/ce/tu102: rename implementation from tu104 Signed-off-by: Ben Skeggs <bskeggs@redhat.com>	2019-02-20 08:59:58 +10:00
Ben Skeggs	f10271ffda	drm/nouveau/fifo/tu102: rename implementation from tu104 Signed-off-by: Ben Skeggs <bskeggs@redhat.com>	2019-02-20 08:59:58 +10:00
Ben Skeggs	8603774233	drm/nouveau/disp/tu102: rename implementation from tu104 Signed-off-by: Ben Skeggs <bskeggs@redhat.com>	2019-02-20 08:59:58 +10:00
Ben Skeggs	954f97983c	drm/nouveau/fault/tu102: rename implementation from tu104 Signed-off-by: Ben Skeggs <bskeggs@redhat.com>	2019-02-20 08:59:58 +10:00
Ben Skeggs	ef7664d9df	drm/nouveau/bar/tu102: rename implementation from tu104 Signed-off-by: Ben Skeggs <bskeggs@redhat.com>	2019-02-20 08:59:57 +10:00
Ben Skeggs	c011b25421	drm/nouveau/mmu/tu102: rename implementation from tu104 Signed-off-by: Ben Skeggs <bskeggs@redhat.com>	2019-02-20 08:59:57 +10:00
Ben Skeggs	fd95bfbdb9	drm/nouveau/mc/tu102: rename implementation from tu104 Signed-off-by: Ben Skeggs <bskeggs@redhat.com>	2019-02-20 08:59:57 +10:00
Ben Skeggs	b51f9dfac7	drm/nouveau/devinit/tu102: rename implementation from tu104 Signed-off-by: Ben Skeggs <bskeggs@redhat.com>	2019-02-20 08:59:57 +10:00

1 2 3 4 5 ...

1540 Commits