linux_dsm_epyc7002

mirror of https://github.com/AuxXxilium/linux_dsm_epyc7002.git synced 2024-11-26 00:30:53 +07:00

History

Shaohua Li 9329672021 x86: Spread tlb flush vector between nodes Currently flush tlb vector allocation is based on below equation: sender = smp_processor_id() % 8 This isn't optimal, CPUs from different node can have the same vector, this causes a lot of lock contention. Instead, we can assign the same vectors to CPUs from the same node, while different node has different vectors. This has below advantages: a. if there is lock contention, the lock contention is between CPUs from one node. This should be much cheaper than the contention between nodes. b. completely avoid lock contention between nodes. This especially benefits kswapd, which is the biggest user of tlb flush, since kswapd sets its affinity to specific node. In my test, this could reduce > 20% CPU overhead in extreme case.The test machine has 4 nodes and each node has 16 CPUs. I then bind each node's kswapd to the first CPU of the node. I run a workload with 4 sequential mmap file read thread. The files are empty sparse file. This workload will trigger a lot of page reclaim and tlbflush. The kswapd bind is to easy trigger the extreme tlb flush lock contention because otherwise kswapd keeps migrating between CPUs of a node and I can't get stable result. Sure in real workload, we can't always see so big tlb flush lock contention, but it's possible. [ hpa: folded in fix from Eric Dumazet to use this_cpu_read() ] Signed-off-by: Shaohua Li <shaohua.li@intel.com> LKML-Reference: <1287544023.4571.8.camel@sli10-conroe.sh.intel.com> Cc: Eric Dumazet <eric.dumazet@gmail.com> Signed-off-by: H. Peter Anvin <hpa@linux.intel.com>		2010-10-20 14:44:42 -07:00
..
boot	x86, setup: move isdigit.h to ctype.h, header files on top.	2010-08-02 21:07:20 -07:00
configs	defconfig reduction	2010-08-14 22:26:53 +02:00
crypto	Merge git://git.kernel.org/pub/scm/linux/kernel/git/torvalds/linux-2.6	2010-05-03 11:28:58 +08:00
ia32	Mark arguments to certain syscalls as being const	2010-08-13 16:53:13 -07:00
include/asm	x86, mm: Hold mm->page_table_lock while doing vmalloc_sync	2010-10-19 13:57:08 -07:00
kernel	mm, x86: Saving vmcore with non-lazy freeing of vmas	2010-09-17 09:11:56 +02:00
kvm	Merge branch 'kvm-updates/2.6.36' of git://git.kernel.org/pub/scm/virt/kvm/kvm	2010-08-22 11:27:36 -07:00
lguest	Merge branch 'ht-delete-2.6.35' into release	2010-05-28 16:20:35 -04:00
lib	Merge branch 'x86/urgent' of git://git.kernel.org/pub/scm/linux/kernel/git/tip/linux-2.6-tip	2010-08-13 10:35:48 -07:00
math-emu	x86, fpu: Unbreak FPU emulation	2010-05-10 13:37:16 -07:00
mm	x86: Spread tlb flush vector between nodes	2010-10-20 14:44:42 -07:00
oprofile	oprofile: add support for Intel processor model 30	2010-08-05 12:17:38 +02:00
pci	x86/PCI: use for_each_pci_dev()	2010-07-30 09:47:33 -07:00
power	update email address	2010-07-19 10:56:54 +02:00
tools	x86: Remove trailing spaces in messages	2010-02-07 17:47:51 +01:00
vdso	Merge branches 'x86-cleanups-for-linus', 'x86-vmware-for-linus', 'x86-mtrr-for-linus', 'x86-apic-for-linus', 'x86-fpu-for-linus' and 'x86-vdso-for-linus' of git://git.kernel.org/pub/scm/linux/kernel/git/tip/linux-2.6-tip	2010-08-06 16:22:59 -07:00
video
xen	Merge branch 'stable/xen-swiotlb-0.8.6' of git://git.kernel.org/pub/scm/linux/kernel/git/konrad/xen	2010-08-12 09:09:41 -07:00
.gitignore	add random binaries to .gitignore	2010-04-08 11:34:34 +02:00
Kbuild
Kconfig	Replace Configure with Enable in description of MAXSMP	2010-08-21 12:38:58 -07:00
Kconfig.cpu	Merge branch 'x86-fpu-for-linus' of git://git.kernel.org/pub/scm/linux/kernel/git/tip/linux-2.6-tip	2010-05-18 08:58:16 -07:00
Kconfig.debug	Merge branch 'x86-cleanups-for-linus' of git://git.kernel.org/pub/scm/linux/kernel/git/tip/linux-2.6-tip	2010-05-18 08:40:21 -07:00
Makefile	x86: Use .cfi_sections for assembly code	2010-05-13 22:15:18 -07:00
Makefile_32.cpu