linux_dsm_epyc7002

mirror of https://github.com/AuxXxilium/linux_dsm_epyc7002.git synced 2024-12-15 00:46:47 +07:00

History

Matt Fleming 6e5f32f7a4 sched/loadavg: Avoid loadavg spikes caused by delayed NO_HZ accounting If we crossed a sample window while in NO_HZ we will add LOAD_FREQ to the pending sample window time on exit, setting the next update not one window into the future, but two. This situation on exiting NO_HZ is described by: this_rq->calc_load_update < jiffies < calc_load_update In this scenario, what we should be doing is: this_rq->calc_load_update = calc_load_update [ next window ] But what we actually do is: this_rq->calc_load_update = calc_load_update + LOAD_FREQ [ next+1 window ] This has the effect of delaying load average updates for potentially up to ~9seconds. This can result in huge spikes in the load average values due to per-cpu uninterruptible task counts being out of sync when accumulated across all CPUs. It's safe to update the per-cpu active count if we wake between sample windows because any load that we left in 'calc_load_idle' will have been zero'd when the idle load was folded in calc_global_load(). This issue is easy to reproduce before, commit `9d89c257df` ("sched/fair: Rewrite runnable load and utilization average tracking") just by forking short-lived process pipelines built from ps(1) and grep(1) in a loop. I'm unable to reproduce the spikes after that commit, but the bug still seems to be present from code review. Signed-off-by: Matt Fleming <matt@codeblueprint.co.uk> Signed-off-by: Peter Zijlstra (Intel) <peterz@infradead.org> Cc: Frederic Weisbecker <fweisbec@gmail.com> Cc: Linus Torvalds <torvalds@linux-foundation.org> Cc: Mike Galbraith <efault@gmx.de> Cc: Mike Galbraith <umgwanakikbuti@gmail.com> Cc: Morten Rasmussen <morten.rasmussen@arm.com> Cc: Peter Zijlstra <peterz@infradead.org> Cc: Thomas Gleixner <tglx@linutronix.de> Cc: Vincent Guittot <vincent.guittot@linaro.org> Fixes: commit `5167e8d` ("sched/nohz: Rewrite and fix load-avg computation -- again") Link: http://lkml.kernel.org/r/20170217120731.11868-2-matt@codeblueprint.co.uk Signed-off-by: Ingo Molnar <mingo@kernel.org>		2017-03-16 09:21:00 +01:00
..
autogroup.c	sched/autogroup: Rename auto_group.[ch] to autogroup.[ch]	2017-02-08 09:01:11 +01:00
autogroup.h	sched/headers: Prepare for new header dependencies before moving code to <linux/sched/autogroup.h>	2017-03-02 08:42:28 +01:00
clock.c	sched/headers: Prepare for new header dependencies before moving code to <linux/sched/nmi.h>	2017-03-02 08:42:30 +01:00
completion.c	sched/headers: Prepare for new header dependencies before moving code to <linux/sched/debug.h>	2017-03-02 08:42:34 +01:00
core.c	Merge branch 'sched-urgent-for-linus' of git://git.kernel.org/pub/scm/linux/kernel/git/tip/tip	2017-03-07 14:42:34 -08:00
cpuacct.c	sched/cputime: Convert kcpustat to nsecs	2017-02-01 09:13:47 +01:00
cpuacct.h	sched/cpuacct: Simplify the cpuacct code	2016-03-21 11:00:28 +01:00
cpudeadline.c	sched/core: Remove the tsk_cpus_allowed() wrapper	2017-03-02 08:42:24 +01:00
cpudeadline.h	sched/deadline: Split cpudl_set() into cpudl_set() and cpudl_clear()	2016-09-05 13:29:43 +02:00
cpufreq_schedutil.c	cpufreq: schedutil: Pass sg_policy to get_next_freq()	2017-03-05 23:58:48 +01:00
cpufreq.c	cpufreq / sched: Pass flags to cpufreq_update_util()	2016-08-16 22:14:55 +02:00
cpupri.c	sched/core: Remove the tsk_cpus_allowed() wrapper	2017-03-02 08:42:24 +01:00
cpupri.h	sched/cpupri: Remove unnecessary definitions in cpupri.h	2014-11-16 10:58:59 +01:00
cputime.c	sched/headers: Prepare to move cputime functionality from <linux/sched.h> into <linux/sched/cputime.h>	2017-03-02 08:42:39 +01:00
deadline.c	sched/deadline: Add missing update_rq_clock() in dl_task_timer()	2017-03-16 09:20:59 +01:00
debug.c	sched/headers: Prepare to move the task_lock()/unlock() APIs to <linux/sched/task.h>	2017-03-02 08:42:38 +01:00
fair.c	Merge branch 'sched-urgent-for-linus' of git://git.kernel.org/pub/scm/linux/kernel/git/tip/tip	2017-03-07 14:42:34 -08:00
features.h	sched/fair: Make select_idle_cpu() more aggressive	2017-03-02 08:50:17 +01:00
idle_task.c	sched/core: Add wrappers for lockdep_(un)pin_lock()	2017-01-14 11:29:30 +01:00
idle.c	sched/headers: Prepare for new header dependencies before moving code to <linux/sched/idle.h>	2017-03-02 08:42:26 +01:00
loadavg.c	sched/loadavg: Avoid loadavg spikes caused by delayed NO_HZ accounting	2017-03-16 09:21:00 +01:00
Makefile	sched/autogroup: Rename auto_group.[ch] to autogroup.[ch]	2017-02-08 09:01:11 +01:00
rt.c	sched/core: Remove the tsk_nr_cpus_allowed() wrapper	2017-03-02 08:42:24 +01:00
sched.h	sched/headers: Prepare to move _init() prototypes from <linux/sched.h> to <linux/sched/init.h>	2017-03-02 08:42:40 +01:00
stats.c	sched: use %*pb[l] to print bitmaps including cpumasks and nodemasks	2015-02-13 21:21:37 -08:00
stats.h	sched/headers: Move cputime functionality from <linux/sched.h> and <linux/cputime.h> into <linux/sched/cputime.h>	2017-03-03 01:45:22 +01:00
stop_task.c	sched/core: Add wrappers for lockdep_(un)pin_lock()	2017-01-14 11:29:30 +01:00
swait.c	sched/headers: Prepare to move signal wakeup & sigpending methods from <linux/sched.h> into <linux/sched/signal.h>	2017-03-02 08:42:32 +01:00
topology.c	sched/topology: Split out scheduler topology code from core.c into topology.c	2017-02-07 10:58:12 +01:00
wait.c	sched/headers: fix up header file dependency on <linux/sched/signal.h>	2017-03-08 10:36:03 -08:00