linux_dsm_epyc7002

mirror of https://github.com/AuxXxilium/linux_dsm_epyc7002.git synced 2024-12-16 09:06:51 +07:00

History

Peter Zijlstra 7dc603c902 sched/fair: Fix PELT integrity for new tasks Vincent and Yuyang found another few scenarios in which entity tracking goes wobbly. The scenarios are basically due to the fact that new tasks are not immediately attached and thereby differ from the normal situation -- a task is always attached to a cfs_rq load average (such that it includes its blocked contribution) and are explicitly detached/attached on migration to another cfs_rq. Scenario 1: switch to fair class p->sched_class = fair_class; if (queued) enqueue_task(p); ... enqueue_entity() enqueue_entity_load_avg() migrated = !sa->last_update_time (true) if (migrated) attach_entity_load_avg() check_class_changed() switched_from() (!fair) switched_to() (fair) switched_to_fair() attach_entity_load_avg() If @p is a new task that hasn't been fair before, it will have !last_update_time and, per the above, end up in attach_entity_load_avg() _twice_. Scenario 2: change between cgroups sched_move_group(p) if (queued) dequeue_task() task_move_group_fair() detach_task_cfs_rq() detach_entity_load_avg() set_task_rq() attach_task_cfs_rq() attach_entity_load_avg() if (queued) enqueue_task(); ... enqueue_entity() enqueue_entity_load_avg() migrated = !sa->last_update_time (true) if (migrated) attach_entity_load_avg() Similar as with scenario 1, if @p is a new task, it will have !load_update_time and we'll end up in attach_entity_load_avg() _twice_. Furthermore, notice how we do a detach_entity_load_avg() on something that wasn't attached to begin with. As stated above; the problem is that the new task isn't yet attached to the load tracking and thereby violates the invariant assumption. This patch remedies this by ensuring a new task is indeed properly attached to the load tracking on creation, through post_init_entity_util_avg(). Of course, this isn't entirely as straightforward as one might think, since the task is hashed before we call wake_up_new_task() and thus can be poked at. We avoid this by adding TASK_NEW and teaching cpu_cgroup_can_attach() to refuse such tasks. Reported-by: Yuyang Du <yuyang.du@intel.com> Reported-by: Vincent Guittot <vincent.guittot@linaro.org> Signed-off-by: Peter Zijlstra (Intel) <peterz@infradead.org> Cc: Linus Torvalds <torvalds@linux-foundation.org> Cc: Mike Galbraith <efault@gmx.de> Cc: Peter Zijlstra <peterz@infradead.org> Cc: Thomas Gleixner <tglx@linutronix.de> Cc: linux-kernel@vger.kernel.org Signed-off-by: Ingo Molnar <mingo@kernel.org>		2016-06-27 12:17:53 +02:00
..
auto_group.c	sched/core: Move the sched_to_prio[] arrays out of line	2015-12-04 10:34:46 +01:00
auto_group.h	sched, timer: Convert usages of ACCESS_ONCE() in the scheduler to READ_ONCE()/WRITE_ONCE()	2015-05-08 12:11:32 +02:00
clock.c	sched/clock: Make local_clock()/cpu_clock() inline	2016-04-13 12:25:22 +02:00
completion.c	sched/completion: Serialize completion_done() with complete()	2015-02-18 14:27:40 +01:00
core.c	sched/fair: Fix PELT integrity for new tasks	2016-06-27 12:17:53 +02:00
cpuacct.c	sched/cpuacct: Check for NULL when using task_pt_regs()	2016-04-13 13:22:37 +02:00
cpuacct.h	sched/cpuacct: Simplify the cpuacct code	2016-03-21 11:00:28 +01:00
cpudeadline.c	sched/core: Use tsk_cpus_allowed() instead of accessing ->cpus_allowed	2016-05-12 09:55:35 +02:00
cpudeadline.h	sched/deadline: Unify dl_time_before() usage	2015-09-23 09:51:25 +02:00
cpufreq_schedutil.c	cpufreq: schedutil: Improve prints messages with pr_fmt	2016-05-19 01:02:52 +02:00
cpufreq.c	cpufreq: sched: Helpers to add and remove update_util hooks	2016-04-02 01:08:43 +02:00
cpupri.c	sched/core: Use tsk_cpus_allowed() instead of accessing ->cpus_allowed	2016-05-12 09:55:35 +02:00
cpupri.h	sched/cpupri: Remove unnecessary definitions in cpupri.h	2014-11-16 10:58:59 +01:00
cputime.c	sched/cputime: Add steal time support to full dynticks CPU time accounting	2016-06-14 11:13:16 +02:00
deadline.c	sched/core: Provide a tsk_nr_cpus_allowed() helper	2016-05-12 09:55:36 +02:00
debug.c	sched/debug: Always show 'nr_migrations'	2016-06-08 14:34:49 +02:00
fair.c	sched/fair: Fix PELT integrity for new tasks	2016-06-27 12:17:53 +02:00
features.h	sched/fair: Convert arch_scale_cpu_capacity() from weak function to #define	2015-09-13 09:52:55 +02:00
idle_task.c	locking/lockdep, sched/core: Implement a better lock pinning scheme	2016-05-05 09:23:59 +02:00
idle.c	Merge branch 'sched/urgent' into sched/core, to pick up fixes	2016-06-14 11:04:13 +02:00
loadavg.c	sched/loadavg: Fix loadavg artifacts on fully idle and on fully loaded systems	2016-05-12 09:55:34 +02:00
Makefile	cpufreq: schedutil: New governor based on scheduler utilization data	2016-04-02 01:09:12 +02:00
rt.c	sched/core: Provide a tsk_nr_cpus_allowed() helper	2016-05-12 09:55:36 +02:00
sched.h	sched/cgroup: Fix cpu_cgroup_fork() handling	2016-06-27 12:17:52 +02:00
stats.c	sched: use %*pb[l] to print bitmaps including cpumasks and nodemasks	2015-02-13 21:21:37 -08:00
stats.h	sched/debug: Fix /proc/sched_debug regression	2016-06-08 14:31:58 +02:00
stop_task.c	locking/lockdep, sched/core: Implement a better lock pinning scheme	2016-05-05 09:23:59 +02:00
swait.c	wait.[ch]: Introduce the simple waitqueue (swait) implementation	2016-02-25 11:27:16 +01:00
wait.c	sched/wait: Fix the signal handling fix	2015-12-13 14:30:59 -08:00