mirror of
https://github.com/AuxXxilium/linux_dsm_epyc7002.git
synced 2024-12-23 01:32:46 +07:00
4d2e26a38f
Convert docs to ReST and add them to the arch-specific book. The conversion here was trivial, as almost every file there was already using an elegant format close to ReST standard. The changes were mostly to mark literal blocks and add a few missing section title identifiers. One note with regards to "--": on Sphinx, this can't be used to identify a list, as it will format it badly. This can be used, however, to identify a long hyphen - and "---" is an even longer one. At its new index.rst, let's add a :orphan: while this is not linked to the main index.rst file, in order to avoid build warnings. Signed-off-by: Mauro Carvalho Chehab <mchehab+samsung@kernel.org> Acked-by: Andrew Donnellan <andrew.donnellan@au1.ibm.com> # cxl
61 lines
2.7 KiB
ReStructuredText
61 lines
2.7 KiB
ReStructuredText
============
|
|
CPU Features
|
|
============
|
|
|
|
Hollis Blanchard <hollis@austin.ibm.com>
|
|
5 Jun 2002
|
|
|
|
This document describes the system (including self-modifying code) used in the
|
|
PPC Linux kernel to support a variety of PowerPC CPUs without requiring
|
|
compile-time selection.
|
|
|
|
Early in the boot process the ppc32 kernel detects the current CPU type and
|
|
chooses a set of features accordingly. Some examples include Altivec support,
|
|
split instruction and data caches, and if the CPU supports the DOZE and NAP
|
|
sleep modes.
|
|
|
|
Detection of the feature set is simple. A list of processors can be found in
|
|
arch/powerpc/kernel/cputable.c. The PVR register is masked and compared with
|
|
each value in the list. If a match is found, the cpu_features of cur_cpu_spec
|
|
is assigned to the feature bitmask for this processor and a __setup_cpu
|
|
function is called.
|
|
|
|
C code may test 'cur_cpu_spec[smp_processor_id()]->cpu_features' for a
|
|
particular feature bit. This is done in quite a few places, for example
|
|
in ppc_setup_l2cr().
|
|
|
|
Implementing cpufeatures in assembly is a little more involved. There are
|
|
several paths that are performance-critical and would suffer if an array
|
|
index, structure dereference, and conditional branch were added. To avoid the
|
|
performance penalty but still allow for runtime (rather than compile-time) CPU
|
|
selection, unused code is replaced by 'nop' instructions. This nop'ing is
|
|
based on CPU 0's capabilities, so a multi-processor system with non-identical
|
|
processors will not work (but such a system would likely have other problems
|
|
anyways).
|
|
|
|
After detecting the processor type, the kernel patches out sections of code
|
|
that shouldn't be used by writing nop's over it. Using cpufeatures requires
|
|
just 2 macros (found in arch/powerpc/include/asm/cputable.h), as seen in head.S
|
|
transfer_to_handler::
|
|
|
|
#ifdef CONFIG_ALTIVEC
|
|
BEGIN_FTR_SECTION
|
|
mfspr r22,SPRN_VRSAVE /* if G4, save vrsave register value */
|
|
stw r22,THREAD_VRSAVE(r23)
|
|
END_FTR_SECTION_IFSET(CPU_FTR_ALTIVEC)
|
|
#endif /* CONFIG_ALTIVEC */
|
|
|
|
If CPU 0 supports Altivec, the code is left untouched. If it doesn't, both
|
|
instructions are replaced with nop's.
|
|
|
|
The END_FTR_SECTION macro has two simpler variations: END_FTR_SECTION_IFSET
|
|
and END_FTR_SECTION_IFCLR. These simply test if a flag is set (in
|
|
cur_cpu_spec[0]->cpu_features) or is cleared, respectively. These two macros
|
|
should be used in the majority of cases.
|
|
|
|
The END_FTR_SECTION macros are implemented by storing information about this
|
|
code in the '__ftr_fixup' ELF section. When do_cpu_ftr_fixups
|
|
(arch/powerpc/kernel/misc.S) is invoked, it will iterate over the records in
|
|
__ftr_fixup, and if the required feature is not present it will loop writing
|
|
nop's from each BEGIN_FTR_SECTION to END_FTR_SECTION.
|