License cleanup: add SPDX GPL-2.0 license identifier to files with no license
Many source files in the tree are missing licensing information, which
makes it harder for compliance tools to determine the correct license.
By default all files without license information are under the default
license of the kernel, which is GPL version 2.
Update the files which contain no license information with the 'GPL-2.0'
SPDX license identifier. The SPDX identifier is a legally binding
shorthand, which can be used instead of the full boiler plate text.
This patch is based on work done by Thomas Gleixner and Kate Stewart and
Philippe Ombredanne.
How this work was done:
Patches were generated and checked against linux-4.14-rc6 for a subset of
the use cases:
- file had no licensing information it it.
- file was a */uapi/* one with no licensing information in it,
- file was a */uapi/* one with existing licensing information,
Further patches will be generated in subsequent months to fix up cases
where non-standard license headers were used, and references to license
had to be inferred by heuristics based on keywords.
The analysis to determine which SPDX License Identifier to be applied to
a file was done in a spreadsheet of side by side results from of the
output of two independent scanners (ScanCode & Windriver) producing SPDX
tag:value files created by Philippe Ombredanne. Philippe prepared the
base worksheet, and did an initial spot review of a few 1000 files.
The 4.13 kernel was the starting point of the analysis with 60,537 files
assessed. Kate Stewart did a file by file comparison of the scanner
results in the spreadsheet to determine which SPDX license identifier(s)
to be applied to the file. She confirmed any determination that was not
immediately clear with lawyers working with the Linux Foundation.
Criteria used to select files for SPDX license identifier tagging was:
- Files considered eligible had to be source code files.
- Make and config files were included as candidates if they contained >5
lines of source
- File already had some variant of a license header in it (even if <5
lines).
All documentation files were explicitly excluded.
The following heuristics were used to determine which SPDX license
identifiers to apply.
- when both scanners couldn't find any license traces, file was
considered to have no license information in it, and the top level
COPYING file license applied.
For non */uapi/* files that summary was:
SPDX license identifier # files
---------------------------------------------------|-------
GPL-2.0 11139
and resulted in the first patch in this series.
If that file was a */uapi/* path one, it was "GPL-2.0 WITH
Linux-syscall-note" otherwise it was "GPL-2.0". Results of that was:
SPDX license identifier # files
---------------------------------------------------|-------
GPL-2.0 WITH Linux-syscall-note 930
and resulted in the second patch in this series.
- if a file had some form of licensing information in it, and was one
of the */uapi/* ones, it was denoted with the Linux-syscall-note if
any GPL family license was found in the file or had no licensing in
it (per prior point). Results summary:
SPDX license identifier # files
---------------------------------------------------|------
GPL-2.0 WITH Linux-syscall-note 270
GPL-2.0+ WITH Linux-syscall-note 169
((GPL-2.0 WITH Linux-syscall-note) OR BSD-2-Clause) 21
((GPL-2.0 WITH Linux-syscall-note) OR BSD-3-Clause) 17
LGPL-2.1+ WITH Linux-syscall-note 15
GPL-1.0+ WITH Linux-syscall-note 14
((GPL-2.0+ WITH Linux-syscall-note) OR BSD-3-Clause) 5
LGPL-2.0+ WITH Linux-syscall-note 4
LGPL-2.1 WITH Linux-syscall-note 3
((GPL-2.0 WITH Linux-syscall-note) OR MIT) 3
((GPL-2.0 WITH Linux-syscall-note) AND MIT) 1
and that resulted in the third patch in this series.
- when the two scanners agreed on the detected license(s), that became
the concluded license(s).
- when there was disagreement between the two scanners (one detected a
license but the other didn't, or they both detected different
licenses) a manual inspection of the file occurred.
- In most cases a manual inspection of the information in the file
resulted in a clear resolution of the license that should apply (and
which scanner probably needed to revisit its heuristics).
- When it was not immediately clear, the license identifier was
confirmed with lawyers working with the Linux Foundation.
- If there was any question as to the appropriate license identifier,
the file was flagged for further research and to be revisited later
in time.
In total, over 70 hours of logged manual review was done on the
spreadsheet to determine the SPDX license identifiers to apply to the
source files by Kate, Philippe, Thomas and, in some cases, confirmation
by lawyers working with the Linux Foundation.
Kate also obtained a third independent scan of the 4.13 code base from
FOSSology, and compared selected files where the other two scanners
disagreed against that SPDX file, to see if there was new insights. The
Windriver scanner is based on an older version of FOSSology in part, so
they are related.
Thomas did random spot checks in about 500 files from the spreadsheets
for the uapi headers and agreed with SPDX license identifier in the
files he inspected. For the non-uapi files Thomas did random spot checks
in about 15000 files.
In initial set of patches against 4.14-rc6, 3 files were found to have
copy/paste license identifier errors, and have been fixed to reflect the
correct identifier.
Additionally Philippe spent 10 hours this week doing a detailed manual
inspection and review of the 12,461 patched files from the initial patch
version early this week with:
- a full scancode scan run, collecting the matched texts, detected
license ids and scores
- reviewing anything where there was a license detected (about 500+
files) to ensure that the applied SPDX license was correct
- reviewing anything where there was no detection but the patch license
was not GPL-2.0 WITH Linux-syscall-note to ensure that the applied
SPDX license was correct
This produced a worksheet with 20 files needing minor correction. This
worksheet was then exported into 3 different .csv files for the
different types of files to be modified.
These .csv files were then reviewed by Greg. Thomas wrote a script to
parse the csv files and add the proper SPDX tag to the file, in the
format that the file expected. This script was further refined by Greg
based on the output to detect more types of files automatically and to
distinguish between header and source .c files (which need different
comment types.) Finally Greg ran the script using the .csv files to
generate the patches.
Reviewed-by: Kate Stewart <kstewart@linuxfoundation.org>
Reviewed-by: Philippe Ombredanne <pombredanne@nexb.com>
Reviewed-by: Thomas Gleixner <tglx@linutronix.de>
Signed-off-by: Greg Kroah-Hartman <gregkh@linuxfoundation.org>
2017-11-01 21:07:57 +07:00
|
|
|
// SPDX-License-Identifier: GPL-2.0
|
2008-05-15 10:49:44 +07:00
|
|
|
/*
|
|
|
|
* Code for replacing ftrace calls with jumps.
|
|
|
|
*
|
|
|
|
* Copyright (C) 2007-2008 Steven Rostedt <srostedt@redhat.com>
|
|
|
|
*
|
|
|
|
* Thanks goes out to P.A. Semi, Inc for supplying me with a PPC64 box.
|
|
|
|
*
|
2009-02-10 12:10:27 +07:00
|
|
|
* Added function graph tracer code, taken from x86 that was written
|
|
|
|
* by Frederic Weisbecker, and ported to PPC by Steven Rostedt.
|
|
|
|
*
|
2008-05-15 10:49:44 +07:00
|
|
|
*/
|
|
|
|
|
2014-06-17 13:15:36 +07:00
|
|
|
#define pr_fmt(fmt) "ftrace-powerpc: " fmt
|
|
|
|
|
2008-05-15 10:49:44 +07:00
|
|
|
#include <linux/spinlock.h>
|
|
|
|
#include <linux/hardirq.h>
|
2008-11-15 07:21:20 +07:00
|
|
|
#include <linux/uaccess.h>
|
2008-11-15 11:47:03 +07:00
|
|
|
#include <linux/module.h>
|
2008-05-15 10:49:44 +07:00
|
|
|
#include <linux/ftrace.h>
|
|
|
|
#include <linux/percpu.h>
|
|
|
|
#include <linux/init.h>
|
|
|
|
#include <linux/list.h>
|
|
|
|
|
2017-03-06 15:49:46 +07:00
|
|
|
#include <asm/asm-prototypes.h>
|
2008-05-15 10:49:44 +07:00
|
|
|
#include <asm/cacheflush.h>
|
2008-11-15 11:47:03 +07:00
|
|
|
#include <asm/code-patching.h>
|
2008-06-22 01:17:27 +07:00
|
|
|
#include <asm/ftrace.h>
|
2011-02-03 00:27:24 +07:00
|
|
|
#include <asm/syscall.h>
|
2008-05-15 10:49:44 +07:00
|
|
|
|
|
|
|
|
2009-02-10 12:10:27 +07:00
|
|
|
#ifdef CONFIG_DYNAMIC_FTRACE
|
powerpc/ftrace: Handle large kernel configs
Currently, we expect to be able to reach ftrace_caller() from all
ftrace-enabled functions through a single relative branch. With large
kernel configs, we see functions outside of 32MB of ftrace_caller()
causing ftrace_init() to bail.
In such configurations, gcc/ld emits two types of trampolines for mcount():
1. A long_branch, which has a single branch to mcount() for functions that
are one hop away from mcount():
c0000000019e8544 <00031b56.long_branch._mcount>:
c0000000019e8544: 4a 69 3f ac b c00000000007c4f0 <._mcount>
2. A plt_branch, for functions that are farther away from mcount():
c0000000051f33f8 <0008ba04.plt_branch._mcount>:
c0000000051f33f8: 3d 82 ff a4 addis r12,r2,-92
c0000000051f33fc: e9 8c 04 20 ld r12,1056(r12)
c0000000051f3400: 7d 89 03 a6 mtctr r12
c0000000051f3404: 4e 80 04 20 bctr
We can reuse those trampolines for ftrace if we can have those
trampolines go to ftrace_caller() instead. However, with ABIv2, we
cannot depend on r2 being valid. As such, we use only the long_branch
trampolines by patching those to instead branch to ftrace_caller or
ftrace_regs_caller.
In addition, we add additional trampolines around .text and .init.text
to catch locations that are covered by the plt branches. This allows
ftrace to work with most large kernel configurations.
For now, we always patch the trampolines to go to ftrace_regs_caller,
which is slightly inefficient. This can be optimized further at a later
point.
Signed-off-by: Naveen N. Rao <naveen.n.rao@linux.vnet.ibm.com>
Signed-off-by: Michael Ellerman <mpe@ellerman.id.au>
2018-10-17 03:25:00 +07:00
|
|
|
|
|
|
|
/*
|
|
|
|
* We generally only have a single long_branch tramp and at most 2 or 3 plt
|
|
|
|
* tramps generated. But, we don't use the plt tramps currently. We also allot
|
|
|
|
* 2 tramps after .text and .init.text. So, we only end up with around 3 usable
|
|
|
|
* tramps in total. Set aside 8 just to be sure.
|
|
|
|
*/
|
|
|
|
#define NUM_FTRACE_TRAMPS 8
|
|
|
|
static unsigned long ftrace_tramps[NUM_FTRACE_TRAMPS];
|
|
|
|
|
2009-02-13 21:31:39 +07:00
|
|
|
static unsigned int
|
2009-02-11 13:19:54 +07:00
|
|
|
ftrace_call_replace(unsigned long ip, unsigned long addr, int link)
|
2008-05-15 10:49:44 +07:00
|
|
|
{
|
2009-02-13 21:31:39 +07:00
|
|
|
unsigned int op;
|
2008-05-15 10:49:44 +07:00
|
|
|
|
2009-05-29 02:33:34 +07:00
|
|
|
addr = ppc_function_entry((void *)addr);
|
2008-05-15 10:49:44 +07:00
|
|
|
|
2009-02-11 13:19:54 +07:00
|
|
|
/* if (link) set op to 'bl' else 'b' */
|
2009-02-13 21:45:27 +07:00
|
|
|
op = create_branch((unsigned int *)ip, addr, link ? 1 : 0);
|
2008-05-15 10:49:44 +07:00
|
|
|
|
2009-02-13 21:31:39 +07:00
|
|
|
return op;
|
2008-05-15 10:49:44 +07:00
|
|
|
}
|
|
|
|
|
2008-11-15 07:21:19 +07:00
|
|
|
static int
|
2009-02-13 21:31:39 +07:00
|
|
|
ftrace_modify_code(unsigned long ip, unsigned int old, unsigned int new)
|
2008-05-15 10:49:44 +07:00
|
|
|
{
|
2009-02-13 21:31:39 +07:00
|
|
|
unsigned int replaced;
|
2008-05-15 10:49:44 +07:00
|
|
|
|
|
|
|
/*
|
2015-12-06 09:02:56 +07:00
|
|
|
* Note:
|
|
|
|
* We are paranoid about modifying text, as if a bug was to happen, it
|
|
|
|
* could cause us to read or write to someplace that could cause harm.
|
|
|
|
* Carefully read and modify the code with probe_kernel_*(), and make
|
|
|
|
* sure what we read is what we expected it to be before modifying it.
|
2008-05-15 10:49:44 +07:00
|
|
|
*/
|
2008-11-15 07:21:20 +07:00
|
|
|
|
|
|
|
/* read the text we want to modify */
|
2009-02-13 21:31:39 +07:00
|
|
|
if (probe_kernel_read(&replaced, (void *)ip, MCOUNT_INSN_SIZE))
|
2008-11-15 07:21:20 +07:00
|
|
|
return -EFAULT;
|
|
|
|
|
|
|
|
/* Make sure it is what we expect it to be */
|
2016-03-03 11:26:59 +07:00
|
|
|
if (replaced != old) {
|
|
|
|
pr_err("%p: replaced (%#x) != old (%#x)",
|
|
|
|
(void *)ip, replaced, old);
|
2008-11-15 07:21:20 +07:00
|
|
|
return -EINVAL;
|
2016-03-03 11:26:59 +07:00
|
|
|
}
|
2008-11-15 07:21:20 +07:00
|
|
|
|
|
|
|
/* replace the text with the new text */
|
2012-04-26 15:31:19 +07:00
|
|
|
if (patch_instruction((unsigned int *)ip, new))
|
2008-11-15 07:21:20 +07:00
|
|
|
return -EPERM;
|
|
|
|
|
|
|
|
return 0;
|
2008-05-15 10:49:44 +07:00
|
|
|
}
|
|
|
|
|
2008-11-15 11:47:03 +07:00
|
|
|
/*
|
|
|
|
* Helper functions that are the same for both PPC64 and PPC32.
|
|
|
|
*/
|
2008-11-15 07:21:19 +07:00
|
|
|
static int test_24bit_addr(unsigned long ip, unsigned long addr)
|
|
|
|
{
|
2014-02-26 09:23:01 +07:00
|
|
|
addr = ppc_function_entry((void *)addr);
|
2008-11-15 07:21:19 +07:00
|
|
|
|
2008-11-26 05:06:19 +07:00
|
|
|
/* use the create_branch to verify that this offset can be branched */
|
|
|
|
return create_branch((unsigned int *)ip, addr, 0);
|
2008-11-15 07:21:19 +07:00
|
|
|
}
|
|
|
|
|
2008-11-15 11:47:03 +07:00
|
|
|
static int is_bl_op(unsigned int op)
|
|
|
|
{
|
|
|
|
return (op & 0xfc000003) == 0x48000001;
|
|
|
|
}
|
|
|
|
|
powerpc/ftrace: Handle large kernel configs
Currently, we expect to be able to reach ftrace_caller() from all
ftrace-enabled functions through a single relative branch. With large
kernel configs, we see functions outside of 32MB of ftrace_caller()
causing ftrace_init() to bail.
In such configurations, gcc/ld emits two types of trampolines for mcount():
1. A long_branch, which has a single branch to mcount() for functions that
are one hop away from mcount():
c0000000019e8544 <00031b56.long_branch._mcount>:
c0000000019e8544: 4a 69 3f ac b c00000000007c4f0 <._mcount>
2. A plt_branch, for functions that are farther away from mcount():
c0000000051f33f8 <0008ba04.plt_branch._mcount>:
c0000000051f33f8: 3d 82 ff a4 addis r12,r2,-92
c0000000051f33fc: e9 8c 04 20 ld r12,1056(r12)
c0000000051f3400: 7d 89 03 a6 mtctr r12
c0000000051f3404: 4e 80 04 20 bctr
We can reuse those trampolines for ftrace if we can have those
trampolines go to ftrace_caller() instead. However, with ABIv2, we
cannot depend on r2 being valid. As such, we use only the long_branch
trampolines by patching those to instead branch to ftrace_caller or
ftrace_regs_caller.
In addition, we add additional trampolines around .text and .init.text
to catch locations that are covered by the plt branches. This allows
ftrace to work with most large kernel configurations.
For now, we always patch the trampolines to go to ftrace_regs_caller,
which is slightly inefficient. This can be optimized further at a later
point.
Signed-off-by: Naveen N. Rao <naveen.n.rao@linux.vnet.ibm.com>
Signed-off-by: Michael Ellerman <mpe@ellerman.id.au>
2018-10-17 03:25:00 +07:00
|
|
|
static int is_b_op(unsigned int op)
|
|
|
|
{
|
|
|
|
return (op & 0xfc000003) == 0x48000000;
|
|
|
|
}
|
|
|
|
|
2008-11-15 11:47:03 +07:00
|
|
|
static unsigned long find_bl_target(unsigned long ip, unsigned int op)
|
|
|
|
{
|
2018-11-13 20:56:56 +07:00
|
|
|
int offset;
|
2008-11-15 11:47:03 +07:00
|
|
|
|
|
|
|
offset = (op & 0x03fffffc);
|
|
|
|
/* make it signed */
|
|
|
|
if (offset & 0x02000000)
|
|
|
|
offset |= 0xfe000000;
|
|
|
|
|
|
|
|
return ip + (long)offset;
|
|
|
|
}
|
|
|
|
|
powerpc/ftrace: Handle large kernel configs
Currently, we expect to be able to reach ftrace_caller() from all
ftrace-enabled functions through a single relative branch. With large
kernel configs, we see functions outside of 32MB of ftrace_caller()
causing ftrace_init() to bail.
In such configurations, gcc/ld emits two types of trampolines for mcount():
1. A long_branch, which has a single branch to mcount() for functions that
are one hop away from mcount():
c0000000019e8544 <00031b56.long_branch._mcount>:
c0000000019e8544: 4a 69 3f ac b c00000000007c4f0 <._mcount>
2. A plt_branch, for functions that are farther away from mcount():
c0000000051f33f8 <0008ba04.plt_branch._mcount>:
c0000000051f33f8: 3d 82 ff a4 addis r12,r2,-92
c0000000051f33fc: e9 8c 04 20 ld r12,1056(r12)
c0000000051f3400: 7d 89 03 a6 mtctr r12
c0000000051f3404: 4e 80 04 20 bctr
We can reuse those trampolines for ftrace if we can have those
trampolines go to ftrace_caller() instead. However, with ABIv2, we
cannot depend on r2 being valid. As such, we use only the long_branch
trampolines by patching those to instead branch to ftrace_caller or
ftrace_regs_caller.
In addition, we add additional trampolines around .text and .init.text
to catch locations that are covered by the plt branches. This allows
ftrace to work with most large kernel configurations.
For now, we always patch the trampolines to go to ftrace_regs_caller,
which is slightly inefficient. This can be optimized further at a later
point.
Signed-off-by: Naveen N. Rao <naveen.n.rao@linux.vnet.ibm.com>
Signed-off-by: Michael Ellerman <mpe@ellerman.id.au>
2018-10-17 03:25:00 +07:00
|
|
|
#ifdef CONFIG_MODULES
|
2008-11-15 11:47:03 +07:00
|
|
|
#ifdef CONFIG_PPC64
|
|
|
|
static int
|
|
|
|
__ftrace_make_nop(struct module *mod,
|
|
|
|
struct dyn_ftrace *rec, unsigned long addr)
|
|
|
|
{
|
powerpc/module: Mark module stubs with a magic value
When a module is loaded, calls out to the kernel go via a stub which is
generated at runtime. One of these stubs is used to call _mcount(),
which is the default target of tracing calls generated by the compiler
with -pg.
If dynamic ftrace is enabled (which it typically is), another stub is
used to call ftrace_caller(), which is the target of tracing calls when
ftrace is actually active.
ftrace then wants to disable the calls to _mcount() at module startup,
and enable/disable the calls to ftrace_caller() when enabling/disabling
tracing - all of these it does by patching the code.
As part of that code patching, the ftrace code wants to confirm that the
branch it is about to modify, is in fact a call to a module stub which
calls _mcount() or ftrace_caller().
Currently it does that by inspecting the instructions and confirming
they are what it expects. Although that works, the code to do it is
pretty intricate because it requires lots of knowledge about the exact
format of the stub.
We can make that process easier by marking the generated stubs with a
magic value, and then looking for that magic value. Altough this is not
as rigorous as the current method, I believe it is sufficient in
practice.
Reviewed-by: Balbir Singh <bsingharora@gmail.com>
Reviewed-by: Torsten Duwe <duwe@suse.de>
Signed-off-by: Michael Ellerman <mpe@ellerman.id.au>
2016-03-03 11:26:55 +07:00
|
|
|
unsigned long entry, ptr, tramp;
|
2008-11-15 11:47:03 +07:00
|
|
|
unsigned long ip = rec->ip;
|
2016-03-03 11:26:59 +07:00
|
|
|
unsigned int op, pop;
|
2008-11-15 11:47:03 +07:00
|
|
|
|
|
|
|
/* read where this goes */
|
2016-03-03 11:26:59 +07:00
|
|
|
if (probe_kernel_read(&op, (void *)ip, sizeof(int))) {
|
|
|
|
pr_err("Fetching opcode failed.\n");
|
2008-11-15 11:47:03 +07:00
|
|
|
return -EFAULT;
|
2016-03-03 11:26:59 +07:00
|
|
|
}
|
2008-11-15 11:47:03 +07:00
|
|
|
|
|
|
|
/* Make sure that that this is still a 24bit jump */
|
2008-11-25 21:39:18 +07:00
|
|
|
if (!is_bl_op(op)) {
|
2014-06-17 13:15:36 +07:00
|
|
|
pr_err("Not expected bl: opcode is %x\n", op);
|
2008-11-15 11:47:03 +07:00
|
|
|
return -EINVAL;
|
|
|
|
}
|
|
|
|
|
|
|
|
/* lets find where the pointer goes */
|
powerpc/module: Mark module stubs with a magic value
When a module is loaded, calls out to the kernel go via a stub which is
generated at runtime. One of these stubs is used to call _mcount(),
which is the default target of tracing calls generated by the compiler
with -pg.
If dynamic ftrace is enabled (which it typically is), another stub is
used to call ftrace_caller(), which is the target of tracing calls when
ftrace is actually active.
ftrace then wants to disable the calls to _mcount() at module startup,
and enable/disable the calls to ftrace_caller() when enabling/disabling
tracing - all of these it does by patching the code.
As part of that code patching, the ftrace code wants to confirm that the
branch it is about to modify, is in fact a call to a module stub which
calls _mcount() or ftrace_caller().
Currently it does that by inspecting the instructions and confirming
they are what it expects. Although that works, the code to do it is
pretty intricate because it requires lots of knowledge about the exact
format of the stub.
We can make that process easier by marking the generated stubs with a
magic value, and then looking for that magic value. Altough this is not
as rigorous as the current method, I believe it is sufficient in
practice.
Reviewed-by: Balbir Singh <bsingharora@gmail.com>
Reviewed-by: Torsten Duwe <duwe@suse.de>
Signed-off-by: Michael Ellerman <mpe@ellerman.id.au>
2016-03-03 11:26:55 +07:00
|
|
|
tramp = find_bl_target(ip, op);
|
2008-11-15 11:47:03 +07:00
|
|
|
|
powerpc/module: Mark module stubs with a magic value
When a module is loaded, calls out to the kernel go via a stub which is
generated at runtime. One of these stubs is used to call _mcount(),
which is the default target of tracing calls generated by the compiler
with -pg.
If dynamic ftrace is enabled (which it typically is), another stub is
used to call ftrace_caller(), which is the target of tracing calls when
ftrace is actually active.
ftrace then wants to disable the calls to _mcount() at module startup,
and enable/disable the calls to ftrace_caller() when enabling/disabling
tracing - all of these it does by patching the code.
As part of that code patching, the ftrace code wants to confirm that the
branch it is about to modify, is in fact a call to a module stub which
calls _mcount() or ftrace_caller().
Currently it does that by inspecting the instructions and confirming
they are what it expects. Although that works, the code to do it is
pretty intricate because it requires lots of knowledge about the exact
format of the stub.
We can make that process easier by marking the generated stubs with a
magic value, and then looking for that magic value. Altough this is not
as rigorous as the current method, I believe it is sufficient in
practice.
Reviewed-by: Balbir Singh <bsingharora@gmail.com>
Reviewed-by: Torsten Duwe <duwe@suse.de>
Signed-off-by: Michael Ellerman <mpe@ellerman.id.au>
2016-03-03 11:26:55 +07:00
|
|
|
pr_devel("ip:%lx jumps to %lx", ip, tramp);
|
2008-11-15 11:47:03 +07:00
|
|
|
|
2014-04-04 12:52:58 +07:00
|
|
|
if (module_trampoline_target(mod, tramp, &ptr)) {
|
2014-06-17 13:15:36 +07:00
|
|
|
pr_err("Failed to get trampoline target\n");
|
2008-11-15 11:47:03 +07:00
|
|
|
return -EFAULT;
|
|
|
|
}
|
|
|
|
|
2014-04-04 12:52:58 +07:00
|
|
|
pr_devel("trampoline target %lx", ptr);
|
2008-11-15 11:47:03 +07:00
|
|
|
|
2014-06-17 13:15:35 +07:00
|
|
|
entry = ppc_global_function_entry((void *)addr);
|
2008-11-15 11:47:03 +07:00
|
|
|
/* This should match what was called */
|
2014-06-17 13:15:35 +07:00
|
|
|
if (ptr != entry) {
|
2014-06-17 13:15:36 +07:00
|
|
|
pr_err("addr %lx does not match expected %lx\n", ptr, entry);
|
2008-11-15 11:47:03 +07:00
|
|
|
return -EINVAL;
|
|
|
|
}
|
|
|
|
|
2018-05-30 19:19:22 +07:00
|
|
|
#ifdef CONFIG_MPROFILE_KERNEL
|
powerpc/ftrace: Separate the heuristics for checking call sites
In __ftrace_make_nop() (the 64-bit version), we have code to deal with
two ftrace ABIs. There is the original ABI, which looks mostly like a
function call, and then the mprofile-kernel ABI which is just a branch.
The code tries to handle both cases, by looking for the presence of a
load to restore the TOC pointer (PPC_INST_LD_TOC). If we detect the TOC
load, we assume the call site is for an mcount() call using the old ABI.
That means we patch the mcount() call with a b +8, to branch over the
TOC load.
However if the kernel was built with mprofile-kernel, then there will
never be a call site using the original ftrace ABI. If for some reason
we do see a TOC load, then it's there for a good reason, and we should
not jump over it.
So split the code, using the existing CC_USING_MPROFILE_KERNEL. Kernels
built with mprofile-kernel will only look for, and expect, the new ABI,
and similarly for the original ABI.
Signed-off-by: Michael Ellerman <mpe@ellerman.id.au>
2016-07-19 11:48:30 +07:00
|
|
|
/* When using -mkernel_profile there is no load to jump over */
|
|
|
|
pop = PPC_INST_NOP;
|
|
|
|
|
|
|
|
if (probe_kernel_read(&op, (void *)(ip - 4), 4)) {
|
|
|
|
pr_err("Fetching instruction at %lx failed.\n", ip - 4);
|
|
|
|
return -EFAULT;
|
|
|
|
}
|
|
|
|
|
|
|
|
/* We expect either a mflr r0, or a std r0, LRSAVE(r1) */
|
|
|
|
if (op != PPC_INST_MFLR && op != PPC_INST_STD_LR) {
|
|
|
|
pr_err("Unexpected instruction %08x around bl _mcount\n", op);
|
|
|
|
return -EINVAL;
|
|
|
|
}
|
|
|
|
#else
|
2008-11-15 11:47:03 +07:00
|
|
|
/*
|
2014-04-04 12:52:58 +07:00
|
|
|
* Our original call site looks like:
|
|
|
|
*
|
|
|
|
* bl <tramp>
|
|
|
|
* ld r2,XX(r1)
|
|
|
|
*
|
|
|
|
* Milton Miller pointed out that we can not simply nop the branch.
|
|
|
|
* If a task was preempted when calling a trace function, the nops
|
|
|
|
* will remove the way to restore the TOC in r2 and the r2 TOC will
|
|
|
|
* get corrupted.
|
|
|
|
*
|
|
|
|
* Use a b +8 to jump over the load.
|
2008-11-15 11:47:03 +07:00
|
|
|
*/
|
|
|
|
|
2016-03-03 11:26:59 +07:00
|
|
|
pop = PPC_INST_BRANCH | 8; /* b +8 */
|
|
|
|
|
|
|
|
/*
|
|
|
|
* Check what is in the next instruction. We can see ld r2,40(r1), but
|
|
|
|
* on first pass after boot we will see mflr r0.
|
|
|
|
*/
|
|
|
|
if (probe_kernel_read(&op, (void *)(ip+4), MCOUNT_INSN_SIZE)) {
|
|
|
|
pr_err("Fetching op failed.\n");
|
|
|
|
return -EFAULT;
|
|
|
|
}
|
|
|
|
|
|
|
|
if (op != PPC_INST_LD_TOC) {
|
powerpc/ftrace: Separate the heuristics for checking call sites
In __ftrace_make_nop() (the 64-bit version), we have code to deal with
two ftrace ABIs. There is the original ABI, which looks mostly like a
function call, and then the mprofile-kernel ABI which is just a branch.
The code tries to handle both cases, by looking for the presence of a
load to restore the TOC pointer (PPC_INST_LD_TOC). If we detect the TOC
load, we assume the call site is for an mcount() call using the old ABI.
That means we patch the mcount() call with a b +8, to branch over the
TOC load.
However if the kernel was built with mprofile-kernel, then there will
never be a call site using the original ftrace ABI. If for some reason
we do see a TOC load, then it's there for a good reason, and we should
not jump over it.
So split the code, using the existing CC_USING_MPROFILE_KERNEL. Kernels
built with mprofile-kernel will only look for, and expect, the new ABI,
and similarly for the original ABI.
Signed-off-by: Michael Ellerman <mpe@ellerman.id.au>
2016-07-19 11:48:30 +07:00
|
|
|
pr_err("Expected %08x found %08x\n", PPC_INST_LD_TOC, op);
|
|
|
|
return -EINVAL;
|
2016-03-03 11:26:59 +07:00
|
|
|
}
|
2018-05-30 19:19:22 +07:00
|
|
|
#endif /* CONFIG_MPROFILE_KERNEL */
|
2016-03-03 11:26:59 +07:00
|
|
|
|
|
|
|
if (patch_instruction((unsigned int *)ip, pop)) {
|
|
|
|
pr_err("Patching NOP failed.\n");
|
2008-11-15 11:47:03 +07:00
|
|
|
return -EPERM;
|
2016-03-03 11:26:59 +07:00
|
|
|
}
|
2008-11-15 11:47:03 +07:00
|
|
|
|
|
|
|
return 0;
|
|
|
|
}
|
|
|
|
|
|
|
|
#else /* !PPC64 */
|
|
|
|
static int
|
|
|
|
__ftrace_make_nop(struct module *mod,
|
|
|
|
struct dyn_ftrace *rec, unsigned long addr)
|
|
|
|
{
|
2008-11-25 21:39:18 +07:00
|
|
|
unsigned int op;
|
|
|
|
unsigned int jmp[4];
|
2008-11-15 14:39:05 +07:00
|
|
|
unsigned long ip = rec->ip;
|
|
|
|
unsigned long tramp;
|
|
|
|
|
2008-11-25 21:39:18 +07:00
|
|
|
if (probe_kernel_read(&op, (void *)ip, MCOUNT_INSN_SIZE))
|
2008-11-15 14:39:05 +07:00
|
|
|
return -EFAULT;
|
|
|
|
|
|
|
|
/* Make sure that that this is still a 24bit jump */
|
2008-11-25 21:39:18 +07:00
|
|
|
if (!is_bl_op(op)) {
|
2014-06-17 13:15:36 +07:00
|
|
|
pr_err("Not expected bl: opcode is %x\n", op);
|
2008-11-15 14:39:05 +07:00
|
|
|
return -EINVAL;
|
|
|
|
}
|
|
|
|
|
|
|
|
/* lets find where the pointer goes */
|
2008-11-25 21:39:18 +07:00
|
|
|
tramp = find_bl_target(ip, op);
|
2008-11-15 14:39:05 +07:00
|
|
|
|
|
|
|
/*
|
|
|
|
* On PPC32 the trampoline looks like:
|
2012-06-21 12:27:14 +07:00
|
|
|
* 0x3d, 0x80, 0x00, 0x00 lis r12,sym@ha
|
|
|
|
* 0x39, 0x8c, 0x00, 0x00 addi r12,r12,sym@l
|
|
|
|
* 0x7d, 0x89, 0x03, 0xa6 mtctr r12
|
2008-11-25 21:39:18 +07:00
|
|
|
* 0x4e, 0x80, 0x04, 0x20 bctr
|
2008-11-15 14:39:05 +07:00
|
|
|
*/
|
|
|
|
|
2009-05-14 03:30:24 +07:00
|
|
|
pr_devel("ip:%lx jumps to %lx", ip, tramp);
|
2008-11-15 14:39:05 +07:00
|
|
|
|
|
|
|
/* Find where the trampoline jumps to */
|
2008-11-25 21:39:18 +07:00
|
|
|
if (probe_kernel_read(jmp, (void *)tramp, sizeof(jmp))) {
|
2014-06-17 13:15:36 +07:00
|
|
|
pr_err("Failed to read %lx\n", tramp);
|
2008-11-15 14:39:05 +07:00
|
|
|
return -EFAULT;
|
|
|
|
}
|
|
|
|
|
2009-05-14 03:30:24 +07:00
|
|
|
pr_devel(" %08x %08x ", jmp[0], jmp[1]);
|
2008-11-25 21:39:18 +07:00
|
|
|
|
|
|
|
/* verify that this is what we expect it to be */
|
2012-06-21 12:27:14 +07:00
|
|
|
if (((jmp[0] & 0xffff0000) != 0x3d800000) ||
|
|
|
|
((jmp[1] & 0xffff0000) != 0x398c0000) ||
|
|
|
|
(jmp[2] != 0x7d8903a6) ||
|
2008-11-25 21:39:18 +07:00
|
|
|
(jmp[3] != 0x4e800420)) {
|
2014-06-17 13:15:36 +07:00
|
|
|
pr_err("Not a trampoline\n");
|
2008-11-25 21:39:18 +07:00
|
|
|
return -EINVAL;
|
|
|
|
}
|
2008-11-15 14:39:05 +07:00
|
|
|
|
2008-11-25 21:39:18 +07:00
|
|
|
tramp = (jmp[1] & 0xffff) |
|
|
|
|
((jmp[0] & 0xffff) << 16);
|
2008-11-15 14:39:05 +07:00
|
|
|
if (tramp & 0x8000)
|
|
|
|
tramp -= 0x10000;
|
|
|
|
|
2009-05-14 03:30:24 +07:00
|
|
|
pr_devel(" %lx ", tramp);
|
2008-11-15 14:39:05 +07:00
|
|
|
|
|
|
|
if (tramp != addr) {
|
2014-06-17 13:15:36 +07:00
|
|
|
pr_err("Trampoline location %08lx does not match addr\n",
|
2008-11-15 14:39:05 +07:00
|
|
|
tramp);
|
|
|
|
return -EINVAL;
|
|
|
|
}
|
|
|
|
|
2009-02-11 03:10:44 +07:00
|
|
|
op = PPC_INST_NOP;
|
2008-11-15 14:39:05 +07:00
|
|
|
|
2012-04-26 15:31:19 +07:00
|
|
|
if (patch_instruction((unsigned int *)ip, op))
|
2008-11-15 14:39:05 +07:00
|
|
|
return -EPERM;
|
|
|
|
|
2008-11-15 11:47:03 +07:00
|
|
|
return 0;
|
|
|
|
}
|
|
|
|
#endif /* PPC64 */
|
2009-02-06 12:33:09 +07:00
|
|
|
#endif /* CONFIG_MODULES */
|
2008-11-15 11:47:03 +07:00
|
|
|
|
powerpc/ftrace: Handle large kernel configs
Currently, we expect to be able to reach ftrace_caller() from all
ftrace-enabled functions through a single relative branch. With large
kernel configs, we see functions outside of 32MB of ftrace_caller()
causing ftrace_init() to bail.
In such configurations, gcc/ld emits two types of trampolines for mcount():
1. A long_branch, which has a single branch to mcount() for functions that
are one hop away from mcount():
c0000000019e8544 <00031b56.long_branch._mcount>:
c0000000019e8544: 4a 69 3f ac b c00000000007c4f0 <._mcount>
2. A plt_branch, for functions that are farther away from mcount():
c0000000051f33f8 <0008ba04.plt_branch._mcount>:
c0000000051f33f8: 3d 82 ff a4 addis r12,r2,-92
c0000000051f33fc: e9 8c 04 20 ld r12,1056(r12)
c0000000051f3400: 7d 89 03 a6 mtctr r12
c0000000051f3404: 4e 80 04 20 bctr
We can reuse those trampolines for ftrace if we can have those
trampolines go to ftrace_caller() instead. However, with ABIv2, we
cannot depend on r2 being valid. As such, we use only the long_branch
trampolines by patching those to instead branch to ftrace_caller or
ftrace_regs_caller.
In addition, we add additional trampolines around .text and .init.text
to catch locations that are covered by the plt branches. This allows
ftrace to work with most large kernel configurations.
For now, we always patch the trampolines to go to ftrace_regs_caller,
which is slightly inefficient. This can be optimized further at a later
point.
Signed-off-by: Naveen N. Rao <naveen.n.rao@linux.vnet.ibm.com>
Signed-off-by: Michael Ellerman <mpe@ellerman.id.au>
2018-10-17 03:25:00 +07:00
|
|
|
static unsigned long find_ftrace_tramp(unsigned long ip)
|
|
|
|
{
|
|
|
|
int i;
|
|
|
|
|
|
|
|
/*
|
|
|
|
* We have the compiler generated long_branch tramps at the end
|
|
|
|
* and we prefer those
|
|
|
|
*/
|
|
|
|
for (i = NUM_FTRACE_TRAMPS - 1; i >= 0; i--)
|
|
|
|
if (!ftrace_tramps[i])
|
|
|
|
continue;
|
|
|
|
else if (create_branch((void *)ip, ftrace_tramps[i], 0))
|
|
|
|
return ftrace_tramps[i];
|
|
|
|
|
|
|
|
return 0;
|
|
|
|
}
|
|
|
|
|
|
|
|
static int add_ftrace_tramp(unsigned long tramp)
|
|
|
|
{
|
|
|
|
int i;
|
|
|
|
|
|
|
|
for (i = 0; i < NUM_FTRACE_TRAMPS; i++)
|
|
|
|
if (!ftrace_tramps[i]) {
|
|
|
|
ftrace_tramps[i] = tramp;
|
|
|
|
return 0;
|
|
|
|
}
|
|
|
|
|
|
|
|
return -1;
|
|
|
|
}
|
|
|
|
|
|
|
|
/*
|
|
|
|
* If this is a compiler generated long_branch trampoline (essentially, a
|
|
|
|
* trampoline that has a branch to _mcount()), we re-write the branch to
|
|
|
|
* instead go to ftrace_[regs_]caller() and note down the location of this
|
|
|
|
* trampoline.
|
|
|
|
*/
|
|
|
|
static int setup_mcount_compiler_tramp(unsigned long tramp)
|
|
|
|
{
|
|
|
|
int i, op;
|
|
|
|
unsigned long ptr;
|
|
|
|
static unsigned long ftrace_plt_tramps[NUM_FTRACE_TRAMPS];
|
|
|
|
|
|
|
|
/* Is this a known long jump tramp? */
|
|
|
|
for (i = 0; i < NUM_FTRACE_TRAMPS; i++)
|
|
|
|
if (!ftrace_tramps[i])
|
|
|
|
break;
|
|
|
|
else if (ftrace_tramps[i] == tramp)
|
|
|
|
return 0;
|
|
|
|
|
|
|
|
/* Is this a known plt tramp? */
|
|
|
|
for (i = 0; i < NUM_FTRACE_TRAMPS; i++)
|
|
|
|
if (!ftrace_plt_tramps[i])
|
|
|
|
break;
|
|
|
|
else if (ftrace_plt_tramps[i] == tramp)
|
|
|
|
return -1;
|
|
|
|
|
|
|
|
/* New trampoline -- read where this goes */
|
|
|
|
if (probe_kernel_read(&op, (void *)tramp, sizeof(int))) {
|
|
|
|
pr_debug("Fetching opcode failed.\n");
|
|
|
|
return -1;
|
|
|
|
}
|
|
|
|
|
|
|
|
/* Is this a 24 bit branch? */
|
|
|
|
if (!is_b_op(op)) {
|
|
|
|
pr_debug("Trampoline is not a long branch tramp.\n");
|
|
|
|
return -1;
|
|
|
|
}
|
|
|
|
|
|
|
|
/* lets find where the pointer goes */
|
|
|
|
ptr = find_bl_target(tramp, op);
|
|
|
|
|
|
|
|
if (ptr != ppc_global_function_entry((void *)_mcount)) {
|
|
|
|
pr_debug("Trampoline target %p is not _mcount\n", (void *)ptr);
|
|
|
|
return -1;
|
|
|
|
}
|
|
|
|
|
|
|
|
/* Let's re-write the tramp to go to ftrace_[regs_]caller */
|
|
|
|
#ifdef CONFIG_DYNAMIC_FTRACE_WITH_REGS
|
|
|
|
ptr = ppc_global_function_entry((void *)ftrace_regs_caller);
|
|
|
|
#else
|
|
|
|
ptr = ppc_global_function_entry((void *)ftrace_caller);
|
|
|
|
#endif
|
|
|
|
if (!create_branch((void *)tramp, ptr, 0)) {
|
|
|
|
pr_debug("%ps is not reachable from existing mcount tramp\n",
|
|
|
|
(void *)ptr);
|
|
|
|
return -1;
|
|
|
|
}
|
|
|
|
|
|
|
|
if (patch_branch((unsigned int *)tramp, ptr, 0)) {
|
|
|
|
pr_debug("REL24 out of range!\n");
|
|
|
|
return -1;
|
|
|
|
}
|
|
|
|
|
|
|
|
if (add_ftrace_tramp(tramp)) {
|
|
|
|
pr_debug("No tramp locations left\n");
|
|
|
|
return -1;
|
|
|
|
}
|
|
|
|
|
|
|
|
return 0;
|
|
|
|
}
|
|
|
|
|
|
|
|
static int __ftrace_make_nop_kernel(struct dyn_ftrace *rec, unsigned long addr)
|
|
|
|
{
|
|
|
|
unsigned long tramp, ip = rec->ip;
|
|
|
|
unsigned int op;
|
|
|
|
|
|
|
|
/* Read where this goes */
|
|
|
|
if (probe_kernel_read(&op, (void *)ip, sizeof(int))) {
|
|
|
|
pr_err("Fetching opcode failed.\n");
|
|
|
|
return -EFAULT;
|
|
|
|
}
|
|
|
|
|
|
|
|
/* Make sure that that this is still a 24bit jump */
|
|
|
|
if (!is_bl_op(op)) {
|
|
|
|
pr_err("Not expected bl: opcode is %x\n", op);
|
|
|
|
return -EINVAL;
|
|
|
|
}
|
|
|
|
|
|
|
|
/* Let's find where the pointer goes */
|
|
|
|
tramp = find_bl_target(ip, op);
|
|
|
|
|
|
|
|
pr_devel("ip:%lx jumps to %lx", ip, tramp);
|
|
|
|
|
|
|
|
if (setup_mcount_compiler_tramp(tramp)) {
|
|
|
|
/* Are other trampolines reachable? */
|
|
|
|
if (!find_ftrace_tramp(ip)) {
|
|
|
|
pr_err("No ftrace trampolines reachable from %ps\n",
|
|
|
|
(void *)ip);
|
|
|
|
return -EINVAL;
|
|
|
|
}
|
|
|
|
}
|
|
|
|
|
|
|
|
if (patch_instruction((unsigned int *)ip, PPC_INST_NOP)) {
|
|
|
|
pr_err("Patching NOP failed.\n");
|
|
|
|
return -EPERM;
|
|
|
|
}
|
|
|
|
|
|
|
|
return 0;
|
|
|
|
}
|
|
|
|
|
2008-11-15 07:21:19 +07:00
|
|
|
int ftrace_make_nop(struct module *mod,
|
|
|
|
struct dyn_ftrace *rec, unsigned long addr)
|
|
|
|
{
|
2008-11-15 11:47:03 +07:00
|
|
|
unsigned long ip = rec->ip;
|
2009-02-13 21:31:39 +07:00
|
|
|
unsigned int old, new;
|
2008-11-15 07:21:19 +07:00
|
|
|
|
|
|
|
/*
|
|
|
|
* If the calling address is more that 24 bits away,
|
|
|
|
* then we had to use a trampoline to make the call.
|
|
|
|
* Otherwise just update the call site.
|
|
|
|
*/
|
2008-11-15 11:47:03 +07:00
|
|
|
if (test_24bit_addr(ip, addr)) {
|
2008-11-15 07:21:19 +07:00
|
|
|
/* within range */
|
2009-02-11 13:19:54 +07:00
|
|
|
old = ftrace_call_replace(ip, addr, 1);
|
2009-05-29 02:33:36 +07:00
|
|
|
new = PPC_INST_NOP;
|
2008-11-15 11:47:03 +07:00
|
|
|
return ftrace_modify_code(ip, old, new);
|
powerpc/ftrace: Handle large kernel configs
Currently, we expect to be able to reach ftrace_caller() from all
ftrace-enabled functions through a single relative branch. With large
kernel configs, we see functions outside of 32MB of ftrace_caller()
causing ftrace_init() to bail.
In such configurations, gcc/ld emits two types of trampolines for mcount():
1. A long_branch, which has a single branch to mcount() for functions that
are one hop away from mcount():
c0000000019e8544 <00031b56.long_branch._mcount>:
c0000000019e8544: 4a 69 3f ac b c00000000007c4f0 <._mcount>
2. A plt_branch, for functions that are farther away from mcount():
c0000000051f33f8 <0008ba04.plt_branch._mcount>:
c0000000051f33f8: 3d 82 ff a4 addis r12,r2,-92
c0000000051f33fc: e9 8c 04 20 ld r12,1056(r12)
c0000000051f3400: 7d 89 03 a6 mtctr r12
c0000000051f3404: 4e 80 04 20 bctr
We can reuse those trampolines for ftrace if we can have those
trampolines go to ftrace_caller() instead. However, with ABIv2, we
cannot depend on r2 being valid. As such, we use only the long_branch
trampolines by patching those to instead branch to ftrace_caller or
ftrace_regs_caller.
In addition, we add additional trampolines around .text and .init.text
to catch locations that are covered by the plt branches. This allows
ftrace to work with most large kernel configurations.
For now, we always patch the trampolines to go to ftrace_regs_caller,
which is slightly inefficient. This can be optimized further at a later
point.
Signed-off-by: Naveen N. Rao <naveen.n.rao@linux.vnet.ibm.com>
Signed-off-by: Michael Ellerman <mpe@ellerman.id.au>
2018-10-17 03:25:00 +07:00
|
|
|
} else if (core_kernel_text(ip))
|
|
|
|
return __ftrace_make_nop_kernel(rec, addr);
|
2008-11-15 11:47:03 +07:00
|
|
|
|
2009-02-06 12:33:09 +07:00
|
|
|
#ifdef CONFIG_MODULES
|
2008-11-15 11:47:03 +07:00
|
|
|
/*
|
|
|
|
* Out of range jumps are called from modules.
|
|
|
|
* We should either already have a pointer to the module
|
|
|
|
* or it has been passed in.
|
|
|
|
*/
|
|
|
|
if (!rec->arch.mod) {
|
|
|
|
if (!mod) {
|
2014-06-17 13:15:36 +07:00
|
|
|
pr_err("No module loaded addr=%lx\n", addr);
|
2008-11-15 11:47:03 +07:00
|
|
|
return -EFAULT;
|
|
|
|
}
|
|
|
|
rec->arch.mod = mod;
|
|
|
|
} else if (mod) {
|
|
|
|
if (mod != rec->arch.mod) {
|
2014-06-17 13:15:36 +07:00
|
|
|
pr_err("Record mod %p not equal to passed in mod %p\n",
|
2008-11-15 11:47:03 +07:00
|
|
|
rec->arch.mod, mod);
|
|
|
|
return -EINVAL;
|
|
|
|
}
|
|
|
|
/* nothing to do if mod == rec->arch.mod */
|
|
|
|
} else
|
|
|
|
mod = rec->arch.mod;
|
|
|
|
|
|
|
|
return __ftrace_make_nop(mod, rec, addr);
|
2009-02-06 12:33:09 +07:00
|
|
|
#else
|
|
|
|
/* We should not get here without modules */
|
|
|
|
return -EINVAL;
|
|
|
|
#endif /* CONFIG_MODULES */
|
2008-11-15 11:47:03 +07:00
|
|
|
}
|
|
|
|
|
2009-02-06 12:33:09 +07:00
|
|
|
#ifdef CONFIG_MODULES
|
2008-11-15 11:47:03 +07:00
|
|
|
#ifdef CONFIG_PPC64
|
2016-03-03 11:26:59 +07:00
|
|
|
/*
|
|
|
|
* Examine the existing instructions for __ftrace_make_call.
|
|
|
|
* They should effectively be a NOP, and follow formal constraints,
|
|
|
|
* depending on the ABI. Return false if they don't.
|
|
|
|
*/
|
2018-05-30 19:19:22 +07:00
|
|
|
#ifndef CONFIG_MPROFILE_KERNEL
|
2008-11-15 11:47:03 +07:00
|
|
|
static int
|
2016-03-03 11:26:59 +07:00
|
|
|
expected_nop_sequence(void *ip, unsigned int op0, unsigned int op1)
|
2008-11-15 11:47:03 +07:00
|
|
|
{
|
|
|
|
/*
|
2014-04-04 12:54:04 +07:00
|
|
|
* We expect to see:
|
|
|
|
*
|
|
|
|
* b +8
|
|
|
|
* ld r2,XX(r1)
|
|
|
|
*
|
|
|
|
* The load offset is different depending on the ABI. For simplicity
|
|
|
|
* just mask it out when doing the compare.
|
2008-11-15 11:47:03 +07:00
|
|
|
*/
|
2016-03-03 11:26:59 +07:00
|
|
|
if ((op0 != 0x48000008) || ((op1 & 0xffff0000) != 0xe8410000))
|
|
|
|
return 0;
|
|
|
|
return 1;
|
|
|
|
}
|
|
|
|
#else
|
|
|
|
static int
|
|
|
|
expected_nop_sequence(void *ip, unsigned int op0, unsigned int op1)
|
|
|
|
{
|
|
|
|
/* look for patched "NOP" on ppc64 with -mprofile-kernel */
|
|
|
|
if (op0 != PPC_INST_NOP)
|
|
|
|
return 0;
|
|
|
|
return 1;
|
|
|
|
}
|
|
|
|
#endif
|
|
|
|
|
|
|
|
static int
|
|
|
|
__ftrace_make_call(struct dyn_ftrace *rec, unsigned long addr)
|
|
|
|
{
|
|
|
|
unsigned int op[2];
|
|
|
|
void *ip = (void *)rec->ip;
|
2018-04-19 14:04:09 +07:00
|
|
|
unsigned long entry, ptr, tramp;
|
|
|
|
struct module *mod = rec->arch.mod;
|
2016-03-03 11:26:59 +07:00
|
|
|
|
|
|
|
/* read where this goes */
|
|
|
|
if (probe_kernel_read(op, ip, sizeof(op)))
|
|
|
|
return -EFAULT;
|
|
|
|
|
|
|
|
if (!expected_nop_sequence(ip, op[0], op[1])) {
|
|
|
|
pr_err("Unexpected call sequence at %p: %x %x\n",
|
|
|
|
ip, op[0], op[1]);
|
2008-11-15 11:47:03 +07:00
|
|
|
return -EINVAL;
|
|
|
|
}
|
|
|
|
|
2018-04-19 14:04:09 +07:00
|
|
|
/* If we never set up ftrace trampoline(s), then bail */
|
|
|
|
#ifdef CONFIG_DYNAMIC_FTRACE_WITH_REGS
|
|
|
|
if (!mod->arch.tramp || !mod->arch.tramp_regs) {
|
|
|
|
#else
|
|
|
|
if (!mod->arch.tramp) {
|
|
|
|
#endif
|
2014-06-17 13:15:36 +07:00
|
|
|
pr_err("No ftrace trampoline\n");
|
2008-11-15 11:47:03 +07:00
|
|
|
return -EINVAL;
|
|
|
|
}
|
|
|
|
|
2018-04-19 14:04:09 +07:00
|
|
|
#ifdef CONFIG_DYNAMIC_FTRACE_WITH_REGS
|
|
|
|
if (rec->flags & FTRACE_FL_REGS)
|
|
|
|
tramp = mod->arch.tramp_regs;
|
|
|
|
else
|
|
|
|
#endif
|
|
|
|
tramp = mod->arch.tramp;
|
|
|
|
|
|
|
|
if (module_trampoline_target(mod, tramp, &ptr)) {
|
|
|
|
pr_err("Failed to get trampoline target\n");
|
|
|
|
return -EFAULT;
|
|
|
|
}
|
|
|
|
|
|
|
|
pr_devel("trampoline target %lx", ptr);
|
|
|
|
|
|
|
|
entry = ppc_global_function_entry((void *)addr);
|
|
|
|
/* This should match what was called */
|
|
|
|
if (ptr != entry) {
|
|
|
|
pr_err("addr %lx does not match expected %lx\n", ptr, entry);
|
|
|
|
return -EINVAL;
|
|
|
|
}
|
|
|
|
|
2014-04-04 12:54:04 +07:00
|
|
|
/* Ensure branch is within 24 bits */
|
2018-04-19 14:04:09 +07:00
|
|
|
if (!create_branch(ip, tramp, BRANCH_SET_LINK)) {
|
2014-06-17 13:15:36 +07:00
|
|
|
pr_err("Branch out of range\n");
|
2008-11-15 11:47:03 +07:00
|
|
|
return -EINVAL;
|
2008-11-15 07:21:19 +07:00
|
|
|
}
|
|
|
|
|
2018-04-19 14:04:09 +07:00
|
|
|
if (patch_branch(ip, tramp, BRANCH_SET_LINK)) {
|
2014-06-17 13:15:36 +07:00
|
|
|
pr_err("REL24 out of range!\n");
|
2014-04-04 12:54:04 +07:00
|
|
|
return -EINVAL;
|
|
|
|
}
|
2008-11-26 01:22:48 +07:00
|
|
|
|
2008-11-15 07:21:19 +07:00
|
|
|
return 0;
|
|
|
|
}
|
2016-03-03 11:26:59 +07:00
|
|
|
|
|
|
|
#else /* !CONFIG_PPC64: */
|
2008-11-15 11:47:03 +07:00
|
|
|
static int
|
|
|
|
__ftrace_make_call(struct dyn_ftrace *rec, unsigned long addr)
|
|
|
|
{
|
2008-11-25 21:39:18 +07:00
|
|
|
unsigned int op;
|
2008-11-15 14:39:05 +07:00
|
|
|
unsigned long ip = rec->ip;
|
|
|
|
|
|
|
|
/* read where this goes */
|
2008-11-25 21:39:18 +07:00
|
|
|
if (probe_kernel_read(&op, (void *)ip, MCOUNT_INSN_SIZE))
|
2008-11-15 14:39:05 +07:00
|
|
|
return -EFAULT;
|
|
|
|
|
|
|
|
/* It should be pointing to a nop */
|
2009-02-11 03:10:44 +07:00
|
|
|
if (op != PPC_INST_NOP) {
|
2014-06-17 13:15:36 +07:00
|
|
|
pr_err("Expected NOP but have %x\n", op);
|
2008-11-15 14:39:05 +07:00
|
|
|
return -EINVAL;
|
|
|
|
}
|
|
|
|
|
|
|
|
/* If we never set up a trampoline to ftrace_caller, then bail */
|
|
|
|
if (!rec->arch.mod->arch.tramp) {
|
2014-06-17 13:15:36 +07:00
|
|
|
pr_err("No ftrace trampoline\n");
|
2008-11-15 14:39:05 +07:00
|
|
|
return -EINVAL;
|
|
|
|
}
|
|
|
|
|
2008-11-26 05:06:19 +07:00
|
|
|
/* create the branch to the trampoline */
|
|
|
|
op = create_branch((unsigned int *)ip,
|
|
|
|
rec->arch.mod->arch.tramp, BRANCH_SET_LINK);
|
|
|
|
if (!op) {
|
2014-06-17 13:15:36 +07:00
|
|
|
pr_err("REL24 out of range!\n");
|
2008-11-15 14:39:05 +07:00
|
|
|
return -EINVAL;
|
|
|
|
}
|
|
|
|
|
2009-05-14 03:30:24 +07:00
|
|
|
pr_devel("write to %lx\n", rec->ip);
|
2008-11-15 14:39:05 +07:00
|
|
|
|
2012-04-26 15:31:19 +07:00
|
|
|
if (patch_instruction((unsigned int *)ip, op))
|
2008-11-15 14:39:05 +07:00
|
|
|
return -EPERM;
|
|
|
|
|
2008-11-15 11:47:03 +07:00
|
|
|
return 0;
|
|
|
|
}
|
|
|
|
#endif /* CONFIG_PPC64 */
|
2009-02-06 12:33:09 +07:00
|
|
|
#endif /* CONFIG_MODULES */
|
2008-11-15 07:21:19 +07:00
|
|
|
|
powerpc/ftrace: Handle large kernel configs
Currently, we expect to be able to reach ftrace_caller() from all
ftrace-enabled functions through a single relative branch. With large
kernel configs, we see functions outside of 32MB of ftrace_caller()
causing ftrace_init() to bail.
In such configurations, gcc/ld emits two types of trampolines for mcount():
1. A long_branch, which has a single branch to mcount() for functions that
are one hop away from mcount():
c0000000019e8544 <00031b56.long_branch._mcount>:
c0000000019e8544: 4a 69 3f ac b c00000000007c4f0 <._mcount>
2. A plt_branch, for functions that are farther away from mcount():
c0000000051f33f8 <0008ba04.plt_branch._mcount>:
c0000000051f33f8: 3d 82 ff a4 addis r12,r2,-92
c0000000051f33fc: e9 8c 04 20 ld r12,1056(r12)
c0000000051f3400: 7d 89 03 a6 mtctr r12
c0000000051f3404: 4e 80 04 20 bctr
We can reuse those trampolines for ftrace if we can have those
trampolines go to ftrace_caller() instead. However, with ABIv2, we
cannot depend on r2 being valid. As such, we use only the long_branch
trampolines by patching those to instead branch to ftrace_caller or
ftrace_regs_caller.
In addition, we add additional trampolines around .text and .init.text
to catch locations that are covered by the plt branches. This allows
ftrace to work with most large kernel configurations.
For now, we always patch the trampolines to go to ftrace_regs_caller,
which is slightly inefficient. This can be optimized further at a later
point.
Signed-off-by: Naveen N. Rao <naveen.n.rao@linux.vnet.ibm.com>
Signed-off-by: Michael Ellerman <mpe@ellerman.id.au>
2018-10-17 03:25:00 +07:00
|
|
|
static int __ftrace_make_call_kernel(struct dyn_ftrace *rec, unsigned long addr)
|
|
|
|
{
|
|
|
|
unsigned int op;
|
|
|
|
void *ip = (void *)rec->ip;
|
|
|
|
unsigned long tramp, entry, ptr;
|
|
|
|
|
|
|
|
/* Make sure we're being asked to patch branch to a known ftrace addr */
|
|
|
|
entry = ppc_global_function_entry((void *)ftrace_caller);
|
|
|
|
ptr = ppc_global_function_entry((void *)addr);
|
|
|
|
|
|
|
|
if (ptr != entry) {
|
|
|
|
#ifdef CONFIG_DYNAMIC_FTRACE_WITH_REGS
|
|
|
|
entry = ppc_global_function_entry((void *)ftrace_regs_caller);
|
|
|
|
if (ptr != entry) {
|
|
|
|
#endif
|
|
|
|
pr_err("Unknown ftrace addr to patch: %ps\n", (void *)ptr);
|
|
|
|
return -EINVAL;
|
|
|
|
#ifdef CONFIG_DYNAMIC_FTRACE_WITH_REGS
|
|
|
|
}
|
|
|
|
#endif
|
|
|
|
}
|
|
|
|
|
|
|
|
/* Make sure we have a nop */
|
|
|
|
if (probe_kernel_read(&op, ip, sizeof(op))) {
|
|
|
|
pr_err("Unable to read ftrace location %p\n", ip);
|
|
|
|
return -EFAULT;
|
|
|
|
}
|
|
|
|
|
|
|
|
if (op != PPC_INST_NOP) {
|
|
|
|
pr_err("Unexpected call sequence at %p: %x\n", ip, op);
|
|
|
|
return -EINVAL;
|
|
|
|
}
|
|
|
|
|
|
|
|
tramp = find_ftrace_tramp((unsigned long)ip);
|
|
|
|
if (!tramp) {
|
|
|
|
pr_err("No ftrace trampolines reachable from %ps\n", ip);
|
|
|
|
return -EINVAL;
|
|
|
|
}
|
|
|
|
|
|
|
|
if (patch_branch(ip, tramp, BRANCH_SET_LINK)) {
|
|
|
|
pr_err("Error patching branch to ftrace tramp!\n");
|
|
|
|
return -EINVAL;
|
|
|
|
}
|
|
|
|
|
|
|
|
return 0;
|
|
|
|
}
|
|
|
|
|
2008-11-15 07:21:19 +07:00
|
|
|
int ftrace_make_call(struct dyn_ftrace *rec, unsigned long addr)
|
|
|
|
{
|
2008-11-15 11:47:03 +07:00
|
|
|
unsigned long ip = rec->ip;
|
2009-02-13 21:31:39 +07:00
|
|
|
unsigned int old, new;
|
2008-11-15 07:21:19 +07:00
|
|
|
|
|
|
|
/*
|
|
|
|
* If the calling address is more that 24 bits away,
|
|
|
|
* then we had to use a trampoline to make the call.
|
|
|
|
* Otherwise just update the call site.
|
|
|
|
*/
|
2008-11-15 11:47:03 +07:00
|
|
|
if (test_24bit_addr(ip, addr)) {
|
2008-11-15 07:21:19 +07:00
|
|
|
/* within range */
|
2009-05-29 02:33:36 +07:00
|
|
|
old = PPC_INST_NOP;
|
2009-02-11 13:19:54 +07:00
|
|
|
new = ftrace_call_replace(ip, addr, 1);
|
2008-11-15 11:47:03 +07:00
|
|
|
return ftrace_modify_code(ip, old, new);
|
powerpc/ftrace: Handle large kernel configs
Currently, we expect to be able to reach ftrace_caller() from all
ftrace-enabled functions through a single relative branch. With large
kernel configs, we see functions outside of 32MB of ftrace_caller()
causing ftrace_init() to bail.
In such configurations, gcc/ld emits two types of trampolines for mcount():
1. A long_branch, which has a single branch to mcount() for functions that
are one hop away from mcount():
c0000000019e8544 <00031b56.long_branch._mcount>:
c0000000019e8544: 4a 69 3f ac b c00000000007c4f0 <._mcount>
2. A plt_branch, for functions that are farther away from mcount():
c0000000051f33f8 <0008ba04.plt_branch._mcount>:
c0000000051f33f8: 3d 82 ff a4 addis r12,r2,-92
c0000000051f33fc: e9 8c 04 20 ld r12,1056(r12)
c0000000051f3400: 7d 89 03 a6 mtctr r12
c0000000051f3404: 4e 80 04 20 bctr
We can reuse those trampolines for ftrace if we can have those
trampolines go to ftrace_caller() instead. However, with ABIv2, we
cannot depend on r2 being valid. As such, we use only the long_branch
trampolines by patching those to instead branch to ftrace_caller or
ftrace_regs_caller.
In addition, we add additional trampolines around .text and .init.text
to catch locations that are covered by the plt branches. This allows
ftrace to work with most large kernel configurations.
For now, we always patch the trampolines to go to ftrace_regs_caller,
which is slightly inefficient. This can be optimized further at a later
point.
Signed-off-by: Naveen N. Rao <naveen.n.rao@linux.vnet.ibm.com>
Signed-off-by: Michael Ellerman <mpe@ellerman.id.au>
2018-10-17 03:25:00 +07:00
|
|
|
} else if (core_kernel_text(ip))
|
|
|
|
return __ftrace_make_call_kernel(rec, addr);
|
2008-11-15 07:21:19 +07:00
|
|
|
|
2009-02-06 12:33:09 +07:00
|
|
|
#ifdef CONFIG_MODULES
|
2008-11-15 11:47:03 +07:00
|
|
|
/*
|
|
|
|
* Out of range jumps are called from modules.
|
|
|
|
* Being that we are converting from nop, it had better
|
|
|
|
* already have a module defined.
|
|
|
|
*/
|
|
|
|
if (!rec->arch.mod) {
|
2014-06-17 13:15:36 +07:00
|
|
|
pr_err("No module loaded\n");
|
2008-11-15 11:47:03 +07:00
|
|
|
return -EINVAL;
|
|
|
|
}
|
|
|
|
|
|
|
|
return __ftrace_make_call(rec, addr);
|
2009-02-06 12:33:09 +07:00
|
|
|
#else
|
|
|
|
/* We should not get here without modules */
|
|
|
|
return -EINVAL;
|
|
|
|
#endif /* CONFIG_MODULES */
|
2008-11-15 07:21:19 +07:00
|
|
|
}
|
|
|
|
|
2018-04-19 14:04:09 +07:00
|
|
|
#ifdef CONFIG_DYNAMIC_FTRACE_WITH_REGS
|
|
|
|
#ifdef CONFIG_MODULES
|
|
|
|
static int
|
|
|
|
__ftrace_modify_call(struct dyn_ftrace *rec, unsigned long old_addr,
|
|
|
|
unsigned long addr)
|
|
|
|
{
|
|
|
|
unsigned int op;
|
|
|
|
unsigned long ip = rec->ip;
|
|
|
|
unsigned long entry, ptr, tramp;
|
|
|
|
struct module *mod = rec->arch.mod;
|
|
|
|
|
|
|
|
/* If we never set up ftrace trampolines, then bail */
|
|
|
|
if (!mod->arch.tramp || !mod->arch.tramp_regs) {
|
|
|
|
pr_err("No ftrace trampoline\n");
|
|
|
|
return -EINVAL;
|
|
|
|
}
|
|
|
|
|
|
|
|
/* read where this goes */
|
|
|
|
if (probe_kernel_read(&op, (void *)ip, sizeof(int))) {
|
|
|
|
pr_err("Fetching opcode failed.\n");
|
|
|
|
return -EFAULT;
|
|
|
|
}
|
|
|
|
|
|
|
|
/* Make sure that that this is still a 24bit jump */
|
|
|
|
if (!is_bl_op(op)) {
|
|
|
|
pr_err("Not expected bl: opcode is %x\n", op);
|
|
|
|
return -EINVAL;
|
|
|
|
}
|
|
|
|
|
|
|
|
/* lets find where the pointer goes */
|
|
|
|
tramp = find_bl_target(ip, op);
|
|
|
|
entry = ppc_global_function_entry((void *)old_addr);
|
|
|
|
|
|
|
|
pr_devel("ip:%lx jumps to %lx", ip, tramp);
|
|
|
|
|
|
|
|
if (tramp != entry) {
|
|
|
|
/* old_addr is not within range, so we must have used a trampoline */
|
|
|
|
if (module_trampoline_target(mod, tramp, &ptr)) {
|
|
|
|
pr_err("Failed to get trampoline target\n");
|
|
|
|
return -EFAULT;
|
|
|
|
}
|
|
|
|
|
|
|
|
pr_devel("trampoline target %lx", ptr);
|
|
|
|
|
|
|
|
/* This should match what was called */
|
|
|
|
if (ptr != entry) {
|
|
|
|
pr_err("addr %lx does not match expected %lx\n", ptr, entry);
|
|
|
|
return -EINVAL;
|
|
|
|
}
|
|
|
|
}
|
|
|
|
|
|
|
|
/* The new target may be within range */
|
|
|
|
if (test_24bit_addr(ip, addr)) {
|
|
|
|
/* within range */
|
|
|
|
if (patch_branch((unsigned int *)ip, addr, BRANCH_SET_LINK)) {
|
|
|
|
pr_err("REL24 out of range!\n");
|
|
|
|
return -EINVAL;
|
|
|
|
}
|
|
|
|
|
|
|
|
return 0;
|
|
|
|
}
|
|
|
|
|
|
|
|
if (rec->flags & FTRACE_FL_REGS)
|
|
|
|
tramp = mod->arch.tramp_regs;
|
|
|
|
else
|
|
|
|
tramp = mod->arch.tramp;
|
|
|
|
|
|
|
|
if (module_trampoline_target(mod, tramp, &ptr)) {
|
|
|
|
pr_err("Failed to get trampoline target\n");
|
|
|
|
return -EFAULT;
|
|
|
|
}
|
|
|
|
|
|
|
|
pr_devel("trampoline target %lx", ptr);
|
|
|
|
|
|
|
|
entry = ppc_global_function_entry((void *)addr);
|
|
|
|
/* This should match what was called */
|
|
|
|
if (ptr != entry) {
|
|
|
|
pr_err("addr %lx does not match expected %lx\n", ptr, entry);
|
|
|
|
return -EINVAL;
|
|
|
|
}
|
|
|
|
|
|
|
|
/* Ensure branch is within 24 bits */
|
|
|
|
if (!create_branch((unsigned int *)ip, tramp, BRANCH_SET_LINK)) {
|
|
|
|
pr_err("Branch out of range\n");
|
|
|
|
return -EINVAL;
|
|
|
|
}
|
|
|
|
|
|
|
|
if (patch_branch((unsigned int *)ip, tramp, BRANCH_SET_LINK)) {
|
|
|
|
pr_err("REL24 out of range!\n");
|
|
|
|
return -EINVAL;
|
|
|
|
}
|
|
|
|
|
|
|
|
return 0;
|
|
|
|
}
|
|
|
|
#endif
|
|
|
|
|
|
|
|
int ftrace_modify_call(struct dyn_ftrace *rec, unsigned long old_addr,
|
|
|
|
unsigned long addr)
|
|
|
|
{
|
|
|
|
unsigned long ip = rec->ip;
|
|
|
|
unsigned int old, new;
|
|
|
|
|
|
|
|
/*
|
|
|
|
* If the calling address is more that 24 bits away,
|
|
|
|
* then we had to use a trampoline to make the call.
|
|
|
|
* Otherwise just update the call site.
|
|
|
|
*/
|
|
|
|
if (test_24bit_addr(ip, addr) && test_24bit_addr(ip, old_addr)) {
|
|
|
|
/* within range */
|
|
|
|
old = ftrace_call_replace(ip, old_addr, 1);
|
|
|
|
new = ftrace_call_replace(ip, addr, 1);
|
|
|
|
return ftrace_modify_code(ip, old, new);
|
powerpc/ftrace: Handle large kernel configs
Currently, we expect to be able to reach ftrace_caller() from all
ftrace-enabled functions through a single relative branch. With large
kernel configs, we see functions outside of 32MB of ftrace_caller()
causing ftrace_init() to bail.
In such configurations, gcc/ld emits two types of trampolines for mcount():
1. A long_branch, which has a single branch to mcount() for functions that
are one hop away from mcount():
c0000000019e8544 <00031b56.long_branch._mcount>:
c0000000019e8544: 4a 69 3f ac b c00000000007c4f0 <._mcount>
2. A plt_branch, for functions that are farther away from mcount():
c0000000051f33f8 <0008ba04.plt_branch._mcount>:
c0000000051f33f8: 3d 82 ff a4 addis r12,r2,-92
c0000000051f33fc: e9 8c 04 20 ld r12,1056(r12)
c0000000051f3400: 7d 89 03 a6 mtctr r12
c0000000051f3404: 4e 80 04 20 bctr
We can reuse those trampolines for ftrace if we can have those
trampolines go to ftrace_caller() instead. However, with ABIv2, we
cannot depend on r2 being valid. As such, we use only the long_branch
trampolines by patching those to instead branch to ftrace_caller or
ftrace_regs_caller.
In addition, we add additional trampolines around .text and .init.text
to catch locations that are covered by the plt branches. This allows
ftrace to work with most large kernel configurations.
For now, we always patch the trampolines to go to ftrace_regs_caller,
which is slightly inefficient. This can be optimized further at a later
point.
Signed-off-by: Naveen N. Rao <naveen.n.rao@linux.vnet.ibm.com>
Signed-off-by: Michael Ellerman <mpe@ellerman.id.au>
2018-10-17 03:25:00 +07:00
|
|
|
} else if (core_kernel_text(ip)) {
|
|
|
|
/*
|
|
|
|
* We always patch out of range locations to go to the regs
|
|
|
|
* variant, so there is nothing to do here
|
|
|
|
*/
|
|
|
|
return 0;
|
2018-04-19 14:04:09 +07:00
|
|
|
}
|
|
|
|
|
|
|
|
#ifdef CONFIG_MODULES
|
|
|
|
/*
|
|
|
|
* Out of range jumps are called from modules.
|
|
|
|
*/
|
|
|
|
if (!rec->arch.mod) {
|
|
|
|
pr_err("No module loaded\n");
|
|
|
|
return -EINVAL;
|
|
|
|
}
|
|
|
|
|
|
|
|
return __ftrace_modify_call(rec, old_addr, addr);
|
|
|
|
#else
|
|
|
|
/* We should not get here without modules */
|
|
|
|
return -EINVAL;
|
|
|
|
#endif /* CONFIG_MODULES */
|
|
|
|
}
|
|
|
|
#endif
|
|
|
|
|
2008-10-23 20:33:08 +07:00
|
|
|
int ftrace_update_ftrace_func(ftrace_func_t func)
|
2008-05-15 10:49:44 +07:00
|
|
|
{
|
|
|
|
unsigned long ip = (unsigned long)(&ftrace_call);
|
2009-02-13 21:31:39 +07:00
|
|
|
unsigned int old, new;
|
2008-05-15 10:49:44 +07:00
|
|
|
int ret;
|
|
|
|
|
2009-02-13 21:31:39 +07:00
|
|
|
old = *(unsigned int *)&ftrace_call;
|
2009-02-11 13:19:54 +07:00
|
|
|
new = ftrace_call_replace(ip, (unsigned long)func, 1);
|
2008-05-15 10:49:44 +07:00
|
|
|
ret = ftrace_modify_code(ip, old, new);
|
|
|
|
|
2018-04-19 14:04:09 +07:00
|
|
|
#ifdef CONFIG_DYNAMIC_FTRACE_WITH_REGS
|
|
|
|
/* Also update the regs callback function */
|
|
|
|
if (!ret) {
|
|
|
|
ip = (unsigned long)(&ftrace_regs_call);
|
|
|
|
old = *(unsigned int *)&ftrace_regs_call;
|
|
|
|
new = ftrace_call_replace(ip, (unsigned long)func, 1);
|
|
|
|
ret = ftrace_modify_code(ip, old, new);
|
|
|
|
}
|
|
|
|
#endif
|
|
|
|
|
2008-05-15 10:49:44 +07:00
|
|
|
return ret;
|
|
|
|
}
|
|
|
|
|
2016-03-03 11:26:57 +07:00
|
|
|
/*
|
|
|
|
* Use the default ftrace_modify_all_code, but without
|
|
|
|
* stop_machine().
|
|
|
|
*/
|
2012-04-26 15:31:17 +07:00
|
|
|
void arch_ftrace_update_code(int command)
|
|
|
|
{
|
2016-03-03 11:26:57 +07:00
|
|
|
ftrace_modify_all_code(command);
|
2012-04-26 15:31:17 +07:00
|
|
|
}
|
|
|
|
|
powerpc/ftrace: Handle large kernel configs
Currently, we expect to be able to reach ftrace_caller() from all
ftrace-enabled functions through a single relative branch. With large
kernel configs, we see functions outside of 32MB of ftrace_caller()
causing ftrace_init() to bail.
In such configurations, gcc/ld emits two types of trampolines for mcount():
1. A long_branch, which has a single branch to mcount() for functions that
are one hop away from mcount():
c0000000019e8544 <00031b56.long_branch._mcount>:
c0000000019e8544: 4a 69 3f ac b c00000000007c4f0 <._mcount>
2. A plt_branch, for functions that are farther away from mcount():
c0000000051f33f8 <0008ba04.plt_branch._mcount>:
c0000000051f33f8: 3d 82 ff a4 addis r12,r2,-92
c0000000051f33fc: e9 8c 04 20 ld r12,1056(r12)
c0000000051f3400: 7d 89 03 a6 mtctr r12
c0000000051f3404: 4e 80 04 20 bctr
We can reuse those trampolines for ftrace if we can have those
trampolines go to ftrace_caller() instead. However, with ABIv2, we
cannot depend on r2 being valid. As such, we use only the long_branch
trampolines by patching those to instead branch to ftrace_caller or
ftrace_regs_caller.
In addition, we add additional trampolines around .text and .init.text
to catch locations that are covered by the plt branches. This allows
ftrace to work with most large kernel configurations.
For now, we always patch the trampolines to go to ftrace_regs_caller,
which is slightly inefficient. This can be optimized further at a later
point.
Signed-off-by: Naveen N. Rao <naveen.n.rao@linux.vnet.ibm.com>
Signed-off-by: Michael Ellerman <mpe@ellerman.id.au>
2018-10-17 03:25:00 +07:00
|
|
|
#ifdef CONFIG_PPC64
|
|
|
|
#define PACATOC offsetof(struct paca_struct, kernel_toc)
|
|
|
|
|
|
|
|
#define PPC_LO(v) ((v) & 0xffff)
|
|
|
|
#define PPC_HI(v) (((v) >> 16) & 0xffff)
|
|
|
|
#define PPC_HA(v) PPC_HI ((v) + 0x8000)
|
|
|
|
|
|
|
|
extern unsigned int ftrace_tramp_text[], ftrace_tramp_init[];
|
|
|
|
|
|
|
|
int __init ftrace_dyn_arch_init(void)
|
|
|
|
{
|
|
|
|
int i;
|
|
|
|
unsigned int *tramp[] = { ftrace_tramp_text, ftrace_tramp_init };
|
|
|
|
u32 stub_insns[] = {
|
|
|
|
0xe98d0000 | PACATOC, /* ld r12,PACATOC(r13) */
|
|
|
|
0x3d8c0000, /* addis r12,r12,<high> */
|
|
|
|
0x398c0000, /* addi r12,r12,<low> */
|
|
|
|
0x7d8903a6, /* mtctr r12 */
|
|
|
|
0x4e800420, /* bctr */
|
|
|
|
};
|
|
|
|
#ifdef CONFIG_DYNAMIC_FTRACE_WITH_REGS
|
|
|
|
unsigned long addr = ppc_global_function_entry((void *)ftrace_regs_caller);
|
|
|
|
#else
|
|
|
|
unsigned long addr = ppc_global_function_entry((void *)ftrace_caller);
|
|
|
|
#endif
|
|
|
|
long reladdr = addr - kernel_toc_addr();
|
|
|
|
|
|
|
|
if (reladdr > 0x7FFFFFFF || reladdr < -(0x80000000L)) {
|
|
|
|
pr_err("Address of %ps out of range of kernel_toc.\n",
|
|
|
|
(void *)addr);
|
|
|
|
return -1;
|
|
|
|
}
|
|
|
|
|
|
|
|
for (i = 0; i < 2; i++) {
|
|
|
|
memcpy(tramp[i], stub_insns, sizeof(stub_insns));
|
|
|
|
tramp[i][1] |= PPC_HA(reladdr);
|
|
|
|
tramp[i][2] |= PPC_LO(reladdr);
|
|
|
|
add_ftrace_tramp((unsigned long)tramp[i]);
|
|
|
|
}
|
|
|
|
|
|
|
|
return 0;
|
|
|
|
}
|
|
|
|
#else
|
2014-02-25 01:59:59 +07:00
|
|
|
int __init ftrace_dyn_arch_init(void)
|
2008-05-15 10:49:44 +07:00
|
|
|
{
|
|
|
|
return 0;
|
|
|
|
}
|
powerpc/ftrace: Handle large kernel configs
Currently, we expect to be able to reach ftrace_caller() from all
ftrace-enabled functions through a single relative branch. With large
kernel configs, we see functions outside of 32MB of ftrace_caller()
causing ftrace_init() to bail.
In such configurations, gcc/ld emits two types of trampolines for mcount():
1. A long_branch, which has a single branch to mcount() for functions that
are one hop away from mcount():
c0000000019e8544 <00031b56.long_branch._mcount>:
c0000000019e8544: 4a 69 3f ac b c00000000007c4f0 <._mcount>
2. A plt_branch, for functions that are farther away from mcount():
c0000000051f33f8 <0008ba04.plt_branch._mcount>:
c0000000051f33f8: 3d 82 ff a4 addis r12,r2,-92
c0000000051f33fc: e9 8c 04 20 ld r12,1056(r12)
c0000000051f3400: 7d 89 03 a6 mtctr r12
c0000000051f3404: 4e 80 04 20 bctr
We can reuse those trampolines for ftrace if we can have those
trampolines go to ftrace_caller() instead. However, with ABIv2, we
cannot depend on r2 being valid. As such, we use only the long_branch
trampolines by patching those to instead branch to ftrace_caller or
ftrace_regs_caller.
In addition, we add additional trampolines around .text and .init.text
to catch locations that are covered by the plt branches. This allows
ftrace to work with most large kernel configurations.
For now, we always patch the trampolines to go to ftrace_regs_caller,
which is slightly inefficient. This can be optimized further at a later
point.
Signed-off-by: Naveen N. Rao <naveen.n.rao@linux.vnet.ibm.com>
Signed-off-by: Michael Ellerman <mpe@ellerman.id.au>
2018-10-17 03:25:00 +07:00
|
|
|
#endif
|
2009-02-10 12:10:27 +07:00
|
|
|
#endif /* CONFIG_DYNAMIC_FTRACE */
|
|
|
|
|
|
|
|
#ifdef CONFIG_FUNCTION_GRAPH_TRACER
|
|
|
|
|
2009-02-11 13:19:54 +07:00
|
|
|
extern void ftrace_graph_call(void);
|
|
|
|
extern void ftrace_graph_stub(void);
|
|
|
|
|
|
|
|
int ftrace_enable_ftrace_graph_caller(void)
|
|
|
|
{
|
|
|
|
unsigned long ip = (unsigned long)(&ftrace_graph_call);
|
|
|
|
unsigned long addr = (unsigned long)(&ftrace_graph_caller);
|
|
|
|
unsigned long stub = (unsigned long)(&ftrace_graph_stub);
|
2009-02-13 21:31:39 +07:00
|
|
|
unsigned int old, new;
|
2009-02-11 13:19:54 +07:00
|
|
|
|
2009-02-13 21:31:39 +07:00
|
|
|
old = ftrace_call_replace(ip, stub, 0);
|
2009-02-11 13:19:54 +07:00
|
|
|
new = ftrace_call_replace(ip, addr, 0);
|
|
|
|
|
|
|
|
return ftrace_modify_code(ip, old, new);
|
|
|
|
}
|
|
|
|
|
|
|
|
int ftrace_disable_ftrace_graph_caller(void)
|
|
|
|
{
|
|
|
|
unsigned long ip = (unsigned long)(&ftrace_graph_call);
|
|
|
|
unsigned long addr = (unsigned long)(&ftrace_graph_caller);
|
|
|
|
unsigned long stub = (unsigned long)(&ftrace_graph_stub);
|
2009-02-13 21:31:39 +07:00
|
|
|
unsigned int old, new;
|
2009-02-11 13:19:54 +07:00
|
|
|
|
2009-02-13 21:31:39 +07:00
|
|
|
old = ftrace_call_replace(ip, addr, 0);
|
2009-02-11 13:19:54 +07:00
|
|
|
new = ftrace_call_replace(ip, stub, 0);
|
|
|
|
|
|
|
|
return ftrace_modify_code(ip, old, new);
|
|
|
|
}
|
|
|
|
|
2009-02-10 12:10:27 +07:00
|
|
|
/*
|
|
|
|
* Hook the return address and push it in the stack of return addrs
|
2014-09-17 14:07:04 +07:00
|
|
|
* in current thread info. Return the address we want to divert to.
|
2009-02-10 12:10:27 +07:00
|
|
|
*/
|
2014-09-17 14:07:04 +07:00
|
|
|
unsigned long prepare_ftrace_return(unsigned long parent, unsigned long ip)
|
2009-02-10 12:10:27 +07:00
|
|
|
{
|
2014-09-17 14:07:03 +07:00
|
|
|
unsigned long return_hooker;
|
2009-02-10 12:10:27 +07:00
|
|
|
|
2014-06-25 21:27:30 +07:00
|
|
|
if (unlikely(ftrace_graph_is_dead()))
|
2014-09-17 14:07:04 +07:00
|
|
|
goto out;
|
2014-06-25 21:27:30 +07:00
|
|
|
|
2009-02-10 12:10:27 +07:00
|
|
|
if (unlikely(atomic_read(¤t->tracing_graph_pause)))
|
2014-09-17 14:07:04 +07:00
|
|
|
goto out;
|
2009-02-10 12:10:27 +07:00
|
|
|
|
2014-09-17 14:07:03 +07:00
|
|
|
return_hooker = ppc_function_entry(return_to_handler);
|
2009-02-10 12:10:27 +07:00
|
|
|
|
2018-11-19 05:28:53 +07:00
|
|
|
if (!function_graph_enter(parent, ip, 0, NULL))
|
|
|
|
parent = return_hooker;
|
2014-09-17 14:07:04 +07:00
|
|
|
out:
|
|
|
|
return parent;
|
2009-02-10 12:10:27 +07:00
|
|
|
}
|
|
|
|
#endif /* CONFIG_FUNCTION_GRAPH_TRACER */
|
2011-02-03 00:27:24 +07:00
|
|
|
|
2016-06-06 23:56:10 +07:00
|
|
|
#ifdef PPC64_ELF_ABI_v1
|
2016-04-26 04:56:14 +07:00
|
|
|
char *arch_ftrace_match_adjust(char *str, const char *search)
|
|
|
|
{
|
|
|
|
if (str[0] == '.' && search[0] != '.')
|
|
|
|
return str + 1;
|
|
|
|
else
|
|
|
|
return str;
|
|
|
|
}
|
2016-06-06 23:56:10 +07:00
|
|
|
#endif /* PPC64_ELF_ABI_v1 */
|