linux_dsm_epyc7002/arch/csky/include/asm
Al Viro 51bb38cb78 csky: Fixup raw_copy_from_user()
If raw_copy_from_user(to, from, N) returns K, callers expect
the first N - K bytes starting at to to have been replaced with
the contents of corresponding area starting at from and the last
K bytes of destination *left* *unmodified*.

What arch/sky/lib/usercopy.c is doing is broken - it can lead to e.g.
data corruption on write(2).

raw_copy_to_user() is inaccurate about return value, which is a bug,
but consequences are less drastic than for raw_copy_from_user().
And just what are those access_ok() doing in there?  I mean, look into
linux/uaccess.h; that's where we do that check (as well as zero tail
on failure in the callers that need zeroing).

AFAICS, all of that shouldn't be hard to fix; something like a patch
below might make a useful starting point.

I would suggest moving these macros into usercopy.c (they are never
used anywhere else) and possibly expanding them there; if you leave
them alive, please at least rename __copy_user_zeroing(). Again,
it must not zero anything on failed read.

Said that, I'm not sure we won't be better off simply turning
usercopy.c into usercopy.S - all that is left there is a couple of
functions, each consisting only of inline asm.

Guo Ren reply:

Yes, raw_copy_from_user is wrong, it's no need zeroing code.

unsigned long _copy_from_user(void *to, const void __user *from,
unsigned long n)
{
        unsigned long res = n;
        might_fault();
        if (likely(access_ok(from, n))) {
                kasan_check_write(to, n);
                res = raw_copy_from_user(to, from, n);
        }
        if (unlikely(res))
                memset(to + (n - res), 0, res);
        return res;
}
EXPORT_SYMBOL(_copy_from_user);

You are right and access_ok() should be removed.

but, how about:
do {
...
        "2:     stw     %3, (%1, 0)     \n"             \
+       "       subi    %0, 4          \n"               \
        "9:     stw     %4, (%1, 4)     \n"             \
+       "       subi    %0, 4          \n"               \
        "10:    stw     %5, (%1, 8)     \n"             \
+       "       subi    %0, 4          \n"               \
        "11:    stw     %6, (%1, 12)    \n"             \
+       "       subi    %0, 4          \n"               \
        "       addi    %2, 16          \n"             \
        "       addi    %1, 16          \n"             \

Don't expand __ex_table

AI Viro reply:

Hey, I've no idea about the instruction scheduling on csky -
if that doesn't slow the things down, all the better.  It's just
that copy_to_user() and friends are on fairly hot codepaths,
and in quite a few situations they will dominate the speed of
e.g. read(2).  So I tried to keep the fast path unchanged.
Up to the architecture maintainers, obviously.  Which would be
you...

As for the fixups size increase (__ex_table size is unchanged)...
You have each of those macros expanded exactly once.
So the size is not a serious argument, IMO - useless complexity
would be, if it is, in fact, useless; the size... not really,
especially since those extra subi will at least offset it.

Again, up to you - asm optimizations of (essentially)
memcpy()-style loops are tricky and can depend upon the
fairly subtle details of architecture.  So even on something
I know reasonably well I would resort to direct experiments
if I can't pass the buck to architecture maintainers.

It *is* worth optimizing - this is where read() from a file
that is already in page cache spends most of the time, etc.

Guo Ren reply:

Thx, after fixup some typo “sub %0, 4”, apply the patch.

TODO:
 - user copy/from codes are still need optimizing.

Signed-off-by: Al Viro <viro@zeniv.linux.org.uk>
Signed-off-by: Guo Ren <guoren@linux.alibaba.com>
2020-05-15 00:16:30 +08:00
..
addrspace.h csky: MMU and page table management 2018-10-25 23:36:19 +08:00
asid.h csky: Add new asid lib code from arm 2019-07-19 14:21:36 +08:00
atomic.h csky: Atomic operations 2018-10-26 00:54:23 +08:00
barrier.h csky: Fixup mb() synchronization problem 2019-07-31 11:04:29 +08:00
bitops.h fls: change parameter to unsigned int 2019-01-04 13:13:46 -08:00
bug.h csky: Debug and Ptrace GDB 2018-10-26 00:54:25 +08:00
cache.h csky: Support icache flush without specific instructions 2020-02-21 15:43:24 +08:00
cacheflush.h csky: Add flush_icache_mm to defer flush icache all 2020-02-21 15:43:24 +08:00
checksum.h csky: Misc headers 2018-10-26 00:54:26 +08:00
cmpxchg.h csky: Atomic operations 2018-10-26 00:54:23 +08:00
elf.h csky: Add EM_CSKY_OLD 39 2018-12-31 23:17:24 +08:00
fixmap.h csky: Separate fixaddr_init from highmem 2020-02-21 15:43:24 +08:00
ftrace.h csky: Implement ftrace with regs 2020-03-08 20:55:14 +08:00
highmem.h csky: MMU and page table management 2018-10-25 23:36:19 +08:00
io.h csky: use generic ioremap 2019-11-12 11:37:52 +01:00
irqflags.h csky: IRQ handling 2018-10-26 00:54:22 +08:00
Kbuild csky: Add kprobes supported 2020-04-03 11:14:17 +08:00
kprobes.h csky: Add kprobes supported 2020-04-03 11:14:17 +08:00
memory.h csky: Separate fixaddr_init from highmem 2020-02-21 15:43:24 +08:00
mmu_context.h csky: Add flush_icache_mm to defer flush icache all 2020-02-21 15:43:24 +08:00
mmu.h csky: Add flush_icache_mm to defer flush icache all 2020-02-21 15:43:24 +08:00
page.h mm/vma: define a default value for VM_DATA_DEFAULT_FLAGS 2020-04-10 15:36:21 -07:00
pci.h csky: Add PCI support 2020-02-21 15:43:25 +08:00
perf_event.h csky: Add perf_arch_fetch_caller_regs support 2019-04-22 13:44:57 +08:00
pgalloc.h mm: treewide: clarify pgtable_page_{ctor,dtor}() naming 2019-09-26 10:10:44 -07:00
pgtable.h mm/special: create generic fallbacks for pte_special() and pte_mkspecial() 2020-04-10 15:36:21 -07:00
probes.h csky: Add kprobes supported 2020-04-03 11:14:17 +08:00
processor.h csky: Fixup gdbmacros.txt with name sp in thread_struct 2020-05-15 00:16:18 +08:00
ptrace.h csky: Fixup calltrace panic 2020-05-13 17:55:06 +08:00
reg_ops.h csky: Misc headers 2018-10-26 00:54:26 +08:00
segment.h get rid of legacy 'get_ds()' function 2019-03-04 10:50:14 -08:00
shmparam.h csky: MMU and page table management 2018-10-25 23:36:19 +08:00
smp.h csky: CPU-hotplug supported for SMP 2018-12-31 23:03:53 +08:00
spinlock_types.h csky: Atomic operations 2018-10-26 00:54:23 +08:00
spinlock.h csky: Atomic operations 2018-10-26 00:54:23 +08:00
stackprotector.h csky: Initial stack protector support 2020-02-21 15:43:24 +08:00
string.h csky: Library functions 2018-10-26 00:54:24 +08:00
switch_to.h csky: Process management and Signal 2018-10-26 00:54:13 +08:00
syscall.h arch/csky patches for 5.2-rc1 2019-05-08 11:41:08 -07:00
syscalls.h csky: System Call 2018-10-25 23:36:19 +08:00
tcm.h csky: Tightly-Coupled Memory or Sram support 2020-02-21 15:43:24 +08:00
thread_info.h csky: Fixup gdbmacros.txt with name sp in thread_struct 2020-05-15 00:16:18 +08:00
tlb.h csky: add missing brackets in a macro for tlb.h 2019-09-07 04:28:05 -03:00
tlbflush.h csky: Cache and TLB routines 2018-10-25 23:36:19 +08:00
traps.h csky: Exception handling and mm-fault 2018-10-25 23:36:19 +08:00
uaccess.h csky: Fixup raw_copy_from_user() 2020-05-15 00:16:30 +08:00
unistd.h csky: Update syscall_trace_enter/exit implementation 2019-04-22 13:44:57 +08:00
uprobes.h csky: Add uprobes support 2020-04-03 11:37:51 +08:00
vdso.h csky: VDSO and rt_sigreturn 2018-10-26 00:54:22 +08:00
vmalloc.h mm/vmalloc: Add empty <asm/vmalloc.h> headers and use them from <linux/vmalloc.h> 2019-12-10 10:12:55 +01:00