linux_dsm_epyc7002/arch/um/sys-i386
Paul Pluzhnikov 13e165baf6 uml: kernels on {i386,x86_64} produce bad coredumps
One of our users reported that when a user-level program SIGSEGVs under
UML kernel, the resulting core dump is not very usable.

I have reproduced that with the latest kernel:

  make ARCH=um defconfig; make ARCH=um

Run the resulting kernel, then "inside" run this program:

#include <pthread.h>

void *fn(void *p)
{
 abort();
}

int main()
{
 pthread_t tid;
 pthread_create(&tid, 0, fn, 0);
 pthread_join(tid, 0);
 return 0;
}

Analyze the coredump with GDB. Here is what you'll see:

sudo gdb -q -ex 'set solib-absolute-prefix ../root_fs' -ex 'file ../root_fs/var/tmp/mt-abort' -ex 'core ../root_fs/var/tmp/core.762'
Reading symbols from /usr/local/google/root_fs/var/tmp/mt-abort...done.
[New Thread 763]
[New Thread 762]
Core was generated by `./mt-abort'.
Program terminated with signal 6, Aborted.
#0  0x0000000040255250 in raise () from ../root_fs/lib64/libc.so.6
(gdb) info thread
  2 Thread 762  0x0000000000000000 in ?? ()
* 1 Thread 763  0x0000000040255250 in raise () from ../root_fs/lib64/libc.so.6

Note that thread#2 looks funny.

(gdb) thread 2
[Switching to thread 2 (Thread 762)]#0  0x0000000000000000 in ?? ()
(gdb) info reg
rax            0x0      0
rbx            0x0      0
rcx            0x0      0
rdx            0x0      0
rsi            0x0      0
rdi            0x0      0
rbp            0x0      0x0
rsp            0x0      0x0
r8             0x0      0
r9             0x0      0
r10            0x0      0
r11            0x0      0
r12            0x0      0
r13            0x0      0
r14            0x0      0
r15            0x0      0
rip            0x0      0
eflags         0x0      [ ]
cs             0x0      0
ss             0x0      0
ds             0x0      0
es             0x0      0
fs             0x0      0
gs             0x0      0

Examining the core shows that NT_PRSTATUS notes for all threads other than
the one that crashed are zeroed out.

I believe this is happening because neither ELF_CORE_COPY_TASK_REGS nor
task_pt_regs are defined under ARCH=um, and so elf_core_copy_task_regs()
becomes a no-op.

Attached patch fixes this for SUBARCH={x86_64,i386}.

Signed-off-by: Paul Pluzhnikov <ppluzhnikov@google.com>
Cc: Jeff Dike <jdike@addtoit.com>
Acked-by: WANG Cong <xiyou.wangcong@gmail.com>
Signed-off-by: Andrew Morton <akpm@linux-foundation.org>
Signed-off-by: Linus Torvalds <torvalds@linux-foundation.org>
2011-03-22 17:44:09 -07:00
..
asm uml: kernels on {i386,x86_64} produce bad coredumps 2011-03-22 17:44:09 -07:00
shared/sysdep Add generic sys_old_mmap() 2010-03-12 15:52:32 -08:00
bug.c uml: header untangling 2008-02-05 09:44:28 -08:00
bugs.c UML: make several more things static 2008-07-24 10:47:24 -07:00
checksum.S UML: make several more things static 2008-07-24 10:47:24 -07:00
delay.c [PATCH] uml: fix unreasonably long udelay 2007-04-02 10:06:08 -07:00
elfcore.c elf coredump: add extended numbering support 2010-03-06 11:26:46 -08:00
fault.c uml: style fixes pass 3 2007-10-16 09:43:05 -07:00
ksyms.c Generic semaphore implementation 2008-04-17 10:42:34 -04:00
ldt.c include cleanup: Update gfp.h and slab.h includes to prepare for breaking implicit slab.h inclusion from percpu.h 2010-03-30 22:02:32 +09:00
Makefile elf coredump: replace ELF_CORE_EXTRA_* macros by functions 2010-03-06 11:26:45 -08:00
ptrace_user.c uml: style fixes in arch/um/sys-x86_64 2008-02-05 09:44:31 -08:00
ptrace.c ptrace: change signature of arch_ptrace() 2010-10-27 18:03:10 -07:00
setjmp.S
signal.c uml: fix a compile error 2008-10-20 08:52:34 -07:00
stub_segv.c uml: simplify SIGSEGV handling 2008-02-05 09:44:28 -08:00
stub.S uml: fix a section warning 2009-06-16 19:47:47 -07:00
sys_call_table.S Add generic sys_old_mmap() 2010-03-12 15:52:32 -08:00
syscalls.c Add generic sys_ipc wrapper 2010-03-12 15:52:32 -08:00
sysrq.c
tls.c uml: host TLS diagnostics 2008-02-05 09:44:27 -08:00
user-offsets.c uml: tidy ptrace interface 2008-05-13 08:02:21 -07:00