[PATCH/7.10,2/2] gdbserver: Fix non-stop / fork / step-over issues

  Ref: https://sourceware.org/ml/gdb-patches/2015-07/msg00868.html

This adds a test that has a multithreaded program have several threads
continuously fork, while another thread continuously steps over a
breakpoint.

This exposes several intertwined issues, which this patch addresses:

 - When we're stopping and suspending threads, some thread may fork,
   and we missed setting its suspend count to 1, like we do when a new
   clone/thread is detected.  When we next unsuspend threads, the fork
   child's suspend count goes below 0, which is bogus and fails an
   assertion.

 - If a step-over is cancelled because a signal arrives, but then gdb
   is not interested in the signal, we pass the signal straight back
   to the inferior.  However, we miss that we need to re-increment the
   suspend counts of all other threads that had been paused for the
   step-over.  As a result, other threads indefinitely end up stuck
   stopped.

 - OTOH, if a thread exits the whole process just while we're stopping
   threads to start a step-over, gdbserver crashes or hits several
   assertions.

 - If a detach request comes in just while gdbserver is handling a
   step-over (in the test at hand, this is GDB detaching the fork
   child), gdbserver internal errors in stabilize_thread's helpers,
   which assert that all thread's suspend counts are 0 (otherwise we
   wouldn't be able to move threads out of the jump pads).  The
   suspend counts aren't 0 while a step-over is in progress, because
   all threads but the one stepping past the breakpoint must remain
   paused until the step-over finishes and the breakpoint can be
   reinserted.

 - Occasionally, we see "BAD - reinserting but not stepping." being
   output (from within linux_resume_one_lwp_throw).  That was because
   GDB pokes memory while gdbserver is busy with a step-over, and that
   suspends threads, and then re-resumes them with proceed_one_lwp,
   which missed another reason to tell linux_resume_one_lwp that the
   thread should be set back to stepping.

 - In a couple places, we were resuming threads that are meant to be
   suspended.  E.g., when a vCont;c/s request for thread B comes in
   just while gdbserver is stepping thread A past a breakpoint.  The
   resume for thread B must be deferred until the step-over finishes.

 - The test runs with both "set detach-on-fork" on and off.  When off,
   it exercises the case of GDB detaching the fork child explicitly.
   When on, it exercises the case of gdb resuming the child
   explicitly.  In the "off" case, gdb seems to exponentially become
   slower as new inferiors are created.  This is _very_ noticeable as
   with only 100 inferiors gdb is crawling already, which makes the
   test take quite a bit to run.  For that reason, I've disabled the
   "off" variant for now.

 - The test fails occasionally with the native target, because several
   code paths aren't expecting that a stopped thread may disappear
   (the test has the leader thread of the parent process exit the
   whole process, just while gdb is handling an event for a non-leader
   thread).  E.g.,:

    [Thread 0x7ffff67bd700 (LWP 11210) exited]
    Cannot find user-level thread for LWP 11217: generic error
    (gdb) FAIL: gdb.threads/fork-plus-threads-2.exp: detach-on-fork=on: inferior 1 exited
    [Thread 0x7ffff7fc1740 (LWP 11203) exited]
    info threads
      Id   Target Id         Frame
      12   Thread 0x7ffff2fb6700 (LWP 11217) (running)

    The current thread <Thread ID 1> has terminated.  See `help thread'.
    (gdb) FAIL: gdb.threads/fork-plus-threads-2.exp: detach-on-fork=on: no threads left

   I fixed some of these issues recently, but there's a lot more to
   do.  Fixing that one just exposes other similar problems elsewhere.
   Meanwhile, I've filed PR18749 and kfailed the test for native.

gdb/ChangeLog:
2015-07-31  Pedro Alves  <palves@redhat.com>

	PR gdb/18749
	* target/waitstatus.h (enum target_stop_reason)
	<TARGET_STOPPED_BY_SINGLE_STEP>: New value.

gdb/gdbserver/ChangeLog:
2015-07-31  Pedro Alves  <palves@redhat.com>

	PR gdb/18749
	* linux-low.c (handle_extended_wait): Set the fork child's suspend
	count if stopping and suspending threads.
	(check_stopped_by_breakpoint): If stopped by trace, set the LWP's
	stop reason to TARGET_STOPPED_BY_SINGLE_STEP.
	(linux_detach): Complete an ongoing step-over.
	(lwp_suspended_inc, lwp_suspended_decr): New functions.  Use
	throughout.
	(resume_stopped_resumed_lwps): Don't resume a suspended thread.
	(linux_wait_1): If passing a signal to the inferior after
	finishing a step-over, unsuspend and re-resume all lwps.  If we
	see a single-step event but the thread should be continuing, don't
	pass the trap to gdb.
	(stuck_in_jump_pad_callback, move_out_of_jump_pad_callback): Use
	internal_error instead of gdb_assert.
	(enqueue_pending_signal): New function.
	(check_ptrace_stopped_lwp_gone): Add debug output.
	(start_step_over): Handle the case of the LWP we're about to
	step-over exiting.  Use internal_error instead of gdb_assert.
	(complete_ongoing_step_over): New function.
	(linux_resume_one_thread): Don't resume a suspended thread.
	(proceed_one_lwp): If the LWP is stepping over a breakpoint, reset
	it stepping.
	(proceed_all_lwps): If a step-over fails to start, look for
	another thread that might need a step-over.

gdb/testsuite/ChangeLog:
2015-07-31  Pedro Alves  <palves@redhat.com>

	PR gdb/18749
	* gdb.threads/fork-plus-threads-2.exp: New file.
	* gdb.threads/fork-plus-threads-2.c: New file.
---
 gdb/gdbserver/linux-low.c                         | 267 +++++++++++++++++++---
 gdb/target/waitstatus.h                           |   5 +-
 gdb/testsuite/gdb.threads/fork-plus-threads-2.c   | 129 +++++++++++
 gdb/testsuite/gdb.threads/fork-plus-threads-2.exp | 116 ++++++++++
 4 files changed, 480 insertions(+), 37 deletions(-)
 create mode 100644 gdb/testsuite/gdb.threads/fork-plus-threads-2.c
 create mode 100644 gdb/testsuite/gdb.threads/fork-plus-threads-2.exp

[PATCH/7.10,2/2] gdbserver: Fix non-stop / fork / step-over issues

Commit Message

Comments

Patch