[FYI/pushed,v4,03/25] Step over clone syscall w/ breakpoint, TARGET_WAITKIND_THREAD_CLONED

  (A good chunk of the problem statement in the commit log below is
Andrew's, adjusted for a different solution, and for covering
displaced stepping too.  The testcase is mostly Andrew's too.)

This commit addresses bugs gdb/19675 and gdb/27830, which are about
stepping over a breakpoint set at a clone syscall instruction, one is
about displaced stepping, and the other about in-line stepping.

Currently, when a new thread is created through a clone syscall, GDB
sets the new thread running.  With 'continue' this makes sense
(assuming no schedlock):

 - all-stop mode, user issues 'continue', all threads are set running,
   a newly created thread should also be set running.

 - non-stop mode, user issues 'continue', other pre-existing threads
   are not affected, but as the new thread is (sort-of) a child of the
   thread the user asked to run, it makes sense that the new threads
   should be created in the running state.

Similarly, if we are stopped at the clone syscall, and there's no
software breakpoint at this address, then the current behaviour is
fine:

 - all-stop mode, user issues 'stepi', stepping will be done in place
   (as there's no breakpoint to step over).  While stepping the thread
   of interest all the other threads will be allowed to continue.  A
   newly created thread will be set running, and then stopped once the
   thread of interest has completed its step.

 - non-stop mode, user issues 'stepi', stepping will be done in place
   (as there's no breakpoint to step over).  Other threads might be
   running or stopped, but as with the continue case above, the new
   thread will be created running.  The only possible issue here is
   that the new thread will be left running after the initial thread
   has completed its stepi.  The user would need to manually select
   the thread and interrupt it, this might not be what the user
   expects.  However, this is not something this commit tries to
   change.

The problem then is what happens when we try to step over a clone
syscall if there is a breakpoint at the syscall address.

- For both all-stop and non-stop modes, with in-line stepping:

   + user issues 'stepi',
   + [non-stop mode only] GDB stops all threads.  In all-stop mode all
     threads are already stopped.
   + GDB removes s/w breakpoint at syscall address,
   + GDB single steps just the thread of interest, all other threads
     are left stopped,
   + New thread is created running,
   + Initial thread completes its step,
   + [non-stop mode only] GDB resumes all threads that it previously
     stopped.

There are two problems in the in-line stepping scenario above:

  1. The new thread might pass through the same code that the initial
     thread is in (i.e. the clone syscall code), in which case it will
     fail to hit the breakpoint in clone as this was removed so the
     first thread can single step,

  2. The new thread might trigger some other stop event before the
     initial thread reports its step completion.  If this happens we
     end up triggering an assertion as GDB assumes that only the
     thread being stepped should stop.  The assert looks like this:

     infrun.c:5899: internal-error: int finish_step_over(execution_control_state*): Assertion `ecs->event_thread->control.trap_expected' failed.

- For both all-stop and non-stop modes, with displaced stepping:

   + user issues 'stepi',
   + GDB starts the displaced step, moves thread's PC to the
     out-of-line scratch pad, maybe adjusts registers,
   + GDB single steps the thread of interest, [non-stop mode only] all
     other threads are left as they were, either running or stopped.
     In all-stop, all other threads are left stopped.
   + New thread is created running,
   + Initial thread completes its step, GDB re-adjusts its PC,
     restores/releases scratchpad,
   + [non-stop mode only] GDB resumes the thread, now past its
     breakpoint.
   + [all-stop mode only] GDB resumes all threads.

There is one problem with the displaced stepping scenario above:

  3. When the parent thread completed its step, GDB adjusted its PC,
     but did not adjust the child's PC, thus that new child thread
     will continue execution in the scratch pad, invoking undefined
     behavior.  If you're lucky, you see a crash.  If unlucky, the
     inferior gets silently corrupted.

What is needed is for GDB to have more control over whether the new
thread is created running or not.  Issue #1 above requires that the
new thread not be allowed to run until the breakpoint has been
reinserted.  The only way to guarantee this is if the new thread is
held in a stopped state until the single step has completed.  Issue #3
above requires that GDB is informed of when a thread clones itself,
and of what is the child's ptid, so that GDB can fixup both the parent
and the child.

When looking for solutions to this problem I considered how GDB
handles fork/vfork as these have some of the same issues.  The main
difference between fork/vfork and clone is that the clone events are
not reported back to core GDB.  Instead, the clone event is handled
automatically in the target code and the child thread is immediately
set running.

Note we have support for requesting thread creation events out of the
target (TARGET_WAITKIND_THREAD_CREATED).  However, those are reported
for the new/child thread.  That would be sufficient to address in-line
stepping (issue #1), but not for displaced-stepping (issue #3).  To
handle displaced-stepping, we need an event that is reported to the
_parent_ of the clone, as the information about the displaced step is
associated with the clone parent.  TARGET_WAITKIND_THREAD_CREATED
includes no indication of which thread is the parent that spawned the
new child.  In fact, for some targets, like e.g., Windows, it would be
impossible to know which thread that was, as thread creation there
doesn't work by "cloning".

The solution implemented here is to model clone on fork/vfork, and
introduce a new TARGET_WAITKIND_THREAD_CLONED event.  This event is
similar to TARGET_WAITKIND_FORKED and TARGET_WAITKIND_VFORKED, except
that we end up with a new thread in the same process, instead of a new
thread of a new process.  Like FORKED and VFORKED, THREAD_CLONED
waitstatuses have a child_ptid property, and the child is held stopped
until GDB explicitly resumes it.  This addresses the in-line stepping
case (issues #1 and #2).

The infrun code that handles displaced stepping fixup for the child
after a fork/vfork event is thus reused for THREAD_CLONE, with some
minimal conditions added, addressing the displaced stepping case
(issue #3).

The native Linux backend is adjusted to unconditionally report
TARGET_WAITKIND_THREAD_CLONED events to the core.

Following the follow_fork model in core GDB, we introduce a
target_follow_clone target method, which is responsible for making the
new clone child visible to the rest of GDB.

Subsequent patches will add clone events support to the remote
protocol and gdbserver.

displaced_step_in_progress_thread becomes unused with this patch, but
a new use will reappear later in the series.  To avoid deleting it and
readding it back, this patch marks it with attribute unused, and the
latter patch removes the attribute again.  We need to do this because
the function is static, and with no callers, the compiler would warn,
(error with -Werror), breaking the build.

This adds a new gdb.threads/stepi-over-clone.exp testcase, which
exercises stepping over a clone syscall, with displaced stepping vs
inline stepping, and all-stop vs non-stop.  We already test stepping
over clone syscalls with gdb.base/step-over-syscall.exp, but this test
uses pthreads, while the other test uses raw clone, and this one is
more thorough.  The testcase passes on native GNU/Linux, but fails
against GDBserver.  GDBserver will be fixed by a later patch in the
series.

Co-authored-by: Andrew Burgess <aburgess@redhat.com>
Bug: https://sourceware.org/bugzilla/show_bug.cgi?id=19675
Bug: https://sourceware.org/bugzilla/show_bug.cgi?id=27830
Change-Id: I95c06024736384ae8542a67ed9fdf6534c325c8e
Reviewed-By: Andrew Burgess <aburgess@redhat.com>
---
 gdb/infrun.c                                  | 158 +++----
 gdb/linux-nat.c                               | 252 +++++------
 gdb/linux-nat.h                               |   2 +
 gdb/target-delegates.c                        |  24 ++
 gdb/target.c                                  |   7 +
 gdb/target.h                                  |   7 +
 gdb/target/waitstatus.c                       |   1 +
 gdb/target/waitstatus.h                       |  31 +-
 gdb/testsuite/gdb.threads/stepi-over-clone.c  |  90 ++++
 .../gdb.threads/stepi-over-clone.exp          | 395 ++++++++++++++++++
 10 files changed, 775 insertions(+), 192 deletions(-)
 create mode 100644 gdb/testsuite/gdb.threads/stepi-over-clone.c
 create mode 100644 gdb/testsuite/gdb.threads/stepi-over-clone.exp

Message ID	20231113150427.477431-4-pedro@palves.net
State	New
Headers	Return-Path: <gdb-patches-bounces+patchwork=sourceware.org@sourceware.org> X-Original-To: patchwork@sourceware.org Delivered-To: patchwork@sourceware.org Received: from server2.sourceware.org (localhost [IPv6:::1]) by sourceware.org (Postfix) with ESMTP id 0C44A3857036 for <patchwork@sourceware.org>; Mon, 13 Nov 2023 15:05:34 +0000 (GMT) X-Original-To: gdb-patches@sourceware.org Delivered-To: gdb-patches@sourceware.org Received: from mail-wm1-f42.google.com (mail-wm1-f42.google.com [209.85.128.42]) by sourceware.org (Postfix) with ESMTPS id 64B9A388C571 for <gdb-patches@sourceware.org>; Mon, 13 Nov 2023 15:04:58 +0000 (GMT) DMARC-Filter: OpenDMARC Filter v1.4.2 sourceware.org 64B9A388C571 Authentication-Results: sourceware.org; dmarc=none (p=none dis=none) header.from=palves.net Authentication-Results: sourceware.org; spf=pass smtp.mailfrom=gmail.com ARC-Filter: OpenARC Filter v1.0.0 sourceware.org 64B9A388C571 Authentication-Results: server2.sourceware.org; arc=none smtp.remote-ip=209.85.128.42 ARC-Seal: i=1; a=rsa-sha256; d=sourceware.org; s=key; t=1699887905; cv=none; b=ZHzJ3Ltau3JOjJ5r3fTyPSbOaKS7iiiRq27W+ahy4K0NfX8Jxo9OKw0/O6fl0Tdp4CGLQrAfRVrKzpIRPERDzjuSdalWn4sOE2Q7twV20kPZehTgd3PieIsAGk5adqWORrRmXHYVpMy9lpNVJP1rfPFKIY+XmBlCESo/dJ+3Ffo= ARC-Message-Signature: i=1; a=rsa-sha256; d=sourceware.org; s=key; t=1699887905; c=relaxed/simple; bh=wdkI0hDnSYkecZ88HpTkCwKt4fGNzq8CH5rtqfegxYw=; h=From:To:Subject:Date:Message-Id:MIME-Version; b=Twh0hPV/qVOTadx8T94V+IGpkQ8aZ84PVo9AwGuwq5HRodQjcWWT4IUkgoqq/MWZqoa0AAwJLJ+/KEj3GHCdFj5YNtf0rXaBtatnAie6A9zQb20Ixranrdgso/qhplF1LEE5dWTBA5eiPXVLrJpK3FqykRggs1jePNxXCwVmALQ= ARC-Authentication-Results: i=1; server2.sourceware.org Received: by mail-wm1-f42.google.com with SMTP id 5b1f17b1804b1-40839652b97so35642655e9.3 for <gdb-patches@sourceware.org>; Mon, 13 Nov 2023 07:04:58 -0800 (PST) X-Google-DKIM-Signature: v=1; a=rsa-sha256; c=relaxed/relaxed; d=1e100.net; s=20230601; t=1699887897; x=1700492697; h=content-transfer-encoding:mime-version:references:in-reply-to :message-id:date:subject:cc:to:from:x-gm-message-state:from:to:cc :subject:date:message-id:reply-to; bh=Z/xQbljIb+jZjILS31JTxfmBlDdBvGbxr6qnpbak2U8=; b=M8Y/xqiexFVYyXPIF+h54fFmxu6zracXm3IOGtStjIBb8ByzlDM2MRNMgdm3sSptCB 2C+3GZYWUyS3j5V843Mo07Ltc668oWe+pdD3BjqzG2cCfIvQtoXiZlI5K1DtcvwtMOQk uDleY84AFhVeNaZr7UR5259hfnjXjuMT9FaqpNbqCrjoMcbjqS8or+h4sWY0GfcO606C sK1u1AZMIEUP0+iBe2nXFKoO7OG8es29jSUSyIzPVfJSl0L6tzmFqrdIu+2honagEszG Db+TFYy3oH6QNX9+OWll5J+fVgAXO9gUhdIj9tEqd+yqgyLvuUFZrXx2MUuDNXRvlTLB K/8w== X-Gm-Message-State: AOJu0YzCvah7+jceA6Fc/WTnLPxpInY690j/QF3tsZ1ru10tDk67eg5Q +B2THqgEMfkJhtWtVnPY+ABbd+h8xg0= X-Google-Smtp-Source: AGHT+IEItAeWIB1gYpOA5u9fg1OvkeLwOPgE58FNamE8WDXH3qYDJsuId9E0FBrR4pkIfwLW6QkgnQ== X-Received: by 2002:adf:f28e:0:b0:32f:7cea:2ea1 with SMTP id k14-20020adff28e000000b0032f7cea2ea1mr4129685wro.18.1699887896217; Mon, 13 Nov 2023 07:04:56 -0800 (PST) Received: from localhost ([2001:8a0:f91e:1a00:8060:1e54:fb28:9635]) by smtp.gmail.com with UTF8SMTPSA id l10-20020a5d674a000000b003253523d767sm5621477wrw.109.2023.11.13.07.04.55 (version=TLS1_3 cipher=TLS_AES_128_GCM_SHA256 bits=128/128); Mon, 13 Nov 2023 07:04:55 -0800 (PST) From: Pedro Alves <pedro@palves.net> To: gdb-patches@sourceware.org Cc: Andrew Burgess <aburgess@redhat.com> Subject: [FYI/pushed v4 03/25] Step over clone syscall w/ breakpoint, TARGET_WAITKIND_THREAD_CLONED Date: Mon, 13 Nov 2023 15:04:05 +0000 Message-Id: <20231113150427.477431-4-pedro@palves.net> X-Mailer: git-send-email 2.34.1 In-Reply-To: <20231113150427.477431-1-pedro@palves.net> References: <20231113150427.477431-1-pedro@palves.net> MIME-Version: 1.0 Content-Transfer-Encoding: 8bit X-Spam-Status: No, score=-9.6 required=5.0 tests=BAYES_00, FREEMAIL_FORGED_FROMDOMAIN, FREEMAIL_FROM, GIT_PATCH_0, HEADER_FROM_DIFFERENT_DOMAINS, KAM_DMARC_STATUS, KAM_SHORT, RCVD_IN_DNSWL_NONE, RCVD_IN_MSPIKE_H2, SPF_HELO_NONE, SPF_PASS, TXREP, T_SCC_BODY_TEXT_LINE autolearn=ham autolearn_force=no version=3.4.6 X-Spam-Checker-Version: SpamAssassin 3.4.6 (2021-04-09) on server2.sourceware.org X-BeenThere: gdb-patches@sourceware.org X-Mailman-Version: 2.1.30 Precedence: list List-Id: Gdb-patches mailing list <gdb-patches.sourceware.org> List-Unsubscribe: <https://sourceware.org/mailman/options/gdb-patches>, <mailto:gdb-patches-request@sourceware.org?subject=unsubscribe> List-Archive: <https://sourceware.org/pipermail/gdb-patches/> List-Post: <mailto:gdb-patches@sourceware.org> List-Help: <mailto:gdb-patches-request@sourceware.org?subject=help> List-Subscribe: <https://sourceware.org/mailman/listinfo/gdb-patches>, <mailto:gdb-patches-request@sourceware.org?subject=subscribe> Errors-To: gdb-patches-bounces+patchwork=sourceware.org@sourceware.org
Series	Step over thread clone and thread exit \| [FYI/pushed,v4,00/25] Step over thread clone and thread exit [FYI/pushed,v4,01/25] Add "maint info linux-lwps" command [FYI/pushed,v4,02/25] gdb/linux: Delete all other LWPs immediately on ptrace exec event [FYI/pushed,v4,03/25] Step over clone syscall w/ breakpoint, TARGET_WAITKIND_THREAD_CLONED [FYI/pushed,v4,04/25] Support clone events in the remote protocol [FYI/pushed,v4,05/25] Avoid duplicate QThreadEvents packets [FYI/pushed,v4,06/25] Thread options & clone events (core + remote) [FYI/pushed,v4,07/25] Thread options & clone events (native Linux) [FYI/pushed,v4,08/25] Thread options & clone events (Linux GDBserver) [FYI/pushed,v4,09/25] gdbserver: Hide and don't detach pending clone children [FYI/pushed,v4,10/25] Remove gdb/19675 kfails (displaced stepping + clone) [FYI/pushed,v4,11/25] all-stop/synchronous RSP support thread-exit events [FYI/pushed,v4,12/25] gdbserver/linux-low.cc: Ignore event_ptid if TARGET_WAITKIND_IGNORE [FYI/pushed,v4,13/25] Move deleting thread on TARGET_WAITKIND_THREAD_EXITED to core [FYI/pushed,v4,14/25] Introduce GDB_THREAD_OPTION_EXIT thread option, fix step-over-thread-exit [FYI/pushed,v4,15/25] Implement GDB_THREAD_OPTION_EXIT support for Linux GDBserver [FYI/pushed,v4,16/25] Implement GDB_THREAD_OPTION_EXIT support for native Linux [FYI/pushed,v4,17/25] gdb: clear step over information on thread exit (PR gdb/27338) [FYI/pushed,v4,18/25] stop_all_threads: (re-)enable async before waiting for stops [FYI/pushed,v4,19/25] gdbserver: Queue no-resumed event after thread exit [FYI/pushed,v4,20/25] Don't resume new threads if scheduler-locking is in effect [FYI/pushed,v4,21/25] Report thread exit event for leader if reporting thread exit events [FYI/pushed,v4,22/25] gdb/testsuite/lib/my-syscalls.S: Refactor new SYSCALL macro [FYI/pushed,v4,23/25] Testcases for stepping over thread exit syscall (PR gdb/27338) [FYI/pushed,v4,24/25] Document remote clone events, and QThreadOptions packet [FYI/pushed,v4,25/25] Cancel execution command on thread exit, when stepping, nexting, etc.

[FYI/pushed,v4,03/25] Step over clone syscall w/ breakpoint, TARGET_WAITKIND_THREAD_CLONED

Commit Message

Comments

Patch