Make `linux_info_proc` prefer using the LWP over the PID

Message ID 20240106024512.14270-1-dark.ryu.550@gmail.com
State New
Headers
Series Make `linux_info_proc` prefer using the LWP over the PID |

Checks

Context Check Description
linaro-tcwg-bot/tcwg_gdb_build--master-aarch64 success Testing passed
linaro-tcwg-bot/tcwg_gdb_build--master-arm success Testing passed
linaro-tcwg-bot/tcwg_gdb_check--master-aarch64 success Testing passed
linaro-tcwg-bot/tcwg_gdb_check--master-arm success Testing passed

Commit Message

Matheus Branco Borella Jan. 6, 2024, 2:45 a.m. UTC
  Fixes: https://sourceware.org/bugzilla/show_bug.cgi?id=31207

Normally, `linux_info_proc` would use the PID to determine which subfolder in
`/proc` to read information from. While this is usually fine, it breaks down
after the main thread exits, at which point the information in `/proc/$pid`
becomes become unreliable, if it is available at all. While it is the case
that most programs terminate after their main thread exits, some may continue
running from detached threads, in which case `info proc` will start misbehaving.

This patch addresses this by making it so that the LWP - the Lightweight Process
ID, that, in the case of GNU/Linux is the number of the process backing up the
thread[1] - is prefered over the PID. By doing this, `linux_info_proc` will
always access valid procfs information, even after the main thread exits.

[1]: https://man7.org/linux/man-pages/man2/clone.2.html
---
 gdb/linux-tdep.c | 9 ++++++++-
 1 file changed, 8 insertions(+), 1 deletion(-)
  

Comments

Simon Marchi Jan. 8, 2024, 3:50 p.m. UTC | #1
On 2024-01-05 21:45, Matheus Branco Borella wrote:
> Fixes: https://sourceware.org/bugzilla/show_bug.cgi?id=31207

We use `Bug:` for these.  Also, move it at the end of the commit
message, like standard git trailers
(https://git-scm.com/docs/git-interpret-trailers).

> Normally, `linux_info_proc` would use the PID to determine which subfolder in
> `/proc` to read information from. While this is usually fine, it breaks down
> after the main thread exits, at which point the information in `/proc/$pid`
> becomes become unreliable, if it is available at all. While it is the case
> that most programs terminate after their main thread exits, some may continue
> running from detached threads, in which case `info proc` will start misbehaving.
> 
> This patch addresses this by making it so that the LWP - the Lightweight Process
> ID, that, in the case of GNU/Linux is the number of the process backing up the
> thread[1] - is prefered over the PID. By doing this, `linux_info_proc` will
> always access valid procfs information, even after the main thread exits.
> 
> [1]: https://man7.org/linux/man-pages/man2/clone.2.html
> ---
>  gdb/linux-tdep.c | 9 ++++++++-
>  1 file changed, 8 insertions(+), 1 deletion(-)
> 
> diff --git a/gdb/linux-tdep.c b/gdb/linux-tdep.c
> index 82e8bc3db3c..2c91e298d45 100644
> --- a/gdb/linux-tdep.c
> +++ b/gdb/linux-tdep.c
> @@ -840,7 +840,14 @@ linux_info_proc (struct gdbarch *gdbarch, const char *args,
>        if (current_inferior ()->fake_pid_p)
>  	error (_("Can't determine the current process's PID: you must name one."));
>  
> -      pid = current_inferior ()->pid;
> +      /* Seeing as, when the main thread exits, the information in /proc/$pid
> +       * becomes unreliable, we should prefer using the current TID, whenever
> +       * possible. */
> +      pid = inferior_ptid.lwp ();
> +
> +      /* And fall back to the actual PID only when the TID is not available. */
> +      if (pid == 0)
> +	pid = current_inferior ()->pid;

I would suggest trying to use the any_live_thread_of_inferior function
to get a non-exited thread.   This way, if the current thread has
exited, it will find another that should be suitable for reading the
proc information.

I can imagine another case where thing would go wrong.  There might be
threads which have exited, but for which we have not processed the
"exited" event yet.  The exited state will not yet be reflected in the
thread_info structure, so we might pick it thinking it's a live thread.
But I would ignore that problem for now, what you propose is already a
good improvement over the current state.

Simon
  
Matheus Branco Borella Jan. 19, 2024, 4:52 p.m. UTC | #2
I've sent in a v2 that should address your points. Thanks for your time.
  

Patch

diff --git a/gdb/linux-tdep.c b/gdb/linux-tdep.c
index 82e8bc3db3c..2c91e298d45 100644
--- a/gdb/linux-tdep.c
+++ b/gdb/linux-tdep.c
@@ -840,7 +840,14 @@  linux_info_proc (struct gdbarch *gdbarch, const char *args,
       if (current_inferior ()->fake_pid_p)
 	error (_("Can't determine the current process's PID: you must name one."));
 
-      pid = current_inferior ()->pid;
+      /* Seeing as, when the main thread exits, the information in /proc/$pid
+       * becomes unreliable, we should prefer using the current TID, whenever
+       * possible. */
+      pid = inferior_ptid.lwp ();
+
+      /* And fall back to the actual PID only when the TID is not available. */
+      if (pid == 0)
+	pid = current_inferior ()->pid;
     }
 
   args = skip_spaces (args);