ping: [patch] aarch64: PR 19806: watchpoints: false negatives + PR 20207 contiguous ones

From: Pedro Alves <palves@redhat.com>

  Hi,

On 03/21/2018 07:03 PM, Jan Kratochvil wrote:

> gdb/ChangeLog
> 2017-03-27  Jan Kratochvil  <jan.kratochvil@redhat.com>
> 
> 	PR breakpoints/19806 and support for PR external/20207.
> 	* NEWS (Changes since GDB 8.0): Mention unaligned hardware watchpoints.
> 	* aarch64-linux-nat.c (aarch64_linux_stopped_data_address): Fix missed
> 	watchpoints and PR external/20207 watchpoints.
> 	* gdbserver/linux-aarch64-low.c (aarch64_stopped_data_address):
> 	Likewise.
> 	* nat/aarch64-linux-hw-point.c (have_any_contiguous): New.
> 	(aarch64_watchpoint_offset): New.
> 	(aarch64_watchpoint_length): Support PR external/20207 watchpoints.
> 	(aarch64_point_encode_ctrl_reg): New parameter offset, new asserts.
> 	(aarch64_point_is_aligned): Support PR external/20207 watchpoints.
> 	(aarch64_align_watchpoint): New parameters aligned_offset_p and
> 	next_addr_orig_p.  Support PR external/20207 watchpoints.
> 	(aarch64_downgrade_regs): New.
> 	(aarch64_dr_state_insert_one_point): New parameters offset and
> 	addr_orig.
> 	(aarch64_dr_state_remove_one_point): Likewise.
> 	(aarch64_handle_breakpoint): Update caller.
> 	(aarch64_handle_aligned_watchpoint): Likewise.
> 	(aarch64_handle_unaligned_watchpoint): Support addr_orig and
> 	aligned_offset.
> 	(aarch64_linux_set_debug_regs): Remove const from state.  Call
> 	aarch64_downgrade_regs.
> 	(aarch64_show_debug_reg_state): Print also dr_addr_orig_wp.
> 	* nat/aarch64-linux-hw-point.h (DR_CONTROL_LENGTH): Rename to ...
> 	(DR_CONTROL_MASK): ... here.
> 	(struct aarch64_debug_reg_state): New field dr_addr_orig_wp.
> 	(unsigned int aarch64_watchpoint_offset): New prototype.
> 	(aarch64_linux_set_debug_regs): Remove const from state.
> 
> gdb/testsuite/ChangeLog
> 2017-03-27  Jan Kratochvil  <jan.kratochvil@redhat.com>
> 
> 	PR breakpoints/19806 and support for PR external/20207.
> 	* gdb.base/watchpoint-unaligned.c: New file.
> 	* gdb.base/watchpoint-unaligned.exp: New file.

So I spent a couple days this week working through this.

The patch looks largely good to me, though as I was going
through this, I was having trouble understanding some of the
details, and kept wishing for clearer comments, so I tried to
clarify and copy/edit the comments as I went.  I also noticed a
few places more where we should update the comments to newer
reality, like the comments on top of aarch64_align_watchpoint.

Other copy/edits:

- gdbserver has its own ChangeLog file
- move align_down (and align_up) to common/, so we
  can use it in gdbserver too, and use it to make
  code a little bit clearer.
- rename the have_any_contiguous global as it wasn't
  immediately clear what it meant when I started
  reading this.  Use copy-init while at it.
- // -> /**/ comments.
- Misc formatting.
- Expanded/clarified the git commit log entry.

Here's the patch that I intend to squash with yours before
merging (keeping you as --author).

Could you double-check to see if I missed something?

I have one question though.  In aarch64_linux_stopped_data_address,
where we match ADDR_TRAP from siginfo to a watchpoint, we check
whether that address is within the aligned watch regions:

      if (state->dr_ref_count_wp[i]
	  && DR_CONTROL_ENABLED (state->dr_ctrl_wp[i])
	  && addr_trap >= addr_watch_aligned
	  && addr_trap < addr_watch + len)
	{

However, by reading
<http://lkml.iu.edu/hypermail/linux/kernel/1611.1/05226.html>:

~~~~~~~~~~~~
Previously, when the hardware reported a watchpoint hit on an address
that did not match our watchpoint (this happens in case of instructions
which access large chunks of memory such as "stp") the process would
enter a loop where we would be continually resuming it (because we did
not recognise that watchpoint hit) and it would keep hitting the
watchpoint again and again. The tracing process would never get
notified of the watchpoint hit.
~~~~~~~~~~~~

... I'm left with the impression that ADDR_TRAP could be even
lower than addr_watch_aligned, in which case we'll still miss
watchpoints.  I wondering whether GDB should be using a similar
trick as that kernel patch does.  I may well be missing something,
though, as I only tried the patch on an older kernel.  WDYT?

From f265a8e13984aaa40a5ed59913ed14923bc67d9d Mon Sep 17 00:00:00 2001
From: Pedro Alves <palves@redhat.com>
Date: Fri, 20 Apr 2018 14:55:03 +0100
Subject: [PATCH] Aarch64: Fix watchpoints set on non-8-byte-aligned addresses
 are always missed (PR 19806)

Ref: https://sourceware.org/bugzilla/show_bug.cgi?id=19806

As described in detail on the bug report, on Aarch64, some unaligned
watchpoints are currently missed.  For example, with:

 union
 {
   char buf[4];
   unsigned int ul;
 } u;

 int
 main ()
 {
   u.ul = 0xffffffff;
   return 0;
 }

on x86-64, we get:

 (gdb) watch u.buf[1]
 Hardware watchpoint 1: u.buf[1]
 (gdb) c
 Continuing.
 Hardware watchpoint 1: u.buf[1]

 Old value = 0 '\000'
 New value = -1 '\377'
 main () at watch.c:11
 11              return 0;
 (gdb)

While on Aarch64, GDB misses the watchpoint hit.

Actually, the kernel reports the hit to gdb, and linux-nat.c forwards
the event to infrun.c.  However, it doesn't work as expected because
the Aarch64 backend code aligns the inserted watchpoint's address to
an 8-byte boundary, and then when the watchpoint triggers, the backend
reports that aligned address as the watchpoint stop address.  Since
that address falls out of any memory range covered by watchpoints that
the core of GDB knows about, the watchpoint hit is not reported to the
user (it's considered a spurious/moribund watchpoint trap, and
ignored).

This patch fixes it, by trying to match the kernel-reported trapped
address (ADDR_TRAP) with a watched region, so that the core of gdb
figures out which watchpoint triggered.

Additionally this patch makes GDB support any watchpoint masks as
described here:

 kernel RFE: aarch64: ptrace: BAS: Support any contiguous range
 https://sourceware.org/bugzilla/show_bug.cgi?id=20207

The above is already fixed in current Linux kernels.

The patch trades missing watchpoints (false negatives) for the
occasional rwatch/awatch false positive on unfixed kernels.  The
latter can happen if you have watchpoints set in near addresses; gdb
may report the wrong watchpoint being hit.

Note the change causes the aarch64 backend to merge fewer watchpoints,
since it now only considers the requested ranges for merging instead
of the enlarged/aligned ranges.  I.e., with multiple overlapping
watchpoints, gdb may run out of the 4 hardware watchpoint registers
earlier than before.  While we could make gdb considered the enlarged
ranges when running on an older kernel, I do not think it's worth the
bother, since older kernels will eventually be phased out.

Tested on RHEL-7.{3,4} for no regressions on:
	kernel-4.10.0-6.el7.aarch64 (contiguous watchpoints supported)
	kernel-4.5.0-15.el7.aarch64 (contiguous watchpoints unsupported)

gdb/ChangeLog:
yyyy-mm-dd  Jan Kratochvil  <jan.kratochvil@redhat.com>

	PR breakpoints/19806 and support for PR external/20207.

	* NEWS: Mention Aarch64 watchpoint improvements.

	* aarch64-linux-nat.c (aarch64_linux_stopped_data_address): Fix missed
	watchpoints and PR external/20207 watchpoints.
	* nat/aarch64-linux-hw-point.c
	(kernel_supports_any_contiguous_range): New.
	(aarch64_watchpoint_offset): New.
	(aarch64_watchpoint_length): Support PR external/20207 watchpoints.
	(aarch64_point_encode_ctrl_reg): New parameter offset, new asserts.
	(aarch64_point_is_aligned): Support PR external/20207 watchpoints.
	(aarch64_align_watchpoint): New parameters aligned_offset_p and
	next_addr_orig_p.  Support PR external/20207 watchpoints.
	(aarch64_downgrade_regs): New.
	(aarch64_dr_state_insert_one_point): New parameters offset and
	addr_orig.
	(aarch64_dr_state_remove_one_point): Likewise.
	(aarch64_handle_breakpoint): Update caller.
	(aarch64_handle_aligned_watchpoint): Likewise.
	(aarch64_handle_unaligned_watchpoint): Support addr_orig and
	aligned_offset.
	(aarch64_linux_set_debug_regs): Remove const from state.  Call
	aarch64_downgrade_regs.
	(aarch64_show_debug_reg_state): Print also dr_addr_orig_wp.
	* nat/aarch64-linux-hw-point.h (DR_CONTROL_LENGTH): Rename to ...
	(DR_CONTROL_MASK): ... this.
	(struct aarch64_debug_reg_state): New field dr_addr_orig_wp.
	(unsigned int aarch64_watchpoint_offset): New prototype.
	(aarch64_linux_set_debug_regs): Remove const from state.
	* utils.c (align_up, align_down): Move to ...
	* common/common-utils.c (align_up, align_down): ... here.
	* utils.h (align_up, align_down): Move to ...
	* common/common-utils.h (align_up, align_down): ... here.

gdb/gdbserver/ChangeLog:
yyyy-mm-dd  Jan Kratochvil  <jan.kratochvil@redhat.com>

	* linux-aarch64-low.c (aarch64_stopped_data_address):
	Likewise.

gdb/testsuite/ChangeLog:
yyyy-mm-dd  Jan Kratochvil  <jan.kratochvil@redhat.com>

	PR breakpoints/19806 and support for PR external/20207.
	* gdb.base/watchpoint-unaligned.c: New file.
	* gdb.base/watchpoint-unaligned.exp: New file.
---
 gdb/NEWS                                        |  15 ++-
 gdb/aarch64-linux-nat.c                         |  33 +++---
 gdb/common/common-utils.c                       |  20 ++++
 gdb/common/common-utils.h                       |  32 ++++++
 gdb/gdbserver/linux-aarch64-low.c               |  33 +++---
 gdb/nat/aarch64-linux-hw-point.c                | 129 +++++++++++++-----------
 gdb/nat/aarch64-linux-hw-point.h                |   4 +-
 gdb/testsuite/gdb.base/watchpoint-unaligned.c   |   4 +-
 gdb/testsuite/gdb.base/watchpoint-unaligned.exp |  41 ++++----
 gdb/utils.c                                     |  16 ---
 gdb/utils.h                                     |  32 ------
 11 files changed, 198 insertions(+), 161 deletions(-)

ping: [patch] aarch64: PR 19806: watchpoints: false negatives + PR 20207 contiguous ones

Commit Message

Comments

Patch