From patchwork Tue Feb 14 03:00:00 2023 Content-Type: text/plain; charset="utf-8" MIME-Version: 1.0 Content-Transfer-Encoding: 7bit X-Patchwork-Submitter: Tom Tromey X-Patchwork-Id: 64937 Return-Path: X-Original-To: patchwork@sourceware.org Delivered-To: patchwork@sourceware.org Received: from server2.sourceware.org (localhost [IPv6:::1]) by sourceware.org (Postfix) with ESMTP id 421313858033 for ; Tue, 14 Feb 2023 03:01:38 +0000 (GMT) X-Original-To: gdb-patches@sourceware.org Delivered-To: gdb-patches@sourceware.org Received: from qproxy2-pub.mail.unifiedlayer.com (qproxy2-pub.mail.unifiedlayer.com [69.89.16.161]) by sourceware.org (Postfix) with ESMTPS id 1ABD63858D1E for ; Tue, 14 Feb 2023 03:01:20 +0000 (GMT) DMARC-Filter: OpenDMARC Filter v1.4.2 sourceware.org 1ABD63858D1E Authentication-Results: sourceware.org; dmarc=none (p=none dis=none) header.from=tromey.com Authentication-Results: sourceware.org; spf=pass smtp.mailfrom=tromey.com Received: from outbound-ss-761.bluehost.com (outbound-ss-761.bluehost.com [74.220.211.250]) by qproxy2.mail.unifiedlayer.com (Postfix) with ESMTP id AB5C68047BBC for ; Tue, 14 Feb 2023 03:01:12 +0000 (UTC) Received: from cmgw10.mail.unifiedlayer.com (unknown [10.0.90.125]) by progateway8.mail.pro1.eigbox.com (Postfix) with ESMTP id 9629F1003FCD2 for ; Tue, 14 Feb 2023 03:00:12 +0000 (UTC) Received: from box5379.bluehost.com ([162.241.216.53]) by cmsmtp with ESMTP id RlYCpDsPxA1zNRlYCp0UvK; Tue, 14 Feb 2023 03:00:12 +0000 X-Authority-Reason: nr=8 X-Authority-Analysis: v=2.4 cv=CqN6zl0D c=1 sm=1 tr=0 ts=63eaf93c a=ApxJNpeYhEAb1aAlGBBbmA==:117 a=ApxJNpeYhEAb1aAlGBBbmA==:17 a=dLZJa+xiwSxG16/P+YVxDGlgEgI=:19 a=m04uMKEZRckA:10:nop_rcvd_month_year a=Qbun_eYptAEA:10:endurance_base64_authed_username_1 a=CCpqsmhAAAAA:8 a=mDV3o1hIAAAA:8 a=qrUNAVCnFug3_m-HlWQA:9 a=ul9cdbp4aOFLsgKbc677:22 a=_FVE-zBwftR9WsbkzFJk:22 DKIM-Signature: v=1; a=rsa-sha256; q=dns/txt; c=relaxed/relaxed; d=tromey.com; s=default; h=Content-Transfer-Encoding:MIME-Version:Message-Id:Date:Subject: Cc:To:From:Sender:Reply-To:Content-Type:Content-ID:Content-Description: Resent-Date:Resent-From:Resent-Sender:Resent-To:Resent-Cc:Resent-Message-ID: In-Reply-To:References:List-Id:List-Help:List-Unsubscribe:List-Subscribe: List-Post:List-Owner:List-Archive; bh=bymAfGdU6jO9vAw7gAqCwx9h8dmDmnpq57EPERNCneE=; b=MvdG+yrHEEVkFxMCtL4IHQWahJ 5SISnas93sLVH/VW6TQ9UMLhuxuwV+8IXWeg2zqzizXbLoQzxBTpEPYKVr3Qqyvh93/U/ozjPV6/e sPh3cQ7QpzLKxpgc74uvE4eiA; Received: from 75-166-130-93.hlrn.qwest.net ([75.166.130.93]:57280 helo=localhost.localdomain) by box5379.bluehost.com with esmtpsa (TLS1.2) tls TLS_ECDHE_RSA_WITH_AES_256_GCM_SHA384 (Exim 4.95) (envelope-from ) id 1pRlYC-000p9L-7A; Mon, 13 Feb 2023 20:00:12 -0700 From: Tom Tromey To: gdb-patches@sourceware.org Cc: Tom Tromey Subject: [PATCH] Fix "start" for D, Rust, etc Date: Mon, 13 Feb 2023 20:00:00 -0700 Message-Id: <20230214030000.1982722-1-tom@tromey.com> X-Mailer: git-send-email 2.39.1 MIME-Version: 1.0 X-AntiAbuse: This header was added to track abuse, please include it with any abuse report X-AntiAbuse: Primary Hostname - box5379.bluehost.com X-AntiAbuse: Original Domain - sourceware.org X-AntiAbuse: Originator/Caller UID/GID - [47 12] / [47 12] X-AntiAbuse: Sender Address Domain - tromey.com X-BWhitelist: no X-Source-IP: 75.166.130.93 X-Source-L: No X-Exim-ID: 1pRlYC-000p9L-7A X-Source: X-Source-Args: X-Source-Dir: X-Source-Sender: 75-166-130-93.hlrn.qwest.net (localhost.localdomain) [75.166.130.93]:57280 X-Source-Auth: tom+tromey.com X-Email-Count: 1 X-Source-Cap: ZWx5bnJvYmk7ZWx5bnJvYmk7Ym94NTM3OS5ibHVlaG9zdC5jb20= X-Local-Domain: yes X-Spam-Status: No, score=-3027.3 required=5.0 tests=BAYES_00, DKIM_SIGNED, DKIM_VALID, GIT_PATCH_0, JMQ_SPF_NEUTRAL, KAM_SHORT, RCVD_IN_DNSWL_NONE, RCVD_IN_MSPIKE_H2, SPF_HELO_NONE, SPF_PASS, TXREP autolearn=ham autolearn_force=no version=3.4.6 X-Spam-Checker-Version: SpamAssassin 3.4.6 (2021-04-09) on server2.sourceware.org X-BeenThere: gdb-patches@sourceware.org X-Mailman-Version: 2.1.29 Precedence: list List-Id: Gdb-patches mailing list List-Unsubscribe: , List-Archive: List-Post: List-Help: List-Subscribe: , Errors-To: gdb-patches-bounces+patchwork=sourceware.org@sourceware.org Sender: "Gdb-patches" The new DWARF indexer broke "start" for some languages. For D, it is broken because, while the code in cooked_index_shard::add specifically excludes Ada, it fails to exclude D. This means that the C "main" will be detected as "main" here -- whereas what is intended is for the code in find_main_name to use d_main_name to find the name. The Rust compiler, on the other hand, uses DW_AT_main_subprogram. However, the code in dwarf2_build_psymtabs_hard fails to create a fully-qualified name, so the name always ends up as plain "main". For D and Ada, a very simple approach suffices: remove the check against "main" from cooked_index_shard::add. This also has the benefit of slightly speeding up DWARF indexing. I assume this approach will work for Pascal and Modula-2 as well, but I don't have a way to test those at present. For Rust, though, this is not sufficient. And, computing the fully-qualified name in dwarf2_build_psymtabs_hard will crash, because cooked_index_entry::full_name uses the canonical name -- and that is not computed until after canonicalization. However, we don't want to wait for canonicalization to be done before computing the main name. That would remove any benefit from doing canonicalization is the background. This patch solves this dilemma by noticing that languages using DW_AT_main_subprogram are, currently, disjoint from languages requiring canonicalization. Because of this, we can add a parameter to full_name to let us avoid crashes, slowdowns, and races here. This is kind of tricky and ugly, so I've tried to comment it sufficiently. While doing this, I had to change gdb.dwarf2/main-subprogram.exp. A different possibility here would be to ignore the canonicalization needs of C in this situation, because those only affect certain types. However, I chose this approach because the test case is artificial anyhow. A long time ago, in an earlier threading attempt, I changed the global current_language to be a function (hidden behind a macro) to let us attempt lazily computing the current language. Perhaps this approach could still be made to work. However, that also seemed rather tricky, more so than this patch. Bug: https://sourceware.org/bugzilla/show_bug.cgi?id=30116 Reviewed-By: Andrew Burgess --- gdb/dwarf2/cooked-index.c | 50 +++++++++++++------- gdb/dwarf2/cooked-index.h | 18 +++++-- gdb/dwarf2/read.c | 13 ++++- gdb/testsuite/gdb.dlang/dlang-start.exp | 38 +++++++++++++++ gdb/testsuite/gdb.dlang/simple.d | 17 +++++++ gdb/testsuite/gdb.dwarf2/main-subprogram.exp | 3 +- gdb/testsuite/gdb.rust/rust-start.exp | 38 +++++++++++++++ 7 files changed, 154 insertions(+), 23 deletions(-) create mode 100644 gdb/testsuite/gdb.dlang/dlang-start.exp create mode 100644 gdb/testsuite/gdb.dlang/simple.d create mode 100644 gdb/testsuite/gdb.rust/rust-start.exp diff --git a/gdb/dwarf2/cooked-index.c b/gdb/dwarf2/cooked-index.c index 3d23a65ad8f..d465028add4 100644 --- a/gdb/dwarf2/cooked-index.c +++ b/gdb/dwarf2/cooked-index.c @@ -48,6 +48,16 @@ to_string (cooked_index_flag flags) /* See cooked-index.h. */ +bool +language_requires_canonicalization (enum language lang) +{ + return (lang == language_ada + || lang == language_c + || lang == language_cplus); +} + +/* See cooked-index.h. */ + int cooked_index_entry::compare (const char *stra, const char *strb, comparison_mode mode) @@ -162,10 +172,12 @@ test_compare () /* See cooked-index.h. */ const char * -cooked_index_entry::full_name (struct obstack *storage) const +cooked_index_entry::full_name (struct obstack *storage, bool for_main) const { + const char *local_name = for_main ? name : canonical; + if ((flags & IS_LINKAGE) != 0 || parent_entry == nullptr) - return canonical; + return local_name; const char *sep = nullptr; switch (per_cu->lang ()) @@ -182,11 +194,11 @@ cooked_index_entry::full_name (struct obstack *storage) const break; default: - return canonical; + return local_name; } - parent_entry->write_scope (storage, sep); - obstack_grow0 (storage, canonical, strlen (canonical)); + parent_entry->write_scope (storage, sep, for_main); + obstack_grow0 (storage, local_name, strlen (local_name)); return (const char *) obstack_finish (storage); } @@ -194,11 +206,13 @@ cooked_index_entry::full_name (struct obstack *storage) const void cooked_index_entry::write_scope (struct obstack *storage, - const char *sep) const + const char *sep, + bool for_main) const { if (parent_entry != nullptr) - parent_entry->write_scope (storage, sep); - obstack_grow (storage, canonical, strlen (canonical)); + parent_entry->write_scope (storage, sep, for_main); + const char *local_name = for_main ? name : canonical; + obstack_grow (storage, local_name, strlen (local_name)); obstack_grow (storage, sep, strlen (sep)); } @@ -218,10 +232,6 @@ cooked_index_shard::add (sect_offset die_offset, enum dwarf_tag tag, implicit "main" discovery. */ if ((flags & IS_MAIN) != 0) m_main = result; - else if (per_cu->lang () != language_ada - && m_main == nullptr - && strcmp (name, "main") == 0) - m_main = result; return result; } @@ -323,6 +333,8 @@ cooked_index_shard::do_finalize () for (cooked_index_entry *entry : m_entries) { + /* Note that this code must be kept in sync with + language_requires_canonicalization. */ gdb_assert (entry->canonical == nullptr); if ((entry->flags & IS_LINKAGE) != 0) entry->canonical = entry->name; @@ -474,11 +486,15 @@ cooked_index::get_main () const for (const auto &index : m_vector) { const cooked_index_entry *entry = index->get_main (); - if (result == nullptr - || ((result->flags & IS_MAIN) == 0 - && entry != nullptr - && (entry->flags & IS_MAIN) != 0)) - result = entry; + /* Choose the first "main" we see. The choice among several is + arbitrary. See the comment by the sole caller to understand + the rationale for filtering by language. */ + if (entry != nullptr + && !language_requires_canonicalization (entry->per_cu->lang ())) + { + result = entry; + break; + } } return result; diff --git a/gdb/dwarf2/cooked-index.h b/gdb/dwarf2/cooked-index.h index 7fa78d5e87e..e90544f7906 100644 --- a/gdb/dwarf2/cooked-index.h +++ b/gdb/dwarf2/cooked-index.h @@ -58,6 +58,13 @@ DEF_ENUM_FLAGS_TYPE (enum cooked_index_flag_enum, cooked_index_flag); std::string to_string (cooked_index_flag flags); +/* Return true if LANG requires canonicalization. This is used + primarily to work around an issue computing the name of "main". + This function must be kept in sync with + cooked_index_shard::do_finalize. */ + +extern bool language_requires_canonicalization (enum language lang); + /* A cooked_index_entry represents a single item in the index. Note that two entries can be created for the same DIE -- one using the name, and another one using the linkage name, if any. @@ -144,8 +151,12 @@ struct cooked_index_entry : public allocate_on_obstack /* Construct the fully-qualified name of this entry and return a pointer to it. If allocation is needed, it will be done on - STORAGE. */ - const char *full_name (struct obstack *storage) const; + STORAGE. FOR_MAIN is true if we are computing the name of the + "main" entry -- one marked DW_AT_main_subprogram. This matters + for avoiding name canonicalization (see comments about this + elsewhere) and also a related race (if "main" computation is done + during finalization). */ + const char *full_name (struct obstack *storage, bool for_main = false) const; /* Comparison modes for the 'compare' function. See the function for a description. */ @@ -220,7 +231,8 @@ struct cooked_index_entry : public allocate_on_obstack private: - void write_scope (struct obstack *storage, const char *sep) const; + void write_scope (struct obstack *storage, const char *sep, + bool for_name) const; }; class cooked_index; diff --git a/gdb/dwarf2/read.c b/gdb/dwarf2/read.c index 470ff125c5b..382603c2936 100644 --- a/gdb/dwarf2/read.c +++ b/gdb/dwarf2/read.c @@ -7167,8 +7167,17 @@ dwarf2_build_psymtabs_hard (dwarf2_per_objfile *per_objfile) const cooked_index_entry *main_entry = vec->get_main (); if (main_entry != nullptr) - set_objfile_main_name (objfile, main_entry->name, - main_entry->per_cu->lang ()); + { + /* We only do this for names not requiring canonicalization. At + this point in the process, names have not been canonicalized. + However, currently, languages that require this step also do + not use DW_AT_main_subprogram. An assert is appropriate here + because this filtering is done in get_main. */ + enum language lang = main_entry->per_cu->lang (); + gdb_assert (!language_requires_canonicalization (lang)); + const char *full_name = main_entry->full_name (&per_bfd->obstack, true); + set_objfile_main_name (objfile, full_name, lang); + } dwarf_read_debug_printf ("Done building psymtabs of %s", objfile_name (objfile)); diff --git a/gdb/testsuite/gdb.dlang/dlang-start.exp b/gdb/testsuite/gdb.dlang/dlang-start.exp new file mode 100644 index 00000000000..fd4688b0635 --- /dev/null +++ b/gdb/testsuite/gdb.dlang/dlang-start.exp @@ -0,0 +1,38 @@ +# Copyright (C) 2023 Free Software Foundation, Inc. + +# This program is free software; you can redistribute it and/or modify +# it under the terms of the GNU General Public License as published by +# the Free Software Foundation; either version 3 of the License, or +# (at your option) any later version. +# +# This program is distributed in the hope that it will be useful, +# but WITHOUT ANY WARRANTY; without even the implied warranty of +# MERCHANTABILITY or FITNESS FOR A PARTICULAR PURPOSE. See the +# GNU General Public License for more details. +# +# You should have received a copy of the GNU General Public License +# along with this program. If not, see . + +# Test "start" for D. + +load_lib d-support.exp +require allow_d_tests + +# This testcase verifies the behavior of the `start' command, which +# does not work when we use the gdb stub... +require !use_gdb_stub + +standard_testfile simple.d +if {[prepare_for_testing "failed to prepare" $testfile $srcfile {debug d}]} { + return -1 +} + +# Verify that "start" lands inside the right procedure. +if {[gdb_start_cmd] < 0} { + unsupported "start failed" + return -1 +} + +gdb_test "" \ + "main \\(\\) at .*simple.d.*" \ + "start" diff --git a/gdb/testsuite/gdb.dlang/simple.d b/gdb/testsuite/gdb.dlang/simple.d new file mode 100644 index 00000000000..b00884b1b9f --- /dev/null +++ b/gdb/testsuite/gdb.dlang/simple.d @@ -0,0 +1,17 @@ +// Copyright (C) 2023 Free Software Foundation, Inc. + +// This program is free software; you can redistribute it and/or modify +// it under the terms of the GNU General Public License as published by +// the Free Software Foundation; either version 3 of the License, or +// (at your option) any later version. +// +// This program is distributed in the hope that it will be useful, +// but WITHOUT ANY WARRANTY; without even the implied warranty of +// MERCHANTABILITY or FITNESS FOR A PARTICULAR PURPOSE. See the +// GNU General Public License for more details. +// +// You should have received a copy of the GNU General Public License +// along with this program. If not, see . + +void main() { +} diff --git a/gdb/testsuite/gdb.dwarf2/main-subprogram.exp b/gdb/testsuite/gdb.dwarf2/main-subprogram.exp index 23f02df8513..149b7a801be 100644 --- a/gdb/testsuite/gdb.dwarf2/main-subprogram.exp +++ b/gdb/testsuite/gdb.dwarf2/main-subprogram.exp @@ -27,8 +27,9 @@ Dwarf::assemble $asm_file { global srcfile cu {} { + # Note we don't want C here as that requires canonicalization. DW_TAG_compile_unit { - {DW_AT_language @DW_LANG_C} + {DW_AT_language @DW_LANG_PLI} {DW_AT_name $srcfile} {DW_AT_comp_dir /tmp} } { diff --git a/gdb/testsuite/gdb.rust/rust-start.exp b/gdb/testsuite/gdb.rust/rust-start.exp new file mode 100644 index 00000000000..96ba2ae3ac8 --- /dev/null +++ b/gdb/testsuite/gdb.rust/rust-start.exp @@ -0,0 +1,38 @@ +# Copyright (C) 2023 Free Software Foundation, Inc. + +# This program is free software; you can redistribute it and/or modify +# it under the terms of the GNU General Public License as published by +# the Free Software Foundation; either version 3 of the License, or +# (at your option) any later version. +# +# This program is distributed in the hope that it will be useful, +# but WITHOUT ANY WARRANTY; without even the implied warranty of +# MERCHANTABILITY or FITNESS FOR A PARTICULAR PURPOSE. See the +# GNU General Public License for more details. +# +# You should have received a copy of the GNU General Public License +# along with this program. If not, see . + +# Test "start" for Rust. + +load_lib rust-support.exp +require allow_rust_tests + +# This testcase verifies the behavior of the `start' command, which +# does not work when we use the gdb stub... +require !use_gdb_stub + +standard_testfile simple.rs +if {[prepare_for_testing "failed to prepare" $testfile $srcfile {debug rust}]} { + return -1 +} + +# Verify that "start" lands inside the right procedure. +if {[gdb_start_cmd] < 0} { + unsupported "start failed" + return -1 +} + +gdb_test "" \ + "simple::main \\(\\) at .*simple.rs.*" \ + "start"