From patchwork Mon Jun 7 02:07:44 2021 Content-Type: text/plain; charset="utf-8" MIME-Version: 1.0 Content-Transfer-Encoding: 7bit X-Patchwork-Submitter: Tom Honermann X-Patchwork-Id: 43727 Return-Path: X-Original-To: patchwork@sourceware.org Delivered-To: patchwork@sourceware.org Received: from server2.sourceware.org (localhost [IPv6:::1]) by sourceware.org (Postfix) with ESMTP id 426DD3857807 for ; Mon, 7 Jun 2021 02:08:16 +0000 (GMT) DKIM-Filter: OpenDKIM Filter v2.11.0 sourceware.org 426DD3857807 DKIM-Signature: v=1; a=rsa-sha256; c=relaxed/relaxed; d=sourceware.org; s=default; t=1623031696; bh=vOKEwwuKfs4yNE31OAjbyQPGlGsjXFVTRKXxQPX/ato=; h=Subject:To:Date:List-Id:List-Unsubscribe:List-Archive:List-Post: List-Help:List-Subscribe:From:Reply-To:From; b=U2PiRm83INW3OCVZ2MRDqjLVDtTqXLF308fa1uqiO/QitcUOfCRvZuGnGk+MfbH4i H3P0ldCniyAvelMXYtf+nEeyGReebetbWATQMSSuvB7FxRYwnSMLmHSXGWoXgkGxNH yX5rq9cgr77nwj6IdQl+FaKZEHqAi4ELm1v+ZSsw= X-Original-To: libc-alpha@sourceware.org Delivered-To: libc-alpha@sourceware.org Received: from smtp107.ord1d.emailsrvr.com (smtp107.ord1d.emailsrvr.com [184.106.54.107]) by sourceware.org (Postfix) with ESMTPS id C06B13858C27 for ; Mon, 7 Jun 2021 02:07:45 +0000 (GMT) DMARC-Filter: OpenDMARC Filter v1.4.1 sourceware.org C06B13858C27 X-Auth-ID: tom@honermann.net Received: by smtp14.relay.ord1d.emailsrvr.com (Authenticated sender: tom-AT-honermann.net) with ESMTPSA id 0B9B6400AC for ; Sun, 6 Jun 2021 22:07:44 -0400 (EDT) Subject: [PATCH 0/3]: C++20 P0482R6 and C2X N2653: support for char8_t, mbrtoc8(), and c8rtomb(). To: libc-alpha Message-ID: <8c4dcb0d-c2d9-0182-034d-b217e7881d77@honermann.net> Date: Sun, 6 Jun 2021 22:07:44 -0400 User-Agent: Mozilla/5.0 (X11; Linux x86_64; rv:68.0) Gecko/20100101 Thunderbird/68.10.0 MIME-Version: 1.0 Content-Language: en-US X-Classification-ID: d182bef8-b957-4292-9ba7-3f7eef7961af-1-1 X-Spam-Status: No, score=-5.1 required=5.0 tests=BAYES_00, DKIM_SIGNED, DKIM_VALID, DKIM_VALID_AU, DKIM_VALID_EF, KAM_SHORT, RCVD_IN_DNSWL_NONE, SPF_HELO_NONE, SPF_PASS, TXREP autolearn=ham autolearn_force=no version=3.4.2 X-Spam-Checker-Version: SpamAssassin 3.4.2 (2018-09-13) on server2.sourceware.org X-BeenThere: libc-alpha@sourceware.org X-Mailman-Version: 2.1.29 Precedence: list List-Id: Libc-alpha mailing list List-Unsubscribe: , List-Archive: List-Post: List-Help: List-Subscribe: , X-Patchwork-Original-From: Tom Honermann via Libc-alpha From: Tom Honermann Reply-To: Tom Honermann Errors-To: libc-alpha-bounces+patchwork=sourceware.org@sourceware.org Sender: "Libc-alpha" This series of patches provides the following: - A fix for bug 25744 [1]. - Implementations of the mbrtoc8 and c8rtomb functions adopted for C++20 via WG21 P0482R6 [2] and proposed for C2X in WG14 N2653 [3]. - A char8_t typedef as proposed for C2X in WG14 N2653 [3]. - A _CHAR8_T_SOURCE feature test macro to be used to opt-in to support for the char8_t typedef and mbrtoc8/c8rtomb functions in pre-C++20 or C source code. Patch 1: A fix and test for bug 25744 [1]. Patch 2: Definitions of the mbrtoc8 and c8rtomb functions, the char8_t typedef, and the _CHAR8_T_SOURCE feature test macro. Patch 3: Tests for the mbrtoc8 and c8rtomb functions and the char8_t typedef. The fix for bug 25744 [1] is included in this patch series because the tests for mbrtoc8 and c8rtomb depend on it for exercising the special case where a pair of Unicode code points is converted to/from a single double byte sequence. Such conversion cases exist for Big5-HKSCS. Tom. [1]: Bug 25744 "mbrtowc with Big5-HKSCS returns 2 instead of 1 when consuming the second byte of certain double byte characters" https://sourceware.org/bugzilla/show_bug.cgi?id=25744 [2]: WG21 P0482R6 "char8_t: A type for UTF-8 characters and strings (Revision 6)" https://wg21.link/p0482r6 [3]: WG14 N2653 "char8_t: A type for UTF-8 characters and strings (Revision 1)" http://www.open-std.org/jtc1/sc22/wg14/www/docs/n2653.htm