[0/2] C.UTF-8
Message ID | 20210726041828.4180653-1-carlos@redhat.com |
---|---|
Headers |
Return-Path: <libc-alpha-bounces+patchwork=sourceware.org@sourceware.org> X-Original-To: patchwork@sourceware.org Delivered-To: patchwork@sourceware.org Received: from server2.sourceware.org (localhost [IPv6:::1]) by sourceware.org (Postfix) with ESMTP id C6E72384840F for <patchwork@sourceware.org>; Mon, 26 Jul 2021 04:18:59 +0000 (GMT) DKIM-Filter: OpenDKIM Filter v2.11.0 sourceware.org C6E72384840F DKIM-Signature: v=1; a=rsa-sha256; c=relaxed/relaxed; d=sourceware.org; s=default; t=1627273139; bh=1Kra1FxupuGbEeQS89KcsHCkwxUFCiylkT5BmDcHpFo=; h=To:Subject:Date:List-Id:List-Unsubscribe:List-Archive:List-Post: List-Help:List-Subscribe:From:Reply-To:From; b=UJ8uR0oYBzFYZFgoBokqJuI1NU6uNMfnXW7fyZr8z87ev/QswrPfz3bI+wuCD22mq DpVmGBDKzjmqk9Xtn0hhLhuvUCyu0cDq68ddtCHyjEY+NPnGxqIWE9ZJqNXBjOwWqe W+uiZ0VbXdBd2JiWwMjHLbKKHjILzl2p5BcT8h6c= X-Original-To: libc-alpha@sourceware.org Delivered-To: libc-alpha@sourceware.org Received: from us-smtp-delivery-124.mimecast.com (us-smtp-delivery-124.mimecast.com [170.10.133.124]) by sourceware.org (Postfix) with ESMTP id 4B2C2385AC1F for <libc-alpha@sourceware.org>; Mon, 26 Jul 2021 04:18:38 +0000 (GMT) DMARC-Filter: OpenDMARC Filter v1.4.1 sourceware.org 4B2C2385AC1F Received: from mail-qt1-f198.google.com (mail-qt1-f198.google.com [209.85.160.198]) (Using TLS) by relay.mimecast.com with ESMTP id us-mta-410-Y794N9_JMWaRHi4_otj9wA-1; Mon, 26 Jul 2021 00:18:36 -0400 X-MC-Unique: Y794N9_JMWaRHi4_otj9wA-1 Received: by mail-qt1-f198.google.com with SMTP id v7-20020ac874870000b029024e8ccfcd07so3554042qtq.11 for <libc-alpha@sourceware.org>; Sun, 25 Jul 2021 21:18:36 -0700 (PDT) X-Google-DKIM-Signature: v=1; a=rsa-sha256; c=relaxed/relaxed; d=1e100.net; s=20161025; h=x-gm-message-state:from:to:cc:subject:date:message-id:mime-version :content-transfer-encoding; bh=1Kra1FxupuGbEeQS89KcsHCkwxUFCiylkT5BmDcHpFo=; b=UoQ6/6prRfA8bABsLp7IyLuURUXF4IxFx5/zOCZhD+ViQ+/NTAMbxOEofSb3j+V5tu gzzXc1sQeCUByZ7aAJ2OAHaUnpkdxEfEU2k2HrpAWv4oGD8/weeV8n7001B3oRcU+AAa DRw5JNT6Mj0zZh8bvM2Kjk7bqApWZuokSPWXRNP1GHiGk5qxYwfJD97XWyo+jzZmKWT6 DYTy3PwqJqycLkr3tduOc3kwYlaCWYliiXmb4F4FqMonzk66Uxi4EprCtVff/ng3VTXv 4vef9h4DjboEo37VIPdnrhCmae7DlMscs42kcyBCuTLhOuQ53x0mtw5doo2/64od966H Qd6g== X-Gm-Message-State: AOAM532HQY2d2ub9IkPH4Doz9Ia+lnMT8UuL651Q0IGX89OBM64WZ3SR HAr0jfn173fsOMX2BViupU0NjY7zopAgfLB+B4BbhbYBoJXrlqEXIRTteV8j4pGXGXGKPyzNU0k fkEupDjTCXg4WwvVWJ2UniovGojsmYfLzt929c9gUZmcED8w2uHerswJZyyOe+ul1O5Kccg== X-Received: by 2002:a37:a5c9:: with SMTP id o192mr15749762qke.58.1627273115938; Sun, 25 Jul 2021 21:18:35 -0700 (PDT) X-Google-Smtp-Source: ABdhPJxXX9wnqYJWTDd0NjNFNe1MsTVVJJgdRyzImLsU5BLah7krSmbbt2XQ2AVJeCqxm+UjGAT4sg== X-Received: by 2002:a37:a5c9:: with SMTP id o192mr15749753qke.58.1627273115716; Sun, 25 Jul 2021 21:18:35 -0700 (PDT) Received: from athas.redhat.com (198-84-214-74.cpe.teksavvy.com. [198.84.214.74]) by smtp.gmail.com with ESMTPSA id a5sm10186992qkf.88.2021.07.25.21.18.34 (version=TLS1_3 cipher=TLS_AES_256_GCM_SHA384 bits=256/256); Sun, 25 Jul 2021 21:18:34 -0700 (PDT) To: libc-alpha@sourceware.org Subject: [PATCH 0/2] C.UTF-8 Date: Mon, 26 Jul 2021 00:18:26 -0400 Message-Id: <20210726041828.4180653-1-carlos@redhat.com> X-Mailer: git-send-email 2.31.1 MIME-Version: 1.0 X-Mimecast-Spam-Score: 0 X-Mimecast-Originator: redhat.com Content-Transfer-Encoding: 8bit Content-Type: text/plain; charset="US-ASCII" X-Spam-Status: No, score=-5.8 required=5.0 tests=BAYES_00, DKIMWL_WL_HIGH, DKIM_SIGNED, DKIM_VALID, DKIM_VALID_AU, DKIM_VALID_EF, KAM_NUMSUBJECT, RCVD_IN_DNSWL_LOW, RCVD_IN_MSPIKE_H4, RCVD_IN_MSPIKE_WL, SPF_HELO_NONE, SPF_NONE, SUBJ_ALL_CAPS, TXREP autolearn=ham autolearn_force=no version=3.4.4 X-Spam-Checker-Version: SpamAssassin 3.4.4 (2020-01-24) on server2.sourceware.org X-BeenThere: libc-alpha@sourceware.org X-Mailman-Version: 2.1.29 Precedence: list List-Id: Libc-alpha mailing list <libc-alpha.sourceware.org> List-Unsubscribe: <https://sourceware.org/mailman/options/libc-alpha>, <mailto:libc-alpha-request@sourceware.org?subject=unsubscribe> List-Archive: <https://sourceware.org/pipermail/libc-alpha/> List-Post: <mailto:libc-alpha@sourceware.org> List-Help: <mailto:libc-alpha-request@sourceware.org?subject=help> List-Subscribe: <https://sourceware.org/mailman/listinfo/libc-alpha>, <mailto:libc-alpha-request@sourceware.org?subject=subscribe> From: Carlos O'Donell via Libc-alpha <libc-alpha@sourceware.org> Reply-To: Carlos O'Donell <carlos@redhat.com> Errors-To: libc-alpha-bounces+patchwork=sourceware.org@sourceware.org Sender: "Libc-alpha" <libc-alpha-bounces+patchwork=sourceware.org@sourceware.org> |
Message
Carlos O'Donell
July 26, 2021, 4:18 a.m. UTC
The following changes implement a minimally sized C.UTF-8. First we implement the 'strcmp_collation' directive. Then we implement C.UTF-8 with an LC_COLLATE that uses the 'strcmp_collation' directive to support using strcmp for collation i.e. code point sorting. The final C.UTF-8 is only ~396KiB with the largest ~346KiB in LC_CTYPE for all of Unicode. Carlos O'Donell (2): Add 'strcmp_collation' support for LC_COLLATE. Add generic C.UTF-8 locale (Bug 17318) iconv/Makefile | 22 ++- iconv/tst-iconv9.c | 87 +++++++++ locale/programs/ld-collate.c | 24 ++- locale/programs/locfile-kw.gperf | 1 + locale/programs/locfile-kw.h | 306 ++++++++++++++++--------------- locale/programs/locfile-token.h | 1 + localedata/C.UTF-8.in | 157 ++++++++++++++++ localedata/Makefile | 2 + localedata/SUPPORTED | 1 + localedata/locales/C | 194 ++++++++++++++++++++ 10 files changed, 637 insertions(+), 158 deletions(-) create mode 100644 iconv/tst-iconv9.c create mode 100644 localedata/C.UTF-8.in create mode 100644 localedata/locales/C