From patchwork Fri Jan 17 20:07:38 2020 Content-Type: text/plain; charset="utf-8" MIME-Version: 1.0 Content-Transfer-Encoding: 8bit X-Patchwork-Submitter: Mike FABIAN X-Patchwork-Id: 37428 Received: (qmail 96926 invoked by alias); 17 Jan 2020 20:07:51 -0000 Mailing-List: contact libc-alpha-help@sourceware.org; run by ezmlm Precedence: bulk List-Id: List-Unsubscribe: List-Subscribe: List-Archive: List-Post: List-Help: , Sender: libc-alpha-owner@sourceware.org Delivered-To: mailing list libc-alpha@sourceware.org Received: (qmail 96916 invoked by uid 89); 17 Jan 2020 20:07:51 -0000 Authentication-Results: sourceware.org; auth=none X-Spam-SWARE-Status: No, score=-16.6 required=5.0 tests=AWL, BAYES_00, BODY_8BITS, GARBLED_BODY, GIT_PATCH_0, GIT_PATCH_1, GIT_PATCH_2, GIT_PATCH_3, RCVD_IN_DNSWL_NONE autolearn=ham version=3.3.1 spammy=H*M:site, H*MI:site, SIGN, letter X-HELO: us-smtp-1.mimecast.com DKIM-Signature: v=1; a=rsa-sha256; c=relaxed/relaxed; d=redhat.com; s=mimecast20190719; t=1579291666; h=from:from:reply-to:subject:subject:date:date:message-id:message-id: to:to:cc:cc:mime-version:mime-version:content-type:content-type; bh=SokeNu5ehCEfUrJT1NeQ9Lzj9b6r9LIEhGq5Dw51aa8=; b=PDnpoxVaolfKY1vvC3rc8j+HJxAZfNehn0JyYARowO6Vv5bGt8+5rGsvIYKE6uwVl7QI9a JFfMZXNem2iIQECf2psrVb9I0OiX5k/yAlRBwGwXZ75HOJ/vQAg10bYFKzPcQJ++LK/fyc C1N1ULX68I+SrRjCtMcBpH2mkZzFkFs= Return-Path: From: Mike FABIAN To: libc-alpha@sourceware.org Cc: Ankit Behera Subject: [PATCH] Update or_IN collation [BZ #22525] Date: Fri, 17 Jan 2020 21:07:38 +0100 Message-ID: User-Agent: Gnus/5.13 (Gnus v5.13) Emacs/27.0.50 (gnu/linux) MIME-Version: 1.0 X-Mimecast-Spam-Score: 0 X-Mimecast-Originator: redhat.com Update or_IN collation [BZ #22525] - Add a test file or_IN.UTF-8.in. - Make the collation agree with CLDR. Ankit Behera (native speaker of Odia) was so kind to check that the order in the test file or_IN.UTF-8.in is correct. From 9fa73af31aa96bd2d91c2aea4822aab5771952b4 Mon Sep 17 00:00:00 2001 From: Mike FABIAN Date: Thu, 5 Dec 2019 14:21:21 +0100 Subject: [PATCH] Update or_IN collation [BZ #22525] - Add a test file or_IN.UTF-8.in. - Make the collation agree with CLDR. --- localedata/Makefile | 2 + localedata/locales/or_IN | 565 +++----------------------------------- localedata/or_IN.UTF-8.in | 145 ++++++++++ 3 files changed, 186 insertions(+), 526 deletions(-) create mode 100644 localedata/or_IN.UTF-8.in diff --git a/localedata/Makefile b/localedata/Makefile index 75da312579..e89bacc1aa 100644 --- a/localedata/Makefile +++ b/localedata/Makefile @@ -92,6 +92,7 @@ test-input := \ nan_TW.UTF-8@latin \ nb_NO.UTF-8 \ om_KE.UTF-8 \ + or_IN.UTF-8 \ os_RU.UTF-8 \ pl_PL.UTF-8 \ ps_AF.UTF-8 \ @@ -258,6 +259,7 @@ LOCALES := \ nl_NL.UTF-8 \ nn_NO.ISO-8859-1 \ om_KE.UTF-8 \ + or_IN.UTF-8 \ os_RU.UTF-8 \ pl_PL.UTF-8 \ ps_AF.UTF-8 \ diff --git a/localedata/locales/or_IN b/localedata/locales/or_IN index 67434238a0..8c521512dc 100644 --- a/localedata/locales/or_IN +++ b/localedata/locales/or_IN @@ -68,541 +68,54 @@ END LC_CTYPE %%%%%%%%%%%%% LC_COLLATE -collating-element from "" -collating-element from "" -collating-element from "" +% CLDR collation rules for Odia: +% (see: https://github.com/unicode-org/cldr/blob/master/common/collation/or.xml) +% +% # The following tailoring is an adjustment of the +% # DUCET collation order for CANDRABINDU, ANUSVARA, +% # and VISARGA. Instead of being sorted with secondary +% # weights, they are collated on primary level between +% # the independent vowels and the consonants. +% # +% &ଔ<ଁ<ଂ<ଃ +% # KSSA is sorted after HA. +% &ହ<କ୍ଷ +% # YYA is secondary different from YA. +% &ଯ<<ୟ +% +% The following rules implement the same order for glibc. + +copy "iso14651_t1" -% Define collation weights as symbols +collating-symbol +collating-symbol +collating-symbol -collating-symbol -collating-symbol -collating-symbol -collating-symbol -collating-symbol -collating-symbol -collating-symbol -collating-symbol -collating-symbol -collating-symbol -collating-symbol -collating-symbol -collating-symbol -collating-symbol -collating-symbol -collating-symbol -collating-symbol -collating-symbol -collating-symbol -collating-symbol -collating-symbol -collating-symbol -collating-symbol -collating-symbol -collating-symbol -collating-symbol -collating-symbol -collating-symbol -collating-symbol -collating-symbol -collating-symbol -collating-symbol -collating-symbol -collating-symbol -collating-symbol -collating-symbol -collating-symbol -collating-symbol -collating-symbol -collating-symbol -collating-symbol -collating-symbol -collating-symbol -collating-symbol -collating-symbol -collating-symbol -collating-symbol -collating-symbol -collating-symbol -collating-symbol -collating-symbol -collating-symbol -collating-symbol -collating-symbol -collating-symbol -collating-symbol -collating-symbol -collating-symbol -collating-symbol -collating-symbol -collating-symbol -collating-symbol -collating-symbol -collating-symbol -collating-symbol -collating-symbol -collating-symbol -collating-symbol -collating-symbol -collating-symbol -collating-symbol -collating-symbol -collating-symbol -collating-symbol -collating-symbol -collating-symbol -collating-symbol -collating-symbol -collating-symbol -collating-symbol -collating-symbol -collating-symbol -collating-symbol -collating-symbol -collating-symbol -collating-symbol -collating-symbol -collating-symbol -collating-symbol -collating-symbol -collating-symbol -collating-symbol -collating-symbol -collating-symbol -collating-symbol -collating-symbol -collating-symbol -collating-symbol -collating-symbol -collating-symbol -collating-symbol -collating-symbol -collating-symbol -collating-symbol -collating-symbol -collating-symbol -collating-symbol -collating-symbol -collating-symbol -collating-symbol -collating-symbol -collating-symbol -collating-symbol -collating-symbol -collating-symbol -collating-symbol -collating-symbol -collating-symbol -collating-symbol -collating-symbol -collating-symbol -collating-symbol -collating-symbol -collating-symbol -collating-symbol -collating-symbol -collating-symbol -collating-symbol -collating-symbol -collating-symbol -collating-symbol -collating-symbol -collating-symbol -collating-symbol -collating-symbol -collating-symbol -collating-symbol -collating-symbol -collating-symbol -collating-symbol -collating-symbol -collating-symbol -collating-symbol -collating-symbol -collating-symbol -collating-symbol -collating-symbol -collating-symbol -collating-symbol -collating-symbol -collating-symbol -collating-symbol +collating-element from "" % କ ୍ ଷ = କ୍ଷ +collating-element from "" % ଡ ଼ = ଡ଼ +collating-element from "" % ଢ ଼ = ଢ଼ -order_start forward;forward;forward;forward +collating-symbol +% &ଔ<ଁ<ଂ<ଃ +reorder-after % ORIYA LETTER AU + + + -% collation weights in order + ;;; % ORIYA SIGN CANDRABINDU + ;;; % ORIYA SIGN ANUSVARA + ;;; % ORIYA SIGN VISARGA - - - - - - - - - - - - - - - - - - - - - - - - - - - - - - - - - - - - - - - - - - - - - - - - - - - - - - - - - - - - - - - - - - - - - - - - - - - - - - - - - - - - - - - - - - - - - - - - - - - - - - - - - - - - - - - - - - - - - - - - - - - - - - - - - - - - - - - - - - - - - - - - - - - - - - - - +% &ହ<କ୍ଷ +reorder-after % ORIYA LETTER HA + -% assignment of characters to weights + ;;;"" % କ ୍ ଷ = କ୍ଷ - IGNORE;IGNORE;IGNORE; - IGNORE;IGNORE;IGNORE; - IGNORE;IGNORE;IGNORE; - IGNORE;IGNORE;IGNORE; - IGNORE;IGNORE;IGNORE; - IGNORE;IGNORE;IGNORE; - IGNORE;IGNORE;IGNORE; - IGNORE;IGNORE;IGNORE; - IGNORE;IGNORE;IGNORE; - IGNORE;IGNORE;IGNORE; - IGNORE;IGNORE;IGNORE; - IGNORE;IGNORE;IGNORE; - IGNORE;IGNORE;IGNORE; - IGNORE;IGNORE;IGNORE; - IGNORE;IGNORE;IGNORE; - IGNORE;IGNORE;IGNORE; - IGNORE;IGNORE;IGNORE; - IGNORE;IGNORE;IGNORE; - IGNORE;IGNORE;IGNORE; - IGNORE;IGNORE;IGNORE; - IGNORE;IGNORE;IGNORE; - IGNORE;IGNORE;IGNORE; - IGNORE;IGNORE;IGNORE; - IGNORE;IGNORE;IGNORE; - IGNORE;IGNORE;IGNORE; - IGNORE;IGNORE;IGNORE; - IGNORE;IGNORE;IGNORE; - IGNORE;IGNORE;IGNORE; - ;;;IGNORE - ;;;IGNORE - ;;;IGNORE - ;;;IGNORE - ;;;IGNORE - ;;;IGNORE - ;;;IGNORE - ;;;IGNORE - ;;;IGNORE - ;;;IGNORE - ;;;IGNORE - ;;;IGNORE - ;;;IGNORE - ;;;IGNORE - ;;;IGNORE - ;;;IGNORE - ;;;IGNORE - ;;;IGNORE - ;;;IGNORE - ;;;IGNORE - ;;;IGNORE - ;;;IGNORE - ;;;IGNORE - ;;;IGNORE - ;;;IGNORE - ;;;IGNORE - ;;;IGNORE - ;;;IGNORE - ;;;IGNORE - ;;;IGNORE - ;;;IGNORE - ;;;IGNORE - ;;;IGNORE - ;;;IGNORE - ;;;IGNORE - ;;;IGNORE - ;;;IGNORE - ;;;IGNORE - ;;;IGNORE - ;;;IGNORE - ;"";"";IGNORE - ;;;IGNORE - ;"";"";IGNORE - ;;;IGNORE - ;"";"";IGNORE - ;;;IGNORE - ;"";"";IGNORE - ;;;IGNORE - ;"";"";IGNORE - ;;;IGNORE - ;"";"";IGNORE - ;;;IGNORE - ;"";"";IGNORE - ;;;IGNORE - ;"";"";IGNORE - ;;;IGNORE - ;"";"";IGNORE - ;;;IGNORE - ;"";"";IGNORE - ;;;IGNORE - ;;;IGNORE - ;;;IGNORE - ;;;IGNORE - ;;;IGNORE - ;;;IGNORE - ;;;IGNORE - ;;;IGNORE - ;;;IGNORE - ;;;IGNORE - ;;;IGNORE - ;;;IGNORE - ;;;IGNORE - ;;;IGNORE - ;;;IGNORE - ;;;IGNORE - ;;;IGNORE - ;;;IGNORE - ;;;IGNORE - ;;;IGNORE - ;;;IGNORE - ;;;IGNORE - ;;;IGNORE - ;;;IGNORE - ;;;IGNORE - ;;;IGNORE - ;;;IGNORE - ;;;IGNORE - ;;;IGNORE - ;;;IGNORE - ;;;IGNORE - ;;;IGNORE - ;;;IGNORE - ;;;IGNORE - ;;;IGNORE - ;;;IGNORE - ;;;IGNORE - ;;;IGNORE - ;;;IGNORE - ;;;IGNORE - ;;;IGNORE - ;;;IGNORE - ;;;IGNORE - ;;;IGNORE - ;;;IGNORE - ;;;IGNORE - ;;;IGNORE - ;;;IGNORE - ;;;IGNORE - ;;;IGNORE - ;;;IGNORE - ;;;IGNORE - ;;;IGNORE - ;;;IGNORE - ;;;IGNORE - ;;;IGNORE - ;;;IGNORE - ;;;IGNORE - ;;;IGNORE - ;;;IGNORE - ;;;IGNORE - ;;;IGNORE - ;;;IGNORE - ;;;IGNORE - ;;;IGNORE - ;;;IGNORE - ;;;IGNORE - ;;;IGNORE - ;;;IGNORE - ;;;IGNORE - ;;;IGNORE - ;;;IGNORE - ;;;IGNORE - ;;;IGNORE - ;;;IGNORE - ;;;IGNORE - ;;;IGNORE - ;;;IGNORE - ;;;IGNORE - ;;;IGNORE - ;;;IGNORE - ;;;IGNORE - ;;;IGNORE - ;;;IGNORE - ;;;IGNORE - ;;;IGNORE - ;;;IGNORE - ;;;IGNORE - ;;;IGNORE - ;;;IGNORE - ;;;IGNORE - ;;;IGNORE - ;;;IGNORE - ;;;IGNORE - ;;;IGNORE - ;;;IGNORE - ;;;IGNORE - ;;;IGNORE - ;;;IGNORE - "";;"";IGNORE - "";;"";"" - "";;"";"" - "";;"";"" - "";;"";"" - "";;"";IGNORE -% FIXME ORIYA_LETTER_WA "";;"";IGNORE -% FIXME ORIYA_LETTER_VA "";;"";IGNORE -% ORIYA_LETTER_WA and ORIYA_LETTER_VA are not defined in unicode 3.0 - "";;"";IGNORE - "";;"";IGNORE - "";;"";IGNORE - "";;"";IGNORE - "";;"";IGNORE - ;;;IGNORE - ;;;IGNORE - ;;;IGNORE - ;;;IGNORE - ;;;IGNORE - ;;;IGNORE - ;;;IGNORE - ;;;IGNORE - ;;;IGNORE - ;;;IGNORE - ;;;IGNORE - ;;;IGNORE - ;;;IGNORE -UNDEFINED IGNORE;IGNORE;IGNORE;%... +% &ଯ<<ୟ + ;"";; % ORIYA LETTER YYA -order_end END LC_COLLATE %%%%%%%%%%%%% diff --git a/localedata/or_IN.UTF-8.in b/localedata/or_IN.UTF-8.in new file mode 100644 index 0000000000..c1f6afb347 --- /dev/null +++ b/localedata/or_IN.UTF-8.in @@ -0,0 +1,145 @@ +0 +୦ +1 +୧ +2 +୨ +3 +୩ +4 +୪ +5 +୫ +6 +୬ +7 +୭ +8 +୮ +9 +୯ +a +A +b +B +c +C +d +D +e +E +f +F +g +G +h +H +i +I +j +J +k +K +l +L +m +M +n +N +o +O +p +P +q +Q +r +R +s +S +t +T +u +U +v +V +w +W +x +X +y +Y +z +Z +ଅ +ଆ +ଇ +ଈ +ଉ +ଊ +ଋ +ୠ +ଌ +ୡ +ଏ +ଐ +ଓ +ଔ +ଁ +ଂ +ଃ +କ +ଖ +ଗ +ଘ +ଙ +ଚ +ଛ +ଜ +ଝ +ଞ +ଟ +ଠ +ଡ +ଡ଼ +ଢ +ଢ଼ +ଢ଼ +ଣ +ତ +ଥ +ଦ +ଧ +ନ +ପ +ଫ +ବ +ଭ +ମ +ଯ +ୟ +ଯଯ +ୟଯ +ର +ଲ +ଳ +ଵ +ୱ +ଶ +ଷ +ସ +ହ +କ୍ଷ +ଽ +ା +ି +ୀ +ୁ +ୂ +ୃ +େ +ୈ +ୋ +ୌ +୍ +ୖ +ୗ -- 2.24.1