New locale: Yakut (Sakha) for Russia (sah_RU) [BZ #22241]

Message ID 1464907072.188232.1531768848214@poczta.nazwa.pl
State Committed
Headers

Commit Message

Rafal Luzynski July 16, 2018, 7:20 p.m. UTC
  This is approved by Mike [1] but as we are in a slushy freeze period
I'd like to have more approvals before pushing.  OK for 2.28?  Carlos?
Anybody?

Regards,

Rafal


[1] https://sourceware.org/bugzilla/show_bug.cgi?id=22241#c17
  

Comments

Carlos O'Donell July 18, 2018, 12:24 a.m. UTC | #1
On 07/16/2018 03:20 PM, Rafal Luzynski wrote:
> This is approved by Mike [1] but as we are in a slushy freeze period
> I'd like to have more approvals before pushing.  OK for 2.28?  Carlos?
> Anybody?
> 
> Regards,
> 
> Rafal
> 
> 
> [1] https://sourceware.org/bugzilla/show_bug.cgi?id=22241#c17
> 

OK for 2.28. Please commit.

Adds a new collation test. And approval from Mike is good for me.

Reviewed-by: Carlos O'Donell <carlos@redhat.com>
  
Rafal Luzynski July 18, 2018, 9:51 a.m. UTC | #2
18.07.2018 02:24 Carlos O'Donell <carlos@redhat.com> wrote:
>
> On 07/16/2018 03:20 PM, Rafal Luzynski wrote:
> > This is approved by Mike [1] but as we are in a slushy freeze period
> > I'd like to have more approvals before pushing. OK for 2.28? Carlos?
> > Anybody?
> >
> > Regards,
> >
> > Rafal
> >
> >
> > [1] https://sourceware.org/bugzilla/show_bug.cgi?id=22241#c17
> >
>
> OK for 2.28. Please commit.

Thank you.  This patch has been committed now.

> Adds a new collation test. And approval from Mike is good for me.

Yes, I had some problems with collation in sah_RU and having similar
problems with dsb_DE recently I tested and consulted this with Mike
multiple times.

Regards,

Rafal
  

Patch

From 6d533c1e72f76dd5b602948f57f9d32cec05c0fd Mon Sep 17 00:00:00 2001
From: Valery Timiriliyev <timiriliyev@gmail.com>
Date: Fri, 6 Jul 2018 14:46:13 +0900
Subject: [PATCH] New locale: Yakut (Sakha) for Russia (sah_RU) [BZ #22241]

	* localedata/Makefile (test-input): Add sah_RU.UTF-8.
	(LOCALES): Likewise.
	* localedata/SUPPORTED (sah_RU/UTF-8): New entry.
	* localedata/locales/sah_RU: New file.
	* localedata/sah_RU.UTF-8.in: New file.
---
 NEWS                       |   4 +-
 localedata/Makefile        |   2 +
 localedata/SUPPORTED       |   1 +
 localedata/locales/sah_RU  | 291 +++++++++++++++++++++++++++++++++++++++++++++
 localedata/sah_RU.UTF-8.in |  84 +++++++++++++
 5 files changed, 380 insertions(+), 2 deletions(-)
 create mode 100644 localedata/locales/sah_RU
 create mode 100644 localedata/sah_RU.UTF-8.in

diff --git a/NEWS b/NEWS
index 5de2c28..325158a 100644
--- a/NEWS
+++ b/NEWS
@@ -40,8 +40,8 @@  Major new features:
   Upper Sorbian, and Walloon.  The following languages now support
   abbreviated alternative month names: Catalan, Greek, and Kashubian.
 
-* New locale added: Lower Sorbian (dsb_DE), including the support of the
-  nominative and genitive month names.
+* Newly added locales: Lower Sorbian (dsb_DE) and Yakut (sah_RU) also
+  include the support of the nominative and genitive month names.
 
 * Building and running on GNU/Hurd systems now works without out-of-tree
   patches.
diff --git a/localedata/Makefile b/localedata/Makefile
index 5ac4f74..43d2d37 100644
--- a/localedata/Makefile
+++ b/localedata/Makefile
@@ -95,6 +95,7 @@  test-input := \
 	ps_AF.UTF-8 \
 	ro_RO.UTF-8 \
 	ru_RU.UTF-8 \
+	sah_RU.UTF-8 \
 	sc_IT.UTF-8 \
 	se_NO.UTF-8 \
 	si_LK.UTF-8 \
@@ -250,6 +251,7 @@  LOCALES := \
 	ps_AF.UTF-8 \
 	ro_RO.UTF-8 \
 	ru_RU.UTF-8 \
+	sah_RU.UTF-8 \
 	sc_IT.UTF-8 \
 	se_NO.UTF-8 \
 	si_LK.UTF-8 \
diff --git a/localedata/SUPPORTED b/localedata/SUPPORTED
index 74aa15d..05de4e3 100644
--- a/localedata/SUPPORTED
+++ b/localedata/SUPPORTED
@@ -384,6 +384,7 @@  ru_UA.UTF-8/UTF-8 \
 ru_UA/KOI8-U \
 rw_RW/UTF-8 \
 sa_IN/UTF-8 \
+sah_RU/UTF-8 \
 sat_IN/UTF-8 \
 sc_IT/UTF-8 \
 sd_IN/UTF-8 \
diff --git a/localedata/locales/sah_RU b/localedata/locales/sah_RU
new file mode 100644
index 0000000..65af029
--- /dev/null
+++ b/localedata/locales/sah_RU
@@ -0,0 +1,291 @@ 
+escape_char  /
+comment_char  %
+
+% Yakut (Sakha) locale for Russian Federation
+% Source: Valery Timiriliyev
+% Email: timiriliyev@gmail.com
+% Tel:
+% Fax:
+% Language: sah
+% Territory: RU
+% Revision: 1.1.0
+% Date: 2018-07-06
+% Users: general
+%
+
+LC_IDENTIFICATION
+title      "Yakut (Sakha) locale for Russian Federation"
+source     "Valery Timiriliyev"
+address    ""
+contact    "Valery Timiriliyev"
+email      "timiriliyev@gmail.com"
+tel        ""
+fax        ""
+language   "Yakut"
+territory  "Russian Federation"
+revision   "1.1.0"
+date       "2018-07-06"
+%
+category  "i18n:2012";LC_IDENTIFICATION
+category  "i18n:2012";LC_CTYPE
+category  "i18n:2012";LC_COLLATE
+category  "i18n:2012";LC_TIME
+category  "i18n:2012";LC_NUMERIC
+category  "i18n:2012";LC_MONETARY
+category  "i18n:2012";LC_MESSAGES
+category  "i18n:2012";LC_PAPER
+category  "i18n:2012";LC_MEASUREMENT
+category  "i18n:2012";LC_NAME
+category  "i18n:2012";LC_ADDRESS
+category  "i18n:2012";LC_TELEPHONE
+END LC_IDENTIFICATION
+
+LC_CTYPE
+copy "ru_RU"
+END LC_CTYPE
+
+LC_COLLATE
+copy "iso14651_t1"
+
+% As of July 2018, the Yakut collating rules are not provided by CLDR.
+% This content is based on the alphabet from Wikipedia:
+% https://en.wikipedia.org/wiki/Yakut_language#Writing_system
+%
+% А Б В Г Ҕ Д Дь Е Ё Ж З И Й К Л М Н Ҥ Нь О Ө П Р С Һ Т У Ү Ф Х Ц Ч Ш Щ
+% Ъ Ы Ь Э Ю Я
+
+collating-element <d-soft-sign> from "<U0434><U044C>"
+collating-element <d-Soft-sign> from "<U0434><U042C>"
+collating-element <D-soft-sign> from "<U0414><U044C>"
+collating-element <D-Soft-sign> from "<U0414><U042C>"
+collating-element <n-soft-sign> from "<U043D><U044C>"
+collating-element <n-Soft-sign> from "<U043D><U042C>"
+collating-element <N-soft-sign> from "<U041D><U044C>"
+collating-element <N-Soft-sign> from "<U041D><U042C>"
+
+collating-symbol <yak-ghe>
+collating-symbol <yak-eng>
+collating-symbol <yak-oe>
+collating-symbol <yak-he>
+collating-symbol <yak-ue>
+
+collating-symbol <d-soft-sign-digraph>
+collating-symbol <n-soft-sign-digraph>
+
+% Ҕ after Г and after Ӷ
+reorder-after <S04F7> % CYRILLIC SMALL LETTER GHE WITH DESCENDER
+<yak-ghe>
+
+% Дь after Д and after Ԃ
+reorder-after <S0503> % CYRILLIC SMALL LETTER KOMI DJE
+<d-soft-sign-digraph>
+
+% Ҥ after Н and after Ԋ
+reorder-after <S050B> % CYRILLIC SMALL LETTER KOMI NJE
+<yak-eng>
+% Followed by Нь
+<n-soft-sign-digraph>
+
+% Ө after О
+% This is already default in iso14651_t1_common, no reorder needed.
+
+% Һ after С and after Ҫ
+reorder-after <S04AB> % CYRILLIC SMALL LETTER ES WITH DESCENDER
+<yak-he>
+
+% Ү after У and after Ӯ but before Ұ, Ꙋ, and Ѹ
+% This is already default in iso14651_t1_common, better leave unchanged.
+
+<U0495> <yak-ghe>;<BASE>;<MIN>;IGNORE % Ò•
+<U0494> <yak-ghe>;<BASE>;<CAP>;IGNORE % Ò”
+<U04A5> <yak-eng>;<BASE>;<MIN>;IGNORE % Ò¥
+<U04A4> <yak-eng>;<BASE>;<CAP>;IGNORE % Ò¤
+<U04BB> <yak-he>;<BASE>;<MIN>;IGNORE % Ò»
+<U04BA> <yak-he>;<BASE>;<CAP>;IGNORE % Òº
+
+<d-soft-sign> <d-soft-sign-digraph>;<BASE>;"<MIN><MIN>";IGNORE % дь
+<d-Soft-sign> <d-soft-sign-digraph>;<BASE>;"<MIN><CAP>";IGNORE % дЬ
+<D-soft-sign> <d-soft-sign-digraph>;<BASE>;"<CAP><MIN>";IGNORE % Дь
+<D-Soft-sign> <d-soft-sign-digraph>;<BASE>;"<CAP><CAP>";IGNORE % ДЬ
+<n-soft-sign> <n-soft-sign-digraph>;<BASE>;"<MIN><MIN>";IGNORE % нь
+<n-Soft-sign> <n-soft-sign-digraph>;<BASE>;"<MIN><CAP>";IGNORE % нЬ
+<N-soft-sign> <n-soft-sign-digraph>;<BASE>;"<CAP><MIN>";IGNORE % Нь
+<N-Soft-sign> <n-soft-sign-digraph>;<BASE>;"<CAP><CAP>";IGNORE % НЬ
+
+reorder-end
+
+END LC_COLLATE
+
+LC_MONETARY
+copy "ru_RU"
+END LC_MONETARY
+
+LC_NUMERIC
+copy "ru_RU"
+END LC_NUMERIC
+
+LC_TIME
+% abday - The abbreviations for the week days:
+% - бс, бн, оп, ср, чп, бт, сб
+abday       "<U0431><U0441>";"<U0431><U043D>";/
+            "<U043E><U043F>";"<U0441><U0440>";/
+            "<U0447><U043F>";"<U0431><U0442>";/
+            "<U0441><U0431>"
+
+% day - The full names of the week days:
+% - баскыһыанньа, бэнидиэнньик, оптуорунньук,
+%   сэрэдэ, чэппиэр, бээтинсэ, субуота
+day         "<U0431><U0430><U0441><U043A><U044B><U04BB><U044B><U0430><U043D><U043D><U044C><U0430>";/
+            "<U0431><U044D><U043D><U0438><U0434><U0438><U044D><U043D><U043D><U044C><U0438><U043A>";/
+            "<U043E><U043F><U0442><U0443><U043E><U0440><U0443><U043D><U043D><U044C><U0443><U043A>";/
+            "<U0441><U044D><U0440><U044D><U0434><U044D>";/
+            "<U0447><U044D><U043F><U043F><U0438><U044D><U0440>";/
+            "<U0431><U044D><U044D><U0442><U0438><U043D><U0441><U044D>";/
+            "<U0441><U0443><U0431><U0443><U043E><U0442><U0430>"
+
+% abmon - The abbreviations for the months
+% - тохс, олун, кул, муус, ыам, бэс, от, атыр, бал, алт, сэт, ахс
+abmon       "<U0442><U043E><U0445><U0441>";"<U043E><U043B><U0443><U043D>";/
+            "<U043A><U0443><U043B>";"<U043C><U0443><U0443><U0441>";/
+            "<U044B><U0430><U043C>";"<U0431><U044D><U0441>";/
+            "<U043E><U0442>";"<U0430><U0442><U044B><U0440>";/
+            "<U0431><U0430><U043B>";"<U0430><U043B><U0442>";/
+            "<U0441><U044D><U0442>";"<U0430><U0445><U0441>"
+
+% mon - The full names of the months (genitive case) -
+% - тохсунньу, олунньу, кулун тутар, муус устар, ыам ыйын,
+%   бэс ыйын, от ыйын, атырдьах ыйын, балаҕан ыйын, алтынньы,
+%   сэтинньи, ахсынньы
+mon         "<U0442><U043E><U0445><U0441><U0443><U043D><U043D><U044C><U0443>";/
+            "<U043E><U043B><U0443><U043D><U043D><U044C><U0443>";/
+            "<U043A><U0443><U043B><U0443><U043D> <U0442><U0443><U0442><U0430><U0440>";/
+            "<U043C><U0443><U0443><U0441> <U0443><U0441><U0442><U0430><U0440>";/
+            "<U044B><U0430><U043C> <U044B><U0439><U044B><U043D>";/
+            "<U0431><U044D><U0441> <U044B><U0439><U044B><U043D>";/
+            "<U043E><U0442> <U044B><U0439><U044B><U043D>";/
+            "<U0430><U0442><U044B><U0440><U0434><U044C><U0430><U0445> <U044B><U0439><U044B><U043D>";/
+            "<U0431><U0430><U043B><U0430><U0495><U0430><U043D> <U044B><U0439><U044B><U043D>";/
+            "<U0430><U043B><U0442><U044B><U043D><U043D><U044C><U044B>";/
+            "<U0441><U044D><U0442><U0438><U043D><U043D><U044C><U0438>";/
+            "<U0430><U0445><U0441><U044B><U043D><U043D><U044C><U044B>"
+
+% alt_mon - The full names of the months (nominative case) -
+% - тохсунньу, олунньу, кулун тутар, муус устар, ыам ыйа,
+%   бэс ыйа, от ыйа, атырдьах ыйа, балаҕан ыйа, алтынньы,
+%   сэтинньи, ахсынньы
+alt_mon     "<U0442><U043E><U0445><U0441><U0443><U043D><U043D><U044C><U0443>";/
+            "<U043E><U043B><U0443><U043D><U043D><U044C><U0443>";/
+            "<U043A><U0443><U043B><U0443><U043D> <U0442><U0443><U0442><U0430><U0440>";/
+            "<U043C><U0443><U0443><U0441> <U0443><U0441><U0442><U0430><U0440>";/
+            "<U044B><U0430><U043C> <U044B><U0439><U0430>";/
+            "<U0431><U044D><U0441> <U044B><U0439><U0430>";/
+            "<U043E><U0442> <U044B><U0439><U0430>";/
+            "<U0430><U0442><U044B><U0440><U0434><U044C><U0430><U0445> <U044B><U0439><U0430>";/
+            "<U0431><U0430><U043B><U0430><U0495><U0430><U043D> <U044B><U0439><U0430>";/
+            "<U0430><U043B><U0442><U044B><U043D><U043D><U044C><U044B>";/
+            "<U0441><U044D><U0442><U0438><U043D><U043D><U044C><U0438>";/
+            "<U0430><U0445><U0441><U044B><U043D><U043D><U044C><U044B>"
+
+% Abbreviated date and time representation to be referenced by the "%c" field
+% descriptor -
+%
+% "%a" (short weekday name),
+% "%Y" (year with century as a decimal number),
+% "%b" (short month name),
+% "%d" (day of month as a decimal number),
+% "%T" (24-hour clock time in format HH:MM:SS),
+% "%Z" (Time zone name)
+d_t_fmt "%a %Y %b %d %T (%Z)"
+
+% Date representation to be referenced by the "%x" field descriptor -
+% "%d/%m/%Y", day/month/year as decimal numbers (01/01/2000).
+d_fmt       "%Y.%m.%d"
+
+% Time representation to be referenced by the "%X" field descriptor -
+% "%T" (24-hour clock time in format HH:MM:SS)
+t_fmt       "%T"
+
+% Define representation of ante meridiem and post meridiem strings -
+% The "" mean default to "AM" and "PM".
+am_pm       "";""
+
+% Define time representation in 12-hour format with "am_pm", to be referenced by the "%r"
+% The "" means that this format is not supported.
+t_fmt_ampm  ""
+
+% Date representation not described in ISO/IEC 14652. Comes out as -
+% "%a %b %e %H:%M:%S %Z %Y" which is default "date" command output
+%
+% %a - abbreviated weekday name,
+% %Y - year with century as a decimal number,e.g. 2001.
+% %e - day of month as a decimal number with leading space (1 to 31),
+% %B - full month name,
+% %T - (24-hour clock time in format HH:MM:SS),
+% %Z - time-zone name,
+% "%a %Y с. %B %e к. %T (%Z)"
+date_fmt "%a %Y <U0441>. %B %e <U043A>. %T (%Z)"
+
+week    7;19971130;1
+first_weekday 2
+first_workday 2
+END LC_TIME
+
+LC_MESSAGES
+% The affirmative response -
+% '^[+1yYдДэЭ]'
+yesexpr "^[+1yY<U0434><U0414><U044D><U042D>]"
+
+% The negative response -
+% '^[-0nNнНсС]'
+noexpr "^[-0nN<U043D><U041D><U0441><U0421>]"
+
+% 'ээх'
+yesstr "<U044D><U044D><U0445>"
+
+% 'суох'
+nostr "<U0441><U0443><U043E><U0445>"
+END LC_MESSAGES
+
+LC_PAPER
+copy "ru_RU"
+END LC_PAPER
+
+LC_TELEPHONE
+copy "ru_RU"
+END LC_TELEPHONE
+
+LC_MEASUREMENT
+copy "ru_RU"
+END LC_MEASUREMENT
+
+LC_NAME
+copy "ru_RU"
+END LC_NAME
+
+LC_ADDRESS
+postal_fmt    "%f%N%a%N%d%N%b%N%s %h %e %r%N%z %T%N%c%N"
+
+% Россия
+country_name  "<U0420><U043E><U0441><U0441><U0438><U044F>"
+
+% Саха тыла
+lang_name     "<U0421><U0430><U0445><U0430> <U0442><U044B><U043B><U0430>"
+
+% UN Geneve 1949:68 Distinguishing signs of vehicles in international traffic
+% RUS
+country_car    "RUS"
+
+% ISO 639 language abbreviations:
+% 639-1 2 letter, 639-2 3 letter terminology
+% (empty), sah, sah
+lang_ab       ""
+lang_term     "sah"
+lang_lib      "sah"
+
+% ISO 3166 country number and 2 and 3 letter abreviations
+% RU, RUS
+country_ab2   "RU"
+country_ab3   "RUS"
+country_num   643
+
+END LC_ADDRESS
diff --git a/localedata/sah_RU.UTF-8.in b/localedata/sah_RU.UTF-8.in
new file mode 100644
index 0000000..324aaf9
--- /dev/null
+++ b/localedata/sah_RU.UTF-8.in
@@ -0,0 +1,84 @@ 
+а
+А
+б
+Б
+в
+Ð’
+г
+Г
+Ò•
+Ò”
+д
+Д
+дэ
+дя
+дь
+Дь
+е
+Е
+Ñ‘
+Ё
+ж
+Ж
+з
+З
+и
+И
+й
+Й
+к
+К
+л
+Л
+м
+М
+н
+Н
+нэ
+ня
+Ò¥
+Ò¤
+нь
+Нь
+о
+О
+Ó©
+Ó¨
+п
+П
+Ñ€
+Р
+с
+С
+Ò»
+Òº
+Ñ‚
+Т
+у
+У
+Ò¯
+Ò®
+Ñ„
+Ф
+Ñ…
+Ð¥
+ц
+Ц
+ч
+Ч
+ш
+Ш
+щ
+Щ
+ÑŠ
+Ъ
+Ñ‹
+Ы
+ь
+Ь
+э
+Э
+ÑŽ
+Ю
+я
+Я
-- 
2.7.5