[v3] localedata: Fix several issues with the set of characters considered 0-width [BZ #31370]
Checks
Context |
Check |
Description |
redhat-pt-bot/TryBot-apply_patch |
success
|
Patch applied to master at the time it was sent
|
linaro-tcwg-bot/tcwg_glibc_build--master-arm |
success
|
Testing passed
|
redhat-pt-bot/TryBot-32bit |
success
|
Build for i686
|
linaro-tcwg-bot/tcwg_glibc_check--master-arm |
success
|
Testing passed
|
linaro-tcwg-bot/tcwg_glibc_build--master-aarch64 |
success
|
Testing passed
|
linaro-tcwg-bot/tcwg_glibc_check--master-aarch64 |
success
|
Testing passed
|
Commit Message
= `Default_Ignorable_Code_Point`s should have width 0 =
Unicode specifies (https://www.unicode.org/faq/unsup_char.html#3) that characters
with the `Default_Ignorable_Code_Point` property
> should be rendered as completely invisible (and non advancing, i.e. “zero width”),
if not explicitly supported in rendering.
Hence, `wcwidth()` should give them all a width of 0, with two exceptions:
- the soft hyphen (U+00AD SOFT HYPHEN) is assigned width 1 by longstanding precedent
- U+115F HANGUL CHOSEONG FILLER needs a carveout
due to the unique behavior of the conjoining Korean jamo characters.
One composed Hangul "syllable block" like 퓛
is made up of two to three individual component characters, or "jamo".
These are all assigned an `East_Asian_Width` of `Wide`
by Unicode, which would normally mean they would all be assigned
width 2 by glibc; a combination of (leading choseong jamo) +
(medial jungseong jamo) + (trailing jongseong jamo) would then have width 2 + 2 + 2 = 6.
However, glibc (and other wcwidth implementations) special-cases jungseong and jongseong,
assigning them all width 0,
to ensure that the complete block has width 2 + 0 + 0 = 2 as it should.
U+115F is meant for use in syllable blocks
that are intentionally missing a leading jamo;
it must be assigned a width of 2 even though it has no visible display
to ensure that the complete block has width 2.
However, `wcwidth()` currently (before this patch)
incorrectly assigns non-zero width to
U+3164 HANGUL FILLER and U+FFA0 HALFWIDTH HANGUL FILLER;
this commit fixes that.
Unicode spec references:
- Hangul: §3.12 https://www.unicode.org/versions/Unicode15.0.0/ch03.pdf#G24646 and
§18.6 https://www.unicode.org/versions/Unicode15.0.0/ch18.pdf#G31028
- `Default_Ignorable_Code_Point`: §5.21 https://www.unicode.org/versions/Unicode15.0.0/ch05.pdf#G40095.
= Non-`Default_Ignorable_Code_Point` format controls should be visible =
The Unicode Standard, §5.21 - Characters Ignored for Display
(https://www.unicode.org/versions/Unicode15.0.0/ch05.pdf#G40095)
says the following:
> A small number of format characters (General_Category = Cf )
> are also not given the Default_Ignorable_Code_Point property.
> This may surprise implementers, who often assume
> that all format characters are generally ignored in fallback display.
> The exact list of these exceptional format characters
> can be found in the Unicode Character Database.
> There are, however, three important sets of such format characters to note:
>
> - prepended concatenation marks
> - interlinear annotation characters
> - Egyptian hieroglyph format controls
>
> The prepended concatenation marks always have a visible display.
> See “Prepended Concatenation Marks” in [*Section 23.2, Layout Controls*](https://www.unicode.org/versions/Unicode15.1.0/ch23.pdf#M9.35858.HeadingBreak.132.Layout.Controls)
> for more discussion of the use and display of these signs.
>
> The other two notable sets of format characters that exceptionally are not ignored
> in fallback display consist of the interlinear annotation characters,
> U+FFF9 INTERLINEAR ANNOTATION ANCHOR through
> U+FFFB INTERLINEAR ANNOTATION TERMINATOR,
> and the Egyptian hieroglyph format controls,
> U+13430 EGYPTIAN HIEROGLYPH VERTICAL JOINER through
> U+1343F EGYPTIAN HIEROGLYPH END WALLED ENCLOSURE.
> These characters should have a visible glyph display for fallback rendering,
> because if they are not displayed,
> it is too easy to misread the resulting displayed text.
> See “Annotation Characters” in [*Section 23.8, Specials*](https://www.unicode.org/versions/Unicode15.1.0/ch23.pdf#M9.21335.Heading.133.Specials),
> as well as [*Section 11.4, Egyptian Hieroglyphs*](https://www.unicode.org/versions/Unicode15.1.0/ch11.pdf#M9.73291.Heading.1418.Egyptian.Hieroglyphs)
> for more discussion of the use and display of these characters.
glibc currently correctly assigns non-zero width to the prepended concatenation marks,
but it incorrectly gives zero width to the interlinear annotation characters
(which a generic terminal cannot interpret)
and the Egyptian hieroglyph format controls
(which are not widely supported in rendering implementations at present).
This commit fixes both these issues as well.
= Derive Hangul syllable type from Unicode data =
Previosuly, the jungseong and jongseong jamo ranges
were hard-coded into the script. With this commit, they are instead parsed
from the HangulSyllableType.txt data file published by Unicode.
This does not affect the end result.
Signed-off-by: Jules Bertholet <julesbertholet@quoi.xyz>
---
SHARED-FILES | 2 +-
localedata/charmaps/UTF-8 | 34 +-
localedata/unicode-gen/HangulSyllableType.txt | 858 ++++++++
localedata/unicode-gen/Makefile | 9 +-
localedata/unicode-gen/PropList.txt | 1827 -----------------
localedata/unicode-gen/utf8_gen.py | 145 +-
6 files changed, 978 insertions(+), 1897 deletions(-)
create mode 100644 localedata/unicode-gen/HangulSyllableType.txt
delete mode 100644 localedata/unicode-gen/PropList.txt
Comments
This patch looks good to me. Thank ou very much!
Jules Bertholet via Libc-locales <libc-locales@sourceware.org> さんはかきました:
> = `Default_Ignorable_Code_Point`s should have width 0 =
>
> Unicode specifies (https://www.unicode.org/faq/unsup_char.html#3) that characters
> with the `Default_Ignorable_Code_Point` property
>
>> should be rendered as completely invisible (and non advancing, i.e. “zero width”),
> if not explicitly supported in rendering.
>
> Hence, `wcwidth()` should give them all a width of 0, with two exceptions:
>
> - the soft hyphen (U+00AD SOFT HYPHEN) is assigned width 1 by longstanding precedent
> - U+115F HANGUL CHOSEONG FILLER needs a carveout
> due to the unique behavior of the conjoining Korean jamo characters.
> One composed Hangul "syllable block" like 퓛
> is made up of two to three individual component characters, or "jamo".
> These are all assigned an `East_Asian_Width` of `Wide`
> by Unicode, which would normally mean they would all be assigned
> width 2 by glibc; a combination of (leading choseong jamo) +
> (medial jungseong jamo) + (trailing jongseong jamo) would then have width 2 + 2 + 2 = 6.
> However, glibc (and other wcwidth implementations) special-cases jungseong and jongseong,
> assigning them all width 0,
> to ensure that the complete block has width 2 + 0 + 0 = 2 as it should.
> U+115F is meant for use in syllable blocks
> that are intentionally missing a leading jamo;
> it must be assigned a width of 2 even though it has no visible display
> to ensure that the complete block has width 2.
>
> However, `wcwidth()` currently (before this patch)
> incorrectly assigns non-zero width to
> U+3164 HANGUL FILLER and U+FFA0 HALFWIDTH HANGUL FILLER;
> this commit fixes that.
>
> Unicode spec references:
> - Hangul: §3.12 https://www.unicode.org/versions/Unicode15.0.0/ch03.pdf#G24646 and
> §18.6 https://www.unicode.org/versions/Unicode15.0.0/ch18.pdf#G31028
> - `Default_Ignorable_Code_Point`: §5.21 https://www.unicode.org/versions/Unicode15.0.0/ch05.pdf#G40095.
>
> = Non-`Default_Ignorable_Code_Point` format controls should be visible =
>
> The Unicode Standard, §5.21 - Characters Ignored for Display
> (https://www.unicode.org/versions/Unicode15.0.0/ch05.pdf#G40095)
> says the following:
>
>> A small number of format characters (General_Category = Cf )
>> are also not given the Default_Ignorable_Code_Point property.
>> This may surprise implementers, who often assume
>> that all format characters are generally ignored in fallback display.
>> The exact list of these exceptional format characters
>> can be found in the Unicode Character Database.
>> There are, however, three important sets of such format characters to note:
>>
>> - prepended concatenation marks
>> - interlinear annotation characters
>> - Egyptian hieroglyph format controls
>>
>> The prepended concatenation marks always have a visible display.
>> See “Prepended Concatenation Marks” in [*Section 23.2, Layout Controls*](https://www.unicode.org/versions/Unicode15.1.0/ch23.pdf#M9.35858.HeadingBreak.132.Layout.Controls)
>> for more discussion of the use and display of these signs.
>>
>> The other two notable sets of format characters that exceptionally are not ignored
>> in fallback display consist of the interlinear annotation characters,
>> U+FFF9 INTERLINEAR ANNOTATION ANCHOR through
>> U+FFFB INTERLINEAR ANNOTATION TERMINATOR,
>> and the Egyptian hieroglyph format controls,
>> U+13430 EGYPTIAN HIEROGLYPH VERTICAL JOINER through
>> U+1343F EGYPTIAN HIEROGLYPH END WALLED ENCLOSURE.
>> These characters should have a visible glyph display for fallback rendering,
>> because if they are not displayed,
>> it is too easy to misread the resulting displayed text.
>> See “Annotation Characters” in [*Section 23.8, Specials*](https://www.unicode.org/versions/Unicode15.1.0/ch23.pdf#M9.21335.Heading.133.Specials),
>> as well as [*Section 11.4, Egyptian Hieroglyphs*](https://www.unicode.org/versions/Unicode15.1.0/ch11.pdf#M9.73291.Heading.1418.Egyptian.Hieroglyphs)
>> for more discussion of the use and display of these characters.
>
> glibc currently correctly assigns non-zero width to the prepended concatenation marks,
> but it incorrectly gives zero width to the interlinear annotation characters
> (which a generic terminal cannot interpret)
> and the Egyptian hieroglyph format controls
> (which are not widely supported in rendering implementations at present).
> This commit fixes both these issues as well.
>
> = Derive Hangul syllable type from Unicode data =
>
> Previosuly, the jungseong and jongseong jamo ranges
> were hard-coded into the script. With this commit, they are instead parsed
> from the HangulSyllableType.txt data file published by Unicode.
> This does not affect the end result.
>
> Signed-off-by: Jules Bertholet <julesbertholet@quoi.xyz>
> ---
> SHARED-FILES | 2 +-
> localedata/charmaps/UTF-8 | 34 +-
> localedata/unicode-gen/HangulSyllableType.txt | 858 ++++++++
> localedata/unicode-gen/Makefile | 9 +-
> localedata/unicode-gen/PropList.txt | 1827 -----------------
> localedata/unicode-gen/utf8_gen.py | 145 +-
> 6 files changed, 978 insertions(+), 1897 deletions(-)
> create mode 100644 localedata/unicode-gen/HangulSyllableType.txt
> delete mode 100644 localedata/unicode-gen/PropList.txt
>
> diff --git a/SHARED-FILES b/SHARED-FILES
> index 2ff44427bc..2edb80341b 100644
> --- a/SHARED-FILES
> +++ b/SHARED-FILES
> @@ -172,7 +172,7 @@ unicode:
> localedata/unicode-gen/unicode-license.txt
> localedata/unicode-gen/DerivedCoreProperties.txt
> localedata/unicode-gen/EastAsianWidth.txt
> - localedata/unicode-gen/PropList.txt
> + localedata/unicode-gen/HangulSyllableType.txt
>
> # The following files are shared with the upstream tzcode project and must be
> # updated regularly to stay in sync with the upstream releases.
> diff --git a/localedata/charmaps/UTF-8 b/localedata/charmaps/UTF-8
> index c9e89f03da..bc694ec4c8 100644
> --- a/localedata/charmaps/UTF-8
> +++ b/localedata/charmaps/UTF-8
> @@ -49856,13 +49856,27 @@ CHARMAP
> END CHARMAP
>
> % Character width according to Unicode 15.1.0.
> -% - Default width is 1.
> +% Width is determined by the following rules, in order of decreasing precedence:
> +% - U+00AD SOFT HYPHEN has width 1, as a special case for compatibility (https://archive.is/b5Ck).
> +% - U+115F HANGUL CHOSEONG FILLER has width 2.
> +% This character stands in for an intentionally omitted leading consonant
> +% in a Hangul syllable block; as such it must be assigned width 2 despite its lack
> +% of visible display to ensure that the complete block has the correct width.
> +% (See below for more information on Hangul syllables.)
> +% - Combining jungseong and jongseong Hangul jamo have width 0; generated from
> +% "grep '^[^;]*;[VT]' HangulSyllableType.txt".
> +% One composed Hangul "syllable block" like 퓛 is made up of
> +% two to three individual component characters called "jamo".
> +% The complete block must have total width 2;
> +% to achieve this, we assign a width of 2 to leading "choseong" jamo,
> +% and of 0 to medial vowel "jungseong" and trailing "jongseong" jamo.
> +% - Non-spacing and enclosing marks have width 0; generated from
> +% "grep -E '^[^;]*;[^;]*;(Mn|Me);' UnicodeData.txt".
> +% - "Default_Ignorable_Code_Point"s have width 0; generated from
> +% "grep '^[^;]*;\s*Default_Ignorable_Code_Point' DerivedCoreProperties.txt".
> % - Double-width characters have width 2; generated from
> -% "grep '^[^;]*;\s*[WF]' EastAsianWidth.txt"
> -% - Non-spacing characters have width 0; generated from PropList.txt or
> -% "grep '^[^;]*;[^;]*;[^;]*;[^;]*;NSM;' UnicodeData.txt"
> -% - Format control characters have width 0; generated from
> -% "grep '^[^;]*;[^;]*;Cf;' UnicodeData.txt"
> +% "grep '^[^;]*;[WF]' EastAsianWidth.txt".
> +% - Default width for all other characters is 1.
> WIDTH
> <U0300>...<U036F> 0
> <U0483>...<U0489> 0
> @@ -50083,7 +50097,9 @@ WIDTH
> <U3099>...<U309A> 0
> <U309B>...<U30FF> 2
> <U3105>...<U312F> 2
> -<U3131>...<U318E> 2
> +<U3131>...<U3163> 2
> +<U3164> 0
> +<U3165>...<U318E> 2
> <U3190>...<U31E3> 2
> <U31EF>...<U321E> 2
> <U3220>...<UA48C> 2
> @@ -50138,8 +50154,8 @@ WIDTH
> <UFE68>...<UFE6B> 2
> <UFEFF> 0
> <UFF01>...<UFF60> 2
> +<UFFA0> 0
> <UFFE0>...<UFFE6> 2
> -<UFFF9>...<UFFFB> 0
> <U000101FD> 0
> <U000102E0> 0
> <U00010376>...<U0001037A> 0
> @@ -50240,7 +50256,7 @@ WIDTH
> <U00011F36>...<U00011F3A> 0
> <U00011F40> 0
> <U00011F42> 0
> -<U00013430>...<U00013440> 0
> +<U00013440> 0
> <U00013447>...<U00013455> 0
> <U00016AF0>...<U00016AF4> 0
> <U00016B30>...<U00016B36> 0
> diff --git a/localedata/unicode-gen/HangulSyllableType.txt b/localedata/unicode-gen/HangulSyllableType.txt
> new file mode 100644
> index 0000000000..98dafb10f1
> --- /dev/null
> +++ b/localedata/unicode-gen/HangulSyllableType.txt
> @@ -0,0 +1,858 @@
> +# HangulSyllableType-15.1.0.txt
> +# Date: 2023-01-05, 20:34:42 GMT
> +# © 2023 Unicode®, Inc.
> +# Unicode and the Unicode Logo are registered trademarks of Unicode, Inc. in the U.S. and other countries.
> +# For terms of use, see https://www.unicode.org/terms_of_use.html
> +#
> +# Unicode Character Database
> +# For documentation, see https://www.unicode.org/reports/tr44/
> +
> +# ================================================
> +
> +# Property: Hangul_Syllable_Type
> +
> +# All code points not explicitly listed for Hangul_Syllable_Type
> +# have the value Not_Applicable (NA).
> +
> +# @missing: 0000..10FFFF; Not_Applicable
> +
> +# ================================================
> +
> +# Hangul_Syllable_Type=Leading_Jamo
> +
> +1100..115F ; L # Lo [96] HANGUL CHOSEONG KIYEOK..HANGUL CHOSEONG FILLER
> +A960..A97C ; L # Lo [29] HANGUL CHOSEONG TIKEUT-MIEUM..HANGUL CHOSEONG SSANGYEORINHIEUH
> +
> +# Total code points: 125
> +
> +# ================================================
> +
> +# Hangul_Syllable_Type=Vowel_Jamo
> +
> +1160..11A7 ; V # Lo [72] HANGUL JUNGSEONG FILLER..HANGUL JUNGSEONG O-YAE
> +D7B0..D7C6 ; V # Lo [23] HANGUL JUNGSEONG O-YEO..HANGUL JUNGSEONG ARAEA-E
> +
> +# Total code points: 95
> +
> +# ================================================
> +
> +# Hangul_Syllable_Type=Trailing_Jamo
> +
> +11A8..11FF ; T # Lo [88] HANGUL JONGSEONG KIYEOK..HANGUL JONGSEONG SSANGNIEUN
> +D7CB..D7FB ; T # Lo [49] HANGUL JONGSEONG NIEUN-RIEUL..HANGUL JONGSEONG PHIEUPH-THIEUTH
> +
> +# Total code points: 137
> +
> +# ================================================
> +
> +# Hangul_Syllable_Type=LV_Syllable
> +
> +AC00 ; LV # Lo HANGUL SYLLABLE GA
> +AC1C ; LV # Lo HANGUL SYLLABLE GAE
> +AC38 ; LV # Lo HANGUL SYLLABLE GYA
> +AC54 ; LV # Lo HANGUL SYLLABLE GYAE
> +AC70 ; LV # Lo HANGUL SYLLABLE GEO
> +AC8C ; LV # Lo HANGUL SYLLABLE GE
> +ACA8 ; LV # Lo HANGUL SYLLABLE GYEO
> +ACC4 ; LV # Lo HANGUL SYLLABLE GYE
> +ACE0 ; LV # Lo HANGUL SYLLABLE GO
> +ACFC ; LV # Lo HANGUL SYLLABLE GWA
> +AD18 ; LV # Lo HANGUL SYLLABLE GWAE
> +AD34 ; LV # Lo HANGUL SYLLABLE GOE
> +AD50 ; LV # Lo HANGUL SYLLABLE GYO
> +AD6C ; LV # Lo HANGUL SYLLABLE GU
> +AD88 ; LV # Lo HANGUL SYLLABLE GWEO
> +ADA4 ; LV # Lo HANGUL SYLLABLE GWE
> +ADC0 ; LV # Lo HANGUL SYLLABLE GWI
> +ADDC ; LV # Lo HANGUL SYLLABLE GYU
> +ADF8 ; LV # Lo HANGUL SYLLABLE GEU
> +AE14 ; LV # Lo HANGUL SYLLABLE GYI
> +AE30 ; LV # Lo HANGUL SYLLABLE GI
> +AE4C ; LV # Lo HANGUL SYLLABLE GGA
> +AE68 ; LV # Lo HANGUL SYLLABLE GGAE
> +AE84 ; LV # Lo HANGUL SYLLABLE GGYA
> +AEA0 ; LV # Lo HANGUL SYLLABLE GGYAE
> +AEBC ; LV # Lo HANGUL SYLLABLE GGEO
> +AED8 ; LV # Lo HANGUL SYLLABLE GGE
> +AEF4 ; LV # Lo HANGUL SYLLABLE GGYEO
> +AF10 ; LV # Lo HANGUL SYLLABLE GGYE
> +AF2C ; LV # Lo HANGUL SYLLABLE GGO
> +AF48 ; LV # Lo HANGUL SYLLABLE GGWA
> +AF64 ; LV # Lo HANGUL SYLLABLE GGWAE
> +AF80 ; LV # Lo HANGUL SYLLABLE GGOE
> +AF9C ; LV # Lo HANGUL SYLLABLE GGYO
> +AFB8 ; LV # Lo HANGUL SYLLABLE GGU
> +AFD4 ; LV # Lo HANGUL SYLLABLE GGWEO
> +AFF0 ; LV # Lo HANGUL SYLLABLE GGWE
> +B00C ; LV # Lo HANGUL SYLLABLE GGWI
> +B028 ; LV # Lo HANGUL SYLLABLE GGYU
> +B044 ; LV # Lo HANGUL SYLLABLE GGEU
> +B060 ; LV # Lo HANGUL SYLLABLE GGYI
> +B07C ; LV # Lo HANGUL SYLLABLE GGI
> +B098 ; LV # Lo HANGUL SYLLABLE NA
> +B0B4 ; LV # Lo HANGUL SYLLABLE NAE
> +B0D0 ; LV # Lo HANGUL SYLLABLE NYA
> +B0EC ; LV # Lo HANGUL SYLLABLE NYAE
> +B108 ; LV # Lo HANGUL SYLLABLE NEO
> +B124 ; LV # Lo HANGUL SYLLABLE NE
> +B140 ; LV # Lo HANGUL SYLLABLE NYEO
> +B15C ; LV # Lo HANGUL SYLLABLE NYE
> +B178 ; LV # Lo HANGUL SYLLABLE NO
> +B194 ; LV # Lo HANGUL SYLLABLE NWA
> +B1B0 ; LV # Lo HANGUL SYLLABLE NWAE
> +B1CC ; LV # Lo HANGUL SYLLABLE NOE
> +B1E8 ; LV # Lo HANGUL SYLLABLE NYO
> +B204 ; LV # Lo HANGUL SYLLABLE NU
> +B220 ; LV # Lo HANGUL SYLLABLE NWEO
> +B23C ; LV # Lo HANGUL SYLLABLE NWE
> +B258 ; LV # Lo HANGUL SYLLABLE NWI
> +B274 ; LV # Lo HANGUL SYLLABLE NYU
> +B290 ; LV # Lo HANGUL SYLLABLE NEU
> +B2AC ; LV # Lo HANGUL SYLLABLE NYI
> +B2C8 ; LV # Lo HANGUL SYLLABLE NI
> +B2E4 ; LV # Lo HANGUL SYLLABLE DA
> +B300 ; LV # Lo HANGUL SYLLABLE DAE
> +B31C ; LV # Lo HANGUL SYLLABLE DYA
> +B338 ; LV # Lo HANGUL SYLLABLE DYAE
> +B354 ; LV # Lo HANGUL SYLLABLE DEO
> +B370 ; LV # Lo HANGUL SYLLABLE DE
> +B38C ; LV # Lo HANGUL SYLLABLE DYEO
> +B3A8 ; LV # Lo HANGUL SYLLABLE DYE
> +B3C4 ; LV # Lo HANGUL SYLLABLE DO
> +B3E0 ; LV # Lo HANGUL SYLLABLE DWA
> +B3FC ; LV # Lo HANGUL SYLLABLE DWAE
> +B418 ; LV # Lo HANGUL SYLLABLE DOE
> +B434 ; LV # Lo HANGUL SYLLABLE DYO
> +B450 ; LV # Lo HANGUL SYLLABLE DU
> +B46C ; LV # Lo HANGUL SYLLABLE DWEO
> +B488 ; LV # Lo HANGUL SYLLABLE DWE
> +B4A4 ; LV # Lo HANGUL SYLLABLE DWI
> +B4C0 ; LV # Lo HANGUL SYLLABLE DYU
> +B4DC ; LV # Lo HANGUL SYLLABLE DEU
> +B4F8 ; LV # Lo HANGUL SYLLABLE DYI
> +B514 ; LV # Lo HANGUL SYLLABLE DI
> +B530 ; LV # Lo HANGUL SYLLABLE DDA
> +B54C ; LV # Lo HANGUL SYLLABLE DDAE
> +B568 ; LV # Lo HANGUL SYLLABLE DDYA
> +B584 ; LV # Lo HANGUL SYLLABLE DDYAE
> +B5A0 ; LV # Lo HANGUL SYLLABLE DDEO
> +B5BC ; LV # Lo HANGUL SYLLABLE DDE
> +B5D8 ; LV # Lo HANGUL SYLLABLE DDYEO
> +B5F4 ; LV # Lo HANGUL SYLLABLE DDYE
> +B610 ; LV # Lo HANGUL SYLLABLE DDO
> +B62C ; LV # Lo HANGUL SYLLABLE DDWA
> +B648 ; LV # Lo HANGUL SYLLABLE DDWAE
> +B664 ; LV # Lo HANGUL SYLLABLE DDOE
> +B680 ; LV # Lo HANGUL SYLLABLE DDYO
> +B69C ; LV # Lo HANGUL SYLLABLE DDU
> +B6B8 ; LV # Lo HANGUL SYLLABLE DDWEO
> +B6D4 ; LV # Lo HANGUL SYLLABLE DDWE
> +B6F0 ; LV # Lo HANGUL SYLLABLE DDWI
> +B70C ; LV # Lo HANGUL SYLLABLE DDYU
> +B728 ; LV # Lo HANGUL SYLLABLE DDEU
> +B744 ; LV # Lo HANGUL SYLLABLE DDYI
> +B760 ; LV # Lo HANGUL SYLLABLE DDI
> +B77C ; LV # Lo HANGUL SYLLABLE RA
> +B798 ; LV # Lo HANGUL SYLLABLE RAE
> +B7B4 ; LV # Lo HANGUL SYLLABLE RYA
> +B7D0 ; LV # Lo HANGUL SYLLABLE RYAE
> +B7EC ; LV # Lo HANGUL SYLLABLE REO
> +B808 ; LV # Lo HANGUL SYLLABLE RE
> +B824 ; LV # Lo HANGUL SYLLABLE RYEO
> +B840 ; LV # Lo HANGUL SYLLABLE RYE
> +B85C ; LV # Lo HANGUL SYLLABLE RO
> +B878 ; LV # Lo HANGUL SYLLABLE RWA
> +B894 ; LV # Lo HANGUL SYLLABLE RWAE
> +B8B0 ; LV # Lo HANGUL SYLLABLE ROE
> +B8CC ; LV # Lo HANGUL SYLLABLE RYO
> +B8E8 ; LV # Lo HANGUL SYLLABLE RU
> +B904 ; LV # Lo HANGUL SYLLABLE RWEO
> +B920 ; LV # Lo HANGUL SYLLABLE RWE
> +B93C ; LV # Lo HANGUL SYLLABLE RWI
> +B958 ; LV # Lo HANGUL SYLLABLE RYU
> +B974 ; LV # Lo HANGUL SYLLABLE REU
> +B990 ; LV # Lo HANGUL SYLLABLE RYI
> +B9AC ; LV # Lo HANGUL SYLLABLE RI
> +B9C8 ; LV # Lo HANGUL SYLLABLE MA
> +B9E4 ; LV # Lo HANGUL SYLLABLE MAE
> +BA00 ; LV # Lo HANGUL SYLLABLE MYA
> +BA1C ; LV # Lo HANGUL SYLLABLE MYAE
> +BA38 ; LV # Lo HANGUL SYLLABLE MEO
> +BA54 ; LV # Lo HANGUL SYLLABLE ME
> +BA70 ; LV # Lo HANGUL SYLLABLE MYEO
> +BA8C ; LV # Lo HANGUL SYLLABLE MYE
> +BAA8 ; LV # Lo HANGUL SYLLABLE MO
> +BAC4 ; LV # Lo HANGUL SYLLABLE MWA
> +BAE0 ; LV # Lo HANGUL SYLLABLE MWAE
> +BAFC ; LV # Lo HANGUL SYLLABLE MOE
> +BB18 ; LV # Lo HANGUL SYLLABLE MYO
> +BB34 ; LV # Lo HANGUL SYLLABLE MU
> +BB50 ; LV # Lo HANGUL SYLLABLE MWEO
> +BB6C ; LV # Lo HANGUL SYLLABLE MWE
> +BB88 ; LV # Lo HANGUL SYLLABLE MWI
> +BBA4 ; LV # Lo HANGUL SYLLABLE MYU
> +BBC0 ; LV # Lo HANGUL SYLLABLE MEU
> +BBDC ; LV # Lo HANGUL SYLLABLE MYI
> +BBF8 ; LV # Lo HANGUL SYLLABLE MI
> +BC14 ; LV # Lo HANGUL SYLLABLE BA
> +BC30 ; LV # Lo HANGUL SYLLABLE BAE
> +BC4C ; LV # Lo HANGUL SYLLABLE BYA
> +BC68 ; LV # Lo HANGUL SYLLABLE BYAE
> +BC84 ; LV # Lo HANGUL SYLLABLE BEO
> +BCA0 ; LV # Lo HANGUL SYLLABLE BE
> +BCBC ; LV # Lo HANGUL SYLLABLE BYEO
> +BCD8 ; LV # Lo HANGUL SYLLABLE BYE
> +BCF4 ; LV # Lo HANGUL SYLLABLE BO
> +BD10 ; LV # Lo HANGUL SYLLABLE BWA
> +BD2C ; LV # Lo HANGUL SYLLABLE BWAE
> +BD48 ; LV # Lo HANGUL SYLLABLE BOE
> +BD64 ; LV # Lo HANGUL SYLLABLE BYO
> +BD80 ; LV # Lo HANGUL SYLLABLE BU
> +BD9C ; LV # Lo HANGUL SYLLABLE BWEO
> +BDB8 ; LV # Lo HANGUL SYLLABLE BWE
> +BDD4 ; LV # Lo HANGUL SYLLABLE BWI
> +BDF0 ; LV # Lo HANGUL SYLLABLE BYU
> +BE0C ; LV # Lo HANGUL SYLLABLE BEU
> +BE28 ; LV # Lo HANGUL SYLLABLE BYI
> +BE44 ; LV # Lo HANGUL SYLLABLE BI
> +BE60 ; LV # Lo HANGUL SYLLABLE BBA
> +BE7C ; LV # Lo HANGUL SYLLABLE BBAE
> +BE98 ; LV # Lo HANGUL SYLLABLE BBYA
> +BEB4 ; LV # Lo HANGUL SYLLABLE BBYAE
> +BED0 ; LV # Lo HANGUL SYLLABLE BBEO
> +BEEC ; LV # Lo HANGUL SYLLABLE BBE
> +BF08 ; LV # Lo HANGUL SYLLABLE BBYEO
> +BF24 ; LV # Lo HANGUL SYLLABLE BBYE
> +BF40 ; LV # Lo HANGUL SYLLABLE BBO
> +BF5C ; LV # Lo HANGUL SYLLABLE BBWA
> +BF78 ; LV # Lo HANGUL SYLLABLE BBWAE
> +BF94 ; LV # Lo HANGUL SYLLABLE BBOE
> +BFB0 ; LV # Lo HANGUL SYLLABLE BBYO
> +BFCC ; LV # Lo HANGUL SYLLABLE BBU
> +BFE8 ; LV # Lo HANGUL SYLLABLE BBWEO
> +C004 ; LV # Lo HANGUL SYLLABLE BBWE
> +C020 ; LV # Lo HANGUL SYLLABLE BBWI
> +C03C ; LV # Lo HANGUL SYLLABLE BBYU
> +C058 ; LV # Lo HANGUL SYLLABLE BBEU
> +C074 ; LV # Lo HANGUL SYLLABLE BBYI
> +C090 ; LV # Lo HANGUL SYLLABLE BBI
> +C0AC ; LV # Lo HANGUL SYLLABLE SA
> +C0C8 ; LV # Lo HANGUL SYLLABLE SAE
> +C0E4 ; LV # Lo HANGUL SYLLABLE SYA
> +C100 ; LV # Lo HANGUL SYLLABLE SYAE
> +C11C ; LV # Lo HANGUL SYLLABLE SEO
> +C138 ; LV # Lo HANGUL SYLLABLE SE
> +C154 ; LV # Lo HANGUL SYLLABLE SYEO
> +C170 ; LV # Lo HANGUL SYLLABLE SYE
> +C18C ; LV # Lo HANGUL SYLLABLE SO
> +C1A8 ; LV # Lo HANGUL SYLLABLE SWA
> +C1C4 ; LV # Lo HANGUL SYLLABLE SWAE
> +C1E0 ; LV # Lo HANGUL SYLLABLE SOE
> +C1FC ; LV # Lo HANGUL SYLLABLE SYO
> +C218 ; LV # Lo HANGUL SYLLABLE SU
> +C234 ; LV # Lo HANGUL SYLLABLE SWEO
> +C250 ; LV # Lo HANGUL SYLLABLE SWE
> +C26C ; LV # Lo HANGUL SYLLABLE SWI
> +C288 ; LV # Lo HANGUL SYLLABLE SYU
> +C2A4 ; LV # Lo HANGUL SYLLABLE SEU
> +C2C0 ; LV # Lo HANGUL SYLLABLE SYI
> +C2DC ; LV # Lo HANGUL SYLLABLE SI
> +C2F8 ; LV # Lo HANGUL SYLLABLE SSA
> +C314 ; LV # Lo HANGUL SYLLABLE SSAE
> +C330 ; LV # Lo HANGUL SYLLABLE SSYA
> +C34C ; LV # Lo HANGUL SYLLABLE SSYAE
> +C368 ; LV # Lo HANGUL SYLLABLE SSEO
> +C384 ; LV # Lo HANGUL SYLLABLE SSE
> +C3A0 ; LV # Lo HANGUL SYLLABLE SSYEO
> +C3BC ; LV # Lo HANGUL SYLLABLE SSYE
> +C3D8 ; LV # Lo HANGUL SYLLABLE SSO
> +C3F4 ; LV # Lo HANGUL SYLLABLE SSWA
> +C410 ; LV # Lo HANGUL SYLLABLE SSWAE
> +C42C ; LV # Lo HANGUL SYLLABLE SSOE
> +C448 ; LV # Lo HANGUL SYLLABLE SSYO
> +C464 ; LV # Lo HANGUL SYLLABLE SSU
> +C480 ; LV # Lo HANGUL SYLLABLE SSWEO
> +C49C ; LV # Lo HANGUL SYLLABLE SSWE
> +C4B8 ; LV # Lo HANGUL SYLLABLE SSWI
> +C4D4 ; LV # Lo HANGUL SYLLABLE SSYU
> +C4F0 ; LV # Lo HANGUL SYLLABLE SSEU
> +C50C ; LV # Lo HANGUL SYLLABLE SSYI
> +C528 ; LV # Lo HANGUL SYLLABLE SSI
> +C544 ; LV # Lo HANGUL SYLLABLE A
> +C560 ; LV # Lo HANGUL SYLLABLE AE
> +C57C ; LV # Lo HANGUL SYLLABLE YA
> +C598 ; LV # Lo HANGUL SYLLABLE YAE
> +C5B4 ; LV # Lo HANGUL SYLLABLE EO
> +C5D0 ; LV # Lo HANGUL SYLLABLE E
> +C5EC ; LV # Lo HANGUL SYLLABLE YEO
> +C608 ; LV # Lo HANGUL SYLLABLE YE
> +C624 ; LV # Lo HANGUL SYLLABLE O
> +C640 ; LV # Lo HANGUL SYLLABLE WA
> +C65C ; LV # Lo HANGUL SYLLABLE WAE
> +C678 ; LV # Lo HANGUL SYLLABLE OE
> +C694 ; LV # Lo HANGUL SYLLABLE YO
> +C6B0 ; LV # Lo HANGUL SYLLABLE U
> +C6CC ; LV # Lo HANGUL SYLLABLE WEO
> +C6E8 ; LV # Lo HANGUL SYLLABLE WE
> +C704 ; LV # Lo HANGUL SYLLABLE WI
> +C720 ; LV # Lo HANGUL SYLLABLE YU
> +C73C ; LV # Lo HANGUL SYLLABLE EU
> +C758 ; LV # Lo HANGUL SYLLABLE YI
> +C774 ; LV # Lo HANGUL SYLLABLE I
> +C790 ; LV # Lo HANGUL SYLLABLE JA
> +C7AC ; LV # Lo HANGUL SYLLABLE JAE
> +C7C8 ; LV # Lo HANGUL SYLLABLE JYA
> +C7E4 ; LV # Lo HANGUL SYLLABLE JYAE
> +C800 ; LV # Lo HANGUL SYLLABLE JEO
> +C81C ; LV # Lo HANGUL SYLLABLE JE
> +C838 ; LV # Lo HANGUL SYLLABLE JYEO
> +C854 ; LV # Lo HANGUL SYLLABLE JYE
> +C870 ; LV # Lo HANGUL SYLLABLE JO
> +C88C ; LV # Lo HANGUL SYLLABLE JWA
> +C8A8 ; LV # Lo HANGUL SYLLABLE JWAE
> +C8C4 ; LV # Lo HANGUL SYLLABLE JOE
> +C8E0 ; LV # Lo HANGUL SYLLABLE JYO
> +C8FC ; LV # Lo HANGUL SYLLABLE JU
> +C918 ; LV # Lo HANGUL SYLLABLE JWEO
> +C934 ; LV # Lo HANGUL SYLLABLE JWE
> +C950 ; LV # Lo HANGUL SYLLABLE JWI
> +C96C ; LV # Lo HANGUL SYLLABLE JYU
> +C988 ; LV # Lo HANGUL SYLLABLE JEU
> +C9A4 ; LV # Lo HANGUL SYLLABLE JYI
> +C9C0 ; LV # Lo HANGUL SYLLABLE JI
> +C9DC ; LV # Lo HANGUL SYLLABLE JJA
> +C9F8 ; LV # Lo HANGUL SYLLABLE JJAE
> +CA14 ; LV # Lo HANGUL SYLLABLE JJYA
> +CA30 ; LV # Lo HANGUL SYLLABLE JJYAE
> +CA4C ; LV # Lo HANGUL SYLLABLE JJEO
> +CA68 ; LV # Lo HANGUL SYLLABLE JJE
> +CA84 ; LV # Lo HANGUL SYLLABLE JJYEO
> +CAA0 ; LV # Lo HANGUL SYLLABLE JJYE
> +CABC ; LV # Lo HANGUL SYLLABLE JJO
> +CAD8 ; LV # Lo HANGUL SYLLABLE JJWA
> +CAF4 ; LV # Lo HANGUL SYLLABLE JJWAE
> +CB10 ; LV # Lo HANGUL SYLLABLE JJOE
> +CB2C ; LV # Lo HANGUL SYLLABLE JJYO
> +CB48 ; LV # Lo HANGUL SYLLABLE JJU
> +CB64 ; LV # Lo HANGUL SYLLABLE JJWEO
> +CB80 ; LV # Lo HANGUL SYLLABLE JJWE
> +CB9C ; LV # Lo HANGUL SYLLABLE JJWI
> +CBB8 ; LV # Lo HANGUL SYLLABLE JJYU
> +CBD4 ; LV # Lo HANGUL SYLLABLE JJEU
> +CBF0 ; LV # Lo HANGUL SYLLABLE JJYI
> +CC0C ; LV # Lo HANGUL SYLLABLE JJI
> +CC28 ; LV # Lo HANGUL SYLLABLE CA
> +CC44 ; LV # Lo HANGUL SYLLABLE CAE
> +CC60 ; LV # Lo HANGUL SYLLABLE CYA
> +CC7C ; LV # Lo HANGUL SYLLABLE CYAE
> +CC98 ; LV # Lo HANGUL SYLLABLE CEO
> +CCB4 ; LV # Lo HANGUL SYLLABLE CE
> +CCD0 ; LV # Lo HANGUL SYLLABLE CYEO
> +CCEC ; LV # Lo HANGUL SYLLABLE CYE
> +CD08 ; LV # Lo HANGUL SYLLABLE CO
> +CD24 ; LV # Lo HANGUL SYLLABLE CWA
> +CD40 ; LV # Lo HANGUL SYLLABLE CWAE
> +CD5C ; LV # Lo HANGUL SYLLABLE COE
> +CD78 ; LV # Lo HANGUL SYLLABLE CYO
> +CD94 ; LV # Lo HANGUL SYLLABLE CU
> +CDB0 ; LV # Lo HANGUL SYLLABLE CWEO
> +CDCC ; LV # Lo HANGUL SYLLABLE CWE
> +CDE8 ; LV # Lo HANGUL SYLLABLE CWI
> +CE04 ; LV # Lo HANGUL SYLLABLE CYU
> +CE20 ; LV # Lo HANGUL SYLLABLE CEU
> +CE3C ; LV # Lo HANGUL SYLLABLE CYI
> +CE58 ; LV # Lo HANGUL SYLLABLE CI
> +CE74 ; LV # Lo HANGUL SYLLABLE KA
> +CE90 ; LV # Lo HANGUL SYLLABLE KAE
> +CEAC ; LV # Lo HANGUL SYLLABLE KYA
> +CEC8 ; LV # Lo HANGUL SYLLABLE KYAE
> +CEE4 ; LV # Lo HANGUL SYLLABLE KEO
> +CF00 ; LV # Lo HANGUL SYLLABLE KE
> +CF1C ; LV # Lo HANGUL SYLLABLE KYEO
> +CF38 ; LV # Lo HANGUL SYLLABLE KYE
> +CF54 ; LV # Lo HANGUL SYLLABLE KO
> +CF70 ; LV # Lo HANGUL SYLLABLE KWA
> +CF8C ; LV # Lo HANGUL SYLLABLE KWAE
> +CFA8 ; LV # Lo HANGUL SYLLABLE KOE
> +CFC4 ; LV # Lo HANGUL SYLLABLE KYO
> +CFE0 ; LV # Lo HANGUL SYLLABLE KU
> +CFFC ; LV # Lo HANGUL SYLLABLE KWEO
> +D018 ; LV # Lo HANGUL SYLLABLE KWE
> +D034 ; LV # Lo HANGUL SYLLABLE KWI
> +D050 ; LV # Lo HANGUL SYLLABLE KYU
> +D06C ; LV # Lo HANGUL SYLLABLE KEU
> +D088 ; LV # Lo HANGUL SYLLABLE KYI
> +D0A4 ; LV # Lo HANGUL SYLLABLE KI
> +D0C0 ; LV # Lo HANGUL SYLLABLE TA
> +D0DC ; LV # Lo HANGUL SYLLABLE TAE
> +D0F8 ; LV # Lo HANGUL SYLLABLE TYA
> +D114 ; LV # Lo HANGUL SYLLABLE TYAE
> +D130 ; LV # Lo HANGUL SYLLABLE TEO
> +D14C ; LV # Lo HANGUL SYLLABLE TE
> +D168 ; LV # Lo HANGUL SYLLABLE TYEO
> +D184 ; LV # Lo HANGUL SYLLABLE TYE
> +D1A0 ; LV # Lo HANGUL SYLLABLE TO
> +D1BC ; LV # Lo HANGUL SYLLABLE TWA
> +D1D8 ; LV # Lo HANGUL SYLLABLE TWAE
> +D1F4 ; LV # Lo HANGUL SYLLABLE TOE
> +D210 ; LV # Lo HANGUL SYLLABLE TYO
> +D22C ; LV # Lo HANGUL SYLLABLE TU
> +D248 ; LV # Lo HANGUL SYLLABLE TWEO
> +D264 ; LV # Lo HANGUL SYLLABLE TWE
> +D280 ; LV # Lo HANGUL SYLLABLE TWI
> +D29C ; LV # Lo HANGUL SYLLABLE TYU
> +D2B8 ; LV # Lo HANGUL SYLLABLE TEU
> +D2D4 ; LV # Lo HANGUL SYLLABLE TYI
> +D2F0 ; LV # Lo HANGUL SYLLABLE TI
> +D30C ; LV # Lo HANGUL SYLLABLE PA
> +D328 ; LV # Lo HANGUL SYLLABLE PAE
> +D344 ; LV # Lo HANGUL SYLLABLE PYA
> +D360 ; LV # Lo HANGUL SYLLABLE PYAE
> +D37C ; LV # Lo HANGUL SYLLABLE PEO
> +D398 ; LV # Lo HANGUL SYLLABLE PE
> +D3B4 ; LV # Lo HANGUL SYLLABLE PYEO
> +D3D0 ; LV # Lo HANGUL SYLLABLE PYE
> +D3EC ; LV # Lo HANGUL SYLLABLE PO
> +D408 ; LV # Lo HANGUL SYLLABLE PWA
> +D424 ; LV # Lo HANGUL SYLLABLE PWAE
> +D440 ; LV # Lo HANGUL SYLLABLE POE
> +D45C ; LV # Lo HANGUL SYLLABLE PYO
> +D478 ; LV # Lo HANGUL SYLLABLE PU
> +D494 ; LV # Lo HANGUL SYLLABLE PWEO
> +D4B0 ; LV # Lo HANGUL SYLLABLE PWE
> +D4CC ; LV # Lo HANGUL SYLLABLE PWI
> +D4E8 ; LV # Lo HANGUL SYLLABLE PYU
> +D504 ; LV # Lo HANGUL SYLLABLE PEU
> +D520 ; LV # Lo HANGUL SYLLABLE PYI
> +D53C ; LV # Lo HANGUL SYLLABLE PI
> +D558 ; LV # Lo HANGUL SYLLABLE HA
> +D574 ; LV # Lo HANGUL SYLLABLE HAE
> +D590 ; LV # Lo HANGUL SYLLABLE HYA
> +D5AC ; LV # Lo HANGUL SYLLABLE HYAE
> +D5C8 ; LV # Lo HANGUL SYLLABLE HEO
> +D5E4 ; LV # Lo HANGUL SYLLABLE HE
> +D600 ; LV # Lo HANGUL SYLLABLE HYEO
> +D61C ; LV # Lo HANGUL SYLLABLE HYE
> +D638 ; LV # Lo HANGUL SYLLABLE HO
> +D654 ; LV # Lo HANGUL SYLLABLE HWA
> +D670 ; LV # Lo HANGUL SYLLABLE HWAE
> +D68C ; LV # Lo HANGUL SYLLABLE HOE
> +D6A8 ; LV # Lo HANGUL SYLLABLE HYO
> +D6C4 ; LV # Lo HANGUL SYLLABLE HU
> +D6E0 ; LV # Lo HANGUL SYLLABLE HWEO
> +D6FC ; LV # Lo HANGUL SYLLABLE HWE
> +D718 ; LV # Lo HANGUL SYLLABLE HWI
> +D734 ; LV # Lo HANGUL SYLLABLE HYU
> +D750 ; LV # Lo HANGUL SYLLABLE HEU
> +D76C ; LV # Lo HANGUL SYLLABLE HYI
> +D788 ; LV # Lo HANGUL SYLLABLE HI
> +
> +# Total code points: 399
> +
> +# ================================================
> +
> +# Hangul_Syllable_Type=LVT_Syllable
> +
> +AC01..AC1B ; LVT # Lo [27] HANGUL SYLLABLE GAG..HANGUL SYLLABLE GAH
> +AC1D..AC37 ; LVT # Lo [27] HANGUL SYLLABLE GAEG..HANGUL SYLLABLE GAEH
> +AC39..AC53 ; LVT # Lo [27] HANGUL SYLLABLE GYAG..HANGUL SYLLABLE GYAH
> +AC55..AC6F ; LVT # Lo [27] HANGUL SYLLABLE GYAEG..HANGUL SYLLABLE GYAEH
> +AC71..AC8B ; LVT # Lo [27] HANGUL SYLLABLE GEOG..HANGUL SYLLABLE GEOH
> +AC8D..ACA7 ; LVT # Lo [27] HANGUL SYLLABLE GEG..HANGUL SYLLABLE GEH
> +ACA9..ACC3 ; LVT # Lo [27] HANGUL SYLLABLE GYEOG..HANGUL SYLLABLE GYEOH
> +ACC5..ACDF ; LVT # Lo [27] HANGUL SYLLABLE GYEG..HANGUL SYLLABLE GYEH
> +ACE1..ACFB ; LVT # Lo [27] HANGUL SYLLABLE GOG..HANGUL SYLLABLE GOH
> +ACFD..AD17 ; LVT # Lo [27] HANGUL SYLLABLE GWAG..HANGUL SYLLABLE GWAH
> +AD19..AD33 ; LVT # Lo [27] HANGUL SYLLABLE GWAEG..HANGUL SYLLABLE GWAEH
> +AD35..AD4F ; LVT # Lo [27] HANGUL SYLLABLE GOEG..HANGUL SYLLABLE GOEH
> +AD51..AD6B ; LVT # Lo [27] HANGUL SYLLABLE GYOG..HANGUL SYLLABLE GYOH
> +AD6D..AD87 ; LVT # Lo [27] HANGUL SYLLABLE GUG..HANGUL SYLLABLE GUH
> +AD89..ADA3 ; LVT # Lo [27] HANGUL SYLLABLE GWEOG..HANGUL SYLLABLE GWEOH
> +ADA5..ADBF ; LVT # Lo [27] HANGUL SYLLABLE GWEG..HANGUL SYLLABLE GWEH
> +ADC1..ADDB ; LVT # Lo [27] HANGUL SYLLABLE GWIG..HANGUL SYLLABLE GWIH
> +ADDD..ADF7 ; LVT # Lo [27] HANGUL SYLLABLE GYUG..HANGUL SYLLABLE GYUH
> +ADF9..AE13 ; LVT # Lo [27] HANGUL SYLLABLE GEUG..HANGUL SYLLABLE GEUH
> +AE15..AE2F ; LVT # Lo [27] HANGUL SYLLABLE GYIG..HANGUL SYLLABLE GYIH
> +AE31..AE4B ; LVT # Lo [27] HANGUL SYLLABLE GIG..HANGUL SYLLABLE GIH
> +AE4D..AE67 ; LVT # Lo [27] HANGUL SYLLABLE GGAG..HANGUL SYLLABLE GGAH
> +AE69..AE83 ; LVT # Lo [27] HANGUL SYLLABLE GGAEG..HANGUL SYLLABLE GGAEH
> +AE85..AE9F ; LVT # Lo [27] HANGUL SYLLABLE GGYAG..HANGUL SYLLABLE GGYAH
> +AEA1..AEBB ; LVT # Lo [27] HANGUL SYLLABLE GGYAEG..HANGUL SYLLABLE GGYAEH
> +AEBD..AED7 ; LVT # Lo [27] HANGUL SYLLABLE GGEOG..HANGUL SYLLABLE GGEOH
> +AED9..AEF3 ; LVT # Lo [27] HANGUL SYLLABLE GGEG..HANGUL SYLLABLE GGEH
> +AEF5..AF0F ; LVT # Lo [27] HANGUL SYLLABLE GGYEOG..HANGUL SYLLABLE GGYEOH
> +AF11..AF2B ; LVT # Lo [27] HANGUL SYLLABLE GGYEG..HANGUL SYLLABLE GGYEH
> +AF2D..AF47 ; LVT # Lo [27] HANGUL SYLLABLE GGOG..HANGUL SYLLABLE GGOH
> +AF49..AF63 ; LVT # Lo [27] HANGUL SYLLABLE GGWAG..HANGUL SYLLABLE GGWAH
> +AF65..AF7F ; LVT # Lo [27] HANGUL SYLLABLE GGWAEG..HANGUL SYLLABLE GGWAEH
> +AF81..AF9B ; LVT # Lo [27] HANGUL SYLLABLE GGOEG..HANGUL SYLLABLE GGOEH
> +AF9D..AFB7 ; LVT # Lo [27] HANGUL SYLLABLE GGYOG..HANGUL SYLLABLE GGYOH
> +AFB9..AFD3 ; LVT # Lo [27] HANGUL SYLLABLE GGUG..HANGUL SYLLABLE GGUH
> +AFD5..AFEF ; LVT # Lo [27] HANGUL SYLLABLE GGWEOG..HANGUL SYLLABLE GGWEOH
> +AFF1..B00B ; LVT # Lo [27] HANGUL SYLLABLE GGWEG..HANGUL SYLLABLE GGWEH
> +B00D..B027 ; LVT # Lo [27] HANGUL SYLLABLE GGWIG..HANGUL SYLLABLE GGWIH
> +B029..B043 ; LVT # Lo [27] HANGUL SYLLABLE GGYUG..HANGUL SYLLABLE GGYUH
> +B045..B05F ; LVT # Lo [27] HANGUL SYLLABLE GGEUG..HANGUL SYLLABLE GGEUH
> +B061..B07B ; LVT # Lo [27] HANGUL SYLLABLE GGYIG..HANGUL SYLLABLE GGYIH
> +B07D..B097 ; LVT # Lo [27] HANGUL SYLLABLE GGIG..HANGUL SYLLABLE GGIH
> +B099..B0B3 ; LVT # Lo [27] HANGUL SYLLABLE NAG..HANGUL SYLLABLE NAH
> +B0B5..B0CF ; LVT # Lo [27] HANGUL SYLLABLE NAEG..HANGUL SYLLABLE NAEH
> +B0D1..B0EB ; LVT # Lo [27] HANGUL SYLLABLE NYAG..HANGUL SYLLABLE NYAH
> +B0ED..B107 ; LVT # Lo [27] HANGUL SYLLABLE NYAEG..HANGUL SYLLABLE NYAEH
> +B109..B123 ; LVT # Lo [27] HANGUL SYLLABLE NEOG..HANGUL SYLLABLE NEOH
> +B125..B13F ; LVT # Lo [27] HANGUL SYLLABLE NEG..HANGUL SYLLABLE NEH
> +B141..B15B ; LVT # Lo [27] HANGUL SYLLABLE NYEOG..HANGUL SYLLABLE NYEOH
> +B15D..B177 ; LVT # Lo [27] HANGUL SYLLABLE NYEG..HANGUL SYLLABLE NYEH
> +B179..B193 ; LVT # Lo [27] HANGUL SYLLABLE NOG..HANGUL SYLLABLE NOH
> +B195..B1AF ; LVT # Lo [27] HANGUL SYLLABLE NWAG..HANGUL SYLLABLE NWAH
> +B1B1..B1CB ; LVT # Lo [27] HANGUL SYLLABLE NWAEG..HANGUL SYLLABLE NWAEH
> +B1CD..B1E7 ; LVT # Lo [27] HANGUL SYLLABLE NOEG..HANGUL SYLLABLE NOEH
> +B1E9..B203 ; LVT # Lo [27] HANGUL SYLLABLE NYOG..HANGUL SYLLABLE NYOH
> +B205..B21F ; LVT # Lo [27] HANGUL SYLLABLE NUG..HANGUL SYLLABLE NUH
> +B221..B23B ; LVT # Lo [27] HANGUL SYLLABLE NWEOG..HANGUL SYLLABLE NWEOH
> +B23D..B257 ; LVT # Lo [27] HANGUL SYLLABLE NWEG..HANGUL SYLLABLE NWEH
> +B259..B273 ; LVT # Lo [27] HANGUL SYLLABLE NWIG..HANGUL SYLLABLE NWIH
> +B275..B28F ; LVT # Lo [27] HANGUL SYLLABLE NYUG..HANGUL SYLLABLE NYUH
> +B291..B2AB ; LVT # Lo [27] HANGUL SYLLABLE NEUG..HANGUL SYLLABLE NEUH
> +B2AD..B2C7 ; LVT # Lo [27] HANGUL SYLLABLE NYIG..HANGUL SYLLABLE NYIH
> +B2C9..B2E3 ; LVT # Lo [27] HANGUL SYLLABLE NIG..HANGUL SYLLABLE NIH
> +B2E5..B2FF ; LVT # Lo [27] HANGUL SYLLABLE DAG..HANGUL SYLLABLE DAH
> +B301..B31B ; LVT # Lo [27] HANGUL SYLLABLE DAEG..HANGUL SYLLABLE DAEH
> +B31D..B337 ; LVT # Lo [27] HANGUL SYLLABLE DYAG..HANGUL SYLLABLE DYAH
> +B339..B353 ; LVT # Lo [27] HANGUL SYLLABLE DYAEG..HANGUL SYLLABLE DYAEH
> +B355..B36F ; LVT # Lo [27] HANGUL SYLLABLE DEOG..HANGUL SYLLABLE DEOH
> +B371..B38B ; LVT # Lo [27] HANGUL SYLLABLE DEG..HANGUL SYLLABLE DEH
> +B38D..B3A7 ; LVT # Lo [27] HANGUL SYLLABLE DYEOG..HANGUL SYLLABLE DYEOH
> +B3A9..B3C3 ; LVT # Lo [27] HANGUL SYLLABLE DYEG..HANGUL SYLLABLE DYEH
> +B3C5..B3DF ; LVT # Lo [27] HANGUL SYLLABLE DOG..HANGUL SYLLABLE DOH
> +B3E1..B3FB ; LVT # Lo [27] HANGUL SYLLABLE DWAG..HANGUL SYLLABLE DWAH
> +B3FD..B417 ; LVT # Lo [27] HANGUL SYLLABLE DWAEG..HANGUL SYLLABLE DWAEH
> +B419..B433 ; LVT # Lo [27] HANGUL SYLLABLE DOEG..HANGUL SYLLABLE DOEH
> +B435..B44F ; LVT # Lo [27] HANGUL SYLLABLE DYOG..HANGUL SYLLABLE DYOH
> +B451..B46B ; LVT # Lo [27] HANGUL SYLLABLE DUG..HANGUL SYLLABLE DUH
> +B46D..B487 ; LVT # Lo [27] HANGUL SYLLABLE DWEOG..HANGUL SYLLABLE DWEOH
> +B489..B4A3 ; LVT # Lo [27] HANGUL SYLLABLE DWEG..HANGUL SYLLABLE DWEH
> +B4A5..B4BF ; LVT # Lo [27] HANGUL SYLLABLE DWIG..HANGUL SYLLABLE DWIH
> +B4C1..B4DB ; LVT # Lo [27] HANGUL SYLLABLE DYUG..HANGUL SYLLABLE DYUH
> +B4DD..B4F7 ; LVT # Lo [27] HANGUL SYLLABLE DEUG..HANGUL SYLLABLE DEUH
> +B4F9..B513 ; LVT # Lo [27] HANGUL SYLLABLE DYIG..HANGUL SYLLABLE DYIH
> +B515..B52F ; LVT # Lo [27] HANGUL SYLLABLE DIG..HANGUL SYLLABLE DIH
> +B531..B54B ; LVT # Lo [27] HANGUL SYLLABLE DDAG..HANGUL SYLLABLE DDAH
> +B54D..B567 ; LVT # Lo [27] HANGUL SYLLABLE DDAEG..HANGUL SYLLABLE DDAEH
> +B569..B583 ; LVT # Lo [27] HANGUL SYLLABLE DDYAG..HANGUL SYLLABLE DDYAH
> +B585..B59F ; LVT # Lo [27] HANGUL SYLLABLE DDYAEG..HANGUL SYLLABLE DDYAEH
> +B5A1..B5BB ; LVT # Lo [27] HANGUL SYLLABLE DDEOG..HANGUL SYLLABLE DDEOH
> +B5BD..B5D7 ; LVT # Lo [27] HANGUL SYLLABLE DDEG..HANGUL SYLLABLE DDEH
> +B5D9..B5F3 ; LVT # Lo [27] HANGUL SYLLABLE DDYEOG..HANGUL SYLLABLE DDYEOH
> +B5F5..B60F ; LVT # Lo [27] HANGUL SYLLABLE DDYEG..HANGUL SYLLABLE DDYEH
> +B611..B62B ; LVT # Lo [27] HANGUL SYLLABLE DDOG..HANGUL SYLLABLE DDOH
> +B62D..B647 ; LVT # Lo [27] HANGUL SYLLABLE DDWAG..HANGUL SYLLABLE DDWAH
> +B649..B663 ; LVT # Lo [27] HANGUL SYLLABLE DDWAEG..HANGUL SYLLABLE DDWAEH
> +B665..B67F ; LVT # Lo [27] HANGUL SYLLABLE DDOEG..HANGUL SYLLABLE DDOEH
> +B681..B69B ; LVT # Lo [27] HANGUL SYLLABLE DDYOG..HANGUL SYLLABLE DDYOH
> +B69D..B6B7 ; LVT # Lo [27] HANGUL SYLLABLE DDUG..HANGUL SYLLABLE DDUH
> +B6B9..B6D3 ; LVT # Lo [27] HANGUL SYLLABLE DDWEOG..HANGUL SYLLABLE DDWEOH
> +B6D5..B6EF ; LVT # Lo [27] HANGUL SYLLABLE DDWEG..HANGUL SYLLABLE DDWEH
> +B6F1..B70B ; LVT # Lo [27] HANGUL SYLLABLE DDWIG..HANGUL SYLLABLE DDWIH
> +B70D..B727 ; LVT # Lo [27] HANGUL SYLLABLE DDYUG..HANGUL SYLLABLE DDYUH
> +B729..B743 ; LVT # Lo [27] HANGUL SYLLABLE DDEUG..HANGUL SYLLABLE DDEUH
> +B745..B75F ; LVT # Lo [27] HANGUL SYLLABLE DDYIG..HANGUL SYLLABLE DDYIH
> +B761..B77B ; LVT # Lo [27] HANGUL SYLLABLE DDIG..HANGUL SYLLABLE DDIH
> +B77D..B797 ; LVT # Lo [27] HANGUL SYLLABLE RAG..HANGUL SYLLABLE RAH
> +B799..B7B3 ; LVT # Lo [27] HANGUL SYLLABLE RAEG..HANGUL SYLLABLE RAEH
> +B7B5..B7CF ; LVT # Lo [27] HANGUL SYLLABLE RYAG..HANGUL SYLLABLE RYAH
> +B7D1..B7EB ; LVT # Lo [27] HANGUL SYLLABLE RYAEG..HANGUL SYLLABLE RYAEH
> +B7ED..B807 ; LVT # Lo [27] HANGUL SYLLABLE REOG..HANGUL SYLLABLE REOH
> +B809..B823 ; LVT # Lo [27] HANGUL SYLLABLE REG..HANGUL SYLLABLE REH
> +B825..B83F ; LVT # Lo [27] HANGUL SYLLABLE RYEOG..HANGUL SYLLABLE RYEOH
> +B841..B85B ; LVT # Lo [27] HANGUL SYLLABLE RYEG..HANGUL SYLLABLE RYEH
> +B85D..B877 ; LVT # Lo [27] HANGUL SYLLABLE ROG..HANGUL SYLLABLE ROH
> +B879..B893 ; LVT # Lo [27] HANGUL SYLLABLE RWAG..HANGUL SYLLABLE RWAH
> +B895..B8AF ; LVT # Lo [27] HANGUL SYLLABLE RWAEG..HANGUL SYLLABLE RWAEH
> +B8B1..B8CB ; LVT # Lo [27] HANGUL SYLLABLE ROEG..HANGUL SYLLABLE ROEH
> +B8CD..B8E7 ; LVT # Lo [27] HANGUL SYLLABLE RYOG..HANGUL SYLLABLE RYOH
> +B8E9..B903 ; LVT # Lo [27] HANGUL SYLLABLE RUG..HANGUL SYLLABLE RUH
> +B905..B91F ; LVT # Lo [27] HANGUL SYLLABLE RWEOG..HANGUL SYLLABLE RWEOH
> +B921..B93B ; LVT # Lo [27] HANGUL SYLLABLE RWEG..HANGUL SYLLABLE RWEH
> +B93D..B957 ; LVT # Lo [27] HANGUL SYLLABLE RWIG..HANGUL SYLLABLE RWIH
> +B959..B973 ; LVT # Lo [27] HANGUL SYLLABLE RYUG..HANGUL SYLLABLE RYUH
> +B975..B98F ; LVT # Lo [27] HANGUL SYLLABLE REUG..HANGUL SYLLABLE REUH
> +B991..B9AB ; LVT # Lo [27] HANGUL SYLLABLE RYIG..HANGUL SYLLABLE RYIH
> +B9AD..B9C7 ; LVT # Lo [27] HANGUL SYLLABLE RIG..HANGUL SYLLABLE RIH
> +B9C9..B9E3 ; LVT # Lo [27] HANGUL SYLLABLE MAG..HANGUL SYLLABLE MAH
> +B9E5..B9FF ; LVT # Lo [27] HANGUL SYLLABLE MAEG..HANGUL SYLLABLE MAEH
> +BA01..BA1B ; LVT # Lo [27] HANGUL SYLLABLE MYAG..HANGUL SYLLABLE MYAH
> +BA1D..BA37 ; LVT # Lo [27] HANGUL SYLLABLE MYAEG..HANGUL SYLLABLE MYAEH
> +BA39..BA53 ; LVT # Lo [27] HANGUL SYLLABLE MEOG..HANGUL SYLLABLE MEOH
> +BA55..BA6F ; LVT # Lo [27] HANGUL SYLLABLE MEG..HANGUL SYLLABLE MEH
> +BA71..BA8B ; LVT # Lo [27] HANGUL SYLLABLE MYEOG..HANGUL SYLLABLE MYEOH
> +BA8D..BAA7 ; LVT # Lo [27] HANGUL SYLLABLE MYEG..HANGUL SYLLABLE MYEH
> +BAA9..BAC3 ; LVT # Lo [27] HANGUL SYLLABLE MOG..HANGUL SYLLABLE MOH
> +BAC5..BADF ; LVT # Lo [27] HANGUL SYLLABLE MWAG..HANGUL SYLLABLE MWAH
> +BAE1..BAFB ; LVT # Lo [27] HANGUL SYLLABLE MWAEG..HANGUL SYLLABLE MWAEH
> +BAFD..BB17 ; LVT # Lo [27] HANGUL SYLLABLE MOEG..HANGUL SYLLABLE MOEH
> +BB19..BB33 ; LVT # Lo [27] HANGUL SYLLABLE MYOG..HANGUL SYLLABLE MYOH
> +BB35..BB4F ; LVT # Lo [27] HANGUL SYLLABLE MUG..HANGUL SYLLABLE MUH
> +BB51..BB6B ; LVT # Lo [27] HANGUL SYLLABLE MWEOG..HANGUL SYLLABLE MWEOH
> +BB6D..BB87 ; LVT # Lo [27] HANGUL SYLLABLE MWEG..HANGUL SYLLABLE MWEH
> +BB89..BBA3 ; LVT # Lo [27] HANGUL SYLLABLE MWIG..HANGUL SYLLABLE MWIH
> +BBA5..BBBF ; LVT # Lo [27] HANGUL SYLLABLE MYUG..HANGUL SYLLABLE MYUH
> +BBC1..BBDB ; LVT # Lo [27] HANGUL SYLLABLE MEUG..HANGUL SYLLABLE MEUH
> +BBDD..BBF7 ; LVT # Lo [27] HANGUL SYLLABLE MYIG..HANGUL SYLLABLE MYIH
> +BBF9..BC13 ; LVT # Lo [27] HANGUL SYLLABLE MIG..HANGUL SYLLABLE MIH
> +BC15..BC2F ; LVT # Lo [27] HANGUL SYLLABLE BAG..HANGUL SYLLABLE BAH
> +BC31..BC4B ; LVT # Lo [27] HANGUL SYLLABLE BAEG..HANGUL SYLLABLE BAEH
> +BC4D..BC67 ; LVT # Lo [27] HANGUL SYLLABLE BYAG..HANGUL SYLLABLE BYAH
> +BC69..BC83 ; LVT # Lo [27] HANGUL SYLLABLE BYAEG..HANGUL SYLLABLE BYAEH
> +BC85..BC9F ; LVT # Lo [27] HANGUL SYLLABLE BEOG..HANGUL SYLLABLE BEOH
> +BCA1..BCBB ; LVT # Lo [27] HANGUL SYLLABLE BEG..HANGUL SYLLABLE BEH
> +BCBD..BCD7 ; LVT # Lo [27] HANGUL SYLLABLE BYEOG..HANGUL SYLLABLE BYEOH
> +BCD9..BCF3 ; LVT # Lo [27] HANGUL SYLLABLE BYEG..HANGUL SYLLABLE BYEH
> +BCF5..BD0F ; LVT # Lo [27] HANGUL SYLLABLE BOG..HANGUL SYLLABLE BOH
> +BD11..BD2B ; LVT # Lo [27] HANGUL SYLLABLE BWAG..HANGUL SYLLABLE BWAH
> +BD2D..BD47 ; LVT # Lo [27] HANGUL SYLLABLE BWAEG..HANGUL SYLLABLE BWAEH
> +BD49..BD63 ; LVT # Lo [27] HANGUL SYLLABLE BOEG..HANGUL SYLLABLE BOEH
> +BD65..BD7F ; LVT # Lo [27] HANGUL SYLLABLE BYOG..HANGUL SYLLABLE BYOH
> +BD81..BD9B ; LVT # Lo [27] HANGUL SYLLABLE BUG..HANGUL SYLLABLE BUH
> +BD9D..BDB7 ; LVT # Lo [27] HANGUL SYLLABLE BWEOG..HANGUL SYLLABLE BWEOH
> +BDB9..BDD3 ; LVT # Lo [27] HANGUL SYLLABLE BWEG..HANGUL SYLLABLE BWEH
> +BDD5..BDEF ; LVT # Lo [27] HANGUL SYLLABLE BWIG..HANGUL SYLLABLE BWIH
> +BDF1..BE0B ; LVT # Lo [27] HANGUL SYLLABLE BYUG..HANGUL SYLLABLE BYUH
> +BE0D..BE27 ; LVT # Lo [27] HANGUL SYLLABLE BEUG..HANGUL SYLLABLE BEUH
> +BE29..BE43 ; LVT # Lo [27] HANGUL SYLLABLE BYIG..HANGUL SYLLABLE BYIH
> +BE45..BE5F ; LVT # Lo [27] HANGUL SYLLABLE BIG..HANGUL SYLLABLE BIH
> +BE61..BE7B ; LVT # Lo [27] HANGUL SYLLABLE BBAG..HANGUL SYLLABLE BBAH
> +BE7D..BE97 ; LVT # Lo [27] HANGUL SYLLABLE BBAEG..HANGUL SYLLABLE BBAEH
> +BE99..BEB3 ; LVT # Lo [27] HANGUL SYLLABLE BBYAG..HANGUL SYLLABLE BBYAH
> +BEB5..BECF ; LVT # Lo [27] HANGUL SYLLABLE BBYAEG..HANGUL SYLLABLE BBYAEH
> +BED1..BEEB ; LVT # Lo [27] HANGUL SYLLABLE BBEOG..HANGUL SYLLABLE BBEOH
> +BEED..BF07 ; LVT # Lo [27] HANGUL SYLLABLE BBEG..HANGUL SYLLABLE BBEH
> +BF09..BF23 ; LVT # Lo [27] HANGUL SYLLABLE BBYEOG..HANGUL SYLLABLE BBYEOH
> +BF25..BF3F ; LVT # Lo [27] HANGUL SYLLABLE BBYEG..HANGUL SYLLABLE BBYEH
> +BF41..BF5B ; LVT # Lo [27] HANGUL SYLLABLE BBOG..HANGUL SYLLABLE BBOH
> +BF5D..BF77 ; LVT # Lo [27] HANGUL SYLLABLE BBWAG..HANGUL SYLLABLE BBWAH
> +BF79..BF93 ; LVT # Lo [27] HANGUL SYLLABLE BBWAEG..HANGUL SYLLABLE BBWAEH
> +BF95..BFAF ; LVT # Lo [27] HANGUL SYLLABLE BBOEG..HANGUL SYLLABLE BBOEH
> +BFB1..BFCB ; LVT # Lo [27] HANGUL SYLLABLE BBYOG..HANGUL SYLLABLE BBYOH
> +BFCD..BFE7 ; LVT # Lo [27] HANGUL SYLLABLE BBUG..HANGUL SYLLABLE BBUH
> +BFE9..C003 ; LVT # Lo [27] HANGUL SYLLABLE BBWEOG..HANGUL SYLLABLE BBWEOH
> +C005..C01F ; LVT # Lo [27] HANGUL SYLLABLE BBWEG..HANGUL SYLLABLE BBWEH
> +C021..C03B ; LVT # Lo [27] HANGUL SYLLABLE BBWIG..HANGUL SYLLABLE BBWIH
> +C03D..C057 ; LVT # Lo [27] HANGUL SYLLABLE BBYUG..HANGUL SYLLABLE BBYUH
> +C059..C073 ; LVT # Lo [27] HANGUL SYLLABLE BBEUG..HANGUL SYLLABLE BBEUH
> +C075..C08F ; LVT # Lo [27] HANGUL SYLLABLE BBYIG..HANGUL SYLLABLE BBYIH
> +C091..C0AB ; LVT # Lo [27] HANGUL SYLLABLE BBIG..HANGUL SYLLABLE BBIH
> +C0AD..C0C7 ; LVT # Lo [27] HANGUL SYLLABLE SAG..HANGUL SYLLABLE SAH
> +C0C9..C0E3 ; LVT # Lo [27] HANGUL SYLLABLE SAEG..HANGUL SYLLABLE SAEH
> +C0E5..C0FF ; LVT # Lo [27] HANGUL SYLLABLE SYAG..HANGUL SYLLABLE SYAH
> +C101..C11B ; LVT # Lo [27] HANGUL SYLLABLE SYAEG..HANGUL SYLLABLE SYAEH
> +C11D..C137 ; LVT # Lo [27] HANGUL SYLLABLE SEOG..HANGUL SYLLABLE SEOH
> +C139..C153 ; LVT # Lo [27] HANGUL SYLLABLE SEG..HANGUL SYLLABLE SEH
> +C155..C16F ; LVT # Lo [27] HANGUL SYLLABLE SYEOG..HANGUL SYLLABLE SYEOH
> +C171..C18B ; LVT # Lo [27] HANGUL SYLLABLE SYEG..HANGUL SYLLABLE SYEH
> +C18D..C1A7 ; LVT # Lo [27] HANGUL SYLLABLE SOG..HANGUL SYLLABLE SOH
> +C1A9..C1C3 ; LVT # Lo [27] HANGUL SYLLABLE SWAG..HANGUL SYLLABLE SWAH
> +C1C5..C1DF ; LVT # Lo [27] HANGUL SYLLABLE SWAEG..HANGUL SYLLABLE SWAEH
> +C1E1..C1FB ; LVT # Lo [27] HANGUL SYLLABLE SOEG..HANGUL SYLLABLE SOEH
> +C1FD..C217 ; LVT # Lo [27] HANGUL SYLLABLE SYOG..HANGUL SYLLABLE SYOH
> +C219..C233 ; LVT # Lo [27] HANGUL SYLLABLE SUG..HANGUL SYLLABLE SUH
> +C235..C24F ; LVT # Lo [27] HANGUL SYLLABLE SWEOG..HANGUL SYLLABLE SWEOH
> +C251..C26B ; LVT # Lo [27] HANGUL SYLLABLE SWEG..HANGUL SYLLABLE SWEH
> +C26D..C287 ; LVT # Lo [27] HANGUL SYLLABLE SWIG..HANGUL SYLLABLE SWIH
> +C289..C2A3 ; LVT # Lo [27] HANGUL SYLLABLE SYUG..HANGUL SYLLABLE SYUH
> +C2A5..C2BF ; LVT # Lo [27] HANGUL SYLLABLE SEUG..HANGUL SYLLABLE SEUH
> +C2C1..C2DB ; LVT # Lo [27] HANGUL SYLLABLE SYIG..HANGUL SYLLABLE SYIH
> +C2DD..C2F7 ; LVT # Lo [27] HANGUL SYLLABLE SIG..HANGUL SYLLABLE SIH
> +C2F9..C313 ; LVT # Lo [27] HANGUL SYLLABLE SSAG..HANGUL SYLLABLE SSAH
> +C315..C32F ; LVT # Lo [27] HANGUL SYLLABLE SSAEG..HANGUL SYLLABLE SSAEH
> +C331..C34B ; LVT # Lo [27] HANGUL SYLLABLE SSYAG..HANGUL SYLLABLE SSYAH
> +C34D..C367 ; LVT # Lo [27] HANGUL SYLLABLE SSYAEG..HANGUL SYLLABLE SSYAEH
> +C369..C383 ; LVT # Lo [27] HANGUL SYLLABLE SSEOG..HANGUL SYLLABLE SSEOH
> +C385..C39F ; LVT # Lo [27] HANGUL SYLLABLE SSEG..HANGUL SYLLABLE SSEH
> +C3A1..C3BB ; LVT # Lo [27] HANGUL SYLLABLE SSYEOG..HANGUL SYLLABLE SSYEOH
> +C3BD..C3D7 ; LVT # Lo [27] HANGUL SYLLABLE SSYEG..HANGUL SYLLABLE SSYEH
> +C3D9..C3F3 ; LVT # Lo [27] HANGUL SYLLABLE SSOG..HANGUL SYLLABLE SSOH
> +C3F5..C40F ; LVT # Lo [27] HANGUL SYLLABLE SSWAG..HANGUL SYLLABLE SSWAH
> +C411..C42B ; LVT # Lo [27] HANGUL SYLLABLE SSWAEG..HANGUL SYLLABLE SSWAEH
> +C42D..C447 ; LVT # Lo [27] HANGUL SYLLABLE SSOEG..HANGUL SYLLABLE SSOEH
> +C449..C463 ; LVT # Lo [27] HANGUL SYLLABLE SSYOG..HANGUL SYLLABLE SSYOH
> +C465..C47F ; LVT # Lo [27] HANGUL SYLLABLE SSUG..HANGUL SYLLABLE SSUH
> +C481..C49B ; LVT # Lo [27] HANGUL SYLLABLE SSWEOG..HANGUL SYLLABLE SSWEOH
> +C49D..C4B7 ; LVT # Lo [27] HANGUL SYLLABLE SSWEG..HANGUL SYLLABLE SSWEH
> +C4B9..C4D3 ; LVT # Lo [27] HANGUL SYLLABLE SSWIG..HANGUL SYLLABLE SSWIH
> +C4D5..C4EF ; LVT # Lo [27] HANGUL SYLLABLE SSYUG..HANGUL SYLLABLE SSYUH
> +C4F1..C50B ; LVT # Lo [27] HANGUL SYLLABLE SSEUG..HANGUL SYLLABLE SSEUH
> +C50D..C527 ; LVT # Lo [27] HANGUL SYLLABLE SSYIG..HANGUL SYLLABLE SSYIH
> +C529..C543 ; LVT # Lo [27] HANGUL SYLLABLE SSIG..HANGUL SYLLABLE SSIH
> +C545..C55F ; LVT # Lo [27] HANGUL SYLLABLE AG..HANGUL SYLLABLE AH
> +C561..C57B ; LVT # Lo [27] HANGUL SYLLABLE AEG..HANGUL SYLLABLE AEH
> +C57D..C597 ; LVT # Lo [27] HANGUL SYLLABLE YAG..HANGUL SYLLABLE YAH
> +C599..C5B3 ; LVT # Lo [27] HANGUL SYLLABLE YAEG..HANGUL SYLLABLE YAEH
> +C5B5..C5CF ; LVT # Lo [27] HANGUL SYLLABLE EOG..HANGUL SYLLABLE EOH
> +C5D1..C5EB ; LVT # Lo [27] HANGUL SYLLABLE EG..HANGUL SYLLABLE EH
> +C5ED..C607 ; LVT # Lo [27] HANGUL SYLLABLE YEOG..HANGUL SYLLABLE YEOH
> +C609..C623 ; LVT # Lo [27] HANGUL SYLLABLE YEG..HANGUL SYLLABLE YEH
> +C625..C63F ; LVT # Lo [27] HANGUL SYLLABLE OG..HANGUL SYLLABLE OH
> +C641..C65B ; LVT # Lo [27] HANGUL SYLLABLE WAG..HANGUL SYLLABLE WAH
> +C65D..C677 ; LVT # Lo [27] HANGUL SYLLABLE WAEG..HANGUL SYLLABLE WAEH
> +C679..C693 ; LVT # Lo [27] HANGUL SYLLABLE OEG..HANGUL SYLLABLE OEH
> +C695..C6AF ; LVT # Lo [27] HANGUL SYLLABLE YOG..HANGUL SYLLABLE YOH
> +C6B1..C6CB ; LVT # Lo [27] HANGUL SYLLABLE UG..HANGUL SYLLABLE UH
> +C6CD..C6E7 ; LVT # Lo [27] HANGUL SYLLABLE WEOG..HANGUL SYLLABLE WEOH
> +C6E9..C703 ; LVT # Lo [27] HANGUL SYLLABLE WEG..HANGUL SYLLABLE WEH
> +C705..C71F ; LVT # Lo [27] HANGUL SYLLABLE WIG..HANGUL SYLLABLE WIH
> +C721..C73B ; LVT # Lo [27] HANGUL SYLLABLE YUG..HANGUL SYLLABLE YUH
> +C73D..C757 ; LVT # Lo [27] HANGUL SYLLABLE EUG..HANGUL SYLLABLE EUH
> +C759..C773 ; LVT # Lo [27] HANGUL SYLLABLE YIG..HANGUL SYLLABLE YIH
> +C775..C78F ; LVT # Lo [27] HANGUL SYLLABLE IG..HANGUL SYLLABLE IH
> +C791..C7AB ; LVT # Lo [27] HANGUL SYLLABLE JAG..HANGUL SYLLABLE JAH
> +C7AD..C7C7 ; LVT # Lo [27] HANGUL SYLLABLE JAEG..HANGUL SYLLABLE JAEH
> +C7C9..C7E3 ; LVT # Lo [27] HANGUL SYLLABLE JYAG..HANGUL SYLLABLE JYAH
> +C7E5..C7FF ; LVT # Lo [27] HANGUL SYLLABLE JYAEG..HANGUL SYLLABLE JYAEH
> +C801..C81B ; LVT # Lo [27] HANGUL SYLLABLE JEOG..HANGUL SYLLABLE JEOH
> +C81D..C837 ; LVT # Lo [27] HANGUL SYLLABLE JEG..HANGUL SYLLABLE JEH
> +C839..C853 ; LVT # Lo [27] HANGUL SYLLABLE JYEOG..HANGUL SYLLABLE JYEOH
> +C855..C86F ; LVT # Lo [27] HANGUL SYLLABLE JYEG..HANGUL SYLLABLE JYEH
> +C871..C88B ; LVT # Lo [27] HANGUL SYLLABLE JOG..HANGUL SYLLABLE JOH
> +C88D..C8A7 ; LVT # Lo [27] HANGUL SYLLABLE JWAG..HANGUL SYLLABLE JWAH
> +C8A9..C8C3 ; LVT # Lo [27] HANGUL SYLLABLE JWAEG..HANGUL SYLLABLE JWAEH
> +C8C5..C8DF ; LVT # Lo [27] HANGUL SYLLABLE JOEG..HANGUL SYLLABLE JOEH
> +C8E1..C8FB ; LVT # Lo [27] HANGUL SYLLABLE JYOG..HANGUL SYLLABLE JYOH
> +C8FD..C917 ; LVT # Lo [27] HANGUL SYLLABLE JUG..HANGUL SYLLABLE JUH
> +C919..C933 ; LVT # Lo [27] HANGUL SYLLABLE JWEOG..HANGUL SYLLABLE JWEOH
> +C935..C94F ; LVT # Lo [27] HANGUL SYLLABLE JWEG..HANGUL SYLLABLE JWEH
> +C951..C96B ; LVT # Lo [27] HANGUL SYLLABLE JWIG..HANGUL SYLLABLE JWIH
> +C96D..C987 ; LVT # Lo [27] HANGUL SYLLABLE JYUG..HANGUL SYLLABLE JYUH
> +C989..C9A3 ; LVT # Lo [27] HANGUL SYLLABLE JEUG..HANGUL SYLLABLE JEUH
> +C9A5..C9BF ; LVT # Lo [27] HANGUL SYLLABLE JYIG..HANGUL SYLLABLE JYIH
> +C9C1..C9DB ; LVT # Lo [27] HANGUL SYLLABLE JIG..HANGUL SYLLABLE JIH
> +C9DD..C9F7 ; LVT # Lo [27] HANGUL SYLLABLE JJAG..HANGUL SYLLABLE JJAH
> +C9F9..CA13 ; LVT # Lo [27] HANGUL SYLLABLE JJAEG..HANGUL SYLLABLE JJAEH
> +CA15..CA2F ; LVT # Lo [27] HANGUL SYLLABLE JJYAG..HANGUL SYLLABLE JJYAH
> +CA31..CA4B ; LVT # Lo [27] HANGUL SYLLABLE JJYAEG..HANGUL SYLLABLE JJYAEH
> +CA4D..CA67 ; LVT # Lo [27] HANGUL SYLLABLE JJEOG..HANGUL SYLLABLE JJEOH
> +CA69..CA83 ; LVT # Lo [27] HANGUL SYLLABLE JJEG..HANGUL SYLLABLE JJEH
> +CA85..CA9F ; LVT # Lo [27] HANGUL SYLLABLE JJYEOG..HANGUL SYLLABLE JJYEOH
> +CAA1..CABB ; LVT # Lo [27] HANGUL SYLLABLE JJYEG..HANGUL SYLLABLE JJYEH
> +CABD..CAD7 ; LVT # Lo [27] HANGUL SYLLABLE JJOG..HANGUL SYLLABLE JJOH
> +CAD9..CAF3 ; LVT # Lo [27] HANGUL SYLLABLE JJWAG..HANGUL SYLLABLE JJWAH
> +CAF5..CB0F ; LVT # Lo [27] HANGUL SYLLABLE JJWAEG..HANGUL SYLLABLE JJWAEH
> +CB11..CB2B ; LVT # Lo [27] HANGUL SYLLABLE JJOEG..HANGUL SYLLABLE JJOEH
> +CB2D..CB47 ; LVT # Lo [27] HANGUL SYLLABLE JJYOG..HANGUL SYLLABLE JJYOH
> +CB49..CB63 ; LVT # Lo [27] HANGUL SYLLABLE JJUG..HANGUL SYLLABLE JJUH
> +CB65..CB7F ; LVT # Lo [27] HANGUL SYLLABLE JJWEOG..HANGUL SYLLABLE JJWEOH
> +CB81..CB9B ; LVT # Lo [27] HANGUL SYLLABLE JJWEG..HANGUL SYLLABLE JJWEH
> +CB9D..CBB7 ; LVT # Lo [27] HANGUL SYLLABLE JJWIG..HANGUL SYLLABLE JJWIH
> +CBB9..CBD3 ; LVT # Lo [27] HANGUL SYLLABLE JJYUG..HANGUL SYLLABLE JJYUH
> +CBD5..CBEF ; LVT # Lo [27] HANGUL SYLLABLE JJEUG..HANGUL SYLLABLE JJEUH
> +CBF1..CC0B ; LVT # Lo [27] HANGUL SYLLABLE JJYIG..HANGUL SYLLABLE JJYIH
> +CC0D..CC27 ; LVT # Lo [27] HANGUL SYLLABLE JJIG..HANGUL SYLLABLE JJIH
> +CC29..CC43 ; LVT # Lo [27] HANGUL SYLLABLE CAG..HANGUL SYLLABLE CAH
> +CC45..CC5F ; LVT # Lo [27] HANGUL SYLLABLE CAEG..HANGUL SYLLABLE CAEH
> +CC61..CC7B ; LVT # Lo [27] HANGUL SYLLABLE CYAG..HANGUL SYLLABLE CYAH
> +CC7D..CC97 ; LVT # Lo [27] HANGUL SYLLABLE CYAEG..HANGUL SYLLABLE CYAEH
> +CC99..CCB3 ; LVT # Lo [27] HANGUL SYLLABLE CEOG..HANGUL SYLLABLE CEOH
> +CCB5..CCCF ; LVT # Lo [27] HANGUL SYLLABLE CEG..HANGUL SYLLABLE CEH
> +CCD1..CCEB ; LVT # Lo [27] HANGUL SYLLABLE CYEOG..HANGUL SYLLABLE CYEOH
> +CCED..CD07 ; LVT # Lo [27] HANGUL SYLLABLE CYEG..HANGUL SYLLABLE CYEH
> +CD09..CD23 ; LVT # Lo [27] HANGUL SYLLABLE COG..HANGUL SYLLABLE COH
> +CD25..CD3F ; LVT # Lo [27] HANGUL SYLLABLE CWAG..HANGUL SYLLABLE CWAH
> +CD41..CD5B ; LVT # Lo [27] HANGUL SYLLABLE CWAEG..HANGUL SYLLABLE CWAEH
> +CD5D..CD77 ; LVT # Lo [27] HANGUL SYLLABLE COEG..HANGUL SYLLABLE COEH
> +CD79..CD93 ; LVT # Lo [27] HANGUL SYLLABLE CYOG..HANGUL SYLLABLE CYOH
> +CD95..CDAF ; LVT # Lo [27] HANGUL SYLLABLE CUG..HANGUL SYLLABLE CUH
> +CDB1..CDCB ; LVT # Lo [27] HANGUL SYLLABLE CWEOG..HANGUL SYLLABLE CWEOH
> +CDCD..CDE7 ; LVT # Lo [27] HANGUL SYLLABLE CWEG..HANGUL SYLLABLE CWEH
> +CDE9..CE03 ; LVT # Lo [27] HANGUL SYLLABLE CWIG..HANGUL SYLLABLE CWIH
> +CE05..CE1F ; LVT # Lo [27] HANGUL SYLLABLE CYUG..HANGUL SYLLABLE CYUH
> +CE21..CE3B ; LVT # Lo [27] HANGUL SYLLABLE CEUG..HANGUL SYLLABLE CEUH
> +CE3D..CE57 ; LVT # Lo [27] HANGUL SYLLABLE CYIG..HANGUL SYLLABLE CYIH
> +CE59..CE73 ; LVT # Lo [27] HANGUL SYLLABLE CIG..HANGUL SYLLABLE CIH
> +CE75..CE8F ; LVT # Lo [27] HANGUL SYLLABLE KAG..HANGUL SYLLABLE KAH
> +CE91..CEAB ; LVT # Lo [27] HANGUL SYLLABLE KAEG..HANGUL SYLLABLE KAEH
> +CEAD..CEC7 ; LVT # Lo [27] HANGUL SYLLABLE KYAG..HANGUL SYLLABLE KYAH
> +CEC9..CEE3 ; LVT # Lo [27] HANGUL SYLLABLE KYAEG..HANGUL SYLLABLE KYAEH
> +CEE5..CEFF ; LVT # Lo [27] HANGUL SYLLABLE KEOG..HANGUL SYLLABLE KEOH
> +CF01..CF1B ; LVT # Lo [27] HANGUL SYLLABLE KEG..HANGUL SYLLABLE KEH
> +CF1D..CF37 ; LVT # Lo [27] HANGUL SYLLABLE KYEOG..HANGUL SYLLABLE KYEOH
> +CF39..CF53 ; LVT # Lo [27] HANGUL SYLLABLE KYEG..HANGUL SYLLABLE KYEH
> +CF55..CF6F ; LVT # Lo [27] HANGUL SYLLABLE KOG..HANGUL SYLLABLE KOH
> +CF71..CF8B ; LVT # Lo [27] HANGUL SYLLABLE KWAG..HANGUL SYLLABLE KWAH
> +CF8D..CFA7 ; LVT # Lo [27] HANGUL SYLLABLE KWAEG..HANGUL SYLLABLE KWAEH
> +CFA9..CFC3 ; LVT # Lo [27] HANGUL SYLLABLE KOEG..HANGUL SYLLABLE KOEH
> +CFC5..CFDF ; LVT # Lo [27] HANGUL SYLLABLE KYOG..HANGUL SYLLABLE KYOH
> +CFE1..CFFB ; LVT # Lo [27] HANGUL SYLLABLE KUG..HANGUL SYLLABLE KUH
> +CFFD..D017 ; LVT # Lo [27] HANGUL SYLLABLE KWEOG..HANGUL SYLLABLE KWEOH
> +D019..D033 ; LVT # Lo [27] HANGUL SYLLABLE KWEG..HANGUL SYLLABLE KWEH
> +D035..D04F ; LVT # Lo [27] HANGUL SYLLABLE KWIG..HANGUL SYLLABLE KWIH
> +D051..D06B ; LVT # Lo [27] HANGUL SYLLABLE KYUG..HANGUL SYLLABLE KYUH
> +D06D..D087 ; LVT # Lo [27] HANGUL SYLLABLE KEUG..HANGUL SYLLABLE KEUH
> +D089..D0A3 ; LVT # Lo [27] HANGUL SYLLABLE KYIG..HANGUL SYLLABLE KYIH
> +D0A5..D0BF ; LVT # Lo [27] HANGUL SYLLABLE KIG..HANGUL SYLLABLE KIH
> +D0C1..D0DB ; LVT # Lo [27] HANGUL SYLLABLE TAG..HANGUL SYLLABLE TAH
> +D0DD..D0F7 ; LVT # Lo [27] HANGUL SYLLABLE TAEG..HANGUL SYLLABLE TAEH
> +D0F9..D113 ; LVT # Lo [27] HANGUL SYLLABLE TYAG..HANGUL SYLLABLE TYAH
> +D115..D12F ; LVT # Lo [27] HANGUL SYLLABLE TYAEG..HANGUL SYLLABLE TYAEH
> +D131..D14B ; LVT # Lo [27] HANGUL SYLLABLE TEOG..HANGUL SYLLABLE TEOH
> +D14D..D167 ; LVT # Lo [27] HANGUL SYLLABLE TEG..HANGUL SYLLABLE TEH
> +D169..D183 ; LVT # Lo [27] HANGUL SYLLABLE TYEOG..HANGUL SYLLABLE TYEOH
> +D185..D19F ; LVT # Lo [27] HANGUL SYLLABLE TYEG..HANGUL SYLLABLE TYEH
> +D1A1..D1BB ; LVT # Lo [27] HANGUL SYLLABLE TOG..HANGUL SYLLABLE TOH
> +D1BD..D1D7 ; LVT # Lo [27] HANGUL SYLLABLE TWAG..HANGUL SYLLABLE TWAH
> +D1D9..D1F3 ; LVT # Lo [27] HANGUL SYLLABLE TWAEG..HANGUL SYLLABLE TWAEH
> +D1F5..D20F ; LVT # Lo [27] HANGUL SYLLABLE TOEG..HANGUL SYLLABLE TOEH
> +D211..D22B ; LVT # Lo [27] HANGUL SYLLABLE TYOG..HANGUL SYLLABLE TYOH
> +D22D..D247 ; LVT # Lo [27] HANGUL SYLLABLE TUG..HANGUL SYLLABLE TUH
> +D249..D263 ; LVT # Lo [27] HANGUL SYLLABLE TWEOG..HANGUL SYLLABLE TWEOH
> +D265..D27F ; LVT # Lo [27] HANGUL SYLLABLE TWEG..HANGUL SYLLABLE TWEH
> +D281..D29B ; LVT # Lo [27] HANGUL SYLLABLE TWIG..HANGUL SYLLABLE TWIH
> +D29D..D2B7 ; LVT # Lo [27] HANGUL SYLLABLE TYUG..HANGUL SYLLABLE TYUH
> +D2B9..D2D3 ; LVT # Lo [27] HANGUL SYLLABLE TEUG..HANGUL SYLLABLE TEUH
> +D2D5..D2EF ; LVT # Lo [27] HANGUL SYLLABLE TYIG..HANGUL SYLLABLE TYIH
> +D2F1..D30B ; LVT # Lo [27] HANGUL SYLLABLE TIG..HANGUL SYLLABLE TIH
> +D30D..D327 ; LVT # Lo [27] HANGUL SYLLABLE PAG..HANGUL SYLLABLE PAH
> +D329..D343 ; LVT # Lo [27] HANGUL SYLLABLE PAEG..HANGUL SYLLABLE PAEH
> +D345..D35F ; LVT # Lo [27] HANGUL SYLLABLE PYAG..HANGUL SYLLABLE PYAH
> +D361..D37B ; LVT # Lo [27] HANGUL SYLLABLE PYAEG..HANGUL SYLLABLE PYAEH
> +D37D..D397 ; LVT # Lo [27] HANGUL SYLLABLE PEOG..HANGUL SYLLABLE PEOH
> +D399..D3B3 ; LVT # Lo [27] HANGUL SYLLABLE PEG..HANGUL SYLLABLE PEH
> +D3B5..D3CF ; LVT # Lo [27] HANGUL SYLLABLE PYEOG..HANGUL SYLLABLE PYEOH
> +D3D1..D3EB ; LVT # Lo [27] HANGUL SYLLABLE PYEG..HANGUL SYLLABLE PYEH
> +D3ED..D407 ; LVT # Lo [27] HANGUL SYLLABLE POG..HANGUL SYLLABLE POH
> +D409..D423 ; LVT # Lo [27] HANGUL SYLLABLE PWAG..HANGUL SYLLABLE PWAH
> +D425..D43F ; LVT # Lo [27] HANGUL SYLLABLE PWAEG..HANGUL SYLLABLE PWAEH
> +D441..D45B ; LVT # Lo [27] HANGUL SYLLABLE POEG..HANGUL SYLLABLE POEH
> +D45D..D477 ; LVT # Lo [27] HANGUL SYLLABLE PYOG..HANGUL SYLLABLE PYOH
> +D479..D493 ; LVT # Lo [27] HANGUL SYLLABLE PUG..HANGUL SYLLABLE PUH
> +D495..D4AF ; LVT # Lo [27] HANGUL SYLLABLE PWEOG..HANGUL SYLLABLE PWEOH
> +D4B1..D4CB ; LVT # Lo [27] HANGUL SYLLABLE PWEG..HANGUL SYLLABLE PWEH
> +D4CD..D4E7 ; LVT # Lo [27] HANGUL SYLLABLE PWIG..HANGUL SYLLABLE PWIH
> +D4E9..D503 ; LVT # Lo [27] HANGUL SYLLABLE PYUG..HANGUL SYLLABLE PYUH
> +D505..D51F ; LVT # Lo [27] HANGUL SYLLABLE PEUG..HANGUL SYLLABLE PEUH
> +D521..D53B ; LVT # Lo [27] HANGUL SYLLABLE PYIG..HANGUL SYLLABLE PYIH
> +D53D..D557 ; LVT # Lo [27] HANGUL SYLLABLE PIG..HANGUL SYLLABLE PIH
> +D559..D573 ; LVT # Lo [27] HANGUL SYLLABLE HAG..HANGUL SYLLABLE HAH
> +D575..D58F ; LVT # Lo [27] HANGUL SYLLABLE HAEG..HANGUL SYLLABLE HAEH
> +D591..D5AB ; LVT # Lo [27] HANGUL SYLLABLE HYAG..HANGUL SYLLABLE HYAH
> +D5AD..D5C7 ; LVT # Lo [27] HANGUL SYLLABLE HYAEG..HANGUL SYLLABLE HYAEH
> +D5C9..D5E3 ; LVT # Lo [27] HANGUL SYLLABLE HEOG..HANGUL SYLLABLE HEOH
> +D5E5..D5FF ; LVT # Lo [27] HANGUL SYLLABLE HEG..HANGUL SYLLABLE HEH
> +D601..D61B ; LVT # Lo [27] HANGUL SYLLABLE HYEOG..HANGUL SYLLABLE HYEOH
> +D61D..D637 ; LVT # Lo [27] HANGUL SYLLABLE HYEG..HANGUL SYLLABLE HYEH
> +D639..D653 ; LVT # Lo [27] HANGUL SYLLABLE HOG..HANGUL SYLLABLE HOH
> +D655..D66F ; LVT # Lo [27] HANGUL SYLLABLE HWAG..HANGUL SYLLABLE HWAH
> +D671..D68B ; LVT # Lo [27] HANGUL SYLLABLE HWAEG..HANGUL SYLLABLE HWAEH
> +D68D..D6A7 ; LVT # Lo [27] HANGUL SYLLABLE HOEG..HANGUL SYLLABLE HOEH
> +D6A9..D6C3 ; LVT # Lo [27] HANGUL SYLLABLE HYOG..HANGUL SYLLABLE HYOH
> +D6C5..D6DF ; LVT # Lo [27] HANGUL SYLLABLE HUG..HANGUL SYLLABLE HUH
> +D6E1..D6FB ; LVT # Lo [27] HANGUL SYLLABLE HWEOG..HANGUL SYLLABLE HWEOH
> +D6FD..D717 ; LVT # Lo [27] HANGUL SYLLABLE HWEG..HANGUL SYLLABLE HWEH
> +D719..D733 ; LVT # Lo [27] HANGUL SYLLABLE HWIG..HANGUL SYLLABLE HWIH
> +D735..D74F ; LVT # Lo [27] HANGUL SYLLABLE HYUG..HANGUL SYLLABLE HYUH
> +D751..D76B ; LVT # Lo [27] HANGUL SYLLABLE HEUG..HANGUL SYLLABLE HEUH
> +D76D..D787 ; LVT # Lo [27] HANGUL SYLLABLE HYIG..HANGUL SYLLABLE HYIH
> +D789..D7A3 ; LVT # Lo [27] HANGUL SYLLABLE HIG..HANGUL SYLLABLE HIH
> +
> +# Total code points: 10773
> +
> +# EOF
> diff --git a/localedata/unicode-gen/Makefile b/localedata/unicode-gen/Makefile
> index 3004d4a792..eef53e1307 100644
> --- a/localedata/unicode-gen/Makefile
> +++ b/localedata/unicode-gen/Makefile
> @@ -1,4 +1,5 @@
> # Copyright (C) 2015-2024 Free Software Foundation, Inc.
> +# Copyright (C) 2024 The GNU Toolchain Authors.
> # This file is part of the GNU C Library.
>
> # The GNU C Library is free software; you can redistribute it and/or
> @@ -40,7 +41,7 @@ UNICODE_VERSION = 15.1.0
> PYTHON3 = python3
> WGET = wget
>
> -DOWNLOADS = UnicodeData.txt DerivedCoreProperties.txt EastAsianWidth.txt PropList.txt
> +DOWNLOADS = UnicodeData.txt DerivedCoreProperties.txt EastAsianWidth.txt HangulSyllableType.txt
> GENERATED = i18n_ctype tr_TR UTF-8 translit_combining translit_compat translit_circle translit_cjk_compat translit_font translit_fraction
> REPORTS = i18n_ctype-report UTF-8-report
>
> @@ -90,10 +91,12 @@ tr_TR: gen_unicode_ctype.py
> -d DerivedCoreProperties.txt -i ../locales/tr_TR -o $@ \
> --unicode_version $(UNICODE_VERSION) --turkish
>
> -UTF-8: UnicodeData.txt EastAsianWidth.txt
> +UTF-8: UnicodeData.txt DerivedCoreProperties.txt EastAsianWidth.txt HangulSyllableType.txt
> UTF-8: utf8_gen.py
> $(PYTHON3) utf8_gen.py -u UnicodeData.txt \
> - -e EastAsianWidth.txt -p PropList.txt \
> + -d DerivedCoreProperties.txt \
> + -e EastAsianWidth.txt \
> + -k HangulSyllableType.txt \
> --unicode_version $(UNICODE_VERSION)
>
> UTF-8-report: UTF-8 ../charmaps/UTF-8
> diff --git a/localedata/unicode-gen/PropList.txt b/localedata/unicode-gen/PropList.txt
> deleted file mode 100644
> index 777e8a2881..0000000000
> --- a/localedata/unicode-gen/PropList.txt
> +++ /dev/null
> @@ -1,1827 +0,0 @@
> -# PropList-15.1.0.txt
> -# Date: 2023-08-01, 21:56:53 GMT
> -# © 2023 Unicode®, Inc.
> -# Unicode and the Unicode Logo are registered trademarks of Unicode, Inc. in the U.S. and other countries.
> -# For terms of use, see https://www.unicode.org/terms_of_use.html
> -#
> -# Unicode Character Database
> -# For documentation, see https://www.unicode.org/reports/tr44/
> -
> -# ================================================
> -
> -0009..000D ; White_Space # Cc [5] <control-0009>..<control-000D>
> -0020 ; White_Space # Zs SPACE
> -0085 ; White_Space # Cc <control-0085>
> -00A0 ; White_Space # Zs NO-BREAK SPACE
> -1680 ; White_Space # Zs OGHAM SPACE MARK
> -2000..200A ; White_Space # Zs [11] EN QUAD..HAIR SPACE
> -2028 ; White_Space # Zl LINE SEPARATOR
> -2029 ; White_Space # Zp PARAGRAPH SEPARATOR
> -202F ; White_Space # Zs NARROW NO-BREAK SPACE
> -205F ; White_Space # Zs MEDIUM MATHEMATICAL SPACE
> -3000 ; White_Space # Zs IDEOGRAPHIC SPACE
> -
> -# Total code points: 25
> -
> -# ================================================
> -
> -061C ; Bidi_Control # Cf ARABIC LETTER MARK
> -200E..200F ; Bidi_Control # Cf [2] LEFT-TO-RIGHT MARK..RIGHT-TO-LEFT MARK
> -202A..202E ; Bidi_Control # Cf [5] LEFT-TO-RIGHT EMBEDDING..RIGHT-TO-LEFT OVERRIDE
> -2066..2069 ; Bidi_Control # Cf [4] LEFT-TO-RIGHT ISOLATE..POP DIRECTIONAL ISOLATE
> -
> -# Total code points: 12
> -
> -# ================================================
> -
> -200C..200D ; Join_Control # Cf [2] ZERO WIDTH NON-JOINER..ZERO WIDTH JOINER
> -
> -# Total code points: 2
> -
> -# ================================================
> -
> -002D ; Dash # Pd HYPHEN-MINUS
> -058A ; Dash # Pd ARMENIAN HYPHEN
> -05BE ; Dash # Pd HEBREW PUNCTUATION MAQAF
> -1400 ; Dash # Pd CANADIAN SYLLABICS HYPHEN
> -1806 ; Dash # Pd MONGOLIAN TODO SOFT HYPHEN
> -2010..2015 ; Dash # Pd [6] HYPHEN..HORIZONTAL BAR
> -2053 ; Dash # Po SWUNG DASH
> -207B ; Dash # Sm SUPERSCRIPT MINUS
> -208B ; Dash # Sm SUBSCRIPT MINUS
> -2212 ; Dash # Sm MINUS SIGN
> -2E17 ; Dash # Pd DOUBLE OBLIQUE HYPHEN
> -2E1A ; Dash # Pd HYPHEN WITH DIAERESIS
> -2E3A..2E3B ; Dash # Pd [2] TWO-EM DASH..THREE-EM DASH
> -2E40 ; Dash # Pd DOUBLE HYPHEN
> -2E5D ; Dash # Pd OBLIQUE HYPHEN
> -301C ; Dash # Pd WAVE DASH
> -3030 ; Dash # Pd WAVY DASH
> -30A0 ; Dash # Pd KATAKANA-HIRAGANA DOUBLE HYPHEN
> -FE31..FE32 ; Dash # Pd [2] PRESENTATION FORM FOR VERTICAL EM DASH..PRESENTATION FORM FOR VERTICAL EN DASH
> -FE58 ; Dash # Pd SMALL EM DASH
> -FE63 ; Dash # Pd SMALL HYPHEN-MINUS
> -FF0D ; Dash # Pd FULLWIDTH HYPHEN-MINUS
> -10EAD ; Dash # Pd YEZIDI HYPHENATION MARK
> -
> -# Total code points: 30
> -
> -# ================================================
> -
> -002D ; Hyphen # Pd HYPHEN-MINUS
> -00AD ; Hyphen # Cf SOFT HYPHEN
> -058A ; Hyphen # Pd ARMENIAN HYPHEN
> -1806 ; Hyphen # Pd MONGOLIAN TODO SOFT HYPHEN
> -2010..2011 ; Hyphen # Pd [2] HYPHEN..NON-BREAKING HYPHEN
> -2E17 ; Hyphen # Pd DOUBLE OBLIQUE HYPHEN
> -30FB ; Hyphen # Po KATAKANA MIDDLE DOT
> -FE63 ; Hyphen # Pd SMALL HYPHEN-MINUS
> -FF0D ; Hyphen # Pd FULLWIDTH HYPHEN-MINUS
> -FF65 ; Hyphen # Po HALFWIDTH KATAKANA MIDDLE DOT
> -
> -# Total code points: 11
> -
> -# ================================================
> -
> -0022 ; Quotation_Mark # Po QUOTATION MARK
> -0027 ; Quotation_Mark # Po APOSTROPHE
> -00AB ; Quotation_Mark # Pi LEFT-POINTING DOUBLE ANGLE QUOTATION MARK
> -00BB ; Quotation_Mark # Pf RIGHT-POINTING DOUBLE ANGLE QUOTATION MARK
> -2018 ; Quotation_Mark # Pi LEFT SINGLE QUOTATION MARK
> -2019 ; Quotation_Mark # Pf RIGHT SINGLE QUOTATION MARK
> -201A ; Quotation_Mark # Ps SINGLE LOW-9 QUOTATION MARK
> -201B..201C ; Quotation_Mark # Pi [2] SINGLE HIGH-REVERSED-9 QUOTATION MARK..LEFT DOUBLE QUOTATION MARK
> -201D ; Quotation_Mark # Pf RIGHT DOUBLE QUOTATION MARK
> -201E ; Quotation_Mark # Ps DOUBLE LOW-9 QUOTATION MARK
> -201F ; Quotation_Mark # Pi DOUBLE HIGH-REVERSED-9 QUOTATION MARK
> -2039 ; Quotation_Mark # Pi SINGLE LEFT-POINTING ANGLE QUOTATION MARK
> -203A ; Quotation_Mark # Pf SINGLE RIGHT-POINTING ANGLE QUOTATION MARK
> -2E42 ; Quotation_Mark # Ps DOUBLE LOW-REVERSED-9 QUOTATION MARK
> -300C ; Quotation_Mark # Ps LEFT CORNER BRACKET
> -300D ; Quotation_Mark # Pe RIGHT CORNER BRACKET
> -300E ; Quotation_Mark # Ps LEFT WHITE CORNER BRACKET
> -300F ; Quotation_Mark # Pe RIGHT WHITE CORNER BRACKET
> -301D ; Quotation_Mark # Ps REVERSED DOUBLE PRIME QUOTATION MARK
> -301E..301F ; Quotation_Mark # Pe [2] DOUBLE PRIME QUOTATION MARK..LOW DOUBLE PRIME QUOTATION MARK
> -FE41 ; Quotation_Mark # Ps PRESENTATION FORM FOR VERTICAL LEFT CORNER BRACKET
> -FE42 ; Quotation_Mark # Pe PRESENTATION FORM FOR VERTICAL RIGHT CORNER BRACKET
> -FE43 ; Quotation_Mark # Ps PRESENTATION FORM FOR VERTICAL LEFT WHITE CORNER BRACKET
> -FE44 ; Quotation_Mark # Pe PRESENTATION FORM FOR VERTICAL RIGHT WHITE CORNER BRACKET
> -FF02 ; Quotation_Mark # Po FULLWIDTH QUOTATION MARK
> -FF07 ; Quotation_Mark # Po FULLWIDTH APOSTROPHE
> -FF62 ; Quotation_Mark # Ps HALFWIDTH LEFT CORNER BRACKET
> -FF63 ; Quotation_Mark # Pe HALFWIDTH RIGHT CORNER BRACKET
> -
> -# Total code points: 30
> -
> -# ================================================
> -
> -0021 ; Terminal_Punctuation # Po EXCLAMATION MARK
> -002C ; Terminal_Punctuation # Po COMMA
> -002E ; Terminal_Punctuation # Po FULL STOP
> -003A..003B ; Terminal_Punctuation # Po [2] COLON..SEMICOLON
> -003F ; Terminal_Punctuation # Po QUESTION MARK
> -037E ; Terminal_Punctuation # Po GREEK QUESTION MARK
> -0387 ; Terminal_Punctuation # Po GREEK ANO TELEIA
> -0589 ; Terminal_Punctuation # Po ARMENIAN FULL STOP
> -05C3 ; Terminal_Punctuation # Po HEBREW PUNCTUATION SOF PASUQ
> -060C ; Terminal_Punctuation # Po ARABIC COMMA
> -061B ; Terminal_Punctuation # Po ARABIC SEMICOLON
> -061D..061F ; Terminal_Punctuation # Po [3] ARABIC END OF TEXT MARK..ARABIC QUESTION MARK
> -06D4 ; Terminal_Punctuation # Po ARABIC FULL STOP
> -0700..070A ; Terminal_Punctuation # Po [11] SYRIAC END OF PARAGRAPH..SYRIAC CONTRACTION
> -070C ; Terminal_Punctuation # Po SYRIAC HARKLEAN METOBELUS
> -07F8..07F9 ; Terminal_Punctuation # Po [2] NKO COMMA..NKO EXCLAMATION MARK
> -0830..083E ; Terminal_Punctuation # Po [15] SAMARITAN PUNCTUATION NEQUDAA..SAMARITAN PUNCTUATION ANNAAU
> -085E ; Terminal_Punctuation # Po MANDAIC PUNCTUATION
> -0964..0965 ; Terminal_Punctuation # Po [2] DEVANAGARI DANDA..DEVANAGARI DOUBLE DANDA
> -0E5A..0E5B ; Terminal_Punctuation # Po [2] THAI CHARACTER ANGKHANKHU..THAI CHARACTER KHOMUT
> -0F08 ; Terminal_Punctuation # Po TIBETAN MARK SBRUL SHAD
> -0F0D..0F12 ; Terminal_Punctuation # Po [6] TIBETAN MARK SHAD..TIBETAN MARK RGYA GRAM SHAD
> -104A..104B ; Terminal_Punctuation # Po [2] MYANMAR SIGN LITTLE SECTION..MYANMAR SIGN SECTION
> -1361..1368 ; Terminal_Punctuation # Po [8] ETHIOPIC WORDSPACE..ETHIOPIC PARAGRAPH SEPARATOR
> -166E ; Terminal_Punctuation # Po CANADIAN SYLLABICS FULL STOP
> -16EB..16ED ; Terminal_Punctuation # Po [3] RUNIC SINGLE PUNCTUATION..RUNIC CROSS PUNCTUATION
> -1735..1736 ; Terminal_Punctuation # Po [2] PHILIPPINE SINGLE PUNCTUATION..PHILIPPINE DOUBLE PUNCTUATION
> -17D4..17D6 ; Terminal_Punctuation # Po [3] KHMER SIGN KHAN..KHMER SIGN CAMNUC PII KUUH
> -17DA ; Terminal_Punctuation # Po KHMER SIGN KOOMUUT
> -1802..1805 ; Terminal_Punctuation # Po [4] MONGOLIAN COMMA..MONGOLIAN FOUR DOTS
> -1808..1809 ; Terminal_Punctuation # Po [2] MONGOLIAN MANCHU COMMA..MONGOLIAN MANCHU FULL STOP
> -1944..1945 ; Terminal_Punctuation # Po [2] LIMBU EXCLAMATION MARK..LIMBU QUESTION MARK
> -1AA8..1AAB ; Terminal_Punctuation # Po [4] TAI THAM SIGN KAAN..TAI THAM SIGN SATKAANKUU
> -1B5A..1B5B ; Terminal_Punctuation # Po [2] BALINESE PANTI..BALINESE PAMADA
> -1B5D..1B5F ; Terminal_Punctuation # Po [3] BALINESE CARIK PAMUNGKAH..BALINESE CARIK PAREREN
> -1B7D..1B7E ; Terminal_Punctuation # Po [2] BALINESE PANTI LANTANG..BALINESE PAMADA LANTANG
> -1C3B..1C3F ; Terminal_Punctuation # Po [5] LEPCHA PUNCTUATION TA-ROL..LEPCHA PUNCTUATION TSHOOK
> -1C7E..1C7F ; Terminal_Punctuation # Po [2] OL CHIKI PUNCTUATION MUCAAD..OL CHIKI PUNCTUATION DOUBLE MUCAAD
> -203C..203D ; Terminal_Punctuation # Po [2] DOUBLE EXCLAMATION MARK..INTERROBANG
> -2047..2049 ; Terminal_Punctuation # Po [3] DOUBLE QUESTION MARK..EXCLAMATION QUESTION MARK
> -2E2E ; Terminal_Punctuation # Po REVERSED QUESTION MARK
> -2E3C ; Terminal_Punctuation # Po STENOGRAPHIC FULL STOP
> -2E41 ; Terminal_Punctuation # Po REVERSED COMMA
> -2E4C ; Terminal_Punctuation # Po MEDIEVAL COMMA
> -2E4E..2E4F ; Terminal_Punctuation # Po [2] PUNCTUS ELEVATUS MARK..CORNISH VERSE DIVIDER
> -2E53..2E54 ; Terminal_Punctuation # Po [2] MEDIEVAL EXCLAMATION MARK..MEDIEVAL QUESTION MARK
> -3001..3002 ; Terminal_Punctuation # Po [2] IDEOGRAPHIC COMMA..IDEOGRAPHIC FULL STOP
> -A4FE..A4FF ; Terminal_Punctuation # Po [2] LISU PUNCTUATION COMMA..LISU PUNCTUATION FULL STOP
> -A60D..A60F ; Terminal_Punctuation # Po [3] VAI COMMA..VAI QUESTION MARK
> -A6F3..A6F7 ; Terminal_Punctuation # Po [5] BAMUM FULL STOP..BAMUM QUESTION MARK
> -A876..A877 ; Terminal_Punctuation # Po [2] PHAGS-PA MARK SHAD..PHAGS-PA MARK DOUBLE SHAD
> -A8CE..A8CF ; Terminal_Punctuation # Po [2] SAURASHTRA DANDA..SAURASHTRA DOUBLE DANDA
> -A92F ; Terminal_Punctuation # Po KAYAH LI SIGN SHYA
> -A9C7..A9C9 ; Terminal_Punctuation # Po [3] JAVANESE PADA PANGKAT..JAVANESE PADA LUNGSI
> -AA5D..AA5F ; Terminal_Punctuation # Po [3] CHAM PUNCTUATION DANDA..CHAM PUNCTUATION TRIPLE DANDA
> -AADF ; Terminal_Punctuation # Po TAI VIET SYMBOL KOI KOI
> -AAF0..AAF1 ; Terminal_Punctuation # Po [2] MEETEI MAYEK CHEIKHAN..MEETEI MAYEK AHANG KHUDAM
> -ABEB ; Terminal_Punctuation # Po MEETEI MAYEK CHEIKHEI
> -FE50..FE52 ; Terminal_Punctuation # Po [3] SMALL COMMA..SMALL FULL STOP
> -FE54..FE57 ; Terminal_Punctuation # Po [4] SMALL SEMICOLON..SMALL EXCLAMATION MARK
> -FF01 ; Terminal_Punctuation # Po FULLWIDTH EXCLAMATION MARK
> -FF0C ; Terminal_Punctuation # Po FULLWIDTH COMMA
> -FF0E ; Terminal_Punctuation # Po FULLWIDTH FULL STOP
> -FF1A..FF1B ; Terminal_Punctuation # Po [2] FULLWIDTH COLON..FULLWIDTH SEMICOLON
> -FF1F ; Terminal_Punctuation # Po FULLWIDTH QUESTION MARK
> -FF61 ; Terminal_Punctuation # Po HALFWIDTH IDEOGRAPHIC FULL STOP
> -FF64 ; Terminal_Punctuation # Po HALFWIDTH IDEOGRAPHIC COMMA
> -1039F ; Terminal_Punctuation # Po UGARITIC WORD DIVIDER
> -103D0 ; Terminal_Punctuation # Po OLD PERSIAN WORD DIVIDER
> -10857 ; Terminal_Punctuation # Po IMPERIAL ARAMAIC SECTION SIGN
> -1091F ; Terminal_Punctuation # Po PHOENICIAN WORD SEPARATOR
> -10A56..10A57 ; Terminal_Punctuation # Po [2] KHAROSHTHI PUNCTUATION DANDA..KHAROSHTHI PUNCTUATION DOUBLE DANDA
> -10AF0..10AF5 ; Terminal_Punctuation # Po [6] MANICHAEAN PUNCTUATION STAR..MANICHAEAN PUNCTUATION TWO DOTS
> -10B3A..10B3F ; Terminal_Punctuation # Po [6] TINY TWO DOTS OVER ONE DOT PUNCTUATION..LARGE ONE RING OVER TWO RINGS PUNCTUATION
> -10B99..10B9C ; Terminal_Punctuation # Po [4] PSALTER PAHLAVI SECTION MARK..PSALTER PAHLAVI FOUR DOTS WITH DOT
> -10F55..10F59 ; Terminal_Punctuation # Po [5] SOGDIAN PUNCTUATION TWO VERTICAL BARS..SOGDIAN PUNCTUATION HALF CIRCLE WITH DOT
> -10F86..10F89 ; Terminal_Punctuation # Po [4] OLD UYGHUR PUNCTUATION BAR..OLD UYGHUR PUNCTUATION FOUR DOTS
> -11047..1104D ; Terminal_Punctuation # Po [7] BRAHMI DANDA..BRAHMI PUNCTUATION LOTUS
> -110BE..110C1 ; Terminal_Punctuation # Po [4] KAITHI SECTION MARK..KAITHI DOUBLE DANDA
> -11141..11143 ; Terminal_Punctuation # Po [3] CHAKMA DANDA..CHAKMA QUESTION MARK
> -111C5..111C6 ; Terminal_Punctuation # Po [2] SHARADA DANDA..SHARADA DOUBLE DANDA
> -111CD ; Terminal_Punctuation # Po SHARADA SUTRA MARK
> -111DE..111DF ; Terminal_Punctuation # Po [2] SHARADA SECTION MARK-1..SHARADA SECTION MARK-2
> -11238..1123C ; Terminal_Punctuation # Po [5] KHOJKI DANDA..KHOJKI DOUBLE SECTION MARK
> -112A9 ; Terminal_Punctuation # Po MULTANI SECTION MARK
> -1144B..1144D ; Terminal_Punctuation # Po [3] NEWA DANDA..NEWA COMMA
> -1145A..1145B ; Terminal_Punctuation # Po [2] NEWA DOUBLE COMMA..NEWA PLACEHOLDER MARK
> -115C2..115C5 ; Terminal_Punctuation # Po [4] SIDDHAM DANDA..SIDDHAM SEPARATOR BAR
> -115C9..115D7 ; Terminal_Punctuation # Po [15] SIDDHAM END OF TEXT MARK..SIDDHAM SECTION MARK WITH CIRCLES AND FOUR ENCLOSURES
> -11641..11642 ; Terminal_Punctuation # Po [2] MODI DANDA..MODI DOUBLE DANDA
> -1173C..1173E ; Terminal_Punctuation # Po [3] AHOM SIGN SMALL SECTION..AHOM SIGN RULAI
> -11944 ; Terminal_Punctuation # Po DIVES AKURU DOUBLE DANDA
> -11946 ; Terminal_Punctuation # Po DIVES AKURU END OF TEXT MARK
> -11A42..11A43 ; Terminal_Punctuation # Po [2] ZANABAZAR SQUARE MARK SHAD..ZANABAZAR SQUARE MARK DOUBLE SHAD
> -11A9B..11A9C ; Terminal_Punctuation # Po [2] SOYOMBO MARK SHAD..SOYOMBO MARK DOUBLE SHAD
> -11AA1..11AA2 ; Terminal_Punctuation # Po [2] SOYOMBO TERMINAL MARK-1..SOYOMBO TERMINAL MARK-2
> -11C41..11C43 ; Terminal_Punctuation # Po [3] BHAIKSUKI DANDA..BHAIKSUKI WORD SEPARATOR
> -11C71 ; Terminal_Punctuation # Po MARCHEN MARK SHAD
> -11EF7..11EF8 ; Terminal_Punctuation # Po [2] MAKASAR PASSIMBANG..MAKASAR END OF SECTION
> -11F43..11F44 ; Terminal_Punctuation # Po [2] KAWI DANDA..KAWI DOUBLE DANDA
> -12470..12474 ; Terminal_Punctuation # Po [5] CUNEIFORM PUNCTUATION SIGN OLD ASSYRIAN WORD DIVIDER..CUNEIFORM PUNCTUATION SIGN DIAGONAL QUADCOLON
> -16A6E..16A6F ; Terminal_Punctuation # Po [2] MRO DANDA..MRO DOUBLE DANDA
> -16AF5 ; Terminal_Punctuation # Po BASSA VAH FULL STOP
> -16B37..16B39 ; Terminal_Punctuation # Po [3] PAHAWH HMONG SIGN VOS THOM..PAHAWH HMONG SIGN CIM CHEEM
> -16B44 ; Terminal_Punctuation # Po PAHAWH HMONG SIGN XAUS
> -16E97..16E98 ; Terminal_Punctuation # Po [2] MEDEFAIDRIN COMMA..MEDEFAIDRIN FULL STOP
> -1BC9F ; Terminal_Punctuation # Po DUPLOYAN PUNCTUATION CHINOOK FULL STOP
> -1DA87..1DA8A ; Terminal_Punctuation # Po [4] SIGNWRITING COMMA..SIGNWRITING COLON
> -
> -# Total code points: 278
> -
> -# ================================================
> -
> -005E ; Other_Math # Sk CIRCUMFLEX ACCENT
> -03D0..03D2 ; Other_Math # L& [3] GREEK BETA SYMBOL..GREEK UPSILON WITH HOOK SYMBOL
> -03D5 ; Other_Math # L& GREEK PHI SYMBOL
> -03F0..03F1 ; Other_Math # L& [2] GREEK KAPPA SYMBOL..GREEK RHO SYMBOL
> -03F4..03F5 ; Other_Math # L& [2] GREEK CAPITAL THETA SYMBOL..GREEK LUNATE EPSILON SYMBOL
> -2016 ; Other_Math # Po DOUBLE VERTICAL LINE
> -2032..2034 ; Other_Math # Po [3] PRIME..TRIPLE PRIME
> -2040 ; Other_Math # Pc CHARACTER TIE
> -2061..2064 ; Other_Math # Cf [4] FUNCTION APPLICATION..INVISIBLE PLUS
> -207D ; Other_Math # Ps SUPERSCRIPT LEFT PARENTHESIS
> -207E ; Other_Math # Pe SUPERSCRIPT RIGHT PARENTHESIS
> -208D ; Other_Math # Ps SUBSCRIPT LEFT PARENTHESIS
> -208E ; Other_Math # Pe SUBSCRIPT RIGHT PARENTHESIS
> -20D0..20DC ; Other_Math # Mn [13] COMBINING LEFT HARPOON ABOVE..COMBINING FOUR DOTS ABOVE
> -20E1 ; Other_Math # Mn COMBINING LEFT RIGHT ARROW ABOVE
> -20E5..20E6 ; Other_Math # Mn [2] COMBINING REVERSE SOLIDUS OVERLAY..COMBINING DOUBLE VERTICAL STROKE OVERLAY
> -20EB..20EF ; Other_Math # Mn [5] COMBINING LONG DOUBLE SOLIDUS OVERLAY..COMBINING RIGHT ARROW BELOW
> -2102 ; Other_Math # L& DOUBLE-STRUCK CAPITAL C
> -2107 ; Other_Math # L& EULER CONSTANT
> -210A..2113 ; Other_Math # L& [10] SCRIPT SMALL G..SCRIPT SMALL L
> -2115 ; Other_Math # L& DOUBLE-STRUCK CAPITAL N
> -2119..211D ; Other_Math # L& [5] DOUBLE-STRUCK CAPITAL P..DOUBLE-STRUCK CAPITAL R
> -2124 ; Other_Math # L& DOUBLE-STRUCK CAPITAL Z
> -2128 ; Other_Math # L& BLACK-LETTER CAPITAL Z
> -2129 ; Other_Math # So TURNED GREEK SMALL LETTER IOTA
> -212C..212D ; Other_Math # L& [2] SCRIPT CAPITAL B..BLACK-LETTER CAPITAL C
> -212F..2131 ; Other_Math # L& [3] SCRIPT SMALL E..SCRIPT CAPITAL F
> -2133..2134 ; Other_Math # L& [2] SCRIPT CAPITAL M..SCRIPT SMALL O
> -2135..2138 ; Other_Math # Lo [4] ALEF SYMBOL..DALET SYMBOL
> -213C..213F ; Other_Math # L& [4] DOUBLE-STRUCK SMALL PI..DOUBLE-STRUCK CAPITAL PI
> -2145..2149 ; Other_Math # L& [5] DOUBLE-STRUCK ITALIC CAPITAL D..DOUBLE-STRUCK ITALIC SMALL J
> -2195..2199 ; Other_Math # So [5] UP DOWN ARROW..SOUTH WEST ARROW
> -219C..219F ; Other_Math # So [4] LEFTWARDS WAVE ARROW..UPWARDS TWO HEADED ARROW
> -21A1..21A2 ; Other_Math # So [2] DOWNWARDS TWO HEADED ARROW..LEFTWARDS ARROW WITH TAIL
> -21A4..21A5 ; Other_Math # So [2] LEFTWARDS ARROW FROM BAR..UPWARDS ARROW FROM BAR
> -21A7 ; Other_Math # So DOWNWARDS ARROW FROM BAR
> -21A9..21AD ; Other_Math # So [5] LEFTWARDS ARROW WITH HOOK..LEFT RIGHT WAVE ARROW
> -21B0..21B1 ; Other_Math # So [2] UPWARDS ARROW WITH TIP LEFTWARDS..UPWARDS ARROW WITH TIP RIGHTWARDS
> -21B6..21B7 ; Other_Math # So [2] ANTICLOCKWISE TOP SEMICIRCLE ARROW..CLOCKWISE TOP SEMICIRCLE ARROW
> -21BC..21CD ; Other_Math # So [18] LEFTWARDS HARPOON WITH BARB UPWARDS..LEFTWARDS DOUBLE ARROW WITH STROKE
> -21D0..21D1 ; Other_Math # So [2] LEFTWARDS DOUBLE ARROW..UPWARDS DOUBLE ARROW
> -21D3 ; Other_Math # So DOWNWARDS DOUBLE ARROW
> -21D5..21DB ; Other_Math # So [7] UP DOWN DOUBLE ARROW..RIGHTWARDS TRIPLE ARROW
> -21DD ; Other_Math # So RIGHTWARDS SQUIGGLE ARROW
> -21E4..21E5 ; Other_Math # So [2] LEFTWARDS ARROW TO BAR..RIGHTWARDS ARROW TO BAR
> -2308 ; Other_Math # Ps LEFT CEILING
> -2309 ; Other_Math # Pe RIGHT CEILING
> -230A ; Other_Math # Ps LEFT FLOOR
> -230B ; Other_Math # Pe RIGHT FLOOR
> -23B4..23B5 ; Other_Math # So [2] TOP SQUARE BRACKET..BOTTOM SQUARE BRACKET
> -23B7 ; Other_Math # So RADICAL SYMBOL BOTTOM
> -23D0 ; Other_Math # So VERTICAL LINE EXTENSION
> -23E2 ; Other_Math # So WHITE TRAPEZIUM
> -25A0..25A1 ; Other_Math # So [2] BLACK SQUARE..WHITE SQUARE
> -25AE..25B6 ; Other_Math # So [9] BLACK VERTICAL RECTANGLE..BLACK RIGHT-POINTING TRIANGLE
> -25BC..25C0 ; Other_Math # So [5] BLACK DOWN-POINTING TRIANGLE..BLACK LEFT-POINTING TRIANGLE
> -25C6..25C7 ; Other_Math # So [2] BLACK DIAMOND..WHITE DIAMOND
> -25CA..25CB ; Other_Math # So [2] LOZENGE..WHITE CIRCLE
> -25CF..25D3 ; Other_Math # So [5] BLACK CIRCLE..CIRCLE WITH UPPER HALF BLACK
> -25E2 ; Other_Math # So BLACK LOWER RIGHT TRIANGLE
> -25E4 ; Other_Math # So BLACK UPPER LEFT TRIANGLE
> -25E7..25EC ; Other_Math # So [6] SQUARE WITH LEFT HALF BLACK..WHITE UP-POINTING TRIANGLE WITH DOT
> -2605..2606 ; Other_Math # So [2] BLACK STAR..WHITE STAR
> -2640 ; Other_Math # So FEMALE SIGN
> -2642 ; Other_Math # So MALE SIGN
> -2660..2663 ; Other_Math # So [4] BLACK SPADE SUIT..BLACK CLUB SUIT
> -266D..266E ; Other_Math # So [2] MUSIC FLAT SIGN..MUSIC NATURAL SIGN
> -27C5 ; Other_Math # Ps LEFT S-SHAPED BAG DELIMITER
> -27C6 ; Other_Math # Pe RIGHT S-SHAPED BAG DELIMITER
> -27E6 ; Other_Math # Ps MATHEMATICAL LEFT WHITE SQUARE BRACKET
> -27E7 ; Other_Math # Pe MATHEMATICAL RIGHT WHITE SQUARE BRACKET
> -27E8 ; Other_Math # Ps MATHEMATICAL LEFT ANGLE BRACKET
> -27E9 ; Other_Math # Pe MATHEMATICAL RIGHT ANGLE BRACKET
> -27EA ; Other_Math # Ps MATHEMATICAL LEFT DOUBLE ANGLE BRACKET
> -27EB ; Other_Math # Pe MATHEMATICAL RIGHT DOUBLE ANGLE BRACKET
> -27EC ; Other_Math # Ps MATHEMATICAL LEFT WHITE TORTOISE SHELL BRACKET
> -27ED ; Other_Math # Pe MATHEMATICAL RIGHT WHITE TORTOISE SHELL BRACKET
> -27EE ; Other_Math # Ps MATHEMATICAL LEFT FLATTENED PARENTHESIS
> -27EF ; Other_Math # Pe MATHEMATICAL RIGHT FLATTENED PARENTHESIS
> -2983 ; Other_Math # Ps LEFT WHITE CURLY BRACKET
> -2984 ; Other_Math # Pe RIGHT WHITE CURLY BRACKET
> -2985 ; Other_Math # Ps LEFT WHITE PARENTHESIS
> -2986 ; Other_Math # Pe RIGHT WHITE PARENTHESIS
> -2987 ; Other_Math # Ps Z NOTATION LEFT IMAGE BRACKET
> -2988 ; Other_Math # Pe Z NOTATION RIGHT IMAGE BRACKET
> -2989 ; Other_Math # Ps Z NOTATION LEFT BINDING BRACKET
> -298A ; Other_Math # Pe Z NOTATION RIGHT BINDING BRACKET
> -298B ; Other_Math # Ps LEFT SQUARE BRACKET WITH UNDERBAR
> -298C ; Other_Math # Pe RIGHT SQUARE BRACKET WITH UNDERBAR
> -298D ; Other_Math # Ps LEFT SQUARE BRACKET WITH TICK IN TOP CORNER
> -298E ; Other_Math # Pe RIGHT SQUARE BRACKET WITH TICK IN BOTTOM CORNER
> -298F ; Other_Math # Ps LEFT SQUARE BRACKET WITH TICK IN BOTTOM CORNER
> -2990 ; Other_Math # Pe RIGHT SQUARE BRACKET WITH TICK IN TOP CORNER
> -2991 ; Other_Math # Ps LEFT ANGLE BRACKET WITH DOT
> -2992 ; Other_Math # Pe RIGHT ANGLE BRACKET WITH DOT
> -2993 ; Other_Math # Ps LEFT ARC LESS-THAN BRACKET
> -2994 ; Other_Math # Pe RIGHT ARC GREATER-THAN BRACKET
> -2995 ; Other_Math # Ps DOUBLE LEFT ARC GREATER-THAN BRACKET
> -2996 ; Other_Math # Pe DOUBLE RIGHT ARC LESS-THAN BRACKET
> -2997 ; Other_Math # Ps LEFT BLACK TORTOISE SHELL BRACKET
> -2998 ; Other_Math # Pe RIGHT BLACK TORTOISE SHELL BRACKET
> -29D8 ; Other_Math # Ps LEFT WIGGLY FENCE
> -29D9 ; Other_Math # Pe RIGHT WIGGLY FENCE
> -29DA ; Other_Math # Ps LEFT DOUBLE WIGGLY FENCE
> -29DB ; Other_Math # Pe RIGHT DOUBLE WIGGLY FENCE
> -29FC ; Other_Math # Ps LEFT-POINTING CURVED ANGLE BRACKET
> -29FD ; Other_Math # Pe RIGHT-POINTING CURVED ANGLE BRACKET
> -FE61 ; Other_Math # Po SMALL ASTERISK
> -FE63 ; Other_Math # Pd SMALL HYPHEN-MINUS
> -FE68 ; Other_Math # Po SMALL REVERSE SOLIDUS
> -FF3C ; Other_Math # Po FULLWIDTH REVERSE SOLIDUS
> -FF3E ; Other_Math # Sk FULLWIDTH CIRCUMFLEX ACCENT
> -1D400..1D454 ; Other_Math # L& [85] MATHEMATICAL BOLD CAPITAL A..MATHEMATICAL ITALIC SMALL G
> -1D456..1D49C ; Other_Math # L& [71] MATHEMATICAL ITALIC SMALL I..MATHEMATICAL SCRIPT CAPITAL A
> -1D49E..1D49F ; Other_Math # L& [2] MATHEMATICAL SCRIPT CAPITAL C..MATHEMATICAL SCRIPT CAPITAL D
> -1D4A2 ; Other_Math # L& MATHEMATICAL SCRIPT CAPITAL G
> -1D4A5..1D4A6 ; Other_Math # L& [2] MATHEMATICAL SCRIPT CAPITAL J..MATHEMATICAL SCRIPT CAPITAL K
> -1D4A9..1D4AC ; Other_Math # L& [4] MATHEMATICAL SCRIPT CAPITAL N..MATHEMATICAL SCRIPT CAPITAL Q
> -1D4AE..1D4B9 ; Other_Math # L& [12] MATHEMATICAL SCRIPT CAPITAL S..MATHEMATICAL SCRIPT SMALL D
> -1D4BB ; Other_Math # L& MATHEMATICAL SCRIPT SMALL F
> -1D4BD..1D4C3 ; Other_Math # L& [7] MATHEMATICAL SCRIPT SMALL H..MATHEMATICAL SCRIPT SMALL N
> -1D4C5..1D505 ; Other_Math # L& [65] MATHEMATICAL SCRIPT SMALL P..MATHEMATICAL FRAKTUR CAPITAL B
> -1D507..1D50A ; Other_Math # L& [4] MATHEMATICAL FRAKTUR CAPITAL D..MATHEMATICAL FRAKTUR CAPITAL G
> -1D50D..1D514 ; Other_Math # L& [8] MATHEMATICAL FRAKTUR CAPITAL J..MATHEMATICAL FRAKTUR CAPITAL Q
> -1D516..1D51C ; Other_Math # L& [7] MATHEMATICAL FRAKTUR CAPITAL S..MATHEMATICAL FRAKTUR CAPITAL Y
> -1D51E..1D539 ; Other_Math # L& [28] MATHEMATICAL FRAKTUR SMALL A..MATHEMATICAL DOUBLE-STRUCK CAPITAL B
> -1D53B..1D53E ; Other_Math # L& [4] MATHEMATICAL DOUBLE-STRUCK CAPITAL D..MATHEMATICAL DOUBLE-STRUCK CAPITAL G
> -1D540..1D544 ; Other_Math # L& [5] MATHEMATICAL DOUBLE-STRUCK CAPITAL I..MATHEMATICAL DOUBLE-STRUCK CAPITAL M
> -1D546 ; Other_Math # L& MATHEMATICAL DOUBLE-STRUCK CAPITAL O
> -1D54A..1D550 ; Other_Math # L& [7] MATHEMATICAL DOUBLE-STRUCK CAPITAL S..MATHEMATICAL DOUBLE-STRUCK CAPITAL Y
> -1D552..1D6A5 ; Other_Math # L& [340] MATHEMATICAL DOUBLE-STRUCK SMALL A..MATHEMATICAL ITALIC SMALL DOTLESS J
> -1D6A8..1D6C0 ; Other_Math # L& [25] MATHEMATICAL BOLD CAPITAL ALPHA..MATHEMATICAL BOLD CAPITAL OMEGA
> -1D6C2..1D6DA ; Other_Math # L& [25] MATHEMATICAL BOLD SMALL ALPHA..MATHEMATICAL BOLD SMALL OMEGA
> -1D6DC..1D6FA ; Other_Math # L& [31] MATHEMATICAL BOLD EPSILON SYMBOL..MATHEMATICAL ITALIC CAPITAL OMEGA
> -1D6FC..1D714 ; Other_Math # L& [25] MATHEMATICAL ITALIC SMALL ALPHA..MATHEMATICAL ITALIC SMALL OMEGA
> -1D716..1D734 ; Other_Math # L& [31] MATHEMATICAL ITALIC EPSILON SYMBOL..MATHEMATICAL BOLD ITALIC CAPITAL OMEGA
> -1D736..1D74E ; Other_Math # L& [25] MATHEMATICAL BOLD ITALIC SMALL ALPHA..MATHEMATICAL BOLD ITALIC SMALL OMEGA
> -1D750..1D76E ; Other_Math # L& [31] MATHEMATICAL BOLD ITALIC EPSILON SYMBOL..MATHEMATICAL SANS-SERIF BOLD CAPITAL OMEGA
> -1D770..1D788 ; Other_Math # L& [25] MATHEMATICAL SANS-SERIF BOLD SMALL ALPHA..MATHEMATICAL SANS-SERIF BOLD SMALL OMEGA
> -1D78A..1D7A8 ; Other_Math # L& [31] MATHEMATICAL SANS-SERIF BOLD EPSILON SYMBOL..MATHEMATICAL SANS-SERIF BOLD ITALIC CAPITAL OMEGA
> -1D7AA..1D7C2 ; Other_Math # L& [25] MATHEMATICAL SANS-SERIF BOLD ITALIC SMALL ALPHA..MATHEMATICAL SANS-SERIF BOLD ITALIC SMALL OMEGA
> -1D7C4..1D7CB ; Other_Math # L& [8] MATHEMATICAL SANS-SERIF BOLD ITALIC EPSILON SYMBOL..MATHEMATICAL BOLD SMALL DIGAMMA
> -1D7CE..1D7FF ; Other_Math # Nd [50] MATHEMATICAL BOLD DIGIT ZERO..MATHEMATICAL MONOSPACE DIGIT NINE
> -1EE00..1EE03 ; Other_Math # Lo [4] ARABIC MATHEMATICAL ALEF..ARABIC MATHEMATICAL DAL
> -1EE05..1EE1F ; Other_Math # Lo [27] ARABIC MATHEMATICAL WAW..ARABIC MATHEMATICAL DOTLESS QAF
> -1EE21..1EE22 ; Other_Math # Lo [2] ARABIC MATHEMATICAL INITIAL BEH..ARABIC MATHEMATICAL INITIAL JEEM
> -1EE24 ; Other_Math # Lo ARABIC MATHEMATICAL INITIAL HEH
> -1EE27 ; Other_Math # Lo ARABIC MATHEMATICAL INITIAL HAH
> -1EE29..1EE32 ; Other_Math # Lo [10] ARABIC MATHEMATICAL INITIAL YEH..ARABIC MATHEMATICAL INITIAL QAF
> -1EE34..1EE37 ; Other_Math # Lo [4] ARABIC MATHEMATICAL INITIAL SHEEN..ARABIC MATHEMATICAL INITIAL KHAH
> -1EE39 ; Other_Math # Lo ARABIC MATHEMATICAL INITIAL DAD
> -1EE3B ; Other_Math # Lo ARABIC MATHEMATICAL INITIAL GHAIN
> -1EE42 ; Other_Math # Lo ARABIC MATHEMATICAL TAILED JEEM
> -1EE47 ; Other_Math # Lo ARABIC MATHEMATICAL TAILED HAH
> -1EE49 ; Other_Math # Lo ARABIC MATHEMATICAL TAILED YEH
> -1EE4B ; Other_Math # Lo ARABIC MATHEMATICAL TAILED LAM
> -1EE4D..1EE4F ; Other_Math # Lo [3] ARABIC MATHEMATICAL TAILED NOON..ARABIC MATHEMATICAL TAILED AIN
> -1EE51..1EE52 ; Other_Math # Lo [2] ARABIC MATHEMATICAL TAILED SAD..ARABIC MATHEMATICAL TAILED QAF
> -1EE54 ; Other_Math # Lo ARABIC MATHEMATICAL TAILED SHEEN
> -1EE57 ; Other_Math # Lo ARABIC MATHEMATICAL TAILED KHAH
> -1EE59 ; Other_Math # Lo ARABIC MATHEMATICAL TAILED DAD
> -1EE5B ; Other_Math # Lo ARABIC MATHEMATICAL TAILED GHAIN
> -1EE5D ; Other_Math # Lo ARABIC MATHEMATICAL TAILED DOTLESS NOON
> -1EE5F ; Other_Math # Lo ARABIC MATHEMATICAL TAILED DOTLESS QAF
> -1EE61..1EE62 ; Other_Math # Lo [2] ARABIC MATHEMATICAL STRETCHED BEH..ARABIC MATHEMATICAL STRETCHED JEEM
> -1EE64 ; Other_Math # Lo ARABIC MATHEMATICAL STRETCHED HEH
> -1EE67..1EE6A ; Other_Math # Lo [4] ARABIC MATHEMATICAL STRETCHED HAH..ARABIC MATHEMATICAL STRETCHED KAF
> -1EE6C..1EE72 ; Other_Math # Lo [7] ARABIC MATHEMATICAL STRETCHED MEEM..ARABIC MATHEMATICAL STRETCHED QAF
> -1EE74..1EE77 ; Other_Math # Lo [4] ARABIC MATHEMATICAL STRETCHED SHEEN..ARABIC MATHEMATICAL STRETCHED KHAH
> -1EE79..1EE7C ; Other_Math # Lo [4] ARABIC MATHEMATICAL STRETCHED DAD..ARABIC MATHEMATICAL STRETCHED DOTLESS BEH
> -1EE7E ; Other_Math # Lo ARABIC MATHEMATICAL STRETCHED DOTLESS FEH
> -1EE80..1EE89 ; Other_Math # Lo [10] ARABIC MATHEMATICAL LOOPED ALEF..ARABIC MATHEMATICAL LOOPED YEH
> -1EE8B..1EE9B ; Other_Math # Lo [17] ARABIC MATHEMATICAL LOOPED LAM..ARABIC MATHEMATICAL LOOPED GHAIN
> -1EEA1..1EEA3 ; Other_Math # Lo [3] ARABIC MATHEMATICAL DOUBLE-STRUCK BEH..ARABIC MATHEMATICAL DOUBLE-STRUCK DAL
> -1EEA5..1EEA9 ; Other_Math # Lo [5] ARABIC MATHEMATICAL DOUBLE-STRUCK WAW..ARABIC MATHEMATICAL DOUBLE-STRUCK YEH
> -1EEAB..1EEBB ; Other_Math # Lo [17] ARABIC MATHEMATICAL DOUBLE-STRUCK LAM..ARABIC MATHEMATICAL DOUBLE-STRUCK GHAIN
> -
> -# Total code points: 1362
> -
> -# ================================================
> -
> -0030..0039 ; Hex_Digit # Nd [10] DIGIT ZERO..DIGIT NINE
> -0041..0046 ; Hex_Digit # L& [6] LATIN CAPITAL LETTER A..LATIN CAPITAL LETTER F
> -0061..0066 ; Hex_Digit # L& [6] LATIN SMALL LETTER A..LATIN SMALL LETTER F
> -FF10..FF19 ; Hex_Digit # Nd [10] FULLWIDTH DIGIT ZERO..FULLWIDTH DIGIT NINE
> -FF21..FF26 ; Hex_Digit # L& [6] FULLWIDTH LATIN CAPITAL LETTER A..FULLWIDTH LATIN CAPITAL LETTER F
> -FF41..FF46 ; Hex_Digit # L& [6] FULLWIDTH LATIN SMALL LETTER A..FULLWIDTH LATIN SMALL LETTER F
> -
> -# Total code points: 44
> -
> -# ================================================
> -
> -0030..0039 ; ASCII_Hex_Digit # Nd [10] DIGIT ZERO..DIGIT NINE
> -0041..0046 ; ASCII_Hex_Digit # L& [6] LATIN CAPITAL LETTER A..LATIN CAPITAL LETTER F
> -0061..0066 ; ASCII_Hex_Digit # L& [6] LATIN SMALL LETTER A..LATIN SMALL LETTER F
> -
> -# Total code points: 22
> -
> -# ================================================
> -
> -0345 ; Other_Alphabetic # Mn COMBINING GREEK YPOGEGRAMMENI
> -05B0..05BD ; Other_Alphabetic # Mn [14] HEBREW POINT SHEVA..HEBREW POINT METEG
> -05BF ; Other_Alphabetic # Mn HEBREW POINT RAFE
> -05C1..05C2 ; Other_Alphabetic # Mn [2] HEBREW POINT SHIN DOT..HEBREW POINT SIN DOT
> -05C4..05C5 ; Other_Alphabetic # Mn [2] HEBREW MARK UPPER DOT..HEBREW MARK LOWER DOT
> -05C7 ; Other_Alphabetic # Mn HEBREW POINT QAMATS QATAN
> -0610..061A ; Other_Alphabetic # Mn [11] ARABIC SIGN SALLALLAHOU ALAYHE WASSALLAM..ARABIC SMALL KASRA
> -064B..0657 ; Other_Alphabetic # Mn [13] ARABIC FATHATAN..ARABIC INVERTED DAMMA
> -0659..065F ; Other_Alphabetic # Mn [7] ARABIC ZWARAKAY..ARABIC WAVY HAMZA BELOW
> -0670 ; Other_Alphabetic # Mn ARABIC LETTER SUPERSCRIPT ALEF
> -06D6..06DC ; Other_Alphabetic # Mn [7] ARABIC SMALL HIGH LIGATURE SAD WITH LAM WITH ALEF MAKSURA..ARABIC SMALL HIGH SEEN
> -06E1..06E4 ; Other_Alphabetic # Mn [4] ARABIC SMALL HIGH DOTLESS HEAD OF KHAH..ARABIC SMALL HIGH MADDA
> -06E7..06E8 ; Other_Alphabetic # Mn [2] ARABIC SMALL HIGH YEH..ARABIC SMALL HIGH NOON
> -06ED ; Other_Alphabetic # Mn ARABIC SMALL LOW MEEM
> -0711 ; Other_Alphabetic # Mn SYRIAC LETTER SUPERSCRIPT ALAPH
> -0730..073F ; Other_Alphabetic # Mn [16] SYRIAC PTHAHA ABOVE..SYRIAC RWAHA
> -07A6..07B0 ; Other_Alphabetic # Mn [11] THAANA ABAFILI..THAANA SUKUN
> -0816..0817 ; Other_Alphabetic # Mn [2] SAMARITAN MARK IN..SAMARITAN MARK IN-ALAF
> -081B..0823 ; Other_Alphabetic # Mn [9] SAMARITAN MARK EPENTHETIC YUT..SAMARITAN VOWEL SIGN A
> -0825..0827 ; Other_Alphabetic # Mn [3] SAMARITAN VOWEL SIGN SHORT A..SAMARITAN VOWEL SIGN U
> -0829..082C ; Other_Alphabetic # Mn [4] SAMARITAN VOWEL SIGN LONG I..SAMARITAN VOWEL SIGN SUKUN
> -08D4..08DF ; Other_Alphabetic # Mn [12] ARABIC SMALL HIGH WORD AR-RUB..ARABIC SMALL HIGH WORD WAQFA
> -08E3..08E9 ; Other_Alphabetic # Mn [7] ARABIC TURNED DAMMA BELOW..ARABIC CURLY KASRATAN
> -08F0..0902 ; Other_Alphabetic # Mn [19] ARABIC OPEN FATHATAN..DEVANAGARI SIGN ANUSVARA
> -0903 ; Other_Alphabetic # Mc DEVANAGARI SIGN VISARGA
> -093A ; Other_Alphabetic # Mn DEVANAGARI VOWEL SIGN OE
> -093B ; Other_Alphabetic # Mc DEVANAGARI VOWEL SIGN OOE
> -093E..0940 ; Other_Alphabetic # Mc [3] DEVANAGARI VOWEL SIGN AA..DEVANAGARI VOWEL SIGN II
> -0941..0948 ; Other_Alphabetic # Mn [8] DEVANAGARI VOWEL SIGN U..DEVANAGARI VOWEL SIGN AI
> -0949..094C ; Other_Alphabetic # Mc [4] DEVANAGARI VOWEL SIGN CANDRA O..DEVANAGARI VOWEL SIGN AU
> -094E..094F ; Other_Alphabetic # Mc [2] DEVANAGARI VOWEL SIGN PRISHTHAMATRA E..DEVANAGARI VOWEL SIGN AW
> -0955..0957 ; Other_Alphabetic # Mn [3] DEVANAGARI VOWEL SIGN CANDRA LONG E..DEVANAGARI VOWEL SIGN UUE
> -0962..0963 ; Other_Alphabetic # Mn [2] DEVANAGARI VOWEL SIGN VOCALIC L..DEVANAGARI VOWEL SIGN VOCALIC LL
> -0981 ; Other_Alphabetic # Mn BENGALI SIGN CANDRABINDU
> -0982..0983 ; Other_Alphabetic # Mc [2] BENGALI SIGN ANUSVARA..BENGALI SIGN VISARGA
> -09BE..09C0 ; Other_Alphabetic # Mc [3] BENGALI VOWEL SIGN AA..BENGALI VOWEL SIGN II
> -09C1..09C4 ; Other_Alphabetic # Mn [4] BENGALI VOWEL SIGN U..BENGALI VOWEL SIGN VOCALIC RR
> -09C7..09C8 ; Other_Alphabetic # Mc [2] BENGALI VOWEL SIGN E..BENGALI VOWEL SIGN AI
> -09CB..09CC ; Other_Alphabetic # Mc [2] BENGALI VOWEL SIGN O..BENGALI VOWEL SIGN AU
> -09D7 ; Other_Alphabetic # Mc BENGALI AU LENGTH MARK
> -09E2..09E3 ; Other_Alphabetic # Mn [2] BENGALI VOWEL SIGN VOCALIC L..BENGALI VOWEL SIGN VOCALIC LL
> -0A01..0A02 ; Other_Alphabetic # Mn [2] GURMUKHI SIGN ADAK BINDI..GURMUKHI SIGN BINDI
> -0A03 ; Other_Alphabetic # Mc GURMUKHI SIGN VISARGA
> -0A3E..0A40 ; Other_Alphabetic # Mc [3] GURMUKHI VOWEL SIGN AA..GURMUKHI VOWEL SIGN II
> -0A41..0A42 ; Other_Alphabetic # Mn [2] GURMUKHI VOWEL SIGN U..GURMUKHI VOWEL SIGN UU
> -0A47..0A48 ; Other_Alphabetic # Mn [2] GURMUKHI VOWEL SIGN EE..GURMUKHI VOWEL SIGN AI
> -0A4B..0A4C ; Other_Alphabetic # Mn [2] GURMUKHI VOWEL SIGN OO..GURMUKHI VOWEL SIGN AU
> -0A51 ; Other_Alphabetic # Mn GURMUKHI SIGN UDAAT
> -0A70..0A71 ; Other_Alphabetic # Mn [2] GURMUKHI TIPPI..GURMUKHI ADDAK
> -0A75 ; Other_Alphabetic # Mn GURMUKHI SIGN YAKASH
> -0A81..0A82 ; Other_Alphabetic # Mn [2] GUJARATI SIGN CANDRABINDU..GUJARATI SIGN ANUSVARA
> -0A83 ; Other_Alphabetic # Mc GUJARATI SIGN VISARGA
> -0ABE..0AC0 ; Other_Alphabetic # Mc [3] GUJARATI VOWEL SIGN AA..GUJARATI VOWEL SIGN II
> -0AC1..0AC5 ; Other_Alphabetic # Mn [5] GUJARATI VOWEL SIGN U..GUJARATI VOWEL SIGN CANDRA E
> -0AC7..0AC8 ; Other_Alphabetic # Mn [2] GUJARATI VOWEL SIGN E..GUJARATI VOWEL SIGN AI
> -0AC9 ; Other_Alphabetic # Mc GUJARATI VOWEL SIGN CANDRA O
> -0ACB..0ACC ; Other_Alphabetic # Mc [2] GUJARATI VOWEL SIGN O..GUJARATI VOWEL SIGN AU
> -0AE2..0AE3 ; Other_Alphabetic # Mn [2] GUJARATI VOWEL SIGN VOCALIC L..GUJARATI VOWEL SIGN VOCALIC LL
> -0AFA..0AFC ; Other_Alphabetic # Mn [3] GUJARATI SIGN SUKUN..GUJARATI SIGN MADDAH
> -0B01 ; Other_Alphabetic # Mn ORIYA SIGN CANDRABINDU
> -0B02..0B03 ; Other_Alphabetic # Mc [2] ORIYA SIGN ANUSVARA..ORIYA SIGN VISARGA
> -0B3E ; Other_Alphabetic # Mc ORIYA VOWEL SIGN AA
> -0B3F ; Other_Alphabetic # Mn ORIYA VOWEL SIGN I
> -0B40 ; Other_Alphabetic # Mc ORIYA VOWEL SIGN II
> -0B41..0B44 ; Other_Alphabetic # Mn [4] ORIYA VOWEL SIGN U..ORIYA VOWEL SIGN VOCALIC RR
> -0B47..0B48 ; Other_Alphabetic # Mc [2] ORIYA VOWEL SIGN E..ORIYA VOWEL SIGN AI
> -0B4B..0B4C ; Other_Alphabetic # Mc [2] ORIYA VOWEL SIGN O..ORIYA VOWEL SIGN AU
> -0B56 ; Other_Alphabetic # Mn ORIYA AI LENGTH MARK
> -0B57 ; Other_Alphabetic # Mc ORIYA AU LENGTH MARK
> -0B62..0B63 ; Other_Alphabetic # Mn [2] ORIYA VOWEL SIGN VOCALIC L..ORIYA VOWEL SIGN VOCALIC LL
> -0B82 ; Other_Alphabetic # Mn TAMIL SIGN ANUSVARA
> -0BBE..0BBF ; Other_Alphabetic # Mc [2] TAMIL VOWEL SIGN AA..TAMIL VOWEL SIGN I
> -0BC0 ; Other_Alphabetic # Mn TAMIL VOWEL SIGN II
> -0BC1..0BC2 ; Other_Alphabetic # Mc [2] TAMIL VOWEL SIGN U..TAMIL VOWEL SIGN UU
> -0BC6..0BC8 ; Other_Alphabetic # Mc [3] TAMIL VOWEL SIGN E..TAMIL VOWEL SIGN AI
> -0BCA..0BCC ; Other_Alphabetic # Mc [3] TAMIL VOWEL SIGN O..TAMIL VOWEL SIGN AU
> -0BD7 ; Other_Alphabetic # Mc TAMIL AU LENGTH MARK
> -0C00 ; Other_Alphabetic # Mn TELUGU SIGN COMBINING CANDRABINDU ABOVE
> -0C01..0C03 ; Other_Alphabetic # Mc [3] TELUGU SIGN CANDRABINDU..TELUGU SIGN VISARGA
> -0C04 ; Other_Alphabetic # Mn TELUGU SIGN COMBINING ANUSVARA ABOVE
> -0C3E..0C40 ; Other_Alphabetic # Mn [3] TELUGU VOWEL SIGN AA..TELUGU VOWEL SIGN II
> -0C41..0C44 ; Other_Alphabetic # Mc [4] TELUGU VOWEL SIGN U..TELUGU VOWEL SIGN VOCALIC RR
> -0C46..0C48 ; Other_Alphabetic # Mn [3] TELUGU VOWEL SIGN E..TELUGU VOWEL SIGN AI
> -0C4A..0C4C ; Other_Alphabetic # Mn [3] TELUGU VOWEL SIGN O..TELUGU VOWEL SIGN AU
> -0C55..0C56 ; Other_Alphabetic # Mn [2] TELUGU LENGTH MARK..TELUGU AI LENGTH MARK
> -0C62..0C63 ; Other_Alphabetic # Mn [2] TELUGU VOWEL SIGN VOCALIC L..TELUGU VOWEL SIGN VOCALIC LL
> -0C81 ; Other_Alphabetic # Mn KANNADA SIGN CANDRABINDU
> -0C82..0C83 ; Other_Alphabetic # Mc [2] KANNADA SIGN ANUSVARA..KANNADA SIGN VISARGA
> -0CBE ; Other_Alphabetic # Mc KANNADA VOWEL SIGN AA
> -0CBF ; Other_Alphabetic # Mn KANNADA VOWEL SIGN I
> -0CC0..0CC4 ; Other_Alphabetic # Mc [5] KANNADA VOWEL SIGN II..KANNADA VOWEL SIGN VOCALIC RR
> -0CC6 ; Other_Alphabetic # Mn KANNADA VOWEL SIGN E
> -0CC7..0CC8 ; Other_Alphabetic # Mc [2] KANNADA VOWEL SIGN EE..KANNADA VOWEL SIGN AI
> -0CCA..0CCB ; Other_Alphabetic # Mc [2] KANNADA VOWEL SIGN O..KANNADA VOWEL SIGN OO
> -0CCC ; Other_Alphabetic # Mn KANNADA VOWEL SIGN AU
> -0CD5..0CD6 ; Other_Alphabetic # Mc [2] KANNADA LENGTH MARK..KANNADA AI LENGTH MARK
> -0CE2..0CE3 ; Other_Alphabetic # Mn [2] KANNADA VOWEL SIGN VOCALIC L..KANNADA VOWEL SIGN VOCALIC LL
> -0CF3 ; Other_Alphabetic # Mc KANNADA SIGN COMBINING ANUSVARA ABOVE RIGHT
> -0D00..0D01 ; Other_Alphabetic # Mn [2] MALAYALAM SIGN COMBINING ANUSVARA ABOVE..MALAYALAM SIGN CANDRABINDU
> -0D02..0D03 ; Other_Alphabetic # Mc [2] MALAYALAM SIGN ANUSVARA..MALAYALAM SIGN VISARGA
> -0D3E..0D40 ; Other_Alphabetic # Mc [3] MALAYALAM VOWEL SIGN AA..MALAYALAM VOWEL SIGN II
> -0D41..0D44 ; Other_Alphabetic # Mn [4] MALAYALAM VOWEL SIGN U..MALAYALAM VOWEL SIGN VOCALIC RR
> -0D46..0D48 ; Other_Alphabetic # Mc [3] MALAYALAM VOWEL SIGN E..MALAYALAM VOWEL SIGN AI
> -0D4A..0D4C ; Other_Alphabetic # Mc [3] MALAYALAM VOWEL SIGN O..MALAYALAM VOWEL SIGN AU
> -0D57 ; Other_Alphabetic # Mc MALAYALAM AU LENGTH MARK
> -0D62..0D63 ; Other_Alphabetic # Mn [2] MALAYALAM VOWEL SIGN VOCALIC L..MALAYALAM VOWEL SIGN VOCALIC LL
> -0D81 ; Other_Alphabetic # Mn SINHALA SIGN CANDRABINDU
> -0D82..0D83 ; Other_Alphabetic # Mc [2] SINHALA SIGN ANUSVARAYA..SINHALA SIGN VISARGAYA
> -0DCF..0DD1 ; Other_Alphabetic # Mc [3] SINHALA VOWEL SIGN AELA-PILLA..SINHALA VOWEL SIGN DIGA AEDA-PILLA
> -0DD2..0DD4 ; Other_Alphabetic # Mn [3] SINHALA VOWEL SIGN KETTI IS-PILLA..SINHALA VOWEL SIGN KETTI PAA-PILLA
> -0DD6 ; Other_Alphabetic # Mn SINHALA VOWEL SIGN DIGA PAA-PILLA
> -0DD8..0DDF ; Other_Alphabetic # Mc [8] SINHALA VOWEL SIGN GAETTA-PILLA..SINHALA VOWEL SIGN GAYANUKITTA
> -0DF2..0DF3 ; Other_Alphabetic # Mc [2] SINHALA VOWEL SIGN DIGA GAETTA-PILLA..SINHALA VOWEL SIGN DIGA GAYANUKITTA
> -0E31 ; Other_Alphabetic # Mn THAI CHARACTER MAI HAN-AKAT
> -0E34..0E3A ; Other_Alphabetic # Mn [7] THAI CHARACTER SARA I..THAI CHARACTER PHINTHU
> -0E4D ; Other_Alphabetic # Mn THAI CHARACTER NIKHAHIT
> -0EB1 ; Other_Alphabetic # Mn LAO VOWEL SIGN MAI KAN
> -0EB4..0EB9 ; Other_Alphabetic # Mn [6] LAO VOWEL SIGN I..LAO VOWEL SIGN UU
> -0EBB..0EBC ; Other_Alphabetic # Mn [2] LAO VOWEL SIGN MAI KON..LAO SEMIVOWEL SIGN LO
> -0ECD ; Other_Alphabetic # Mn LAO NIGGAHITA
> -0F71..0F7E ; Other_Alphabetic # Mn [14] TIBETAN VOWEL SIGN AA..TIBETAN SIGN RJES SU NGA RO
> -0F7F ; Other_Alphabetic # Mc TIBETAN SIGN RNAM BCAD
> -0F80..0F83 ; Other_Alphabetic # Mn [4] TIBETAN VOWEL SIGN REVERSED I..TIBETAN SIGN SNA LDAN
> -0F8D..0F97 ; Other_Alphabetic # Mn [11] TIBETAN SUBJOINED SIGN LCE TSA CAN..TIBETAN SUBJOINED LETTER JA
> -0F99..0FBC ; Other_Alphabetic # Mn [36] TIBETAN SUBJOINED LETTER NYA..TIBETAN SUBJOINED LETTER FIXED-FORM RA
> -102B..102C ; Other_Alphabetic # Mc [2] MYANMAR VOWEL SIGN TALL AA..MYANMAR VOWEL SIGN AA
> -102D..1030 ; Other_Alphabetic # Mn [4] MYANMAR VOWEL SIGN I..MYANMAR VOWEL SIGN UU
> -1031 ; Other_Alphabetic # Mc MYANMAR VOWEL SIGN E
> -1032..1036 ; Other_Alphabetic # Mn [5] MYANMAR VOWEL SIGN AI..MYANMAR SIGN ANUSVARA
> -1038 ; Other_Alphabetic # Mc MYANMAR SIGN VISARGA
> -103B..103C ; Other_Alphabetic # Mc [2] MYANMAR CONSONANT SIGN MEDIAL YA..MYANMAR CONSONANT SIGN MEDIAL RA
> -103D..103E ; Other_Alphabetic # Mn [2] MYANMAR CONSONANT SIGN MEDIAL WA..MYANMAR CONSONANT SIGN MEDIAL HA
> -1056..1057 ; Other_Alphabetic # Mc [2] MYANMAR VOWEL SIGN VOCALIC R..MYANMAR VOWEL SIGN VOCALIC RR
> -1058..1059 ; Other_Alphabetic # Mn [2] MYANMAR VOWEL SIGN VOCALIC L..MYANMAR VOWEL SIGN VOCALIC LL
> -105E..1060 ; Other_Alphabetic # Mn [3] MYANMAR CONSONANT SIGN MON MEDIAL NA..MYANMAR CONSONANT SIGN MON MEDIAL LA
> -1062..1064 ; Other_Alphabetic # Mc [3] MYANMAR VOWEL SIGN SGAW KAREN EU..MYANMAR TONE MARK SGAW KAREN KE PHO
> -1067..106D ; Other_Alphabetic # Mc [7] MYANMAR VOWEL SIGN WESTERN PWO KAREN EU..MYANMAR SIGN WESTERN PWO KAREN TONE-5
> -1071..1074 ; Other_Alphabetic # Mn [4] MYANMAR VOWEL SIGN GEBA KAREN I..MYANMAR VOWEL SIGN KAYAH EE
> -1082 ; Other_Alphabetic # Mn MYANMAR CONSONANT SIGN SHAN MEDIAL WA
> -1083..1084 ; Other_Alphabetic # Mc [2] MYANMAR VOWEL SIGN SHAN AA..MYANMAR VOWEL SIGN SHAN E
> -1085..1086 ; Other_Alphabetic # Mn [2] MYANMAR VOWEL SIGN SHAN E ABOVE..MYANMAR VOWEL SIGN SHAN FINAL Y
> -1087..108C ; Other_Alphabetic # Mc [6] MYANMAR SIGN SHAN TONE-2..MYANMAR SIGN SHAN COUNCIL TONE-3
> -108D ; Other_Alphabetic # Mn MYANMAR SIGN SHAN COUNCIL EMPHATIC TONE
> -108F ; Other_Alphabetic # Mc MYANMAR SIGN RUMAI PALAUNG TONE-5
> -109A..109C ; Other_Alphabetic # Mc [3] MYANMAR SIGN KHAMTI TONE-1..MYANMAR VOWEL SIGN AITON A
> -109D ; Other_Alphabetic # Mn MYANMAR VOWEL SIGN AITON AI
> -1712..1713 ; Other_Alphabetic # Mn [2] TAGALOG VOWEL SIGN I..TAGALOG VOWEL SIGN U
> -1732..1733 ; Other_Alphabetic # Mn [2] HANUNOO VOWEL SIGN I..HANUNOO VOWEL SIGN U
> -1752..1753 ; Other_Alphabetic # Mn [2] BUHID VOWEL SIGN I..BUHID VOWEL SIGN U
> -1772..1773 ; Other_Alphabetic # Mn [2] TAGBANWA VOWEL SIGN I..TAGBANWA VOWEL SIGN U
> -17B6 ; Other_Alphabetic # Mc KHMER VOWEL SIGN AA
> -17B7..17BD ; Other_Alphabetic # Mn [7] KHMER VOWEL SIGN I..KHMER VOWEL SIGN UA
> -17BE..17C5 ; Other_Alphabetic # Mc [8] KHMER VOWEL SIGN OE..KHMER VOWEL SIGN AU
> -17C6 ; Other_Alphabetic # Mn KHMER SIGN NIKAHIT
> -17C7..17C8 ; Other_Alphabetic # Mc [2] KHMER SIGN REAHMUK..KHMER SIGN YUUKALEAPINTU
> -1885..1886 ; Other_Alphabetic # Mn [2] MONGOLIAN LETTER ALI GALI BALUDA..MONGOLIAN LETTER ALI GALI THREE BALUDA
> -18A9 ; Other_Alphabetic # Mn MONGOLIAN LETTER ALI GALI DAGALGA
> -1920..1922 ; Other_Alphabetic # Mn [3] LIMBU VOWEL SIGN A..LIMBU VOWEL SIGN U
> -1923..1926 ; Other_Alphabetic # Mc [4] LIMBU VOWEL SIGN EE..LIMBU VOWEL SIGN AU
> -1927..1928 ; Other_Alphabetic # Mn [2] LIMBU VOWEL SIGN E..LIMBU VOWEL SIGN O
> -1929..192B ; Other_Alphabetic # Mc [3] LIMBU SUBJOINED LETTER YA..LIMBU SUBJOINED LETTER WA
> -1930..1931 ; Other_Alphabetic # Mc [2] LIMBU SMALL LETTER KA..LIMBU SMALL LETTER NGA
> -1932 ; Other_Alphabetic # Mn LIMBU SMALL LETTER ANUSVARA
> -1933..1938 ; Other_Alphabetic # Mc [6] LIMBU SMALL LETTER TA..LIMBU SMALL LETTER LA
> -1A17..1A18 ; Other_Alphabetic # Mn [2] BUGINESE VOWEL SIGN I..BUGINESE VOWEL SIGN U
> -1A19..1A1A ; Other_Alphabetic # Mc [2] BUGINESE VOWEL SIGN E..BUGINESE VOWEL SIGN O
> -1A1B ; Other_Alphabetic # Mn BUGINESE VOWEL SIGN AE
> -1A55 ; Other_Alphabetic # Mc TAI THAM CONSONANT SIGN MEDIAL RA
> -1A56 ; Other_Alphabetic # Mn TAI THAM CONSONANT SIGN MEDIAL LA
> -1A57 ; Other_Alphabetic # Mc TAI THAM CONSONANT SIGN LA TANG LAI
> -1A58..1A5E ; Other_Alphabetic # Mn [7] TAI THAM SIGN MAI KANG LAI..TAI THAM CONSONANT SIGN SA
> -1A61 ; Other_Alphabetic # Mc TAI THAM VOWEL SIGN A
> -1A62 ; Other_Alphabetic # Mn TAI THAM VOWEL SIGN MAI SAT
> -1A63..1A64 ; Other_Alphabetic # Mc [2] TAI THAM VOWEL SIGN AA..TAI THAM VOWEL SIGN TALL AA
> -1A65..1A6C ; Other_Alphabetic # Mn [8] TAI THAM VOWEL SIGN I..TAI THAM VOWEL SIGN OA BELOW
> -1A6D..1A72 ; Other_Alphabetic # Mc [6] TAI THAM VOWEL SIGN OY..TAI THAM VOWEL SIGN THAM AI
> -1A73..1A74 ; Other_Alphabetic # Mn [2] TAI THAM VOWEL SIGN OA ABOVE..TAI THAM SIGN MAI KANG
> -1ABF..1AC0 ; Other_Alphabetic # Mn [2] COMBINING LATIN SMALL LETTER W BELOW..COMBINING LATIN SMALL LETTER TURNED W BELOW
> -1ACC..1ACE ; Other_Alphabetic # Mn [3] COMBINING LATIN SMALL LETTER INSULAR G..COMBINING LATIN SMALL LETTER INSULAR T
> -1B00..1B03 ; Other_Alphabetic # Mn [4] BALINESE SIGN ULU RICEM..BALINESE SIGN SURANG
> -1B04 ; Other_Alphabetic # Mc BALINESE SIGN BISAH
> -1B35 ; Other_Alphabetic # Mc BALINESE VOWEL SIGN TEDUNG
> -1B36..1B3A ; Other_Alphabetic # Mn [5] BALINESE VOWEL SIGN ULU..BALINESE VOWEL SIGN RA REPA
> -1B3B ; Other_Alphabetic # Mc BALINESE VOWEL SIGN RA REPA TEDUNG
> -1B3C ; Other_Alphabetic # Mn BALINESE VOWEL SIGN LA LENGA
> -1B3D..1B41 ; Other_Alphabetic # Mc [5] BALINESE VOWEL SIGN LA LENGA TEDUNG..BALINESE VOWEL SIGN TALING REPA TEDUNG
> -1B42 ; Other_Alphabetic # Mn BALINESE VOWEL SIGN PEPET
> -1B43 ; Other_Alphabetic # Mc BALINESE VOWEL SIGN PEPET TEDUNG
> -1B80..1B81 ; Other_Alphabetic # Mn [2] SUNDANESE SIGN PANYECEK..SUNDANESE SIGN PANGLAYAR
> -1B82 ; Other_Alphabetic # Mc SUNDANESE SIGN PANGWISAD
> -1BA1 ; Other_Alphabetic # Mc SUNDANESE CONSONANT SIGN PAMINGKAL
> -1BA2..1BA5 ; Other_Alphabetic # Mn [4] SUNDANESE CONSONANT SIGN PANYAKRA..SUNDANESE VOWEL SIGN PANYUKU
> -1BA6..1BA7 ; Other_Alphabetic # Mc [2] SUNDANESE VOWEL SIGN PANAELAENG..SUNDANESE VOWEL SIGN PANOLONG
> -1BA8..1BA9 ; Other_Alphabetic # Mn [2] SUNDANESE VOWEL SIGN PAMEPET..SUNDANESE VOWEL SIGN PANEULEUNG
> -1BAC..1BAD ; Other_Alphabetic # Mn [2] SUNDANESE CONSONANT SIGN PASANGAN MA..SUNDANESE CONSONANT SIGN PASANGAN WA
> -1BE7 ; Other_Alphabetic # Mc BATAK VOWEL SIGN E
> -1BE8..1BE9 ; Other_Alphabetic # Mn [2] BATAK VOWEL SIGN PAKPAK E..BATAK VOWEL SIGN EE
> -1BEA..1BEC ; Other_Alphabetic # Mc [3] BATAK VOWEL SIGN I..BATAK VOWEL SIGN O
> -1BED ; Other_Alphabetic # Mn BATAK VOWEL SIGN KARO O
> -1BEE ; Other_Alphabetic # Mc BATAK VOWEL SIGN U
> -1BEF..1BF1 ; Other_Alphabetic # Mn [3] BATAK VOWEL SIGN U FOR SIMALUNGUN SA..BATAK CONSONANT SIGN H
> -1C24..1C2B ; Other_Alphabetic # Mc [8] LEPCHA SUBJOINED LETTER YA..LEPCHA VOWEL SIGN UU
> -1C2C..1C33 ; Other_Alphabetic # Mn [8] LEPCHA VOWEL SIGN E..LEPCHA CONSONANT SIGN T
> -1C34..1C35 ; Other_Alphabetic # Mc [2] LEPCHA CONSONANT SIGN NYIN-DO..LEPCHA CONSONANT SIGN KANG
> -1C36 ; Other_Alphabetic # Mn LEPCHA SIGN RAN
> -1DE7..1DF4 ; Other_Alphabetic # Mn [14] COMBINING LATIN SMALL LETTER ALPHA..COMBINING LATIN SMALL LETTER U WITH DIAERESIS
> -24B6..24E9 ; Other_Alphabetic # So [52] CIRCLED LATIN CAPITAL LETTER A..CIRCLED LATIN SMALL LETTER Z
> -2DE0..2DFF ; Other_Alphabetic # Mn [32] COMBINING CYRILLIC LETTER BE..COMBINING CYRILLIC LETTER IOTIFIED BIG YUS
> -A674..A67B ; Other_Alphabetic # Mn [8] COMBINING CYRILLIC LETTER UKRAINIAN IE..COMBINING CYRILLIC LETTER OMEGA
> -A69E..A69F ; Other_Alphabetic # Mn [2] COMBINING CYRILLIC LETTER EF..COMBINING CYRILLIC LETTER IOTIFIED E
> -A802 ; Other_Alphabetic # Mn SYLOTI NAGRI SIGN DVISVARA
> -A80B ; Other_Alphabetic # Mn SYLOTI NAGRI SIGN ANUSVARA
> -A823..A824 ; Other_Alphabetic # Mc [2] SYLOTI NAGRI VOWEL SIGN A..SYLOTI NAGRI VOWEL SIGN I
> -A825..A826 ; Other_Alphabetic # Mn [2] SYLOTI NAGRI VOWEL SIGN U..SYLOTI NAGRI VOWEL SIGN E
> -A827 ; Other_Alphabetic # Mc SYLOTI NAGRI VOWEL SIGN OO
> -A880..A881 ; Other_Alphabetic # Mc [2] SAURASHTRA SIGN ANUSVARA..SAURASHTRA SIGN VISARGA
> -A8B4..A8C3 ; Other_Alphabetic # Mc [16] SAURASHTRA CONSONANT SIGN HAARU..SAURASHTRA VOWEL SIGN AU
> -A8C5 ; Other_Alphabetic # Mn SAURASHTRA SIGN CANDRABINDU
> -A8FF ; Other_Alphabetic # Mn DEVANAGARI VOWEL SIGN AY
> -A926..A92A ; Other_Alphabetic # Mn [5] KAYAH LI VOWEL UE..KAYAH LI VOWEL O
> -A947..A951 ; Other_Alphabetic # Mn [11] REJANG VOWEL SIGN I..REJANG CONSONANT SIGN R
> -A952 ; Other_Alphabetic # Mc REJANG CONSONANT SIGN H
> -A980..A982 ; Other_Alphabetic # Mn [3] JAVANESE SIGN PANYANGGA..JAVANESE SIGN LAYAR
> -A983 ; Other_Alphabetic # Mc JAVANESE SIGN WIGNYAN
> -A9B4..A9B5 ; Other_Alphabetic # Mc [2] JAVANESE VOWEL SIGN TARUNG..JAVANESE VOWEL SIGN TOLONG
> -A9B6..A9B9 ; Other_Alphabetic # Mn [4] JAVANESE VOWEL SIGN WULU..JAVANESE VOWEL SIGN SUKU MENDUT
> -A9BA..A9BB ; Other_Alphabetic # Mc [2] JAVANESE VOWEL SIGN TALING..JAVANESE VOWEL SIGN DIRGA MURE
> -A9BC..A9BD ; Other_Alphabetic # Mn [2] JAVANESE VOWEL SIGN PEPET..JAVANESE CONSONANT SIGN KERET
> -A9BE..A9BF ; Other_Alphabetic # Mc [2] JAVANESE CONSONANT SIGN PENGKAL..JAVANESE CONSONANT SIGN CAKRA
> -A9E5 ; Other_Alphabetic # Mn MYANMAR SIGN SHAN SAW
> -AA29..AA2E ; Other_Alphabetic # Mn [6] CHAM VOWEL SIGN AA..CHAM VOWEL SIGN OE
> -AA2F..AA30 ; Other_Alphabetic # Mc [2] CHAM VOWEL SIGN O..CHAM VOWEL SIGN AI
> -AA31..AA32 ; Other_Alphabetic # Mn [2] CHAM VOWEL SIGN AU..CHAM VOWEL SIGN UE
> -AA33..AA34 ; Other_Alphabetic # Mc [2] CHAM CONSONANT SIGN YA..CHAM CONSONANT SIGN RA
> -AA35..AA36 ; Other_Alphabetic # Mn [2] CHAM CONSONANT SIGN LA..CHAM CONSONANT SIGN WA
> -AA43 ; Other_Alphabetic # Mn CHAM CONSONANT SIGN FINAL NG
> -AA4C ; Other_Alphabetic # Mn CHAM CONSONANT SIGN FINAL M
> -AA4D ; Other_Alphabetic # Mc CHAM CONSONANT SIGN FINAL H
> -AA7B ; Other_Alphabetic # Mc MYANMAR SIGN PAO KAREN TONE
> -AA7C ; Other_Alphabetic # Mn MYANMAR SIGN TAI LAING TONE-2
> -AA7D ; Other_Alphabetic # Mc MYANMAR SIGN TAI LAING TONE-5
> -AAB0 ; Other_Alphabetic # Mn TAI VIET MAI KANG
> -AAB2..AAB4 ; Other_Alphabetic # Mn [3] TAI VIET VOWEL I..TAI VIET VOWEL U
> -AAB7..AAB8 ; Other_Alphabetic # Mn [2] TAI VIET MAI KHIT..TAI VIET VOWEL IA
> -AABE ; Other_Alphabetic # Mn TAI VIET VOWEL AM
> -AAEB ; Other_Alphabetic # Mc MEETEI MAYEK VOWEL SIGN II
> -AAEC..AAED ; Other_Alphabetic # Mn [2] MEETEI MAYEK VOWEL SIGN UU..MEETEI MAYEK VOWEL SIGN AAI
> -AAEE..AAEF ; Other_Alphabetic # Mc [2] MEETEI MAYEK VOWEL SIGN AU..MEETEI MAYEK VOWEL SIGN AAU
> -AAF5 ; Other_Alphabetic # Mc MEETEI MAYEK VOWEL SIGN VISARGA
> -ABE3..ABE4 ; Other_Alphabetic # Mc [2] MEETEI MAYEK VOWEL SIGN ONAP..MEETEI MAYEK VOWEL SIGN INAP
> -ABE5 ; Other_Alphabetic # Mn MEETEI MAYEK VOWEL SIGN ANAP
> -ABE6..ABE7 ; Other_Alphabetic # Mc [2] MEETEI MAYEK VOWEL SIGN YENAP..MEETEI MAYEK VOWEL SIGN SOUNAP
> -ABE8 ; Other_Alphabetic # Mn MEETEI MAYEK VOWEL SIGN UNAP
> -ABE9..ABEA ; Other_Alphabetic # Mc [2] MEETEI MAYEK VOWEL SIGN CHEINAP..MEETEI MAYEK VOWEL SIGN NUNG
> -FB1E ; Other_Alphabetic # Mn HEBREW POINT JUDEO-SPANISH VARIKA
> -10376..1037A ; Other_Alphabetic # Mn [5] COMBINING OLD PERMIC LETTER AN..COMBINING OLD PERMIC LETTER SII
> -10A01..10A03 ; Other_Alphabetic # Mn [3] KHAROSHTHI VOWEL SIGN I..KHAROSHTHI VOWEL SIGN VOCALIC R
> -10A05..10A06 ; Other_Alphabetic # Mn [2] KHAROSHTHI VOWEL SIGN E..KHAROSHTHI VOWEL SIGN O
> -10A0C..10A0F ; Other_Alphabetic # Mn [4] KHAROSHTHI VOWEL LENGTH MARK..KHAROSHTHI SIGN VISARGA
> -10D24..10D27 ; Other_Alphabetic # Mn [4] HANIFI ROHINGYA SIGN HARBAHAY..HANIFI ROHINGYA SIGN TASSI
> -10EAB..10EAC ; Other_Alphabetic # Mn [2] YEZIDI COMBINING HAMZA MARK..YEZIDI COMBINING MADDA MARK
> -11000 ; Other_Alphabetic # Mc BRAHMI SIGN CANDRABINDU
> -11001 ; Other_Alphabetic # Mn BRAHMI SIGN ANUSVARA
> -11002 ; Other_Alphabetic # Mc BRAHMI SIGN VISARGA
> -11038..11045 ; Other_Alphabetic # Mn [14] BRAHMI VOWEL SIGN AA..BRAHMI VOWEL SIGN AU
> -11073..11074 ; Other_Alphabetic # Mn [2] BRAHMI VOWEL SIGN OLD TAMIL SHORT E..BRAHMI VOWEL SIGN OLD TAMIL SHORT O
> -11080..11081 ; Other_Alphabetic # Mn [2] KAITHI SIGN CANDRABINDU..KAITHI SIGN ANUSVARA
> -11082 ; Other_Alphabetic # Mc KAITHI SIGN VISARGA
> -110B0..110B2 ; Other_Alphabetic # Mc [3] KAITHI VOWEL SIGN AA..KAITHI VOWEL SIGN II
> -110B3..110B6 ; Other_Alphabetic # Mn [4] KAITHI VOWEL SIGN U..KAITHI VOWEL SIGN AI
> -110B7..110B8 ; Other_Alphabetic # Mc [2] KAITHI VOWEL SIGN O..KAITHI VOWEL SIGN AU
> -110C2 ; Other_Alphabetic # Mn KAITHI VOWEL SIGN VOCALIC R
> -11100..11102 ; Other_Alphabetic # Mn [3] CHAKMA SIGN CANDRABINDU..CHAKMA SIGN VISARGA
> -11127..1112B ; Other_Alphabetic # Mn [5] CHAKMA VOWEL SIGN A..CHAKMA VOWEL SIGN UU
> -1112C ; Other_Alphabetic # Mc CHAKMA VOWEL SIGN E
> -1112D..11132 ; Other_Alphabetic # Mn [6] CHAKMA VOWEL SIGN AI..CHAKMA AU MARK
> -11145..11146 ; Other_Alphabetic # Mc [2] CHAKMA VOWEL SIGN AA..CHAKMA VOWEL SIGN EI
> -11180..11181 ; Other_Alphabetic # Mn [2] SHARADA SIGN CANDRABINDU..SHARADA SIGN ANUSVARA
> -11182 ; Other_Alphabetic # Mc SHARADA SIGN VISARGA
> -111B3..111B5 ; Other_Alphabetic # Mc [3] SHARADA VOWEL SIGN AA..SHARADA VOWEL SIGN II
> -111B6..111BE ; Other_Alphabetic # Mn [9] SHARADA VOWEL SIGN U..SHARADA VOWEL SIGN O
> -111BF ; Other_Alphabetic # Mc SHARADA VOWEL SIGN AU
> -111CE ; Other_Alphabetic # Mc SHARADA VOWEL SIGN PRISHTHAMATRA E
> -111CF ; Other_Alphabetic # Mn SHARADA SIGN INVERTED CANDRABINDU
> -1122C..1122E ; Other_Alphabetic # Mc [3] KHOJKI VOWEL SIGN AA..KHOJKI VOWEL SIGN II
> -1122F..11231 ; Other_Alphabetic # Mn [3] KHOJKI VOWEL SIGN U..KHOJKI VOWEL SIGN AI
> -11232..11233 ; Other_Alphabetic # Mc [2] KHOJKI VOWEL SIGN O..KHOJKI VOWEL SIGN AU
> -11234 ; Other_Alphabetic # Mn KHOJKI SIGN ANUSVARA
> -11237 ; Other_Alphabetic # Mn KHOJKI SIGN SHADDA
> -1123E ; Other_Alphabetic # Mn KHOJKI SIGN SUKUN
> -11241 ; Other_Alphabetic # Mn KHOJKI VOWEL SIGN VOCALIC R
> -112DF ; Other_Alphabetic # Mn KHUDAWADI SIGN ANUSVARA
> -112E0..112E2 ; Other_Alphabetic # Mc [3] KHUDAWADI VOWEL SIGN AA..KHUDAWADI VOWEL SIGN II
> -112E3..112E8 ; Other_Alphabetic # Mn [6] KHUDAWADI VOWEL SIGN U..KHUDAWADI VOWEL SIGN AU
> -11300..11301 ; Other_Alphabetic # Mn [2] GRANTHA SIGN COMBINING ANUSVARA ABOVE..GRANTHA SIGN CANDRABINDU
> -11302..11303 ; Other_Alphabetic # Mc [2] GRANTHA SIGN ANUSVARA..GRANTHA SIGN VISARGA
> -1133E..1133F ; Other_Alphabetic # Mc [2] GRANTHA VOWEL SIGN AA..GRANTHA VOWEL SIGN I
> -11340 ; Other_Alphabetic # Mn GRANTHA VOWEL SIGN II
> -11341..11344 ; Other_Alphabetic # Mc [4] GRANTHA VOWEL SIGN U..GRANTHA VOWEL SIGN VOCALIC RR
> -11347..11348 ; Other_Alphabetic # Mc [2] GRANTHA VOWEL SIGN EE..GRANTHA VOWEL SIGN AI
> -1134B..1134C ; Other_Alphabetic # Mc [2] GRANTHA VOWEL SIGN OO..GRANTHA VOWEL SIGN AU
> -11357 ; Other_Alphabetic # Mc GRANTHA AU LENGTH MARK
> -11362..11363 ; Other_Alphabetic # Mc [2] GRANTHA VOWEL SIGN VOCALIC L..GRANTHA VOWEL SIGN VOCALIC LL
> -11435..11437 ; Other_Alphabetic # Mc [3] NEWA VOWEL SIGN AA..NEWA VOWEL SIGN II
> -11438..1143F ; Other_Alphabetic # Mn [8] NEWA VOWEL SIGN U..NEWA VOWEL SIGN AI
> -11440..11441 ; Other_Alphabetic # Mc [2] NEWA VOWEL SIGN O..NEWA VOWEL SIGN AU
> -11443..11444 ; Other_Alphabetic # Mn [2] NEWA SIGN CANDRABINDU..NEWA SIGN ANUSVARA
> -11445 ; Other_Alphabetic # Mc NEWA SIGN VISARGA
> -114B0..114B2 ; Other_Alphabetic # Mc [3] TIRHUTA VOWEL SIGN AA..TIRHUTA VOWEL SIGN II
> -114B3..114B8 ; Other_Alphabetic # Mn [6] TIRHUTA VOWEL SIGN U..TIRHUTA VOWEL SIGN VOCALIC LL
> -114B9 ; Other_Alphabetic # Mc TIRHUTA VOWEL SIGN E
> -114BA ; Other_Alphabetic # Mn TIRHUTA VOWEL SIGN SHORT E
> -114BB..114BE ; Other_Alphabetic # Mc [4] TIRHUTA VOWEL SIGN AI..TIRHUTA VOWEL SIGN AU
> -114BF..114C0 ; Other_Alphabetic # Mn [2] TIRHUTA SIGN CANDRABINDU..TIRHUTA SIGN ANUSVARA
> -114C1 ; Other_Alphabetic # Mc TIRHUTA SIGN VISARGA
> -115AF..115B1 ; Other_Alphabetic # Mc [3] SIDDHAM VOWEL SIGN AA..SIDDHAM VOWEL SIGN II
> -115B2..115B5 ; Other_Alphabetic # Mn [4] SIDDHAM VOWEL SIGN U..SIDDHAM VOWEL SIGN VOCALIC RR
> -115B8..115BB ; Other_Alphabetic # Mc [4] SIDDHAM VOWEL SIGN E..SIDDHAM VOWEL SIGN AU
> -115BC..115BD ; Other_Alphabetic # Mn [2] SIDDHAM SIGN CANDRABINDU..SIDDHAM SIGN ANUSVARA
> -115BE ; Other_Alphabetic # Mc SIDDHAM SIGN VISARGA
> -115DC..115DD ; Other_Alphabetic # Mn [2] SIDDHAM VOWEL SIGN ALTERNATE U..SIDDHAM VOWEL SIGN ALTERNATE UU
> -11630..11632 ; Other_Alphabetic # Mc [3] MODI VOWEL SIGN AA..MODI VOWEL SIGN II
> -11633..1163A ; Other_Alphabetic # Mn [8] MODI VOWEL SIGN U..MODI VOWEL SIGN AI
> -1163B..1163C ; Other_Alphabetic # Mc [2] MODI VOWEL SIGN O..MODI VOWEL SIGN AU
> -1163D ; Other_Alphabetic # Mn MODI SIGN ANUSVARA
> -1163E ; Other_Alphabetic # Mc MODI SIGN VISARGA
> -11640 ; Other_Alphabetic # Mn MODI SIGN ARDHACANDRA
> -116AB ; Other_Alphabetic # Mn TAKRI SIGN ANUSVARA
> -116AC ; Other_Alphabetic # Mc TAKRI SIGN VISARGA
> -116AD ; Other_Alphabetic # Mn TAKRI VOWEL SIGN AA
> -116AE..116AF ; Other_Alphabetic # Mc [2] TAKRI VOWEL SIGN I..TAKRI VOWEL SIGN II
> -116B0..116B5 ; Other_Alphabetic # Mn [6] TAKRI VOWEL SIGN U..TAKRI VOWEL SIGN AU
> -1171D..1171F ; Other_Alphabetic # Mn [3] AHOM CONSONANT SIGN MEDIAL LA..AHOM CONSONANT SIGN MEDIAL LIGATING RA
> -11720..11721 ; Other_Alphabetic # Mc [2] AHOM VOWEL SIGN A..AHOM VOWEL SIGN AA
> -11722..11725 ; Other_Alphabetic # Mn [4] AHOM VOWEL SIGN I..AHOM VOWEL SIGN UU
> -11726 ; Other_Alphabetic # Mc AHOM VOWEL SIGN E
> -11727..1172A ; Other_Alphabetic # Mn [4] AHOM VOWEL SIGN AW..AHOM VOWEL SIGN AM
> -1182C..1182E ; Other_Alphabetic # Mc [3] DOGRA VOWEL SIGN AA..DOGRA VOWEL SIGN II
> -1182F..11837 ; Other_Alphabetic # Mn [9] DOGRA VOWEL SIGN U..DOGRA SIGN ANUSVARA
> -11838 ; Other_Alphabetic # Mc DOGRA SIGN VISARGA
> -11930..11935 ; Other_Alphabetic # Mc [6] DIVES AKURU VOWEL SIGN AA..DIVES AKURU VOWEL SIGN E
> -11937..11938 ; Other_Alphabetic # Mc [2] DIVES AKURU VOWEL SIGN AI..DIVES AKURU VOWEL SIGN O
> -1193B..1193C ; Other_Alphabetic # Mn [2] DIVES AKURU SIGN ANUSVARA..DIVES AKURU SIGN CANDRABINDU
> -11940 ; Other_Alphabetic # Mc DIVES AKURU MEDIAL YA
> -11942 ; Other_Alphabetic # Mc DIVES AKURU MEDIAL RA
> -119D1..119D3 ; Other_Alphabetic # Mc [3] NANDINAGARI VOWEL SIGN AA..NANDINAGARI VOWEL SIGN II
> -119D4..119D7 ; Other_Alphabetic # Mn [4] NANDINAGARI VOWEL SIGN U..NANDINAGARI VOWEL SIGN VOCALIC RR
> -119DA..119DB ; Other_Alphabetic # Mn [2] NANDINAGARI VOWEL SIGN E..NANDINAGARI VOWEL SIGN AI
> -119DC..119DF ; Other_Alphabetic # Mc [4] NANDINAGARI VOWEL SIGN O..NANDINAGARI SIGN VISARGA
> -119E4 ; Other_Alphabetic # Mc NANDINAGARI VOWEL SIGN PRISHTHAMATRA E
> -11A01..11A0A ; Other_Alphabetic # Mn [10] ZANABAZAR SQUARE VOWEL SIGN I..ZANABAZAR SQUARE VOWEL LENGTH MARK
> -11A35..11A38 ; Other_Alphabetic # Mn [4] ZANABAZAR SQUARE SIGN CANDRABINDU..ZANABAZAR SQUARE SIGN ANUSVARA
> -11A39 ; Other_Alphabetic # Mc ZANABAZAR SQUARE SIGN VISARGA
> -11A3B..11A3E ; Other_Alphabetic # Mn [4] ZANABAZAR SQUARE CLUSTER-FINAL LETTER YA..ZANABAZAR SQUARE CLUSTER-FINAL LETTER VA
> -11A51..11A56 ; Other_Alphabetic # Mn [6] SOYOMBO VOWEL SIGN I..SOYOMBO VOWEL SIGN OE
> -11A57..11A58 ; Other_Alphabetic # Mc [2] SOYOMBO VOWEL SIGN AI..SOYOMBO VOWEL SIGN AU
> -11A59..11A5B ; Other_Alphabetic # Mn [3] SOYOMBO VOWEL SIGN VOCALIC R..SOYOMBO VOWEL LENGTH MARK
> -11A8A..11A96 ; Other_Alphabetic # Mn [13] SOYOMBO FINAL CONSONANT SIGN G..SOYOMBO SIGN ANUSVARA
> -11A97 ; Other_Alphabetic # Mc SOYOMBO SIGN VISARGA
> -11C2F ; Other_Alphabetic # Mc BHAIKSUKI VOWEL SIGN AA
> -11C30..11C36 ; Other_Alphabetic # Mn [7] BHAIKSUKI VOWEL SIGN I..BHAIKSUKI VOWEL SIGN VOCALIC L
> -11C38..11C3D ; Other_Alphabetic # Mn [6] BHAIKSUKI VOWEL SIGN E..BHAIKSUKI SIGN ANUSVARA
> -11C3E ; Other_Alphabetic # Mc BHAIKSUKI SIGN VISARGA
> -11C92..11CA7 ; Other_Alphabetic # Mn [22] MARCHEN SUBJOINED LETTER KA..MARCHEN SUBJOINED LETTER ZA
> -11CA9 ; Other_Alphabetic # Mc MARCHEN SUBJOINED LETTER YA
> -11CAA..11CB0 ; Other_Alphabetic # Mn [7] MARCHEN SUBJOINED LETTER RA..MARCHEN VOWEL SIGN AA
> -11CB1 ; Other_Alphabetic # Mc MARCHEN VOWEL SIGN I
> -11CB2..11CB3 ; Other_Alphabetic # Mn [2] MARCHEN VOWEL SIGN U..MARCHEN VOWEL SIGN E
> -11CB4 ; Other_Alphabetic # Mc MARCHEN VOWEL SIGN O
> -11CB5..11CB6 ; Other_Alphabetic # Mn [2] MARCHEN SIGN ANUSVARA..MARCHEN SIGN CANDRABINDU
> -11D31..11D36 ; Other_Alphabetic # Mn [6] MASARAM GONDI VOWEL SIGN AA..MASARAM GONDI VOWEL SIGN VOCALIC R
> -11D3A ; Other_Alphabetic # Mn MASARAM GONDI VOWEL SIGN E
> -11D3C..11D3D ; Other_Alphabetic # Mn [2] MASARAM GONDI VOWEL SIGN AI..MASARAM GONDI VOWEL SIGN O
> -11D3F..11D41 ; Other_Alphabetic # Mn [3] MASARAM GONDI VOWEL SIGN AU..MASARAM GONDI SIGN VISARGA
> -11D43 ; Other_Alphabetic # Mn MASARAM GONDI SIGN CANDRA
> -11D47 ; Other_Alphabetic # Mn MASARAM GONDI RA-KARA
> -11D8A..11D8E ; Other_Alphabetic # Mc [5] GUNJALA GONDI VOWEL SIGN AA..GUNJALA GONDI VOWEL SIGN UU
> -11D90..11D91 ; Other_Alphabetic # Mn [2] GUNJALA GONDI VOWEL SIGN EE..GUNJALA GONDI VOWEL SIGN AI
> -11D93..11D94 ; Other_Alphabetic # Mc [2] GUNJALA GONDI VOWEL SIGN OO..GUNJALA GONDI VOWEL SIGN AU
> -11D95 ; Other_Alphabetic # Mn GUNJALA GONDI SIGN ANUSVARA
> -11D96 ; Other_Alphabetic # Mc GUNJALA GONDI SIGN VISARGA
> -11EF3..11EF4 ; Other_Alphabetic # Mn [2] MAKASAR VOWEL SIGN I..MAKASAR VOWEL SIGN U
> -11EF5..11EF6 ; Other_Alphabetic # Mc [2] MAKASAR VOWEL SIGN E..MAKASAR VOWEL SIGN O
> -11F00..11F01 ; Other_Alphabetic # Mn [2] KAWI SIGN CANDRABINDU..KAWI SIGN ANUSVARA
> -11F03 ; Other_Alphabetic # Mc KAWI SIGN VISARGA
> -11F34..11F35 ; Other_Alphabetic # Mc [2] KAWI VOWEL SIGN AA..KAWI VOWEL SIGN ALTERNATE AA
> -11F36..11F3A ; Other_Alphabetic # Mn [5] KAWI VOWEL SIGN I..KAWI VOWEL SIGN VOCALIC R
> -11F3E..11F3F ; Other_Alphabetic # Mc [2] KAWI VOWEL SIGN E..KAWI VOWEL SIGN AI
> -11F40 ; Other_Alphabetic # Mn KAWI VOWEL SIGN EU
> -16F4F ; Other_Alphabetic # Mn MIAO SIGN CONSONANT MODIFIER BAR
> -16F51..16F87 ; Other_Alphabetic # Mc [55] MIAO SIGN ASPIRATION..MIAO VOWEL SIGN UI
> -16F8F..16F92 ; Other_Alphabetic # Mn [4] MIAO TONE RIGHT..MIAO TONE BELOW
> -16FF0..16FF1 ; Other_Alphabetic # Mc [2] VIETNAMESE ALTERNATE READING MARK CA..VIETNAMESE ALTERNATE READING MARK NHAY
> -1BC9E ; Other_Alphabetic # Mn DUPLOYAN DOUBLE MARK
> -1E000..1E006 ; Other_Alphabetic # Mn [7] COMBINING GLAGOLITIC LETTER AZU..COMBINING GLAGOLITIC LETTER ZHIVETE
> -1E008..1E018 ; Other_Alphabetic # Mn [17] COMBINING GLAGOLITIC LETTER ZEMLJA..COMBINING GLAGOLITIC LETTER HERU
> -1E01B..1E021 ; Other_Alphabetic # Mn [7] COMBINING GLAGOLITIC LETTER SHTA..COMBINING GLAGOLITIC LETTER YATI
> -1E023..1E024 ; Other_Alphabetic # Mn [2] COMBINING GLAGOLITIC LETTER YU..COMBINING GLAGOLITIC LETTER SMALL YUS
> -1E026..1E02A ; Other_Alphabetic # Mn [5] COMBINING GLAGOLITIC LETTER YO..COMBINING GLAGOLITIC LETTER FITA
> -1E08F ; Other_Alphabetic # Mn COMBINING CYRILLIC SMALL LETTER BYELORUSSIAN-UKRAINIAN I
> -1E947 ; Other_Alphabetic # Mn ADLAM HAMZA
> -1F130..1F149 ; Other_Alphabetic # So [26] SQUARED LATIN CAPITAL LETTER A..SQUARED LATIN CAPITAL LETTER Z
> -1F150..1F169 ; Other_Alphabetic # So [26] NEGATIVE CIRCLED LATIN CAPITAL LETTER A..NEGATIVE CIRCLED LATIN CAPITAL LETTER Z
> -1F170..1F189 ; Other_Alphabetic # So [26] NEGATIVE SQUARED LATIN CAPITAL LETTER A..NEGATIVE SQUARED LATIN CAPITAL LETTER Z
> -
> -# Total code points: 1425
> -
> -# ================================================
> -
> -3006 ; Ideographic # Lo IDEOGRAPHIC CLOSING MARK
> -3007 ; Ideographic # Nl IDEOGRAPHIC NUMBER ZERO
> -3021..3029 ; Ideographic # Nl [9] HANGZHOU NUMERAL ONE..HANGZHOU NUMERAL NINE
> -3038..303A ; Ideographic # Nl [3] HANGZHOU NUMERAL TEN..HANGZHOU NUMERAL THIRTY
> -3400..4DBF ; Ideographic # Lo [6592] CJK UNIFIED IDEOGRAPH-3400..CJK UNIFIED IDEOGRAPH-4DBF
> -4E00..9FFF ; Ideographic # Lo [20992] CJK UNIFIED IDEOGRAPH-4E00..CJK UNIFIED IDEOGRAPH-9FFF
> -F900..FA6D ; Ideographic # Lo [366] CJK COMPATIBILITY IDEOGRAPH-F900..CJK COMPATIBILITY IDEOGRAPH-FA6D
> -FA70..FAD9 ; Ideographic # Lo [106] CJK COMPATIBILITY IDEOGRAPH-FA70..CJK COMPATIBILITY IDEOGRAPH-FAD9
> -16FE4 ; Ideographic # Mn KHITAN SMALL SCRIPT FILLER
> -17000..187F7 ; Ideographic # Lo [6136] TANGUT IDEOGRAPH-17000..TANGUT IDEOGRAPH-187F7
> -18800..18CD5 ; Ideographic # Lo [1238] TANGUT COMPONENT-001..KHITAN SMALL SCRIPT CHARACTER-18CD5
> -18D00..18D08 ; Ideographic # Lo [9] TANGUT IDEOGRAPH-18D00..TANGUT IDEOGRAPH-18D08
> -1B170..1B2FB ; Ideographic # Lo [396] NUSHU CHARACTER-1B170..NUSHU CHARACTER-1B2FB
> -20000..2A6DF ; Ideographic # Lo [42720] CJK UNIFIED IDEOGRAPH-20000..CJK UNIFIED IDEOGRAPH-2A6DF
> -2A700..2B739 ; Ideographic # Lo [4154] CJK UNIFIED IDEOGRAPH-2A700..CJK UNIFIED IDEOGRAPH-2B739
> -2B740..2B81D ; Ideographic # Lo [222] CJK UNIFIED IDEOGRAPH-2B740..CJK UNIFIED IDEOGRAPH-2B81D
> -2B820..2CEA1 ; Ideographic # Lo [5762] CJK UNIFIED IDEOGRAPH-2B820..CJK UNIFIED IDEOGRAPH-2CEA1
> -2CEB0..2EBE0 ; Ideographic # Lo [7473] CJK UNIFIED IDEOGRAPH-2CEB0..CJK UNIFIED IDEOGRAPH-2EBE0
> -2EBF0..2EE5D ; Ideographic # Lo [622] CJK UNIFIED IDEOGRAPH-2EBF0..CJK UNIFIED IDEOGRAPH-2EE5D
> -2F800..2FA1D ; Ideographic # Lo [542] CJK COMPATIBILITY IDEOGRAPH-2F800..CJK COMPATIBILITY IDEOGRAPH-2FA1D
> -30000..3134A ; Ideographic # Lo [4939] CJK UNIFIED IDEOGRAPH-30000..CJK UNIFIED IDEOGRAPH-3134A
> -31350..323AF ; Ideographic # Lo [4192] CJK UNIFIED IDEOGRAPH-31350..CJK UNIFIED IDEOGRAPH-323AF
> -
> -# Total code points: 106476
> -
> -# ================================================
> -
> -005E ; Diacritic # Sk CIRCUMFLEX ACCENT
> -0060 ; Diacritic # Sk GRAVE ACCENT
> -00A8 ; Diacritic # Sk DIAERESIS
> -00AF ; Diacritic # Sk MACRON
> -00B4 ; Diacritic # Sk ACUTE ACCENT
> -00B7 ; Diacritic # Po MIDDLE DOT
> -00B8 ; Diacritic # Sk CEDILLA
> -02B0..02C1 ; Diacritic # Lm [18] MODIFIER LETTER SMALL H..MODIFIER LETTER REVERSED GLOTTAL STOP
> -02C2..02C5 ; Diacritic # Sk [4] MODIFIER LETTER LEFT ARROWHEAD..MODIFIER LETTER DOWN ARROWHEAD
> -02C6..02D1 ; Diacritic # Lm [12] MODIFIER LETTER CIRCUMFLEX ACCENT..MODIFIER LETTER HALF TRIANGULAR COLON
> -02D2..02DF ; Diacritic # Sk [14] MODIFIER LETTER CENTRED RIGHT HALF RING..MODIFIER LETTER CROSS ACCENT
> -02E0..02E4 ; Diacritic # Lm [5] MODIFIER LETTER SMALL GAMMA..MODIFIER LETTER SMALL REVERSED GLOTTAL STOP
> -02E5..02EB ; Diacritic # Sk [7] MODIFIER LETTER EXTRA-HIGH TONE BAR..MODIFIER LETTER YANG DEPARTING TONE MARK
> -02EC ; Diacritic # Lm MODIFIER LETTER VOICING
> -02ED ; Diacritic # Sk MODIFIER LETTER UNASPIRATED
> -02EE ; Diacritic # Lm MODIFIER LETTER DOUBLE APOSTROPHE
> -02EF..02FF ; Diacritic # Sk [17] MODIFIER LETTER LOW DOWN ARROWHEAD..MODIFIER LETTER LOW LEFT ARROW
> -0300..034E ; Diacritic # Mn [79] COMBINING GRAVE ACCENT..COMBINING UPWARDS ARROW BELOW
> -0350..0357 ; Diacritic # Mn [8] COMBINING RIGHT ARROWHEAD ABOVE..COMBINING RIGHT HALF RING ABOVE
> -035D..0362 ; Diacritic # Mn [6] COMBINING DOUBLE BREVE..COMBINING DOUBLE RIGHTWARDS ARROW BELOW
> -0374 ; Diacritic # Lm GREEK NUMERAL SIGN
> -0375 ; Diacritic # Sk GREEK LOWER NUMERAL SIGN
> -037A ; Diacritic # Lm GREEK YPOGEGRAMMENI
> -0384..0385 ; Diacritic # Sk [2] GREEK TONOS..GREEK DIALYTIKA TONOS
> -0483..0487 ; Diacritic # Mn [5] COMBINING CYRILLIC TITLO..COMBINING CYRILLIC POKRYTIE
> -0559 ; Diacritic # Lm ARMENIAN MODIFIER LETTER LEFT HALF RING
> -0591..05A1 ; Diacritic # Mn [17] HEBREW ACCENT ETNAHTA..HEBREW ACCENT PAZER
> -05A3..05BD ; Diacritic # Mn [27] HEBREW ACCENT MUNAH..HEBREW POINT METEG
> -05BF ; Diacritic # Mn HEBREW POINT RAFE
> -05C1..05C2 ; Diacritic # Mn [2] HEBREW POINT SHIN DOT..HEBREW POINT SIN DOT
> -05C4 ; Diacritic # Mn HEBREW MARK UPPER DOT
> -064B..0652 ; Diacritic # Mn [8] ARABIC FATHATAN..ARABIC SUKUN
> -0657..0658 ; Diacritic # Mn [2] ARABIC INVERTED DAMMA..ARABIC MARK NOON GHUNNA
> -06DF..06E0 ; Diacritic # Mn [2] ARABIC SMALL HIGH ROUNDED ZERO..ARABIC SMALL HIGH UPRIGHT RECTANGULAR ZERO
> -06E5..06E6 ; Diacritic # Lm [2] ARABIC SMALL WAW..ARABIC SMALL YEH
> -06EA..06EC ; Diacritic # Mn [3] ARABIC EMPTY CENTRE LOW STOP..ARABIC ROUNDED HIGH STOP WITH FILLED CENTRE
> -0730..074A ; Diacritic # Mn [27] SYRIAC PTHAHA ABOVE..SYRIAC BARREKH
> -07A6..07B0 ; Diacritic # Mn [11] THAANA ABAFILI..THAANA SUKUN
> -07EB..07F3 ; Diacritic # Mn [9] NKO COMBINING SHORT HIGH TONE..NKO COMBINING DOUBLE DOT ABOVE
> -07F4..07F5 ; Diacritic # Lm [2] NKO HIGH TONE APOSTROPHE..NKO LOW TONE APOSTROPHE
> -0818..0819 ; Diacritic # Mn [2] SAMARITAN MARK OCCLUSION..SAMARITAN MARK DAGESH
> -0898..089F ; Diacritic # Mn [8] ARABIC SMALL HIGH WORD AL-JUZ..ARABIC HALF MADDA OVER MADDA
> -08C9 ; Diacritic # Lm ARABIC SMALL FARSI YEH
> -08CA..08D2 ; Diacritic # Mn [9] ARABIC SMALL HIGH FARSI YEH..ARABIC LARGE ROUND DOT INSIDE CIRCLE BELOW
> -08E3..08FE ; Diacritic # Mn [28] ARABIC TURNED DAMMA BELOW..ARABIC DAMMA WITH DOT
> -093C ; Diacritic # Mn DEVANAGARI SIGN NUKTA
> -094D ; Diacritic # Mn DEVANAGARI SIGN VIRAMA
> -0951..0954 ; Diacritic # Mn [4] DEVANAGARI STRESS SIGN UDATTA..DEVANAGARI ACUTE ACCENT
> -0971 ; Diacritic # Lm DEVANAGARI SIGN HIGH SPACING DOT
> -09BC ; Diacritic # Mn BENGALI SIGN NUKTA
> -09CD ; Diacritic # Mn BENGALI SIGN VIRAMA
> -0A3C ; Diacritic # Mn GURMUKHI SIGN NUKTA
> -0A4D ; Diacritic # Mn GURMUKHI SIGN VIRAMA
> -0ABC ; Diacritic # Mn GUJARATI SIGN NUKTA
> -0ACD ; Diacritic # Mn GUJARATI SIGN VIRAMA
> -0AFD..0AFF ; Diacritic # Mn [3] GUJARATI SIGN THREE-DOT NUKTA ABOVE..GUJARATI SIGN TWO-CIRCLE NUKTA ABOVE
> -0B3C ; Diacritic # Mn ORIYA SIGN NUKTA
> -0B4D ; Diacritic # Mn ORIYA SIGN VIRAMA
> -0B55 ; Diacritic # Mn ORIYA SIGN OVERLINE
> -0BCD ; Diacritic # Mn TAMIL SIGN VIRAMA
> -0C3C ; Diacritic # Mn TELUGU SIGN NUKTA
> -0C4D ; Diacritic # Mn TELUGU SIGN VIRAMA
> -0CBC ; Diacritic # Mn KANNADA SIGN NUKTA
> -0CCD ; Diacritic # Mn KANNADA SIGN VIRAMA
> -0D3B..0D3C ; Diacritic # Mn [2] MALAYALAM SIGN VERTICAL BAR VIRAMA..MALAYALAM SIGN CIRCULAR VIRAMA
> -0D4D ; Diacritic # Mn MALAYALAM SIGN VIRAMA
> -0DCA ; Diacritic # Mn SINHALA SIGN AL-LAKUNA
> -0E47..0E4C ; Diacritic # Mn [6] THAI CHARACTER MAITAIKHU..THAI CHARACTER THANTHAKHAT
> -0E4E ; Diacritic # Mn THAI CHARACTER YAMAKKAN
> -0EBA ; Diacritic # Mn LAO SIGN PALI VIRAMA
> -0EC8..0ECC ; Diacritic # Mn [5] LAO TONE MAI EK..LAO CANCELLATION MARK
> -0F18..0F19 ; Diacritic # Mn [2] TIBETAN ASTROLOGICAL SIGN -KHYUD PA..TIBETAN ASTROLOGICAL SIGN SDONG TSHUGS
> -0F35 ; Diacritic # Mn TIBETAN MARK NGAS BZUNG NYI ZLA
> -0F37 ; Diacritic # Mn TIBETAN MARK NGAS BZUNG SGOR RTAGS
> -0F39 ; Diacritic # Mn TIBETAN MARK TSA -PHRU
> -0F3E..0F3F ; Diacritic # Mc [2] TIBETAN SIGN YAR TSHES..TIBETAN SIGN MAR TSHES
> -0F82..0F84 ; Diacritic # Mn [3] TIBETAN SIGN NYI ZLA NAA DA..TIBETAN MARK HALANTA
> -0F86..0F87 ; Diacritic # Mn [2] TIBETAN SIGN LCI RTAGS..TIBETAN SIGN YANG RTAGS
> -0FC6 ; Diacritic # Mn TIBETAN SYMBOL PADMA GDAN
> -1037 ; Diacritic # Mn MYANMAR SIGN DOT BELOW
> -1039..103A ; Diacritic # Mn [2] MYANMAR SIGN VIRAMA..MYANMAR SIGN ASAT
> -1063..1064 ; Diacritic # Mc [2] MYANMAR TONE MARK SGAW KAREN HATHI..MYANMAR TONE MARK SGAW KAREN KE PHO
> -1069..106D ; Diacritic # Mc [5] MYANMAR SIGN WESTERN PWO KAREN TONE-1..MYANMAR SIGN WESTERN PWO KAREN TONE-5
> -1087..108C ; Diacritic # Mc [6] MYANMAR SIGN SHAN TONE-2..MYANMAR SIGN SHAN COUNCIL TONE-3
> -108D ; Diacritic # Mn MYANMAR SIGN SHAN COUNCIL EMPHATIC TONE
> -108F ; Diacritic # Mc MYANMAR SIGN RUMAI PALAUNG TONE-5
> -109A..109B ; Diacritic # Mc [2] MYANMAR SIGN KHAMTI TONE-1..MYANMAR SIGN KHAMTI TONE-3
> -135D..135F ; Diacritic # Mn [3] ETHIOPIC COMBINING GEMINATION AND VOWEL LENGTH MARK..ETHIOPIC COMBINING GEMINATION MARK
> -1714 ; Diacritic # Mn TAGALOG SIGN VIRAMA
> -1715 ; Diacritic # Mc TAGALOG SIGN PAMUDPOD
> -17C9..17D3 ; Diacritic # Mn [11] KHMER SIGN MUUSIKATOAN..KHMER SIGN BATHAMASAT
> -17DD ; Diacritic # Mn KHMER SIGN ATTHACAN
> -1939..193B ; Diacritic # Mn [3] LIMBU SIGN MUKPHRENG..LIMBU SIGN SA-I
> -1A75..1A7C ; Diacritic # Mn [8] TAI THAM SIGN TONE-1..TAI THAM SIGN KHUEN-LUE KARAN
> -1A7F ; Diacritic # Mn TAI THAM COMBINING CRYPTOGRAMMIC DOT
> -1AB0..1ABD ; Diacritic # Mn [14] COMBINING DOUBLED CIRCUMFLEX ACCENT..COMBINING PARENTHESES BELOW
> -1ABE ; Diacritic # Me COMBINING PARENTHESES OVERLAY
> -1AC1..1ACB ; Diacritic # Mn [11] COMBINING LEFT PARENTHESIS ABOVE LEFT..COMBINING TRIPLE ACUTE ACCENT
> -1B34 ; Diacritic # Mn BALINESE SIGN REREKAN
> -1B44 ; Diacritic # Mc BALINESE ADEG ADEG
> -1B6B..1B73 ; Diacritic # Mn [9] BALINESE MUSICAL SYMBOL COMBINING TEGEH..BALINESE MUSICAL SYMBOL COMBINING GONG
> -1BAA ; Diacritic # Mc SUNDANESE SIGN PAMAAEH
> -1BAB ; Diacritic # Mn SUNDANESE SIGN VIRAMA
> -1C36..1C37 ; Diacritic # Mn [2] LEPCHA SIGN RAN..LEPCHA SIGN NUKTA
> -1C78..1C7D ; Diacritic # Lm [6] OL CHIKI MU TTUDDAG..OL CHIKI AHAD
> -1CD0..1CD2 ; Diacritic # Mn [3] VEDIC TONE KARSHANA..VEDIC TONE PRENKHA
> -1CD3 ; Diacritic # Po VEDIC SIGN NIHSHVASA
> -1CD4..1CE0 ; Diacritic # Mn [13] VEDIC SIGN YAJURVEDIC MIDLINE SVARITA..VEDIC TONE RIGVEDIC KASHMIRI INDEPENDENT SVARITA
> -1CE1 ; Diacritic # Mc VEDIC TONE ATHARVAVEDIC INDEPENDENT SVARITA
> -1CE2..1CE8 ; Diacritic # Mn [7] VEDIC SIGN VISARGA SVARITA..VEDIC SIGN VISARGA ANUDATTA WITH TAIL
> -1CED ; Diacritic # Mn VEDIC SIGN TIRYAK
> -1CF4 ; Diacritic # Mn VEDIC TONE CANDRA ABOVE
> -1CF7 ; Diacritic # Mc VEDIC SIGN ATIKRAMA
> -1CF8..1CF9 ; Diacritic # Mn [2] VEDIC TONE RING ABOVE..VEDIC TONE DOUBLE RING ABOVE
> -1D2C..1D6A ; Diacritic # Lm [63] MODIFIER LETTER CAPITAL A..GREEK SUBSCRIPT SMALL LETTER CHI
> -1DC4..1DCF ; Diacritic # Mn [12] COMBINING MACRON-ACUTE..COMBINING ZIGZAG BELOW
> -1DF5..1DFF ; Diacritic # Mn [11] COMBINING UP TACK ABOVE..COMBINING RIGHT ARROWHEAD AND DOWN ARROWHEAD BELOW
> -1FBD ; Diacritic # Sk GREEK KORONIS
> -1FBF..1FC1 ; Diacritic # Sk [3] GREEK PSILI..GREEK DIALYTIKA AND PERISPOMENI
> -1FCD..1FCF ; Diacritic # Sk [3] GREEK PSILI AND VARIA..GREEK PSILI AND PERISPOMENI
> -1FDD..1FDF ; Diacritic # Sk [3] GREEK DASIA AND VARIA..GREEK DASIA AND PERISPOMENI
> -1FED..1FEF ; Diacritic # Sk [3] GREEK DIALYTIKA AND VARIA..GREEK VARIA
> -1FFD..1FFE ; Diacritic # Sk [2] GREEK OXIA..GREEK DASIA
> -2CEF..2CF1 ; Diacritic # Mn [3] COPTIC COMBINING NI ABOVE..COPTIC COMBINING SPIRITUS LENIS
> -2E2F ; Diacritic # Lm VERTICAL TILDE
> -302A..302D ; Diacritic # Mn [4] IDEOGRAPHIC LEVEL TONE MARK..IDEOGRAPHIC ENTERING TONE MARK
> -302E..302F ; Diacritic # Mc [2] HANGUL SINGLE DOT TONE MARK..HANGUL DOUBLE DOT TONE MARK
> -3099..309A ; Diacritic # Mn [2] COMBINING KATAKANA-HIRAGANA VOICED SOUND MARK..COMBINING KATAKANA-HIRAGANA SEMI-VOICED SOUND MARK
> -309B..309C ; Diacritic # Sk [2] KATAKANA-HIRAGANA VOICED SOUND MARK..KATAKANA-HIRAGANA SEMI-VOICED SOUND MARK
> -30FC ; Diacritic # Lm KATAKANA-HIRAGANA PROLONGED SOUND MARK
> -A66F ; Diacritic # Mn COMBINING CYRILLIC VZMET
> -A67C..A67D ; Diacritic # Mn [2] COMBINING CYRILLIC KAVYKA..COMBINING CYRILLIC PAYEROK
> -A67F ; Diacritic # Lm CYRILLIC PAYEROK
> -A69C..A69D ; Diacritic # Lm [2] MODIFIER LETTER CYRILLIC HARD SIGN..MODIFIER LETTER CYRILLIC SOFT SIGN
> -A6F0..A6F1 ; Diacritic # Mn [2] BAMUM COMBINING MARK KOQNDON..BAMUM COMBINING MARK TUKWENTIS
> -A700..A716 ; Diacritic # Sk [23] MODIFIER LETTER CHINESE TONE YIN PING..MODIFIER LETTER EXTRA-LOW LEFT-STEM TONE BAR
> -A717..A71F ; Diacritic # Lm [9] MODIFIER LETTER DOT VERTICAL BAR..MODIFIER LETTER LOW INVERTED EXCLAMATION MARK
> -A720..A721 ; Diacritic # Sk [2] MODIFIER LETTER STRESS AND HIGH TONE..MODIFIER LETTER STRESS AND LOW TONE
> -A788 ; Diacritic # Lm MODIFIER LETTER LOW CIRCUMFLEX ACCENT
> -A789..A78A ; Diacritic # Sk [2] MODIFIER LETTER COLON..MODIFIER LETTER SHORT EQUALS SIGN
> -A7F8..A7F9 ; Diacritic # Lm [2] MODIFIER LETTER CAPITAL H WITH STROKE..MODIFIER LETTER SMALL LIGATURE OE
> -A8C4 ; Diacritic # Mn SAURASHTRA SIGN VIRAMA
> -A8E0..A8F1 ; Diacritic # Mn [18] COMBINING DEVANAGARI DIGIT ZERO..COMBINING DEVANAGARI SIGN AVAGRAHA
> -A92B..A92D ; Diacritic # Mn [3] KAYAH LI TONE PLOPHU..KAYAH LI TONE CALYA PLOPHU
> -A92E ; Diacritic # Po KAYAH LI SIGN CWI
> -A953 ; Diacritic # Mc REJANG VIRAMA
> -A9B3 ; Diacritic # Mn JAVANESE SIGN CECAK TELU
> -A9C0 ; Diacritic # Mc JAVANESE PANGKON
> -A9E5 ; Diacritic # Mn MYANMAR SIGN SHAN SAW
> -AA7B ; Diacritic # Mc MYANMAR SIGN PAO KAREN TONE
> -AA7C ; Diacritic # Mn MYANMAR SIGN TAI LAING TONE-2
> -AA7D ; Diacritic # Mc MYANMAR SIGN TAI LAING TONE-5
> -AABF ; Diacritic # Mn TAI VIET TONE MAI EK
> -AAC0 ; Diacritic # Lo TAI VIET TONE MAI NUENG
> -AAC1 ; Diacritic # Mn TAI VIET TONE MAI THO
> -AAC2 ; Diacritic # Lo TAI VIET TONE MAI SONG
> -AAF6 ; Diacritic # Mn MEETEI MAYEK VIRAMA
> -AB5B ; Diacritic # Sk MODIFIER BREVE WITH INVERTED BREVE
> -AB5C..AB5F ; Diacritic # Lm [4] MODIFIER LETTER SMALL HENG..MODIFIER LETTER SMALL U WITH LEFT HOOK
> -AB69 ; Diacritic # Lm MODIFIER LETTER SMALL TURNED W
> -AB6A..AB6B ; Diacritic # Sk [2] MODIFIER LETTER LEFT TACK..MODIFIER LETTER RIGHT TACK
> -ABEC ; Diacritic # Mc MEETEI MAYEK LUM IYEK
> -ABED ; Diacritic # Mn MEETEI MAYEK APUN IYEK
> -FB1E ; Diacritic # Mn HEBREW POINT JUDEO-SPANISH VARIKA
> -FE20..FE2F ; Diacritic # Mn [16] COMBINING LIGATURE LEFT HALF..COMBINING CYRILLIC TITLO RIGHT HALF
> -FF3E ; Diacritic # Sk FULLWIDTH CIRCUMFLEX ACCENT
> -FF40 ; Diacritic # Sk FULLWIDTH GRAVE ACCENT
> -FF70 ; Diacritic # Lm HALFWIDTH KATAKANA-HIRAGANA PROLONGED SOUND MARK
> -FF9E..FF9F ; Diacritic # Lm [2] HALFWIDTH KATAKANA VOICED SOUND MARK..HALFWIDTH KATAKANA SEMI-VOICED SOUND MARK
> -FFE3 ; Diacritic # Sk FULLWIDTH MACRON
> -102E0 ; Diacritic # Mn COPTIC EPACT THOUSANDS MARK
> -10780..10785 ; Diacritic # Lm [6] MODIFIER LETTER SMALL CAPITAL AA..MODIFIER LETTER SMALL B WITH HOOK
> -10787..107B0 ; Diacritic # Lm [42] MODIFIER LETTER SMALL DZ DIGRAPH..MODIFIER LETTER SMALL V WITH RIGHT HOOK
> -107B2..107BA ; Diacritic # Lm [9] MODIFIER LETTER SMALL CAPITAL Y..MODIFIER LETTER SMALL S WITH CURL
> -10AE5..10AE6 ; Diacritic # Mn [2] MANICHAEAN ABBREVIATION MARK ABOVE..MANICHAEAN ABBREVIATION MARK BELOW
> -10D22..10D23 ; Diacritic # Lo [2] HANIFI ROHINGYA MARK SAKIN..HANIFI ROHINGYA MARK NA KHONNA
> -10D24..10D27 ; Diacritic # Mn [4] HANIFI ROHINGYA SIGN HARBAHAY..HANIFI ROHINGYA SIGN TASSI
> -10EFD..10EFF ; Diacritic # Mn [3] ARABIC SMALL LOW WORD SAKTA..ARABIC SMALL LOW WORD MADDA
> -10F46..10F50 ; Diacritic # Mn [11] SOGDIAN COMBINING DOT BELOW..SOGDIAN COMBINING STROKE BELOW
> -10F82..10F85 ; Diacritic # Mn [4] OLD UYGHUR COMBINING DOT ABOVE..OLD UYGHUR COMBINING TWO DOTS BELOW
> -11046 ; Diacritic # Mn BRAHMI VIRAMA
> -11070 ; Diacritic # Mn BRAHMI SIGN OLD TAMIL VIRAMA
> -110B9..110BA ; Diacritic # Mn [2] KAITHI SIGN VIRAMA..KAITHI SIGN NUKTA
> -11133..11134 ; Diacritic # Mn [2] CHAKMA VIRAMA..CHAKMA MAAYYAA
> -11173 ; Diacritic # Mn MAHAJANI SIGN NUKTA
> -111C0 ; Diacritic # Mc SHARADA SIGN VIRAMA
> -111CA..111CC ; Diacritic # Mn [3] SHARADA SIGN NUKTA..SHARADA EXTRA SHORT VOWEL MARK
> -11235 ; Diacritic # Mc KHOJKI SIGN VIRAMA
> -11236 ; Diacritic # Mn KHOJKI SIGN NUKTA
> -112E9..112EA ; Diacritic # Mn [2] KHUDAWADI SIGN NUKTA..KHUDAWADI SIGN VIRAMA
> -1133C ; Diacritic # Mn GRANTHA SIGN NUKTA
> -1134D ; Diacritic # Mc GRANTHA SIGN VIRAMA
> -11366..1136C ; Diacritic # Mn [7] COMBINING GRANTHA DIGIT ZERO..COMBINING GRANTHA DIGIT SIX
> -11370..11374 ; Diacritic # Mn [5] COMBINING GRANTHA LETTER A..COMBINING GRANTHA LETTER PA
> -11442 ; Diacritic # Mn NEWA SIGN VIRAMA
> -11446 ; Diacritic # Mn NEWA SIGN NUKTA
> -114C2..114C3 ; Diacritic # Mn [2] TIRHUTA SIGN VIRAMA..TIRHUTA SIGN NUKTA
> -115BF..115C0 ; Diacritic # Mn [2] SIDDHAM SIGN VIRAMA..SIDDHAM SIGN NUKTA
> -1163F ; Diacritic # Mn MODI SIGN VIRAMA
> -116B6 ; Diacritic # Mc TAKRI SIGN VIRAMA
> -116B7 ; Diacritic # Mn TAKRI SIGN NUKTA
> -1172B ; Diacritic # Mn AHOM SIGN KILLER
> -11839..1183A ; Diacritic # Mn [2] DOGRA SIGN VIRAMA..DOGRA SIGN NUKTA
> -1193D ; Diacritic # Mc DIVES AKURU SIGN HALANTA
> -1193E ; Diacritic # Mn DIVES AKURU VIRAMA
> -11943 ; Diacritic # Mn DIVES AKURU SIGN NUKTA
> -119E0 ; Diacritic # Mn NANDINAGARI SIGN VIRAMA
> -11A34 ; Diacritic # Mn ZANABAZAR SQUARE SIGN VIRAMA
> -11A47 ; Diacritic # Mn ZANABAZAR SQUARE SUBJOINER
> -11A99 ; Diacritic # Mn SOYOMBO SUBJOINER
> -11C3F ; Diacritic # Mn BHAIKSUKI SIGN VIRAMA
> -11D42 ; Diacritic # Mn MASARAM GONDI SIGN NUKTA
> -11D44..11D45 ; Diacritic # Mn [2] MASARAM GONDI SIGN HALANTA..MASARAM GONDI VIRAMA
> -11D97 ; Diacritic # Mn GUNJALA GONDI VIRAMA
> -13447..13455 ; Diacritic # Mn [15] EGYPTIAN HIEROGLYPH MODIFIER DAMAGED AT TOP START..EGYPTIAN HIEROGLYPH MODIFIER DAMAGED
> -16AF0..16AF4 ; Diacritic # Mn [5] BASSA VAH COMBINING HIGH TONE..BASSA VAH COMBINING HIGH-LOW TONE
> -16B30..16B36 ; Diacritic # Mn [7] PAHAWH HMONG MARK CIM TUB..PAHAWH HMONG MARK CIM TAUM
> -16F8F..16F92 ; Diacritic # Mn [4] MIAO TONE RIGHT..MIAO TONE BELOW
> -16F93..16F9F ; Diacritic # Lm [13] MIAO LETTER TONE-2..MIAO LETTER REFORMED TONE-8
> -16FF0..16FF1 ; Diacritic # Mc [2] VIETNAMESE ALTERNATE READING MARK CA..VIETNAMESE ALTERNATE READING MARK NHAY
> -1AFF0..1AFF3 ; Diacritic # Lm [4] KATAKANA LETTER MINNAN TONE-2..KATAKANA LETTER MINNAN TONE-5
> -1AFF5..1AFFB ; Diacritic # Lm [7] KATAKANA LETTER MINNAN TONE-7..KATAKANA LETTER MINNAN NASALIZED TONE-5
> -1AFFD..1AFFE ; Diacritic # Lm [2] KATAKANA LETTER MINNAN NASALIZED TONE-7..KATAKANA LETTER MINNAN NASALIZED TONE-8
> -1CF00..1CF2D ; Diacritic # Mn [46] ZNAMENNY COMBINING MARK GORAZDO NIZKO S KRYZHEM ON LEFT..ZNAMENNY COMBINING MARK KRYZH ON LEFT
> -1CF30..1CF46 ; Diacritic # Mn [23] ZNAMENNY COMBINING TONAL RANGE MARK MRACHNO..ZNAMENNY PRIZNAK MODIFIER ROG
> -1D167..1D169 ; Diacritic # Mn [3] MUSICAL SYMBOL COMBINING TREMOLO-1..MUSICAL SYMBOL COMBINING TREMOLO-3
> -1D16D..1D172 ; Diacritic # Mc [6] MUSICAL SYMBOL COMBINING AUGMENTATION DOT..MUSICAL SYMBOL COMBINING FLAG-5
> -1D17B..1D182 ; Diacritic # Mn [8] MUSICAL SYMBOL COMBINING ACCENT..MUSICAL SYMBOL COMBINING LOURE
> -1D185..1D18B ; Diacritic # Mn [7] MUSICAL SYMBOL COMBINING DOIT..MUSICAL SYMBOL COMBINING TRIPLE TONGUE
> -1D1AA..1D1AD ; Diacritic # Mn [4] MUSICAL SYMBOL COMBINING DOWN BOW..MUSICAL SYMBOL COMBINING SNAP PIZZICATO
> -1E030..1E06D ; Diacritic # Lm [62] MODIFIER LETTER CYRILLIC SMALL A..MODIFIER LETTER CYRILLIC SMALL STRAIGHT U WITH STROKE
> -1E130..1E136 ; Diacritic # Mn [7] NYIAKENG PUACHUE HMONG TONE-B..NYIAKENG PUACHUE HMONG TONE-D
> -1E2AE ; Diacritic # Mn TOTO SIGN RISING TONE
> -1E2EC..1E2EF ; Diacritic # Mn [4] WANCHO TONE TUP..WANCHO TONE KOINI
> -1E8D0..1E8D6 ; Diacritic # Mn [7] MENDE KIKAKUI COMBINING NUMBER TEENS..MENDE KIKAKUI COMBINING NUMBER MILLIONS
> -1E944..1E946 ; Diacritic # Mn [3] ADLAM ALIF LENGTHENER..ADLAM GEMINATION MARK
> -1E948..1E94A ; Diacritic # Mn [3] ADLAM CONSONANT MODIFIER..ADLAM NUKTA
> -
> -# Total code points: 1144
> -
> -# ================================================
> -
> -00B7 ; Extender # Po MIDDLE DOT
> -02D0..02D1 ; Extender # Lm [2] MODIFIER LETTER TRIANGULAR COLON..MODIFIER LETTER HALF TRIANGULAR COLON
> -0640 ; Extender # Lm ARABIC TATWEEL
> -07FA ; Extender # Lm NKO LAJANYALAN
> -0B55 ; Extender # Mn ORIYA SIGN OVERLINE
> -0E46 ; Extender # Lm THAI CHARACTER MAIYAMOK
> -0EC6 ; Extender # Lm LAO KO LA
> -180A ; Extender # Po MONGOLIAN NIRUGU
> -1843 ; Extender # Lm MONGOLIAN LETTER TODO LONG VOWEL SIGN
> -1AA7 ; Extender # Lm TAI THAM SIGN MAI YAMOK
> -1C36 ; Extender # Mn LEPCHA SIGN RAN
> -1C7B ; Extender # Lm OL CHIKI RELAA
> -3005 ; Extender # Lm IDEOGRAPHIC ITERATION MARK
> -3031..3035 ; Extender # Lm [5] VERTICAL KANA REPEAT MARK..VERTICAL KANA REPEAT MARK LOWER HALF
> -309D..309E ; Extender # Lm [2] HIRAGANA ITERATION MARK..HIRAGANA VOICED ITERATION MARK
> -30FC..30FE ; Extender # Lm [3] KATAKANA-HIRAGANA PROLONGED SOUND MARK..KATAKANA VOICED ITERATION MARK
> -A015 ; Extender # Lm YI SYLLABLE WU
> -A60C ; Extender # Lm VAI SYLLABLE LENGTHENER
> -A9CF ; Extender # Lm JAVANESE PANGRANGKEP
> -A9E6 ; Extender # Lm MYANMAR MODIFIER LETTER SHAN REDUPLICATION
> -AA70 ; Extender # Lm MYANMAR MODIFIER LETTER KHAMTI REDUPLICATION
> -AADD ; Extender # Lm TAI VIET SYMBOL SAM
> -AAF3..AAF4 ; Extender # Lm [2] MEETEI MAYEK SYLLABLE REPETITION MARK..MEETEI MAYEK WORD REPETITION MARK
> -FF70 ; Extender # Lm HALFWIDTH KATAKANA-HIRAGANA PROLONGED SOUND MARK
> -10781..10782 ; Extender # Lm [2] MODIFIER LETTER SUPERSCRIPT TRIANGULAR COLON..MODIFIER LETTER SUPERSCRIPT HALF TRIANGULAR COLON
> -1135D ; Extender # Lo GRANTHA SIGN PLUTA
> -115C6..115C8 ; Extender # Po [3] SIDDHAM REPETITION MARK-1..SIDDHAM REPETITION MARK-3
> -11A98 ; Extender # Mn SOYOMBO GEMINATION MARK
> -16B42..16B43 ; Extender # Lm [2] PAHAWH HMONG SIGN VOS NRUA..PAHAWH HMONG SIGN IB YAM
> -16FE0..16FE1 ; Extender # Lm [2] TANGUT ITERATION MARK..NUSHU ITERATION MARK
> -16FE3 ; Extender # Lm OLD CHINESE ITERATION MARK
> -1E13C..1E13D ; Extender # Lm [2] NYIAKENG PUACHUE HMONG SIGN XW XW..NYIAKENG PUACHUE HMONG SYLLABLE LENGTHENER
> -1E944..1E946 ; Extender # Mn [3] ADLAM ALIF LENGTHENER..ADLAM GEMINATION MARK
> -
> -# Total code points: 50
> -
> -# ================================================
> -
> -00AA ; Other_Lowercase # Lo FEMININE ORDINAL INDICATOR
> -00BA ; Other_Lowercase # Lo MASCULINE ORDINAL INDICATOR
> -02B0..02B8 ; Other_Lowercase # Lm [9] MODIFIER LETTER SMALL H..MODIFIER LETTER SMALL Y
> -02C0..02C1 ; Other_Lowercase # Lm [2] MODIFIER LETTER GLOTTAL STOP..MODIFIER LETTER REVERSED GLOTTAL STOP
> -02E0..02E4 ; Other_Lowercase # Lm [5] MODIFIER LETTER SMALL GAMMA..MODIFIER LETTER SMALL REVERSED GLOTTAL STOP
> -0345 ; Other_Lowercase # Mn COMBINING GREEK YPOGEGRAMMENI
> -037A ; Other_Lowercase # Lm GREEK YPOGEGRAMMENI
> -10FC ; Other_Lowercase # Lm MODIFIER LETTER GEORGIAN NAR
> -1D2C..1D6A ; Other_Lowercase # Lm [63] MODIFIER LETTER CAPITAL A..GREEK SUBSCRIPT SMALL LETTER CHI
> -1D78 ; Other_Lowercase # Lm MODIFIER LETTER CYRILLIC EN
> -1D9B..1DBF ; Other_Lowercase # Lm [37] MODIFIER LETTER SMALL TURNED ALPHA..MODIFIER LETTER SMALL THETA
> -2071 ; Other_Lowercase # Lm SUPERSCRIPT LATIN SMALL LETTER I
> -207F ; Other_Lowercase # Lm SUPERSCRIPT LATIN SMALL LETTER N
> -2090..209C ; Other_Lowercase # Lm [13] LATIN SUBSCRIPT SMALL LETTER A..LATIN SUBSCRIPT SMALL LETTER T
> -2170..217F ; Other_Lowercase # Nl [16] SMALL ROMAN NUMERAL ONE..SMALL ROMAN NUMERAL ONE THOUSAND
> -24D0..24E9 ; Other_Lowercase # So [26] CIRCLED LATIN SMALL LETTER A..CIRCLED LATIN SMALL LETTER Z
> -2C7C..2C7D ; Other_Lowercase # Lm [2] LATIN SUBSCRIPT SMALL LETTER J..MODIFIER LETTER CAPITAL V
> -A69C..A69D ; Other_Lowercase # Lm [2] MODIFIER LETTER CYRILLIC HARD SIGN..MODIFIER LETTER CYRILLIC SOFT SIGN
> -A770 ; Other_Lowercase # Lm MODIFIER LETTER US
> -A7F2..A7F4 ; Other_Lowercase # Lm [3] MODIFIER LETTER CAPITAL C..MODIFIER LETTER CAPITAL Q
> -A7F8..A7F9 ; Other_Lowercase # Lm [2] MODIFIER LETTER CAPITAL H WITH STROKE..MODIFIER LETTER SMALL LIGATURE OE
> -AB5C..AB5F ; Other_Lowercase # Lm [4] MODIFIER LETTER SMALL HENG..MODIFIER LETTER SMALL U WITH LEFT HOOK
> -AB69 ; Other_Lowercase # Lm MODIFIER LETTER SMALL TURNED W
> -10780 ; Other_Lowercase # Lm MODIFIER LETTER SMALL CAPITAL AA
> -10783..10785 ; Other_Lowercase # Lm [3] MODIFIER LETTER SMALL AE..MODIFIER LETTER SMALL B WITH HOOK
> -10787..107B0 ; Other_Lowercase # Lm [42] MODIFIER LETTER SMALL DZ DIGRAPH..MODIFIER LETTER SMALL V WITH RIGHT HOOK
> -107B2..107BA ; Other_Lowercase # Lm [9] MODIFIER LETTER SMALL CAPITAL Y..MODIFIER LETTER SMALL S WITH CURL
> -1E030..1E06D ; Other_Lowercase # Lm [62] MODIFIER LETTER CYRILLIC SMALL A..MODIFIER LETTER CYRILLIC SMALL STRAIGHT U WITH STROKE
> -
> -# Total code points: 311
> -
> -# ================================================
> -
> -2160..216F ; Other_Uppercase # Nl [16] ROMAN NUMERAL ONE..ROMAN NUMERAL ONE THOUSAND
> -24B6..24CF ; Other_Uppercase # So [26] CIRCLED LATIN CAPITAL LETTER A..CIRCLED LATIN CAPITAL LETTER Z
> -1F130..1F149 ; Other_Uppercase # So [26] SQUARED LATIN CAPITAL LETTER A..SQUARED LATIN CAPITAL LETTER Z
> -1F150..1F169 ; Other_Uppercase # So [26] NEGATIVE CIRCLED LATIN CAPITAL LETTER A..NEGATIVE CIRCLED LATIN CAPITAL LETTER Z
> -1F170..1F189 ; Other_Uppercase # So [26] NEGATIVE SQUARED LATIN CAPITAL LETTER A..NEGATIVE SQUARED LATIN CAPITAL LETTER Z
> -
> -# Total code points: 120
> -
> -# ================================================
> -
> -FDD0..FDEF ; Noncharacter_Code_Point # Cn [32] <noncharacter-FDD0>..<noncharacter-FDEF>
> -FFFE..FFFF ; Noncharacter_Code_Point # Cn [2] <noncharacter-FFFE>..<noncharacter-FFFF>
> -1FFFE..1FFFF ; Noncharacter_Code_Point # Cn [2] <noncharacter-1FFFE>..<noncharacter-1FFFF>
> -2FFFE..2FFFF ; Noncharacter_Code_Point # Cn [2] <noncharacter-2FFFE>..<noncharacter-2FFFF>
> -3FFFE..3FFFF ; Noncharacter_Code_Point # Cn [2] <noncharacter-3FFFE>..<noncharacter-3FFFF>
> -4FFFE..4FFFF ; Noncharacter_Code_Point # Cn [2] <noncharacter-4FFFE>..<noncharacter-4FFFF>
> -5FFFE..5FFFF ; Noncharacter_Code_Point # Cn [2] <noncharacter-5FFFE>..<noncharacter-5FFFF>
> -6FFFE..6FFFF ; Noncharacter_Code_Point # Cn [2] <noncharacter-6FFFE>..<noncharacter-6FFFF>
> -7FFFE..7FFFF ; Noncharacter_Code_Point # Cn [2] <noncharacter-7FFFE>..<noncharacter-7FFFF>
> -8FFFE..8FFFF ; Noncharacter_Code_Point # Cn [2] <noncharacter-8FFFE>..<noncharacter-8FFFF>
> -9FFFE..9FFFF ; Noncharacter_Code_Point # Cn [2] <noncharacter-9FFFE>..<noncharacter-9FFFF>
> -AFFFE..AFFFF ; Noncharacter_Code_Point # Cn [2] <noncharacter-AFFFE>..<noncharacter-AFFFF>
> -BFFFE..BFFFF ; Noncharacter_Code_Point # Cn [2] <noncharacter-BFFFE>..<noncharacter-BFFFF>
> -CFFFE..CFFFF ; Noncharacter_Code_Point # Cn [2] <noncharacter-CFFFE>..<noncharacter-CFFFF>
> -DFFFE..DFFFF ; Noncharacter_Code_Point # Cn [2] <noncharacter-DFFFE>..<noncharacter-DFFFF>
> -EFFFE..EFFFF ; Noncharacter_Code_Point # Cn [2] <noncharacter-EFFFE>..<noncharacter-EFFFF>
> -FFFFE..FFFFF ; Noncharacter_Code_Point # Cn [2] <noncharacter-FFFFE>..<noncharacter-FFFFF>
> -10FFFE..10FFFF; Noncharacter_Code_Point # Cn [2] <noncharacter-10FFFE>..<noncharacter-10FFFF>
> -
> -# Total code points: 66
> -
> -# ================================================
> -
> -09BE ; Other_Grapheme_Extend # Mc BENGALI VOWEL SIGN AA
> -09D7 ; Other_Grapheme_Extend # Mc BENGALI AU LENGTH MARK
> -0B3E ; Other_Grapheme_Extend # Mc ORIYA VOWEL SIGN AA
> -0B57 ; Other_Grapheme_Extend # Mc ORIYA AU LENGTH MARK
> -0BBE ; Other_Grapheme_Extend # Mc TAMIL VOWEL SIGN AA
> -0BD7 ; Other_Grapheme_Extend # Mc TAMIL AU LENGTH MARK
> -0CC2 ; Other_Grapheme_Extend # Mc KANNADA VOWEL SIGN UU
> -0CD5..0CD6 ; Other_Grapheme_Extend # Mc [2] KANNADA LENGTH MARK..KANNADA AI LENGTH MARK
> -0D3E ; Other_Grapheme_Extend # Mc MALAYALAM VOWEL SIGN AA
> -0D57 ; Other_Grapheme_Extend # Mc MALAYALAM AU LENGTH MARK
> -0DCF ; Other_Grapheme_Extend # Mc SINHALA VOWEL SIGN AELA-PILLA
> -0DDF ; Other_Grapheme_Extend # Mc SINHALA VOWEL SIGN GAYANUKITTA
> -1B35 ; Other_Grapheme_Extend # Mc BALINESE VOWEL SIGN TEDUNG
> -200C ; Other_Grapheme_Extend # Cf ZERO WIDTH NON-JOINER
> -302E..302F ; Other_Grapheme_Extend # Mc [2] HANGUL SINGLE DOT TONE MARK..HANGUL DOUBLE DOT TONE MARK
> -FF9E..FF9F ; Other_Grapheme_Extend # Lm [2] HALFWIDTH KATAKANA VOICED SOUND MARK..HALFWIDTH KATAKANA SEMI-VOICED SOUND MARK
> -1133E ; Other_Grapheme_Extend # Mc GRANTHA VOWEL SIGN AA
> -11357 ; Other_Grapheme_Extend # Mc GRANTHA AU LENGTH MARK
> -114B0 ; Other_Grapheme_Extend # Mc TIRHUTA VOWEL SIGN AA
> -114BD ; Other_Grapheme_Extend # Mc TIRHUTA VOWEL SIGN SHORT O
> -115AF ; Other_Grapheme_Extend # Mc SIDDHAM VOWEL SIGN AA
> -11930 ; Other_Grapheme_Extend # Mc DIVES AKURU VOWEL SIGN AA
> -1D165 ; Other_Grapheme_Extend # Mc MUSICAL SYMBOL COMBINING STEM
> -1D16E..1D172 ; Other_Grapheme_Extend # Mc [5] MUSICAL SYMBOL COMBINING FLAG-1..MUSICAL SYMBOL COMBINING FLAG-5
> -E0020..E007F ; Other_Grapheme_Extend # Cf [96] TAG SPACE..CANCEL TAG
> -
> -# Total code points: 127
> -
> -# ================================================
> -
> -2FF0..2FF1 ; IDS_Binary_Operator # So [2] IDEOGRAPHIC DESCRIPTION CHARACTER LEFT TO RIGHT..IDEOGRAPHIC DESCRIPTION CHARACTER ABOVE TO BELOW
> -2FF4..2FFD ; IDS_Binary_Operator # So [10] IDEOGRAPHIC DESCRIPTION CHARACTER FULL SURROUND..IDEOGRAPHIC DESCRIPTION CHARACTER SURROUND FROM LOWER RIGHT
> -31EF ; IDS_Binary_Operator # So IDEOGRAPHIC DESCRIPTION CHARACTER SUBTRACTION
> -
> -# Total code points: 13
> -
> -# ================================================
> -
> -2FF2..2FF3 ; IDS_Trinary_Operator # So [2] IDEOGRAPHIC DESCRIPTION CHARACTER LEFT TO MIDDLE AND RIGHT..IDEOGRAPHIC DESCRIPTION CHARACTER ABOVE TO MIDDLE AND BELOW
> -
> -# Total code points: 2
> -
> -# ================================================
> -
> -2FFE..2FFF ; IDS_Unary_Operator # So [2] IDEOGRAPHIC DESCRIPTION CHARACTER HORIZONTAL REFLECTION..IDEOGRAPHIC DESCRIPTION CHARACTER ROTATION
> -
> -# Total code points: 2
> -
> -# ================================================
> -
> -2E80..2E99 ; Radical # So [26] CJK RADICAL REPEAT..CJK RADICAL RAP
> -2E9B..2EF3 ; Radical # So [89] CJK RADICAL CHOKE..CJK RADICAL C-SIMPLIFIED TURTLE
> -2F00..2FD5 ; Radical # So [214] KANGXI RADICAL ONE..KANGXI RADICAL FLUTE
> -
> -# Total code points: 329
> -
> -# ================================================
> -
> -3400..4DBF ; Unified_Ideograph # Lo [6592] CJK UNIFIED IDEOGRAPH-3400..CJK UNIFIED IDEOGRAPH-4DBF
> -4E00..9FFF ; Unified_Ideograph # Lo [20992] CJK UNIFIED IDEOGRAPH-4E00..CJK UNIFIED IDEOGRAPH-9FFF
> -FA0E..FA0F ; Unified_Ideograph # Lo [2] CJK COMPATIBILITY IDEOGRAPH-FA0E..CJK COMPATIBILITY IDEOGRAPH-FA0F
> -FA11 ; Unified_Ideograph # Lo CJK COMPATIBILITY IDEOGRAPH-FA11
> -FA13..FA14 ; Unified_Ideograph # Lo [2] CJK COMPATIBILITY IDEOGRAPH-FA13..CJK COMPATIBILITY IDEOGRAPH-FA14
> -FA1F ; Unified_Ideograph # Lo CJK COMPATIBILITY IDEOGRAPH-FA1F
> -FA21 ; Unified_Ideograph # Lo CJK COMPATIBILITY IDEOGRAPH-FA21
> -FA23..FA24 ; Unified_Ideograph # Lo [2] CJK COMPATIBILITY IDEOGRAPH-FA23..CJK COMPATIBILITY IDEOGRAPH-FA24
> -FA27..FA29 ; Unified_Ideograph # Lo [3] CJK COMPATIBILITY IDEOGRAPH-FA27..CJK COMPATIBILITY IDEOGRAPH-FA29
> -20000..2A6DF ; Unified_Ideograph # Lo [42720] CJK UNIFIED IDEOGRAPH-20000..CJK UNIFIED IDEOGRAPH-2A6DF
> -2A700..2B739 ; Unified_Ideograph # Lo [4154] CJK UNIFIED IDEOGRAPH-2A700..CJK UNIFIED IDEOGRAPH-2B739
> -2B740..2B81D ; Unified_Ideograph # Lo [222] CJK UNIFIED IDEOGRAPH-2B740..CJK UNIFIED IDEOGRAPH-2B81D
> -2B820..2CEA1 ; Unified_Ideograph # Lo [5762] CJK UNIFIED IDEOGRAPH-2B820..CJK UNIFIED IDEOGRAPH-2CEA1
> -2CEB0..2EBE0 ; Unified_Ideograph # Lo [7473] CJK UNIFIED IDEOGRAPH-2CEB0..CJK UNIFIED IDEOGRAPH-2EBE0
> -2EBF0..2EE5D ; Unified_Ideograph # Lo [622] CJK UNIFIED IDEOGRAPH-2EBF0..CJK UNIFIED IDEOGRAPH-2EE5D
> -30000..3134A ; Unified_Ideograph # Lo [4939] CJK UNIFIED IDEOGRAPH-30000..CJK UNIFIED IDEOGRAPH-3134A
> -31350..323AF ; Unified_Ideograph # Lo [4192] CJK UNIFIED IDEOGRAPH-31350..CJK UNIFIED IDEOGRAPH-323AF
> -
> -# Total code points: 97680
> -
> -# ================================================
> -
> -034F ; Other_Default_Ignorable_Code_Point # Mn COMBINING GRAPHEME JOINER
> -115F..1160 ; Other_Default_Ignorable_Code_Point # Lo [2] HANGUL CHOSEONG FILLER..HANGUL JUNGSEONG FILLER
> -17B4..17B5 ; Other_Default_Ignorable_Code_Point # Mn [2] KHMER VOWEL INHERENT AQ..KHMER VOWEL INHERENT AA
> -2065 ; Other_Default_Ignorable_Code_Point # Cn <reserved-2065>
> -3164 ; Other_Default_Ignorable_Code_Point # Lo HANGUL FILLER
> -FFA0 ; Other_Default_Ignorable_Code_Point # Lo HALFWIDTH HANGUL FILLER
> -FFF0..FFF8 ; Other_Default_Ignorable_Code_Point # Cn [9] <reserved-FFF0>..<reserved-FFF8>
> -E0000 ; Other_Default_Ignorable_Code_Point # Cn <reserved-E0000>
> -E0002..E001F ; Other_Default_Ignorable_Code_Point # Cn [30] <reserved-E0002>..<reserved-E001F>
> -E0080..E00FF ; Other_Default_Ignorable_Code_Point # Cn [128] <reserved-E0080>..<reserved-E00FF>
> -E01F0..E0FFF ; Other_Default_Ignorable_Code_Point # Cn [3600] <reserved-E01F0>..<reserved-E0FFF>
> -
> -# Total code points: 3776
> -
> -# ================================================
> -
> -0149 ; Deprecated # L& LATIN SMALL LETTER N PRECEDED BY APOSTROPHE
> -0673 ; Deprecated # Lo ARABIC LETTER ALEF WITH WAVY HAMZA BELOW
> -0F77 ; Deprecated # Mn TIBETAN VOWEL SIGN VOCALIC RR
> -0F79 ; Deprecated # Mn TIBETAN VOWEL SIGN VOCALIC LL
> -17A3..17A4 ; Deprecated # Lo [2] KHMER INDEPENDENT VOWEL QAQ..KHMER INDEPENDENT VOWEL QAA
> -206A..206F ; Deprecated # Cf [6] INHIBIT SYMMETRIC SWAPPING..NOMINAL DIGIT SHAPES
> -2329 ; Deprecated # Ps LEFT-POINTING ANGLE BRACKET
> -232A ; Deprecated # Pe RIGHT-POINTING ANGLE BRACKET
> -E0001 ; Deprecated # Cf LANGUAGE TAG
> -
> -# Total code points: 15
> -
> -# ================================================
> -
> -0069..006A ; Soft_Dotted # L& [2] LATIN SMALL LETTER I..LATIN SMALL LETTER J
> -012F ; Soft_Dotted # L& LATIN SMALL LETTER I WITH OGONEK
> -0249 ; Soft_Dotted # L& LATIN SMALL LETTER J WITH STROKE
> -0268 ; Soft_Dotted # L& LATIN SMALL LETTER I WITH STROKE
> -029D ; Soft_Dotted # L& LATIN SMALL LETTER J WITH CROSSED-TAIL
> -02B2 ; Soft_Dotted # Lm MODIFIER LETTER SMALL J
> -03F3 ; Soft_Dotted # L& GREEK LETTER YOT
> -0456 ; Soft_Dotted # L& CYRILLIC SMALL LETTER BYELORUSSIAN-UKRAINIAN I
> -0458 ; Soft_Dotted # L& CYRILLIC SMALL LETTER JE
> -1D62 ; Soft_Dotted # Lm LATIN SUBSCRIPT SMALL LETTER I
> -1D96 ; Soft_Dotted # L& LATIN SMALL LETTER I WITH RETROFLEX HOOK
> -1DA4 ; Soft_Dotted # Lm MODIFIER LETTER SMALL I WITH STROKE
> -1DA8 ; Soft_Dotted # Lm MODIFIER LETTER SMALL J WITH CROSSED-TAIL
> -1E2D ; Soft_Dotted # L& LATIN SMALL LETTER I WITH TILDE BELOW
> -1ECB ; Soft_Dotted # L& LATIN SMALL LETTER I WITH DOT BELOW
> -2071 ; Soft_Dotted # Lm SUPERSCRIPT LATIN SMALL LETTER I
> -2148..2149 ; Soft_Dotted # L& [2] DOUBLE-STRUCK ITALIC SMALL I..DOUBLE-STRUCK ITALIC SMALL J
> -2C7C ; Soft_Dotted # Lm LATIN SUBSCRIPT SMALL LETTER J
> -1D422..1D423 ; Soft_Dotted # L& [2] MATHEMATICAL BOLD SMALL I..MATHEMATICAL BOLD SMALL J
> -1D456..1D457 ; Soft_Dotted # L& [2] MATHEMATICAL ITALIC SMALL I..MATHEMATICAL ITALIC SMALL J
> -1D48A..1D48B ; Soft_Dotted # L& [2] MATHEMATICAL BOLD ITALIC SMALL I..MATHEMATICAL BOLD ITALIC SMALL J
> -1D4BE..1D4BF ; Soft_Dotted # L& [2] MATHEMATICAL SCRIPT SMALL I..MATHEMATICAL SCRIPT SMALL J
> -1D4F2..1D4F3 ; Soft_Dotted # L& [2] MATHEMATICAL BOLD SCRIPT SMALL I..MATHEMATICAL BOLD SCRIPT SMALL J
> -1D526..1D527 ; Soft_Dotted # L& [2] MATHEMATICAL FRAKTUR SMALL I..MATHEMATICAL FRAKTUR SMALL J
> -1D55A..1D55B ; Soft_Dotted # L& [2] MATHEMATICAL DOUBLE-STRUCK SMALL I..MATHEMATICAL DOUBLE-STRUCK SMALL J
> -1D58E..1D58F ; Soft_Dotted # L& [2] MATHEMATICAL BOLD FRAKTUR SMALL I..MATHEMATICAL BOLD FRAKTUR SMALL J
> -1D5C2..1D5C3 ; Soft_Dotted # L& [2] MATHEMATICAL SANS-SERIF SMALL I..MATHEMATICAL SANS-SERIF SMALL J
> -1D5F6..1D5F7 ; Soft_Dotted # L& [2] MATHEMATICAL SANS-SERIF BOLD SMALL I..MATHEMATICAL SANS-SERIF BOLD SMALL J
> -1D62A..1D62B ; Soft_Dotted # L& [2] MATHEMATICAL SANS-SERIF ITALIC SMALL I..MATHEMATICAL SANS-SERIF ITALIC SMALL J
> -1D65E..1D65F ; Soft_Dotted # L& [2] MATHEMATICAL SANS-SERIF BOLD ITALIC SMALL I..MATHEMATICAL SANS-SERIF BOLD ITALIC SMALL J
> -1D692..1D693 ; Soft_Dotted # L& [2] MATHEMATICAL MONOSPACE SMALL I..MATHEMATICAL MONOSPACE SMALL J
> -1DF1A ; Soft_Dotted # L& LATIN SMALL LETTER I WITH STROKE AND RETROFLEX HOOK
> -1E04C..1E04D ; Soft_Dotted # Lm [2] MODIFIER LETTER CYRILLIC SMALL BYELORUSSIAN-UKRAINIAN I..MODIFIER LETTER CYRILLIC SMALL JE
> -1E068 ; Soft_Dotted # Lm CYRILLIC SUBSCRIPT SMALL LETTER BYELORUSSIAN-UKRAINIAN I
> -
> -# Total code points: 50
> -
> -# ================================================
> -
> -0E40..0E44 ; Logical_Order_Exception # Lo [5] THAI CHARACTER SARA E..THAI CHARACTER SARA AI MAIMALAI
> -0EC0..0EC4 ; Logical_Order_Exception # Lo [5] LAO VOWEL SIGN E..LAO VOWEL SIGN AI
> -19B5..19B7 ; Logical_Order_Exception # Lo [3] NEW TAI LUE VOWEL SIGN E..NEW TAI LUE VOWEL SIGN O
> -19BA ; Logical_Order_Exception # Lo NEW TAI LUE VOWEL SIGN AY
> -AAB5..AAB6 ; Logical_Order_Exception # Lo [2] TAI VIET VOWEL E..TAI VIET VOWEL O
> -AAB9 ; Logical_Order_Exception # Lo TAI VIET VOWEL UEA
> -AABB..AABC ; Logical_Order_Exception # Lo [2] TAI VIET VOWEL AUE..TAI VIET VOWEL AY
> -
> -# Total code points: 19
> -
> -# ================================================
> -
> -1885..1886 ; Other_ID_Start # Mn [2] MONGOLIAN LETTER ALI GALI BALUDA..MONGOLIAN LETTER ALI GALI THREE BALUDA
> -2118 ; Other_ID_Start # Sm SCRIPT CAPITAL P
> -212E ; Other_ID_Start # So ESTIMATED SYMBOL
> -309B..309C ; Other_ID_Start # Sk [2] KATAKANA-HIRAGANA VOICED SOUND MARK..KATAKANA-HIRAGANA SEMI-VOICED SOUND MARK
> -
> -# Total code points: 6
> -
> -# ================================================
> -
> -00B7 ; Other_ID_Continue # Po MIDDLE DOT
> -0387 ; Other_ID_Continue # Po GREEK ANO TELEIA
> -1369..1371 ; Other_ID_Continue # No [9] ETHIOPIC DIGIT ONE..ETHIOPIC DIGIT NINE
> -19DA ; Other_ID_Continue # No NEW TAI LUE THAM DIGIT ONE
> -200C..200D ; Other_ID_Continue # Cf [2] ZERO WIDTH NON-JOINER..ZERO WIDTH JOINER
> -30FB ; Other_ID_Continue # Po KATAKANA MIDDLE DOT
> -FF65 ; Other_ID_Continue # Po HALFWIDTH KATAKANA MIDDLE DOT
> -
> -# Total code points: 16
> -
> -# ================================================
> -
> -00B2..00B3 ; ID_Compat_Math_Continue # No [2] SUPERSCRIPT TWO..SUPERSCRIPT THREE
> -00B9 ; ID_Compat_Math_Continue # No SUPERSCRIPT ONE
> -2070 ; ID_Compat_Math_Continue # No SUPERSCRIPT ZERO
> -2074..2079 ; ID_Compat_Math_Continue # No [6] SUPERSCRIPT FOUR..SUPERSCRIPT NINE
> -207A..207C ; ID_Compat_Math_Continue # Sm [3] SUPERSCRIPT PLUS SIGN..SUPERSCRIPT EQUALS SIGN
> -207D ; ID_Compat_Math_Continue # Ps SUPERSCRIPT LEFT PARENTHESIS
> -207E ; ID_Compat_Math_Continue # Pe SUPERSCRIPT RIGHT PARENTHESIS
> -2080..2089 ; ID_Compat_Math_Continue # No [10] SUBSCRIPT ZERO..SUBSCRIPT NINE
> -208A..208C ; ID_Compat_Math_Continue # Sm [3] SUBSCRIPT PLUS SIGN..SUBSCRIPT EQUALS SIGN
> -208D ; ID_Compat_Math_Continue # Ps SUBSCRIPT LEFT PARENTHESIS
> -208E ; ID_Compat_Math_Continue # Pe SUBSCRIPT RIGHT PARENTHESIS
> -2202 ; ID_Compat_Math_Continue # Sm PARTIAL DIFFERENTIAL
> -2207 ; ID_Compat_Math_Continue # Sm NABLA
> -221E ; ID_Compat_Math_Continue # Sm INFINITY
> -1D6C1 ; ID_Compat_Math_Continue # Sm MATHEMATICAL BOLD NABLA
> -1D6DB ; ID_Compat_Math_Continue # Sm MATHEMATICAL BOLD PARTIAL DIFFERENTIAL
> -1D6FB ; ID_Compat_Math_Continue # Sm MATHEMATICAL ITALIC NABLA
> -1D715 ; ID_Compat_Math_Continue # Sm MATHEMATICAL ITALIC PARTIAL DIFFERENTIAL
> -1D735 ; ID_Compat_Math_Continue # Sm MATHEMATICAL BOLD ITALIC NABLA
> -1D74F ; ID_Compat_Math_Continue # Sm MATHEMATICAL BOLD ITALIC PARTIAL DIFFERENTIAL
> -1D76F ; ID_Compat_Math_Continue # Sm MATHEMATICAL SANS-SERIF BOLD NABLA
> -1D789 ; ID_Compat_Math_Continue # Sm MATHEMATICAL SANS-SERIF BOLD PARTIAL DIFFERENTIAL
> -1D7A9 ; ID_Compat_Math_Continue # Sm MATHEMATICAL SANS-SERIF BOLD ITALIC NABLA
> -1D7C3 ; ID_Compat_Math_Continue # Sm MATHEMATICAL SANS-SERIF BOLD ITALIC PARTIAL DIFFERENTIAL
> -
> -# Total code points: 43
> -
> -# ================================================
> -
> -2202 ; ID_Compat_Math_Start # Sm PARTIAL DIFFERENTIAL
> -2207 ; ID_Compat_Math_Start # Sm NABLA
> -221E ; ID_Compat_Math_Start # Sm INFINITY
> -1D6C1 ; ID_Compat_Math_Start # Sm MATHEMATICAL BOLD NABLA
> -1D6DB ; ID_Compat_Math_Start # Sm MATHEMATICAL BOLD PARTIAL DIFFERENTIAL
> -1D6FB ; ID_Compat_Math_Start # Sm MATHEMATICAL ITALIC NABLA
> -1D715 ; ID_Compat_Math_Start # Sm MATHEMATICAL ITALIC PARTIAL DIFFERENTIAL
> -1D735 ; ID_Compat_Math_Start # Sm MATHEMATICAL BOLD ITALIC NABLA
> -1D74F ; ID_Compat_Math_Start # Sm MATHEMATICAL BOLD ITALIC PARTIAL DIFFERENTIAL
> -1D76F ; ID_Compat_Math_Start # Sm MATHEMATICAL SANS-SERIF BOLD NABLA
> -1D789 ; ID_Compat_Math_Start # Sm MATHEMATICAL SANS-SERIF BOLD PARTIAL DIFFERENTIAL
> -1D7A9 ; ID_Compat_Math_Start # Sm MATHEMATICAL SANS-SERIF BOLD ITALIC NABLA
> -1D7C3 ; ID_Compat_Math_Start # Sm MATHEMATICAL SANS-SERIF BOLD ITALIC PARTIAL DIFFERENTIAL
> -
> -# Total code points: 13
> -
> -# ================================================
> -
> -0021 ; Sentence_Terminal # Po EXCLAMATION MARK
> -002E ; Sentence_Terminal # Po FULL STOP
> -003F ; Sentence_Terminal # Po QUESTION MARK
> -0589 ; Sentence_Terminal # Po ARMENIAN FULL STOP
> -061D..061F ; Sentence_Terminal # Po [3] ARABIC END OF TEXT MARK..ARABIC QUESTION MARK
> -06D4 ; Sentence_Terminal # Po ARABIC FULL STOP
> -0700..0702 ; Sentence_Terminal # Po [3] SYRIAC END OF PARAGRAPH..SYRIAC SUBLINEAR FULL STOP
> -07F9 ; Sentence_Terminal # Po NKO EXCLAMATION MARK
> -0837 ; Sentence_Terminal # Po SAMARITAN PUNCTUATION MELODIC QITSA
> -0839 ; Sentence_Terminal # Po SAMARITAN PUNCTUATION QITSA
> -083D..083E ; Sentence_Terminal # Po [2] SAMARITAN PUNCTUATION SOF MASHFAAT..SAMARITAN PUNCTUATION ANNAAU
> -0964..0965 ; Sentence_Terminal # Po [2] DEVANAGARI DANDA..DEVANAGARI DOUBLE DANDA
> -104A..104B ; Sentence_Terminal # Po [2] MYANMAR SIGN LITTLE SECTION..MYANMAR SIGN SECTION
> -1362 ; Sentence_Terminal # Po ETHIOPIC FULL STOP
> -1367..1368 ; Sentence_Terminal # Po [2] ETHIOPIC QUESTION MARK..ETHIOPIC PARAGRAPH SEPARATOR
> -166E ; Sentence_Terminal # Po CANADIAN SYLLABICS FULL STOP
> -1735..1736 ; Sentence_Terminal # Po [2] PHILIPPINE SINGLE PUNCTUATION..PHILIPPINE DOUBLE PUNCTUATION
> -17D4..17D5 ; Sentence_Terminal # Po [2] KHMER SIGN KHAN..KHMER SIGN BARIYOOSAN
> -1803 ; Sentence_Terminal # Po MONGOLIAN FULL STOP
> -1809 ; Sentence_Terminal # Po MONGOLIAN MANCHU FULL STOP
> -1944..1945 ; Sentence_Terminal # Po [2] LIMBU EXCLAMATION MARK..LIMBU QUESTION MARK
> -1AA8..1AAB ; Sentence_Terminal # Po [4] TAI THAM SIGN KAAN..TAI THAM SIGN SATKAANKUU
> -1B5A..1B5B ; Sentence_Terminal # Po [2] BALINESE PANTI..BALINESE PAMADA
> -1B5E..1B5F ; Sentence_Terminal # Po [2] BALINESE CARIK SIKI..BALINESE CARIK PAREREN
> -1B7D..1B7E ; Sentence_Terminal # Po [2] BALINESE PANTI LANTANG..BALINESE PAMADA LANTANG
> -1C3B..1C3C ; Sentence_Terminal # Po [2] LEPCHA PUNCTUATION TA-ROL..LEPCHA PUNCTUATION NYET THYOOM TA-ROL
> -1C7E..1C7F ; Sentence_Terminal # Po [2] OL CHIKI PUNCTUATION MUCAAD..OL CHIKI PUNCTUATION DOUBLE MUCAAD
> -203C..203D ; Sentence_Terminal # Po [2] DOUBLE EXCLAMATION MARK..INTERROBANG
> -2047..2049 ; Sentence_Terminal # Po [3] DOUBLE QUESTION MARK..EXCLAMATION QUESTION MARK
> -2E2E ; Sentence_Terminal # Po REVERSED QUESTION MARK
> -2E3C ; Sentence_Terminal # Po STENOGRAPHIC FULL STOP
> -2E53..2E54 ; Sentence_Terminal # Po [2] MEDIEVAL EXCLAMATION MARK..MEDIEVAL QUESTION MARK
> -3002 ; Sentence_Terminal # Po IDEOGRAPHIC FULL STOP
> -A4FF ; Sentence_Terminal # Po LISU PUNCTUATION FULL STOP
> -A60E..A60F ; Sentence_Terminal # Po [2] VAI FULL STOP..VAI QUESTION MARK
> -A6F3 ; Sentence_Terminal # Po BAMUM FULL STOP
> -A6F7 ; Sentence_Terminal # Po BAMUM QUESTION MARK
> -A876..A877 ; Sentence_Terminal # Po [2] PHAGS-PA MARK SHAD..PHAGS-PA MARK DOUBLE SHAD
> -A8CE..A8CF ; Sentence_Terminal # Po [2] SAURASHTRA DANDA..SAURASHTRA DOUBLE DANDA
> -A92F ; Sentence_Terminal # Po KAYAH LI SIGN SHYA
> -A9C8..A9C9 ; Sentence_Terminal # Po [2] JAVANESE PADA LINGSA..JAVANESE PADA LUNGSI
> -AA5D..AA5F ; Sentence_Terminal # Po [3] CHAM PUNCTUATION DANDA..CHAM PUNCTUATION TRIPLE DANDA
> -AAF0..AAF1 ; Sentence_Terminal # Po [2] MEETEI MAYEK CHEIKHAN..MEETEI MAYEK AHANG KHUDAM
> -ABEB ; Sentence_Terminal # Po MEETEI MAYEK CHEIKHEI
> -FE52 ; Sentence_Terminal # Po SMALL FULL STOP
> -FE56..FE57 ; Sentence_Terminal # Po [2] SMALL QUESTION MARK..SMALL EXCLAMATION MARK
> -FF01 ; Sentence_Terminal # Po FULLWIDTH EXCLAMATION MARK
> -FF0E ; Sentence_Terminal # Po FULLWIDTH FULL STOP
> -FF1F ; Sentence_Terminal # Po FULLWIDTH QUESTION MARK
> -FF61 ; Sentence_Terminal # Po HALFWIDTH IDEOGRAPHIC FULL STOP
> -10A56..10A57 ; Sentence_Terminal # Po [2] KHAROSHTHI PUNCTUATION DANDA..KHAROSHTHI PUNCTUATION DOUBLE DANDA
> -10F55..10F59 ; Sentence_Terminal # Po [5] SOGDIAN PUNCTUATION TWO VERTICAL BARS..SOGDIAN PUNCTUATION HALF CIRCLE WITH DOT
> -10F86..10F89 ; Sentence_Terminal # Po [4] OLD UYGHUR PUNCTUATION BAR..OLD UYGHUR PUNCTUATION FOUR DOTS
> -11047..11048 ; Sentence_Terminal # Po [2] BRAHMI DANDA..BRAHMI DOUBLE DANDA
> -110BE..110C1 ; Sentence_Terminal # Po [4] KAITHI SECTION MARK..KAITHI DOUBLE DANDA
> -11141..11143 ; Sentence_Terminal # Po [3] CHAKMA DANDA..CHAKMA QUESTION MARK
> -111C5..111C6 ; Sentence_Terminal # Po [2] SHARADA DANDA..SHARADA DOUBLE DANDA
> -111CD ; Sentence_Terminal # Po SHARADA SUTRA MARK
> -111DE..111DF ; Sentence_Terminal # Po [2] SHARADA SECTION MARK-1..SHARADA SECTION MARK-2
> -11238..11239 ; Sentence_Terminal # Po [2] KHOJKI DANDA..KHOJKI DOUBLE DANDA
> -1123B..1123C ; Sentence_Terminal # Po [2] KHOJKI SECTION MARK..KHOJKI DOUBLE SECTION MARK
> -112A9 ; Sentence_Terminal # Po MULTANI SECTION MARK
> -1144B..1144C ; Sentence_Terminal # Po [2] NEWA DANDA..NEWA DOUBLE DANDA
> -115C2..115C3 ; Sentence_Terminal # Po [2] SIDDHAM DANDA..SIDDHAM DOUBLE DANDA
> -115C9..115D7 ; Sentence_Terminal # Po [15] SIDDHAM END OF TEXT MARK..SIDDHAM SECTION MARK WITH CIRCLES AND FOUR ENCLOSURES
> -11641..11642 ; Sentence_Terminal # Po [2] MODI DANDA..MODI DOUBLE DANDA
> -1173C..1173E ; Sentence_Terminal # Po [3] AHOM SIGN SMALL SECTION..AHOM SIGN RULAI
> -11944 ; Sentence_Terminal # Po DIVES AKURU DOUBLE DANDA
> -11946 ; Sentence_Terminal # Po DIVES AKURU END OF TEXT MARK
> -11A42..11A43 ; Sentence_Terminal # Po [2] ZANABAZAR SQUARE MARK SHAD..ZANABAZAR SQUARE MARK DOUBLE SHAD
> -11A9B..11A9C ; Sentence_Terminal # Po [2] SOYOMBO MARK SHAD..SOYOMBO MARK DOUBLE SHAD
> -11C41..11C42 ; Sentence_Terminal # Po [2] BHAIKSUKI DANDA..BHAIKSUKI DOUBLE DANDA
> -11EF7..11EF8 ; Sentence_Terminal # Po [2] MAKASAR PASSIMBANG..MAKASAR END OF SECTION
> -11F43..11F44 ; Sentence_Terminal # Po [2] KAWI DANDA..KAWI DOUBLE DANDA
> -16A6E..16A6F ; Sentence_Terminal # Po [2] MRO DANDA..MRO DOUBLE DANDA
> -16AF5 ; Sentence_Terminal # Po BASSA VAH FULL STOP
> -16B37..16B38 ; Sentence_Terminal # Po [2] PAHAWH HMONG SIGN VOS THOM..PAHAWH HMONG SIGN VOS TSHAB CEEB
> -16B44 ; Sentence_Terminal # Po PAHAWH HMONG SIGN XAUS
> -16E98 ; Sentence_Terminal # Po MEDEFAIDRIN FULL STOP
> -1BC9F ; Sentence_Terminal # Po DUPLOYAN PUNCTUATION CHINOOK FULL STOP
> -1DA88 ; Sentence_Terminal # Po SIGNWRITING FULL STOP
> -
> -# Total code points: 156
> -
> -# ================================================
> -
> -180B..180D ; Variation_Selector # Mn [3] MONGOLIAN FREE VARIATION SELECTOR ONE..MONGOLIAN FREE VARIATION SELECTOR THREE
> -180F ; Variation_Selector # Mn MONGOLIAN FREE VARIATION SELECTOR FOUR
> -FE00..FE0F ; Variation_Selector # Mn [16] VARIATION SELECTOR-1..VARIATION SELECTOR-16
> -E0100..E01EF ; Variation_Selector # Mn [240] VARIATION SELECTOR-17..VARIATION SELECTOR-256
> -
> -# Total code points: 260
> -
> -# ================================================
> -
> -0009..000D ; Pattern_White_Space # Cc [5] <control-0009>..<control-000D>
> -0020 ; Pattern_White_Space # Zs SPACE
> -0085 ; Pattern_White_Space # Cc <control-0085>
> -200E..200F ; Pattern_White_Space # Cf [2] LEFT-TO-RIGHT MARK..RIGHT-TO-LEFT MARK
> -2028 ; Pattern_White_Space # Zl LINE SEPARATOR
> -2029 ; Pattern_White_Space # Zp PARAGRAPH SEPARATOR
> -
> -# Total code points: 11
> -
> -# ================================================
> -
> -0021..0023 ; Pattern_Syntax # Po [3] EXCLAMATION MARK..NUMBER SIGN
> -0024 ; Pattern_Syntax # Sc DOLLAR SIGN
> -0025..0027 ; Pattern_Syntax # Po [3] PERCENT SIGN..APOSTROPHE
> -0028 ; Pattern_Syntax # Ps LEFT PARENTHESIS
> -0029 ; Pattern_Syntax # Pe RIGHT PARENTHESIS
> -002A ; Pattern_Syntax # Po ASTERISK
> -002B ; Pattern_Syntax # Sm PLUS SIGN
> -002C ; Pattern_Syntax # Po COMMA
> -002D ; Pattern_Syntax # Pd HYPHEN-MINUS
> -002E..002F ; Pattern_Syntax # Po [2] FULL STOP..SOLIDUS
> -003A..003B ; Pattern_Syntax # Po [2] COLON..SEMICOLON
> -003C..003E ; Pattern_Syntax # Sm [3] LESS-THAN SIGN..GREATER-THAN SIGN
> -003F..0040 ; Pattern_Syntax # Po [2] QUESTION MARK..COMMERCIAL AT
> -005B ; Pattern_Syntax # Ps LEFT SQUARE BRACKET
> -005C ; Pattern_Syntax # Po REVERSE SOLIDUS
> -005D ; Pattern_Syntax # Pe RIGHT SQUARE BRACKET
> -005E ; Pattern_Syntax # Sk CIRCUMFLEX ACCENT
> -0060 ; Pattern_Syntax # Sk GRAVE ACCENT
> -007B ; Pattern_Syntax # Ps LEFT CURLY BRACKET
> -007C ; Pattern_Syntax # Sm VERTICAL LINE
> -007D ; Pattern_Syntax # Pe RIGHT CURLY BRACKET
> -007E ; Pattern_Syntax # Sm TILDE
> -00A1 ; Pattern_Syntax # Po INVERTED EXCLAMATION MARK
> -00A2..00A5 ; Pattern_Syntax # Sc [4] CENT SIGN..YEN SIGN
> -00A6 ; Pattern_Syntax # So BROKEN BAR
> -00A7 ; Pattern_Syntax # Po SECTION SIGN
> -00A9 ; Pattern_Syntax # So COPYRIGHT SIGN
> -00AB ; Pattern_Syntax # Pi LEFT-POINTING DOUBLE ANGLE QUOTATION MARK
> -00AC ; Pattern_Syntax # Sm NOT SIGN
> -00AE ; Pattern_Syntax # So REGISTERED SIGN
> -00B0 ; Pattern_Syntax # So DEGREE SIGN
> -00B1 ; Pattern_Syntax # Sm PLUS-MINUS SIGN
> -00B6 ; Pattern_Syntax # Po PILCROW SIGN
> -00BB ; Pattern_Syntax # Pf RIGHT-POINTING DOUBLE ANGLE QUOTATION MARK
> -00BF ; Pattern_Syntax # Po INVERTED QUESTION MARK
> -00D7 ; Pattern_Syntax # Sm MULTIPLICATION SIGN
> -00F7 ; Pattern_Syntax # Sm DIVISION SIGN
> -2010..2015 ; Pattern_Syntax # Pd [6] HYPHEN..HORIZONTAL BAR
> -2016..2017 ; Pattern_Syntax # Po [2] DOUBLE VERTICAL LINE..DOUBLE LOW LINE
> -2018 ; Pattern_Syntax # Pi LEFT SINGLE QUOTATION MARK
> -2019 ; Pattern_Syntax # Pf RIGHT SINGLE QUOTATION MARK
> -201A ; Pattern_Syntax # Ps SINGLE LOW-9 QUOTATION MARK
> -201B..201C ; Pattern_Syntax # Pi [2] SINGLE HIGH-REVERSED-9 QUOTATION MARK..LEFT DOUBLE QUOTATION MARK
> -201D ; Pattern_Syntax # Pf RIGHT DOUBLE QUOTATION MARK
> -201E ; Pattern_Syntax # Ps DOUBLE LOW-9 QUOTATION MARK
> -201F ; Pattern_Syntax # Pi DOUBLE HIGH-REVERSED-9 QUOTATION MARK
> -2020..2027 ; Pattern_Syntax # Po [8] DAGGER..HYPHENATION POINT
> -2030..2038 ; Pattern_Syntax # Po [9] PER MILLE SIGN..CARET
> -2039 ; Pattern_Syntax # Pi SINGLE LEFT-POINTING ANGLE QUOTATION MARK
> -203A ; Pattern_Syntax # Pf SINGLE RIGHT-POINTING ANGLE QUOTATION MARK
> -203B..203E ; Pattern_Syntax # Po [4] REFERENCE MARK..OVERLINE
> -2041..2043 ; Pattern_Syntax # Po [3] CARET INSERTION POINT..HYPHEN BULLET
> -2044 ; Pattern_Syntax # Sm FRACTION SLASH
> -2045 ; Pattern_Syntax # Ps LEFT SQUARE BRACKET WITH QUILL
> -2046 ; Pattern_Syntax # Pe RIGHT SQUARE BRACKET WITH QUILL
> -2047..2051 ; Pattern_Syntax # Po [11] DOUBLE QUESTION MARK..TWO ASTERISKS ALIGNED VERTICALLY
> -2052 ; Pattern_Syntax # Sm COMMERCIAL MINUS SIGN
> -2053 ; Pattern_Syntax # Po SWUNG DASH
> -2055..205E ; Pattern_Syntax # Po [10] FLOWER PUNCTUATION MARK..VERTICAL FOUR DOTS
> -2190..2194 ; Pattern_Syntax # Sm [5] LEFTWARDS ARROW..LEFT RIGHT ARROW
> -2195..2199 ; Pattern_Syntax # So [5] UP DOWN ARROW..SOUTH WEST ARROW
> -219A..219B ; Pattern_Syntax # Sm [2] LEFTWARDS ARROW WITH STROKE..RIGHTWARDS ARROW WITH STROKE
> -219C..219F ; Pattern_Syntax # So [4] LEFTWARDS WAVE ARROW..UPWARDS TWO HEADED ARROW
> -21A0 ; Pattern_Syntax # Sm RIGHTWARDS TWO HEADED ARROW
> -21A1..21A2 ; Pattern_Syntax # So [2] DOWNWARDS TWO HEADED ARROW..LEFTWARDS ARROW WITH TAIL
> -21A3 ; Pattern_Syntax # Sm RIGHTWARDS ARROW WITH TAIL
> -21A4..21A5 ; Pattern_Syntax # So [2] LEFTWARDS ARROW FROM BAR..UPWARDS ARROW FROM BAR
> -21A6 ; Pattern_Syntax # Sm RIGHTWARDS ARROW FROM BAR
> -21A7..21AD ; Pattern_Syntax # So [7] DOWNWARDS ARROW FROM BAR..LEFT RIGHT WAVE ARROW
> -21AE ; Pattern_Syntax # Sm LEFT RIGHT ARROW WITH STROKE
> -21AF..21CD ; Pattern_Syntax # So [31] DOWNWARDS ZIGZAG ARROW..LEFTWARDS DOUBLE ARROW WITH STROKE
> -21CE..21CF ; Pattern_Syntax # Sm [2] LEFT RIGHT DOUBLE ARROW WITH STROKE..RIGHTWARDS DOUBLE ARROW WITH STROKE
> -21D0..21D1 ; Pattern_Syntax # So [2] LEFTWARDS DOUBLE ARROW..UPWARDS DOUBLE ARROW
> -21D2 ; Pattern_Syntax # Sm RIGHTWARDS DOUBLE ARROW
> -21D3 ; Pattern_Syntax # So DOWNWARDS DOUBLE ARROW
> -21D4 ; Pattern_Syntax # Sm LEFT RIGHT DOUBLE ARROW
> -21D5..21F3 ; Pattern_Syntax # So [31] UP DOWN DOUBLE ARROW..UP DOWN WHITE ARROW
> -21F4..22FF ; Pattern_Syntax # Sm [268] RIGHT ARROW WITH SMALL CIRCLE..Z NOTATION BAG MEMBERSHIP
> -2300..2307 ; Pattern_Syntax # So [8] DIAMETER SIGN..WAVY LINE
> -2308 ; Pattern_Syntax # Ps LEFT CEILING
> -2309 ; Pattern_Syntax # Pe RIGHT CEILING
> -230A ; Pattern_Syntax # Ps LEFT FLOOR
> -230B ; Pattern_Syntax # Pe RIGHT FLOOR
> -230C..231F ; Pattern_Syntax # So [20] BOTTOM RIGHT CROP..BOTTOM RIGHT CORNER
> -2320..2321 ; Pattern_Syntax # Sm [2] TOP HALF INTEGRAL..BOTTOM HALF INTEGRAL
> -2322..2328 ; Pattern_Syntax # So [7] FROWN..KEYBOARD
> -2329 ; Pattern_Syntax # Ps LEFT-POINTING ANGLE BRACKET
> -232A ; Pattern_Syntax # Pe RIGHT-POINTING ANGLE BRACKET
> -232B..237B ; Pattern_Syntax # So [81] ERASE TO THE LEFT..NOT CHECK MARK
> -237C ; Pattern_Syntax # Sm RIGHT ANGLE WITH DOWNWARDS ZIGZAG ARROW
> -237D..239A ; Pattern_Syntax # So [30] SHOULDERED OPEN BOX..CLEAR SCREEN SYMBOL
> -239B..23B3 ; Pattern_Syntax # Sm [25] LEFT PARENTHESIS UPPER HOOK..SUMMATION BOTTOM
> -23B4..23DB ; Pattern_Syntax # So [40] TOP SQUARE BRACKET..FUSE
> -23DC..23E1 ; Pattern_Syntax # Sm [6] TOP PARENTHESIS..BOTTOM TORTOISE SHELL BRACKET
> -23E2..2426 ; Pattern_Syntax # So [69] WHITE TRAPEZIUM..SYMBOL FOR SUBSTITUTE FORM TWO
> -2427..243F ; Pattern_Syntax # Cn [25] <reserved-2427>..<reserved-243F>
> -2440..244A ; Pattern_Syntax # So [11] OCR HOOK..OCR DOUBLE BACKSLASH
> -244B..245F ; Pattern_Syntax # Cn [21] <reserved-244B>..<reserved-245F>
> -2500..25B6 ; Pattern_Syntax # So [183] BOX DRAWINGS LIGHT HORIZONTAL..BLACK RIGHT-POINTING TRIANGLE
> -25B7 ; Pattern_Syntax # Sm WHITE RIGHT-POINTING TRIANGLE
> -25B8..25C0 ; Pattern_Syntax # So [9] BLACK RIGHT-POINTING SMALL TRIANGLE..BLACK LEFT-POINTING TRIANGLE
> -25C1 ; Pattern_Syntax # Sm WHITE LEFT-POINTING TRIANGLE
> -25C2..25F7 ; Pattern_Syntax # So [54] BLACK LEFT-POINTING SMALL TRIANGLE..WHITE CIRCLE WITH UPPER RIGHT QUADRANT
> -25F8..25FF ; Pattern_Syntax # Sm [8] UPPER LEFT TRIANGLE..LOWER RIGHT TRIANGLE
> -2600..266E ; Pattern_Syntax # So [111] BLACK SUN WITH RAYS..MUSIC NATURAL SIGN
> -266F ; Pattern_Syntax # Sm MUSIC SHARP SIGN
> -2670..2767 ; Pattern_Syntax # So [248] WEST SYRIAC CROSS..ROTATED FLORAL HEART BULLET
> -2768 ; Pattern_Syntax # Ps MEDIUM LEFT PARENTHESIS ORNAMENT
> -2769 ; Pattern_Syntax # Pe MEDIUM RIGHT PARENTHESIS ORNAMENT
> -276A ; Pattern_Syntax # Ps MEDIUM FLATTENED LEFT PARENTHESIS ORNAMENT
> -276B ; Pattern_Syntax # Pe MEDIUM FLATTENED RIGHT PARENTHESIS ORNAMENT
> -276C ; Pattern_Syntax # Ps MEDIUM LEFT-POINTING ANGLE BRACKET ORNAMENT
> -276D ; Pattern_Syntax # Pe MEDIUM RIGHT-POINTING ANGLE BRACKET ORNAMENT
> -276E ; Pattern_Syntax # Ps HEAVY LEFT-POINTING ANGLE QUOTATION MARK ORNAMENT
> -276F ; Pattern_Syntax # Pe HEAVY RIGHT-POINTING ANGLE QUOTATION MARK ORNAMENT
> -2770 ; Pattern_Syntax # Ps HEAVY LEFT-POINTING ANGLE BRACKET ORNAMENT
> -2771 ; Pattern_Syntax # Pe HEAVY RIGHT-POINTING ANGLE BRACKET ORNAMENT
> -2772 ; Pattern_Syntax # Ps LIGHT LEFT TORTOISE SHELL BRACKET ORNAMENT
> -2773 ; Pattern_Syntax # Pe LIGHT RIGHT TORTOISE SHELL BRACKET ORNAMENT
> -2774 ; Pattern_Syntax # Ps MEDIUM LEFT CURLY BRACKET ORNAMENT
> -2775 ; Pattern_Syntax # Pe MEDIUM RIGHT CURLY BRACKET ORNAMENT
> -2794..27BF ; Pattern_Syntax # So [44] HEAVY WIDE-HEADED RIGHTWARDS ARROW..DOUBLE CURLY LOOP
> -27C0..27C4 ; Pattern_Syntax # Sm [5] THREE DIMENSIONAL ANGLE..OPEN SUPERSET
> -27C5 ; Pattern_Syntax # Ps LEFT S-SHAPED BAG DELIMITER
> -27C6 ; Pattern_Syntax # Pe RIGHT S-SHAPED BAG DELIMITER
> -27C7..27E5 ; Pattern_Syntax # Sm [31] OR WITH DOT INSIDE..WHITE SQUARE WITH RIGHTWARDS TICK
> -27E6 ; Pattern_Syntax # Ps MATHEMATICAL LEFT WHITE SQUARE BRACKET
> -27E7 ; Pattern_Syntax # Pe MATHEMATICAL RIGHT WHITE SQUARE BRACKET
> -27E8 ; Pattern_Syntax # Ps MATHEMATICAL LEFT ANGLE BRACKET
> -27E9 ; Pattern_Syntax # Pe MATHEMATICAL RIGHT ANGLE BRACKET
> -27EA ; Pattern_Syntax # Ps MATHEMATICAL LEFT DOUBLE ANGLE BRACKET
> -27EB ; Pattern_Syntax # Pe MATHEMATICAL RIGHT DOUBLE ANGLE BRACKET
> -27EC ; Pattern_Syntax # Ps MATHEMATICAL LEFT WHITE TORTOISE SHELL BRACKET
> -27ED ; Pattern_Syntax # Pe MATHEMATICAL RIGHT WHITE TORTOISE SHELL BRACKET
> -27EE ; Pattern_Syntax # Ps MATHEMATICAL LEFT FLATTENED PARENTHESIS
> -27EF ; Pattern_Syntax # Pe MATHEMATICAL RIGHT FLATTENED PARENTHESIS
> -27F0..27FF ; Pattern_Syntax # Sm [16] UPWARDS QUADRUPLE ARROW..LONG RIGHTWARDS SQUIGGLE ARROW
> -2800..28FF ; Pattern_Syntax # So [256] BRAILLE PATTERN BLANK..BRAILLE PATTERN DOTS-12345678
> -2900..2982 ; Pattern_Syntax # Sm [131] RIGHTWARDS TWO-HEADED ARROW WITH VERTICAL STROKE..Z NOTATION TYPE COLON
> -2983 ; Pattern_Syntax # Ps LEFT WHITE CURLY BRACKET
> -2984 ; Pattern_Syntax # Pe RIGHT WHITE CURLY BRACKET
> -2985 ; Pattern_Syntax # Ps LEFT WHITE PARENTHESIS
> -2986 ; Pattern_Syntax # Pe RIGHT WHITE PARENTHESIS
> -2987 ; Pattern_Syntax # Ps Z NOTATION LEFT IMAGE BRACKET
> -2988 ; Pattern_Syntax # Pe Z NOTATION RIGHT IMAGE BRACKET
> -2989 ; Pattern_Syntax # Ps Z NOTATION LEFT BINDING BRACKET
> -298A ; Pattern_Syntax # Pe Z NOTATION RIGHT BINDING BRACKET
> -298B ; Pattern_Syntax # Ps LEFT SQUARE BRACKET WITH UNDERBAR
> -298C ; Pattern_Syntax # Pe RIGHT SQUARE BRACKET WITH UNDERBAR
> -298D ; Pattern_Syntax # Ps LEFT SQUARE BRACKET WITH TICK IN TOP CORNER
> -298E ; Pattern_Syntax # Pe RIGHT SQUARE BRACKET WITH TICK IN BOTTOM CORNER
> -298F ; Pattern_Syntax # Ps LEFT SQUARE BRACKET WITH TICK IN BOTTOM CORNER
> -2990 ; Pattern_Syntax # Pe RIGHT SQUARE BRACKET WITH TICK IN TOP CORNER
> -2991 ; Pattern_Syntax # Ps LEFT ANGLE BRACKET WITH DOT
> -2992 ; Pattern_Syntax # Pe RIGHT ANGLE BRACKET WITH DOT
> -2993 ; Pattern_Syntax # Ps LEFT ARC LESS-THAN BRACKET
> -2994 ; Pattern_Syntax # Pe RIGHT ARC GREATER-THAN BRACKET
> -2995 ; Pattern_Syntax # Ps DOUBLE LEFT ARC GREATER-THAN BRACKET
> -2996 ; Pattern_Syntax # Pe DOUBLE RIGHT ARC LESS-THAN BRACKET
> -2997 ; Pattern_Syntax # Ps LEFT BLACK TORTOISE SHELL BRACKET
> -2998 ; Pattern_Syntax # Pe RIGHT BLACK TORTOISE SHELL BRACKET
> -2999..29D7 ; Pattern_Syntax # Sm [63] DOTTED FENCE..BLACK HOURGLASS
> -29D8 ; Pattern_Syntax # Ps LEFT WIGGLY FENCE
> -29D9 ; Pattern_Syntax # Pe RIGHT WIGGLY FENCE
> -29DA ; Pattern_Syntax # Ps LEFT DOUBLE WIGGLY FENCE
> -29DB ; Pattern_Syntax # Pe RIGHT DOUBLE WIGGLY FENCE
> -29DC..29FB ; Pattern_Syntax # Sm [32] INCOMPLETE INFINITY..TRIPLE PLUS
> -29FC ; Pattern_Syntax # Ps LEFT-POINTING CURVED ANGLE BRACKET
> -29FD ; Pattern_Syntax # Pe RIGHT-POINTING CURVED ANGLE BRACKET
> -29FE..2AFF ; Pattern_Syntax # Sm [258] TINY..N-ARY WHITE VERTICAL BAR
> -2B00..2B2F ; Pattern_Syntax # So [48] NORTH EAST WHITE ARROW..WHITE VERTICAL ELLIPSE
> -2B30..2B44 ; Pattern_Syntax # Sm [21] LEFT ARROW WITH SMALL CIRCLE..RIGHTWARDS ARROW THROUGH SUPERSET
> -2B45..2B46 ; Pattern_Syntax # So [2] LEFTWARDS QUADRUPLE ARROW..RIGHTWARDS QUADRUPLE ARROW
> -2B47..2B4C ; Pattern_Syntax # Sm [6] REVERSE TILDE OPERATOR ABOVE RIGHTWARDS ARROW..RIGHTWARDS ARROW ABOVE REVERSE TILDE OPERATOR
> -2B4D..2B73 ; Pattern_Syntax # So [39] DOWNWARDS TRIANGLE-HEADED ZIGZAG ARROW..DOWNWARDS TRIANGLE-HEADED ARROW TO BAR
> -2B74..2B75 ; Pattern_Syntax # Cn [2] <reserved-2B74>..<reserved-2B75>
> -2B76..2B95 ; Pattern_Syntax # So [32] NORTH WEST TRIANGLE-HEADED ARROW TO BAR..RIGHTWARDS BLACK ARROW
> -2B96 ; Pattern_Syntax # Cn <reserved-2B96>
> -2B97..2BFF ; Pattern_Syntax # So [105] SYMBOL FOR TYPE A ELECTRONICS..HELLSCHREIBER PAUSE SYMBOL
> -2E00..2E01 ; Pattern_Syntax # Po [2] RIGHT ANGLE SUBSTITUTION MARKER..RIGHT ANGLE DOTTED SUBSTITUTION MARKER
> -2E02 ; Pattern_Syntax # Pi LEFT SUBSTITUTION BRACKET
> -2E03 ; Pattern_Syntax # Pf RIGHT SUBSTITUTION BRACKET
> -2E04 ; Pattern_Syntax # Pi LEFT DOTTED SUBSTITUTION BRACKET
> -2E05 ; Pattern_Syntax # Pf RIGHT DOTTED SUBSTITUTION BRACKET
> -2E06..2E08 ; Pattern_Syntax # Po [3] RAISED INTERPOLATION MARKER..DOTTED TRANSPOSITION MARKER
> -2E09 ; Pattern_Syntax # Pi LEFT TRANSPOSITION BRACKET
> -2E0A ; Pattern_Syntax # Pf RIGHT TRANSPOSITION BRACKET
> -2E0B ; Pattern_Syntax # Po RAISED SQUARE
> -2E0C ; Pattern_Syntax # Pi LEFT RAISED OMISSION BRACKET
> -2E0D ; Pattern_Syntax # Pf RIGHT RAISED OMISSION BRACKET
> -2E0E..2E16 ; Pattern_Syntax # Po [9] EDITORIAL CORONIS..DOTTED RIGHT-POINTING ANGLE
> -2E17 ; Pattern_Syntax # Pd DOUBLE OBLIQUE HYPHEN
> -2E18..2E19 ; Pattern_Syntax # Po [2] INVERTED INTERROBANG..PALM BRANCH
> -2E1A ; Pattern_Syntax # Pd HYPHEN WITH DIAERESIS
> -2E1B ; Pattern_Syntax # Po TILDE WITH RING ABOVE
> -2E1C ; Pattern_Syntax # Pi LEFT LOW PARAPHRASE BRACKET
> -2E1D ; Pattern_Syntax # Pf RIGHT LOW PARAPHRASE BRACKET
> -2E1E..2E1F ; Pattern_Syntax # Po [2] TILDE WITH DOT ABOVE..TILDE WITH DOT BELOW
> -2E20 ; Pattern_Syntax # Pi LEFT VERTICAL BAR WITH QUILL
> -2E21 ; Pattern_Syntax # Pf RIGHT VERTICAL BAR WITH QUILL
> -2E22 ; Pattern_Syntax # Ps TOP LEFT HALF BRACKET
> -2E23 ; Pattern_Syntax # Pe TOP RIGHT HALF BRACKET
> -2E24 ; Pattern_Syntax # Ps BOTTOM LEFT HALF BRACKET
> -2E25 ; Pattern_Syntax # Pe BOTTOM RIGHT HALF BRACKET
> -2E26 ; Pattern_Syntax # Ps LEFT SIDEWAYS U BRACKET
> -2E27 ; Pattern_Syntax # Pe RIGHT SIDEWAYS U BRACKET
> -2E28 ; Pattern_Syntax # Ps LEFT DOUBLE PARENTHESIS
> -2E29 ; Pattern_Syntax # Pe RIGHT DOUBLE PARENTHESIS
> -2E2A..2E2E ; Pattern_Syntax # Po [5] TWO DOTS OVER ONE DOT PUNCTUATION..REVERSED QUESTION MARK
> -2E2F ; Pattern_Syntax # Lm VERTICAL TILDE
> -2E30..2E39 ; Pattern_Syntax # Po [10] RING POINT..TOP HALF SECTION SIGN
> -2E3A..2E3B ; Pattern_Syntax # Pd [2] TWO-EM DASH..THREE-EM DASH
> -2E3C..2E3F ; Pattern_Syntax # Po [4] STENOGRAPHIC FULL STOP..CAPITULUM
> -2E40 ; Pattern_Syntax # Pd DOUBLE HYPHEN
> -2E41 ; Pattern_Syntax # Po REVERSED COMMA
> -2E42 ; Pattern_Syntax # Ps DOUBLE LOW-REVERSED-9 QUOTATION MARK
> -2E43..2E4F ; Pattern_Syntax # Po [13] DASH WITH LEFT UPTURN..CORNISH VERSE DIVIDER
> -2E50..2E51 ; Pattern_Syntax # So [2] CROSS PATTY WITH RIGHT CROSSBAR..CROSS PATTY WITH LEFT CROSSBAR
> -2E52..2E54 ; Pattern_Syntax # Po [3] TIRONIAN SIGN CAPITAL ET..MEDIEVAL QUESTION MARK
> -2E55 ; Pattern_Syntax # Ps LEFT SQUARE BRACKET WITH STROKE
> -2E56 ; Pattern_Syntax # Pe RIGHT SQUARE BRACKET WITH STROKE
> -2E57 ; Pattern_Syntax # Ps LEFT SQUARE BRACKET WITH DOUBLE STROKE
> -2E58 ; Pattern_Syntax # Pe RIGHT SQUARE BRACKET WITH DOUBLE STROKE
> -2E59 ; Pattern_Syntax # Ps TOP HALF LEFT PARENTHESIS
> -2E5A ; Pattern_Syntax # Pe TOP HALF RIGHT PARENTHESIS
> -2E5B ; Pattern_Syntax # Ps BOTTOM HALF LEFT PARENTHESIS
> -2E5C ; Pattern_Syntax # Pe BOTTOM HALF RIGHT PARENTHESIS
> -2E5D ; Pattern_Syntax # Pd OBLIQUE HYPHEN
> -2E5E..2E7F ; Pattern_Syntax # Cn [34] <reserved-2E5E>..<reserved-2E7F>
> -3001..3003 ; Pattern_Syntax # Po [3] IDEOGRAPHIC COMMA..DITTO MARK
> -3008 ; Pattern_Syntax # Ps LEFT ANGLE BRACKET
> -3009 ; Pattern_Syntax # Pe RIGHT ANGLE BRACKET
> -300A ; Pattern_Syntax # Ps LEFT DOUBLE ANGLE BRACKET
> -300B ; Pattern_Syntax # Pe RIGHT DOUBLE ANGLE BRACKET
> -300C ; Pattern_Syntax # Ps LEFT CORNER BRACKET
> -300D ; Pattern_Syntax # Pe RIGHT CORNER BRACKET
> -300E ; Pattern_Syntax # Ps LEFT WHITE CORNER BRACKET
> -300F ; Pattern_Syntax # Pe RIGHT WHITE CORNER BRACKET
> -3010 ; Pattern_Syntax # Ps LEFT BLACK LENTICULAR BRACKET
> -3011 ; Pattern_Syntax # Pe RIGHT BLACK LENTICULAR BRACKET
> -3012..3013 ; Pattern_Syntax # So [2] POSTAL MARK..GETA MARK
> -3014 ; Pattern_Syntax # Ps LEFT TORTOISE SHELL BRACKET
> -3015 ; Pattern_Syntax # Pe RIGHT TORTOISE SHELL BRACKET
> -3016 ; Pattern_Syntax # Ps LEFT WHITE LENTICULAR BRACKET
> -3017 ; Pattern_Syntax # Pe RIGHT WHITE LENTICULAR BRACKET
> -3018 ; Pattern_Syntax # Ps LEFT WHITE TORTOISE SHELL BRACKET
> -3019 ; Pattern_Syntax # Pe RIGHT WHITE TORTOISE SHELL BRACKET
> -301A ; Pattern_Syntax # Ps LEFT WHITE SQUARE BRACKET
> -301B ; Pattern_Syntax # Pe RIGHT WHITE SQUARE BRACKET
> -301C ; Pattern_Syntax # Pd WAVE DASH
> -301D ; Pattern_Syntax # Ps REVERSED DOUBLE PRIME QUOTATION MARK
> -301E..301F ; Pattern_Syntax # Pe [2] DOUBLE PRIME QUOTATION MARK..LOW DOUBLE PRIME QUOTATION MARK
> -3020 ; Pattern_Syntax # So POSTAL MARK FACE
> -3030 ; Pattern_Syntax # Pd WAVY DASH
> -FD3E ; Pattern_Syntax # Pe ORNATE LEFT PARENTHESIS
> -FD3F ; Pattern_Syntax # Ps ORNATE RIGHT PARENTHESIS
> -FE45..FE46 ; Pattern_Syntax # Po [2] SESAME DOT..WHITE SESAME DOT
> -
> -# Total code points: 2760
> -
> -# ================================================
> -
> -0600..0605 ; Prepended_Concatenation_Mark # Cf [6] ARABIC NUMBER SIGN..ARABIC NUMBER MARK ABOVE
> -06DD ; Prepended_Concatenation_Mark # Cf ARABIC END OF AYAH
> -070F ; Prepended_Concatenation_Mark # Cf SYRIAC ABBREVIATION MARK
> -0890..0891 ; Prepended_Concatenation_Mark # Cf [2] ARABIC POUND MARK ABOVE..ARABIC PIASTRE MARK ABOVE
> -08E2 ; Prepended_Concatenation_Mark # Cf ARABIC DISPUTED END OF AYAH
> -110BD ; Prepended_Concatenation_Mark # Cf KAITHI NUMBER SIGN
> -110CD ; Prepended_Concatenation_Mark # Cf KAITHI NUMBER SIGN ABOVE
> -
> -# Total code points: 13
> -
> -# ================================================
> -
> -1F1E6..1F1FF ; Regional_Indicator # So [26] REGIONAL INDICATOR SYMBOL LETTER A..REGIONAL INDICATOR SYMBOL LETTER Z
> -
> -# Total code points: 26
> -
> -# EOF
> diff --git a/localedata/unicode-gen/utf8_gen.py b/localedata/unicode-gen/utf8_gen.py
> index e273607b67..15ed7ad113 100755
> --- a/localedata/unicode-gen/utf8_gen.py
> +++ b/localedata/unicode-gen/utf8_gen.py
> @@ -1,6 +1,7 @@
> #!/usr/bin/python3
> # -*- coding: utf-8 -*-
> # Copyright (C) 2014-2024 Free Software Foundation, Inc.
> +# Copyright (C) 2024 The GNU Toolchain Authors.
> # This file is part of the GNU C Library.
> #
> # The GNU C Library is free software; you can redistribute it and/or
> @@ -28,7 +29,6 @@ It will output UTF-8 file
> '''
>
> import argparse
> -import sys
> import re
> import unicode_utils
>
> @@ -200,30 +200,40 @@ def write_header_charmap(outfile):
>
> def write_header_width(outfile, unicode_version):
> '''Writes the header on top of the WIDTH section to the output file'''
> - outfile.write('% Character width according to Unicode '
> - + '{:s}.\n'.format(unicode_version))
> - outfile.write('% - Default width is 1.\n')
> + outfile.write('% Character width according to Unicode {:s}.\n'.format(unicode_version))
> + outfile.write('% Width is determined by the following rules, in order of decreasing precedence:\n')
> + outfile.write('% - U+00AD SOFT HYPHEN has width 1, as a special case for compatibility (https://archive.is/b5Ck).\n')
> + outfile.write('% - U+115F HANGUL CHOSEONG FILLER has width 2.\n')
> + outfile.write('% This character stands in for an intentionally omitted leading consonant\n')
> + outfile.write('% in a Hangul syllable block; as such it must be assigned width 2 despite its lack\n')
> + outfile.write('% of visible display to ensure that the complete block has the correct width.\n')
> + outfile.write('% (See below for more information on Hangul syllables.)\n')
> + outfile.write('% - Combining jungseong and jongseong Hangul jamo have width 0; generated from\n')
> + outfile.write('% "grep \'^[^;]*;[VT]\' HangulSyllableType.txt".\n')
> + outfile.write('% One composed Hangul "syllable block" like 퓛 is made up of\n')
> + outfile.write('% two to three individual component characters called "jamo".\n')
> + outfile.write('% The complete block must have total width 2;\n')
> + outfile.write('% to achieve this, we assign a width of 2 to leading "choseong" jamo,\n')
> + outfile.write('% and of 0 to medial vowel "jungseong" and trailing "jongseong" jamo.\n')
> + outfile.write('% - Non-spacing and enclosing marks have width 0; generated from\n')
> + outfile.write('% "grep -E \'^[^;]*;[^;]*;(Mn|Me);\' UnicodeData.txt".\n')
> + outfile.write('% - "Default_Ignorable_Code_Point"s have width 0; generated from\n')
> + outfile.write('% "grep \'^[^;]*;\\s*Default_Ignorable_Code_Point\' DerivedCoreProperties.txt".\n')
> outfile.write('% - Double-width characters have width 2; generated from\n')
> - outfile.write('% "grep \'^[^;]*;\\s*[WF]\' EastAsianWidth.txt"\n')
> - outfile.write('% - Non-spacing characters have width 0; '
> - + 'generated from PropList.txt or\n')
> - outfile.write('% "grep \'^[^;]*;[^;]*;[^;]*;[^;]*;NSM;\' '
> - + 'UnicodeData.txt"\n')
> - outfile.write('% - Format control characters have width 0; '
> - + 'generated from\n')
> - outfile.write("% \"grep '^[^;]*;[^;]*;Cf;' UnicodeData.txt\"\n")
> -# Not needed covered by Cf
> -# outfile.write("% - Zero width characters have width 0; generated from\n")
> -# outfile.write("% \"grep '^[^;]*;ZERO WIDTH ' UnicodeData.txt\"\n")
> + outfile.write('% "grep \'^[^;]*;[WF]\' EastAsianWidth.txt".\n')
> + outfile.write('% - Default width for all other characters is 1.\n')
> outfile.write("WIDTH\n")
>
> -def process_width(outfile, ulines, elines, plines):
> - '''ulines are lines from UnicodeData.txt, elines are lines from
> - EastAsianWidth.txt containing characters with width “W” or “F”,
> - plines are lines from PropList.txt which contain characters
> - with the property “Prepended_Concatenation_Mark”.
> -
> +def process_width(outfile, ulines, dlines, elines, klines):
> + '''ulines are lines from UnicodeData.txt.
> + elines are lines from EastAsianWidth.txt containing characters with width
> + “W” or “F”.
> + dlines are lines from DerivedCoreProperties.txt which contain
> + characters with the property “Default_Ignorable_Code_Point”.
> + klines are lines from HangulSyllableType.txt which contain characters
> + with syllable type “V” or “T”.
> '''
> + # Wide and fullwidth characters have width 1
> width_dict = {}
> for line in elines:
> fields = line.split(";")
> @@ -235,14 +245,14 @@ def process_width(outfile, ulines, elines, plines):
> int(code_points[1], 16)+1):
> width_dict[key] = 2
>
> + # Nonspacing and enclosing marks have width 0
> for line in ulines:
> fields = line.split(";")
> - if fields[4] == "NSM" or fields[2] in ("Cf", "Me", "Mn"):
> + if fields[4] == "NSM" or fields[2] in ("Me", "Mn"):
> width_dict[int(fields[0], 16)] = 0
>
> - for line in plines:
> - # Characters with the property “Prepended_Concatenation_Mark”
> - # should have the width 1:
> + # Conjoining vowel and trailing jamo have width 0
> + for line in klines:
> fields = line.split(";")
> if not '..' in fields[0]:
> code_points = (fields[0], fields[0])
> @@ -250,21 +260,26 @@ def process_width(outfile, ulines, elines, plines):
> code_points = fields[0].split("..")
> for key in range(int(code_points[0], 16),
> int(code_points[1], 16)+1):
> - del width_dict[key] # default width is 1
> -
> - # handle special cases for compatibility
> - for key in list((0x00AD,)):
> - # https://www.cs.tut.fi/~jkorpela/shy.html
> - if key in width_dict:
> - del width_dict[key] # default width is 1
> - for key in list(range(0x1160, 0x1200)):
> - # Hangul jungseong and jongseong:
> - if key in unicode_utils.UNICODE_ATTRIBUTES:
> - width_dict[key] = 0
> - for key in list(range(0xD7B0, 0xD800)):
> - # Hangul jungseong and jongseong:
> - if key in unicode_utils.UNICODE_ATTRIBUTES:
> width_dict[key] = 0
> +
> + # “Default_Ignorable_Code_Point”s have width 0
> + for line in dlines:
> + fields = line.split(";")
> + if not '..' in fields[0]:
> + code_points = (fields[0], fields[0])
> + else:
> + code_points = fields[0].split("..")
> + for key in range(int(code_points[0], 16),
> + int(code_points[1], 16)+1):
> + width_dict[key] = 0 # default width is 1
> +
> +
> + # Special case: U+00AD SOFT HYPHEN
> + del width_dict[0x00AD]
> +
> + # Special case: U+115F HANGUL CHOSEONG FILLER
> + width_dict[0x115F] = 2
> +
> for key in list(range(0x3248, 0x3250)):
> # These are “A” which means we can decide whether to treat them
> # as “W” or “N” based on context:
> @@ -302,7 +317,7 @@ def process_width(outfile, ulines, elines, plines):
> if __name__ == "__main__":
> PARSER = argparse.ArgumentParser(
> description='''
> - Generate a UTF-8 file from UnicodeData.txt, EastAsianWidth.txt, and PropList.txt.
> + Generate a UTF-8 file from UnicodeData.txt, DerivedCoreProperties.txt, EastAsianWidth.txt, and HangulSyllableType.txt
> ''')
> PARSER.add_argument(
> '-u', '--unicode_data_file',
> @@ -311,6 +326,13 @@ if __name__ == "__main__":
> default='UnicodeData.txt',
> help=('The UnicodeData.txt file to read, '
> + 'default: %(default)s'))
> + PARSER.add_argument(
> + '-d', '--derived_core_properties_file',
> + nargs='?',
> + type=str,
> + default='DerivedCoreProperties.txt',
> + help=('The DerivedCoreProperties.txt file to read, '
> + + 'default: %(default)s'))
> PARSER.add_argument(
> '-e', '--east_asian_with_file',
> nargs='?',
> @@ -319,11 +341,11 @@ if __name__ == "__main__":
> help=('The EastAsianWidth.txt file to read, '
> + 'default: %(default)s'))
> PARSER.add_argument(
> - '-p', '--prop_list_file',
> + '-k', '--hangul_syllable_type_file',
> nargs='?',
> type=str,
> - default='PropList.txt',
> - help=('The PropList.txt file to read, '
> + default='HangulSyllableType.txt',
> + help=('The HangulSyllableType.txt file to read, '
> + 'default: %(default)s'))
> PARSER.add_argument(
> '--unicode_version',
> @@ -336,27 +358,35 @@ if __name__ == "__main__":
> unicode_utils.fill_attributes(ARGS.unicode_data_file)
> with open(ARGS.unicode_data_file, mode='r') as UNIDATA_FILE:
> UNICODE_DATA_LINES = UNIDATA_FILE.readlines()
> - with open(ARGS.east_asian_with_file, mode='r') as EAST_ASIAN_WIDTH_FILE:
> - EAST_ASIAN_WIDTH_LINES = []
> - for LINE in EAST_ASIAN_WIDTH_FILE:
> - # If characters from EastAsianWidth.txt which are from
> - # reserved ranges (i.e. not yet assigned code points)
> + with open(ARGS.derived_core_properties_file, mode='r') as DERIVED_CORE_PROPERTIES_FILE:
> + DERIVED_CORE_PROPERTIES_LINES = []
> + for LINE in DERIVED_CORE_PROPERTIES_FILE:
> + # If characters which are from reserved ranges
> + # (i.e. not yet assigned code points)
> # are added to the WIDTH section of the UTF-8 file, then
> # “make check” produces “Unknown Character” errors for
> # these code points because such unassigned code points
> # are not in the CHARMAP section of the UTF-8 file.
> #
> - # Therefore, we skip all reserved code points when reading
> - # the EastAsianWidth.txt file.
> - if re.match(r'.*<reserved-.+>\.\.<reserved-.+>.*', LINE):
> + # Therefore, we skip all reserved code points.
> + if re.match(r'.*<reserved-.+>', LINE):
> + continue
> + if re.match(r'^[^;]*;\s*Default_Ignorable_Code_Point', LINE):
> + DERIVED_CORE_PROPERTIES_LINES.append(LINE.strip())
> + with open(ARGS.east_asian_with_file, mode='r') as EAST_ASIAN_WIDTH_FILE:
> + EAST_ASIAN_WIDTH_LINES = []
> + for LINE in EAST_ASIAN_WIDTH_FILE:
> + if re.match(r'.*<reserved-.+>', LINE):
> continue
> if re.match(r'^[^;]*;\s*[WF]', LINE):
> EAST_ASIAN_WIDTH_LINES.append(LINE.strip())
> - with open(ARGS.prop_list_file, mode='r') as PROP_LIST_FILE:
> - PROP_LIST_LINES = []
> - for LINE in PROP_LIST_FILE:
> - if re.match(r'^[^;]*;[\s]*Prepended_Concatenation_Mark', LINE):
> - PROP_LIST_LINES.append(LINE.strip())
> + with open(ARGS.hangul_syllable_type_file, mode='r') as HANGUL_SYLLABLE_TYPE_FILE:
> + HANGUL_SYLLABLE_TYPE_LINES = []
> + for LINE in HANGUL_SYLLABLE_TYPE_FILE:
> + if re.match(r'.*<reserved-.+>', LINE):
> + continue
> + if re.match(r'^[^;]*;\s*[VT]', LINE):
> + HANGUL_SYLLABLE_TYPE_LINES.append(LINE.strip())
> with open('UTF-8', mode='w') as OUTFILE:
> # Processing UnicodeData.txt and write CHARMAP to UTF-8 file
> write_header_charmap(OUTFILE)
> @@ -366,6 +396,7 @@ if __name__ == "__main__":
> write_header_width(OUTFILE, ARGS.unicode_version)
> process_width(OUTFILE,
> UNICODE_DATA_LINES,
> + DERIVED_CORE_PROPERTIES_LINES,
> EAST_ASIAN_WIDTH_LINES,
> - PROP_LIST_LINES)
> + HANGUL_SYLLABLE_TYPE_LINES)
> OUTFILE.write("END WIDTH\n")
@@ -172,7 +172,7 @@ unicode:
localedata/unicode-gen/unicode-license.txt
localedata/unicode-gen/DerivedCoreProperties.txt
localedata/unicode-gen/EastAsianWidth.txt
- localedata/unicode-gen/PropList.txt
+ localedata/unicode-gen/HangulSyllableType.txt
# The following files are shared with the upstream tzcode project and must be
# updated regularly to stay in sync with the upstream releases.
@@ -49856,13 +49856,27 @@ CHARMAP
END CHARMAP
% Character width according to Unicode 15.1.0.
-% - Default width is 1.
+% Width is determined by the following rules, in order of decreasing precedence:
+% - U+00AD SOFT HYPHEN has width 1, as a special case for compatibility (https://archive.is/b5Ck).
+% - U+115F HANGUL CHOSEONG FILLER has width 2.
+% This character stands in for an intentionally omitted leading consonant
+% in a Hangul syllable block; as such it must be assigned width 2 despite its lack
+% of visible display to ensure that the complete block has the correct width.
+% (See below for more information on Hangul syllables.)
+% - Combining jungseong and jongseong Hangul jamo have width 0; generated from
+% "grep '^[^;]*;[VT]' HangulSyllableType.txt".
+% One composed Hangul "syllable block" like 퓛 is made up of
+% two to three individual component characters called "jamo".
+% The complete block must have total width 2;
+% to achieve this, we assign a width of 2 to leading "choseong" jamo,
+% and of 0 to medial vowel "jungseong" and trailing "jongseong" jamo.
+% - Non-spacing and enclosing marks have width 0; generated from
+% "grep -E '^[^;]*;[^;]*;(Mn|Me);' UnicodeData.txt".
+% - "Default_Ignorable_Code_Point"s have width 0; generated from
+% "grep '^[^;]*;\s*Default_Ignorable_Code_Point' DerivedCoreProperties.txt".
% - Double-width characters have width 2; generated from
-% "grep '^[^;]*;\s*[WF]' EastAsianWidth.txt"
-% - Non-spacing characters have width 0; generated from PropList.txt or
-% "grep '^[^;]*;[^;]*;[^;]*;[^;]*;NSM;' UnicodeData.txt"
-% - Format control characters have width 0; generated from
-% "grep '^[^;]*;[^;]*;Cf;' UnicodeData.txt"
+% "grep '^[^;]*;[WF]' EastAsianWidth.txt".
+% - Default width for all other characters is 1.
WIDTH
<U0300>...<U036F> 0
<U0483>...<U0489> 0
@@ -50083,7 +50097,9 @@ WIDTH
<U3099>...<U309A> 0
<U309B>...<U30FF> 2
<U3105>...<U312F> 2
-<U3131>...<U318E> 2
+<U3131>...<U3163> 2
+<U3164> 0
+<U3165>...<U318E> 2
<U3190>...<U31E3> 2
<U31EF>...<U321E> 2
<U3220>...<UA48C> 2
@@ -50138,8 +50154,8 @@ WIDTH
<UFE68>...<UFE6B> 2
<UFEFF> 0
<UFF01>...<UFF60> 2
+<UFFA0> 0
<UFFE0>...<UFFE6> 2
-<UFFF9>...<UFFFB> 0
<U000101FD> 0
<U000102E0> 0
<U00010376>...<U0001037A> 0
@@ -50240,7 +50256,7 @@ WIDTH
<U00011F36>...<U00011F3A> 0
<U00011F40> 0
<U00011F42> 0
-<U00013430>...<U00013440> 0
+<U00013440> 0
<U00013447>...<U00013455> 0
<U00016AF0>...<U00016AF4> 0
<U00016B30>...<U00016B36> 0
new file mode 100644
@@ -0,0 +1,858 @@
+# HangulSyllableType-15.1.0.txt
+# Date: 2023-01-05, 20:34:42 GMT
+# © 2023 Unicode®, Inc.
+# Unicode and the Unicode Logo are registered trademarks of Unicode, Inc. in the U.S. and other countries.
+# For terms of use, see https://www.unicode.org/terms_of_use.html
+#
+# Unicode Character Database
+# For documentation, see https://www.unicode.org/reports/tr44/
+
+# ================================================
+
+# Property: Hangul_Syllable_Type
+
+# All code points not explicitly listed for Hangul_Syllable_Type
+# have the value Not_Applicable (NA).
+
+# @missing: 0000..10FFFF; Not_Applicable
+
+# ================================================
+
+# Hangul_Syllable_Type=Leading_Jamo
+
+1100..115F ; L # Lo [96] HANGUL CHOSEONG KIYEOK..HANGUL CHOSEONG FILLER
+A960..A97C ; L # Lo [29] HANGUL CHOSEONG TIKEUT-MIEUM..HANGUL CHOSEONG SSANGYEORINHIEUH
+
+# Total code points: 125
+
+# ================================================
+
+# Hangul_Syllable_Type=Vowel_Jamo
+
+1160..11A7 ; V # Lo [72] HANGUL JUNGSEONG FILLER..HANGUL JUNGSEONG O-YAE
+D7B0..D7C6 ; V # Lo [23] HANGUL JUNGSEONG O-YEO..HANGUL JUNGSEONG ARAEA-E
+
+# Total code points: 95
+
+# ================================================
+
+# Hangul_Syllable_Type=Trailing_Jamo
+
+11A8..11FF ; T # Lo [88] HANGUL JONGSEONG KIYEOK..HANGUL JONGSEONG SSANGNIEUN
+D7CB..D7FB ; T # Lo [49] HANGUL JONGSEONG NIEUN-RIEUL..HANGUL JONGSEONG PHIEUPH-THIEUTH
+
+# Total code points: 137
+
+# ================================================
+
+# Hangul_Syllable_Type=LV_Syllable
+
+AC00 ; LV # Lo HANGUL SYLLABLE GA
+AC1C ; LV # Lo HANGUL SYLLABLE GAE
+AC38 ; LV # Lo HANGUL SYLLABLE GYA
+AC54 ; LV # Lo HANGUL SYLLABLE GYAE
+AC70 ; LV # Lo HANGUL SYLLABLE GEO
+AC8C ; LV # Lo HANGUL SYLLABLE GE
+ACA8 ; LV # Lo HANGUL SYLLABLE GYEO
+ACC4 ; LV # Lo HANGUL SYLLABLE GYE
+ACE0 ; LV # Lo HANGUL SYLLABLE GO
+ACFC ; LV # Lo HANGUL SYLLABLE GWA
+AD18 ; LV # Lo HANGUL SYLLABLE GWAE
+AD34 ; LV # Lo HANGUL SYLLABLE GOE
+AD50 ; LV # Lo HANGUL SYLLABLE GYO
+AD6C ; LV # Lo HANGUL SYLLABLE GU
+AD88 ; LV # Lo HANGUL SYLLABLE GWEO
+ADA4 ; LV # Lo HANGUL SYLLABLE GWE
+ADC0 ; LV # Lo HANGUL SYLLABLE GWI
+ADDC ; LV # Lo HANGUL SYLLABLE GYU
+ADF8 ; LV # Lo HANGUL SYLLABLE GEU
+AE14 ; LV # Lo HANGUL SYLLABLE GYI
+AE30 ; LV # Lo HANGUL SYLLABLE GI
+AE4C ; LV # Lo HANGUL SYLLABLE GGA
+AE68 ; LV # Lo HANGUL SYLLABLE GGAE
+AE84 ; LV # Lo HANGUL SYLLABLE GGYA
+AEA0 ; LV # Lo HANGUL SYLLABLE GGYAE
+AEBC ; LV # Lo HANGUL SYLLABLE GGEO
+AED8 ; LV # Lo HANGUL SYLLABLE GGE
+AEF4 ; LV # Lo HANGUL SYLLABLE GGYEO
+AF10 ; LV # Lo HANGUL SYLLABLE GGYE
+AF2C ; LV # Lo HANGUL SYLLABLE GGO
+AF48 ; LV # Lo HANGUL SYLLABLE GGWA
+AF64 ; LV # Lo HANGUL SYLLABLE GGWAE
+AF80 ; LV # Lo HANGUL SYLLABLE GGOE
+AF9C ; LV # Lo HANGUL SYLLABLE GGYO
+AFB8 ; LV # Lo HANGUL SYLLABLE GGU
+AFD4 ; LV # Lo HANGUL SYLLABLE GGWEO
+AFF0 ; LV # Lo HANGUL SYLLABLE GGWE
+B00C ; LV # Lo HANGUL SYLLABLE GGWI
+B028 ; LV # Lo HANGUL SYLLABLE GGYU
+B044 ; LV # Lo HANGUL SYLLABLE GGEU
+B060 ; LV # Lo HANGUL SYLLABLE GGYI
+B07C ; LV # Lo HANGUL SYLLABLE GGI
+B098 ; LV # Lo HANGUL SYLLABLE NA
+B0B4 ; LV # Lo HANGUL SYLLABLE NAE
+B0D0 ; LV # Lo HANGUL SYLLABLE NYA
+B0EC ; LV # Lo HANGUL SYLLABLE NYAE
+B108 ; LV # Lo HANGUL SYLLABLE NEO
+B124 ; LV # Lo HANGUL SYLLABLE NE
+B140 ; LV # Lo HANGUL SYLLABLE NYEO
+B15C ; LV # Lo HANGUL SYLLABLE NYE
+B178 ; LV # Lo HANGUL SYLLABLE NO
+B194 ; LV # Lo HANGUL SYLLABLE NWA
+B1B0 ; LV # Lo HANGUL SYLLABLE NWAE
+B1CC ; LV # Lo HANGUL SYLLABLE NOE
+B1E8 ; LV # Lo HANGUL SYLLABLE NYO
+B204 ; LV # Lo HANGUL SYLLABLE NU
+B220 ; LV # Lo HANGUL SYLLABLE NWEO
+B23C ; LV # Lo HANGUL SYLLABLE NWE
+B258 ; LV # Lo HANGUL SYLLABLE NWI
+B274 ; LV # Lo HANGUL SYLLABLE NYU
+B290 ; LV # Lo HANGUL SYLLABLE NEU
+B2AC ; LV # Lo HANGUL SYLLABLE NYI
+B2C8 ; LV # Lo HANGUL SYLLABLE NI
+B2E4 ; LV # Lo HANGUL SYLLABLE DA
+B300 ; LV # Lo HANGUL SYLLABLE DAE
+B31C ; LV # Lo HANGUL SYLLABLE DYA
+B338 ; LV # Lo HANGUL SYLLABLE DYAE
+B354 ; LV # Lo HANGUL SYLLABLE DEO
+B370 ; LV # Lo HANGUL SYLLABLE DE
+B38C ; LV # Lo HANGUL SYLLABLE DYEO
+B3A8 ; LV # Lo HANGUL SYLLABLE DYE
+B3C4 ; LV # Lo HANGUL SYLLABLE DO
+B3E0 ; LV # Lo HANGUL SYLLABLE DWA
+B3FC ; LV # Lo HANGUL SYLLABLE DWAE
+B418 ; LV # Lo HANGUL SYLLABLE DOE
+B434 ; LV # Lo HANGUL SYLLABLE DYO
+B450 ; LV # Lo HANGUL SYLLABLE DU
+B46C ; LV # Lo HANGUL SYLLABLE DWEO
+B488 ; LV # Lo HANGUL SYLLABLE DWE
+B4A4 ; LV # Lo HANGUL SYLLABLE DWI
+B4C0 ; LV # Lo HANGUL SYLLABLE DYU
+B4DC ; LV # Lo HANGUL SYLLABLE DEU
+B4F8 ; LV # Lo HANGUL SYLLABLE DYI
+B514 ; LV # Lo HANGUL SYLLABLE DI
+B530 ; LV # Lo HANGUL SYLLABLE DDA
+B54C ; LV # Lo HANGUL SYLLABLE DDAE
+B568 ; LV # Lo HANGUL SYLLABLE DDYA
+B584 ; LV # Lo HANGUL SYLLABLE DDYAE
+B5A0 ; LV # Lo HANGUL SYLLABLE DDEO
+B5BC ; LV # Lo HANGUL SYLLABLE DDE
+B5D8 ; LV # Lo HANGUL SYLLABLE DDYEO
+B5F4 ; LV # Lo HANGUL SYLLABLE DDYE
+B610 ; LV # Lo HANGUL SYLLABLE DDO
+B62C ; LV # Lo HANGUL SYLLABLE DDWA
+B648 ; LV # Lo HANGUL SYLLABLE DDWAE
+B664 ; LV # Lo HANGUL SYLLABLE DDOE
+B680 ; LV # Lo HANGUL SYLLABLE DDYO
+B69C ; LV # Lo HANGUL SYLLABLE DDU
+B6B8 ; LV # Lo HANGUL SYLLABLE DDWEO
+B6D4 ; LV # Lo HANGUL SYLLABLE DDWE
+B6F0 ; LV # Lo HANGUL SYLLABLE DDWI
+B70C ; LV # Lo HANGUL SYLLABLE DDYU
+B728 ; LV # Lo HANGUL SYLLABLE DDEU
+B744 ; LV # Lo HANGUL SYLLABLE DDYI
+B760 ; LV # Lo HANGUL SYLLABLE DDI
+B77C ; LV # Lo HANGUL SYLLABLE RA
+B798 ; LV # Lo HANGUL SYLLABLE RAE
+B7B4 ; LV # Lo HANGUL SYLLABLE RYA
+B7D0 ; LV # Lo HANGUL SYLLABLE RYAE
+B7EC ; LV # Lo HANGUL SYLLABLE REO
+B808 ; LV # Lo HANGUL SYLLABLE RE
+B824 ; LV # Lo HANGUL SYLLABLE RYEO
+B840 ; LV # Lo HANGUL SYLLABLE RYE
+B85C ; LV # Lo HANGUL SYLLABLE RO
+B878 ; LV # Lo HANGUL SYLLABLE RWA
+B894 ; LV # Lo HANGUL SYLLABLE RWAE
+B8B0 ; LV # Lo HANGUL SYLLABLE ROE
+B8CC ; LV # Lo HANGUL SYLLABLE RYO
+B8E8 ; LV # Lo HANGUL SYLLABLE RU
+B904 ; LV # Lo HANGUL SYLLABLE RWEO
+B920 ; LV # Lo HANGUL SYLLABLE RWE
+B93C ; LV # Lo HANGUL SYLLABLE RWI
+B958 ; LV # Lo HANGUL SYLLABLE RYU
+B974 ; LV # Lo HANGUL SYLLABLE REU
+B990 ; LV # Lo HANGUL SYLLABLE RYI
+B9AC ; LV # Lo HANGUL SYLLABLE RI
+B9C8 ; LV # Lo HANGUL SYLLABLE MA
+B9E4 ; LV # Lo HANGUL SYLLABLE MAE
+BA00 ; LV # Lo HANGUL SYLLABLE MYA
+BA1C ; LV # Lo HANGUL SYLLABLE MYAE
+BA38 ; LV # Lo HANGUL SYLLABLE MEO
+BA54 ; LV # Lo HANGUL SYLLABLE ME
+BA70 ; LV # Lo HANGUL SYLLABLE MYEO
+BA8C ; LV # Lo HANGUL SYLLABLE MYE
+BAA8 ; LV # Lo HANGUL SYLLABLE MO
+BAC4 ; LV # Lo HANGUL SYLLABLE MWA
+BAE0 ; LV # Lo HANGUL SYLLABLE MWAE
+BAFC ; LV # Lo HANGUL SYLLABLE MOE
+BB18 ; LV # Lo HANGUL SYLLABLE MYO
+BB34 ; LV # Lo HANGUL SYLLABLE MU
+BB50 ; LV # Lo HANGUL SYLLABLE MWEO
+BB6C ; LV # Lo HANGUL SYLLABLE MWE
+BB88 ; LV # Lo HANGUL SYLLABLE MWI
+BBA4 ; LV # Lo HANGUL SYLLABLE MYU
+BBC0 ; LV # Lo HANGUL SYLLABLE MEU
+BBDC ; LV # Lo HANGUL SYLLABLE MYI
+BBF8 ; LV # Lo HANGUL SYLLABLE MI
+BC14 ; LV # Lo HANGUL SYLLABLE BA
+BC30 ; LV # Lo HANGUL SYLLABLE BAE
+BC4C ; LV # Lo HANGUL SYLLABLE BYA
+BC68 ; LV # Lo HANGUL SYLLABLE BYAE
+BC84 ; LV # Lo HANGUL SYLLABLE BEO
+BCA0 ; LV # Lo HANGUL SYLLABLE BE
+BCBC ; LV # Lo HANGUL SYLLABLE BYEO
+BCD8 ; LV # Lo HANGUL SYLLABLE BYE
+BCF4 ; LV # Lo HANGUL SYLLABLE BO
+BD10 ; LV # Lo HANGUL SYLLABLE BWA
+BD2C ; LV # Lo HANGUL SYLLABLE BWAE
+BD48 ; LV # Lo HANGUL SYLLABLE BOE
+BD64 ; LV # Lo HANGUL SYLLABLE BYO
+BD80 ; LV # Lo HANGUL SYLLABLE BU
+BD9C ; LV # Lo HANGUL SYLLABLE BWEO
+BDB8 ; LV # Lo HANGUL SYLLABLE BWE
+BDD4 ; LV # Lo HANGUL SYLLABLE BWI
+BDF0 ; LV # Lo HANGUL SYLLABLE BYU
+BE0C ; LV # Lo HANGUL SYLLABLE BEU
+BE28 ; LV # Lo HANGUL SYLLABLE BYI
+BE44 ; LV # Lo HANGUL SYLLABLE BI
+BE60 ; LV # Lo HANGUL SYLLABLE BBA
+BE7C ; LV # Lo HANGUL SYLLABLE BBAE
+BE98 ; LV # Lo HANGUL SYLLABLE BBYA
+BEB4 ; LV # Lo HANGUL SYLLABLE BBYAE
+BED0 ; LV # Lo HANGUL SYLLABLE BBEO
+BEEC ; LV # Lo HANGUL SYLLABLE BBE
+BF08 ; LV # Lo HANGUL SYLLABLE BBYEO
+BF24 ; LV # Lo HANGUL SYLLABLE BBYE
+BF40 ; LV # Lo HANGUL SYLLABLE BBO
+BF5C ; LV # Lo HANGUL SYLLABLE BBWA
+BF78 ; LV # Lo HANGUL SYLLABLE BBWAE
+BF94 ; LV # Lo HANGUL SYLLABLE BBOE
+BFB0 ; LV # Lo HANGUL SYLLABLE BBYO
+BFCC ; LV # Lo HANGUL SYLLABLE BBU
+BFE8 ; LV # Lo HANGUL SYLLABLE BBWEO
+C004 ; LV # Lo HANGUL SYLLABLE BBWE
+C020 ; LV # Lo HANGUL SYLLABLE BBWI
+C03C ; LV # Lo HANGUL SYLLABLE BBYU
+C058 ; LV # Lo HANGUL SYLLABLE BBEU
+C074 ; LV # Lo HANGUL SYLLABLE BBYI
+C090 ; LV # Lo HANGUL SYLLABLE BBI
+C0AC ; LV # Lo HANGUL SYLLABLE SA
+C0C8 ; LV # Lo HANGUL SYLLABLE SAE
+C0E4 ; LV # Lo HANGUL SYLLABLE SYA
+C100 ; LV # Lo HANGUL SYLLABLE SYAE
+C11C ; LV # Lo HANGUL SYLLABLE SEO
+C138 ; LV # Lo HANGUL SYLLABLE SE
+C154 ; LV # Lo HANGUL SYLLABLE SYEO
+C170 ; LV # Lo HANGUL SYLLABLE SYE
+C18C ; LV # Lo HANGUL SYLLABLE SO
+C1A8 ; LV # Lo HANGUL SYLLABLE SWA
+C1C4 ; LV # Lo HANGUL SYLLABLE SWAE
+C1E0 ; LV # Lo HANGUL SYLLABLE SOE
+C1FC ; LV # Lo HANGUL SYLLABLE SYO
+C218 ; LV # Lo HANGUL SYLLABLE SU
+C234 ; LV # Lo HANGUL SYLLABLE SWEO
+C250 ; LV # Lo HANGUL SYLLABLE SWE
+C26C ; LV # Lo HANGUL SYLLABLE SWI
+C288 ; LV # Lo HANGUL SYLLABLE SYU
+C2A4 ; LV # Lo HANGUL SYLLABLE SEU
+C2C0 ; LV # Lo HANGUL SYLLABLE SYI
+C2DC ; LV # Lo HANGUL SYLLABLE SI
+C2F8 ; LV # Lo HANGUL SYLLABLE SSA
+C314 ; LV # Lo HANGUL SYLLABLE SSAE
+C330 ; LV # Lo HANGUL SYLLABLE SSYA
+C34C ; LV # Lo HANGUL SYLLABLE SSYAE
+C368 ; LV # Lo HANGUL SYLLABLE SSEO
+C384 ; LV # Lo HANGUL SYLLABLE SSE
+C3A0 ; LV # Lo HANGUL SYLLABLE SSYEO
+C3BC ; LV # Lo HANGUL SYLLABLE SSYE
+C3D8 ; LV # Lo HANGUL SYLLABLE SSO
+C3F4 ; LV # Lo HANGUL SYLLABLE SSWA
+C410 ; LV # Lo HANGUL SYLLABLE SSWAE
+C42C ; LV # Lo HANGUL SYLLABLE SSOE
+C448 ; LV # Lo HANGUL SYLLABLE SSYO
+C464 ; LV # Lo HANGUL SYLLABLE SSU
+C480 ; LV # Lo HANGUL SYLLABLE SSWEO
+C49C ; LV # Lo HANGUL SYLLABLE SSWE
+C4B8 ; LV # Lo HANGUL SYLLABLE SSWI
+C4D4 ; LV # Lo HANGUL SYLLABLE SSYU
+C4F0 ; LV # Lo HANGUL SYLLABLE SSEU
+C50C ; LV # Lo HANGUL SYLLABLE SSYI
+C528 ; LV # Lo HANGUL SYLLABLE SSI
+C544 ; LV # Lo HANGUL SYLLABLE A
+C560 ; LV # Lo HANGUL SYLLABLE AE
+C57C ; LV # Lo HANGUL SYLLABLE YA
+C598 ; LV # Lo HANGUL SYLLABLE YAE
+C5B4 ; LV # Lo HANGUL SYLLABLE EO
+C5D0 ; LV # Lo HANGUL SYLLABLE E
+C5EC ; LV # Lo HANGUL SYLLABLE YEO
+C608 ; LV # Lo HANGUL SYLLABLE YE
+C624 ; LV # Lo HANGUL SYLLABLE O
+C640 ; LV # Lo HANGUL SYLLABLE WA
+C65C ; LV # Lo HANGUL SYLLABLE WAE
+C678 ; LV # Lo HANGUL SYLLABLE OE
+C694 ; LV # Lo HANGUL SYLLABLE YO
+C6B0 ; LV # Lo HANGUL SYLLABLE U
+C6CC ; LV # Lo HANGUL SYLLABLE WEO
+C6E8 ; LV # Lo HANGUL SYLLABLE WE
+C704 ; LV # Lo HANGUL SYLLABLE WI
+C720 ; LV # Lo HANGUL SYLLABLE YU
+C73C ; LV # Lo HANGUL SYLLABLE EU
+C758 ; LV # Lo HANGUL SYLLABLE YI
+C774 ; LV # Lo HANGUL SYLLABLE I
+C790 ; LV # Lo HANGUL SYLLABLE JA
+C7AC ; LV # Lo HANGUL SYLLABLE JAE
+C7C8 ; LV # Lo HANGUL SYLLABLE JYA
+C7E4 ; LV # Lo HANGUL SYLLABLE JYAE
+C800 ; LV # Lo HANGUL SYLLABLE JEO
+C81C ; LV # Lo HANGUL SYLLABLE JE
+C838 ; LV # Lo HANGUL SYLLABLE JYEO
+C854 ; LV # Lo HANGUL SYLLABLE JYE
+C870 ; LV # Lo HANGUL SYLLABLE JO
+C88C ; LV # Lo HANGUL SYLLABLE JWA
+C8A8 ; LV # Lo HANGUL SYLLABLE JWAE
+C8C4 ; LV # Lo HANGUL SYLLABLE JOE
+C8E0 ; LV # Lo HANGUL SYLLABLE JYO
+C8FC ; LV # Lo HANGUL SYLLABLE JU
+C918 ; LV # Lo HANGUL SYLLABLE JWEO
+C934 ; LV # Lo HANGUL SYLLABLE JWE
+C950 ; LV # Lo HANGUL SYLLABLE JWI
+C96C ; LV # Lo HANGUL SYLLABLE JYU
+C988 ; LV # Lo HANGUL SYLLABLE JEU
+C9A4 ; LV # Lo HANGUL SYLLABLE JYI
+C9C0 ; LV # Lo HANGUL SYLLABLE JI
+C9DC ; LV # Lo HANGUL SYLLABLE JJA
+C9F8 ; LV # Lo HANGUL SYLLABLE JJAE
+CA14 ; LV # Lo HANGUL SYLLABLE JJYA
+CA30 ; LV # Lo HANGUL SYLLABLE JJYAE
+CA4C ; LV # Lo HANGUL SYLLABLE JJEO
+CA68 ; LV # Lo HANGUL SYLLABLE JJE
+CA84 ; LV # Lo HANGUL SYLLABLE JJYEO
+CAA0 ; LV # Lo HANGUL SYLLABLE JJYE
+CABC ; LV # Lo HANGUL SYLLABLE JJO
+CAD8 ; LV # Lo HANGUL SYLLABLE JJWA
+CAF4 ; LV # Lo HANGUL SYLLABLE JJWAE
+CB10 ; LV # Lo HANGUL SYLLABLE JJOE
+CB2C ; LV # Lo HANGUL SYLLABLE JJYO
+CB48 ; LV # Lo HANGUL SYLLABLE JJU
+CB64 ; LV # Lo HANGUL SYLLABLE JJWEO
+CB80 ; LV # Lo HANGUL SYLLABLE JJWE
+CB9C ; LV # Lo HANGUL SYLLABLE JJWI
+CBB8 ; LV # Lo HANGUL SYLLABLE JJYU
+CBD4 ; LV # Lo HANGUL SYLLABLE JJEU
+CBF0 ; LV # Lo HANGUL SYLLABLE JJYI
+CC0C ; LV # Lo HANGUL SYLLABLE JJI
+CC28 ; LV # Lo HANGUL SYLLABLE CA
+CC44 ; LV # Lo HANGUL SYLLABLE CAE
+CC60 ; LV # Lo HANGUL SYLLABLE CYA
+CC7C ; LV # Lo HANGUL SYLLABLE CYAE
+CC98 ; LV # Lo HANGUL SYLLABLE CEO
+CCB4 ; LV # Lo HANGUL SYLLABLE CE
+CCD0 ; LV # Lo HANGUL SYLLABLE CYEO
+CCEC ; LV # Lo HANGUL SYLLABLE CYE
+CD08 ; LV # Lo HANGUL SYLLABLE CO
+CD24 ; LV # Lo HANGUL SYLLABLE CWA
+CD40 ; LV # Lo HANGUL SYLLABLE CWAE
+CD5C ; LV # Lo HANGUL SYLLABLE COE
+CD78 ; LV # Lo HANGUL SYLLABLE CYO
+CD94 ; LV # Lo HANGUL SYLLABLE CU
+CDB0 ; LV # Lo HANGUL SYLLABLE CWEO
+CDCC ; LV # Lo HANGUL SYLLABLE CWE
+CDE8 ; LV # Lo HANGUL SYLLABLE CWI
+CE04 ; LV # Lo HANGUL SYLLABLE CYU
+CE20 ; LV # Lo HANGUL SYLLABLE CEU
+CE3C ; LV # Lo HANGUL SYLLABLE CYI
+CE58 ; LV # Lo HANGUL SYLLABLE CI
+CE74 ; LV # Lo HANGUL SYLLABLE KA
+CE90 ; LV # Lo HANGUL SYLLABLE KAE
+CEAC ; LV # Lo HANGUL SYLLABLE KYA
+CEC8 ; LV # Lo HANGUL SYLLABLE KYAE
+CEE4 ; LV # Lo HANGUL SYLLABLE KEO
+CF00 ; LV # Lo HANGUL SYLLABLE KE
+CF1C ; LV # Lo HANGUL SYLLABLE KYEO
+CF38 ; LV # Lo HANGUL SYLLABLE KYE
+CF54 ; LV # Lo HANGUL SYLLABLE KO
+CF70 ; LV # Lo HANGUL SYLLABLE KWA
+CF8C ; LV # Lo HANGUL SYLLABLE KWAE
+CFA8 ; LV # Lo HANGUL SYLLABLE KOE
+CFC4 ; LV # Lo HANGUL SYLLABLE KYO
+CFE0 ; LV # Lo HANGUL SYLLABLE KU
+CFFC ; LV # Lo HANGUL SYLLABLE KWEO
+D018 ; LV # Lo HANGUL SYLLABLE KWE
+D034 ; LV # Lo HANGUL SYLLABLE KWI
+D050 ; LV # Lo HANGUL SYLLABLE KYU
+D06C ; LV # Lo HANGUL SYLLABLE KEU
+D088 ; LV # Lo HANGUL SYLLABLE KYI
+D0A4 ; LV # Lo HANGUL SYLLABLE KI
+D0C0 ; LV # Lo HANGUL SYLLABLE TA
+D0DC ; LV # Lo HANGUL SYLLABLE TAE
+D0F8 ; LV # Lo HANGUL SYLLABLE TYA
+D114 ; LV # Lo HANGUL SYLLABLE TYAE
+D130 ; LV # Lo HANGUL SYLLABLE TEO
+D14C ; LV # Lo HANGUL SYLLABLE TE
+D168 ; LV # Lo HANGUL SYLLABLE TYEO
+D184 ; LV # Lo HANGUL SYLLABLE TYE
+D1A0 ; LV # Lo HANGUL SYLLABLE TO
+D1BC ; LV # Lo HANGUL SYLLABLE TWA
+D1D8 ; LV # Lo HANGUL SYLLABLE TWAE
+D1F4 ; LV # Lo HANGUL SYLLABLE TOE
+D210 ; LV # Lo HANGUL SYLLABLE TYO
+D22C ; LV # Lo HANGUL SYLLABLE TU
+D248 ; LV # Lo HANGUL SYLLABLE TWEO
+D264 ; LV # Lo HANGUL SYLLABLE TWE
+D280 ; LV # Lo HANGUL SYLLABLE TWI
+D29C ; LV # Lo HANGUL SYLLABLE TYU
+D2B8 ; LV # Lo HANGUL SYLLABLE TEU
+D2D4 ; LV # Lo HANGUL SYLLABLE TYI
+D2F0 ; LV # Lo HANGUL SYLLABLE TI
+D30C ; LV # Lo HANGUL SYLLABLE PA
+D328 ; LV # Lo HANGUL SYLLABLE PAE
+D344 ; LV # Lo HANGUL SYLLABLE PYA
+D360 ; LV # Lo HANGUL SYLLABLE PYAE
+D37C ; LV # Lo HANGUL SYLLABLE PEO
+D398 ; LV # Lo HANGUL SYLLABLE PE
+D3B4 ; LV # Lo HANGUL SYLLABLE PYEO
+D3D0 ; LV # Lo HANGUL SYLLABLE PYE
+D3EC ; LV # Lo HANGUL SYLLABLE PO
+D408 ; LV # Lo HANGUL SYLLABLE PWA
+D424 ; LV # Lo HANGUL SYLLABLE PWAE
+D440 ; LV # Lo HANGUL SYLLABLE POE
+D45C ; LV # Lo HANGUL SYLLABLE PYO
+D478 ; LV # Lo HANGUL SYLLABLE PU
+D494 ; LV # Lo HANGUL SYLLABLE PWEO
+D4B0 ; LV # Lo HANGUL SYLLABLE PWE
+D4CC ; LV # Lo HANGUL SYLLABLE PWI
+D4E8 ; LV # Lo HANGUL SYLLABLE PYU
+D504 ; LV # Lo HANGUL SYLLABLE PEU
+D520 ; LV # Lo HANGUL SYLLABLE PYI
+D53C ; LV # Lo HANGUL SYLLABLE PI
+D558 ; LV # Lo HANGUL SYLLABLE HA
+D574 ; LV # Lo HANGUL SYLLABLE HAE
+D590 ; LV # Lo HANGUL SYLLABLE HYA
+D5AC ; LV # Lo HANGUL SYLLABLE HYAE
+D5C8 ; LV # Lo HANGUL SYLLABLE HEO
+D5E4 ; LV # Lo HANGUL SYLLABLE HE
+D600 ; LV # Lo HANGUL SYLLABLE HYEO
+D61C ; LV # Lo HANGUL SYLLABLE HYE
+D638 ; LV # Lo HANGUL SYLLABLE HO
+D654 ; LV # Lo HANGUL SYLLABLE HWA
+D670 ; LV # Lo HANGUL SYLLABLE HWAE
+D68C ; LV # Lo HANGUL SYLLABLE HOE
+D6A8 ; LV # Lo HANGUL SYLLABLE HYO
+D6C4 ; LV # Lo HANGUL SYLLABLE HU
+D6E0 ; LV # Lo HANGUL SYLLABLE HWEO
+D6FC ; LV # Lo HANGUL SYLLABLE HWE
+D718 ; LV # Lo HANGUL SYLLABLE HWI
+D734 ; LV # Lo HANGUL SYLLABLE HYU
+D750 ; LV # Lo HANGUL SYLLABLE HEU
+D76C ; LV # Lo HANGUL SYLLABLE HYI
+D788 ; LV # Lo HANGUL SYLLABLE HI
+
+# Total code points: 399
+
+# ================================================
+
+# Hangul_Syllable_Type=LVT_Syllable
+
+AC01..AC1B ; LVT # Lo [27] HANGUL SYLLABLE GAG..HANGUL SYLLABLE GAH
+AC1D..AC37 ; LVT # Lo [27] HANGUL SYLLABLE GAEG..HANGUL SYLLABLE GAEH
+AC39..AC53 ; LVT # Lo [27] HANGUL SYLLABLE GYAG..HANGUL SYLLABLE GYAH
+AC55..AC6F ; LVT # Lo [27] HANGUL SYLLABLE GYAEG..HANGUL SYLLABLE GYAEH
+AC71..AC8B ; LVT # Lo [27] HANGUL SYLLABLE GEOG..HANGUL SYLLABLE GEOH
+AC8D..ACA7 ; LVT # Lo [27] HANGUL SYLLABLE GEG..HANGUL SYLLABLE GEH
+ACA9..ACC3 ; LVT # Lo [27] HANGUL SYLLABLE GYEOG..HANGUL SYLLABLE GYEOH
+ACC5..ACDF ; LVT # Lo [27] HANGUL SYLLABLE GYEG..HANGUL SYLLABLE GYEH
+ACE1..ACFB ; LVT # Lo [27] HANGUL SYLLABLE GOG..HANGUL SYLLABLE GOH
+ACFD..AD17 ; LVT # Lo [27] HANGUL SYLLABLE GWAG..HANGUL SYLLABLE GWAH
+AD19..AD33 ; LVT # Lo [27] HANGUL SYLLABLE GWAEG..HANGUL SYLLABLE GWAEH
+AD35..AD4F ; LVT # Lo [27] HANGUL SYLLABLE GOEG..HANGUL SYLLABLE GOEH
+AD51..AD6B ; LVT # Lo [27] HANGUL SYLLABLE GYOG..HANGUL SYLLABLE GYOH
+AD6D..AD87 ; LVT # Lo [27] HANGUL SYLLABLE GUG..HANGUL SYLLABLE GUH
+AD89..ADA3 ; LVT # Lo [27] HANGUL SYLLABLE GWEOG..HANGUL SYLLABLE GWEOH
+ADA5..ADBF ; LVT # Lo [27] HANGUL SYLLABLE GWEG..HANGUL SYLLABLE GWEH
+ADC1..ADDB ; LVT # Lo [27] HANGUL SYLLABLE GWIG..HANGUL SYLLABLE GWIH
+ADDD..ADF7 ; LVT # Lo [27] HANGUL SYLLABLE GYUG..HANGUL SYLLABLE GYUH
+ADF9..AE13 ; LVT # Lo [27] HANGUL SYLLABLE GEUG..HANGUL SYLLABLE GEUH
+AE15..AE2F ; LVT # Lo [27] HANGUL SYLLABLE GYIG..HANGUL SYLLABLE GYIH
+AE31..AE4B ; LVT # Lo [27] HANGUL SYLLABLE GIG..HANGUL SYLLABLE GIH
+AE4D..AE67 ; LVT # Lo [27] HANGUL SYLLABLE GGAG..HANGUL SYLLABLE GGAH
+AE69..AE83 ; LVT # Lo [27] HANGUL SYLLABLE GGAEG..HANGUL SYLLABLE GGAEH
+AE85..AE9F ; LVT # Lo [27] HANGUL SYLLABLE GGYAG..HANGUL SYLLABLE GGYAH
+AEA1..AEBB ; LVT # Lo [27] HANGUL SYLLABLE GGYAEG..HANGUL SYLLABLE GGYAEH
+AEBD..AED7 ; LVT # Lo [27] HANGUL SYLLABLE GGEOG..HANGUL SYLLABLE GGEOH
+AED9..AEF3 ; LVT # Lo [27] HANGUL SYLLABLE GGEG..HANGUL SYLLABLE GGEH
+AEF5..AF0F ; LVT # Lo [27] HANGUL SYLLABLE GGYEOG..HANGUL SYLLABLE GGYEOH
+AF11..AF2B ; LVT # Lo [27] HANGUL SYLLABLE GGYEG..HANGUL SYLLABLE GGYEH
+AF2D..AF47 ; LVT # Lo [27] HANGUL SYLLABLE GGOG..HANGUL SYLLABLE GGOH
+AF49..AF63 ; LVT # Lo [27] HANGUL SYLLABLE GGWAG..HANGUL SYLLABLE GGWAH
+AF65..AF7F ; LVT # Lo [27] HANGUL SYLLABLE GGWAEG..HANGUL SYLLABLE GGWAEH
+AF81..AF9B ; LVT # Lo [27] HANGUL SYLLABLE GGOEG..HANGUL SYLLABLE GGOEH
+AF9D..AFB7 ; LVT # Lo [27] HANGUL SYLLABLE GGYOG..HANGUL SYLLABLE GGYOH
+AFB9..AFD3 ; LVT # Lo [27] HANGUL SYLLABLE GGUG..HANGUL SYLLABLE GGUH
+AFD5..AFEF ; LVT # Lo [27] HANGUL SYLLABLE GGWEOG..HANGUL SYLLABLE GGWEOH
+AFF1..B00B ; LVT # Lo [27] HANGUL SYLLABLE GGWEG..HANGUL SYLLABLE GGWEH
+B00D..B027 ; LVT # Lo [27] HANGUL SYLLABLE GGWIG..HANGUL SYLLABLE GGWIH
+B029..B043 ; LVT # Lo [27] HANGUL SYLLABLE GGYUG..HANGUL SYLLABLE GGYUH
+B045..B05F ; LVT # Lo [27] HANGUL SYLLABLE GGEUG..HANGUL SYLLABLE GGEUH
+B061..B07B ; LVT # Lo [27] HANGUL SYLLABLE GGYIG..HANGUL SYLLABLE GGYIH
+B07D..B097 ; LVT # Lo [27] HANGUL SYLLABLE GGIG..HANGUL SYLLABLE GGIH
+B099..B0B3 ; LVT # Lo [27] HANGUL SYLLABLE NAG..HANGUL SYLLABLE NAH
+B0B5..B0CF ; LVT # Lo [27] HANGUL SYLLABLE NAEG..HANGUL SYLLABLE NAEH
+B0D1..B0EB ; LVT # Lo [27] HANGUL SYLLABLE NYAG..HANGUL SYLLABLE NYAH
+B0ED..B107 ; LVT # Lo [27] HANGUL SYLLABLE NYAEG..HANGUL SYLLABLE NYAEH
+B109..B123 ; LVT # Lo [27] HANGUL SYLLABLE NEOG..HANGUL SYLLABLE NEOH
+B125..B13F ; LVT # Lo [27] HANGUL SYLLABLE NEG..HANGUL SYLLABLE NEH
+B141..B15B ; LVT # Lo [27] HANGUL SYLLABLE NYEOG..HANGUL SYLLABLE NYEOH
+B15D..B177 ; LVT # Lo [27] HANGUL SYLLABLE NYEG..HANGUL SYLLABLE NYEH
+B179..B193 ; LVT # Lo [27] HANGUL SYLLABLE NOG..HANGUL SYLLABLE NOH
+B195..B1AF ; LVT # Lo [27] HANGUL SYLLABLE NWAG..HANGUL SYLLABLE NWAH
+B1B1..B1CB ; LVT # Lo [27] HANGUL SYLLABLE NWAEG..HANGUL SYLLABLE NWAEH
+B1CD..B1E7 ; LVT # Lo [27] HANGUL SYLLABLE NOEG..HANGUL SYLLABLE NOEH
+B1E9..B203 ; LVT # Lo [27] HANGUL SYLLABLE NYOG..HANGUL SYLLABLE NYOH
+B205..B21F ; LVT # Lo [27] HANGUL SYLLABLE NUG..HANGUL SYLLABLE NUH
+B221..B23B ; LVT # Lo [27] HANGUL SYLLABLE NWEOG..HANGUL SYLLABLE NWEOH
+B23D..B257 ; LVT # Lo [27] HANGUL SYLLABLE NWEG..HANGUL SYLLABLE NWEH
+B259..B273 ; LVT # Lo [27] HANGUL SYLLABLE NWIG..HANGUL SYLLABLE NWIH
+B275..B28F ; LVT # Lo [27] HANGUL SYLLABLE NYUG..HANGUL SYLLABLE NYUH
+B291..B2AB ; LVT # Lo [27] HANGUL SYLLABLE NEUG..HANGUL SYLLABLE NEUH
+B2AD..B2C7 ; LVT # Lo [27] HANGUL SYLLABLE NYIG..HANGUL SYLLABLE NYIH
+B2C9..B2E3 ; LVT # Lo [27] HANGUL SYLLABLE NIG..HANGUL SYLLABLE NIH
+B2E5..B2FF ; LVT # Lo [27] HANGUL SYLLABLE DAG..HANGUL SYLLABLE DAH
+B301..B31B ; LVT # Lo [27] HANGUL SYLLABLE DAEG..HANGUL SYLLABLE DAEH
+B31D..B337 ; LVT # Lo [27] HANGUL SYLLABLE DYAG..HANGUL SYLLABLE DYAH
+B339..B353 ; LVT # Lo [27] HANGUL SYLLABLE DYAEG..HANGUL SYLLABLE DYAEH
+B355..B36F ; LVT # Lo [27] HANGUL SYLLABLE DEOG..HANGUL SYLLABLE DEOH
+B371..B38B ; LVT # Lo [27] HANGUL SYLLABLE DEG..HANGUL SYLLABLE DEH
+B38D..B3A7 ; LVT # Lo [27] HANGUL SYLLABLE DYEOG..HANGUL SYLLABLE DYEOH
+B3A9..B3C3 ; LVT # Lo [27] HANGUL SYLLABLE DYEG..HANGUL SYLLABLE DYEH
+B3C5..B3DF ; LVT # Lo [27] HANGUL SYLLABLE DOG..HANGUL SYLLABLE DOH
+B3E1..B3FB ; LVT # Lo [27] HANGUL SYLLABLE DWAG..HANGUL SYLLABLE DWAH
+B3FD..B417 ; LVT # Lo [27] HANGUL SYLLABLE DWAEG..HANGUL SYLLABLE DWAEH
+B419..B433 ; LVT # Lo [27] HANGUL SYLLABLE DOEG..HANGUL SYLLABLE DOEH
+B435..B44F ; LVT # Lo [27] HANGUL SYLLABLE DYOG..HANGUL SYLLABLE DYOH
+B451..B46B ; LVT # Lo [27] HANGUL SYLLABLE DUG..HANGUL SYLLABLE DUH
+B46D..B487 ; LVT # Lo [27] HANGUL SYLLABLE DWEOG..HANGUL SYLLABLE DWEOH
+B489..B4A3 ; LVT # Lo [27] HANGUL SYLLABLE DWEG..HANGUL SYLLABLE DWEH
+B4A5..B4BF ; LVT # Lo [27] HANGUL SYLLABLE DWIG..HANGUL SYLLABLE DWIH
+B4C1..B4DB ; LVT # Lo [27] HANGUL SYLLABLE DYUG..HANGUL SYLLABLE DYUH
+B4DD..B4F7 ; LVT # Lo [27] HANGUL SYLLABLE DEUG..HANGUL SYLLABLE DEUH
+B4F9..B513 ; LVT # Lo [27] HANGUL SYLLABLE DYIG..HANGUL SYLLABLE DYIH
+B515..B52F ; LVT # Lo [27] HANGUL SYLLABLE DIG..HANGUL SYLLABLE DIH
+B531..B54B ; LVT # Lo [27] HANGUL SYLLABLE DDAG..HANGUL SYLLABLE DDAH
+B54D..B567 ; LVT # Lo [27] HANGUL SYLLABLE DDAEG..HANGUL SYLLABLE DDAEH
+B569..B583 ; LVT # Lo [27] HANGUL SYLLABLE DDYAG..HANGUL SYLLABLE DDYAH
+B585..B59F ; LVT # Lo [27] HANGUL SYLLABLE DDYAEG..HANGUL SYLLABLE DDYAEH
+B5A1..B5BB ; LVT # Lo [27] HANGUL SYLLABLE DDEOG..HANGUL SYLLABLE DDEOH
+B5BD..B5D7 ; LVT # Lo [27] HANGUL SYLLABLE DDEG..HANGUL SYLLABLE DDEH
+B5D9..B5F3 ; LVT # Lo [27] HANGUL SYLLABLE DDYEOG..HANGUL SYLLABLE DDYEOH
+B5F5..B60F ; LVT # Lo [27] HANGUL SYLLABLE DDYEG..HANGUL SYLLABLE DDYEH
+B611..B62B ; LVT # Lo [27] HANGUL SYLLABLE DDOG..HANGUL SYLLABLE DDOH
+B62D..B647 ; LVT # Lo [27] HANGUL SYLLABLE DDWAG..HANGUL SYLLABLE DDWAH
+B649..B663 ; LVT # Lo [27] HANGUL SYLLABLE DDWAEG..HANGUL SYLLABLE DDWAEH
+B665..B67F ; LVT # Lo [27] HANGUL SYLLABLE DDOEG..HANGUL SYLLABLE DDOEH
+B681..B69B ; LVT # Lo [27] HANGUL SYLLABLE DDYOG..HANGUL SYLLABLE DDYOH
+B69D..B6B7 ; LVT # Lo [27] HANGUL SYLLABLE DDUG..HANGUL SYLLABLE DDUH
+B6B9..B6D3 ; LVT # Lo [27] HANGUL SYLLABLE DDWEOG..HANGUL SYLLABLE DDWEOH
+B6D5..B6EF ; LVT # Lo [27] HANGUL SYLLABLE DDWEG..HANGUL SYLLABLE DDWEH
+B6F1..B70B ; LVT # Lo [27] HANGUL SYLLABLE DDWIG..HANGUL SYLLABLE DDWIH
+B70D..B727 ; LVT # Lo [27] HANGUL SYLLABLE DDYUG..HANGUL SYLLABLE DDYUH
+B729..B743 ; LVT # Lo [27] HANGUL SYLLABLE DDEUG..HANGUL SYLLABLE DDEUH
+B745..B75F ; LVT # Lo [27] HANGUL SYLLABLE DDYIG..HANGUL SYLLABLE DDYIH
+B761..B77B ; LVT # Lo [27] HANGUL SYLLABLE DDIG..HANGUL SYLLABLE DDIH
+B77D..B797 ; LVT # Lo [27] HANGUL SYLLABLE RAG..HANGUL SYLLABLE RAH
+B799..B7B3 ; LVT # Lo [27] HANGUL SYLLABLE RAEG..HANGUL SYLLABLE RAEH
+B7B5..B7CF ; LVT # Lo [27] HANGUL SYLLABLE RYAG..HANGUL SYLLABLE RYAH
+B7D1..B7EB ; LVT # Lo [27] HANGUL SYLLABLE RYAEG..HANGUL SYLLABLE RYAEH
+B7ED..B807 ; LVT # Lo [27] HANGUL SYLLABLE REOG..HANGUL SYLLABLE REOH
+B809..B823 ; LVT # Lo [27] HANGUL SYLLABLE REG..HANGUL SYLLABLE REH
+B825..B83F ; LVT # Lo [27] HANGUL SYLLABLE RYEOG..HANGUL SYLLABLE RYEOH
+B841..B85B ; LVT # Lo [27] HANGUL SYLLABLE RYEG..HANGUL SYLLABLE RYEH
+B85D..B877 ; LVT # Lo [27] HANGUL SYLLABLE ROG..HANGUL SYLLABLE ROH
+B879..B893 ; LVT # Lo [27] HANGUL SYLLABLE RWAG..HANGUL SYLLABLE RWAH
+B895..B8AF ; LVT # Lo [27] HANGUL SYLLABLE RWAEG..HANGUL SYLLABLE RWAEH
+B8B1..B8CB ; LVT # Lo [27] HANGUL SYLLABLE ROEG..HANGUL SYLLABLE ROEH
+B8CD..B8E7 ; LVT # Lo [27] HANGUL SYLLABLE RYOG..HANGUL SYLLABLE RYOH
+B8E9..B903 ; LVT # Lo [27] HANGUL SYLLABLE RUG..HANGUL SYLLABLE RUH
+B905..B91F ; LVT # Lo [27] HANGUL SYLLABLE RWEOG..HANGUL SYLLABLE RWEOH
+B921..B93B ; LVT # Lo [27] HANGUL SYLLABLE RWEG..HANGUL SYLLABLE RWEH
+B93D..B957 ; LVT # Lo [27] HANGUL SYLLABLE RWIG..HANGUL SYLLABLE RWIH
+B959..B973 ; LVT # Lo [27] HANGUL SYLLABLE RYUG..HANGUL SYLLABLE RYUH
+B975..B98F ; LVT # Lo [27] HANGUL SYLLABLE REUG..HANGUL SYLLABLE REUH
+B991..B9AB ; LVT # Lo [27] HANGUL SYLLABLE RYIG..HANGUL SYLLABLE RYIH
+B9AD..B9C7 ; LVT # Lo [27] HANGUL SYLLABLE RIG..HANGUL SYLLABLE RIH
+B9C9..B9E3 ; LVT # Lo [27] HANGUL SYLLABLE MAG..HANGUL SYLLABLE MAH
+B9E5..B9FF ; LVT # Lo [27] HANGUL SYLLABLE MAEG..HANGUL SYLLABLE MAEH
+BA01..BA1B ; LVT # Lo [27] HANGUL SYLLABLE MYAG..HANGUL SYLLABLE MYAH
+BA1D..BA37 ; LVT # Lo [27] HANGUL SYLLABLE MYAEG..HANGUL SYLLABLE MYAEH
+BA39..BA53 ; LVT # Lo [27] HANGUL SYLLABLE MEOG..HANGUL SYLLABLE MEOH
+BA55..BA6F ; LVT # Lo [27] HANGUL SYLLABLE MEG..HANGUL SYLLABLE MEH
+BA71..BA8B ; LVT # Lo [27] HANGUL SYLLABLE MYEOG..HANGUL SYLLABLE MYEOH
+BA8D..BAA7 ; LVT # Lo [27] HANGUL SYLLABLE MYEG..HANGUL SYLLABLE MYEH
+BAA9..BAC3 ; LVT # Lo [27] HANGUL SYLLABLE MOG..HANGUL SYLLABLE MOH
+BAC5..BADF ; LVT # Lo [27] HANGUL SYLLABLE MWAG..HANGUL SYLLABLE MWAH
+BAE1..BAFB ; LVT # Lo [27] HANGUL SYLLABLE MWAEG..HANGUL SYLLABLE MWAEH
+BAFD..BB17 ; LVT # Lo [27] HANGUL SYLLABLE MOEG..HANGUL SYLLABLE MOEH
+BB19..BB33 ; LVT # Lo [27] HANGUL SYLLABLE MYOG..HANGUL SYLLABLE MYOH
+BB35..BB4F ; LVT # Lo [27] HANGUL SYLLABLE MUG..HANGUL SYLLABLE MUH
+BB51..BB6B ; LVT # Lo [27] HANGUL SYLLABLE MWEOG..HANGUL SYLLABLE MWEOH
+BB6D..BB87 ; LVT # Lo [27] HANGUL SYLLABLE MWEG..HANGUL SYLLABLE MWEH
+BB89..BBA3 ; LVT # Lo [27] HANGUL SYLLABLE MWIG..HANGUL SYLLABLE MWIH
+BBA5..BBBF ; LVT # Lo [27] HANGUL SYLLABLE MYUG..HANGUL SYLLABLE MYUH
+BBC1..BBDB ; LVT # Lo [27] HANGUL SYLLABLE MEUG..HANGUL SYLLABLE MEUH
+BBDD..BBF7 ; LVT # Lo [27] HANGUL SYLLABLE MYIG..HANGUL SYLLABLE MYIH
+BBF9..BC13 ; LVT # Lo [27] HANGUL SYLLABLE MIG..HANGUL SYLLABLE MIH
+BC15..BC2F ; LVT # Lo [27] HANGUL SYLLABLE BAG..HANGUL SYLLABLE BAH
+BC31..BC4B ; LVT # Lo [27] HANGUL SYLLABLE BAEG..HANGUL SYLLABLE BAEH
+BC4D..BC67 ; LVT # Lo [27] HANGUL SYLLABLE BYAG..HANGUL SYLLABLE BYAH
+BC69..BC83 ; LVT # Lo [27] HANGUL SYLLABLE BYAEG..HANGUL SYLLABLE BYAEH
+BC85..BC9F ; LVT # Lo [27] HANGUL SYLLABLE BEOG..HANGUL SYLLABLE BEOH
+BCA1..BCBB ; LVT # Lo [27] HANGUL SYLLABLE BEG..HANGUL SYLLABLE BEH
+BCBD..BCD7 ; LVT # Lo [27] HANGUL SYLLABLE BYEOG..HANGUL SYLLABLE BYEOH
+BCD9..BCF3 ; LVT # Lo [27] HANGUL SYLLABLE BYEG..HANGUL SYLLABLE BYEH
+BCF5..BD0F ; LVT # Lo [27] HANGUL SYLLABLE BOG..HANGUL SYLLABLE BOH
+BD11..BD2B ; LVT # Lo [27] HANGUL SYLLABLE BWAG..HANGUL SYLLABLE BWAH
+BD2D..BD47 ; LVT # Lo [27] HANGUL SYLLABLE BWAEG..HANGUL SYLLABLE BWAEH
+BD49..BD63 ; LVT # Lo [27] HANGUL SYLLABLE BOEG..HANGUL SYLLABLE BOEH
+BD65..BD7F ; LVT # Lo [27] HANGUL SYLLABLE BYOG..HANGUL SYLLABLE BYOH
+BD81..BD9B ; LVT # Lo [27] HANGUL SYLLABLE BUG..HANGUL SYLLABLE BUH
+BD9D..BDB7 ; LVT # Lo [27] HANGUL SYLLABLE BWEOG..HANGUL SYLLABLE BWEOH
+BDB9..BDD3 ; LVT # Lo [27] HANGUL SYLLABLE BWEG..HANGUL SYLLABLE BWEH
+BDD5..BDEF ; LVT # Lo [27] HANGUL SYLLABLE BWIG..HANGUL SYLLABLE BWIH
+BDF1..BE0B ; LVT # Lo [27] HANGUL SYLLABLE BYUG..HANGUL SYLLABLE BYUH
+BE0D..BE27 ; LVT # Lo [27] HANGUL SYLLABLE BEUG..HANGUL SYLLABLE BEUH
+BE29..BE43 ; LVT # Lo [27] HANGUL SYLLABLE BYIG..HANGUL SYLLABLE BYIH
+BE45..BE5F ; LVT # Lo [27] HANGUL SYLLABLE BIG..HANGUL SYLLABLE BIH
+BE61..BE7B ; LVT # Lo [27] HANGUL SYLLABLE BBAG..HANGUL SYLLABLE BBAH
+BE7D..BE97 ; LVT # Lo [27] HANGUL SYLLABLE BBAEG..HANGUL SYLLABLE BBAEH
+BE99..BEB3 ; LVT # Lo [27] HANGUL SYLLABLE BBYAG..HANGUL SYLLABLE BBYAH
+BEB5..BECF ; LVT # Lo [27] HANGUL SYLLABLE BBYAEG..HANGUL SYLLABLE BBYAEH
+BED1..BEEB ; LVT # Lo [27] HANGUL SYLLABLE BBEOG..HANGUL SYLLABLE BBEOH
+BEED..BF07 ; LVT # Lo [27] HANGUL SYLLABLE BBEG..HANGUL SYLLABLE BBEH
+BF09..BF23 ; LVT # Lo [27] HANGUL SYLLABLE BBYEOG..HANGUL SYLLABLE BBYEOH
+BF25..BF3F ; LVT # Lo [27] HANGUL SYLLABLE BBYEG..HANGUL SYLLABLE BBYEH
+BF41..BF5B ; LVT # Lo [27] HANGUL SYLLABLE BBOG..HANGUL SYLLABLE BBOH
+BF5D..BF77 ; LVT # Lo [27] HANGUL SYLLABLE BBWAG..HANGUL SYLLABLE BBWAH
+BF79..BF93 ; LVT # Lo [27] HANGUL SYLLABLE BBWAEG..HANGUL SYLLABLE BBWAEH
+BF95..BFAF ; LVT # Lo [27] HANGUL SYLLABLE BBOEG..HANGUL SYLLABLE BBOEH
+BFB1..BFCB ; LVT # Lo [27] HANGUL SYLLABLE BBYOG..HANGUL SYLLABLE BBYOH
+BFCD..BFE7 ; LVT # Lo [27] HANGUL SYLLABLE BBUG..HANGUL SYLLABLE BBUH
+BFE9..C003 ; LVT # Lo [27] HANGUL SYLLABLE BBWEOG..HANGUL SYLLABLE BBWEOH
+C005..C01F ; LVT # Lo [27] HANGUL SYLLABLE BBWEG..HANGUL SYLLABLE BBWEH
+C021..C03B ; LVT # Lo [27] HANGUL SYLLABLE BBWIG..HANGUL SYLLABLE BBWIH
+C03D..C057 ; LVT # Lo [27] HANGUL SYLLABLE BBYUG..HANGUL SYLLABLE BBYUH
+C059..C073 ; LVT # Lo [27] HANGUL SYLLABLE BBEUG..HANGUL SYLLABLE BBEUH
+C075..C08F ; LVT # Lo [27] HANGUL SYLLABLE BBYIG..HANGUL SYLLABLE BBYIH
+C091..C0AB ; LVT # Lo [27] HANGUL SYLLABLE BBIG..HANGUL SYLLABLE BBIH
+C0AD..C0C7 ; LVT # Lo [27] HANGUL SYLLABLE SAG..HANGUL SYLLABLE SAH
+C0C9..C0E3 ; LVT # Lo [27] HANGUL SYLLABLE SAEG..HANGUL SYLLABLE SAEH
+C0E5..C0FF ; LVT # Lo [27] HANGUL SYLLABLE SYAG..HANGUL SYLLABLE SYAH
+C101..C11B ; LVT # Lo [27] HANGUL SYLLABLE SYAEG..HANGUL SYLLABLE SYAEH
+C11D..C137 ; LVT # Lo [27] HANGUL SYLLABLE SEOG..HANGUL SYLLABLE SEOH
+C139..C153 ; LVT # Lo [27] HANGUL SYLLABLE SEG..HANGUL SYLLABLE SEH
+C155..C16F ; LVT # Lo [27] HANGUL SYLLABLE SYEOG..HANGUL SYLLABLE SYEOH
+C171..C18B ; LVT # Lo [27] HANGUL SYLLABLE SYEG..HANGUL SYLLABLE SYEH
+C18D..C1A7 ; LVT # Lo [27] HANGUL SYLLABLE SOG..HANGUL SYLLABLE SOH
+C1A9..C1C3 ; LVT # Lo [27] HANGUL SYLLABLE SWAG..HANGUL SYLLABLE SWAH
+C1C5..C1DF ; LVT # Lo [27] HANGUL SYLLABLE SWAEG..HANGUL SYLLABLE SWAEH
+C1E1..C1FB ; LVT # Lo [27] HANGUL SYLLABLE SOEG..HANGUL SYLLABLE SOEH
+C1FD..C217 ; LVT # Lo [27] HANGUL SYLLABLE SYOG..HANGUL SYLLABLE SYOH
+C219..C233 ; LVT # Lo [27] HANGUL SYLLABLE SUG..HANGUL SYLLABLE SUH
+C235..C24F ; LVT # Lo [27] HANGUL SYLLABLE SWEOG..HANGUL SYLLABLE SWEOH
+C251..C26B ; LVT # Lo [27] HANGUL SYLLABLE SWEG..HANGUL SYLLABLE SWEH
+C26D..C287 ; LVT # Lo [27] HANGUL SYLLABLE SWIG..HANGUL SYLLABLE SWIH
+C289..C2A3 ; LVT # Lo [27] HANGUL SYLLABLE SYUG..HANGUL SYLLABLE SYUH
+C2A5..C2BF ; LVT # Lo [27] HANGUL SYLLABLE SEUG..HANGUL SYLLABLE SEUH
+C2C1..C2DB ; LVT # Lo [27] HANGUL SYLLABLE SYIG..HANGUL SYLLABLE SYIH
+C2DD..C2F7 ; LVT # Lo [27] HANGUL SYLLABLE SIG..HANGUL SYLLABLE SIH
+C2F9..C313 ; LVT # Lo [27] HANGUL SYLLABLE SSAG..HANGUL SYLLABLE SSAH
+C315..C32F ; LVT # Lo [27] HANGUL SYLLABLE SSAEG..HANGUL SYLLABLE SSAEH
+C331..C34B ; LVT # Lo [27] HANGUL SYLLABLE SSYAG..HANGUL SYLLABLE SSYAH
+C34D..C367 ; LVT # Lo [27] HANGUL SYLLABLE SSYAEG..HANGUL SYLLABLE SSYAEH
+C369..C383 ; LVT # Lo [27] HANGUL SYLLABLE SSEOG..HANGUL SYLLABLE SSEOH
+C385..C39F ; LVT # Lo [27] HANGUL SYLLABLE SSEG..HANGUL SYLLABLE SSEH
+C3A1..C3BB ; LVT # Lo [27] HANGUL SYLLABLE SSYEOG..HANGUL SYLLABLE SSYEOH
+C3BD..C3D7 ; LVT # Lo [27] HANGUL SYLLABLE SSYEG..HANGUL SYLLABLE SSYEH
+C3D9..C3F3 ; LVT # Lo [27] HANGUL SYLLABLE SSOG..HANGUL SYLLABLE SSOH
+C3F5..C40F ; LVT # Lo [27] HANGUL SYLLABLE SSWAG..HANGUL SYLLABLE SSWAH
+C411..C42B ; LVT # Lo [27] HANGUL SYLLABLE SSWAEG..HANGUL SYLLABLE SSWAEH
+C42D..C447 ; LVT # Lo [27] HANGUL SYLLABLE SSOEG..HANGUL SYLLABLE SSOEH
+C449..C463 ; LVT # Lo [27] HANGUL SYLLABLE SSYOG..HANGUL SYLLABLE SSYOH
+C465..C47F ; LVT # Lo [27] HANGUL SYLLABLE SSUG..HANGUL SYLLABLE SSUH
+C481..C49B ; LVT # Lo [27] HANGUL SYLLABLE SSWEOG..HANGUL SYLLABLE SSWEOH
+C49D..C4B7 ; LVT # Lo [27] HANGUL SYLLABLE SSWEG..HANGUL SYLLABLE SSWEH
+C4B9..C4D3 ; LVT # Lo [27] HANGUL SYLLABLE SSWIG..HANGUL SYLLABLE SSWIH
+C4D5..C4EF ; LVT # Lo [27] HANGUL SYLLABLE SSYUG..HANGUL SYLLABLE SSYUH
+C4F1..C50B ; LVT # Lo [27] HANGUL SYLLABLE SSEUG..HANGUL SYLLABLE SSEUH
+C50D..C527 ; LVT # Lo [27] HANGUL SYLLABLE SSYIG..HANGUL SYLLABLE SSYIH
+C529..C543 ; LVT # Lo [27] HANGUL SYLLABLE SSIG..HANGUL SYLLABLE SSIH
+C545..C55F ; LVT # Lo [27] HANGUL SYLLABLE AG..HANGUL SYLLABLE AH
+C561..C57B ; LVT # Lo [27] HANGUL SYLLABLE AEG..HANGUL SYLLABLE AEH
+C57D..C597 ; LVT # Lo [27] HANGUL SYLLABLE YAG..HANGUL SYLLABLE YAH
+C599..C5B3 ; LVT # Lo [27] HANGUL SYLLABLE YAEG..HANGUL SYLLABLE YAEH
+C5B5..C5CF ; LVT # Lo [27] HANGUL SYLLABLE EOG..HANGUL SYLLABLE EOH
+C5D1..C5EB ; LVT # Lo [27] HANGUL SYLLABLE EG..HANGUL SYLLABLE EH
+C5ED..C607 ; LVT # Lo [27] HANGUL SYLLABLE YEOG..HANGUL SYLLABLE YEOH
+C609..C623 ; LVT # Lo [27] HANGUL SYLLABLE YEG..HANGUL SYLLABLE YEH
+C625..C63F ; LVT # Lo [27] HANGUL SYLLABLE OG..HANGUL SYLLABLE OH
+C641..C65B ; LVT # Lo [27] HANGUL SYLLABLE WAG..HANGUL SYLLABLE WAH
+C65D..C677 ; LVT # Lo [27] HANGUL SYLLABLE WAEG..HANGUL SYLLABLE WAEH
+C679..C693 ; LVT # Lo [27] HANGUL SYLLABLE OEG..HANGUL SYLLABLE OEH
+C695..C6AF ; LVT # Lo [27] HANGUL SYLLABLE YOG..HANGUL SYLLABLE YOH
+C6B1..C6CB ; LVT # Lo [27] HANGUL SYLLABLE UG..HANGUL SYLLABLE UH
+C6CD..C6E7 ; LVT # Lo [27] HANGUL SYLLABLE WEOG..HANGUL SYLLABLE WEOH
+C6E9..C703 ; LVT # Lo [27] HANGUL SYLLABLE WEG..HANGUL SYLLABLE WEH
+C705..C71F ; LVT # Lo [27] HANGUL SYLLABLE WIG..HANGUL SYLLABLE WIH
+C721..C73B ; LVT # Lo [27] HANGUL SYLLABLE YUG..HANGUL SYLLABLE YUH
+C73D..C757 ; LVT # Lo [27] HANGUL SYLLABLE EUG..HANGUL SYLLABLE EUH
+C759..C773 ; LVT # Lo [27] HANGUL SYLLABLE YIG..HANGUL SYLLABLE YIH
+C775..C78F ; LVT # Lo [27] HANGUL SYLLABLE IG..HANGUL SYLLABLE IH
+C791..C7AB ; LVT # Lo [27] HANGUL SYLLABLE JAG..HANGUL SYLLABLE JAH
+C7AD..C7C7 ; LVT # Lo [27] HANGUL SYLLABLE JAEG..HANGUL SYLLABLE JAEH
+C7C9..C7E3 ; LVT # Lo [27] HANGUL SYLLABLE JYAG..HANGUL SYLLABLE JYAH
+C7E5..C7FF ; LVT # Lo [27] HANGUL SYLLABLE JYAEG..HANGUL SYLLABLE JYAEH
+C801..C81B ; LVT # Lo [27] HANGUL SYLLABLE JEOG..HANGUL SYLLABLE JEOH
+C81D..C837 ; LVT # Lo [27] HANGUL SYLLABLE JEG..HANGUL SYLLABLE JEH
+C839..C853 ; LVT # Lo [27] HANGUL SYLLABLE JYEOG..HANGUL SYLLABLE JYEOH
+C855..C86F ; LVT # Lo [27] HANGUL SYLLABLE JYEG..HANGUL SYLLABLE JYEH
+C871..C88B ; LVT # Lo [27] HANGUL SYLLABLE JOG..HANGUL SYLLABLE JOH
+C88D..C8A7 ; LVT # Lo [27] HANGUL SYLLABLE JWAG..HANGUL SYLLABLE JWAH
+C8A9..C8C3 ; LVT # Lo [27] HANGUL SYLLABLE JWAEG..HANGUL SYLLABLE JWAEH
+C8C5..C8DF ; LVT # Lo [27] HANGUL SYLLABLE JOEG..HANGUL SYLLABLE JOEH
+C8E1..C8FB ; LVT # Lo [27] HANGUL SYLLABLE JYOG..HANGUL SYLLABLE JYOH
+C8FD..C917 ; LVT # Lo [27] HANGUL SYLLABLE JUG..HANGUL SYLLABLE JUH
+C919..C933 ; LVT # Lo [27] HANGUL SYLLABLE JWEOG..HANGUL SYLLABLE JWEOH
+C935..C94F ; LVT # Lo [27] HANGUL SYLLABLE JWEG..HANGUL SYLLABLE JWEH
+C951..C96B ; LVT # Lo [27] HANGUL SYLLABLE JWIG..HANGUL SYLLABLE JWIH
+C96D..C987 ; LVT # Lo [27] HANGUL SYLLABLE JYUG..HANGUL SYLLABLE JYUH
+C989..C9A3 ; LVT # Lo [27] HANGUL SYLLABLE JEUG..HANGUL SYLLABLE JEUH
+C9A5..C9BF ; LVT # Lo [27] HANGUL SYLLABLE JYIG..HANGUL SYLLABLE JYIH
+C9C1..C9DB ; LVT # Lo [27] HANGUL SYLLABLE JIG..HANGUL SYLLABLE JIH
+C9DD..C9F7 ; LVT # Lo [27] HANGUL SYLLABLE JJAG..HANGUL SYLLABLE JJAH
+C9F9..CA13 ; LVT # Lo [27] HANGUL SYLLABLE JJAEG..HANGUL SYLLABLE JJAEH
+CA15..CA2F ; LVT # Lo [27] HANGUL SYLLABLE JJYAG..HANGUL SYLLABLE JJYAH
+CA31..CA4B ; LVT # Lo [27] HANGUL SYLLABLE JJYAEG..HANGUL SYLLABLE JJYAEH
+CA4D..CA67 ; LVT # Lo [27] HANGUL SYLLABLE JJEOG..HANGUL SYLLABLE JJEOH
+CA69..CA83 ; LVT # Lo [27] HANGUL SYLLABLE JJEG..HANGUL SYLLABLE JJEH
+CA85..CA9F ; LVT # Lo [27] HANGUL SYLLABLE JJYEOG..HANGUL SYLLABLE JJYEOH
+CAA1..CABB ; LVT # Lo [27] HANGUL SYLLABLE JJYEG..HANGUL SYLLABLE JJYEH
+CABD..CAD7 ; LVT # Lo [27] HANGUL SYLLABLE JJOG..HANGUL SYLLABLE JJOH
+CAD9..CAF3 ; LVT # Lo [27] HANGUL SYLLABLE JJWAG..HANGUL SYLLABLE JJWAH
+CAF5..CB0F ; LVT # Lo [27] HANGUL SYLLABLE JJWAEG..HANGUL SYLLABLE JJWAEH
+CB11..CB2B ; LVT # Lo [27] HANGUL SYLLABLE JJOEG..HANGUL SYLLABLE JJOEH
+CB2D..CB47 ; LVT # Lo [27] HANGUL SYLLABLE JJYOG..HANGUL SYLLABLE JJYOH
+CB49..CB63 ; LVT # Lo [27] HANGUL SYLLABLE JJUG..HANGUL SYLLABLE JJUH
+CB65..CB7F ; LVT # Lo [27] HANGUL SYLLABLE JJWEOG..HANGUL SYLLABLE JJWEOH
+CB81..CB9B ; LVT # Lo [27] HANGUL SYLLABLE JJWEG..HANGUL SYLLABLE JJWEH
+CB9D..CBB7 ; LVT # Lo [27] HANGUL SYLLABLE JJWIG..HANGUL SYLLABLE JJWIH
+CBB9..CBD3 ; LVT # Lo [27] HANGUL SYLLABLE JJYUG..HANGUL SYLLABLE JJYUH
+CBD5..CBEF ; LVT # Lo [27] HANGUL SYLLABLE JJEUG..HANGUL SYLLABLE JJEUH
+CBF1..CC0B ; LVT # Lo [27] HANGUL SYLLABLE JJYIG..HANGUL SYLLABLE JJYIH
+CC0D..CC27 ; LVT # Lo [27] HANGUL SYLLABLE JJIG..HANGUL SYLLABLE JJIH
+CC29..CC43 ; LVT # Lo [27] HANGUL SYLLABLE CAG..HANGUL SYLLABLE CAH
+CC45..CC5F ; LVT # Lo [27] HANGUL SYLLABLE CAEG..HANGUL SYLLABLE CAEH
+CC61..CC7B ; LVT # Lo [27] HANGUL SYLLABLE CYAG..HANGUL SYLLABLE CYAH
+CC7D..CC97 ; LVT # Lo [27] HANGUL SYLLABLE CYAEG..HANGUL SYLLABLE CYAEH
+CC99..CCB3 ; LVT # Lo [27] HANGUL SYLLABLE CEOG..HANGUL SYLLABLE CEOH
+CCB5..CCCF ; LVT # Lo [27] HANGUL SYLLABLE CEG..HANGUL SYLLABLE CEH
+CCD1..CCEB ; LVT # Lo [27] HANGUL SYLLABLE CYEOG..HANGUL SYLLABLE CYEOH
+CCED..CD07 ; LVT # Lo [27] HANGUL SYLLABLE CYEG..HANGUL SYLLABLE CYEH
+CD09..CD23 ; LVT # Lo [27] HANGUL SYLLABLE COG..HANGUL SYLLABLE COH
+CD25..CD3F ; LVT # Lo [27] HANGUL SYLLABLE CWAG..HANGUL SYLLABLE CWAH
+CD41..CD5B ; LVT # Lo [27] HANGUL SYLLABLE CWAEG..HANGUL SYLLABLE CWAEH
+CD5D..CD77 ; LVT # Lo [27] HANGUL SYLLABLE COEG..HANGUL SYLLABLE COEH
+CD79..CD93 ; LVT # Lo [27] HANGUL SYLLABLE CYOG..HANGUL SYLLABLE CYOH
+CD95..CDAF ; LVT # Lo [27] HANGUL SYLLABLE CUG..HANGUL SYLLABLE CUH
+CDB1..CDCB ; LVT # Lo [27] HANGUL SYLLABLE CWEOG..HANGUL SYLLABLE CWEOH
+CDCD..CDE7 ; LVT # Lo [27] HANGUL SYLLABLE CWEG..HANGUL SYLLABLE CWEH
+CDE9..CE03 ; LVT # Lo [27] HANGUL SYLLABLE CWIG..HANGUL SYLLABLE CWIH
+CE05..CE1F ; LVT # Lo [27] HANGUL SYLLABLE CYUG..HANGUL SYLLABLE CYUH
+CE21..CE3B ; LVT # Lo [27] HANGUL SYLLABLE CEUG..HANGUL SYLLABLE CEUH
+CE3D..CE57 ; LVT # Lo [27] HANGUL SYLLABLE CYIG..HANGUL SYLLABLE CYIH
+CE59..CE73 ; LVT # Lo [27] HANGUL SYLLABLE CIG..HANGUL SYLLABLE CIH
+CE75..CE8F ; LVT # Lo [27] HANGUL SYLLABLE KAG..HANGUL SYLLABLE KAH
+CE91..CEAB ; LVT # Lo [27] HANGUL SYLLABLE KAEG..HANGUL SYLLABLE KAEH
+CEAD..CEC7 ; LVT # Lo [27] HANGUL SYLLABLE KYAG..HANGUL SYLLABLE KYAH
+CEC9..CEE3 ; LVT # Lo [27] HANGUL SYLLABLE KYAEG..HANGUL SYLLABLE KYAEH
+CEE5..CEFF ; LVT # Lo [27] HANGUL SYLLABLE KEOG..HANGUL SYLLABLE KEOH
+CF01..CF1B ; LVT # Lo [27] HANGUL SYLLABLE KEG..HANGUL SYLLABLE KEH
+CF1D..CF37 ; LVT # Lo [27] HANGUL SYLLABLE KYEOG..HANGUL SYLLABLE KYEOH
+CF39..CF53 ; LVT # Lo [27] HANGUL SYLLABLE KYEG..HANGUL SYLLABLE KYEH
+CF55..CF6F ; LVT # Lo [27] HANGUL SYLLABLE KOG..HANGUL SYLLABLE KOH
+CF71..CF8B ; LVT # Lo [27] HANGUL SYLLABLE KWAG..HANGUL SYLLABLE KWAH
+CF8D..CFA7 ; LVT # Lo [27] HANGUL SYLLABLE KWAEG..HANGUL SYLLABLE KWAEH
+CFA9..CFC3 ; LVT # Lo [27] HANGUL SYLLABLE KOEG..HANGUL SYLLABLE KOEH
+CFC5..CFDF ; LVT # Lo [27] HANGUL SYLLABLE KYOG..HANGUL SYLLABLE KYOH
+CFE1..CFFB ; LVT # Lo [27] HANGUL SYLLABLE KUG..HANGUL SYLLABLE KUH
+CFFD..D017 ; LVT # Lo [27] HANGUL SYLLABLE KWEOG..HANGUL SYLLABLE KWEOH
+D019..D033 ; LVT # Lo [27] HANGUL SYLLABLE KWEG..HANGUL SYLLABLE KWEH
+D035..D04F ; LVT # Lo [27] HANGUL SYLLABLE KWIG..HANGUL SYLLABLE KWIH
+D051..D06B ; LVT # Lo [27] HANGUL SYLLABLE KYUG..HANGUL SYLLABLE KYUH
+D06D..D087 ; LVT # Lo [27] HANGUL SYLLABLE KEUG..HANGUL SYLLABLE KEUH
+D089..D0A3 ; LVT # Lo [27] HANGUL SYLLABLE KYIG..HANGUL SYLLABLE KYIH
+D0A5..D0BF ; LVT # Lo [27] HANGUL SYLLABLE KIG..HANGUL SYLLABLE KIH
+D0C1..D0DB ; LVT # Lo [27] HANGUL SYLLABLE TAG..HANGUL SYLLABLE TAH
+D0DD..D0F7 ; LVT # Lo [27] HANGUL SYLLABLE TAEG..HANGUL SYLLABLE TAEH
+D0F9..D113 ; LVT # Lo [27] HANGUL SYLLABLE TYAG..HANGUL SYLLABLE TYAH
+D115..D12F ; LVT # Lo [27] HANGUL SYLLABLE TYAEG..HANGUL SYLLABLE TYAEH
+D131..D14B ; LVT # Lo [27] HANGUL SYLLABLE TEOG..HANGUL SYLLABLE TEOH
+D14D..D167 ; LVT # Lo [27] HANGUL SYLLABLE TEG..HANGUL SYLLABLE TEH
+D169..D183 ; LVT # Lo [27] HANGUL SYLLABLE TYEOG..HANGUL SYLLABLE TYEOH
+D185..D19F ; LVT # Lo [27] HANGUL SYLLABLE TYEG..HANGUL SYLLABLE TYEH
+D1A1..D1BB ; LVT # Lo [27] HANGUL SYLLABLE TOG..HANGUL SYLLABLE TOH
+D1BD..D1D7 ; LVT # Lo [27] HANGUL SYLLABLE TWAG..HANGUL SYLLABLE TWAH
+D1D9..D1F3 ; LVT # Lo [27] HANGUL SYLLABLE TWAEG..HANGUL SYLLABLE TWAEH
+D1F5..D20F ; LVT # Lo [27] HANGUL SYLLABLE TOEG..HANGUL SYLLABLE TOEH
+D211..D22B ; LVT # Lo [27] HANGUL SYLLABLE TYOG..HANGUL SYLLABLE TYOH
+D22D..D247 ; LVT # Lo [27] HANGUL SYLLABLE TUG..HANGUL SYLLABLE TUH
+D249..D263 ; LVT # Lo [27] HANGUL SYLLABLE TWEOG..HANGUL SYLLABLE TWEOH
+D265..D27F ; LVT # Lo [27] HANGUL SYLLABLE TWEG..HANGUL SYLLABLE TWEH
+D281..D29B ; LVT # Lo [27] HANGUL SYLLABLE TWIG..HANGUL SYLLABLE TWIH
+D29D..D2B7 ; LVT # Lo [27] HANGUL SYLLABLE TYUG..HANGUL SYLLABLE TYUH
+D2B9..D2D3 ; LVT # Lo [27] HANGUL SYLLABLE TEUG..HANGUL SYLLABLE TEUH
+D2D5..D2EF ; LVT # Lo [27] HANGUL SYLLABLE TYIG..HANGUL SYLLABLE TYIH
+D2F1..D30B ; LVT # Lo [27] HANGUL SYLLABLE TIG..HANGUL SYLLABLE TIH
+D30D..D327 ; LVT # Lo [27] HANGUL SYLLABLE PAG..HANGUL SYLLABLE PAH
+D329..D343 ; LVT # Lo [27] HANGUL SYLLABLE PAEG..HANGUL SYLLABLE PAEH
+D345..D35F ; LVT # Lo [27] HANGUL SYLLABLE PYAG..HANGUL SYLLABLE PYAH
+D361..D37B ; LVT # Lo [27] HANGUL SYLLABLE PYAEG..HANGUL SYLLABLE PYAEH
+D37D..D397 ; LVT # Lo [27] HANGUL SYLLABLE PEOG..HANGUL SYLLABLE PEOH
+D399..D3B3 ; LVT # Lo [27] HANGUL SYLLABLE PEG..HANGUL SYLLABLE PEH
+D3B5..D3CF ; LVT # Lo [27] HANGUL SYLLABLE PYEOG..HANGUL SYLLABLE PYEOH
+D3D1..D3EB ; LVT # Lo [27] HANGUL SYLLABLE PYEG..HANGUL SYLLABLE PYEH
+D3ED..D407 ; LVT # Lo [27] HANGUL SYLLABLE POG..HANGUL SYLLABLE POH
+D409..D423 ; LVT # Lo [27] HANGUL SYLLABLE PWAG..HANGUL SYLLABLE PWAH
+D425..D43F ; LVT # Lo [27] HANGUL SYLLABLE PWAEG..HANGUL SYLLABLE PWAEH
+D441..D45B ; LVT # Lo [27] HANGUL SYLLABLE POEG..HANGUL SYLLABLE POEH
+D45D..D477 ; LVT # Lo [27] HANGUL SYLLABLE PYOG..HANGUL SYLLABLE PYOH
+D479..D493 ; LVT # Lo [27] HANGUL SYLLABLE PUG..HANGUL SYLLABLE PUH
+D495..D4AF ; LVT # Lo [27] HANGUL SYLLABLE PWEOG..HANGUL SYLLABLE PWEOH
+D4B1..D4CB ; LVT # Lo [27] HANGUL SYLLABLE PWEG..HANGUL SYLLABLE PWEH
+D4CD..D4E7 ; LVT # Lo [27] HANGUL SYLLABLE PWIG..HANGUL SYLLABLE PWIH
+D4E9..D503 ; LVT # Lo [27] HANGUL SYLLABLE PYUG..HANGUL SYLLABLE PYUH
+D505..D51F ; LVT # Lo [27] HANGUL SYLLABLE PEUG..HANGUL SYLLABLE PEUH
+D521..D53B ; LVT # Lo [27] HANGUL SYLLABLE PYIG..HANGUL SYLLABLE PYIH
+D53D..D557 ; LVT # Lo [27] HANGUL SYLLABLE PIG..HANGUL SYLLABLE PIH
+D559..D573 ; LVT # Lo [27] HANGUL SYLLABLE HAG..HANGUL SYLLABLE HAH
+D575..D58F ; LVT # Lo [27] HANGUL SYLLABLE HAEG..HANGUL SYLLABLE HAEH
+D591..D5AB ; LVT # Lo [27] HANGUL SYLLABLE HYAG..HANGUL SYLLABLE HYAH
+D5AD..D5C7 ; LVT # Lo [27] HANGUL SYLLABLE HYAEG..HANGUL SYLLABLE HYAEH
+D5C9..D5E3 ; LVT # Lo [27] HANGUL SYLLABLE HEOG..HANGUL SYLLABLE HEOH
+D5E5..D5FF ; LVT # Lo [27] HANGUL SYLLABLE HEG..HANGUL SYLLABLE HEH
+D601..D61B ; LVT # Lo [27] HANGUL SYLLABLE HYEOG..HANGUL SYLLABLE HYEOH
+D61D..D637 ; LVT # Lo [27] HANGUL SYLLABLE HYEG..HANGUL SYLLABLE HYEH
+D639..D653 ; LVT # Lo [27] HANGUL SYLLABLE HOG..HANGUL SYLLABLE HOH
+D655..D66F ; LVT # Lo [27] HANGUL SYLLABLE HWAG..HANGUL SYLLABLE HWAH
+D671..D68B ; LVT # Lo [27] HANGUL SYLLABLE HWAEG..HANGUL SYLLABLE HWAEH
+D68D..D6A7 ; LVT # Lo [27] HANGUL SYLLABLE HOEG..HANGUL SYLLABLE HOEH
+D6A9..D6C3 ; LVT # Lo [27] HANGUL SYLLABLE HYOG..HANGUL SYLLABLE HYOH
+D6C5..D6DF ; LVT # Lo [27] HANGUL SYLLABLE HUG..HANGUL SYLLABLE HUH
+D6E1..D6FB ; LVT # Lo [27] HANGUL SYLLABLE HWEOG..HANGUL SYLLABLE HWEOH
+D6FD..D717 ; LVT # Lo [27] HANGUL SYLLABLE HWEG..HANGUL SYLLABLE HWEH
+D719..D733 ; LVT # Lo [27] HANGUL SYLLABLE HWIG..HANGUL SYLLABLE HWIH
+D735..D74F ; LVT # Lo [27] HANGUL SYLLABLE HYUG..HANGUL SYLLABLE HYUH
+D751..D76B ; LVT # Lo [27] HANGUL SYLLABLE HEUG..HANGUL SYLLABLE HEUH
+D76D..D787 ; LVT # Lo [27] HANGUL SYLLABLE HYIG..HANGUL SYLLABLE HYIH
+D789..D7A3 ; LVT # Lo [27] HANGUL SYLLABLE HIG..HANGUL SYLLABLE HIH
+
+# Total code points: 10773
+
+# EOF
@@ -1,4 +1,5 @@
# Copyright (C) 2015-2024 Free Software Foundation, Inc.
+# Copyright (C) 2024 The GNU Toolchain Authors.
# This file is part of the GNU C Library.
# The GNU C Library is free software; you can redistribute it and/or
@@ -40,7 +41,7 @@ UNICODE_VERSION = 15.1.0
PYTHON3 = python3
WGET = wget
-DOWNLOADS = UnicodeData.txt DerivedCoreProperties.txt EastAsianWidth.txt PropList.txt
+DOWNLOADS = UnicodeData.txt DerivedCoreProperties.txt EastAsianWidth.txt HangulSyllableType.txt
GENERATED = i18n_ctype tr_TR UTF-8 translit_combining translit_compat translit_circle translit_cjk_compat translit_font translit_fraction
REPORTS = i18n_ctype-report UTF-8-report
@@ -90,10 +91,12 @@ tr_TR: gen_unicode_ctype.py
-d DerivedCoreProperties.txt -i ../locales/tr_TR -o $@ \
--unicode_version $(UNICODE_VERSION) --turkish
-UTF-8: UnicodeData.txt EastAsianWidth.txt
+UTF-8: UnicodeData.txt DerivedCoreProperties.txt EastAsianWidth.txt HangulSyllableType.txt
UTF-8: utf8_gen.py
$(PYTHON3) utf8_gen.py -u UnicodeData.txt \
- -e EastAsianWidth.txt -p PropList.txt \
+ -d DerivedCoreProperties.txt \
+ -e EastAsianWidth.txt \
+ -k HangulSyllableType.txt \
--unicode_version $(UNICODE_VERSION)
UTF-8-report: UTF-8 ../charmaps/UTF-8
deleted file mode 100644
@@ -1,1827 +0,0 @@
-# PropList-15.1.0.txt
-# Date: 2023-08-01, 21:56:53 GMT
-# © 2023 Unicode®, Inc.
-# Unicode and the Unicode Logo are registered trademarks of Unicode, Inc. in the U.S. and other countries.
-# For terms of use, see https://www.unicode.org/terms_of_use.html
-#
-# Unicode Character Database
-# For documentation, see https://www.unicode.org/reports/tr44/
-
-# ================================================
-
-0009..000D ; White_Space # Cc [5] <control-0009>..<control-000D>
-0020 ; White_Space # Zs SPACE
-0085 ; White_Space # Cc <control-0085>
-00A0 ; White_Space # Zs NO-BREAK SPACE
-1680 ; White_Space # Zs OGHAM SPACE MARK
-2000..200A ; White_Space # Zs [11] EN QUAD..HAIR SPACE
-2028 ; White_Space # Zl LINE SEPARATOR
-2029 ; White_Space # Zp PARAGRAPH SEPARATOR
-202F ; White_Space # Zs NARROW NO-BREAK SPACE
-205F ; White_Space # Zs MEDIUM MATHEMATICAL SPACE
-3000 ; White_Space # Zs IDEOGRAPHIC SPACE
-
-# Total code points: 25
-
-# ================================================
-
-061C ; Bidi_Control # Cf ARABIC LETTER MARK
-200E..200F ; Bidi_Control # Cf [2] LEFT-TO-RIGHT MARK..RIGHT-TO-LEFT MARK
-202A..202E ; Bidi_Control # Cf [5] LEFT-TO-RIGHT EMBEDDING..RIGHT-TO-LEFT OVERRIDE
-2066..2069 ; Bidi_Control # Cf [4] LEFT-TO-RIGHT ISOLATE..POP DIRECTIONAL ISOLATE
-
-# Total code points: 12
-
-# ================================================
-
-200C..200D ; Join_Control # Cf [2] ZERO WIDTH NON-JOINER..ZERO WIDTH JOINER
-
-# Total code points: 2
-
-# ================================================
-
-002D ; Dash # Pd HYPHEN-MINUS
-058A ; Dash # Pd ARMENIAN HYPHEN
-05BE ; Dash # Pd HEBREW PUNCTUATION MAQAF
-1400 ; Dash # Pd CANADIAN SYLLABICS HYPHEN
-1806 ; Dash # Pd MONGOLIAN TODO SOFT HYPHEN
-2010..2015 ; Dash # Pd [6] HYPHEN..HORIZONTAL BAR
-2053 ; Dash # Po SWUNG DASH
-207B ; Dash # Sm SUPERSCRIPT MINUS
-208B ; Dash # Sm SUBSCRIPT MINUS
-2212 ; Dash # Sm MINUS SIGN
-2E17 ; Dash # Pd DOUBLE OBLIQUE HYPHEN
-2E1A ; Dash # Pd HYPHEN WITH DIAERESIS
-2E3A..2E3B ; Dash # Pd [2] TWO-EM DASH..THREE-EM DASH
-2E40 ; Dash # Pd DOUBLE HYPHEN
-2E5D ; Dash # Pd OBLIQUE HYPHEN
-301C ; Dash # Pd WAVE DASH
-3030 ; Dash # Pd WAVY DASH
-30A0 ; Dash # Pd KATAKANA-HIRAGANA DOUBLE HYPHEN
-FE31..FE32 ; Dash # Pd [2] PRESENTATION FORM FOR VERTICAL EM DASH..PRESENTATION FORM FOR VERTICAL EN DASH
-FE58 ; Dash # Pd SMALL EM DASH
-FE63 ; Dash # Pd SMALL HYPHEN-MINUS
-FF0D ; Dash # Pd FULLWIDTH HYPHEN-MINUS
-10EAD ; Dash # Pd YEZIDI HYPHENATION MARK
-
-# Total code points: 30
-
-# ================================================
-
-002D ; Hyphen # Pd HYPHEN-MINUS
-00AD ; Hyphen # Cf SOFT HYPHEN
-058A ; Hyphen # Pd ARMENIAN HYPHEN
-1806 ; Hyphen # Pd MONGOLIAN TODO SOFT HYPHEN
-2010..2011 ; Hyphen # Pd [2] HYPHEN..NON-BREAKING HYPHEN
-2E17 ; Hyphen # Pd DOUBLE OBLIQUE HYPHEN
-30FB ; Hyphen # Po KATAKANA MIDDLE DOT
-FE63 ; Hyphen # Pd SMALL HYPHEN-MINUS
-FF0D ; Hyphen # Pd FULLWIDTH HYPHEN-MINUS
-FF65 ; Hyphen # Po HALFWIDTH KATAKANA MIDDLE DOT
-
-# Total code points: 11
-
-# ================================================
-
-0022 ; Quotation_Mark # Po QUOTATION MARK
-0027 ; Quotation_Mark # Po APOSTROPHE
-00AB ; Quotation_Mark # Pi LEFT-POINTING DOUBLE ANGLE QUOTATION MARK
-00BB ; Quotation_Mark # Pf RIGHT-POINTING DOUBLE ANGLE QUOTATION MARK
-2018 ; Quotation_Mark # Pi LEFT SINGLE QUOTATION MARK
-2019 ; Quotation_Mark # Pf RIGHT SINGLE QUOTATION MARK
-201A ; Quotation_Mark # Ps SINGLE LOW-9 QUOTATION MARK
-201B..201C ; Quotation_Mark # Pi [2] SINGLE HIGH-REVERSED-9 QUOTATION MARK..LEFT DOUBLE QUOTATION MARK
-201D ; Quotation_Mark # Pf RIGHT DOUBLE QUOTATION MARK
-201E ; Quotation_Mark # Ps DOUBLE LOW-9 QUOTATION MARK
-201F ; Quotation_Mark # Pi DOUBLE HIGH-REVERSED-9 QUOTATION MARK
-2039 ; Quotation_Mark # Pi SINGLE LEFT-POINTING ANGLE QUOTATION MARK
-203A ; Quotation_Mark # Pf SINGLE RIGHT-POINTING ANGLE QUOTATION MARK
-2E42 ; Quotation_Mark # Ps DOUBLE LOW-REVERSED-9 QUOTATION MARK
-300C ; Quotation_Mark # Ps LEFT CORNER BRACKET
-300D ; Quotation_Mark # Pe RIGHT CORNER BRACKET
-300E ; Quotation_Mark # Ps LEFT WHITE CORNER BRACKET
-300F ; Quotation_Mark # Pe RIGHT WHITE CORNER BRACKET
-301D ; Quotation_Mark # Ps REVERSED DOUBLE PRIME QUOTATION MARK
-301E..301F ; Quotation_Mark # Pe [2] DOUBLE PRIME QUOTATION MARK..LOW DOUBLE PRIME QUOTATION MARK
-FE41 ; Quotation_Mark # Ps PRESENTATION FORM FOR VERTICAL LEFT CORNER BRACKET
-FE42 ; Quotation_Mark # Pe PRESENTATION FORM FOR VERTICAL RIGHT CORNER BRACKET
-FE43 ; Quotation_Mark # Ps PRESENTATION FORM FOR VERTICAL LEFT WHITE CORNER BRACKET
-FE44 ; Quotation_Mark # Pe PRESENTATION FORM FOR VERTICAL RIGHT WHITE CORNER BRACKET
-FF02 ; Quotation_Mark # Po FULLWIDTH QUOTATION MARK
-FF07 ; Quotation_Mark # Po FULLWIDTH APOSTROPHE
-FF62 ; Quotation_Mark # Ps HALFWIDTH LEFT CORNER BRACKET
-FF63 ; Quotation_Mark # Pe HALFWIDTH RIGHT CORNER BRACKET
-
-# Total code points: 30
-
-# ================================================
-
-0021 ; Terminal_Punctuation # Po EXCLAMATION MARK
-002C ; Terminal_Punctuation # Po COMMA
-002E ; Terminal_Punctuation # Po FULL STOP
-003A..003B ; Terminal_Punctuation # Po [2] COLON..SEMICOLON
-003F ; Terminal_Punctuation # Po QUESTION MARK
-037E ; Terminal_Punctuation # Po GREEK QUESTION MARK
-0387 ; Terminal_Punctuation # Po GREEK ANO TELEIA
-0589 ; Terminal_Punctuation # Po ARMENIAN FULL STOP
-05C3 ; Terminal_Punctuation # Po HEBREW PUNCTUATION SOF PASUQ
-060C ; Terminal_Punctuation # Po ARABIC COMMA
-061B ; Terminal_Punctuation # Po ARABIC SEMICOLON
-061D..061F ; Terminal_Punctuation # Po [3] ARABIC END OF TEXT MARK..ARABIC QUESTION MARK
-06D4 ; Terminal_Punctuation # Po ARABIC FULL STOP
-0700..070A ; Terminal_Punctuation # Po [11] SYRIAC END OF PARAGRAPH..SYRIAC CONTRACTION
-070C ; Terminal_Punctuation # Po SYRIAC HARKLEAN METOBELUS
-07F8..07F9 ; Terminal_Punctuation # Po [2] NKO COMMA..NKO EXCLAMATION MARK
-0830..083E ; Terminal_Punctuation # Po [15] SAMARITAN PUNCTUATION NEQUDAA..SAMARITAN PUNCTUATION ANNAAU
-085E ; Terminal_Punctuation # Po MANDAIC PUNCTUATION
-0964..0965 ; Terminal_Punctuation # Po [2] DEVANAGARI DANDA..DEVANAGARI DOUBLE DANDA
-0E5A..0E5B ; Terminal_Punctuation # Po [2] THAI CHARACTER ANGKHANKHU..THAI CHARACTER KHOMUT
-0F08 ; Terminal_Punctuation # Po TIBETAN MARK SBRUL SHAD
-0F0D..0F12 ; Terminal_Punctuation # Po [6] TIBETAN MARK SHAD..TIBETAN MARK RGYA GRAM SHAD
-104A..104B ; Terminal_Punctuation # Po [2] MYANMAR SIGN LITTLE SECTION..MYANMAR SIGN SECTION
-1361..1368 ; Terminal_Punctuation # Po [8] ETHIOPIC WORDSPACE..ETHIOPIC PARAGRAPH SEPARATOR
-166E ; Terminal_Punctuation # Po CANADIAN SYLLABICS FULL STOP
-16EB..16ED ; Terminal_Punctuation # Po [3] RUNIC SINGLE PUNCTUATION..RUNIC CROSS PUNCTUATION
-1735..1736 ; Terminal_Punctuation # Po [2] PHILIPPINE SINGLE PUNCTUATION..PHILIPPINE DOUBLE PUNCTUATION
-17D4..17D6 ; Terminal_Punctuation # Po [3] KHMER SIGN KHAN..KHMER SIGN CAMNUC PII KUUH
-17DA ; Terminal_Punctuation # Po KHMER SIGN KOOMUUT
-1802..1805 ; Terminal_Punctuation # Po [4] MONGOLIAN COMMA..MONGOLIAN FOUR DOTS
-1808..1809 ; Terminal_Punctuation # Po [2] MONGOLIAN MANCHU COMMA..MONGOLIAN MANCHU FULL STOP
-1944..1945 ; Terminal_Punctuation # Po [2] LIMBU EXCLAMATION MARK..LIMBU QUESTION MARK
-1AA8..1AAB ; Terminal_Punctuation # Po [4] TAI THAM SIGN KAAN..TAI THAM SIGN SATKAANKUU
-1B5A..1B5B ; Terminal_Punctuation # Po [2] BALINESE PANTI..BALINESE PAMADA
-1B5D..1B5F ; Terminal_Punctuation # Po [3] BALINESE CARIK PAMUNGKAH..BALINESE CARIK PAREREN
-1B7D..1B7E ; Terminal_Punctuation # Po [2] BALINESE PANTI LANTANG..BALINESE PAMADA LANTANG
-1C3B..1C3F ; Terminal_Punctuation # Po [5] LEPCHA PUNCTUATION TA-ROL..LEPCHA PUNCTUATION TSHOOK
-1C7E..1C7F ; Terminal_Punctuation # Po [2] OL CHIKI PUNCTUATION MUCAAD..OL CHIKI PUNCTUATION DOUBLE MUCAAD
-203C..203D ; Terminal_Punctuation # Po [2] DOUBLE EXCLAMATION MARK..INTERROBANG
-2047..2049 ; Terminal_Punctuation # Po [3] DOUBLE QUESTION MARK..EXCLAMATION QUESTION MARK
-2E2E ; Terminal_Punctuation # Po REVERSED QUESTION MARK
-2E3C ; Terminal_Punctuation # Po STENOGRAPHIC FULL STOP
-2E41 ; Terminal_Punctuation # Po REVERSED COMMA
-2E4C ; Terminal_Punctuation # Po MEDIEVAL COMMA
-2E4E..2E4F ; Terminal_Punctuation # Po [2] PUNCTUS ELEVATUS MARK..CORNISH VERSE DIVIDER
-2E53..2E54 ; Terminal_Punctuation # Po [2] MEDIEVAL EXCLAMATION MARK..MEDIEVAL QUESTION MARK
-3001..3002 ; Terminal_Punctuation # Po [2] IDEOGRAPHIC COMMA..IDEOGRAPHIC FULL STOP
-A4FE..A4FF ; Terminal_Punctuation # Po [2] LISU PUNCTUATION COMMA..LISU PUNCTUATION FULL STOP
-A60D..A60F ; Terminal_Punctuation # Po [3] VAI COMMA..VAI QUESTION MARK
-A6F3..A6F7 ; Terminal_Punctuation # Po [5] BAMUM FULL STOP..BAMUM QUESTION MARK
-A876..A877 ; Terminal_Punctuation # Po [2] PHAGS-PA MARK SHAD..PHAGS-PA MARK DOUBLE SHAD
-A8CE..A8CF ; Terminal_Punctuation # Po [2] SAURASHTRA DANDA..SAURASHTRA DOUBLE DANDA
-A92F ; Terminal_Punctuation # Po KAYAH LI SIGN SHYA
-A9C7..A9C9 ; Terminal_Punctuation # Po [3] JAVANESE PADA PANGKAT..JAVANESE PADA LUNGSI
-AA5D..AA5F ; Terminal_Punctuation # Po [3] CHAM PUNCTUATION DANDA..CHAM PUNCTUATION TRIPLE DANDA
-AADF ; Terminal_Punctuation # Po TAI VIET SYMBOL KOI KOI
-AAF0..AAF1 ; Terminal_Punctuation # Po [2] MEETEI MAYEK CHEIKHAN..MEETEI MAYEK AHANG KHUDAM
-ABEB ; Terminal_Punctuation # Po MEETEI MAYEK CHEIKHEI
-FE50..FE52 ; Terminal_Punctuation # Po [3] SMALL COMMA..SMALL FULL STOP
-FE54..FE57 ; Terminal_Punctuation # Po [4] SMALL SEMICOLON..SMALL EXCLAMATION MARK
-FF01 ; Terminal_Punctuation # Po FULLWIDTH EXCLAMATION MARK
-FF0C ; Terminal_Punctuation # Po FULLWIDTH COMMA
-FF0E ; Terminal_Punctuation # Po FULLWIDTH FULL STOP
-FF1A..FF1B ; Terminal_Punctuation # Po [2] FULLWIDTH COLON..FULLWIDTH SEMICOLON
-FF1F ; Terminal_Punctuation # Po FULLWIDTH QUESTION MARK
-FF61 ; Terminal_Punctuation # Po HALFWIDTH IDEOGRAPHIC FULL STOP
-FF64 ; Terminal_Punctuation # Po HALFWIDTH IDEOGRAPHIC COMMA
-1039F ; Terminal_Punctuation # Po UGARITIC WORD DIVIDER
-103D0 ; Terminal_Punctuation # Po OLD PERSIAN WORD DIVIDER
-10857 ; Terminal_Punctuation # Po IMPERIAL ARAMAIC SECTION SIGN
-1091F ; Terminal_Punctuation # Po PHOENICIAN WORD SEPARATOR
-10A56..10A57 ; Terminal_Punctuation # Po [2] KHAROSHTHI PUNCTUATION DANDA..KHAROSHTHI PUNCTUATION DOUBLE DANDA
-10AF0..10AF5 ; Terminal_Punctuation # Po [6] MANICHAEAN PUNCTUATION STAR..MANICHAEAN PUNCTUATION TWO DOTS
-10B3A..10B3F ; Terminal_Punctuation # Po [6] TINY TWO DOTS OVER ONE DOT PUNCTUATION..LARGE ONE RING OVER TWO RINGS PUNCTUATION
-10B99..10B9C ; Terminal_Punctuation # Po [4] PSALTER PAHLAVI SECTION MARK..PSALTER PAHLAVI FOUR DOTS WITH DOT
-10F55..10F59 ; Terminal_Punctuation # Po [5] SOGDIAN PUNCTUATION TWO VERTICAL BARS..SOGDIAN PUNCTUATION HALF CIRCLE WITH DOT
-10F86..10F89 ; Terminal_Punctuation # Po [4] OLD UYGHUR PUNCTUATION BAR..OLD UYGHUR PUNCTUATION FOUR DOTS
-11047..1104D ; Terminal_Punctuation # Po [7] BRAHMI DANDA..BRAHMI PUNCTUATION LOTUS
-110BE..110C1 ; Terminal_Punctuation # Po [4] KAITHI SECTION MARK..KAITHI DOUBLE DANDA
-11141..11143 ; Terminal_Punctuation # Po [3] CHAKMA DANDA..CHAKMA QUESTION MARK
-111C5..111C6 ; Terminal_Punctuation # Po [2] SHARADA DANDA..SHARADA DOUBLE DANDA
-111CD ; Terminal_Punctuation # Po SHARADA SUTRA MARK
-111DE..111DF ; Terminal_Punctuation # Po [2] SHARADA SECTION MARK-1..SHARADA SECTION MARK-2
-11238..1123C ; Terminal_Punctuation # Po [5] KHOJKI DANDA..KHOJKI DOUBLE SECTION MARK
-112A9 ; Terminal_Punctuation # Po MULTANI SECTION MARK
-1144B..1144D ; Terminal_Punctuation # Po [3] NEWA DANDA..NEWA COMMA
-1145A..1145B ; Terminal_Punctuation # Po [2] NEWA DOUBLE COMMA..NEWA PLACEHOLDER MARK
-115C2..115C5 ; Terminal_Punctuation # Po [4] SIDDHAM DANDA..SIDDHAM SEPARATOR BAR
-115C9..115D7 ; Terminal_Punctuation # Po [15] SIDDHAM END OF TEXT MARK..SIDDHAM SECTION MARK WITH CIRCLES AND FOUR ENCLOSURES
-11641..11642 ; Terminal_Punctuation # Po [2] MODI DANDA..MODI DOUBLE DANDA
-1173C..1173E ; Terminal_Punctuation # Po [3] AHOM SIGN SMALL SECTION..AHOM SIGN RULAI
-11944 ; Terminal_Punctuation # Po DIVES AKURU DOUBLE DANDA
-11946 ; Terminal_Punctuation # Po DIVES AKURU END OF TEXT MARK
-11A42..11A43 ; Terminal_Punctuation # Po [2] ZANABAZAR SQUARE MARK SHAD..ZANABAZAR SQUARE MARK DOUBLE SHAD
-11A9B..11A9C ; Terminal_Punctuation # Po [2] SOYOMBO MARK SHAD..SOYOMBO MARK DOUBLE SHAD
-11AA1..11AA2 ; Terminal_Punctuation # Po [2] SOYOMBO TERMINAL MARK-1..SOYOMBO TERMINAL MARK-2
-11C41..11C43 ; Terminal_Punctuation # Po [3] BHAIKSUKI DANDA..BHAIKSUKI WORD SEPARATOR
-11C71 ; Terminal_Punctuation # Po MARCHEN MARK SHAD
-11EF7..11EF8 ; Terminal_Punctuation # Po [2] MAKASAR PASSIMBANG..MAKASAR END OF SECTION
-11F43..11F44 ; Terminal_Punctuation # Po [2] KAWI DANDA..KAWI DOUBLE DANDA
-12470..12474 ; Terminal_Punctuation # Po [5] CUNEIFORM PUNCTUATION SIGN OLD ASSYRIAN WORD DIVIDER..CUNEIFORM PUNCTUATION SIGN DIAGONAL QUADCOLON
-16A6E..16A6F ; Terminal_Punctuation # Po [2] MRO DANDA..MRO DOUBLE DANDA
-16AF5 ; Terminal_Punctuation # Po BASSA VAH FULL STOP
-16B37..16B39 ; Terminal_Punctuation # Po [3] PAHAWH HMONG SIGN VOS THOM..PAHAWH HMONG SIGN CIM CHEEM
-16B44 ; Terminal_Punctuation # Po PAHAWH HMONG SIGN XAUS
-16E97..16E98 ; Terminal_Punctuation # Po [2] MEDEFAIDRIN COMMA..MEDEFAIDRIN FULL STOP
-1BC9F ; Terminal_Punctuation # Po DUPLOYAN PUNCTUATION CHINOOK FULL STOP
-1DA87..1DA8A ; Terminal_Punctuation # Po [4] SIGNWRITING COMMA..SIGNWRITING COLON
-
-# Total code points: 278
-
-# ================================================
-
-005E ; Other_Math # Sk CIRCUMFLEX ACCENT
-03D0..03D2 ; Other_Math # L& [3] GREEK BETA SYMBOL..GREEK UPSILON WITH HOOK SYMBOL
-03D5 ; Other_Math # L& GREEK PHI SYMBOL
-03F0..03F1 ; Other_Math # L& [2] GREEK KAPPA SYMBOL..GREEK RHO SYMBOL
-03F4..03F5 ; Other_Math # L& [2] GREEK CAPITAL THETA SYMBOL..GREEK LUNATE EPSILON SYMBOL
-2016 ; Other_Math # Po DOUBLE VERTICAL LINE
-2032..2034 ; Other_Math # Po [3] PRIME..TRIPLE PRIME
-2040 ; Other_Math # Pc CHARACTER TIE
-2061..2064 ; Other_Math # Cf [4] FUNCTION APPLICATION..INVISIBLE PLUS
-207D ; Other_Math # Ps SUPERSCRIPT LEFT PARENTHESIS
-207E ; Other_Math # Pe SUPERSCRIPT RIGHT PARENTHESIS
-208D ; Other_Math # Ps SUBSCRIPT LEFT PARENTHESIS
-208E ; Other_Math # Pe SUBSCRIPT RIGHT PARENTHESIS
-20D0..20DC ; Other_Math # Mn [13] COMBINING LEFT HARPOON ABOVE..COMBINING FOUR DOTS ABOVE
-20E1 ; Other_Math # Mn COMBINING LEFT RIGHT ARROW ABOVE
-20E5..20E6 ; Other_Math # Mn [2] COMBINING REVERSE SOLIDUS OVERLAY..COMBINING DOUBLE VERTICAL STROKE OVERLAY
-20EB..20EF ; Other_Math # Mn [5] COMBINING LONG DOUBLE SOLIDUS OVERLAY..COMBINING RIGHT ARROW BELOW
-2102 ; Other_Math # L& DOUBLE-STRUCK CAPITAL C
-2107 ; Other_Math # L& EULER CONSTANT
-210A..2113 ; Other_Math # L& [10] SCRIPT SMALL G..SCRIPT SMALL L
-2115 ; Other_Math # L& DOUBLE-STRUCK CAPITAL N
-2119..211D ; Other_Math # L& [5] DOUBLE-STRUCK CAPITAL P..DOUBLE-STRUCK CAPITAL R
-2124 ; Other_Math # L& DOUBLE-STRUCK CAPITAL Z
-2128 ; Other_Math # L& BLACK-LETTER CAPITAL Z
-2129 ; Other_Math # So TURNED GREEK SMALL LETTER IOTA
-212C..212D ; Other_Math # L& [2] SCRIPT CAPITAL B..BLACK-LETTER CAPITAL C
-212F..2131 ; Other_Math # L& [3] SCRIPT SMALL E..SCRIPT CAPITAL F
-2133..2134 ; Other_Math # L& [2] SCRIPT CAPITAL M..SCRIPT SMALL O
-2135..2138 ; Other_Math # Lo [4] ALEF SYMBOL..DALET SYMBOL
-213C..213F ; Other_Math # L& [4] DOUBLE-STRUCK SMALL PI..DOUBLE-STRUCK CAPITAL PI
-2145..2149 ; Other_Math # L& [5] DOUBLE-STRUCK ITALIC CAPITAL D..DOUBLE-STRUCK ITALIC SMALL J
-2195..2199 ; Other_Math # So [5] UP DOWN ARROW..SOUTH WEST ARROW
-219C..219F ; Other_Math # So [4] LEFTWARDS WAVE ARROW..UPWARDS TWO HEADED ARROW
-21A1..21A2 ; Other_Math # So [2] DOWNWARDS TWO HEADED ARROW..LEFTWARDS ARROW WITH TAIL
-21A4..21A5 ; Other_Math # So [2] LEFTWARDS ARROW FROM BAR..UPWARDS ARROW FROM BAR
-21A7 ; Other_Math # So DOWNWARDS ARROW FROM BAR
-21A9..21AD ; Other_Math # So [5] LEFTWARDS ARROW WITH HOOK..LEFT RIGHT WAVE ARROW
-21B0..21B1 ; Other_Math # So [2] UPWARDS ARROW WITH TIP LEFTWARDS..UPWARDS ARROW WITH TIP RIGHTWARDS
-21B6..21B7 ; Other_Math # So [2] ANTICLOCKWISE TOP SEMICIRCLE ARROW..CLOCKWISE TOP SEMICIRCLE ARROW
-21BC..21CD ; Other_Math # So [18] LEFTWARDS HARPOON WITH BARB UPWARDS..LEFTWARDS DOUBLE ARROW WITH STROKE
-21D0..21D1 ; Other_Math # So [2] LEFTWARDS DOUBLE ARROW..UPWARDS DOUBLE ARROW
-21D3 ; Other_Math # So DOWNWARDS DOUBLE ARROW
-21D5..21DB ; Other_Math # So [7] UP DOWN DOUBLE ARROW..RIGHTWARDS TRIPLE ARROW
-21DD ; Other_Math # So RIGHTWARDS SQUIGGLE ARROW
-21E4..21E5 ; Other_Math # So [2] LEFTWARDS ARROW TO BAR..RIGHTWARDS ARROW TO BAR
-2308 ; Other_Math # Ps LEFT CEILING
-2309 ; Other_Math # Pe RIGHT CEILING
-230A ; Other_Math # Ps LEFT FLOOR
-230B ; Other_Math # Pe RIGHT FLOOR
-23B4..23B5 ; Other_Math # So [2] TOP SQUARE BRACKET..BOTTOM SQUARE BRACKET
-23B7 ; Other_Math # So RADICAL SYMBOL BOTTOM
-23D0 ; Other_Math # So VERTICAL LINE EXTENSION
-23E2 ; Other_Math # So WHITE TRAPEZIUM
-25A0..25A1 ; Other_Math # So [2] BLACK SQUARE..WHITE SQUARE
-25AE..25B6 ; Other_Math # So [9] BLACK VERTICAL RECTANGLE..BLACK RIGHT-POINTING TRIANGLE
-25BC..25C0 ; Other_Math # So [5] BLACK DOWN-POINTING TRIANGLE..BLACK LEFT-POINTING TRIANGLE
-25C6..25C7 ; Other_Math # So [2] BLACK DIAMOND..WHITE DIAMOND
-25CA..25CB ; Other_Math # So [2] LOZENGE..WHITE CIRCLE
-25CF..25D3 ; Other_Math # So [5] BLACK CIRCLE..CIRCLE WITH UPPER HALF BLACK
-25E2 ; Other_Math # So BLACK LOWER RIGHT TRIANGLE
-25E4 ; Other_Math # So BLACK UPPER LEFT TRIANGLE
-25E7..25EC ; Other_Math # So [6] SQUARE WITH LEFT HALF BLACK..WHITE UP-POINTING TRIANGLE WITH DOT
-2605..2606 ; Other_Math # So [2] BLACK STAR..WHITE STAR
-2640 ; Other_Math # So FEMALE SIGN
-2642 ; Other_Math # So MALE SIGN
-2660..2663 ; Other_Math # So [4] BLACK SPADE SUIT..BLACK CLUB SUIT
-266D..266E ; Other_Math # So [2] MUSIC FLAT SIGN..MUSIC NATURAL SIGN
-27C5 ; Other_Math # Ps LEFT S-SHAPED BAG DELIMITER
-27C6 ; Other_Math # Pe RIGHT S-SHAPED BAG DELIMITER
-27E6 ; Other_Math # Ps MATHEMATICAL LEFT WHITE SQUARE BRACKET
-27E7 ; Other_Math # Pe MATHEMATICAL RIGHT WHITE SQUARE BRACKET
-27E8 ; Other_Math # Ps MATHEMATICAL LEFT ANGLE BRACKET
-27E9 ; Other_Math # Pe MATHEMATICAL RIGHT ANGLE BRACKET
-27EA ; Other_Math # Ps MATHEMATICAL LEFT DOUBLE ANGLE BRACKET
-27EB ; Other_Math # Pe MATHEMATICAL RIGHT DOUBLE ANGLE BRACKET
-27EC ; Other_Math # Ps MATHEMATICAL LEFT WHITE TORTOISE SHELL BRACKET
-27ED ; Other_Math # Pe MATHEMATICAL RIGHT WHITE TORTOISE SHELL BRACKET
-27EE ; Other_Math # Ps MATHEMATICAL LEFT FLATTENED PARENTHESIS
-27EF ; Other_Math # Pe MATHEMATICAL RIGHT FLATTENED PARENTHESIS
-2983 ; Other_Math # Ps LEFT WHITE CURLY BRACKET
-2984 ; Other_Math # Pe RIGHT WHITE CURLY BRACKET
-2985 ; Other_Math # Ps LEFT WHITE PARENTHESIS
-2986 ; Other_Math # Pe RIGHT WHITE PARENTHESIS
-2987 ; Other_Math # Ps Z NOTATION LEFT IMAGE BRACKET
-2988 ; Other_Math # Pe Z NOTATION RIGHT IMAGE BRACKET
-2989 ; Other_Math # Ps Z NOTATION LEFT BINDING BRACKET
-298A ; Other_Math # Pe Z NOTATION RIGHT BINDING BRACKET
-298B ; Other_Math # Ps LEFT SQUARE BRACKET WITH UNDERBAR
-298C ; Other_Math # Pe RIGHT SQUARE BRACKET WITH UNDERBAR
-298D ; Other_Math # Ps LEFT SQUARE BRACKET WITH TICK IN TOP CORNER
-298E ; Other_Math # Pe RIGHT SQUARE BRACKET WITH TICK IN BOTTOM CORNER
-298F ; Other_Math # Ps LEFT SQUARE BRACKET WITH TICK IN BOTTOM CORNER
-2990 ; Other_Math # Pe RIGHT SQUARE BRACKET WITH TICK IN TOP CORNER
-2991 ; Other_Math # Ps LEFT ANGLE BRACKET WITH DOT
-2992 ; Other_Math # Pe RIGHT ANGLE BRACKET WITH DOT
-2993 ; Other_Math # Ps LEFT ARC LESS-THAN BRACKET
-2994 ; Other_Math # Pe RIGHT ARC GREATER-THAN BRACKET
-2995 ; Other_Math # Ps DOUBLE LEFT ARC GREATER-THAN BRACKET
-2996 ; Other_Math # Pe DOUBLE RIGHT ARC LESS-THAN BRACKET
-2997 ; Other_Math # Ps LEFT BLACK TORTOISE SHELL BRACKET
-2998 ; Other_Math # Pe RIGHT BLACK TORTOISE SHELL BRACKET
-29D8 ; Other_Math # Ps LEFT WIGGLY FENCE
-29D9 ; Other_Math # Pe RIGHT WIGGLY FENCE
-29DA ; Other_Math # Ps LEFT DOUBLE WIGGLY FENCE
-29DB ; Other_Math # Pe RIGHT DOUBLE WIGGLY FENCE
-29FC ; Other_Math # Ps LEFT-POINTING CURVED ANGLE BRACKET
-29FD ; Other_Math # Pe RIGHT-POINTING CURVED ANGLE BRACKET
-FE61 ; Other_Math # Po SMALL ASTERISK
-FE63 ; Other_Math # Pd SMALL HYPHEN-MINUS
-FE68 ; Other_Math # Po SMALL REVERSE SOLIDUS
-FF3C ; Other_Math # Po FULLWIDTH REVERSE SOLIDUS
-FF3E ; Other_Math # Sk FULLWIDTH CIRCUMFLEX ACCENT
-1D400..1D454 ; Other_Math # L& [85] MATHEMATICAL BOLD CAPITAL A..MATHEMATICAL ITALIC SMALL G
-1D456..1D49C ; Other_Math # L& [71] MATHEMATICAL ITALIC SMALL I..MATHEMATICAL SCRIPT CAPITAL A
-1D49E..1D49F ; Other_Math # L& [2] MATHEMATICAL SCRIPT CAPITAL C..MATHEMATICAL SCRIPT CAPITAL D
-1D4A2 ; Other_Math # L& MATHEMATICAL SCRIPT CAPITAL G
-1D4A5..1D4A6 ; Other_Math # L& [2] MATHEMATICAL SCRIPT CAPITAL J..MATHEMATICAL SCRIPT CAPITAL K
-1D4A9..1D4AC ; Other_Math # L& [4] MATHEMATICAL SCRIPT CAPITAL N..MATHEMATICAL SCRIPT CAPITAL Q
-1D4AE..1D4B9 ; Other_Math # L& [12] MATHEMATICAL SCRIPT CAPITAL S..MATHEMATICAL SCRIPT SMALL D
-1D4BB ; Other_Math # L& MATHEMATICAL SCRIPT SMALL F
-1D4BD..1D4C3 ; Other_Math # L& [7] MATHEMATICAL SCRIPT SMALL H..MATHEMATICAL SCRIPT SMALL N
-1D4C5..1D505 ; Other_Math # L& [65] MATHEMATICAL SCRIPT SMALL P..MATHEMATICAL FRAKTUR CAPITAL B
-1D507..1D50A ; Other_Math # L& [4] MATHEMATICAL FRAKTUR CAPITAL D..MATHEMATICAL FRAKTUR CAPITAL G
-1D50D..1D514 ; Other_Math # L& [8] MATHEMATICAL FRAKTUR CAPITAL J..MATHEMATICAL FRAKTUR CAPITAL Q
-1D516..1D51C ; Other_Math # L& [7] MATHEMATICAL FRAKTUR CAPITAL S..MATHEMATICAL FRAKTUR CAPITAL Y
-1D51E..1D539 ; Other_Math # L& [28] MATHEMATICAL FRAKTUR SMALL A..MATHEMATICAL DOUBLE-STRUCK CAPITAL B
-1D53B..1D53E ; Other_Math # L& [4] MATHEMATICAL DOUBLE-STRUCK CAPITAL D..MATHEMATICAL DOUBLE-STRUCK CAPITAL G
-1D540..1D544 ; Other_Math # L& [5] MATHEMATICAL DOUBLE-STRUCK CAPITAL I..MATHEMATICAL DOUBLE-STRUCK CAPITAL M
-1D546 ; Other_Math # L& MATHEMATICAL DOUBLE-STRUCK CAPITAL O
-1D54A..1D550 ; Other_Math # L& [7] MATHEMATICAL DOUBLE-STRUCK CAPITAL S..MATHEMATICAL DOUBLE-STRUCK CAPITAL Y
-1D552..1D6A5 ; Other_Math # L& [340] MATHEMATICAL DOUBLE-STRUCK SMALL A..MATHEMATICAL ITALIC SMALL DOTLESS J
-1D6A8..1D6C0 ; Other_Math # L& [25] MATHEMATICAL BOLD CAPITAL ALPHA..MATHEMATICAL BOLD CAPITAL OMEGA
-1D6C2..1D6DA ; Other_Math # L& [25] MATHEMATICAL BOLD SMALL ALPHA..MATHEMATICAL BOLD SMALL OMEGA
-1D6DC..1D6FA ; Other_Math # L& [31] MATHEMATICAL BOLD EPSILON SYMBOL..MATHEMATICAL ITALIC CAPITAL OMEGA
-1D6FC..1D714 ; Other_Math # L& [25] MATHEMATICAL ITALIC SMALL ALPHA..MATHEMATICAL ITALIC SMALL OMEGA
-1D716..1D734 ; Other_Math # L& [31] MATHEMATICAL ITALIC EPSILON SYMBOL..MATHEMATICAL BOLD ITALIC CAPITAL OMEGA
-1D736..1D74E ; Other_Math # L& [25] MATHEMATICAL BOLD ITALIC SMALL ALPHA..MATHEMATICAL BOLD ITALIC SMALL OMEGA
-1D750..1D76E ; Other_Math # L& [31] MATHEMATICAL BOLD ITALIC EPSILON SYMBOL..MATHEMATICAL SANS-SERIF BOLD CAPITAL OMEGA
-1D770..1D788 ; Other_Math # L& [25] MATHEMATICAL SANS-SERIF BOLD SMALL ALPHA..MATHEMATICAL SANS-SERIF BOLD SMALL OMEGA
-1D78A..1D7A8 ; Other_Math # L& [31] MATHEMATICAL SANS-SERIF BOLD EPSILON SYMBOL..MATHEMATICAL SANS-SERIF BOLD ITALIC CAPITAL OMEGA
-1D7AA..1D7C2 ; Other_Math # L& [25] MATHEMATICAL SANS-SERIF BOLD ITALIC SMALL ALPHA..MATHEMATICAL SANS-SERIF BOLD ITALIC SMALL OMEGA
-1D7C4..1D7CB ; Other_Math # L& [8] MATHEMATICAL SANS-SERIF BOLD ITALIC EPSILON SYMBOL..MATHEMATICAL BOLD SMALL DIGAMMA
-1D7CE..1D7FF ; Other_Math # Nd [50] MATHEMATICAL BOLD DIGIT ZERO..MATHEMATICAL MONOSPACE DIGIT NINE
-1EE00..1EE03 ; Other_Math # Lo [4] ARABIC MATHEMATICAL ALEF..ARABIC MATHEMATICAL DAL
-1EE05..1EE1F ; Other_Math # Lo [27] ARABIC MATHEMATICAL WAW..ARABIC MATHEMATICAL DOTLESS QAF
-1EE21..1EE22 ; Other_Math # Lo [2] ARABIC MATHEMATICAL INITIAL BEH..ARABIC MATHEMATICAL INITIAL JEEM
-1EE24 ; Other_Math # Lo ARABIC MATHEMATICAL INITIAL HEH
-1EE27 ; Other_Math # Lo ARABIC MATHEMATICAL INITIAL HAH
-1EE29..1EE32 ; Other_Math # Lo [10] ARABIC MATHEMATICAL INITIAL YEH..ARABIC MATHEMATICAL INITIAL QAF
-1EE34..1EE37 ; Other_Math # Lo [4] ARABIC MATHEMATICAL INITIAL SHEEN..ARABIC MATHEMATICAL INITIAL KHAH
-1EE39 ; Other_Math # Lo ARABIC MATHEMATICAL INITIAL DAD
-1EE3B ; Other_Math # Lo ARABIC MATHEMATICAL INITIAL GHAIN
-1EE42 ; Other_Math # Lo ARABIC MATHEMATICAL TAILED JEEM
-1EE47 ; Other_Math # Lo ARABIC MATHEMATICAL TAILED HAH
-1EE49 ; Other_Math # Lo ARABIC MATHEMATICAL TAILED YEH
-1EE4B ; Other_Math # Lo ARABIC MATHEMATICAL TAILED LAM
-1EE4D..1EE4F ; Other_Math # Lo [3] ARABIC MATHEMATICAL TAILED NOON..ARABIC MATHEMATICAL TAILED AIN
-1EE51..1EE52 ; Other_Math # Lo [2] ARABIC MATHEMATICAL TAILED SAD..ARABIC MATHEMATICAL TAILED QAF
-1EE54 ; Other_Math # Lo ARABIC MATHEMATICAL TAILED SHEEN
-1EE57 ; Other_Math # Lo ARABIC MATHEMATICAL TAILED KHAH
-1EE59 ; Other_Math # Lo ARABIC MATHEMATICAL TAILED DAD
-1EE5B ; Other_Math # Lo ARABIC MATHEMATICAL TAILED GHAIN
-1EE5D ; Other_Math # Lo ARABIC MATHEMATICAL TAILED DOTLESS NOON
-1EE5F ; Other_Math # Lo ARABIC MATHEMATICAL TAILED DOTLESS QAF
-1EE61..1EE62 ; Other_Math # Lo [2] ARABIC MATHEMATICAL STRETCHED BEH..ARABIC MATHEMATICAL STRETCHED JEEM
-1EE64 ; Other_Math # Lo ARABIC MATHEMATICAL STRETCHED HEH
-1EE67..1EE6A ; Other_Math # Lo [4] ARABIC MATHEMATICAL STRETCHED HAH..ARABIC MATHEMATICAL STRETCHED KAF
-1EE6C..1EE72 ; Other_Math # Lo [7] ARABIC MATHEMATICAL STRETCHED MEEM..ARABIC MATHEMATICAL STRETCHED QAF
-1EE74..1EE77 ; Other_Math # Lo [4] ARABIC MATHEMATICAL STRETCHED SHEEN..ARABIC MATHEMATICAL STRETCHED KHAH
-1EE79..1EE7C ; Other_Math # Lo [4] ARABIC MATHEMATICAL STRETCHED DAD..ARABIC MATHEMATICAL STRETCHED DOTLESS BEH
-1EE7E ; Other_Math # Lo ARABIC MATHEMATICAL STRETCHED DOTLESS FEH
-1EE80..1EE89 ; Other_Math # Lo [10] ARABIC MATHEMATICAL LOOPED ALEF..ARABIC MATHEMATICAL LOOPED YEH
-1EE8B..1EE9B ; Other_Math # Lo [17] ARABIC MATHEMATICAL LOOPED LAM..ARABIC MATHEMATICAL LOOPED GHAIN
-1EEA1..1EEA3 ; Other_Math # Lo [3] ARABIC MATHEMATICAL DOUBLE-STRUCK BEH..ARABIC MATHEMATICAL DOUBLE-STRUCK DAL
-1EEA5..1EEA9 ; Other_Math # Lo [5] ARABIC MATHEMATICAL DOUBLE-STRUCK WAW..ARABIC MATHEMATICAL DOUBLE-STRUCK YEH
-1EEAB..1EEBB ; Other_Math # Lo [17] ARABIC MATHEMATICAL DOUBLE-STRUCK LAM..ARABIC MATHEMATICAL DOUBLE-STRUCK GHAIN
-
-# Total code points: 1362
-
-# ================================================
-
-0030..0039 ; Hex_Digit # Nd [10] DIGIT ZERO..DIGIT NINE
-0041..0046 ; Hex_Digit # L& [6] LATIN CAPITAL LETTER A..LATIN CAPITAL LETTER F
-0061..0066 ; Hex_Digit # L& [6] LATIN SMALL LETTER A..LATIN SMALL LETTER F
-FF10..FF19 ; Hex_Digit # Nd [10] FULLWIDTH DIGIT ZERO..FULLWIDTH DIGIT NINE
-FF21..FF26 ; Hex_Digit # L& [6] FULLWIDTH LATIN CAPITAL LETTER A..FULLWIDTH LATIN CAPITAL LETTER F
-FF41..FF46 ; Hex_Digit # L& [6] FULLWIDTH LATIN SMALL LETTER A..FULLWIDTH LATIN SMALL LETTER F
-
-# Total code points: 44
-
-# ================================================
-
-0030..0039 ; ASCII_Hex_Digit # Nd [10] DIGIT ZERO..DIGIT NINE
-0041..0046 ; ASCII_Hex_Digit # L& [6] LATIN CAPITAL LETTER A..LATIN CAPITAL LETTER F
-0061..0066 ; ASCII_Hex_Digit # L& [6] LATIN SMALL LETTER A..LATIN SMALL LETTER F
-
-# Total code points: 22
-
-# ================================================
-
-0345 ; Other_Alphabetic # Mn COMBINING GREEK YPOGEGRAMMENI
-05B0..05BD ; Other_Alphabetic # Mn [14] HEBREW POINT SHEVA..HEBREW POINT METEG
-05BF ; Other_Alphabetic # Mn HEBREW POINT RAFE
-05C1..05C2 ; Other_Alphabetic # Mn [2] HEBREW POINT SHIN DOT..HEBREW POINT SIN DOT
-05C4..05C5 ; Other_Alphabetic # Mn [2] HEBREW MARK UPPER DOT..HEBREW MARK LOWER DOT
-05C7 ; Other_Alphabetic # Mn HEBREW POINT QAMATS QATAN
-0610..061A ; Other_Alphabetic # Mn [11] ARABIC SIGN SALLALLAHOU ALAYHE WASSALLAM..ARABIC SMALL KASRA
-064B..0657 ; Other_Alphabetic # Mn [13] ARABIC FATHATAN..ARABIC INVERTED DAMMA
-0659..065F ; Other_Alphabetic # Mn [7] ARABIC ZWARAKAY..ARABIC WAVY HAMZA BELOW
-0670 ; Other_Alphabetic # Mn ARABIC LETTER SUPERSCRIPT ALEF
-06D6..06DC ; Other_Alphabetic # Mn [7] ARABIC SMALL HIGH LIGATURE SAD WITH LAM WITH ALEF MAKSURA..ARABIC SMALL HIGH SEEN
-06E1..06E4 ; Other_Alphabetic # Mn [4] ARABIC SMALL HIGH DOTLESS HEAD OF KHAH..ARABIC SMALL HIGH MADDA
-06E7..06E8 ; Other_Alphabetic # Mn [2] ARABIC SMALL HIGH YEH..ARABIC SMALL HIGH NOON
-06ED ; Other_Alphabetic # Mn ARABIC SMALL LOW MEEM
-0711 ; Other_Alphabetic # Mn SYRIAC LETTER SUPERSCRIPT ALAPH
-0730..073F ; Other_Alphabetic # Mn [16] SYRIAC PTHAHA ABOVE..SYRIAC RWAHA
-07A6..07B0 ; Other_Alphabetic # Mn [11] THAANA ABAFILI..THAANA SUKUN
-0816..0817 ; Other_Alphabetic # Mn [2] SAMARITAN MARK IN..SAMARITAN MARK IN-ALAF
-081B..0823 ; Other_Alphabetic # Mn [9] SAMARITAN MARK EPENTHETIC YUT..SAMARITAN VOWEL SIGN A
-0825..0827 ; Other_Alphabetic # Mn [3] SAMARITAN VOWEL SIGN SHORT A..SAMARITAN VOWEL SIGN U
-0829..082C ; Other_Alphabetic # Mn [4] SAMARITAN VOWEL SIGN LONG I..SAMARITAN VOWEL SIGN SUKUN
-08D4..08DF ; Other_Alphabetic # Mn [12] ARABIC SMALL HIGH WORD AR-RUB..ARABIC SMALL HIGH WORD WAQFA
-08E3..08E9 ; Other_Alphabetic # Mn [7] ARABIC TURNED DAMMA BELOW..ARABIC CURLY KASRATAN
-08F0..0902 ; Other_Alphabetic # Mn [19] ARABIC OPEN FATHATAN..DEVANAGARI SIGN ANUSVARA
-0903 ; Other_Alphabetic # Mc DEVANAGARI SIGN VISARGA
-093A ; Other_Alphabetic # Mn DEVANAGARI VOWEL SIGN OE
-093B ; Other_Alphabetic # Mc DEVANAGARI VOWEL SIGN OOE
-093E..0940 ; Other_Alphabetic # Mc [3] DEVANAGARI VOWEL SIGN AA..DEVANAGARI VOWEL SIGN II
-0941..0948 ; Other_Alphabetic # Mn [8] DEVANAGARI VOWEL SIGN U..DEVANAGARI VOWEL SIGN AI
-0949..094C ; Other_Alphabetic # Mc [4] DEVANAGARI VOWEL SIGN CANDRA O..DEVANAGARI VOWEL SIGN AU
-094E..094F ; Other_Alphabetic # Mc [2] DEVANAGARI VOWEL SIGN PRISHTHAMATRA E..DEVANAGARI VOWEL SIGN AW
-0955..0957 ; Other_Alphabetic # Mn [3] DEVANAGARI VOWEL SIGN CANDRA LONG E..DEVANAGARI VOWEL SIGN UUE
-0962..0963 ; Other_Alphabetic # Mn [2] DEVANAGARI VOWEL SIGN VOCALIC L..DEVANAGARI VOWEL SIGN VOCALIC LL
-0981 ; Other_Alphabetic # Mn BENGALI SIGN CANDRABINDU
-0982..0983 ; Other_Alphabetic # Mc [2] BENGALI SIGN ANUSVARA..BENGALI SIGN VISARGA
-09BE..09C0 ; Other_Alphabetic # Mc [3] BENGALI VOWEL SIGN AA..BENGALI VOWEL SIGN II
-09C1..09C4 ; Other_Alphabetic # Mn [4] BENGALI VOWEL SIGN U..BENGALI VOWEL SIGN VOCALIC RR
-09C7..09C8 ; Other_Alphabetic # Mc [2] BENGALI VOWEL SIGN E..BENGALI VOWEL SIGN AI
-09CB..09CC ; Other_Alphabetic # Mc [2] BENGALI VOWEL SIGN O..BENGALI VOWEL SIGN AU
-09D7 ; Other_Alphabetic # Mc BENGALI AU LENGTH MARK
-09E2..09E3 ; Other_Alphabetic # Mn [2] BENGALI VOWEL SIGN VOCALIC L..BENGALI VOWEL SIGN VOCALIC LL
-0A01..0A02 ; Other_Alphabetic # Mn [2] GURMUKHI SIGN ADAK BINDI..GURMUKHI SIGN BINDI
-0A03 ; Other_Alphabetic # Mc GURMUKHI SIGN VISARGA
-0A3E..0A40 ; Other_Alphabetic # Mc [3] GURMUKHI VOWEL SIGN AA..GURMUKHI VOWEL SIGN II
-0A41..0A42 ; Other_Alphabetic # Mn [2] GURMUKHI VOWEL SIGN U..GURMUKHI VOWEL SIGN UU
-0A47..0A48 ; Other_Alphabetic # Mn [2] GURMUKHI VOWEL SIGN EE..GURMUKHI VOWEL SIGN AI
-0A4B..0A4C ; Other_Alphabetic # Mn [2] GURMUKHI VOWEL SIGN OO..GURMUKHI VOWEL SIGN AU
-0A51 ; Other_Alphabetic # Mn GURMUKHI SIGN UDAAT
-0A70..0A71 ; Other_Alphabetic # Mn [2] GURMUKHI TIPPI..GURMUKHI ADDAK
-0A75 ; Other_Alphabetic # Mn GURMUKHI SIGN YAKASH
-0A81..0A82 ; Other_Alphabetic # Mn [2] GUJARATI SIGN CANDRABINDU..GUJARATI SIGN ANUSVARA
-0A83 ; Other_Alphabetic # Mc GUJARATI SIGN VISARGA
-0ABE..0AC0 ; Other_Alphabetic # Mc [3] GUJARATI VOWEL SIGN AA..GUJARATI VOWEL SIGN II
-0AC1..0AC5 ; Other_Alphabetic # Mn [5] GUJARATI VOWEL SIGN U..GUJARATI VOWEL SIGN CANDRA E
-0AC7..0AC8 ; Other_Alphabetic # Mn [2] GUJARATI VOWEL SIGN E..GUJARATI VOWEL SIGN AI
-0AC9 ; Other_Alphabetic # Mc GUJARATI VOWEL SIGN CANDRA O
-0ACB..0ACC ; Other_Alphabetic # Mc [2] GUJARATI VOWEL SIGN O..GUJARATI VOWEL SIGN AU
-0AE2..0AE3 ; Other_Alphabetic # Mn [2] GUJARATI VOWEL SIGN VOCALIC L..GUJARATI VOWEL SIGN VOCALIC LL
-0AFA..0AFC ; Other_Alphabetic # Mn [3] GUJARATI SIGN SUKUN..GUJARATI SIGN MADDAH
-0B01 ; Other_Alphabetic # Mn ORIYA SIGN CANDRABINDU
-0B02..0B03 ; Other_Alphabetic # Mc [2] ORIYA SIGN ANUSVARA..ORIYA SIGN VISARGA
-0B3E ; Other_Alphabetic # Mc ORIYA VOWEL SIGN AA
-0B3F ; Other_Alphabetic # Mn ORIYA VOWEL SIGN I
-0B40 ; Other_Alphabetic # Mc ORIYA VOWEL SIGN II
-0B41..0B44 ; Other_Alphabetic # Mn [4] ORIYA VOWEL SIGN U..ORIYA VOWEL SIGN VOCALIC RR
-0B47..0B48 ; Other_Alphabetic # Mc [2] ORIYA VOWEL SIGN E..ORIYA VOWEL SIGN AI
-0B4B..0B4C ; Other_Alphabetic # Mc [2] ORIYA VOWEL SIGN O..ORIYA VOWEL SIGN AU
-0B56 ; Other_Alphabetic # Mn ORIYA AI LENGTH MARK
-0B57 ; Other_Alphabetic # Mc ORIYA AU LENGTH MARK
-0B62..0B63 ; Other_Alphabetic # Mn [2] ORIYA VOWEL SIGN VOCALIC L..ORIYA VOWEL SIGN VOCALIC LL
-0B82 ; Other_Alphabetic # Mn TAMIL SIGN ANUSVARA
-0BBE..0BBF ; Other_Alphabetic # Mc [2] TAMIL VOWEL SIGN AA..TAMIL VOWEL SIGN I
-0BC0 ; Other_Alphabetic # Mn TAMIL VOWEL SIGN II
-0BC1..0BC2 ; Other_Alphabetic # Mc [2] TAMIL VOWEL SIGN U..TAMIL VOWEL SIGN UU
-0BC6..0BC8 ; Other_Alphabetic # Mc [3] TAMIL VOWEL SIGN E..TAMIL VOWEL SIGN AI
-0BCA..0BCC ; Other_Alphabetic # Mc [3] TAMIL VOWEL SIGN O..TAMIL VOWEL SIGN AU
-0BD7 ; Other_Alphabetic # Mc TAMIL AU LENGTH MARK
-0C00 ; Other_Alphabetic # Mn TELUGU SIGN COMBINING CANDRABINDU ABOVE
-0C01..0C03 ; Other_Alphabetic # Mc [3] TELUGU SIGN CANDRABINDU..TELUGU SIGN VISARGA
-0C04 ; Other_Alphabetic # Mn TELUGU SIGN COMBINING ANUSVARA ABOVE
-0C3E..0C40 ; Other_Alphabetic # Mn [3] TELUGU VOWEL SIGN AA..TELUGU VOWEL SIGN II
-0C41..0C44 ; Other_Alphabetic # Mc [4] TELUGU VOWEL SIGN U..TELUGU VOWEL SIGN VOCALIC RR
-0C46..0C48 ; Other_Alphabetic # Mn [3] TELUGU VOWEL SIGN E..TELUGU VOWEL SIGN AI
-0C4A..0C4C ; Other_Alphabetic # Mn [3] TELUGU VOWEL SIGN O..TELUGU VOWEL SIGN AU
-0C55..0C56 ; Other_Alphabetic # Mn [2] TELUGU LENGTH MARK..TELUGU AI LENGTH MARK
-0C62..0C63 ; Other_Alphabetic # Mn [2] TELUGU VOWEL SIGN VOCALIC L..TELUGU VOWEL SIGN VOCALIC LL
-0C81 ; Other_Alphabetic # Mn KANNADA SIGN CANDRABINDU
-0C82..0C83 ; Other_Alphabetic # Mc [2] KANNADA SIGN ANUSVARA..KANNADA SIGN VISARGA
-0CBE ; Other_Alphabetic # Mc KANNADA VOWEL SIGN AA
-0CBF ; Other_Alphabetic # Mn KANNADA VOWEL SIGN I
-0CC0..0CC4 ; Other_Alphabetic # Mc [5] KANNADA VOWEL SIGN II..KANNADA VOWEL SIGN VOCALIC RR
-0CC6 ; Other_Alphabetic # Mn KANNADA VOWEL SIGN E
-0CC7..0CC8 ; Other_Alphabetic # Mc [2] KANNADA VOWEL SIGN EE..KANNADA VOWEL SIGN AI
-0CCA..0CCB ; Other_Alphabetic # Mc [2] KANNADA VOWEL SIGN O..KANNADA VOWEL SIGN OO
-0CCC ; Other_Alphabetic # Mn KANNADA VOWEL SIGN AU
-0CD5..0CD6 ; Other_Alphabetic # Mc [2] KANNADA LENGTH MARK..KANNADA AI LENGTH MARK
-0CE2..0CE3 ; Other_Alphabetic # Mn [2] KANNADA VOWEL SIGN VOCALIC L..KANNADA VOWEL SIGN VOCALIC LL
-0CF3 ; Other_Alphabetic # Mc KANNADA SIGN COMBINING ANUSVARA ABOVE RIGHT
-0D00..0D01 ; Other_Alphabetic # Mn [2] MALAYALAM SIGN COMBINING ANUSVARA ABOVE..MALAYALAM SIGN CANDRABINDU
-0D02..0D03 ; Other_Alphabetic # Mc [2] MALAYALAM SIGN ANUSVARA..MALAYALAM SIGN VISARGA
-0D3E..0D40 ; Other_Alphabetic # Mc [3] MALAYALAM VOWEL SIGN AA..MALAYALAM VOWEL SIGN II
-0D41..0D44 ; Other_Alphabetic # Mn [4] MALAYALAM VOWEL SIGN U..MALAYALAM VOWEL SIGN VOCALIC RR
-0D46..0D48 ; Other_Alphabetic # Mc [3] MALAYALAM VOWEL SIGN E..MALAYALAM VOWEL SIGN AI
-0D4A..0D4C ; Other_Alphabetic # Mc [3] MALAYALAM VOWEL SIGN O..MALAYALAM VOWEL SIGN AU
-0D57 ; Other_Alphabetic # Mc MALAYALAM AU LENGTH MARK
-0D62..0D63 ; Other_Alphabetic # Mn [2] MALAYALAM VOWEL SIGN VOCALIC L..MALAYALAM VOWEL SIGN VOCALIC LL
-0D81 ; Other_Alphabetic # Mn SINHALA SIGN CANDRABINDU
-0D82..0D83 ; Other_Alphabetic # Mc [2] SINHALA SIGN ANUSVARAYA..SINHALA SIGN VISARGAYA
-0DCF..0DD1 ; Other_Alphabetic # Mc [3] SINHALA VOWEL SIGN AELA-PILLA..SINHALA VOWEL SIGN DIGA AEDA-PILLA
-0DD2..0DD4 ; Other_Alphabetic # Mn [3] SINHALA VOWEL SIGN KETTI IS-PILLA..SINHALA VOWEL SIGN KETTI PAA-PILLA
-0DD6 ; Other_Alphabetic # Mn SINHALA VOWEL SIGN DIGA PAA-PILLA
-0DD8..0DDF ; Other_Alphabetic # Mc [8] SINHALA VOWEL SIGN GAETTA-PILLA..SINHALA VOWEL SIGN GAYANUKITTA
-0DF2..0DF3 ; Other_Alphabetic # Mc [2] SINHALA VOWEL SIGN DIGA GAETTA-PILLA..SINHALA VOWEL SIGN DIGA GAYANUKITTA
-0E31 ; Other_Alphabetic # Mn THAI CHARACTER MAI HAN-AKAT
-0E34..0E3A ; Other_Alphabetic # Mn [7] THAI CHARACTER SARA I..THAI CHARACTER PHINTHU
-0E4D ; Other_Alphabetic # Mn THAI CHARACTER NIKHAHIT
-0EB1 ; Other_Alphabetic # Mn LAO VOWEL SIGN MAI KAN
-0EB4..0EB9 ; Other_Alphabetic # Mn [6] LAO VOWEL SIGN I..LAO VOWEL SIGN UU
-0EBB..0EBC ; Other_Alphabetic # Mn [2] LAO VOWEL SIGN MAI KON..LAO SEMIVOWEL SIGN LO
-0ECD ; Other_Alphabetic # Mn LAO NIGGAHITA
-0F71..0F7E ; Other_Alphabetic # Mn [14] TIBETAN VOWEL SIGN AA..TIBETAN SIGN RJES SU NGA RO
-0F7F ; Other_Alphabetic # Mc TIBETAN SIGN RNAM BCAD
-0F80..0F83 ; Other_Alphabetic # Mn [4] TIBETAN VOWEL SIGN REVERSED I..TIBETAN SIGN SNA LDAN
-0F8D..0F97 ; Other_Alphabetic # Mn [11] TIBETAN SUBJOINED SIGN LCE TSA CAN..TIBETAN SUBJOINED LETTER JA
-0F99..0FBC ; Other_Alphabetic # Mn [36] TIBETAN SUBJOINED LETTER NYA..TIBETAN SUBJOINED LETTER FIXED-FORM RA
-102B..102C ; Other_Alphabetic # Mc [2] MYANMAR VOWEL SIGN TALL AA..MYANMAR VOWEL SIGN AA
-102D..1030 ; Other_Alphabetic # Mn [4] MYANMAR VOWEL SIGN I..MYANMAR VOWEL SIGN UU
-1031 ; Other_Alphabetic # Mc MYANMAR VOWEL SIGN E
-1032..1036 ; Other_Alphabetic # Mn [5] MYANMAR VOWEL SIGN AI..MYANMAR SIGN ANUSVARA
-1038 ; Other_Alphabetic # Mc MYANMAR SIGN VISARGA
-103B..103C ; Other_Alphabetic # Mc [2] MYANMAR CONSONANT SIGN MEDIAL YA..MYANMAR CONSONANT SIGN MEDIAL RA
-103D..103E ; Other_Alphabetic # Mn [2] MYANMAR CONSONANT SIGN MEDIAL WA..MYANMAR CONSONANT SIGN MEDIAL HA
-1056..1057 ; Other_Alphabetic # Mc [2] MYANMAR VOWEL SIGN VOCALIC R..MYANMAR VOWEL SIGN VOCALIC RR
-1058..1059 ; Other_Alphabetic # Mn [2] MYANMAR VOWEL SIGN VOCALIC L..MYANMAR VOWEL SIGN VOCALIC LL
-105E..1060 ; Other_Alphabetic # Mn [3] MYANMAR CONSONANT SIGN MON MEDIAL NA..MYANMAR CONSONANT SIGN MON MEDIAL LA
-1062..1064 ; Other_Alphabetic # Mc [3] MYANMAR VOWEL SIGN SGAW KAREN EU..MYANMAR TONE MARK SGAW KAREN KE PHO
-1067..106D ; Other_Alphabetic # Mc [7] MYANMAR VOWEL SIGN WESTERN PWO KAREN EU..MYANMAR SIGN WESTERN PWO KAREN TONE-5
-1071..1074 ; Other_Alphabetic # Mn [4] MYANMAR VOWEL SIGN GEBA KAREN I..MYANMAR VOWEL SIGN KAYAH EE
-1082 ; Other_Alphabetic # Mn MYANMAR CONSONANT SIGN SHAN MEDIAL WA
-1083..1084 ; Other_Alphabetic # Mc [2] MYANMAR VOWEL SIGN SHAN AA..MYANMAR VOWEL SIGN SHAN E
-1085..1086 ; Other_Alphabetic # Mn [2] MYANMAR VOWEL SIGN SHAN E ABOVE..MYANMAR VOWEL SIGN SHAN FINAL Y
-1087..108C ; Other_Alphabetic # Mc [6] MYANMAR SIGN SHAN TONE-2..MYANMAR SIGN SHAN COUNCIL TONE-3
-108D ; Other_Alphabetic # Mn MYANMAR SIGN SHAN COUNCIL EMPHATIC TONE
-108F ; Other_Alphabetic # Mc MYANMAR SIGN RUMAI PALAUNG TONE-5
-109A..109C ; Other_Alphabetic # Mc [3] MYANMAR SIGN KHAMTI TONE-1..MYANMAR VOWEL SIGN AITON A
-109D ; Other_Alphabetic # Mn MYANMAR VOWEL SIGN AITON AI
-1712..1713 ; Other_Alphabetic # Mn [2] TAGALOG VOWEL SIGN I..TAGALOG VOWEL SIGN U
-1732..1733 ; Other_Alphabetic # Mn [2] HANUNOO VOWEL SIGN I..HANUNOO VOWEL SIGN U
-1752..1753 ; Other_Alphabetic # Mn [2] BUHID VOWEL SIGN I..BUHID VOWEL SIGN U
-1772..1773 ; Other_Alphabetic # Mn [2] TAGBANWA VOWEL SIGN I..TAGBANWA VOWEL SIGN U
-17B6 ; Other_Alphabetic # Mc KHMER VOWEL SIGN AA
-17B7..17BD ; Other_Alphabetic # Mn [7] KHMER VOWEL SIGN I..KHMER VOWEL SIGN UA
-17BE..17C5 ; Other_Alphabetic # Mc [8] KHMER VOWEL SIGN OE..KHMER VOWEL SIGN AU
-17C6 ; Other_Alphabetic # Mn KHMER SIGN NIKAHIT
-17C7..17C8 ; Other_Alphabetic # Mc [2] KHMER SIGN REAHMUK..KHMER SIGN YUUKALEAPINTU
-1885..1886 ; Other_Alphabetic # Mn [2] MONGOLIAN LETTER ALI GALI BALUDA..MONGOLIAN LETTER ALI GALI THREE BALUDA
-18A9 ; Other_Alphabetic # Mn MONGOLIAN LETTER ALI GALI DAGALGA
-1920..1922 ; Other_Alphabetic # Mn [3] LIMBU VOWEL SIGN A..LIMBU VOWEL SIGN U
-1923..1926 ; Other_Alphabetic # Mc [4] LIMBU VOWEL SIGN EE..LIMBU VOWEL SIGN AU
-1927..1928 ; Other_Alphabetic # Mn [2] LIMBU VOWEL SIGN E..LIMBU VOWEL SIGN O
-1929..192B ; Other_Alphabetic # Mc [3] LIMBU SUBJOINED LETTER YA..LIMBU SUBJOINED LETTER WA
-1930..1931 ; Other_Alphabetic # Mc [2] LIMBU SMALL LETTER KA..LIMBU SMALL LETTER NGA
-1932 ; Other_Alphabetic # Mn LIMBU SMALL LETTER ANUSVARA
-1933..1938 ; Other_Alphabetic # Mc [6] LIMBU SMALL LETTER TA..LIMBU SMALL LETTER LA
-1A17..1A18 ; Other_Alphabetic # Mn [2] BUGINESE VOWEL SIGN I..BUGINESE VOWEL SIGN U
-1A19..1A1A ; Other_Alphabetic # Mc [2] BUGINESE VOWEL SIGN E..BUGINESE VOWEL SIGN O
-1A1B ; Other_Alphabetic # Mn BUGINESE VOWEL SIGN AE
-1A55 ; Other_Alphabetic # Mc TAI THAM CONSONANT SIGN MEDIAL RA
-1A56 ; Other_Alphabetic # Mn TAI THAM CONSONANT SIGN MEDIAL LA
-1A57 ; Other_Alphabetic # Mc TAI THAM CONSONANT SIGN LA TANG LAI
-1A58..1A5E ; Other_Alphabetic # Mn [7] TAI THAM SIGN MAI KANG LAI..TAI THAM CONSONANT SIGN SA
-1A61 ; Other_Alphabetic # Mc TAI THAM VOWEL SIGN A
-1A62 ; Other_Alphabetic # Mn TAI THAM VOWEL SIGN MAI SAT
-1A63..1A64 ; Other_Alphabetic # Mc [2] TAI THAM VOWEL SIGN AA..TAI THAM VOWEL SIGN TALL AA
-1A65..1A6C ; Other_Alphabetic # Mn [8] TAI THAM VOWEL SIGN I..TAI THAM VOWEL SIGN OA BELOW
-1A6D..1A72 ; Other_Alphabetic # Mc [6] TAI THAM VOWEL SIGN OY..TAI THAM VOWEL SIGN THAM AI
-1A73..1A74 ; Other_Alphabetic # Mn [2] TAI THAM VOWEL SIGN OA ABOVE..TAI THAM SIGN MAI KANG
-1ABF..1AC0 ; Other_Alphabetic # Mn [2] COMBINING LATIN SMALL LETTER W BELOW..COMBINING LATIN SMALL LETTER TURNED W BELOW
-1ACC..1ACE ; Other_Alphabetic # Mn [3] COMBINING LATIN SMALL LETTER INSULAR G..COMBINING LATIN SMALL LETTER INSULAR T
-1B00..1B03 ; Other_Alphabetic # Mn [4] BALINESE SIGN ULU RICEM..BALINESE SIGN SURANG
-1B04 ; Other_Alphabetic # Mc BALINESE SIGN BISAH
-1B35 ; Other_Alphabetic # Mc BALINESE VOWEL SIGN TEDUNG
-1B36..1B3A ; Other_Alphabetic # Mn [5] BALINESE VOWEL SIGN ULU..BALINESE VOWEL SIGN RA REPA
-1B3B ; Other_Alphabetic # Mc BALINESE VOWEL SIGN RA REPA TEDUNG
-1B3C ; Other_Alphabetic # Mn BALINESE VOWEL SIGN LA LENGA
-1B3D..1B41 ; Other_Alphabetic # Mc [5] BALINESE VOWEL SIGN LA LENGA TEDUNG..BALINESE VOWEL SIGN TALING REPA TEDUNG
-1B42 ; Other_Alphabetic # Mn BALINESE VOWEL SIGN PEPET
-1B43 ; Other_Alphabetic # Mc BALINESE VOWEL SIGN PEPET TEDUNG
-1B80..1B81 ; Other_Alphabetic # Mn [2] SUNDANESE SIGN PANYECEK..SUNDANESE SIGN PANGLAYAR
-1B82 ; Other_Alphabetic # Mc SUNDANESE SIGN PANGWISAD
-1BA1 ; Other_Alphabetic # Mc SUNDANESE CONSONANT SIGN PAMINGKAL
-1BA2..1BA5 ; Other_Alphabetic # Mn [4] SUNDANESE CONSONANT SIGN PANYAKRA..SUNDANESE VOWEL SIGN PANYUKU
-1BA6..1BA7 ; Other_Alphabetic # Mc [2] SUNDANESE VOWEL SIGN PANAELAENG..SUNDANESE VOWEL SIGN PANOLONG
-1BA8..1BA9 ; Other_Alphabetic # Mn [2] SUNDANESE VOWEL SIGN PAMEPET..SUNDANESE VOWEL SIGN PANEULEUNG
-1BAC..1BAD ; Other_Alphabetic # Mn [2] SUNDANESE CONSONANT SIGN PASANGAN MA..SUNDANESE CONSONANT SIGN PASANGAN WA
-1BE7 ; Other_Alphabetic # Mc BATAK VOWEL SIGN E
-1BE8..1BE9 ; Other_Alphabetic # Mn [2] BATAK VOWEL SIGN PAKPAK E..BATAK VOWEL SIGN EE
-1BEA..1BEC ; Other_Alphabetic # Mc [3] BATAK VOWEL SIGN I..BATAK VOWEL SIGN O
-1BED ; Other_Alphabetic # Mn BATAK VOWEL SIGN KARO O
-1BEE ; Other_Alphabetic # Mc BATAK VOWEL SIGN U
-1BEF..1BF1 ; Other_Alphabetic # Mn [3] BATAK VOWEL SIGN U FOR SIMALUNGUN SA..BATAK CONSONANT SIGN H
-1C24..1C2B ; Other_Alphabetic # Mc [8] LEPCHA SUBJOINED LETTER YA..LEPCHA VOWEL SIGN UU
-1C2C..1C33 ; Other_Alphabetic # Mn [8] LEPCHA VOWEL SIGN E..LEPCHA CONSONANT SIGN T
-1C34..1C35 ; Other_Alphabetic # Mc [2] LEPCHA CONSONANT SIGN NYIN-DO..LEPCHA CONSONANT SIGN KANG
-1C36 ; Other_Alphabetic # Mn LEPCHA SIGN RAN
-1DE7..1DF4 ; Other_Alphabetic # Mn [14] COMBINING LATIN SMALL LETTER ALPHA..COMBINING LATIN SMALL LETTER U WITH DIAERESIS
-24B6..24E9 ; Other_Alphabetic # So [52] CIRCLED LATIN CAPITAL LETTER A..CIRCLED LATIN SMALL LETTER Z
-2DE0..2DFF ; Other_Alphabetic # Mn [32] COMBINING CYRILLIC LETTER BE..COMBINING CYRILLIC LETTER IOTIFIED BIG YUS
-A674..A67B ; Other_Alphabetic # Mn [8] COMBINING CYRILLIC LETTER UKRAINIAN IE..COMBINING CYRILLIC LETTER OMEGA
-A69E..A69F ; Other_Alphabetic # Mn [2] COMBINING CYRILLIC LETTER EF..COMBINING CYRILLIC LETTER IOTIFIED E
-A802 ; Other_Alphabetic # Mn SYLOTI NAGRI SIGN DVISVARA
-A80B ; Other_Alphabetic # Mn SYLOTI NAGRI SIGN ANUSVARA
-A823..A824 ; Other_Alphabetic # Mc [2] SYLOTI NAGRI VOWEL SIGN A..SYLOTI NAGRI VOWEL SIGN I
-A825..A826 ; Other_Alphabetic # Mn [2] SYLOTI NAGRI VOWEL SIGN U..SYLOTI NAGRI VOWEL SIGN E
-A827 ; Other_Alphabetic # Mc SYLOTI NAGRI VOWEL SIGN OO
-A880..A881 ; Other_Alphabetic # Mc [2] SAURASHTRA SIGN ANUSVARA..SAURASHTRA SIGN VISARGA
-A8B4..A8C3 ; Other_Alphabetic # Mc [16] SAURASHTRA CONSONANT SIGN HAARU..SAURASHTRA VOWEL SIGN AU
-A8C5 ; Other_Alphabetic # Mn SAURASHTRA SIGN CANDRABINDU
-A8FF ; Other_Alphabetic # Mn DEVANAGARI VOWEL SIGN AY
-A926..A92A ; Other_Alphabetic # Mn [5] KAYAH LI VOWEL UE..KAYAH LI VOWEL O
-A947..A951 ; Other_Alphabetic # Mn [11] REJANG VOWEL SIGN I..REJANG CONSONANT SIGN R
-A952 ; Other_Alphabetic # Mc REJANG CONSONANT SIGN H
-A980..A982 ; Other_Alphabetic # Mn [3] JAVANESE SIGN PANYANGGA..JAVANESE SIGN LAYAR
-A983 ; Other_Alphabetic # Mc JAVANESE SIGN WIGNYAN
-A9B4..A9B5 ; Other_Alphabetic # Mc [2] JAVANESE VOWEL SIGN TARUNG..JAVANESE VOWEL SIGN TOLONG
-A9B6..A9B9 ; Other_Alphabetic # Mn [4] JAVANESE VOWEL SIGN WULU..JAVANESE VOWEL SIGN SUKU MENDUT
-A9BA..A9BB ; Other_Alphabetic # Mc [2] JAVANESE VOWEL SIGN TALING..JAVANESE VOWEL SIGN DIRGA MURE
-A9BC..A9BD ; Other_Alphabetic # Mn [2] JAVANESE VOWEL SIGN PEPET..JAVANESE CONSONANT SIGN KERET
-A9BE..A9BF ; Other_Alphabetic # Mc [2] JAVANESE CONSONANT SIGN PENGKAL..JAVANESE CONSONANT SIGN CAKRA
-A9E5 ; Other_Alphabetic # Mn MYANMAR SIGN SHAN SAW
-AA29..AA2E ; Other_Alphabetic # Mn [6] CHAM VOWEL SIGN AA..CHAM VOWEL SIGN OE
-AA2F..AA30 ; Other_Alphabetic # Mc [2] CHAM VOWEL SIGN O..CHAM VOWEL SIGN AI
-AA31..AA32 ; Other_Alphabetic # Mn [2] CHAM VOWEL SIGN AU..CHAM VOWEL SIGN UE
-AA33..AA34 ; Other_Alphabetic # Mc [2] CHAM CONSONANT SIGN YA..CHAM CONSONANT SIGN RA
-AA35..AA36 ; Other_Alphabetic # Mn [2] CHAM CONSONANT SIGN LA..CHAM CONSONANT SIGN WA
-AA43 ; Other_Alphabetic # Mn CHAM CONSONANT SIGN FINAL NG
-AA4C ; Other_Alphabetic # Mn CHAM CONSONANT SIGN FINAL M
-AA4D ; Other_Alphabetic # Mc CHAM CONSONANT SIGN FINAL H
-AA7B ; Other_Alphabetic # Mc MYANMAR SIGN PAO KAREN TONE
-AA7C ; Other_Alphabetic # Mn MYANMAR SIGN TAI LAING TONE-2
-AA7D ; Other_Alphabetic # Mc MYANMAR SIGN TAI LAING TONE-5
-AAB0 ; Other_Alphabetic # Mn TAI VIET MAI KANG
-AAB2..AAB4 ; Other_Alphabetic # Mn [3] TAI VIET VOWEL I..TAI VIET VOWEL U
-AAB7..AAB8 ; Other_Alphabetic # Mn [2] TAI VIET MAI KHIT..TAI VIET VOWEL IA
-AABE ; Other_Alphabetic # Mn TAI VIET VOWEL AM
-AAEB ; Other_Alphabetic # Mc MEETEI MAYEK VOWEL SIGN II
-AAEC..AAED ; Other_Alphabetic # Mn [2] MEETEI MAYEK VOWEL SIGN UU..MEETEI MAYEK VOWEL SIGN AAI
-AAEE..AAEF ; Other_Alphabetic # Mc [2] MEETEI MAYEK VOWEL SIGN AU..MEETEI MAYEK VOWEL SIGN AAU
-AAF5 ; Other_Alphabetic # Mc MEETEI MAYEK VOWEL SIGN VISARGA
-ABE3..ABE4 ; Other_Alphabetic # Mc [2] MEETEI MAYEK VOWEL SIGN ONAP..MEETEI MAYEK VOWEL SIGN INAP
-ABE5 ; Other_Alphabetic # Mn MEETEI MAYEK VOWEL SIGN ANAP
-ABE6..ABE7 ; Other_Alphabetic # Mc [2] MEETEI MAYEK VOWEL SIGN YENAP..MEETEI MAYEK VOWEL SIGN SOUNAP
-ABE8 ; Other_Alphabetic # Mn MEETEI MAYEK VOWEL SIGN UNAP
-ABE9..ABEA ; Other_Alphabetic # Mc [2] MEETEI MAYEK VOWEL SIGN CHEINAP..MEETEI MAYEK VOWEL SIGN NUNG
-FB1E ; Other_Alphabetic # Mn HEBREW POINT JUDEO-SPANISH VARIKA
-10376..1037A ; Other_Alphabetic # Mn [5] COMBINING OLD PERMIC LETTER AN..COMBINING OLD PERMIC LETTER SII
-10A01..10A03 ; Other_Alphabetic # Mn [3] KHAROSHTHI VOWEL SIGN I..KHAROSHTHI VOWEL SIGN VOCALIC R
-10A05..10A06 ; Other_Alphabetic # Mn [2] KHAROSHTHI VOWEL SIGN E..KHAROSHTHI VOWEL SIGN O
-10A0C..10A0F ; Other_Alphabetic # Mn [4] KHAROSHTHI VOWEL LENGTH MARK..KHAROSHTHI SIGN VISARGA
-10D24..10D27 ; Other_Alphabetic # Mn [4] HANIFI ROHINGYA SIGN HARBAHAY..HANIFI ROHINGYA SIGN TASSI
-10EAB..10EAC ; Other_Alphabetic # Mn [2] YEZIDI COMBINING HAMZA MARK..YEZIDI COMBINING MADDA MARK
-11000 ; Other_Alphabetic # Mc BRAHMI SIGN CANDRABINDU
-11001 ; Other_Alphabetic # Mn BRAHMI SIGN ANUSVARA
-11002 ; Other_Alphabetic # Mc BRAHMI SIGN VISARGA
-11038..11045 ; Other_Alphabetic # Mn [14] BRAHMI VOWEL SIGN AA..BRAHMI VOWEL SIGN AU
-11073..11074 ; Other_Alphabetic # Mn [2] BRAHMI VOWEL SIGN OLD TAMIL SHORT E..BRAHMI VOWEL SIGN OLD TAMIL SHORT O
-11080..11081 ; Other_Alphabetic # Mn [2] KAITHI SIGN CANDRABINDU..KAITHI SIGN ANUSVARA
-11082 ; Other_Alphabetic # Mc KAITHI SIGN VISARGA
-110B0..110B2 ; Other_Alphabetic # Mc [3] KAITHI VOWEL SIGN AA..KAITHI VOWEL SIGN II
-110B3..110B6 ; Other_Alphabetic # Mn [4] KAITHI VOWEL SIGN U..KAITHI VOWEL SIGN AI
-110B7..110B8 ; Other_Alphabetic # Mc [2] KAITHI VOWEL SIGN O..KAITHI VOWEL SIGN AU
-110C2 ; Other_Alphabetic # Mn KAITHI VOWEL SIGN VOCALIC R
-11100..11102 ; Other_Alphabetic # Mn [3] CHAKMA SIGN CANDRABINDU..CHAKMA SIGN VISARGA
-11127..1112B ; Other_Alphabetic # Mn [5] CHAKMA VOWEL SIGN A..CHAKMA VOWEL SIGN UU
-1112C ; Other_Alphabetic # Mc CHAKMA VOWEL SIGN E
-1112D..11132 ; Other_Alphabetic # Mn [6] CHAKMA VOWEL SIGN AI..CHAKMA AU MARK
-11145..11146 ; Other_Alphabetic # Mc [2] CHAKMA VOWEL SIGN AA..CHAKMA VOWEL SIGN EI
-11180..11181 ; Other_Alphabetic # Mn [2] SHARADA SIGN CANDRABINDU..SHARADA SIGN ANUSVARA
-11182 ; Other_Alphabetic # Mc SHARADA SIGN VISARGA
-111B3..111B5 ; Other_Alphabetic # Mc [3] SHARADA VOWEL SIGN AA..SHARADA VOWEL SIGN II
-111B6..111BE ; Other_Alphabetic # Mn [9] SHARADA VOWEL SIGN U..SHARADA VOWEL SIGN O
-111BF ; Other_Alphabetic # Mc SHARADA VOWEL SIGN AU
-111CE ; Other_Alphabetic # Mc SHARADA VOWEL SIGN PRISHTHAMATRA E
-111CF ; Other_Alphabetic # Mn SHARADA SIGN INVERTED CANDRABINDU
-1122C..1122E ; Other_Alphabetic # Mc [3] KHOJKI VOWEL SIGN AA..KHOJKI VOWEL SIGN II
-1122F..11231 ; Other_Alphabetic # Mn [3] KHOJKI VOWEL SIGN U..KHOJKI VOWEL SIGN AI
-11232..11233 ; Other_Alphabetic # Mc [2] KHOJKI VOWEL SIGN O..KHOJKI VOWEL SIGN AU
-11234 ; Other_Alphabetic # Mn KHOJKI SIGN ANUSVARA
-11237 ; Other_Alphabetic # Mn KHOJKI SIGN SHADDA
-1123E ; Other_Alphabetic # Mn KHOJKI SIGN SUKUN
-11241 ; Other_Alphabetic # Mn KHOJKI VOWEL SIGN VOCALIC R
-112DF ; Other_Alphabetic # Mn KHUDAWADI SIGN ANUSVARA
-112E0..112E2 ; Other_Alphabetic # Mc [3] KHUDAWADI VOWEL SIGN AA..KHUDAWADI VOWEL SIGN II
-112E3..112E8 ; Other_Alphabetic # Mn [6] KHUDAWADI VOWEL SIGN U..KHUDAWADI VOWEL SIGN AU
-11300..11301 ; Other_Alphabetic # Mn [2] GRANTHA SIGN COMBINING ANUSVARA ABOVE..GRANTHA SIGN CANDRABINDU
-11302..11303 ; Other_Alphabetic # Mc [2] GRANTHA SIGN ANUSVARA..GRANTHA SIGN VISARGA
-1133E..1133F ; Other_Alphabetic # Mc [2] GRANTHA VOWEL SIGN AA..GRANTHA VOWEL SIGN I
-11340 ; Other_Alphabetic # Mn GRANTHA VOWEL SIGN II
-11341..11344 ; Other_Alphabetic # Mc [4] GRANTHA VOWEL SIGN U..GRANTHA VOWEL SIGN VOCALIC RR
-11347..11348 ; Other_Alphabetic # Mc [2] GRANTHA VOWEL SIGN EE..GRANTHA VOWEL SIGN AI
-1134B..1134C ; Other_Alphabetic # Mc [2] GRANTHA VOWEL SIGN OO..GRANTHA VOWEL SIGN AU
-11357 ; Other_Alphabetic # Mc GRANTHA AU LENGTH MARK
-11362..11363 ; Other_Alphabetic # Mc [2] GRANTHA VOWEL SIGN VOCALIC L..GRANTHA VOWEL SIGN VOCALIC LL
-11435..11437 ; Other_Alphabetic # Mc [3] NEWA VOWEL SIGN AA..NEWA VOWEL SIGN II
-11438..1143F ; Other_Alphabetic # Mn [8] NEWA VOWEL SIGN U..NEWA VOWEL SIGN AI
-11440..11441 ; Other_Alphabetic # Mc [2] NEWA VOWEL SIGN O..NEWA VOWEL SIGN AU
-11443..11444 ; Other_Alphabetic # Mn [2] NEWA SIGN CANDRABINDU..NEWA SIGN ANUSVARA
-11445 ; Other_Alphabetic # Mc NEWA SIGN VISARGA
-114B0..114B2 ; Other_Alphabetic # Mc [3] TIRHUTA VOWEL SIGN AA..TIRHUTA VOWEL SIGN II
-114B3..114B8 ; Other_Alphabetic # Mn [6] TIRHUTA VOWEL SIGN U..TIRHUTA VOWEL SIGN VOCALIC LL
-114B9 ; Other_Alphabetic # Mc TIRHUTA VOWEL SIGN E
-114BA ; Other_Alphabetic # Mn TIRHUTA VOWEL SIGN SHORT E
-114BB..114BE ; Other_Alphabetic # Mc [4] TIRHUTA VOWEL SIGN AI..TIRHUTA VOWEL SIGN AU
-114BF..114C0 ; Other_Alphabetic # Mn [2] TIRHUTA SIGN CANDRABINDU..TIRHUTA SIGN ANUSVARA
-114C1 ; Other_Alphabetic # Mc TIRHUTA SIGN VISARGA
-115AF..115B1 ; Other_Alphabetic # Mc [3] SIDDHAM VOWEL SIGN AA..SIDDHAM VOWEL SIGN II
-115B2..115B5 ; Other_Alphabetic # Mn [4] SIDDHAM VOWEL SIGN U..SIDDHAM VOWEL SIGN VOCALIC RR
-115B8..115BB ; Other_Alphabetic # Mc [4] SIDDHAM VOWEL SIGN E..SIDDHAM VOWEL SIGN AU
-115BC..115BD ; Other_Alphabetic # Mn [2] SIDDHAM SIGN CANDRABINDU..SIDDHAM SIGN ANUSVARA
-115BE ; Other_Alphabetic # Mc SIDDHAM SIGN VISARGA
-115DC..115DD ; Other_Alphabetic # Mn [2] SIDDHAM VOWEL SIGN ALTERNATE U..SIDDHAM VOWEL SIGN ALTERNATE UU
-11630..11632 ; Other_Alphabetic # Mc [3] MODI VOWEL SIGN AA..MODI VOWEL SIGN II
-11633..1163A ; Other_Alphabetic # Mn [8] MODI VOWEL SIGN U..MODI VOWEL SIGN AI
-1163B..1163C ; Other_Alphabetic # Mc [2] MODI VOWEL SIGN O..MODI VOWEL SIGN AU
-1163D ; Other_Alphabetic # Mn MODI SIGN ANUSVARA
-1163E ; Other_Alphabetic # Mc MODI SIGN VISARGA
-11640 ; Other_Alphabetic # Mn MODI SIGN ARDHACANDRA
-116AB ; Other_Alphabetic # Mn TAKRI SIGN ANUSVARA
-116AC ; Other_Alphabetic # Mc TAKRI SIGN VISARGA
-116AD ; Other_Alphabetic # Mn TAKRI VOWEL SIGN AA
-116AE..116AF ; Other_Alphabetic # Mc [2] TAKRI VOWEL SIGN I..TAKRI VOWEL SIGN II
-116B0..116B5 ; Other_Alphabetic # Mn [6] TAKRI VOWEL SIGN U..TAKRI VOWEL SIGN AU
-1171D..1171F ; Other_Alphabetic # Mn [3] AHOM CONSONANT SIGN MEDIAL LA..AHOM CONSONANT SIGN MEDIAL LIGATING RA
-11720..11721 ; Other_Alphabetic # Mc [2] AHOM VOWEL SIGN A..AHOM VOWEL SIGN AA
-11722..11725 ; Other_Alphabetic # Mn [4] AHOM VOWEL SIGN I..AHOM VOWEL SIGN UU
-11726 ; Other_Alphabetic # Mc AHOM VOWEL SIGN E
-11727..1172A ; Other_Alphabetic # Mn [4] AHOM VOWEL SIGN AW..AHOM VOWEL SIGN AM
-1182C..1182E ; Other_Alphabetic # Mc [3] DOGRA VOWEL SIGN AA..DOGRA VOWEL SIGN II
-1182F..11837 ; Other_Alphabetic # Mn [9] DOGRA VOWEL SIGN U..DOGRA SIGN ANUSVARA
-11838 ; Other_Alphabetic # Mc DOGRA SIGN VISARGA
-11930..11935 ; Other_Alphabetic # Mc [6] DIVES AKURU VOWEL SIGN AA..DIVES AKURU VOWEL SIGN E
-11937..11938 ; Other_Alphabetic # Mc [2] DIVES AKURU VOWEL SIGN AI..DIVES AKURU VOWEL SIGN O
-1193B..1193C ; Other_Alphabetic # Mn [2] DIVES AKURU SIGN ANUSVARA..DIVES AKURU SIGN CANDRABINDU
-11940 ; Other_Alphabetic # Mc DIVES AKURU MEDIAL YA
-11942 ; Other_Alphabetic # Mc DIVES AKURU MEDIAL RA
-119D1..119D3 ; Other_Alphabetic # Mc [3] NANDINAGARI VOWEL SIGN AA..NANDINAGARI VOWEL SIGN II
-119D4..119D7 ; Other_Alphabetic # Mn [4] NANDINAGARI VOWEL SIGN U..NANDINAGARI VOWEL SIGN VOCALIC RR
-119DA..119DB ; Other_Alphabetic # Mn [2] NANDINAGARI VOWEL SIGN E..NANDINAGARI VOWEL SIGN AI
-119DC..119DF ; Other_Alphabetic # Mc [4] NANDINAGARI VOWEL SIGN O..NANDINAGARI SIGN VISARGA
-119E4 ; Other_Alphabetic # Mc NANDINAGARI VOWEL SIGN PRISHTHAMATRA E
-11A01..11A0A ; Other_Alphabetic # Mn [10] ZANABAZAR SQUARE VOWEL SIGN I..ZANABAZAR SQUARE VOWEL LENGTH MARK
-11A35..11A38 ; Other_Alphabetic # Mn [4] ZANABAZAR SQUARE SIGN CANDRABINDU..ZANABAZAR SQUARE SIGN ANUSVARA
-11A39 ; Other_Alphabetic # Mc ZANABAZAR SQUARE SIGN VISARGA
-11A3B..11A3E ; Other_Alphabetic # Mn [4] ZANABAZAR SQUARE CLUSTER-FINAL LETTER YA..ZANABAZAR SQUARE CLUSTER-FINAL LETTER VA
-11A51..11A56 ; Other_Alphabetic # Mn [6] SOYOMBO VOWEL SIGN I..SOYOMBO VOWEL SIGN OE
-11A57..11A58 ; Other_Alphabetic # Mc [2] SOYOMBO VOWEL SIGN AI..SOYOMBO VOWEL SIGN AU
-11A59..11A5B ; Other_Alphabetic # Mn [3] SOYOMBO VOWEL SIGN VOCALIC R..SOYOMBO VOWEL LENGTH MARK
-11A8A..11A96 ; Other_Alphabetic # Mn [13] SOYOMBO FINAL CONSONANT SIGN G..SOYOMBO SIGN ANUSVARA
-11A97 ; Other_Alphabetic # Mc SOYOMBO SIGN VISARGA
-11C2F ; Other_Alphabetic # Mc BHAIKSUKI VOWEL SIGN AA
-11C30..11C36 ; Other_Alphabetic # Mn [7] BHAIKSUKI VOWEL SIGN I..BHAIKSUKI VOWEL SIGN VOCALIC L
-11C38..11C3D ; Other_Alphabetic # Mn [6] BHAIKSUKI VOWEL SIGN E..BHAIKSUKI SIGN ANUSVARA
-11C3E ; Other_Alphabetic # Mc BHAIKSUKI SIGN VISARGA
-11C92..11CA7 ; Other_Alphabetic # Mn [22] MARCHEN SUBJOINED LETTER KA..MARCHEN SUBJOINED LETTER ZA
-11CA9 ; Other_Alphabetic # Mc MARCHEN SUBJOINED LETTER YA
-11CAA..11CB0 ; Other_Alphabetic # Mn [7] MARCHEN SUBJOINED LETTER RA..MARCHEN VOWEL SIGN AA
-11CB1 ; Other_Alphabetic # Mc MARCHEN VOWEL SIGN I
-11CB2..11CB3 ; Other_Alphabetic # Mn [2] MARCHEN VOWEL SIGN U..MARCHEN VOWEL SIGN E
-11CB4 ; Other_Alphabetic # Mc MARCHEN VOWEL SIGN O
-11CB5..11CB6 ; Other_Alphabetic # Mn [2] MARCHEN SIGN ANUSVARA..MARCHEN SIGN CANDRABINDU
-11D31..11D36 ; Other_Alphabetic # Mn [6] MASARAM GONDI VOWEL SIGN AA..MASARAM GONDI VOWEL SIGN VOCALIC R
-11D3A ; Other_Alphabetic # Mn MASARAM GONDI VOWEL SIGN E
-11D3C..11D3D ; Other_Alphabetic # Mn [2] MASARAM GONDI VOWEL SIGN AI..MASARAM GONDI VOWEL SIGN O
-11D3F..11D41 ; Other_Alphabetic # Mn [3] MASARAM GONDI VOWEL SIGN AU..MASARAM GONDI SIGN VISARGA
-11D43 ; Other_Alphabetic # Mn MASARAM GONDI SIGN CANDRA
-11D47 ; Other_Alphabetic # Mn MASARAM GONDI RA-KARA
-11D8A..11D8E ; Other_Alphabetic # Mc [5] GUNJALA GONDI VOWEL SIGN AA..GUNJALA GONDI VOWEL SIGN UU
-11D90..11D91 ; Other_Alphabetic # Mn [2] GUNJALA GONDI VOWEL SIGN EE..GUNJALA GONDI VOWEL SIGN AI
-11D93..11D94 ; Other_Alphabetic # Mc [2] GUNJALA GONDI VOWEL SIGN OO..GUNJALA GONDI VOWEL SIGN AU
-11D95 ; Other_Alphabetic # Mn GUNJALA GONDI SIGN ANUSVARA
-11D96 ; Other_Alphabetic # Mc GUNJALA GONDI SIGN VISARGA
-11EF3..11EF4 ; Other_Alphabetic # Mn [2] MAKASAR VOWEL SIGN I..MAKASAR VOWEL SIGN U
-11EF5..11EF6 ; Other_Alphabetic # Mc [2] MAKASAR VOWEL SIGN E..MAKASAR VOWEL SIGN O
-11F00..11F01 ; Other_Alphabetic # Mn [2] KAWI SIGN CANDRABINDU..KAWI SIGN ANUSVARA
-11F03 ; Other_Alphabetic # Mc KAWI SIGN VISARGA
-11F34..11F35 ; Other_Alphabetic # Mc [2] KAWI VOWEL SIGN AA..KAWI VOWEL SIGN ALTERNATE AA
-11F36..11F3A ; Other_Alphabetic # Mn [5] KAWI VOWEL SIGN I..KAWI VOWEL SIGN VOCALIC R
-11F3E..11F3F ; Other_Alphabetic # Mc [2] KAWI VOWEL SIGN E..KAWI VOWEL SIGN AI
-11F40 ; Other_Alphabetic # Mn KAWI VOWEL SIGN EU
-16F4F ; Other_Alphabetic # Mn MIAO SIGN CONSONANT MODIFIER BAR
-16F51..16F87 ; Other_Alphabetic # Mc [55] MIAO SIGN ASPIRATION..MIAO VOWEL SIGN UI
-16F8F..16F92 ; Other_Alphabetic # Mn [4] MIAO TONE RIGHT..MIAO TONE BELOW
-16FF0..16FF1 ; Other_Alphabetic # Mc [2] VIETNAMESE ALTERNATE READING MARK CA..VIETNAMESE ALTERNATE READING MARK NHAY
-1BC9E ; Other_Alphabetic # Mn DUPLOYAN DOUBLE MARK
-1E000..1E006 ; Other_Alphabetic # Mn [7] COMBINING GLAGOLITIC LETTER AZU..COMBINING GLAGOLITIC LETTER ZHIVETE
-1E008..1E018 ; Other_Alphabetic # Mn [17] COMBINING GLAGOLITIC LETTER ZEMLJA..COMBINING GLAGOLITIC LETTER HERU
-1E01B..1E021 ; Other_Alphabetic # Mn [7] COMBINING GLAGOLITIC LETTER SHTA..COMBINING GLAGOLITIC LETTER YATI
-1E023..1E024 ; Other_Alphabetic # Mn [2] COMBINING GLAGOLITIC LETTER YU..COMBINING GLAGOLITIC LETTER SMALL YUS
-1E026..1E02A ; Other_Alphabetic # Mn [5] COMBINING GLAGOLITIC LETTER YO..COMBINING GLAGOLITIC LETTER FITA
-1E08F ; Other_Alphabetic # Mn COMBINING CYRILLIC SMALL LETTER BYELORUSSIAN-UKRAINIAN I
-1E947 ; Other_Alphabetic # Mn ADLAM HAMZA
-1F130..1F149 ; Other_Alphabetic # So [26] SQUARED LATIN CAPITAL LETTER A..SQUARED LATIN CAPITAL LETTER Z
-1F150..1F169 ; Other_Alphabetic # So [26] NEGATIVE CIRCLED LATIN CAPITAL LETTER A..NEGATIVE CIRCLED LATIN CAPITAL LETTER Z
-1F170..1F189 ; Other_Alphabetic # So [26] NEGATIVE SQUARED LATIN CAPITAL LETTER A..NEGATIVE SQUARED LATIN CAPITAL LETTER Z
-
-# Total code points: 1425
-
-# ================================================
-
-3006 ; Ideographic # Lo IDEOGRAPHIC CLOSING MARK
-3007 ; Ideographic # Nl IDEOGRAPHIC NUMBER ZERO
-3021..3029 ; Ideographic # Nl [9] HANGZHOU NUMERAL ONE..HANGZHOU NUMERAL NINE
-3038..303A ; Ideographic # Nl [3] HANGZHOU NUMERAL TEN..HANGZHOU NUMERAL THIRTY
-3400..4DBF ; Ideographic # Lo [6592] CJK UNIFIED IDEOGRAPH-3400..CJK UNIFIED IDEOGRAPH-4DBF
-4E00..9FFF ; Ideographic # Lo [20992] CJK UNIFIED IDEOGRAPH-4E00..CJK UNIFIED IDEOGRAPH-9FFF
-F900..FA6D ; Ideographic # Lo [366] CJK COMPATIBILITY IDEOGRAPH-F900..CJK COMPATIBILITY IDEOGRAPH-FA6D
-FA70..FAD9 ; Ideographic # Lo [106] CJK COMPATIBILITY IDEOGRAPH-FA70..CJK COMPATIBILITY IDEOGRAPH-FAD9
-16FE4 ; Ideographic # Mn KHITAN SMALL SCRIPT FILLER
-17000..187F7 ; Ideographic # Lo [6136] TANGUT IDEOGRAPH-17000..TANGUT IDEOGRAPH-187F7
-18800..18CD5 ; Ideographic # Lo [1238] TANGUT COMPONENT-001..KHITAN SMALL SCRIPT CHARACTER-18CD5
-18D00..18D08 ; Ideographic # Lo [9] TANGUT IDEOGRAPH-18D00..TANGUT IDEOGRAPH-18D08
-1B170..1B2FB ; Ideographic # Lo [396] NUSHU CHARACTER-1B170..NUSHU CHARACTER-1B2FB
-20000..2A6DF ; Ideographic # Lo [42720] CJK UNIFIED IDEOGRAPH-20000..CJK UNIFIED IDEOGRAPH-2A6DF
-2A700..2B739 ; Ideographic # Lo [4154] CJK UNIFIED IDEOGRAPH-2A700..CJK UNIFIED IDEOGRAPH-2B739
-2B740..2B81D ; Ideographic # Lo [222] CJK UNIFIED IDEOGRAPH-2B740..CJK UNIFIED IDEOGRAPH-2B81D
-2B820..2CEA1 ; Ideographic # Lo [5762] CJK UNIFIED IDEOGRAPH-2B820..CJK UNIFIED IDEOGRAPH-2CEA1
-2CEB0..2EBE0 ; Ideographic # Lo [7473] CJK UNIFIED IDEOGRAPH-2CEB0..CJK UNIFIED IDEOGRAPH-2EBE0
-2EBF0..2EE5D ; Ideographic # Lo [622] CJK UNIFIED IDEOGRAPH-2EBF0..CJK UNIFIED IDEOGRAPH-2EE5D
-2F800..2FA1D ; Ideographic # Lo [542] CJK COMPATIBILITY IDEOGRAPH-2F800..CJK COMPATIBILITY IDEOGRAPH-2FA1D
-30000..3134A ; Ideographic # Lo [4939] CJK UNIFIED IDEOGRAPH-30000..CJK UNIFIED IDEOGRAPH-3134A
-31350..323AF ; Ideographic # Lo [4192] CJK UNIFIED IDEOGRAPH-31350..CJK UNIFIED IDEOGRAPH-323AF
-
-# Total code points: 106476
-
-# ================================================
-
-005E ; Diacritic # Sk CIRCUMFLEX ACCENT
-0060 ; Diacritic # Sk GRAVE ACCENT
-00A8 ; Diacritic # Sk DIAERESIS
-00AF ; Diacritic # Sk MACRON
-00B4 ; Diacritic # Sk ACUTE ACCENT
-00B7 ; Diacritic # Po MIDDLE DOT
-00B8 ; Diacritic # Sk CEDILLA
-02B0..02C1 ; Diacritic # Lm [18] MODIFIER LETTER SMALL H..MODIFIER LETTER REVERSED GLOTTAL STOP
-02C2..02C5 ; Diacritic # Sk [4] MODIFIER LETTER LEFT ARROWHEAD..MODIFIER LETTER DOWN ARROWHEAD
-02C6..02D1 ; Diacritic # Lm [12] MODIFIER LETTER CIRCUMFLEX ACCENT..MODIFIER LETTER HALF TRIANGULAR COLON
-02D2..02DF ; Diacritic # Sk [14] MODIFIER LETTER CENTRED RIGHT HALF RING..MODIFIER LETTER CROSS ACCENT
-02E0..02E4 ; Diacritic # Lm [5] MODIFIER LETTER SMALL GAMMA..MODIFIER LETTER SMALL REVERSED GLOTTAL STOP
-02E5..02EB ; Diacritic # Sk [7] MODIFIER LETTER EXTRA-HIGH TONE BAR..MODIFIER LETTER YANG DEPARTING TONE MARK
-02EC ; Diacritic # Lm MODIFIER LETTER VOICING
-02ED ; Diacritic # Sk MODIFIER LETTER UNASPIRATED
-02EE ; Diacritic # Lm MODIFIER LETTER DOUBLE APOSTROPHE
-02EF..02FF ; Diacritic # Sk [17] MODIFIER LETTER LOW DOWN ARROWHEAD..MODIFIER LETTER LOW LEFT ARROW
-0300..034E ; Diacritic # Mn [79] COMBINING GRAVE ACCENT..COMBINING UPWARDS ARROW BELOW
-0350..0357 ; Diacritic # Mn [8] COMBINING RIGHT ARROWHEAD ABOVE..COMBINING RIGHT HALF RING ABOVE
-035D..0362 ; Diacritic # Mn [6] COMBINING DOUBLE BREVE..COMBINING DOUBLE RIGHTWARDS ARROW BELOW
-0374 ; Diacritic # Lm GREEK NUMERAL SIGN
-0375 ; Diacritic # Sk GREEK LOWER NUMERAL SIGN
-037A ; Diacritic # Lm GREEK YPOGEGRAMMENI
-0384..0385 ; Diacritic # Sk [2] GREEK TONOS..GREEK DIALYTIKA TONOS
-0483..0487 ; Diacritic # Mn [5] COMBINING CYRILLIC TITLO..COMBINING CYRILLIC POKRYTIE
-0559 ; Diacritic # Lm ARMENIAN MODIFIER LETTER LEFT HALF RING
-0591..05A1 ; Diacritic # Mn [17] HEBREW ACCENT ETNAHTA..HEBREW ACCENT PAZER
-05A3..05BD ; Diacritic # Mn [27] HEBREW ACCENT MUNAH..HEBREW POINT METEG
-05BF ; Diacritic # Mn HEBREW POINT RAFE
-05C1..05C2 ; Diacritic # Mn [2] HEBREW POINT SHIN DOT..HEBREW POINT SIN DOT
-05C4 ; Diacritic # Mn HEBREW MARK UPPER DOT
-064B..0652 ; Diacritic # Mn [8] ARABIC FATHATAN..ARABIC SUKUN
-0657..0658 ; Diacritic # Mn [2] ARABIC INVERTED DAMMA..ARABIC MARK NOON GHUNNA
-06DF..06E0 ; Diacritic # Mn [2] ARABIC SMALL HIGH ROUNDED ZERO..ARABIC SMALL HIGH UPRIGHT RECTANGULAR ZERO
-06E5..06E6 ; Diacritic # Lm [2] ARABIC SMALL WAW..ARABIC SMALL YEH
-06EA..06EC ; Diacritic # Mn [3] ARABIC EMPTY CENTRE LOW STOP..ARABIC ROUNDED HIGH STOP WITH FILLED CENTRE
-0730..074A ; Diacritic # Mn [27] SYRIAC PTHAHA ABOVE..SYRIAC BARREKH
-07A6..07B0 ; Diacritic # Mn [11] THAANA ABAFILI..THAANA SUKUN
-07EB..07F3 ; Diacritic # Mn [9] NKO COMBINING SHORT HIGH TONE..NKO COMBINING DOUBLE DOT ABOVE
-07F4..07F5 ; Diacritic # Lm [2] NKO HIGH TONE APOSTROPHE..NKO LOW TONE APOSTROPHE
-0818..0819 ; Diacritic # Mn [2] SAMARITAN MARK OCCLUSION..SAMARITAN MARK DAGESH
-0898..089F ; Diacritic # Mn [8] ARABIC SMALL HIGH WORD AL-JUZ..ARABIC HALF MADDA OVER MADDA
-08C9 ; Diacritic # Lm ARABIC SMALL FARSI YEH
-08CA..08D2 ; Diacritic # Mn [9] ARABIC SMALL HIGH FARSI YEH..ARABIC LARGE ROUND DOT INSIDE CIRCLE BELOW
-08E3..08FE ; Diacritic # Mn [28] ARABIC TURNED DAMMA BELOW..ARABIC DAMMA WITH DOT
-093C ; Diacritic # Mn DEVANAGARI SIGN NUKTA
-094D ; Diacritic # Mn DEVANAGARI SIGN VIRAMA
-0951..0954 ; Diacritic # Mn [4] DEVANAGARI STRESS SIGN UDATTA..DEVANAGARI ACUTE ACCENT
-0971 ; Diacritic # Lm DEVANAGARI SIGN HIGH SPACING DOT
-09BC ; Diacritic # Mn BENGALI SIGN NUKTA
-09CD ; Diacritic # Mn BENGALI SIGN VIRAMA
-0A3C ; Diacritic # Mn GURMUKHI SIGN NUKTA
-0A4D ; Diacritic # Mn GURMUKHI SIGN VIRAMA
-0ABC ; Diacritic # Mn GUJARATI SIGN NUKTA
-0ACD ; Diacritic # Mn GUJARATI SIGN VIRAMA
-0AFD..0AFF ; Diacritic # Mn [3] GUJARATI SIGN THREE-DOT NUKTA ABOVE..GUJARATI SIGN TWO-CIRCLE NUKTA ABOVE
-0B3C ; Diacritic # Mn ORIYA SIGN NUKTA
-0B4D ; Diacritic # Mn ORIYA SIGN VIRAMA
-0B55 ; Diacritic # Mn ORIYA SIGN OVERLINE
-0BCD ; Diacritic # Mn TAMIL SIGN VIRAMA
-0C3C ; Diacritic # Mn TELUGU SIGN NUKTA
-0C4D ; Diacritic # Mn TELUGU SIGN VIRAMA
-0CBC ; Diacritic # Mn KANNADA SIGN NUKTA
-0CCD ; Diacritic # Mn KANNADA SIGN VIRAMA
-0D3B..0D3C ; Diacritic # Mn [2] MALAYALAM SIGN VERTICAL BAR VIRAMA..MALAYALAM SIGN CIRCULAR VIRAMA
-0D4D ; Diacritic # Mn MALAYALAM SIGN VIRAMA
-0DCA ; Diacritic # Mn SINHALA SIGN AL-LAKUNA
-0E47..0E4C ; Diacritic # Mn [6] THAI CHARACTER MAITAIKHU..THAI CHARACTER THANTHAKHAT
-0E4E ; Diacritic # Mn THAI CHARACTER YAMAKKAN
-0EBA ; Diacritic # Mn LAO SIGN PALI VIRAMA
-0EC8..0ECC ; Diacritic # Mn [5] LAO TONE MAI EK..LAO CANCELLATION MARK
-0F18..0F19 ; Diacritic # Mn [2] TIBETAN ASTROLOGICAL SIGN -KHYUD PA..TIBETAN ASTROLOGICAL SIGN SDONG TSHUGS
-0F35 ; Diacritic # Mn TIBETAN MARK NGAS BZUNG NYI ZLA
-0F37 ; Diacritic # Mn TIBETAN MARK NGAS BZUNG SGOR RTAGS
-0F39 ; Diacritic # Mn TIBETAN MARK TSA -PHRU
-0F3E..0F3F ; Diacritic # Mc [2] TIBETAN SIGN YAR TSHES..TIBETAN SIGN MAR TSHES
-0F82..0F84 ; Diacritic # Mn [3] TIBETAN SIGN NYI ZLA NAA DA..TIBETAN MARK HALANTA
-0F86..0F87 ; Diacritic # Mn [2] TIBETAN SIGN LCI RTAGS..TIBETAN SIGN YANG RTAGS
-0FC6 ; Diacritic # Mn TIBETAN SYMBOL PADMA GDAN
-1037 ; Diacritic # Mn MYANMAR SIGN DOT BELOW
-1039..103A ; Diacritic # Mn [2] MYANMAR SIGN VIRAMA..MYANMAR SIGN ASAT
-1063..1064 ; Diacritic # Mc [2] MYANMAR TONE MARK SGAW KAREN HATHI..MYANMAR TONE MARK SGAW KAREN KE PHO
-1069..106D ; Diacritic # Mc [5] MYANMAR SIGN WESTERN PWO KAREN TONE-1..MYANMAR SIGN WESTERN PWO KAREN TONE-5
-1087..108C ; Diacritic # Mc [6] MYANMAR SIGN SHAN TONE-2..MYANMAR SIGN SHAN COUNCIL TONE-3
-108D ; Diacritic # Mn MYANMAR SIGN SHAN COUNCIL EMPHATIC TONE
-108F ; Diacritic # Mc MYANMAR SIGN RUMAI PALAUNG TONE-5
-109A..109B ; Diacritic # Mc [2] MYANMAR SIGN KHAMTI TONE-1..MYANMAR SIGN KHAMTI TONE-3
-135D..135F ; Diacritic # Mn [3] ETHIOPIC COMBINING GEMINATION AND VOWEL LENGTH MARK..ETHIOPIC COMBINING GEMINATION MARK
-1714 ; Diacritic # Mn TAGALOG SIGN VIRAMA
-1715 ; Diacritic # Mc TAGALOG SIGN PAMUDPOD
-17C9..17D3 ; Diacritic # Mn [11] KHMER SIGN MUUSIKATOAN..KHMER SIGN BATHAMASAT
-17DD ; Diacritic # Mn KHMER SIGN ATTHACAN
-1939..193B ; Diacritic # Mn [3] LIMBU SIGN MUKPHRENG..LIMBU SIGN SA-I
-1A75..1A7C ; Diacritic # Mn [8] TAI THAM SIGN TONE-1..TAI THAM SIGN KHUEN-LUE KARAN
-1A7F ; Diacritic # Mn TAI THAM COMBINING CRYPTOGRAMMIC DOT
-1AB0..1ABD ; Diacritic # Mn [14] COMBINING DOUBLED CIRCUMFLEX ACCENT..COMBINING PARENTHESES BELOW
-1ABE ; Diacritic # Me COMBINING PARENTHESES OVERLAY
-1AC1..1ACB ; Diacritic # Mn [11] COMBINING LEFT PARENTHESIS ABOVE LEFT..COMBINING TRIPLE ACUTE ACCENT
-1B34 ; Diacritic # Mn BALINESE SIGN REREKAN
-1B44 ; Diacritic # Mc BALINESE ADEG ADEG
-1B6B..1B73 ; Diacritic # Mn [9] BALINESE MUSICAL SYMBOL COMBINING TEGEH..BALINESE MUSICAL SYMBOL COMBINING GONG
-1BAA ; Diacritic # Mc SUNDANESE SIGN PAMAAEH
-1BAB ; Diacritic # Mn SUNDANESE SIGN VIRAMA
-1C36..1C37 ; Diacritic # Mn [2] LEPCHA SIGN RAN..LEPCHA SIGN NUKTA
-1C78..1C7D ; Diacritic # Lm [6] OL CHIKI MU TTUDDAG..OL CHIKI AHAD
-1CD0..1CD2 ; Diacritic # Mn [3] VEDIC TONE KARSHANA..VEDIC TONE PRENKHA
-1CD3 ; Diacritic # Po VEDIC SIGN NIHSHVASA
-1CD4..1CE0 ; Diacritic # Mn [13] VEDIC SIGN YAJURVEDIC MIDLINE SVARITA..VEDIC TONE RIGVEDIC KASHMIRI INDEPENDENT SVARITA
-1CE1 ; Diacritic # Mc VEDIC TONE ATHARVAVEDIC INDEPENDENT SVARITA
-1CE2..1CE8 ; Diacritic # Mn [7] VEDIC SIGN VISARGA SVARITA..VEDIC SIGN VISARGA ANUDATTA WITH TAIL
-1CED ; Diacritic # Mn VEDIC SIGN TIRYAK
-1CF4 ; Diacritic # Mn VEDIC TONE CANDRA ABOVE
-1CF7 ; Diacritic # Mc VEDIC SIGN ATIKRAMA
-1CF8..1CF9 ; Diacritic # Mn [2] VEDIC TONE RING ABOVE..VEDIC TONE DOUBLE RING ABOVE
-1D2C..1D6A ; Diacritic # Lm [63] MODIFIER LETTER CAPITAL A..GREEK SUBSCRIPT SMALL LETTER CHI
-1DC4..1DCF ; Diacritic # Mn [12] COMBINING MACRON-ACUTE..COMBINING ZIGZAG BELOW
-1DF5..1DFF ; Diacritic # Mn [11] COMBINING UP TACK ABOVE..COMBINING RIGHT ARROWHEAD AND DOWN ARROWHEAD BELOW
-1FBD ; Diacritic # Sk GREEK KORONIS
-1FBF..1FC1 ; Diacritic # Sk [3] GREEK PSILI..GREEK DIALYTIKA AND PERISPOMENI
-1FCD..1FCF ; Diacritic # Sk [3] GREEK PSILI AND VARIA..GREEK PSILI AND PERISPOMENI
-1FDD..1FDF ; Diacritic # Sk [3] GREEK DASIA AND VARIA..GREEK DASIA AND PERISPOMENI
-1FED..1FEF ; Diacritic # Sk [3] GREEK DIALYTIKA AND VARIA..GREEK VARIA
-1FFD..1FFE ; Diacritic # Sk [2] GREEK OXIA..GREEK DASIA
-2CEF..2CF1 ; Diacritic # Mn [3] COPTIC COMBINING NI ABOVE..COPTIC COMBINING SPIRITUS LENIS
-2E2F ; Diacritic # Lm VERTICAL TILDE
-302A..302D ; Diacritic # Mn [4] IDEOGRAPHIC LEVEL TONE MARK..IDEOGRAPHIC ENTERING TONE MARK
-302E..302F ; Diacritic # Mc [2] HANGUL SINGLE DOT TONE MARK..HANGUL DOUBLE DOT TONE MARK
-3099..309A ; Diacritic # Mn [2] COMBINING KATAKANA-HIRAGANA VOICED SOUND MARK..COMBINING KATAKANA-HIRAGANA SEMI-VOICED SOUND MARK
-309B..309C ; Diacritic # Sk [2] KATAKANA-HIRAGANA VOICED SOUND MARK..KATAKANA-HIRAGANA SEMI-VOICED SOUND MARK
-30FC ; Diacritic # Lm KATAKANA-HIRAGANA PROLONGED SOUND MARK
-A66F ; Diacritic # Mn COMBINING CYRILLIC VZMET
-A67C..A67D ; Diacritic # Mn [2] COMBINING CYRILLIC KAVYKA..COMBINING CYRILLIC PAYEROK
-A67F ; Diacritic # Lm CYRILLIC PAYEROK
-A69C..A69D ; Diacritic # Lm [2] MODIFIER LETTER CYRILLIC HARD SIGN..MODIFIER LETTER CYRILLIC SOFT SIGN
-A6F0..A6F1 ; Diacritic # Mn [2] BAMUM COMBINING MARK KOQNDON..BAMUM COMBINING MARK TUKWENTIS
-A700..A716 ; Diacritic # Sk [23] MODIFIER LETTER CHINESE TONE YIN PING..MODIFIER LETTER EXTRA-LOW LEFT-STEM TONE BAR
-A717..A71F ; Diacritic # Lm [9] MODIFIER LETTER DOT VERTICAL BAR..MODIFIER LETTER LOW INVERTED EXCLAMATION MARK
-A720..A721 ; Diacritic # Sk [2] MODIFIER LETTER STRESS AND HIGH TONE..MODIFIER LETTER STRESS AND LOW TONE
-A788 ; Diacritic # Lm MODIFIER LETTER LOW CIRCUMFLEX ACCENT
-A789..A78A ; Diacritic # Sk [2] MODIFIER LETTER COLON..MODIFIER LETTER SHORT EQUALS SIGN
-A7F8..A7F9 ; Diacritic # Lm [2] MODIFIER LETTER CAPITAL H WITH STROKE..MODIFIER LETTER SMALL LIGATURE OE
-A8C4 ; Diacritic # Mn SAURASHTRA SIGN VIRAMA
-A8E0..A8F1 ; Diacritic # Mn [18] COMBINING DEVANAGARI DIGIT ZERO..COMBINING DEVANAGARI SIGN AVAGRAHA
-A92B..A92D ; Diacritic # Mn [3] KAYAH LI TONE PLOPHU..KAYAH LI TONE CALYA PLOPHU
-A92E ; Diacritic # Po KAYAH LI SIGN CWI
-A953 ; Diacritic # Mc REJANG VIRAMA
-A9B3 ; Diacritic # Mn JAVANESE SIGN CECAK TELU
-A9C0 ; Diacritic # Mc JAVANESE PANGKON
-A9E5 ; Diacritic # Mn MYANMAR SIGN SHAN SAW
-AA7B ; Diacritic # Mc MYANMAR SIGN PAO KAREN TONE
-AA7C ; Diacritic # Mn MYANMAR SIGN TAI LAING TONE-2
-AA7D ; Diacritic # Mc MYANMAR SIGN TAI LAING TONE-5
-AABF ; Diacritic # Mn TAI VIET TONE MAI EK
-AAC0 ; Diacritic # Lo TAI VIET TONE MAI NUENG
-AAC1 ; Diacritic # Mn TAI VIET TONE MAI THO
-AAC2 ; Diacritic # Lo TAI VIET TONE MAI SONG
-AAF6 ; Diacritic # Mn MEETEI MAYEK VIRAMA
-AB5B ; Diacritic # Sk MODIFIER BREVE WITH INVERTED BREVE
-AB5C..AB5F ; Diacritic # Lm [4] MODIFIER LETTER SMALL HENG..MODIFIER LETTER SMALL U WITH LEFT HOOK
-AB69 ; Diacritic # Lm MODIFIER LETTER SMALL TURNED W
-AB6A..AB6B ; Diacritic # Sk [2] MODIFIER LETTER LEFT TACK..MODIFIER LETTER RIGHT TACK
-ABEC ; Diacritic # Mc MEETEI MAYEK LUM IYEK
-ABED ; Diacritic # Mn MEETEI MAYEK APUN IYEK
-FB1E ; Diacritic # Mn HEBREW POINT JUDEO-SPANISH VARIKA
-FE20..FE2F ; Diacritic # Mn [16] COMBINING LIGATURE LEFT HALF..COMBINING CYRILLIC TITLO RIGHT HALF
-FF3E ; Diacritic # Sk FULLWIDTH CIRCUMFLEX ACCENT
-FF40 ; Diacritic # Sk FULLWIDTH GRAVE ACCENT
-FF70 ; Diacritic # Lm HALFWIDTH KATAKANA-HIRAGANA PROLONGED SOUND MARK
-FF9E..FF9F ; Diacritic # Lm [2] HALFWIDTH KATAKANA VOICED SOUND MARK..HALFWIDTH KATAKANA SEMI-VOICED SOUND MARK
-FFE3 ; Diacritic # Sk FULLWIDTH MACRON
-102E0 ; Diacritic # Mn COPTIC EPACT THOUSANDS MARK
-10780..10785 ; Diacritic # Lm [6] MODIFIER LETTER SMALL CAPITAL AA..MODIFIER LETTER SMALL B WITH HOOK
-10787..107B0 ; Diacritic # Lm [42] MODIFIER LETTER SMALL DZ DIGRAPH..MODIFIER LETTER SMALL V WITH RIGHT HOOK
-107B2..107BA ; Diacritic # Lm [9] MODIFIER LETTER SMALL CAPITAL Y..MODIFIER LETTER SMALL S WITH CURL
-10AE5..10AE6 ; Diacritic # Mn [2] MANICHAEAN ABBREVIATION MARK ABOVE..MANICHAEAN ABBREVIATION MARK BELOW
-10D22..10D23 ; Diacritic # Lo [2] HANIFI ROHINGYA MARK SAKIN..HANIFI ROHINGYA MARK NA KHONNA
-10D24..10D27 ; Diacritic # Mn [4] HANIFI ROHINGYA SIGN HARBAHAY..HANIFI ROHINGYA SIGN TASSI
-10EFD..10EFF ; Diacritic # Mn [3] ARABIC SMALL LOW WORD SAKTA..ARABIC SMALL LOW WORD MADDA
-10F46..10F50 ; Diacritic # Mn [11] SOGDIAN COMBINING DOT BELOW..SOGDIAN COMBINING STROKE BELOW
-10F82..10F85 ; Diacritic # Mn [4] OLD UYGHUR COMBINING DOT ABOVE..OLD UYGHUR COMBINING TWO DOTS BELOW
-11046 ; Diacritic # Mn BRAHMI VIRAMA
-11070 ; Diacritic # Mn BRAHMI SIGN OLD TAMIL VIRAMA
-110B9..110BA ; Diacritic # Mn [2] KAITHI SIGN VIRAMA..KAITHI SIGN NUKTA
-11133..11134 ; Diacritic # Mn [2] CHAKMA VIRAMA..CHAKMA MAAYYAA
-11173 ; Diacritic # Mn MAHAJANI SIGN NUKTA
-111C0 ; Diacritic # Mc SHARADA SIGN VIRAMA
-111CA..111CC ; Diacritic # Mn [3] SHARADA SIGN NUKTA..SHARADA EXTRA SHORT VOWEL MARK
-11235 ; Diacritic # Mc KHOJKI SIGN VIRAMA
-11236 ; Diacritic # Mn KHOJKI SIGN NUKTA
-112E9..112EA ; Diacritic # Mn [2] KHUDAWADI SIGN NUKTA..KHUDAWADI SIGN VIRAMA
-1133C ; Diacritic # Mn GRANTHA SIGN NUKTA
-1134D ; Diacritic # Mc GRANTHA SIGN VIRAMA
-11366..1136C ; Diacritic # Mn [7] COMBINING GRANTHA DIGIT ZERO..COMBINING GRANTHA DIGIT SIX
-11370..11374 ; Diacritic # Mn [5] COMBINING GRANTHA LETTER A..COMBINING GRANTHA LETTER PA
-11442 ; Diacritic # Mn NEWA SIGN VIRAMA
-11446 ; Diacritic # Mn NEWA SIGN NUKTA
-114C2..114C3 ; Diacritic # Mn [2] TIRHUTA SIGN VIRAMA..TIRHUTA SIGN NUKTA
-115BF..115C0 ; Diacritic # Mn [2] SIDDHAM SIGN VIRAMA..SIDDHAM SIGN NUKTA
-1163F ; Diacritic # Mn MODI SIGN VIRAMA
-116B6 ; Diacritic # Mc TAKRI SIGN VIRAMA
-116B7 ; Diacritic # Mn TAKRI SIGN NUKTA
-1172B ; Diacritic # Mn AHOM SIGN KILLER
-11839..1183A ; Diacritic # Mn [2] DOGRA SIGN VIRAMA..DOGRA SIGN NUKTA
-1193D ; Diacritic # Mc DIVES AKURU SIGN HALANTA
-1193E ; Diacritic # Mn DIVES AKURU VIRAMA
-11943 ; Diacritic # Mn DIVES AKURU SIGN NUKTA
-119E0 ; Diacritic # Mn NANDINAGARI SIGN VIRAMA
-11A34 ; Diacritic # Mn ZANABAZAR SQUARE SIGN VIRAMA
-11A47 ; Diacritic # Mn ZANABAZAR SQUARE SUBJOINER
-11A99 ; Diacritic # Mn SOYOMBO SUBJOINER
-11C3F ; Diacritic # Mn BHAIKSUKI SIGN VIRAMA
-11D42 ; Diacritic # Mn MASARAM GONDI SIGN NUKTA
-11D44..11D45 ; Diacritic # Mn [2] MASARAM GONDI SIGN HALANTA..MASARAM GONDI VIRAMA
-11D97 ; Diacritic # Mn GUNJALA GONDI VIRAMA
-13447..13455 ; Diacritic # Mn [15] EGYPTIAN HIEROGLYPH MODIFIER DAMAGED AT TOP START..EGYPTIAN HIEROGLYPH MODIFIER DAMAGED
-16AF0..16AF4 ; Diacritic # Mn [5] BASSA VAH COMBINING HIGH TONE..BASSA VAH COMBINING HIGH-LOW TONE
-16B30..16B36 ; Diacritic # Mn [7] PAHAWH HMONG MARK CIM TUB..PAHAWH HMONG MARK CIM TAUM
-16F8F..16F92 ; Diacritic # Mn [4] MIAO TONE RIGHT..MIAO TONE BELOW
-16F93..16F9F ; Diacritic # Lm [13] MIAO LETTER TONE-2..MIAO LETTER REFORMED TONE-8
-16FF0..16FF1 ; Diacritic # Mc [2] VIETNAMESE ALTERNATE READING MARK CA..VIETNAMESE ALTERNATE READING MARK NHAY
-1AFF0..1AFF3 ; Diacritic # Lm [4] KATAKANA LETTER MINNAN TONE-2..KATAKANA LETTER MINNAN TONE-5
-1AFF5..1AFFB ; Diacritic # Lm [7] KATAKANA LETTER MINNAN TONE-7..KATAKANA LETTER MINNAN NASALIZED TONE-5
-1AFFD..1AFFE ; Diacritic # Lm [2] KATAKANA LETTER MINNAN NASALIZED TONE-7..KATAKANA LETTER MINNAN NASALIZED TONE-8
-1CF00..1CF2D ; Diacritic # Mn [46] ZNAMENNY COMBINING MARK GORAZDO NIZKO S KRYZHEM ON LEFT..ZNAMENNY COMBINING MARK KRYZH ON LEFT
-1CF30..1CF46 ; Diacritic # Mn [23] ZNAMENNY COMBINING TONAL RANGE MARK MRACHNO..ZNAMENNY PRIZNAK MODIFIER ROG
-1D167..1D169 ; Diacritic # Mn [3] MUSICAL SYMBOL COMBINING TREMOLO-1..MUSICAL SYMBOL COMBINING TREMOLO-3
-1D16D..1D172 ; Diacritic # Mc [6] MUSICAL SYMBOL COMBINING AUGMENTATION DOT..MUSICAL SYMBOL COMBINING FLAG-5
-1D17B..1D182 ; Diacritic # Mn [8] MUSICAL SYMBOL COMBINING ACCENT..MUSICAL SYMBOL COMBINING LOURE
-1D185..1D18B ; Diacritic # Mn [7] MUSICAL SYMBOL COMBINING DOIT..MUSICAL SYMBOL COMBINING TRIPLE TONGUE
-1D1AA..1D1AD ; Diacritic # Mn [4] MUSICAL SYMBOL COMBINING DOWN BOW..MUSICAL SYMBOL COMBINING SNAP PIZZICATO
-1E030..1E06D ; Diacritic # Lm [62] MODIFIER LETTER CYRILLIC SMALL A..MODIFIER LETTER CYRILLIC SMALL STRAIGHT U WITH STROKE
-1E130..1E136 ; Diacritic # Mn [7] NYIAKENG PUACHUE HMONG TONE-B..NYIAKENG PUACHUE HMONG TONE-D
-1E2AE ; Diacritic # Mn TOTO SIGN RISING TONE
-1E2EC..1E2EF ; Diacritic # Mn [4] WANCHO TONE TUP..WANCHO TONE KOINI
-1E8D0..1E8D6 ; Diacritic # Mn [7] MENDE KIKAKUI COMBINING NUMBER TEENS..MENDE KIKAKUI COMBINING NUMBER MILLIONS
-1E944..1E946 ; Diacritic # Mn [3] ADLAM ALIF LENGTHENER..ADLAM GEMINATION MARK
-1E948..1E94A ; Diacritic # Mn [3] ADLAM CONSONANT MODIFIER..ADLAM NUKTA
-
-# Total code points: 1144
-
-# ================================================
-
-00B7 ; Extender # Po MIDDLE DOT
-02D0..02D1 ; Extender # Lm [2] MODIFIER LETTER TRIANGULAR COLON..MODIFIER LETTER HALF TRIANGULAR COLON
-0640 ; Extender # Lm ARABIC TATWEEL
-07FA ; Extender # Lm NKO LAJANYALAN
-0B55 ; Extender # Mn ORIYA SIGN OVERLINE
-0E46 ; Extender # Lm THAI CHARACTER MAIYAMOK
-0EC6 ; Extender # Lm LAO KO LA
-180A ; Extender # Po MONGOLIAN NIRUGU
-1843 ; Extender # Lm MONGOLIAN LETTER TODO LONG VOWEL SIGN
-1AA7 ; Extender # Lm TAI THAM SIGN MAI YAMOK
-1C36 ; Extender # Mn LEPCHA SIGN RAN
-1C7B ; Extender # Lm OL CHIKI RELAA
-3005 ; Extender # Lm IDEOGRAPHIC ITERATION MARK
-3031..3035 ; Extender # Lm [5] VERTICAL KANA REPEAT MARK..VERTICAL KANA REPEAT MARK LOWER HALF
-309D..309E ; Extender # Lm [2] HIRAGANA ITERATION MARK..HIRAGANA VOICED ITERATION MARK
-30FC..30FE ; Extender # Lm [3] KATAKANA-HIRAGANA PROLONGED SOUND MARK..KATAKANA VOICED ITERATION MARK
-A015 ; Extender # Lm YI SYLLABLE WU
-A60C ; Extender # Lm VAI SYLLABLE LENGTHENER
-A9CF ; Extender # Lm JAVANESE PANGRANGKEP
-A9E6 ; Extender # Lm MYANMAR MODIFIER LETTER SHAN REDUPLICATION
-AA70 ; Extender # Lm MYANMAR MODIFIER LETTER KHAMTI REDUPLICATION
-AADD ; Extender # Lm TAI VIET SYMBOL SAM
-AAF3..AAF4 ; Extender # Lm [2] MEETEI MAYEK SYLLABLE REPETITION MARK..MEETEI MAYEK WORD REPETITION MARK
-FF70 ; Extender # Lm HALFWIDTH KATAKANA-HIRAGANA PROLONGED SOUND MARK
-10781..10782 ; Extender # Lm [2] MODIFIER LETTER SUPERSCRIPT TRIANGULAR COLON..MODIFIER LETTER SUPERSCRIPT HALF TRIANGULAR COLON
-1135D ; Extender # Lo GRANTHA SIGN PLUTA
-115C6..115C8 ; Extender # Po [3] SIDDHAM REPETITION MARK-1..SIDDHAM REPETITION MARK-3
-11A98 ; Extender # Mn SOYOMBO GEMINATION MARK
-16B42..16B43 ; Extender # Lm [2] PAHAWH HMONG SIGN VOS NRUA..PAHAWH HMONG SIGN IB YAM
-16FE0..16FE1 ; Extender # Lm [2] TANGUT ITERATION MARK..NUSHU ITERATION MARK
-16FE3 ; Extender # Lm OLD CHINESE ITERATION MARK
-1E13C..1E13D ; Extender # Lm [2] NYIAKENG PUACHUE HMONG SIGN XW XW..NYIAKENG PUACHUE HMONG SYLLABLE LENGTHENER
-1E944..1E946 ; Extender # Mn [3] ADLAM ALIF LENGTHENER..ADLAM GEMINATION MARK
-
-# Total code points: 50
-
-# ================================================
-
-00AA ; Other_Lowercase # Lo FEMININE ORDINAL INDICATOR
-00BA ; Other_Lowercase # Lo MASCULINE ORDINAL INDICATOR
-02B0..02B8 ; Other_Lowercase # Lm [9] MODIFIER LETTER SMALL H..MODIFIER LETTER SMALL Y
-02C0..02C1 ; Other_Lowercase # Lm [2] MODIFIER LETTER GLOTTAL STOP..MODIFIER LETTER REVERSED GLOTTAL STOP
-02E0..02E4 ; Other_Lowercase # Lm [5] MODIFIER LETTER SMALL GAMMA..MODIFIER LETTER SMALL REVERSED GLOTTAL STOP
-0345 ; Other_Lowercase # Mn COMBINING GREEK YPOGEGRAMMENI
-037A ; Other_Lowercase # Lm GREEK YPOGEGRAMMENI
-10FC ; Other_Lowercase # Lm MODIFIER LETTER GEORGIAN NAR
-1D2C..1D6A ; Other_Lowercase # Lm [63] MODIFIER LETTER CAPITAL A..GREEK SUBSCRIPT SMALL LETTER CHI
-1D78 ; Other_Lowercase # Lm MODIFIER LETTER CYRILLIC EN
-1D9B..1DBF ; Other_Lowercase # Lm [37] MODIFIER LETTER SMALL TURNED ALPHA..MODIFIER LETTER SMALL THETA
-2071 ; Other_Lowercase # Lm SUPERSCRIPT LATIN SMALL LETTER I
-207F ; Other_Lowercase # Lm SUPERSCRIPT LATIN SMALL LETTER N
-2090..209C ; Other_Lowercase # Lm [13] LATIN SUBSCRIPT SMALL LETTER A..LATIN SUBSCRIPT SMALL LETTER T
-2170..217F ; Other_Lowercase # Nl [16] SMALL ROMAN NUMERAL ONE..SMALL ROMAN NUMERAL ONE THOUSAND
-24D0..24E9 ; Other_Lowercase # So [26] CIRCLED LATIN SMALL LETTER A..CIRCLED LATIN SMALL LETTER Z
-2C7C..2C7D ; Other_Lowercase # Lm [2] LATIN SUBSCRIPT SMALL LETTER J..MODIFIER LETTER CAPITAL V
-A69C..A69D ; Other_Lowercase # Lm [2] MODIFIER LETTER CYRILLIC HARD SIGN..MODIFIER LETTER CYRILLIC SOFT SIGN
-A770 ; Other_Lowercase # Lm MODIFIER LETTER US
-A7F2..A7F4 ; Other_Lowercase # Lm [3] MODIFIER LETTER CAPITAL C..MODIFIER LETTER CAPITAL Q
-A7F8..A7F9 ; Other_Lowercase # Lm [2] MODIFIER LETTER CAPITAL H WITH STROKE..MODIFIER LETTER SMALL LIGATURE OE
-AB5C..AB5F ; Other_Lowercase # Lm [4] MODIFIER LETTER SMALL HENG..MODIFIER LETTER SMALL U WITH LEFT HOOK
-AB69 ; Other_Lowercase # Lm MODIFIER LETTER SMALL TURNED W
-10780 ; Other_Lowercase # Lm MODIFIER LETTER SMALL CAPITAL AA
-10783..10785 ; Other_Lowercase # Lm [3] MODIFIER LETTER SMALL AE..MODIFIER LETTER SMALL B WITH HOOK
-10787..107B0 ; Other_Lowercase # Lm [42] MODIFIER LETTER SMALL DZ DIGRAPH..MODIFIER LETTER SMALL V WITH RIGHT HOOK
-107B2..107BA ; Other_Lowercase # Lm [9] MODIFIER LETTER SMALL CAPITAL Y..MODIFIER LETTER SMALL S WITH CURL
-1E030..1E06D ; Other_Lowercase # Lm [62] MODIFIER LETTER CYRILLIC SMALL A..MODIFIER LETTER CYRILLIC SMALL STRAIGHT U WITH STROKE
-
-# Total code points: 311
-
-# ================================================
-
-2160..216F ; Other_Uppercase # Nl [16] ROMAN NUMERAL ONE..ROMAN NUMERAL ONE THOUSAND
-24B6..24CF ; Other_Uppercase # So [26] CIRCLED LATIN CAPITAL LETTER A..CIRCLED LATIN CAPITAL LETTER Z
-1F130..1F149 ; Other_Uppercase # So [26] SQUARED LATIN CAPITAL LETTER A..SQUARED LATIN CAPITAL LETTER Z
-1F150..1F169 ; Other_Uppercase # So [26] NEGATIVE CIRCLED LATIN CAPITAL LETTER A..NEGATIVE CIRCLED LATIN CAPITAL LETTER Z
-1F170..1F189 ; Other_Uppercase # So [26] NEGATIVE SQUARED LATIN CAPITAL LETTER A..NEGATIVE SQUARED LATIN CAPITAL LETTER Z
-
-# Total code points: 120
-
-# ================================================
-
-FDD0..FDEF ; Noncharacter_Code_Point # Cn [32] <noncharacter-FDD0>..<noncharacter-FDEF>
-FFFE..FFFF ; Noncharacter_Code_Point # Cn [2] <noncharacter-FFFE>..<noncharacter-FFFF>
-1FFFE..1FFFF ; Noncharacter_Code_Point # Cn [2] <noncharacter-1FFFE>..<noncharacter-1FFFF>
-2FFFE..2FFFF ; Noncharacter_Code_Point # Cn [2] <noncharacter-2FFFE>..<noncharacter-2FFFF>
-3FFFE..3FFFF ; Noncharacter_Code_Point # Cn [2] <noncharacter-3FFFE>..<noncharacter-3FFFF>
-4FFFE..4FFFF ; Noncharacter_Code_Point # Cn [2] <noncharacter-4FFFE>..<noncharacter-4FFFF>
-5FFFE..5FFFF ; Noncharacter_Code_Point # Cn [2] <noncharacter-5FFFE>..<noncharacter-5FFFF>
-6FFFE..6FFFF ; Noncharacter_Code_Point # Cn [2] <noncharacter-6FFFE>..<noncharacter-6FFFF>
-7FFFE..7FFFF ; Noncharacter_Code_Point # Cn [2] <noncharacter-7FFFE>..<noncharacter-7FFFF>
-8FFFE..8FFFF ; Noncharacter_Code_Point # Cn [2] <noncharacter-8FFFE>..<noncharacter-8FFFF>
-9FFFE..9FFFF ; Noncharacter_Code_Point # Cn [2] <noncharacter-9FFFE>..<noncharacter-9FFFF>
-AFFFE..AFFFF ; Noncharacter_Code_Point # Cn [2] <noncharacter-AFFFE>..<noncharacter-AFFFF>
-BFFFE..BFFFF ; Noncharacter_Code_Point # Cn [2] <noncharacter-BFFFE>..<noncharacter-BFFFF>
-CFFFE..CFFFF ; Noncharacter_Code_Point # Cn [2] <noncharacter-CFFFE>..<noncharacter-CFFFF>
-DFFFE..DFFFF ; Noncharacter_Code_Point # Cn [2] <noncharacter-DFFFE>..<noncharacter-DFFFF>
-EFFFE..EFFFF ; Noncharacter_Code_Point # Cn [2] <noncharacter-EFFFE>..<noncharacter-EFFFF>
-FFFFE..FFFFF ; Noncharacter_Code_Point # Cn [2] <noncharacter-FFFFE>..<noncharacter-FFFFF>
-10FFFE..10FFFF; Noncharacter_Code_Point # Cn [2] <noncharacter-10FFFE>..<noncharacter-10FFFF>
-
-# Total code points: 66
-
-# ================================================
-
-09BE ; Other_Grapheme_Extend # Mc BENGALI VOWEL SIGN AA
-09D7 ; Other_Grapheme_Extend # Mc BENGALI AU LENGTH MARK
-0B3E ; Other_Grapheme_Extend # Mc ORIYA VOWEL SIGN AA
-0B57 ; Other_Grapheme_Extend # Mc ORIYA AU LENGTH MARK
-0BBE ; Other_Grapheme_Extend # Mc TAMIL VOWEL SIGN AA
-0BD7 ; Other_Grapheme_Extend # Mc TAMIL AU LENGTH MARK
-0CC2 ; Other_Grapheme_Extend # Mc KANNADA VOWEL SIGN UU
-0CD5..0CD6 ; Other_Grapheme_Extend # Mc [2] KANNADA LENGTH MARK..KANNADA AI LENGTH MARK
-0D3E ; Other_Grapheme_Extend # Mc MALAYALAM VOWEL SIGN AA
-0D57 ; Other_Grapheme_Extend # Mc MALAYALAM AU LENGTH MARK
-0DCF ; Other_Grapheme_Extend # Mc SINHALA VOWEL SIGN AELA-PILLA
-0DDF ; Other_Grapheme_Extend # Mc SINHALA VOWEL SIGN GAYANUKITTA
-1B35 ; Other_Grapheme_Extend # Mc BALINESE VOWEL SIGN TEDUNG
-200C ; Other_Grapheme_Extend # Cf ZERO WIDTH NON-JOINER
-302E..302F ; Other_Grapheme_Extend # Mc [2] HANGUL SINGLE DOT TONE MARK..HANGUL DOUBLE DOT TONE MARK
-FF9E..FF9F ; Other_Grapheme_Extend # Lm [2] HALFWIDTH KATAKANA VOICED SOUND MARK..HALFWIDTH KATAKANA SEMI-VOICED SOUND MARK
-1133E ; Other_Grapheme_Extend # Mc GRANTHA VOWEL SIGN AA
-11357 ; Other_Grapheme_Extend # Mc GRANTHA AU LENGTH MARK
-114B0 ; Other_Grapheme_Extend # Mc TIRHUTA VOWEL SIGN AA
-114BD ; Other_Grapheme_Extend # Mc TIRHUTA VOWEL SIGN SHORT O
-115AF ; Other_Grapheme_Extend # Mc SIDDHAM VOWEL SIGN AA
-11930 ; Other_Grapheme_Extend # Mc DIVES AKURU VOWEL SIGN AA
-1D165 ; Other_Grapheme_Extend # Mc MUSICAL SYMBOL COMBINING STEM
-1D16E..1D172 ; Other_Grapheme_Extend # Mc [5] MUSICAL SYMBOL COMBINING FLAG-1..MUSICAL SYMBOL COMBINING FLAG-5
-E0020..E007F ; Other_Grapheme_Extend # Cf [96] TAG SPACE..CANCEL TAG
-
-# Total code points: 127
-
-# ================================================
-
-2FF0..2FF1 ; IDS_Binary_Operator # So [2] IDEOGRAPHIC DESCRIPTION CHARACTER LEFT TO RIGHT..IDEOGRAPHIC DESCRIPTION CHARACTER ABOVE TO BELOW
-2FF4..2FFD ; IDS_Binary_Operator # So [10] IDEOGRAPHIC DESCRIPTION CHARACTER FULL SURROUND..IDEOGRAPHIC DESCRIPTION CHARACTER SURROUND FROM LOWER RIGHT
-31EF ; IDS_Binary_Operator # So IDEOGRAPHIC DESCRIPTION CHARACTER SUBTRACTION
-
-# Total code points: 13
-
-# ================================================
-
-2FF2..2FF3 ; IDS_Trinary_Operator # So [2] IDEOGRAPHIC DESCRIPTION CHARACTER LEFT TO MIDDLE AND RIGHT..IDEOGRAPHIC DESCRIPTION CHARACTER ABOVE TO MIDDLE AND BELOW
-
-# Total code points: 2
-
-# ================================================
-
-2FFE..2FFF ; IDS_Unary_Operator # So [2] IDEOGRAPHIC DESCRIPTION CHARACTER HORIZONTAL REFLECTION..IDEOGRAPHIC DESCRIPTION CHARACTER ROTATION
-
-# Total code points: 2
-
-# ================================================
-
-2E80..2E99 ; Radical # So [26] CJK RADICAL REPEAT..CJK RADICAL RAP
-2E9B..2EF3 ; Radical # So [89] CJK RADICAL CHOKE..CJK RADICAL C-SIMPLIFIED TURTLE
-2F00..2FD5 ; Radical # So [214] KANGXI RADICAL ONE..KANGXI RADICAL FLUTE
-
-# Total code points: 329
-
-# ================================================
-
-3400..4DBF ; Unified_Ideograph # Lo [6592] CJK UNIFIED IDEOGRAPH-3400..CJK UNIFIED IDEOGRAPH-4DBF
-4E00..9FFF ; Unified_Ideograph # Lo [20992] CJK UNIFIED IDEOGRAPH-4E00..CJK UNIFIED IDEOGRAPH-9FFF
-FA0E..FA0F ; Unified_Ideograph # Lo [2] CJK COMPATIBILITY IDEOGRAPH-FA0E..CJK COMPATIBILITY IDEOGRAPH-FA0F
-FA11 ; Unified_Ideograph # Lo CJK COMPATIBILITY IDEOGRAPH-FA11
-FA13..FA14 ; Unified_Ideograph # Lo [2] CJK COMPATIBILITY IDEOGRAPH-FA13..CJK COMPATIBILITY IDEOGRAPH-FA14
-FA1F ; Unified_Ideograph # Lo CJK COMPATIBILITY IDEOGRAPH-FA1F
-FA21 ; Unified_Ideograph # Lo CJK COMPATIBILITY IDEOGRAPH-FA21
-FA23..FA24 ; Unified_Ideograph # Lo [2] CJK COMPATIBILITY IDEOGRAPH-FA23..CJK COMPATIBILITY IDEOGRAPH-FA24
-FA27..FA29 ; Unified_Ideograph # Lo [3] CJK COMPATIBILITY IDEOGRAPH-FA27..CJK COMPATIBILITY IDEOGRAPH-FA29
-20000..2A6DF ; Unified_Ideograph # Lo [42720] CJK UNIFIED IDEOGRAPH-20000..CJK UNIFIED IDEOGRAPH-2A6DF
-2A700..2B739 ; Unified_Ideograph # Lo [4154] CJK UNIFIED IDEOGRAPH-2A700..CJK UNIFIED IDEOGRAPH-2B739
-2B740..2B81D ; Unified_Ideograph # Lo [222] CJK UNIFIED IDEOGRAPH-2B740..CJK UNIFIED IDEOGRAPH-2B81D
-2B820..2CEA1 ; Unified_Ideograph # Lo [5762] CJK UNIFIED IDEOGRAPH-2B820..CJK UNIFIED IDEOGRAPH-2CEA1
-2CEB0..2EBE0 ; Unified_Ideograph # Lo [7473] CJK UNIFIED IDEOGRAPH-2CEB0..CJK UNIFIED IDEOGRAPH-2EBE0
-2EBF0..2EE5D ; Unified_Ideograph # Lo [622] CJK UNIFIED IDEOGRAPH-2EBF0..CJK UNIFIED IDEOGRAPH-2EE5D
-30000..3134A ; Unified_Ideograph # Lo [4939] CJK UNIFIED IDEOGRAPH-30000..CJK UNIFIED IDEOGRAPH-3134A
-31350..323AF ; Unified_Ideograph # Lo [4192] CJK UNIFIED IDEOGRAPH-31350..CJK UNIFIED IDEOGRAPH-323AF
-
-# Total code points: 97680
-
-# ================================================
-
-034F ; Other_Default_Ignorable_Code_Point # Mn COMBINING GRAPHEME JOINER
-115F..1160 ; Other_Default_Ignorable_Code_Point # Lo [2] HANGUL CHOSEONG FILLER..HANGUL JUNGSEONG FILLER
-17B4..17B5 ; Other_Default_Ignorable_Code_Point # Mn [2] KHMER VOWEL INHERENT AQ..KHMER VOWEL INHERENT AA
-2065 ; Other_Default_Ignorable_Code_Point # Cn <reserved-2065>
-3164 ; Other_Default_Ignorable_Code_Point # Lo HANGUL FILLER
-FFA0 ; Other_Default_Ignorable_Code_Point # Lo HALFWIDTH HANGUL FILLER
-FFF0..FFF8 ; Other_Default_Ignorable_Code_Point # Cn [9] <reserved-FFF0>..<reserved-FFF8>
-E0000 ; Other_Default_Ignorable_Code_Point # Cn <reserved-E0000>
-E0002..E001F ; Other_Default_Ignorable_Code_Point # Cn [30] <reserved-E0002>..<reserved-E001F>
-E0080..E00FF ; Other_Default_Ignorable_Code_Point # Cn [128] <reserved-E0080>..<reserved-E00FF>
-E01F0..E0FFF ; Other_Default_Ignorable_Code_Point # Cn [3600] <reserved-E01F0>..<reserved-E0FFF>
-
-# Total code points: 3776
-
-# ================================================
-
-0149 ; Deprecated # L& LATIN SMALL LETTER N PRECEDED BY APOSTROPHE
-0673 ; Deprecated # Lo ARABIC LETTER ALEF WITH WAVY HAMZA BELOW
-0F77 ; Deprecated # Mn TIBETAN VOWEL SIGN VOCALIC RR
-0F79 ; Deprecated # Mn TIBETAN VOWEL SIGN VOCALIC LL
-17A3..17A4 ; Deprecated # Lo [2] KHMER INDEPENDENT VOWEL QAQ..KHMER INDEPENDENT VOWEL QAA
-206A..206F ; Deprecated # Cf [6] INHIBIT SYMMETRIC SWAPPING..NOMINAL DIGIT SHAPES
-2329 ; Deprecated # Ps LEFT-POINTING ANGLE BRACKET
-232A ; Deprecated # Pe RIGHT-POINTING ANGLE BRACKET
-E0001 ; Deprecated # Cf LANGUAGE TAG
-
-# Total code points: 15
-
-# ================================================
-
-0069..006A ; Soft_Dotted # L& [2] LATIN SMALL LETTER I..LATIN SMALL LETTER J
-012F ; Soft_Dotted # L& LATIN SMALL LETTER I WITH OGONEK
-0249 ; Soft_Dotted # L& LATIN SMALL LETTER J WITH STROKE
-0268 ; Soft_Dotted # L& LATIN SMALL LETTER I WITH STROKE
-029D ; Soft_Dotted # L& LATIN SMALL LETTER J WITH CROSSED-TAIL
-02B2 ; Soft_Dotted # Lm MODIFIER LETTER SMALL J
-03F3 ; Soft_Dotted # L& GREEK LETTER YOT
-0456 ; Soft_Dotted # L& CYRILLIC SMALL LETTER BYELORUSSIAN-UKRAINIAN I
-0458 ; Soft_Dotted # L& CYRILLIC SMALL LETTER JE
-1D62 ; Soft_Dotted # Lm LATIN SUBSCRIPT SMALL LETTER I
-1D96 ; Soft_Dotted # L& LATIN SMALL LETTER I WITH RETROFLEX HOOK
-1DA4 ; Soft_Dotted # Lm MODIFIER LETTER SMALL I WITH STROKE
-1DA8 ; Soft_Dotted # Lm MODIFIER LETTER SMALL J WITH CROSSED-TAIL
-1E2D ; Soft_Dotted # L& LATIN SMALL LETTER I WITH TILDE BELOW
-1ECB ; Soft_Dotted # L& LATIN SMALL LETTER I WITH DOT BELOW
-2071 ; Soft_Dotted # Lm SUPERSCRIPT LATIN SMALL LETTER I
-2148..2149 ; Soft_Dotted # L& [2] DOUBLE-STRUCK ITALIC SMALL I..DOUBLE-STRUCK ITALIC SMALL J
-2C7C ; Soft_Dotted # Lm LATIN SUBSCRIPT SMALL LETTER J
-1D422..1D423 ; Soft_Dotted # L& [2] MATHEMATICAL BOLD SMALL I..MATHEMATICAL BOLD SMALL J
-1D456..1D457 ; Soft_Dotted # L& [2] MATHEMATICAL ITALIC SMALL I..MATHEMATICAL ITALIC SMALL J
-1D48A..1D48B ; Soft_Dotted # L& [2] MATHEMATICAL BOLD ITALIC SMALL I..MATHEMATICAL BOLD ITALIC SMALL J
-1D4BE..1D4BF ; Soft_Dotted # L& [2] MATHEMATICAL SCRIPT SMALL I..MATHEMATICAL SCRIPT SMALL J
-1D4F2..1D4F3 ; Soft_Dotted # L& [2] MATHEMATICAL BOLD SCRIPT SMALL I..MATHEMATICAL BOLD SCRIPT SMALL J
-1D526..1D527 ; Soft_Dotted # L& [2] MATHEMATICAL FRAKTUR SMALL I..MATHEMATICAL FRAKTUR SMALL J
-1D55A..1D55B ; Soft_Dotted # L& [2] MATHEMATICAL DOUBLE-STRUCK SMALL I..MATHEMATICAL DOUBLE-STRUCK SMALL J
-1D58E..1D58F ; Soft_Dotted # L& [2] MATHEMATICAL BOLD FRAKTUR SMALL I..MATHEMATICAL BOLD FRAKTUR SMALL J
-1D5C2..1D5C3 ; Soft_Dotted # L& [2] MATHEMATICAL SANS-SERIF SMALL I..MATHEMATICAL SANS-SERIF SMALL J
-1D5F6..1D5F7 ; Soft_Dotted # L& [2] MATHEMATICAL SANS-SERIF BOLD SMALL I..MATHEMATICAL SANS-SERIF BOLD SMALL J
-1D62A..1D62B ; Soft_Dotted # L& [2] MATHEMATICAL SANS-SERIF ITALIC SMALL I..MATHEMATICAL SANS-SERIF ITALIC SMALL J
-1D65E..1D65F ; Soft_Dotted # L& [2] MATHEMATICAL SANS-SERIF BOLD ITALIC SMALL I..MATHEMATICAL SANS-SERIF BOLD ITALIC SMALL J
-1D692..1D693 ; Soft_Dotted # L& [2] MATHEMATICAL MONOSPACE SMALL I..MATHEMATICAL MONOSPACE SMALL J
-1DF1A ; Soft_Dotted # L& LATIN SMALL LETTER I WITH STROKE AND RETROFLEX HOOK
-1E04C..1E04D ; Soft_Dotted # Lm [2] MODIFIER LETTER CYRILLIC SMALL BYELORUSSIAN-UKRAINIAN I..MODIFIER LETTER CYRILLIC SMALL JE
-1E068 ; Soft_Dotted # Lm CYRILLIC SUBSCRIPT SMALL LETTER BYELORUSSIAN-UKRAINIAN I
-
-# Total code points: 50
-
-# ================================================
-
-0E40..0E44 ; Logical_Order_Exception # Lo [5] THAI CHARACTER SARA E..THAI CHARACTER SARA AI MAIMALAI
-0EC0..0EC4 ; Logical_Order_Exception # Lo [5] LAO VOWEL SIGN E..LAO VOWEL SIGN AI
-19B5..19B7 ; Logical_Order_Exception # Lo [3] NEW TAI LUE VOWEL SIGN E..NEW TAI LUE VOWEL SIGN O
-19BA ; Logical_Order_Exception # Lo NEW TAI LUE VOWEL SIGN AY
-AAB5..AAB6 ; Logical_Order_Exception # Lo [2] TAI VIET VOWEL E..TAI VIET VOWEL O
-AAB9 ; Logical_Order_Exception # Lo TAI VIET VOWEL UEA
-AABB..AABC ; Logical_Order_Exception # Lo [2] TAI VIET VOWEL AUE..TAI VIET VOWEL AY
-
-# Total code points: 19
-
-# ================================================
-
-1885..1886 ; Other_ID_Start # Mn [2] MONGOLIAN LETTER ALI GALI BALUDA..MONGOLIAN LETTER ALI GALI THREE BALUDA
-2118 ; Other_ID_Start # Sm SCRIPT CAPITAL P
-212E ; Other_ID_Start # So ESTIMATED SYMBOL
-309B..309C ; Other_ID_Start # Sk [2] KATAKANA-HIRAGANA VOICED SOUND MARK..KATAKANA-HIRAGANA SEMI-VOICED SOUND MARK
-
-# Total code points: 6
-
-# ================================================
-
-00B7 ; Other_ID_Continue # Po MIDDLE DOT
-0387 ; Other_ID_Continue # Po GREEK ANO TELEIA
-1369..1371 ; Other_ID_Continue # No [9] ETHIOPIC DIGIT ONE..ETHIOPIC DIGIT NINE
-19DA ; Other_ID_Continue # No NEW TAI LUE THAM DIGIT ONE
-200C..200D ; Other_ID_Continue # Cf [2] ZERO WIDTH NON-JOINER..ZERO WIDTH JOINER
-30FB ; Other_ID_Continue # Po KATAKANA MIDDLE DOT
-FF65 ; Other_ID_Continue # Po HALFWIDTH KATAKANA MIDDLE DOT
-
-# Total code points: 16
-
-# ================================================
-
-00B2..00B3 ; ID_Compat_Math_Continue # No [2] SUPERSCRIPT TWO..SUPERSCRIPT THREE
-00B9 ; ID_Compat_Math_Continue # No SUPERSCRIPT ONE
-2070 ; ID_Compat_Math_Continue # No SUPERSCRIPT ZERO
-2074..2079 ; ID_Compat_Math_Continue # No [6] SUPERSCRIPT FOUR..SUPERSCRIPT NINE
-207A..207C ; ID_Compat_Math_Continue # Sm [3] SUPERSCRIPT PLUS SIGN..SUPERSCRIPT EQUALS SIGN
-207D ; ID_Compat_Math_Continue # Ps SUPERSCRIPT LEFT PARENTHESIS
-207E ; ID_Compat_Math_Continue # Pe SUPERSCRIPT RIGHT PARENTHESIS
-2080..2089 ; ID_Compat_Math_Continue # No [10] SUBSCRIPT ZERO..SUBSCRIPT NINE
-208A..208C ; ID_Compat_Math_Continue # Sm [3] SUBSCRIPT PLUS SIGN..SUBSCRIPT EQUALS SIGN
-208D ; ID_Compat_Math_Continue # Ps SUBSCRIPT LEFT PARENTHESIS
-208E ; ID_Compat_Math_Continue # Pe SUBSCRIPT RIGHT PARENTHESIS
-2202 ; ID_Compat_Math_Continue # Sm PARTIAL DIFFERENTIAL
-2207 ; ID_Compat_Math_Continue # Sm NABLA
-221E ; ID_Compat_Math_Continue # Sm INFINITY
-1D6C1 ; ID_Compat_Math_Continue # Sm MATHEMATICAL BOLD NABLA
-1D6DB ; ID_Compat_Math_Continue # Sm MATHEMATICAL BOLD PARTIAL DIFFERENTIAL
-1D6FB ; ID_Compat_Math_Continue # Sm MATHEMATICAL ITALIC NABLA
-1D715 ; ID_Compat_Math_Continue # Sm MATHEMATICAL ITALIC PARTIAL DIFFERENTIAL
-1D735 ; ID_Compat_Math_Continue # Sm MATHEMATICAL BOLD ITALIC NABLA
-1D74F ; ID_Compat_Math_Continue # Sm MATHEMATICAL BOLD ITALIC PARTIAL DIFFERENTIAL
-1D76F ; ID_Compat_Math_Continue # Sm MATHEMATICAL SANS-SERIF BOLD NABLA
-1D789 ; ID_Compat_Math_Continue # Sm MATHEMATICAL SANS-SERIF BOLD PARTIAL DIFFERENTIAL
-1D7A9 ; ID_Compat_Math_Continue # Sm MATHEMATICAL SANS-SERIF BOLD ITALIC NABLA
-1D7C3 ; ID_Compat_Math_Continue # Sm MATHEMATICAL SANS-SERIF BOLD ITALIC PARTIAL DIFFERENTIAL
-
-# Total code points: 43
-
-# ================================================
-
-2202 ; ID_Compat_Math_Start # Sm PARTIAL DIFFERENTIAL
-2207 ; ID_Compat_Math_Start # Sm NABLA
-221E ; ID_Compat_Math_Start # Sm INFINITY
-1D6C1 ; ID_Compat_Math_Start # Sm MATHEMATICAL BOLD NABLA
-1D6DB ; ID_Compat_Math_Start # Sm MATHEMATICAL BOLD PARTIAL DIFFERENTIAL
-1D6FB ; ID_Compat_Math_Start # Sm MATHEMATICAL ITALIC NABLA
-1D715 ; ID_Compat_Math_Start # Sm MATHEMATICAL ITALIC PARTIAL DIFFERENTIAL
-1D735 ; ID_Compat_Math_Start # Sm MATHEMATICAL BOLD ITALIC NABLA
-1D74F ; ID_Compat_Math_Start # Sm MATHEMATICAL BOLD ITALIC PARTIAL DIFFERENTIAL
-1D76F ; ID_Compat_Math_Start # Sm MATHEMATICAL SANS-SERIF BOLD NABLA
-1D789 ; ID_Compat_Math_Start # Sm MATHEMATICAL SANS-SERIF BOLD PARTIAL DIFFERENTIAL
-1D7A9 ; ID_Compat_Math_Start # Sm MATHEMATICAL SANS-SERIF BOLD ITALIC NABLA
-1D7C3 ; ID_Compat_Math_Start # Sm MATHEMATICAL SANS-SERIF BOLD ITALIC PARTIAL DIFFERENTIAL
-
-# Total code points: 13
-
-# ================================================
-
-0021 ; Sentence_Terminal # Po EXCLAMATION MARK
-002E ; Sentence_Terminal # Po FULL STOP
-003F ; Sentence_Terminal # Po QUESTION MARK
-0589 ; Sentence_Terminal # Po ARMENIAN FULL STOP
-061D..061F ; Sentence_Terminal # Po [3] ARABIC END OF TEXT MARK..ARABIC QUESTION MARK
-06D4 ; Sentence_Terminal # Po ARABIC FULL STOP
-0700..0702 ; Sentence_Terminal # Po [3] SYRIAC END OF PARAGRAPH..SYRIAC SUBLINEAR FULL STOP
-07F9 ; Sentence_Terminal # Po NKO EXCLAMATION MARK
-0837 ; Sentence_Terminal # Po SAMARITAN PUNCTUATION MELODIC QITSA
-0839 ; Sentence_Terminal # Po SAMARITAN PUNCTUATION QITSA
-083D..083E ; Sentence_Terminal # Po [2] SAMARITAN PUNCTUATION SOF MASHFAAT..SAMARITAN PUNCTUATION ANNAAU
-0964..0965 ; Sentence_Terminal # Po [2] DEVANAGARI DANDA..DEVANAGARI DOUBLE DANDA
-104A..104B ; Sentence_Terminal # Po [2] MYANMAR SIGN LITTLE SECTION..MYANMAR SIGN SECTION
-1362 ; Sentence_Terminal # Po ETHIOPIC FULL STOP
-1367..1368 ; Sentence_Terminal # Po [2] ETHIOPIC QUESTION MARK..ETHIOPIC PARAGRAPH SEPARATOR
-166E ; Sentence_Terminal # Po CANADIAN SYLLABICS FULL STOP
-1735..1736 ; Sentence_Terminal # Po [2] PHILIPPINE SINGLE PUNCTUATION..PHILIPPINE DOUBLE PUNCTUATION
-17D4..17D5 ; Sentence_Terminal # Po [2] KHMER SIGN KHAN..KHMER SIGN BARIYOOSAN
-1803 ; Sentence_Terminal # Po MONGOLIAN FULL STOP
-1809 ; Sentence_Terminal # Po MONGOLIAN MANCHU FULL STOP
-1944..1945 ; Sentence_Terminal # Po [2] LIMBU EXCLAMATION MARK..LIMBU QUESTION MARK
-1AA8..1AAB ; Sentence_Terminal # Po [4] TAI THAM SIGN KAAN..TAI THAM SIGN SATKAANKUU
-1B5A..1B5B ; Sentence_Terminal # Po [2] BALINESE PANTI..BALINESE PAMADA
-1B5E..1B5F ; Sentence_Terminal # Po [2] BALINESE CARIK SIKI..BALINESE CARIK PAREREN
-1B7D..1B7E ; Sentence_Terminal # Po [2] BALINESE PANTI LANTANG..BALINESE PAMADA LANTANG
-1C3B..1C3C ; Sentence_Terminal # Po [2] LEPCHA PUNCTUATION TA-ROL..LEPCHA PUNCTUATION NYET THYOOM TA-ROL
-1C7E..1C7F ; Sentence_Terminal # Po [2] OL CHIKI PUNCTUATION MUCAAD..OL CHIKI PUNCTUATION DOUBLE MUCAAD
-203C..203D ; Sentence_Terminal # Po [2] DOUBLE EXCLAMATION MARK..INTERROBANG
-2047..2049 ; Sentence_Terminal # Po [3] DOUBLE QUESTION MARK..EXCLAMATION QUESTION MARK
-2E2E ; Sentence_Terminal # Po REVERSED QUESTION MARK
-2E3C ; Sentence_Terminal # Po STENOGRAPHIC FULL STOP
-2E53..2E54 ; Sentence_Terminal # Po [2] MEDIEVAL EXCLAMATION MARK..MEDIEVAL QUESTION MARK
-3002 ; Sentence_Terminal # Po IDEOGRAPHIC FULL STOP
-A4FF ; Sentence_Terminal # Po LISU PUNCTUATION FULL STOP
-A60E..A60F ; Sentence_Terminal # Po [2] VAI FULL STOP..VAI QUESTION MARK
-A6F3 ; Sentence_Terminal # Po BAMUM FULL STOP
-A6F7 ; Sentence_Terminal # Po BAMUM QUESTION MARK
-A876..A877 ; Sentence_Terminal # Po [2] PHAGS-PA MARK SHAD..PHAGS-PA MARK DOUBLE SHAD
-A8CE..A8CF ; Sentence_Terminal # Po [2] SAURASHTRA DANDA..SAURASHTRA DOUBLE DANDA
-A92F ; Sentence_Terminal # Po KAYAH LI SIGN SHYA
-A9C8..A9C9 ; Sentence_Terminal # Po [2] JAVANESE PADA LINGSA..JAVANESE PADA LUNGSI
-AA5D..AA5F ; Sentence_Terminal # Po [3] CHAM PUNCTUATION DANDA..CHAM PUNCTUATION TRIPLE DANDA
-AAF0..AAF1 ; Sentence_Terminal # Po [2] MEETEI MAYEK CHEIKHAN..MEETEI MAYEK AHANG KHUDAM
-ABEB ; Sentence_Terminal # Po MEETEI MAYEK CHEIKHEI
-FE52 ; Sentence_Terminal # Po SMALL FULL STOP
-FE56..FE57 ; Sentence_Terminal # Po [2] SMALL QUESTION MARK..SMALL EXCLAMATION MARK
-FF01 ; Sentence_Terminal # Po FULLWIDTH EXCLAMATION MARK
-FF0E ; Sentence_Terminal # Po FULLWIDTH FULL STOP
-FF1F ; Sentence_Terminal # Po FULLWIDTH QUESTION MARK
-FF61 ; Sentence_Terminal # Po HALFWIDTH IDEOGRAPHIC FULL STOP
-10A56..10A57 ; Sentence_Terminal # Po [2] KHAROSHTHI PUNCTUATION DANDA..KHAROSHTHI PUNCTUATION DOUBLE DANDA
-10F55..10F59 ; Sentence_Terminal # Po [5] SOGDIAN PUNCTUATION TWO VERTICAL BARS..SOGDIAN PUNCTUATION HALF CIRCLE WITH DOT
-10F86..10F89 ; Sentence_Terminal # Po [4] OLD UYGHUR PUNCTUATION BAR..OLD UYGHUR PUNCTUATION FOUR DOTS
-11047..11048 ; Sentence_Terminal # Po [2] BRAHMI DANDA..BRAHMI DOUBLE DANDA
-110BE..110C1 ; Sentence_Terminal # Po [4] KAITHI SECTION MARK..KAITHI DOUBLE DANDA
-11141..11143 ; Sentence_Terminal # Po [3] CHAKMA DANDA..CHAKMA QUESTION MARK
-111C5..111C6 ; Sentence_Terminal # Po [2] SHARADA DANDA..SHARADA DOUBLE DANDA
-111CD ; Sentence_Terminal # Po SHARADA SUTRA MARK
-111DE..111DF ; Sentence_Terminal # Po [2] SHARADA SECTION MARK-1..SHARADA SECTION MARK-2
-11238..11239 ; Sentence_Terminal # Po [2] KHOJKI DANDA..KHOJKI DOUBLE DANDA
-1123B..1123C ; Sentence_Terminal # Po [2] KHOJKI SECTION MARK..KHOJKI DOUBLE SECTION MARK
-112A9 ; Sentence_Terminal # Po MULTANI SECTION MARK
-1144B..1144C ; Sentence_Terminal # Po [2] NEWA DANDA..NEWA DOUBLE DANDA
-115C2..115C3 ; Sentence_Terminal # Po [2] SIDDHAM DANDA..SIDDHAM DOUBLE DANDA
-115C9..115D7 ; Sentence_Terminal # Po [15] SIDDHAM END OF TEXT MARK..SIDDHAM SECTION MARK WITH CIRCLES AND FOUR ENCLOSURES
-11641..11642 ; Sentence_Terminal # Po [2] MODI DANDA..MODI DOUBLE DANDA
-1173C..1173E ; Sentence_Terminal # Po [3] AHOM SIGN SMALL SECTION..AHOM SIGN RULAI
-11944 ; Sentence_Terminal # Po DIVES AKURU DOUBLE DANDA
-11946 ; Sentence_Terminal # Po DIVES AKURU END OF TEXT MARK
-11A42..11A43 ; Sentence_Terminal # Po [2] ZANABAZAR SQUARE MARK SHAD..ZANABAZAR SQUARE MARK DOUBLE SHAD
-11A9B..11A9C ; Sentence_Terminal # Po [2] SOYOMBO MARK SHAD..SOYOMBO MARK DOUBLE SHAD
-11C41..11C42 ; Sentence_Terminal # Po [2] BHAIKSUKI DANDA..BHAIKSUKI DOUBLE DANDA
-11EF7..11EF8 ; Sentence_Terminal # Po [2] MAKASAR PASSIMBANG..MAKASAR END OF SECTION
-11F43..11F44 ; Sentence_Terminal # Po [2] KAWI DANDA..KAWI DOUBLE DANDA
-16A6E..16A6F ; Sentence_Terminal # Po [2] MRO DANDA..MRO DOUBLE DANDA
-16AF5 ; Sentence_Terminal # Po BASSA VAH FULL STOP
-16B37..16B38 ; Sentence_Terminal # Po [2] PAHAWH HMONG SIGN VOS THOM..PAHAWH HMONG SIGN VOS TSHAB CEEB
-16B44 ; Sentence_Terminal # Po PAHAWH HMONG SIGN XAUS
-16E98 ; Sentence_Terminal # Po MEDEFAIDRIN FULL STOP
-1BC9F ; Sentence_Terminal # Po DUPLOYAN PUNCTUATION CHINOOK FULL STOP
-1DA88 ; Sentence_Terminal # Po SIGNWRITING FULL STOP
-
-# Total code points: 156
-
-# ================================================
-
-180B..180D ; Variation_Selector # Mn [3] MONGOLIAN FREE VARIATION SELECTOR ONE..MONGOLIAN FREE VARIATION SELECTOR THREE
-180F ; Variation_Selector # Mn MONGOLIAN FREE VARIATION SELECTOR FOUR
-FE00..FE0F ; Variation_Selector # Mn [16] VARIATION SELECTOR-1..VARIATION SELECTOR-16
-E0100..E01EF ; Variation_Selector # Mn [240] VARIATION SELECTOR-17..VARIATION SELECTOR-256
-
-# Total code points: 260
-
-# ================================================
-
-0009..000D ; Pattern_White_Space # Cc [5] <control-0009>..<control-000D>
-0020 ; Pattern_White_Space # Zs SPACE
-0085 ; Pattern_White_Space # Cc <control-0085>
-200E..200F ; Pattern_White_Space # Cf [2] LEFT-TO-RIGHT MARK..RIGHT-TO-LEFT MARK
-2028 ; Pattern_White_Space # Zl LINE SEPARATOR
-2029 ; Pattern_White_Space # Zp PARAGRAPH SEPARATOR
-
-# Total code points: 11
-
-# ================================================
-
-0021..0023 ; Pattern_Syntax # Po [3] EXCLAMATION MARK..NUMBER SIGN
-0024 ; Pattern_Syntax # Sc DOLLAR SIGN
-0025..0027 ; Pattern_Syntax # Po [3] PERCENT SIGN..APOSTROPHE
-0028 ; Pattern_Syntax # Ps LEFT PARENTHESIS
-0029 ; Pattern_Syntax # Pe RIGHT PARENTHESIS
-002A ; Pattern_Syntax # Po ASTERISK
-002B ; Pattern_Syntax # Sm PLUS SIGN
-002C ; Pattern_Syntax # Po COMMA
-002D ; Pattern_Syntax # Pd HYPHEN-MINUS
-002E..002F ; Pattern_Syntax # Po [2] FULL STOP..SOLIDUS
-003A..003B ; Pattern_Syntax # Po [2] COLON..SEMICOLON
-003C..003E ; Pattern_Syntax # Sm [3] LESS-THAN SIGN..GREATER-THAN SIGN
-003F..0040 ; Pattern_Syntax # Po [2] QUESTION MARK..COMMERCIAL AT
-005B ; Pattern_Syntax # Ps LEFT SQUARE BRACKET
-005C ; Pattern_Syntax # Po REVERSE SOLIDUS
-005D ; Pattern_Syntax # Pe RIGHT SQUARE BRACKET
-005E ; Pattern_Syntax # Sk CIRCUMFLEX ACCENT
-0060 ; Pattern_Syntax # Sk GRAVE ACCENT
-007B ; Pattern_Syntax # Ps LEFT CURLY BRACKET
-007C ; Pattern_Syntax # Sm VERTICAL LINE
-007D ; Pattern_Syntax # Pe RIGHT CURLY BRACKET
-007E ; Pattern_Syntax # Sm TILDE
-00A1 ; Pattern_Syntax # Po INVERTED EXCLAMATION MARK
-00A2..00A5 ; Pattern_Syntax # Sc [4] CENT SIGN..YEN SIGN
-00A6 ; Pattern_Syntax # So BROKEN BAR
-00A7 ; Pattern_Syntax # Po SECTION SIGN
-00A9 ; Pattern_Syntax # So COPYRIGHT SIGN
-00AB ; Pattern_Syntax # Pi LEFT-POINTING DOUBLE ANGLE QUOTATION MARK
-00AC ; Pattern_Syntax # Sm NOT SIGN
-00AE ; Pattern_Syntax # So REGISTERED SIGN
-00B0 ; Pattern_Syntax # So DEGREE SIGN
-00B1 ; Pattern_Syntax # Sm PLUS-MINUS SIGN
-00B6 ; Pattern_Syntax # Po PILCROW SIGN
-00BB ; Pattern_Syntax # Pf RIGHT-POINTING DOUBLE ANGLE QUOTATION MARK
-00BF ; Pattern_Syntax # Po INVERTED QUESTION MARK
-00D7 ; Pattern_Syntax # Sm MULTIPLICATION SIGN
-00F7 ; Pattern_Syntax # Sm DIVISION SIGN
-2010..2015 ; Pattern_Syntax # Pd [6] HYPHEN..HORIZONTAL BAR
-2016..2017 ; Pattern_Syntax # Po [2] DOUBLE VERTICAL LINE..DOUBLE LOW LINE
-2018 ; Pattern_Syntax # Pi LEFT SINGLE QUOTATION MARK
-2019 ; Pattern_Syntax # Pf RIGHT SINGLE QUOTATION MARK
-201A ; Pattern_Syntax # Ps SINGLE LOW-9 QUOTATION MARK
-201B..201C ; Pattern_Syntax # Pi [2] SINGLE HIGH-REVERSED-9 QUOTATION MARK..LEFT DOUBLE QUOTATION MARK
-201D ; Pattern_Syntax # Pf RIGHT DOUBLE QUOTATION MARK
-201E ; Pattern_Syntax # Ps DOUBLE LOW-9 QUOTATION MARK
-201F ; Pattern_Syntax # Pi DOUBLE HIGH-REVERSED-9 QUOTATION MARK
-2020..2027 ; Pattern_Syntax # Po [8] DAGGER..HYPHENATION POINT
-2030..2038 ; Pattern_Syntax # Po [9] PER MILLE SIGN..CARET
-2039 ; Pattern_Syntax # Pi SINGLE LEFT-POINTING ANGLE QUOTATION MARK
-203A ; Pattern_Syntax # Pf SINGLE RIGHT-POINTING ANGLE QUOTATION MARK
-203B..203E ; Pattern_Syntax # Po [4] REFERENCE MARK..OVERLINE
-2041..2043 ; Pattern_Syntax # Po [3] CARET INSERTION POINT..HYPHEN BULLET
-2044 ; Pattern_Syntax # Sm FRACTION SLASH
-2045 ; Pattern_Syntax # Ps LEFT SQUARE BRACKET WITH QUILL
-2046 ; Pattern_Syntax # Pe RIGHT SQUARE BRACKET WITH QUILL
-2047..2051 ; Pattern_Syntax # Po [11] DOUBLE QUESTION MARK..TWO ASTERISKS ALIGNED VERTICALLY
-2052 ; Pattern_Syntax # Sm COMMERCIAL MINUS SIGN
-2053 ; Pattern_Syntax # Po SWUNG DASH
-2055..205E ; Pattern_Syntax # Po [10] FLOWER PUNCTUATION MARK..VERTICAL FOUR DOTS
-2190..2194 ; Pattern_Syntax # Sm [5] LEFTWARDS ARROW..LEFT RIGHT ARROW
-2195..2199 ; Pattern_Syntax # So [5] UP DOWN ARROW..SOUTH WEST ARROW
-219A..219B ; Pattern_Syntax # Sm [2] LEFTWARDS ARROW WITH STROKE..RIGHTWARDS ARROW WITH STROKE
-219C..219F ; Pattern_Syntax # So [4] LEFTWARDS WAVE ARROW..UPWARDS TWO HEADED ARROW
-21A0 ; Pattern_Syntax # Sm RIGHTWARDS TWO HEADED ARROW
-21A1..21A2 ; Pattern_Syntax # So [2] DOWNWARDS TWO HEADED ARROW..LEFTWARDS ARROW WITH TAIL
-21A3 ; Pattern_Syntax # Sm RIGHTWARDS ARROW WITH TAIL
-21A4..21A5 ; Pattern_Syntax # So [2] LEFTWARDS ARROW FROM BAR..UPWARDS ARROW FROM BAR
-21A6 ; Pattern_Syntax # Sm RIGHTWARDS ARROW FROM BAR
-21A7..21AD ; Pattern_Syntax # So [7] DOWNWARDS ARROW FROM BAR..LEFT RIGHT WAVE ARROW
-21AE ; Pattern_Syntax # Sm LEFT RIGHT ARROW WITH STROKE
-21AF..21CD ; Pattern_Syntax # So [31] DOWNWARDS ZIGZAG ARROW..LEFTWARDS DOUBLE ARROW WITH STROKE
-21CE..21CF ; Pattern_Syntax # Sm [2] LEFT RIGHT DOUBLE ARROW WITH STROKE..RIGHTWARDS DOUBLE ARROW WITH STROKE
-21D0..21D1 ; Pattern_Syntax # So [2] LEFTWARDS DOUBLE ARROW..UPWARDS DOUBLE ARROW
-21D2 ; Pattern_Syntax # Sm RIGHTWARDS DOUBLE ARROW
-21D3 ; Pattern_Syntax # So DOWNWARDS DOUBLE ARROW
-21D4 ; Pattern_Syntax # Sm LEFT RIGHT DOUBLE ARROW
-21D5..21F3 ; Pattern_Syntax # So [31] UP DOWN DOUBLE ARROW..UP DOWN WHITE ARROW
-21F4..22FF ; Pattern_Syntax # Sm [268] RIGHT ARROW WITH SMALL CIRCLE..Z NOTATION BAG MEMBERSHIP
-2300..2307 ; Pattern_Syntax # So [8] DIAMETER SIGN..WAVY LINE
-2308 ; Pattern_Syntax # Ps LEFT CEILING
-2309 ; Pattern_Syntax # Pe RIGHT CEILING
-230A ; Pattern_Syntax # Ps LEFT FLOOR
-230B ; Pattern_Syntax # Pe RIGHT FLOOR
-230C..231F ; Pattern_Syntax # So [20] BOTTOM RIGHT CROP..BOTTOM RIGHT CORNER
-2320..2321 ; Pattern_Syntax # Sm [2] TOP HALF INTEGRAL..BOTTOM HALF INTEGRAL
-2322..2328 ; Pattern_Syntax # So [7] FROWN..KEYBOARD
-2329 ; Pattern_Syntax # Ps LEFT-POINTING ANGLE BRACKET
-232A ; Pattern_Syntax # Pe RIGHT-POINTING ANGLE BRACKET
-232B..237B ; Pattern_Syntax # So [81] ERASE TO THE LEFT..NOT CHECK MARK
-237C ; Pattern_Syntax # Sm RIGHT ANGLE WITH DOWNWARDS ZIGZAG ARROW
-237D..239A ; Pattern_Syntax # So [30] SHOULDERED OPEN BOX..CLEAR SCREEN SYMBOL
-239B..23B3 ; Pattern_Syntax # Sm [25] LEFT PARENTHESIS UPPER HOOK..SUMMATION BOTTOM
-23B4..23DB ; Pattern_Syntax # So [40] TOP SQUARE BRACKET..FUSE
-23DC..23E1 ; Pattern_Syntax # Sm [6] TOP PARENTHESIS..BOTTOM TORTOISE SHELL BRACKET
-23E2..2426 ; Pattern_Syntax # So [69] WHITE TRAPEZIUM..SYMBOL FOR SUBSTITUTE FORM TWO
-2427..243F ; Pattern_Syntax # Cn [25] <reserved-2427>..<reserved-243F>
-2440..244A ; Pattern_Syntax # So [11] OCR HOOK..OCR DOUBLE BACKSLASH
-244B..245F ; Pattern_Syntax # Cn [21] <reserved-244B>..<reserved-245F>
-2500..25B6 ; Pattern_Syntax # So [183] BOX DRAWINGS LIGHT HORIZONTAL..BLACK RIGHT-POINTING TRIANGLE
-25B7 ; Pattern_Syntax # Sm WHITE RIGHT-POINTING TRIANGLE
-25B8..25C0 ; Pattern_Syntax # So [9] BLACK RIGHT-POINTING SMALL TRIANGLE..BLACK LEFT-POINTING TRIANGLE
-25C1 ; Pattern_Syntax # Sm WHITE LEFT-POINTING TRIANGLE
-25C2..25F7 ; Pattern_Syntax # So [54] BLACK LEFT-POINTING SMALL TRIANGLE..WHITE CIRCLE WITH UPPER RIGHT QUADRANT
-25F8..25FF ; Pattern_Syntax # Sm [8] UPPER LEFT TRIANGLE..LOWER RIGHT TRIANGLE
-2600..266E ; Pattern_Syntax # So [111] BLACK SUN WITH RAYS..MUSIC NATURAL SIGN
-266F ; Pattern_Syntax # Sm MUSIC SHARP SIGN
-2670..2767 ; Pattern_Syntax # So [248] WEST SYRIAC CROSS..ROTATED FLORAL HEART BULLET
-2768 ; Pattern_Syntax # Ps MEDIUM LEFT PARENTHESIS ORNAMENT
-2769 ; Pattern_Syntax # Pe MEDIUM RIGHT PARENTHESIS ORNAMENT
-276A ; Pattern_Syntax # Ps MEDIUM FLATTENED LEFT PARENTHESIS ORNAMENT
-276B ; Pattern_Syntax # Pe MEDIUM FLATTENED RIGHT PARENTHESIS ORNAMENT
-276C ; Pattern_Syntax # Ps MEDIUM LEFT-POINTING ANGLE BRACKET ORNAMENT
-276D ; Pattern_Syntax # Pe MEDIUM RIGHT-POINTING ANGLE BRACKET ORNAMENT
-276E ; Pattern_Syntax # Ps HEAVY LEFT-POINTING ANGLE QUOTATION MARK ORNAMENT
-276F ; Pattern_Syntax # Pe HEAVY RIGHT-POINTING ANGLE QUOTATION MARK ORNAMENT
-2770 ; Pattern_Syntax # Ps HEAVY LEFT-POINTING ANGLE BRACKET ORNAMENT
-2771 ; Pattern_Syntax # Pe HEAVY RIGHT-POINTING ANGLE BRACKET ORNAMENT
-2772 ; Pattern_Syntax # Ps LIGHT LEFT TORTOISE SHELL BRACKET ORNAMENT
-2773 ; Pattern_Syntax # Pe LIGHT RIGHT TORTOISE SHELL BRACKET ORNAMENT
-2774 ; Pattern_Syntax # Ps MEDIUM LEFT CURLY BRACKET ORNAMENT
-2775 ; Pattern_Syntax # Pe MEDIUM RIGHT CURLY BRACKET ORNAMENT
-2794..27BF ; Pattern_Syntax # So [44] HEAVY WIDE-HEADED RIGHTWARDS ARROW..DOUBLE CURLY LOOP
-27C0..27C4 ; Pattern_Syntax # Sm [5] THREE DIMENSIONAL ANGLE..OPEN SUPERSET
-27C5 ; Pattern_Syntax # Ps LEFT S-SHAPED BAG DELIMITER
-27C6 ; Pattern_Syntax # Pe RIGHT S-SHAPED BAG DELIMITER
-27C7..27E5 ; Pattern_Syntax # Sm [31] OR WITH DOT INSIDE..WHITE SQUARE WITH RIGHTWARDS TICK
-27E6 ; Pattern_Syntax # Ps MATHEMATICAL LEFT WHITE SQUARE BRACKET
-27E7 ; Pattern_Syntax # Pe MATHEMATICAL RIGHT WHITE SQUARE BRACKET
-27E8 ; Pattern_Syntax # Ps MATHEMATICAL LEFT ANGLE BRACKET
-27E9 ; Pattern_Syntax # Pe MATHEMATICAL RIGHT ANGLE BRACKET
-27EA ; Pattern_Syntax # Ps MATHEMATICAL LEFT DOUBLE ANGLE BRACKET
-27EB ; Pattern_Syntax # Pe MATHEMATICAL RIGHT DOUBLE ANGLE BRACKET
-27EC ; Pattern_Syntax # Ps MATHEMATICAL LEFT WHITE TORTOISE SHELL BRACKET
-27ED ; Pattern_Syntax # Pe MATHEMATICAL RIGHT WHITE TORTOISE SHELL BRACKET
-27EE ; Pattern_Syntax # Ps MATHEMATICAL LEFT FLATTENED PARENTHESIS
-27EF ; Pattern_Syntax # Pe MATHEMATICAL RIGHT FLATTENED PARENTHESIS
-27F0..27FF ; Pattern_Syntax # Sm [16] UPWARDS QUADRUPLE ARROW..LONG RIGHTWARDS SQUIGGLE ARROW
-2800..28FF ; Pattern_Syntax # So [256] BRAILLE PATTERN BLANK..BRAILLE PATTERN DOTS-12345678
-2900..2982 ; Pattern_Syntax # Sm [131] RIGHTWARDS TWO-HEADED ARROW WITH VERTICAL STROKE..Z NOTATION TYPE COLON
-2983 ; Pattern_Syntax # Ps LEFT WHITE CURLY BRACKET
-2984 ; Pattern_Syntax # Pe RIGHT WHITE CURLY BRACKET
-2985 ; Pattern_Syntax # Ps LEFT WHITE PARENTHESIS
-2986 ; Pattern_Syntax # Pe RIGHT WHITE PARENTHESIS
-2987 ; Pattern_Syntax # Ps Z NOTATION LEFT IMAGE BRACKET
-2988 ; Pattern_Syntax # Pe Z NOTATION RIGHT IMAGE BRACKET
-2989 ; Pattern_Syntax # Ps Z NOTATION LEFT BINDING BRACKET
-298A ; Pattern_Syntax # Pe Z NOTATION RIGHT BINDING BRACKET
-298B ; Pattern_Syntax # Ps LEFT SQUARE BRACKET WITH UNDERBAR
-298C ; Pattern_Syntax # Pe RIGHT SQUARE BRACKET WITH UNDERBAR
-298D ; Pattern_Syntax # Ps LEFT SQUARE BRACKET WITH TICK IN TOP CORNER
-298E ; Pattern_Syntax # Pe RIGHT SQUARE BRACKET WITH TICK IN BOTTOM CORNER
-298F ; Pattern_Syntax # Ps LEFT SQUARE BRACKET WITH TICK IN BOTTOM CORNER
-2990 ; Pattern_Syntax # Pe RIGHT SQUARE BRACKET WITH TICK IN TOP CORNER
-2991 ; Pattern_Syntax # Ps LEFT ANGLE BRACKET WITH DOT
-2992 ; Pattern_Syntax # Pe RIGHT ANGLE BRACKET WITH DOT
-2993 ; Pattern_Syntax # Ps LEFT ARC LESS-THAN BRACKET
-2994 ; Pattern_Syntax # Pe RIGHT ARC GREATER-THAN BRACKET
-2995 ; Pattern_Syntax # Ps DOUBLE LEFT ARC GREATER-THAN BRACKET
-2996 ; Pattern_Syntax # Pe DOUBLE RIGHT ARC LESS-THAN BRACKET
-2997 ; Pattern_Syntax # Ps LEFT BLACK TORTOISE SHELL BRACKET
-2998 ; Pattern_Syntax # Pe RIGHT BLACK TORTOISE SHELL BRACKET
-2999..29D7 ; Pattern_Syntax # Sm [63] DOTTED FENCE..BLACK HOURGLASS
-29D8 ; Pattern_Syntax # Ps LEFT WIGGLY FENCE
-29D9 ; Pattern_Syntax # Pe RIGHT WIGGLY FENCE
-29DA ; Pattern_Syntax # Ps LEFT DOUBLE WIGGLY FENCE
-29DB ; Pattern_Syntax # Pe RIGHT DOUBLE WIGGLY FENCE
-29DC..29FB ; Pattern_Syntax # Sm [32] INCOMPLETE INFINITY..TRIPLE PLUS
-29FC ; Pattern_Syntax # Ps LEFT-POINTING CURVED ANGLE BRACKET
-29FD ; Pattern_Syntax # Pe RIGHT-POINTING CURVED ANGLE BRACKET
-29FE..2AFF ; Pattern_Syntax # Sm [258] TINY..N-ARY WHITE VERTICAL BAR
-2B00..2B2F ; Pattern_Syntax # So [48] NORTH EAST WHITE ARROW..WHITE VERTICAL ELLIPSE
-2B30..2B44 ; Pattern_Syntax # Sm [21] LEFT ARROW WITH SMALL CIRCLE..RIGHTWARDS ARROW THROUGH SUPERSET
-2B45..2B46 ; Pattern_Syntax # So [2] LEFTWARDS QUADRUPLE ARROW..RIGHTWARDS QUADRUPLE ARROW
-2B47..2B4C ; Pattern_Syntax # Sm [6] REVERSE TILDE OPERATOR ABOVE RIGHTWARDS ARROW..RIGHTWARDS ARROW ABOVE REVERSE TILDE OPERATOR
-2B4D..2B73 ; Pattern_Syntax # So [39] DOWNWARDS TRIANGLE-HEADED ZIGZAG ARROW..DOWNWARDS TRIANGLE-HEADED ARROW TO BAR
-2B74..2B75 ; Pattern_Syntax # Cn [2] <reserved-2B74>..<reserved-2B75>
-2B76..2B95 ; Pattern_Syntax # So [32] NORTH WEST TRIANGLE-HEADED ARROW TO BAR..RIGHTWARDS BLACK ARROW
-2B96 ; Pattern_Syntax # Cn <reserved-2B96>
-2B97..2BFF ; Pattern_Syntax # So [105] SYMBOL FOR TYPE A ELECTRONICS..HELLSCHREIBER PAUSE SYMBOL
-2E00..2E01 ; Pattern_Syntax # Po [2] RIGHT ANGLE SUBSTITUTION MARKER..RIGHT ANGLE DOTTED SUBSTITUTION MARKER
-2E02 ; Pattern_Syntax # Pi LEFT SUBSTITUTION BRACKET
-2E03 ; Pattern_Syntax # Pf RIGHT SUBSTITUTION BRACKET
-2E04 ; Pattern_Syntax # Pi LEFT DOTTED SUBSTITUTION BRACKET
-2E05 ; Pattern_Syntax # Pf RIGHT DOTTED SUBSTITUTION BRACKET
-2E06..2E08 ; Pattern_Syntax # Po [3] RAISED INTERPOLATION MARKER..DOTTED TRANSPOSITION MARKER
-2E09 ; Pattern_Syntax # Pi LEFT TRANSPOSITION BRACKET
-2E0A ; Pattern_Syntax # Pf RIGHT TRANSPOSITION BRACKET
-2E0B ; Pattern_Syntax # Po RAISED SQUARE
-2E0C ; Pattern_Syntax # Pi LEFT RAISED OMISSION BRACKET
-2E0D ; Pattern_Syntax # Pf RIGHT RAISED OMISSION BRACKET
-2E0E..2E16 ; Pattern_Syntax # Po [9] EDITORIAL CORONIS..DOTTED RIGHT-POINTING ANGLE
-2E17 ; Pattern_Syntax # Pd DOUBLE OBLIQUE HYPHEN
-2E18..2E19 ; Pattern_Syntax # Po [2] INVERTED INTERROBANG..PALM BRANCH
-2E1A ; Pattern_Syntax # Pd HYPHEN WITH DIAERESIS
-2E1B ; Pattern_Syntax # Po TILDE WITH RING ABOVE
-2E1C ; Pattern_Syntax # Pi LEFT LOW PARAPHRASE BRACKET
-2E1D ; Pattern_Syntax # Pf RIGHT LOW PARAPHRASE BRACKET
-2E1E..2E1F ; Pattern_Syntax # Po [2] TILDE WITH DOT ABOVE..TILDE WITH DOT BELOW
-2E20 ; Pattern_Syntax # Pi LEFT VERTICAL BAR WITH QUILL
-2E21 ; Pattern_Syntax # Pf RIGHT VERTICAL BAR WITH QUILL
-2E22 ; Pattern_Syntax # Ps TOP LEFT HALF BRACKET
-2E23 ; Pattern_Syntax # Pe TOP RIGHT HALF BRACKET
-2E24 ; Pattern_Syntax # Ps BOTTOM LEFT HALF BRACKET
-2E25 ; Pattern_Syntax # Pe BOTTOM RIGHT HALF BRACKET
-2E26 ; Pattern_Syntax # Ps LEFT SIDEWAYS U BRACKET
-2E27 ; Pattern_Syntax # Pe RIGHT SIDEWAYS U BRACKET
-2E28 ; Pattern_Syntax # Ps LEFT DOUBLE PARENTHESIS
-2E29 ; Pattern_Syntax # Pe RIGHT DOUBLE PARENTHESIS
-2E2A..2E2E ; Pattern_Syntax # Po [5] TWO DOTS OVER ONE DOT PUNCTUATION..REVERSED QUESTION MARK
-2E2F ; Pattern_Syntax # Lm VERTICAL TILDE
-2E30..2E39 ; Pattern_Syntax # Po [10] RING POINT..TOP HALF SECTION SIGN
-2E3A..2E3B ; Pattern_Syntax # Pd [2] TWO-EM DASH..THREE-EM DASH
-2E3C..2E3F ; Pattern_Syntax # Po [4] STENOGRAPHIC FULL STOP..CAPITULUM
-2E40 ; Pattern_Syntax # Pd DOUBLE HYPHEN
-2E41 ; Pattern_Syntax # Po REVERSED COMMA
-2E42 ; Pattern_Syntax # Ps DOUBLE LOW-REVERSED-9 QUOTATION MARK
-2E43..2E4F ; Pattern_Syntax # Po [13] DASH WITH LEFT UPTURN..CORNISH VERSE DIVIDER
-2E50..2E51 ; Pattern_Syntax # So [2] CROSS PATTY WITH RIGHT CROSSBAR..CROSS PATTY WITH LEFT CROSSBAR
-2E52..2E54 ; Pattern_Syntax # Po [3] TIRONIAN SIGN CAPITAL ET..MEDIEVAL QUESTION MARK
-2E55 ; Pattern_Syntax # Ps LEFT SQUARE BRACKET WITH STROKE
-2E56 ; Pattern_Syntax # Pe RIGHT SQUARE BRACKET WITH STROKE
-2E57 ; Pattern_Syntax # Ps LEFT SQUARE BRACKET WITH DOUBLE STROKE
-2E58 ; Pattern_Syntax # Pe RIGHT SQUARE BRACKET WITH DOUBLE STROKE
-2E59 ; Pattern_Syntax # Ps TOP HALF LEFT PARENTHESIS
-2E5A ; Pattern_Syntax # Pe TOP HALF RIGHT PARENTHESIS
-2E5B ; Pattern_Syntax # Ps BOTTOM HALF LEFT PARENTHESIS
-2E5C ; Pattern_Syntax # Pe BOTTOM HALF RIGHT PARENTHESIS
-2E5D ; Pattern_Syntax # Pd OBLIQUE HYPHEN
-2E5E..2E7F ; Pattern_Syntax # Cn [34] <reserved-2E5E>..<reserved-2E7F>
-3001..3003 ; Pattern_Syntax # Po [3] IDEOGRAPHIC COMMA..DITTO MARK
-3008 ; Pattern_Syntax # Ps LEFT ANGLE BRACKET
-3009 ; Pattern_Syntax # Pe RIGHT ANGLE BRACKET
-300A ; Pattern_Syntax # Ps LEFT DOUBLE ANGLE BRACKET
-300B ; Pattern_Syntax # Pe RIGHT DOUBLE ANGLE BRACKET
-300C ; Pattern_Syntax # Ps LEFT CORNER BRACKET
-300D ; Pattern_Syntax # Pe RIGHT CORNER BRACKET
-300E ; Pattern_Syntax # Ps LEFT WHITE CORNER BRACKET
-300F ; Pattern_Syntax # Pe RIGHT WHITE CORNER BRACKET
-3010 ; Pattern_Syntax # Ps LEFT BLACK LENTICULAR BRACKET
-3011 ; Pattern_Syntax # Pe RIGHT BLACK LENTICULAR BRACKET
-3012..3013 ; Pattern_Syntax # So [2] POSTAL MARK..GETA MARK
-3014 ; Pattern_Syntax # Ps LEFT TORTOISE SHELL BRACKET
-3015 ; Pattern_Syntax # Pe RIGHT TORTOISE SHELL BRACKET
-3016 ; Pattern_Syntax # Ps LEFT WHITE LENTICULAR BRACKET
-3017 ; Pattern_Syntax # Pe RIGHT WHITE LENTICULAR BRACKET
-3018 ; Pattern_Syntax # Ps LEFT WHITE TORTOISE SHELL BRACKET
-3019 ; Pattern_Syntax # Pe RIGHT WHITE TORTOISE SHELL BRACKET
-301A ; Pattern_Syntax # Ps LEFT WHITE SQUARE BRACKET
-301B ; Pattern_Syntax # Pe RIGHT WHITE SQUARE BRACKET
-301C ; Pattern_Syntax # Pd WAVE DASH
-301D ; Pattern_Syntax # Ps REVERSED DOUBLE PRIME QUOTATION MARK
-301E..301F ; Pattern_Syntax # Pe [2] DOUBLE PRIME QUOTATION MARK..LOW DOUBLE PRIME QUOTATION MARK
-3020 ; Pattern_Syntax # So POSTAL MARK FACE
-3030 ; Pattern_Syntax # Pd WAVY DASH
-FD3E ; Pattern_Syntax # Pe ORNATE LEFT PARENTHESIS
-FD3F ; Pattern_Syntax # Ps ORNATE RIGHT PARENTHESIS
-FE45..FE46 ; Pattern_Syntax # Po [2] SESAME DOT..WHITE SESAME DOT
-
-# Total code points: 2760
-
-# ================================================
-
-0600..0605 ; Prepended_Concatenation_Mark # Cf [6] ARABIC NUMBER SIGN..ARABIC NUMBER MARK ABOVE
-06DD ; Prepended_Concatenation_Mark # Cf ARABIC END OF AYAH
-070F ; Prepended_Concatenation_Mark # Cf SYRIAC ABBREVIATION MARK
-0890..0891 ; Prepended_Concatenation_Mark # Cf [2] ARABIC POUND MARK ABOVE..ARABIC PIASTRE MARK ABOVE
-08E2 ; Prepended_Concatenation_Mark # Cf ARABIC DISPUTED END OF AYAH
-110BD ; Prepended_Concatenation_Mark # Cf KAITHI NUMBER SIGN
-110CD ; Prepended_Concatenation_Mark # Cf KAITHI NUMBER SIGN ABOVE
-
-# Total code points: 13
-
-# ================================================
-
-1F1E6..1F1FF ; Regional_Indicator # So [26] REGIONAL INDICATOR SYMBOL LETTER A..REGIONAL INDICATOR SYMBOL LETTER Z
-
-# Total code points: 26
-
-# EOF
@@ -1,6 +1,7 @@
#!/usr/bin/python3
# -*- coding: utf-8 -*-
# Copyright (C) 2014-2024 Free Software Foundation, Inc.
+# Copyright (C) 2024 The GNU Toolchain Authors.
# This file is part of the GNU C Library.
#
# The GNU C Library is free software; you can redistribute it and/or
@@ -28,7 +29,6 @@ It will output UTF-8 file
'''
import argparse
-import sys
import re
import unicode_utils
@@ -200,30 +200,40 @@ def write_header_charmap(outfile):
def write_header_width(outfile, unicode_version):
'''Writes the header on top of the WIDTH section to the output file'''
- outfile.write('% Character width according to Unicode '
- + '{:s}.\n'.format(unicode_version))
- outfile.write('% - Default width is 1.\n')
+ outfile.write('% Character width according to Unicode {:s}.\n'.format(unicode_version))
+ outfile.write('% Width is determined by the following rules, in order of decreasing precedence:\n')
+ outfile.write('% - U+00AD SOFT HYPHEN has width 1, as a special case for compatibility (https://archive.is/b5Ck).\n')
+ outfile.write('% - U+115F HANGUL CHOSEONG FILLER has width 2.\n')
+ outfile.write('% This character stands in for an intentionally omitted leading consonant\n')
+ outfile.write('% in a Hangul syllable block; as such it must be assigned width 2 despite its lack\n')
+ outfile.write('% of visible display to ensure that the complete block has the correct width.\n')
+ outfile.write('% (See below for more information on Hangul syllables.)\n')
+ outfile.write('% - Combining jungseong and jongseong Hangul jamo have width 0; generated from\n')
+ outfile.write('% "grep \'^[^;]*;[VT]\' HangulSyllableType.txt".\n')
+ outfile.write('% One composed Hangul "syllable block" like 퓛 is made up of\n')
+ outfile.write('% two to three individual component characters called "jamo".\n')
+ outfile.write('% The complete block must have total width 2;\n')
+ outfile.write('% to achieve this, we assign a width of 2 to leading "choseong" jamo,\n')
+ outfile.write('% and of 0 to medial vowel "jungseong" and trailing "jongseong" jamo.\n')
+ outfile.write('% - Non-spacing and enclosing marks have width 0; generated from\n')
+ outfile.write('% "grep -E \'^[^;]*;[^;]*;(Mn|Me);\' UnicodeData.txt".\n')
+ outfile.write('% - "Default_Ignorable_Code_Point"s have width 0; generated from\n')
+ outfile.write('% "grep \'^[^;]*;\\s*Default_Ignorable_Code_Point\' DerivedCoreProperties.txt".\n')
outfile.write('% - Double-width characters have width 2; generated from\n')
- outfile.write('% "grep \'^[^;]*;\\s*[WF]\' EastAsianWidth.txt"\n')
- outfile.write('% - Non-spacing characters have width 0; '
- + 'generated from PropList.txt or\n')
- outfile.write('% "grep \'^[^;]*;[^;]*;[^;]*;[^;]*;NSM;\' '
- + 'UnicodeData.txt"\n')
- outfile.write('% - Format control characters have width 0; '
- + 'generated from\n')
- outfile.write("% \"grep '^[^;]*;[^;]*;Cf;' UnicodeData.txt\"\n")
-# Not needed covered by Cf
-# outfile.write("% - Zero width characters have width 0; generated from\n")
-# outfile.write("% \"grep '^[^;]*;ZERO WIDTH ' UnicodeData.txt\"\n")
+ outfile.write('% "grep \'^[^;]*;[WF]\' EastAsianWidth.txt".\n')
+ outfile.write('% - Default width for all other characters is 1.\n')
outfile.write("WIDTH\n")
-def process_width(outfile, ulines, elines, plines):
- '''ulines are lines from UnicodeData.txt, elines are lines from
- EastAsianWidth.txt containing characters with width “W” or “F”,
- plines are lines from PropList.txt which contain characters
- with the property “Prepended_Concatenation_Mark”.
-
+def process_width(outfile, ulines, dlines, elines, klines):
+ '''ulines are lines from UnicodeData.txt.
+ elines are lines from EastAsianWidth.txt containing characters with width
+ “W” or “F”.
+ dlines are lines from DerivedCoreProperties.txt which contain
+ characters with the property “Default_Ignorable_Code_Point”.
+ klines are lines from HangulSyllableType.txt which contain characters
+ with syllable type “V” or “T”.
'''
+ # Wide and fullwidth characters have width 1
width_dict = {}
for line in elines:
fields = line.split(";")
@@ -235,14 +245,14 @@ def process_width(outfile, ulines, elines, plines):
int(code_points[1], 16)+1):
width_dict[key] = 2
+ # Nonspacing and enclosing marks have width 0
for line in ulines:
fields = line.split(";")
- if fields[4] == "NSM" or fields[2] in ("Cf", "Me", "Mn"):
+ if fields[4] == "NSM" or fields[2] in ("Me", "Mn"):
width_dict[int(fields[0], 16)] = 0
- for line in plines:
- # Characters with the property “Prepended_Concatenation_Mark”
- # should have the width 1:
+ # Conjoining vowel and trailing jamo have width 0
+ for line in klines:
fields = line.split(";")
if not '..' in fields[0]:
code_points = (fields[0], fields[0])
@@ -250,21 +260,26 @@ def process_width(outfile, ulines, elines, plines):
code_points = fields[0].split("..")
for key in range(int(code_points[0], 16),
int(code_points[1], 16)+1):
- del width_dict[key] # default width is 1
-
- # handle special cases for compatibility
- for key in list((0x00AD,)):
- # https://www.cs.tut.fi/~jkorpela/shy.html
- if key in width_dict:
- del width_dict[key] # default width is 1
- for key in list(range(0x1160, 0x1200)):
- # Hangul jungseong and jongseong:
- if key in unicode_utils.UNICODE_ATTRIBUTES:
- width_dict[key] = 0
- for key in list(range(0xD7B0, 0xD800)):
- # Hangul jungseong and jongseong:
- if key in unicode_utils.UNICODE_ATTRIBUTES:
width_dict[key] = 0
+
+ # “Default_Ignorable_Code_Point”s have width 0
+ for line in dlines:
+ fields = line.split(";")
+ if not '..' in fields[0]:
+ code_points = (fields[0], fields[0])
+ else:
+ code_points = fields[0].split("..")
+ for key in range(int(code_points[0], 16),
+ int(code_points[1], 16)+1):
+ width_dict[key] = 0 # default width is 1
+
+
+ # Special case: U+00AD SOFT HYPHEN
+ del width_dict[0x00AD]
+
+ # Special case: U+115F HANGUL CHOSEONG FILLER
+ width_dict[0x115F] = 2
+
for key in list(range(0x3248, 0x3250)):
# These are “A” which means we can decide whether to treat them
# as “W” or “N” based on context:
@@ -302,7 +317,7 @@ def process_width(outfile, ulines, elines, plines):
if __name__ == "__main__":
PARSER = argparse.ArgumentParser(
description='''
- Generate a UTF-8 file from UnicodeData.txt, EastAsianWidth.txt, and PropList.txt.
+ Generate a UTF-8 file from UnicodeData.txt, DerivedCoreProperties.txt, EastAsianWidth.txt, and HangulSyllableType.txt
''')
PARSER.add_argument(
'-u', '--unicode_data_file',
@@ -311,6 +326,13 @@ if __name__ == "__main__":
default='UnicodeData.txt',
help=('The UnicodeData.txt file to read, '
+ 'default: %(default)s'))
+ PARSER.add_argument(
+ '-d', '--derived_core_properties_file',
+ nargs='?',
+ type=str,
+ default='DerivedCoreProperties.txt',
+ help=('The DerivedCoreProperties.txt file to read, '
+ + 'default: %(default)s'))
PARSER.add_argument(
'-e', '--east_asian_with_file',
nargs='?',
@@ -319,11 +341,11 @@ if __name__ == "__main__":
help=('The EastAsianWidth.txt file to read, '
+ 'default: %(default)s'))
PARSER.add_argument(
- '-p', '--prop_list_file',
+ '-k', '--hangul_syllable_type_file',
nargs='?',
type=str,
- default='PropList.txt',
- help=('The PropList.txt file to read, '
+ default='HangulSyllableType.txt',
+ help=('The HangulSyllableType.txt file to read, '
+ 'default: %(default)s'))
PARSER.add_argument(
'--unicode_version',
@@ -336,27 +358,35 @@ if __name__ == "__main__":
unicode_utils.fill_attributes(ARGS.unicode_data_file)
with open(ARGS.unicode_data_file, mode='r') as UNIDATA_FILE:
UNICODE_DATA_LINES = UNIDATA_FILE.readlines()
- with open(ARGS.east_asian_with_file, mode='r') as EAST_ASIAN_WIDTH_FILE:
- EAST_ASIAN_WIDTH_LINES = []
- for LINE in EAST_ASIAN_WIDTH_FILE:
- # If characters from EastAsianWidth.txt which are from
- # reserved ranges (i.e. not yet assigned code points)
+ with open(ARGS.derived_core_properties_file, mode='r') as DERIVED_CORE_PROPERTIES_FILE:
+ DERIVED_CORE_PROPERTIES_LINES = []
+ for LINE in DERIVED_CORE_PROPERTIES_FILE:
+ # If characters which are from reserved ranges
+ # (i.e. not yet assigned code points)
# are added to the WIDTH section of the UTF-8 file, then
# “make check” produces “Unknown Character” errors for
# these code points because such unassigned code points
# are not in the CHARMAP section of the UTF-8 file.
#
- # Therefore, we skip all reserved code points when reading
- # the EastAsianWidth.txt file.
- if re.match(r'.*<reserved-.+>\.\.<reserved-.+>.*', LINE):
+ # Therefore, we skip all reserved code points.
+ if re.match(r'.*<reserved-.+>', LINE):
+ continue
+ if re.match(r'^[^;]*;\s*Default_Ignorable_Code_Point', LINE):
+ DERIVED_CORE_PROPERTIES_LINES.append(LINE.strip())
+ with open(ARGS.east_asian_with_file, mode='r') as EAST_ASIAN_WIDTH_FILE:
+ EAST_ASIAN_WIDTH_LINES = []
+ for LINE in EAST_ASIAN_WIDTH_FILE:
+ if re.match(r'.*<reserved-.+>', LINE):
continue
if re.match(r'^[^;]*;\s*[WF]', LINE):
EAST_ASIAN_WIDTH_LINES.append(LINE.strip())
- with open(ARGS.prop_list_file, mode='r') as PROP_LIST_FILE:
- PROP_LIST_LINES = []
- for LINE in PROP_LIST_FILE:
- if re.match(r'^[^;]*;[\s]*Prepended_Concatenation_Mark', LINE):
- PROP_LIST_LINES.append(LINE.strip())
+ with open(ARGS.hangul_syllable_type_file, mode='r') as HANGUL_SYLLABLE_TYPE_FILE:
+ HANGUL_SYLLABLE_TYPE_LINES = []
+ for LINE in HANGUL_SYLLABLE_TYPE_FILE:
+ if re.match(r'.*<reserved-.+>', LINE):
+ continue
+ if re.match(r'^[^;]*;\s*[VT]', LINE):
+ HANGUL_SYLLABLE_TYPE_LINES.append(LINE.strip())
with open('UTF-8', mode='w') as OUTFILE:
# Processing UnicodeData.txt and write CHARMAP to UTF-8 file
write_header_charmap(OUTFILE)
@@ -366,6 +396,7 @@ if __name__ == "__main__":
write_header_width(OUTFILE, ARGS.unicode_version)
process_width(OUTFILE,
UNICODE_DATA_LINES,
+ DERIVED_CORE_PROPERTIES_LINES,
EAST_ASIAN_WIDTH_LINES,
- PROP_LIST_LINES)
+ HANGUL_SYLLABLE_TYPE_LINES)
OUTFILE.write("END WIDTH\n")