Skip to content

Commit 40626f0

Browse files
authored
upgrade to Unicode 15 (#200)
Reverting several reserved characters that were removed from the previous release, and that are only referencing as "@missing@ in DerivedBidiClass.txt
1 parent c13d54f commit 40626f0

18 files changed

+4107
-3268
lines changed

maint/Unicode.tables/BidiMirroring.txt

Lines changed: 4 additions & 4 deletions
Original file line numberDiff line numberDiff line change
@@ -1,6 +1,6 @@
1-
# BidiMirroring-14.0.0.txt
2-
# Date: 2021-08-08, 22:55:00 GMT [KW, RP]
3-
# © 2021 Unicode®, Inc.
1+
# BidiMirroring-15.0.0.txt
2+
# Date: 2022-05-03, 18:47:00 GMT [KW, RP]
3+
# © 2022 Unicode®, Inc.
44
# For terms of use, see https://www.unicode.org/terms_of_use.html
55
#
66
# Unicode Character Database
@@ -15,7 +15,7 @@
1515
# value, for which there is another Unicode character that typically has a glyph
1616
# that is the mirror image of the original character's glyph.
1717
#
18-
# The repertoire covered by the file is Unicode 14.0.0.
18+
# The repertoire covered by the file is Unicode 15.0.0.
1919
#
2020
# The file contains a list of lines with mappings from one code point
2121
# to another one for character-based mirroring.

maint/Unicode.tables/CaseFolding.txt

Lines changed: 5 additions & 5 deletions
Original file line numberDiff line numberDiff line change
@@ -1,11 +1,11 @@
1-
# CaseFolding-14.0.0.txt
2-
# Date: 2021-03-08, 19:35:41 GMT
3-
# © 2021 Unicode®, Inc.
1+
# CaseFolding-15.0.0.txt
2+
# Date: 2022-02-02, 23:35:35 GMT
3+
# © 2022 Unicode®, Inc.
44
# Unicode and the Unicode Logo are registered trademarks of Unicode, Inc. in the U.S. and other countries.
5-
# For terms of use, see http://www.unicode.org/terms_of_use.html
5+
# For terms of use, see https://www.unicode.org/terms_of_use.html
66
#
77
# Unicode Character Database
8-
# For documentation, see http://www.unicode.org/reports/tr44/
8+
# For documentation, see https://www.unicode.org/reports/tr44/
99
#
1010
# Case Folding Properties
1111
#

maint/Unicode.tables/DerivedBidiClass.txt

Lines changed: 153 additions & 45 deletions
Large diffs are not rendered by default.

maint/Unicode.tables/DerivedCoreProperties.txt

Lines changed: 206 additions & 47 deletions
Large diffs are not rendered by default.

maint/Unicode.tables/DerivedGeneralCategory.txt

Lines changed: 98 additions & 59 deletions
Large diffs are not rendered by default.

maint/Unicode.tables/GraphemeBreakProperty.txt

Lines changed: 27 additions & 11 deletions
Original file line numberDiff line numberDiff line change
@@ -1,11 +1,11 @@
1-
# GraphemeBreakProperty-14.0.0.txt
2-
# Date: 2021-08-12, 23:13:02 GMT
3-
# © 2021 Unicode®, Inc.
1+
# GraphemeBreakProperty-15.0.0.txt
2+
# Date: 2022-04-27, 17:07:38 GMT
3+
# © 2022 Unicode®, Inc.
44
# Unicode and the Unicode Logo are registered trademarks of Unicode, Inc. in the U.S. and other countries.
5-
# For terms of use, see http://www.unicode.org/terms_of_use.html
5+
# For terms of use, see https://www.unicode.org/terms_of_use.html
66
#
77
# Unicode Character Database
8-
# For documentation, see http://www.unicode.org/reports/tr44/
8+
# For documentation, see https://www.unicode.org/reports/tr44/
99

1010
# ================================================
1111

@@ -32,8 +32,9 @@
3232
11A3A ; Prepend # Lo ZANABAZAR SQUARE CLUSTER-INITIAL LETTER RA
3333
11A84..11A89 ; Prepend # Lo [6] SOYOMBO SIGN JIHVAMULIYA..SOYOMBO CLUSTER-INITIAL LETTER SA
3434
11D46 ; Prepend # Lo MASARAM GONDI REPHA
35+
11F02 ; Prepend # Lo KAWI SIGN REPHA
3536

36-
# Total code points: 26
37+
# Total code points: 27
3738

3839
# ================================================
3940

@@ -67,7 +68,7 @@
6768
FEFF ; Control # Cf ZERO WIDTH NO-BREAK SPACE
6869
FFF0..FFF8 ; Control # Cn [9] <reserved-FFF0>..<reserved-FFF8>
6970
FFF9..FFFB ; Control # Cf [3] INTERLINEAR ANNOTATION ANCHOR..INTERLINEAR ANNOTATION TERMINATOR
70-
13430..13438 ; Control # Cf [9] EGYPTIAN HIEROGLYPH VERTICAL JOINER..EGYPTIAN HIEROGLYPH END SEGMENT
71+
13430..1343F ; Control # Cf [16] EGYPTIAN HIEROGLYPH VERTICAL JOINER..EGYPTIAN HIEROGLYPH END WALLED ENCLOSURE
7172
1BCA0..1BCA3 ; Control # Cf [4] SHORTHAND FORMAT LETTER OVERLAP..SHORTHAND FORMAT UP STEP
7273
1D173..1D17A ; Control # Cf [8] MUSICAL SYMBOL BEGIN BEAM..MUSICAL SYMBOL END PHRASE
7374
E0000 ; Control # Cn <reserved-E0000>
@@ -76,7 +77,7 @@ E0002..E001F ; Control # Cn [30] <reserved-E0002>..<reserved-E001F>
7677
E0080..E00FF ; Control # Cn [128] <reserved-E0080>..<reserved-E00FF>
7778
E01F0..E0FFF ; Control # Cn [3600] <reserved-E01F0>..<reserved-E0FFF>
7879

79-
# Total code points: 3886
80+
# Total code points: 3893
8081

8182
# ================================================
8283

@@ -185,7 +186,7 @@ E01F0..E0FFF ; Control # Cn [3600] <reserved-E01F0>..<reserved-E0FFF>
185186
0E47..0E4E ; Extend # Mn [8] THAI CHARACTER MAITAIKHU..THAI CHARACTER YAMAKKAN
186187
0EB1 ; Extend # Mn LAO VOWEL SIGN MAI KAN
187188
0EB4..0EBC ; Extend # Mn [9] LAO VOWEL SIGN I..LAO SEMIVOWEL SIGN LO
188-
0EC8..0ECD ; Extend # Mn [6] LAO TONE MAI EK..LAO NIGGAHITA
189+
0EC8..0ECE ; Extend # Mn [7] LAO TONE MAI EK..LAO YAMAKKAN
189190
0F18..0F19 ; Extend # Mn [2] TIBETAN ASTROLOGICAL SIGN -KHYUD PA..TIBETAN ASTROLOGICAL SIGN SDONG TSHUGS
190191
0F35 ; Extend # Mn TIBETAN MARK NGAS BZUNG NYI ZLA
191192
0F37 ; Extend # Mn TIBETAN MARK NGAS BZUNG SGOR RTAGS
@@ -324,6 +325,7 @@ FF9E..FF9F ; Extend # Lm [2] HALFWIDTH KATAKANA VOICED SOUND MARK..HALFWIDT
324325
10AE5..10AE6 ; Extend # Mn [2] MANICHAEAN ABBREVIATION MARK ABOVE..MANICHAEAN ABBREVIATION MARK BELOW
325326
10D24..10D27 ; Extend # Mn [4] HANIFI ROHINGYA SIGN HARBAHAY..HANIFI ROHINGYA SIGN TASSI
326327
10EAB..10EAC ; Extend # Mn [2] YEZIDI COMBINING HAMZA MARK..YEZIDI COMBINING MADDA MARK
328+
10EFD..10EFF ; Extend # Mn [3] ARABIC SMALL LOW WORD SAKTA..ARABIC SMALL LOW WORD MADDA
327329
10F46..10F50 ; Extend # Mn [11] SOGDIAN COMBINING DOT BELOW..SOGDIAN COMBINING STROKE BELOW
328330
10F82..10F85 ; Extend # Mn [4] OLD UYGHUR COMBINING DOT ABOVE..OLD UYGHUR COMBINING TWO DOTS BELOW
329331
11001 ; Extend # Mn BRAHMI SIGN ANUSVARA
@@ -346,6 +348,7 @@ FF9E..FF9F ; Extend # Lm [2] HALFWIDTH KATAKANA VOICED SOUND MARK..HALFWIDT
346348
11234 ; Extend # Mn KHOJKI SIGN ANUSVARA
347349
11236..11237 ; Extend # Mn [2] KHOJKI SIGN NUKTA..KHOJKI SIGN SHADDA
348350
1123E ; Extend # Mn KHOJKI SIGN SUKUN
351+
11241 ; Extend # Mn KHOJKI VOWEL SIGN VOCALIC R
349352
112DF ; Extend # Mn KHUDAWADI SIGN ANUSVARA
350353
112E3..112EA ; Extend # Mn [8] KHUDAWADI VOWEL SIGN U..KHUDAWADI SIGN VIRAMA
351354
11300..11301 ; Extend # Mn [2] GRANTHA SIGN COMBINING ANUSVARA ABOVE..GRANTHA SIGN CANDRABINDU
@@ -413,6 +416,12 @@ FF9E..FF9F ; Extend # Lm [2] HALFWIDTH KATAKANA VOICED SOUND MARK..HALFWIDT
413416
11D95 ; Extend # Mn GUNJALA GONDI SIGN ANUSVARA
414417
11D97 ; Extend # Mn GUNJALA GONDI VIRAMA
415418
11EF3..11EF4 ; Extend # Mn [2] MAKASAR VOWEL SIGN I..MAKASAR VOWEL SIGN U
419+
11F00..11F01 ; Extend # Mn [2] KAWI SIGN CANDRABINDU..KAWI SIGN ANUSVARA
420+
11F36..11F3A ; Extend # Mn [5] KAWI VOWEL SIGN I..KAWI VOWEL SIGN VOCALIC R
421+
11F40 ; Extend # Mn KAWI VOWEL SIGN EU
422+
11F42 ; Extend # Mn KAWI CONJOINER
423+
13440 ; Extend # Mn EGYPTIAN HIEROGLYPH MIRROR HORIZONTALLY
424+
13447..13455 ; Extend # Mn [15] EGYPTIAN HIEROGLYPH MODIFIER DAMAGED AT TOP START..EGYPTIAN HIEROGLYPH MODIFIER DAMAGED
416425
16AF0..16AF4 ; Extend # Mn [5] BASSA VAH COMBINING HIGH TONE..BASSA VAH COMBINING HIGH-LOW TONE
417426
16B30..16B36 ; Extend # Mn [7] PAHAWH HMONG MARK CIM TUB..PAHAWH HMONG MARK CIM TAUM
418427
16F4F ; Extend # Mn MIAO SIGN CONSONANT MODIFIER BAR
@@ -439,16 +448,18 @@ FF9E..FF9F ; Extend # Lm [2] HALFWIDTH KATAKANA VOICED SOUND MARK..HALFWIDT
439448
1E01B..1E021 ; Extend # Mn [7] COMBINING GLAGOLITIC LETTER SHTA..COMBINING GLAGOLITIC LETTER YATI
440449
1E023..1E024 ; Extend # Mn [2] COMBINING GLAGOLITIC LETTER YU..COMBINING GLAGOLITIC LETTER SMALL YUS
441450
1E026..1E02A ; Extend # Mn [5] COMBINING GLAGOLITIC LETTER YO..COMBINING GLAGOLITIC LETTER FITA
451+
1E08F ; Extend # Mn COMBINING CYRILLIC SMALL LETTER BYELORUSSIAN-UKRAINIAN I
442452
1E130..1E136 ; Extend # Mn [7] NYIAKENG PUACHUE HMONG TONE-B..NYIAKENG PUACHUE HMONG TONE-D
443453
1E2AE ; Extend # Mn TOTO SIGN RISING TONE
444454
1E2EC..1E2EF ; Extend # Mn [4] WANCHO TONE TUP..WANCHO TONE KOINI
455+
1E4EC..1E4EF ; Extend # Mn [4] NAG MUNDARI SIGN MUHOR..NAG MUNDARI SIGN SUTUH
445456
1E8D0..1E8D6 ; Extend # Mn [7] MENDE KIKAKUI COMBINING NUMBER TEENS..MENDE KIKAKUI COMBINING NUMBER MILLIONS
446457
1E944..1E94A ; Extend # Mn [7] ADLAM ALIF LENGTHENER..ADLAM NUKTA
447458
1F3FB..1F3FF ; Extend # Sk [5] EMOJI MODIFIER FITZPATRICK TYPE-1-2..EMOJI MODIFIER FITZPATRICK TYPE-6
448459
E0020..E007F ; Extend # Cf [96] TAG SPACE..CANCEL TAG
449460
E0100..E01EF ; Extend # Mn [240] VARIATION SELECTOR-17..VARIATION SELECTOR-256
450461

451-
# Total code points: 2095
462+
# Total code points: 2130
452463

453464
# ================================================
454465

@@ -489,6 +500,7 @@ E0100..E01EF ; Extend # Mn [240] VARIATION SELECTOR-17..VARIATION SELECTOR-256
489500
0CC3..0CC4 ; SpacingMark # Mc [2] KANNADA VOWEL SIGN VOCALIC R..KANNADA VOWEL SIGN VOCALIC RR
490501
0CC7..0CC8 ; SpacingMark # Mc [2] KANNADA VOWEL SIGN EE..KANNADA VOWEL SIGN AI
491502
0CCA..0CCB ; SpacingMark # Mc [2] KANNADA VOWEL SIGN O..KANNADA VOWEL SIGN OO
503+
0CF3 ; SpacingMark # Mc KANNADA SIGN COMBINING ANUSVARA ABOVE RIGHT
492504
0D02..0D03 ; SpacingMark # Mc [2] MALAYALAM SIGN ANUSVARA..MALAYALAM SIGN VISARGA
493505
0D3F..0D40 ; SpacingMark # Mc [2] MALAYALAM VOWEL SIGN I..MALAYALAM VOWEL SIGN II
494506
0D46..0D48 ; SpacingMark # Mc [3] MALAYALAM VOWEL SIGN E..MALAYALAM VOWEL SIGN AI
@@ -614,12 +626,16 @@ ABEC ; SpacingMark # Mc MEETEI MAYEK LUM IYEK
614626
11D93..11D94 ; SpacingMark # Mc [2] GUNJALA GONDI VOWEL SIGN OO..GUNJALA GONDI VOWEL SIGN AU
615627
11D96 ; SpacingMark # Mc GUNJALA GONDI SIGN VISARGA
616628
11EF5..11EF6 ; SpacingMark # Mc [2] MAKASAR VOWEL SIGN E..MAKASAR VOWEL SIGN O
629+
11F03 ; SpacingMark # Mc KAWI SIGN VISARGA
630+
11F34..11F35 ; SpacingMark # Mc [2] KAWI VOWEL SIGN AA..KAWI VOWEL SIGN ALTERNATE AA
631+
11F3E..11F3F ; SpacingMark # Mc [2] KAWI VOWEL SIGN E..KAWI VOWEL SIGN AI
632+
11F41 ; SpacingMark # Mc KAWI SIGN KILLER
617633
16F51..16F87 ; SpacingMark # Mc [55] MIAO SIGN ASPIRATION..MIAO VOWEL SIGN UI
618634
16FF0..16FF1 ; SpacingMark # Mc [2] VIETNAMESE ALTERNATE READING MARK CA..VIETNAMESE ALTERNATE READING MARK NHAY
619635
1D166 ; SpacingMark # Mc MUSICAL SYMBOL COMBINING SPRECHGESANG STEM
620636
1D16D ; SpacingMark # Mc MUSICAL SYMBOL COMBINING AUGMENTATION DOT
621637

622-
# Total code points: 388
638+
# Total code points: 395
623639

624640
# ================================================
625641

0 commit comments

Comments
 (0)