idnits 2.17.1 draft-ietf-idnabis-tables-09.txt: Checking boilerplate required by RFC 5378 and the IETF Trust (see https://trustee.ietf.org/license-info): ---------------------------------------------------------------------------- ** You're using the IETF Trust Provisions' Section 6.b License Notice from 12 Sep 2009 rather than the newer Notice from 28 Dec 2009. (See https://trustee.ietf.org/license-info/) Checking nits according to https://www.ietf.org/id-info/1id-guidelines.txt: ---------------------------------------------------------------------------- No issues found here. Checking nits according to https://www.ietf.org/id-info/checklist : ---------------------------------------------------------------------------- No issues found here. Miscellaneous warnings: ---------------------------------------------------------------------------- == The copyright year in the IETF Trust and authors Copyright Line does not match the current year -- The document seems to lack a disclaimer for pre-RFC5378 work, but may have content which was first submitted before 10 November 2008. If you have contacted all the original authors and they are all willing to grant the BCP78 rights to the IETF Trust, then this is fine, and you can ignore this comment. If not, you may need to add the pre-RFC5378 disclaimer. (See the Legal Provisions document at https://trustee.ietf.org/license-info for more information.) -- The document date (January 9, 2010) is 5221 days in the past. Is this intentional? -- Found something which looks like a code comment -- if you have code sections in the document, please surround them with '' and '' lines. Checking references for intended status: Proposed Standard ---------------------------------------------------------------------------- (See RFCs 3967 and 4897 for information about using normative references to lower-maturity documents in RFCs) -- Possible downref: Non-RFC (?) normative reference: ref. 'TR15' -- Possible downref: Non-RFC (?) normative reference: ref. 'Unicode5' -- Possible downref: Non-RFC (?) normative reference: ref. 'Unicode52' == Outdated reference: A later version (-07) exists of draft-ietf-idnabis-bidi-06 == Outdated reference: A later version (-13) exists of draft-ietf-idnabis-defs-12 == Outdated reference: A later version (-18) exists of draft-ietf-idnabis-protocol-17 -- Obsolete informational reference (is this intentional?): RFC 3454 (Obsoleted by RFC 7564) -- Obsolete informational reference (is this intentional?): RFC 3491 (Obsoleted by RFC 5891) -- Obsolete informational reference (is this intentional?): RFC 5226 (Obsoleted by RFC 8126) Summary: 1 error (**), 0 flaws (~~), 4 warnings (==), 9 comments (--). Run idnits with the --verbose option for more detailed information about the items above. -------------------------------------------------------------------------------- 2 Network Working Group P. Faltstrom, Ed. 3 Internet-Draft Cisco 4 Intended status: Standards Track January 9, 2010 5 Expires: July 13, 2010 7 The Unicode code points and IDNA 8 draft-ietf-idnabis-tables-09.txt 10 Abstract 12 This document specifies rules for deciding whether a code point, 13 considered in isolation or in context, is a candidate for inclusion 14 in an Internationalized Domain Name (IDN). 16 It is part of the specification of Internationalizing Domain Names in 17 Applications 2008 (IDNA2008). 19 Status of this Memo 21 This Internet-Draft is submitted to IETF in full conformance with the 22 provisions of BCP 78 and BCP 79. 24 Internet-Drafts are working documents of the Internet Engineering 25 Task Force (IETF), its areas, and its working groups. Note that 26 other groups may also distribute working documents as Internet- 27 Drafts. 29 Internet-Drafts are draft documents valid for a maximum of six months 30 and may be updated, replaced, or obsoleted by other documents at any 31 time. It is inappropriate to use Internet-Drafts as reference 32 material or to cite them other than as "work in progress." 34 The list of current Internet-Drafts can be accessed at 35 http://www.ietf.org/ietf/1id-abstracts.txt. 37 The list of Internet-Draft Shadow Directories can be accessed at 38 http://www.ietf.org/shadow.html. 40 This Internet-Draft will expire on July 13, 2010. 42 Copyright Notice 44 Copyright (c) 2010 IETF Trust and the persons identified as the 45 document authors. All rights reserved. 47 This document is subject to BCP 78 and the IETF Trust's Legal 48 Provisions Relating to IETF Documents 49 (http://trustee.ietf.org/license-info) in effect on the date of 50 publication of this document. Please review these documents 51 carefully, as they describe your rights and restrictions with respect 52 to this document. Code Components extracted from this document must 53 include Simplified BSD License text as described in Section 4.e of 54 the Trust Legal Provisions and are provided without warranty as 55 described in the BSD License. 57 Table of Contents 59 1. Introduction . . . . . . . . . . . . . . . . . . . . . . . . . 3 60 2. Category definitions Used to Calculate Derived Property 61 Value . . . . . . . . . . . . . . . . . . . . . . . . . . . . 4 62 2.1. LetterDigits (A) . . . . . . . . . . . . . . . . . . . . . 5 63 2.2. Unstable (B) . . . . . . . . . . . . . . . . . . . . . . . 5 64 2.3. IgnorableProperties (C) . . . . . . . . . . . . . . . . . 6 65 2.4. IgnorableBlocks (D) . . . . . . . . . . . . . . . . . . . 6 66 2.5. LDH (E) . . . . . . . . . . . . . . . . . . . . . . . . . 6 67 2.6. Exceptions (F) . . . . . . . . . . . . . . . . . . . . . . 7 68 2.7. BackwardCompatible (G) . . . . . . . . . . . . . . . . . . 8 69 2.8. JoinControl (H) . . . . . . . . . . . . . . . . . . . . . 8 70 2.9. OldHangulJamo (I) . . . . . . . . . . . . . . . . . . . . 8 71 2.10. Unassigned (J) . . . . . . . . . . . . . . . . . . . . . . 9 72 3. Calculation of the Derived Property . . . . . . . . . . . . . 9 73 4. Codepoints . . . . . . . . . . . . . . . . . . . . . . . . . . 10 74 5. IANA Considerations . . . . . . . . . . . . . . . . . . . . . 10 75 5.1. IDNA derived property value registry . . . . . . . . . . . 10 76 5.2. IDNA Context Registry . . . . . . . . . . . . . . . . . . 10 77 5.2.1. Template for context registry . . . . . . . . . . . . 10 78 6. Security Considerations . . . . . . . . . . . . . . . . . . . 11 79 7. Acknowledgements . . . . . . . . . . . . . . . . . . . . . . . 11 80 Appendix A. Contextual Rules Registry . . . . . . . . . . . . . 11 81 Appendix A.1. ZERO WIDTH NON-JOINER . . . . . . . . . . . . . . . 14 82 Appendix A.2. ZERO WIDTH JOINER . . . . . . . . . . . . . . . . . 14 83 Appendix A.3. MIDDLE DOT . . . . . . . . . . . . . . . . . . . . . 14 84 Appendix A.4. GREEK LOWER NUMERAL SIGN (KERAIA) . . . . . . . . . 14 85 Appendix A.5. HEBREW PUNCTUATION GERESH . . . . . . . . . . . . . 15 86 Appendix A.6. HEBREW PUNCTUATION GERSHAYIM . . . . . . . . . . . . 15 87 Appendix A.7. KATAKANA MIDDLE DOT . . . . . . . . . . . . . . . . 15 88 Appendix A.8. ARABIC-INDIC DIGITS . . . . . . . . . . . . . . . . 16 89 Appendix A.9. EXTENDED ARABIC-INDIC DIGITS . . . . . . . . . . . . 16 90 Appendix B. Codepoints 0x0000 - 0x10FFFF . . . . . . . . . . . . 16 91 Appendix B.1. Codepoints in Unicode Character Database (UCD) 92 format . . . . . . . . . . . . . . . . . . . . . . . 16 93 8. References . . . . . . . . . . . . . . . . . . . . . . . . . . 65 94 8.1. Normative References . . . . . . . . . . . . . . . . . . . 65 95 8.2. Informative References . . . . . . . . . . . . . . . . . . 65 97 Author's Address . . . . . . . . . . . . . . . . . . . . . . . . . 66 99 1. Introduction 101 RFC 4690 [RFC4690] suggests an inclusion based approach for selecting 102 the code points from The Unicode Standard [Unicode52] that should be 103 included in the list of code points that may be used in 104 Internationalized Domain Names. 106 Specifically, RFC 4690 [RFC4690] says the following: 108 The IAB has concluded that there is a consensus within the broader 109 community that lists of code points should be specified by the use 110 of an inclusion-based mechanism (i.e., identifying the characters 111 that are permitted), rather than by excluding a small number of 112 characters from the total Unicode set as Stringprep [RFC3454] and 113 Nameprep [RFC3491] do today. That conclusion should be reviewed 114 by the IETF community and action taken as appropriate. 116 This document reviews and classifies the collections of code points 117 in the Unicode character set by examining various properties of the 118 code points. It then defines an algorithm for determining a derived 119 property value. It specifies a procedure, and not a table, of code 120 points so that the algorithm can be used to determine code point sets 121 independent of the version of Unicode that is in use. 123 This document is not intended to specify precisely how these property 124 values are to be applied in IDN labels. That information appears in 125 [IDNA2008-protocol], but it is important to understand that the 126 assignment of a value of this property to a particular character is 127 not sufficient to determine whether it can be used in a given label. 128 In particular, some combinations of allowed code points are not 129 advisable for use in IDNs due to rules specific to a script or class 130 of characters. The requirement for such rules is linked to the 131 operations in [IDNA2008-protocol] and especially to the characters 132 designated as requiring contextual rules. 134 The value of the property is to be interpreted as follows. 136 o PROTOCOL VALID: Those that are allowed to be used in IDNs. Code 137 points with this property value are permitted for general use in 138 IDNs. However, that a label consists only of code points that 139 have this property value does not imply that the label can be used 140 in DNS. See [IDNA2008-protocol] for algorithms to make decisions 141 about labels in domain names. The abbreviated term PVALID is used 142 to refer to this value in the rest of this document. 143 o CONTEXTUAL RULE REQUIRED: Some characteristics of the character, 144 such as it being invisible in certain contexts or problematic in 145 others, requires that it not be used in labels unless specific 146 other characters or properties are present. The abbreviated term 147 CONTEXT is used to refer to this value in the rest of this 148 document. There are two subdivisions of CONTEXTUAL RULE REQUIRED, 149 one for Join_controls (called CONTEXTJ) and for other characters 150 (called CONTEXTO). These are discussed in more detail below and 151 in [IDNA2008-protocol]. 152 o DISALLOWED: Those that should clearly not be included in IDNs. 153 Code points with this property value are not permitted in IDNs. 154 o UNASSIGNED: Those code points that are not designated (i.e. are 155 unassigned) in the Unicode Standard. 157 The mechanisms described here allow determination of the value of the 158 property for future versions of Unicode (including characters added 159 after Unicode 5.2). Changes in Unicode properties that do not affect 160 the outcome of this process do not affect IDN. For example, a 161 character can have its Unicode General_Category value (see 162 [Unicode52]) change from So to Sm, or from Lo to Ll, without 163 affecting the algorithm results. Moreover, even if such changes were 164 to result, the BackwardCompatible list (Section 2.7) can be adjusted 165 to ensure the stability of the results. 167 Some code points need to be allowed in exceptional circumstances, but 168 should be excluded in all other cases; these rules are also described 169 in other documents. The most notable of these are the Join Control 170 characters, U+200D ZERO WIDTH JOINER and U+200C ZERO WIDTH NON- 171 JOINER. Both of them have the derived property value CONTEXTJ. A 172 character with the derived property value CONTEXTJ or CONTEXTO 173 (CONTEXTUAL RULE REQUIRED) is not to be used unless an appropriate 174 rule has been established and the context of the character is 175 consistent with that rule. It is invalid to either register a string 176 containing these characters or even to look one up unless such 177 contextual rule is found and satisfied. Please see Appendix A, The 178 Contextual Rules Registry, for more information. 180 This document is part of a series that, together, constitute a 181 proposal for updating the IDNA standards to resolve issues uncovered 182 in recent years, cover a broader range of scripts, and provide for 183 migration to newer versions of Unicode. See [IDNA2008-rationale] for 184 a broader discussion. 186 The key words "MUST", "MUST NOT", "REQUIRED", "SHALL", "SHALL NOT", 187 "SHOULD", "SHOULD NOT", "RECOMMENDED", "MAY", and "OPTIONAL" in this 188 document are to be interpreted as described in RFC 2119 [RFC2119]. 190 2. Category definitions Used to Calculate Derived Property Value 192 The derived property obtains its value based on a two-step procedure. 193 First, characters are placed in one or more character categories 194 based on either core properties defined by the Unicode Standard or by 195 treating the codepoint as an exception and addressing the codepoint 196 by its codepoint value. These categories are not mutually exclusive. 198 In the second step, set operations are used with these categories to 199 determine the values for an IDN-specific property. Those operations 200 are specified in Section 3. 202 Unicode property names and property value names may have short 203 abbreviations, such as gc for the General_Category property, and Ll 204 for the Lowercase_Letter property value of the gc property. 206 In the following specification of categories, the operation which 207 returns the value of a particular Unicode character property for a 208 code point is designated by using the formal name of that property 209 (from PropertyAliases.txt) followed by '(cp)'. For example, the 210 value of the General_Category property for a code point is indicated 211 by General_Category(cp). 213 2.1. LetterDigits (A) 215 A: General_Category(cp) is in {Ll, Lu, Lo, Nd, Lm, Mn, Mc} 217 These rules identifies characters commonly used in mnemonics and 218 often informally described as "language characters". In general, 219 only code points assigned to this category are suitable for use in 220 IDN. 222 For more information, see section 4.5 of The Unicode Standard 223 [Unicode5]. 225 The categories used in this rule are: 226 o Ll - Lowercase_Letter 227 o Lu - Uppercase_Letter 228 o Lo - Other_Letter 229 o Nd - Decimal_Number 230 o Lm - Modifier_Letter 231 o Mn - Nonspacing_Mark 232 o Mc - Spacing_Mark 234 2.2. Unstable (B) 236 B: toNFKC(toCaseFold(toNFKC(cp))) != cp 238 This category is used to group the characters that are not stable 239 under NFKC normalization and casefolding. In general, these code 240 points are not suitable for use for IDN. 242 The toCaseFold() operation is defined in Section 3.13 of The Unicode 243 Standard [Unicode5]. 245 The toNFKC() operation returns the code point in normalization form 246 KC. For more information, see Section 5 of Unicode Standard Annex 247 #15 [TR15]. 249 It should be noted that NFKC is used, although NFC is used in the 250 "IDNA Protocol" document [IDNA2008-protocol]. 252 2.3. IgnorableProperties (C) 254 C: Default_Ignorable_Code_Point(cp) = True or 255 White_Space(cp) = True or 256 Noncharacter_Code_Point(cp) = True 258 This category is used to group code points that are not recommended 259 for use in identifiers. In general, these code points are not 260 suitable for use for IDN. 262 The definition for Default_Ignorable_Code_Point can be found in 263 DerivedCoreProperties.txt [1] and is at the time of Unicode 5.2: 265 Other_Default_Ignorable_Code_Point + Cf (Format characters) 266 + Variation_Selector - White_Space - FFF9..FFFB (Annotation 267 Characters) - 0600..0603, 06DD, 070F (exceptional Cf characters 268 that should be visible) 270 2.4. IgnorableBlocks (D) 272 D: Block(cp) is in {Combining Diacritical Marks for Symbols, 273 Musical Symbols, Ancient Greek Musical Notation} 275 This category is used to identifying code points that are not useful 276 in mnemonics or that are otherwise impractical for IDN use. In 277 general, these code points are not suitable for use for IDN. 279 The definition of blocks can be found in Blocks.txt [2] 281 2.5. LDH (E) 283 E: cp is in {002D, 0030..0039, 0061..007A} 285 This category is used in the second step to preserve the traditional 286 "hostname" (LDH) characters ('-', 0-9 and a-z). In general, these 287 code points are suitable for use for IDN. Note that there are other 288 rules regarding the code point U+002D HYPHEN-MINUS that are specified 289 in the IDNA Protocol Specification [IDNA2008-protocol]. 291 2.6. Exceptions (F) 293 F: cp is in {00B7, 00DF, 0375, 03C2, 05F3, 05F4, 0640, 0660, 294 0661, 0662, 0663, 0664, 0665, 0666, 0667, 0668, 295 0669, 06F0, 06F1, 06F2, 06F3, 06F4, 06F5, 06F6, 296 06F7, 06F8, 06F9, 06FD, 06FE, 07FA, 0F0B, 3007, 297 302E, 302F, 3031, 3032, 3033, 3034, 3035, 303B, 298 30FB} 300 This category explicitly lists code points for which the category 301 cannot be assigned using only the core property values that exist in 302 the Unicode standard. The values are according to the table below: 304 PVALID -- Would otherwise have been DISALLOWED 306 00DF; PVALID # LATIN SMALL LETTER SHARP S 307 03C2; PVALID # GREEK SMALL LETTER FINAL SIGMA 308 06FD; PVALID # ARABIC SIGN SINDHI AMPERSAND 309 06FE; PVALID # ARABIC SIGN SINDHI POSTPOSITION MEN 310 0F0B; PVALID # TIBETAN MARK INTERSYLLABIC TSHEG 311 3007; PVALID # IDEOGRAPHIC NUMBER ZERO 313 CONTEXTO -- Would otherwise have been DISALLOWED 315 00B7; CONTEXTO # MIDDLE DOT 316 0375; CONTEXTO # GREEK LOWER NUMERAL SIGN (KERAIA) 317 05F3; CONTEXTO # HEBREW PUNCTUATION GERESH 318 05F4; CONTEXTO # HEBREW PUNCTUATION GERSHAYIM 319 30FB; CONTEXTO # KATAKANA MIDDLE DOT 321 CONTEXTO -- Would otherwise have been PVALID 323 0660; CONTEXTO # ARABIC-INDIC DIGIT ZERO 324 0661; CONTEXTO # ARABIC-INDIC DIGIT ONE 325 0662; CONTEXTO # ARABIC-INDIC DIGIT TWO 326 0663; CONTEXTO # ARABIC-INDIC DIGIT THREE 327 0664; CONTEXTO # ARABIC-INDIC DIGIT FOUR 328 0665; CONTEXTO # ARABIC-INDIC DIGIT FIVE 329 0666; CONTEXTO # ARABIC-INDIC DIGIT SIX 330 0667; CONTEXTO # ARABIC-INDIC DIGIT SEVEN 331 0668; CONTEXTO # ARABIC-INDIC DIGIT EIGHT 332 0669; CONTEXTO # ARABIC-INDIC DIGIT NINE 333 06F0; CONTEXTO # EXTENDED ARABIC-INDIC DIGIT ZERO 334 06F1; CONTEXTO # EXTENDED ARABIC-INDIC DIGIT ONE 335 06F2; CONTEXTO # EXTENDED ARABIC-INDIC DIGIT TWO 336 06F3; CONTEXTO # EXTENDED ARABIC-INDIC DIGIT THREE 337 06F4; CONTEXTO # EXTENDED ARABIC-INDIC DIGIT FOUR 338 06F5; CONTEXTO # EXTENDED ARABIC-INDIC DIGIT FIVE 339 06F6; CONTEXTO # EXTENDED ARABIC-INDIC DIGIT SIX 340 06F7; CONTEXTO # EXTENDED ARABIC-INDIC DIGIT SEVEN 341 06F8; CONTEXTO # EXTENDED ARABIC-INDIC DIGIT EIGHT 342 06F9; CONTEXTO # EXTENDED ARABIC-INDIC DIGIT NINE 344 DISALLOWED -- Would otherwise have been PVALID 346 0640; DISALLOWED # ARABIC TATWEEL 347 07FA; DISALLOWED # NKO LAJANYALAN 348 302E; DISALLOWED # HANGUL SINGLE DOT TONE MARK 349 302F; DISALLOWED # HANGUL DOUBLE DOT TONE MARK 350 3031; DISALLOWED # VERTICAL KANA REPEAT MARK 351 3032; DISALLOWED # VERTICAL KANA REPEAT WITH VOICED SOUND MARK 352 3033; DISALLOWED # VERTICAL KANA REPEAT MARK UPPER HALF 353 3034; DISALLOWED # VERTICAL KANA REPEAT WITH VOICED SOUND MARK UPPER HA 354 3035; DISALLOWED # VERTICAL KANA REPEAT MARK LOWER HALF 355 303B; DISALLOWED # VERTICAL IDEOGRAPHIC ITERATION MARK 357 2.7. BackwardCompatible (G) 359 G: cp is in {} 361 This category includes the code points that property values in 362 versions of Unicode after 5.2 have changed in such a way that the 363 derived property value would no longer be PVALID or DISALLOWED. If 364 changes are made to future versions of Unicode so that code points 365 might change property value from PVALID or DISALLOWED, then this 366 table can be updated and keep special exception values so that the 367 property values for code points stay stable. 369 2.8. JoinControl (H) 371 H: Join_Control(cp) = True 373 This category consists of Join Control characters (i.e., they are not 374 in LetterDigits (Section 2.1)) but are still required in IDN labels 375 under some circumstances. 377 2.9. OldHangulJamo (I) 379 I: Hangul_Syllable_Type(cp) is in {L, V, T} 381 This category consists of all conjoining Hangul Jamo (Leading Jamo, 382 Vowel Jamo, and Trailing Jamo). 384 Elimination of conjoining Hangul Jamos from the set of PVALID 385 characters results in restricting the set of Korean PVALID characters 386 just to preformed, modern Hangul syllable characters. Old Hangul 387 syllables, which must be spelled with sequences of conjoining Hangul 388 Jamos, are not PVALID for IDNs. 390 2.10. Unassigned (J) 392 J: General_Category(cp) is in {Cn} and 393 Noncharacter_Code_Point(cp) = False 395 This category consists of code points in the Unicode character set 396 that are not (yet) assigned. It should be noted that Unicode 397 distinguishes between 'unassigned code points' and 'unassigned 398 characters'. The unassigned code points are all but (Cn - 399 Noncharacters), while the unassigned *characters* are all but (Cn + 400 Cs). 402 3. Calculation of the Derived Property 404 As described above (Section 1) and in more detail in the "IDNA 405 Protocol" document [IDNA2008-protocol], possible values of the IDN 406 property are: 408 o PVALID 409 o CONTEXTJ 410 o CONTEXTO 411 o DISALLOWED 412 o UNASSIGNED 414 The algorithm to calculate the value of the derived property is as 415 follows. If the names of a rule (such as Exception) is used, that 416 implies the set of codepoints that the rule define, while the same 417 name as a function call (such as Exception(cp)) imply the value cp 418 has in the Exceptions table. 420 If .cp. .in. Exceptions Then Exceptions(cp); 421 Else If .cp. .in. BackwardCompatible Then BackwardCompatible(cp); 422 Else If .cp. .in. Unassigned Then UNASSIGNED; 423 Else If .cp. .in. LDH Then PVALID; 424 Else If .cp. .in. JoinControl Then CONTEXTJ; 425 Else If .cp. .in. Unstable Then DISALLOWED; 426 Else If .cp. .in. IgnorableProperties Then DISALLOWED; 427 Else If .cp. .in. IgnorableBlocks Then DISALLOWED; 428 Else If .cp. .in. OldHangulJamo Then DISALLOWED; 429 Else If .cp. .in. LetterDigits Then PVALID; 430 Else DISALLOWED; 432 4. Codepoints 434 The Categories and Rules defined in Section 2 and Section 3 apply to 435 all Unicode code points. The table in Appendix B shows, for 436 illustrative purposes, the consequences of the categories and 437 classification rules, and the resulting property values. 439 The list of code points that can be found in Appendix B is non- 440 normative. Section 2 and Section 3 are normative. 442 5. IANA Considerations 444 5.1. IDNA derived property value registry 446 IANA is to create a registry with the derived properties for the 447 versions of Unicode that is released after (and including) version 448 5.2. The derived property value is to be calculated in cooperation 449 with a designated expert [RFC5226] according to the specifications in 450 Section 2 and Section 3 and not by copying the non-normative table 451 found in Appendix B. 453 If during this process (creation of the table of derived property 454 values) followed by a designated expert review, either non-backward 455 compatible changes to the table of derived properties are discovered, 456 or otherwise problems during the creation of the table arises, that 457 is to be flagged to the IESG. Changes to the rules (as specified in 458 Section 2 and Section 3), including BackwardCompatible (Section 2.7) 459 (a set that is at release of this document is empty), require IETF 460 Review, as described in RFC 5226 [RFC5226]. 462 5.2. IDNA Context Registry 464 For characters that are defined in IDNA derived property value 465 registry (Section 5.1) as CONTEXTO or CONTEXTJ and therefore 466 requiring a contextual rule IANA will create and maintain a list of 467 approved contextual rules. Additions or changes to these rules 468 require IETF Review, as described in [RFC5226]. 470 A table from which that registry can be initialized, and some further 471 discussion appears in Appendix A. 473 5.2.1. Template for context registry 475 The following information is to be given when a new rule is created. 476 Name: Unique name of the rule 477 Code point: Rule should be applied when this codepoint exist in 478 label 479 Overview: Description in plain english on what the rule verifies 480 Lookup: Should rule be applied at time of lookup? 481 Rule Set: The set of rules, as described in 483 6. Security Considerations 485 Security Considerations for this version of IDNA, except for the 486 special issues associated with right to left and characters, are 487 described in [IDNA2008-defs]. Specific issues for labels containing 488 characters associated with scripts written right to left appear in 489 [IDNA2008-bidi]. 491 7. Acknowledgements 493 This document would not have been possible to produce without input 494 from many people. The main contributors are (in alphabetical order) 495 Harald Alvestrand, Vint Cerf, Tina Dam, Mark Davis, Gihan Dias, 496 Mouhammet Diop, Michael Everson, Asmus Freytag, Debbie Garside, Paul 497 Hoffman, Kent Karlsson, Cary Karp, Jaeyoun Kim, John Klensin, Olaf 498 Kolkman, Gervase Markham, Ram Mohan, Lisa Moore, Yngve Pettersen, 499 Erik van der Poel, Hualin Qian, Rick Reed, Pete Resnick, Lakmal 500 Silva, Michel Suignard, Andrew Sullivan, Wil Tan, Kenneth Whistler, 501 Chris Wright and Yoshiro Yoneya. 503 Appendix A. Contextual Rules Registry 505 As discussed in Section 5.2 and in the IANA Considerations section of 506 [IDNA2008-rationale], a registry of rules that define the contexts in 507 which particular PROTOCOL-VALID characters, characters associated 508 with a requirement for Contextual Information, are permitted. These 509 rules are expressed as tests on the label in which the characters 510 appear (all, or any part of, the label may be tested). 512 The grammatical rules are expressed in pseudo code. The conventions 513 used for that pseudo code are explained here. 515 Each rule is constructed as a Boolean expression that evaluates to 516 either True or False. A simple "True;" or "False;" rule sets the 517 default result value for the rule set. Subsequent conditional rules 518 that evaluate to True or False may re-set the result value. 520 A special value "Undefined" is used to deal with any error 521 conditions, such as an attempt to test a character before the start 522 of a label or after the end of a label. If any term of a rule 523 evaluates to Undefined, further evaluation of the rule immediately 524 terminates, as the result value of the rule will itself be Undefined. 526 cp represents the codepoint to be tested. 528 FirstChar is a special term which denotes the first codepoint in a 529 label. 531 LastChar is a special term which denotes the last codepoint in a 532 label. 534 .eq. represents the equality relation. 536 A .eq. B evaluates to True if A equals B. 538 .is. represents checking position in a label. 540 A .is. B evaluates to True if A and B have same position in 541 the same label. 543 .ne. represents the non-equality relation. 545 A .ne. B evaluates to True if A is not equal to B. 547 .in. represents the set inclusion relation. 549 A .in. B evaluates to True if A is a member of the set B. 551 A functional notation, Function_Name(cp), is used to express either 552 string positions within a label, Boolean character property tests of 553 a codepoint, or a regular expression match. When such function names 554 refer to Boolean character property tests, the function names use the 555 exact Unicode character property name for the property in question, 556 and "cp" is evaluated as the Unicode value of the codepoint to be 557 tested, rather than as its position in the label. When such function 558 names refer to string positions within a label, "cp" is evaluated as 559 its position in the label. 561 RegExpMatch(X) takes as its parameter X a schematic regular 562 expression consisting of a mix of Unicode character property values 563 and literal Unicode codepoints. 565 Script(cp) returns the value of the Unicode Script property, as 566 defined in Scripts.txt in the Unicode Character Database. 568 Canonical_Combining_Class(cp) returns the value of the Unicode 569 Canonical_Combining_Class property, as defined in UnicodeData.txt in 570 the Unicode Character Database. 572 Before(cp) returns the codepoint of the character immediately 573 preceding cp in logical order in the string representing the label. 574 Before(FirstChar) evaluates to Undefined. 576 After(cp) returns the codepoint of the character immediately 577 following cp in logical order in the string representing the label. 578 After(LastChar) evaluates to Undefined. 580 Note that "Before" and "After" do not refer to the visual display 581 order of the character in a label, which may be reversed or otherwise 582 modified by the bidirectional algorithm for labels including 583 characters from scripts written right-to-left. Instead, 'Before' and 584 'After' refer to the network order of the character in the label. 586 The clauses "Then True" and "Then False" imply exit from the pseudo- 587 code routine with the corresponding result. 589 Repeated evaluation for all characters in a label makes use of the 590 special construct: 592 For All Characters: 593 Expression; 594 End For; 596 This construct requires repeated evaluation of "Expression" for each 597 codepoint in the label, starting from FirstChar and proceeding to 598 LastChar. 600 The different fields in the rules are to be interpreted as follows: 601 Code point: 602 The codepoint, or codepoints, that this rule is to be applied to. 603 Normally, this implies that if any of the codepoints in a label is 604 as defined, then the rules should be applied. If evaluated to 605 True, the codepoint is ok as used; if evaluated to False, it is 606 not o.k. 607 Overview: 608 A description of the goal with the rule, in plain English. 609 Lookup: 610 True if application of this rule is recommended at lookup time; 611 False otherwise. 612 Rule Set: 613 The rule set itself, as described above. 615 Appendix A.1. ZERO WIDTH NON-JOINER 616 Code point: 617 U+200C 618 Overview: 619 This may occur in a formally cursive script (such as Arabic) in a 620 context where it breaks a cursive connection as required for 621 orthographic rules, as in the Persian language, for example. It 622 also may occur in Indic scripts in a consonant conjunct context 623 (immediately following a virama), to control required display of 624 such conjuncts. 625 Lookup: 626 True 627 Rule Set: 628 False; 629 If Canonical_Combining_Class(Before(cp)) .eq. Virama Then True; 630 If RegExpMatch((Joining_Type:{L,D})(Joining_Type:T)*\u200C 631 (Joining_Type:T)*(Joining_Type:{R,D})) Then True; 633 Appendix A.2. ZERO WIDTH JOINER 634 Code point: 635 U+200D 636 Overview: 637 This may occur in Indic scripts in a consonant conjunct context 638 (immediately following a virama), to control required display of 639 such conjuncts. 640 Lookup: 641 True 642 Rule Set: 643 False; 644 If Canonical_Combining_Class(Before(cp)) .eq. Virama Then True; 646 Appendix A.3. MIDDLE DOT 647 Code point: 648 U+00B7 649 Overview: 650 Between 'l' (U+006C) characters only, used to permit the Catalan 651 character ela geminada to be expressed 652 Lookup: 653 False 654 Rule Set: 655 False; 656 If Before(cp) .eq. U+006C And 657 After(cp) .eq. U+006C Then True; 659 Appendix A.4. GREEK LOWER NUMERAL SIGN (KERAIA) 660 Code point: 661 U+0375 662 Overview: 663 The script of the following character MUST be Greek. 664 Lookup: 665 False 666 Rule Set: 667 False; 668 If Script(After(cp)) .eq. Greek Then True; 670 Appendix A.5. HEBREW PUNCTUATION GERESH 671 Code point: 672 U+05F3 673 Overview: 674 The script of the preceding character MUST be Hebrew. 675 Lookup: 676 False 677 Rule Set: 678 False; 679 If Script(Before(cp)) .eq. Hebrew Then True; 681 Appendix A.6. HEBREW PUNCTUATION GERSHAYIM 682 Code point: 683 U+05F4 684 Overview: 685 The script of the preceding character MUST be Hebrew. 686 Lookup: 687 False 688 Rule Set: 689 False; 690 If Script(Before(cp)) .eq. Hebrew Then True; 692 Appendix A.7. KATAKANA MIDDLE DOT 693 Code point: 694 U+30FB 695 Overview: 696 Note that the Script of Katakana Middle Dot is not any of 697 "Hiragana", "Katakana" or "Han". The effect of this rule is to 698 require at least one character in the label to be in one of those 699 scripts. 700 Lookup: 701 False 702 Rule Set: 703 False; 704 For All Characters: 705 If Script(cp) .in. {Hiragana, Katakana, Han} Then True; 706 End For; 708 Appendix A.8. ARABIC-INDIC DIGITS 709 Code point: 710 0660..0669 711 Overview: 712 Can not be mixed with Extended Arabic-Indic Digits. 713 Lookup: 714 False 715 Rule Set: 716 True; 717 For All Characters: 718 If cp .in. 06F0..06F9 Then False; 719 End For; 721 Appendix A.9. EXTENDED ARABIC-INDIC DIGITS 722 Code point: 723 06F0..06F9 724 Overview: 725 Can not be mixed with Arabic-Indic Digits. 726 Lookup: 727 False 728 Rule Set: 729 True; 730 For All Characters: 731 If cp .in. 0660..0669 Then False; 732 End For; 734 Appendix B. Codepoints 0x0000 - 0x10FFFF 736 If one applies the rules (Section 3) to the code points 0x0000 to 737 0x10FFFF to Unicode 5.2, the result is as follows. 739 This list is non-normative, and only included for illustrative 740 purposes. Specifically, what is displayed in the third column is not 741 the formal name of the codepoint (as defined in section 4.8 of The 742 Unicode Standard [Unicode52]). The differences exists for example 743 for the codepoints that have the codepoint value as part of the name 744 (example: CJK UNIFIED IDEOGRAPH-4E00) and the naming of Hangul 745 syllables. For many codepoints, what you see is the official name. 747 Appendix B.1. Codepoints in Unicode Character Database (UCD) format 749 0000..002C ; DISALLOWED # ..COMMA 750 002D ; PVALID # HYPHEN-MINUS 751 002E..002F ; DISALLOWED # FULL STOP..SOLIDUS 752 0030..0039 ; PVALID # DIGIT ZERO..DIGIT NINE 753 003A..0060 ; DISALLOWED # COLON..GRAVE ACCENT 754 0061..007A ; PVALID # LATIN SMALL LETTER A..LATIN SMALL LETTER Z 755 007B..00B6 ; DISALLOWED # LEFT CURLY BRACKET..PILCROW SIGN 756 00B7 ; CONTEXTO # MIDDLE DOT 757 00B8..00DE ; DISALLOWED # CEDILLA..LATIN CAPITAL LETTER THORN 758 00DF..00F6 ; PVALID # LATIN SMALL LETTER SHARP S..LATIN SMALL LETT 759 00F7 ; DISALLOWED # DIVISION SIGN 760 00F8..00FF ; PVALID # LATIN SMALL LETTER O WITH STROKE..LATIN SMAL 761 0100 ; DISALLOWED # LATIN CAPITAL LETTER A WITH MACRON 762 0101 ; PVALID # LATIN SMALL LETTER A WITH MACRON 763 0102 ; DISALLOWED # LATIN CAPITAL LETTER A WITH BREVE 764 0103 ; PVALID # LATIN SMALL LETTER A WITH BREVE 765 0104 ; DISALLOWED # LATIN CAPITAL LETTER A WITH OGONEK 766 0105 ; PVALID # LATIN SMALL LETTER A WITH OGONEK 767 0106 ; DISALLOWED # LATIN CAPITAL LETTER C WITH ACUTE 768 0107 ; PVALID # LATIN SMALL LETTER C WITH ACUTE 769 0108 ; DISALLOWED # LATIN CAPITAL LETTER C WITH CIRCUMFLEX 770 0109 ; PVALID # LATIN SMALL LETTER C WITH CIRCUMFLEX 771 010A ; DISALLOWED # LATIN CAPITAL LETTER C WITH DOT ABOVE 772 010B ; PVALID # LATIN SMALL LETTER C WITH DOT ABOVE 773 010C ; DISALLOWED # LATIN CAPITAL LETTER C WITH CARON 774 010D ; PVALID # LATIN SMALL LETTER C WITH CARON 775 010E ; DISALLOWED # LATIN CAPITAL LETTER D WITH CARON 776 010F ; PVALID # LATIN SMALL LETTER D WITH CARON 777 0110 ; DISALLOWED # LATIN CAPITAL LETTER D WITH STROKE 778 0111 ; PVALID # LATIN SMALL LETTER D WITH STROKE 779 0112 ; DISALLOWED # LATIN CAPITAL LETTER E WITH MACRON 780 0113 ; PVALID # LATIN SMALL LETTER E WITH MACRON 781 0114 ; DISALLOWED # LATIN CAPITAL LETTER E WITH BREVE 782 0115 ; PVALID # LATIN SMALL LETTER E WITH BREVE 783 0116 ; DISALLOWED # LATIN CAPITAL LETTER E WITH DOT ABOVE 784 0117 ; PVALID # LATIN SMALL LETTER E WITH DOT ABOVE 785 0118 ; DISALLOWED # LATIN CAPITAL LETTER E WITH OGONEK 786 0119 ; PVALID # LATIN SMALL LETTER E WITH OGONEK 787 011A ; DISALLOWED # LATIN CAPITAL LETTER E WITH CARON 788 011B ; PVALID # LATIN SMALL LETTER E WITH CARON 789 011C ; DISALLOWED # LATIN CAPITAL LETTER G WITH CIRCUMFLEX 790 011D ; PVALID # LATIN SMALL LETTER G WITH CIRCUMFLEX 791 011E ; DISALLOWED # LATIN CAPITAL LETTER G WITH BREVE 792 011F ; PVALID # LATIN SMALL LETTER G WITH BREVE 793 0120 ; DISALLOWED # LATIN CAPITAL LETTER G WITH DOT ABOVE 794 0121 ; PVALID # LATIN SMALL LETTER G WITH DOT ABOVE 795 0122 ; DISALLOWED # LATIN CAPITAL LETTER G WITH CEDILLA 796 0123 ; PVALID # LATIN SMALL LETTER G WITH CEDILLA 797 0124 ; DISALLOWED # LATIN CAPITAL LETTER H WITH CIRCUMFLEX 798 0125 ; PVALID # LATIN SMALL LETTER H WITH CIRCUMFLEX 799 0126 ; DISALLOWED # LATIN CAPITAL LETTER H WITH STROKE 800 0127 ; PVALID # LATIN SMALL LETTER H WITH STROKE 801 0128 ; DISALLOWED # LATIN CAPITAL LETTER I WITH TILDE 802 0129 ; PVALID # LATIN SMALL LETTER I WITH TILDE 803 012A ; DISALLOWED # LATIN CAPITAL LETTER I WITH MACRON 804 012B ; PVALID # LATIN SMALL LETTER I WITH MACRON 805 012C ; DISALLOWED # LATIN CAPITAL LETTER I WITH BREVE 806 012D ; PVALID # LATIN SMALL LETTER I WITH BREVE 807 012E ; DISALLOWED # LATIN CAPITAL LETTER I WITH OGONEK 808 012F ; PVALID # LATIN SMALL LETTER I WITH OGONEK 809 0130 ; DISALLOWED # LATIN CAPITAL LETTER I WITH DOT ABOVE 810 0131 ; PVALID # LATIN SMALL LETTER DOTLESS I 811 0132..0134 ; DISALLOWED # LATIN CAPITAL LIGATURE IJ..LATIN CAPITAL LET 812 0135 ; PVALID # LATIN SMALL LETTER J WITH CIRCUMFLEX 813 0136 ; DISALLOWED # LATIN CAPITAL LETTER K WITH CEDILLA 814 0137..0138 ; PVALID # LATIN SMALL LETTER K WITH CEDILLA..LATIN SMA 815 0139 ; DISALLOWED # LATIN CAPITAL LETTER L WITH ACUTE 816 013A ; PVALID # LATIN SMALL LETTER L WITH ACUTE 817 013B ; DISALLOWED # LATIN CAPITAL LETTER L WITH CEDILLA 818 013C ; PVALID # LATIN SMALL LETTER L WITH CEDILLA 819 013D ; DISALLOWED # LATIN CAPITAL LETTER L WITH CARON 820 013E ; PVALID # LATIN SMALL LETTER L WITH CARON 821 013F..0141 ; DISALLOWED # LATIN CAPITAL LETTER L WITH MIDDLE DOT..LATI 822 0142 ; PVALID # LATIN SMALL LETTER L WITH STROKE 823 0143 ; DISALLOWED # LATIN CAPITAL LETTER N WITH ACUTE 824 0144 ; PVALID # LATIN SMALL LETTER N WITH ACUTE 825 0145 ; DISALLOWED # LATIN CAPITAL LETTER N WITH CEDILLA 826 0146 ; PVALID # LATIN SMALL LETTER N WITH CEDILLA 827 0147 ; DISALLOWED # LATIN CAPITAL LETTER N WITH CARON 828 0148 ; PVALID # LATIN SMALL LETTER N WITH CARON 829 0149..014A ; DISALLOWED # LATIN SMALL LETTER N PRECEDED BY APOSTROPHE. 830 014B ; PVALID # LATIN SMALL LETTER ENG 831 014C ; DISALLOWED # LATIN CAPITAL LETTER O WITH MACRON 832 014D ; PVALID # LATIN SMALL LETTER O WITH MACRON 833 014E ; DISALLOWED # LATIN CAPITAL LETTER O WITH BREVE 834 014F ; PVALID # LATIN SMALL LETTER O WITH BREVE 835 0150 ; DISALLOWED # LATIN CAPITAL LETTER O WITH DOUBLE ACUTE 836 0151 ; PVALID # LATIN SMALL LETTER O WITH DOUBLE ACUTE 837 0152 ; DISALLOWED # LATIN CAPITAL LIGATURE OE 838 0153 ; PVALID # LATIN SMALL LIGATURE OE 839 0154 ; DISALLOWED # LATIN CAPITAL LETTER R WITH ACUTE 840 0155 ; PVALID # LATIN SMALL LETTER R WITH ACUTE 841 0156 ; DISALLOWED # LATIN CAPITAL LETTER R WITH CEDILLA 842 0157 ; PVALID # LATIN SMALL LETTER R WITH CEDILLA 843 0158 ; DISALLOWED # LATIN CAPITAL LETTER R WITH CARON 844 0159 ; PVALID # LATIN SMALL LETTER R WITH CARON 845 015A ; DISALLOWED # LATIN CAPITAL LETTER S WITH ACUTE 846 015B ; PVALID # LATIN SMALL LETTER S WITH ACUTE 847 015C ; DISALLOWED # LATIN CAPITAL LETTER S WITH CIRCUMFLEX 848 015D ; PVALID # LATIN SMALL LETTER S WITH CIRCUMFLEX 849 015E ; DISALLOWED # LATIN CAPITAL LETTER S WITH CEDILLA 850 015F ; PVALID # LATIN SMALL LETTER S WITH CEDILLA 851 0160 ; DISALLOWED # LATIN CAPITAL LETTER S WITH CARON 852 0161 ; PVALID # LATIN SMALL LETTER S WITH CARON 853 0162 ; DISALLOWED # LATIN CAPITAL LETTER T WITH CEDILLA 854 0163 ; PVALID # LATIN SMALL LETTER T WITH CEDILLA 855 0164 ; DISALLOWED # LATIN CAPITAL LETTER T WITH CARON 856 0165 ; PVALID # LATIN SMALL LETTER T WITH CARON 857 0166 ; DISALLOWED # LATIN CAPITAL LETTER T WITH STROKE 858 0167 ; PVALID # LATIN SMALL LETTER T WITH STROKE 859 0168 ; DISALLOWED # LATIN CAPITAL LETTER U WITH TILDE 860 0169 ; PVALID # LATIN SMALL LETTER U WITH TILDE 861 016A ; DISALLOWED # LATIN CAPITAL LETTER U WITH MACRON 862 016B ; PVALID # LATIN SMALL LETTER U WITH MACRON 863 016C ; DISALLOWED # LATIN CAPITAL LETTER U WITH BREVE 864 016D ; PVALID # LATIN SMALL LETTER U WITH BREVE 865 016E ; DISALLOWED # LATIN CAPITAL LETTER U WITH RING ABOVE 866 016F ; PVALID # LATIN SMALL LETTER U WITH RING ABOVE 867 0170 ; DISALLOWED # LATIN CAPITAL LETTER U WITH DOUBLE ACUTE 868 0171 ; PVALID # LATIN SMALL LETTER U WITH DOUBLE ACUTE 869 0172 ; DISALLOWED # LATIN CAPITAL LETTER U WITH OGONEK 870 0173 ; PVALID # LATIN SMALL LETTER U WITH OGONEK 871 0174 ; DISALLOWED # LATIN CAPITAL LETTER W WITH CIRCUMFLEX 872 0175 ; PVALID # LATIN SMALL LETTER W WITH CIRCUMFLEX 873 0176 ; DISALLOWED # LATIN CAPITAL LETTER Y WITH CIRCUMFLEX 874 0177 ; PVALID # LATIN SMALL LETTER Y WITH CIRCUMFLEX 875 0178..0179 ; DISALLOWED # LATIN CAPITAL LETTER Y WITH DIAERESIS..LATIN 876 017A ; PVALID # LATIN SMALL LETTER Z WITH ACUTE 877 017B ; DISALLOWED # LATIN CAPITAL LETTER Z WITH DOT ABOVE 878 017C ; PVALID # LATIN SMALL LETTER Z WITH DOT ABOVE 879 017D ; DISALLOWED # LATIN CAPITAL LETTER Z WITH CARON 880 017E ; PVALID # LATIN SMALL LETTER Z WITH CARON 881 017F ; DISALLOWED # LATIN SMALL LETTER LONG S 882 0180 ; PVALID # LATIN SMALL LETTER B WITH STROKE 883 0181..0182 ; DISALLOWED # LATIN CAPITAL LETTER B WITH HOOK..LATIN CAPI 884 0183 ; PVALID # LATIN SMALL LETTER B WITH TOPBAR 885 0184 ; DISALLOWED # LATIN CAPITAL LETTER TONE SIX 886 0185 ; PVALID # LATIN SMALL LETTER TONE SIX 887 0186..0187 ; DISALLOWED # LATIN CAPITAL LETTER OPEN O..LATIN CAPITAL L 888 0188 ; PVALID # LATIN SMALL LETTER C WITH HOOK 889 0189..018B ; DISALLOWED # LATIN CAPITAL LETTER AFRICAN D..LATIN CAPITA 890 018C..018D ; PVALID # LATIN SMALL LETTER D WITH TOPBAR..LATIN SMAL 891 018E..0191 ; DISALLOWED # LATIN CAPITAL LETTER REVERSED E..LATIN CAPIT 892 0192 ; PVALID # LATIN SMALL LETTER F WITH HOOK 893 0193..0194 ; DISALLOWED # LATIN CAPITAL LETTER G WITH HOOK..LATIN CAPI 894 0195 ; PVALID # LATIN SMALL LETTER HV 895 0196..0198 ; DISALLOWED # LATIN CAPITAL LETTER IOTA..LATIN CAPITAL LET 896 0199..019B ; PVALID # LATIN SMALL LETTER K WITH HOOK..LATIN SMALL 897 019C..019D ; DISALLOWED # LATIN CAPITAL LETTER TURNED M..LATIN CAPITAL 898 019E ; PVALID # LATIN SMALL LETTER N WITH LONG RIGHT LEG 899 019F..01A0 ; DISALLOWED # LATIN CAPITAL LETTER O WITH MIDDLE TILDE..LA 900 01A1 ; PVALID # LATIN SMALL LETTER O WITH HORN 901 01A2 ; DISALLOWED # LATIN CAPITAL LETTER OI 902 01A3 ; PVALID # LATIN SMALL LETTER OI 903 01A4 ; DISALLOWED # LATIN CAPITAL LETTER P WITH HOOK 904 01A5 ; PVALID # LATIN SMALL LETTER P WITH HOOK 905 01A6..01A7 ; DISALLOWED # LATIN LETTER YR..LATIN CAPITAL LETTER TONE T 906 01A8 ; PVALID # LATIN SMALL LETTER TONE TWO 907 01A9 ; DISALLOWED # LATIN CAPITAL LETTER ESH 908 01AA..01AB ; PVALID # LATIN LETTER REVERSED ESH LOOP..LATIN SMALL 909 01AC ; DISALLOWED # LATIN CAPITAL LETTER T WITH HOOK 910 01AD ; PVALID # LATIN SMALL LETTER T WITH HOOK 911 01AE..01AF ; DISALLOWED # LATIN CAPITAL LETTER T WITH RETROFLEX HOOK.. 912 01B0 ; PVALID # LATIN SMALL LETTER U WITH HORN 913 01B1..01B3 ; DISALLOWED # LATIN CAPITAL LETTER UPSILON..LATIN CAPITAL 914 01B4 ; PVALID # LATIN SMALL LETTER Y WITH HOOK 915 01B5 ; DISALLOWED # LATIN CAPITAL LETTER Z WITH STROKE 916 01B6 ; PVALID # LATIN SMALL LETTER Z WITH STROKE 917 01B7..01B8 ; DISALLOWED # LATIN CAPITAL LETTER EZH..LATIN CAPITAL LETT 918 01B9..01BB ; PVALID # LATIN SMALL LETTER EZH REVERSED..LATIN LETTE 919 01BC ; DISALLOWED # LATIN CAPITAL LETTER TONE FIVE 920 01BD..01C3 ; PVALID # LATIN SMALL LETTER TONE FIVE..LATIN LETTER R 921 01C4..01CD ; DISALLOWED # LATIN CAPITAL LETTER DZ WITH CARON..LATIN CA 922 01CE ; PVALID # LATIN SMALL LETTER A WITH CARON 923 01CF ; DISALLOWED # LATIN CAPITAL LETTER I WITH CARON 924 01D0 ; PVALID # LATIN SMALL LETTER I WITH CARON 925 01D1 ; DISALLOWED # LATIN CAPITAL LETTER O WITH CARON 926 01D2 ; PVALID # LATIN SMALL LETTER O WITH CARON 927 01D3 ; DISALLOWED # LATIN CAPITAL LETTER U WITH CARON 928 01D4 ; PVALID # LATIN SMALL LETTER U WITH CARON 929 01D5 ; DISALLOWED # LATIN CAPITAL LETTER U WITH DIAERESIS AND MA 930 01D6 ; PVALID # LATIN SMALL LETTER U WITH DIAERESIS AND MACR 931 01D7 ; DISALLOWED # LATIN CAPITAL LETTER U WITH DIAERESIS AND AC 932 01D8 ; PVALID # LATIN SMALL LETTER U WITH DIAERESIS AND ACUT 933 01D9 ; DISALLOWED # LATIN CAPITAL LETTER U WITH DIAERESIS AND CA 934 01DA ; PVALID # LATIN SMALL LETTER U WITH DIAERESIS AND CARO 935 01DB ; DISALLOWED # LATIN CAPITAL LETTER U WITH DIAERESIS AND GR 936 01DC..01DD ; PVALID # LATIN SMALL LETTER U WITH DIAERESIS AND GRAV 937 01DE ; DISALLOWED # LATIN CAPITAL LETTER A WITH DIAERESIS AND MA 938 01DF ; PVALID # LATIN SMALL LETTER A WITH DIAERESIS AND MACR 939 01E0 ; DISALLOWED # LATIN CAPITAL LETTER A WITH DOT ABOVE AND MA 940 01E1 ; PVALID # LATIN SMALL LETTER A WITH DOT ABOVE AND MACR 941 01E2 ; DISALLOWED # LATIN CAPITAL LETTER AE WITH MACRON 942 01E3 ; PVALID # LATIN SMALL LETTER AE WITH MACRON 943 01E4 ; DISALLOWED # LATIN CAPITAL LETTER G WITH STROKE 944 01E5 ; PVALID # LATIN SMALL LETTER G WITH STROKE 945 01E6 ; DISALLOWED # LATIN CAPITAL LETTER G WITH CARON 946 01E7 ; PVALID # LATIN SMALL LETTER G WITH CARON 947 01E8 ; DISALLOWED # LATIN CAPITAL LETTER K WITH CARON 948 01E9 ; PVALID # LATIN SMALL LETTER K WITH CARON 949 01EA ; DISALLOWED # LATIN CAPITAL LETTER O WITH OGONEK 950 01EB ; PVALID # LATIN SMALL LETTER O WITH OGONEK 951 01EC ; DISALLOWED # LATIN CAPITAL LETTER O WITH OGONEK AND MACRO 952 01ED ; PVALID # LATIN SMALL LETTER O WITH OGONEK AND MACRON 953 01EE ; DISALLOWED # LATIN CAPITAL LETTER EZH WITH CARON 954 01EF..01F0 ; PVALID # LATIN SMALL LETTER EZH WITH CARON..LATIN SMA 955 01F1..01F4 ; DISALLOWED # LATIN CAPITAL LETTER DZ..LATIN CAPITAL LETTE 956 01F5 ; PVALID # LATIN SMALL LETTER G WITH ACUTE 957 01F6..01F8 ; DISALLOWED # LATIN CAPITAL LETTER HWAIR..LATIN CAPITAL LE 958 01F9 ; PVALID # LATIN SMALL LETTER N WITH GRAVE 959 01FA ; DISALLOWED # LATIN CAPITAL LETTER A WITH RING ABOVE AND A 960 01FB ; PVALID # LATIN SMALL LETTER A WITH RING ABOVE AND ACU 961 01FC ; DISALLOWED # LATIN CAPITAL LETTER AE WITH ACUTE 962 01FD ; PVALID # LATIN SMALL LETTER AE WITH ACUTE 963 01FE ; DISALLOWED # LATIN CAPITAL LETTER O WITH STROKE AND ACUTE 964 01FF ; PVALID # LATIN SMALL LETTER O WITH STROKE AND ACUTE 965 0200 ; DISALLOWED # LATIN CAPITAL LETTER A WITH DOUBLE GRAVE 966 0201 ; PVALID # LATIN SMALL LETTER A WITH DOUBLE GRAVE 967 0202 ; DISALLOWED # LATIN CAPITAL LETTER A WITH INVERTED BREVE 968 0203 ; PVALID # LATIN SMALL LETTER A WITH INVERTED BREVE 969 0204 ; DISALLOWED # LATIN CAPITAL LETTER E WITH DOUBLE GRAVE 970 0205 ; PVALID # LATIN SMALL LETTER E WITH DOUBLE GRAVE 971 0206 ; DISALLOWED # LATIN CAPITAL LETTER E WITH INVERTED BREVE 972 0207 ; PVALID # LATIN SMALL LETTER E WITH INVERTED BREVE 973 0208 ; DISALLOWED # LATIN CAPITAL LETTER I WITH DOUBLE GRAVE 974 0209 ; PVALID # LATIN SMALL LETTER I WITH DOUBLE GRAVE 975 020A ; DISALLOWED # LATIN CAPITAL LETTER I WITH INVERTED BREVE 976 020B ; PVALID # LATIN SMALL LETTER I WITH INVERTED BREVE 977 020C ; DISALLOWED # LATIN CAPITAL LETTER O WITH DOUBLE GRAVE 978 020D ; PVALID # LATIN SMALL LETTER O WITH DOUBLE GRAVE 979 020E ; DISALLOWED # LATIN CAPITAL LETTER O WITH INVERTED BREVE 980 020F ; PVALID # LATIN SMALL LETTER O WITH INVERTED BREVE 981 0210 ; DISALLOWED # LATIN CAPITAL LETTER R WITH DOUBLE GRAVE 982 0211 ; PVALID # LATIN SMALL LETTER R WITH DOUBLE GRAVE 983 0212 ; DISALLOWED # LATIN CAPITAL LETTER R WITH INVERTED BREVE 984 0213 ; PVALID # LATIN SMALL LETTER R WITH INVERTED BREVE 985 0214 ; DISALLOWED # LATIN CAPITAL LETTER U WITH DOUBLE GRAVE 986 0215 ; PVALID # LATIN SMALL LETTER U WITH DOUBLE GRAVE 987 0216 ; DISALLOWED # LATIN CAPITAL LETTER U WITH INVERTED BREVE 988 0217 ; PVALID # LATIN SMALL LETTER U WITH INVERTED BREVE 989 0218 ; DISALLOWED # LATIN CAPITAL LETTER S WITH COMMA BELOW 990 0219 ; PVALID # LATIN SMALL LETTER S WITH COMMA BELOW 991 021A ; DISALLOWED # LATIN CAPITAL LETTER T WITH COMMA BELOW 992 021B ; PVALID # LATIN SMALL LETTER T WITH COMMA BELOW 993 021C ; DISALLOWED # LATIN CAPITAL LETTER YOGH 994 021D ; PVALID # LATIN SMALL LETTER YOGH 995 021E ; DISALLOWED # LATIN CAPITAL LETTER H WITH CARON 996 021F ; PVALID # LATIN SMALL LETTER H WITH CARON 997 0220 ; DISALLOWED # LATIN CAPITAL LETTER N WITH LONG RIGHT LEG 998 0221 ; PVALID # LATIN SMALL LETTER D WITH CURL 999 0222 ; DISALLOWED # LATIN CAPITAL LETTER OU 1000 0223 ; PVALID # LATIN SMALL LETTER OU 1001 0224 ; DISALLOWED # LATIN CAPITAL LETTER Z WITH HOOK 1002 0225 ; PVALID # LATIN SMALL LETTER Z WITH HOOK 1003 0226 ; DISALLOWED # LATIN CAPITAL LETTER A WITH DOT ABOVE 1004 0227 ; PVALID # LATIN SMALL LETTER A WITH DOT ABOVE 1005 0228 ; DISALLOWED # LATIN CAPITAL LETTER E WITH CEDILLA 1006 0229 ; PVALID # LATIN SMALL LETTER E WITH CEDILLA 1007 022A ; DISALLOWED # LATIN CAPITAL LETTER O WITH DIAERESIS AND MA 1008 022B ; PVALID # LATIN SMALL LETTER O WITH DIAERESIS AND MACR 1009 022C ; DISALLOWED # LATIN CAPITAL LETTER O WITH TILDE AND MACRON 1010 022D ; PVALID # LATIN SMALL LETTER O WITH TILDE AND MACRON 1011 022E ; DISALLOWED # LATIN CAPITAL LETTER O WITH DOT ABOVE 1012 022F ; PVALID # LATIN SMALL LETTER O WITH DOT ABOVE 1013 0230 ; DISALLOWED # LATIN CAPITAL LETTER O WITH DOT ABOVE AND MA 1014 0231 ; PVALID # LATIN SMALL LETTER O WITH DOT ABOVE AND MACR 1015 0232 ; DISALLOWED # LATIN CAPITAL LETTER Y WITH MACRON 1016 0233..0239 ; PVALID # LATIN SMALL LETTER Y WITH MACRON..LATIN SMAL 1017 023A..023B ; DISALLOWED # LATIN CAPITAL LETTER A WITH STROKE..LATIN CA 1018 023C ; PVALID # LATIN SMALL LETTER C WITH STROKE 1019 023D..023E ; DISALLOWED # LATIN CAPITAL LETTER L WITH BAR..LATIN CAPIT 1020 023F..0240 ; PVALID # LATIN SMALL LETTER S WITH SWASH TAIL..LATIN 1021 0241 ; DISALLOWED # LATIN CAPITAL LETTER GLOTTAL STOP 1022 0242 ; PVALID # LATIN SMALL LETTER GLOTTAL STOP 1023 0243..0246 ; DISALLOWED # LATIN CAPITAL LETTER B WITH STROKE..LATIN CA 1024 0247 ; PVALID # LATIN SMALL LETTER E WITH STROKE 1025 0248 ; DISALLOWED # LATIN CAPITAL LETTER J WITH STROKE 1026 0249 ; PVALID # LATIN SMALL LETTER J WITH STROKE 1027 024A ; DISALLOWED # LATIN CAPITAL LETTER SMALL Q WITH HOOK TAIL 1028 024B ; PVALID # LATIN SMALL LETTER Q WITH HOOK TAIL 1029 024C ; DISALLOWED # LATIN CAPITAL LETTER R WITH STROKE 1030 024D ; PVALID # LATIN SMALL LETTER R WITH STROKE 1031 024E ; DISALLOWED # LATIN CAPITAL LETTER Y WITH STROKE 1032 024F..02AF ; PVALID # LATIN SMALL LETTER Y WITH STROKE..LATIN SMAL 1033 02B0..02B8 ; DISALLOWED # MODIFIER LETTER SMALL H..MODIFIER LETTER SMA 1034 02B9..02C1 ; PVALID # MODIFIER LETTER PRIME..MODIFIER LETTER REVER 1035 02C2..02C5 ; DISALLOWED # MODIFIER LETTER LEFT ARROWHEAD..MODIFIER LET 1036 02C6..02D1 ; PVALID # MODIFIER LETTER CIRCUMFLEX ACCENT..MODIFIER 1037 02D2..02EB ; DISALLOWED # MODIFIER LETTER CENTRED RIGHT HALF RING..MOD 1038 02EC ; PVALID # MODIFIER LETTER VOICING 1039 02ED ; DISALLOWED # MODIFIER LETTER UNASPIRATED 1040 02EE ; PVALID # MODIFIER LETTER DOUBLE APOSTROPHE 1041 02EF..02FF ; DISALLOWED # MODIFIER LETTER LOW DOWN ARROWHEAD..MODIFIER 1042 0300..033F ; PVALID # COMBINING GRAVE ACCENT..COMBINING DOUBLE OVE 1043 0340..0341 ; DISALLOWED # COMBINING GRAVE TONE MARK..COMBINING ACUTE T 1044 0342 ; PVALID # COMBINING GREEK PERISPOMENI 1045 0343..0345 ; DISALLOWED # COMBINING GREEK KORONIS..COMBINING GREEK YPO 1046 0346..034E ; PVALID # COMBINING BRIDGE ABOVE..COMBINING UPWARDS AR 1047 034F ; DISALLOWED # COMBINING GRAPHEME JOINER 1048 0350..036F ; PVALID # COMBINING RIGHT ARROWHEAD ABOVE..COMBINING L 1049 0370 ; DISALLOWED # GREEK CAPITAL LETTER HETA 1050 0371 ; PVALID # GREEK SMALL LETTER HETA 1051 0372 ; DISALLOWED # GREEK CAPITAL LETTER ARCHAIC SAMPI 1052 0373 ; PVALID # GREEK SMALL LETTER ARCHAIC SAMPI 1053 0374 ; DISALLOWED # GREEK NUMERAL SIGN 1054 0375 ; CONTEXTO # GREEK LOWER NUMERAL SIGN 1055 0376 ; DISALLOWED # GREEK CAPITAL LETTER PAMPHYLIAN DIGAMMA 1056 0377 ; PVALID # GREEK SMALL LETTER PAMPHYLIAN DIGAMMA 1057 0378..0379 ; UNASSIGNED # .. 1058 037A ; DISALLOWED # GREEK YPOGEGRAMMENI 1059 037B..037D ; PVALID # GREEK SMALL REVERSED LUNATE SIGMA SYMBOL..GR 1060 037E ; DISALLOWED # GREEK QUESTION MARK 1061 037F..0383 ; UNASSIGNED # .. 1062 0384..038A ; DISALLOWED # GREEK TONOS..GREEK CAPITAL LETTER IOTA WITH 1063 038B ; UNASSIGNED # 1064 038C ; DISALLOWED # GREEK CAPITAL LETTER OMICRON WITH TONOS 1065 038D ; UNASSIGNED # 1066 038E..038F ; DISALLOWED # GREEK CAPITAL LETTER UPSILON WITH TONOS..GRE 1067 0390 ; PVALID # GREEK SMALL LETTER IOTA WITH DIALYTIKA AND T 1068 0391..03A1 ; DISALLOWED # GREEK CAPITAL LETTER ALPHA..GREEK CAPITAL LE 1069 03A2 ; UNASSIGNED # 1070 03A3..03AB ; DISALLOWED # GREEK CAPITAL LETTER SIGMA..GREEK CAPITAL LE 1071 03AC..03CE ; PVALID # GREEK SMALL LETTER ALPHA WITH TONOS..GREEK S 1072 03CF..03D6 ; DISALLOWED # GREEK CAPITAL KAI SYMBOL..GREEK PI SYMBOL 1073 03D7 ; PVALID # GREEK KAI SYMBOL 1074 03D8 ; DISALLOWED # GREEK LETTER ARCHAIC KOPPA 1075 03D9 ; PVALID # GREEK SMALL LETTER ARCHAIC KOPPA 1076 03DA ; DISALLOWED # GREEK LETTER STIGMA 1077 03DB ; PVALID # GREEK SMALL LETTER STIGMA 1078 03DC ; DISALLOWED # GREEK LETTER DIGAMMA 1079 03DD ; PVALID # GREEK SMALL LETTER DIGAMMA 1080 03DE ; DISALLOWED # GREEK LETTER KOPPA 1081 03DF ; PVALID # GREEK SMALL LETTER KOPPA 1082 03E0 ; DISALLOWED # GREEK LETTER SAMPI 1083 03E1 ; PVALID # GREEK SMALL LETTER SAMPI 1084 03E2 ; DISALLOWED # COPTIC CAPITAL LETTER SHEI 1085 03E3 ; PVALID # COPTIC SMALL LETTER SHEI 1086 03E4 ; DISALLOWED # COPTIC CAPITAL LETTER FEI 1087 03E5 ; PVALID # COPTIC SMALL LETTER FEI 1088 03E6 ; DISALLOWED # COPTIC CAPITAL LETTER KHEI 1089 03E7 ; PVALID # COPTIC SMALL LETTER KHEI 1090 03E8 ; DISALLOWED # COPTIC CAPITAL LETTER HORI 1091 03E9 ; PVALID # COPTIC SMALL LETTER HORI 1092 03EA ; DISALLOWED # COPTIC CAPITAL LETTER GANGIA 1093 03EB ; PVALID # COPTIC SMALL LETTER GANGIA 1094 03EC ; DISALLOWED # COPTIC CAPITAL LETTER SHIMA 1095 03ED ; PVALID # COPTIC SMALL LETTER SHIMA 1096 03EE ; DISALLOWED # COPTIC CAPITAL LETTER DEI 1097 03EF ; PVALID # COPTIC SMALL LETTER DEI 1098 03F0..03F2 ; DISALLOWED # GREEK KAPPA SYMBOL..GREEK LUNATE SIGMA SYMBO 1099 03F3 ; PVALID # GREEK LETTER YOT 1100 03F4..03F7 ; DISALLOWED # GREEK CAPITAL THETA SYMBOL..GREEK CAPITAL LE 1101 03F8 ; PVALID # GREEK SMALL LETTER SHO 1102 03F9..03FA ; DISALLOWED # GREEK CAPITAL LUNATE SIGMA SYMBOL..GREEK CAP 1103 03FB..03FC ; PVALID # GREEK SMALL LETTER SAN..GREEK RHO WITH STROK 1104 03FD..042F ; DISALLOWED # GREEK CAPITAL REVERSED LUNATE SIGMA SYMBOL.. 1105 0430..045F ; PVALID # CYRILLIC SMALL LETTER A..CYRILLIC SMALL LETT 1106 0460 ; DISALLOWED # CYRILLIC CAPITAL LETTER OMEGA 1107 0461 ; PVALID # CYRILLIC SMALL LETTER OMEGA 1108 0462 ; DISALLOWED # CYRILLIC CAPITAL LETTER YAT 1109 0463 ; PVALID # CYRILLIC SMALL LETTER YAT 1110 0464 ; DISALLOWED # CYRILLIC CAPITAL LETTER IOTIFIED E 1111 0465 ; PVALID # CYRILLIC SMALL LETTER IOTIFIED E 1112 0466 ; DISALLOWED # CYRILLIC CAPITAL LETTER LITTLE YUS 1113 0467 ; PVALID # CYRILLIC SMALL LETTER LITTLE YUS 1114 0468 ; DISALLOWED # CYRILLIC CAPITAL LETTER IOTIFIED LITTLE YUS 1115 0469 ; PVALID # CYRILLIC SMALL LETTER IOTIFIED LITTLE YUS 1116 046A ; DISALLOWED # CYRILLIC CAPITAL LETTER BIG YUS 1117 046B ; PVALID # CYRILLIC SMALL LETTER BIG YUS 1118 046C ; DISALLOWED # CYRILLIC CAPITAL LETTER IOTIFIED BIG YUS 1119 046D ; PVALID # CYRILLIC SMALL LETTER IOTIFIED BIG YUS 1120 046E ; DISALLOWED # CYRILLIC CAPITAL LETTER KSI 1121 046F ; PVALID # CYRILLIC SMALL LETTER KSI 1122 0470 ; DISALLOWED # CYRILLIC CAPITAL LETTER PSI 1123 0471 ; PVALID # CYRILLIC SMALL LETTER PSI 1124 0472 ; DISALLOWED # CYRILLIC CAPITAL LETTER FITA 1125 0473 ; PVALID # CYRILLIC SMALL LETTER FITA 1126 0474 ; DISALLOWED # CYRILLIC CAPITAL LETTER IZHITSA 1127 0475 ; PVALID # CYRILLIC SMALL LETTER IZHITSA 1128 0476 ; DISALLOWED # CYRILLIC CAPITAL LETTER IZHITSA WITH DOUBLE 1129 0477 ; PVALID # CYRILLIC SMALL LETTER IZHITSA WITH DOUBLE GR 1130 0478 ; DISALLOWED # CYRILLIC CAPITAL LETTER UK 1131 0479 ; PVALID # CYRILLIC SMALL LETTER UK 1132 047A ; DISALLOWED # CYRILLIC CAPITAL LETTER ROUND OMEGA 1133 047B ; PVALID # CYRILLIC SMALL LETTER ROUND OMEGA 1134 047C ; DISALLOWED # CYRILLIC CAPITAL LETTER OMEGA WITH TITLO 1135 047D ; PVALID # CYRILLIC SMALL LETTER OMEGA WITH TITLO 1136 047E ; DISALLOWED # CYRILLIC CAPITAL LETTER OT 1137 047F ; PVALID # CYRILLIC SMALL LETTER OT 1138 0480 ; DISALLOWED # CYRILLIC CAPITAL LETTER KOPPA 1139 0481 ; PVALID # CYRILLIC SMALL LETTER KOPPA 1140 0482 ; DISALLOWED # CYRILLIC THOUSANDS SIGN 1141 0483..0487 ; PVALID # COMBINING CYRILLIC TITLO..COMBINING CYRILLIC 1142 0488..048A ; DISALLOWED # COMBINING CYRILLIC HUNDRED THOUSANDS SIGN..C 1143 048B ; PVALID # CYRILLIC SMALL LETTER SHORT I WITH TAIL 1144 048C ; DISALLOWED # CYRILLIC CAPITAL LETTER SEMISOFT SIGN 1145 048D ; PVALID # CYRILLIC SMALL LETTER SEMISOFT SIGN 1146 048E ; DISALLOWED # CYRILLIC CAPITAL LETTER ER WITH TICK 1147 048F ; PVALID # CYRILLIC SMALL LETTER ER WITH TICK 1148 0490 ; DISALLOWED # CYRILLIC CAPITAL LETTER GHE WITH UPTURN 1149 0491 ; PVALID # CYRILLIC SMALL LETTER GHE WITH UPTURN 1150 0492 ; DISALLOWED # CYRILLIC CAPITAL LETTER GHE WITH STROKE 1151 0493 ; PVALID # CYRILLIC SMALL LETTER GHE WITH STROKE 1152 0494 ; DISALLOWED # CYRILLIC CAPITAL LETTER GHE WITH MIDDLE HOOK 1153 0495 ; PVALID # CYRILLIC SMALL LETTER GHE WITH MIDDLE HOOK 1154 0496 ; DISALLOWED # CYRILLIC CAPITAL LETTER ZHE WITH DESCENDER 1155 0497 ; PVALID # CYRILLIC SMALL LETTER ZHE WITH DESCENDER 1156 0498 ; DISALLOWED # CYRILLIC CAPITAL LETTER ZE WITH DESCENDER 1157 0499 ; PVALID # CYRILLIC SMALL LETTER ZE WITH DESCENDER 1158 049A ; DISALLOWED # CYRILLIC CAPITAL LETTER KA WITH DESCENDER 1159 049B ; PVALID # CYRILLIC SMALL LETTER KA WITH DESCENDER 1160 049C ; DISALLOWED # CYRILLIC CAPITAL LETTER KA WITH VERTICAL STR 1161 049D ; PVALID # CYRILLIC SMALL LETTER KA WITH VERTICAL STROK 1162 049E ; DISALLOWED # CYRILLIC CAPITAL LETTER KA WITH STROKE 1163 049F ; PVALID # CYRILLIC SMALL LETTER KA WITH STROKE 1164 04A0 ; DISALLOWED # CYRILLIC CAPITAL LETTER BASHKIR KA 1165 04A1 ; PVALID # CYRILLIC SMALL LETTER BASHKIR KA 1166 04A2 ; DISALLOWED # CYRILLIC CAPITAL LETTER EN WITH DESCENDER 1167 04A3 ; PVALID # CYRILLIC SMALL LETTER EN WITH DESCENDER 1168 04A4 ; DISALLOWED # CYRILLIC CAPITAL LIGATURE EN GHE 1169 04A5 ; PVALID # CYRILLIC SMALL LIGATURE EN GHE 1170 04A6 ; DISALLOWED # CYRILLIC CAPITAL LETTER PE WITH MIDDLE HOOK 1171 04A7 ; PVALID # CYRILLIC SMALL LETTER PE WITH MIDDLE HOOK 1172 04A8 ; DISALLOWED # CYRILLIC CAPITAL LETTER ABKHASIAN HA 1173 04A9 ; PVALID # CYRILLIC SMALL LETTER ABKHASIAN HA 1174 04AA ; DISALLOWED # CYRILLIC CAPITAL LETTER ES WITH DESCENDER 1175 04AB ; PVALID # CYRILLIC SMALL LETTER ES WITH DESCENDER 1176 04AC ; DISALLOWED # CYRILLIC CAPITAL LETTER TE WITH DESCENDER 1177 04AD ; PVALID # CYRILLIC SMALL LETTER TE WITH DESCENDER 1178 04AE ; DISALLOWED # CYRILLIC CAPITAL LETTER STRAIGHT U 1179 04AF ; PVALID # CYRILLIC SMALL LETTER STRAIGHT U 1180 04B0 ; DISALLOWED # CYRILLIC CAPITAL LETTER STRAIGHT U WITH STRO 1181 04B1 ; PVALID # CYRILLIC SMALL LETTER STRAIGHT U WITH STROKE 1182 04B2 ; DISALLOWED # CYRILLIC CAPITAL LETTER HA WITH DESCENDER 1183 04B3 ; PVALID # CYRILLIC SMALL LETTER HA WITH DESCENDER 1184 04B4 ; DISALLOWED # CYRILLIC CAPITAL LIGATURE TE TSE 1185 04B5 ; PVALID # CYRILLIC SMALL LIGATURE TE TSE 1186 04B6 ; DISALLOWED # CYRILLIC CAPITAL LETTER CHE WITH DESCENDER 1187 04B7 ; PVALID # CYRILLIC SMALL LETTER CHE WITH DESCENDER 1188 04B8 ; DISALLOWED # CYRILLIC CAPITAL LETTER CHE WITH VERTICAL ST 1189 04B9 ; PVALID # CYRILLIC SMALL LETTER CHE WITH VERTICAL STRO 1190 04BA ; DISALLOWED # CYRILLIC CAPITAL LETTER SHHA 1191 04BB ; PVALID # CYRILLIC SMALL LETTER SHHA 1192 04BC ; DISALLOWED # CYRILLIC CAPITAL LETTER ABKHASIAN CHE 1193 04BD ; PVALID # CYRILLIC SMALL LETTER ABKHASIAN CHE 1194 04BE ; DISALLOWED # CYRILLIC CAPITAL LETTER ABKHASIAN CHE WITH D 1195 04BF ; PVALID # CYRILLIC SMALL LETTER ABKHASIAN CHE WITH DES 1196 04C0..04C1 ; DISALLOWED # CYRILLIC LETTER PALOCHKA..CYRILLIC CAPITAL L 1197 04C2 ; PVALID # CYRILLIC SMALL LETTER ZHE WITH BREVE 1198 04C3 ; DISALLOWED # CYRILLIC CAPITAL LETTER KA WITH HOOK 1199 04C4 ; PVALID # CYRILLIC SMALL LETTER KA WITH HOOK 1200 04C5 ; DISALLOWED # CYRILLIC CAPITAL LETTER EL WITH TAIL 1201 04C6 ; PVALID # CYRILLIC SMALL LETTER EL WITH TAIL 1202 04C7 ; DISALLOWED # CYRILLIC CAPITAL LETTER EN WITH HOOK 1203 04C8 ; PVALID # CYRILLIC SMALL LETTER EN WITH HOOK 1204 04C9 ; DISALLOWED # CYRILLIC CAPITAL LETTER EN WITH TAIL 1205 04CA ; PVALID # CYRILLIC SMALL LETTER EN WITH TAIL 1206 04CB ; DISALLOWED # CYRILLIC CAPITAL LETTER KHAKASSIAN CHE 1207 04CC ; PVALID # CYRILLIC SMALL LETTER KHAKASSIAN CHE 1208 04CD ; DISALLOWED # CYRILLIC CAPITAL LETTER EM WITH TAIL 1209 04CE..04CF ; PVALID # CYRILLIC SMALL LETTER EM WITH TAIL..CYRILLIC 1210 04D0 ; DISALLOWED # CYRILLIC CAPITAL LETTER A WITH BREVE 1211 04D1 ; PVALID # CYRILLIC SMALL LETTER A WITH BREVE 1212 04D2 ; DISALLOWED # CYRILLIC CAPITAL LETTER A WITH DIAERESIS 1213 04D3 ; PVALID # CYRILLIC SMALL LETTER A WITH DIAERESIS 1214 04D4 ; DISALLOWED # CYRILLIC CAPITAL LIGATURE A IE 1215 04D5 ; PVALID # CYRILLIC SMALL LIGATURE A IE 1216 04D6 ; DISALLOWED # CYRILLIC CAPITAL LETTER IE WITH BREVE 1217 04D7 ; PVALID # CYRILLIC SMALL LETTER IE WITH BREVE 1218 04D8 ; DISALLOWED # CYRILLIC CAPITAL LETTER SCHWA 1219 04D9 ; PVALID # CYRILLIC SMALL LETTER SCHWA 1220 04DA ; DISALLOWED # CYRILLIC CAPITAL LETTER SCHWA WITH DIAERESIS 1221 04DB ; PVALID # CYRILLIC SMALL LETTER SCHWA WITH DIAERESIS 1222 04DC ; DISALLOWED # CYRILLIC CAPITAL LETTER ZHE WITH DIAERESIS 1223 04DD ; PVALID # CYRILLIC SMALL LETTER ZHE WITH DIAERESIS 1224 04DE ; DISALLOWED # CYRILLIC CAPITAL LETTER ZE WITH DIAERESIS 1225 04DF ; PVALID # CYRILLIC SMALL LETTER ZE WITH DIAERESIS 1226 04E0 ; DISALLOWED # CYRILLIC CAPITAL LETTER ABKHASIAN DZE 1227 04E1 ; PVALID # CYRILLIC SMALL LETTER ABKHASIAN DZE 1228 04E2 ; DISALLOWED # CYRILLIC CAPITAL LETTER I WITH MACRON 1229 04E3 ; PVALID # CYRILLIC SMALL LETTER I WITH MACRON 1230 04E4 ; DISALLOWED # CYRILLIC CAPITAL LETTER I WITH DIAERESIS 1231 04E5 ; PVALID # CYRILLIC SMALL LETTER I WITH DIAERESIS 1232 04E6 ; DISALLOWED # CYRILLIC CAPITAL LETTER O WITH DIAERESIS 1233 04E7 ; PVALID # CYRILLIC SMALL LETTER O WITH DIAERESIS 1234 04E8 ; DISALLOWED # CYRILLIC CAPITAL LETTER BARRED O 1235 04E9 ; PVALID # CYRILLIC SMALL LETTER BARRED O 1236 04EA ; DISALLOWED # CYRILLIC CAPITAL LETTER BARRED O WITH DIAERE 1237 04EB ; PVALID # CYRILLIC SMALL LETTER BARRED O WITH DIAERESI 1238 04EC ; DISALLOWED # CYRILLIC CAPITAL LETTER E WITH DIAERESIS 1239 04ED ; PVALID # CYRILLIC SMALL LETTER E WITH DIAERESIS 1240 04EE ; DISALLOWED # CYRILLIC CAPITAL LETTER U WITH MACRON 1241 04EF ; PVALID # CYRILLIC SMALL LETTER U WITH MACRON 1242 04F0 ; DISALLOWED # CYRILLIC CAPITAL LETTER U WITH DIAERESIS 1243 04F1 ; PVALID # CYRILLIC SMALL LETTER U WITH DIAERESIS 1244 04F2 ; DISALLOWED # CYRILLIC CAPITAL LETTER U WITH DOUBLE ACUTE 1245 04F3 ; PVALID # CYRILLIC SMALL LETTER U WITH DOUBLE ACUTE 1246 04F4 ; DISALLOWED # CYRILLIC CAPITAL LETTER CHE WITH DIAERESIS 1247 04F5 ; PVALID # CYRILLIC SMALL LETTER CHE WITH DIAERESIS 1248 04F6 ; DISALLOWED # CYRILLIC CAPITAL LETTER GHE WITH DESCENDER 1249 04F7 ; PVALID # CYRILLIC SMALL LETTER GHE WITH DESCENDER 1250 04F8 ; DISALLOWED # CYRILLIC CAPITAL LETTER YERU WITH DIAERESIS 1251 04F9 ; PVALID # CYRILLIC SMALL LETTER YERU WITH DIAERESIS 1252 04FA ; DISALLOWED # CYRILLIC CAPITAL LETTER GHE WITH STROKE AND 1253 04FB ; PVALID # CYRILLIC SMALL LETTER GHE WITH STROKE AND HO 1254 04FC ; DISALLOWED # CYRILLIC CAPITAL LETTER HA WITH HOOK 1255 04FD ; PVALID # CYRILLIC SMALL LETTER HA WITH HOOK 1256 04FE ; DISALLOWED # CYRILLIC CAPITAL LETTER HA WITH STROKE 1257 04FF ; PVALID # CYRILLIC SMALL LETTER HA WITH STROKE 1258 0500 ; DISALLOWED # CYRILLIC CAPITAL LETTER KOMI DE 1259 0501 ; PVALID # CYRILLIC SMALL LETTER KOMI DE 1260 0502 ; DISALLOWED # CYRILLIC CAPITAL LETTER KOMI DJE 1261 0503 ; PVALID # CYRILLIC SMALL LETTER KOMI DJE 1262 0504 ; DISALLOWED # CYRILLIC CAPITAL LETTER KOMI ZJE 1263 0505 ; PVALID # CYRILLIC SMALL LETTER KOMI ZJE 1264 0506 ; DISALLOWED # CYRILLIC CAPITAL LETTER KOMI DZJE 1265 0507 ; PVALID # CYRILLIC SMALL LETTER KOMI DZJE 1266 0508 ; DISALLOWED # CYRILLIC CAPITAL LETTER KOMI LJE 1267 0509 ; PVALID # CYRILLIC SMALL LETTER KOMI LJE 1268 050A ; DISALLOWED # CYRILLIC CAPITAL LETTER KOMI NJE 1269 050B ; PVALID # CYRILLIC SMALL LETTER KOMI NJE 1270 050C ; DISALLOWED # CYRILLIC CAPITAL LETTER KOMI SJE 1271 050D ; PVALID # CYRILLIC SMALL LETTER KOMI SJE 1272 050E ; DISALLOWED # CYRILLIC CAPITAL LETTER KOMI TJE 1273 050F ; PVALID # CYRILLIC SMALL LETTER KOMI TJE 1274 0510 ; DISALLOWED # CYRILLIC CAPITAL LETTER REVERSED ZE 1275 0511 ; PVALID # CYRILLIC SMALL LETTER REVERSED ZE 1276 0512 ; DISALLOWED # CYRILLIC CAPITAL LETTER EL WITH HOOK 1277 0513 ; PVALID # CYRILLIC SMALL LETTER EL WITH HOOK 1278 0514 ; DISALLOWED # CYRILLIC CAPITAL LETTER LHA 1279 0515 ; PVALID # CYRILLIC SMALL LETTER LHA 1280 0516 ; DISALLOWED # CYRILLIC CAPITAL LETTER RHA 1281 0517 ; PVALID # CYRILLIC SMALL LETTER RHA 1282 0518 ; DISALLOWED # CYRILLIC CAPITAL LETTER YAE 1283 0519 ; PVALID # CYRILLIC SMALL LETTER YAE 1284 051A ; DISALLOWED # CYRILLIC CAPITAL LETTER QA 1285 051B ; PVALID # CYRILLIC SMALL LETTER QA 1286 051C ; DISALLOWED # CYRILLIC CAPITAL LETTER WE 1287 051D ; PVALID # CYRILLIC SMALL LETTER WE 1288 051E ; DISALLOWED # CYRILLIC CAPITAL LETTER ALEUT KA 1289 051F ; PVALID # CYRILLIC SMALL LETTER ALEUT KA 1290 0520 ; DISALLOWED # CYRILLIC CAPITAL LETTER EL WITH MIDDLE HOOK 1291 0521 ; PVALID # CYRILLIC SMALL LETTER EL WITH MIDDLE HOOK 1292 0522 ; DISALLOWED # CYRILLIC CAPITAL LETTER EN WITH MIDDLE HOOK 1293 0523 ; PVALID # CYRILLIC SMALL LETTER EN WITH MIDDLE HOOK 1294 0524 ; DISALLOWED # CYRILLIC CAPITAL LETTER PE WITH DESCENDER 1295 0525 ; PVALID # CYRILLIC SMALL LETTER PE WITH DESCENDER 1296 0526..0530 ; UNASSIGNED # .. 1297 0531..0556 ; DISALLOWED # ARMENIAN CAPITAL LETTER AYB..ARMENIAN CAPITA 1298 0557..0558 ; UNASSIGNED # .. 1299 0559 ; PVALID # ARMENIAN MODIFIER LETTER LEFT HALF RING 1300 055A..055F ; DISALLOWED # ARMENIAN APOSTROPHE..ARMENIAN ABBREVIATION M 1301 0560 ; UNASSIGNED # 1302 0561..0586 ; PVALID # ARMENIAN SMALL LETTER AYB..ARMENIAN SMALL LE 1303 0587 ; DISALLOWED # ARMENIAN SMALL LIGATURE ECH YIWN 1304 0588 ; UNASSIGNED # 1305 0589..058A ; DISALLOWED # ARMENIAN FULL STOP..ARMENIAN HYPHEN 1306 058B..0590 ; UNASSIGNED # .. 1307 0591..05BD ; PVALID # HEBREW ACCENT ETNAHTA..HEBREW POINT METEG 1308 05BE ; DISALLOWED # HEBREW PUNCTUATION MAQAF 1309 05BF ; PVALID # HEBREW POINT RAFE 1310 05C0 ; DISALLOWED # HEBREW PUNCTUATION PASEQ 1311 05C1..05C2 ; PVALID # HEBREW POINT SHIN DOT..HEBREW POINT SIN DOT 1312 05C3 ; DISALLOWED # HEBREW PUNCTUATION SOF PASUQ 1313 05C4..05C5 ; PVALID # HEBREW MARK UPPER DOT..HEBREW MARK LOWER DOT 1314 05C6 ; DISALLOWED # HEBREW PUNCTUATION NUN HAFUKHA 1315 05C7 ; PVALID # HEBREW POINT QAMATS QATAN 1316 05C8..05CF ; UNASSIGNED # .. 1317 05D0..05EA ; PVALID # HEBREW LETTER ALEF..HEBREW LETTER TAV 1318 05EB..05EF ; UNASSIGNED # .. 1319 05F0..05F2 ; PVALID # HEBREW LIGATURE YIDDISH DOUBLE VAV..HEBREW L 1320 05F3..05F4 ; CONTEXTO # HEBREW PUNCTUATION GERESH..HEBREW PUNCTUATIO 1321 05F5..05FF ; UNASSIGNED # .. 1322 0600..0603 ; DISALLOWED # ARABIC NUMBER SIGN..ARABIC SIGN SAFHA 1323 0604..0605 ; UNASSIGNED # .. 1324 0606..060F ; DISALLOWED # ARABIC-INDIC CUBE ROOT..ARABIC SIGN MISRA 1325 0610..061A ; PVALID # ARABIC SIGN SALLALLAHOU ALAYHE WASSALLAM..AR 1326 061B ; DISALLOWED # ARABIC SEMICOLON 1327 061C..061D ; UNASSIGNED # .. 1328 061E..061F ; DISALLOWED # ARABIC TRIPLE DOT PUNCTUATION MARK..ARABIC Q 1329 0620 ; UNASSIGNED # 1330 0621..063F ; PVALID # ARABIC LETTER HAMZA..ARABIC LETTER FARSI YEH 1331 0640 ; DISALLOWED # ARABIC TATWEEL 1332 0641..065E ; PVALID # ARABIC LETTER FEH..ARABIC FATHA WITH TWO DOT 1333 065F ; UNASSIGNED # 1334 0660..0669 ; CONTEXTO # ARABIC-INDIC DIGIT ZERO..ARABIC-INDIC DIGIT 1335 066A..066D ; DISALLOWED # ARABIC PERCENT SIGN..ARABIC FIVE POINTED STA 1336 066E..0674 ; PVALID # ARABIC LETTER DOTLESS BEH..ARABIC LETTER HIG 1337 0675..0678 ; DISALLOWED # ARABIC LETTER HIGH HAMZA ALEF..ARABIC LETTER 1338 0679..06D3 ; PVALID # ARABIC LETTER TTEH..ARABIC LETTER YEH BARREE 1339 06D4 ; DISALLOWED # ARABIC FULL STOP 1340 06D5..06DC ; PVALID # ARABIC LETTER AE..ARABIC SMALL HIGH SEEN 1341 06DD..06DE ; DISALLOWED # ARABIC END OF AYAH..ARABIC START OF RUB EL H 1342 06DF..06E8 ; PVALID # ARABIC SMALL HIGH ROUNDED ZERO..ARABIC SMALL 1343 06E9 ; DISALLOWED # ARABIC PLACE OF SAJDAH 1344 06EA..06EF ; PVALID # ARABIC EMPTY CENTRE LOW STOP..ARABIC LETTER 1345 06F0..06F9 ; CONTEXTO # EXTENDED ARABIC-INDIC DIGIT ZERO..EXTENDED A 1346 06FA..06FF ; PVALID # ARABIC LETTER SHEEN WITH DOT BELOW..ARABIC L 1347 0700..070D ; DISALLOWED # SYRIAC END OF PARAGRAPH..SYRIAC HARKLEAN AST 1348 070E ; UNASSIGNED # 1349 070F ; DISALLOWED # SYRIAC ABBREVIATION MARK 1350 0710..074A ; PVALID # SYRIAC LETTER ALAPH..SYRIAC BARREKH 1351 074B..074C ; UNASSIGNED # .. 1352 074D..07B1 ; PVALID # SYRIAC LETTER SOGDIAN ZHAIN..THAANA LETTER N 1353 07B2..07BF ; UNASSIGNED # .. 1354 07C0..07F5 ; PVALID # NKO DIGIT ZERO..NKO LOW TONE APOSTROPHE 1355 07F6..07FA ; DISALLOWED # NKO SYMBOL OO DENNEN..NKO LAJANYALAN 1356 07FB..07FF ; UNASSIGNED # .. 1357 0800..082D ; PVALID # SAMARITAN LETTER ALAF..SAMARITAN MARK NEQUDA 1358 082E..082F ; UNASSIGNED # .. 1359 0830..083E ; DISALLOWED # SAMARITAN PUNCTUATION NEQUDAA..SAMARITAN PUN 1360 083F..08FF ; UNASSIGNED # .. 1361 0900..0939 ; PVALID # DEVANAGARI SIGN INVERTED CANDRABINDU..DEVANA 1362 093A..093B ; UNASSIGNED # .. 1363 093C..094E ; PVALID # DEVANAGARI SIGN NUKTA..DEVANAGARI VOWEL SIGN 1364 094F ; UNASSIGNED # 1365 0950..0955 ; PVALID # DEVANAGARI OM..DEVANAGARI VOWEL SIGN CANDRA 1366 0956..0957 ; UNASSIGNED # .. 1367 0958..095F ; DISALLOWED # DEVANAGARI LETTER QA..DEVANAGARI LETTER YYA 1368 0960..0963 ; PVALID # DEVANAGARI LETTER VOCALIC RR..DEVANAGARI VOW 1369 0964..0965 ; DISALLOWED # DEVANAGARI DANDA..DEVANAGARI DOUBLE DANDA 1370 0966..096F ; PVALID # DEVANAGARI DIGIT ZERO..DEVANAGARI DIGIT NINE 1371 0970 ; DISALLOWED # DEVANAGARI ABBREVIATION SIGN 1372 0971..0972 ; PVALID # DEVANAGARI SIGN HIGH SPACING DOT..DEVANAGARI 1373 0973..0978 ; UNASSIGNED # .. 1374 0979..097F ; PVALID # DEVANAGARI LETTER ZHA..DEVANAGARI LETTER BBA 1375 0980 ; UNASSIGNED # 1376 0981..0983 ; PVALID # BENGALI SIGN CANDRABINDU..BENGALI SIGN VISAR 1377 0984 ; UNASSIGNED # 1378 0985..098C ; PVALID # BENGALI LETTER A..BENGALI LETTER VOCALIC L 1379 098D..098E ; UNASSIGNED # .. 1380 098F..0990 ; PVALID # BENGALI LETTER E..BENGALI LETTER AI 1381 0991..0992 ; UNASSIGNED # .. 1382 0993..09A8 ; PVALID # BENGALI LETTER O..BENGALI LETTER NA 1383 09A9 ; UNASSIGNED # 1384 09AA..09B0 ; PVALID # BENGALI LETTER PA..BENGALI LETTER RA 1385 09B1 ; UNASSIGNED # 1386 09B2 ; PVALID # BENGALI LETTER LA 1387 09B3..09B5 ; UNASSIGNED # .. 1388 09B6..09B9 ; PVALID # BENGALI LETTER SHA..BENGALI LETTER HA 1389 09BA..09BB ; UNASSIGNED # .. 1390 09BC..09C4 ; PVALID # BENGALI SIGN NUKTA..BENGALI VOWEL SIGN VOCAL 1391 09C5..09C6 ; UNASSIGNED # .. 1392 09C7..09C8 ; PVALID # BENGALI VOWEL SIGN E..BENGALI VOWEL SIGN AI 1393 09C9..09CA ; UNASSIGNED # .. 1394 09CB..09CE ; PVALID # BENGALI VOWEL SIGN O..BENGALI LETTER KHANDA 1395 09CF..09D6 ; UNASSIGNED # .. 1396 09D7 ; PVALID # BENGALI AU LENGTH MARK 1397 09D8..09DB ; UNASSIGNED # .. 1398 09DC..09DD ; DISALLOWED # BENGALI LETTER RRA..BENGALI LETTER RHA 1399 09DE ; UNASSIGNED # 1400 09DF ; DISALLOWED # BENGALI LETTER YYA 1401 09E0..09E3 ; PVALID # BENGALI LETTER VOCALIC RR..BENGALI VOWEL SIG 1402 09E4..09E5 ; UNASSIGNED # .. 1403 09E6..09F1 ; PVALID # BENGALI DIGIT ZERO..BENGALI LETTER RA WITH L 1404 09F2..09FB ; DISALLOWED # BENGALI RUPEE MARK..BENGALI GANDA MARK 1405 09FC..0A00 ; UNASSIGNED # .. 1406 0A01..0A03 ; PVALID # GURMUKHI SIGN ADAK BINDI..GURMUKHI SIGN VISA 1407 0A04 ; UNASSIGNED # 1408 0A05..0A0A ; PVALID # GURMUKHI LETTER A..GURMUKHI LETTER UU 1409 0A0B..0A0E ; UNASSIGNED # .. 1410 0A0F..0A10 ; PVALID # GURMUKHI LETTER EE..GURMUKHI LETTER AI 1411 0A11..0A12 ; UNASSIGNED # .. 1412 0A13..0A28 ; PVALID # GURMUKHI LETTER OO..GURMUKHI LETTER NA 1413 0A29 ; UNASSIGNED # 1414 0A2A..0A30 ; PVALID # GURMUKHI LETTER PA..GURMUKHI LETTER RA 1415 0A31 ; UNASSIGNED # 1416 0A32 ; PVALID # GURMUKHI LETTER LA 1417 0A33 ; DISALLOWED # GURMUKHI LETTER LLA 1418 0A34 ; UNASSIGNED # 1419 0A35 ; PVALID # GURMUKHI LETTER VA 1420 0A36 ; DISALLOWED # GURMUKHI LETTER SHA 1421 0A37 ; UNASSIGNED # 1422 0A38..0A39 ; PVALID # GURMUKHI LETTER SA..GURMUKHI LETTER HA 1423 0A3A..0A3B ; UNASSIGNED # .. 1424 0A3C ; PVALID # GURMUKHI SIGN NUKTA 1425 0A3D ; UNASSIGNED # 1426 0A3E..0A42 ; PVALID # GURMUKHI VOWEL SIGN AA..GURMUKHI VOWEL SIGN 1427 0A43..0A46 ; UNASSIGNED # .. 1428 0A47..0A48 ; PVALID # GURMUKHI VOWEL SIGN EE..GURMUKHI VOWEL SIGN 1429 0A49..0A4A ; UNASSIGNED # .. 1430 0A4B..0A4D ; PVALID # GURMUKHI VOWEL SIGN OO..GURMUKHI SIGN VIRAMA 1431 0A4E..0A50 ; UNASSIGNED # .. 1432 0A51 ; PVALID # GURMUKHI SIGN UDAAT 1433 0A52..0A58 ; UNASSIGNED # .. 1434 0A59..0A5B ; DISALLOWED # GURMUKHI LETTER KHHA..GURMUKHI LETTER ZA 1435 0A5C ; PVALID # GURMUKHI LETTER RRA 1436 0A5D ; UNASSIGNED # 1437 0A5E ; DISALLOWED # GURMUKHI LETTER FA 1438 0A5F..0A65 ; UNASSIGNED # .. 1439 0A66..0A75 ; PVALID # GURMUKHI DIGIT ZERO..GURMUKHI SIGN YAKASH 1440 0A76..0A80 ; UNASSIGNED # .. 1441 0A81..0A83 ; PVALID # GUJARATI SIGN CANDRABINDU..GUJARATI SIGN VIS 1442 0A84 ; UNASSIGNED # 1443 0A85..0A8D ; PVALID # GUJARATI LETTER A..GUJARATI VOWEL CANDRA E 1444 0A8E ; UNASSIGNED # 1445 0A8F..0A91 ; PVALID # GUJARATI LETTER E..GUJARATI VOWEL CANDRA O 1446 0A92 ; UNASSIGNED # 1447 0A93..0AA8 ; PVALID # GUJARATI LETTER O..GUJARATI LETTER NA 1448 0AA9 ; UNASSIGNED # 1449 0AAA..0AB0 ; PVALID # GUJARATI LETTER PA..GUJARATI LETTER RA 1450 0AB1 ; UNASSIGNED # 1451 0AB2..0AB3 ; PVALID # GUJARATI LETTER LA..GUJARATI LETTER LLA 1452 0AB4 ; UNASSIGNED # 1453 0AB5..0AB9 ; PVALID # GUJARATI LETTER VA..GUJARATI LETTER HA 1454 0ABA..0ABB ; UNASSIGNED # .. 1455 0ABC..0AC5 ; PVALID # GUJARATI SIGN NUKTA..GUJARATI VOWEL SIGN CAN 1456 0AC6 ; UNASSIGNED # 1457 0AC7..0AC9 ; PVALID # GUJARATI VOWEL SIGN E..GUJARATI VOWEL SIGN C 1458 0ACA ; UNASSIGNED # 1459 0ACB..0ACD ; PVALID # GUJARATI VOWEL SIGN O..GUJARATI SIGN VIRAMA 1460 0ACE..0ACF ; UNASSIGNED # .. 1461 0AD0 ; PVALID # GUJARATI OM 1462 0AD1..0ADF ; UNASSIGNED # .. 1463 0AE0..0AE3 ; PVALID # GUJARATI LETTER VOCALIC RR..GUJARATI VOWEL S 1464 0AE4..0AE5 ; UNASSIGNED # .. 1465 0AE6..0AEF ; PVALID # GUJARATI DIGIT ZERO..GUJARATI DIGIT NINE 1466 0AF0 ; UNASSIGNED # 1467 0AF1 ; DISALLOWED # GUJARATI RUPEE SIGN 1468 0AF2..0B00 ; UNASSIGNED # .. 1469 0B01..0B03 ; PVALID # ORIYA SIGN CANDRABINDU..ORIYA SIGN VISARGA 1470 0B04 ; UNASSIGNED # 1471 0B05..0B0C ; PVALID # ORIYA LETTER A..ORIYA LETTER VOCALIC L 1472 0B0D..0B0E ; UNASSIGNED # .. 1473 0B0F..0B10 ; PVALID # ORIYA LETTER E..ORIYA LETTER AI 1474 0B11..0B12 ; UNASSIGNED # .. 1475 0B13..0B28 ; PVALID # ORIYA LETTER O..ORIYA LETTER NA 1476 0B29 ; UNASSIGNED # 1477 0B2A..0B30 ; PVALID # ORIYA LETTER PA..ORIYA LETTER RA 1478 0B31 ; UNASSIGNED # 1479 0B32..0B33 ; PVALID # ORIYA LETTER LA..ORIYA LETTER LLA 1480 0B34 ; UNASSIGNED # 1481 0B35..0B39 ; PVALID # ORIYA LETTER VA..ORIYA LETTER HA 1482 0B3A..0B3B ; UNASSIGNED # .. 1483 0B3C..0B44 ; PVALID # ORIYA SIGN NUKTA..ORIYA VOWEL SIGN VOCALIC R 1484 0B45..0B46 ; UNASSIGNED # .. 1485 0B47..0B48 ; PVALID # ORIYA VOWEL SIGN E..ORIYA VOWEL SIGN AI 1486 0B49..0B4A ; UNASSIGNED # .. 1487 0B4B..0B4D ; PVALID # ORIYA VOWEL SIGN O..ORIYA SIGN VIRAMA 1488 0B4E..0B55 ; UNASSIGNED # .. 1489 0B56..0B57 ; PVALID # ORIYA AI LENGTH MARK..ORIYA AU LENGTH MARK 1490 0B58..0B5B ; UNASSIGNED # .. 1491 0B5C..0B5D ; DISALLOWED # ORIYA LETTER RRA..ORIYA LETTER RHA 1492 0B5E ; UNASSIGNED # 1493 0B5F..0B63 ; PVALID # ORIYA LETTER YYA..ORIYA VOWEL SIGN VOCALIC L 1494 0B64..0B65 ; UNASSIGNED # .. 1495 0B66..0B6F ; PVALID # ORIYA DIGIT ZERO..ORIYA DIGIT NINE 1496 0B70 ; DISALLOWED # ORIYA ISSHAR 1497 0B71 ; PVALID # ORIYA LETTER WA 1498 0B72..0B81 ; UNASSIGNED # .. 1499 0B82..0B83 ; PVALID # TAMIL SIGN ANUSVARA..TAMIL SIGN VISARGA 1500 0B84 ; UNASSIGNED # 1501 0B85..0B8A ; PVALID # TAMIL LETTER A..TAMIL LETTER UU 1502 0B8B..0B8D ; UNASSIGNED # .. 1503 0B8E..0B90 ; PVALID # TAMIL LETTER E..TAMIL LETTER AI 1504 0B91 ; UNASSIGNED # 1505 0B92..0B95 ; PVALID # TAMIL LETTER O..TAMIL LETTER KA 1506 0B96..0B98 ; UNASSIGNED # .. 1507 0B99..0B9A ; PVALID # TAMIL LETTER NGA..TAMIL LETTER CA 1508 0B9B ; UNASSIGNED # 1509 0B9C ; PVALID # TAMIL LETTER JA 1510 0B9D ; UNASSIGNED # 1511 0B9E..0B9F ; PVALID # TAMIL LETTER NYA..TAMIL LETTER TTA 1512 0BA0..0BA2 ; UNASSIGNED # .. 1513 0BA3..0BA4 ; PVALID # TAMIL LETTER NNA..TAMIL LETTER TA 1514 0BA5..0BA7 ; UNASSIGNED # .. 1515 0BA8..0BAA ; PVALID # TAMIL LETTER NA..TAMIL LETTER PA 1516 0BAB..0BAD ; UNASSIGNED # .. 1517 0BAE..0BB9 ; PVALID # TAMIL LETTER MA..TAMIL LETTER HA 1518 0BBA..0BBD ; UNASSIGNED # .. 1519 0BBE..0BC2 ; PVALID # TAMIL VOWEL SIGN AA..TAMIL VOWEL SIGN UU 1520 0BC3..0BC5 ; UNASSIGNED # .. 1521 0BC6..0BC8 ; PVALID # TAMIL VOWEL SIGN E..TAMIL VOWEL SIGN AI 1522 0BC9 ; UNASSIGNED # 1523 0BCA..0BCD ; PVALID # TAMIL VOWEL SIGN O..TAMIL SIGN VIRAMA 1524 0BCE..0BCF ; UNASSIGNED # .. 1525 0BD0 ; PVALID # TAMIL OM 1526 0BD1..0BD6 ; UNASSIGNED # .. 1527 0BD7 ; PVALID # TAMIL AU LENGTH MARK 1528 0BD8..0BE5 ; UNASSIGNED # .. 1529 0BE6..0BEF ; PVALID # TAMIL DIGIT ZERO..TAMIL DIGIT NINE 1530 0BF0..0BFA ; DISALLOWED # TAMIL NUMBER TEN..TAMIL NUMBER SIGN 1531 0BFB..0C00 ; UNASSIGNED # .. 1532 0C01..0C03 ; PVALID # TELUGU SIGN CANDRABINDU..TELUGU SIGN VISARGA 1533 0C04 ; UNASSIGNED # 1534 0C05..0C0C ; PVALID # TELUGU LETTER A..TELUGU LETTER VOCALIC L 1535 0C0D ; UNASSIGNED # 1536 0C0E..0C10 ; PVALID # TELUGU LETTER E..TELUGU LETTER AI 1537 0C11 ; UNASSIGNED # 1538 0C12..0C28 ; PVALID # TELUGU LETTER O..TELUGU LETTER NA 1539 0C29 ; UNASSIGNED # 1540 0C2A..0C33 ; PVALID # TELUGU LETTER PA..TELUGU LETTER LLA 1541 0C34 ; UNASSIGNED # 1542 0C35..0C39 ; PVALID # TELUGU LETTER VA..TELUGU LETTER HA 1543 0C3A..0C3C ; UNASSIGNED # .. 1544 0C3D..0C44 ; PVALID # TELUGU SIGN AVAGRAHA..TELUGU VOWEL SIGN VOCA 1545 0C45 ; UNASSIGNED # 1546 0C46..0C48 ; PVALID # TELUGU VOWEL SIGN E..TELUGU VOWEL SIGN AI 1547 0C49 ; UNASSIGNED # 1548 0C4A..0C4D ; PVALID # TELUGU VOWEL SIGN O..TELUGU SIGN VIRAMA 1549 0C4E..0C54 ; UNASSIGNED # .. 1550 0C55..0C56 ; PVALID # TELUGU LENGTH MARK..TELUGU AI LENGTH MARK 1551 0C57 ; UNASSIGNED # 1552 0C58..0C59 ; PVALID # TELUGU LETTER TSA..TELUGU LETTER DZA 1553 0C5A..0C5F ; UNASSIGNED # .. 1554 0C60..0C63 ; PVALID # TELUGU LETTER VOCALIC RR..TELUGU VOWEL SIGN 1555 0C64..0C65 ; UNASSIGNED # .. 1556 0C66..0C6F ; PVALID # TELUGU DIGIT ZERO..TELUGU DIGIT NINE 1557 0C70..0C77 ; UNASSIGNED # .. 1558 0C78..0C7F ; DISALLOWED # TELUGU FRACTION DIGIT ZERO FOR ODD POWERS OF 1559 0C80..0C81 ; UNASSIGNED # .. 1560 0C82..0C83 ; PVALID # KANNADA SIGN ANUSVARA..KANNADA SIGN VISARGA 1561 0C84 ; UNASSIGNED # 1562 0C85..0C8C ; PVALID # KANNADA LETTER A..KANNADA LETTER VOCALIC L 1563 0C8D ; UNASSIGNED # 1564 0C8E..0C90 ; PVALID # KANNADA LETTER E..KANNADA LETTER AI 1565 0C91 ; UNASSIGNED # 1566 0C92..0CA8 ; PVALID # KANNADA LETTER O..KANNADA LETTER NA 1567 0CA9 ; UNASSIGNED # 1568 0CAA..0CB3 ; PVALID # KANNADA LETTER PA..KANNADA LETTER LLA 1569 0CB4 ; UNASSIGNED # 1570 0CB5..0CB9 ; PVALID # KANNADA LETTER VA..KANNADA LETTER HA 1571 0CBA..0CBB ; UNASSIGNED # .. 1572 0CBC..0CC4 ; PVALID # KANNADA SIGN NUKTA..KANNADA VOWEL SIGN VOCAL 1573 0CC5 ; UNASSIGNED # 1574 0CC6..0CC8 ; PVALID # KANNADA VOWEL SIGN E..KANNADA VOWEL SIGN AI 1575 0CC9 ; UNASSIGNED # 1576 0CCA..0CCD ; PVALID # KANNADA VOWEL SIGN O..KANNADA SIGN VIRAMA 1577 0CCE..0CD4 ; UNASSIGNED # .. 1578 0CD5..0CD6 ; PVALID # KANNADA LENGTH MARK..KANNADA AI LENGTH MARK 1579 0CD7..0CDD ; UNASSIGNED # .. 1580 0CDE ; PVALID # KANNADA LETTER FA 1581 0CDF ; UNASSIGNED # 1582 0CE0..0CE3 ; PVALID # KANNADA LETTER VOCALIC RR..KANNADA VOWEL SIG 1583 0CE4..0CE5 ; UNASSIGNED # .. 1584 0CE6..0CEF ; PVALID # KANNADA DIGIT ZERO..KANNADA DIGIT NINE 1585 0CF0 ; UNASSIGNED # 1586 0CF1..0CF2 ; DISALLOWED # KANNADA SIGN JIHVAMULIYA..KANNADA SIGN UPADH 1587 0CF3..0D01 ; UNASSIGNED # .. 1588 0D02..0D03 ; PVALID # MALAYALAM SIGN ANUSVARA..MALAYALAM SIGN VISA 1589 0D04 ; UNASSIGNED # 1590 0D05..0D0C ; PVALID # MALAYALAM LETTER A..MALAYALAM LETTER VOCALIC 1591 0D0D ; UNASSIGNED # 1592 0D0E..0D10 ; PVALID # MALAYALAM LETTER E..MALAYALAM LETTER AI 1593 0D11 ; UNASSIGNED # 1594 0D12..0D28 ; PVALID # MALAYALAM LETTER O..MALAYALAM LETTER NA 1595 0D29 ; UNASSIGNED # 1596 0D2A..0D39 ; PVALID # MALAYALAM LETTER PA..MALAYALAM LETTER HA 1597 0D3A..0D3C ; UNASSIGNED # .. 1598 0D3D..0D44 ; PVALID # MALAYALAM SIGN AVAGRAHA..MALAYALAM VOWEL SIG 1599 0D45 ; UNASSIGNED # 1600 0D46..0D48 ; PVALID # MALAYALAM VOWEL SIGN E..MALAYALAM VOWEL SIGN 1601 0D49 ; UNASSIGNED # 1602 0D4A..0D4D ; PVALID # MALAYALAM VOWEL SIGN O..MALAYALAM SIGN VIRAM 1603 0D4E..0D56 ; UNASSIGNED # .. 1604 0D57 ; PVALID # MALAYALAM AU LENGTH MARK 1605 0D58..0D5F ; UNASSIGNED # .. 1606 0D60..0D63 ; PVALID # MALAYALAM LETTER VOCALIC RR..MALAYALAM VOWEL 1607 0D64..0D65 ; UNASSIGNED # .. 1608 0D66..0D6F ; PVALID # MALAYALAM DIGIT ZERO..MALAYALAM DIGIT NINE 1609 0D70..0D75 ; DISALLOWED # MALAYALAM NUMBER TEN..MALAYALAM FRACTION THR 1610 0D76..0D78 ; UNASSIGNED # .. 1611 0D79 ; DISALLOWED # MALAYALAM DATE MARK 1612 0D7A..0D7F ; PVALID # MALAYALAM LETTER CHILLU NN..MALAYALAM LETTER 1613 0D80..0D81 ; UNASSIGNED # .. 1614 0D82..0D83 ; PVALID # SINHALA SIGN ANUSVARAYA..SINHALA SIGN VISARG 1615 0D84 ; UNASSIGNED # 1616 0D85..0D96 ; PVALID # SINHALA LETTER AYANNA..SINHALA LETTER AUYANN 1617 0D97..0D99 ; UNASSIGNED # .. 1618 0D9A..0DB1 ; PVALID # SINHALA LETTER ALPAPRAANA KAYANNA..SINHALA L 1619 0DB2 ; UNASSIGNED # 1620 0DB3..0DBB ; PVALID # SINHALA LETTER SANYAKA DAYANNA..SINHALA LETT 1621 0DBC ; UNASSIGNED # 1622 0DBD ; PVALID # SINHALA LETTER DANTAJA LAYANNA 1623 0DBE..0DBF ; UNASSIGNED # .. 1624 0DC0..0DC6 ; PVALID # SINHALA LETTER VAYANNA..SINHALA LETTER FAYAN 1625 0DC7..0DC9 ; UNASSIGNED # .. 1626 0DCA ; PVALID # SINHALA SIGN AL-LAKUNA 1627 0DCB..0DCE ; UNASSIGNED # .. 1628 0DCF..0DD4 ; PVALID # SINHALA VOWEL SIGN AELA-PILLA..SINHALA VOWEL 1629 0DD5 ; UNASSIGNED # 1630 0DD6 ; PVALID # SINHALA VOWEL SIGN DIGA PAA-PILLA 1631 0DD7 ; UNASSIGNED # 1632 0DD8..0DDF ; PVALID # SINHALA VOWEL SIGN GAETTA-PILLA..SINHALA VOW 1633 0DE0..0DF1 ; UNASSIGNED # .. 1634 0DF2..0DF3 ; PVALID # SINHALA VOWEL SIGN DIGA GAETTA-PILLA..SINHAL 1635 0DF4 ; DISALLOWED # SINHALA PUNCTUATION KUNDDALIYA 1636 0DF5..0E00 ; UNASSIGNED # .. 1637 0E01..0E32 ; PVALID # THAI CHARACTER KO KAI..THAI CHARACTER SARA A 1638 0E33 ; DISALLOWED # THAI CHARACTER SARA AM 1639 0E34..0E3A ; PVALID # THAI CHARACTER SARA I..THAI CHARACTER PHINTH 1640 0E3B..0E3E ; UNASSIGNED # .. 1641 0E3F ; DISALLOWED # THAI CURRENCY SYMBOL BAHT 1642 0E40..0E4E ; PVALID # THAI CHARACTER SARA E..THAI CHARACTER YAMAKK 1643 0E4F ; DISALLOWED # THAI CHARACTER FONGMAN 1644 0E50..0E59 ; PVALID # THAI DIGIT ZERO..THAI DIGIT NINE 1645 0E5A..0E5B ; DISALLOWED # THAI CHARACTER ANGKHANKHU..THAI CHARACTER KH 1646 0E5C..0E80 ; UNASSIGNED # .. 1647 0E81..0E82 ; PVALID # LAO LETTER KO..LAO LETTER KHO SUNG 1648 0E83 ; UNASSIGNED # 1649 0E84 ; PVALID # LAO LETTER KHO TAM 1650 0E85..0E86 ; UNASSIGNED # .. 1651 0E87..0E88 ; PVALID # LAO LETTER NGO..LAO LETTER CO 1652 0E89 ; UNASSIGNED # 1653 0E8A ; PVALID # LAO LETTER SO TAM 1654 0E8B..0E8C ; UNASSIGNED # .. 1655 0E8D ; PVALID # LAO LETTER NYO 1656 0E8E..0E93 ; UNASSIGNED # .. 1657 0E94..0E97 ; PVALID # LAO LETTER DO..LAO LETTER THO TAM 1658 0E98 ; UNASSIGNED # 1659 0E99..0E9F ; PVALID # LAO LETTER NO..LAO LETTER FO SUNG 1660 0EA0 ; UNASSIGNED # 1661 0EA1..0EA3 ; PVALID # LAO LETTER MO..LAO LETTER LO LING 1662 0EA4 ; UNASSIGNED # 1663 0EA5 ; PVALID # LAO LETTER LO LOOT 1664 0EA6 ; UNASSIGNED # 1665 0EA7 ; PVALID # LAO LETTER WO 1666 0EA8..0EA9 ; UNASSIGNED # .. 1667 0EAA..0EAB ; PVALID # LAO LETTER SO SUNG..LAO LETTER HO SUNG 1668 0EAC ; UNASSIGNED # 1669 0EAD..0EB2 ; PVALID # LAO LETTER O..LAO VOWEL SIGN AA 1670 0EB3 ; DISALLOWED # LAO VOWEL SIGN AM 1671 0EB4..0EB9 ; PVALID # LAO VOWEL SIGN I..LAO VOWEL SIGN UU 1672 0EBA ; UNASSIGNED # 1673 0EBB..0EBD ; PVALID # LAO VOWEL SIGN MAI KON..LAO SEMIVOWEL SIGN N 1674 0EBE..0EBF ; UNASSIGNED # .. 1675 0EC0..0EC4 ; PVALID # LAO VOWEL SIGN E..LAO VOWEL SIGN AI 1676 0EC5 ; UNASSIGNED # 1677 0EC6 ; PVALID # LAO KO LA 1678 0EC7 ; UNASSIGNED # 1679 0EC8..0ECD ; PVALID # LAO TONE MAI EK..LAO NIGGAHITA 1680 0ECE..0ECF ; UNASSIGNED # .. 1681 0ED0..0ED9 ; PVALID # LAO DIGIT ZERO..LAO DIGIT NINE 1682 0EDA..0EDB ; UNASSIGNED # .. 1683 0EDC..0EDD ; DISALLOWED # LAO HO NO..LAO HO MO 1684 0EDE..0EFF ; UNASSIGNED # .. 1685 0F00 ; PVALID # TIBETAN SYLLABLE OM 1686 0F01..0F0A ; DISALLOWED # TIBETAN MARK GTER YIG MGO TRUNCATED A..TIBET 1687 0F0B ; PVALID # TIBETAN MARK INTERSYLLABIC TSHEG 1688 0F0C..0F17 ; DISALLOWED # TIBETAN MARK DELIMITER TSHEG BSTAR..TIBETAN 1689 0F18..0F19 ; PVALID # TIBETAN ASTROLOGICAL SIGN -KHYUD PA..TIBETAN 1690 0F1A..0F1F ; DISALLOWED # TIBETAN SIGN RDEL DKAR GCIG..TIBETAN SIGN RD 1691 0F20..0F29 ; PVALID # TIBETAN DIGIT ZERO..TIBETAN DIGIT NINE 1692 0F2A..0F34 ; DISALLOWED # TIBETAN DIGIT HALF ONE..TIBETAN MARK BSDUS R 1693 0F35 ; PVALID # TIBETAN MARK NGAS BZUNG NYI ZLA 1694 0F36 ; DISALLOWED # TIBETAN MARK CARET -DZUD RTAGS BZHI MIG CAN 1695 0F37 ; PVALID # TIBETAN MARK NGAS BZUNG SGOR RTAGS 1696 0F38 ; DISALLOWED # TIBETAN MARK CHE MGO 1697 0F39 ; PVALID # TIBETAN MARK TSA -PHRU 1698 0F3A..0F3D ; DISALLOWED # TIBETAN MARK GUG RTAGS GYON..TIBETAN MARK AN 1699 0F3E..0F42 ; PVALID # TIBETAN SIGN YAR TSHES..TIBETAN LETTER GA 1700 0F43 ; DISALLOWED # TIBETAN LETTER GHA 1701 0F44..0F47 ; PVALID # TIBETAN LETTER NGA..TIBETAN LETTER JA 1702 0F48 ; UNASSIGNED # 1703 0F49..0F4C ; PVALID # TIBETAN LETTER NYA..TIBETAN LETTER DDA 1704 0F4D ; DISALLOWED # TIBETAN LETTER DDHA 1705 0F4E..0F51 ; PVALID # TIBETAN LETTER NNA..TIBETAN LETTER DA 1706 0F52 ; DISALLOWED # TIBETAN LETTER DHA 1707 0F53..0F56 ; PVALID # TIBETAN LETTER NA..TIBETAN LETTER BA 1708 0F57 ; DISALLOWED # TIBETAN LETTER BHA 1709 0F58..0F5B ; PVALID # TIBETAN LETTER MA..TIBETAN LETTER DZA 1710 0F5C ; DISALLOWED # TIBETAN LETTER DZHA 1711 0F5D..0F68 ; PVALID # TIBETAN LETTER WA..TIBETAN LETTER A 1712 0F69 ; DISALLOWED # TIBETAN LETTER KSSA 1713 0F6A..0F6C ; PVALID # TIBETAN LETTER FIXED-FORM RA..TIBETAN LETTER 1714 0F6D..0F70 ; UNASSIGNED # .. 1715 0F71..0F72 ; PVALID # TIBETAN VOWEL SIGN AA..TIBETAN VOWEL SIGN I 1716 0F73 ; DISALLOWED # TIBETAN VOWEL SIGN II 1717 0F74 ; PVALID # TIBETAN VOWEL SIGN U 1718 0F75..0F79 ; DISALLOWED # TIBETAN VOWEL SIGN UU..TIBETAN VOWEL SIGN VO 1719 0F7A..0F80 ; PVALID # TIBETAN VOWEL SIGN E..TIBETAN VOWEL SIGN REV 1720 0F81 ; DISALLOWED # TIBETAN VOWEL SIGN REVERSED II 1721 0F82..0F84 ; PVALID # TIBETAN SIGN NYI ZLA NAA DA..TIBETAN MARK HA 1722 0F85 ; DISALLOWED # TIBETAN MARK PALUTA 1723 0F86..0F8B ; PVALID # TIBETAN SIGN LCI RTAGS..TIBETAN SIGN GRU MED 1724 0F8C..0F8F ; UNASSIGNED # .. 1725 0F90..0F92 ; PVALID # TIBETAN SUBJOINED LETTER KA..TIBETAN SUBJOIN 1726 0F93 ; DISALLOWED # TIBETAN SUBJOINED LETTER GHA 1727 0F94..0F97 ; PVALID # TIBETAN SUBJOINED LETTER NGA..TIBETAN SUBJOI 1728 0F98 ; UNASSIGNED # 1729 0F99..0F9C ; PVALID # TIBETAN SUBJOINED LETTER NYA..TIBETAN SUBJOI 1730 0F9D ; DISALLOWED # TIBETAN SUBJOINED LETTER DDHA 1731 0F9E..0FA1 ; PVALID # TIBETAN SUBJOINED LETTER NNA..TIBETAN SUBJOI 1732 0FA2 ; DISALLOWED # TIBETAN SUBJOINED LETTER DHA 1733 0FA3..0FA6 ; PVALID # TIBETAN SUBJOINED LETTER NA..TIBETAN SUBJOIN 1734 0FA7 ; DISALLOWED # TIBETAN SUBJOINED LETTER BHA 1735 0FA8..0FAB ; PVALID # TIBETAN SUBJOINED LETTER MA..TIBETAN SUBJOIN 1736 0FAC ; DISALLOWED # TIBETAN SUBJOINED LETTER DZHA 1737 0FAD..0FB8 ; PVALID # TIBETAN SUBJOINED LETTER WA..TIBETAN SUBJOIN 1738 0FB9 ; DISALLOWED # TIBETAN SUBJOINED LETTER KSSA 1739 0FBA..0FBC ; PVALID # TIBETAN SUBJOINED LETTER FIXED-FORM WA..TIBE 1740 0FBD ; UNASSIGNED # 1741 0FBE..0FC5 ; DISALLOWED # TIBETAN KU RU KHA..TIBETAN SYMBOL RDO RJE 1742 0FC6 ; PVALID # TIBETAN SYMBOL PADMA GDAN 1743 0FC7..0FCC ; DISALLOWED # TIBETAN SYMBOL RDO RJE RGYA GRAM..TIBETAN SY 1744 0FCD ; UNASSIGNED # 1745 0FCE..0FD8 ; DISALLOWED # TIBETAN SIGN RDEL NAG RDEL DKAR..LEFT-FACING 1746 0FD9..0FFF ; UNASSIGNED # .. 1747 1000..1049 ; PVALID # MYANMAR LETTER KA..MYANMAR DIGIT NINE 1748 104A..104F ; DISALLOWED # MYANMAR SIGN LITTLE SECTION..MYANMAR SYMBOL 1749 1050..109D ; PVALID # MYANMAR LETTER SHA..MYANMAR VOWEL SIGN AITON 1750 109E..10C5 ; DISALLOWED # MYANMAR SYMBOL SHAN ONE..GEORGIAN CAPITAL LE 1751 10C6..10CF ; UNASSIGNED # .. 1752 10D0..10FA ; PVALID # GEORGIAN LETTER AN..GEORGIAN LETTER AIN 1753 10FB..10FC ; DISALLOWED # GEORGIAN PARAGRAPH SEPARATOR..MODIFIER LETTE 1754 10FD..10FF ; UNASSIGNED # .. 1755 1100..11FF ; DISALLOWED # HANGUL CHOSEONG KIYEOK..HANGUL JONGSEONG SSA 1756 1200..1248 ; PVALID # ETHIOPIC SYLLABLE HA..ETHIOPIC SYLLABLE QWA 1757 1249 ; UNASSIGNED # 1758 124A..124D ; PVALID # ETHIOPIC SYLLABLE QWI..ETHIOPIC SYLLABLE QWE 1759 124E..124F ; UNASSIGNED # .. 1760 1250..1256 ; PVALID # ETHIOPIC SYLLABLE QHA..ETHIOPIC SYLLABLE QHO 1761 1257 ; UNASSIGNED # 1762 1258 ; PVALID # ETHIOPIC SYLLABLE QHWA 1763 1259 ; UNASSIGNED # 1764 125A..125D ; PVALID # ETHIOPIC SYLLABLE QHWI..ETHIOPIC SYLLABLE QH 1765 125E..125F ; UNASSIGNED # .. 1766 1260..1288 ; PVALID # ETHIOPIC SYLLABLE BA..ETHIOPIC SYLLABLE XWA 1767 1289 ; UNASSIGNED # 1768 128A..128D ; PVALID # ETHIOPIC SYLLABLE XWI..ETHIOPIC SYLLABLE XWE 1769 128E..128F ; UNASSIGNED # .. 1770 1290..12B0 ; PVALID # ETHIOPIC SYLLABLE NA..ETHIOPIC SYLLABLE KWA 1771 12B1 ; UNASSIGNED # 1772 12B2..12B5 ; PVALID # ETHIOPIC SYLLABLE KWI..ETHIOPIC SYLLABLE KWE 1773 12B6..12B7 ; UNASSIGNED # .. 1774 12B8..12BE ; PVALID # ETHIOPIC SYLLABLE KXA..ETHIOPIC SYLLABLE KXO 1775 12BF ; UNASSIGNED # 1776 12C0 ; PVALID # ETHIOPIC SYLLABLE KXWA 1777 12C1 ; UNASSIGNED # 1778 12C2..12C5 ; PVALID # ETHIOPIC SYLLABLE KXWI..ETHIOPIC SYLLABLE KX 1779 12C6..12C7 ; UNASSIGNED # .. 1780 12C8..12D6 ; PVALID # ETHIOPIC SYLLABLE WA..ETHIOPIC SYLLABLE PHAR 1781 12D7 ; UNASSIGNED # 1782 12D8..1310 ; PVALID # ETHIOPIC SYLLABLE ZA..ETHIOPIC SYLLABLE GWA 1783 1311 ; UNASSIGNED # 1784 1312..1315 ; PVALID # ETHIOPIC SYLLABLE GWI..ETHIOPIC SYLLABLE GWE 1785 1316..1317 ; UNASSIGNED # .. 1786 1318..135A ; PVALID # ETHIOPIC SYLLABLE GGA..ETHIOPIC SYLLABLE FYA 1787 135B..135E ; UNASSIGNED # .. 1788 135F ; PVALID # ETHIOPIC COMBINING GEMINATION MARK 1789 1360..137C ; DISALLOWED # ETHIOPIC SECTION MARK..ETHIOPIC NUMBER TEN T 1790 137D..137F ; UNASSIGNED # .. 1791 1380..138F ; PVALID # ETHIOPIC SYLLABLE SEBATBEIT MWA..ETHIOPIC SY 1792 1390..1399 ; DISALLOWED # ETHIOPIC TONAL MARK YIZET..ETHIOPIC TONAL MA 1793 139A..139F ; UNASSIGNED # .. 1794 13A0..13F4 ; PVALID # CHEROKEE LETTER A..CHEROKEE LETTER YV 1795 13F5..13FF ; UNASSIGNED # .. 1796 1400 ; DISALLOWED # CANADIAN SYLLABICS HYPHEN 1797 1401..166C ; PVALID # CANADIAN SYLLABICS E..CANADIAN SYLLABICS CAR 1798 166D..166E ; DISALLOWED # CANADIAN SYLLABICS CHI SIGN..CANADIAN SYLLAB 1799 166F..167F ; PVALID # CANADIAN SYLLABICS QAI..CANADIAN SYLLABICS B 1800 1680 ; DISALLOWED # OGHAM SPACE MARK 1801 1681..169A ; PVALID # OGHAM LETTER BEITH..OGHAM LETTER PEITH 1802 169B..169C ; DISALLOWED # OGHAM FEATHER MARK..OGHAM REVERSED FEATHER M 1803 169D..169F ; UNASSIGNED # .. 1804 16A0..16EA ; PVALID # RUNIC LETTER FEHU FEOH FE F..RUNIC LETTER X 1805 16EB..16F0 ; DISALLOWED # RUNIC SINGLE PUNCTUATION..RUNIC BELGTHOR SYM 1806 16F1..16FF ; UNASSIGNED # .. 1807 1700..170C ; PVALID # TAGALOG LETTER A..TAGALOG LETTER YA 1808 170D ; UNASSIGNED # 1809 170E..1714 ; PVALID # TAGALOG LETTER LA..TAGALOG SIGN VIRAMA 1810 1715..171F ; UNASSIGNED # .. 1811 1720..1734 ; PVALID # HANUNOO LETTER A..HANUNOO SIGN PAMUDPOD 1812 1735..1736 ; DISALLOWED # PHILIPPINE SINGLE PUNCTUATION..PHILIPPINE DO 1813 1737..173F ; UNASSIGNED # .. 1814 1740..1753 ; PVALID # BUHID LETTER A..BUHID VOWEL SIGN U 1815 1754..175F ; UNASSIGNED # .. 1816 1760..176C ; PVALID # TAGBANWA LETTER A..TAGBANWA LETTER YA 1817 176D ; UNASSIGNED # 1818 176E..1770 ; PVALID # TAGBANWA LETTER LA..TAGBANWA LETTER SA 1819 1771 ; UNASSIGNED # 1820 1772..1773 ; PVALID # TAGBANWA VOWEL SIGN I..TAGBANWA VOWEL SIGN U 1821 1774..177F ; UNASSIGNED # .. 1822 1780..17B3 ; PVALID # KHMER LETTER KA..KHMER INDEPENDENT VOWEL QAU 1823 17B4..17B5 ; DISALLOWED # KHMER VOWEL INHERENT AQ..KHMER VOWEL INHEREN 1824 17B6..17D3 ; PVALID # KHMER VOWEL SIGN AA..KHMER SIGN BATHAMASAT 1825 17D4..17D6 ; DISALLOWED # KHMER SIGN KHAN..KHMER SIGN CAMNUC PII KUUH 1826 17D7 ; PVALID # KHMER SIGN LEK TOO 1827 17D8..17DB ; DISALLOWED # KHMER SIGN BEYYAL..KHMER CURRENCY SYMBOL RIE 1828 17DC..17DD ; PVALID # KHMER SIGN AVAKRAHASANYA..KHMER SIGN ATTHACA 1829 17DE..17DF ; UNASSIGNED # .. 1830 17E0..17E9 ; PVALID # KHMER DIGIT ZERO..KHMER DIGIT NINE 1831 17EA..17EF ; UNASSIGNED # .. 1832 17F0..17F9 ; DISALLOWED # KHMER SYMBOL LEK ATTAK SON..KHMER SYMBOL LEK 1833 17FA..17FF ; UNASSIGNED # .. 1834 1800..180E ; DISALLOWED # MONGOLIAN BIRGA..MONGOLIAN VOWEL SEPARATOR 1835 180F ; UNASSIGNED # 1836 1810..1819 ; PVALID # MONGOLIAN DIGIT ZERO..MONGOLIAN DIGIT NINE 1837 181A..181F ; UNASSIGNED # .. 1838 1820..1877 ; PVALID # MONGOLIAN LETTER A..MONGOLIAN LETTER MANCHU 1839 1878..187F ; UNASSIGNED # .. 1840 1880..18AA ; PVALID # MONGOLIAN LETTER ALI GALI ANUSVARA ONE..MONG 1841 18AB..18AF ; UNASSIGNED # .. 1842 18B0..18F5 ; PVALID # CANADIAN SYLLABICS OY..CANADIAN SYLLABICS CA 1843 18F6..18FF ; UNASSIGNED # .. 1844 1900..191C ; PVALID # LIMBU VOWEL-CARRIER LETTER..LIMBU LETTER HA 1845 191D..191F ; UNASSIGNED # .. 1846 1920..192B ; PVALID # LIMBU VOWEL SIGN A..LIMBU SUBJOINED LETTER W 1847 192C..192F ; UNASSIGNED # .. 1848 1930..193B ; PVALID # LIMBU SMALL LETTER KA..LIMBU SIGN SA-I 1849 193C..193F ; UNASSIGNED # .. 1850 1940 ; DISALLOWED # LIMBU SIGN LOO 1851 1941..1943 ; UNASSIGNED # .. 1852 1944..1945 ; DISALLOWED # LIMBU EXCLAMATION MARK..LIMBU QUESTION MARK 1853 1946..196D ; PVALID # LIMBU DIGIT ZERO..TAI LE LETTER AI 1854 196E..196F ; UNASSIGNED # .. 1855 1970..1974 ; PVALID # TAI LE LETTER TONE-2..TAI LE LETTER TONE-6 1856 1975..197F ; UNASSIGNED # .. 1857 1980..19AB ; PVALID # NEW TAI LUE LETTER HIGH QA..NEW TAI LUE LETT 1858 19AC..19AF ; UNASSIGNED # .. 1859 19B0..19C9 ; PVALID # NEW TAI LUE VOWEL SIGN VOWEL SHORTENER..NEW 1860 19CA..19CF ; UNASSIGNED # .. 1861 19D0..19DA ; PVALID # NEW TAI LUE DIGIT ZERO..NEW TAI LUE THAM DIG 1862 19DB..19DD ; UNASSIGNED # .. 1863 19DE..19FF ; DISALLOWED # NEW TAI LUE SIGN LAE..KHMER SYMBOL DAP-PRAM 1864 1A00..1A1B ; PVALID # BUGINESE LETTER KA..BUGINESE VOWEL SIGN AE 1865 1A1C..1A1D ; UNASSIGNED # .. 1866 1A1E..1A1F ; DISALLOWED # BUGINESE PALLAWA..BUGINESE END OF SECTION 1867 1A20..1A5E ; PVALID # TAI THAM LETTER HIGH KA..TAI THAM CONSONANT 1868 1A5F ; UNASSIGNED # 1869 1A60..1A7C ; PVALID # TAI THAM SIGN SAKOT..TAI THAM SIGN KHUEN-LUE 1870 1A7D..1A7E ; UNASSIGNED # .. 1871 1A7F..1A89 ; PVALID # TAI THAM COMBINING CRYPTOGRAMMIC DOT..TAI TH 1872 1A8A..1A8F ; UNASSIGNED # .. 1873 1A90..1A99 ; PVALID # TAI THAM THAM DIGIT ZERO..TAI THAM THAM DIGI 1874 1A9A..1A9F ; UNASSIGNED # .. 1875 1AA0..1AA6 ; DISALLOWED # TAI THAM SIGN WIANG..TAI THAM SIGN REVERSED 1876 1AA7 ; PVALID # TAI THAM SIGN MAI YAMOK 1877 1AA8..1AAD ; DISALLOWED # TAI THAM SIGN KAAN..TAI THAM SIGN CAANG 1878 1AAE..1AFF ; UNASSIGNED # .. 1879 1B00..1B4B ; PVALID # BALINESE SIGN ULU RICEM..BALINESE LETTER ASY 1880 1B4C..1B4F ; UNASSIGNED # .. 1881 1B50..1B59 ; PVALID # BALINESE DIGIT ZERO..BALINESE DIGIT NINE 1882 1B5A..1B6A ; DISALLOWED # BALINESE PANTI..BALINESE MUSICAL SYMBOL DANG 1883 1B6B..1B73 ; PVALID # BALINESE MUSICAL SYMBOL COMBINING TEGEH..BAL 1884 1B74..1B7C ; DISALLOWED # BALINESE MUSICAL SYMBOL RIGHT-HAND OPEN DUG. 1885 1B7D..1B7F ; UNASSIGNED # .. 1886 1B80..1BAA ; PVALID # SUNDANESE SIGN PANYECEK..SUNDANESE SIGN PAMA 1887 1BAB..1BAD ; UNASSIGNED # .. 1888 1BAE..1BB9 ; PVALID # SUNDANESE LETTER KHA..SUNDANESE DIGIT NINE 1889 1BBA..1BFF ; UNASSIGNED # .. 1890 1C00..1C37 ; PVALID # LEPCHA LETTER KA..LEPCHA SIGN NUKTA 1891 1C38..1C3A ; UNASSIGNED # .. 1892 1C3B..1C3F ; DISALLOWED # LEPCHA PUNCTUATION TA-ROL..LEPCHA PUNCTUATIO 1893 1C40..1C49 ; PVALID # LEPCHA DIGIT ZERO..LEPCHA DIGIT NINE 1894 1C4A..1C4C ; UNASSIGNED # .. 1895 1C4D..1C7D ; PVALID # LEPCHA LETTER TTA..OL CHIKI AHAD 1896 1C7E..1C7F ; DISALLOWED # OL CHIKI PUNCTUATION MUCAAD..OL CHIKI PUNCTU 1897 1C80..1CCF ; UNASSIGNED # .. 1898 1CD0..1CD2 ; PVALID # VEDIC TONE KARSHANA..VEDIC TONE PRENKHA 1899 1CD3 ; DISALLOWED # VEDIC SIGN NIHSHVASA 1900 1CD4..1CF2 ; PVALID # VEDIC SIGN YAJURVEDIC MIDLINE SVARITA..VEDIC 1901 1CF3..1CFF ; UNASSIGNED # .. 1902 1D00..1D2B ; PVALID # LATIN LETTER SMALL CAPITAL A..CYRILLIC LETTE 1903 1D2C..1D2E ; DISALLOWED # MODIFIER LETTER CAPITAL A..MODIFIER LETTER C 1904 1D2F ; PVALID # MODIFIER LETTER CAPITAL BARRED B 1905 1D30..1D3A ; DISALLOWED # MODIFIER LETTER CAPITAL D..MODIFIER LETTER C 1906 1D3B ; PVALID # MODIFIER LETTER CAPITAL REVERSED N 1907 1D3C..1D4D ; DISALLOWED # MODIFIER LETTER CAPITAL O..MODIFIER LETTER S 1908 1D4E ; PVALID # MODIFIER LETTER SMALL TURNED I 1909 1D4F..1D6A ; DISALLOWED # MODIFIER LETTER SMALL K..GREEK SUBSCRIPT SMA 1910 1D6B..1D77 ; PVALID # LATIN SMALL LETTER UE..LATIN SMALL LETTER TU 1911 1D78 ; DISALLOWED # MODIFIER LETTER CYRILLIC EN 1912 1D79..1D9A ; PVALID # LATIN SMALL LETTER INSULAR G..LATIN SMALL LE 1913 1D9B..1DBF ; DISALLOWED # MODIFIER LETTER SMALL TURNED ALPHA..MODIFIER 1914 1DC0..1DE6 ; PVALID # COMBINING DOTTED GRAVE ACCENT..COMBINING LAT 1915 1DE7..1DFC ; UNASSIGNED # .. 1916 1DFD..1DFF ; PVALID # COMBINING ALMOST EQUAL TO BELOW..COMBINING R 1917 1E00 ; DISALLOWED # LATIN CAPITAL LETTER A WITH RING BELOW 1918 1E01 ; PVALID # LATIN SMALL LETTER A WITH RING BELOW 1919 1E02 ; DISALLOWED # LATIN CAPITAL LETTER B WITH DOT ABOVE 1920 1E03 ; PVALID # LATIN SMALL LETTER B WITH DOT ABOVE 1921 1E04 ; DISALLOWED # LATIN CAPITAL LETTER B WITH DOT BELOW 1922 1E05 ; PVALID # LATIN SMALL LETTER B WITH DOT BELOW 1923 1E06 ; DISALLOWED # LATIN CAPITAL LETTER B WITH LINE BELOW 1924 1E07 ; PVALID # LATIN SMALL LETTER B WITH LINE BELOW 1925 1E08 ; DISALLOWED # LATIN CAPITAL LETTER C WITH CEDILLA AND ACUT 1926 1E09 ; PVALID # LATIN SMALL LETTER C WITH CEDILLA AND ACUTE 1927 1E0A ; DISALLOWED # LATIN CAPITAL LETTER D WITH DOT ABOVE 1928 1E0B ; PVALID # LATIN SMALL LETTER D WITH DOT ABOVE 1929 1E0C ; DISALLOWED # LATIN CAPITAL LETTER D WITH DOT BELOW 1930 1E0D ; PVALID # LATIN SMALL LETTER D WITH DOT BELOW 1931 1E0E ; DISALLOWED # LATIN CAPITAL LETTER D WITH LINE BELOW 1932 1E0F ; PVALID # LATIN SMALL LETTER D WITH LINE BELOW 1933 1E10 ; DISALLOWED # LATIN CAPITAL LETTER D WITH CEDILLA 1934 1E11 ; PVALID # LATIN SMALL LETTER D WITH CEDILLA 1935 1E12 ; DISALLOWED # LATIN CAPITAL LETTER D WITH CIRCUMFLEX BELOW 1936 1E13 ; PVALID # LATIN SMALL LETTER D WITH CIRCUMFLEX BELOW 1937 1E14 ; DISALLOWED # LATIN CAPITAL LETTER E WITH MACRON AND GRAVE 1938 1E15 ; PVALID # LATIN SMALL LETTER E WITH MACRON AND GRAVE 1939 1E16 ; DISALLOWED # LATIN CAPITAL LETTER E WITH MACRON AND ACUTE 1940 1E17 ; PVALID # LATIN SMALL LETTER E WITH MACRON AND ACUTE 1941 1E18 ; DISALLOWED # LATIN CAPITAL LETTER E WITH CIRCUMFLEX BELOW 1942 1E19 ; PVALID # LATIN SMALL LETTER E WITH CIRCUMFLEX BELOW 1943 1E1A ; DISALLOWED # LATIN CAPITAL LETTER E WITH TILDE BELOW 1944 1E1B ; PVALID # LATIN SMALL LETTER E WITH TILDE BELOW 1945 1E1C ; DISALLOWED # LATIN CAPITAL LETTER E WITH CEDILLA AND BREV 1946 1E1D ; PVALID # LATIN SMALL LETTER E WITH CEDILLA AND BREVE 1947 1E1E ; DISALLOWED # LATIN CAPITAL LETTER F WITH DOT ABOVE 1948 1E1F ; PVALID # LATIN SMALL LETTER F WITH DOT ABOVE 1949 1E20 ; DISALLOWED # LATIN CAPITAL LETTER G WITH MACRON 1950 1E21 ; PVALID # LATIN SMALL LETTER G WITH MACRON 1951 1E22 ; DISALLOWED # LATIN CAPITAL LETTER H WITH DOT ABOVE 1952 1E23 ; PVALID # LATIN SMALL LETTER H WITH DOT ABOVE 1953 1E24 ; DISALLOWED # LATIN CAPITAL LETTER H WITH DOT BELOW 1954 1E25 ; PVALID # LATIN SMALL LETTER H WITH DOT BELOW 1955 1E26 ; DISALLOWED # LATIN CAPITAL LETTER H WITH DIAERESIS 1956 1E27 ; PVALID # LATIN SMALL LETTER H WITH DIAERESIS 1957 1E28 ; DISALLOWED # LATIN CAPITAL LETTER H WITH CEDILLA 1958 1E29 ; PVALID # LATIN SMALL LETTER H WITH CEDILLA 1959 1E2A ; DISALLOWED # LATIN CAPITAL LETTER H WITH BREVE BELOW 1960 1E2B ; PVALID # LATIN SMALL LETTER H WITH BREVE BELOW 1961 1E2C ; DISALLOWED # LATIN CAPITAL LETTER I WITH TILDE BELOW 1962 1E2D ; PVALID # LATIN SMALL LETTER I WITH TILDE BELOW 1963 1E2E ; DISALLOWED # LATIN CAPITAL LETTER I WITH DIAERESIS AND AC 1964 1E2F ; PVALID # LATIN SMALL LETTER I WITH DIAERESIS AND ACUT 1965 1E30 ; DISALLOWED # LATIN CAPITAL LETTER K WITH ACUTE 1966 1E31 ; PVALID # LATIN SMALL LETTER K WITH ACUTE 1967 1E32 ; DISALLOWED # LATIN CAPITAL LETTER K WITH DOT BELOW 1968 1E33 ; PVALID # LATIN SMALL LETTER K WITH DOT BELOW 1969 1E34 ; DISALLOWED # LATIN CAPITAL LETTER K WITH LINE BELOW 1970 1E35 ; PVALID # LATIN SMALL LETTER K WITH LINE BELOW 1971 1E36 ; DISALLOWED # LATIN CAPITAL LETTER L WITH DOT BELOW 1972 1E37 ; PVALID # LATIN SMALL LETTER L WITH DOT BELOW 1973 1E38 ; DISALLOWED # LATIN CAPITAL LETTER L WITH DOT BELOW AND MA 1974 1E39 ; PVALID # LATIN SMALL LETTER L WITH DOT BELOW AND MACR 1975 1E3A ; DISALLOWED # LATIN CAPITAL LETTER L WITH LINE BELOW 1976 1E3B ; PVALID # LATIN SMALL LETTER L WITH LINE BELOW 1977 1E3C ; DISALLOWED # LATIN CAPITAL LETTER L WITH CIRCUMFLEX BELOW 1978 1E3D ; PVALID # LATIN SMALL LETTER L WITH CIRCUMFLEX BELOW 1979 1E3E ; DISALLOWED # LATIN CAPITAL LETTER M WITH ACUTE 1980 1E3F ; PVALID # LATIN SMALL LETTER M WITH ACUTE 1981 1E40 ; DISALLOWED # LATIN CAPITAL LETTER M WITH DOT ABOVE 1982 1E41 ; PVALID # LATIN SMALL LETTER M WITH DOT ABOVE 1983 1E42 ; DISALLOWED # LATIN CAPITAL LETTER M WITH DOT BELOW 1984 1E43 ; PVALID # LATIN SMALL LETTER M WITH DOT BELOW 1985 1E44 ; DISALLOWED # LATIN CAPITAL LETTER N WITH DOT ABOVE 1986 1E45 ; PVALID # LATIN SMALL LETTER N WITH DOT ABOVE 1987 1E46 ; DISALLOWED # LATIN CAPITAL LETTER N WITH DOT BELOW 1988 1E47 ; PVALID # LATIN SMALL LETTER N WITH DOT BELOW 1989 1E48 ; DISALLOWED # LATIN CAPITAL LETTER N WITH LINE BELOW 1990 1E49 ; PVALID # LATIN SMALL LETTER N WITH LINE BELOW 1991 1E4A ; DISALLOWED # LATIN CAPITAL LETTER N WITH CIRCUMFLEX BELOW 1992 1E4B ; PVALID # LATIN SMALL LETTER N WITH CIRCUMFLEX BELOW 1993 1E4C ; DISALLOWED # LATIN CAPITAL LETTER O WITH TILDE AND ACUTE 1994 1E4D ; PVALID # LATIN SMALL LETTER O WITH TILDE AND ACUTE 1995 1E4E ; DISALLOWED # LATIN CAPITAL LETTER O WITH TILDE AND DIAERE 1996 1E4F ; PVALID # LATIN SMALL LETTER O WITH TILDE AND DIAERESI 1997 1E50 ; DISALLOWED # LATIN CAPITAL LETTER O WITH MACRON AND GRAVE 1998 1E51 ; PVALID # LATIN SMALL LETTER O WITH MACRON AND GRAVE 1999 1E52 ; DISALLOWED # LATIN CAPITAL LETTER O WITH MACRON AND ACUTE 2000 1E53 ; PVALID # LATIN SMALL LETTER O WITH MACRON AND ACUTE 2001 1E54 ; DISALLOWED # LATIN CAPITAL LETTER P WITH ACUTE 2002 1E55 ; PVALID # LATIN SMALL LETTER P WITH ACUTE 2003 1E56 ; DISALLOWED # LATIN CAPITAL LETTER P WITH DOT ABOVE 2004 1E57 ; PVALID # LATIN SMALL LETTER P WITH DOT ABOVE 2005 1E58 ; DISALLOWED # LATIN CAPITAL LETTER R WITH DOT ABOVE 2006 1E59 ; PVALID # LATIN SMALL LETTER R WITH DOT ABOVE 2007 1E5A ; DISALLOWED # LATIN CAPITAL LETTER R WITH DOT BELOW 2008 1E5B ; PVALID # LATIN SMALL LETTER R WITH DOT BELOW 2009 1E5C ; DISALLOWED # LATIN CAPITAL LETTER R WITH DOT BELOW AND MA 2010 1E5D ; PVALID # LATIN SMALL LETTER R WITH DOT BELOW AND MACR 2011 1E5E ; DISALLOWED # LATIN CAPITAL LETTER R WITH LINE BELOW 2012 1E5F ; PVALID # LATIN SMALL LETTER R WITH LINE BELOW 2013 1E60 ; DISALLOWED # LATIN CAPITAL LETTER S WITH DOT ABOVE 2014 1E61 ; PVALID # LATIN SMALL LETTER S WITH DOT ABOVE 2015 1E62 ; DISALLOWED # LATIN CAPITAL LETTER S WITH DOT BELOW 2016 1E63 ; PVALID # LATIN SMALL LETTER S WITH DOT BELOW 2017 1E64 ; DISALLOWED # LATIN CAPITAL LETTER S WITH ACUTE AND DOT AB 2018 1E65 ; PVALID # LATIN SMALL LETTER S WITH ACUTE AND DOT ABOV 2019 1E66 ; DISALLOWED # LATIN CAPITAL LETTER S WITH CARON AND DOT AB 2020 1E67 ; PVALID # LATIN SMALL LETTER S WITH CARON AND DOT ABOV 2021 1E68 ; DISALLOWED # LATIN CAPITAL LETTER S WITH DOT BELOW AND DO 2022 1E69 ; PVALID # LATIN SMALL LETTER S WITH DOT BELOW AND DOT 2023 1E6A ; DISALLOWED # LATIN CAPITAL LETTER T WITH DOT ABOVE 2024 1E6B ; PVALID # LATIN SMALL LETTER T WITH DOT ABOVE 2025 1E6C ; DISALLOWED # LATIN CAPITAL LETTER T WITH DOT BELOW 2026 1E6D ; PVALID # LATIN SMALL LETTER T WITH DOT BELOW 2027 1E6E ; DISALLOWED # LATIN CAPITAL LETTER T WITH LINE BELOW 2028 1E6F ; PVALID # LATIN SMALL LETTER T WITH LINE BELOW 2029 1E70 ; DISALLOWED # LATIN CAPITAL LETTER T WITH CIRCUMFLEX BELOW 2030 1E71 ; PVALID # LATIN SMALL LETTER T WITH CIRCUMFLEX BELOW 2031 1E72 ; DISALLOWED # LATIN CAPITAL LETTER U WITH DIAERESIS BELOW 2032 1E73 ; PVALID # LATIN SMALL LETTER U WITH DIAERESIS BELOW 2033 1E74 ; DISALLOWED # LATIN CAPITAL LETTER U WITH TILDE BELOW 2034 1E75 ; PVALID # LATIN SMALL LETTER U WITH TILDE BELOW 2035 1E76 ; DISALLOWED # LATIN CAPITAL LETTER U WITH CIRCUMFLEX BELOW 2036 1E77 ; PVALID # LATIN SMALL LETTER U WITH CIRCUMFLEX BELOW 2037 1E78 ; DISALLOWED # LATIN CAPITAL LETTER U WITH TILDE AND ACUTE 2038 1E79 ; PVALID # LATIN SMALL LETTER U WITH TILDE AND ACUTE 2039 1E7A ; DISALLOWED # LATIN CAPITAL LETTER U WITH MACRON AND DIAER 2040 1E7B ; PVALID # LATIN SMALL LETTER U WITH MACRON AND DIAERES 2041 1E7C ; DISALLOWED # LATIN CAPITAL LETTER V WITH TILDE 2042 1E7D ; PVALID # LATIN SMALL LETTER V WITH TILDE 2043 1E7E ; DISALLOWED # LATIN CAPITAL LETTER V WITH DOT BELOW 2044 1E7F ; PVALID # LATIN SMALL LETTER V WITH DOT BELOW 2045 1E80 ; DISALLOWED # LATIN CAPITAL LETTER W WITH GRAVE 2046 1E81 ; PVALID # LATIN SMALL LETTER W WITH GRAVE 2047 1E82 ; DISALLOWED # LATIN CAPITAL LETTER W WITH ACUTE 2048 1E83 ; PVALID # LATIN SMALL LETTER W WITH ACUTE 2049 1E84 ; DISALLOWED # LATIN CAPITAL LETTER W WITH DIAERESIS 2050 1E85 ; PVALID # LATIN SMALL LETTER W WITH DIAERESIS 2051 1E86 ; DISALLOWED # LATIN CAPITAL LETTER W WITH DOT ABOVE 2052 1E87 ; PVALID # LATIN SMALL LETTER W WITH DOT ABOVE 2053 1E88 ; DISALLOWED # LATIN CAPITAL LETTER W WITH DOT BELOW 2054 1E89 ; PVALID # LATIN SMALL LETTER W WITH DOT BELOW 2055 1E8A ; DISALLOWED # LATIN CAPITAL LETTER X WITH DOT ABOVE 2056 1E8B ; PVALID # LATIN SMALL LETTER X WITH DOT ABOVE 2057 1E8C ; DISALLOWED # LATIN CAPITAL LETTER X WITH DIAERESIS 2058 1E8D ; PVALID # LATIN SMALL LETTER X WITH DIAERESIS 2059 1E8E ; DISALLOWED # LATIN CAPITAL LETTER Y WITH DOT ABOVE 2060 1E8F ; PVALID # LATIN SMALL LETTER Y WITH DOT ABOVE 2061 1E90 ; DISALLOWED # LATIN CAPITAL LETTER Z WITH CIRCUMFLEX 2062 1E91 ; PVALID # LATIN SMALL LETTER Z WITH CIRCUMFLEX 2063 1E92 ; DISALLOWED # LATIN CAPITAL LETTER Z WITH DOT BELOW 2064 1E93 ; PVALID # LATIN SMALL LETTER Z WITH DOT BELOW 2065 1E94 ; DISALLOWED # LATIN CAPITAL LETTER Z WITH LINE BELOW 2066 1E95..1E99 ; PVALID # LATIN SMALL LETTER Z WITH LINE BELOW..LATIN 2067 1E9A..1E9B ; DISALLOWED # LATIN SMALL LETTER A WITH RIGHT HALF RING..L 2068 1E9C..1E9D ; PVALID # LATIN SMALL LETTER LONG S WITH DIAGONAL STRO 2069 1E9E ; DISALLOWED # LATIN CAPITAL LETTER SHARP S 2070 1E9F ; PVALID # LATIN SMALL LETTER DELTA 2071 1EA0 ; DISALLOWED # LATIN CAPITAL LETTER A WITH DOT BELOW 2072 1EA1 ; PVALID # LATIN SMALL LETTER A WITH DOT BELOW 2073 1EA2 ; DISALLOWED # LATIN CAPITAL LETTER A WITH HOOK ABOVE 2074 1EA3 ; PVALID # LATIN SMALL LETTER A WITH HOOK ABOVE 2075 1EA4 ; DISALLOWED # LATIN CAPITAL LETTER A WITH CIRCUMFLEX AND A 2076 1EA5 ; PVALID # LATIN SMALL LETTER A WITH CIRCUMFLEX AND ACU 2077 1EA6 ; DISALLOWED # LATIN CAPITAL LETTER A WITH CIRCUMFLEX AND G 2078 1EA7 ; PVALID # LATIN SMALL LETTER A WITH CIRCUMFLEX AND GRA 2079 1EA8 ; DISALLOWED # LATIN CAPITAL LETTER A WITH CIRCUMFLEX AND H 2080 1EA9 ; PVALID # LATIN SMALL LETTER A WITH CIRCUMFLEX AND HOO 2081 1EAA ; DISALLOWED # LATIN CAPITAL LETTER A WITH CIRCUMFLEX AND T 2082 1EAB ; PVALID # LATIN SMALL LETTER A WITH CIRCUMFLEX AND TIL 2083 1EAC ; DISALLOWED # LATIN CAPITAL LETTER A WITH CIRCUMFLEX AND D 2084 1EAD ; PVALID # LATIN SMALL LETTER A WITH CIRCUMFLEX AND DOT 2085 1EAE ; DISALLOWED # LATIN CAPITAL LETTER A WITH BREVE AND ACUTE 2086 1EAF ; PVALID # LATIN SMALL LETTER A WITH BREVE AND ACUTE 2087 1EB0 ; DISALLOWED # LATIN CAPITAL LETTER A WITH BREVE AND GRAVE 2088 1EB1 ; PVALID # LATIN SMALL LETTER A WITH BREVE AND GRAVE 2089 1EB2 ; DISALLOWED # LATIN CAPITAL LETTER A WITH BREVE AND HOOK A 2090 1EB3 ; PVALID # LATIN SMALL LETTER A WITH BREVE AND HOOK ABO 2091 1EB4 ; DISALLOWED # LATIN CAPITAL LETTER A WITH BREVE AND TILDE 2092 1EB5 ; PVALID # LATIN SMALL LETTER A WITH BREVE AND TILDE 2093 1EB6 ; DISALLOWED # LATIN CAPITAL LETTER A WITH BREVE AND DOT BE 2094 1EB7 ; PVALID # LATIN SMALL LETTER A WITH BREVE AND DOT BELO 2095 1EB8 ; DISALLOWED # LATIN CAPITAL LETTER E WITH DOT BELOW 2096 1EB9 ; PVALID # LATIN SMALL LETTER E WITH DOT BELOW 2097 1EBA ; DISALLOWED # LATIN CAPITAL LETTER E WITH HOOK ABOVE 2098 1EBB ; PVALID # LATIN SMALL LETTER E WITH HOOK ABOVE 2099 1EBC ; DISALLOWED # LATIN CAPITAL LETTER E WITH TILDE 2100 1EBD ; PVALID # LATIN SMALL LETTER E WITH TILDE 2101 1EBE ; DISALLOWED # LATIN CAPITAL LETTER E WITH CIRCUMFLEX AND A 2102 1EBF ; PVALID # LATIN SMALL LETTER E WITH CIRCUMFLEX AND ACU 2103 1EC0 ; DISALLOWED # LATIN CAPITAL LETTER E WITH CIRCUMFLEX AND G 2104 1EC1 ; PVALID # LATIN SMALL LETTER E WITH CIRCUMFLEX AND GRA 2105 1EC2 ; DISALLOWED # LATIN CAPITAL LETTER E WITH CIRCUMFLEX AND H 2106 1EC3 ; PVALID # LATIN SMALL LETTER E WITH CIRCUMFLEX AND HOO 2107 1EC4 ; DISALLOWED # LATIN CAPITAL LETTER E WITH CIRCUMFLEX AND T 2108 1EC5 ; PVALID # LATIN SMALL LETTER E WITH CIRCUMFLEX AND TIL 2109 1EC6 ; DISALLOWED # LATIN CAPITAL LETTER E WITH CIRCUMFLEX AND D 2110 1EC7 ; PVALID # LATIN SMALL LETTER E WITH CIRCUMFLEX AND DOT 2111 1EC8 ; DISALLOWED # LATIN CAPITAL LETTER I WITH HOOK ABOVE 2112 1EC9 ; PVALID # LATIN SMALL LETTER I WITH HOOK ABOVE 2113 1ECA ; DISALLOWED # LATIN CAPITAL LETTER I WITH DOT BELOW 2114 1ECB ; PVALID # LATIN SMALL LETTER I WITH DOT BELOW 2115 1ECC ; DISALLOWED # LATIN CAPITAL LETTER O WITH DOT BELOW 2116 1ECD ; PVALID # LATIN SMALL LETTER O WITH DOT BELOW 2117 1ECE ; DISALLOWED # LATIN CAPITAL LETTER O WITH HOOK ABOVE 2118 1ECF ; PVALID # LATIN SMALL LETTER O WITH HOOK ABOVE 2119 1ED0 ; DISALLOWED # LATIN CAPITAL LETTER O WITH CIRCUMFLEX AND A 2120 1ED1 ; PVALID # LATIN SMALL LETTER O WITH CIRCUMFLEX AND ACU 2121 1ED2 ; DISALLOWED # LATIN CAPITAL LETTER O WITH CIRCUMFLEX AND G 2122 1ED3 ; PVALID # LATIN SMALL LETTER O WITH CIRCUMFLEX AND GRA 2123 1ED4 ; DISALLOWED # LATIN CAPITAL LETTER O WITH CIRCUMFLEX AND H 2124 1ED5 ; PVALID # LATIN SMALL LETTER O WITH CIRCUMFLEX AND HOO 2125 1ED6 ; DISALLOWED # LATIN CAPITAL LETTER O WITH CIRCUMFLEX AND T 2126 1ED7 ; PVALID # LATIN SMALL LETTER O WITH CIRCUMFLEX AND TIL 2127 1ED8 ; DISALLOWED # LATIN CAPITAL LETTER O WITH CIRCUMFLEX AND D 2128 1ED9 ; PVALID # LATIN SMALL LETTER O WITH CIRCUMFLEX AND DOT 2129 1EDA ; DISALLOWED # LATIN CAPITAL LETTER O WITH HORN AND ACUTE 2130 1EDB ; PVALID # LATIN SMALL LETTER O WITH HORN AND ACUTE 2131 1EDC ; DISALLOWED # LATIN CAPITAL LETTER O WITH HORN AND GRAVE 2132 1EDD ; PVALID # LATIN SMALL LETTER O WITH HORN AND GRAVE 2133 1EDE ; DISALLOWED # LATIN CAPITAL LETTER O WITH HORN AND HOOK AB 2134 1EDF ; PVALID # LATIN SMALL LETTER O WITH HORN AND HOOK ABOV 2135 1EE0 ; DISALLOWED # LATIN CAPITAL LETTER O WITH HORN AND TILDE 2136 1EE1 ; PVALID # LATIN SMALL LETTER O WITH HORN AND TILDE 2137 1EE2 ; DISALLOWED # LATIN CAPITAL LETTER O WITH HORN AND DOT BEL 2138 1EE3 ; PVALID # LATIN SMALL LETTER O WITH HORN AND DOT BELOW 2139 1EE4 ; DISALLOWED # LATIN CAPITAL LETTER U WITH DOT BELOW 2140 1EE5 ; PVALID # LATIN SMALL LETTER U WITH DOT BELOW 2141 1EE6 ; DISALLOWED # LATIN CAPITAL LETTER U WITH HOOK ABOVE 2142 1EE7 ; PVALID # LATIN SMALL LETTER U WITH HOOK ABOVE 2143 1EE8 ; DISALLOWED # LATIN CAPITAL LETTER U WITH HORN AND ACUTE 2144 1EE9 ; PVALID # LATIN SMALL LETTER U WITH HORN AND ACUTE 2145 1EEA ; DISALLOWED # LATIN CAPITAL LETTER U WITH HORN AND GRAVE 2146 1EEB ; PVALID # LATIN SMALL LETTER U WITH HORN AND GRAVE 2147 1EEC ; DISALLOWED # LATIN CAPITAL LETTER U WITH HORN AND HOOK AB 2148 1EED ; PVALID # LATIN SMALL LETTER U WITH HORN AND HOOK ABOV 2149 1EEE ; DISALLOWED # LATIN CAPITAL LETTER U WITH HORN AND TILDE 2150 1EEF ; PVALID # LATIN SMALL LETTER U WITH HORN AND TILDE 2151 1EF0 ; DISALLOWED # LATIN CAPITAL LETTER U WITH HORN AND DOT BEL 2152 1EF1 ; PVALID # LATIN SMALL LETTER U WITH HORN AND DOT BELOW 2153 1EF2 ; DISALLOWED # LATIN CAPITAL LETTER Y WITH GRAVE 2154 1EF3 ; PVALID # LATIN SMALL LETTER Y WITH GRAVE 2155 1EF4 ; DISALLOWED # LATIN CAPITAL LETTER Y WITH DOT BELOW 2156 1EF5 ; PVALID # LATIN SMALL LETTER Y WITH DOT BELOW 2157 1EF6 ; DISALLOWED # LATIN CAPITAL LETTER Y WITH HOOK ABOVE 2158 1EF7 ; PVALID # LATIN SMALL LETTER Y WITH HOOK ABOVE 2159 1EF8 ; DISALLOWED # LATIN CAPITAL LETTER Y WITH TILDE 2160 1EF9 ; PVALID # LATIN SMALL LETTER Y WITH TILDE 2161 1EFA ; DISALLOWED # LATIN CAPITAL LETTER MIDDLE-WELSH LL 2162 1EFB ; PVALID # LATIN SMALL LETTER MIDDLE-WELSH LL 2163 1EFC ; DISALLOWED # LATIN CAPITAL LETTER MIDDLE-WELSH V 2164 1EFD ; PVALID # LATIN SMALL LETTER MIDDLE-WELSH V 2165 1EFE ; DISALLOWED # LATIN CAPITAL LETTER Y WITH LOOP 2166 1EFF..1F07 ; PVALID # LATIN SMALL LETTER Y WITH LOOP..GREEK SMALL 2167 1F08..1F0F ; DISALLOWED # GREEK CAPITAL LETTER ALPHA WITH PSILI..GREEK 2168 1F10..1F15 ; PVALID # GREEK SMALL LETTER EPSILON WITH PSILI..GREEK 2169 1F16..1F17 ; UNASSIGNED # .. 2170 1F18..1F1D ; DISALLOWED # GREEK CAPITAL LETTER EPSILON WITH PSILI..GRE 2171 1F1E..1F1F ; UNASSIGNED # .. 2172 1F20..1F27 ; PVALID # GREEK SMALL LETTER ETA WITH PSILI..GREEK SMA 2173 1F28..1F2F ; DISALLOWED # GREEK CAPITAL LETTER ETA WITH PSILI..GREEK C 2174 1F30..1F37 ; PVALID # GREEK SMALL LETTER IOTA WITH PSILI..GREEK SM 2175 1F38..1F3F ; DISALLOWED # GREEK CAPITAL LETTER IOTA WITH PSILI..GREEK 2176 1F40..1F45 ; PVALID # GREEK SMALL LETTER OMICRON WITH PSILI..GREEK 2177 1F46..1F47 ; UNASSIGNED # .. 2178 1F48..1F4D ; DISALLOWED # GREEK CAPITAL LETTER OMICRON WITH PSILI..GRE 2179 1F4E..1F4F ; UNASSIGNED # .. 2180 1F50..1F57 ; PVALID # GREEK SMALL LETTER UPSILON WITH PSILI..GREEK 2181 1F58 ; UNASSIGNED # 2182 1F59 ; DISALLOWED # GREEK CAPITAL LETTER UPSILON WITH DASIA 2183 1F5A ; UNASSIGNED # 2184 1F5B ; DISALLOWED # GREEK CAPITAL LETTER UPSILON WITH DASIA AND 2185 1F5C ; UNASSIGNED # 2186 1F5D ; DISALLOWED # GREEK CAPITAL LETTER UPSILON WITH DASIA AND 2187 1F5E ; UNASSIGNED # 2188 1F5F ; DISALLOWED # GREEK CAPITAL LETTER UPSILON WITH DASIA AND 2189 1F60..1F67 ; PVALID # GREEK SMALL LETTER OMEGA WITH PSILI..GREEK S 2190 1F68..1F6F ; DISALLOWED # GREEK CAPITAL LETTER OMEGA WITH PSILI..GREEK 2191 1F70 ; PVALID # GREEK SMALL LETTER ALPHA WITH VARIA 2192 1F71 ; DISALLOWED # GREEK SMALL LETTER ALPHA WITH OXIA 2193 1F72 ; PVALID # GREEK SMALL LETTER EPSILON WITH VARIA 2194 1F73 ; DISALLOWED # GREEK SMALL LETTER EPSILON WITH OXIA 2195 1F74 ; PVALID # GREEK SMALL LETTER ETA WITH VARIA 2196 1F75 ; DISALLOWED # GREEK SMALL LETTER ETA WITH OXIA 2197 1F76 ; PVALID # GREEK SMALL LETTER IOTA WITH VARIA 2198 1F77 ; DISALLOWED # GREEK SMALL LETTER IOTA WITH OXIA 2199 1F78 ; PVALID # GREEK SMALL LETTER OMICRON WITH VARIA 2200 1F79 ; DISALLOWED # GREEK SMALL LETTER OMICRON WITH OXIA 2201 1F7A ; PVALID # GREEK SMALL LETTER UPSILON WITH VARIA 2202 1F7B ; DISALLOWED # GREEK SMALL LETTER UPSILON WITH OXIA 2203 1F7C ; PVALID # GREEK SMALL LETTER OMEGA WITH VARIA 2204 1F7D ; DISALLOWED # GREEK SMALL LETTER OMEGA WITH OXIA 2205 1F7E..1F7F ; UNASSIGNED # .. 2206 1F80..1FAF ; DISALLOWED # GREEK SMALL LETTER ALPHA WITH PSILI AND YPOG 2207 1FB0..1FB1 ; PVALID # GREEK SMALL LETTER ALPHA WITH VRACHY..GREEK 2208 1FB2..1FB4 ; DISALLOWED # GREEK SMALL LETTER ALPHA WITH VARIA AND YPOG 2209 1FB5 ; UNASSIGNED # 2210 1FB6 ; PVALID # GREEK SMALL LETTER ALPHA WITH PERISPOMENI 2211 1FB7..1FC4 ; DISALLOWED # GREEK SMALL LETTER ALPHA WITH PERISPOMENI AN 2212 1FC5 ; UNASSIGNED # 2213 1FC6 ; PVALID # GREEK SMALL LETTER ETA WITH PERISPOMENI 2214 1FC7..1FCF ; DISALLOWED # GREEK SMALL LETTER ETA WITH PERISPOMENI AND 2215 1FD0..1FD2 ; PVALID # GREEK SMALL LETTER IOTA WITH VRACHY..GREEK S 2216 1FD3 ; DISALLOWED # GREEK SMALL LETTER IOTA WITH DIALYTIKA AND O 2217 1FD4..1FD5 ; UNASSIGNED # .. 2218 1FD6..1FD7 ; PVALID # GREEK SMALL LETTER IOTA WITH PERISPOMENI..GR 2219 1FD8..1FDB ; DISALLOWED # GREEK CAPITAL LETTER IOTA WITH VRACHY..GREEK 2220 1FDC ; UNASSIGNED # 2221 1FDD..1FDF ; DISALLOWED # GREEK DASIA AND VARIA..GREEK DASIA AND PERIS 2222 1FE0..1FE2 ; PVALID # GREEK SMALL LETTER UPSILON WITH VRACHY..GREE 2223 1FE3 ; DISALLOWED # GREEK SMALL LETTER UPSILON WITH DIALYTIKA AN 2224 1FE4..1FE7 ; PVALID # GREEK SMALL LETTER RHO WITH PSILI..GREEK SMA 2225 1FE8..1FEF ; DISALLOWED # GREEK CAPITAL LETTER UPSILON WITH VRACHY..GR 2226 1FF0..1FF1 ; UNASSIGNED # .. 2227 1FF2..1FF4 ; DISALLOWED # GREEK SMALL LETTER OMEGA WITH VARIA AND YPOG 2228 1FF5 ; UNASSIGNED # 2229 1FF6 ; PVALID # GREEK SMALL LETTER OMEGA WITH PERISPOMENI 2230 1FF7..1FFE ; DISALLOWED # GREEK SMALL LETTER OMEGA WITH PERISPOMENI AN 2231 1FFF ; UNASSIGNED # 2232 2000..200B ; DISALLOWED # EN QUAD..ZERO WIDTH SPACE 2233 200C..200D ; CONTEXTJ # ZERO WIDTH NON-JOINER..ZERO WIDTH JOINER 2234 200E..2064 ; DISALLOWED # LEFT-TO-RIGHT MARK..INVISIBLE PLUS 2235 2065..2069 ; UNASSIGNED # .. 2236 206A..2071 ; DISALLOWED # INHIBIT SYMMETRIC SWAPPING..SUPERSCRIPT LATI 2237 2072..2073 ; UNASSIGNED # .. 2238 2074..208E ; DISALLOWED # SUPERSCRIPT FOUR..SUBSCRIPT RIGHT PARENTHESI 2239 208F ; UNASSIGNED # 2240 2090..2094 ; DISALLOWED # LATIN SUBSCRIPT SMALL LETTER A..LATIN SUBSCR 2241 2095..209F ; UNASSIGNED # .. 2242 20A0..20B8 ; DISALLOWED # EURO-CURRENCY SIGN..TENGE SIGN 2243 20B9..20CF ; UNASSIGNED # .. 2244 20D0..20F0 ; DISALLOWED # COMBINING LEFT HARPOON ABOVE..COMBINING ASTE 2245 20F1..20FF ; UNASSIGNED # .. 2246 2100..214D ; DISALLOWED # ACCOUNT OF..AKTIESELSKAB 2247 214E ; PVALID # TURNED SMALL F 2248 214F..2183 ; DISALLOWED # SYMBOL FOR SAMARITAN SOURCE..ROMAN NUMERAL R 2249 2184 ; PVALID # LATIN SMALL LETTER REVERSED C 2250 2185..2189 ; DISALLOWED # ROMAN NUMERAL SIX LATE FORM..VULGAR FRACTION 2251 218A..218F ; UNASSIGNED # .. 2252 2190..23E8 ; DISALLOWED # LEFTWARDS ARROW..DECIMAL EXPONENT SYMBOL 2253 23E9..23FF ; UNASSIGNED # .. 2254 2400..2426 ; DISALLOWED # SYMBOL FOR NULL..SYMBOL FOR SUBSTITUTE FORM 2255 2427..243F ; UNASSIGNED # .. 2256 2440..244A ; DISALLOWED # OCR HOOK..OCR DOUBLE BACKSLASH 2257 244B..245F ; UNASSIGNED # .. 2258 2460..26CD ; DISALLOWED # CIRCLED DIGIT ONE..DISABLED CAR 2259 26CE ; UNASSIGNED # 2260 26CF..26E1 ; DISALLOWED # PICK..RESTRICTED LEFT ENTRY-2 2261 26E2 ; UNASSIGNED # 2262 26E3 ; DISALLOWED # HEAVY CIRCLE WITH STROKE AND TWO DOTS ABOVE 2263 26E4..26E7 ; UNASSIGNED # .. 2264 26E8..26FF ; DISALLOWED # BLACK CROSS ON SHIELD..WHITE FLAG WITH HORIZ 2265 2700 ; UNASSIGNED # 2266 2701..2704 ; DISALLOWED # UPPER BLADE SCISSORS..WHITE SCISSORS 2267 2705 ; UNASSIGNED # 2268 2706..2709 ; DISALLOWED # TELEPHONE LOCATION SIGN..ENVELOPE 2269 270A..270B ; UNASSIGNED # .. 2270 270C..2727 ; DISALLOWED # VICTORY HAND..WHITE FOUR POINTED STAR 2271 2728 ; UNASSIGNED # 2272 2729..274B ; DISALLOWED # STRESS OUTLINED WHITE STAR..HEAVY EIGHT TEAR 2273 274C ; UNASSIGNED # 2274 274D ; DISALLOWED # SHADOWED WHITE CIRCLE 2275 274E ; UNASSIGNED # 2276 274F..2752 ; DISALLOWED # LOWER RIGHT DROP-SHADOWED WHITE SQUARE..UPPE 2277 2753..2755 ; UNASSIGNED # .. 2278 2756..275E ; DISALLOWED # BLACK DIAMOND MINUS WHITE X..HEAVY DOUBLE CO 2279 275F..2760 ; UNASSIGNED # .. 2280 2761..2794 ; DISALLOWED # CURVED STEM PARAGRAPH SIGN ORNAMENT..HEAVY W 2281 2795..2797 ; UNASSIGNED # .. 2282 2798..27AF ; DISALLOWED # HEAVY SOUTH EAST ARROW..NOTCHED LOWER RIGHT- 2283 27B0 ; UNASSIGNED # 2284 27B1..27BE ; DISALLOWED # NOTCHED UPPER RIGHT-SHADOWED WHITE RIGHTWARD 2285 27BF ; UNASSIGNED # 2286 27C0..27CA ; DISALLOWED # THREE DIMENSIONAL ANGLE..VERTICAL BAR WITH H 2287 27CB ; UNASSIGNED # 2288 27CC ; DISALLOWED # LONG DIVISION 2289 27CD..27CF ; UNASSIGNED # .. 2290 27D0..2B4C ; DISALLOWED # WHITE DIAMOND WITH CENTRED DOT..RIGHTWARDS A 2291 2B4D..2B4F ; UNASSIGNED # .. 2292 2B50..2B59 ; DISALLOWED # WHITE MEDIUM STAR..HEAVY CIRCLED SALTIRE 2293 2B5A..2BFF ; UNASSIGNED # .. 2294 2C00..2C2E ; DISALLOWED # GLAGOLITIC CAPITAL LETTER AZU..GLAGOLITIC CA 2295 2C2F ; UNASSIGNED # 2296 2C30..2C5E ; PVALID # GLAGOLITIC SMALL LETTER AZU..GLAGOLITIC SMAL 2297 2C5F ; UNASSIGNED # 2298 2C60 ; DISALLOWED # LATIN CAPITAL LETTER L WITH DOUBLE BAR 2299 2C61 ; PVALID # LATIN SMALL LETTER L WITH DOUBLE BAR 2300 2C62..2C64 ; DISALLOWED # LATIN CAPITAL LETTER L WITH MIDDLE TILDE..LA 2301 2C65..2C66 ; PVALID # LATIN SMALL LETTER A WITH STROKE..LATIN SMAL 2302 2C67 ; DISALLOWED # LATIN CAPITAL LETTER H WITH DESCENDER 2303 2C68 ; PVALID # LATIN SMALL LETTER H WITH DESCENDER 2304 2C69 ; DISALLOWED # LATIN CAPITAL LETTER K WITH DESCENDER 2305 2C6A ; PVALID # LATIN SMALL LETTER K WITH DESCENDER 2306 2C6B ; DISALLOWED # LATIN CAPITAL LETTER Z WITH DESCENDER 2307 2C6C ; PVALID # LATIN SMALL LETTER Z WITH DESCENDER 2308 2C6D..2C70 ; DISALLOWED # LATIN CAPITAL LETTER ALPHA..LATIN CAPITAL LE 2309 2C71 ; PVALID # LATIN SMALL LETTER V WITH RIGHT HOOK 2310 2C72 ; DISALLOWED # LATIN CAPITAL LETTER W WITH HOOK 2311 2C73..2C74 ; PVALID # LATIN SMALL LETTER W WITH HOOK..LATIN SMALL 2312 2C75 ; DISALLOWED # LATIN CAPITAL LETTER HALF H 2313 2C76..2C7B ; PVALID # LATIN SMALL LETTER HALF H..LATIN LETTER SMAL 2314 2C7C..2C80 ; DISALLOWED # LATIN SUBSCRIPT SMALL LETTER J..COPTIC CAPIT 2315 2C81 ; PVALID # COPTIC SMALL LETTER ALFA 2316 2C82 ; DISALLOWED # COPTIC CAPITAL LETTER VIDA 2317 2C83 ; PVALID # COPTIC SMALL LETTER VIDA 2318 2C84 ; DISALLOWED # COPTIC CAPITAL LETTER GAMMA 2319 2C85 ; PVALID # COPTIC SMALL LETTER GAMMA 2320 2C86 ; DISALLOWED # COPTIC CAPITAL LETTER DALDA 2321 2C87 ; PVALID # COPTIC SMALL LETTER DALDA 2322 2C88 ; DISALLOWED # COPTIC CAPITAL LETTER EIE 2323 2C89 ; PVALID # COPTIC SMALL LETTER EIE 2324 2C8A ; DISALLOWED # COPTIC CAPITAL LETTER SOU 2325 2C8B ; PVALID # COPTIC SMALL LETTER SOU 2326 2C8C ; DISALLOWED # COPTIC CAPITAL LETTER ZATA 2327 2C8D ; PVALID # COPTIC SMALL LETTER ZATA 2328 2C8E ; DISALLOWED # COPTIC CAPITAL LETTER HATE 2329 2C8F ; PVALID # COPTIC SMALL LETTER HATE 2330 2C90 ; DISALLOWED # COPTIC CAPITAL LETTER THETHE 2331 2C91 ; PVALID # COPTIC SMALL LETTER THETHE 2332 2C92 ; DISALLOWED # COPTIC CAPITAL LETTER IAUDA 2333 2C93 ; PVALID # COPTIC SMALL LETTER IAUDA 2334 2C94 ; DISALLOWED # COPTIC CAPITAL LETTER KAPA 2335 2C95 ; PVALID # COPTIC SMALL LETTER KAPA 2336 2C96 ; DISALLOWED # COPTIC CAPITAL LETTER LAULA 2337 2C97 ; PVALID # COPTIC SMALL LETTER LAULA 2338 2C98 ; DISALLOWED # COPTIC CAPITAL LETTER MI 2339 2C99 ; PVALID # COPTIC SMALL LETTER MI 2340 2C9A ; DISALLOWED # COPTIC CAPITAL LETTER NI 2341 2C9B ; PVALID # COPTIC SMALL LETTER NI 2342 2C9C ; DISALLOWED # COPTIC CAPITAL LETTER KSI 2343 2C9D ; PVALID # COPTIC SMALL LETTER KSI 2344 2C9E ; DISALLOWED # COPTIC CAPITAL LETTER O 2345 2C9F ; PVALID # COPTIC SMALL LETTER O 2346 2CA0 ; DISALLOWED # COPTIC CAPITAL LETTER PI 2347 2CA1 ; PVALID # COPTIC SMALL LETTER PI 2348 2CA2 ; DISALLOWED # COPTIC CAPITAL LETTER RO 2349 2CA3 ; PVALID # COPTIC SMALL LETTER RO 2350 2CA4 ; DISALLOWED # COPTIC CAPITAL LETTER SIMA 2351 2CA5 ; PVALID # COPTIC SMALL LETTER SIMA 2352 2CA6 ; DISALLOWED # COPTIC CAPITAL LETTER TAU 2353 2CA7 ; PVALID # COPTIC SMALL LETTER TAU 2354 2CA8 ; DISALLOWED # COPTIC CAPITAL LETTER UA 2355 2CA9 ; PVALID # COPTIC SMALL LETTER UA 2356 2CAA ; DISALLOWED # COPTIC CAPITAL LETTER FI 2357 2CAB ; PVALID # COPTIC SMALL LETTER FI 2358 2CAC ; DISALLOWED # COPTIC CAPITAL LETTER KHI 2359 2CAD ; PVALID # COPTIC SMALL LETTER KHI 2360 2CAE ; DISALLOWED # COPTIC CAPITAL LETTER PSI 2361 2CAF ; PVALID # COPTIC SMALL LETTER PSI 2362 2CB0 ; DISALLOWED # COPTIC CAPITAL LETTER OOU 2363 2CB1 ; PVALID # COPTIC SMALL LETTER OOU 2364 2CB2 ; DISALLOWED # COPTIC CAPITAL LETTER DIALECT-P ALEF 2365 2CB3 ; PVALID # COPTIC SMALL LETTER DIALECT-P ALEF 2366 2CB4 ; DISALLOWED # COPTIC CAPITAL LETTER OLD COPTIC AIN 2367 2CB5 ; PVALID # COPTIC SMALL LETTER OLD COPTIC AIN 2368 2CB6 ; DISALLOWED # COPTIC CAPITAL LETTER CRYPTOGRAMMIC EIE 2369 2CB7 ; PVALID # COPTIC SMALL LETTER CRYPTOGRAMMIC EIE 2370 2CB8 ; DISALLOWED # COPTIC CAPITAL LETTER DIALECT-P KAPA 2371 2CB9 ; PVALID # COPTIC SMALL LETTER DIALECT-P KAPA 2372 2CBA ; DISALLOWED # COPTIC CAPITAL LETTER DIALECT-P NI 2373 2CBB ; PVALID # COPTIC SMALL LETTER DIALECT-P NI 2374 2CBC ; DISALLOWED # COPTIC CAPITAL LETTER CRYPTOGRAMMIC NI 2375 2CBD ; PVALID # COPTIC SMALL LETTER CRYPTOGRAMMIC NI 2376 2CBE ; DISALLOWED # COPTIC CAPITAL LETTER OLD COPTIC OOU 2377 2CBF ; PVALID # COPTIC SMALL LETTER OLD COPTIC OOU 2378 2CC0 ; DISALLOWED # COPTIC CAPITAL LETTER SAMPI 2379 2CC1 ; PVALID # COPTIC SMALL LETTER SAMPI 2380 2CC2 ; DISALLOWED # COPTIC CAPITAL LETTER CROSSED SHEI 2381 2CC3 ; PVALID # COPTIC SMALL LETTER CROSSED SHEI 2382 2CC4 ; DISALLOWED # COPTIC CAPITAL LETTER OLD COPTIC SHEI 2383 2CC5 ; PVALID # COPTIC SMALL LETTER OLD COPTIC SHEI 2384 2CC6 ; DISALLOWED # COPTIC CAPITAL LETTER OLD COPTIC ESH 2385 2CC7 ; PVALID # COPTIC SMALL LETTER OLD COPTIC ESH 2386 2CC8 ; DISALLOWED # COPTIC CAPITAL LETTER AKHMIMIC KHEI 2387 2CC9 ; PVALID # COPTIC SMALL LETTER AKHMIMIC KHEI 2388 2CCA ; DISALLOWED # COPTIC CAPITAL LETTER DIALECT-P HORI 2389 2CCB ; PVALID # COPTIC SMALL LETTER DIALECT-P HORI 2390 2CCC ; DISALLOWED # COPTIC CAPITAL LETTER OLD COPTIC HORI 2391 2CCD ; PVALID # COPTIC SMALL LETTER OLD COPTIC HORI 2392 2CCE ; DISALLOWED # COPTIC CAPITAL LETTER OLD COPTIC HA 2393 2CCF ; PVALID # COPTIC SMALL LETTER OLD COPTIC HA 2394 2CD0 ; DISALLOWED # COPTIC CAPITAL LETTER L-SHAPED HA 2395 2CD1 ; PVALID # COPTIC SMALL LETTER L-SHAPED HA 2396 2CD2 ; DISALLOWED # COPTIC CAPITAL LETTER OLD COPTIC HEI 2397 2CD3 ; PVALID # COPTIC SMALL LETTER OLD COPTIC HEI 2398 2CD4 ; DISALLOWED # COPTIC CAPITAL LETTER OLD COPTIC HAT 2399 2CD5 ; PVALID # COPTIC SMALL LETTER OLD COPTIC HAT 2400 2CD6 ; DISALLOWED # COPTIC CAPITAL LETTER OLD COPTIC GANGIA 2401 2CD7 ; PVALID # COPTIC SMALL LETTER OLD COPTIC GANGIA 2402 2CD8 ; DISALLOWED # COPTIC CAPITAL LETTER OLD COPTIC DJA 2403 2CD9 ; PVALID # COPTIC SMALL LETTER OLD COPTIC DJA 2404 2CDA ; DISALLOWED # COPTIC CAPITAL LETTER OLD COPTIC SHIMA 2405 2CDB ; PVALID # COPTIC SMALL LETTER OLD COPTIC SHIMA 2406 2CDC ; DISALLOWED # COPTIC CAPITAL LETTER OLD NUBIAN SHIMA 2407 2CDD ; PVALID # COPTIC SMALL LETTER OLD NUBIAN SHIMA 2408 2CDE ; DISALLOWED # COPTIC CAPITAL LETTER OLD NUBIAN NGI 2409 2CDF ; PVALID # COPTIC SMALL LETTER OLD NUBIAN NGI 2410 2CE0 ; DISALLOWED # COPTIC CAPITAL LETTER OLD NUBIAN NYI 2411 2CE1 ; PVALID # COPTIC SMALL LETTER OLD NUBIAN NYI 2412 2CE2 ; DISALLOWED # COPTIC CAPITAL LETTER OLD NUBIAN WAU 2413 2CE3..2CE4 ; PVALID # COPTIC SMALL LETTER OLD NUBIAN WAU..COPTIC S 2414 2CE5..2CEB ; DISALLOWED # COPTIC SYMBOL MI RO..COPTIC CAPITAL LETTER C 2415 2CEC ; PVALID # COPTIC SMALL LETTER CRYPTOGRAMMIC SHEI 2416 2CED ; DISALLOWED # COPTIC CAPITAL LETTER CRYPTOGRAMMIC GANGIA 2417 2CEE..2CF1 ; PVALID # COPTIC SMALL LETTER CRYPTOGRAMMIC GANGIA..CO 2418 2CF2..2CF8 ; UNASSIGNED # .. 2419 2CF9..2CFF ; DISALLOWED # COPTIC OLD NUBIAN FULL STOP..COPTIC MORPHOLO 2420 2D00..2D25 ; PVALID # GEORGIAN SMALL LETTER AN..GEORGIAN SMALL LET 2421 2D26..2D2F ; UNASSIGNED # .. 2422 2D30..2D65 ; PVALID # TIFINAGH LETTER YA..TIFINAGH LETTER YAZZ 2423 2D66..2D6E ; UNASSIGNED # .. 2424 2D6F ; DISALLOWED # TIFINAGH MODIFIER LETTER LABIALIZATION MARK 2425 2D70..2D7F ; UNASSIGNED # .. 2426 2D80..2D96 ; PVALID # ETHIOPIC SYLLABLE LOA..ETHIOPIC SYLLABLE GGW 2427 2D97..2D9F ; UNASSIGNED # .. 2428 2DA0..2DA6 ; PVALID # ETHIOPIC SYLLABLE SSA..ETHIOPIC SYLLABLE SSO 2429 2DA7 ; UNASSIGNED # 2430 2DA8..2DAE ; PVALID # ETHIOPIC SYLLABLE CCA..ETHIOPIC SYLLABLE CCO 2431 2DAF ; UNASSIGNED # 2432 2DB0..2DB6 ; PVALID # ETHIOPIC SYLLABLE ZZA..ETHIOPIC SYLLABLE ZZO 2433 2DB7 ; UNASSIGNED # 2434 2DB8..2DBE ; PVALID # ETHIOPIC SYLLABLE CCHA..ETHIOPIC SYLLABLE CC 2435 2DBF ; UNASSIGNED # 2436 2DC0..2DC6 ; PVALID # ETHIOPIC SYLLABLE QYA..ETHIOPIC SYLLABLE QYO 2437 2DC7 ; UNASSIGNED # 2438 2DC8..2DCE ; PVALID # ETHIOPIC SYLLABLE KYA..ETHIOPIC SYLLABLE KYO 2439 2DCF ; UNASSIGNED # 2440 2DD0..2DD6 ; PVALID # ETHIOPIC SYLLABLE XYA..ETHIOPIC SYLLABLE XYO 2441 2DD7 ; UNASSIGNED # 2442 2DD8..2DDE ; PVALID # ETHIOPIC SYLLABLE GYA..ETHIOPIC SYLLABLE GYO 2443 2DDF ; UNASSIGNED # 2444 2DE0..2DFF ; PVALID # COMBINING CYRILLIC LETTER BE..COMBINING CYRI 2445 2E00..2E2E ; DISALLOWED # RIGHT ANGLE SUBSTITUTION MARKER..REVERSED QU 2446 2E2F ; PVALID # VERTICAL TILDE 2447 2E30..2E31 ; DISALLOWED # RING POINT..WORD SEPARATOR MIDDLE DOT 2448 2E32..2E7F ; UNASSIGNED # .. 2449 2E80..2E99 ; DISALLOWED # CJK RADICAL REPEAT..CJK RADICAL RAP 2450 2E9A ; UNASSIGNED # 2451 2E9B..2EF3 ; DISALLOWED # CJK RADICAL CHOKE..CJK RADICAL C-SIMPLIFIED 2452 2EF4..2EFF ; UNASSIGNED # .. 2453 2F00..2FD5 ; DISALLOWED # KANGXI RADICAL ONE..KANGXI RADICAL FLUTE 2454 2FD6..2FEF ; UNASSIGNED # .. 2455 2FF0..2FFB ; DISALLOWED # IDEOGRAPHIC DESCRIPTION CHARACTER LEFT TO RI 2456 2FFC..2FFF ; UNASSIGNED # .. 2457 3000..3004 ; DISALLOWED # IDEOGRAPHIC SPACE..JAPANESE INDUSTRIAL STAND 2458 3005..3007 ; PVALID # IDEOGRAPHIC ITERATION MARK..IDEOGRAPHIC NUMB 2459 3008..3029 ; DISALLOWED # LEFT ANGLE BRACKET..HANGZHOU NUMERAL NINE 2460 302A..302D ; PVALID # IDEOGRAPHIC LEVEL TONE MARK..IDEOGRAPHIC ENT 2461 302E..303B ; DISALLOWED # HANGUL SINGLE DOT TONE MARK..VERTICAL IDEOGR 2462 303C ; PVALID # MASU MARK 2463 303D..303F ; DISALLOWED # PART ALTERNATION MARK..IDEOGRAPHIC HALF FILL 2464 3040 ; UNASSIGNED # 2465 3041..3096 ; PVALID # HIRAGANA LETTER SMALL A..HIRAGANA LETTER SMA 2466 3097..3098 ; UNASSIGNED # .. 2467 3099..309A ; PVALID # COMBINING KATAKANA-HIRAGANA VOICED SOUND MAR 2468 309B..309C ; DISALLOWED # KATAKANA-HIRAGANA VOICED SOUND MARK..KATAKAN 2469 309D..309E ; PVALID # HIRAGANA ITERATION MARK..HIRAGANA VOICED ITE 2470 309F..30A0 ; DISALLOWED # HIRAGANA DIGRAPH YORI..KATAKANA-HIRAGANA DOU 2471 30A1..30FA ; PVALID # KATAKANA LETTER SMALL A..KATAKANA LETTER VO 2472 30FB ; CONTEXTO # KATAKANA MIDDLE DOT 2473 30FC..30FE ; PVALID # KATAKANA-HIRAGANA PROLONGED SOUND MARK..KATA 2474 30FF ; DISALLOWED # KATAKANA DIGRAPH KOTO 2475 3100..3104 ; UNASSIGNED # .. 2476 3105..312D ; PVALID # BOPOMOFO LETTER B..BOPOMOFO LETTER IH 2477 312E..3130 ; UNASSIGNED # .. 2478 3131..318E ; DISALLOWED # HANGUL LETTER KIYEOK..HANGUL LETTER ARAEAE 2479 318F ; UNASSIGNED # 2480 3190..319F ; DISALLOWED # IDEOGRAPHIC ANNOTATION LINKING MARK..IDEOGRA 2481 31A0..31B7 ; PVALID # BOPOMOFO LETTER BU..BOPOMOFO FINAL LETTER H 2482 31B8..31BF ; UNASSIGNED # .. 2483 31C0..31E3 ; DISALLOWED # CJK STROKE T..CJK STROKE Q 2484 31E4..31EF ; UNASSIGNED # .. 2485 31F0..31FF ; PVALID # KATAKANA LETTER SMALL KU..KATAKANA LETTER SM 2486 3200..321E ; DISALLOWED # PARENTHESIZED HANGUL KIYEOK..PARENTHESIZED K 2487 321F ; UNASSIGNED # 2488 3220..32FE ; DISALLOWED # PARENTHESIZED IDEOGRAPH ONE..CIRCLED KATAKAN 2489 32FF ; UNASSIGNED # 2490 3300..33FF ; DISALLOWED # SQUARE APAATO..SQUARE GAL 2491 3400..4DB5 ; PVALID # .... 2493 4DC0..4DFF ; DISALLOWED # HEXAGRAM FOR THE CREATIVE HEAVEN..HEXAGRAM F 2494 4E00..9FCB ; PVALID # .. 2495 9FCC..9FFF ; UNASSIGNED # .. 2496 A000..A48C ; PVALID # YI SYLLABLE IT..YI SYLLABLE YYR 2497 A48D..A48F ; UNASSIGNED # .. 2498 A490..A4C6 ; DISALLOWED # YI RADICAL QOT..YI RADICAL KE 2499 A4C7..A4CF ; UNASSIGNED # .. 2500 A4D0..A4FD ; PVALID # LISU LETTER BA..LISU LETTER TONE MYA JEU 2501 A4FE..A4FF ; DISALLOWED # LISU PUNCTUATION COMMA..LISU PUNCTUATION FUL 2502 A500..A60C ; PVALID # VAI SYLLABLE EE..VAI SYLLABLE LENGTHENER 2503 A60D..A60F ; DISALLOWED # VAI COMMA..VAI QUESTION MARK 2504 A610..A62B ; PVALID # VAI SYLLABLE NDOLE FA..VAI SYLLABLE NDOLE DO 2505 A62C..A63F ; UNASSIGNED # .. 2506 A640 ; DISALLOWED # CYRILLIC CAPITAL LETTER ZEMLYA 2507 A641 ; PVALID # CYRILLIC SMALL LETTER ZEMLYA 2508 A642 ; DISALLOWED # CYRILLIC CAPITAL LETTER DZELO 2509 A643 ; PVALID # CYRILLIC SMALL LETTER DZELO 2510 A644 ; DISALLOWED # CYRILLIC CAPITAL LETTER REVERSED DZE 2511 A645 ; PVALID # CYRILLIC SMALL LETTER REVERSED DZE 2512 A646 ; DISALLOWED # CYRILLIC CAPITAL LETTER IOTA 2513 A647 ; PVALID # CYRILLIC SMALL LETTER IOTA 2514 A648 ; DISALLOWED # CYRILLIC CAPITAL LETTER DJERV 2515 A649 ; PVALID # CYRILLIC SMALL LETTER DJERV 2516 A64A ; DISALLOWED # CYRILLIC CAPITAL LETTER MONOGRAPH UK 2517 A64B ; PVALID # CYRILLIC SMALL LETTER MONOGRAPH UK 2518 A64C ; DISALLOWED # CYRILLIC CAPITAL LETTER BROAD OMEGA 2519 A64D ; PVALID # CYRILLIC SMALL LETTER BROAD OMEGA 2520 A64E ; DISALLOWED # CYRILLIC CAPITAL LETTER NEUTRAL YER 2521 A64F ; PVALID # CYRILLIC SMALL LETTER NEUTRAL YER 2522 A650 ; DISALLOWED # CYRILLIC CAPITAL LETTER YERU WITH BACK YER 2523 A651 ; PVALID # CYRILLIC SMALL LETTER YERU WITH BACK YER 2524 A652 ; DISALLOWED # CYRILLIC CAPITAL LETTER IOTIFIED YAT 2525 A653 ; PVALID # CYRILLIC SMALL LETTER IOTIFIED YAT 2526 A654 ; DISALLOWED # CYRILLIC CAPITAL LETTER REVERSED YU 2527 A655 ; PVALID # CYRILLIC SMALL LETTER REVERSED YU 2528 A656 ; DISALLOWED # CYRILLIC CAPITAL LETTER IOTIFIED A 2529 A657 ; PVALID # CYRILLIC SMALL LETTER IOTIFIED A 2530 A658 ; DISALLOWED # CYRILLIC CAPITAL LETTER CLOSED LITTLE YUS 2531 A659 ; PVALID # CYRILLIC SMALL LETTER CLOSED LITTLE YUS 2532 A65A ; DISALLOWED # CYRILLIC CAPITAL LETTER BLENDED YUS 2533 A65B ; PVALID # CYRILLIC SMALL LETTER BLENDED YUS 2534 A65C ; DISALLOWED # CYRILLIC CAPITAL LETTER IOTIFIED CLOSED LITT 2535 A65D ; PVALID # CYRILLIC SMALL LETTER IOTIFIED CLOSED LITTLE 2536 A65E ; DISALLOWED # CYRILLIC CAPITAL LETTER YN 2537 A65F ; PVALID # CYRILLIC SMALL LETTER YN 2538 A660..A661 ; UNASSIGNED # .. 2539 A662 ; DISALLOWED # CYRILLIC CAPITAL LETTER SOFT DE 2540 A663 ; PVALID # CYRILLIC SMALL LETTER SOFT DE 2541 A664 ; DISALLOWED # CYRILLIC CAPITAL LETTER SOFT EL 2542 A665 ; PVALID # CYRILLIC SMALL LETTER SOFT EL 2543 A666 ; DISALLOWED # CYRILLIC CAPITAL LETTER SOFT EM 2544 A667 ; PVALID # CYRILLIC SMALL LETTER SOFT EM 2545 A668 ; DISALLOWED # CYRILLIC CAPITAL LETTER MONOCULAR O 2546 A669 ; PVALID # CYRILLIC SMALL LETTER MONOCULAR O 2547 A66A ; DISALLOWED # CYRILLIC CAPITAL LETTER BINOCULAR O 2548 A66B ; PVALID # CYRILLIC SMALL LETTER BINOCULAR O 2549 A66C ; DISALLOWED # CYRILLIC CAPITAL LETTER DOUBLE MONOCULAR O 2550 A66D..A66F ; PVALID # CYRILLIC SMALL LETTER DOUBLE MONOCULAR O..CO 2551 A670..A673 ; DISALLOWED # COMBINING CYRILLIC TEN MILLIONS SIGN..SLAVON 2552 A674..A67B ; UNASSIGNED # .. 2553 A67C..A67D ; PVALID # COMBINING CYRILLIC KAVYKA..COMBINING CYRILLI 2554 A67E ; DISALLOWED # CYRILLIC KAVYKA 2555 A67F ; PVALID # CYRILLIC PAYEROK 2556 A680 ; DISALLOWED # CYRILLIC CAPITAL LETTER DWE 2557 A681 ; PVALID # CYRILLIC SMALL LETTER DWE 2558 A682 ; DISALLOWED # CYRILLIC CAPITAL LETTER DZWE 2559 A683 ; PVALID # CYRILLIC SMALL LETTER DZWE 2560 A684 ; DISALLOWED # CYRILLIC CAPITAL LETTER ZHWE 2561 A685 ; PVALID # CYRILLIC SMALL LETTER ZHWE 2562 A686 ; DISALLOWED # CYRILLIC CAPITAL LETTER CCHE 2563 A687 ; PVALID # CYRILLIC SMALL LETTER CCHE 2564 A688 ; DISALLOWED # CYRILLIC CAPITAL LETTER DZZE 2565 A689 ; PVALID # CYRILLIC SMALL LETTER DZZE 2566 A68A ; DISALLOWED # CYRILLIC CAPITAL LETTER TE WITH MIDDLE HOOK 2567 A68B ; PVALID # CYRILLIC SMALL LETTER TE WITH MIDDLE HOOK 2568 A68C ; DISALLOWED # CYRILLIC CAPITAL LETTER TWE 2569 A68D ; PVALID # CYRILLIC SMALL LETTER TWE 2570 A68E ; DISALLOWED # CYRILLIC CAPITAL LETTER TSWE 2571 A68F ; PVALID # CYRILLIC SMALL LETTER TSWE 2572 A690 ; DISALLOWED # CYRILLIC CAPITAL LETTER TSSE 2573 A691 ; PVALID # CYRILLIC SMALL LETTER TSSE 2574 A692 ; DISALLOWED # CYRILLIC CAPITAL LETTER TCHE 2575 A693 ; PVALID # CYRILLIC SMALL LETTER TCHE 2576 A694 ; DISALLOWED # CYRILLIC CAPITAL LETTER HWE 2577 A695 ; PVALID # CYRILLIC SMALL LETTER HWE 2578 A696 ; DISALLOWED # CYRILLIC CAPITAL LETTER SHWE 2579 A697 ; PVALID # CYRILLIC SMALL LETTER SHWE 2580 A698..A69F ; UNASSIGNED # .. 2581 A6A0..A6E5 ; PVALID # BAMUM LETTER A..BAMUM LETTER KI 2582 A6E6..A6EF ; DISALLOWED # BAMUM LETTER MO..BAMUM LETTER KOGHOM 2583 A6F0..A6F1 ; PVALID # BAMUM COMBINING MARK KOQNDON..BAMUM COMBININ 2584 A6F2..A6F7 ; DISALLOWED # BAMUM NJAEMLI..BAMUM QUESTION MARK 2585 A6F8..A6FF ; UNASSIGNED # .. 2586 A700..A716 ; DISALLOWED # MODIFIER LETTER CHINESE TONE YIN PING..MODIF 2587 A717..A71F ; PVALID # MODIFIER LETTER DOT VERTICAL BAR..MODIFIER L 2588 A720..A722 ; DISALLOWED # MODIFIER LETTER STRESS AND HIGH TONE..LATIN 2589 A723 ; PVALID # LATIN SMALL LETTER EGYPTOLOGICAL ALEF 2590 A724 ; DISALLOWED # LATIN CAPITAL LETTER EGYPTOLOGICAL AIN 2591 A725 ; PVALID # LATIN SMALL LETTER EGYPTOLOGICAL AIN 2592 A726 ; DISALLOWED # LATIN CAPITAL LETTER HENG 2593 A727 ; PVALID # LATIN SMALL LETTER HENG 2594 A728 ; DISALLOWED # LATIN CAPITAL LETTER TZ 2595 A729 ; PVALID # LATIN SMALL LETTER TZ 2596 A72A ; DISALLOWED # LATIN CAPITAL LETTER TRESILLO 2597 A72B ; PVALID # LATIN SMALL LETTER TRESILLO 2598 A72C ; DISALLOWED # LATIN CAPITAL LETTER CUATRILLO 2599 A72D ; PVALID # LATIN SMALL LETTER CUATRILLO 2600 A72E ; DISALLOWED # LATIN CAPITAL LETTER CUATRILLO WITH COMMA 2601 A72F..A731 ; PVALID # LATIN SMALL LETTER CUATRILLO WITH COMMA..LAT 2602 A732 ; DISALLOWED # LATIN CAPITAL LETTER AA 2603 A733 ; PVALID # LATIN SMALL LETTER AA 2604 A734 ; DISALLOWED # LATIN CAPITAL LETTER AO 2605 A735 ; PVALID # LATIN SMALL LETTER AO 2606 A736 ; DISALLOWED # LATIN CAPITAL LETTER AU 2607 A737 ; PVALID # LATIN SMALL LETTER AU 2608 A738 ; DISALLOWED # LATIN CAPITAL LETTER AV 2609 A739 ; PVALID # LATIN SMALL LETTER AV 2610 A73A ; DISALLOWED # LATIN CAPITAL LETTER AV WITH HORIZONTAL BAR 2611 A73B ; PVALID # LATIN SMALL LETTER AV WITH HORIZONTAL BAR 2612 A73C ; DISALLOWED # LATIN CAPITAL LETTER AY 2613 A73D ; PVALID # LATIN SMALL LETTER AY 2614 A73E ; DISALLOWED # LATIN CAPITAL LETTER REVERSED C WITH DOT 2615 A73F ; PVALID # LATIN SMALL LETTER REVERSED C WITH DOT 2616 A740 ; DISALLOWED # LATIN CAPITAL LETTER K WITH STROKE 2617 A741 ; PVALID # LATIN SMALL LETTER K WITH STROKE 2618 A742 ; DISALLOWED # LATIN CAPITAL LETTER K WITH DIAGONAL STROKE 2619 A743 ; PVALID # LATIN SMALL LETTER K WITH DIAGONAL STROKE 2620 A744 ; DISALLOWED # LATIN CAPITAL LETTER K WITH STROKE AND DIAGO 2621 A745 ; PVALID # LATIN SMALL LETTER K WITH STROKE AND DIAGONA 2622 A746 ; DISALLOWED # LATIN CAPITAL LETTER BROKEN L 2623 A747 ; PVALID # LATIN SMALL LETTER BROKEN L 2624 A748 ; DISALLOWED # LATIN CAPITAL LETTER L WITH HIGH STROKE 2625 A749 ; PVALID # LATIN SMALL LETTER L WITH HIGH STROKE 2626 A74A ; DISALLOWED # LATIN CAPITAL LETTER O WITH LONG STROKE OVER 2627 A74B ; PVALID # LATIN SMALL LETTER O WITH LONG STROKE OVERLA 2628 A74C ; DISALLOWED # LATIN CAPITAL LETTER O WITH LOOP 2629 A74D ; PVALID # LATIN SMALL LETTER O WITH LOOP 2630 A74E ; DISALLOWED # LATIN CAPITAL LETTER OO 2631 A74F ; PVALID # LATIN SMALL LETTER OO 2632 A750 ; DISALLOWED # LATIN CAPITAL LETTER P WITH STROKE THROUGH D 2633 A751 ; PVALID # LATIN SMALL LETTER P WITH STROKE THROUGH DES 2634 A752 ; DISALLOWED # LATIN CAPITAL LETTER P WITH FLOURISH 2635 A753 ; PVALID # LATIN SMALL LETTER P WITH FLOURISH 2636 A754 ; DISALLOWED # LATIN CAPITAL LETTER P WITH SQUIRREL TAIL 2637 A755 ; PVALID # LATIN SMALL LETTER P WITH SQUIRREL TAIL 2638 A756 ; DISALLOWED # LATIN CAPITAL LETTER Q WITH STROKE THROUGH D 2639 A757 ; PVALID # LATIN SMALL LETTER Q WITH STROKE THROUGH DES 2640 A758 ; DISALLOWED # LATIN CAPITAL LETTER Q WITH DIAGONAL STROKE 2641 A759 ; PVALID # LATIN SMALL LETTER Q WITH DIAGONAL STROKE 2642 A75A ; DISALLOWED # LATIN CAPITAL LETTER R ROTUNDA 2643 A75B ; PVALID # LATIN SMALL LETTER R ROTUNDA 2644 A75C ; DISALLOWED # LATIN CAPITAL LETTER RUM ROTUNDA 2645 A75D ; PVALID # LATIN SMALL LETTER RUM ROTUNDA 2646 A75E ; DISALLOWED # LATIN CAPITAL LETTER V WITH DIAGONAL STROKE 2647 A75F ; PVALID # LATIN SMALL LETTER V WITH DIAGONAL STROKE 2648 A760 ; DISALLOWED # LATIN CAPITAL LETTER VY 2649 A761 ; PVALID # LATIN SMALL LETTER VY 2650 A762 ; DISALLOWED # LATIN CAPITAL LETTER VISIGOTHIC Z 2651 A763 ; PVALID # LATIN SMALL LETTER VISIGOTHIC Z 2652 A764 ; DISALLOWED # LATIN CAPITAL LETTER THORN WITH STROKE 2653 A765 ; PVALID # LATIN SMALL LETTER THORN WITH STROKE 2654 A766 ; DISALLOWED # LATIN CAPITAL LETTER THORN WITH STROKE THROU 2655 A767 ; PVALID # LATIN SMALL LETTER THORN WITH STROKE THROUGH 2656 A768 ; DISALLOWED # LATIN CAPITAL LETTER VEND 2657 A769 ; PVALID # LATIN SMALL LETTER VEND 2658 A76A ; DISALLOWED # LATIN CAPITAL LETTER ET 2659 A76B ; PVALID # LATIN SMALL LETTER ET 2660 A76C ; DISALLOWED # LATIN CAPITAL LETTER IS 2661 A76D ; PVALID # LATIN SMALL LETTER IS 2662 A76E ; DISALLOWED # LATIN CAPITAL LETTER CON 2663 A76F ; PVALID # LATIN SMALL LETTER CON 2664 A770 ; DISALLOWED # MODIFIER LETTER US 2665 A771..A778 ; PVALID # LATIN SMALL LETTER DUM..LATIN SMALL LETTER U 2666 A779 ; DISALLOWED # LATIN CAPITAL LETTER INSULAR D 2667 A77A ; PVALID # LATIN SMALL LETTER INSULAR D 2668 A77B ; DISALLOWED # LATIN CAPITAL LETTER INSULAR F 2669 A77C ; PVALID # LATIN SMALL LETTER INSULAR F 2670 A77D..A77E ; DISALLOWED # LATIN CAPITAL LETTER INSULAR G..LATIN CAPITA 2671 A77F ; PVALID # LATIN SMALL LETTER TURNED INSULAR G 2672 A780 ; DISALLOWED # LATIN CAPITAL LETTER TURNED L 2673 A781 ; PVALID # LATIN SMALL LETTER TURNED L 2674 A782 ; DISALLOWED # LATIN CAPITAL LETTER INSULAR R 2675 A783 ; PVALID # LATIN SMALL LETTER INSULAR R 2676 A784 ; DISALLOWED # LATIN CAPITAL LETTER INSULAR S 2677 A785 ; PVALID # LATIN SMALL LETTER INSULAR S 2678 A786 ; DISALLOWED # LATIN CAPITAL LETTER INSULAR T 2679 A787..A788 ; PVALID # LATIN SMALL LETTER INSULAR T..MODIFIER LETTE 2680 A789..A78B ; DISALLOWED # MODIFIER LETTER COLON..LATIN CAPITAL LETTER 2681 A78C ; PVALID # LATIN SMALL LETTER SALTILLO 2682 A78D..A7FA ; UNASSIGNED # .. 2683 A7FB..A827 ; PVALID # LATIN EPIGRAPHIC LETTER REVERSED F..SYLOTI N 2684 A828..A82B ; DISALLOWED # SYLOTI NAGRI POETRY MARK-1..SYLOTI NAGRI POE 2685 A82C..A82F ; UNASSIGNED # .. 2686 A830..A839 ; DISALLOWED # NORTH INDIC FRACTION ONE QUARTER..NORTH INDI 2687 A83A..A83F ; UNASSIGNED # .. 2688 A840..A873 ; PVALID # PHAGS-PA LETTER KA..PHAGS-PA LETTER CANDRABI 2689 A874..A877 ; DISALLOWED # PHAGS-PA SINGLE HEAD MARK..PHAGS-PA MARK DOU 2690 A878..A87F ; UNASSIGNED # .. 2691 A880..A8C4 ; PVALID # SAURASHTRA SIGN ANUSVARA..SAURASHTRA SIGN VI 2692 A8C5..A8CD ; UNASSIGNED # .. 2693 A8CE..A8CF ; DISALLOWED # SAURASHTRA DANDA..SAURASHTRA DOUBLE DANDA 2694 A8D0..A8D9 ; PVALID # SAURASHTRA DIGIT ZERO..SAURASHTRA DIGIT NINE 2695 A8DA..A8DF ; UNASSIGNED # .. 2696 A8E0..A8F7 ; PVALID # COMBINING DEVANAGARI DIGIT ZERO..DEVANAGARI 2697 A8F8..A8FA ; DISALLOWED # DEVANAGARI SIGN PUSHPIKA..DEVANAGARI CARET 2698 A8FB ; PVALID # DEVANAGARI HEADSTROKE 2699 A8FC..A8FF ; UNASSIGNED # .. 2700 A900..A92D ; PVALID # KAYAH LI DIGIT ZERO..KAYAH LI TONE CALYA PLO 2701 A92E..A92F ; DISALLOWED # KAYAH LI SIGN CWI..KAYAH LI SIGN SHYA 2702 A930..A953 ; PVALID # REJANG LETTER KA..REJANG VIRAMA 2703 A954..A95E ; UNASSIGNED # .. 2704 A95F..A97C ; DISALLOWED # REJANG SECTION MARK..HANGUL CHOSEONG SSANGYE 2705 A97D..A97F ; UNASSIGNED # .. 2706 A980..A9C0 ; PVALID # JAVANESE SIGN PANYANGGA..JAVANESE PANGKON 2707 A9C1..A9CD ; DISALLOWED # JAVANESE LEFT RERENGGAN..JAVANESE TURNED PAD 2708 A9CE ; UNASSIGNED # 2709 A9CF..A9D9 ; PVALID # JAVANESE PANGRANGKEP..JAVANESE DIGIT NINE 2710 A9DA..A9DD ; UNASSIGNED # .. 2711 A9DE..A9DF ; DISALLOWED # JAVANESE PADA TIRTA TUMETES..JAVANESE PADA I 2712 A9E0..A9FF ; UNASSIGNED # .. 2713 AA00..AA36 ; PVALID # CHAM LETTER A..CHAM CONSONANT SIGN WA 2714 AA37..AA3F ; UNASSIGNED # .. 2715 AA40..AA4D ; PVALID # CHAM LETTER FINAL K..CHAM CONSONANT SIGN FIN 2716 AA4E..AA4F ; UNASSIGNED # .. 2717 AA50..AA59 ; PVALID # CHAM DIGIT ZERO..CHAM DIGIT NINE 2718 AA5A..AA5B ; UNASSIGNED # .. 2719 AA5C..AA5F ; DISALLOWED # CHAM PUNCTUATION SPIRAL..CHAM PUNCTUATION TR 2720 AA60..AA76 ; PVALID # MYANMAR LETTER KHAMTI GA..MYANMAR LOGOGRAM K 2721 AA77..AA79 ; DISALLOWED # MYANMAR SYMBOL AITON EXCLAMATION..MYANMAR SY 2722 AA7A..AA7B ; PVALID # MYANMAR LETTER AITON RA..MYANMAR SIGN PAO KA 2723 AA7C..AA7F ; UNASSIGNED # .. 2724 AA80..AAC2 ; PVALID # TAI VIET LETTER LOW KO..TAI VIET TONE MAI SO 2725 AAC3..AADA ; UNASSIGNED # .. 2726 AADB..AADD ; PVALID # TAI VIET SYMBOL KON..TAI VIET SYMBOL SAM 2727 AADE..AADF ; DISALLOWED # TAI VIET SYMBOL HO HOI..TAI VIET SYMBOL KOI 2728 AAE0..ABBF ; UNASSIGNED # .. 2729 ABC0..ABEA ; PVALID # MEETEI MAYEK LETTER KOK..MEETEI MAYEK VOWEL 2730 ABEB ; DISALLOWED # MEETEI MAYEK CHEIKHEI 2731 ABEC..ABED ; PVALID # MEETEI MAYEK LUM IYEK..MEETEI MAYEK APUN IYE 2732 ABEE..ABEF ; UNASSIGNED # .. 2733 ABF0..ABF9 ; PVALID # MEETEI MAYEK DIGIT ZERO..MEETEI MAYEK DIGIT 2734 ABFA..ABFF ; UNASSIGNED # .. 2735 AC00..D7A3 ; PVALID # .. 2736 D7A4..D7AF ; UNASSIGNED # .. 2737 D7B0..D7C6 ; DISALLOWED # HANGUL JUNGSEONG O-YEO..HANGUL JUNGSEONG ARA 2738 D7C7..D7CA ; UNASSIGNED # .. 2739 D7CB..D7FB ; DISALLOWED # HANGUL JONGSEONG NIEUN-RIEUL..HANGUL JONGSEO 2740 D7FC..D7FF ; UNASSIGNED # .. 2741 D800..FA0D ; DISALLOWED # ..CJK COMPAT 2742 FA0E..FA0F ; PVALID # CJK COMPATIBILITY IDEOGRAPH-FA0E..CJK COMPAT 2743 FA10 ; DISALLOWED # CJK COMPATIBILITY IDEOGRAPH-FA10 2744 FA11 ; PVALID # CJK COMPATIBILITY IDEOGRAPH-FA11 2745 FA12 ; DISALLOWED # CJK COMPATIBILITY IDEOGRAPH-FA12 2746 FA13..FA14 ; PVALID # CJK COMPATIBILITY IDEOGRAPH-FA13..CJK COMPAT 2747 FA15..FA1E ; DISALLOWED # CJK COMPATIBILITY IDEOGRAPH-FA15..CJK COMPAT 2748 FA1F ; PVALID # CJK COMPATIBILITY IDEOGRAPH-FA1F 2749 FA20 ; DISALLOWED # CJK COMPATIBILITY IDEOGRAPH-FA20 2750 FA21 ; PVALID # CJK COMPATIBILITY IDEOGRAPH-FA21 2751 FA22 ; DISALLOWED # CJK COMPATIBILITY IDEOGRAPH-FA22 2752 FA23..FA24 ; PVALID # CJK COMPATIBILITY IDEOGRAPH-FA23..CJK COMPAT 2753 FA25..FA26 ; DISALLOWED # CJK COMPATIBILITY IDEOGRAPH-FA25..CJK COMPAT 2754 FA27..FA29 ; PVALID # CJK COMPATIBILITY IDEOGRAPH-FA27..CJK COMPAT 2755 FA2A..FA2D ; DISALLOWED # CJK COMPATIBILITY IDEOGRAPH-FA2A..CJK COMPAT 2756 FA2E..FA2F ; UNASSIGNED # .. 2757 FA30..FA6D ; DISALLOWED # CJK COMPATIBILITY IDEOGRAPH-FA30..CJK COMPAT 2758 FA6E..FA6F ; UNASSIGNED # .. 2759 FA70..FAD9 ; DISALLOWED # CJK COMPATIBILITY IDEOGRAPH-FA70..CJK COMPAT 2760 FADA..FAFF ; UNASSIGNED # .. 2761 FB00..FB06 ; DISALLOWED # LATIN SMALL LIGATURE FF..LATIN SMALL LIGATUR 2762 FB07..FB12 ; UNASSIGNED # .. 2763 FB13..FB17 ; DISALLOWED # ARMENIAN SMALL LIGATURE MEN NOW..ARMENIAN SM 2764 FB18..FB1C ; UNASSIGNED # .. 2765 FB1D ; DISALLOWED # HEBREW LETTER YOD WITH HIRIQ 2766 FB1E ; PVALID # HEBREW POINT JUDEO-SPANISH VARIKA 2767 FB1F..FB36 ; DISALLOWED # HEBREW LIGATURE YIDDISH YOD YOD PATAH..HEBRE 2768 FB37 ; UNASSIGNED # 2769 FB38..FB3C ; DISALLOWED # HEBREW LETTER TET WITH DAGESH..HEBREW LETTER 2770 FB3D ; UNASSIGNED # 2771 FB3E ; DISALLOWED # HEBREW LETTER MEM WITH DAGESH 2772 FB3F ; UNASSIGNED # 2773 FB40..FB41 ; DISALLOWED # HEBREW LETTER NUN WITH DAGESH..HEBREW LETTER 2774 FB42 ; UNASSIGNED # 2775 FB43..FB44 ; DISALLOWED # HEBREW LETTER FINAL PE WITH DAGESH..HEBREW L 2776 FB45 ; UNASSIGNED # 2777 FB46..FBB1 ; DISALLOWED # HEBREW LETTER TSADI WITH DAGESH..ARABIC LETT 2778 FBB2..FBD2 ; UNASSIGNED # .. 2779 FBD3..FD3F ; DISALLOWED # ARABIC LETTER NG ISOLATED FORM..ORNATE RIGHT 2780 FD40..FD4F ; UNASSIGNED # .. 2781 FD50..FD8F ; DISALLOWED # ARABIC LIGATURE TEH WITH JEEM WITH MEEM INIT 2782 FD90..FD91 ; UNASSIGNED # .. 2783 FD92..FDC7 ; DISALLOWED # ARABIC LIGATURE MEEM WITH JEEM WITH KHAH INI 2784 FDC8..FDCF ; UNASSIGNED # .. 2785 FDD0..FDFD ; DISALLOWED # ..ARABIC LIGATURE BISMILLAH AR 2786 FDFE..FDFF ; UNASSIGNED # .. 2787 FE00..FE19 ; DISALLOWED # VARIATION SELECTOR-1..PRESENTATION FORM FOR 2788 FE1A..FE1F ; UNASSIGNED # .. 2789 FE20..FE26 ; PVALID # COMBINING LIGATURE LEFT HALF..COMBINING CONJ 2790 FE27..FE2F ; UNASSIGNED # .. 2791 FE30..FE52 ; DISALLOWED # PRESENTATION FORM FOR VERTICAL TWO DOT LEADE 2792 FE53 ; UNASSIGNED # 2793 FE54..FE66 ; DISALLOWED # SMALL SEMICOLON..SMALL EQUALS SIGN 2794 FE67 ; UNASSIGNED # 2795 FE68..FE6B ; DISALLOWED # SMALL REVERSE SOLIDUS..SMALL COMMERCIAL AT 2796 FE6C..FE6F ; UNASSIGNED # .. 2797 FE70..FE72 ; DISALLOWED # ARABIC FATHATAN ISOLATED FORM..ARABIC DAMMAT 2798 FE73 ; PVALID # ARABIC TAIL FRAGMENT 2799 FE74 ; DISALLOWED # ARABIC KASRATAN ISOLATED FORM 2800 FE75 ; UNASSIGNED # 2801 FE76..FEFC ; DISALLOWED # ARABIC FATHA ISOLATED FORM..ARABIC LIGATURE 2802 FEFD..FEFE ; UNASSIGNED # .. 2803 FEFF ; DISALLOWED # ZERO WIDTH NO-BREAK SPACE 2804 FF00 ; UNASSIGNED # 2805 FF01..FFBE ; DISALLOWED # FULLWIDTH EXCLAMATION MARK..HALFWIDTH HANGUL 2806 FFBF..FFC1 ; UNASSIGNED # .. 2807 FFC2..FFC7 ; DISALLOWED # HALFWIDTH HANGUL LETTER A..HALFWIDTH HANGUL 2808 FFC8..FFC9 ; UNASSIGNED # .. 2809 FFCA..FFCF ; DISALLOWED # HALFWIDTH HANGUL LETTER YEO..HALFWIDTH HANGU 2810 FFD0..FFD1 ; UNASSIGNED # .. 2811 FFD2..FFD7 ; DISALLOWED # HALFWIDTH HANGUL LETTER YO..HALFWIDTH HANGUL 2812 FFD8..FFD9 ; UNASSIGNED # .. 2813 FFDA..FFDC ; DISALLOWED # HALFWIDTH HANGUL LETTER EU..HALFWIDTH HANGUL 2814 FFDD..FFDF ; UNASSIGNED # .. 2815 FFE0..FFE6 ; DISALLOWED # FULLWIDTH CENT SIGN..FULLWIDTH WON SIGN 2816 FFE7 ; UNASSIGNED # 2817 FFE8..FFEE ; DISALLOWED # HALFWIDTH FORMS LIGHT VERTICAL..HALFWIDTH WH 2818 FFEF..FFF8 ; UNASSIGNED # .. 2819 FFF9..FFFF ; DISALLOWED # INTERLINEAR ANNOTATION ANCHOR.. 2822 1000D..10026; PVALID # LINEAR B SYLLABLE B036 JO..LINEAR B SYLLABLE 2823 10027 ; UNASSIGNED # 2824 10028..1003A; PVALID # LINEAR B SYLLABLE B060 RA..LINEAR B SYLLABLE 2825 1003B ; UNASSIGNED # 2826 1003C..1003D; PVALID # LINEAR B SYLLABLE B017 ZA..LINEAR B SYLLABLE 2827 1003E ; UNASSIGNED # 2828 1003F..1004D; PVALID # LINEAR B SYLLABLE B020 ZO..LINEAR B SYLLABLE 2829 1004E..1004F; UNASSIGNED # .. 2830 10050..1005D; PVALID # LINEAR B SYMBOL B018..LINEAR B SYMBOL B089 2831 1005E..1007F; UNASSIGNED # .. 2832 10080..100FA; PVALID # LINEAR B IDEOGRAM B100 MAN..LINEAR B IDEOGRA 2833 100FB..100FF; UNASSIGNED # .. 2834 10100..10102; DISALLOWED # AEGEAN WORD SEPARATOR LINE..AEGEAN CHECK MAR 2835 10103..10106; UNASSIGNED # .. 2836 10107..10133; DISALLOWED # AEGEAN NUMBER ONE..AEGEAN NUMBER NINETY THOU 2837 10134..10136; UNASSIGNED # .. 2838 10137..1018A; DISALLOWED # AEGEAN WEIGHT BASE UNIT..GREEK ZERO SIGN 2839 1018B..1018F; UNASSIGNED # .. 2840 10190..1019B; DISALLOWED # ROMAN SEXTANS SIGN..ROMAN CENTURIAL SIGN 2841 1019C..101CF; UNASSIGNED # .. 2842 101D0..101FC; DISALLOWED # PHAISTOS DISC SIGN PEDESTRIAN..PHAISTOS DISC 2843 101FD ; PVALID # PHAISTOS DISC SIGN COMBINING OBLIQUE STROKE 2844 101FE..1027F; UNASSIGNED # .. 2845 10280..1029C; PVALID # LYCIAN LETTER A..LYCIAN LETTER X 2846 1029D..1029F; UNASSIGNED # .. 2847 102A0..102D0; PVALID # CARIAN LETTER A..CARIAN LETTER UUU3 2848 102D1..102FF; UNASSIGNED # .. 2849 10300..1031E; PVALID # OLD ITALIC LETTER A..OLD ITALIC LETTER UU 2850 1031F ; UNASSIGNED # 2851 10320..10323; DISALLOWED # OLD ITALIC NUMERAL ONE..OLD ITALIC NUMERAL F 2852 10324..1032F; UNASSIGNED # .. 2853 10330..10340; PVALID # GOTHIC LETTER AHSA..GOTHIC LETTER PAIRTHRA 2854 10341 ; DISALLOWED # GOTHIC LETTER NINETY 2855 10342..10349; PVALID # GOTHIC LETTER RAIDA..GOTHIC LETTER OTHAL 2856 1034A ; DISALLOWED # GOTHIC LETTER NINE HUNDRED 2857 1034B..1037F; UNASSIGNED # .. 2858 10380..1039D; PVALID # UGARITIC LETTER ALPA..UGARITIC LETTER SSU 2859 1039E ; UNASSIGNED # 2860 1039F ; DISALLOWED # UGARITIC WORD DIVIDER 2861 103A0..103C3; PVALID # OLD PERSIAN SIGN A..OLD PERSIAN SIGN HA 2862 103C4..103C7; UNASSIGNED # .. 2863 103C8..103CF; PVALID # OLD PERSIAN SIGN AURAMAZDAA..OLD PERSIAN SIG 2864 103D0..103D5; DISALLOWED # OLD PERSIAN WORD DIVIDER..OLD PERSIAN NUMBER 2865 103D6..103FF; UNASSIGNED # .. 2866 10400..10427; DISALLOWED # DESERET CAPITAL LETTER LONG I..DESERET CAPIT 2867 10428..1049D; PVALID # DESERET SMALL LETTER LONG I..OSMANYA LETTER 2868 1049E..1049F; UNASSIGNED # .. 2869 104A0..104A9; PVALID # OSMANYA DIGIT ZERO..OSMANYA DIGIT NINE 2870 104AA..107FF; UNASSIGNED # .. 2871 10800..10805; PVALID # CYPRIOT SYLLABLE A..CYPRIOT SYLLABLE JA 2872 10806..10807; UNASSIGNED # .. 2873 10808 ; PVALID # CYPRIOT SYLLABLE JO 2874 10809 ; UNASSIGNED # 2875 1080A..10835; PVALID # CYPRIOT SYLLABLE KA..CYPRIOT SYLLABLE WO 2876 10836 ; UNASSIGNED # 2877 10837..10838; PVALID # CYPRIOT SYLLABLE XA..CYPRIOT SYLLABLE XE 2878 10839..1083B; UNASSIGNED # .. 2879 1083C ; PVALID # CYPRIOT SYLLABLE ZA 2880 1083D..1083E; UNASSIGNED # .. 2881 1083F..10855; PVALID # CYPRIOT SYLLABLE ZO..IMPERIAL ARAMAIC LETTER 2882 10856 ; UNASSIGNED # 2883 10857..1085F; DISALLOWED # IMPERIAL ARAMAIC SECTION SIGN..IMPERIAL ARAM 2884 10860..108FF; UNASSIGNED # .. 2885 10900..10915; PVALID # PHOENICIAN LETTER ALF..PHOENICIAN LETTER TAU 2886 10916..1091B; DISALLOWED # PHOENICIAN NUMBER ONE..PHOENICIAN NUMBER THR 2887 1091C..1091E; UNASSIGNED # .. 2888 1091F ; DISALLOWED # PHOENICIAN WORD SEPARATOR 2889 10920..10939; PVALID # LYDIAN LETTER A..LYDIAN LETTER C 2890 1093A..1093E; UNASSIGNED # .. 2891 1093F ; DISALLOWED # LYDIAN TRIANGULAR MARK 2892 10940..109FF; UNASSIGNED # .. 2893 10A00..10A03; PVALID # KHAROSHTHI LETTER A..KHAROSHTHI VOWEL SIGN V 2894 10A04 ; UNASSIGNED # 2895 10A05..10A06; PVALID # KHAROSHTHI VOWEL SIGN E..KHAROSHTHI VOWEL SI 2896 10A07..10A0B; UNASSIGNED # .. 2897 10A0C..10A13; PVALID # KHAROSHTHI VOWEL LENGTH MARK..KHAROSHTHI LET 2898 10A14 ; UNASSIGNED # 2899 10A15..10A17; PVALID # KHAROSHTHI LETTER CA..KHAROSHTHI LETTER JA 2900 10A18 ; UNASSIGNED # 2901 10A19..10A33; PVALID # KHAROSHTHI LETTER NYA..KHAROSHTHI LETTER TTT 2902 10A34..10A37; UNASSIGNED # .. 2903 10A38..10A3A; PVALID # KHAROSHTHI SIGN BAR ABOVE..KHAROSHTHI SIGN D 2904 10A3B..10A3E; UNASSIGNED # .. 2905 10A3F ; PVALID # KHAROSHTHI VIRAMA 2906 10A40..10A47; DISALLOWED # KHAROSHTHI DIGIT ONE..KHAROSHTHI NUMBER ONE 2907 10A48..10A4F; UNASSIGNED # .. 2908 10A50..10A58; DISALLOWED # KHAROSHTHI PUNCTUATION DOT..KHAROSHTHI PUNCT 2909 10A59..10A5F; UNASSIGNED # .. 2910 10A60..10A7C; PVALID # OLD SOUTH ARABIAN LETTER HE..OLD SOUTH ARABI 2911 10A7D..10A7F; DISALLOWED # OLD SOUTH ARABIAN NUMBER ONE..OLD SOUTH ARAB 2912 10A80..10AFF; UNASSIGNED # .. 2913 10B00..10B35; PVALID # AVESTAN LETTER A..AVESTAN LETTER HE 2914 10B36..10B38; UNASSIGNED # .. 2915 10B39..10B3F; DISALLOWED # AVESTAN ABBREVIATION MARK..LARGE ONE RING OV 2916 10B40..10B55; PVALID # INSCRIPTIONAL PARTHIAN LETTER ALEPH..INSCRIP 2917 10B56..10B57; UNASSIGNED # .. 2918 10B58..10B5F; DISALLOWED # INSCRIPTIONAL PARTHIAN NUMBER ONE..INSCRIPTI 2919 10B60..10B72; PVALID # INSCRIPTIONAL PAHLAVI LETTER ALEPH..INSCRIPT 2920 10B73..10B77; UNASSIGNED # .. 2921 10B78..10B7F; DISALLOWED # INSCRIPTIONAL PAHLAVI NUMBER ONE..INSCRIPTIO 2922 10B80..10BFF; UNASSIGNED # .. 2923 10C00..10C48; PVALID # OLD TURKIC LETTER ORKHON A..OLD TURKIC LETTE 2924 10C49..10E5F; UNASSIGNED # .. 2925 10E60..10E7E; DISALLOWED # RUMI DIGIT ONE..RUMI FRACTION TWO THIRDS 2926 10E7F..1107F; UNASSIGNED # .. 2927 11080..110BA; PVALID # KAITHI SIGN CANDRABINDU..KAITHI SIGN NUKTA 2928 110BB..110C1; DISALLOWED # KAITHI ABBREVIATION SIGN..KAITHI DOUBLE DAND 2929 110C2..11FFF; UNASSIGNED # .. 2930 12000..1236E; PVALID # CUNEIFORM SIGN A..CUNEIFORM SIGN ZUM 2931 1236F..123FF; UNASSIGNED # .. 2932 12400..12462; DISALLOWED # CUNEIFORM NUMERIC SIGN TWO ASH..CUNEIFORM NU 2933 12463..1246F; UNASSIGNED # .. 2934 12470..12473; DISALLOWED # CUNEIFORM PUNCTUATION SIGN OLD ASSYRIAN WORD 2935 12474..12FFF; UNASSIGNED # .. 2936 13000..1342E; PVALID # EGYPTIAN HIEROGLYPH A001..EGYPTIAN HIEROGLYP 2937 1342F..1CFFF; UNASSIGNED # .. 2938 1D000..1D0F5; DISALLOWED # BYZANTINE MUSICAL SYMBOL PSILI..BYZANTINE MU 2939 1D0F6..1D0FF; UNASSIGNED # .. 2940 1D100..1D126; DISALLOWED # MUSICAL SYMBOL SINGLE BARLINE..MUSICAL SYMBO 2941 1D127..1D128; UNASSIGNED # .. 2942 1D129..1D1DD; DISALLOWED # MUSICAL SYMBOL MULTIPLE MEASURE REST..MUSICA 2943 1D1DE..1D1FF; UNASSIGNED # .. 2944 1D200..1D245; DISALLOWED # GREEK VOCAL NOTATION SYMBOL-1..GREEK MUSICAL 2945 1D246..1D2FF; UNASSIGNED # .. 2946 1D300..1D356; DISALLOWED # MONOGRAM FOR EARTH..TETRAGRAM FOR FOSTERING 2947 1D357..1D35F; UNASSIGNED # .. 2948 1D360..1D371; DISALLOWED # COUNTING ROD UNIT DIGIT ONE..COUNTING ROD TE 2949 1D372..1D3FF; UNASSIGNED # .. 2950 1D400..1D454; DISALLOWED # MATHEMATICAL BOLD CAPITAL A..MATHEMATICAL IT 2951 1D455 ; UNASSIGNED # 2952 1D456..1D49C; DISALLOWED # MATHEMATICAL ITALIC SMALL I..MATHEMATICAL SC 2953 1D49D ; UNASSIGNED # 2954 1D49E..1D49F; DISALLOWED # MATHEMATICAL SCRIPT CAPITAL C..MATHEMATICAL 2955 1D4A0..1D4A1; UNASSIGNED # .. 2956 1D4A2 ; DISALLOWED # MATHEMATICAL SCRIPT CAPITAL G 2957 1D4A3..1D4A4; UNASSIGNED # .. 2958 1D4A5..1D4A6; DISALLOWED # MATHEMATICAL SCRIPT CAPITAL J..MATHEMATICAL 2959 1D4A7..1D4A8; UNASSIGNED # .. 2960 1D4A9..1D4AC; DISALLOWED # MATHEMATICAL SCRIPT CAPITAL N..MATHEMATICAL 2961 1D4AD ; UNASSIGNED # 2962 1D4AE..1D4B9; DISALLOWED # MATHEMATICAL SCRIPT CAPITAL S..MATHEMATICAL 2963 1D4BA ; UNASSIGNED # 2964 1D4BB ; DISALLOWED # MATHEMATICAL SCRIPT SMALL F 2965 1D4BC ; UNASSIGNED # 2966 1D4BD..1D4C3; DISALLOWED # MATHEMATICAL SCRIPT SMALL H..MATHEMATICAL SC 2967 1D4C4 ; UNASSIGNED # 2968 1D4C5..1D505; DISALLOWED # MATHEMATICAL SCRIPT SMALL P..MATHEMATICAL FR 2969 1D506 ; UNASSIGNED # 2970 1D507..1D50A; DISALLOWED # MATHEMATICAL FRAKTUR CAPITAL D..MATHEMATICAL 2971 1D50B..1D50C; UNASSIGNED # .. 2972 1D50D..1D514; DISALLOWED # MATHEMATICAL FRAKTUR CAPITAL J..MATHEMATICAL 2973 1D515 ; UNASSIGNED # 2974 1D516..1D51C; DISALLOWED # MATHEMATICAL FRAKTUR CAPITAL S..MATHEMATICAL 2975 1D51D ; UNASSIGNED # 2976 1D51E..1D539; DISALLOWED # MATHEMATICAL FRAKTUR SMALL A..MATHEMATICAL D 2977 1D53A ; UNASSIGNED # 2978 1D53B..1D53E; DISALLOWED # MATHEMATICAL DOUBLE-STRUCK CAPITAL D..MATHEM 2979 1D53F ; UNASSIGNED # 2980 1D540..1D544; DISALLOWED # MATHEMATICAL DOUBLE-STRUCK CAPITAL I..MATHEM 2981 1D545 ; UNASSIGNED # 2982 1D546 ; DISALLOWED # MATHEMATICAL DOUBLE-STRUCK CAPITAL O 2983 1D547..1D549; UNASSIGNED # .. 2984 1D54A..1D550; DISALLOWED # MATHEMATICAL DOUBLE-STRUCK CAPITAL S..MATHEM 2985 1D551 ; UNASSIGNED # 2986 1D552..1D6A5; DISALLOWED # MATHEMATICAL DOUBLE-STRUCK SMALL A..MATHEMAT 2987 1D6A6..1D6A7; UNASSIGNED # .. 2988 1D6A8..1D7CB; DISALLOWED # MATHEMATICAL BOLD CAPITAL ALPHA..MATHEMATICA 2989 1D7CC..1D7CD; UNASSIGNED # .. 2990 1D7CE..1D7FF; DISALLOWED # MATHEMATICAL BOLD DIGIT ZERO..MATHEMATICAL M 2991 1D800..1EFFF; UNASSIGNED # .. 2992 1F000..1F02B; DISALLOWED # MAHJONG TILE EAST WIND..MAHJONG TILE BACK 2993 1F02C..1F02F; UNASSIGNED # .. 2994 1F030..1F093; DISALLOWED # DOMINO TILE HORIZONTAL BACK..DOMINO TILE VER 2995 1F094..1F0FF; UNASSIGNED # .. 2996 1F100..1F10A; DISALLOWED # DIGIT ZERO FULL STOP..DIGIT NINE COMMA 2997 1F10B..1F10F; UNASSIGNED # .. 2998 1F110..1F12E; DISALLOWED # PARENTHESIZED LATIN CAPITAL LETTER A..CIRCLE 2999 1F12F..1F130; UNASSIGNED # .. 3000 1F131 ; DISALLOWED # SQUARED LATIN CAPITAL LETTER B 3001 1F132..1F13C; UNASSIGNED # .. 3002 1F13D ; DISALLOWED # SQUARED LATIN CAPITAL LETTER N 3003 1F13E ; UNASSIGNED # 3004 1F13F ; DISALLOWED # SQUARED LATIN CAPITAL LETTER P 3005 1F140..1F141; UNASSIGNED # .. 3006 1F142 ; DISALLOWED # SQUARED LATIN CAPITAL LETTER S 3007 1F143..1F145; UNASSIGNED # .. 3008 1F146 ; DISALLOWED # SQUARED LATIN CAPITAL LETTER W 3009 1F147..1F149; UNASSIGNED # .. 3010 1F14A..1F14E; DISALLOWED # SQUARED HV..SQUARED PPV 3011 1F14F..1F156; UNASSIGNED # .. 3012 1F157 ; DISALLOWED # NEGATIVE CIRCLED LATIN CAPITAL LETTER H 3013 1F158..1F15E; UNASSIGNED # .. 3014 1F15F ; DISALLOWED # NEGATIVE CIRCLED LATIN CAPITAL LETTER P 3015 1F160..1F178; UNASSIGNED # .. 3016 1F179 ; DISALLOWED # NEGATIVE SQUARED LATIN CAPITAL LETTER J 3017 1F17A ; UNASSIGNED # 3018 1F17B..1F17C; DISALLOWED # NEGATIVE SQUARED LATIN CAPITAL LETTER L..NEG 3019 1F17D..1F17E; UNASSIGNED # .. 3020 1F17F ; DISALLOWED # NEGATIVE SQUARED LATIN CAPITAL LETTER P 3021 1F180..1F189; UNASSIGNED # .. 3022 1F18A..1F18D; DISALLOWED # CROSSED NEGATIVE SQUARED LATIN CAPITAL LETTE 3023 1F18E..1F18F; UNASSIGNED # .. 3024 1F190 ; DISALLOWED # SQUARE DJ 3025 1F191..1F1FF; UNASSIGNED # .. 3026 1F200 ; DISALLOWED # SQUARE HIRAGANA HOKA 3027 1F201..1F20F; UNASSIGNED # .. 3028 1F210..1F231; DISALLOWED # SQUARED CJK UNIFIED IDEOGRAPH-624B..SQUARED 3029 1F232..1F23F; UNASSIGNED # .. 3030 1F240..1F248; DISALLOWED # TORTOISE SHELL BRACKETED CJK UNIFIED IDEOGRA 3031 1F249..1FFFD; UNASSIGNED # .. 3032 1FFFE..1FFFF; DISALLOWED # .. 3033 20000..2A6D6; PVALID # .... 3035 2A700..2B734; PVALID # .... 3037 2F800..2FA1D; DISALLOWED # CJK COMPATIBILITY IDEOGRAPH-2F800..CJK COMPA 3038 2FA1E..2FFFD; UNASSIGNED # .. 3039 2FFFE..2FFFF; DISALLOWED # .. 3040 30000..3FFFD; UNASSIGNED # .. 3041 3FFFE..3FFFF; DISALLOWED # .. 3042 40000..4FFFD; UNASSIGNED # .. 3043 4FFFE..4FFFF; DISALLOWED # .. 3044 50000..5FFFD; UNASSIGNED # .. 3045 5FFFE..5FFFF; DISALLOWED # .. 3046 60000..6FFFD; UNASSIGNED # .. 3047 6FFFE..6FFFF; DISALLOWED # .. 3048 70000..7FFFD; UNASSIGNED # .. 3049 7FFFE..7FFFF; DISALLOWED # .. 3050 80000..8FFFD; UNASSIGNED # .. 3051 8FFFE..8FFFF; DISALLOWED # .. 3052 90000..9FFFD; UNASSIGNED # .. 3053 9FFFE..9FFFF; DISALLOWED # .. 3054 A0000..AFFFD; UNASSIGNED # .. 3055 AFFFE..AFFFF; DISALLOWED # .. 3056 B0000..BFFFD; UNASSIGNED # .. 3057 BFFFE..BFFFF; DISALLOWED # .. 3058 C0000..CFFFD; UNASSIGNED # .. 3059 CFFFE..CFFFF; DISALLOWED # .. 3060 D0000..DFFFD; UNASSIGNED # .. 3061 DFFFE..DFFFF; DISALLOWED # .. 3062 E0000 ; UNASSIGNED # 3063 E0001 ; DISALLOWED # LANGUAGE TAG 3064 E0002..E001F; UNASSIGNED # .. 3065 E0020..E007F; DISALLOWED # TAG SPACE..CANCEL TAG 3066 E0080..E00FF; UNASSIGNED # .. 3067 E0100..E01EF; DISALLOWED # VARIATION SELECTOR-17..VARIATION SELECTOR-25 3068 E01F0..EFFFD; UNASSIGNED # .. 3069 EFFFE..10FFFF; DISALLOWED # .. 3071 8. References 3073 8.1. Normative References 3075 [RFC2119] Bradner, S., "Key words for use in RFCs to Indicate 3076 Requirement Levels", BCP 14, RFC 2119, March 1997. 3078 [TR15] Davis, M. and M. Duerst, "Unicode Standard Annex #15, 3079 Unicode Normalization Forms, an integral part of the 3080 Unicode Standard", 3081 . 3083 [Unicode5] 3084 The Unicode Consortium, "The Unicode Standard, Version 3085 5.0.0", Boston, MA, Addison-Wesley ISBN 0-321-48091-0, 3086 2007. 3088 [Unicode52] 3089 The Unicode Consortium, "The Unicode Standard, Version 3090 5.1.0", Unicode 5.0.0, Boston, MA, Addison-Wesley ISBN 3091 0-321-48091-0, as amended by Unicode 5.1.0 3092 http://www.unicode.org/versions/Unicode5.1.0/, 2008, 3093 . 3095 8.2. Informative References 3097 [IDNA2008-bidi] 3098 Alvestrand, H. and C. Karp, "Right-to-left scripts for 3099 IDNA", draft-ietf-idnabis-bidi-06 (work in progress), 3100 September 2009. 3102 [IDNA2008-defs] 3103 Klensin, J., "Internationalized Domain Names for 3104 Applications (IDNA): Definitions and Document Framework", 3105 draft-ietf-idnabis-defs-12 (work in progress), 3106 October 2009. 3108 [IDNA2008-protocol] 3109 Klensin, J., "Internationalizing Domain Names in 3110 Applications (IDNA): Protocol", October 2009, . 3114 [IDNA2008-rationale] 3115 Klensin, J., Ed., "Internationalized Domain Names for 3116 Applications (IDNA): Background, Explanation, and 3117 Rationale", June 2009, . 3120 [RFC3454] Hoffman, P. and M. Blanchet, "Preparation of 3121 Internationalized Strings ("stringprep")", RFC 3454, 3122 December 2002. 3124 [RFC3491] Hoffman, P. and M. Blanchet, "Nameprep: A Stringprep 3125 Profile for Internationalized Domain Names (IDN)", 3126 RFC 3491, March 2003. 3128 [RFC4690] Klensin, J., Faltstrom, P., Karp, C., and IAB, "Review and 3129 Recommendations for Internationalized Domain Names 3130 (IDNs)", RFC 4690, September 2006. 3132 [RFC5226] Narten, T. and H. Alvestrand, "Guidelines for Writing an 3133 IANA Considerations Section in RFCs", BCP 26, RFC 5226, 3134 May 2008. 3136 URIs 3138 [1] 3140 [2] 3142 Author's Address 3144 Patrik Faltstrom (editor) 3145 Cisco 3147 Email: paf@cisco.com