| < draft-faltstrom-unicode12-03.txt | draft-faltstrom-unicode12-07.txt > | |||
|---|---|---|---|---|
| Network Working Group P. Faltstrom | Network Working Group P. Faltstrom | |||
| Internet-Draft Netnod | Internet-Draft Netnod | |||
| Intended status: Standards Track 3 October 2021 | Intended status: Standards Track February 13, 2022 | |||
| Expires: 6 April 2022 | Expires: August 17, 2022 | |||
| IDNA2008 and Unicode 12.0.0 | IDNA2008 and Unicode 12.0.0 | |||
| draft-faltstrom-unicode12-03 | draft-faltstrom-unicode12-07 | |||
| Abstract | Abstract | |||
| This document describes the changes between Unicode 6.2.0 and Unicode | This document describes the changes between Unicode 6.0.0 and Unicode | |||
| 12.0.0 in the context of IDNA2008. Some additions and changes have | 12.0.0 in the context of IDNA2008. Some additions and changes have | |||
| been made in the Unicode Standard that affect the values produced by | been made in the Unicode Standard that affect the values produced by | |||
| the algorithm IDNA2008 specifies. IDNA2008 allows adding exceptions | the algorithm IDNA2008 specifies. IDNA2008 allows adding exceptions | |||
| to the algorithm for backward compatibility; however, this document | to the algorithm for backward compatibility; however, this document | |||
| does not add any such exceptions. This document provides the | does not add any such exceptions. This document provides the | |||
| necessary tables to IANA to make its database consisstent with | necessary tables to IANA to make its database consistent with Unicode | |||
| Unicode 12.0.0. | 12.0.0. | |||
| To improve understanding, this document describes systems that are | To improve understanding, this document describes systems that are | |||
| being used as alternatives to those that conform to IDNA2008. | being used as alternatives to those that conform to IDNA2008. | |||
| TO BE REMOVED AT TIME OF PUBLICATION AS AN RFC: | TO BE REMOVED AT TIME OF PUBLICATION AS AN RFC: | |||
| This document is discussed on the i18n-discuss@ietf.org mailing list | This document is discussed on the i18n-discuss@ietf.org mailing list | |||
| of the IETF. | of the IETF. | |||
| Status of This Memo | Status of This Memo | |||
| skipping to change at page 1, line 45 ¶ | skipping to change at page 1, line 45 ¶ | |||
| Internet-Drafts are working documents of the Internet Engineering | Internet-Drafts are working documents of the Internet Engineering | |||
| Task Force (IETF). Note that other groups may also distribute | Task Force (IETF). Note that other groups may also distribute | |||
| working documents as Internet-Drafts. The list of current Internet- | working documents as Internet-Drafts. The list of current Internet- | |||
| Drafts is at https://datatracker.ietf.org/drafts/current/. | Drafts is at https://datatracker.ietf.org/drafts/current/. | |||
| Internet-Drafts are draft documents valid for a maximum of six months | Internet-Drafts are draft documents valid for a maximum of six months | |||
| and may be updated, replaced, or obsoleted by other documents at any | and may be updated, replaced, or obsoleted by other documents at any | |||
| time. It is inappropriate to use Internet-Drafts as reference | time. It is inappropriate to use Internet-Drafts as reference | |||
| material or to cite them other than as "work in progress." | material or to cite them other than as "work in progress." | |||
| This Internet-Draft will expire on 6 April 2022. | This Internet-Draft will expire on August 17, 2022. | |||
| Copyright Notice | Copyright Notice | |||
| Copyright (c) 2021 IETF Trust and the persons identified as the | Copyright (c) 2022 IETF Trust and the persons identified as the | |||
| document authors. All rights reserved. | document authors. All rights reserved. | |||
| This document is subject to BCP 78 and the IETF Trust's Legal | This document is subject to BCP 78 and the IETF Trust's Legal | |||
| Provisions Relating to IETF Documents (https://trustee.ietf.org/ | Provisions Relating to IETF Documents | |||
| license-info) in effect on the date of publication of this document. | (https://trustee.ietf.org/license-info) in effect on the date of | |||
| Please review these documents carefully, as they describe your rights | publication of this document. Please review these documents | |||
| and restrictions with respect to this document. Code Components | carefully, as they describe your rights and restrictions with respect | |||
| extracted from this document must include Simplified BSD License text | to this document. Code Components extracted from this document must | |||
| as described in Section 4.e of the Trust Legal Provisions and are | include Simplified BSD License text as described in Section 4.e of | |||
| provided without warranty as described in the Simplified BSD License. | the Trust Legal Provisions and are provided without warranty as | |||
| described in the Simplified BSD License. | ||||
| Table of Contents | Table of Contents | |||
| 1. Introduction . . . . . . . . . . . . . . . . . . . . . . . . 3 | 1. Introduction . . . . . . . . . . . . . . . . . . . . . . . . 2 | |||
| 2. Background . . . . . . . . . . . . . . . . . . . . . . . . . 4 | 2. Background . . . . . . . . . . . . . . . . . . . . . . . . . 4 | |||
| 2.1. IDNA2008 Documents . . . . . . . . . . . . . . . . . . . 5 | 2.1. IDNA2008 Documents . . . . . . . . . . . . . . . . . . . 5 | |||
| 2.2. Additional important IDNA2008-related documents . . . . . 6 | 2.2. Additional important IDNA2008-related documents . . . . . 6 | |||
| 2.3. Deployment . . . . . . . . . . . . . . . . . . . . . . . 6 | 2.3. Deployment . . . . . . . . . . . . . . . . . . . . . . . 6 | |||
| 3. Notable Changes Between Unicode 6.2.0 and 12.0.0 . . . . . . 7 | 3. Notable Changes Between Unicode 6.0.0 and 12.0.0 . . . . . . 7 | |||
| 3.1. Changes between Unicode 6.2.0 and 7.0.0 . . . . . . . . . 7 | 3.1. Changes between Unicode 6.0.0 and 7.0.0 . . . . . . . . . 7 | |||
| 3.2. Changes between Unicode 7.0.0 and 10.0.0 . . . . . . . . 8 | 3.2. Changes between Unicode 7.0.0 and 10.0.0 . . . . . . . . 8 | |||
| 3.3. Changes between Unicode 10.0.0 and 11.0.0 . . . . . . . . 9 | 3.3. Changes between Unicode 10.0.0 and 11.0.0 . . . . . . . . 9 | |||
| 3.4. Changes between Unicode 11.0.0 and 12.0.0 . . . . . . . . 10 | 3.4. Changes between Unicode 11.0.0 and 12.0.0 . . . . . . . . 10 | |||
| 4. U+111C9 SHARADA SANDHI MARK . . . . . . . . . . . . . . . . . 11 | 4. U+111C9 SHARADA SANDHI MARK . . . . . . . . . . . . . . . . . 11 | |||
| 5. Conclusion . . . . . . . . . . . . . . . . . . . . . . . . . 11 | 5. Conclusion . . . . . . . . . . . . . . . . . . . . . . . . . 11 | |||
| 6. IANA Considerations . . . . . . . . . . . . . . . . . . . . . 12 | 6. IANA Considerations . . . . . . . . . . . . . . . . . . . . . 11 | |||
| 7. Security Considerations . . . . . . . . . . . . . . . . . . . 12 | 7. Security Considerations . . . . . . . . . . . . . . . . . . . 12 | |||
| 8. Acknowledgements . . . . . . . . . . . . . . . . . . . . . . 12 | 8. Acknowledgements . . . . . . . . . . . . . . . . . . . . . . 12 | |||
| 9. References . . . . . . . . . . . . . . . . . . . . . . . . . 12 | 9. References . . . . . . . . . . . . . . . . . . . . . . . . . 12 | |||
| 9.1. Normative References . . . . . . . . . . . . . . . . . . 12 | 9.1. Normative References . . . . . . . . . . . . . . . . . . 12 | |||
| 9.2. Non-normative references . . . . . . . . . . . . . . . . 13 | 9.2. Non-normative references . . . . . . . . . . . . . . . . 13 | |||
| Appendix A. Changes from Unicode 6.3.0 to Unicode 7.0.0 . . . . 15 | Appendix A. Changes from Unicode 6.0.0 to Unicode 7.0.0 . . . . 15 | |||
| Appendix B. Changes from Unicode 7.0.0 to Unicode 8.0.0 . . . . 18 | Appendix B. Changes from Unicode 7.0.0 to Unicode 8.0.0 . . . . 21 | |||
| Appendix C. Changes from Unicode 8.0.0 to Unicode 9.0.0 . . . . 20 | Appendix C. Changes from Unicode 8.0.0 to Unicode 9.0.0 . . . . 23 | |||
| Appendix D. Changes from Unicode 9.0.0 to Unicode 10.0.0 . . . . 21 | Appendix D. Changes from Unicode 9.0.0 to Unicode 10.0.0 . . . . 24 | |||
| Appendix E. Changes from Unicode 10.0.0 to Unicode 11.0.0 . . . 22 | Appendix E. Changes from Unicode 10.0.0 to Unicode 11.0.0 . . . 26 | |||
| Appendix F. Changes from Unicode 11.0.0 to Unicode 12.0.0 . . . 24 | Appendix F. Changes from Unicode 11.0.0 to Unicode 12.0.0 . . . 27 | |||
| Author's Address . . . . . . . . . . . . . . . . . . . . . . . . 25 | Author's Address . . . . . . . . . . . . . . . . . . . . . . . . 29 | |||
| 1. Introduction | 1. Introduction | |||
| The current version of Internationalized Domain Names for | The current version of Internationalized Domain Names for | |||
| Applications (IDNA) was initiated in 2008, and despite not being | Applications (IDNA) was initiated in 2008, and despite not being | |||
| completed until 2010, is widely known as "IDNA2008". It is specified | completed until 2010, is widely known as "IDNA2008". It is specified | |||
| in the series of documents listed in Section 2.1. The IDNA2008 | in the series of documents listed in Section 2.1. The IDNA2008 | |||
| standard includes an algorithm by which a derived property value is | standard includes an algorithm by which a derived property value is | |||
| calculated based on the properties defined from the Unicode Standard. | calculated based on the properties defined from the Unicode Standard. | |||
| The derived property values that can be calculated are defined in RFC | The derived property values that can be calculated are defined in RFC | |||
| 5892 [RFC5892]. The summary below is a summary to make the reading | 5892 [RFC5892]. Below is a summary to aid in the reading of this | |||
| of this document easier. For definition of the terms, please see RFC | document. For definition of the terms, please see RFC 5892 | |||
| 5892 [RFC5892]. | [RFC5892]. | |||
| * PROTOCOL VALID: Those that are allowed to be used in IDNs. Code | o PROTOCOL VALID: Those that are allowed to be used in IDNs. Code | |||
| points with this property value are permitted for general use in | points with this property value are permitted for general use in | |||
| IDNs. However, that a label consists only of code points that | IDNs. However, that a label consists only of code points that | |||
| have this property value does not imply that the label can be used | have this property value does not imply that the label can be used | |||
| in DNS. The abbreviated term PVALID is used to refer to this | in DNS. The abbreviated term PVALID is used to refer to this | |||
| value. | value. | |||
| * CONTEXTUAL RULE REQUIRED: Some characteristics of the character, | o CONTEXTUAL RULE REQUIRED: Some characteristics of the character, | |||
| such as it being invisible in certain contexts or problematic in | such as it being invisible in certain contexts or problematic in | |||
| others, require that it not be used in labels unless specific | others, require that it not be used in labels unless specific | |||
| other characters or properties are present. The abbreviated term | other characters or properties are present. The abbreviated term | |||
| CONTEXT is used to refer to this value. | CONTEXT is used to refer to this value. As explained in RFC 5892 | |||
| [RFC5892] CONTEXT is in turn divided into CONTEXTJ and CONTEXTO. | ||||
| * DISALLOWED: Those that should clearly not be included in IDNs. | o DISALLOWED: Those that should clearly not be included in IDNs. | |||
| Code points with this property value are not permitted in IDNs. | Code points with this property value are not permitted in IDNs. | |||
| * UNASSIGNED: Those code points that are not designated (i.e., are | o UNASSIGNED: Those code points that are not designated (i.e., are | |||
| unassigned) in the Unicode Standard. | unassigned) in the Unicode Standard. | |||
| When the Unicode Standard is updated, new code points are assigned | When the Unicode Standard is updated, new code points are assigned | |||
| and already-assigned code points can have their property values | and already-assigned code points can have their property values | |||
| changed. | changed. | |||
| * Assigning code points can create problems if the newly-assigned | o Assigning code points can create problems if the newly-assigned | |||
| code points are compositions of existing code points and because | code points are compositions of existing code points and because | |||
| of that the normalization relationships associated with those code | of that the normalization relationships associated with those code | |||
| points should have been changed. | points should have been changed. | |||
| * Changing properties for already-assigned code points can create | o Changing properties for already-assigned code points can create | |||
| problems if the property change results in changes to the derived | problems if the property change results in changes to the derived | |||
| property value. This might make an earlier allowed code point | property value. This might make an earlier allowed code point | |||
| whose derived property value is PVALID to then not be allowed | whose derived property value is PVALID to then not be allowed | |||
| anymore if its derived property value changes to DISALLOWED. The | anymore if its derived property value changes to DISALLOWED. The | |||
| problem can also happen the other way around: a code point that | problem can also happen the other way around: a code point that | |||
| was not allowed (and thus is prohibited) can suddenly end up being | was not allowed (and thus is prohibited) can suddenly end up being | |||
| allowed. | allowed. | |||
| * Problems can also be created if the properties assigned to those | o Problems can also be created if the properties assigned to those | |||
| code points are inconsistent with IDNA2008 assumptions about how | code points are inconsistent with IDNA2008 assumptions about how | |||
| properties are assigned and/or about how code points with those | properties are assigned and/or about how code points with those | |||
| properties are used or behave. | properties are used or behave. | |||
| There were three incompatible changes in the Unicode standard after | There were three incompatible changes in the Unicode standard between | |||
| Unicode 5.2.0 [Unicode-5.2.0] up to including Unicode 6.0.0 | Unicode 5.2.0 [Unicode-5.2.0] and Unicode 6.0.0 [Unicode-6.0.0]; they | |||
| [Unicode-6.0.0], as described in RFC 6452 [RFC6452]. The code points | are described in RFC 6452 [RFC6452]. The code points U+0CF1 and | |||
| U+0CF1 and U+0CF2 had a derived property value change from DISALLOWED | U+0CF2 had a derived property value change from DISALLOWED to PVALID, | |||
| to PVALID while U+19DA had a change in derived property value from | and the code point U+19DA had a change in derived property value from | |||
| PVALID to DISALLOWED. They were examined in great detail and IETF | PVALID to DISALLOWED. These changes where examined in great detail, | |||
| concluded that the consensus is that no update was needed to RFC 5892 | but the IETF concluded that these changes to the Unicode standard did | |||
| [RFC5892] based on the changes made to the Unicode standard. | not warrant an update to RFC 5892 [RFC5892]. | |||
| As described in Section 3, more changes have been made to code points | As described in Section 3, more incompatible changes have been made | |||
| between Unicode version 6.0.0 and Unicode version 12.0.0 | to code points between Unicode 6.0.0 and Unicode 12.0.0 | |||
| [Unicode-12.0.0] so that the derived property values have been | [Unicode-12.0.0]; however, the changes in the derived property values | |||
| changed in an incompatible way. This document concludes that no | do not result in exceptions (as defined in section 2.6 of RFC 5892 | |||
| exceptions are to be added to RFC 5892 [RFC5892] even though there | [RFC5892]) being added to RFC 5892 [RFC5892]. | |||
| are changes in the derived property value as a result of the changes | ||||
| made in Unicode between version 6.2.0 and 12.0.0. | ||||
| Further, in 2015, the Internet Architecture Board (IAB) issued a | Further, in 2015, the Internet Architecture Board (IAB) issued a | |||
| statement [IAB] which requested the IETF to resolve the issues | statement [IAB2005-1] that advised the community to avoid using any | |||
| related to the code point ARABIC LETTER BEH WITH HAMZA ABOVE (U+08A1) | of the potentially problematic code points and asked the IETF to | |||
| that was introduced in Unicode 7.0.0 [Unicode-7.0.0]. This document | resolve the issues related to the code point ARABIC LETTER BEH WITH | |||
| concludes that this code point is not to be added to the exception | HAMZA ABOVE (U+08A1) that was introduced in Unicode 7.0.0 | |||
| list either. It should be noted that the review on U+08A1 indicated | [Unicode-7.0.0]. In February of that year, the statement was revised | |||
| that it is not an isolated case and that a number of PVALID code | [IAB2005-2] to focus on the latter request. More details about the | |||
| points of long standing may have similar issues. The problem | problem of code point sequences not normalizing as one might expect | |||
| resulted in a clarification of the review process of new Unicode | appear in a draft that was part of the discussion [IDNA7]. | |||
| versions RFC 8753 [RFC8753]. This clarification of the review | ||||
| process will impact review of Unicode versions after version 12.0.0. | The result of the work in the IETF was that no exception was added to | |||
| RFC 5892 [RFC5892]; however, it should be noted that the review of | ||||
| the issues around U+08A1 indicated that this code point is not an | ||||
| isolated case and that a number of long-standing PVALID code points | ||||
| may have similar issues. While the affected code points remain | ||||
| PVALID in this document, identification of the problem resulted in a | ||||
| clarification of the review process for new Unicode versions. That | ||||
| clarification, which reinforces the original review plan to capture | ||||
| issues like these, was published as RFC 8753 [RFC8753]. Any review | ||||
| of Unicode versions after 12.0.0 should be made according to RFC 8753 | ||||
| [RFC8753]; an objective of this document is to ensure that a proper | ||||
| review of such versions after version 12.0.0 can be made. | ||||
| 2. Background | 2. Background | |||
| 2.1. IDNA2008 Documents | 2.1. IDNA2008 Documents | |||
| IDNA2008 consists of the following documents. The documents in the | IDNA2008 consists of the following documents. The documents in the | |||
| set have informal names. | set have informal names. | |||
| * Internationalized Domain Names for Applications (IDNA): | o Internationalized Domain Names for Applications (IDNA): | |||
| Definitions and Document Framework [RFC5890], informally called | Definitions and Document Framework [RFC5890], informally called | |||
| "Defs" or "Definitions", contains definitions and other material | "Defs" or "Definitions", contains definitions and other material | |||
| that are needed for understanding other documents in the set. | that are needed for understanding other documents in the set. | |||
| * Internationalized Domain Names in Applications (IDNA): Protocol | o Internationalized Domain Names in Applications (IDNA): Protocol | |||
| [RFC5891], informally called "Protocol", describes the core | [RFC5891], informally called "Protocol", describes the core | |||
| IDNA2008 protocol and its operations. It needs to be interpreted | IDNA2008 protocol and its operations. It needs to be interpreted | |||
| in combination with the Bidi document (described below). | in combination with the Bidi document (described below). | |||
| * The Unicode Code Points and Internationalized Domain Names for | o The Unicode Code Points and Internationalized Domain Names for | |||
| Applications (IDNA) [RFC5892], informally called "Tables", lists | Applications (IDNA) [RFC5892], informally called "Tables", lists | |||
| the categories and rules that identify the code points allowed in | the categories and rules that identify the code points allowed in | |||
| a label written in native character form (called a "U-label"), and | a label written in native character form (called a "U-label"), and | |||
| is based on Unicode 5.2.0 [Unicode-5.2.0] code point assignments | is based on Unicode 5.2.0 [Unicode-5.2.0] code point assignments | |||
| and additional rules unique to IDNA2008. The Unicode-based rules | and additional rules unique to IDNA2008. The Unicode-based rules | |||
| in RFC 4892 are expected to be stable across Unicode updates and | in RFC 5892 are expected to be stable across Unicode updates and | |||
| hence independent of Unicode versions. RFC 5892 [RFC5892] | hence independent of Unicode versions. RFC 5892 [RFC5892] | |||
| obsoletes RFC 3491 [RFC3491], and in particular the use of the | obsoletes RFC 3491 [RFC3491], and in particular the use of the | |||
| tables to which RFC 3491 [RFC3491] refers. | tables to which RFC 3491 [RFC3491] refers. | |||
| * Right-to-Left Scripts for Internationalized Domain Names for | o Right-to-Left Scripts for Internationalized Domain Names for | |||
| Applications (IDNA) [RFC5893], informally called "Bidi", specifies | Applications (IDNA) [RFC5893], informally called "Bidi", specifies | |||
| special rules for labels that contain characters that are written | special rules for labels that contain characters that are written | |||
| from right to left. | from right to left. | |||
| * Internationalized Domain Names for Applications (IDNA): | o Internationalized Domain Names for Applications (IDNA): | |||
| Background, Explanation, and Rationale [RFC5894], informally | Background, Explanation, and Rationale [RFC5894], informally | |||
| called "Rationale", provides an overview of the protocol and | called "Rationale", provides an overview of the protocol and | |||
| associated tables, and gives explanatory material and some | associated tables, and gives explanatory material and some | |||
| rationale for the decisions that led to IDNA2008. It also | rationale for the decisions that led to IDNA2008. It also | |||
| contains advice for DNS registry operators and others who use | contains advice for DNS registry operators and others who use | |||
| Internationalized Domain Names (IDNs). | Internationalized Domain Names (IDNs). | |||
| * Mapping Characters for Internationalized Domain Names in | o Mapping Characters for Internationalized Domain Names in | |||
| Applications (IDNA) 2008 [RFC5895], informally called "Mapping", | Applications (IDNA) 2008 [RFC5895], informally called "Mapping", | |||
| discusses the issue of mapping characters into other characters | discusses the issue of mapping characters into other characters | |||
| and provides guidance for doing so when that is appropriate. RFC | and provides guidance for doing so when that is appropriate. RFC | |||
| 5895 provides advice only and is not a required part of IDNA. | 5895 provides advice only and is not a required part of IDNA. | |||
| 2.2. Additional important IDNA2008-related documents | 2.2. Additional important IDNA2008-related documents | |||
| There are other documents important for the understanding and | There are other documents important for the understanding and | |||
| functioning of IDNA2008, for example this. | functioning of IDNA2008, for example this. | |||
| * The Unicode Code Points and Internationalized Domain Names for | o The Unicode Code Points and Internationalized Domain Names for | |||
| Applications (IDNA) - Unicode 6.0 [RFC6452] describes some changes | Applications (IDNA) - Unicode 6.0 [RFC6452] describes some changes | |||
| made to Unicode 6.0.0 [Unicode-6.0.0] that resulted in derived | made to Unicode 6.0.0 [Unicode-6.0.0] that resulted in derived | |||
| property value change for the code points U+0CF1, U+0CF2 and | property value change for the code points U+0CF1, U+0CF2 and | |||
| U+19DA. U+0CF1 and U+0CF2 changed from DISALLOWED to PVALID, | U+19DA. U+0CF1 and U+0CF2 changed from DISALLOWED to PVALID, | |||
| while U+19DA changed from PVALID to DISALLOWED. The IETF | while U+19DA changed from PVALID to DISALLOWED. The IETF | |||
| concluded that no update to RFC 5892 [RFC5892] was needed based on | concluded that no update to RFC 5892 [RFC5892] was needed based on | |||
| the changes made in Unicode 6.0.0 [Unicode-6.0.0]. As a result, | the changes made in Unicode 6.0.0 [Unicode-6.0.0]. As a result, | |||
| the derived property value remained aligned with the Unicode | the derived property value remained aligned with the Unicode | |||
| Standard. Specifically, no exception was added. | Standard. Specifically, no exception was added. | |||
| 2.3. Deployment | 2.3. Deployment | |||
| There are many variations on the general IDNA model in use in the | There are many variations on the general IDNA model in use in the | |||
| various parts of the community. The following lists some of the | various parts of the community. The following lists some of the | |||
| strategies that implementations that claim to be IDNA compliant are | strategies that implementations that claim to be IDNA compliant are | |||
| known to use, but it should be noted the list is not complete: | known to use, but it should be noted the list is not complete: | |||
| * IDNA2003 as specified in RFC 3490 [RFC3490] and RFC 3491 | o IDNA2003 as specified in RFC 3490 [RFC3490] and RFC 3491 | |||
| [RFC3491]. Those specifications are dependent on case folding and | [RFC3491]. Those specifications are dependent on case folding and | |||
| NFKC normalization and on tables that specify for each code point | NFKC normalization and on tables that specify for each code point | |||
| whether it is allowed to be used or not, with a distinction made | whether it is allowed to be used or not, with a distinction made | |||
| between use for "stored strings" and "query strings". The tables | between use for "stored strings" and "query strings". The tables | |||
| themselves are dependent on version 3.2 of The Unicode Standard | themselves are dependent on Unicode 3.2 [Unicode-3.2.0]. | |||
| [Unicode-3.2.0]. | ||||
| * A number of variations on IDNA2003, sometimes presented as | o A number of variations on IDNA2003, sometimes presented as | |||
| "updated IDNA2003" or the like, which follow the principles of | "updated IDNA2003" or the like, which follow the principles of | |||
| IDNA2003 as understood by the implementers but that use tables | IDNA2003 as understood by the implementers but that use tables | |||
| that represent how the implementers believe Stringprep [RFC3454] | that represent how the implementers believe Stringprep [RFC3454] | |||
| and Nameprep [RFC3491] would have evolved had the IETF not moved | and Nameprep [RFC3491] would have evolved had the IETF not moved | |||
| in the direction of IDNA2008 instead. | in the direction of IDNA2008 instead. | |||
| * A mix between IDNA2003 and IDNA2008 where code points assigned to | o A mix between IDNA2003 and IDNA2008 where code points assigned to | |||
| Unicode after Unicode 3.2.0 [Unicode-3.2.0] have derived property | Unicode after Unicode 3.2.0 [Unicode-3.2.0] have derived property | |||
| value calculated according to the algorithm specified in IDNA2008. | value calculated according to the algorithm specified in IDNA2008. | |||
| * A mix between IDNA2003 and IDNA2008 according to the Unicode | o A mix between IDNA2003 and IDNA2008 according to the Unicode | |||
| Technical Standard #46 [UTS-46]. Because that document specifies | Technical Standard #46 [UTS-46]. Because that document specifies | |||
| different profiles, there are several different variations that | different profiles, there are several variations that leave users | |||
| leave users with no guarantee that two applications claiming | with no guarantee that two applications claiming conformance to | |||
| conformance to UTS#46 will interoperate well with each other much | UTS#46 will interoperate well with each other much less with | |||
| less with conforming IDNA2008 implementations. UTS#46 is | conforming IDNA2008 implementations. UTS#46 is ultimately based | |||
| ultimately based on a normative table very much like the one used | on a normative table very much like the one used by Stringprep | |||
| by Stringprep [RFC3454] but updated for each new version of | [RFC3454] but updated for each new version of Unicode. | |||
| Unicode. | ||||
| * The (normative) IDNA2008 algorithm applied to whatever version of | o The (normative) IDNA2008 algorithm applied to whatever version of | |||
| Unicode Standard exists in the operating system and/or libraries | Unicode Standard exists in the operating system and/or libraries | |||
| used, independent of whatever version of tables appears in the | used, independent of whatever version of tables appears in the | |||
| (non-normative) IANA database. | (non-normative) IANA database. | |||
| In practice, the Unicode Consortium creates a maximum set of code | In practice, the Unicode Consortium creates a maximum set of code | |||
| points by assigning code points in the Unicode Standard. The | points by assigning code points in the Unicode Standard. The | |||
| IDNA2008 rules use the Unicode Standard to create a further subset of | IDNA2008 rules use the Unicode Standard to create a further subset of | |||
| code points and context that are permitted in DNS labels associated | code points and context that are permitted in DNS labels associated | |||
| with its PVALID, CONTEXTJ, and CONTEXTO derived property values. DNS | with its PVALID, and CONTEXT (CONTEXTJ or CONTEXTO) derived property | |||
| registries and other organizations that deal with IDNs are supposed | values. DNS registries and other organizations that deal with IDNs | |||
| to create their own subsets from IDNA2008 for use by those registries | are supposed to create their own subsets from IDNA2008 for use by | |||
| and organizations. | those registries and organizations. | |||
| This progressive subsetting and narrowing of the repertoire of code | This progressive subsetting and narrowing of the repertoire of code | |||
| points that can be used in labels is an implementation of the | points that can be used in labels is an implementation of the | |||
| principles of being conservative when deciding what code points to | principles of being conservative when deciding what code points to | |||
| include in such a subset. SAC-084 [SAC-084] and RFC 6912 [RFC6912] | include in such a subset. SAC-084 [SAC-084] and RFC 6912 [RFC6912] | |||
| recommend to DNS registries and other organizations to be | recommend to DNS registries and other organizations to be | |||
| conservative when creating their subsets, and to use the principle of | conservative when creating their subsets, and to use the principle of | |||
| creating subsets by inclusion. | creating subsets by inclusion. | |||
| 3. Notable Changes Between Unicode 6.2.0 and 12.0.0 | See also the Security Considerations section in this document. | |||
| 3.1. Changes between Unicode 6.2.0 and 7.0.0 | 3. Notable Changes Between Unicode 6.0.0 and 12.0.0 | |||
| Change in number of characters in each category: | Among the changes between the Unicode versions, most code points that | |||
| change derived property value change from UNASSIGNED to PVALID or | ||||
| from UNASSIGNED to DISALLOWED. The interesting changes in derived | ||||
| property values include other changes. All changes between the major | ||||
| versions of Unicode can be found in Appendix A (6.0.0-7.0.0), | ||||
| Appendix B (7.0.0-8.0.0), Appendix C (8.0.0-9.0.0), Appendix D | ||||
| (9.0.0-10.0.0), Appendix E (10.0.0-11.0.0) and Appendix F | ||||
| (11.0.0-12.0.0). | ||||
| Code points that changed derived property value: 0 | 3.1. Changes between Unicode 6.0.0 and 7.0.0 | |||
| PVALID changed from 97946 to 99867 (+1921) | Change in number of characters in each category: | |||
| UNASSIGNED changed from 864348 to 861509 (-2839) | PVALID changed from 97418 to 99867 (+2449) | |||
| CONTEXTJ did not change, at 2 | UNASSIGNED changed from 865081 to 861509 (-3572) | |||
| CONTEXTJ did not change, at 2 | ||||
| CONTEXTO did not change, at 25 | CONTEXTO did not change, at 25 | |||
| DISALLOWED changed from 151791 to 152709 (+918) | DISALLOWED changed from 151586 to 152709 (+1123) | |||
| TOTAL did not change, at 1114112 | TOTAL did not change, at 1114112 | |||
| There are no changes made to Unicode between version 6.2.0 and | ||||
| There are no changes made to Unicode between version 6.0.0 and | ||||
| 7.0.0 that impact IDNA2008 calculation of the derived property | 7.0.0 that impact IDNA2008 calculation of the derived property | |||
| values. | values. | |||
| The code points U+17B4 KHMER VOWEL INHERENT AQ and U+17B5 KHMER VOWEL | ||||
| INHERENT AA both changed the general category from Cf (Format) to Mn | ||||
| (Nonspacing_Mark), but that did not impact the calculation of the | ||||
| derived property value which stayed at DISALLOWED. | ||||
| The character ARABIC LETTER BEH WITH HAMZA ABOVE (U+08A1) was | The character ARABIC LETTER BEH WITH HAMZA ABOVE (U+08A1) was | |||
| introduced in Unicode 7.0.0. This was discussed extensively in the | introduced in Unicode 7.0.0. This was discussed extensively in the | |||
| IETF, and by the IAB in their statement [IAB] requesting the IETF to | IETF, and by the IAB in their statement [IAB2005-1] requesting the | |||
| investigate the issue. Specifically, the IAB stated: | IETF to investigate the issue. Specifically, the IAB stated: | |||
| On the same precautionary principle, the IAB recommends that the | On the same precautionary principle, the IAB recommends that the | |||
| Internationalized Domain Names for Applications (IDNA) Parameters | Internationalized Domain Names for Applications (IDNA) Parameters | |||
| registry (http://www.iana.org/assignments/idna-tables/) not be | registry <https://www.iana.org/assignments/idna-tables/> not be | |||
| updated to Unicode 7.0.0 until the IETF has consensus on a | updated to Unicode 7.0.0 until the IETF has consensus on a | |||
| solution to this problem. | solution to this problem. | |||
| The discussion in the IETF concluded that although it is possible to | The discussion in the IETF concluded that although it is possible to | |||
| create "the same" character in multiple ways, the issue with U+08A1 | create "the same" character in multiple ways, the issue with U+08A1 | |||
| is not unique. The character U+08A1 (ARABIC LETTER BEH WITH HAMZA | is not unique. The character U+08A1 (ARABIC LETTER BEH WITH HAMZA | |||
| ABOVE) can be represented with the sequence ARABIC LETTER BEH | ABOVE) can be represented with the sequence ARABIC LETTER BEH | |||
| (U+0628) and ARABIC HAMZA ABOVE (U+0654). This identical to LATIN | (U+0628) and ARABIC HAMZA ABOVE (U+0654). This identical to LATIN | |||
| SMALL LETTER O WITH STROKE (U+00F8), which can be represented with | SMALL LETTER O WITH STROKE (U+00F8), which can be represented with | |||
| the sequence LATIN SMALL LETTER O (U+006F) followed by COMBINING | the sequence LATIN SMALL LETTER O (U+006F) followed by COMBINING | |||
| SHORT SOLIDUS OVERLAY (U+0337). | SHORT SOLIDUS OVERLAY (U+0337). | |||
| Although the discussion about this specific code point resulted in | Although the discussion about this specific code point resulted in | |||
| acceptance of the derived property value of PVALID, the underlying | acceptance of the derived property value of PVALID, the underlying | |||
| problem with combining sequences is not understood fully. Therefore | problem with combining sequences is not understood fully. Therefore, | |||
| it cannot be claimed that this case can be extrapolated to other | it cannot be claimed that this case can be extrapolated to other | |||
| situations and other code points. | situations and other code points. | |||
| 3.2. Changes between Unicode 7.0.0 and 10.0.0 | 3.2. Changes between Unicode 7.0.0 and 10.0.0 | |||
| Change in number of characters in each category: | Change in number of characters in each category: | |||
| Code points that changed derived property value: 0 | Code points that changed derived property value: 0 | |||
| PVALID changed from 99867 to 122411 (+22544) | PVALID changed from 99867 to 122411 (+22544) | |||
| skipping to change at page 9, line 43 ¶ | skipping to change at page 10, line 5 ¶ | |||
| The properties for ZANABAZAR SQUARE VOWEL SIGN AI (U+11A07) and | The properties for ZANABAZAR SQUARE VOWEL SIGN AI (U+11A07) and | |||
| ZANABZAR SQUARE VOWEL SIGN AU (U+11A08) were corrected from Mc to | ZANABZAR SQUARE VOWEL SIGN AU (U+11A08) were corrected from Mc to | |||
| Mn. | Mn. | |||
| SPHERICAL ANGLE OPENING UP (U+29A1) was changed to Bidi_M=N. | SPHERICAL ANGLE OPENING UP (U+29A1) was changed to Bidi_M=N. | |||
| These changes to the Unicode Standard have the following implications | These changes to the Unicode Standard have the following implications | |||
| for these code points: | for these code points: | |||
| * The newly assigned 684 characters are assigned a derived property | o The newly assigned 684 characters are assigned a derived property | |||
| value as of a result of applying the IDNA2008 algorithm. | value as of a result of applying the IDNA2008 algorithm. | |||
| * The Georgian letters in the ranges U+10D0..U+10FA and | o The Georgian letters in the ranges U+10D0..U+10FA and | |||
| U+10FD..U+10FF existed before IDNA2008 was created. Applying the | U+10FD..U+10FF existed before IDNA2008 was created. Applying the | |||
| IDNA2008 algorithm to the code points assigned the derived | IDNA2008 algorithm to the code points assigned the derived | |||
| property value PVALID, and that value is unchanged even if the | property value PVALID, and that value is unchanged even if the | |||
| underlying Unicode properties have changed. The newly encoded | underlying Unicode properties have changed. The newly encoded | |||
| Mtavruli letters have general category "Lu" and are therefore | Mtavruli letters have general category "Lu" and are therefore | |||
| DISALLOWED. | DISALLOWED. | |||
| * The U+111C9 SHARADA SANDHI MARK was added to Unicode 8.0.0 | o The U+111C9 SHARADA SANDHI MARK was added to Unicode 8.0.0 | |||
| [Unicode-8.0.0]. Applying the IDNA2008 algorithm to the code | [Unicode-8.0.0]. Applying the IDNA2008 algorithm to the code | |||
| point assigned the derived property value DISALLOWED. The changes | point assigned the derived property value DISALLOWED. The changes | |||
| in the underlying properties in the Unicode Standard Version | in the underlying properties in the Unicode Standard Version | |||
| 11.0.0 [Unicode-11.0.0] caused the derived property value to | 11.0.0 [Unicode-11.0.0] caused the derived property value to | |||
| change to PVALID. | change to PVALID. | |||
| * The characters ZANABAZAR SQUARE VOWEL SIGN AI (U+11A07) and | o The characters ZANABAZAR SQUARE VOWEL SIGN AI (U+11A07) and | |||
| ZANABZAR SQUARE VOWEL SIGN AU (U+11A08) were added to Unicode | ZANABZAR SQUARE VOWEL SIGN AU (U+11A08) were added to Unicode | |||
| 10.0.0 [Unicode-10.0.0]. Applying the IDNA2008 algorithm to the | 10.0.0 [Unicode-10.0.0]. Applying the IDNA2008 algorithm to the | |||
| code points assigned the derived property value PVALID, and that | code points assigned the derived property value PVALID, and that | |||
| value is unchanged even if the underlying Unicode properties have | value is unchanged even if the underlying Unicode properties have | |||
| changed. | changed. | |||
| * SPHERICAL ANGLE OPENING UP (U+29A1) existed before IDNA2008 was | o SPHERICAL ANGLE OPENING UP (U+29A1) existed before IDNA2008 was | |||
| created. Applying the IDNA2008 algorithm to the code point | created. Applying the IDNA2008 algorithm to the code point | |||
| assigned the derived property value DISALLOWED, and that value is | assigned the derived property value DISALLOWED, and that value is | |||
| unchanged even if the underlying Unicode properties have changed. | unchanged even if the underlying Unicode properties have changed. | |||
| 3.4. Changes between Unicode 11.0.0 and 12.0.0 | 3.4. Changes between Unicode 11.0.0 and 12.0.0 | |||
| Change in number of characters in each category: | Change in number of characters in each category: | |||
| Code points that changed derived property value: 0 | Code points that changed derived property value: 0 | |||
| skipping to change at page 11, line 8 ¶ | skipping to change at page 11, line 8 ¶ | |||
| CONTEXTO did not change, at 25 | CONTEXTO did not change, at 25 | |||
| DISALLOWED changed from 154260 to 154542 (+282) | DISALLOWED changed from 154260 to 154542 (+282) | |||
| TOTAL did not change, at 1114112 | TOTAL did not change, at 1114112 | |||
| 4. U+111C9 SHARADA SANDHI MARK | 4. U+111C9 SHARADA SANDHI MARK | |||
| As one can see in Section 3, an incompatible property change was made | As one can see in Section 3, an incompatible property change was made | |||
| between Unicode 6.2.0 and 12.0.0, affecting the code point U+111C9. | between Unicode 6.0.0 and 12.0.0, affecting the code point U+111C9. | |||
| Its derived property value thus changed from DISALLOWED to PVALID. | Its derived property value thus changed from DISALLOWED to PVALID. | |||
| In situations like these, IDNA2008 allow for addition of rules to RFC | In situations like these, IDNA2008 allow for addition of rules to RFC | |||
| 5892 [RFC5892] section 2.7. (BackwardCompatible (G)). If the code | 5892 [RFC5892] section 2.7. If the code point is accepted, it might | |||
| point is accepted, it might still be rejected if validated by | still be rejected if validated by software based on older versions of | |||
| software based on older versions of Unicode than 11.0.0. As the | Unicode than 12.0.0. As the character is rarely used outside the | |||
| character is rarely used outside of the group of Sharada specialists, | group of Sharada specialists, and used in some records for indicating | |||
| and used in some records for indicating sandhi breaks, the conclusion | sandhi breaks, the conclusion is that it could either be added as an | |||
| is that it could either be added as an exception or allowed to change | exception or allowed to change its property value, as the use of the | |||
| its property value, as the use of the code point is limited outside a | code point is limited outside a special community. As including an | |||
| special community. As including an exception would require | exception would require implementation changes in deployed | |||
| implementation changes in deployed implementations of IDNA20008, the | implementations of IDNA20008, the IETF has decided to not add a | |||
| editor proposes that such a BackwardCompatible rule NOT to be added | BackwardCompatible rule to IDNA2008 (i.e. Section 2.7 of RFC 5892 | |||
| to IDNA2008. This also ensures all sandhi marks being treated in an | [RFC5892] for this code point. This also ensures all sandhi marks | |||
| equal way. | being treated in an equal way. | |||
| The IETF has decided to NOT add a BackwardCompatible rule to IDNA2008 | ||||
| (i.e. Section 2.7 of RFC 5892 [RFC5892]) for this code point. | ||||
| 5. Conclusion | 5. Conclusion | |||
| As described in Section 3 and Section 4, changes have been made to | As described in Section 3 and Section 4, changes have been made to | |||
| Unicode between version 6.2.0 and 12.0.0. Some changes to specific | Unicode between version 6.0.0 and 12.0.0. Some changes to specific | |||
| characters changed their derived property value, whereas other | characters changed their derived property value, whereas other | |||
| changes did not. Given the deployment considerations described in | changes did not. Given the deployment considerations described in | |||
| Section 2.3 and changes in the Unicode Standard described in | Section 2.3 and changes in the Unicode Standard described in | |||
| Section 3 and Section 4, including implications to normalization, the | Section 3 and Section 4, including implications to normalization, the | |||
| conclusion of this document is to not add any exception rules to | conclusion is to not add any exception rules to IDNA2008. | |||
| IDNA2008. | ||||
| This document addresses only changes to Unicode between version 6.2.0 | This document addresses only changes to Unicode between version 6.0.0 | |||
| and version 12.0.0. Changes in future Unicode versions might result | and version 12.0.0. Changes in future Unicode versions might result | |||
| in the conclusion that exception rules need to be added to IDNA2008 | in the conclusion that exception rules need to be added to IDNA2008 | |||
| after review process explained in RFC 8753 [RFC8753]. Separately | after the review process explained in RFC 8753 [RFC8753]. Separately | |||
| from any changes in Unicode, the IETF might conclude that updates to | from any changes in Unicode, the IETF might conclude that updates to | |||
| RFC 5892 [RFC5892] or other IDNA2008 documents might become | RFC 5892 [RFC5892] or other IDNA2008 documents might become | |||
| necessary; such updates might include changes to the algorithm | necessary; such updates might include changes to the algorithm | |||
| specified in IDNA2008 as well as additional rules, categories, or | specified in IDNA2008 as well as additional rules, categories, or | |||
| other forms of tuning. Like the clarifications in RFC 8753 | other forms of tuning, like the clarifications in RFC 8753 [RFC8753]. | |||
| [RFC8753]. | ||||
| 6. IANA Considerations | 6. IANA Considerations | |||
| IANA is requested to update the IDNA Parameters registry of derived | IANA is requested to update the IDNA Parameters registry [IANA-IDNA] | |||
| property values, after the expert reviewer validates that the derived | of derived property values, after the expert reviewer validates that | |||
| property values are calculated correctly. | the derived property values are calculated correctly. | |||
| 7. Security Considerations | 7. Security Considerations | |||
| This document makes recommendations regarding the use of the IDNA2008 | This document makes recommendations regarding the use of the IDNA2008 | |||
| algorithm for calculation of derived property values, based on | algorithm for calculation of derived property values, based on | |||
| Unicode version 12.0.0. This recommendation do not say anything | Unicode version 12.0.0. This recommendation does not say anything | |||
| about what recommendations to make for future versions of the Unicode | about what recommendations to make for future versions of the Unicode | |||
| Standard. | Standard. | |||
| Not following these recommendations can lead to various security | Not following these recommendations can lead to various security | |||
| issues. Specifically, allowing confusable characters may lead to | issues. Specifically, allowing confusable characters may lead to | |||
| various phishing attacks, as described in the Security Consideration | various phishing attacks, as described in the Security Consideration | |||
| Sections in the documents listed in Section 2.1. | Sections in the documents listed in Section 2.1. | |||
| 8. Acknowledgements | 8. Acknowledgements | |||
| skipping to change at page 13, line 22 ¶ | skipping to change at page 13, line 12 ¶ | |||
| (IDNA)", RFC 5893, DOI 10.17487/RFC5893, August 2010, | (IDNA)", RFC 5893, DOI 10.17487/RFC5893, August 2010, | |||
| <https://www.rfc-editor.org/info/rfc5893>. | <https://www.rfc-editor.org/info/rfc5893>. | |||
| [RFC6452] Faltstrom, P., Ed. and P. Hoffman, Ed., "The Unicode Code | [RFC6452] Faltstrom, P., Ed. and P. Hoffman, Ed., "The Unicode Code | |||
| Points and Internationalized Domain Names for Applications | Points and Internationalized Domain Names for Applications | |||
| (IDNA) - Unicode 6.0", RFC 6452, DOI 10.17487/RFC6452, | (IDNA) - Unicode 6.0", RFC 6452, DOI 10.17487/RFC6452, | |||
| November 2011, <https://www.rfc-editor.org/info/rfc6452>. | November 2011, <https://www.rfc-editor.org/info/rfc6452>. | |||
| 9.2. Non-normative references | 9.2. Non-normative references | |||
| [Changes-11.0.0] | [IAB2005-1] | |||
| The Unicode Consortium, "Unicode Standard Annex #44", | Internet Architecture Board, "IAB Statement on Identifiers | |||
| Unicode Standard Annex #44, UNICODE CHARACTER DATABASE, | and Unicode 7.0.0", IAB Statement on Identifiers and | |||
| Change History https://www.unicode.org/reports/tr44/ | Unicode 7.0.0 | |||
| tr44-21d4.html#Change_History, May 2018. | <https://www.iab.org/documents/correspondence-reports- | |||
| documents/2015-2/iab-statement-on-identifiers-and-unicode- | ||||
| 7-0-0/archive/>, January 2015. | ||||
| [IAB] Internet Architecture Board, "IAB Statement on Identifiers | [IAB2005-2] | |||
| Internet Architecture Board, "IAB Statement on Identifiers | ||||
| and Unicode 7.0.0", IAB Statement on Identifiers and | and Unicode 7.0.0", IAB Statement on Identifiers and | |||
| Unicode 7.0.0 | Unicode 7.0.0 | |||
| https://www.iab.org/documents/correspondence-reports- | <https://www.iab.org/documents/correspondence-reports- | |||
| documents/2015-2/iab-statement-on-identifiers-and-unicode- | documents/2015-2/iab-statement-on-identifiers-and-unicode- | |||
| 7-0-0/, January 2015. | 7-0-0/>, February 2015. | |||
| [IANA-IDNA] | ||||
| IANA, "IDNA Rules and Derived Property Values", IDNA Rules | ||||
| and Derived Property Values | ||||
| <https://www.iana.org/assignments/idna-tables-6.0.0/idna- | ||||
| tables-6.0.0.xhtml>, April 2020. | ||||
| [IDNA7] Klensin, J. and P. Faltstrom, "IDNA Update for Unicode 7.0 | ||||
| and Later Versions", draft-klensin-idna-5892upd-unicode70 | ||||
| <https://datatracker.ietf.org/doc/draft-klensin-idna- | ||||
| 5892upd-unicode70/>, October 2017. | ||||
| [RFC3454] Hoffman, P. and M. Blanchet, "Preparation of | [RFC3454] Hoffman, P. and M. Blanchet, "Preparation of | |||
| Internationalized Strings ("stringprep")", RFC 3454, | Internationalized Strings ("stringprep")", RFC 3454, | |||
| DOI 10.17487/RFC3454, December 2002, | DOI 10.17487/RFC3454, December 2002, | |||
| <https://www.rfc-editor.org/info/rfc3454>. | <https://www.rfc-editor.org/info/rfc3454>. | |||
| [RFC3490] Faltstrom, P., Hoffman, P., and A. Costello, | [RFC3490] Faltstrom, P., Hoffman, P., and A. Costello, | |||
| "Internationalizing Domain Names in Applications (IDNA)", | "Internationalizing Domain Names in Applications (IDNA)", | |||
| RFC 3490, DOI 10.17487/RFC3490, March 2003, | RFC 3490, DOI 10.17487/RFC3490, March 2003, | |||
| <https://www.rfc-editor.org/info/rfc3490>. | <https://www.rfc-editor.org/info/rfc3490>. | |||
| skipping to change at page 14, line 15 ¶ | skipping to change at page 14, line 15 ¶ | |||
| [RFC5895] Resnick, P. and P. Hoffman, "Mapping Characters for | [RFC5895] Resnick, P. and P. Hoffman, "Mapping Characters for | |||
| Internationalized Domain Names in Applications (IDNA) | Internationalized Domain Names in Applications (IDNA) | |||
| 2008", RFC 5895, DOI 10.17487/RFC5895, September 2010, | 2008", RFC 5895, DOI 10.17487/RFC5895, September 2010, | |||
| <https://www.rfc-editor.org/info/rfc5895>. | <https://www.rfc-editor.org/info/rfc5895>. | |||
| [RFC6912] Sullivan, A., Thaler, D., Klensin, J., and O. Kolkman, | [RFC6912] Sullivan, A., Thaler, D., Klensin, J., and O. Kolkman, | |||
| "Principles for Unicode Code Point Inclusion in Labels in | "Principles for Unicode Code Point Inclusion in Labels in | |||
| the DNS", RFC 6912, DOI 10.17487/RFC6912, April 2013, | the DNS", RFC 6912, DOI 10.17487/RFC6912, April 2013, | |||
| <https://www.rfc-editor.org/info/rfc6912>. | <https://www.rfc-editor.org/info/rfc6912>. | |||
| [RFC8753] Klensin, J. and P. Fältström, "Internationalized Domain | [RFC8753] Klensin, J. and P. Faeltstroem, "Internationalized Domain | |||
| Names for Applications (IDNA) Review for New Unicode | Names for Applications (IDNA) Review for New Unicode | |||
| Versions", RFC 8753, DOI 10.17487/RFC8753, April 2020, | Versions", RFC 8753, DOI 10.17487/RFC8753, April 2020, | |||
| <https://www.rfc-editor.org/info/rfc8753>. | <https://www.rfc-editor.org/info/rfc8753>. | |||
| [SAC-084] The Security and Stability Advisory Committee, "SAC084", | [SAC-084] The Security and Stability Advisory Committee, "SAC084", | |||
| SSAC Comments on Guidelines for the Extended Process | SSAC Comments on Guidelines for the Extended Process | |||
| Similarity Review Panel for the IDN ccTLD Fast Track | Similarity Review Panel for the IDN ccTLD Fast Track | |||
| Process https://www.icann.org/en/system/files/files/sac- | Process <https://www.icann.org/en/system/files/files/sac- | |||
| 084-en.pdf, August 2016. | 084-en.pdf>, August 2016. | |||
| [Unicode-10.0.0] | ||||
| The Unicode Consortium, "The Unicode Standard, Version | ||||
| 10.0.0", The Unicode Standard, Version 10.0.0 ISBN | ||||
| 978-1-936213-16-0, June 2017. | ||||
| [Unicode-11.0.0] | ||||
| The Unicode Consortium, "The Unicode Standard, Version | ||||
| 11.0.0", The Unicode Standard, Version 11.0.0 ISBN | ||||
| 978-1-936213-19-1, June 2018. | ||||
| [Unicode-12.0.0] | ||||
| The Unicode Consortium, "The Unicode Standard, Version | ||||
| 12.0.0", The Unicode Standard, Version 12.0.0 ISBN | ||||
| 978-1-936213-22-1, March 2019. | ||||
| [Unicode-3.2.0] | [Unicode-3.2.0] | |||
| The Unicode Consortium, "The Unicode Standard, Version | The Unicode Consortium, "The Unicode Standard, Version | |||
| 3.2.0", The Unicode Standard, Version 3.2.0 ISBN | 3.2.0", The Unicode Standard, Version 3.2.0 ISBN | |||
| 0-201-61633-5, March 2002. | 0-201-61633-5, March 2002. | |||
| [Unicode-5.2.0] | [Unicode-5.2.0] | |||
| The Unicode Consortium, "The Unicode Standard, Version | The Unicode Consortium, "The Unicode Standard, Version | |||
| 5.2.0", The Unicode Standard, Version 5.2.0 ISBN | 5.2.0", The Unicode Standard, Version 5.2.0 ISBN | |||
| 978-1-936213-00-9, October 2009. | 978-1-936213-00-9, October 2009. | |||
| [Unicode-6.0.0] | [Unicode-6.0.0] | |||
| The Unicode Consortium, "The Unicode Standard, Version | The Unicode Consortium, "The Unicode Standard, Version | |||
| 6.0.0", The Unicode Standard, Version 6.0.0 ISBN | 6.0.0", The Unicode Standard, Version 6.0.0 ISBN | |||
| 978-1-936213-01-6, October 2011. | 978-1-936213-01-6, October 2011. | |||
| [Unicode-6.3.0] | ||||
| The Unicode Consortium, "The Unicode Standard, Version | ||||
| 6.3.0", The Unicode Standard, Version 6.3.0 ISBN | ||||
| 978-1-936213-08-5, September 2013. | ||||
| [Unicode-7.0.0] | [Unicode-7.0.0] | |||
| The Unicode Consortium, "The Unicode Standard, Version | The Unicode Consortium, "The Unicode Standard, Version | |||
| 7.0.0", The Unicode Standard, Version 7.0.0 ISBN | 7.0.0", The Unicode Standard, Version 7.0.0 ISBN | |||
| 978-1-936213-09-2, June 2014. | 978-1-936213-09-2, June 2014. | |||
| [Unicode-8.0.0] | [Unicode-8.0.0] | |||
| The Unicode Consortium, "The Unicode Standard, Version | The Unicode Consortium, "The Unicode Standard, Version | |||
| 8.0.0", The Unicode Standard, Version 8.0.0 ISBN | 8.0.0", The Unicode Standard, Version 8.0.0 ISBN | |||
| 978-1-936213-10-8, June 2015. | 978-1-936213-10-8, June 2015. | |||
| [Unicode-9.0.0] | [Unicode-10.0.0] | |||
| The Unicode Consortium, "The Unicode Standard, Version | The Unicode Consortium, "The Unicode Standard, Version | |||
| 9.0.0", The Unicode Standard, Version 9.0.0 ISBN | 10.0.0", The Unicode Standard, Version 10.0.0 ISBN | |||
| 978-1-936213-13-9, June 2016. | 978-1-936213-16-0, June 2017. | |||
| [Unicode-11.0.0] | ||||
| The Unicode Consortium, "The Unicode Standard, Version | ||||
| 11.0.0", The Unicode Standard, Version 11.0.0 ISBN | ||||
| 978-1-936213-19-1, June 2018. | ||||
| [Unicode-12.0.0] | ||||
| The Unicode Consortium, "The Unicode Standard, Version | ||||
| 12.0.0", The Unicode Standard, Version 12.0.0 ISBN | ||||
| 978-1-936213-22-1, March 2019. | ||||
| [UTS-46] The Unicode Consortium, "Unicode Technical Standard #46, | [UTS-46] The Unicode Consortium, "Unicode Technical Standard #46, | |||
| Version 12.0.0", UNICODE IDNA COMPATIBILITY | Version 12.0.0", UNICODE IDNA COMPATIBILITY | |||
| PROCESSING http://www.unicode.org/reports/tr46/, March | PROCESSING <https://www.unicode.org/reports/tr46/>, March | |||
| 2019. | 2019. | |||
| Appendix A. Changes from Unicode 6.3.0 to Unicode 7.0.0 | Appendix A. Changes from Unicode 6.0.0 to Unicode 7.0.0 | |||
| Changes from derived property value UNASSIGNED to either PVALID or | Changes from derived property value UNASSIGNED to either PVALID or | |||
| DISALLOWED. | DISALLOWED. | |||
| 037F ; DISALLOWED # GREEK CAPITAL LETTER YOT | 037F ; DISALLOWED # GREEK CAPITAL LETTER YOT | |||
| 0528..052F ; DISALLOWED # CYRILLIC CAPITAL LETTER EN WITH LEFT HOOK..C | 0528 ; DISALLOWED # CYRILLIC CAPITAL LETTER EN WITH LEFT HOOK | |||
| 058D..058E ; DISALLOWED # RIGHT-FACING ARMENIAN ETERNITY SIGN..LEFT-FA | 0529 ; PVALID # CYRILLIC SMALL LETTER EN WITH LEFT HOOK | |||
| 0605 ; DISALLOWED # ARABIC NUMBER MARK ABOVE | 052A ; DISALLOWED # CYRILLIC CAPITAL LETTER DZZHE | |||
| 08A1 ; PVALID # ARABIC LETTER BEH WITH HAMZA ABOVE | 052B ; PVALID # CYRILLIC SMALL LETTER DZZHE | |||
| 08AD..08B2 ; PVALID # ARABIC LETTER LOW ALEF..ARABIC LETTER ZAIN W | 052C ; DISALLOWED # CYRILLIC CAPITAL LETTER DCHE | |||
| 08FF ; PVALID # ARABIC MARK SIDEWAYS NOON GHUNNA | 052D ; PVALID # CYRILLIC SMALL LETTER DCHE | |||
| 0978 ; PVALID # DEVANAGARI LETTER MARWARI DDA | 052E ; DISALLOWED # CYRILLIC CAPITAL LETTER EL WITH DESCENDER | |||
| 0980 ; PVALID # BENGALI ANJI | 052F ; PVALID # CYRILLIC SMALL LETTER EL WITH DESCENDER | |||
| 0C00 ; PVALID # TELUGU SIGN COMBINING CANDRABINDU ABOVE | 058D..058F ; DISALLOWED # RIGHT-FACING ARMENIAN ETERNITY SIGN..ARMENIAN | |||
| 0C34 ; PVALID # TELUGU LETTER LLLA | 0604..0605 ; DISALLOWED # ARABIC SIGN SAMVAT..ARABIC NUMBER MARK ABOVE | |||
| 0C81 ; PVALID # KANNADA SIGN CANDRABINDU | 061C ; DISALLOWED # ARABIC LETTER MARK | |||
| 0D01 ; PVALID # MALAYALAM SIGN CANDRABINDU | 08A0..08B2 ; PVALID # ARABIC LETTER BEH WITH SMALL V BELOW..ARABIC | |||
| 0DE6..0DEF ; PVALID # SINHALA LITH DIGIT ZERO..SINHALA LITH DIGIT | 08E4..08FF ; PVALID # ARABIC CURLY FATHA..ARABIC MARK SIDEWAYS NOON | |||
| 16F1..16F8 ; PVALID # RUNIC LETTER K..RUNIC LETTER FRANKS CASKET A | 0978 ; PVALID # DEVANAGARI LETTER MARWARI DDA | |||
| 191D..191E ; PVALID # LIMBU LETTER GYAN..LIMBU LETTER TRA | 0980 ; PVALID # BENGALI ANJI | |||
| 1AB0..1ABE ; PVALID # COMBINING DOUBLED CIRCUMFLEX ACCENT..COMBINI | 0AF0 ; DISALLOWED # GUJARATI ABBREVIATION SIGN | |||
| 1CF8..1CF9 ; PVALID # VEDIC TONE RING ABOVE..VEDIC TONE DOUBLE RIN | 0C00 ; PVALID # TELUGU SIGN COMBINING CANDRABINDU ABOVE | |||
| 1DE7..1DF5 ; PVALID # COMBINING LATIN SMALL LETTER ALPHA..COMBININ | 0C34 ; PVALID # TELUGU LETTER LLLA | |||
| 20BB..20BD ; DISALLOWED # NORDIC MARK SIGN..RUBLE SIGN | 0C81 ; PVALID # KANNADA SIGN CANDRABINDU | |||
| 23F4..23FA ; DISALLOWED # BLACK MEDIUM LEFT-POINTING TRIANGLE..BLACK C | 0D01 ; PVALID # MALAYALAM SIGN CANDRABINDU | |||
| 2700 ; DISALLOWED # BLACK SAFETY SCISSORS | 0DE6..0DEF ; PVALID # SINHALA LITH DIGIT ZERO..SINHALA LITH DIGIT N | |||
| 2B4D..2B4F ; DISALLOWED # DOWNWARDS TRIANGLE-HEADED ZIGZAG ARROW..SHOR | 0EDE..0EDF ; PVALID # LAO LETTER KHMU GO..LAO LETTER KHMU NYO | |||
| 2B5A..2B73 ; DISALLOWED # SLANTED NORTH ARROW WITH HOOKED HEAD..DOWNWA | 10C7 ; DISALLOWED # GEORGIAN CAPITAL LETTER YN | |||
| 2B76..2B95 ; DISALLOWED # NORTH WEST TRIANGLE-HEADED ARROW TO BAR..RIG | 10CD ; DISALLOWED # GEORGIAN CAPITAL LETTER AEN | |||
| 2B98..2BB9 ; DISALLOWED # THREE-D TOP-LIGHTED LEFTWARDS EQUILATERAL AR | 10FD..10FF ; PVALID # GEORGIAN LETTER AEN..GEORGIAN LETTER LABIAL S | |||
| 2BBD..2BC8 ; DISALLOWED # BALLOT BOX WITH LIGHT X..BLACK MEDIUM RIGHT- | 16F1..16F8 ; PVALID # RUNIC LETTER K..RUNIC LETTER FRANKS CASKET AE | |||
| 2BCA..2BD1 ; DISALLOWED # TOP HALF BLACK CIRCLE..UNCERTAINTY SIGN | 17B4..17B5 ; DISALLOWED # KHMER VOWEL INHERENT AQ..KHMER VOWEL INHERENT | |||
| 2E3C..2E42 ; DISALLOWED # STENOGRAPHIC FULL STOP..DOUBLE LOW-REVERSED- | 191D..191E ; PVALID # LIMBU LETTER GYAN..LIMBU LETTER TRA | |||
| A698..A69D ; DISALLOWED # CYRILLIC CAPITAL LETTER DOUBLE O..MODIFIER L | 1AB0..1ABD ; PVALID # COMBINING DOUBLED CIRCUMFLEX ACCENT..COMBININ | |||
| A794..A79F ; PVALID # LATIN SMALL LETTER C WITH PALATAL HOOK..LATI | 1ABE ; DISALLOWED # COMBINING PARENTHESES OVERLAY | |||
| A7AB..A7AD ; DISALLOWED # LATIN CAPITAL LETTER REVERSED OPEN E..LATIN | 1BAB..1BAD ; PVALID # SUNDANESE SIGN VIRAMA..SUNDANESE CONSONANT SI | |||
| A7B0..A7B1 ; DISALLOWED # LATIN CAPITAL LETTER TURNED K..LATIN CAPITAL | 1BBA..1BBF ; PVALID # SUNDANESE AVAGRAHA..SUNDANESE LETTER FINAL M | |||
| A7F7 ; PVALID # LATIN EPIGRAPHIC LETTER SIDEWAYS I | 1CC0..1CC7 ; DISALLOWED # SUNDANESE PUNCTUATION BINDU SURYA..SUNDANESE | |||
| A9E0..A9FE ; PVALID # MYANMAR LETTER SHAN GHA..MYANMAR LETTER TAI | 1CF3..1CF6 ; PVALID # VEDIC SIGN ROTATED ARDHAVISARGA..VEDIC SIGN U | |||
| AA7C..AA7F ; PVALID # MYANMAR SIGN TAI LAING TONE-2..MYANMAR LETTE | 1CF8..1CF9 ; PVALID # VEDIC TONE RING ABOVE..VEDIC TONE DOUBLE RING | |||
| AB30..AB5F ; PVALID # LATIN SMALL LETTER BARRED ALPHA..MODIFIER LE | 1DE7..1DF5 ; PVALID # COMBINING LATIN SMALL LETTER ALPHA..COMBINING | |||
| AB64..AB65 ; PVALID # LATIN SMALL LETTER INVERTED ALPHA..GREEK LET | 2066..2069 ; DISALLOWED # LEFT-TO-RIGHT ISOLATE..POP DIRECTIONAL ISOLAT | |||
| FE27..FE2D ; PVALID # COMBINING LIGATURE LEFT HALF BELOW..COMBININ | 20BA..20BD ; DISALLOWED # TURKISH LIRA SIGN..RUBLE SIGN | |||
| 1018B..1018C; DISALLOWED # GREEK ONE QUARTER SIGN..GREEK SINUSOID SIGN | 23F4..23FA ; DISALLOWED # BLACK MEDIUM LEFT-POINTING TRIANGLE..BLACK CI | |||
| 101A0 ; DISALLOWED # GREEK SYMBOL TAU RHO | 2700 ; DISALLOWED # BLACK SAFETY SCISSORS | |||
| 102E0..102FB; PVALID # COPTIC EPACT THOUSANDS MARK..COPTIC EPACT NU | 27CB ; DISALLOWED # MATHEMATICAL RISING DIAGONAL | |||
| 1031F ; PVALID # OLD ITALIC LETTER ESS | 27CD ; DISALLOWED # MATHEMATICAL FALLING DIAGONAL | |||
| 10350..1037A; PVALID # OLD PERMIC LETTER AN..COMBINING OLD PERMIC L | 2B4D..2B4F ; DISALLOWED # DOWNWARDS TRIANGLE-HEADED ZIGZAG ARROW..SHORT | |||
| 10500..10527; PVALID # ELBASAN LETTER A..ELBASAN LETTER KHE | 2B5A..2B73 ; DISALLOWED # SLANTED NORTH ARROW WITH HOOKED HEAD..DOWNWAR | |||
| 10530..10563; PVALID # CAUCASIAN ALBANIAN LETTER ALT..CAUCASIAN ALB | 2B76..2B95 ; DISALLOWED # NORTH WEST TRIANGLE-HEADED ARROW TO BAR..RIGH | |||
| 1056F ; DISALLOWED # CAUCASIAN ALBANIAN CITATION MARK | 2B98..2BB9 ; DISALLOWED # THREE-D TOP-LIGHTED LEFTWARDS EQUILATERAL ARR | |||
| 10600..10736; PVALID # LINEAR A SIGN AB001..LINEAR A SIGN A664 | 2BBD..2BC8 ; DISALLOWED # BALLOT BOX WITH LIGHT X..BLACK MEDIUM RIGHT-P | |||
| 10740..10755; PVALID # LINEAR A SIGN A701 A..LINEAR A SIGN A732 JE | 2BCA..2BD1 ; DISALLOWED # TOP HALF BLACK CIRCLE..UNCERTAINTY SIGN | |||
| 10760..10767; PVALID # LINEAR A SIGN A800..LINEAR A SIGN A807 | 2CF2 ; DISALLOWED # COPTIC CAPITAL LETTER BOHAIRIC KHEI | |||
| 10860..1089E; PVALID # PALMYRENE LETTER ALEPH..NABATAEAN LETTER TAW | 2CF3 ; PVALID # COPTIC SMALL LETTER BOHAIRIC KHEI | |||
| 108A7..108AF; DISALLOWED # NABATAEAN NUMBER ONE..NABATAEAN NUMBER ONE H | 2D27 ; PVALID # GEORGIAN SMALL LETTER YN | |||
| 10A80..10A9F; PVALID # OLD NORTH ARABIAN LETTER HEH..OLD NORTH ARAB | 2D2D ; PVALID # GEORGIAN SMALL LETTER AEN | |||
| 10AC0..10AE6; PVALID # MANICHAEAN LETTER ALEPH..MANICHAEAN ABBREVIA | 2D66..2D67 ; PVALID # TIFINAGH LETTER YE..TIFINAGH LETTER YO | |||
| 10AEB..10AF6; DISALLOWED # MANICHAEAN NUMBER ONE..MANICHAEAN PUNCTUATIO | 2E32..2E42 ; DISALLOWED # TURNED COMMA..DOUBLE LOW-REVERSED-9 QUOTATION | |||
| 10B80..10B91; PVALID # PSALTER PAHLAVI LETTER ALEPH..PSALTER PAHLAV | 9FCC ; PVALID # <CJK Ideograph> | |||
| 10B99..10B9C; DISALLOWED # PSALTER PAHLAVI SECTION MARK..PSALTER PAHLAV | A674..A67B ; PVALID # COMBINING CYRILLIC LETTER UKRAINIAN IE..COMBI | |||
| 10BA9..10BAF; DISALLOWED # PSALTER PAHLAVI NUMBER ONE..PSALTER PAHLAVI | A698 ; DISALLOWED # CYRILLIC CAPITAL LETTER DOUBLE O | |||
| 1107F ; PVALID # BRAHMI NUMBER JOINER | A699 ; PVALID # CYRILLIC SMALL LETTER DOUBLE O | |||
| 11150..11176; PVALID # MAHAJANI LETTER A..MAHAJANI LIGATURE SHRI | A69A ; DISALLOWED # CYRILLIC CAPITAL LETTER CROSSED O | |||
| 111CD ; DISALLOWED # SHARADA SUTRA MARK | A69B ; PVALID # CYRILLIC SMALL LETTER CROSSED O | |||
| 111DA ; PVALID # SHARADA EKAM | A69C..A69D ; DISALLOWED # MODIFIER LETTER CYRILLIC HARD SIGN..MODIFIER | |||
| 111E1..111F4; DISALLOWED # SINHALA ARCHAIC DIGIT ONE..SINHALA ARCHAIC N | A69F ; PVALID # COMBINING CYRILLIC LETTER IOTIFIED E | |||
| 11200..11211; PVALID # KHOJKI LETTER A..KHOJKI LETTER JJA | A792 ; DISALLOWED # LATIN CAPITAL LETTER C WITH BAR | |||
| 11213..1123D; PVALID # KHOJKI LETTER NYA..KHOJKI ABBREVIATION SIGN | A793..A795 ; PVALID # LATIN SMALL LETTER C WITH BAR..LATIN SMALL LE | |||
| 112B0..112EA; PVALID # KHUDAWADI LETTER A..KHUDAWADI SIGN VIRAMA | A796 ; DISALLOWED # LATIN CAPITAL LETTER B WITH FLOURISH | |||
| 112F0..112F9; PVALID # KHUDAWADI DIGIT ZERO..KHUDAWADI DIGIT NINE | A797 ; PVALID # LATIN SMALL LETTER B WITH FLOURISH | |||
| 11301..11303; PVALID # GRANTHA SIGN CANDRABINDU..GRANTHA SIGN VISAR | A798 ; DISALLOWED # LATIN CAPITAL LETTER F WITH STROKE | |||
| 11305..1130C; PVALID # GRANTHA LETTER A..GRANTHA LETTER VOCALIC L | A799 ; PVALID # LATIN SMALL LETTER F WITH STROKE | |||
| 1130F..11310; PVALID # GRANTHA LETTER EE..GRANTHA LETTER AI | A79A ; DISALLOWED # LATIN CAPITAL LETTER VOLAPUK AE | |||
| 11313..11328; PVALID # GRANTHA LETTER OO..GRANTHA LETTER NA | A79B ; PVALID # LATIN SMALL LETTER VOLAPUK AE | |||
| 1132A..11330; PVALID # GRANTHA LETTER PA..GRANTHA LETTER RA | A79C ; DISALLOWED # LATIN CAPITAL LETTER VOLAPUK OE | |||
| 11332..11333; PVALID # GRANTHA LETTER LA..GRANTHA LETTER LLA | A79D ; PVALID # LATIN SMALL LETTER VOLAPUK OE | |||
| 11335..11339; PVALID # GRANTHA LETTER VA..GRANTHA LETTER HA | A79E ; DISALLOWED # LATIN CAPITAL LETTER VOLAPUK UE | |||
| 1133C..11344; PVALID # GRANTHA SIGN NUKTA..GRANTHA VOWEL SIGN VOCAL | A79F ; PVALID # LATIN SMALL LETTER VOLAPUK UE | |||
| 11347..11348; PVALID # GRANTHA VOWEL SIGN EE..GRANTHA VOWEL SIGN AI | A7AA..A7AD ; DISALLOWED # LATIN CAPITAL LETTER H WITH HOOK..LATIN CAPIT | |||
| 1134B..1134D; PVALID # GRANTHA VOWEL SIGN OO..GRANTHA SIGN VIRAMA | A7B0..A7B1 ; DISALLOWED # LATIN CAPITAL LETTER TURNED K..LATIN CAPITAL | |||
| 11357 ; PVALID # GRANTHA AU LENGTH MARK | A7F7 ; PVALID # LATIN EPIGRAPHIC LETTER SIDEWAYS I | |||
| 1135D..11363; PVALID # GRANTHA SIGN PLUTA..GRANTHA VOWEL SIGN VOCAL | A7F8..A7F9 ; DISALLOWED # MODIFIER LETTER CAPITAL H WITH STROKE..MODIFI | |||
| 11366..1136C; PVALID # COMBINING GRANTHA DIGIT ZERO..COMBINING GRAN | A9E0..A9FE ; PVALID # MYANMAR LETTER SHAN GHA..MYANMAR LETTER TAI L | |||
| 11370..11374; PVALID # COMBINING GRANTHA LETTER A..COMBINING GRANTH | AA7C..AA7F ; PVALID # MYANMAR SIGN TAI LAING TONE-2..MYANMAR LETTER | |||
| 11480..114C7; PVALID # TIRHUTA ANJI..TIRHUTA OM | AAE0..AAEF ; PVALID # MEETEI MAYEK LETTER E..MEETEI MAYEK VOWEL SIG | |||
| 114D0..114D9; PVALID # TIRHUTA DIGIT ZERO..TIRHUTA DIGIT NINE | AAF0..AAF1 ; DISALLOWED # MEETEI MAYEK CHEIKHAN..MEETEI MAYEK AHANG KHU | |||
| 11580..115B5; PVALID # SIDDHAM LETTER A..SIDDHAM VOWEL SIGN VOCALIC | AAF2..AAF6 ; PVALID # MEETEI MAYEK ANJI..MEETEI MAYEK VIRAMA | |||
| 115B8..115C9; PVALID # SIDDHAM VOWEL SIGN E..SIDDHAM END OF TEXT MA | AB30..AB5A ; PVALID # LATIN SMALL LETTER BARRED ALPHA..LATIN SMALL | |||
| 11600..11644; PVALID # MODI LETTER A..MODI SIGN HUVA | AB5B..AB5F ; DISALLOWED # MODIFIER BREVE WITH INVERTED BREVE..MODIFIER | |||
| 11650..11659; PVALID # MODI DIGIT ZERO..MODI DIGIT NINE | AB64..AB65 ; PVALID # LATIN SMALL LETTER INVERTED ALPHA..GREEK LETT | |||
| 118A0..118F2; DISALLOWED # WARANG CITI CAPITAL LETTER NGAA..WARANG CITI | FA2E..FA2F ; DISALLOWED # CJK COMPATIBILITY IDEOGRAPH-FA2E..CJK COMPATI | |||
| 118FF ; PVALID # WARANG CITI OM | FE27..FE2D ; PVALID # COMBINING LIGATURE LEFT HALF BELOW..COMBINING | |||
| 11AC0..11AF8; PVALID # PAU CIN HAU LETTER PA..PAU CIN HAU GLOTTAL S | 1018B..1018C; DISALLOWED # GREEK ONE QUARTER SIGN..GREEK SINUSOID SIGN | |||
| 1236F..12398; PVALID # CUNEIFORM SIGN KAP ELAMITE..CUNEIFORM SIGN U | 101A0 ; DISALLOWED # GREEK SYMBOL TAU RHO | |||
| 12463..1246E; DISALLOWED # CUNEIFORM NUMERIC SIGN ONE QUARTER GUR..CUNE | 102E0 ; PVALID # COPTIC EPACT THOUSANDS MARK | |||
| 12474 ; DISALLOWED # CUNEIFORM PUNCTUATION SIGN DIAGONAL QUADCOLO | 102E1..102FB; DISALLOWED # COPTIC EPACT DIGIT ONE..COPTIC EPACT NUMBER N | |||
| 16A40..16A5E; PVALID # MRO LETTER TA..MRO LETTER TEK | 1031F ; PVALID # OLD ITALIC LETTER ESS | |||
| 16A60..16A69; PVALID # MRO DIGIT ZERO..MRO DIGIT NINE | 10350..1037A; PVALID # OLD PERMIC LETTER AN..COMBINING OLD PERMIC LE | |||
| 16A6E..16A6F; DISALLOWED # MRO DANDA..MRO DOUBLE DANDA | 10500..10527; PVALID # ELBASAN LETTER A..ELBASAN LETTER KHE | |||
| 16AD0..16AED; PVALID # BASSA VAH LETTER ENNI..BASSA VAH LETTER I | 10530..10563; PVALID # CAUCASIAN ALBANIAN LETTER ALT..CAUCASIAN ALBA | |||
| 16AF0..16AF5; PVALID # BASSA VAH COMBINING HIGH TONE..BASSA VAH FUL | 1056F ; DISALLOWED # CAUCASIAN ALBANIAN CITATION MARK | |||
| 16B00..16B45; PVALID # PAHAWH HMONG VOWEL KEEB..PAHAWH HMONG SIGN C | 10600..10736; PVALID # LINEAR A SIGN AB001..LINEAR A SIGN A664 | |||
| 16B50..16B59; PVALID # PAHAWH HMONG DIGIT ZERO..PAHAWH HMONG DIGIT | 10740..10755; PVALID # LINEAR A SIGN A701 A..LINEAR A SIGN A732 JE | |||
| 16B5B..16B61; DISALLOWED # PAHAWH HMONG NUMBER TENS..PAHAWH HMONG NUMBE | 10760..10767; PVALID # LINEAR A SIGN A800..LINEAR A SIGN A807 | |||
| 16B63..16B77; PVALID # PAHAWH HMONG SIGN VOS LUB..PAHAWH HMONG SIGN | 10860..10876; PVALID # PALMYRENE LETTER ALEPH..PALMYRENE LETTER TAW | |||
| 16B7D..16B8F; PVALID # PAHAWH HMONG CLAN SIGN TSHEEJ..PAHAWH HMONG | 10877..1087F; DISALLOWED # PALMYRENE LEFT-POINTING FLEURON..PALMYRENE NU | |||
| 1BC00..1BC6A; PVALID # DUPLOYAN LETTER H..DUPLOYAN LETTER VOCALIC M | 10880..1089E; PVALID # NABATAEAN LETTER FINAL ALEPH..NABATAEAN LETTE | |||
| 1BC70..1BC7C; PVALID # DUPLOYAN AFFIX LEFT HORIZONTAL SECANT..DUPLO | 108A7..108AF; DISALLOWED # NABATAEAN NUMBER ONE..NABATAEAN NUMBER ONE HU | |||
| 1BC80..1BC88; PVALID # DUPLOYAN AFFIX HIGH ACUTE..DUPLOYAN AFFIX HI | 10980..109B7; PVALID # MEROITIC HIEROGLYPHIC LETTER A..MEROITIC CURS | |||
| 1BC90..1BC99; PVALID # DUPLOYAN AFFIX LOW ACUTE..DUPLOYAN AFFIX LOW | 109BE..109BF; PVALID # MEROITIC CURSIVE LOGOGRAM RMT..MEROITIC CURSI | |||
| 1BC9C..1BCA3; DISALLOWED # DUPLOYAN SIGN O WITH CROSS..SHORTHAND FORMAT | 10A80..10A9C; PVALID # OLD NORTH ARABIAN LETTER HEH..OLD NORTH ARABI | |||
| 1E800..1E8C4; PVALID # MENDE KIKAKUI SYLLABLE M001 KI..MENDE KIKAKU | 10A9D..10A9F; DISALLOWED # OLD NORTH ARABIAN NUMBER ONE..OLD NORTH ARABI | |||
| 1E8C7..1E8D6; DISALLOWED # MENDE KIKAKUI DIGIT ONE..MENDE KIKAKUI COMBI | 10AC0..10AC7; PVALID # MANICHAEAN LETTER ALEPH..MANICHAEAN LETTER WA | |||
| 1F0BF ; DISALLOWED # PLAYING CARD RED JOKER | 10AC8 ; DISALLOWED # MANICHAEAN SIGN UD | |||
| 1F0E0..1F0F5; DISALLOWED # PLAYING CARD FOOL..PLAYING CARD TRUMP-21 | 10AC9..10AE6; PVALID # MANICHAEAN LETTER ZAYIN..MANICHAEAN ABBREVIAT | |||
| 1F10B..1F10C; DISALLOWED # DINGBAT CIRCLED SANS-SERIF DIGIT ZERO..DINGB | 10AEB..10AF6; DISALLOWED # MANICHAEAN NUMBER ONE..MANICHAEAN PUNCTUATION | |||
| 1F321..1F32C; DISALLOWED # THERMOMETER..WIND BLOWING FACE | 10B80..10B91; PVALID # PSALTER PAHLAVI LETTER ALEPH..PSALTER PAHLAVI | |||
| 1F336 ; DISALLOWED # HOT PEPPER | 10B99..10B9C; DISALLOWED # PSALTER PAHLAVI SECTION MARK..PSALTER PAHLAVI | |||
| 1F37D ; DISALLOWED # FORK AND KNIFE WITH PLATE | 10BA9..10BAF; DISALLOWED # PSALTER PAHLAVI NUMBER ONE..PSALTER PAHLAVI N | |||
| 1F394..1F39F; DISALLOWED # HEART WITH TIP ON THE LEFT..ADMISSION TICKET | 1107F ; PVALID # BRAHMI NUMBER JOINER | |||
| 1F3C5 ; DISALLOWED # SPORTS MEDAL | 110D0..110E8; PVALID # SORA SOMPENG LETTER SAH..SORA SOMPENG LETTER | |||
| 1F3CB..1F3CE; DISALLOWED # WEIGHT LIFTER..RACING CAR | 110F0..110F9; PVALID # SORA SOMPENG DIGIT ZERO..SORA SOMPENG DIGIT N | |||
| 1F3D4..1F3DF; DISALLOWED # SNOW CAPPED MOUNTAIN..STADIUM | 11100..11134; PVALID # CHAKMA SIGN CANDRABINDU..CHAKMA MAAYYAA | |||
| 1F3F1..1F3F7; DISALLOWED # WHITE PENNANT..LABEL | 11136..1113F; PVALID # CHAKMA DIGIT ZERO..CHAKMA DIGIT NINE | |||
| 1F43F ; DISALLOWED # CHIPMUNK | 11140..11143; DISALLOWED # CHAKMA SECTION MARK..CHAKMA QUESTION MARK | |||
| 1F441 ; DISALLOWED # EYE | 11150..11173; PVALID # MAHAJANI LETTER A..MAHAJANI SIGN NUKTA | |||
| 1F4F8 ; DISALLOWED # CAMERA WITH FLASH | 11174..11175; DISALLOWED # MAHAJANI ABBREVIATION SIGN..MAHAJANI SECTION | |||
| 1F4FD..1F4FE; DISALLOWED # FILM PROJECTOR..PORTABLE STEREO | 11176 ; PVALID # MAHAJANI LIGATURE SHRI | |||
| 1F53E..1F53F; DISALLOWED # LOWER RIGHT SHADOWED WHITE CIRCLE..UPPER RIG | 11180..111C4; PVALID # SHARADA SIGN CANDRABINDU..SHARADA OM | |||
| 1F544..1F54A; DISALLOWED # NOTCHED RIGHT SEMICIRCLE WITH THREE DOTS..DO | 111C5..111C8; DISALLOWED # SHARADA DANDA..SHARADA SEPARATOR | |||
| 1F568..1F579; DISALLOWED # RIGHT SPEAKER..JOYSTICK | 111CD ; DISALLOWED # SHARADA SUTRA MARK | |||
| 1F57B..1F5A3; DISALLOWED # LEFT HAND TELEPHONE RECEIVER..BLACK DOWN POI | 111D0..111DA; PVALID # SHARADA DIGIT ZERO..SHARADA EKAM | |||
| 1F5A5..1F5FA; DISALLOWED # DESKTOP COMPUTER..WORLD MAP | 111E1..111F4; DISALLOWED # SINHALA ARCHAIC DIGIT ONE..SINHALA ARCHAIC NU | |||
| 1F641..1F642; DISALLOWED # SLIGHTLY FROWNING FACE..SLIGHTLY SMILING FAC | 11200..11211; PVALID # KHOJKI LETTER A..KHOJKI LETTER JJA | |||
| 1F650..1F67F; DISALLOWED # NORTH WEST POINTING LEAF..REVERSE CHECKER BO | 11213..11237; PVALID # KHOJKI LETTER NYA..KHOJKI SIGN SHADDA | |||
| 1F6C6..1F6CF; DISALLOWED # TRIANGLE WITH ROUNDED CORNERS..BED | 11238..1123D; DISALLOWED # KHOJKI DANDA..KHOJKI ABBREVIATION SIGN | |||
| 1F6E0..1F6EC; DISALLOWED # HAMMER AND WRENCH..AIRPLANE ARRIVING | 112B0..112EA; PVALID # KHUDAWADI LETTER A..KHUDAWADI SIGN VIRAMA | |||
| 1F6F0..1F6F3; DISALLOWED # SATELLITE..PASSENGER SHIP | 112F0..112F9; PVALID # KHUDAWADI DIGIT ZERO..KHUDAWADI DIGIT NINE | |||
| 1F780..1F7D4; DISALLOWED # BLACK LEFT-POINTING ISOSCELES RIGHT TRIANGLE | 11301..11303; PVALID # GRANTHA SIGN CANDRABINDU..GRANTHA SIGN VISARG | |||
| 1F800..1F80B; DISALLOWED # LEFTWARDS ARROW WITH SMALL TRIANGLE ARROWHEA | 11305..1130C; PVALID # GRANTHA LETTER A..GRANTHA LETTER VOCALIC L | |||
| 1F810..1F847; DISALLOWED # LEFTWARDS ARROW WITH SMALL EQUILATERAL ARROW | 1130F..11310; PVALID # GRANTHA LETTER EE..GRANTHA LETTER AI | |||
| 1F850..1F859; DISALLOWED # LEFTWARDS SANS-SERIF ARROW..UP DOWN SANS-SER | 11313..11328; PVALID # GRANTHA LETTER OO..GRANTHA LETTER NA | |||
| 1F860..1F887; DISALLOWED # WIDE-HEADED LEFTWARDS LIGHT BARB ARROW..WIDE | 1132A..11330; PVALID # GRANTHA LETTER PA..GRANTHA LETTER RA | |||
| 11332..11333; PVALID # GRANTHA LETTER LA..GRANTHA LETTER LLA | ||||
| 11335..11339; PVALID # GRANTHA LETTER VA..GRANTHA LETTER HA | ||||
| 1133C..11344; PVALID # GRANTHA SIGN NUKTA..GRANTHA VOWEL SIGN VOCALI | ||||
| 11347..11348; PVALID # GRANTHA VOWEL SIGN EE..GRANTHA VOWEL SIGN AI | ||||
| 1134B..1134D; PVALID # GRANTHA VOWEL SIGN OO..GRANTHA SIGN VIRAMA | ||||
| 11357 ; PVALID # GRANTHA AU LENGTH MARK | ||||
| 1135D..11363; PVALID # GRANTHA SIGN PLUTA..GRANTHA VOWEL SIGN VOCALI | ||||
| 11366..1136C; PVALID # COMBINING GRANTHA DIGIT ZERO..COMBINING GRANT | ||||
| 11370..11374; PVALID # COMBINING GRANTHA LETTER A..COMBINING GRANTHA | ||||
| 11480..114C5; PVALID # TIRHUTA ANJI..TIRHUTA GVANG | ||||
| 114C6 ; DISALLOWED # TIRHUTA ABBREVIATION SIGN | ||||
| 114C7 ; PVALID # TIRHUTA OM | ||||
| 114D0..114D9; PVALID # TIRHUTA DIGIT ZERO..TIRHUTA DIGIT NINE | ||||
| 11580..115B5; PVALID # SIDDHAM LETTER A..SIDDHAM VOWEL SIGN VOCALIC | ||||
| 115B8..115C0; PVALID # SIDDHAM VOWEL SIGN E..SIDDHAM SIGN NUKTA | ||||
| 115C1..115C9; DISALLOWED # SIDDHAM SIGN SIDDHAM..SIDDHAM END OF TEXT MAR | ||||
| 11600..11640; PVALID # MODI LETTER A..MODI SIGN ARDHACANDRA | ||||
| 11641..11643; DISALLOWED # MODI DANDA..MODI ABBREVIATION SIGN | ||||
| 11644 ; PVALID # MODI SIGN HUVA | ||||
| 11650..11659; PVALID # MODI DIGIT ZERO..MODI DIGIT NINE | ||||
| 11680..116B7; PVALID # TAKRI LETTER A..TAKRI SIGN NUKTA | ||||
| 116C0..116C9; PVALID # TAKRI DIGIT ZERO..TAKRI DIGIT NINE | ||||
| 118A0..118BF; DISALLOWED # WARANG CITI CAPITAL LETTER NGAA..WARANG CITI | ||||
| 118C0..118E9; PVALID # WARANG CITI SMALL LETTER NGAA..WARANG CITI DI | ||||
| 118EA..118F2; DISALLOWED # WARANG CITI NUMBER TEN..WARANG CITI NUMBER NI | ||||
| 118FF ; PVALID # WARANG CITI OM | ||||
| 11AC0..11AF8; PVALID # PAU CIN HAU LETTER PA..PAU CIN HAU GLOTTAL ST | ||||
| 1236F..12398; PVALID # CUNEIFORM SIGN KAP ELAMITE..CUNEIFORM SIGN UM | ||||
| 12463..1246E; DISALLOWED # CUNEIFORM NUMERIC SIGN ONE QUARTER GUR..CUNEI | ||||
| 12474 ; DISALLOWED # CUNEIFORM PUNCTUATION SIGN DIAGONAL QUADCOLON | ||||
| 16A40..16A5E; PVALID # MRO LETTER TA..MRO LETTER TEK | ||||
| 16A60..16A69; PVALID # MRO DIGIT ZERO..MRO DIGIT NINE | ||||
| 16A6E..16A6F; DISALLOWED # MRO DANDA..MRO DOUBLE DANDA | ||||
| 16AD0..16AED; PVALID # BASSA VAH LETTER ENNI..BASSA VAH LETTER I | ||||
| 16AF0..16AF4; PVALID # BASSA VAH COMBINING HIGH TONE..BASSA VAH COMB | ||||
| 16AF5 ; DISALLOWED # BASSA VAH FULL STOP | ||||
| 16B00..16B36; PVALID # PAHAWH HMONG VOWEL KEEB..PAHAWH HMONG MARK CI | ||||
| 16B37..16B3F; DISALLOWED # PAHAWH HMONG SIGN VOS THOM..PAHAWH HMONG SIGN | ||||
| 16B40..16B43; PVALID # PAHAWH HMONG SIGN VOS SEEV..PAHAWH HMONG SIGN | ||||
| 16B44..16B45; DISALLOWED # PAHAWH HMONG SIGN XAUS..PAHAWH HMONG SIGN CIM | ||||
| 16B50..16B59; PVALID # PAHAWH HMONG DIGIT ZERO..PAHAWH HMONG DIGIT N | ||||
| 16B5B..16B61; DISALLOWED # PAHAWH HMONG NUMBER TENS..PAHAWH HMONG NUMBER | ||||
| 16B63..16B77; PVALID # PAHAWH HMONG SIGN VOS LUB..PAHAWH HMONG SIGN | ||||
| 16B7D..16B8F; PVALID # PAHAWH HMONG CLAN SIGN TSHEEJ..PAHAWH HMONG C | ||||
| 16F00..16F44; PVALID # MIAO LETTER PA..MIAO LETTER HHA | ||||
| 16F50..16F7E; PVALID # MIAO LETTER NASALIZATION..MIAO VOWEL SIGN NG | ||||
| 16F8F..16F9F; PVALID # MIAO TONE RIGHT..MIAO LETTER REFORMED TONE-8 | ||||
| 1BC00..1BC6A; PVALID # DUPLOYAN LETTER H..DUPLOYAN LETTER VOCALIC M | ||||
| 1BC70..1BC7C; PVALID # DUPLOYAN AFFIX LEFT HORIZONTAL SECANT..DUPLOY | ||||
| 1BC80..1BC88; PVALID # DUPLOYAN AFFIX HIGH ACUTE..DUPLOYAN AFFIX HIG | ||||
| 1BC90..1BC99; PVALID # DUPLOYAN AFFIX LOW ACUTE..DUPLOYAN AFFIX LOW | ||||
| 1BC9C ; DISALLOWED # DUPLOYAN SIGN O WITH CROSS | ||||
| 1BC9D..1BC9E; PVALID # DUPLOYAN THICK LETTER SELECTOR..DUPLOYAN DOUB | ||||
| 1BC9F..1BCA3; DISALLOWED # DUPLOYAN PUNCTUATION CHINOOK FULL STOP..SHORT | ||||
| 1E800..1E8C4; PVALID # MENDE KIKAKUI SYLLABLE M001 KI..MENDE KIKAKUI | ||||
| 1E8C7..1E8CF; DISALLOWED # MENDE KIKAKUI DIGIT ONE..MENDE KIKAKUI DIGIT | ||||
| 1E8D0..1E8D6; PVALID # MENDE KIKAKUI COMBINING NUMBER TEENS..MENDE K | ||||
| 1EE00..1EE03; DISALLOWED # ARABIC MATHEMATICAL ALEF..ARABIC MATHEMATICAL | ||||
| 1EE05..1EE1F; DISALLOWED # ARABIC MATHEMATICAL WAW..ARABIC MATHEMATICAL | ||||
| 1EE21..1EE22; DISALLOWED # ARABIC MATHEMATICAL INITIAL BEH..ARABIC MATHE | ||||
| 1EE24 ; DISALLOWED # ARABIC MATHEMATICAL INITIAL HEH | ||||
| 1EE27 ; DISALLOWED # ARABIC MATHEMATICAL INITIAL HAH | ||||
| 1EE29..1EE32; DISALLOWED # ARABIC MATHEMATICAL INITIAL YEH..ARABIC MATHE | ||||
| 1EE34..1EE37; DISALLOWED # ARABIC MATHEMATICAL INITIAL SHEEN..ARABIC MAT | ||||
| 1EE39 ; DISALLOWED # ARABIC MATHEMATICAL INITIAL DAD | ||||
| 1EE3B ; DISALLOWED # ARABIC MATHEMATICAL INITIAL GHAIN | ||||
| 1EE42 ; DISALLOWED # ARABIC MATHEMATICAL TAILED JEEM | ||||
| 1EE47 ; DISALLOWED # ARABIC MATHEMATICAL TAILED HAH | ||||
| 1EE49 ; DISALLOWED # ARABIC MATHEMATICAL TAILED YEH | ||||
| 1EE4B ; DISALLOWED # ARABIC MATHEMATICAL TAILED LAM | ||||
| 1EE4D..1EE4F; DISALLOWED # ARABIC MATHEMATICAL TAILED NOON..ARABIC MATHE | ||||
| 1EE51..1EE52; DISALLOWED # ARABIC MATHEMATICAL TAILED SAD..ARABIC MATHEM | ||||
| 1EE54 ; DISALLOWED # ARABIC MATHEMATICAL TAILED SHEEN | ||||
| 1EE57 ; DISALLOWED # ARABIC MATHEMATICAL TAILED KHAH | ||||
| 1EE59 ; DISALLOWED # ARABIC MATHEMATICAL TAILED DAD | ||||
| 1EE5B ; DISALLOWED # ARABIC MATHEMATICAL TAILED GHAIN | ||||
| 1EE5D ; DISALLOWED # ARABIC MATHEMATICAL TAILED DOTLESS NOON | ||||
| 1EE5F ; DISALLOWED # ARABIC MATHEMATICAL TAILED DOTLESS QAF | ||||
| 1EE61..1EE62; DISALLOWED # ARABIC MATHEMATICAL STRETCHED BEH..ARABIC MAT | ||||
| 1EE64 ; DISALLOWED # ARABIC MATHEMATICAL STRETCHED HEH | ||||
| 1EE67..1EE6A; DISALLOWED # ARABIC MATHEMATICAL STRETCHED HAH..ARABIC MAT | ||||
| 1EE6C..1EE72; DISALLOWED # ARABIC MATHEMATICAL STRETCHED MEEM..ARABIC MA | ||||
| 1EE74..1EE77; DISALLOWED # ARABIC MATHEMATICAL STRETCHED SHEEN..ARABIC M | ||||
| 1EE79..1EE7C; DISALLOWED # ARABIC MATHEMATICAL STRETCHED DAD..ARABIC MAT | ||||
| 1EE7E ; DISALLOWED # ARABIC MATHEMATICAL STRETCHED DOTLESS FEH | ||||
| 1EE80..1EE89; DISALLOWED # ARABIC MATHEMATICAL LOOPED ALEF..ARABIC MATHE | ||||
| 1EE8B..1EE9B; DISALLOWED # ARABIC MATHEMATICAL LOOPED LAM..ARABIC MATHEM | ||||
| 1EEA1..1EEA3; DISALLOWED # ARABIC MATHEMATICAL DOUBLE-STRUCK BEH..ARABIC | ||||
| 1EEA5..1EEA9; DISALLOWED # ARABIC MATHEMATICAL DOUBLE-STRUCK WAW..ARABIC | ||||
| 1EEAB..1EEBB; DISALLOWED # ARABIC MATHEMATICAL DOUBLE-STRUCK LAM..ARABIC | ||||
| 1EEF0..1EEF1; DISALLOWED # ARABIC MATHEMATICAL OPERATOR MEEM WITH HAH WI | ||||
| 1F0BF ; DISALLOWED # PLAYING CARD RED JOKER | ||||
| 1F0E0..1F0F5; DISALLOWED # PLAYING CARD FOOL..PLAYING CARD TRUMP-21 | ||||
| 1F10B..1F10C; DISALLOWED # DINGBAT CIRCLED SANS-SERIF DIGIT ZERO..DINGBA | ||||
| 1F16A..1F16B; DISALLOWED # RAISED MC SIGN..RAISED MD SIGN | ||||
| 1F321..1F32C; DISALLOWED # THERMOMETER..WIND BLOWING FACE | ||||
| 1F336 ; DISALLOWED # HOT PEPPER | ||||
| 1F37D ; DISALLOWED # FORK AND KNIFE WITH PLATE | ||||
| 1F394..1F39F; DISALLOWED # HEART WITH TIP ON THE LEFT..ADMISSION TICKETS | ||||
| 1F3C5 ; DISALLOWED # SPORTS MEDAL | ||||
| 1F3CB..1F3CE; DISALLOWED # WEIGHT LIFTER..RACING CAR | ||||
| 1F3D4..1F3DF; DISALLOWED # SNOW CAPPED MOUNTAIN..STADIUM | ||||
| 1F3F1..1F3F7; DISALLOWED # WHITE PENNANT..LABEL | ||||
| 1F43F ; DISALLOWED # CHIPMUNK | ||||
| 1F441 ; DISALLOWED # EYE | ||||
| 1F4F8 ; DISALLOWED # CAMERA WITH FLASH | ||||
| 1F4FD..1F4FE; DISALLOWED # FILM PROJECTOR..PORTABLE STEREO | ||||
| 1F53E..1F54A; DISALLOWED # LOWER RIGHT SHADOWED WHITE CIRCLE..DOVE OF PE | ||||
| 1F568..1F579; DISALLOWED # RIGHT SPEAKER..JOYSTICK | ||||
| 1F57B..1F5A3; DISALLOWED # LEFT HAND TELEPHONE RECEIVER..BLACK DOWN POIN | ||||
| 1F5A5..1F5FA; DISALLOWED # DESKTOP COMPUTER..WORLD MAP | ||||
| 1F600 ; DISALLOWED # GRINNING FACE | ||||
| 1F611 ; DISALLOWED # EXPRESSIONLESS FACE | ||||
| 1F615 ; DISALLOWED # CONFUSED FACE | ||||
| 1F617 ; DISALLOWED # KISSING FACE | ||||
| 1F619 ; DISALLOWED # KISSING FACE WITH SMILING EYES | ||||
| 1F61B ; DISALLOWED # FACE WITH STUCK-OUT TONGUE | ||||
| 1F61F ; DISALLOWED # WORRIED FACE | ||||
| 1F626..1F627; DISALLOWED # FROWNING FACE WITH OPEN MOUTH..ANGUISHED FACE | ||||
| 1F62C ; DISALLOWED # GRIMACING FACE | ||||
| 1F62E..1F62F; DISALLOWED # FACE WITH OPEN MOUTH..HUSHED FACE | ||||
| 1F634 ; DISALLOWED # SLEEPING FACE | ||||
| 1F641..1F642; DISALLOWED # SLIGHTLY FROWNING FACE..SLIGHTLY SMILING FACE | ||||
| 1F650..1F67F; DISALLOWED # NORTH WEST POINTING LEAF..REVERSE CHECKER BOA | ||||
| 1F6C6..1F6CF; DISALLOWED # TRIANGLE WITH ROUNDED CORNERS..BED | ||||
| 1F6E0..1F6EC; DISALLOWED # HAMMER AND WRENCH..AIRPLANE ARRIVING | ||||
| 1F6F0..1F6F3; DISALLOWED # SATELLITE..PASSENGER SHIP | ||||
| 1F780..1F7D4; DISALLOWED # BLACK LEFT-POINTING ISOSCELES RIGHT TRIANGLE. | ||||
| 1F800..1F80B; DISALLOWED # LEFTWARDS ARROW WITH SMALL TRIANGLE ARROWHEAD | ||||
| 1F810..1F847; DISALLOWED # LEFTWARDS ARROW WITH SMALL EQUILATERAL ARROWH | ||||
| 1F850..1F859; DISALLOWED # LEFTWARDS SANS-SERIF ARROW..UP DOWN SANS-SERI | ||||
| 1F860..1F887; DISALLOWED # WIDE-HEADED LEFTWARDS LIGHT BARB ARROW..WIDE- | ||||
| 1F890..1F8AD; DISALLOWED # LEFTWARDS TRIANGLE ARROWHEAD..WHITE ARROW SHA | ||||
| Appendix B. Changes from Unicode 7.0.0 to Unicode 8.0.0 | Appendix B. Changes from Unicode 7.0.0 to Unicode 8.0.0 | |||
| Changes from derived property value UNASSIGNED to either PVALID or | Changes from derived property value UNASSIGNED to either PVALID or | |||
| DISALLOWED. | DISALLOWED. | |||
| 08B3..08B4 ; PVALID # ARABIC LETTER AIN WITH THREE DOTS BELOW..ARA | 08B3..08B4 ; PVALID # ARABIC LETTER AIN WITH THREE DOTS BELOW..ARAB | |||
| 08E3 ; PVALID # ARABIC TURNED DAMMA BELOW | 08E3 ; PVALID # ARABIC TURNED DAMMA BELOW | |||
| 0AF9 ; PVALID # GUJARATI LETTER ZHA | 0AF9 ; PVALID # GUJARATI LETTER ZHA | |||
| 0C5A ; PVALID # TELUGU LETTER RRRA | 0C5A ; PVALID # TELUGU LETTER RRRA | |||
| 0D5F ; PVALID # MALAYALAM LETTER ARCHAIC II | 0D5F ; PVALID # MALAYALAM LETTER ARCHAIC II | |||
| 13F5 ; PVALID # CHEROKEE LETTER MV | 13F5 ; PVALID # CHEROKEE LETTER MV | |||
| 13F8..13FD ; DISALLOWED # CHEROKEE SMALL LETTER YE..CHEROKEE SMALL LET | 13F8..13FD ; DISALLOWED # CHEROKEE SMALL LETTER YE..CHEROKEE SMALL LETT | |||
| 20BE ; DISALLOWED # LARI SIGN | 20BE ; DISALLOWED # LARI SIGN | |||
| 218A..218B ; DISALLOWED # TURNED DIGIT TWO..TURNED DIGIT THREE | 218A..218B ; DISALLOWED # TURNED DIGIT TWO..TURNED DIGIT THREE | |||
| 2BEC..2BEF ; DISALLOWED # LEFTWARDS TWO-HEADED ARROW WITH TRIANGLE ARR | 2BEC..2BEF ; DISALLOWED # LEFTWARDS TWO-HEADED ARROW WITH TRIANGLE ARRO | |||
| 9FCD..9FD5 ; PVALID # <CJK Ideograph>..<CJK Ideograph> | 9FCD..9FD5 ; PVALID # <CJK Ideograph>..<CJK Ideograph> | |||
| A69E ; PVALID # COMBINING CYRILLIC LETTER EF | A69E ; PVALID # COMBINING CYRILLIC LETTER EF | |||
| A78F ; PVALID # LATIN LETTER SINOLOGICAL DOT | A78F ; PVALID # LATIN LETTER SINOLOGICAL DOT | |||
| A7B2..A7B7 ; DISALLOWED # LATIN CAPITAL LETTER J WITH CROSSED-TAIL..LA | A7B2..A7B4 ; DISALLOWED # LATIN CAPITAL LETTER J WITH CROSSED-TAIL..LAT | |||
| A8FC..A8FD ; DISALLOWED # DEVANAGARI SIGN SIDDHAM..DEVANAGARI JAIN OM | A7B5 ; PVALID # LATIN SMALL LETTER BETA | |||
| AB60..AB63 ; PVALID # LATIN SMALL LETTER SAKHA YAT..LATIN SMALL LE | A7B6 ; DISALLOWED # LATIN CAPITAL LETTER OMEGA | |||
| AB70..ABBF ; DISALLOWED # CHEROKEE SMALL LETTER A..CHEROKEE SMALL LETT | A7B7 ; PVALID # LATIN SMALL LETTER OMEGA | |||
| FE2E..FE2F ; PVALID # COMBINING CYRILLIC TITLO LEFT HALF..COMBININ | A8FC ; DISALLOWED # DEVANAGARI SIGN SIDDHAM | |||
| 108E0..108F2; PVALID # HATRAN LETTER ALEPH..HATRAN LETTER QOPH | A8FD ; PVALID # DEVANAGARI JAIN OM | |||
| 108F4..108F5; PVALID # HATRAN LETTER SHIN..HATRAN LETTER TAW | AB60..AB63 ; PVALID # LATIN SMALL LETTER SAKHA YAT..LATIN SMALL LET | |||
| 108FB..108FF; DISALLOWED # HATRAN NUMBER ONE..HATRAN NUMBER ONE HUNDRED | AB70..ABBF ; DISALLOWED # CHEROKEE SMALL LETTER A..CHEROKEE SMALL LETTE | |||
| 109BC..109BD; DISALLOWED # MEROITIC CURSIVE FRACTION ELEVEN TWELFTHS..M | FE2E..FE2F ; PVALID # COMBINING CYRILLIC TITLO LEFT HALF..COMBINING | |||
| 109C0..109CF; DISALLOWED # MEROITIC CURSIVE NUMBER ONE..MEROITIC CURSIV | 108E0..108F2; PVALID # HATRAN LETTER ALEPH..HATRAN LETTER QOPH | |||
| 109D2..109FF; DISALLOWED # MEROITIC CURSIVE NUMBER ONE HUNDRED..MEROITI | 108F4..108F5; PVALID # HATRAN LETTER SHIN..HATRAN LETTER TAW | |||
| 10C80..10CB2; DISALLOWED # OLD HUNGARIAN CAPITAL LETTER A..OLD HUNGARIA | 108FB..108FF; DISALLOWED # HATRAN NUMBER ONE..HATRAN NUMBER ONE HUNDRED | |||
| 10CC0..10CF2; PVALID # OLD HUNGARIAN SMALL LETTER A..OLD HUNGARIAN | 109BC..109BD; DISALLOWED # MEROITIC CURSIVE FRACTION ELEVEN TWELFTHS..ME | |||
| 10CFA..10CFF; DISALLOWED # OLD HUNGARIAN NUMBER ONE..OLD HUNGARIAN NUMB | 109C0..109CF; DISALLOWED # MEROITIC CURSIVE NUMBER ONE..MEROITIC CURSIVE | |||
| 111C9..111CC; DISALLOWED # SHARADA SANDHI MARK..SHARADA EXTRA SHORT VOW | 109D2..109FF; DISALLOWED # MEROITIC CURSIVE NUMBER ONE HUNDRED..MEROITIC | |||
| 111DB..111DF; DISALLOWED # SHARADA SIGN SIDDHAM..SHARADA SECTION MARK-2 | 10C80..10CB2; DISALLOWED # OLD HUNGARIAN CAPITAL LETTER A..OLD HUNGARIAN | |||
| 11280..11286; PVALID # MULTANI LETTER A..MULTANI LETTER GA | 10CC0..10CF2; PVALID # OLD HUNGARIAN SMALL LETTER A..OLD HUNGARIAN S | |||
| 11288 ; PVALID # MULTANI LETTER GHA | 10CFA..10CFF; DISALLOWED # OLD HUNGARIAN NUMBER ONE..OLD HUNGARIAN NUMBE | |||
| 1128A..1128D; PVALID # MULTANI LETTER CA..MULTANI LETTER JJA | 111C9 ; DISALLOWED # SHARADA SANDHI MARK | |||
| 1128F..1129D; PVALID # MULTANI LETTER NYA..MULTANI LETTER BA | 111CA..111CC; PVALID # SHARADA SIGN NUKTA..SHARADA EXTRA SHORT VOWEL | |||
| 1129F..112A9; PVALID # MULTANI LETTER BHA..MULTANI SECTION MARK | 111DB ; DISALLOWED # SHARADA SIGN SIDDHAM | |||
| 11300 ; PVALID # GRANTHA SIGN COMBINING ANUSVARA ABOVE | 111DC ; PVALID # SHARADA HEADSTROKE | |||
| 11350 ; PVALID # GRANTHA OM | 111DD..111DF; DISALLOWED # SHARADA CONTINUATION SIGN..SHARADA SECTION MA | |||
| 115CA..115DD; DISALLOWED # SIDDHAM SECTION MARK WITH TRIDENT AND U-SHAP | 11280..11286; PVALID # MULTANI LETTER A..MULTANI LETTER GA | |||
| 11700..11719; PVALID # AHOM LETTER KA..AHOM LETTER JHA | 11288 ; PVALID # MULTANI LETTER GHA | |||
| 1171D..1172B; PVALID # AHOM CONSONANT SIGN MEDIAL LA..AHOM SIGN KIL | 1128A..1128D; PVALID # MULTANI LETTER CA..MULTANI LETTER JJA | |||
| 11730..1173F; PVALID # AHOM DIGIT ZERO..AHOM SYMBOL VI | 1128F..1129D; PVALID # MULTANI LETTER NYA..MULTANI LETTER BA | |||
| 12399 ; PVALID # CUNEIFORM SIGN U U | 1129F..112A8; PVALID # MULTANI LETTER BHA..MULTANI LETTER RHA | |||
| 12480..12543; PVALID # CUNEIFORM SIGN AB TIMES NUN TENU..CUNEIFORM | 112A9 ; DISALLOWED # MULTANI SECTION MARK | |||
| 14400..14646; PVALID # ANATOLIAN HIEROGLYPH A001..ANATOLIAN HIEROGL | 11300 ; PVALID # GRANTHA SIGN COMBINING ANUSVARA ABOVE | |||
| 1D1DE..1D1E8; DISALLOWED # MUSICAL SYMBOL KIEVAN C CLEF..MUSICAL SYMBOL | 11350 ; PVALID # GRANTHA OM | |||
| 1D800..1DA8B; DISALLOWED # SIGNWRITING HAND-FIST INDEX..SIGNWRITING PAR | 115CA..115D7; DISALLOWED # SIDDHAM SECTION MARK WITH TRIDENT AND U-SHAPE | |||
| 1DA9B..1DA9F; PVALID # SIGNWRITING FILL MODIFIER-2..SIGNWRITING FIL | 115D8..115DD; PVALID # SIDDHAM LETTER THREE-CIRCLE ALTERNATE I..SIDD | |||
| 1DAA1..1DAAF; PVALID # SIGNWRITING ROTATION MODIFIER-2..SIGNWRITING | 11700..11719; PVALID # AHOM LETTER KA..AHOM LETTER JHA | |||
| 1F32D..1F32F; DISALLOWED # HOT DOG..BURRITO | 1171D..1172B; PVALID # AHOM CONSONANT SIGN MEDIAL LA..AHOM SIGN KILL | |||
| 1F37E..1F37F; DISALLOWED # BOTTLE WITH POPPING CORK..POPCORN | 11730..11739; PVALID # AHOM DIGIT ZERO..AHOM DIGIT NINE | |||
| 1F3CF..1F3D3; DISALLOWED # CRICKET BAT AND BALL..TABLE TENNIS PADDLE AN | 1173A..1173F; DISALLOWED # AHOM NUMBER TEN..AHOM SYMBOL VI | |||
| 1F3F8..1F3FF; DISALLOWED # BADMINTON RACQUET AND SHUTTLECOCK..EMOJI MOD | 12399 ; PVALID # CUNEIFORM SIGN U U | |||
| 1F4FF ; DISALLOWED # PRAYER BEADS | 12480..12543; PVALID # CUNEIFORM SIGN AB TIMES NUN TENU..CUNEIFORM S | |||
| 1F54B..1F54F; DISALLOWED # KAABA..BOWL OF HYGIEIA | 14400..14646; PVALID # ANATOLIAN HIEROGLYPH A001..ANATOLIAN HIEROGLY | |||
| 1F643..1F644; DISALLOWED # UPSIDE-DOWN FACE..FACE WITH ROLLING EYES | 1D1DE..1D1E8; DISALLOWED # MUSICAL SYMBOL KIEVAN C CLEF..MUSICAL SYMBOL | |||
| 1F6D0 ; DISALLOWED # PLACE OF WORSHIP | 1D800..1D9FF; DISALLOWED # SIGNWRITING HAND-FIST INDEX..SIGNWRITING HEAD | |||
| 1F910..1F918; DISALLOWED # ZIPPER-MOUTH FACE..SIGN OF THE HORNS | 1DA00..1DA36; PVALID # SIGNWRITING HEAD RIM..SIGNWRITING AIR SUCKING | |||
| 1F980..1F984; DISALLOWED # CRAB..UNICORN FACE | 1DA37..1DA3A; DISALLOWED # SIGNWRITING AIR BLOW SMALL ROTATIONS..SIGNWRI | |||
| 1F9C0 ; DISALLOWED # CHEESE WEDGE | 1DA3B..1DA6C; PVALID # SIGNWRITING MOUTH CLOSED NEUTRAL..SIGNWRITING | |||
| 1DA6D..1DA74; DISALLOWED # SIGNWRITING SHOULDER HIP SPINE..SIGNWRITING T | ||||
| 1DA75 ; PVALID # SIGNWRITING UPPER BODY TILTING FROM HIP JOINT | ||||
| 1DA76..1DA83; DISALLOWED # SIGNWRITING LIMB COMBINATION..SIGNWRITING LOC | ||||
| 1DA84 ; PVALID # SIGNWRITING LOCATION HEAD NECK | ||||
| 1DA85..1DA8B; DISALLOWED # SIGNWRITING LOCATION TORSO..SIGNWRITING PAREN | ||||
| 1DA9B..1DA9F; PVALID # SIGNWRITING FILL MODIFIER-2..SIGNWRITING FILL | ||||
| 1DAA1..1DAAF; PVALID # SIGNWRITING ROTATION MODIFIER-2..SIGNWRITING | ||||
| 1F32D..1F32F; DISALLOWED # HOT DOG..BURRITO | ||||
| 1F37E..1F37F; DISALLOWED # BOTTLE WITH POPPING CORK..POPCORN | ||||
| 1F3CF..1F3D3; DISALLOWED # CRICKET BAT AND BALL..TABLE TENNIS PADDLE AND | ||||
| 1F3F8..1F3FF; DISALLOWED # BADMINTON RACQUET AND SHUTTLECOCK..EMOJI MODI | ||||
| 1F4FF ; DISALLOWED # PRAYER BEADS | ||||
| 1F54B..1F54F; DISALLOWED # KAABA..BOWL OF HYGIEIA | ||||
| 1F643..1F644; DISALLOWED # UPSIDE-DOWN FACE..FACE WITH ROLLING EYES | ||||
| 1F6D0 ; DISALLOWED # PLACE OF WORSHIP | ||||
| 1F910..1F918; DISALLOWED # ZIPPER-MOUTH FACE..SIGN OF THE HORNS | ||||
| 1F980..1F984; DISALLOWED # CRAB..UNICORN FACE | ||||
| 1F9C0 ; DISALLOWED # CHEESE WEDGE | ||||
| 2B820..2CEA1; PVALID # <CJK Ideograph Extension E>..<CJK Ideograph E | ||||
| Appendix C. Changes from Unicode 8.0.0 to Unicode 9.0.0 | Appendix C. Changes from Unicode 8.0.0 to Unicode 9.0.0 | |||
| Changes from derived property value UNASSIGNED to either PVALID or | Changes from derived property value UNASSIGNED to either PVALID or | |||
| DISALLOWED. | DISALLOWED. | |||
| 08B6..08BD ; PVALID # ARABIC LETTER BEH WITH SMALL MEEM ABOVE..ARA | 08B6..08BD ; PVALID # ARABIC LETTER BEH WITH SMALL MEEM ABOVE..ARAB | |||
| 08D4..08E2 ; PVALID # ARABIC SMALL HIGH WORD AR-RUB..ARABIC DISPUT | 08D4..08E1 ; PVALID # ARABIC SMALL HIGH WORD AR-RUB..ARABIC SMALL H | |||
| 0C80 ; PVALID # KANNADA SIGN SPACING CANDRABINDU | 08E2 ; DISALLOWED # ARABIC DISPUTED END OF AYAH | |||
| 0D4F ; DISALLOWED # MALAYALAM SIGN PARA | 0C80 ; PVALID # KANNADA SIGN SPACING CANDRABINDU | |||
| 0D54..0D56 ; PVALID # MALAYALAM LETTER CHILLU M..MALAYALAM LETTER | 0D4F ; DISALLOWED # MALAYALAM SIGN PARA | |||
| 0D58..0D5E ; DISALLOWED # MALAYALAM FRACTION ONE ONE-HUNDRED-AND-SIXTI | 0D54..0D56 ; PVALID # MALAYALAM LETTER CHILLU M..MALAYALAM LETTER C | |||
| 0D76..0D78 ; DISALLOWED # MALAYALAM FRACTION ONE SIXTEENTH..MALAYALAM | 0D58..0D5E ; DISALLOWED # MALAYALAM FRACTION ONE ONE-HUNDRED-AND-SIXTIE | |||
| 1C80..1C88 ; DISALLOWED # CYRILLIC SMALL LETTER ROUNDED VE..CYRILLIC S | 0D76..0D78 ; DISALLOWED # MALAYALAM FRACTION ONE SIXTEENTH..MALAYALAM F | |||
| 1DFB ; PVALID # COMBINING DELETION MARK | 1C80..1C88 ; DISALLOWED # CYRILLIC SMALL LETTER ROUNDED VE..CYRILLIC SM | |||
| 23FB..23FE ; DISALLOWED # POWER SYMBOL..POWER SLEEP SYMBOL | 1DFB ; PVALID # COMBINING DELETION MARK | |||
| 2E43..2E44 ; DISALLOWED # DASH WITH LEFT UPTURN..DOUBLE SUSPENSION MAR | 23FB..23FE ; DISALLOWED # POWER SYMBOL..POWER SLEEP SYMBOL | |||
| A7AE ; DISALLOWED # LATIN CAPITAL LETTER SMALL CAPITAL I | 2E43..2E44 ; DISALLOWED # DASH WITH LEFT UPTURN..DOUBLE SUSPENSION MARK | |||
| A8C5 ; PVALID # SAURASHTRA SIGN CANDRABINDU | A7AE ; DISALLOWED # LATIN CAPITAL LETTER SMALL CAPITAL I | |||
| 1018D..1018E; DISALLOWED # GREEK INDICTION SIGN..NOMISMA SIGN | A8C5 ; PVALID # SAURASHTRA SIGN CANDRABINDU | |||
| 104B0..104D3; DISALLOWED # OSAGE CAPITAL LETTER A..OSAGE CAPITAL LETTER | 1018D..1018E; DISALLOWED # GREEK INDICTION SIGN..NOMISMA SIGN | |||
| 104D8..104FB; PVALID # OSAGE SMALL LETTER A..OSAGE SMALL LETTER ZHA | 104B0..104D3; DISALLOWED # OSAGE CAPITAL LETTER A..OSAGE CAPITAL LETTER | |||
| 1123E ; PVALID # KHOJKI SIGN SUKUN | 104D8..104FB; PVALID # OSAGE SMALL LETTER A..OSAGE SMALL LETTER ZHA | |||
| 11400..11459; PVALID # NEWA LETTER A..NEWA DIGIT NINE | 1123E ; PVALID # KHOJKI SIGN SUKUN | |||
| 1145B ; DISALLOWED # NEWA PLACEHOLDER MARK | 11400..1144A; PVALID # NEWA LETTER A..NEWA SIDDHI | |||
| 1145D ; DISALLOWED # NEWA INSERTION SIGN | 1144B..1144F; DISALLOWED # NEWA DANDA..NEWA ABBREVIATION SIGN | |||
| 11660..1166C; DISALLOWED # MONGOLIAN BIRGA WITH ORNAMENT..MONGOLIAN TUR | 11450..11459; PVALID # NEWA DIGIT ZERO..NEWA DIGIT NINE | |||
| 11C00..11C08; PVALID # BHAIKSUKI LETTER A..BHAIKSUKI LETTER VOCALIC | 1145B ; DISALLOWED # NEWA PLACEHOLDER MARK | |||
| 11C0A..11C36; PVALID # BHAIKSUKI LETTER E..BHAIKSUKI VOWEL SIGN VOC | 1145D ; DISALLOWED # NEWA INSERTION SIGN | |||
| 11C38..11C45; PVALID # BHAIKSUKI VOWEL SIGN E..BHAIKSUKI GAP FILLER | 11660..1166C; DISALLOWED # MONGOLIAN BIRGA WITH ORNAMENT..MONGOLIAN TURN | |||
| 11C50..11C6C; PVALID # BHAIKSUKI DIGIT ZERO..BHAIKSUKI HUNDREDS UNI | 11C00..11C08; PVALID # BHAIKSUKI LETTER A..BHAIKSUKI LETTER VOCALIC | |||
| 11C70..11C8F; DISALLOWED # MARCHEN HEAD MARK..MARCHEN LETTER A | 11C0A..11C36; PVALID # BHAIKSUKI LETTER E..BHAIKSUKI VOWEL SIGN VOCA | |||
| 11C92..11CA7; PVALID # MARCHEN SUBJOINED LETTER KA..MARCHEN SUBJOIN | 11C38..11C40; PVALID # BHAIKSUKI VOWEL SIGN E..BHAIKSUKI SIGN AVAGRA | |||
| 11CA9..11CB6; PVALID # MARCHEN SUBJOINED LETTER YA..MARCHEN SIGN CA | 11C41..11C45; DISALLOWED # BHAIKSUKI DANDA..BHAIKSUKI GAP FILLER-2 | |||
| 16FE0 ; PVALID # TANGUT ITERATION MARK | 11C50..11C59; PVALID # BHAIKSUKI DIGIT ZERO..BHAIKSUKI DIGIT NINE | |||
| 17000..187EC; PVALID # <Tangut Ideograph>..<Tangut Ideograph> | 11C5A..11C6C; DISALLOWED # BHAIKSUKI NUMBER ONE..BHAIKSUKI HUNDREDS UNIT | |||
| 18800..18AF2; PVALID # TANGUT COMPONENT-001..TANGUT COMPONENT-755 | 11C70..11C71; DISALLOWED # MARCHEN HEAD MARK..MARCHEN MARK SHAD | |||
| 1E000..1E006; PVALID # COMBINING GLAGOLITIC LETTER AZU..COMBINING G | 11C72..11C8F; PVALID # MARCHEN LETTER KA..MARCHEN LETTER A | |||
| 1E008..1E018; PVALID # COMBINING GLAGOLITIC LETTER ZEMLJA..COMBININ | 11C92..11CA7; PVALID # MARCHEN SUBJOINED LETTER KA..MARCHEN SUBJOINE | |||
| 1E01B..1E021; PVALID # COMBINING GLAGOLITIC LETTER SHTA..COMBINING | 11CA9..11CB6; PVALID # MARCHEN SUBJOINED LETTER YA..MARCHEN SIGN CAN | |||
| 1E023..1E024; PVALID # COMBINING GLAGOLITIC LETTER YU..COMBINING GL | 16FE0 ; PVALID # TANGUT ITERATION MARK | |||
| 1E026..1E02A; PVALID # COMBINING GLAGOLITIC LETTER YO..COMBINING GL | 17000..187EC; PVALID # <Tangut Ideograph>..<Tangut Ideograph> | |||
| 1E900..1E94A; DISALLOWED # ADLAM CAPITAL LETTER ALIF..ADLAM NUKTA | 18800..18AF2; PVALID # TANGUT COMPONENT-001..TANGUT COMPONENT-755 | |||
| 1E950..1E959; PVALID # ADLAM DIGIT ZERO..ADLAM DIGIT NINE | 1E000..1E006; PVALID # COMBINING GLAGOLITIC LETTER AZU..COMBINING GL | |||
| 1E95E..1E95F; DISALLOWED # ADLAM INITIAL EXCLAMATION MARK..ADLAM INITIA | 1E008..1E018; PVALID # COMBINING GLAGOLITIC LETTER ZEMLJA..COMBINING | |||
| 1F19B..1F1AC; DISALLOWED # SQUARED THREE D..SQUARED VOD | 1E01B..1E021; PVALID # COMBINING GLAGOLITIC LETTER SHTA..COMBINING G | |||
| 1F23B ; DISALLOWED # SQUARED CJK UNIFIED IDEOGRAPH-914D | 1E023..1E024; PVALID # COMBINING GLAGOLITIC LETTER YU..COMBINING GLA | |||
| 1F57A ; DISALLOWED # MAN DANCING | 1E026..1E02A; PVALID # COMBINING GLAGOLITIC LETTER YO..COMBINING GLA | |||
| 1F5A4 ; DISALLOWED # BLACK HEART | 1E900..1E921; DISALLOWED # ADLAM CAPITAL LETTER ALIF..ADLAM CAPITAL LETT | |||
| 1F6D1..1F6D2; DISALLOWED # OCTAGONAL SIGN..SHOPPING TROLLEY | 1E922..1E94A; PVALID # ADLAM SMALL LETTER ALIF..ADLAM NUKTA | |||
| 1F6F4..1F6F6; DISALLOWED # SCOOTER..CANOE | 1E950..1E959; PVALID # ADLAM DIGIT ZERO..ADLAM DIGIT NINE | |||
| 1F919..1F91E; DISALLOWED # CALL ME HAND..HAND WITH INDEX AND MIDDLE FIN | 1E95E..1E95F; DISALLOWED # ADLAM INITIAL EXCLAMATION MARK..ADLAM INITIAL | |||
| 1F920..1F927; DISALLOWED # FACE WITH COWBOY HAT..SNEEZING FACE | 1F19B..1F1AC; DISALLOWED # SQUARED THREE D..SQUARED VOD | |||
| 1F930 ; DISALLOWED # PREGNANT WOMAN | 1F23B ; DISALLOWED # SQUARED CJK UNIFIED IDEOGRAPH-914D | |||
| 1F933..1F93E; DISALLOWED # SELFIE..HANDBALL | 1F57A ; DISALLOWED # MAN DANCING | |||
| 1F940..1F94B; DISALLOWED # WILTED FLOWER..MARTIAL ARTS UNIFORM | 1F5A4 ; DISALLOWED # BLACK HEART | |||
| 1F950..1F95E; DISALLOWED # CROISSANT..PANCAKES | 1F6D1..1F6D2; DISALLOWED # OCTAGONAL SIGN..SHOPPING TROLLEY | |||
| 1F6F4..1F6F6; DISALLOWED # SCOOTER..CANOE | ||||
| 1F919..1F91E; DISALLOWED # CALL ME HAND..HAND WITH INDEX AND MIDDLE FING | ||||
| 1F920..1F927; DISALLOWED # FACE WITH COWBOY HAT..SNEEZING FACE | ||||
| 1F930 ; DISALLOWED # PREGNANT WOMAN | ||||
| 1F933..1F93E; DISALLOWED # SELFIE..HANDBALL | ||||
| 1F940..1F94B; DISALLOWED # WILTED FLOWER..MARTIAL ARTS UNIFORM | ||||
| 1F950..1F95E; DISALLOWED # CROISSANT..PANCAKES | ||||
| 1F985..1F991; DISALLOWED # EAGLE..SQUID | ||||
| Appendix D. Changes from Unicode 9.0.0 to Unicode 10.0.0 | Appendix D. Changes from Unicode 9.0.0 to Unicode 10.0.0 | |||
| Changes from derived property value UNASSIGNED to either PVALID or | Changes from derived property value UNASSIGNED to either PVALID or | |||
| DISALLOWED. | DISALLOWED. | |||
| 0860..086A ; PVALID # SYRIAC LETTER MALAYALAM NGA..SYRIAC LETTER M | 0860..086A ; PVALID # SYRIAC LETTER MALAYALAM NGA..SYRIAC LETTER MA | |||
| 09FC..09FD ; PVALID # BENGALI LETTER VEDIC ANUSVARA..BENGALI ABBRE | 09FC ; PVALID # BENGALI LETTER VEDIC ANUSVARA | |||
| 0AFA..0AFF ; PVALID # GUJARATI SIGN SUKUN..GUJARATI SIGN TWO-CIRCL | 09FD ; DISALLOWED # BENGALI ABBREVIATION SIGN | |||
| 0D00 ; PVALID # MALAYALAM SIGN COMBINING ANUSVARA ABOVE | 0AFA..0AFF ; PVALID # GUJARATI SIGN SUKUN..GUJARATI SIGN TWO-CIRCLE | |||
| 0D3B..0D3C ; PVALID # MALAYALAM SIGN VERTICAL BAR VIRAMA..MALAYALA | 0D00 ; PVALID # MALAYALAM SIGN COMBINING ANUSVARA ABOVE | |||
| 1CF7 ; PVALID # VEDIC SIGN ATIKRAMA | 0D3B..0D3C ; PVALID # MALAYALAM SIGN VERTICAL BAR VIRAMA..MALAYALAM | |||
| 1DF6..1DF9 ; PVALID # COMBINING KAVYKA ABOVE RIGHT..COMBINING WIDE | 1CF7 ; PVALID # VEDIC SIGN ATIKRAMA | |||
| 20BF ; DISALLOWED # BITCOIN SIGN | 1DF6..1DF9 ; PVALID # COMBINING KAVYKA ABOVE RIGHT..COMBINING WIDE | |||
| 23FF ; DISALLOWED # OBSERVER EYE SYMBOL | 20BF ; DISALLOWED # BITCOIN SIGN | |||
| 2BD2 ; DISALLOWED # GROUP MARK | 23FF ; DISALLOWED # OBSERVER EYE SYMBOL | |||
| 2E45..2E49 ; DISALLOWED # INVERTED LOW KAVYKA..DOUBLE STACKED COMMA | 2BD2 ; DISALLOWED # GROUP MARK | |||
| 312E ; PVALID # BOPOMOFO LETTER O WITH DOT ABOVE | 2E45..2E49 ; DISALLOWED # INVERTED LOW KAVYKA..DOUBLE STACKED COMMA | |||
| 9FD6..9FEA ; PVALID # <CJK Ideograph>..<CJK Ideograph> | 312E ; PVALID # BOPOMOFO LETTER O WITH DOT ABOVE | |||
| 1032D..1032F; PVALID # OLD ITALIC LETTER YE..OLD ITALIC LETTER SOUT | 9FD6..9FEA ; PVALID # <CJK Ideograph>..<CJK Ideograph> | |||
| 11A00..11A47; PVALID # ZANABAZAR SQUARE LETTER A..ZANABAZAR SQUARE | 1032D..1032F; PVALID # OLD ITALIC LETTER YE..OLD ITALIC LETTER SOUTH | |||
| 11A50..11A83; PVALID # SOYOMBO LETTER A..SOYOMBO LETTER KSSA | 11A00..11A3E; PVALID # ZANABAZAR SQUARE LETTER A..ZANABAZAR SQUARE C | |||
| 11A86..11A9C; PVALID # SOYOMBO CLUSTER-INITIAL LETTER RA..SOYOMBO M | 11A3F..11A46; DISALLOWED # ZANABAZAR SQUARE INITIAL HEAD MARK..ZANABAZAR | |||
| 11A9E..11AA2; DISALLOWED # SOYOMBO HEAD MARK WITH MOON AND SUN AND TRIP | 11A47 ; PVALID # ZANABAZAR SQUARE SUBJOINER | |||
| 11D00..11D06; PVALID # MASARAM GONDI LETTER A..MASARAM GONDI LETTER | 11A50..11A83; PVALID # SOYOMBO LETTER A..SOYOMBO LETTER KSSA | |||
| 11D08..11D09; PVALID # MASARAM GONDI LETTER AI..MASARAM GONDI LETTE | 11A86..11A99; PVALID # SOYOMBO CLUSTER-INITIAL LETTER RA..SOYOMBO SU | |||
| 11D0B..11D36; PVALID # MASARAM GONDI LETTER AU..MASARAM GONDI VOWEL | 11A9A..11A9C; DISALLOWED # SOYOMBO MARK TSHEG..SOYOMBO MARK DOUBLE SHAD | |||
| 11D3A ; PVALID # MASARAM GONDI VOWEL SIGN E | 11A9E..11AA2; DISALLOWED # SOYOMBO HEAD MARK WITH MOON AND SUN AND TRIPL | |||
| 11D3C..11D3D; PVALID # MASARAM GONDI VOWEL SIGN AI..MASARAM GONDI V | 11D00..11D06; PVALID # MASARAM GONDI LETTER A..MASARAM GONDI LETTER | |||
| 11D3F..11D47; PVALID # MASARAM GONDI VOWEL SIGN AU..MASARAM GONDI R | 11D08..11D09; PVALID # MASARAM GONDI LETTER AI..MASARAM GONDI LETTER | |||
| 11D50..11D59; PVALID # MASARAM GONDI DIGIT ZERO..MASARAM GONDI DIGI | 11D0B..11D36; PVALID # MASARAM GONDI LETTER AU..MASARAM GONDI VOWEL | |||
| 16FE1 ; PVALID # NUSHU ITERATION MARK | 11D3A ; PVALID # MASARAM GONDI VOWEL SIGN E | |||
| 1B002..1B11E; PVALID # HENTAIGANA LETTER A-1..HENTAIGANA LETTER N-M | 11D3C..11D3D; PVALID # MASARAM GONDI VOWEL SIGN AI..MASARAM GONDI VO | |||
| 1B170..1B2FB; PVALID # NUSHU CHARACTER-1B170..NUSHU CHARACTER-1B2FB | 11D3F..11D47; PVALID # MASARAM GONDI VOWEL SIGN AU..MASARAM GONDI RA | |||
| 1F260..1F265; DISALLOWED # ROUNDED SYMBOL FOR FU..ROUNDED SYMBOL FOR CA | 11D50..11D59; PVALID # MASARAM GONDI DIGIT ZERO..MASARAM GONDI DIGIT | |||
| 1F6D3..1F6D4; DISALLOWED # STUPA..PAGODA | 16FE1 ; PVALID # NUSHU ITERATION MARK | |||
| 1F6F7..1F6F8; DISALLOWED # SLED..FLYING SAUCER | 1B002..1B11E; PVALID # HENTAIGANA LETTER A-1..HENTAIGANA LETTER N-MU | |||
| 1F900..1F90B; DISALLOWED # CIRCLED CROSS FORMEE WITH FOUR DOTS..DOWNWAR | 1B170..1B2FB; PVALID # NUSHU CHARACTER-1B170..NUSHU CHARACTER-1B2FB | |||
| 1F91F ; DISALLOWED # I LOVE YOU HAND SIGN | 1F260..1F265; DISALLOWED # ROUNDED SYMBOL FOR FU..ROUNDED SYMBOL FOR CAI | |||
| 1F928..1F92F; DISALLOWED # FACE WITH ONE EYEBROW RAISED..SHOCKED FACE W | 1F6D3..1F6D4; DISALLOWED # STUPA..PAGODA | |||
| 1F931..1F932; DISALLOWED # BREAST-FEEDING..PALMS UP TOGETHER | 1F6F7..1F6F8; DISALLOWED # SLED..FLYING SAUCER | |||
| 1F94C ; DISALLOWED # CURLING STONE | 1F900..1F90B; DISALLOWED # CIRCLED CROSS FORMEE WITH FOUR DOTS..DOWNWARD | |||
| 1F95F..1F96B; DISALLOWED # DUMPLING..CANNED FOOD | 1F91F ; DISALLOWED # I LOVE YOU HAND SIGN | |||
| 1F992..1F997; DISALLOWED # GIRAFFE FACE..CRICKET | 1F928..1F92F; DISALLOWED # FACE WITH ONE EYEBROW RAISED..SHOCKED FACE WI | |||
| 1F9D0..1F9E6; DISALLOWED # FACE WITH MONOCLE..SOCKS | 1F931..1F932; DISALLOWED # BREAST-FEEDING..PALMS UP TOGETHER | |||
| 1F94C ; DISALLOWED # CURLING STONE | ||||
| 1F95F..1F96B; DISALLOWED # DUMPLING..CANNED FOOD | ||||
| 1F992..1F997; DISALLOWED # GIRAFFE FACE..CRICKET | ||||
| 1F9D0..1F9E6; DISALLOWED # FACE WITH MONOCLE..SOCKS | ||||
| 2CEB0..2EBE0; PVALID # <CJK Ideograph Extension F>..<CJK Ideograph E | ||||
| Appendix E. Changes from Unicode 10.0.0 to Unicode 11.0.0 | Appendix E. Changes from Unicode 10.0.0 to Unicode 11.0.0 | |||
| Changes from derived property value DISALLOWED to PVALID. | Changes from derived property value DISALLOWED to PVALID. | |||
| 111C9 ; PVALID # SHARADA SANDHI MARK | 111C9 ; PVALID # SHARADA SANDHI MARK | |||
| Changes from derived property value UNASSIGNED to either PVALID or | Changes from derived property value UNASSIGNED to either PVALID or | |||
| DISALLOWED. | DISALLOWED. | |||
| 0560 ; PVALID # ARMENIAN SMALL LETTER TURNED AYB | 0560 ; PVALID # ARMENIAN SMALL LETTER TURNED AYB | |||
| 0588 ; PVALID # ARMENIAN SMALL LETTER YI WITH STROKE | 0588 ; PVALID # ARMENIAN SMALL LETTER YI WITH STROKE | |||
| 05EF ; PVALID # HEBREW YOD TRIANGLE | 05EF ; PVALID # HEBREW YOD TRIANGLE | |||
| 07FD..07FF ; PVALID # NKO DANTAYALAN..NKO TAMAN SIGN | 07FD ; PVALID # NKO DANTAYALAN | |||
| 08D3 ; PVALID # ARABIC SMALL LOW WAW | 07FE..07FF ; DISALLOWED # NKO DOROME SIGN..NKO TAMAN SIGN | |||
| 09FE ; PVALID # BENGALI SANDHI MARK | 08D3 ; PVALID # ARABIC SMALL LOW WAW | |||
| 0A76 ; DISALLOWED # GURMUKHI ABBREVIATION SIGN | 09FE ; PVALID # BENGALI SANDHI MARK | |||
| 0C04 ; PVALID # TELUGU SIGN COMBINING ANUSVARA ABOVE | 0A76 ; DISALLOWED # GURMUKHI ABBREVIATION SIGN | |||
| 0C84 ; DISALLOWED # KANNADA SIGN SIDDHAM | 0C04 ; PVALID # TELUGU SIGN COMBINING ANUSVARA ABOVE | |||
| 1878 ; PVALID # MONGOLIAN LETTER CHA WITH TWO DOTS | 0C84 ; DISALLOWED # KANNADA SIGN SIDDHAM | |||
| 1C90..1CBA ; DISALLOWED # GEORGIAN MTAVRULI CAPITAL LETTER AN..GEORGIA | 1878 ; PVALID # MONGOLIAN LETTER CHA WITH TWO DOTS | |||
| 1CBD..1CBF ; DISALLOWED # GEORGIAN MTAVRULI CAPITAL LETTER AEN..GEORGI | 1C90..1CBA ; DISALLOWED # GEORGIAN MTAVRULI CAPITAL LETTER AN..GEORGIAN | |||
| 2BBA..2BBC ; DISALLOWED # OVERLAPPING WHITE SQUARES..OVERLAPPING BLACK | 1CBD..1CBF ; DISALLOWED # GEORGIAN MTAVRULI CAPITAL LETTER AEN..GEORGIA | |||
| 2BD3..2BEB ; DISALLOWED # PLUTO FORM TWO..STAR WITH RIGHT HALF BLACK | 2BBA..2BBC ; DISALLOWED # OVERLAPPING WHITE SQUARES..OVERLAPPING BLACK | |||
| 2BF0..2BFE ; DISALLOWED # ERIS FORM ONE..REVERSED RIGHT ANGLE | 2BD3..2BEB ; DISALLOWED # PLUTO FORM TWO..STAR WITH RIGHT HALF BLACK | |||
| 2E4A..2E4E ; DISALLOWED # DOTTED SOLIDUS..PUNCTUS ELEVATUS MARK | 2BF0..2BFE ; DISALLOWED # ERIS FORM ONE..REVERSED RIGHT ANGLE | |||
| 312F ; PVALID # BOPOMOFO LETTER NN | 2E4A..2E4E ; DISALLOWED # DOTTED SOLIDUS..PUNCTUS ELEVATUS MARK | |||
| 9FEB..9FEF ; PVALID # <CJK Ideograph>..<CJK Ideograph> | 312F ; PVALID # BOPOMOFO LETTER NN | |||
| A7AF ; PVALID # LATIN LETTER SMALL CAPITAL Q | 9FEB..9FEF ; PVALID # <CJK Ideograph>..<CJK Ideograph> | |||
| A7B8..A7B9 ; DISALLOWED # LATIN CAPITAL LETTER U WITH STROKE..LATIN SM | A7AF ; PVALID # LATIN LETTER SMALL CAPITAL Q | |||
| A8FE..A8FF ; PVALID # DEVANAGARI LETTER AY..DEVANAGARI VOWEL SIGN | A7B8 ; DISALLOWED # LATIN CAPITAL LETTER U WITH STROKE | |||
| 10A34..10A35; PVALID # KHAROSHTHI LETTER TTTA..KHAROSHTHI LETTER VH | A7B9 ; PVALID # LATIN SMALL LETTER U WITH STROKE | |||
| 10A48 ; DISALLOWED # KHAROSHTHI FRACTION ONE HALF | A8FE..A8FF ; PVALID # DEVANAGARI LETTER AY..DEVANAGARI VOWEL SIGN A | |||
| 10D00..10D27; PVALID # HANIFI ROHINGYA LETTER A..HANIFI ROHINGYA SI | 10A34..10A35; PVALID # KHAROSHTHI LETTER TTTA..KHAROSHTHI LETTER VHA | |||
| 10D30..10D39; PVALID # HANIFI ROHINGYA DIGIT ZERO..HANIFI ROHINGYA | 10A48 ; DISALLOWED # KHAROSHTHI FRACTION ONE HALF | |||
| 10F00..10F27; PVALID # OLD SOGDIAN LETTER ALEPH..OLD SOGDIAN LIGATU | 10D00..10D27; PVALID # HANIFI ROHINGYA LETTER A..HANIFI ROHINGYA SIG | |||
| 10F30..10F59; PVALID # SOGDIAN LETTER ALEPH..SOGDIAN PUNCTUATION HA | 10D30..10D39; PVALID # HANIFI ROHINGYA DIGIT ZERO..HANIFI ROHINGYA D | |||
| 110CD ; DISALLOWED # KAITHI NUMBER SIGN ABOVE | 10F00..10F1C; PVALID # OLD SOGDIAN LETTER ALEPH..OLD SOGDIAN LETTER | |||
| 11144..11146; PVALID # CHAKMA LETTER LHAA..CHAKMA VOWEL SIGN EI | 10F1D..10F26; DISALLOWED # OLD SOGDIAN NUMBER ONE..OLD SOGDIAN FRACTION | |||
| 1133B ; PVALID # COMBINING BINDU BELOW | 10F27 ; PVALID # OLD SOGDIAN LIGATURE AYIN-DALETH | |||
| 1145E ; PVALID # NEWA SANDHI MARK | 10F30..10F50; PVALID # SOGDIAN LETTER ALEPH..SOGDIAN COMBINING STROK | |||
| 1171A ; PVALID # AHOM LETTER ALTERNATE BA | 10F51..10F59; DISALLOWED # SOGDIAN NUMBER ONE..SOGDIAN PUNCTUATION HALF | |||
| 11800..1183B; PVALID # DOGRA LETTER A..DOGRA ABBREVIATION SIGN | 110CD ; DISALLOWED # KAITHI NUMBER SIGN ABOVE | |||
| 11A9D ; PVALID # SOYOMBO MARK PLUTA | 11144..11146; PVALID # CHAKMA LETTER LHAA..CHAKMA VOWEL SIGN EI | |||
| 11D60..11D65; PVALID # GUNJALA GONDI LETTER A..GUNJALA GONDI LETTER | 1133B ; PVALID # COMBINING BINDU BELOW | |||
| 11D67..11D68; PVALID # GUNJALA GONDI LETTER EE..GUNJALA GONDI LETTE | 1145E ; PVALID # NEWA SANDHI MARK | |||
| 11D6A..11D8E; PVALID # GUNJALA GONDI LETTER OO..GUNJALA GONDI VOWEL | 1171A ; PVALID # AHOM LETTER ALTERNATE BA | |||
| 11D90..11D91; PVALID # GUNJALA GONDI VOWEL SIGN EE..GUNJALA GONDI V | 11800..1183A; PVALID # DOGRA LETTER A..DOGRA SIGN NUKTA | |||
| 11D93..11D98; PVALID # GUNJALA GONDI VOWEL SIGN OO..GUNJALA GONDI O | 1183B ; DISALLOWED # DOGRA ABBREVIATION SIGN | |||
| 11DA0..11DA9; PVALID # GUNJALA GONDI DIGIT ZERO..GUNJALA GONDI DIGI | 11A9D ; PVALID # SOYOMBO MARK PLUTA | |||
| 11EE0..11EF8; PVALID # MAKASAR LETTER KA..MAKASAR END OF SECTION | 11D60..11D65; PVALID # GUNJALA GONDI LETTER A..GUNJALA GONDI LETTER | |||
| 16E40..16E9A; DISALLOWED # MEDEFAIDRIN CAPITAL LETTER M..MEDEFAIDRIN EX | 11D67..11D68; PVALID # GUNJALA GONDI LETTER EE..GUNJALA GONDI LETTER | |||
| 187ED..187F1; PVALID # <Tangut Ideograph>..<Tangut Ideograph> | 11D6A..11D8E; PVALID # GUNJALA GONDI LETTER OO..GUNJALA GONDI VOWEL | |||
| 1D2E0..1D2F3; DISALLOWED # MAYAN NUMERAL ZERO..MAYAN NUMERAL NINETEEN | 11D90..11D91; PVALID # GUNJALA GONDI VOWEL SIGN EE..GUNJALA GONDI VO | |||
| 1D372..1D378; DISALLOWED # IDEOGRAPHIC TALLY MARK ONE..TALLY MARK FIVE | 11D93..11D98; PVALID # GUNJALA GONDI VOWEL SIGN OO..GUNJALA GONDI OM | |||
| 1EC71..1ECB4; DISALLOWED # INDIC SIYAQ NUMBER ONE..INDIC SIYAQ ALTERNAT | 11DA0..11DA9; PVALID # GUNJALA GONDI DIGIT ZERO..GUNJALA GONDI DIGIT | |||
| 1F12F ; DISALLOWED # COPYLEFT SYMBOL | 11EE0..11EF6; PVALID # MAKASAR LETTER KA..MAKASAR VOWEL SIGN O | |||
| 1F6F9 ; DISALLOWED # SKATEBOARD | 11EF7..11EF8; DISALLOWED # MAKASAR PASSIMBANG..MAKASAR END OF SECTION | |||
| 1F7D5..1F7D8; DISALLOWED # CIRCLED TRIANGLE..NEGATIVE CIRCLED SQUARE | 16E40..16E5F; DISALLOWED # MEDEFAIDRIN CAPITAL LETTER M..MEDEFAIDRIN CAP | |||
| 1F94D..1F94F; DISALLOWED # LACROSSE STICK AND BALL..FLYING DISC | 16E60..16E7F; PVALID # MEDEFAIDRIN SMALL LETTER M..MEDEFAIDRIN SMALL | |||
| 1F96C..1F970; DISALLOWED # LEAFY GREEN..SMILING FACE WITH SMILING EYES | 16E80..16E9A; DISALLOWED # MEDEFAIDRIN DIGIT ZERO..MEDEFAIDRIN EXCLAMATI | |||
| 1F973..1F976; DISALLOWED # FACE WITH PARTY HORN AND PARTY HAT..FREEZING | 187ED..187F1; PVALID # <Tangut Ideograph>..<Tangut Ideograph> | |||
| 1F97A ; DISALLOWED # FACE WITH PLEADING EYES | 1D2E0..1D2F3; DISALLOWED # MAYAN NUMERAL ZERO..MAYAN NUMERAL NINETEEN | |||
| 1F97C..1F97F; DISALLOWED # LAB COAT..FLAT SHOE | 1D372..1D378; DISALLOWED # IDEOGRAPHIC TALLY MARK ONE..TALLY MARK FIVE | |||
| 1F998..1F9A2; DISALLOWED # KANGAROO..SWAN | 1EC71..1ECB4; DISALLOWED # INDIC SIYAQ NUMBER ONE..INDIC SIYAQ ALTERNATE | |||
| 1F9B0..1F9B9; DISALLOWED # EMOJI COMPONENT RED HAIR..SUPERVILLAIN | 1F12F ; DISALLOWED # COPYLEFT SYMBOL | |||
| 1F9C1..1F9C2; DISALLOWED # CUPCAKE..SALT SHAKER | 1F6F9 ; DISALLOWED # SKATEBOARD | |||
| 1F9E7..1F9FF; DISALLOWED # RED GIFT ENVELOPE..NAZAR AMULET | 1F7D5..1F7D8; DISALLOWED # CIRCLED TRIANGLE..NEGATIVE CIRCLED SQUARE | |||
| 1F94D..1F94F; DISALLOWED # LACROSSE STICK AND BALL..FLYING DISC | ||||
| 1F96C..1F970; DISALLOWED # LEAFY GREEN..SMILING FACE WITH SMILING EYES A | ||||
| 1F973..1F976; DISALLOWED # FACE WITH PARTY HORN AND PARTY HAT..FREEZING | ||||
| 1F97A ; DISALLOWED # FACE WITH PLEADING EYES | ||||
| 1F97C..1F97F; DISALLOWED # LAB COAT..FLAT SHOE | ||||
| 1F998..1F9A2; DISALLOWED # KANGAROO..SWAN | ||||
| 1F9B0..1F9B9; DISALLOWED # EMOJI COMPONENT RED HAIR..SUPERVILLAIN | ||||
| 1F9C1..1F9C2; DISALLOWED # CUPCAKE..SALT SHAKER | ||||
| 1F9E7..1F9FF; DISALLOWED # RED GIFT ENVELOPE..NAZAR AMULET | ||||
| 1FA60..1FA6D; DISALLOWED # XIANGQI RED GENERAL..XIANGQI BLACK SOLDIER | ||||
| Appendix F. Changes from Unicode 11.0.0 to Unicode 12.0.0 | Appendix F. Changes from Unicode 11.0.0 to Unicode 12.0.0 | |||
| Changes from derived property value UNASSIGNED to either PVALID or | Changes from derived property value UNASSIGNED to either PVALID or | |||
| DISALLOWED. | DISALLOWED. | |||
| 0C77..0C7F ; DISALLOWED # TELUGU SIGN SIDDHAM..TELUGU SIGN TUUMU | 0C77 ; DISALLOWED # TELUGU SIGN SIDDHAM | |||
| 0E86..0E8A ; PVALID # LAO LETTER PALI GHA..LAO LETTER SO TAM | 0E86 ; PVALID # LAO LETTER PALI GHA | |||
| 0E8C..0EA3 ; PVALID # LAO LETTER PALI JHA..LAO LETTER LO LING | 0E89 ; PVALID # LAO LETTER PALI CHA | |||
| 0EA7..0EB2 ; PVALID # LAO LETTER WO..LAO VOWEL SIGN AA | 0E8C ; PVALID # LAO LETTER PALI JHA | |||
| 0EB4..0EBD ; PVALID # LAO VOWEL SIGN I..LAO SEMIVOWEL SIGN NYO | 0E8E..0E93 ; PVALID # LAO LETTER PALI NYA..LAO LETTER PALI NNA | |||
| 1CD4..1CFA ; PVALID # VEDIC SIGN YAJURVEDIC MIDLINE SVARITA..VEDIC | 0E98 ; PVALID # LAO LETTER PALI DHA | |||
| 2B98..2C2E ; DISALLOWED # THREE-D TOP-LIGHTED LEFTWARDS EQUILATERAL AR | 0EA0 ; PVALID # LAO LETTER PALI BHA | |||
| 2E30..2E4F ; DISALLOWED # RING POINT..CORNISH VERSE DIVIDER | 0EA8..0EA9 ; PVALID # LAO LETTER SANSKRIT SHA..LAO LETTER SANSKRIT | |||
| A7BA ; DISALLOWED # LATIN CAPITAL LETTER GLOTTAL A | 0EAC ; PVALID # LAO LETTER PALI LLA | |||
| A7BB ; PVALID # LATIN SMALL LETTER GLOTTAL A | 0EBA ; PVALID # LAO SIGN PALI VIRAMA | |||
| A7BC ; DISALLOWED # LATIN CAPITAL LETTER GLOTTAL I | 1CFA ; PVALID # VEDIC SIGN DOUBLE ANUSVARA ANTARGOMUKHA | |||
| A7BD ; PVALID # LATIN SMALL LETTER GLOTTAL I | 2BC9 ; DISALLOWED # NEPTUNE FORM TWO | |||
| A7BE ; DISALLOWED # LATIN CAPITAL LETTER GLOTTAL U | 2BFF ; DISALLOWED # HELLSCHREIBER PAUSE SYMBOL | |||
| A7BF ; PVALID # LATIN SMALL LETTER GLOTTAL U | 2E4F ; DISALLOWED # CORNISH VERSE DIVIDER | |||
| A7C2 ; DISALLOWED # LATIN CAPITAL LETTER ANGLICANA W | A7BA ; DISALLOWED # LATIN CAPITAL LETTER GLOTTAL A | |||
| A7C3 ; PVALID # LATIN SMALL LETTER ANGLICANA W | A7BB ; PVALID # LATIN SMALL LETTER GLOTTAL A | |||
| A7C4..A7C6 ; DISALLOWED # LATIN CAPITAL LETTER C WITH PALATAL HOOK..LA | A7BC ; DISALLOWED # LATIN CAPITAL LETTER GLOTTAL I | |||
| AB60..AB67 ; PVALID # LATIN SMALL LETTER SAKHA YAT..LATIN SMALL LE | A7BD ; PVALID # LATIN SMALL LETTER GLOTTAL I | |||
| 10FE0..10FF6; PVALID # ELYMAIC LETTER ALEPH..ELYMAIC LIGATURE ZAYIN | A7BE ; DISALLOWED # LATIN CAPITAL LETTER GLOTTAL U | |||
| 1145E..1145F; PVALID # NEWA SANDHI MARK..NEWA LETTER VEDIC ANUSVARA | A7BF ; PVALID # LATIN SMALL LETTER GLOTTAL U | |||
| 11680..116B8; PVALID # TAKRI LETTER A..TAKRI LETTER ARCHAIC KHA | A7C2 ; DISALLOWED # LATIN CAPITAL LETTER ANGLICANA W | |||
| 119A0..119A7; PVALID # NANDINAGARI LETTER A..NANDINAGARI LETTER VOC | A7C3 ; PVALID # LATIN SMALL LETTER ANGLICANA W | |||
| 119AA..119D7; PVALID # NANDINAGARI LETTER E..NANDINAGARI VOWEL SIGN | A7C4..A7C6 ; DISALLOWED # LATIN CAPITAL LETTER C WITH PALATAL HOOK..LAT | |||
| 119DA..119E1; PVALID # NANDINAGARI VOWEL SIGN E..NANDINAGARI SIGN A | AB66..AB67 ; PVALID # LATIN SMALL LETTER DZ DIGRAPH WITH RETROFLEX | |||
| 119E2 ; DISALLOWED # NANDINAGARI SIGN SIDDHAM | 10FE0..10FF6; PVALID # ELYMAIC LETTER ALEPH..ELYMAIC LIGATURE ZAYIN- | |||
| 119E3..119E4; PVALID # NANDINAGARI HEADSTROKE..NANDINAGARI VOWEL SI | 1145F ; PVALID # NEWA LETTER VEDIC ANUSVARA | |||
| 11A50..11A99; PVALID # SOYOMBO LETTER A..SOYOMBO SUBJOINER | 116B8 ; PVALID # TAKRI LETTER ARCHAIC KHA | |||
| 11FC0..11FF1; DISALLOWED # TAMIL FRACTION ONE THREE-HUNDRED-AND-TWENTIE | 119A0..119A7; PVALID # NANDINAGARI LETTER A..NANDINAGARI LETTER VOCA | |||
| 11FFF ; DISALLOWED # TAMIL PUNCTUATION END OF TEXT | 119AA..119D7; PVALID # NANDINAGARI LETTER E..NANDINAGARI VOWEL SIGN | |||
| 13430..13438; DISALLOWED # EGYPTIAN HIEROGLYPH VERTICAL JOINER..EGYPTIA | 119DA..119E1; PVALID # NANDINAGARI VOWEL SIGN E..NANDINAGARI SIGN AV | |||
| 16F00..16F4A; PVALID # MIAO LETTER PA..MIAO LETTER RTE | 119E2 ; DISALLOWED # NANDINAGARI SIGN SIDDHAM | |||
| 16F4F..16F87; PVALID # MIAO SIGN CONSONANT MODIFIER BAR..MIAO VOWEL | 119E3..119E4; PVALID # NANDINAGARI HEADSTROKE..NANDINAGARI VOWEL SIG | |||
| 16FE2 ; DISALLOWED # OLD CHINESE HOOK MARK | 11A84..11A85; PVALID # SOYOMBO SIGN JIHVAMULIYA..SOYOMBO SIGN UPADHM | |||
| 16FE3 ; PVALID # OLD CHINESE ITERATION MARK | 11FC0..11FF1; DISALLOWED # TAMIL FRACTION ONE THREE-HUNDRED-AND-TWENTIET | |||
| 17000..187F7; PVALID # <Tangut Ideograph>..<Tangut Ideograph> | 11FFF ; DISALLOWED # TAMIL PUNCTUATION END OF TEXT | |||
| 1B150..1B152; PVALID # HIRAGANA LETTER SMALL WI..HIRAGANA LETTER SM | 13430..13438; DISALLOWED # EGYPTIAN HIEROGLYPH VERTICAL JOINER..EGYPTIAN | |||
| 1B164..1B167; PVALID # KATAKANA LETTER SMALL WI..KATAKANA LETTER SM | 16F45..16F4A; PVALID # MIAO LETTER BRI..MIAO LETTER RTE | |||
| 1E100..1E12C; PVALID # NYIAKENG PUACHUE HMONG LETTER MA..NYIAKENG P | 16F4F ; PVALID # MIAO SIGN CONSONANT MODIFIER BAR | |||
| 1E130..1E13D; PVALID # NYIAKENG PUACHUE HMONG TONE-B..NYIAKENG PUAC | 16F7F..16F87; PVALID # MIAO VOWEL SIGN UOG..MIAO VOWEL SIGN UI | |||
| 1E140..1E149; PVALID # NYIAKENG PUACHUE HMONG DIGIT ZERO..NYIAKENG | 16FE2 ; DISALLOWED # OLD CHINESE HOOK MARK | |||
| 1E14E ; PVALID # NYIAKENG PUACHUE HMONG LOGOGRAM NYAJ | 16FE3 ; PVALID # OLD CHINESE ITERATION MARK | |||
| 1E14F ; DISALLOWED # NYIAKENG PUACHUE HMONG CIRCLED CA | 187F2..187F7; PVALID # <Tangut Ideograph>..<Tangut Ideograph> | |||
| 1E2C0..1E2F9; PVALID # WANCHO LETTER AA..WANCHO DIGIT NINE | 1B150..1B152; PVALID # HIRAGANA LETTER SMALL WI..HIRAGANA LETTER SMA | |||
| 1E2FF ; DISALLOWED # WANCHO NGUN SIGN | 1B164..1B167; PVALID # KATAKANA LETTER SMALL WI..KATAKANA LETTER SMA | |||
| 1E922..1E94B; PVALID # ADLAM SMALL LETTER ALIF..ADLAM NASALIZATION | 1E100..1E12C; PVALID # NYIAKENG PUACHUE HMONG LETTER MA..NYIAKENG PU | |||
| 1ED01..1ED3D; DISALLOWED # OTTOMAN SIYAQ NUMBER ONE..OTTOMAN SIYAQ FRAC | 1E130..1E13D; PVALID # NYIAKENG PUACHUE HMONG TONE-B..NYIAKENG PUACH | |||
| 1F110..1F16C; DISALLOWED # PARENTHESIZED LATIN CAPITAL LETTER A..RAISED | 1E140..1E149; PVALID # NYIAKENG PUACHUE HMONG DIGIT ZERO..NYIAKENG P | |||
| 1F300..1F6D5; DISALLOWED # CYCLONE..HINDU TEMPLE | 1E14E ; PVALID # NYIAKENG PUACHUE HMONG LOGOGRAM NYAJ | |||
| 1F6F0..1F6FA; DISALLOWED # SATELLITE..AUTO RICKSHAW | 1E14F ; DISALLOWED # NYIAKENG PUACHUE HMONG CIRCLED CA | |||
| 1F7E0..1F7EB; DISALLOWED # LARGE ORANGE CIRCLE..LARGE BROWN SQUARE | 1E2C0..1E2F9; PVALID # WANCHO LETTER AA..WANCHO DIGIT NINE | |||
| 1F90D..1F971; DISALLOWED # WHITE HEART..YAWNING FACE | 1E2FF ; DISALLOWED # WANCHO NGUN SIGN | |||
| 1F97A..1F9A2; DISALLOWED # FACE WITH PLEADING EYES..SWAN | 1E94B ; PVALID # ADLAM NASALIZATION MARK | |||
| 1F9A5..1F9AA; DISALLOWED # SLOTH..OYSTER | 1ED01..1ED3D; DISALLOWED # OTTOMAN SIYAQ NUMBER ONE..OTTOMAN SIYAQ FRACT | |||
| 1F9AE..1F9CA; DISALLOWED # GUIDE DOG..ICE CUBE | 1F16C ; DISALLOWED # RAISED MR SIGN | |||
| 1F9CD..1FA53; DISALLOWED # STANDING PERSON..BLACK CHESS KNIGHT-BISHOP | 1F6D5 ; DISALLOWED # HINDU TEMPLE | |||
| 1FA70..1FA73; DISALLOWED # BALLET SHOES..SHORTS | 1F6FA ; DISALLOWED # AUTO RICKSHAW | |||
| 1FA78..1FA7A; DISALLOWED # DROP OF BLOOD..STETHOSCOPE | 1F7E0..1F7EB; DISALLOWED # LARGE ORANGE CIRCLE..LARGE BROWN SQUARE | |||
| 1FA80..1FA82; DISALLOWED # YO-YO..PARACHUTE | 1F90D..1F90F; DISALLOWED # WHITE HEART..PINCHING HAND | |||
| 1FA90..1FA95; DISALLOWED # RINGED PLANET..BANJO | 1F93F ; DISALLOWED # DIVING MASK | |||
| 1F971 ; DISALLOWED # YAWNING FACE | ||||
| 1F97B ; DISALLOWED # SARI | ||||
| 1F9A5..1F9AA; DISALLOWED # SLOTH..OYSTER | ||||
| 1F9AE..1F9AF; DISALLOWED # GUIDE DOG..PROBING CANE | ||||
| 1F9BA..1F9BF; DISALLOWED # SAFETY VEST..MECHANICAL LEG | ||||
| 1F9C3..1F9CA; DISALLOWED # BEVERAGE BOX..ICE CUBE | ||||
| 1F9CD..1F9CF; DISALLOWED # STANDING PERSON..DEAF PERSON | ||||
| 1FA00..1FA53; DISALLOWED # NEUTRAL CHESS KING..BLACK CHESS KNIGHT-BISHOP | ||||
| 1FA70..1FA73; DISALLOWED # BALLET SHOES..SHORTS | ||||
| 1FA78..1FA7A; DISALLOWED # DROP OF BLOOD..STETHOSCOPE | ||||
| 1FA80..1FA82; DISALLOWED # YO-YO..PARACHUTE | ||||
| 1FA90..1FA95; DISALLOWED # RINGED PLANET..BANJO | ||||
| Author's Address | Author's Address | |||
| Patrik Faltstrom | Patrik Faltstrom | |||
| Netnod | Netnod | |||
| Email: paf@netnod.se | Email: paf@netnod.se | |||
| End of changes. 84 change blocks. | ||||
| 575 lines changed or deleted | 783 lines changed or added | |||
This html diff was produced by rfcdiff 1.48. The latest version is available from http://tools.ietf.org/tools/rfcdiff/ | ||||