| < draft-klensin-idna-5892upd-unicode70-01.txt | draft-klensin-idna-5892upd-unicode70-02.txt > | |||
|---|---|---|---|---|
| Network Working Group J. Klensin | Network Working Group J. Klensin | |||
| Internet-Draft | Internet-Draft | |||
| Updates: 5892, 5894 (if approved) P. Faltstrom | Updates: 5892, 5894 (if approved) P. Faltstrom | |||
| Intended status: Standards Track Netnod | Intended status: Standards Track Netnod | |||
| Expires: June 10, 2015 December 7, 2014 | Expires: June 10, 2015 December 7, 2014 | |||
| IDNA Update for Unicode 7.0.0 | IDNA Update for Unicode 7.0.0 | |||
| draft-klensin-idna-5892upd-unicode70-01.txt | draft-klensin-idna-5892upd-unicode70-02.txt | |||
| Abstract | Abstract | |||
| The current version of the IDNA specifications anticipated that each | The current version of the IDNA specifications anticipated that each | |||
| new version of Unicode would be reviewed to verify that no changes | new version of Unicode would be reviewed to verify that no changes | |||
| had been introduced that required adjustments to the set of rules | had been introduced that required adjustments to the set of rules | |||
| and, in particular, whether new exceptions or backward compatibility | and, in particular, whether new exceptions or backward compatibility | |||
| adjustments were needed. That review was conducted for Unicode 7.0.0 | adjustments were needed. That review was conducted for Unicode 7.0.0 | |||
| and identified a potentially problematic new code point. This | and identified a potentially problematic new code point. This | |||
| specification discusses that code point and associated issues and | specification discusses that code point and associated issues and | |||
| skipping to change at page 2, line 32 ¶ | skipping to change at page 2, line 32 ¶ | |||
| 3.1. Disallow This New Code Point . . . . . . . . . . . . . . 9 | 3.1. Disallow This New Code Point . . . . . . . . . . . . . . 9 | |||
| 3.2. Disallow the combining sequences for these characters . . 10 | 3.2. Disallow the combining sequences for these characters . . 10 | |||
| 3.3. Do Nothing Other Than Warn . . . . . . . . . . . . . . . 11 | 3.3. Do Nothing Other Than Warn . . . . . . . . . . . . . . . 11 | |||
| 3.4. Normalization Form IETF (or DNS) . . . . . . . . . . . . 11 | 3.4. Normalization Form IETF (or DNS) . . . . . . . . . . . . 11 | |||
| 4. Editorial clarification to RFC 5892 . . . . . . . . . . . . . 11 | 4. Editorial clarification to RFC 5892 . . . . . . . . . . . . . 11 | |||
| 5. Acknowledgements . . . . . . . . . . . . . . . . . . . . . . 12 | 5. Acknowledgements . . . . . . . . . . . . . . . . . . . . . . 12 | |||
| 6. IANA Considerations . . . . . . . . . . . . . . . . . . . . . 12 | 6. IANA Considerations . . . . . . . . . . . . . . . . . . . . . 12 | |||
| 7. Security Considerations . . . . . . . . . . . . . . . . . . . 12 | 7. Security Considerations . . . . . . . . . . . . . . . . . . . 12 | |||
| 8. References . . . . . . . . . . . . . . . . . . . . . . . . . 13 | 8. References . . . . . . . . . . . . . . . . . . . . . . . . . 13 | |||
| 8.1. Normative References . . . . . . . . . . . . . . . . . . 13 | 8.1. Normative References . . . . . . . . . . . . . . . . . . 13 | |||
| 8.2. Informative References . . . . . . . . . . . . . . . . . 14 | 8.2. Informative References . . . . . . . . . . . . . . . . . 15 | |||
| Appendix A. Change Log . . . . . . . . . . . . . . . . . . . . . 15 | Appendix A. Change Log . . . . . . . . . . . . . . . . . . . . . 15 | |||
| A.1. Changes from version -00 to -01 . . . . . . . . . . . . . 15 | A.1. Changes from version -00 to -01 . . . . . . . . . . . . . 15 | |||
| A.2. Changes from version -01 to -02 . . . . . . . . . . . . . 15 | ||||
| Authors' Addresses . . . . . . . . . . . . . . . . . . . . . . . 15 | Authors' Addresses . . . . . . . . . . . . . . . . . . . . . . . 15 | |||
| 1. Introduction | 1. Introduction | |||
| The current version of the IDNA specifications, known as "IDNA2008" | The current version of the IDNA specifications, known as "IDNA2008" | |||
| [RFC5890], anticipated that each new version of Unicode would be | [RFC5890], anticipated that each new version of Unicode would be | |||
| reviewed to verify that no changes had been introduced that required | reviewed to verify that no changes had been introduced that required | |||
| adjustments to IDNA's rules and, in particular, whether new | adjustments to IDNA's rules and, in particular, whether new | |||
| exceptions or backward compatibility adjustments were needed. When | exceptions or backward compatibility adjustments were needed. When | |||
| that review was carefully conducted for Unicode 7.0.0 [Unicode7], | that review was carefully conducted for Unicode 7.0.0 [Unicode7], | |||
| skipping to change at page 7, line 38 ¶ | skipping to change at page 7, line 38 ¶ | |||
| point is that there are several other Arabic-script code points that | point is that there are several other Arabic-script code points that | |||
| behave in the same way for similar language-specific reasons. | behave in the same way for similar language-specific reasons. | |||
| In particular, at least three other grapheme clusters that have been | In particular, at least three other grapheme clusters that have been | |||
| present for many version of Unicode can be seen as involving issues | present for many version of Unicode can be seen as involving issues | |||
| similar to those for the newly-added ARABIC LETTER BEH WITH HAMZA | similar to those for the newly-added ARABIC LETTER BEH WITH HAMZA | |||
| ABOVE. ARABIC LETTER HAH WITH HAMZA ABOVE (U+0681) and ARABIC LETTER | ABOVE. ARABIC LETTER HAH WITH HAMZA ABOVE (U+0681) and ARABIC LETTER | |||
| REH WITH HAMZA ABOVE (U+076C) do not have decomposition forms and are | REH WITH HAMZA ABOVE (U+076C) do not have decomposition forms and are | |||
| preferred over combining sequences using HAMZA ABOVE (U+0654) | preferred over combining sequences using HAMZA ABOVE (U+0654) | |||
| [Unicode62-Hamza]. By contrast, ARABIC LETTER ALEF WITH HAMZA ABOVE | [Unicode62-Hamza]. By contrast, ARABIC LETTER ALEF WITH HAMZA ABOVE | |||
| (U+0623) decomposes into \u'0627'\u'0653' and ARABIC LETTER YEH WITH | (U+0623) decomposes into \u'0627'\u'0654' and ARABIC LETTER YEH WITH | |||
| HAMZA ABOVE (U+0626) decomposes into \u'064A'\u'0654' so the | HAMZA ABOVE (U+0626) decomposes into \u'064A'\u'0654' so the | |||
| precomposed character and combining sequences compare equal when both | precomposed character and combining sequences compare equal when both | |||
| are normalized, as this specification prefers. | are normalized, as this specification prefers. | |||
| There are other variations in which a precomposed character involving | There are other variations in which a precomposed character involving | |||
| HAMZA ABOVE has a decomposition to a combining sequence that can form | HAMZA ABOVE has a decomposition to a combining sequence that can form | |||
| it. For example, ARABIC LETTER U WITH HAMZA ABOVE (U+0677) has a | it. For example, ARABIC LETTER U WITH HAMZA ABOVE (U+0677) has a | |||
| compatibility (???) decomposition into the combining sequence | compatibility (???) decomposition into the combining sequence | |||
| \u'06C7'\u'0674'. | \u'06C7'\u'0674'. | |||
| skipping to change at page 12, line 28 ¶ | skipping to change at page 12, line 28 ¶ | |||
| 5. Acknowledgements | 5. Acknowledgements | |||
| The Unicode 7.0.0 changes were extensively discussed within the IAB's | The Unicode 7.0.0 changes were extensively discussed within the IAB's | |||
| Internationalization Program. The authors are grateful for the | Internationalization Program. The authors are grateful for the | |||
| discussions and feedback there, especially from Andrew Sullivan and | discussions and feedback there, especially from Andrew Sullivan and | |||
| David Thaler. Additional information was requested and received from | David Thaler. Additional information was requested and received from | |||
| Mark Davis and Ken Whistler and while they probably do not agree with | Mark Davis and Ken Whistler and while they probably do not agree with | |||
| the necessity of excluding this code point or taking even more | the necessity of excluding this code point or taking even more | |||
| drastic action as their responsibility is to look at the Unicode | drastic action as their responsibility is to look at the Unicode | |||
| Consortium requirements for stability, the decision would not have | Consortium requirements for stability, the decision would not have | |||
| been possible without their input. Several experts and reviewers who | been possible without their input. Thanks to Bill McQuillan for | |||
| reading the document carefully enough to identify and report a | ||||
| confusing typographical error. Several experts and reviewers who | ||||
| prefer to remain anonymous also provided helpful input and comments | prefer to remain anonymous also provided helpful input and comments | |||
| on preliminary versions of this document. | on preliminary versions of this document. | |||
| 6. IANA Considerations | 6. IANA Considerations | |||
| When the IANA registry and tables are updated to reflect Unicode | When the IANA registry and tables are updated to reflect Unicode | |||
| 7.0.0, changes should be made according to the decisions the IETF | 7.0.0, changes should be made according to the decisions the IETF | |||
| makes about Section 3. | makes about Section 3. | |||
| 7. Security Considerations | 7. Security Considerations | |||
| skipping to change at page 15, line 32 ¶ | skipping to change at page 15, line 38 ¶ | |||
| RFC Editor: Please remove this appendix before publication. | RFC Editor: Please remove this appendix before publication. | |||
| A.1. Changes from version -00 to -01 | A.1. Changes from version -00 to -01 | |||
| o Version 01 of this document is an extensive rewrite and | o Version 01 of this document is an extensive rewrite and | |||
| reorganization, reflecting discussions with UTC members and adding | reorganization, reflecting discussions with UTC members and adding | |||
| three more options for discussion to the original proposal to | three more options for discussion to the original proposal to | |||
| simply disallow the new code point. | simply disallow the new code point. | |||
| A.2. Changes from version -01 to -02 | ||||
| Corrected a typographical error in which Hamza Above was incorrectly | ||||
| listed with the wrong code point. | ||||
| Authors' Addresses | Authors' Addresses | |||
| John C Klensin | John C Klensin | |||
| 1770 Massachusetts Ave, Ste 322 | 1770 Massachusetts Ave, Ste 322 | |||
| Cambridge, MA 02140 | Cambridge, MA 02140 | |||
| USA | USA | |||
| Phone: +1 617 245 1457 | Phone: +1 617 245 1457 | |||
| Email: john-ietf@jck.com | Email: john-ietf@jck.com | |||
| Patrik Faltstrom | Patrik Faltstrom | |||
| Netnod | Netnod | |||
| Franzengatan 5 | Franzengatan 5 | |||
| Stockholm 112 51 | Stockholm 112 51 | |||
| Sweden | Sweden | |||
| Phone: +46 70 6059051 | Phone: +46 70 6059051 | |||
| Email: paf@netnod.se | Email: paf@netnod.se | |||
| End of changes. 7 change blocks. | ||||
| 5 lines changed or deleted | 12 lines changed or added | |||
This html diff was produced by rfcdiff 1.48. The latest version is available from http://tools.ietf.org/tools/rfcdiff/ | ||||