Internet Draft Andrzej Bartosiewicz draft-bartosiewicz-idn-pltld-06.txt NASK (PL-TLD) February 23, 2004 Expires in six months Intended status: Informational Registering Internationalized Domain Names under .PL Status of this Memo This document is an Internet-Draft and is subject to all provisions of Section 10 of RFC2026 except that the right to produce derivative works is not granted. Internet-Drafts are working documents of the Internet Engineering Task Force (IETF), its areas, and its working groups. Note that other groups may also distribute working documents as Internet-Drafts. Internet-Drafts are draft documents valid for a maximum of six months and may be updated, replaced, or obsoleted by other documents at any time. It is inappropriate to use Internet- Drafts as reference material or to cite them other than as "work in progress." The list of current Internet-Drafts can be accessed at http://www.ietf.org/1id-abstracts.html The list of Internet-Draft Shadow Directories can be accessed at http://www.ietf.org/shadow.html Abstract This document describes rules of Internationalized Domain Name registration under PL-TLD. All the rules are based on the idea that the Registry registers the ACE label version of the domain name instead of IDN in UNICODE. This document also includes the list of accepted Unicode codepoints for IDN registration uder ".PL". Introduction NASK is the PL-TLD registry with the liberal policy of the internationalized domain name registration. 1. Main concepts The Policy of the IDN-registration under .PL is liberal and is based on the following concepts: -NASK accepts only the proper ACE labels (according to [3], [8] and [9]) which begin with the "xn--" prefix and contain only ASCII characters -the received string after the ToUnicode operation (according to Section 4.2 of [4] and Section 6.2 of [3]) applied to the ACE label MUST only include codepoints [5] from one of the following sets: a) Latin set: U+002D HYPEN-MINUS U+0030..U+0039 DIGIT ZERO .. DIGIT 9 U+0061..U+007A LATIN SMALL LETTER A .. LATIN SMALL LETTER Z U+00B7 MIDDLE DOT U+00E0 LATIN SMALL LETTER A WITH GRAVE U+00E1 LATIN SMALL LETTER A WITH ACUTE U+00E2 LATIN SMALL LETTER A WITH CIRCUMFLEX U+00E3 LATIN SMALL LETTER A WITH TILDE U+00E4 LATIN SMALL LETTER A WITH DIAERESIS U+00E5 LATIN SMALL LETTER A WITH RING ABOVE U+00E6 LATIN SMALL LETTER AE U+00E7 LATIN SMALL LETTER C WITH CEDILLA U+00E8 LATIN SMALL LETTER E WITH GRAVE U+00E9 LATIN SMALL LETTER E WITH ACUTE U+00EA LATIN SMALL LETTER E WITH CIRCUMFLEX U+00EB LATIN SMALL LETTER E WITH DIAERESIS U+00EC LATIN SMALL LETTER I WITH GRAVE U+00ED LATIN SMALL LETTER I WITH ACUTE U+00EE LATIN SMALL LETTER I WITH CIRCUMFLEX U+00EF LATIN SMALL LETTER I WITH DIAERESIS U+00F0 LATIN SMALL LETTER ETH U+00F1 LATIN SMALL LETTER N WITH TILDE U+00F2 LATIN SMALL LETTER O WITH GRAVE U+00F3 LATIN SMALL LETTER O WITH ACUTE U+00F4 LATIN SMALL LETTER O WITH CIRCUMFLEX U+00F5 LATIN SMALL LETTER O WITH TILDE U+00F6 LATIN SMALL LETTER O WITH DIAERESIS U+00F8 LATIN SMALL LETTER O WITH STROKE U+00F9 LATIN SMALL LETTER U WITH GRAVE U+00FA LATIN SMALL LETTER U WITH ACUTE U+00FB LATIN SMALL LETTER U WITH CIRCUMFLEX U+00FC LATIN SMALL LETTER U WITH DIAERESIS U+00FD LATIN SMALL LETTER Y WITH ACUTE U+00FE LATIN SMALL LETTER THORN U+00FF LATIN SMALL LETTER Y WITH DIAERESIS U+0101 LATIN SMALL LETTER A WITH MACRON U+0103 LATIN SMALL LETTER A WITH BREVE U+0105 LATIN SMALL LETTER A WITH OGONEK U+0107 LATIN SMALL LETTER C WITH ACUTE U+0109 LATIN SMALL LETTER C WITH CIRCUMFLEX U+010B LATIN SMALL LETTER C WITH DOT ABOVE U+010D LATIN SMALL LETTER C WITH CARON U+010F LATIN SMALL LETTER D WITH CARON U+0111 LATIN SMALL LETTER D WITH STROKE U+0113 LATIN SMALL LETTER E WITH MACRON U+0115 LATIN SMALL LETTER E WITH BREVE U+0117 LATIN SMALL LETTER E WITH DOT ABOVE U+0119 LATIN SMALL LETTER E WITH OGONEK U+011B LATIN SMALL LETTER E WITH CARON U+011D LATIN SMALL LETTER G WITH CIRCUMFLEX U+011F LATIN SMALL LETTER G WITH BREVE U+0121 LATIN SMALL LETTER G WITH DOT ABOVE U+0123 LATIN SMALL LETTER G WITH CEDILLA U+0125 LATIN SMALL LETTER H WITH CIRCUMFLEX U+0127 LATIN SMALL LETTER H WITH STROKE U+0129 LATIN SMALL LETTER I WITH TILDE U+012B LATIN SMALL LETTER I WITH MACRON U+012D LATIN SMALL LETTER I WITH BREVE U+012F LATIN SMALL LETTER I WITH OGONEK U+0131 LATIN SMALL LETTER DOTLESS I U+0135 LATIN SMALL LETTER J WITH CIRCUMFLEX U+0137 LATIN SMALL LETTER K WITH CEDILLA U+0138 LATIN SMALL LETTER KRA U+013A LATIN SMALL LETTER L WITH ACUTE U+013C LATIN SMALL LETTER L WITH CEDILLA U+013E LATIN SMALL LETTER L WITH CARON U+0142 LATIN SMALL LETTER L WITH STROKE U+0144 LATIN SMALL LETTER N WITH ACUTE U+0146 LATIN SMALL LETTER N WITH CEDILLA U+0148 LATIN SMALL LETTER N WITH CARON U+014B LATIN SMALL LETTER ENG U+014D LATIN SMALL LETTER O WITH MACRON U+014F LATIN SMALL LETTER O WITH BREVE U+0151 LATIN SMALL LETTER O WITH DOUBLE ACUTE U+0153 LATIN SMALL LIGATURE OE U+0155 LATIN SMALL LETTER R WITRH ACUTE U+0157 LATIN SMALL LETTER R WITH CEDILLA U+0159 LATIN SMALL LETTER R WITH CARON U+015B LATIN SMALL LETTER S WITH ACUTE U+015D LATIN SMALL LETTER S WITH CIRCUMFLEX U+015F LATIN SMALL LETTER S WITH CEDILLA U+0161 LATIN SMALL LETTER S WITH CARON U+0163 LATIN SMALL LETTER T WITH CEDILLA U+0165 LATIN SMALL LETTER T WITH CARON U+0167 LATIN SMALL LETTER T WITH STROKE U+0169 LATIN SMALL LETTER U WITH TILDE U+016B LATIN SMALL LETTER U WITH MACRON U+016D LATIN SMALL LETTER U WITH BREVE U+016F LATIN SMALL LETTER U WITH RING ABOVE U+0171 LATIN SMALL LETTER U WITH DOUBLE ACUTE U+0173 LATIN SMALL LETTER U WITH OGONEK U+0175 LATIN SMALL LETTER W WITH CIRCUMFLEX U+0177 LATIN SMALL LETTER Y WITH CIRCUMFLEX U+017A LATIN SMALL LETTER Z WITH ACUTE U+017C LATIN SMALL LETTER Z WITH DOT ABOVE U+017E LATIN SMALL LETTER Z WITH CARON b) Greek set: U+002D HYPEN-MINUS U+0030..U+0039 DIGIT ZERO .. DIGIT 9 U+0390 GREEK SMALL LETTER IOTA WITH DIALITYKA AND TONOS U+03AC GREEK SMALL LETTER ALPHA WITH TONOS U+03AD GREEK SMALL LETTER EPSILON WITH TONOS U+03AE GREEK SMALL LETTER ETA WITH TONOS U+03AF GREEK SMALL LETTER IOTA WITH TONOS U+03B0 GREEK SMALL LETTER UPSILON WITH DIALITYKA AND TONOS U+03B1 GREEK SMALL LETTER ALPHA U+03B2 GREEK SMALL LETTER BETA U+03B3 GREEK SMALL LETTER GAMMA U+03B4 GREEK SMALL LETTER DELTA U+03B5 GREEK SMALL LETTER EPSILON U+03B6 GREEK SMALL LETTER ZETA U+03B7 GREEK SMALL LETTER ETA U+03B8 GREEK SMALL LETTER THETA U+03B9 GREEK SMALL LETTER IOTA U+03BA GREEK SMALL LETTER KAPPA U+03BB GREEK SMALL LETTER LAMDA U+03BC GREEK SMALL LETTER MU U+03BD GREEK SMALL LETTER NU U+03BE GREEK SMALL LETTER XI U+03BF GREEK SMALL LETTER OMRICON U+03C0 GREEK SMALL LETTER PI U+03C1 GREEK SMALL LETTER RHO U+03C2 GREEK SMALL LETTER FINAL SIGMA U+03C3 GREEK SMALL LETTER SIGMA U+03C4 GREEK SMALL LETTER TAU U+03C5 GREEK SMALL LETTER UPSILON U+03C6 GREEK SMALL LETTER PHI U+03C7 GREEK SMALL LETTER CHI U+03C8 GREEK SMALL LETTER PSI U+03C9 GREEK SMALL LETTER OMEGA U+03CA GREEK SMALL LETTER IOTA WITH DIALYKTIKA U+03CB GREEK SMALL LETTER UPSILON WITH DIALYKTIKA U+03CC GREEK SMALL LETTER OMICRON WITH TONOS U+03CD GREEK SMALL LETTER UPSILON WITH TONOS U+03CE GREEK SMALL LETTER OMEGA WITH TONOS c) Cyrillic set: U+002D HYPEN-MINUS U+0030..U+0039 DIGIT ZERO .. DIGIT 9 U+0430 CYRILLIC SMALL LETER A U+0431 CYRILLIC SMALL LETER BE U+0432 CYRILLIC SMALL LETER VE U+0433 CYRILLIC SMALL LETER GHE U+0434 CYRILLIC SMALL LETER DE U+0435 CYRILLIC SMALL LETER IE U+0436 CYRILLIC SMALL LETER ZHE U+0437 CYRILLIC SMALL LETER ZE U+0438 CYRILLIC SMALL LETER I U+0439 CYRILLIC SMALL LETER SHORT I U+043A CYRILLIC SMALL LETER KA U+043B CYRILLIC SMALL LETER EL U+043C CYRILLIC SMALL LETER EM U+043D CYRILLIC SMALL LETER EN U+043E CYRILLIC SMALL LETER O U+043F CYRILLIC SMALL LETER PE U+0440 CYRILLIC SMALL LETER ER U+0441 CYRILLIC SMALL LETER ES U+0442 CYRILLIC SMALL LETER TE U+0443 CYRILLIC SMALL LETER U U+0444 CYRILLIC SMALL LETER EF U+0445 CYRILLIC SMALL LETER HA U+0446 CYRILLIC SMALL LETER TSE U+0447 CYRILLIC SMALL LETER CHE U+0448 CYRILLIC SMALL LETER SHA U+0449 CYRILLIC SMALL LETER SHCHA U+044A CYRILLIC SMALL LETER HARD SIGN U+044B CYRILLIC SMALL LETER YERU U+044C CYRILLIC SMALL LETER SOFT SIGN U+044D CYRILLIC SMALL LETER E U+044E CYRILLIC SMALL LETER YU U+044F CYRILLIC SMALL LETER YA U+0450 CYRILLIC SMALL LETER IE WITH GRAVE U+0451 CYRILLIC SMALL LETER IO U+0452 CYRILLIC SMALL LETER DJE U+0453 CYRILLIC SMALL LETER GJE U+0454 CYRILLIC SMALL LETER UKRAINIAN IE U+0455 CYRILLIC SMALL LETER DZE U+0456 CYRILLIC SMALL LETER BYELORUSSIAN-UKRAINIAN I U+0457 CYRILLIC SMALL LETER YI U+0458 CYRILLIC SMALL LETER JE U+0459 CYRILLIC SMALL LETER LJE U+045A CYRILLIC SMALL LETER NJE U+045B CYRILLIC SMALL LETER TSHE U+045C CYRILLIC SMALL LETER KJE U+045D CYRILLIC SMALL LETER I WITH GRAVE U+045E CYRILLIC SMALL LETER SHORT U U+045F CYRILLIC SMALL LETER DZHE U+0491 CYRILLIC SMALL LETER GHE WITH UPTURN U+04C2 CYRILLIC SMALL LETER ZHE WITH BREVE d) Hebrew set: U+002D HYPEN-MINUS U+0030..U+0039 DIGIT ZERO .. DIGIT 9 U+05D0 HEBREW LETTER ALEF U+05D1 HEBREW LETTER BET U+05D2 HEBREW LETTER GIME U+05D3 HEBREW LETTER DALET U+05D4 HEBREW LETTER HE U+05D5 HEBREW LETTER VAV U+05D6 HEBREW LETTER ZAYIN U+05D7 HEBREW LETTER HET U+05D8 HEBREW LETTER TET U+05D9 HEBREW LETTER YOD U+05DA HEBREW LETTER FINAL KAF U+05DB HEBREW LETTER KAF U+05DC HEBREW LETTER LAMED U+05DD HEBREW LETTER FINAL MEM U+05DE HEBREW LETTER MEM U+05DF HEBREW LETTER FINAL NUN U+05E0 HEBREW LETTER NUN U+05E1 HEBREW LETTER SAMEKH U+05E2 HEBREW LETTER AYIN U+05E3 HEBREW LETTER FINAL PE U+05E4 HEBREW LETTER PE U+05E5 HEBREW LETTER FINAL TSADI U+05E6 HEBREW LETTER TSADI U+05E7 HEBREW LETTER QOF U+05E8 HEBREW LETTER RESH U+05E9 HEBREW LETTER SHIN U+05EA HEBREW LETTER TAV e) Arabic set: U+002D HYPEN-MINUS U+0621 ARABIC LETTER HAMZA U+0622 ARABIC LETTER ALEF WITH MADDA U+0623 ARABIC LETTER ALEF WITH HAMZA U+0624 ARABIC LETTER WAW WITH HAMZA U+0625 ARABIC LETTER ALEF WITH HAMZA BELOW U+0626 ARABIC LETTER YEH WITH HAMZA ABOVE U+0627 ARABIC LETTER ALEF U+0628 ARABIC LETTER BEH U+0629 ARABIC LETTER MARBUTA U+062A ARABIC LETTER TEH U+062B ARABIC LETTER THEH U+062C ARABIC LETTER JEMM U+062D ARABIC LETTER HAH U+062E ARABIC LETTER KHAH U+062F ARABIC LETTER DAL U+0630 ARABIC LETTER THAL U+0631 ARABIC LETTER REH U+0632 ARABIC LETTER ZAIN U+0633 ARABIC LETTER SEEN U+0634 ARABIC LETTER SHEEN U+0635 ARABIC LETTER SAD U+0636 ARABIC LETTER DAD U+0637 ARABIC LETTER TAH U+0638 ARABIC LETTER ZAH U+0639 ARABIC LETTER AIN U+063A ARABIC LETTER GHAIN U+0641 ARABIC LETTER FEH U+0642 ARABIC LETTER QAF U+0643 ARABIC LETTER KAF U+0644 ARABIC LETTER LAM U+0645 ARABIC LETTER MEEM U+0646 ARABIC LETTER NOON U+0647 ARABIC LETTER HEH U+0648 ARABIC LETTER WAW U+0649 ARABIC LETTER ALEF MAKSURA U+064A ARABIC LETTER YEH U+0660 ARABIC-INDIC DIGIT ZERO U+0661 ARABIC-INDIC DIGIT ONE U+0662 ARABIC-INDIC DIGIT TWO U+0663 ARABIC-INDIC DIGIT THREE U+0664 ARABIC-INDIC DIGIT FOUR U+0665 ARABIC-INDIC DIGIT FIVE U+0666 ARABIC-INDIC DIGIT SIX U+0667 ARABIC-INDIC DIGIT SEVEN U+0668 ARABIC-INDIC DIGIT EIGHT U+0669 ARABIC-INDIC DIGIT NINE -A combination of characters from different sets is not allowed. <------------- ToUnicode ---------------------- --------------- | internationalized | | ACE label + | | domain name | --NAMEPREP--> ---PUNYCODE--> | FULL STOP + | | IDN | +"xn--" | zone | ---------------------- --------------- -------------> ToASCII 2. The IDN influence on Registry Registry's IDN registration policy described in Section 1 has no influence on Registry activities especially on invocing, whois and registration process. The Registry does not provide sunrise period for IDN registrations. 2.1. The subject of the contract The subject of the contract between Registry and Registrant is the ACE version of the IDN domain name. Registry stores ACE version in the zone files, whois and other internal databases. Registy MUST NOT store the IDN version of the domain name. 2.2. Variant tables Registry does not implement variant tables and bundles (concept of variant tables and bundles is described in [6]) 2.3. Influence on EPP implementation. There is no influence on EPP implementation [7]. 3. The release date a. The release date for Polish characters registration under .PL is September 11th, 2003. b. The release date for German characters is October 6th, 2003. c. The release date for remaining characters from Latin-1 Supplement and Latin Extended-A scripts is October 20th, 2003 d. The release date for Greek, Hebrew and Arabic script is November 3rd, 2003 e. The release date for Cyrillic script is February 26th, 2004 4. Terminology Characters in the document are given as their Unicode codepoints on U+XXXX format (X is a hexadecimal digit). "Registry" in this document is equivalent to PL-TLD Registry "NASK". "ACE label" and "ACE prefix" is defined in Section 2 of [4]. "NAMEPREP" is defined in [1], "PUNYCODE", "ToUnicode" and "ToASCII" is defined on [3]. 5. Normative References [1] Hoffman, P. and M. Blanchet, "Nameprep: A Stringprep Profile for Internationalized Domain Names (IDN)", RFC 3491, March 2003. [2] P. Hoffman, M. Blanchet, "Preparation of Internationalized Strings", RFC 3454, December 2002 [3] A. Costello, "Punycode: A Bootstring encoding of Unicode for Internationalized Domain Names in Applications (IDNA)", RFC 3492, March 2003. [4] Faltstrom, P., Hoffman, P. and A Costello, "Internationalizing Domain Names in Applications (IDNA)", RFC 3490, March 2003. [5] The Unicode Consortium, "The Unicode Standard", http://www.unicode.org/unicode/standard/standard.html [6] Paul Hoffman, "A Method for Registering Internationalized Domain Names", Internet Draft, June 2003. [7] Zygmuntowicz, Wegrzynowicz, Laszkiewicz, Brzostek, Zarowski, "EPP parameters for .pl ccTLD", Internet-Draft, draft-zygmuntowicz-epp-pltld-00.txt, May 2003 [8] P. Mockapetris, "Domain Names - Concepts and Facilities", RFC 1034, November 1987 [9] P. Mockapetris, "Domain Names - Implementation and Specification," RFC-1035, November 1987. 6. IANA considerations There are no IANA considerations for this document. 7. Authors' addresses Andrzej Bartosiewicz NASK 18 Wawozowa Street PL-02-796 Warsaw Poland andrzejb@nask.pl Krzysztof Olesik NASK 18 Wawozowa Street PL-02-796 Warsaw Poland kolesik@nask.pl