[Date Prev][Date Next][Thread Prev][Thread Next][Date Index][Thread Index]
[Simple] Re: [Geopriv] Domain identifier in common policy
On Nov 14, 2005, at 3:17 PM, Henning Schulzrinne wrote:
Even if the domain field is limited to ireg-name, I do not
believe that solves the problem. I'm not an IRI expert, but it
appears that not all ireg-name values are legal domain names.
Sure, but this is not a new problem. example.andy, ccs.columbia.edu
or ex$ample.com are not valid domain names, either, even without
the I18N problems. Thus, there will always be valid domain-looking
strings that don't match any protocol-possible string. This isn't
an issue - the rule component just will never match and somebody
will have to fix the typo.
I guess what I'm trying to say is that not all IRIs produce a valid
domain name. If you are saying, "it doesn't matter, these are never
handed to gethostbyname()" then I agree with you.
And I still wonder about the IDN case, especially given that this
appears to be an exact match. Which form is the exact match
performed on, the UTF8, a NamePrepped domain, or an ACE encoded
domain. Given that there can be variants of an IDN floating
around on the wire, it always seems to me that the comparison
should be done on the ACE version.
I may be looking at the wrong thing, but Section 5 of RFC 3987
seems relevant:
5. Normalization and Comparison . . . . . . . . . . . . . . . . . 21
5.1.
Equivalence . . . . . . . . . . . . . . . . . . . . . . 22
5.2. Preparation for
Comparison . . . . . . . . . . . . . . . 22
5.3. Comparison
Ladder . . . . . . . . . . . . . . . . . . . 23
5.3.1. Simple String
Comparison . . . . . . . . . . . . 23
5.3.2. Syntax-Based
Normalization . . . . . . . . . . . 24
5.3.3. Scheme-Based
Normalization . . . . . . . . . . . 27
5.3.4. Protocol-Based
Normalization . . . . . . . . . . 28
At the very least, common-policy ought to point out which comparison
is being done. I would assume that since these are being used as
identifiers, that section 5.3.1 is the section that is relevant here.
And it might be that I'm just not smart enough to understand
something as trivial as internationalized character comparison, but
what happens with punycode encoded domain names? As the example from
that RFC: if I receive it as domain="xn--99zt52a", do I convert it to
domain="納豆" for the comparison? As the RFC states:
Implementations with scheme-specific knowledge MAY convert
punycode-encoded domain name labels to the corresponding characters
by using the ToUnicode procedure.
Again, I'm not an expert, but shouldn't something specify what is to
happen here for doing this identity comparison?
-andy
_______________________________________________
Simple mailing list
Simple at ietf.org
https://www1.ietf.org/mailman/listinfo/simple