Randy Presuhn <randy underscore presuhn at mindspring dot com> wrote:
Our intention in the ABNF was *NOT* to describe the registry as a byte-stream, but rather to describe it as a character stream. I view the encoding of the registry as UTF-8 (rather than UTF-16 or UTF-32 or EBCDIC :-) as a separate matter from the ABNF, and think that trying to accomplish both in the ABNF would only serve to confuse rather than enlighten.
I agree completely. The main aspect of this revision as it pertains to representation of the Registry is that characters beyond U+007F will no longer be represented by hex references like Á but as themselves. The UTF-8 encoding of these characters is a separate matter from the grammar.
-- Doug Ewell * Thornton, Colorado, USA * RFC 4645 * UTN #14 http://www.ewellic.org http://www1.ietf.org/html.charters/ltru-charter.html http://www.alvestrand.no/mailman/listinfo/ietf-languages ˆ
Note Well: Messages sent to this mailing list are the opinions of the senders and do not imply endorsement by the IETF.