On Sat, Sep 16, 2006 at 04:28:01PM -0700, Mark Davis <mark.davis at icu-project.org> wrote a message of 98 lines which said: > BTW, I had updated my regex to the final spec for 4646. Here is a > single Perl or Java regex that does most of the parse: Isn't it too lax? When testing it in a Perl script, I find it accepts all my well-formed tags (OK) but also accepts wrongly: fr-Latn-F is well-formed en-a-bbb-a-ccc is well-formed tlh-a-b-foo is well-formed abcdefghi-012345678 is well-formed ab-abc-abc-abc-abc is well-formed ab-abcd-abc is well-formed ab-ab-abc is well-formed ab-123-abc is well-formed ab-abcde-abc is well-formed ab-1abc-abc is well-formed ab-ab-abcd is well-formed ab-123-abcd is well-formed ab-abcde-abcd is well-formed ab-1abc-abcd is well-formed ab-a-b is well-formed ab-a-x is well-formed ab--ab is well-formed ab-abc- is well-formed ab-c-abc-r-toto-c-abc is well-formed abcd-efg is well-formed aabbccddE is well-formed _______________________________________________ Ltru mailing list Ltru at ietf.org https://www1.ietf.org/mailman/listinfo/ltru
Note Well: Messages sent to this mailing list are the opinions of the senders and do not imply endorsement by the IETF.