[Date Prev][Date Next][Thread Prev][Thread Next][Date Index][Thread Index]

RE: [Ltru] Click letters



If anyone would like to review the entries in the draft code table for 639-3 and suggest character changes to get beyond ASCII / Latin-1 limitations, please feel free; I'm sure the RA would welcome the feedback. But don't necessarily expect guarantees that all or any such changes would necessarily be made prior to publication: that is potentially a large amount of feedback to vet and take action on. 

Note that a prior issue for the RA and the JAC to consider is what character set should be allowed in language names used in the 639 standards. If you would like to submit feedback in that regard, it should probably go to the JAC via Michael Everson -- IIRC he is IETF liaison to the JAC.


Peter


-----Original Message-----
From: Kent Karlsson [mailto:kent.karlsson14 at comhem.se] 
Sent: Wednesday, October 11, 2006 11:52 AM
To: 'John Cowan'; 'Doug Ewell'
Cc: 'LTRU Working Group'
Subject: RE: [Ltru] Click letters


> > I suppose it's not too early to point out that I would be 
> opposed to any 
> > nascent plans to "improve" on the characters used in ISO 639-3, by 
> > changing the ASCII slashes and bangs to "real" Unicode 
> click letters. 
> 
> +1

The spelling should of course be fixed in the source standard, in this
case preferably before 639-3 is published so that we don't get these
ASCII fallbacks in the language subtag registry (not even initially).

I have been intending to write to the relevant JAC about this (see below).

> > Applications are welcome to change these to the "correct" characters for 
> > display.
> 
> Not much point: they'd display the same way in any reasonable font.

No, they do not look like the ASCII fallbacks used at all, except for 
EXCLAMATION MARK vs. LATIN LETTER RETROFLEX CLICK.

> The point of these letter clones of punctuation marks is so that
> one can use natural-language identifiers (in programming languages,
> XML documents, etc.) without straying into the domain of punctuation
> marks, which are usually forbidden in identifiers.

No, it most certainly is NOT the point of those characters. While they (for
many programming languages) are *allowed* in identifiers, I hardly expect
them to be used in that way. The point of them, instead, is to use them
in ordinary text, mostly in the languages that use click sounds. Some
of the language names are, really, in those languages, and should
therefore be spelled using the click letters.

=====================================

Below you fined my draft comment (intended for the JAC) on this issue
that I have. (Note that this e-mail was set to be sent in UTF-8. Hopefully
it is still in UTF-8 when you receive it.)

Another comment I have is that country names should not be used for
disambiguation of same-named but different languages. Country names
are not quite as stable as one would want for the langauge names. Rather
one should use language group names (at a suitable level). Nor is it always
clear from the country names if one refers to closely related languages, or
to languages that are not at all related (though one can sometimes guess).

-----------------------------------------------

The following language code descriptions (language names) should be
spelled with the proper letters, not using ASCII fallbacks. The fallback
to proper letter mapping appears to be as follows:

        /  -> ǀ    01C0;LATIN LETTER DENTAL CLICK;Lo;0;L;;;;;N;LATIN LETTER PIPE;;;;
        // -> ǁ    01C1;LATIN LETTER LATERAL CLICK;Lo;0;L;;;;;N;LATIN LETTER DOUBLE PIPE;;;;
        =/ -> ǂ    01C2;LATIN LETTER ALVEOLAR CLICK;Lo;0;L;;;;;N;LATIN LETTER PIPE DOUBLE BAR;;;;
        !  -> ǃ    01C3;LATIN LETTER RETROFLEX CLICK;Lo;0;L;;;;;N;LATIN LETTER EXCLAMATION MARK;;;;
        '  -> ʼ    02BC;MODIFIER LETTER APOSTROPHE;Lm;0;L;;;;;N;;;;;

In the cases where the ' stands for glottal stop use MODIFIER LETTER APOSTROPHE.
Elision and genitive apostrophes are properly spelled with ’ 2019;RIGHT SINGLE
QUOTATION MARK as in the last nine entries listed below. I'm not sure if some
of the other apostrophes really are elision apostrophes too.

oun                        I        L        !O!ung
nmn                        I        L        !Xóõ
hnh                        I        L        //Ani
gnk                        I        L        //Gana
xeg                        I        E        //Xegwi
gwj                        I        L        /Gwi
xam                        I        E        /Xam
huc                        I        L        =/Hua
aue                        I        L        =/Kx'au//'ein
hgm                        I        L        Hai//om
ktz                        I        L        Ju/'hoan
ngh                        I        L        N/u
                                        
acq                        I        L        Arabic, Ta'izzi-Adeni Spoken
alu                        I        L        'Are'are
mtv                        I        L        Asaro'o
aot                        I        L        A'tong
kud                        I        L        'Auhelawa
bei                        I        L        Bekati'
byd                        I        L        Benyadu'
crt                        I        L        Chorote, Iyojwa'ja
crq                        I        L        Chorote, Iyo'wujwa
dri                        I        L        C'lela
ing                        I        L        Degexit'an
dor                        I        L        Dori'o
dup                        I        L        Duano'
fab                        I        L        Fa D'ambu
fmp                        I        L        Fe'fe'
gqa                        I        L        Ga'anda
gdg                        I        L        Ga'dang
gaq                        I        L        Gata'
bbj                        I        L        Ghomálá'
gmb                        I        L        Gula'alaa
hwc                        I        L        Hawai'i Creole English
hps                        I        L        Hawai'i Pidgin Sign Language
haw        haw                I        L        Hawaiian             [should this not use a letter apostrophe?]
dbj                        I        L        Ida'an
ilu                        I        L        Ili'uun
xkk                        I        L        Kaco'
kzf                        I        L        Kaili, Da'a
blk                        I        L        Karen, Pa'o
ksw                        I        L        Karen, S'gaw
xxk                        I        L        Ke'o
kuk                        I        L        Kepo'
xkc                        I        L        Kho'ini
krr                        I        L        Kru'ng 2        [what is the 2 here? 2? LATIN CAPITAL/SMALL TONE TWO? LATIN LETTER TWO WITH STROKE? ...]
xmp                        I        L        Kuku-Mu'inh
kuy                        I        L        Kuuku-Ya'u
bko                        I        L        Kwa'
wka                        I        E        Kw'adza
kwf                        I        L        Kwara'ae
lbi                        I        L        La'bi
lns                        I        L        Lamnso'
lra                        I        L        Lara'
ljl                        I        L        Li'o
prb                        I        L        Lua'
mhy                        I        L        Ma'anyan
mhi                        I        L        Ma'di
snm                        I        L        Ma'di, Southern
lva                        I        L        Maku'a
slz                        I        L        Ma'ya
yus                        I        L        Maya, Chan Santa Cruz           [should this not use a letter apostrophe?]
yua                        I        L        Maya, Yucatán           [should this not use a letter apostrophe?]
xmy                        I        L        Mayaguduna
emy                        I        E        Mayan, Epigraphic           [should this not use a letter apostrophe?]
mtk                        I        L        Mbe'
muc                        I        L        Mbu'
mym                        I        L        Me'en
mgo                        I        L        Meta'
gbn                        I        L        Mo'da
nae                        I        L        Naka'ela
nnz                        I        L        Nda'nda'
nxg                        I        L        Ngad'a
nea                        I        L        Ngad'a, Eastern
nbm                        I        L        Ngbaka Ma'bo
ncg                        I        L        Nisga'a
num                        I        L        Niuafo'ou
nyp                        I        E        Nyang'i
tyh                        I        L        O'du
lul                        I        L        Olu'bo
oyy                        I        L        Oya'oya
pqa                        I        L        Pa'a
ple                        I        L        Palu'e
phq                        I        L        Phana'
qxq                        I        L        Qashqa'i
apb                        I        L        Sa'a
snv                        I        L        Sa'ban
scq                        I        L        Sa'och
xst                        I        L        Silt'e
ssq                        I        L        So'a
rob                        I        L        Tae'
tln                        I        L        Talondo'
tto                        I        L        Ta'oih, Lower
tth                        I        L        Ta'oih, Upper
tct                        I        L        T'en
tve                        I        L        Te'un
mlu                        I        L        To'abaita
tlz                        I        L        Toala'
ood                        I        L        Tohono O'odham
sda                        I        L        Toraja-Sa'dan
tsl                        I        L        Ts'ün-Lao
ulm                        I        L        Ulumanda'
urk                        I        L        Urak Lawoi'
wag                        I        L        Wa'ema
wmh                        I        L        Waima'a
wih                        I        L        Wik-Me'anha
ame                        I        L        Yanesha'
pto                        I        L        Zo'é
                                        
eot                        I        L        Beti (Côte d'Ivoire)     [elision apostrophe]
crd                        I        L        Coeur d'Alene     [elision apostrophe]
fla                        I        L        Kalispel-Pend d'Oreille     [elision apostrophe]
kfo                        I        L        Koro (Côte d'Ivoire)     [elision apostrophe]
loi                        I        L        Loma (Côte d'Ivoire)     [elision apostrophe]
mre                        I        E        Martha's Vineyard Sign Language     [elision apostrophe]
ngm                        I        L        Ngatik Men's Creole     [elision apostrophe]
plw                        I        L        Palawano, Brooke's Point     [elision apostrophe]
neb                        I        L        Toura (Côte d'Ivoire)     [elision apostrophe]

======================================================


_______________________________________________
Ltru mailing list
Ltru at ietf.org
https://www1.ietf.org/mailman/listinfo/ltru
_______________________________________________
Ltru mailing list
Ltru at ietf.org
https://www1.ietf.org/mailman/listinfo/ltru

Note Well: Messages sent to this mailing list are the opinions of the senders and do not imply endorsement by the IETF.