[Date Prev][Date Next][Thread Prev][Thread Next][Date Index][Thread Index]

Re: [Ltru] extlang for users



Martin Hosken scripsit:

> So, if we go with extlang, how should I tag some Mandarin text? Let's
> say I am creating content. I have no history of previous tagging and
> I have no idea whether the text is also in any other language?

I don't understand that: if you know it's Mandarin, you surely know it
isn't anything else.  Text is not both Mandarin and Cantonese at the
same time (except for the highly artificial situation in Hong Kong of
reading Mandarin text word by word using Cantonese equivalents).

But let's say it's Mandarin.

> zh-cmn or cmn?

"cmn" is not an option if we use extlang, so your choice is between
"zh-cmn" and "zh"?  Both have been used in the past, though "zh" much
more so.  "zh-cmn" is theoretically more specific (it excludes all
non-Mandarin varieties); in practice, though, almost all texts tagged
simply "zh" are in Mandarin, partly because almost all Chinese texts
are in Mandarin anyhow.

-- 
We are lost, lost.  No name, no business, no Precious, nothing.  Only empty.
Only hungry: yes, we are hungry.  A few little fishes, nassty bony little
fishes, for a poor creature, and they say death.  So wise they are; so just,
so very just.  --Gollum        cowan at ccil.org  http://ccil.org/~cowan
_______________________________________________
Ltru mailing list
Ltru at ietf.org
https://www.ietf.org/mailman/listinfo/ltru



Note Well: Messages sent to this mailing list are the opinions of the senders and do not imply endorsement by the IETF.