Martin Hosken scripsit: > So, if we go with extlang, how should I tag some Mandarin text? Let's > say I am creating content. I have no history of previous tagging and > I have no idea whether the text is also in any other language? I don't understand that: if you know it's Mandarin, you surely know it isn't anything else. Text is not both Mandarin and Cantonese at the same time (except for the highly artificial situation in Hong Kong of reading Mandarin text word by word using Cantonese equivalents). But let's say it's Mandarin. > zh-cmn or cmn? "cmn" is not an option if we use extlang, so your choice is between "zh-cmn" and "zh"? Both have been used in the past, though "zh" much more so. "zh-cmn" is theoretically more specific (it excludes all non-Mandarin varieties); in practice, though, almost all texts tagged simply "zh" are in Mandarin, partly because almost all Chinese texts are in Mandarin anyhow. -- We are lost, lost. No name, no business, no Precious, nothing. Only empty. Only hungry: yes, we are hungry. A few little fishes, nassty bony little fishes, for a poor creature, and they say death. So wise they are; so just, so very just. --Gollum cowan at ccil.org http://ccil.org/~cowan _______________________________________________ Ltru mailing list Ltru at ietf.org https://www.ietf.org/mailman/listinfo/ltru
Note Well: Messages sent to this mailing list are the opinions of the senders and do not imply endorsement by the IETF.