| | LINGUIST List 7.950: Company names, Wide-character, Dutch dialects |
 | | There may be special escape characters or sequences to signal the beginning and end of converted strings, but these by preference should be required only in the converted form, not the wide-character standard form. |
 | | Obviously the converted strings cannot be understood as specific (strings of) displayed characters, such as Cyrillic capital shcha or Mandarin shi4 (~= 'be'), without knowing the language and code set of each string; but assume that each file contains only English and one other (variable) language, which is known for each file. |
 | | Also assume that character representation must remain constant, so that capital shcha is represented by the same ASCII substring wherever it occurs in its string. |
| www.ling.ed.ac.uk /linguist/issues/7/7-950.html (947 words) |