Factbites
 Where results make sense
About us   |   Why use us?   |   Reviews   |   PR   |   Contact us  

Topic: JIS X 0201


In the News (Thu 24 Dec 09)

  
 Compaq Tru64 UNIX Technical Reference for Using Japanese Features
The representation of ASCII/JIS Roman and JIS X 0208 characters is similar to that of DEC Kanji.
To allow the characters defined in these standards to be encoded in a single codeset, the first byte of each JIS X 0208 character is encoded in the ranges 81-9F and EO-FC, while the second byte is between 40 and FC, as shown in Table 2-4.
If there are JIS X 0208 characters on a line, there must be a switch to ASCII or to the left-hand part of (Roman letters) before the end of the line (in other words, before the CRLF, or carriage return and line feed).
www.zdv.uni-mainz.de /cms-extern/DU_5.0/HTML/SUPPDOCS/JAPANDOC/JAPANCH2.HTM   (2669 words)

  
 A complete introduction to Japanese character encodings   (Site not responding. Last check: 2007-10-15)
JIS is a blanket term used to describe all non-Unicode Japanese character sets and encodings, all of which are based on standards by the JSA (Japanese Standards Association).
JIS X 0212 was introduced in 1990 to accomodate the demand for rare kanji.
JIS X 0213 is the successor to JIS X 0208.
www.cs.mcgill.ca /~aelias4/encodings.html   (6765 words)

  
 [No title]   (Site not responding. Last check: 2007-10-15)
The bit combinations permitted in JIS X 0208 are explicitly described in the syntax so that the invalid character positions will be excluded in ISO-2022-JP text.
JIS X 0201 Latin set is identical to ASCII except for REVERSE SOLIDUS (BACKSLASH) and TILDE.
One possible deletion is to convert the characters to corresponding characters in JIS X 0208 1983.
mirror.aarnet.edu.au /pub/kame/i-d/draft-yamamoto-charset-iso-2022-jp-01.txt   (2023 words)

  
 DIGITAL UNIX Technical Reference for Using Japanese Features   (Site not responding. Last check: 2007-10-15)
The Roman letters defined in JIS X 0201 are the same as those in ASCII, except that the tilde sign (~) is replaced by a horizontal bar (located at the upper part of a character cell), and the backslash (\) is replaced by the Japan currency sign (Yen).
JIS X 0208 has defined a total of 6,879 characters which include those in Table 1-1.
JIS X 0212 was published by the Japanese Industrial Standard Committee in 1990, and JIS X 0212-1990 is the current revision.
www.phys.uu.nl /DU/unix/HTML/SUPPDOCS/JAPANDOC/JAPANCH1.HTM   (541 words)

  
 RFC 2237 (rfc2237) - Japanese Character Encoding for Internet Messages
It defines the use of JIS X 0208 as the double-byte character set in ISO-2022-JP text.
Fortunately JIS (Japanese Industry Standard) defines JIS X 0212 as "code of the supplementary Japanese graphic character set for information interchange".
JIS X 0201-Roman is not identical to the ASCII with two different characters.
www.faqs.org /rfcs/rfc2237.html   (1057 words)

  
 Problems on Interoperativity between Unicode and CJK Local Encodings   (Site not responding. Last check: 2007-10-15)
JIS X 0212 is a 94x94 ISO 2022-compliant character set and includes additional Kanji which are intended to be used with JIS X 0208.
JIS X 0213 is a new Japanese standard which was released in 2000.
It is a superset of JIS X 0208.
www.debian.or.jp /~kubota/unicode-symbols.html.en   (418 words)

  
 Shift-JIS - Encyclopedia Glossary Meaning Explanation Shift-JIS   (Site not responding. Last check: 2007-10-15)
It is based on character sets defined within JIS standards JIS X 0201:1997 (for the single-byte characters) and JIS X 0208:1997 (for the double byte characters).
There are two areas for expansion: Firstly, JIS X 0208 does not fill the whole 94x94 space encoded for it in Shift_JIS, therefore there is room for more characters here -- these are really extensions to JIS X 0208 rather than to Shift_JIS itself.
Secondly, Shift_JIS has more encoding space than is needed for JIS X 0201 and JIS X 0208 and these can and are used for yet more characters.
www.encyclopedia-glossary.com /en/Shift-JIS.html   (485 words)

  
 docs.sun.com: JFP リファレンスマニュアル 5 : ...   (Site not responding. Last check: 2007-10-15)
PCK (SJIS) と UTF–8 間の変換は、JIS X 0201、JIS X 0208 とユーザー定義文字およびベンダー定義文字が変換可能です。変換先の文字集合に対応する文字のない入力データが与えられた場合は、置換文字 (PCK: 「?」(0x3f)、 UTF-8: U+FFFD (0xefbfbd)) に変換されます。変換元の文字集合に属さない入力データが与えられた場合は、iconv(3C) は
PCK (SJIS) と UTF–8–Java 間の変換は、JIS X 0201、JIS X 0208 の文字が変換可能です。入力データとしてユーザー定義文字またはベンダー定義文字が与えられた場合は、置換文字 (PCK: 「?」(0x3f)、UTF–8: U+FFFD (0xefbfbd)) に変換されます。変換元の文字集合に属さない入力データが与えられた場合は、iconv(3C) は
JIS X 0201、JIS X 0208、JIS X 0212 で定義された文字を変換できます。変換先に対応する文字がない場合は置換文字に変換します。
docs.sun.com /app/docs/doc/817-1786/6mhgav5ki?a=view   (686 words)

  
 Traditional CJK charsets   (Site not responding. Last check: 2007-10-15)
The JIS X 0208 characters are organized in 94 rows ("ku") of 94 cells ("ten") so that they can be mapped over the 94 printable G0 characters of ASCII as specified by ISO 2022.
JIS X 0208 is thus structurally limited to 94×94 = 8'836 characters and with that low number no feasible replacement for Unicode with its 40'000+ characters.
GB 2312 is the Chinese equivalent to JIS X 0208 holding 6'763 hanzi, KS C 5601 the Korean DBCS holding 4'888 hanja and 2'350 Hangul syllables, and CNS 11643 Plane 1 the Taiwanese (Traditional Chinese) standard (albeit the not formally specified industry-standard "Big5" is used more often in Taiwan).
czyborra.com /charsets/cjk.html   (470 words)

  
 Compaq Tru64 UNIX Technical Reference for Using Japanese Features
In addition, two more character sets, JIS Katakana and JIS X 0212, are encoded in Japanese EUC by making use of the Single-Shift 2 (SS2) and Single-Shift 3 (SS3) control characters.
Shift JIS codes use a combination of single-byte data and two-byte data to represent characters defined in JIS X 0201 and JIS X 0208.
It is extended to support narrow JIS X 0201 Katakana characters, JIS X 0212 characters, and user-defined characters (UDC).
www.helsinki.fi /atk/unix/dec_manuals/DOC_51/HTML/SUPPDOCS/JAPANDOC/JAPANCH2.HTM   (2678 words)

  
 XML Japanese Profile (Second Edition)
The use of [JIS X 0208]:1978 (the first version of JIS X 0208) is an error; results are undefined.
Likewise, Halfwidth Katakana of [JIS X 0201] are deprecated.
X-sjis-jisx0221-1995 is a conversion table derived from the conversion table in [JIS X 0221-1] Appendix 2 (including Appendix 2.2) for the shift encoding which is specified in [JIS X 0208] Appendix 1.
www.w3.org /TR/japanese-xml   (4166 words)

  
 Encode::JIS2K - JIS X 0212 Encodings
Simply put, JIS X 0213 is a rework and reorganization of JIS X 0208 and JIS X 0212.
JIS X 0213 was designed to extend JIS X 0208 and JIS X 0212 without being imcompatible to (classic) EUC-JP and Shift_JIS.
JIS X plane 1 is (almost) a superset of JIS X 0208.
cpan.uwinnipeg.ca /htdocs/Encode-JIS2K/Encode/JIS2K.html   (453 words)

  
 WDIFF
JIS X 0208, which was originally called JIS C 6226, was published in 1978.
To harmonize JIS X 0208 1978 with Joyo Kanji, pairs of characters were swapped in the code positions and several characters had their shapes amended.
Therefore, among versions of JIS X 0208, the version of 1978 is of a different graphic character set while the versions of 1983, 1990 and 1997 are regarded as a family.
bgp.potaroo.net /cgi-bin/htmlwdiff?f1=../all-ids/draft-yamamoto-charset-iso-2022-jp-01.txt&f2=../all-ids/draft-yamamoto-charset-iso-2022-jp-00.txt   (4236 words)

  
 [Gs-devel] Need a help with CJK encodings
JIS X 0208:1978 defines Katakana, Hiragana, Kanji and full-width alphabets, Cyrillic, Greeks.
"JIS" and "SHIFT_JIS" are often used as names of encoding method: JIS is for ISO-2022-JP (7bit encoing of JIS X 0208:1978), SHIFT_JIS is for Microsoft encoding (8bit encoding of JIS X 0201:1969 and JIS X 0208:1978).
JIS X 0208:1978 is not upper compatible, because it re-use the 8bit code area which had ever used in JIS X 0201:1969.
www.ghostscript.com /pipermail/gs-devel/2001-December/000947.html   (769 words)

  
 Introduction to i18n - Characters in Each Country
Though JIS X 0201 is included in SHIFT-JIS encoding (explained later) and widely used for Windows/Macintosh, usage of this is not encouraged in UNIX.
JIS X 0212 is not widely used, probably because it cannot be included in SHIFT-JIS, the standard encoding for Japanese version of Windows and Macintosh.
JIS X 0208 (aka JIS C 6226) is the main standard for Japanese characters.
www.debian.org /doc/manuals/intro-i18n/ch-languages.en.html   (5031 words)

  
 Problems and Solutions for Unicode and User/Vendor Defined Characters   (Site not responding. Last check: 2007-10-15)
However, G3: JIS X 0212 rows 78 to 82 are the Reserved Area (the area reserved for future revision) of JIS X 0212, and this WG recommends that with view to future revisions of JIS X 0212, that user/vendor defined characters are not allocated to this area as far as possible.
The same area is the Reserved Area for JIS X 0212, and as there is a possibility that JIS X 0212 may be extended in the future, its use for user/vendor defined characters should be avoided as far as possible.
JIS X 0221 is the translation of ISO 10646, and is technically identical to ISO 10646 except for the additions of Annexes 1 to 3.
www.opengroup.or.jp /jvc/cde/ucs-conv-e.html   (5136 words)

  
 [No title]
Introduction JCS (JIS Coded Character Set) Committee has defined a new CCS (Coded Character Set), JIS X 0213 [JISX0213], which mostly is an extension of JIS X 0208 [JISX0208], to encode usual Japanese text.
JIS X 0213 has two planes, plane1 and plane2 and can be subsetted to use plane1 only.
Shift_JISX0213 Shift_JISX0213 consists of CCSes of JIS X 0201 [JISX0201], JIS X 0211 [JISX0211] and JIS X 0213.
quimby.gnus.org /internet-drafts/tmp/draft-ohta-jcs-jis-x-0213-00.txt   (874 words)

  
 [linux-users:40727] ASCII or JIS X 0201 (Re: yen-mark)
[linux-users:40727] ASCII or JIS X 0201 (Re: yen-mark)
[linux-users:40729] Re: ASCII or JIS X 0201 (Re: yen-mark)INOUE Koichi
[linux-users:40731] Re: ASCII or JIS X 0201 (Re: yen-mark)Kosuke Yagi
search.luky.org /linux-users.4/msg00728.html   (131 words)

  
 Проект OpenNet: MAN iconv_ja (5) Форматы файлов (FreeBSD и Linux)
eucJP to jis and jis to eucJP Conversion between eucJP and jis is provided for the compa- tibility with ujtojis7() and jis7touj() libraries,and euctojis and jistoeuc utilities.
PCK (SJIS) to jis and jis to PCK (SJIS) Conversion between PCK (SJIS) and jis is provided for the compatibility with sjtojis7() and jis7tosj() libraries, and sjtojis jistosj utilities.
Since ibm931 does not support Kana characters in its single byte code set (SBCS), JIS X 0201 Kana characters are replaced with substi- tute characters in conversion to ibm931.
www.linux.opennet.ru /man.shtml?topic=iconv_ja&category=5&russian=4   (1670 words)

  
 Zvon - RFC 1468 [Japanese Character Encoding for Internet Messages] - Description
Likewise, JIS C 6220 was renamed JIS X 0201.
The "Roman" character set of JIS X 0201 [JISX0201] is identical to ASCII except for backslash () and tilde (~).
The JIS X 0208 [JISX0208] character sets consist of Kanji, Hiragana, Katakana and some other symbols and characters.
www.zvon.org /tmRFC/RFC1468/Output/chapter2.html   (348 words)

  
 [linux-users:40731] Re: ASCII or JIS X 0201 (Re: yen-mark)
[linux-users:40731] Re: ASCII or JIS X 0201 (Re: yen-mark)
[linux-users:40720] ASCII or JIS X 0201 (Re: yen-mark)SaSaki Kuninobu
Previous by thread: [linux-users:40732] Re: ASCII or JIS X 0201 (Re: yen-mark)
search.luky.org /linux-users.4/msg00732.html   (113 words)

  
 [No title]
This is one of the reasons to recommend ASCII, instead of JIS X 0201 1976, in ISO-2022-JP.
Typical cases are citing and Yamamoto [Page 4] Internet-Draft ISO-2022-JP CHARSET November 1997 forwarding a message including JIS X 0201 Kana set and/or JIS X 0212 1990 (but labeled as ISO-2022-JP).
4.1 Rules for decoding syntax ISO-2022-JP decoding syntax that embeds ASCII, JIS X 0201 1976, JIS X 0208 1978, JIS X 0208 1983 (including 1990) is defined as follows (see also Figure 1): iso-2022-jp-text-d = *(line-d CRLF) line-d ; When text is terminated by CRLF, the last 'line-d' is empty.
quimby.gnus.org /internet-drafts/draft-yamamoto-charset-iso-2022-jp-00.txt   (1847 words)

  
 NetPoint.Ru - Из Чайника в Гуру / RFC 1468 (RFC1468) - Japanese Character Encoding for Internet Messages   (Site not responding. Last check: 2007-10-15)
If there are JIS X 0208 characters on a line, there must be a switch to ASCII or to the "Roman" set of JIS X 0201 before the end of the line (i.e., before the CRLF).
Each JIS X 0208 character takes up two columns, and the escape sequences do not take up any columns.
The implementor is reminded that JIS X 0208 characters take up two bytes and should not be split in the middle to break lines for displaying, etc. The JIS X 0208 standard was revised in 1990, to add two characters at the end of the table.
www.netpoint.ru /rfcs/rfc1468.html   (1175 words)

  
 LinuxDig.com Request for Comments RFC#: 2237 : Japanese Character Encoding for Internet Messages.
It defines the use of JIS X 0208 as the
JIS (Japanese Industry Standard) defines JIS X 0212 as 'code of the
JIS X 0201-Roman is not identical to the ASCII with two different
www.linuxdig.com /rfc/individual/2237.php   (983 words)

  
 Re: ASCII and JIS X 0201 Roman - the backslash problem
ASCII and JIS X 0201 Roman - the backslash problem
Re: ASCII and JIS X 0201 Roman - the backslash problem
Prev by thread: Re: ASCII and JIS X 0201 Roman - the backslash problem
mail.nl.linux.org /linux-utf8/2002-05/msg00095.html   (273 words)

  
 RFC1468
Note that JIS X 0208 was called JIS C 6226 until the name was changed on March 1st, 1987.
The implementor is reminded that JIS X 0208 characters take up two bytes and should not be split in the middle to break lines for displaying, etc.
The JIS X 0208 standard was revised in 1990, to add two characters at the end of the table.
www.scit.wlv.ac.uk /appdocs/rfcs/rfc14xx/RFC1468.html   (1273 words)

  
 w3m-dev: [w3m-dev 00192] JIS X 0201 (Re: i18n-5.patch)
Next message: aito@ei5sun.yz.yamagata-u.ac.jp: "[w3m-dev 00193] Re: JIS X 0201 (Re: i18n-5.patch)"
Next in thread: aito@ei5sun.yz.yamagata-u.ac.jp: "[w3m-dev 00193] Re: JIS X 0201 (Re: i18n-5.patch)"
Reply: aito@ei5sun.yz.yamagata-u.ac.jp: "[w3m-dev 00193] Re: JIS X 0201 (Re: i18n-5.patch)"
www.w3m.org /mailarchive/w3m-dev/0191.html   (171 words)

  
 LinuxDig.com Request for Comments RFC#: 1468 : Japanese Character Encoding for Internet Messages.
Note that JIS X 0208 was called JIS C 6226 until the name was changed
If there are JIS X 0208 characters on a line, there must be a switch
The JIS X 0208 standard was revised in 1990, to add two characters at
www.linuxdig.com /rfc/individual/1468.php   (953 words)

  
 [No title]   (Site not responding. Last check: 2007-10-15)
This is not the letter "a" but a different character from JIS X 0201 with the same name.
I am sure that the JIS Roman overbar and yen sign will not get merged to tilde and backslash, and as a matter of implementation the resulting buffer should contain JIS X 0208-1983 and ASCII, except for those two characters (and not the Old JIS equivalents).
I haven't read JIS standards yet, but a friend of mine said unifiying JIS X 0208 1978 to JIS X 0208 1983 is implied by the standards.
list-archive.xemacs.org /xemacs-mule/xemacs-mule.200001   (10173 words)

Try your search on: Qwika (all wikis)

Factbites
  About us   |   Why use us?   |   Reviews   |   Press   |   Contact us  
Copyright © 2005-2007 www.factbites.com Usage implies agreement with terms.