Factbites
 Where results make sense
About us   |   Why use us?   |   Reviews   |   PR   |   Contact us  

Topic: GB 18030


Related Topics

  
  GB 18030: A mega-codepage
The mapping table between GB 18030 and Unicode is backwards compatible with the one between GB 2312-1980 and Unicode, and with some exceptions (with the one between GBK and Unicode), most of the changes compared to the GBK mapping table are due to updates for Unicode 3.0.
GB 18030 is defined as much with charts of assigned characters as with a mapping table to and from Unicode.
An implementation of GB 18030 needs to be able to determine the length of a byte sequence by examining not only the lead byte, but at least the second byte of a multi-byte sequence as well.
icu.sourceforge.net /docs/papers/gb18030.html   (2283 words)

  
 (URW)++ Chinesische Fonts
The GB 2312-80 and GB/T 12345-90 encodings may be regarded as a pair of styles.
With the release of Unicode 2.1 in 1993 a standard named GB 13000.1 was published which contained all the glyphs of Unicode 2.1.
GB 18030-2000 was created as an update of GBK for Unicode 3.0 which defines 6,582 new characters in plane 0 (the Basic Multilingual Plane = BMP).
www.urwpp.de /english/fonttechnologie/chinfonts_cont.html?t6pb-id=wd1e812044ffaa891f9987dbeae12bccd5a861fbdec   (466 words)

  
 The Global Advisor Newsletter - Unicode Standard for Chinese Characters
In 1993, the PRC published GB 13000.1-93, a Chinese national standard that is code- and character-compatible with ISO 10646-1/Unicode 2.1, thus signaling their intention to support the efforts of the International Organization for Standardization (ISO)/ International Electrotechnical Commission (IEC) and the Unicode Consortium.
GB 13000 was updated as required to maintain compatibility with the standards of the ISO/IEC and the Unicode Consortium.
Historically, GB 18030-2000 standard is significant because it is the first widely used character set that includes characters with Universal numerical codes (code positions) that exceed 65,535 - the largest number that can be represented with two bytes of computer memory.
www.intersolinc.com /newsletters/newsletter_48.htm   (1831 words)

  
 GB18030-2000 - The New Chinese National Standard - GB 18030
The GB 2312-1980 character set standard was established in 1981 to represent simplified Chinese characters.
GBK is an extension of GB 2312-1980 and the key significant property of GBK is that it leaves the characters and codes as defined in GB 2312-1980 untouched and positions all additional characters around it.
The four-byte portion uses the code points 0x30 through 0x39, which are vacant in GB 11383, as an additional means to extend the two-byte encodings, thus effectively increasing the number of four-byte codes to now include code points ranging from 0x81308130 through 0xfe39fe39.
developers.sun.com /dev/gadc/technicalpublications/articles/gb18030.html   (1242 words)

  
 Wikipedia: Guobiao
GĂșobiāo, 国标 (GB), is the abbreviation of Guójiā Biāozhǔn Mǎ (国家标准码), meaning the "National Standard Encoding" is the standard Chinese character encoding system in the People's Republic of China, issued by the Standardization Administration of China (SAC), a member of ISO and IEC.
In computer contexts, "GB" alone is often used to mention GB 2312-1980 issued in 1980, or GB 18030-2000 issued in 2000.
There are about 7200 characters in the GB codeset, versus approx.
www.factbook.org /wikipedia/en/g/gu/guobiao.html   (177 words)

  
 ISV: GB 18030
GB 18030-2000 is a new Chinese character-encoding standard that was introduced in to the Unicode Standard in 2000.
GB 18030 contains room for 1.6 million characters, can include one, two or four byte characters, and includes support for Mongolian, Tibetan, Yi, and Uyghur, as well as all previously supported Chinese scripts.
Two Monotype Imaging WorldType fonts that include GB 18030 support were recently approved for distribution within the People’s Republic of China by the PRC government.
www.18030.com   (680 words)

  
 Unicode FAQ on GB 18030   (Site not responding. Last check: 2007-11-02)
GB 18030 is a new Chinese codepage standard that extends GB 2312-1980 and GBK (which itself is an extension of GB 2312-1980).
This is possible because of the definition of GB 18030 with a mapping table to ISO 10646/Unicode.
A small portion of these mappings have changed between GBK and GB 18030, and GB 18030 maps them instead to Unicode characters that were introduced in Unicode 3.0.
icu.sourceforge.net /docs/papers/unicode-gb18030-faq.html   (613 words)

  
 Chinese Mac Character Sets and Encodings   (Site not responding. Last check: 2007-11-02)
GB 2312 (1980) includes 6,763 Chinese characters (on two levels: the first is arranged by reading, the second by radical then number of strokes), along with symbols and punctuation, Japanese kana, the Greek and Cyrillic alphabets, Zhuyin, and two sets of Pinyin letters with tone marks (full-width and half-width).
GB 2312 and all of its extensions were replaced by GB 18030 in 2000.
GB 18030 (2000) is the most recent update to the Chinese national standard coded character set.
www.yale.edu /chinesemac/pages/charset_encoding.html   (2039 words)

  
 GB18030 - Wikipedia, the free encyclopedia
GB18030 is the registered Internet name for the official character set of the People's Republic of China (PRC).
The SimSun 18030 font enables the display of the GB 18030 characters, which includes all the characters in Unicode 2.1 plus new characters found in the Unicode CJK Unified Ideographs Extension A section.
Authoritative mapping table between GB18030 and Unicode (warning: firefox seems to hang on loading this).
en.wikipedia.org /wiki/GB18030   (432 words)

  
 Eighteenth International Unicode Conference - Abstract
The Chinese national standard GB 18030-2000 will be described as a character standard that attempts a complete re-mapping of Unicode 3.0's character repertoire into a "legacy" code space as defined through the specification GBK, now being extended to form the standard GB 18030.
When it comes to GB 18030-2000, it is of crucial importance that mapping mechanisms between the original encoding and Unicode are being adopted efficiently.
GB 18030-2000, on the other hand, presents a challenge in that it applies a four-byte encoding scheme to map the complete character repertoire of Unicode 3.0 into its own encoding space in order to remain compatible with pre-existing national Chinese standards, namely the specification GBK.
www.unicode.org /iuc/iuc18/a325.html   (602 words)

  
 man: GB18030
The GB18030 encoding implements GB 18030-2000, a PRC national standard for the encoding of Chinese characters.
It is a superset of the older GB 2312-1980 and GBK encodings, and incorporates Unicode's Unihan Exten- sion A completely.
GB 11383-1981 (ASCII) characters are represented by single bytes in the range 0x00 to 0x7F.
www.hmug.org /man/5/GB18030.php   (243 words)

  
 Business Wire: Agfa Monotype Announces CITS and SLC Approval f... @ HighBeam Research   (Site not responding. Last check: 2007-11-02)
In 2000, the GB 18030 Chinese character-encoding standard was introduced into the Unicode(R) worldwide character-encoding standard.
GB stands for Guojia Biaozhun, Chinese for "national standard." GB 18030 can contain up to 1.6 million characters supporting one, two or four-byte characters.
In 1995, Agfa Monotype was the first font company to have its Chinese typefaces endorsed by the PRC government - years before GB 18030 was established.
highbeam.com /library/doc0.asp?docid=1G1:86173175&...   (592 words)

  
 Chinese, Japanese, Korean, CJK, Asian fonts
There is also the extension of GB 2312-80 called GBK which is encoded with 20,902 unified ideographs that are assigned in Unicode 2.1.
GB 18030 was created as an update to GB 2312-80 for Unicode 3.0.
Although neither GB nor Big-5 encodings have an official connection with a specific character set, they have become synonymous with simplified and traditional sets, respectively.
www.ascendercorp.com /cjk.html   (719 words)

  
 qgb18030codec(3): Conversion to/from Chinese ... - Linux man page
GBK, formally the Chinese Internal Code Specification, is a commonly used extension of GB 2312-80.
GBK has been superceded by the new Chinese national standard GB 18030-2000, which added a 4-byte encoding while remaining compatible with GB2312 and GBK.
The new GB 18030-2000 may be described as a special encoding of Unicode 3.x and ISO-10646-1.
www.die.net /doc/linux/man/man3/qgb18030codec.3.html   (465 words)

  
 [No title]
Since the legacy GB 2312-1980 standard and GBK (1995) specification is still widely used, it is important to provide a smooth migration path towards GB 13000.
The new GB 18030-2000 standard "replaces" GBK: it retains legacy encoding compatibility, and also provides for a complete and final mechanism to include future extensions of Unicode.
Intended usage: COMMON Acknowledgement: Appreciations and kudos to the Internet community for documenting and explaining the GB 18030-2000 standard to the whole world; for implementing this new standard in software so quickly; and for their comments to this registration.
www.iana.org /assignments/charset-reg/GB18030   (436 words)

  
 Character Set List
GB 18030 was therefore created to represent all the Unicode 3.0 hanzi.
GB 2312 may be represented in either 7-bit form or 8-bit form, depending on whether compatibility with 7-bit systems is more important than distinguishing Chinese characters from ASCII characters.
GB 12345 is the traditional equivalent of the simplified character set GB 2312.
www.jbrowse.com /text/charsets.html   (9294 words)

  
 Digital 21: IT Infrastructure: Infrastructure and Standards: Coding Schemes Comparison   (Site not responding. Last check: 2007-11-02)
GB (Guo Biao) is the national coding standard released by the Chinese government.
The latest version of the GB standard is GB 18030-2000 released in 2000 comprising around 27,000 Chinese characters.
ISO 10646 is a coding standard developed by the International Organization for Standardization (ISO) to embrace characters of all major languages in the world.
sc.info.gov.hk /TuniS/www.gov.hk/digital21/eng/structure/iso_comparison.html   (165 words)

  
 GB18030 -- Facts, Info, and Encyclopedia article   (Site not responding. Last check: 2007-11-02)
GB18030 is the registered internet name for the official (An ordered list of characters that are used together in writing or printing) character set of the (additional info and facts about People's Republic of China) People's Republic of China (PRC).
GB abbreviates (additional info and facts about Guojia Biaozhun) Guojia Biaozhun (国家标准), which means national standard in Chinese.
Microsoft made the change from UCS-2 to UTF-16 with Windows 2000.
www.absoluteastronomy.com /encyclopedia/g/gb/gb180301.htm   (443 words)

  
 GB 18030 - TheBestLinks.com - Character encoding, Japanese language, Korean language, Unicode, ...
GB 18030 - TheBestLinks.com - Character encoding, Japanese language, Korean language, Unicode,...
GB 18030, Character encoding, Japanese language, Korean language, Unicode...
You can add this article to your own "watchlist" and receive e-mail notification about all changes in this page.
www.thebestlinks.com /GB_18030.html   (119 words)

  
 Introduce new alias for GB 18030
Maybe we would be better off with having something like define_alias(qr/^GB(?:\s-)?(.+)/i => '"gb$1"'); this will make things like GB 18030 GB 12345 work.
Re: Introduce new alias for GB 18030, Dan Kogai
Re: Introduce new alias for GB 18030, Autrijus Tang
www.mhonarc.org /archive/html/perl-unicode/2002-03/msg00154.html   (83 words)

  
 man: gbk
GBK is a backwards-compatible extension of the GB 2312-1980 encoding method for Chinese text, which adds the characters defined in the Unified Han portion of the Unicode 2.1 standard.
Chinese National Standard GB 18030-2000: Information Technology -- Chinese ideograms coded character set for information interchange -- Extension for the basic set, March 2000.
GBK is not a standard, but has been superseded by GB 18030-2000.
www.hmug.org /man/5/gbk.php   (163 words)

  
 [No title]   (Site not responding. Last check: 2007-11-02)
Usage and implementation requirements The GB 18030-2000 standard wasIt has been created by the Chinese government with inputs from major vendors and institutions to ensure full ISO 10646 / Unicode 3.x compatibility while providing a smooth migration path from GB2312 and GBK encodings.
Newer revisions will be published to follow updates in Unicode and ISO 10646 standards.
While unofficial, it is indeed the authorative document on GB18030.
lists.w3.org /Archives/Public/ietf-charsets/2001OctDec/att-0003/GB18030-IANA-Registration.txt   (483 words)

  
 Chinese in Mac OS X 10.2   (Site not responding. Last check: 2007-11-02)
A GB 18030 bitmap font is installed in the /Library/Fonts folder.
Unicode Blocks: This gives you direct access, for example, to a table of the characters in the "CJK Unified Ideographs Extension B" block of Unicode, which is not part of the GB 18030 character set, and so is not included in the "by Radical" tab of View: Simplified Chinese.
There are two sort orders for Simplified Chinese: [1] sort by GB code and [2] sort by number of strokes.
www.yale.edu /chinesemac/pages/os_x2.html   (1245 words)

  
 GB - OneLook Dictionary Search
GB : Butterfly Glossary (networking terminology) [home, info]
GB, Gb : Dictionary of Computing and Digital Media [home, info]
Phrases that include GB: csp gb, erl gb, g or gb, gb 18030, gb fb, more...
www.onelook.com /?w=GB&ls=all   (347 words)

  
 RE: creating a test font w/ CJKV Extension B characters.
And the invalid characters could be then represented for example > with a colored/highlighted glyph such as .
As both the input and > output are not a Unicode scheme, I don't think this invalidates the Unicode > conformance: the behavior would just be conforming to GB18030 or other > legacy GB PUAs mappings.
> I'm pretty sure that there are no such legacy GB mapping, and I doubt that China will ever want to map characters to extra-Unicode codepoints in GB-18030...
www.mail-archive.com /unicode@unicode.org/msg19885.html   (356 words)

  
 Gb   (Site not responding. Last check: 2007-11-02)
Grants and licences Grants and licences in GB.
Sony, obviously not wanting SanDisk to get lonely with the only 4 GB Memory Stick has today announced its own.
The CF card will be available in 2.5 GB and 5 GB capacities and will probably offer the best $/MB of any large CF card.
www.ad999.com /?q=gb   (748 words)

  
 Are GB 18030 and CNS 11643-1992 the best spellings?
Are GB 18030 and CNS 11643-1992 the best spellings?
Writing a bit of an article, putting in there all I have learnt about CJK encodings on the Internet and at perl-unicode(_at_)perl(_dot_)org(_dot_) Has already taken me a week :-) Could you help me with this: Is GB 18030 the best spelling for this encoding?
Re: Are GB 18030 and CNS 11643-1992 the best spellings?
www.mhonarc.org /archive/html/perl-unicode/2002-03/msg00290.html   (182 words)

Try your search on: Qwika (all wikis)

Factbites
  About us   |   Why use us?   |   Reviews   |   Press   |   Contact us  
Copyright © 2005-2007 www.factbites.com Usage implies agreement with terms.