Factbites
 Where results make sense
About us   |   Why use us?   |   Reviews   |   PR   |   Contact us  

Topic: Basic Multilingual Plane


Related Topics

In the News (Fri 25 Jul 08)

  
  Unicode
Plane 1, the Supplementary Multilingual Plane (SMP) is mostly used for historic scripts, e.g.
Plane 2, the Supplementary Ideographic Plane (SIP) is used for about 40000 rare Chinese characters that are mostly historic, although there are some modern ones.
Representation of code points from all planes, including the BMP, can be achieved using the UTF-16 encoding, which requires one 16-bit word for characters in the BMP, and a pair of 16-bit words for characters in higher planes.
www.ebroadcast.com.au /lookup/encyclopedia/un/Unicode.html   (1154 words)

  
 Astral Planes
The supplementary planes are an innovation in how characters are internally represented—programmers have to assume a character can have a million possible values, not just 64K, which means they often have to change their existing code.
The informal name for the supplementary planes of Unicode is "astral planes", since (especially in the late '90s) their use seemed to be as remote as the theosophical "great beyond".
Though the basics of the notation are shared, the semantics of the notation has changed, and the decipherment of the Middle Byzantine notation was only done in 1916, by the musicologist and composer Egon Wellesz.
www.tlg.uci.edu /~opoudjis/unicode/unicode_astral.html   (2708 words)

  
 "Universal Multiple 8-Bit Encoded Character Set( UCS )" -- the international standard ISO/IEC10646.1-1993.   (Site not responding. Last check: 2007-10-12)
In the UCS encoding space, 00 plane of 00 group is called basic multilingual plane.
The group encoding of basic multilingual plane is 00H.
UCS stipulates that it can be omitted when the group of canonical form and the plane encoding is 00H, therefore, the characters arranged on the basic multilingual plane can be represented by binary number with two bytes, forming double 8-bit encoded character set, written as UCS-2.
www.i18nwithvb.com /surrogate_ime/codepages/ucs.htm   (395 words)

  
 Mapping of Unicode characters - Wikipedia, the free encyclopedia
Most of the allocated code points in the BMP are used to encode Chinese, Japanese, and Korean (CJK) characters.
Plane 1, the Supplementary Multilingual Plane (SMP), is mostly used for historic scripts such as Linear B, but is also used for musical and mathematical symbols.
The Basic Multilingual Plane includes a PUA in the range from U+E000 to U+F8FF (57344–63743).
en.wikipedia.org /wiki/Basic_Multilingual_Plane   (659 words)

  
 ISO 10646-1:1993/Am.2 – UTF-16 (N1334)
Planes 11 to DF in Group 00 and planes 00 to FF in Groups 01 to 5F are reserved for future standardisation, and thus those code positions shall not be used for any other purpose.
The code positions of 32 planes from Plane E0 to Plane FF of Group 00 shall be for Private Use.
The code positions of Plane 0F and Plane 10, and of the 32 planes from Plane E0 to Plane FF, of Group 00 shall be for Private Use.
www.cl.cam.ac.uk /~mgk25/ucs/ISO-10646-UTF-16.html   (2175 words)

  
 Using Plane 1 Characters
Plane 1 (also known as the Supplementary Multilingual Plane, or SMP) will mainly be used for historical scripts as well as sets of Western and Byzantine musical symbols.
Characters in the supplementary planes are different than characters in the BMP because they are stored in a Unicode font under one hexadecimal number, but in many applications are accessed through the use of surrogate pairs.
Plane 1 characters will not display properly in Windows 98/Me (although if you open a file that contains Plane 1 characters under Win98/Me, it will not be damaged; the Plane 1 characters won’t be visible, but they will still be there if you open the file later under Win2000 or XP).
scholarsfonts.net /Plane1.htm   (2837 words)

  
 Universal character set : Basic multilingual plane
It has over a million code points, but only the first 65536 (the Basic Multilingual Plane, or BMP) are commonly used, the remainder being reserved for such purposes as representing ancient Egyptian hieroglyphics or rare Chinese characters.
Many code points, even in the BMP, are deliberately not assigned to characters, to allow for future expansion or to minimize conflicts with other encoding forms.
Code points outside the BMP cannot be represented with UCS-2.
www.termsdefined.net /ba/basic-multilingual-plane.html   (735 words)

  
 Glossary   (Site not responding. Last check: 2007-10-12)
Planes are numbered from 0 to 16, with the number being the first code point of the plane divided by 65,536.
(2) A basic unit of articulation that corresponds to a pulmonary pulse.
The basic accent in modern Greek, having the form of an acute accent.
www.unicode.org /glossary   (7498 words)

  
 BMP - Basic Multilingual Plane, Batch Message Processing, Bean-Managed Persistence, BitMaP
Basic Multilingual Plane is not the only word formed from BMP.
Files of this type usually have the suffix ".bmp" as part of their name.
BMP is a standard format for graphic files in Windows.
www.auditmypc.com /acronym/BMP.asp   (526 words)

  
 ISO-10646 Concept Dictionary
The code space in 10646 is divided into cells, rows, planes, and groups.
plane - a set of (no more than 256x256) characters of 10646 arranged in rows and columns.
The 10646 nomenclature refers to coded characters as multiples of octets and assumes taht octets are serialized, while the Unicode nomenclature rfefores to coded characters as indivisible 16 bit entitles.
www.cit.gu.edu.au /~davidt/cit3611/C_UNIX/ISO-10646.htm   (2011 words)

  
 Unicode encodings
Now that there are characters in Unicode beyond the Basic Multilingual Plane (BMP), it is critical that operating systems and applications support the full range of 1,112,064 valid Unicode code points, and interoperate between the UTFs.
The 1,048,576 characters of the 16 Supplementary Planes are encoded with four bytes.
The single 16-bit portion of this encoding is used to encode the entire BMP, except for 2,048 code points known as "surrogates" that are used in pairs to encode the 1,048,576 characters of the 16 Supplementary Planes.
www-128.ibm.com /developerworks/java/library/j-u-encode.html   (2153 words)

  
 ongoing · Characters vs. Bytes
Plane 0 is called the “Basic Multilingual Plane” or BMP and contains pretty well everything useful.
Past the BMP, planes 1 through 16 are sometimes humorously called the “astral planes” and are used for exotic, rare, and historically important characters.
All the characters in the BMP appear as themselves, but clearly some trickery is going to be involved if you want to store astral-plane characters, because they just don't fit in sixteen bits.
www.tbray.org /ongoing/When/200x/2003/04/26/UTF   (2675 words)

  
 Plane 14 - Plane14
The Unicode code space for characters is divided into 17 "planes" and each plane has 65,536 (= 2) code points.
Furthermore, ranges of characters have been tentatively blocked out for every known unencoded script (see [1]), and while Unicode may need another plane for ideographic characters, there are ten planes that could only be needed if previously unknown scripts with tens of thousands of characters are discovered.
The Basic Multilingual Plane includes a Private Use Area in the range U+E000-U+F8FF (57344-63743), and Plane Fifteen (U+F0000-U+FFFFF) and Plane Sixteen (U+100000-10FFFF) are completely reserved for private use as well.
www.kopete.org /Plane-14.html   (561 words)

  
 Characters, Entities and Fonts
This plane of characters is also known as the Secondary Multilingual Plane (SMP), in contrast to the Basic Multilingual Plane (BMP) which was originally the entire extent of Unicode.
Support for Plane 1 characters in currently deployed software is not always reliable, and in particular support for these Mathematical Alphanumeric Symbol characters is not likely to be widespread until after public fonts covering the characters adopted for mathematics are available.
Most of these characters come from the additions to Plane 1, however a few characters (such as the double-struck letters N, P, Z, Q, R, C, H representing common number sets) were already present in Unicode 3.0 and retain their original positions.
www.w3.org /TR/MathML2/chapter6.html   (3326 words)

  
 Linux Unicode programming
The first plane (plane 00 of the group 00) is the Basic Multilingual Plane (BMP).
The BMP defines characters in general use in alphabetic, syllabic and ideographic scripts as well as various symbols and digits.
The BMP is used as a two-octet coded character set identified as the UCS-2 form of ISO 10646.
www-106.ibm.com /developerworks/linux/library/l-linuni.html   (2787 words)

  
 Medieval Unicode Font Initiative
This is often referred to as the BMP (Basic Multilingual Plane), and fonts with a smaller or larger selection of these characters can be found on a large number of computers, especially within the Windows community (although the users may not know that they have a Unicode font on their computer).
Now, Unicode has added 16 supplementary planes of equal size to the BMP, so that the total number of possible characters exceeds one million.
This means that there is ample room for additions in each range, and it is furthermore possible to define another 13 subranges, each containing 256 code points, all within the bounds of the Private Use Area of the Basic Multilingual Plane.
gandalf.aksis.uib.no /mufi/proposal/proposal-v1.html   (1162 words)

  
 Basic Multilingual Plane: Computer Book: Basic Multilingual Plane: Definitions Basic Multilingual Plane Glossary & ...
Basic Multilingual Plane: Read about your Basic Multilingual Plane computer abbreviation, computer definition or term.
The BMP currently includes the Latin,Greek, Cyrillic, Devangari, hiragana, katakana, and Cherokeescripts, among others, and a large body of mathematical,APL-related, and other miscellaneous characters.
Most ofthe Han ideographs in current use are present in the BMP,but due to the large number of ideographs, many were placed inthe Supplementary Ideographic Plane.
www.the-computer-book.com /Basic_Multilingual_Plane-computer-pc-terms-tips.html   (126 words)

  
 Production First Software Encyclopedia of Typography and Electronic Communication : U
The first version (1.x) was identical to only the first (basic multilingual) plane (65,536 characters) of the more extensive 4-byte ISO/IEC/10646 character encoding standard.
However, there is a basic difference (presently) between the two standards: Unicode extensively defines character properties and character behavior, whereas ISO/IEC/10646 does not at this time.
A Unicode font is also compliant to the Basic Multilingual Plane of ISO10646, so that it, perhaps, could also be considered an « ISO10646 font, » possibly.
ourworld.compuserve.com /homepages/profirst/u.htm   (2351 words)

  
 Plane 14 Characters for Language Tags
The Plane 14 technical report represents the consensus of a meeting of the UTC Working Group on Tagging and Annotation and of IETF representatives which took place on June 24,1997.
The basic rules for Unicode conformance for the tag characters are exactly the same as for any other Unicode characters.
This technical report for Plane 14 tags is, instead, aimed at removing a significant barrier to the universal adoption of Unicode in such arenas as Internet protocol development.
www.oasis-open.org /cover/unicodetr7-lang.html   (3291 words)

  
 Your Multilingual Mac
Basic Apple documentation can be found in the Help menu of the Finder if you put "languages" in the Question box.
The multilingual capabilities of various browsers under OS X can be demonstrated by installing these and going to UTF-8 Sampler or Alan Wood's Unicode Sample Pages.
The basic principle of Unicode is to assign a unique number (usually expressed in hexadecimal form) to every character.
homepage.mac.com /thgewecke/mlingos9.html#typing   (8742 words)

  
 Unicode Revisited
However, ISO 10646-1 that was prescribed recently is called the Basic Multilingual Plane, and it is only the portion that corresponds to the 0 group 0 plane of the previously mentioned huge code.
However, this Basic Multilingual Plane is something that in fact adopts Unicode almost as is, and the method by which it switches into UCS-4 has not been decided.
It is based on two basic concepts: (1) language data are divided into four layers: Language, Group, Script, and Font; and (2) language specifier codes, which specify each of those four layers, are used to switch between 16-bit character planes.
tronweb.super-nova.co.jp /unicoderevisited.html   (5039 words)

  
 Sorting It All Out : The basics of supplementary
Refers to designated code points in the Unicode Standard or other character encoding standards whose interpretations are not specified in those standards and whose use may be determined by private agreement among cooperating users.
We start with the Basic Multilingual Plane -- it is the code units from U+0000 to U+FFFF.
There are seventeen planes (0x0 through 0x10) and only 0xf and 0x10 are specifically private-use.
blogs.msdn.com /michkap/archive/2005/09/24/472543.aspx   (2005 words)

  
 Character Types
{AI95-00285-01} {Latin-1} {BMP} {ISO/IEC 10646:2003} {ISO 10646} {Character} The predefined type Character is a character type whose values correspond to the 256 code positions of Row 00 (also known as Latin-1) of the ISO/IEC 10646:2003 ISO 10646 Basic Multilingual Plane (BMP).
Each of the graphic characters of Row 00 of the BMP has a corresponding character_literal in Character.
{AI95-00285-01} {Wide_Character} {BMP} {ISO/IEC 10646:2003} {ISO 10646} The predefined type Wide_Character is a character type whose values correspond to the 65536 code positions of the ISO/IEC 10646:2003 ISO 10646 Basic Multilingual Plane (BMP).
www.adaic.org /standards/05aarm/html/AA-3-5-2.html   (1134 words)

  
 Basic Multilingual Plane   (Site not responding. Last check: 2007-10-12)
It was developed by merging the ISO draft for 16-bit character codes with the code used by Unicode.
It inherited features from both, compatibility with other standards (such as ISO Latin 1) from the ISO draft, character combining (such as writing the letter "á" as a combination of "a" and "'") from Unicode.
Presently the BMP is half empty, although it covers all major languages, including Roman, Greek, Cyrillic, Chinese, hiragana, katakana, Devanagari, Easter Island "rongo-rongo", and even Elvish (but leaves out Klingon).
burks.bton.ac.uk /burks/foldoc/61/10.htm   (120 words)

  
 Unicode Polytonic Greek for the World Wide Web (version 0.9.7)
The leading two bytes indicate that the character uses a three-byte code and indicates which range of characters is relevant, while the third byte indicates the specific character in the set.
UTF-16 is an encoding which represents all Basic Multilingual Plane characters (including basic Greek, extended Greek, and combining diacriticals) with two bytes and all other characters (including Aegean scripts, Etruscan, and any future scripts of relevance to classicists) with four bytes.
The four-byte combinations use surrogates, two-byte combinations appearing before a two-byte code which indicate that the intended character is outside the Basic Multilingual Plane, and exactly in which plane the intended character is located.
www.stoa.org /unicode/encodings.html   (725 words)

  
 IUC27: Are We Counting Bytes Yet?
For most SBCS encodings, the mapping of bytes to Unicode characters is also 1:1 (and nearly all of these encodings map entirely to the Basic Multilingual Plane of Unicode, which means that each character consumes just one UTF-16 code unit).
This method has a basic design problem: in a stateful encoding, the replacement byte sequence is in one or another state.
BMP code points have the advantage of being represented by a single UTF-16 code unit.
www.inter-locale.com /whitepaper/IUC27-a303.html   (8649 words)

  
 An Introduction to Writing Systems: A review of script characteristics affecting computer-based script support and ...
This tutorial is particularly useful for people who are new to Unicode, in that it provides an overview of the basics in the context of real examples.
Ideographic characters in the Basic Multilingual Plane are three bytes in UTF-8.
If, however, the next input character is a number, the rendering algorithm views the period as part of the left-to-right flow of the number, and moves it automatically to the right of the 10.
people.w3.org /rishida/scripts/tutorial/all.html   (9126 words)

  
 Internationalization Encyclopedia: : basic multilingual plane
Please do not hesitate to email me if you disagree with this article or found an any error in this page.
(aka BMP) This is the Unicode Plane 0, it contains all the characters defined in Unicode up to the version 3.1 which introduces the first supplementary characters.
All characters in that plane can be encoded directly using UTF-16.
www.i18ngurus.com /encyclopedia/basic_multilingual_plane.html   (69 words)

Try your search on: Qwika (all wikis)

Factbites
  About us   |   Why use us?   |   Reviews   |   Press   |   Contact us  
Copyright © 2005-2007 www.factbites.com Usage implies agreement with terms.