Factbites
 Where results make sense
About us   |   Why use us?   |   Reviews   |   PR   |   Contact us  

Topic: Macintosh Central European encoding


Related Topics

In the News (Fri 17 Feb 12)

  
  Encoding Class (System.Text)
Encoding is the process of transforming a set of Unicode characters into a sequence of bytes.
Although numerous non-Unicode encodings are supported for compatibility with legacy applications, the Unicode encodings (UTF8Encoding, UnicodeEncoding, and UTF32Encoding) are recommended when there is a choice.
Optionally, the Encoding provides a preamble which is an array of bytes that can be prefixed to the sequence of bytes resulting from the encoding process.
msdn2.microsoft.com /en-us/library/system.text.encoding.aspx   (1471 words)

  
 [No title]
Mapping a Mac encoding character to a Unicode sequence or vice versa ----------------------------------------------------------------------- In some cases, a character in a Mac OS legacy encoding maps to a sequence of Unicode characters.
In a few cases, a sequence of characters in a Mac OS legacy encoding must be grouped for mapping to a single Unicode character or a sequence of Unicode characters.
The encodings described here (and in the accompanying tables) are the legacy encodings used in Mac OS versions 7.1 and later.
www.unicode.org /Public/MAPPINGS/VENDORS/APPLE/Readme.txt   (3725 words)

  
 A tutorial on character code issues
In one possible encoding for ISO 10646, the string a!ä‰ is presented as the following sequence of octets (using two octets for each character): 0, 97, 0, 33, 0, 228, 32, 48.
Notice that encoding ISO 8859-1 data this way means that the character code is the one specified by the ISO 8859-1 standard, whereas the character encoding is different from the one specified (or at least suggested) in that standard.
Thus, if the message was ISO 8859-1 encoded and contained the Ä (upper case A with dieresis) character, encoded as octet 196, the E-mail program used on the Mac should use a conversion table to map this to octet 128, which is the encoding for Ä on Mac.
www.indwes.edu /Faculty/bcupp/things/Characters/chars.html   (7816 words)

  
 Romanian Computing Information (Penn State)
Although Central European languages like Czech, Slovak, Romanian, Polish and Hungarian are written in the Roman alphabet, these languages include characters not commonly found in Western European languages like Spanish and French.
These are encoded as Unicode, Latin-2 (ISO-8859-2) or "Central European" and require special font and keyboard support separate from Western European languages.
Note on encoding: There are a number of alternative encoding schemes in use for Romanian, but since the Latin-2 and Unicode method conforms to printed Romanian, I will assume that as a standard.
tlt.its.psu.edu /suggestions/international/bylanguage/romanian.html   (1335 words)

  
 MT-NW Manual: International Language Support
For example, in the normal Macintosh character set, an é is represented by the byte value 142, but in the chatacter set ISO-8859-1 (often used on Usenet), an é is represented by a byte value 233.
This encoding differs from just a simple character set mapping, in that an algorithmic transformation must be applied to the text on sending and receiving ends.
Fonts for various European character sets are widely available on the Internet; with a little searching, you should be able to find Cyrillic, Central European, Greek and any other fonts that you need.
www.smfr.org /mtnw/docs/TextEncoding.html   (3279 words)

  
 Encoding Menu
When reencoding to an encoding defined by code points, the glyph with the matching unicode value is placed in the encoding slot.
When reencoding to an encoding defined by glyph name, we first search for a glyph with the matching name and use it, if not found we search for the glyph name's corresponding unicode code point (if any) and if found we change its name to that specified by the encoding.
Currently the file must either be in the format used by the unicode consortium for mapping ISO 8859 encodings to unicode, or it must be a postscript encoding array.
fontforge.sourceforge.net /encodingmenu.html   (1652 words)

  
 Character/Glyph Index Mapping
The platform ID and platform-specific encoding ID in the header entry (and, in the case of the Macintosh platform, the language field in the subtable itself) are used to specify a particular 'cmap' encoding.
The cmap table header is followed by an array of encoding records that specify the particular encoding and the offset to the subtable for that encoding.
These code standards use a mixed 8/16-bit encoding, in which certain byte values signal the first byte of a 2-byte character (but these values are also legal as the second byte of a 2-byte character).
www.microsoft.com /typography/OTSPEC/cmap.htm   (2527 words)

  
 comp.sys.mac.comm FAQ (v 2.4.1) Feb 15 2003
Note that modems for the Macintosh Performa apparently can only plug into the Macintosh Performa because of an extra pin which they posesses (there is a corresponding extra hole on the Performa serial port connector into which this pin fits).
Unlike ASCII encoding there is virturally no increase in file size but since these formats are 8-bit they cannot be used on their own in the remaining areas of the Internet that are not fully 8-bit (like E-mail and Usenet).
Usually, one won't find Macintosh files in uuencode format; however, most non-Macintosh specific binary data posted to Usenet is uuencoded, so if you wish to use any of this data (such as the images posted in alt.binaries.* and elsewhere), you will need to deal with uuencode.
www.cs.uu.nl /wais/html/na-dir/macintosh/comm-faq/part1.html   (6705 words)

  
 Pygments — Python syntax highlighter   (Site not responding. Last check: 2007-10-26)
The current implementation works as follows: * *

  1. If the font has a type 4 cmap for the Unicode platform * (encoding 0, 1, 2, 3 or 4), or a type 4 cmap for the Microsoft * platform (encodings 1 or 10), that table is used to map Unicode * codepoints to glyphs.
Most recent fonts, both for Macintosh and * Windows, should provide such a table. * *
  • Otherwise, if the font has any type 0 cmap for the Macintosh * platform, a Unicode-to-glyph mapping is synthesized from certain * type 0 cmaps.
  • For the * Macintosh platform, this is 0 to indicate language-neutral * encoding, or the MacOS language code plus one. The * Apple documentation does not mention that one needs to be * added, but the Adobe OpenType specification does.
    pygments.org /demo/9   (2514 words)

      
     Technical Note TN1140: The Euro Currency Symbol
    According to the European Union, the Euro is scheduled to exist as banking currency beginning on January 1, 1999, and it will move into more general use with the introduction of coins and notes on January 1, 2002.
    The standard European glyph, as designed and specified by the European Union in Brussels, has been added to all of Apple's Mac OS fonts to be distributed with Mac OS 8.5, due out later in 1998.
    The "natural encoding" of a font is the encoding vector with which it was originally built.
    developer.apple.com /technotes/tn/tn1140.html   (4326 words)

      
     Category:Character sets
    The category of character sets includes articles on specific character encodings (see the article for a precise definition).
    This includes coded character sets, character encoding forms, character encoding schemes, and character maps (historically called character sets or code pages), and even includes those that use non-numeric, pre-digital codes, such as electrical impulses.
    Articles pertaining to character encoding in general, or encoding in general, may be found in the parent categories, Category:Character encoding and Category:Encodings.
    www.danceage.com /biography/sdmc_Category:Character_sets   (138 words)

      
     pronunciationguide.org: Displaying Central European Fonts
    Modern technology being what it is -- that is, Anglocentric -- it's quite possible that your browser is not displaying all the fonts correctly for certain Central European languages, in particular Czech, Polish and Hungarian.
    The problem can also be tackled on Macintosh and Unix systems, but again I have no personal knowledge of how to do it.
    As a starting point I can suggest doing a web search for "Central European fonts" -- there are quite many websites out there offering free downloads of all sorts of things that might possibly be helpful.
    www.math.nyu.edu /~wendlc/pronunciation/fonts.html   (392 words)

      
     Hungarian Computing Information (Penn State)
    Although Hungarian is classified by location as "Central European" it is not a Slavic language.
    If no encoding is declared, then the browser uses the default setting, which in the U.S. is typically Latin-1.
    Central European AlphabetsPronuciation key for Central European alphabets.
    tlt.its.psu.edu /suggestions/international/bylanguage/hungarian.html   (1347 words)

      
     Font Info
    The default assumes that the font currently has a reasonable encoding, but you want it to have a different one, here each character will be moved to the appropriate position in the new font.
    It will go through and rename every character to match what it should be in the new encoding (this is for cases where people have given a font an inappropriate encoding but where the mapping from character code to glyph is correct).
    Sadly the encoding is not always sufficient for understanding the font's behavior.
    pfaedit.sourceforge.net /fontinfo.html   (4068 words)

      
     Character Encoding and the Web
    The simplest solution in foreign countries is to adopt one standard character encoding and store all files on the same machine in the same encoding format.
    If it is possible that the characters will be encoded differently from the original data, then the middleware must parse the file to find the or markup and change the charset or encoding value to match the properties of the new output stream.
    Data from different sources may be encoded in one of the 8859 eight bit character sets, one of the Windows vendor-specific character sets, or one of the legacy Far East multi-byte codes.
    www.yale.edu /pclt/encoding   (10018 words)

      
     General Purpose PostScript Generating Utility - Some Encodings
    See section What is an Encoding, is an instructive presentation of the encodings.
    This encoding is meant to be used for PC files with drawing lines.
    The Latin 2 character set supports the Slavic languages of Central Europe which use the Latin alphabet.
    ofb.net /gnu/a2ps/a2ps_66.html   (467 words)

      
     comp.sys.mac.comm FAQ (v 2.4.1) Feb 15 2003
    Note that modems for the Macintosh Performa apparently can only plug into the Macintosh Performa because of an extra pin which they posesses (there is a corresponding extra hole on the Performa serial port connector into which this pin fits).
    Unlike ASCII encoding there is virturally no increase in file size but since these formats are 8-bit they cannot be used on their own in the remaining areas of the Internet that are not fully 8-bit (like E-mail and Usenet).
    Usually, one won't find Macintosh files in uuencode format; however, most non-Macintosh specific binary data posted to Usenet is uuencoded, so if you wish to use any of this data (such as the images posted in alt.binaries.* and elsewhere), you will need to deal with uuencode.
    www.faqs.org /faqs/macintosh/comm-faq/part1   (6594 words)

      
     OpenType Fonts - A New Font Format for Macintosh and Windows
    Macintosh Classic fonts are fundamentally different from Windows fonts (click here for more info).
    The “Pro” designation is for OpenType fonts with central European (CE) language support and the “Std” designation is for OpenType fonts that do not have central European support.
    Unlike older one byte fonts, Unicode encoding uses two bytes per character so it is possible to have a font with up to 65,536 characters.
    www.sketchpad.net /opentype-fonts.htm   (1284 words)

      
     The Euro Symbol | Font Magazine 002 | Features | FontShop
    After its decision to launch the new currency, the European Council at first did not address the question whether, like the dollar, the yen and the pound sterling, it should be accompanied by a currency symbol of its own.
    On July 15, 1997, the European Monetary Institute confirms that “there is a need for a distinctive codified symbol of the common currency” and pledges support for the symbol presented by the European Commission.
    European Macintosh users type “Alt Shift D” or, under OS 9 or OS X, “Alt E”; other Macintosh users type “Option Shift 2”.
    www.fontshop.com /features/fontmag/002/02_euro   (3255 words)

      
     HermesSOFT | Cyrillic :: Glagolitic :: Multilingual :: Typefaces
    Most character sets and character encoding schemes developed in the past are limited in their coverage, usually supporting just one language or a small set of languages.
    That is, two encodings can use the same number for two different characters, or use different numbers for the same character.
    Thus, Western encoding supports languages of Western Europe, Central European encoding supports languages of East and Central Europe, Turkish encoding supports Turkish language, Cyrillic Standard encoding supports languages based on Cyrillic alphabet and so on.
    www.hermessoft.com /newproject/fontinfo.php   (806 words)

      
     BioMed Central | Full text | The two authentic methionine aminopeptidase genes are differentially expressed in Bacillus ...
    This is an Open Access article distributed under the terms of the Creative Commons Attribution License (http://creativecommons.org/licenses/by/2.0), which permits unrestricted use, distribution, and reproduction in any medium, provided the original work is properly cited.
    All experiments were performed in accordance with the European regulation requirements concerning the contained use of Genetically Modified Organisms of Group-I (French agreement N° 2735).
    The yflG transcriptional fusion with the lacZ gene (yflG::lacZ disruptant, strain BFS4611) was constructed within the framework of the European Union and Japanese project for the functional analysis of the genome of B.
    www.biomedcentral.com /1471-2180/5/57   (6664 words)

      
     Output Streams, Writers, and Encodings
    The encoding declaration will be set to indicate the character set in use.
    The default encoding of XML documents; each Unicode character is represented in between 1 and 4 bytes.
    The standard encodings in Table 3.1 should be sufficient for any document you need to create, and are a lot more cross-platform compatible than platform-specific code pages such as Cp1252 and MacRoman.
    www.ibiblio.org /xml/books/xmljava/chapters/ch03s03.html   (1184 words)

      
     Code page - Wikipedia, the free encyclopedia
    IBM and Microsoft often allocate a code page number to a character set even if that charset is better known by another name.
    In modern applications, operating systems and programming languages, the IBM code pages have been rendered obsolete by newer and better international standards, such as ISO 8859-1 and Unicode.
    This standard was in use in Iran in DOS-based programs and after introduction of Microsoft code page 1256 this standard became obsolete.
    en.wikipedia.org /wiki/Code_page   (989 words)

      
     [No title]
    However, the # Mac OS Central European character set uses the standard control # characters at 0x00-0x1F and 0x7F.
    The Mac OS Central # European character set also includes a number of characters # needed for the Mac OS user interface and localization (e.g.
    All of # the characters in Mac OS Central European that are also in the # Mac OS Roman character set are at the same code point in both # character sets; this improves application compatibility.
    www.unicode.org /Public/MAPPINGS/VENDORS/APPLE/CENTEURO.TXT   (598 words)

      
     BioMed Central | Full text | Apolipocrustacein, formerly vitellogenin, is the major egg yolk precursor protein in ...
    A DUF1081 domain (Pfam accession number PF06448) of unknown function was identified in the central region, and a von Willebrand-factor type-D domain (VWD) (Pfam accession number PF00094) at the C-terminal end.
    While the VWD domain, initially identified in the human von Willebrand factor (vWF), is distributed over a wide range of proteins, the LLT module/LPD-N domain, part of the lipovitellin 1 subunit of vertebrate Vtg [6-12], contains twenty-two N-terminal conserved amino acid sequence motifs (N1 to N22) as a common denominator of LLTP [15].
    Tsutsui N, Kawazoe I, Ohira T, Jasmani S, Yang W, Wilder M, Aida K: Molecular characterization of a cDNA encoding vitellogenin and its expression in the hepatopancreas and ovary during vitellogenesis in the kuruma prawn, Penaeus japonicus.
    www.biomedcentral.com /1471-2148/7/3   (5042 words)

      
     The Ohio State University Libraries: East European and Slavic Studies (EES)
    Emily Tock, Bibliographer for Slavic and East European Studies, is the librarian in charge of the collections from and about the Eurasian landmass - from Prague to Vladivostok.
    Unbound copies of these works are available for purchase by scholars and libraries from The Ohio State University's Center for Slavic and East European Studies for the cost of photocopying.
    The University of Illinois Slavic Library's searchable database of Slavic and East European newspapers is a must-see resource for researchers focusing on the twentieth century.
    library.osu.edu /sites/ees/public_html/Index.html   (1456 words)

      
     East European font solutions
    When done with the Cyrillic encodings, set the Default Encoding to the one of your preferred (usually native) tongue, in the case of a West European tongue, I recommend Western (Latin 1), which is identical to ISO-8859-1.
    I have not tested them yet, but I presume they are identical to the general Central European Macintosh fonts, and they miss glyphs for the specific Serbo-Croatian dj sound written as a d with a horizontal stroke.
    Their encoding corresponds much to what is used on many pages on the World Wide Web.
    www.thau-knudsen.dk /sw/easteurope.html   (2840 words)

      
     ISO 8859-2 (Latin 2) Resources
    The series was designed by ECMA (European Computer Manufacturer's Association) and approved as standard by the International Organization for Standardization (ISO).
    The most complete non-commercial collection of ISO 8859-2 encoded fonts has been prepared by Petr Kolar from Technical University of Liberec, Czech Republic, and made avaiable on ftp.vslib.cz.
    Central European fonts for Macintosh can be obtained from the internationalization/localization area on the Apple FTP server (check either Czech or Polish subdirectory).
    nl.ijs.si /gnusl/cee/iso8859-2.html   (1608 words)

    Try your search on: Qwika (all wikis)

    Factbites
      About us   |   Why use us?   |   Reviews   |   Press   |   Contact us  
    Copyright © 2005-2007 www.factbites.com Usage implies agreement with terms.