lotus



previous page: 5.2.1 Coding Hungarian accents: House rules for plain (7-bit) ASCII
  
page up: Hungarian FAQ
  
next page: 5.2.3 Coding Hungarian accents: Text formatting languages

5.2.2 Coding Hungarian accents: Fancy 8-bit character sets (extended ASCII)




Description

This article is from the Hungarian FAQ, by Zoli Fekete fekete@bc.edu with numerous contributions by others.

5.2.2 Coding Hungarian accents: Fancy 8-bit character sets (extended ASCII)

The following rollcall lists the most important character sets
supported by the majority of hardware and software, including the
accent conversion programs. The available Hungarian accented characters
are detailed for each set.

Notes:

Henceforth when referring to an accented character, the numerical
(Pro1sze1ki) notation will be used to maintain clarity.

5.2.2.1 PC-codepages

(*) PC-437: Hardware

The basic hardware character set of PC-compatible systems. Since it
was supposed to contain many symbols (line drawing characters, some
Greek letters, etc.), and be general, it's pretty poor in terms of
accented characters. Missing Hungarian vowels: o3, u3 [substitute them
with o^ & u^], A1 [substitute it with A-circle], I1, O1, O3, U1, U3.

(*) CWI recommendation for Hungarian accents:

A standard initiative to replace the many house rules of character code
assignment for accents unavailable in PC-437. Codes are assigned as
follows:

o3->147 [o^], u3->150 [u^], A1->143, I1->141 [i`] or 140 [I^],
O1->149 [o`], O3->167, U1->151 [u`], U3->153 [y~]

(*) PC-850: Multilingual

Contains all the accented vowels but ?3. Substitute them with ?^.
Note: ? means o, u, O or U.

(*) PC-852: Latin 2

Contains all the accented vowels. Try to use this if available.

(*) PC-860: Portuguese
(*) PC-863: Canadian-French
(*) PC-865: Nordic

These sets miss various Hungarian accents, esp. in upper case. Using
them for a Hungarian text makes absolutely no sense.

5.2.2.2 ISO character sets

These character sets are specified by ISO standards. As far as ALL
(not only Hungarian) accented vowels concerned, ISO 8859/1, 2 & 9 is
equivalent to Windows Latin 1, 2 & 5 respectively.

(*) ISO 8859/1:
(*) ISO 8859/3:

Contain all the accented vowels but ?3. Substitute them with ?^.

(*) ISO 8859/2:

Contains all the accented vowels. Try to use this if available.

Fonts for iso-8859-2 (and some other) character sets can be found at
<ftp://ftp.tarki.hu/pub/font/> for various operation systems, and at
<ftp://almos.vein.hu/ssa/kbd_es_font/> (mirrored at
<ftp://ftp.vma.bme.hu/pub/ssa/kbd_es_font/> and
<ftp://ftp.tarki.hu/pub/ssa/kbd_es_font/>) mostly for Unix. There is
material for Hungarianizing the Linux (and possibly other Unix variant)
operation system at <ftp://ftp.tarki.hu/pub/magyar/linux/>.

5.2.2.3 Others

The following character sets are supported by various laser printers.
Roman-8 bears special importance as being the default character set of
many printers.

(*) Ventura International & Roman-8:
(*) MC Text:

Contain all the accented vowels but ?3. Substitute them with ?^.

 

Continue to:













TOP
previous page: 5.2.1 Coding Hungarian accents: House rules for plain (7-bit) ASCII
  
page up: Hungarian FAQ
  
next page: 5.2.3 Coding Hungarian accents: Text formatting languages