wefcdr

ISO 8859 encoding family
Standard	; ISO/IEC 8859
Extends	; US-ASCII;
Preceded by	; ISO 646;
Succeeded by	; ISO 10646 (Unicode)
Other related encoding(s)	; Windows-125x;
	v; ; t; ; e; ;

ISO/IEC 8859 is a joint ISO and IEC series of standards for 8-bit character encodings. The series of standards consists of numbered parts, such as ISO/IEC 8859-1, ISO/IEC 8859-2, etc. There are 15 parts, excluding the abandoned ISO/IEC 8859-12. The ISO working group maintaining this series of standards has been disbanded.

ISO/IEC 8859 parts 1, 2, 3, and 4 were originally Ecma International standard ECMA-94.

Introduction

While the bit patterns of the 95 printable ASCII characters are sufficient to exchange information in modern English, most other languages that use Latin alphabets need additional symbols not covered by ASCII. ISO/IEC 8859 sought to remedy this problem by utilizing the eighth bit in an 8-bit byte to allow positions for another 96 printable characters. Early encodings were limited to 7 bits because of restrictions of some data transmission protocols, and partially for historical reasons. However, more characters were needed than could fit in a single 8-bit character encoding, so several mappings were developed, including at least ten suitable for various Latin alphabets.

The ISO/IEC 8859-n encodings only contain printable characters, and were designed to be used in conjunction with control characters mapped to the unassigned bytes. To this end a series of encodings registered with the IANA add the C0 control set (control characters mapped to bytes 0 to 31) from ISO 646 and the C1 control set (control characters mapped to bytes 128 to 159) from ISO 6429, resulting in full 8-bit character maps with most, if not all, bytes assigned. These sets have ISO-8859-n as their preferred MIME name or, in cases where a preferred MIME name is not specified, their canonical name. Many people use the terms ISO/IEC 8859-n and ISO-8859-n interchangeably. ISO/IEC 8859-11 did not get such a charset assigned, presumably because it was almost identical to TIS 620.

Characters

The ISO/IEC 8859 standard is designed for reliable information exchange, not typography; the standard omits symbols needed for high-quality typography, such as optional ligatures, curly quotation marks, dashes, etc. As a result, high-quality typesetting systems often use proprietary or idiosyncratic extensions on top of the ASCII and ISO/IEC 8859 standards, or use Unicode instead.

As a rule of thumb, if a character or symbol was not already part of a widely used data-processing character set and was also not usually provided on typewriter keyboards for a national language, it did not get in. Hence the directional double quotation marks « and » used for some European languages were included, but not the directional double quotation marks “ and ” used for English and some other languages. French did not get its œ and Œ ligatures because they could be typed as 'oe'. Likewise, Ÿ, needed for all-caps text, was dropped as well.^[1]^[2]^[3] Albeit under different codepoints, these three characters were later reintroduced with ISO/IEC 8859-15 in 1999, which also introduced the new euro sign character €. Likewise Dutch did not get the ĳ and Ĳ letters, because Dutch speakers had become used to typing these as two letters instead. Romanian did not initially get its Ș/ș and Ț/ț (with comma) letters, because these letters were initially unified with Ş/ş and Ţ/ţ (with cedilla) by the Unicode Consortium, considering the shapes with comma beneath to be glyph variants of the shapes with cedilla. However, the letters with explicit comma below were later added to the Unicode standard and are also in ISO/IEC 8859-16.
Most of the ISO/IEC 8859 encodings provide diacritic marks required for various European languages using the Latin script. Others provide non-Latin alphabets: Greek, Cyrillic, Hebrew, Arabic and Thai. Most of the encodings contain only spacing characters although the Thai, Hebrew, and Arabic ones do also contain combining characters. However, the standard makes no provision for the scripts of East Asian languages (CJK), as their ideographic writing systems require many thousands of code points. Although it uses Latin based characters, Vietnamese does not fit into 96 positions (without using combining diacritics) either. Each Japanese syllabic alphabet (hiragana or katakana, see Kana) would fit, but like several other alphabets of the world they are not encoded in the ISO/IEC 8859 system.

The parts of ISO/IEC 8859

ISO/IEC 8859 is divided into the following parts:

Part	Name	Revisions	Description
Part 1	Latin-1 Western European	1987, 1998	Perhaps the most widely used part of ISO/IEC 8859, covering most Western European languages: Danish (partial),^{[nb 1]}Dutch (partial),^{[nb 2]}English, Faeroese, Finnish (partial),^{[nb 3]}French (partial),^{[nb 3]}German, Icelandic, Irish, Italian, Norwegian, Portuguese, Rhaeto-Romanic, Scottish Gaelic, Spanish, Catalan, and Swedish. Languages from other parts of the world are also covered, including: Eastern European Albanian, Southeast Asian Indonesian, as well as the African languages Afrikaans and Swahili. The missing euro sign and capital Ÿ are in the revised version ISO/IEC 8859-15 (see below). The corresponding IANA character set is ISO-8859-1.
Part 2	Latin-2 Central European	1987, 1999	Supports those Central and Eastern European languages that use the Latin alphabet, including Bosnian, Polish, Croatian, Czech, Slovak, Slovene, Serbian, and Hungarian. The missing euro sign can be found in version ISO/IEC 8859-16.
Part 3	Latin-3 South European	1988, 1999	Turkish, Maltese, and Esperanto. Largely superseded by ISO/IEC 8859-9 for Turkish.
Part 4	Latin-4 North European	1988, 1998	Estonian, Latvian, Lithuanian, Greenlandic, and Sami.
Part 5	Latin/Cyrillic	1988, 1999	Covers mostly Slavic languages that use a Cyrillic alphabet, including Belarusian, Bulgarian, Macedonian, Russian, Serbian, and Ukrainian (partial).^{[nb 4]}
Part 6	Latin/Arabic	1987, 1999	Covers the most common Arabic language characters. Does not support other languages using the Arabic script. Needs to be BiDi and cursive joining processed for display.
Part 7	Latin/Greek	1987, 2003	Covers the modern Greek language (monotonic orthography). Can also be used for Ancient Greek written without accents or in monotonic orthography, but lacks the diacritics for polytonic orthography. These were introduced with Unicode.
Part 8	Latin/Hebrew	1988, 1999	Covers the modern Hebrew alphabet as used in Israel. In practice two different encodings exist, logical order (needs to be BiDi processed for display) and visual (left-to-right) order (in effect, after bidi processing and line breaking).
Part 9	Latin-5 Turkish	1989, 1999	Largely the same as ISO/IEC 8859-1, replacing the rarely used Icelandic letters with Turkish ones.
Part 10	Latin-6 Nordic	1992, 1998	A rearrangement of Latin-4. Considered more useful for Nordic languages. Baltic languages use Latin-4 more.
Part 11	Latin/Thai	2001	Contains characters needed for the Thai language. Virtually identical to TIS 620.
~~Part 12~~	Latin/Devanagari	N/A	The work in making a part of 8859 for Devanagari was officially abandoned in 1997. ISCII and Unicode/ISO/IEC 10646 cover Devanagari.
Part 13	Latin-7 Baltic Rim	1998	Added some characters for Baltic languages which were missing from Latin-4 and Latin-6.
Part 14	Latin-8 Celtic	1998	Covers Celtic languages such as Gaelic and the Breton language.
Part 15	Latin-9	1999	A revision of 8859-1 that removes some little-used symbols, replacing them with the euro sign € and the letters Š, š, Ž, ž, Œ, œ, and Ÿ, which completes the coverage of French, Finnish and Estonian.
Part 16	Latin-10 South-Eastern European	2001	Intended for Albanian, Croatian, Hungarian, Italian, Polish, Romanian and Slovene, but also Finnish, French, German and Irish Gaelic (new orthography). The focus lies more on letters than symbols. The currency sign is replaced with the euro sign.

Each part of ISO/IEC 8859 is designed to support languages that often borrow from each other, so the characters needed by each language are usually accommodated by a single part. However, there are some characters and language combinations that are not accommodated without transcriptions. Efforts were made to make conversions as smooth as possible. For example, German has all of its seven special characters at the same positions in all Latin variants (1–4, 9, 10, 13–16), and in many positions the characters only differ in the diacritics between the sets. In particular, variants 1–4 were designed jointly, and have the property that every encoded character appears either at a given position or not at all.

Table

Comparison of the various parts (1–16) of ISO/IEC 8859
Binary	Oct	Dec	Hex	1	2	3	4	5	6	7	8	9	10	11	13	14	15	16
1010 0000	240	160	A0	Non-breaking space (NBSP)
1010 0001	241	161	A1	¡	Ą	Ħ	Ą	Ё		‘		¡	Ą	ก	”	Ḃ	¡	Ą
1010 0010	242	162	A2	¢	˘		ĸ	Ђ		’	¢	¢	Ē	ข	¢	ḃ	¢	ą
1010 0011	243	163	A3	£	Ł	£	Ŗ	Ѓ		£			Ģ	ฃ	£			Ł
1010 0100	244	164	A4	¤				Є	¤	€	¤		Ī	ค	¤	Ċ	€
1010 0101	245	165	A5	¥	Ľ		Ĩ	Ѕ		₯	¥		Ĩ	ฅ	„	ċ	¥	„
1010 0110	246	166	A6	¦	Ś	Ĥ	Ļ	І		¦			Ķ	ฆ	¦	Ḋ	Š
1010 0111	247	167	A7	§				Ї		§				ง	§
1010 1000	250	168	A8	¨				Ј		¨			Ļ	จ	Ø	Ẁ	š
1010 1001	251	169	A9	©	Š	İ	Š	Љ		©			Đ	ฉ	©
1010 1010	252	170	AA	ª	Ş		Ē	Њ		ͺ	×	ª	Š	ช	Ŗ	Ẃ	ª	Ș
1010 1011	253	171	AB	«	Ť	Ğ	Ģ	Ћ		«			Ŧ	ซ	«	ḋ	«
1010 1100	254	172	AC	¬	Ź	Ĵ	Ŧ	Ќ	،	¬			Ž	ฌ	¬	Ỳ	¬	Ź
1010 1101	255	173	AD	soft hyphen (SHY)										ญ	SHY
1010 1110	256	174	AE	®	Ž		Ž	Ў			®		Ū	ฎ	®			ź
1010 1111	257	175	AF	¯	Ż		¯	Џ		―	¯		Ŋ	ฏ	Æ	Ÿ	¯	Ż
1011 0000	260	176	B0	°				А		°				ฐ	°	Ḟ	°
1011 0001	261	177	B1	±	ą	ħ	ą	Б		±			ą	ฑ	±	ḟ	±
1011 0010	262	178	B2	²	˛	²	˛	В		²			ē	ฒ	²	Ġ	²	Č
1011 0011	263	179	B3	³	ł	³	ŗ	Г		³			ģ	ณ	³	ġ	³	ł
1011 0100	264	180	B4	´				Д		΄	´		ī	ด	“	Ṁ	Ž
1011 0101	265	181	B5	µ	ľ	µ	ĩ	Е		΅	µ		ĩ	ต	µ	ṁ	µ	”
1011 0110	266	182	B6	¶	ś	ĥ	ļ	Ж		Ά	¶		ķ	ถ	¶
1011 0111	267	183	B7	·	ˇ	·	ˇ	З		·				ท	·	Ṗ	·
1011 1000	270	184	B8	¸				И		Έ	¸		ļ	ธ	ø	ẁ	ž
1011 1001	271	185	B9	¹	š	ı	š	Й		Ή	¹		đ	น	¹	ṗ	¹	č
1011 1010	272	186	BA	º	ş		ē	К		Ί	÷	º	š	บ	ŗ	ẃ	º	ș
1011 1011	273	187	BB	»	ť	ğ	ģ	Л	؛	»			ŧ	ป	»	Ṡ	»
1011 1100	274	188	BC	¼	ź	ĵ	ŧ	М		Ό	¼		ž	ผ	¼	ỳ	Œ
1011 1101	275	189	BD	½	˝	½	Ŋ	Н		½			―	ฝ	½	Ẅ	œ
1011 1110	276	190	BE	¾	ž		ž	О		Ύ	¾		ū	พ	¾	ẅ	Ÿ
1011 1111	277	191	BF	¿	ż		ŋ	П	؟	Ώ		¿	ŋ	ฟ	æ	ṡ	¿	ż
1100 0000	300	192	C0	À	Ŕ	À	Ā	Р		ΐ		À	Ā	ภ	Ą	À
1100 0001	301	193	C1	Á				С	ء	Α		Á		ม	Į	Á
1100 0010	302	194	C2	Â				Т	آ	Β		Â		ย	Ā	Â
1100 0011	303	195	C3	Ã	Ă		Ã	У	أ	Γ		Ã		ร	Ć	Ã		Ă
1100 0100	304	196	C4	Ä				Ф	ؤ	Δ		Ä		ฤ	Ä
1100 0101	305	197	C5	Å	Ĺ	Ċ	Å	Х	إ	Ε		Å		ล	Å			Ć
1100 0110	306	198	C6	Æ	Ć	Ĉ	Æ	Ц	ئ	Ζ		Æ		ฦ	Ę	Æ
1100 0111	307	199	C7	Ç			Į	Ч	ا	Η		Ç	Į	ว	Ē	Ç
1100 1000	310	200	C8	È	Č	È	Č	Ш	ب	Θ		È	Č	ศ	Č	È
1100 1001	311	201	C9	É				Щ	ة	Ι		É		ษ	É
1100 1010	312	202	CA	Ê	Ę	Ê	Ę	Ъ	ت	Κ		Ê	Ę	ส	Ź	Ê
1100 1011	313	203	CB	Ë				Ы	ث	Λ		Ë		ห	Ė	Ë
1100 1100	314	204	CC	Ì	Ě	Ì	Ė	Ь	ج	Μ		Ì	Ė	ฬ	Ģ	Ì
1100 1101	315	205	CD	Í				Э	ح	Ν		Í		อ	Ķ	Í
1100 1110	316	206	CE	Î				Ю	خ	Ξ		Î		ฮ	Ī	Î
1100 1111	317	207	CF	Ï	Ď	Ï	Ī	Я	د	Ο		Ï		ฯ	Ļ	Ï
Binary	Oct	Dec	Hex	1	2	3	4	5	6	7	8	9	10	11	13	14	15	16
1101 0000	320	208	D0	Ð	Đ		Đ	а	ذ	Π		Ğ	Ð	ะ	Š	Ŵ	Ð
1101 0001	321	209	D1	Ñ	Ń	Ñ	Ņ	б	ر	Ρ		Ñ	Ņ	ั	Ń	Ñ		Ń
1101 0010	322	210	D2	Ò	Ň	Ò	Ō	в	ز			Ò	Ō	า	Ņ	Ò
1101 0011	323	211	D3	Ó			Ķ	г	س	Σ		Ó		ำ	Ó
1101 0100	324	212	D4	Ô				д	ش	Τ		Ô		ิ	Ō	Ô
1101 0101	325	213	D5	Õ	Ő	Ġ	Õ	е	ص	Υ		Õ		ี	Ő
1101 0110	326	214	D6	Ö				ж	ض	Φ		Ö		ึ	Ö
1101 0111	327	215	D7	×				з	ط	Χ		×	Ũ	ื	×	Ṫ	×	Ś
1101 1000	330	216	D8	Ø	Ř	Ĝ	Ø	и	ظ	Ψ		Ø		ุ	Ų	Ø		Ű
1101 1001	331	217	D9	Ù	Ů	Ù	Ų	й	ع	Ω		Ù	Ų	ู	Ł	Ù
1101 1010	332	218	DA	Ú				к	غ	Ϊ		Ú		ฺ	Ś	Ú
1101 1011	333	219	DB	Û	Ű	Û		л		Ϋ		Û			Ū	Û
1101 1100	334	220	DC	Ü				м		ά		Ü			Ü
1101 1101	335	221	DD	Ý		Ŭ	Ũ	н		έ		İ	Ý		Ż	Ý		Ę
1101 1110	336	222	DE	Þ	Ţ	Ŝ	Ū	о		ή		Ş	Þ		Ž	Ŷ	Þ	Ț
1101 1111	337	223	DF	ß				п		ί	‗	ß		฿	ß
1110 0000	340	224	E0	à	ŕ	à	ā	р	ـ	ΰ	א	à	ā	เ	ą	à
1110 0001	341	225	E1	á				с	ف	α	ב	á		แ	į	á
1110 0010	342	226	E2	â				т	ق	β	ג	â		โ	ā	â
1110 0011	343	227	E3	ã	ă		ã	у	ك	γ	ד	ã		ใ	ć	ã		ă
1110 0100	344	228	E4	ä				ф	ل	δ	ה	ä		ไ	ä
1110 0101	345	229	E5	å	ĺ	ċ	å	х	م	ε	ו	å		ๅ	å			ć
1110 0110	346	230	E6	æ	ć	ĉ	æ	ц	ن	ζ	ז	æ		ๆ	ę	æ
1110 0111	347	231	E7	ç			į	ч	ه	η	ח	ç	į	็	ē	ç
1110 1000	350	232	E8	è	č	è	č	ш	و	θ	ט	è	č	่	č	è
1110 1001	351	233	E9	é				щ	ى	ι	י	é		้	é
1110 1010	352	234	EA	ê	ę	ê	ę	ъ	ي	κ	ך	ê	ę	๊	ź	ê
1110 1011	353	235	EB	ë				ы	ً	λ	כ	ë		๋	ė	ë
1110 1100	354	236	EC	ì	ě	ì	ė	ь	ٌ	μ	ל	ì	ė	์	ģ	ì
1110 1101	355	237	ED	í				э	ٍ	ν	ם	í		ํ	ķ	í
1110 1110	356	238	EE	î				ю	َ	ξ	מ	î		๎	ī	î
1110 1111	357	239	EF	ï	ď	ï	ī	я	ُ	ο	ן	ï		๏	ļ	ï
1111 0000	360	240	F0	ð	đ		đ	№	ِ	π	נ	ğ	ð	๐	š	ŵ	ð	đ
1111 0001	361	241	F1	ñ	ń	ñ	ņ	ё	ّ	ρ	ס	ñ	ņ	๑	ń	ñ		ń
1111 0010	362	242	F2	ò	ň	ò	ō	ђ	ْ	ς	ע	ò	ō	๒	ņ	ò
1111 0011	363	243	F3	ó			ķ	ѓ		σ	ף	ó		๓	ó
1111 0100	364	244	F4	ô				є		τ	פ	ô		๔	ō	ô
1111 0101	365	245	F5	õ	ő	ġ	õ	ѕ		υ	ץ	õ		๕	ő
1111 0110	366	246	F6	ö				і		φ	צ	ö		๖	ö
1111 0111	367	247	F7	÷				ї		χ	ק	÷	ũ	๗	÷	ṫ	÷	ś
1111 1000	370	248	F8	ø	ř	ĝ	ø	ј		ψ	ר	ø		๘	ų	ø		ű
1111 1001	371	249	F9	ù	ů	ù	ų	љ		ω	ש	ù	ų	๙	ł	ù
1111 1010	372	250	FA	ú				њ		ϊ	ת	ú		๚	ś	ú
1111 1011	373	251	FB	û	ű	û		ћ		ϋ		û		๛	ū	û
1111 1100	374	252	FC	ü				ќ		ό		ü			ü
1111 1101	375	253	FD	ý		ŭ	ũ	§		ύ	LRM	ı	ý		ż	ý		ę
1111 1110	376	254	FE	þ	ţ	ŝ	ū	ў		ώ	RLM	ş	þ		ž	ŷ	þ	ț
1111 1111	377	255	FF	ÿ	˙			џ				ÿ	ĸ		’	ÿ
Binary	Oct	Dec	Hex	1	2	3	4	5	6	7	8	9	10	11	13	14	15	16

At position 0xA0 there's always the non breaking space and 0xAD is mostly the soft hyphen, which only shows at line breaks. Other empty fields are either unassigned or the system used is not able to display them.

There are new additions as ISO/IEC 8859-7:2003 and ISO/IEC 8859-8:1999 versions. LRM stands for left-to-right mark (U+200E) and RLM stands for right-to-left mark (U+200F).

Relationship to Unicode and the UCS

Since 1991, the Unicode Consortium^{[nb 4]} has been working with ISO and IEC to develop the Unicode Standard and ISO/IEC 10646: the Universal Character Set (UCS) in tandem. Newer editions of ISO/IEC 8859 express characters in terms of their Unicode/UCS names and the U+nnnn notation, effectively causing each part of ISO/IEC 8859 to be a Unicode/UCS character encoding scheme that maps a very small subset of the UCS to single 8-bit bytes. The first 256 characters in Unicode and the UCS are identical to those in ISO/IEC-8859-1 (Latin-1).

Single-byte character sets including the parts of ISO/IEC 8859 and derivatives of them were favoured throughout the 1990s, having the advantages of being well-established and more easily implemented in software: the equation of one byte to one character is simple and adequate for most single-language applications, and there are no combining characters or variant forms. As Unicode-enabled operating systems became more widespread, ISO/IEC 8859 and other legacy encodings became less popular. While remnants of ISO 8859 and single-byte character models remain entrenched in many operating systems, programming languages, data storage systems, networking applications, display hardware, and end-user application software, most modern computing applications use Unicode internally, and rely on conversion tables to map to and from other encodings, when necessary.

Development status

The ISO/IEC 8859 standard was maintained by ISO/IEC Joint Technical Committee 1, Subcommittee 2, Working Group 3 (ISO/IEC JTC 1/SC 2/WG 3). In June 2004, WG 3 disbanded, and maintenance duties were transferred to SC 2. The standard is not currently being updated, as the Subcommittee's only remaining working group, WG 2, is concentrating on development of Unicode's Universal Coded Character Set.

Notes

^ Missing several accented vowels including Ǿ and ǿ. These can be replaced with non-accented vowels at the cost of increased ambiguity.

^ Only the Ĳ/ĳ (letter IJ) is missing, which is usually represented as IJ.

^ ^a^b Missing characters are in ISO/IEC 8859-15.

^ ^a^b 8859-5 misses the Ґ/ґ letter, which was reintroduced into the Ukrainian alphabet in 1990.

References

^ Haralambous, Yannis (September 2007). Fonts & Encodings. Translated by Horne, P. Scott (1st ed.). Sebastopol, California, USA: O'Reilly Media, Inc. pp. 37–38. ISBN 978-0-596-10242-5. ISBN 0-596-10242-9. […] According to a urban legend, the French delegate was out sick the day when the standard came up for a vote and had to have his Belgian counterpart act as his proxy. In fact, the French delegate was an engineer, who was convinced that this ligature was useless, and the Swiss and German representatives pressed hard to have the mathematical symbols × and ÷ included at the positions where Œ and œ would logically appear. […]

^ André, Jacques (2003-10-15) [2003-10-02]. André, Bernard; Baron, Georges-Louis; Bruillard, Éric, eds. "Histoire d'Œ, histoire d'@ des rumeurs typographiques et de leurs enseignements". Traitement de texte et production de documents INRP/GEDIAPS (in French): 19–34. Archived from the original on 2016-12-08. Retrieved 2016-12-09.

^ André, Jacques (November 1996). "ISO Latin-1, norme de codage des caractères européens? trois caractères français en sont absents!" (PDF). Cahiers GUTenberg (in French) (25): 65–77. Archived from the original (PDF) on 2008-11-30.

Published versions of each part of ISO/IEC 8859 are available, for a fee, from the ISO catalogue site and from the IEC Webstore.

PDF versions of the final drafts of some parts of ISO/IEC 8859 as submitted to the ISO/IEC JTC 1/SC 2/WG 3 for review & publication are available at the WG 3 web site:
- ISO/IEC 8859-1:1998 - 8-bit single-byte coded graphic character sets, Part 1: Latin alphabet No. 1 (draft dated February 12, 1998, published April 15, 1998)
- ISO/IEC 8859-4:1998 - 8-bit single-byte coded graphic character sets, Part 4: Latin alphabet No. 4 (draft dated February 12, 1998, published July 1, 1998)
- ISO/IEC 8859-7:1999 - 8-bit single-byte coded graphic character sets, Part 7: Latin/Greek alphabet (draft dated June 10, 1999; superseded by ISO/IEC 8859-7:2003, published October 10, 2003)
- ISO/IEC 8859-10:1998 - 8-bit single-byte coded graphic character sets, Part 10: Latin alphabet No. 6 (draft dated February 12, 1998, published July 15, 1998)
- ISO/IEC 8859-11:1999 - 8-bit single-byte coded graphic character sets, Part 11: Latin/Thai character set (draft dated June 22, 1999; superseded by ISO/IEC 8859-11:2001, published 15 December 2001)
- ISO/IEC 8859-13:1998 - 8-bit single-byte coded graphic character sets, Part 13: Latin alphabet No. 7 (draft dated April 15, 1998, published October 15, 1998)
- ISO/IEC 8859-15:1998 - 8-bit single-byte coded graphic character sets, Part 15: Latin alphabet No. 9 (draft dated August 1, 1997; superseded by ISO/IEC 8859-15:1999, published March 15, 1999)
- ISO/IEC 8859-16:2000 - 8-bit single-byte coded graphic character sets, Part 16: Latin alphabet No. 10 (draft dated November 15, 1999; superseded by ISO/IEC 8859-16:2001, published July 15, 2001)

ECMA standards, which in intent correspond exactly to the ISO/IEC 8859 character set standards, can be found at:
- Standard ECMA-94: 8-Bit Single Byte Coded Graphic Character Sets - Latin Alphabets No. 1 to No. 4 2nd edition (June 1986)
- Standard ECMA-113: 8-Bit Single-Byte Coded Graphic Character Sets - Latin/Cyrillic Alphabet 3rd edition (December 1999)
- Standard ECMA-114: 8-Bit Single-Byte Coded Graphic Character Sets - Latin/Arabic Alphabet 2nd edition (December 2000)
- Standard ECMA-118: 8-Bit Single-Byte Coded Graphic Character Sets - Latin/Greek Alphabet (December 1986)
- Standard ECMA-121: 8-Bit Single-Byte Coded Graphic Character Sets - Latin/Hebrew Alphabet 2nd edition (December 2000)
- Standard ECMA-128: 8-Bit Single-Byte Coded Graphic Character Sets - Latin Alphabet No. 5 2nd edition (December 1999)
- Standard ECMA-144: 8-Bit Single-Byte Coded Character Sets - Latin Alphabet No. 6 3rd edition (December 2000)

ISO/IEC 8859-1 to Unicode mapping tables as plain text files are at the Unicode FTP site.

Informal descriptions and code charts for most ISO/IEC 8859 standards are available in ISO/IEC 8859 Alphabet Soup (Mirror)

[NB_Missing_vowels-4] Missing several accented vowels including Ǿ and ǿ. These can be replaced with non-accented vowels at the cost of increased ambiguity.

[NB_IJ-5] Only the Ĳ/ĳ (letter IJ) is missing, which is usually represented as IJ.

[two-6] Missing characters are in ISO/IEC 8859-15.

[NB_Ghe-7] 8859-5 misses the Ґ/ґ letter, which was reintroduced into the Ukrainian alphabet in 1990.

[Haralambous_2007-1] Haralambous, Yannis (September 2007). Fonts & Encodings. Translated by Horne, P. Scott (1st ed.). Sebastopol, California, USA: O'Reilly Media, Inc. pp. 37–38. ISBN 978-0-596-10242-5. ISBN 0-596-10242-9. […] According to a urban legend, the French delegate was out sick the day when the standard came up for a vote and had to have his Belgian counterpart act as his proxy. In fact, the French delegate was an engineer, who was convinced that this ligature was useless, and the Swiss and German representatives pressed hard to have the mathematical symbols × and ÷ included at the positions where Œ and œ would logically appear. […]

[Andre_2003-2] André, Jacques (2003-10-15) [2003-10-02]. André, Bernard; Baron, Georges-Louis; Bruillard, Éric, eds. "Histoire d'Œ, histoire d'@ des rumeurs typographiques et de leurs enseignements". Traitement de texte et production de documents INRP/GEDIAPS (in French): 19–34. Archived from the original on 2016-12-08. Retrieved 2016-12-09.

[Andre_1996-3] André, Jacques (November 1996). "ISO Latin-1, norme de codage des caractères européens? trois caractères français en sont absents!" (PDF). Cahiers GUTenberg (in French) (25): 65–77. Archived from the original (PDF) on 2008-11-30.

v t e List of International Electrotechnical Commission standards
IEC standards	IEC 60027 IEC 60034 IEC 60038 IEC 60062 IEC 60063 IEC 60068 IEC 60112 IEC 60228 IEC 60269 IEC 60297 IEC 60309 IEC 60320 IEC 60364 IEC 60446 IEC 60559 IEC 60601 IEC 60870 IEC 60870-5 IEC 60870-6 IEC 60906-1 IEC 60908 IEC 60929 IEC 60958 AES3 S/PDIF IEC 61030 IEC 61131 IEC 61131-3 IEC 61158 IEC 61162 IEC 61334 IEC 61346 IEC 61355 IEC 61400 IEC 61499 IEC 61508 IEC 61511 IEC 61850 IEC 61851 IEC 61883 IEC 61960 IEC 61968 IEC 61970 IEC 62014-4 IEC 62056 IEC 62061 IEC 62196 IEC 62262 IEC 62264 IEC 62304 IEC 62325 IEC 62351 IEC 62365 IEC 62366 IEC 62379 IEC 62386 IEC 62455 IEC 62680 IEC 62682 IEC 62700
ISO/IEC standards	ISO/IEC 646 ISO/IEC 2022 ISO/IEC 4909 ISO/IEC 5218 ISO/IEC 6429 ISO/IEC 6523 ISO/IEC 7810 ISO/IEC 7811 ISO/IEC 7812 ISO/IEC 7813 ISO/IEC 7816 ISO/IEC 7942 ISO/IEC 8613 ISO/IEC 8632 ISO/IEC 8652 ISO/IEC 8859 ISO/IEC 9126 ISO/IEC 9293 ISO/IEC 9592 ISO/IEC 9593 ISO/IEC 9899 ISO/IEC 9945 ISO/IEC 9995 ISO/IEC 10021 ISO/IEC 10116 ISO/IEC 10165 ISO/IEC 10179 ISO/IEC 10646 ISO/IEC 10967 ISO/IEC 11172 ISO/IEC 11179 ISO/IEC 11404 ISO/IEC 11544 ISO/IEC 11801 ISO/IEC 12207 ISO/IEC 13250 ISO/IEC 13346 ISO/IEC 13522-5 ISO/IEC 13568 ISO/IEC 13818 ISO/IEC 14443 ISO/IEC 14496 ISO/IEC 14882 ISO/IEC 15288 ISO/IEC 15291 ISO/IEC 15408 ISO/IEC 15444 ISO/IEC 15445 ISO/IEC 15504 ISO/IEC 15511 ISO/IEC 15693 ISO/IEC 15897 ISO/IEC 15938 ISO/IEC 16262 ISO/IEC 17024 ISO/IEC 17025 ISO/IEC 18000 ISO/IEC 18004 ISO/IEC 18014 ISO/IEC 19752 ISO/IEC 19757 ISO/IEC 19770 ISO/IEC 19788 ISO/IEC 20000 ISO/IEC 21000 ISO/IEC 21827 ISO/IEC 23000 ISO/IEC 23003 ISO/IEC 23008 ISO/IEC 23270 ISO/IEC 23360 ISO/IEC 24707 ISO/IEC 24727 ISO/IEC 24744 ISO/IEC 24752 ISO/IEC 26300 ISO/IEC 27000 ISO/IEC 27000-series ISO/IEC 27002 ISO/IEC 27040 ISO/IEC 29119 ISO/IEC 33001 ISO/IEC 38500 ISO/IEC 42010 ISO/IEC 80000
Related	International Electrotechnical Commission

v t e Character encodings
Early telecommunications	ASCII ISO/IEC 646 ISO/IEC 6937 T.61 BCDIC Baudot code Morse code Telegraph code Wabun code Special telegraphy codes Non-Latin Chinese Cyrillic Needle telegraph codes
ISO/IEC 8859	-1 -2 -3 -4 -5 -6 -7 -8 -9 -10 -11 -12 -13 -14 -15 -16
Bibliographic use	ANSEL ISO 5426 / 5426-2 / 5427 / 5428 / 6438 / 6861 / 6862 / 10585 / 10586 / 10754 / 11822 MARC-8
National standards	ArmSCII BraSCII CNS 11643 ELOT 927 GOST 10859 GB 18030 HKSCS ISCII JIS X 0201 JIS X 0208 JIS X 0212 JIS X 0213 KOI-7 KPS 9566 KS X 1001 PASCII SI 960 TIS-620 TSCII VISCII VSCII YUSCII
EUC	CN JP KR TW
ISO/IEC 2022	CN JP KR CCCII
MacOS code pages ("scripts")	Armenian Arabic Barents Cyrillic Celtic CentEuro ChineseSimp / EUC-CN ChineseTrad / Big5 Croatian Cyrillic Devanagari Dingbats Esperanto Farsi (Persian) Gaelic Georgian Greek Gujarati Gurmukhi Hebrew Iceland Inuit Japanese / ShiftJIS Keyboard Korean / EUC-KR Latin-1 Ogham Roman Romanian Sámi Symbol Thai / TIS-620 Turkish Turkic Latin Turkic Cyrillic Ukrainian
DOS code pages	100 111 112 113 151 152 161 162 163 164 165 166 210 220 301 437 449 489 620 667 668 707 708 709 710 711 714 715 720 721 737 768 770 771 772 773 774 775 776 777 778 790 850 851 852 853 854 855/872 856 857 858 859 860 861 862 863 864/17248 865 866/808 867 868 869 874/1161/1162 876 877 878 881 882 883 884 885 891 895 896 897 898 899 900 903 904 906 907 909 910 911 926 927 928 929 932 934 936 938 941 942 943 944 946 947 948 949 950/1370 951 966 991 1034 1039 1040 1041 1042 1043 1044 1046 1086 1088 1092 1093 1098 1108 1109 1114 1115 1116 1117 1118 1119 1125/848 1126 1127 1131/849 1139 1167 1168 1300 1351 1361 1362 1363 1372 1373 1374 1375 1380 1381 1385 1386 1391 1392 1393 1394 CWI-2 Iran System Kamenický KOI8 Mazovia MIK
IBM AIX code pages	367 371 806 813 819 895 896 912 913 914 915 916 919 920 921/901 922/902 923 952 953 954 955 956 957 958 959 960 961 963 964 965 970 971 1004 1006 1008 1009 1010 1011 1012 1013 1014 1015 1016 1017 1018 1019 1029 1036 1089 1111 1124 1129/1163 1133 1350 1382 1383
IBM Apple MacIntosh emulations	1275 1280 1281 1282 1283 1284 1285 1286
IBM Adobe emulations	1038 1276 1277
IBM DEC emulations	1020 1021 1023 1090 1100 1101 1102 1103 1104 1105 1106 1107 1287 1288
IBM HP emulations	1050 1051 1052 1053 1054 1055 1056 1057 1058
Windows code pages	CER-GS 874/1162 (TIS-620) 932/943 (Shift JIS) 936/1386 (GBK) 950/1370 (Big5) 949/1363 (EUC-KR) 1169 1174 Extended Latin-8 1200 (UTF-16LE) 1201 (UTF-16BE) 1250 1251 1252 1253 1254 1255 1256 1257 1258 1259 1261 1270 54936 (GB18030)
EBCDIC code pages	1 2 3 4 5 6 7 8 9 10 11 12 13 14 15 16 17 18 19 20 21 22 23 24 25 26 27 28 29 30 31 32 33 34 35 36 37/1140 37-2 38 39 40 251 252 254 256 257 258 259 260 264 273/1141 274 275 276 277/1142 278/1143 279 280/1144 281 282 283 284/1145 285/1146 286 287 288 289 290 293 297/1147 298 300 310 320 321 322 330 351 352 353 355 357 358 359 360 361 363 382 383 384 385 386 387 388 389 390 391 392 393 394 395 410 420/16804 421 423 424/8616/12712 425 435 500/1148 803 829 833 834 835 836 837 838/838 839 870/1110/1153 871/1149 875/4971/9067 880 881 882 883 884 885 886 887 888 889 890 892 893 905 918 924 930/1390 931 933/1364 935/1388 937/1371 939/1399 1001 1002 1003 1005 1007 1024 1025/1154 1026/1155 1027 1028 1030 1031 1032 1033 1037 1047 1068 1069 1070 1071 1073 1074 1075 1076 1077 1078 1079 1080 1081 1082 1083 1084 1085 1087 1091 1097 1112/1156 1113 1122/1157 1123/1158 1130/1164 1132 1136 1137 1150 1151 1152 1159 1165 1166 1278 1279 1303 1364 1376 1377 JEF KEIS
Platform specific	Acorn Adobe Standard Adobe Latin 1 Apple II ATASCII Atari ST BICS Casio calculators CDC CPC DEC Radix-50 DEC MCS/NRCS DG International ELWRO-Junior FIELDATA GEM GEOS GSM 03.38 HP Roman Extension HP Roman-8 HP Roman-9 HP FOCAL HP RPL LICS LMBCS Mattel Aquarius MSX NEC APC NeXT PCW PETSCII Sharp calculators TI calculators TRS-80 Ventura International Ventura Symbol WISCII XCCS ZX80 ZX81 ZX Spectrum
Unicode / ISO/IEC 10646	UTF-1 UTF-7 UTF-8 UTF-16 (UTF-16LE/UTF-16BE) / UCS-2 UTF-32 (UTF-32LE/UTF-32BE) / UCS-4 UTF-EBCDIC GB 18030 BOCU-1 CESU-8 SCSU
TeX typesetting system	Cork LGR LY1 OML OMS OMX OT1 OT2 OT3 OT4 T2A T2B T2C T2D T3 T4 T5 TS1 TS3 U X2
Miscellaneous code pages	ABICOMP APL ARIB STD-B24 HZ INIS INIS-8 ISO-IR-111 ISO-IR-182 ISO-IR-197 ISO-IR-200 ISO-IR-201 Johab SEASCII TACE16 TRON UTF-5 UTF-6 WTF-8
Related topics	Code page Control character (C0 C1) CCSID Character encodings in HTML Charset detection Han unification Hardware ISO 6429/IEC 6429/ANSI X3.64 Mojibake
Character sets

v t e Standards of Ecma International
Application interfaces	ANSI escape code Common Language Infrastructure Office Open XML OpenXPS
File systems (tape)	Advanced Intelligent Tape DDS DLT Super DLT Holographic Versatile Disc Linear Tape-Open (Ultrium-1) VXA
File systems (disk)	CD-ROM CD File System (CDFS) FAT FAT12 FAT16 FAT16B FD UDF Ultra Density Optical Universal Media Disc
Graphics	Universal 3D
Programming languages	C++/CLI C# Eiffel JavaScript (E4X, ECMAScript)
Radio link interfaces	NFC UWB
Other	ECMA-35
List of Ecma standards (1961 - Present)

v t e ISO standards by standard number
List of ISO standards / ISO romanizations / IEC standards
1–9999	1 2 3 4 5 6 7 9 16 17 31 -0 -1 -2 -3 -4 -5 -6 -7 -8 -9 -10 -11 -12 -13 128 216 217 226 228 233 259 269 302 306 361 428 500 518 519 639 -1 -2 -3 -5 -6 646 657 668 690 704 732 764 838 843 860 898 965 999 1000 1004 1007 1073-1 1155 1413 1538 1629 1745 1989 2014 2015 2022 2033 2047 2108 2145 2146 2240 2281 2533 2709 2711 2720 2788 2848 2852 3029 3103 3166 -1 -2 -3 3297 3307 3601 3602 3864 3901 3950 3977 4031 4157 4165 4217 4909 5218 5426 5427 5428 5725 5775 5776 5800 5964 6166 6344 6346 6385 6425 6429 6438 6523 6709 6943 7001 7002 7010 7027 7064 7098 7185 7200 7498 -1 7736 7810 7811 7812 7813 7816 7942 8000 8093 8178 8217 8373 8501-1 8571 8583 8601 8613 8632 8651 8652 8691 8805/8806 8807 8820-5 8859 -1 -2 -3 -4 -5 -6 -7 -8 -8-I -9 -10 -11 -12 -13 -14 -15 -16 8879 9000/9001 9036 9075 9126 9141 9227 9241 9293 9314 9362 9407 9506 9529 9564 9592/9593 9594 9660 9797-1 9897 9899 9945 9984 9985 9995
10000–19999	10005 10006 10007 10116 10118-3 10160 10161 10165 10179 10206 10218 10303 -11 -21 -22 -28 -238 10383 10487 10585 10589 10646 10664 10746 10861 10957 10962 10967 11073 11170 11179 11404 11544 11783 11784 11785 11801 11898 11940 (-2) 11941 11941 (TR) 11992 12006 12182 12207 12234-2 13211 -1 -2 13216 13250 13399 13406-2 13450 13485 13490 13567 13568 13584 13616 14000 14031 14224 14289 14396 14443 14496 -2 -3 -6 -10 -11 -12 -14 -17 -20 14644 14649 14651 14698 14750 14764 14882 14971 15022 15189 15288 15291 15292 15398 15408 15444 -3 15445 15438 15504 15511 15686 15693 15706 -2 15707 15897 15919 15924 15926 15926 WIP 15930 16023 16262 16612-2 16750 16949 (TS) 17024 17025 17100 17203 17369 17442 17799 18000 18004 18014 18245 18629 18916 19005 19011 19092 (-1 -2) 19114 19115 19125 19136 19407 19439 19500 19501 19502 19503 19505 19506 19507 19508 19509 19510 19600 19752 19757 19770 19775-1 19794-5 19831
20000+	20000 20022 20121 20400 21000 21047 21500 21827:2002 22000 23270 23271 23360 24517 24613 24617 24707 25178 25964 26000 26300 26324 27000 series 27000 27001 27002 27006 27729 28000 29110 29148 29199-2 29500 30170 31000 32000 38500 40500 42010 55000 80000 -1 -2 -3
Category

搜尋此網誌

wefcdr

ISO/IEC 8859

Contents

Introduction

Characters

The parts of ISO/IEC 8859

Table

Relationship to Unicode and the UCS

Development status

See also

Notes

References

Popular posts from this blog

California

New York City

CNN