88417
|
1 # README file for charset mapping files in this directory.
|
|
2 # Copyright (C) 2001, 2002
|
|
3 # National Institute of Advanced Industrial Science and Technology (AIST)
|
|
4 # Registration Number H13PRO009
|
88536
|
5 # Copyright (C) 2002 Free Software Foundation, Inc.
|
88417
|
6
|
|
7 # This file is part of GNU Emacs.
|
|
8
|
|
9 # GNU Emacs is free software; you can redistribute it and/or modify
|
|
10 # it under the terms of the GNU General Public License as published by
|
|
11 # the Free Software Foundation; either version 2, or (at your option)
|
|
12 # any later version.
|
|
13
|
|
14 # GNU Emacs is distributed in the hope that it will be useful,
|
|
15 # but WITHOUT ANY WARRANTY; without even the implied warranty of
|
|
16 # MERCHANTABILITY or FITNESS FOR A PARTICULAR PURPOSE. See the
|
|
17 # GNU General Public License for more details.
|
|
18
|
|
19 # You should have received a copy of the GNU General Public License
|
|
20 # along with GNU Emacs; see the file COPYING. If not, write to
|
|
21 # the Free Software Foundation, Inc., 59 Temple Place - Suite 330,
|
|
22 # Boston, MA 02111-1307, USA.
|
|
23
|
|
24
|
|
25 (1) Format of mapping files
|
|
26
|
|
27 Each line contains a code point and the corresponding Unicode
|
|
28 character code separated by a space. Both code points and Unicode
|
88536
|
29 character codes are in hexadecimal preceded by "0x". Comments may be
|
88689
|
30 used, starting with "#". Code ranges may also be used, with
|
|
31 (inclusive) start and end code points separated by "-" followed by the
|
|
32 unicode of the start of the range
|
88417
|
33
|
88536
|
34 Examples:
|
|
35 0xA0 0x00A0 # no-break space
|
|
36
|
88689
|
37 0x8141-0x8143 0x4E04 # map onto a Unicode range
|
88417
|
38
|
|
39
|
|
40 (2) Source of mapping files
|
|
41
|
88536
|
42 Most mapping files are generated from the data files distributed with
|
88417
|
43 glibc (under the sub-directory "localedata/charmaps"). This list
|
88536
|
44 shows the correspondence of the data file, the mapping file, and which
|
|
45 charset uses it.
|
88417
|
46
|
|
47 DATA-FILE MAP-FILE CHARSET
|
|
48 ========= ======== =======
|
|
49 ISO-8859-2 8859-2.map iso-8859-2
|
|
50 ISO-8859-3 8859-3.map iso-8859-3
|
|
51 ISO-8859-4 8859-4.map iso-8859-4
|
|
52 ISO-8859-5 8859-5.map iso-8859-5
|
|
53 ISO-8859-6 8859-6.map iso-8859-6
|
|
54 ISO-8859-7 8859-7.map iso-8859-7
|
|
55 ISO-8859-8 8859-8.map iso-8859-8
|
|
56 ISO-8859-9 8859-9.map iso-8859-9
|
|
57 ISO-8859-10 8859-10.map iso-8859-10
|
|
58 ISO-8859-13 8859-13.map iso-8859-13
|
|
59 ISO-8859-14 8859-14.map iso-8859-14
|
|
60 ISO-8859-15 8859-15.map iso-8859-15
|
|
61 ISO-8859-16 8859-16.map iso-8859-16
|
|
62 GB2312 gb2312-1980.map chinese-gb2312
|
|
63 EUC-KR ksc5601-1987.map korean-ksc5601
|
|
64 JIS_C6220-1969-RO and EUC-JP jisx0201.map jisx0201
|
|
65 EUC-JP jisx0208-1990.map japanese-jisx0208
|
|
66 EUC-JP jisx0212-1990.map japanese-jisx0212
|
|
67 EUC-TW cns11643-1.map chinese-cns11643-1
|
|
68 EUC-TW cns11643-2.map chinese-cns11643-2
|
|
69 BIG5 big5.map big5
|
|
70 BIG5 big5-1.map chinese-big5-1
|
|
71 BIG5 big5-2.map chinese-big5-2
|
|
72 MACINTOSH mac-roman.map mac-roman
|
|
73 VISCII viscii.map viscii
|
|
74 VISCII viscii-lower.map vietnamese-viscii-lower
|
|
75 VISCII viscii-upper.map vietnamese-viscii-upper
|
|
76 VISCII vscii.map vscii
|
|
77 KOI8-R koi8-r.map koi8-r
|
|
78 IBM866 ibm866.map alternativnyj
|
88536
|
79 CP1251 windows-1251.map windows-1251
|
|
80 CP1250 windows-1250.map windows-1250
|
|
81 GEORGIAN-PS georgian-ps.map georgian-ps
|
|
82 KOI8-U koi8-u.map koi8-u
|
|
83 KOI8-T koi8-t.map koi8-t
|
|
84 EBCDIC-US ebcdic.us.map ebcdic-us
|
|
85 EBCDIC-UK ebcdic.uk.map ebcdic-uk
|
|
86 CP1250 windows-1250.map windows-1250
|
|
87 CP1251 windows-1251.map windows-1251
|
|
88 CP1252 windows-1252.map windows-1252
|
|
89
|
|
90 From ICU:
|
|
91 cp1125.map cp1125
|