Mercurial > emacs
view etc/charsets/README @ 89420:c3e67ce6ee0f
(Qsignature, Qendian): Delete these variables.
(syms_of_coding): Don't initialize them.
(CATEGORY_MASK_UTF_16_AUTO): New macro.
(detect_coding_utf_16): Add CATEGORY_MASK_UTF_16_AUTO in
detect_info->found.
(decode_coding_utf_16): Don't detect BOM here.
(encode_coding_utf_16): Produce BOM if CODING_UTF_16_BOM (coding)
is NOT utf_16_without_bom.
(setup_coding_system): For a coding system of type utf-16, check
if the attribute :endian is Qbig or not (not nil or not), and set
CODING_REQUIRE_DETECTION_MASK if BOM detection is required.
(detect_coding): If coding type is utf-16 and BOM detection is
required, detect it.
(Fdefine_coding_system_internal): For a coding system of type
utf-16, check if the attribute :endian is Qbig or not (not nil or
not).
author | Kenichi Handa <handa@m17n.org> |
---|---|
date | Tue, 06 May 2003 12:28:11 +0000 |
parents | e324faed6fcc |
children | 957a41d75fd5 |
line wrap: on
line source
# README file for charset mapping files in this directory. # Copyright (C) 2001, 2002 # National Institute of Advanced Industrial Science and Technology (AIST) # Registration Number H13PRO009 # Copyright (C) 2002 Free Software Foundation, Inc. # This file is part of GNU Emacs. # GNU Emacs is free software; you can redistribute it and/or modify # it under the terms of the GNU General Public License as published by # the Free Software Foundation; either version 2, or (at your option) # any later version. # GNU Emacs is distributed in the hope that it will be useful, # but WITHOUT ANY WARRANTY; without even the implied warranty of # MERCHANTABILITY or FITNESS FOR A PARTICULAR PURPOSE. See the # GNU General Public License for more details. # You should have received a copy of the GNU General Public License # along with GNU Emacs; see the file COPYING. If not, write to # the Free Software Foundation, Inc., 59 Temple Place - Suite 330, # Boston, MA 02111-1307, USA. (1) Format of mapping files Each line contains a code point and the corresponding Unicode character code separated by a space. Both code points and Unicode character codes are in hexadecimal preceded by "0x". Comments may be used, starting with "#". Code ranges may also be used, with (inclusive) start and end code points separated by "-" followed by the unicode of the start of the range Examples: 0xA0 0x00A0 # no-break space 0x8141-0x8143 0x4E04 # map onto a Unicode range (2) Source of mapping files Most mapping files are generated from the data files distributed with glibc (under the sub-directory "localedata/charmaps"). This list shows the correspondence of the data file, the mapping file, and which charset uses it. DATA-FILE MAP-FILE CHARSET ========= ======== ======= ISO-8859-2 8859-2.map iso-8859-2 ISO-8859-3 8859-3.map iso-8859-3 ISO-8859-4 8859-4.map iso-8859-4 ISO-8859-5 8859-5.map iso-8859-5 ISO-8859-6 8859-6.map iso-8859-6 ISO-8859-7 8859-7.map iso-8859-7 ISO-8859-8 8859-8.map iso-8859-8 ISO-8859-9 8859-9.map iso-8859-9 ISO-8859-10 8859-10.map iso-8859-10 ISO-8859-13 8859-13.map iso-8859-13 ISO-8859-14 8859-14.map iso-8859-14 ISO-8859-15 8859-15.map iso-8859-15 ISO-8859-16 8859-16.map iso-8859-16 GB2312 gb2312-1980.map chinese-gb2312 EUC-KR ksc5601-1987.map korean-ksc5601 JIS_C6220-1969-RO and EUC-JP jisx0201.map jisx0201 EUC-JP jisx0208-1990.map japanese-jisx0208 EUC-JP jisx0212-1990.map japanese-jisx0212 EUC-TW cns11643-1.map chinese-cns11643-1 EUC-TW cns11643-2.map chinese-cns11643-2 BIG5 big5.map big5 BIG5 big5-1.map chinese-big5-1 BIG5 big5-2.map chinese-big5-2 MACINTOSH mac-roman.map mac-roman VISCII viscii.map viscii VISCII viscii-lower.map vietnamese-viscii-lower VISCII viscii-upper.map vietnamese-viscii-upper VISCII vscii.map vscii KOI8-R koi8-r.map koi8-r IBM866 ibm866.map alternativnyj CP1251 windows-1251.map windows-1251 CP1250 windows-1250.map windows-1250 GEORGIAN-PS georgian-ps.map georgian-ps KOI8-U koi8-u.map koi8-u KOI8-T koi8-t.map koi8-t EBCDIC-US ebcdic.us.map ebcdic-us EBCDIC-UK ebcdic.uk.map ebcdic-uk CP1250 windows-1250.map windows-1250 CP1251 windows-1251.map windows-1251 CP1252 windows-1252.map windows-1252 From ICU: cp1125.map cp1125