emacs: man/mule.texi annotate

annotate man/mule.texi @ 28285:c54d62415e91

Changed the type of parameter passed to the function defined by `quickurl-format-function'. Before only the text of the URL was passed. Now the whole URL structure is passed and the function is responsible for extracting the parts it requires. Changed the default of `quickurl-format-function' accordingly. (quickurl-insert): Changed the `funcall' of `quickurl-format-function' to match the above change. (quickurl-list-insert): Changed the `url' case so that it makes use of `quickurl-format-function', previous to this the format was hard wired.

author	Gerd Moellmann <gerd@gnu.org>
date	Thu, 23 Mar 2000 13:53:14 +0000
parents	0699f691fac1
children	ccadb68eaefd

rev	line source
25829 ac7e9e5e2ccb # Dave Love <fx@gnu.org> parents: diff changeset	1 @c This is part of the Emacs manual.
ac7e9e5e2ccb # Dave Love <fx@gnu.org> parents: diff changeset	2 @c Copyright (C) 1997, 1999 Free Software Foundation, Inc.
ac7e9e5e2ccb # Dave Love <fx@gnu.org> parents: diff changeset	3 @c See file emacs.texi for copying conditions.
ac7e9e5e2ccb # Dave Love <fx@gnu.org> parents: diff changeset	4 @node International, Major Modes, Frames, Top
ac7e9e5e2ccb # Dave Love <fx@gnu.org> parents: diff changeset	5 @chapter International Character Set Support
ac7e9e5e2ccb # Dave Love <fx@gnu.org> parents: diff changeset	6 @cindex MULE
ac7e9e5e2ccb # Dave Love <fx@gnu.org> parents: diff changeset	7 @cindex international scripts
ac7e9e5e2ccb # Dave Love <fx@gnu.org> parents: diff changeset	8 @cindex multibyte characters
ac7e9e5e2ccb # Dave Love <fx@gnu.org> parents: diff changeset	9 @cindex encoding of characters
ac7e9e5e2ccb # Dave Love <fx@gnu.org> parents: diff changeset	10
ac7e9e5e2ccb # Dave Love <fx@gnu.org> parents: diff changeset	11 @cindex Chinese
26140 068f7ad41d40 Describe new functions and variables for locales. Paul Eggert <eggert@twinsun.com> parents: 25829 diff changeset	12 @cindex Cyrillic
25829 ac7e9e5e2ccb # Dave Love <fx@gnu.org> parents: diff changeset	13 @cindex Devanagari
ac7e9e5e2ccb # Dave Love <fx@gnu.org> parents: diff changeset	14 @cindex Hindi
ac7e9e5e2ccb # Dave Love <fx@gnu.org> parents: diff changeset	15 @cindex Marathi
26140 068f7ad41d40 Describe new functions and variables for locales. Paul Eggert <eggert@twinsun.com> parents: 25829 diff changeset	16 @cindex Ethiopic
25829 ac7e9e5e2ccb # Dave Love <fx@gnu.org> parents: diff changeset	17 @cindex Greek
26140 068f7ad41d40 Describe new functions and variables for locales. Paul Eggert <eggert@twinsun.com> parents: 25829 diff changeset	18 @cindex Hebrew
25829 ac7e9e5e2ccb # Dave Love <fx@gnu.org> parents: diff changeset	19 @cindex IPA
ac7e9e5e2ccb # Dave Love <fx@gnu.org> parents: diff changeset	20 @cindex Japanese
ac7e9e5e2ccb # Dave Love <fx@gnu.org> parents: diff changeset	21 @cindex Korean
ac7e9e5e2ccb # Dave Love <fx@gnu.org> parents: diff changeset	22 @cindex Lao
ac7e9e5e2ccb # Dave Love <fx@gnu.org> parents: diff changeset	23 @cindex Thai
ac7e9e5e2ccb # Dave Love <fx@gnu.org> parents: diff changeset	24 @cindex Tibetan
ac7e9e5e2ccb # Dave Love <fx@gnu.org> parents: diff changeset	25 @cindex Vietnamese
ac7e9e5e2ccb # Dave Love <fx@gnu.org> parents: diff changeset	26 Emacs supports a wide variety of international character sets,
ac7e9e5e2ccb # Dave Love <fx@gnu.org> parents: diff changeset	27 including European variants of the Latin alphabet, as well as Chinese,
26140 068f7ad41d40 Describe new functions and variables for locales. Paul Eggert <eggert@twinsun.com> parents: 25829 diff changeset	28 Cyrillic, Devanagari (Hindi and Marathi), Ethiopic, Greek, Hebrew, IPA,
068f7ad41d40 Describe new functions and variables for locales. Paul Eggert <eggert@twinsun.com> parents: 25829 diff changeset	29 Japanese, Korean, Lao, Thai, Tibetan, and Vietnamese scripts. These features
25829 ac7e9e5e2ccb # Dave Love <fx@gnu.org> parents: diff changeset	30 have been merged from the modified version of Emacs known as MULE (for
ac7e9e5e2ccb # Dave Love <fx@gnu.org> parents: diff changeset	31 ``MULti-lingual Enhancement to GNU Emacs'')
ac7e9e5e2ccb # Dave Love <fx@gnu.org> parents: diff changeset	32
ac7e9e5e2ccb # Dave Love <fx@gnu.org> parents: diff changeset	33 @menu
ac7e9e5e2ccb # Dave Love <fx@gnu.org> parents: diff changeset	34 * International Intro:: Basic concepts of multibyte characters.
ac7e9e5e2ccb # Dave Love <fx@gnu.org> parents: diff changeset	35 * Enabling Multibyte:: Controlling whether to use multibyte characters.
ac7e9e5e2ccb # Dave Love <fx@gnu.org> parents: diff changeset	36 * Language Environments:: Setting things up for the language you use.
ac7e9e5e2ccb # Dave Love <fx@gnu.org> parents: diff changeset	37 * Input Methods:: Entering text characters not on your keyboard.
ac7e9e5e2ccb # Dave Love <fx@gnu.org> parents: diff changeset	38 * Select Input Method:: Specifying your choice of input methods.
ac7e9e5e2ccb # Dave Love <fx@gnu.org> parents: diff changeset	39 * Multibyte Conversion:: How single-byte characters convert to multibyte.
ac7e9e5e2ccb # Dave Love <fx@gnu.org> parents: diff changeset	40 * Coding Systems:: Character set conversion when you read and
ac7e9e5e2ccb # Dave Love <fx@gnu.org> parents: diff changeset	41 write files, and so on.
ac7e9e5e2ccb # Dave Love <fx@gnu.org> parents: diff changeset	42 * Recognize Coding:: How Emacs figures out which conversion to use.
ac7e9e5e2ccb # Dave Love <fx@gnu.org> parents: diff changeset	43 * Specify Coding:: Various ways to choose which conversion to use.
ac7e9e5e2ccb # Dave Love <fx@gnu.org> parents: diff changeset	44 * Fontsets:: Fontsets are collections of fonts
ac7e9e5e2ccb # Dave Love <fx@gnu.org> parents: diff changeset	45 that cover the whole spectrum of characters.
ac7e9e5e2ccb # Dave Love <fx@gnu.org> parents: diff changeset	46 * Defining Fontsets:: Defining a new fontset.
27211 0699f691fac1 Don't conflate single-byte with European. Dave Love <fx@gnu.org> parents: 27156 diff changeset	47 * Single-Byte Character Support::
25829 ac7e9e5e2ccb # Dave Love <fx@gnu.org> parents: diff changeset	48 You can pick one European character set
ac7e9e5e2ccb # Dave Love <fx@gnu.org> parents: diff changeset	49 to use without multibyte characters.
ac7e9e5e2ccb # Dave Love <fx@gnu.org> parents: diff changeset	50 @end menu
ac7e9e5e2ccb # Dave Love <fx@gnu.org> parents: diff changeset	51
ac7e9e5e2ccb # Dave Love <fx@gnu.org> parents: diff changeset	52 @node International Intro
ac7e9e5e2ccb # Dave Love <fx@gnu.org> parents: diff changeset	53 @section Introduction to International Character Sets
ac7e9e5e2ccb # Dave Love <fx@gnu.org> parents: diff changeset	54
ac7e9e5e2ccb # Dave Love <fx@gnu.org> parents: diff changeset	55 The users of these scripts have established many more-or-less standard
ac7e9e5e2ccb # Dave Love <fx@gnu.org> parents: diff changeset	56 coding systems for storing files. Emacs internally uses a single
ac7e9e5e2ccb # Dave Love <fx@gnu.org> parents: diff changeset	57 multibyte character encoding, so that it can intermix characters from
ac7e9e5e2ccb # Dave Love <fx@gnu.org> parents: diff changeset	58 all these scripts in a single buffer or string. This encoding
ac7e9e5e2ccb # Dave Love <fx@gnu.org> parents: diff changeset	59 represents each non-ASCII character as a sequence of bytes in the range
ac7e9e5e2ccb # Dave Love <fx@gnu.org> parents: diff changeset	60 0200 through 0377. Emacs translates between the multibyte character
ac7e9e5e2ccb # Dave Love <fx@gnu.org> parents: diff changeset	61 encoding and various other coding systems when reading and writing
ac7e9e5e2ccb # Dave Love <fx@gnu.org> parents: diff changeset	62 files, when exchanging data with subprocesses, and (in some cases) in
ac7e9e5e2ccb # Dave Love <fx@gnu.org> parents: diff changeset	63 the @kbd{C-q} command (@pxref{Multibyte Conversion}).
ac7e9e5e2ccb # Dave Love <fx@gnu.org> parents: diff changeset	64
ac7e9e5e2ccb # Dave Love <fx@gnu.org> parents: diff changeset	65 @kindex C-h h
ac7e9e5e2ccb # Dave Love <fx@gnu.org> parents: diff changeset	66 @findex view-hello-file
ac7e9e5e2ccb # Dave Love <fx@gnu.org> parents: diff changeset	67 The command @kbd{C-h h} (@code{view-hello-file}) displays the file
ac7e9e5e2ccb # Dave Love <fx@gnu.org> parents: diff changeset	68 @file{etc/HELLO}, which shows how to say ``hello'' in many languages.
27156 488f307b4f59 (International Intro): Add a link to to Fontsets. Gerd Moellmann <gerd@gnu.org> parents: 26513 diff changeset	69 This illustrates various scripts. If the font you're using doesn't have
488f307b4f59 (International Intro): Add a link to to Fontsets. Gerd Moellmann <gerd@gnu.org> parents: 26513 diff changeset	70 characters for all those different languages, you will see some hollow
488f307b4f59 (International Intro): Add a link to to Fontsets. Gerd Moellmann <gerd@gnu.org> parents: 26513 diff changeset	71 boxes instead of characters; see @ref{Fontsets}.
25829 ac7e9e5e2ccb # Dave Love <fx@gnu.org> parents: diff changeset	72
ac7e9e5e2ccb # Dave Love <fx@gnu.org> parents: diff changeset	73 Keyboards, even in the countries where these character sets are used,
ac7e9e5e2ccb # Dave Love <fx@gnu.org> parents: diff changeset	74 generally don't have keys for all the characters in them. So Emacs
ac7e9e5e2ccb # Dave Love <fx@gnu.org> parents: diff changeset	75 supports various @dfn{input methods}, typically one for each script or
ac7e9e5e2ccb # Dave Love <fx@gnu.org> parents: diff changeset	76 language, to make it convenient to type them.
ac7e9e5e2ccb # Dave Love <fx@gnu.org> parents: diff changeset	77
ac7e9e5e2ccb # Dave Love <fx@gnu.org> parents: diff changeset	78 @kindex C-x RET
ac7e9e5e2ccb # Dave Love <fx@gnu.org> parents: diff changeset	79 The prefix key @kbd{C-x @key{RET}} is used for commands that pertain
ac7e9e5e2ccb # Dave Love <fx@gnu.org> parents: diff changeset	80 to multibyte characters, coding systems, and input methods.
ac7e9e5e2ccb # Dave Love <fx@gnu.org> parents: diff changeset	81
ac7e9e5e2ccb # Dave Love <fx@gnu.org> parents: diff changeset	82 @node Enabling Multibyte
ac7e9e5e2ccb # Dave Love <fx@gnu.org> parents: diff changeset	83 @section Enabling Multibyte Characters
ac7e9e5e2ccb # Dave Love <fx@gnu.org> parents: diff changeset	84
ac7e9e5e2ccb # Dave Love <fx@gnu.org> parents: diff changeset	85 You can enable or disable multibyte character support, either for
ac7e9e5e2ccb # Dave Love <fx@gnu.org> parents: diff changeset	86 Emacs as a whole, or for a single buffer. When multibyte characters are
ac7e9e5e2ccb # Dave Love <fx@gnu.org> parents: diff changeset	87 disabled in a buffer, then each byte in that buffer represents a
ac7e9e5e2ccb # Dave Love <fx@gnu.org> parents: diff changeset	88 character, even codes 0200 through 0377. The old features for
ac7e9e5e2ccb # Dave Love <fx@gnu.org> parents: diff changeset	89 supporting the European character sets, ISO Latin-1 and ISO Latin-2,
ac7e9e5e2ccb # Dave Love <fx@gnu.org> parents: diff changeset	90 work as they did in Emacs 19 and also work for the other ISO 8859
ac7e9e5e2ccb # Dave Love <fx@gnu.org> parents: diff changeset	91 character sets.
ac7e9e5e2ccb # Dave Love <fx@gnu.org> parents: diff changeset	92
ac7e9e5e2ccb # Dave Love <fx@gnu.org> parents: diff changeset	93 However, there is no need to turn off multibyte character support to
ac7e9e5e2ccb # Dave Love <fx@gnu.org> parents: diff changeset	94 use ISO Latin; the Emacs multibyte character set includes all the
ac7e9e5e2ccb # Dave Love <fx@gnu.org> parents: diff changeset	95 characters in these character sets, and Emacs can translate
ac7e9e5e2ccb # Dave Love <fx@gnu.org> parents: diff changeset	96 automatically to and from the ISO codes.
ac7e9e5e2ccb # Dave Love <fx@gnu.org> parents: diff changeset	97
ac7e9e5e2ccb # Dave Love <fx@gnu.org> parents: diff changeset	98 To edit a particular file in unibyte representation, visit it using
ac7e9e5e2ccb # Dave Love <fx@gnu.org> parents: diff changeset	99 @code{find-file-literally}. @xref{Visiting}. To convert a buffer in
ac7e9e5e2ccb # Dave Love <fx@gnu.org> parents: diff changeset	100 multibyte representation into a single-byte representation of the same
ac7e9e5e2ccb # Dave Love <fx@gnu.org> parents: diff changeset	101 characters, the easiest way is to save the contents in a file, kill the
ac7e9e5e2ccb # Dave Love <fx@gnu.org> parents: diff changeset	102 buffer, and find the file again with @code{find-file-literally}. You
ac7e9e5e2ccb # Dave Love <fx@gnu.org> parents: diff changeset	103 can also use @kbd{C-x @key{RET} c}
ac7e9e5e2ccb # Dave Love <fx@gnu.org> parents: diff changeset	104 (@code{universal-coding-system-argument}) and specify @samp{raw-text} as
ac7e9e5e2ccb # Dave Love <fx@gnu.org> parents: diff changeset	105 the coding system with which to find or save a file. @xref{Specify
ac7e9e5e2ccb # Dave Love <fx@gnu.org> parents: diff changeset	106 Coding}. Finding a file as @samp{raw-text} doesn't disable format
ac7e9e5e2ccb # Dave Love <fx@gnu.org> parents: diff changeset	107 conversion, uncompression and auto mode selection as
ac7e9e5e2ccb # Dave Love <fx@gnu.org> parents: diff changeset	108 @code{find-file-literally} does.
ac7e9e5e2ccb # Dave Love <fx@gnu.org> parents: diff changeset	109
ac7e9e5e2ccb # Dave Love <fx@gnu.org> parents: diff changeset	110 @vindex enable-multibyte-characters
ac7e9e5e2ccb # Dave Love <fx@gnu.org> parents: diff changeset	111 @vindex default-enable-multibyte-characters
ac7e9e5e2ccb # Dave Love <fx@gnu.org> parents: diff changeset	112 To turn off multibyte character support by default, start Emacs with
ac7e9e5e2ccb # Dave Love <fx@gnu.org> parents: diff changeset	113 the @samp{--unibyte} option (@pxref{Initial Options}), or set the
ac7e9e5e2ccb # Dave Love <fx@gnu.org> parents: diff changeset	114 environment variable @samp{EMACS_UNIBYTE}. You can also customize
ac7e9e5e2ccb # Dave Love <fx@gnu.org> parents: diff changeset	115 @code{enable-multibyte-characters} or, equivalently, directly set the
ac7e9e5e2ccb # Dave Love <fx@gnu.org> parents: diff changeset	116 variable @code{default-enable-multibyte-characters} in your init file to
ac7e9e5e2ccb # Dave Love <fx@gnu.org> parents: diff changeset	117 have basically the same effect as @samp{--unibyte}.
ac7e9e5e2ccb # Dave Love <fx@gnu.org> parents: diff changeset	118
ac7e9e5e2ccb # Dave Love <fx@gnu.org> parents: diff changeset	119 Multibyte strings are not created during initialization from the
ac7e9e5e2ccb # Dave Love <fx@gnu.org> parents: diff changeset	120 values of environment variables, @file{/etc/passwd} entries etc.@: that
ac7e9e5e2ccb # Dave Love <fx@gnu.org> parents: diff changeset	121 contain non-ASCII 8-bit characters. However, the initialization file is
ac7e9e5e2ccb # Dave Love <fx@gnu.org> parents: diff changeset	122 normally read as multibyte---like Lisp files in general---even with
ac7e9e5e2ccb # Dave Love <fx@gnu.org> parents: diff changeset	123 @samp{--unibyte}. To avoid multibyte strings being generated by
ac7e9e5e2ccb # Dave Love <fx@gnu.org> parents: diff changeset	124 non-ASCII characters in it, put @samp{--unibyte: t;--} in a comment on
ac7e9e5e2ccb # Dave Love <fx@gnu.org> parents: diff changeset	125 the first line. Do the same for initialization files for packages like
ac7e9e5e2ccb # Dave Love <fx@gnu.org> parents: diff changeset	126 Gnus.
ac7e9e5e2ccb # Dave Love <fx@gnu.org> parents: diff changeset	127
ac7e9e5e2ccb # Dave Love <fx@gnu.org> parents: diff changeset	128 The mode line indicates whether multibyte character support is enabled
ac7e9e5e2ccb # Dave Love <fx@gnu.org> parents: diff changeset	129 in the current buffer. If it is, there are two or more characters (most
ac7e9e5e2ccb # Dave Love <fx@gnu.org> parents: diff changeset	130 often two dashes) before the colon near the beginning of the mode line.
ac7e9e5e2ccb # Dave Love <fx@gnu.org> parents: diff changeset	131 When multibyte characters are not enabled, just one dash precedes the
ac7e9e5e2ccb # Dave Love <fx@gnu.org> parents: diff changeset	132 colon.
ac7e9e5e2ccb # Dave Love <fx@gnu.org> parents: diff changeset	133
ac7e9e5e2ccb # Dave Love <fx@gnu.org> parents: diff changeset	134 @node Language Environments
ac7e9e5e2ccb # Dave Love <fx@gnu.org> parents: diff changeset	135 @section Language Environments
ac7e9e5e2ccb # Dave Love <fx@gnu.org> parents: diff changeset	136 @cindex language environments
ac7e9e5e2ccb # Dave Love <fx@gnu.org> parents: diff changeset	137
ac7e9e5e2ccb # Dave Love <fx@gnu.org> parents: diff changeset	138 All supported character sets are supported in Emacs buffers whenever
ac7e9e5e2ccb # Dave Love <fx@gnu.org> parents: diff changeset	139 multibyte characters are enabled; there is no need to select a
ac7e9e5e2ccb # Dave Love <fx@gnu.org> parents: diff changeset	140 particular language in order to display its characters in an Emacs
ac7e9e5e2ccb # Dave Love <fx@gnu.org> parents: diff changeset	141 buffer. However, it is important to select a @dfn{language environment}
ac7e9e5e2ccb # Dave Love <fx@gnu.org> parents: diff changeset	142 in order to set various defaults. The language environment really
ac7e9e5e2ccb # Dave Love <fx@gnu.org> parents: diff changeset	143 represents a choice of preferred script (more or less) rather than a
ac7e9e5e2ccb # Dave Love <fx@gnu.org> parents: diff changeset	144 choice of language.
ac7e9e5e2ccb # Dave Love <fx@gnu.org> parents: diff changeset	145
ac7e9e5e2ccb # Dave Love <fx@gnu.org> parents: diff changeset	146 The language environment controls which coding systems to recognize
ac7e9e5e2ccb # Dave Love <fx@gnu.org> parents: diff changeset	147 when reading text (@pxref{Recognize Coding}). This applies to files,
ac7e9e5e2ccb # Dave Love <fx@gnu.org> parents: diff changeset	148 incoming mail, netnews, and any other text you read into Emacs. It may
ac7e9e5e2ccb # Dave Love <fx@gnu.org> parents: diff changeset	149 also specify the default coding system to use when you create a file.
ac7e9e5e2ccb # Dave Love <fx@gnu.org> parents: diff changeset	150 Each language environment also specifies a default input method.
ac7e9e5e2ccb # Dave Love <fx@gnu.org> parents: diff changeset	151
ac7e9e5e2ccb # Dave Love <fx@gnu.org> parents: diff changeset	152 @findex set-language-environment
26140 068f7ad41d40 Describe new functions and variables for locales. Paul Eggert <eggert@twinsun.com> parents: 25829 diff changeset	153 @vindex current-language-environment
068f7ad41d40 Describe new functions and variables for locales. Paul Eggert <eggert@twinsun.com> parents: 25829 diff changeset	154 To select a language environment, customize the option
068f7ad41d40 Describe new functions and variables for locales. Paul Eggert <eggert@twinsun.com> parents: 25829 diff changeset	155 @code{current-language-environment} or use the command @kbd{M-x
25829 ac7e9e5e2ccb # Dave Love <fx@gnu.org> parents: diff changeset	156 set-language-environment}. It makes no difference which buffer is
ac7e9e5e2ccb # Dave Love <fx@gnu.org> parents: diff changeset	157 current when you use this command, because the effects apply globally to
ac7e9e5e2ccb # Dave Love <fx@gnu.org> parents: diff changeset	158 the Emacs session. The supported language environments include:
ac7e9e5e2ccb # Dave Love <fx@gnu.org> parents: diff changeset	159
ac7e9e5e2ccb # Dave Love <fx@gnu.org> parents: diff changeset	160 @quotation
26140 068f7ad41d40 Describe new functions and variables for locales. Paul Eggert <eggert@twinsun.com> parents: 25829 diff changeset	161 Chinese-BIG5, Chinese-CNS, Chinese-GB, Cyrillic-ALT, Cyrillic-ISO,
068f7ad41d40 Describe new functions and variables for locales. Paul Eggert <eggert@twinsun.com> parents: 25829 diff changeset	162 Cyrillic-KOI8, Czech, Devanagari, English, Ethiopic, German, Greek,
068f7ad41d40 Describe new functions and variables for locales. Paul Eggert <eggert@twinsun.com> parents: 25829 diff changeset	163 Hebrew, IPA, Japanese, Korean, Lao, Latin-1, Latin-2, Latin-3,
068f7ad41d40 Describe new functions and variables for locales. Paul Eggert <eggert@twinsun.com> parents: 25829 diff changeset	164 Latin-4, Latin-5, Latin-8, Latin-9, Romanian, Slovak, Slovenian, Thai,
068f7ad41d40 Describe new functions and variables for locales. Paul Eggert <eggert@twinsun.com> parents: 25829 diff changeset	165 Tibetan, Turkish, and Vietnamese.
25829 ac7e9e5e2ccb # Dave Love <fx@gnu.org> parents: diff changeset	166 @end quotation
ac7e9e5e2ccb # Dave Love <fx@gnu.org> parents: diff changeset	167
26140 068f7ad41d40 Describe new functions and variables for locales. Paul Eggert <eggert@twinsun.com> parents: 25829 diff changeset	168 @findex set-locale-environment
068f7ad41d40 Describe new functions and variables for locales. Paul Eggert <eggert@twinsun.com> parents: 25829 diff changeset	169 @vindex locale-language-names
068f7ad41d40 Describe new functions and variables for locales. Paul Eggert <eggert@twinsun.com> parents: 25829 diff changeset	170 @vindex locale-charset-language-names
25829 ac7e9e5e2ccb # Dave Love <fx@gnu.org> parents: diff changeset	171 Some operating systems let you specify the language you are using by
26140 068f7ad41d40 Describe new functions and variables for locales. Paul Eggert <eggert@twinsun.com> parents: 25829 diff changeset	172 setting the locale environment variables @env{LC_ALL}, @env{LC_CTYPE},
068f7ad41d40 Describe new functions and variables for locales. Paul Eggert <eggert@twinsun.com> parents: 25829 diff changeset	173 and @env{LANG}; the first of these which is nonempty specifies your
068f7ad41d40 Describe new functions and variables for locales. Paul Eggert <eggert@twinsun.com> parents: 25829 diff changeset	174 locale. Emacs handles this during startup by invoking the
068f7ad41d40 Describe new functions and variables for locales. Paul Eggert <eggert@twinsun.com> parents: 25829 diff changeset	175 @code{set-locale-environment} function, which matches your locale
068f7ad41d40 Describe new functions and variables for locales. Paul Eggert <eggert@twinsun.com> parents: 25829 diff changeset	176 against entries in the value of the variable
068f7ad41d40 Describe new functions and variables for locales. Paul Eggert <eggert@twinsun.com> parents: 25829 diff changeset	177 @code{locale-language-names} and selects the corresponding language
068f7ad41d40 Describe new functions and variables for locales. Paul Eggert <eggert@twinsun.com> parents: 25829 diff changeset	178 environment if a match is found. But if your locale also matches an
068f7ad41d40 Describe new functions and variables for locales. Paul Eggert <eggert@twinsun.com> parents: 25829 diff changeset	179 entry in the variable @code{locale-charset-language-names}, this entry
068f7ad41d40 Describe new functions and variables for locales. Paul Eggert <eggert@twinsun.com> parents: 25829 diff changeset	180 is preferred if its character set disagrees. For example, suppose the
068f7ad41d40 Describe new functions and variables for locales. Paul Eggert <eggert@twinsun.com> parents: 25829 diff changeset	181 locale @samp{en_GB.ISO8859-15} matches @code{"Latin-1"} in
068f7ad41d40 Describe new functions and variables for locales. Paul Eggert <eggert@twinsun.com> parents: 25829 diff changeset	182 @code{locale-language-names} and @code{"Latin-9"} in
068f7ad41d40 Describe new functions and variables for locales. Paul Eggert <eggert@twinsun.com> parents: 25829 diff changeset	183 @code{locale-charset-language-names}; since these two language
068f7ad41d40 Describe new functions and variables for locales. Paul Eggert <eggert@twinsun.com> parents: 25829 diff changeset	184 environments' character sets disagree, Emacs uses @code{"Latin-9"}.
068f7ad41d40 Describe new functions and variables for locales. Paul Eggert <eggert@twinsun.com> parents: 25829 diff changeset	185
26513 949ca235ee9e Describe the relationship between set-locale-environment and Paul Eggert <eggert@twinsun.com> parents: 26140 diff changeset	186 If all goes well, the @code{set-locale-environment} function selects
949ca235ee9e Describe the relationship between set-locale-environment and Paul Eggert <eggert@twinsun.com> parents: 26140 diff changeset	187 the language environment, since language is part of locale. It also
949ca235ee9e Describe the relationship between set-locale-environment and Paul Eggert <eggert@twinsun.com> parents: 26140 diff changeset	188 adjusts the display table and terminal coding system, the locale coding
949ca235ee9e Describe the relationship between set-locale-environment and Paul Eggert <eggert@twinsun.com> parents: 26140 diff changeset	189 system, and the preferred coding system as needed for the locale.
949ca235ee9e Describe the relationship between set-locale-environment and Paul Eggert <eggert@twinsun.com> parents: 26140 diff changeset	190
949ca235ee9e Describe the relationship between set-locale-environment and Paul Eggert <eggert@twinsun.com> parents: 26140 diff changeset	191 Since the @code{set-locale-environment} function is automatically
949ca235ee9e Describe the relationship between set-locale-environment and Paul Eggert <eggert@twinsun.com> parents: 26140 diff changeset	192 invoked during startup, you normally do not need to invoke it yourself.
949ca235ee9e Describe the relationship between set-locale-environment and Paul Eggert <eggert@twinsun.com> parents: 26140 diff changeset	193 However, if you modify the @env{LC_ALL}, @env{LC_CTYPE}, or @env{LANG}
949ca235ee9e Describe the relationship between set-locale-environment and Paul Eggert <eggert@twinsun.com> parents: 26140 diff changeset	194 environment variables, you may want to invoke the
949ca235ee9e Describe the relationship between set-locale-environment and Paul Eggert <eggert@twinsun.com> parents: 26140 diff changeset	195 @code{set-locale-environment} function afterwards.
949ca235ee9e Describe the relationship between set-locale-environment and Paul Eggert <eggert@twinsun.com> parents: 26140 diff changeset	196
26140 068f7ad41d40 Describe new functions and variables for locales. Paul Eggert <eggert@twinsun.com> parents: 25829 diff changeset	197 @findex set-locale-environment
068f7ad41d40 Describe new functions and variables for locales. Paul Eggert <eggert@twinsun.com> parents: 25829 diff changeset	198 @vindex locale-preferred-coding-systems
068f7ad41d40 Describe new functions and variables for locales. Paul Eggert <eggert@twinsun.com> parents: 25829 diff changeset	199 The @code{set-locale-environment} function normally uses the preferred
068f7ad41d40 Describe new functions and variables for locales. Paul Eggert <eggert@twinsun.com> parents: 25829 diff changeset	200 coding system established by the language environment to decode system
068f7ad41d40 Describe new functions and variables for locales. Paul Eggert <eggert@twinsun.com> parents: 25829 diff changeset	201 messages. But if your locale matches an entry in the variable
068f7ad41d40 Describe new functions and variables for locales. Paul Eggert <eggert@twinsun.com> parents: 25829 diff changeset	202 @code{locale-preferred-coding-systems}, Emacs uses the corresponding
068f7ad41d40 Describe new functions and variables for locales. Paul Eggert <eggert@twinsun.com> parents: 25829 diff changeset	203 coding system instead. For example, if the locale @samp{ja_JP.PCK}
068f7ad41d40 Describe new functions and variables for locales. Paul Eggert <eggert@twinsun.com> parents: 25829 diff changeset	204 matches @code{japanese-shift-jis} in
068f7ad41d40 Describe new functions and variables for locales. Paul Eggert <eggert@twinsun.com> parents: 25829 diff changeset	205 @code{locale-preferred-coding-systems}, Emacs uses that encoding even
068f7ad41d40 Describe new functions and variables for locales. Paul Eggert <eggert@twinsun.com> parents: 25829 diff changeset	206 though it might normally use @code{japanese-iso-8bit}.
068f7ad41d40 Describe new functions and variables for locales. Paul Eggert <eggert@twinsun.com> parents: 25829 diff changeset	207
068f7ad41d40 Describe new functions and variables for locales. Paul Eggert <eggert@twinsun.com> parents: 25829 diff changeset	208 The environment chosen from the locale when Emacs starts is
068f7ad41d40 Describe new functions and variables for locales. Paul Eggert <eggert@twinsun.com> parents: 25829 diff changeset	209 overidden by any explicit use of the command
068f7ad41d40 Describe new functions and variables for locales. Paul Eggert <eggert@twinsun.com> parents: 25829 diff changeset	210 @code{set-language-environment} or customization of
068f7ad41d40 Describe new functions and variables for locales. Paul Eggert <eggert@twinsun.com> parents: 25829 diff changeset	211 @code{current-language-environment} in your init file.
25829 ac7e9e5e2ccb # Dave Love <fx@gnu.org> parents: diff changeset	212
ac7e9e5e2ccb # Dave Love <fx@gnu.org> parents: diff changeset	213 @kindex C-h L
ac7e9e5e2ccb # Dave Love <fx@gnu.org> parents: diff changeset	214 @findex describe-language-environment
ac7e9e5e2ccb # Dave Love <fx@gnu.org> parents: diff changeset	215 To display information about the effects of a certain language
ac7e9e5e2ccb # Dave Love <fx@gnu.org> parents: diff changeset	216 environment @var{lang-env}, use the command @kbd{C-h L @var{lang-env}
ac7e9e5e2ccb # Dave Love <fx@gnu.org> parents: diff changeset	217 @key{RET}} (@code{describe-language-environment}). This tells you which
ac7e9e5e2ccb # Dave Love <fx@gnu.org> parents: diff changeset	218 languages this language environment is useful for, and lists the
ac7e9e5e2ccb # Dave Love <fx@gnu.org> parents: diff changeset	219 character sets, coding systems, and input methods that go with it. It
ac7e9e5e2ccb # Dave Love <fx@gnu.org> parents: diff changeset	220 also shows some sample text to illustrate scripts used in this language
ac7e9e5e2ccb # Dave Love <fx@gnu.org> parents: diff changeset	221 environment. By default, this command describes the chosen language
ac7e9e5e2ccb # Dave Love <fx@gnu.org> parents: diff changeset	222 environment.
ac7e9e5e2ccb # Dave Love <fx@gnu.org> parents: diff changeset	223
ac7e9e5e2ccb # Dave Love <fx@gnu.org> parents: diff changeset	224 @vindex set-language-environment-hook
ac7e9e5e2ccb # Dave Love <fx@gnu.org> parents: diff changeset	225 You can customize any language environment with the normal hook
ac7e9e5e2ccb # Dave Love <fx@gnu.org> parents: diff changeset	226 @code{set-language-environment-hook}. The command
ac7e9e5e2ccb # Dave Love <fx@gnu.org> parents: diff changeset	227 @code{set-language-environment} runs that hook after setting up the new
ac7e9e5e2ccb # Dave Love <fx@gnu.org> parents: diff changeset	228 language environment. The hook functions can test for a specific
ac7e9e5e2ccb # Dave Love <fx@gnu.org> parents: diff changeset	229 language environment by checking the variable
ac7e9e5e2ccb # Dave Love <fx@gnu.org> parents: diff changeset	230 @code{current-language-environment}.
ac7e9e5e2ccb # Dave Love <fx@gnu.org> parents: diff changeset	231
ac7e9e5e2ccb # Dave Love <fx@gnu.org> parents: diff changeset	232 @vindex exit-language-environment-hook
ac7e9e5e2ccb # Dave Love <fx@gnu.org> parents: diff changeset	233 Before it starts to set up the new language environment,
ac7e9e5e2ccb # Dave Love <fx@gnu.org> parents: diff changeset	234 @code{set-language-environment} first runs the hook
ac7e9e5e2ccb # Dave Love <fx@gnu.org> parents: diff changeset	235 @code{exit-language-environment-hook}. This hook is useful for undoing
ac7e9e5e2ccb # Dave Love <fx@gnu.org> parents: diff changeset	236 customizations that were made with @code{set-language-environment-hook}.
ac7e9e5e2ccb # Dave Love <fx@gnu.org> parents: diff changeset	237 For instance, if you set up a special key binding in a specific language
ac7e9e5e2ccb # Dave Love <fx@gnu.org> parents: diff changeset	238 environment using @code{set-language-environment-hook}, you should set
ac7e9e5e2ccb # Dave Love <fx@gnu.org> parents: diff changeset	239 up @code{exit-language-environment-hook} to restore the normal binding
ac7e9e5e2ccb # Dave Love <fx@gnu.org> parents: diff changeset	240 for that key.
ac7e9e5e2ccb # Dave Love <fx@gnu.org> parents: diff changeset	241
ac7e9e5e2ccb # Dave Love <fx@gnu.org> parents: diff changeset	242 @node Input Methods
ac7e9e5e2ccb # Dave Love <fx@gnu.org> parents: diff changeset	243 @section Input Methods
ac7e9e5e2ccb # Dave Love <fx@gnu.org> parents: diff changeset	244
ac7e9e5e2ccb # Dave Love <fx@gnu.org> parents: diff changeset	245 @cindex input methods
ac7e9e5e2ccb # Dave Love <fx@gnu.org> parents: diff changeset	246 An @dfn{input method} is a kind of character conversion designed
ac7e9e5e2ccb # Dave Love <fx@gnu.org> parents: diff changeset	247 specifically for interactive input. In Emacs, typically each language
ac7e9e5e2ccb # Dave Love <fx@gnu.org> parents: diff changeset	248 has its own input method; sometimes several languages which use the same
ac7e9e5e2ccb # Dave Love <fx@gnu.org> parents: diff changeset	249 characters can share one input method. A few languages support several
ac7e9e5e2ccb # Dave Love <fx@gnu.org> parents: diff changeset	250 input methods.
ac7e9e5e2ccb # Dave Love <fx@gnu.org> parents: diff changeset	251
ac7e9e5e2ccb # Dave Love <fx@gnu.org> parents: diff changeset	252 The simplest kind of input method works by mapping ASCII letters into
ac7e9e5e2ccb # Dave Love <fx@gnu.org> parents: diff changeset	253 another alphabet. This is how the Greek and Russian input methods work.
ac7e9e5e2ccb # Dave Love <fx@gnu.org> parents: diff changeset	254
ac7e9e5e2ccb # Dave Love <fx@gnu.org> parents: diff changeset	255 A more powerful technique is composition: converting sequences of
ac7e9e5e2ccb # Dave Love <fx@gnu.org> parents: diff changeset	256 characters into one letter. Many European input methods use composition
ac7e9e5e2ccb # Dave Love <fx@gnu.org> parents: diff changeset	257 to produce a single non-ASCII letter from a sequence that consists of a
ac7e9e5e2ccb # Dave Love <fx@gnu.org> parents: diff changeset	258 letter followed by accent characters (or vice versa). For example, some
ac7e9e5e2ccb # Dave Love <fx@gnu.org> parents: diff changeset	259 methods convert the sequence @kbd{a'} into a single accented letter.
ac7e9e5e2ccb # Dave Love <fx@gnu.org> parents: diff changeset	260 These input methods have no special commands of their own; all they do
ac7e9e5e2ccb # Dave Love <fx@gnu.org> parents: diff changeset	261 is compose sequences of printing characters.
ac7e9e5e2ccb # Dave Love <fx@gnu.org> parents: diff changeset	262
ac7e9e5e2ccb # Dave Love <fx@gnu.org> parents: diff changeset	263 The input methods for syllabic scripts typically use mapping followed
ac7e9e5e2ccb # Dave Love <fx@gnu.org> parents: diff changeset	264 by composition. The input methods for Thai and Korean work this way.
ac7e9e5e2ccb # Dave Love <fx@gnu.org> parents: diff changeset	265 First, letters are mapped into symbols for particular sounds or tone
ac7e9e5e2ccb # Dave Love <fx@gnu.org> parents: diff changeset	266 marks; then, sequences of these which make up a whole syllable are
ac7e9e5e2ccb # Dave Love <fx@gnu.org> parents: diff changeset	267 mapped into one syllable sign.
ac7e9e5e2ccb # Dave Love <fx@gnu.org> parents: diff changeset	268
ac7e9e5e2ccb # Dave Love <fx@gnu.org> parents: diff changeset	269 Chinese and Japanese require more complex methods. In Chinese input
ac7e9e5e2ccb # Dave Love <fx@gnu.org> parents: diff changeset	270 methods, first you enter the phonetic spelling of a Chinese word (in
ac7e9e5e2ccb # Dave Love <fx@gnu.org> parents: diff changeset	271 input method @code{chinese-py}, among others), or a sequence of portions
ac7e9e5e2ccb # Dave Love <fx@gnu.org> parents: diff changeset	272 of the character (input methods @code{chinese-4corner} and
ac7e9e5e2ccb # Dave Love <fx@gnu.org> parents: diff changeset	273 @code{chinese-sw}, and others). Since one phonetic spelling typically
ac7e9e5e2ccb # Dave Love <fx@gnu.org> parents: diff changeset	274 corresponds to many different Chinese characters, you must select one of
ac7e9e5e2ccb # Dave Love <fx@gnu.org> parents: diff changeset	275 the alternatives using special Emacs commands. Keys such as @kbd{C-f},
ac7e9e5e2ccb # Dave Love <fx@gnu.org> parents: diff changeset	276 @kbd{C-b}, @kbd{C-n}, @kbd{C-p}, and digits have special definitions in
ac7e9e5e2ccb # Dave Love <fx@gnu.org> parents: diff changeset	277 this situation, used for selecting among the alternatives. @key{TAB}
ac7e9e5e2ccb # Dave Love <fx@gnu.org> parents: diff changeset	278 displays a buffer showing all the possibilities.
ac7e9e5e2ccb # Dave Love <fx@gnu.org> parents: diff changeset	279
ac7e9e5e2ccb # Dave Love <fx@gnu.org> parents: diff changeset	280 In Japanese input methods, first you input a whole word using
ac7e9e5e2ccb # Dave Love <fx@gnu.org> parents: diff changeset	281 phonetic spelling; then, after the word is in the buffer, Emacs converts
ac7e9e5e2ccb # Dave Love <fx@gnu.org> parents: diff changeset	282 it into one or more characters using a large dictionary. One phonetic
ac7e9e5e2ccb # Dave Love <fx@gnu.org> parents: diff changeset	283 spelling corresponds to many differently written Japanese words, so you
ac7e9e5e2ccb # Dave Love <fx@gnu.org> parents: diff changeset	284 must select one of them; use @kbd{C-n} and @kbd{C-p} to cycle through
ac7e9e5e2ccb # Dave Love <fx@gnu.org> parents: diff changeset	285 the alternatives.
ac7e9e5e2ccb # Dave Love <fx@gnu.org> parents: diff changeset	286
ac7e9e5e2ccb # Dave Love <fx@gnu.org> parents: diff changeset	287 Sometimes it is useful to cut off input method processing so that the
ac7e9e5e2ccb # Dave Love <fx@gnu.org> parents: diff changeset	288 characters you have just entered will not combine with subsequent
ac7e9e5e2ccb # Dave Love <fx@gnu.org> parents: diff changeset	289 characters. For example, in input method @code{latin-1-postfix}, the
ac7e9e5e2ccb # Dave Love <fx@gnu.org> parents: diff changeset	290 sequence @kbd{e '} combines to form an @samp{e} with an accent. What if
ac7e9e5e2ccb # Dave Love <fx@gnu.org> parents: diff changeset	291 you want to enter them as separate characters?
ac7e9e5e2ccb # Dave Love <fx@gnu.org> parents: diff changeset	292
ac7e9e5e2ccb # Dave Love <fx@gnu.org> parents: diff changeset	293 One way is to type the accent twice; that is a special feature for
ac7e9e5e2ccb # Dave Love <fx@gnu.org> parents: diff changeset	294 entering the separate letter and accent. For example, @kbd{e ' '} gives
ac7e9e5e2ccb # Dave Love <fx@gnu.org> parents: diff changeset	295 you the two characters @samp{e'}. Another way is to type another letter
ac7e9e5e2ccb # Dave Love <fx@gnu.org> parents: diff changeset	296 after the @kbd{e}---something that won't combine with that---and
ac7e9e5e2ccb # Dave Love <fx@gnu.org> parents: diff changeset	297 immediately delete it. For example, you could type @kbd{e e @key{DEL}
ac7e9e5e2ccb # Dave Love <fx@gnu.org> parents: diff changeset	298 '} to get separate @samp{e} and @samp{'}.
ac7e9e5e2ccb # Dave Love <fx@gnu.org> parents: diff changeset	299
ac7e9e5e2ccb # Dave Love <fx@gnu.org> parents: diff changeset	300 Another method, more general but not quite as easy to type, is to use
ac7e9e5e2ccb # Dave Love <fx@gnu.org> parents: diff changeset	301 @kbd{C-\ C-\} between two characters to stop them from combining. This
ac7e9e5e2ccb # Dave Love <fx@gnu.org> parents: diff changeset	302 is the command @kbd{C-\} (@code{toggle-input-method}) used twice.
ac7e9e5e2ccb # Dave Love <fx@gnu.org> parents: diff changeset	303 @ifinfo
ac7e9e5e2ccb # Dave Love <fx@gnu.org> parents: diff changeset	304 @xref{Select Input Method}.
ac7e9e5e2ccb # Dave Love <fx@gnu.org> parents: diff changeset	305 @end ifinfo
ac7e9e5e2ccb # Dave Love <fx@gnu.org> parents: diff changeset	306
ac7e9e5e2ccb # Dave Love <fx@gnu.org> parents: diff changeset	307 @kbd{C-\ C-\} is especially useful inside an incremental search,
ac7e9e5e2ccb # Dave Love <fx@gnu.org> parents: diff changeset	308 because it stops waiting for more characters to combine, and starts
ac7e9e5e2ccb # Dave Love <fx@gnu.org> parents: diff changeset	309 searching for what you have already entered.
ac7e9e5e2ccb # Dave Love <fx@gnu.org> parents: diff changeset	310
ac7e9e5e2ccb # Dave Love <fx@gnu.org> parents: diff changeset	311 @vindex input-method-verbose-flag
ac7e9e5e2ccb # Dave Love <fx@gnu.org> parents: diff changeset	312 @vindex input-method-highlight-flag
ac7e9e5e2ccb # Dave Love <fx@gnu.org> parents: diff changeset	313 The variables @code{input-method-highlight-flag} and
ac7e9e5e2ccb # Dave Love <fx@gnu.org> parents: diff changeset	314 @code{input-method-verbose-flag} control how input methods explain what
ac7e9e5e2ccb # Dave Love <fx@gnu.org> parents: diff changeset	315 is happening. If @code{input-method-highlight-flag} is non-@code{nil},
ac7e9e5e2ccb # Dave Love <fx@gnu.org> parents: diff changeset	316 the partial sequence is highlighted in the buffer. If
ac7e9e5e2ccb # Dave Love <fx@gnu.org> parents: diff changeset	317 @code{input-method-verbose-flag} is non-@code{nil}, the list of possible
ac7e9e5e2ccb # Dave Love <fx@gnu.org> parents: diff changeset	318 characters to type next is displayed in the echo area (but not when you
ac7e9e5e2ccb # Dave Love <fx@gnu.org> parents: diff changeset	319 are in the minibuffer).
ac7e9e5e2ccb # Dave Love <fx@gnu.org> parents: diff changeset	320
ac7e9e5e2ccb # Dave Love <fx@gnu.org> parents: diff changeset	321 @node Select Input Method
ac7e9e5e2ccb # Dave Love <fx@gnu.org> parents: diff changeset	322 @section Selecting an Input Method
ac7e9e5e2ccb # Dave Love <fx@gnu.org> parents: diff changeset	323
ac7e9e5e2ccb # Dave Love <fx@gnu.org> parents: diff changeset	324 @table @kbd
ac7e9e5e2ccb # Dave Love <fx@gnu.org> parents: diff changeset	325 @item C-\
ac7e9e5e2ccb # Dave Love <fx@gnu.org> parents: diff changeset	326 Enable or disable use of the selected input method.
ac7e9e5e2ccb # Dave Love <fx@gnu.org> parents: diff changeset	327
ac7e9e5e2ccb # Dave Love <fx@gnu.org> parents: diff changeset	328 @item C-x @key{RET} C-\ @var{method} @key{RET}
ac7e9e5e2ccb # Dave Love <fx@gnu.org> parents: diff changeset	329 Select a new input method for the current buffer.
ac7e9e5e2ccb # Dave Love <fx@gnu.org> parents: diff changeset	330
ac7e9e5e2ccb # Dave Love <fx@gnu.org> parents: diff changeset	331 @item C-h I @var{method} @key{RET}
ac7e9e5e2ccb # Dave Love <fx@gnu.org> parents: diff changeset	332 @itemx C-h C-\ @var{method} @key{RET}
ac7e9e5e2ccb # Dave Love <fx@gnu.org> parents: diff changeset	333 @findex describe-input-method
ac7e9e5e2ccb # Dave Love <fx@gnu.org> parents: diff changeset	334 @kindex C-h I
ac7e9e5e2ccb # Dave Love <fx@gnu.org> parents: diff changeset	335 @kindex C-h C-\
ac7e9e5e2ccb # Dave Love <fx@gnu.org> parents: diff changeset	336 Describe the input method @var{method} (@code{describe-input-method}).
ac7e9e5e2ccb # Dave Love <fx@gnu.org> parents: diff changeset	337 By default, it describes the current input method (if any).
ac7e9e5e2ccb # Dave Love <fx@gnu.org> parents: diff changeset	338 This description should give you the full details of how to
ac7e9e5e2ccb # Dave Love <fx@gnu.org> parents: diff changeset	339 use any particular input method.
ac7e9e5e2ccb # Dave Love <fx@gnu.org> parents: diff changeset	340
ac7e9e5e2ccb # Dave Love <fx@gnu.org> parents: diff changeset	341 @item M-x list-input-methods
ac7e9e5e2ccb # Dave Love <fx@gnu.org> parents: diff changeset	342 Display a list of all the supported input methods.
ac7e9e5e2ccb # Dave Love <fx@gnu.org> parents: diff changeset	343 @end table
ac7e9e5e2ccb # Dave Love <fx@gnu.org> parents: diff changeset	344
ac7e9e5e2ccb # Dave Love <fx@gnu.org> parents: diff changeset	345 @findex set-input-method
ac7e9e5e2ccb # Dave Love <fx@gnu.org> parents: diff changeset	346 @vindex current-input-method
ac7e9e5e2ccb # Dave Love <fx@gnu.org> parents: diff changeset	347 @kindex C-x RET C-\
ac7e9e5e2ccb # Dave Love <fx@gnu.org> parents: diff changeset	348 To choose an input method for the current buffer, use @kbd{C-x
ac7e9e5e2ccb # Dave Love <fx@gnu.org> parents: diff changeset	349 @key{RET} C-\} (@code{set-input-method}). This command reads the
ac7e9e5e2ccb # Dave Love <fx@gnu.org> parents: diff changeset	350 input method name with the minibuffer; the name normally starts with the
ac7e9e5e2ccb # Dave Love <fx@gnu.org> parents: diff changeset	351 language environment that it is meant to be used with. The variable
ac7e9e5e2ccb # Dave Love <fx@gnu.org> parents: diff changeset	352 @code{current-input-method} records which input method is selected.
ac7e9e5e2ccb # Dave Love <fx@gnu.org> parents: diff changeset	353
ac7e9e5e2ccb # Dave Love <fx@gnu.org> parents: diff changeset	354 @findex toggle-input-method
ac7e9e5e2ccb # Dave Love <fx@gnu.org> parents: diff changeset	355 @kindex C-\
ac7e9e5e2ccb # Dave Love <fx@gnu.org> parents: diff changeset	356 Input methods use various sequences of ASCII characters to stand for
ac7e9e5e2ccb # Dave Love <fx@gnu.org> parents: diff changeset	357 non-ASCII characters. Sometimes it is useful to turn off the input
ac7e9e5e2ccb # Dave Love <fx@gnu.org> parents: diff changeset	358 method temporarily. To do this, type @kbd{C-\}
ac7e9e5e2ccb # Dave Love <fx@gnu.org> parents: diff changeset	359 (@code{toggle-input-method}). To reenable the input method, type
ac7e9e5e2ccb # Dave Love <fx@gnu.org> parents: diff changeset	360 @kbd{C-\} again.
ac7e9e5e2ccb # Dave Love <fx@gnu.org> parents: diff changeset	361
ac7e9e5e2ccb # Dave Love <fx@gnu.org> parents: diff changeset	362 If you type @kbd{C-\} and you have not yet selected an input method,
ac7e9e5e2ccb # Dave Love <fx@gnu.org> parents: diff changeset	363 it prompts for you to specify one. This has the same effect as using
ac7e9e5e2ccb # Dave Love <fx@gnu.org> parents: diff changeset	364 @kbd{C-x @key{RET} C-\} to specify an input method.
ac7e9e5e2ccb # Dave Love <fx@gnu.org> parents: diff changeset	365
ac7e9e5e2ccb # Dave Love <fx@gnu.org> parents: diff changeset	366 @vindex default-input-method
ac7e9e5e2ccb # Dave Love <fx@gnu.org> parents: diff changeset	367 Selecting a language environment specifies a default input method for
ac7e9e5e2ccb # Dave Love <fx@gnu.org> parents: diff changeset	368 use in various buffers. When you have a default input method, you can
ac7e9e5e2ccb # Dave Love <fx@gnu.org> parents: diff changeset	369 select it in the current buffer by typing @kbd{C-\}. The variable
ac7e9e5e2ccb # Dave Love <fx@gnu.org> parents: diff changeset	370 @code{default-input-method} specifies the default input method
ac7e9e5e2ccb # Dave Love <fx@gnu.org> parents: diff changeset	371 (@code{nil} means there is none).
ac7e9e5e2ccb # Dave Love <fx@gnu.org> parents: diff changeset	372
ac7e9e5e2ccb # Dave Love <fx@gnu.org> parents: diff changeset	373 @findex quail-set-keyboard-layout
ac7e9e5e2ccb # Dave Love <fx@gnu.org> parents: diff changeset	374 Some input methods for alphabetic scripts work by (in effect)
ac7e9e5e2ccb # Dave Love <fx@gnu.org> parents: diff changeset	375 remapping the keyboard to emulate various keyboard layouts commonly used
ac7e9e5e2ccb # Dave Love <fx@gnu.org> parents: diff changeset	376 for those scripts. How to do this remapping properly depends on your
ac7e9e5e2ccb # Dave Love <fx@gnu.org> parents: diff changeset	377 actual keyboard layout. To specify which layout your keyboard has, use
ac7e9e5e2ccb # Dave Love <fx@gnu.org> parents: diff changeset	378 the command @kbd{M-x quail-set-keyboard-layout}.
ac7e9e5e2ccb # Dave Love <fx@gnu.org> parents: diff changeset	379
ac7e9e5e2ccb # Dave Love <fx@gnu.org> parents: diff changeset	380 @findex list-input-methods
ac7e9e5e2ccb # Dave Love <fx@gnu.org> parents: diff changeset	381 To display a list of all the supported input methods, type @kbd{M-x
ac7e9e5e2ccb # Dave Love <fx@gnu.org> parents: diff changeset	382 list-input-methods}. The list gives information about each input
ac7e9e5e2ccb # Dave Love <fx@gnu.org> parents: diff changeset	383 method, including the string that stands for it in the mode line.
ac7e9e5e2ccb # Dave Love <fx@gnu.org> parents: diff changeset	384
ac7e9e5e2ccb # Dave Love <fx@gnu.org> parents: diff changeset	385 @node Multibyte Conversion
ac7e9e5e2ccb # Dave Love <fx@gnu.org> parents: diff changeset	386 @section Unibyte and Multibyte Non-ASCII characters
ac7e9e5e2ccb # Dave Love <fx@gnu.org> parents: diff changeset	387
ac7e9e5e2ccb # Dave Love <fx@gnu.org> parents: diff changeset	388 When multibyte characters are enabled, character codes 0240 (octal)
ac7e9e5e2ccb # Dave Love <fx@gnu.org> parents: diff changeset	389 through 0377 (octal) are not really legitimate in the buffer. The valid
ac7e9e5e2ccb # Dave Love <fx@gnu.org> parents: diff changeset	390 non-ASCII printing characters have codes that start from 0400.
ac7e9e5e2ccb # Dave Love <fx@gnu.org> parents: diff changeset	391
ac7e9e5e2ccb # Dave Love <fx@gnu.org> parents: diff changeset	392 If you type a self-inserting character in the invalid range 0240
ac7e9e5e2ccb # Dave Love <fx@gnu.org> parents: diff changeset	393 through 0377, Emacs assumes you intended to use one of the ISO
ac7e9e5e2ccb # Dave Love <fx@gnu.org> parents: diff changeset	394 Latin-@var{n} character sets, and converts it to the Emacs code
ac7e9e5e2ccb # Dave Love <fx@gnu.org> parents: diff changeset	395 representing that Latin-@var{n} character. You select @emph{which} ISO
ac7e9e5e2ccb # Dave Love <fx@gnu.org> parents: diff changeset	396 Latin character set to use through your choice of language environment
ac7e9e5e2ccb # Dave Love <fx@gnu.org> parents: diff changeset	397 @iftex
ac7e9e5e2ccb # Dave Love <fx@gnu.org> parents: diff changeset	398 (see above).
ac7e9e5e2ccb # Dave Love <fx@gnu.org> parents: diff changeset	399 @end iftex
ac7e9e5e2ccb # Dave Love <fx@gnu.org> parents: diff changeset	400 @ifinfo
ac7e9e5e2ccb # Dave Love <fx@gnu.org> parents: diff changeset	401 (@pxref{Language Environments}).
ac7e9e5e2ccb # Dave Love <fx@gnu.org> parents: diff changeset	402 @end ifinfo
ac7e9e5e2ccb # Dave Love <fx@gnu.org> parents: diff changeset	403 If you do not specify a choice, the default is Latin-1.
ac7e9e5e2ccb # Dave Love <fx@gnu.org> parents: diff changeset	404
ac7e9e5e2ccb # Dave Love <fx@gnu.org> parents: diff changeset	405 The same thing happens when you use @kbd{C-q} to enter an octal code
ac7e9e5e2ccb # Dave Love <fx@gnu.org> parents: diff changeset	406 in this range.
ac7e9e5e2ccb # Dave Love <fx@gnu.org> parents: diff changeset	407
ac7e9e5e2ccb # Dave Love <fx@gnu.org> parents: diff changeset	408 @node Coding Systems
ac7e9e5e2ccb # Dave Love <fx@gnu.org> parents: diff changeset	409 @section Coding Systems
ac7e9e5e2ccb # Dave Love <fx@gnu.org> parents: diff changeset	410 @cindex coding systems
ac7e9e5e2ccb # Dave Love <fx@gnu.org> parents: diff changeset	411
ac7e9e5e2ccb # Dave Love <fx@gnu.org> parents: diff changeset	412 Users of various languages have established many more-or-less standard
ac7e9e5e2ccb # Dave Love <fx@gnu.org> parents: diff changeset	413 coding systems for representing them. Emacs does not use these coding
ac7e9e5e2ccb # Dave Love <fx@gnu.org> parents: diff changeset	414 systems internally; instead, it converts from various coding systems to
ac7e9e5e2ccb # Dave Love <fx@gnu.org> parents: diff changeset	415 its own system when reading data, and converts the internal coding
ac7e9e5e2ccb # Dave Love <fx@gnu.org> parents: diff changeset	416 system to other coding systems when writing data. Conversion is
ac7e9e5e2ccb # Dave Love <fx@gnu.org> parents: diff changeset	417 possible in reading or writing files, in sending or receiving from the
ac7e9e5e2ccb # Dave Love <fx@gnu.org> parents: diff changeset	418 terminal, and in exchanging data with subprocesses.
ac7e9e5e2ccb # Dave Love <fx@gnu.org> parents: diff changeset	419
ac7e9e5e2ccb # Dave Love <fx@gnu.org> parents: diff changeset	420 Emacs assigns a name to each coding system. Most coding systems are
ac7e9e5e2ccb # Dave Love <fx@gnu.org> parents: diff changeset	421 used for one language, and the name of the coding system starts with the
ac7e9e5e2ccb # Dave Love <fx@gnu.org> parents: diff changeset	422 language name. Some coding systems are used for several languages;
ac7e9e5e2ccb # Dave Love <fx@gnu.org> parents: diff changeset	423 their names usually start with @samp{iso}. There are also special
ac7e9e5e2ccb # Dave Love <fx@gnu.org> parents: diff changeset	424 coding systems @code{no-conversion}, @code{raw-text} and
ac7e9e5e2ccb # Dave Love <fx@gnu.org> parents: diff changeset	425 @code{emacs-mule} which do not convert printing characters at all.
ac7e9e5e2ccb # Dave Love <fx@gnu.org> parents: diff changeset	426
ac7e9e5e2ccb # Dave Love <fx@gnu.org> parents: diff changeset	427 @cindex end-of-line conversion
ac7e9e5e2ccb # Dave Love <fx@gnu.org> parents: diff changeset	428 In addition to converting various representations of non-ASCII
ac7e9e5e2ccb # Dave Love <fx@gnu.org> parents: diff changeset	429 characters, a coding system can perform end-of-line conversion. Emacs
ac7e9e5e2ccb # Dave Love <fx@gnu.org> parents: diff changeset	430 handles three different conventions for how to separate lines in a file:
ac7e9e5e2ccb # Dave Love <fx@gnu.org> parents: diff changeset	431 newline, carriage-return linefeed, and just carriage-return.
ac7e9e5e2ccb # Dave Love <fx@gnu.org> parents: diff changeset	432
ac7e9e5e2ccb # Dave Love <fx@gnu.org> parents: diff changeset	433 @table @kbd
ac7e9e5e2ccb # Dave Love <fx@gnu.org> parents: diff changeset	434 @item C-h C @var{coding} @key{RET}
ac7e9e5e2ccb # Dave Love <fx@gnu.org> parents: diff changeset	435 Describe coding system @var{coding}.
ac7e9e5e2ccb # Dave Love <fx@gnu.org> parents: diff changeset	436
ac7e9e5e2ccb # Dave Love <fx@gnu.org> parents: diff changeset	437 @item C-h C @key{RET}
ac7e9e5e2ccb # Dave Love <fx@gnu.org> parents: diff changeset	438 Describe the coding systems currently in use.
ac7e9e5e2ccb # Dave Love <fx@gnu.org> parents: diff changeset	439
ac7e9e5e2ccb # Dave Love <fx@gnu.org> parents: diff changeset	440 @item M-x list-coding-systems
ac7e9e5e2ccb # Dave Love <fx@gnu.org> parents: diff changeset	441 Display a list of all the supported coding systems.
ac7e9e5e2ccb # Dave Love <fx@gnu.org> parents: diff changeset	442 @end table
ac7e9e5e2ccb # Dave Love <fx@gnu.org> parents: diff changeset	443
ac7e9e5e2ccb # Dave Love <fx@gnu.org> parents: diff changeset	444 @kindex C-h C
ac7e9e5e2ccb # Dave Love <fx@gnu.org> parents: diff changeset	445 @findex describe-coding-system
ac7e9e5e2ccb # Dave Love <fx@gnu.org> parents: diff changeset	446 The command @kbd{C-h C} (@code{describe-coding-system}) displays
ac7e9e5e2ccb # Dave Love <fx@gnu.org> parents: diff changeset	447 information about particular coding systems. You can specify a coding
ac7e9e5e2ccb # Dave Love <fx@gnu.org> parents: diff changeset	448 system name as argument; alternatively, with an empty argument, it
ac7e9e5e2ccb # Dave Love <fx@gnu.org> parents: diff changeset	449 describes the coding systems currently selected for various purposes,
ac7e9e5e2ccb # Dave Love <fx@gnu.org> parents: diff changeset	450 both in the current buffer and as the defaults, and the priority list
ac7e9e5e2ccb # Dave Love <fx@gnu.org> parents: diff changeset	451 for recognizing coding systems (@pxref{Recognize Coding}).
ac7e9e5e2ccb # Dave Love <fx@gnu.org> parents: diff changeset	452
ac7e9e5e2ccb # Dave Love <fx@gnu.org> parents: diff changeset	453 @findex list-coding-systems
ac7e9e5e2ccb # Dave Love <fx@gnu.org> parents: diff changeset	454 To display a list of all the supported coding systems, type @kbd{M-x
ac7e9e5e2ccb # Dave Love <fx@gnu.org> parents: diff changeset	455 list-coding-systems}. The list gives information about each coding
ac7e9e5e2ccb # Dave Love <fx@gnu.org> parents: diff changeset	456 system, including the letter that stands for it in the mode line
ac7e9e5e2ccb # Dave Love <fx@gnu.org> parents: diff changeset	457 (@pxref{Mode Line}).
ac7e9e5e2ccb # Dave Love <fx@gnu.org> parents: diff changeset	458
ac7e9e5e2ccb # Dave Love <fx@gnu.org> parents: diff changeset	459 @cindex end-of-line conversion
ac7e9e5e2ccb # Dave Love <fx@gnu.org> parents: diff changeset	460 @cindex MS-DOS end-of-line conversion
ac7e9e5e2ccb # Dave Love <fx@gnu.org> parents: diff changeset	461 @cindex Macintosh end-of-line conversion
ac7e9e5e2ccb # Dave Love <fx@gnu.org> parents: diff changeset	462 Each of the coding systems that appear in this list---except for
ac7e9e5e2ccb # Dave Love <fx@gnu.org> parents: diff changeset	463 @code{no-conversion}, which means no conversion of any kind---specifies
ac7e9e5e2ccb # Dave Love <fx@gnu.org> parents: diff changeset	464 how and whether to convert printing characters, but leaves the choice of
ac7e9e5e2ccb # Dave Love <fx@gnu.org> parents: diff changeset	465 end-of-line conversion to be decided based on the contents of each file.
ac7e9e5e2ccb # Dave Love <fx@gnu.org> parents: diff changeset	466 For example, if the file appears to use the sequence carriage-return
ac7e9e5e2ccb # Dave Love <fx@gnu.org> parents: diff changeset	467 linefeed to separate lines, DOS end-of-line conversion will be used.
ac7e9e5e2ccb # Dave Love <fx@gnu.org> parents: diff changeset	468
ac7e9e5e2ccb # Dave Love <fx@gnu.org> parents: diff changeset	469 Each of the listed coding systems has three variants which specify
ac7e9e5e2ccb # Dave Love <fx@gnu.org> parents: diff changeset	470 exactly what to do for end-of-line conversion:
ac7e9e5e2ccb # Dave Love <fx@gnu.org> parents: diff changeset	471
ac7e9e5e2ccb # Dave Love <fx@gnu.org> parents: diff changeset	472 @table @code
ac7e9e5e2ccb # Dave Love <fx@gnu.org> parents: diff changeset	473 @item @dots{}-unix
ac7e9e5e2ccb # Dave Love <fx@gnu.org> parents: diff changeset	474 Don't do any end-of-line conversion; assume the file uses
ac7e9e5e2ccb # Dave Love <fx@gnu.org> parents: diff changeset	475 newline to separate lines. (This is the convention normally used
ac7e9e5e2ccb # Dave Love <fx@gnu.org> parents: diff changeset	476 on Unix and GNU systems.)
ac7e9e5e2ccb # Dave Love <fx@gnu.org> parents: diff changeset	477
ac7e9e5e2ccb # Dave Love <fx@gnu.org> parents: diff changeset	478 @item @dots{}-dos
ac7e9e5e2ccb # Dave Love <fx@gnu.org> parents: diff changeset	479 Assume the file uses carriage-return linefeed to separate lines, and do
ac7e9e5e2ccb # Dave Love <fx@gnu.org> parents: diff changeset	480 the appropriate conversion. (This is the convention normally used on
ac7e9e5e2ccb # Dave Love <fx@gnu.org> parents: diff changeset	481 Microsoft systems.@footnote{It is also specified for MIME `text/*'
ac7e9e5e2ccb # Dave Love <fx@gnu.org> parents: diff changeset	482 bodies and in other network transport contexts. It is different
ac7e9e5e2ccb # Dave Love <fx@gnu.org> parents: diff changeset	483 from the SGML reference syntax record-start/record-end format which
ac7e9e5e2ccb # Dave Love <fx@gnu.org> parents: diff changeset	484 Emacs doesn't support directly.})
ac7e9e5e2ccb # Dave Love <fx@gnu.org> parents: diff changeset	485
ac7e9e5e2ccb # Dave Love <fx@gnu.org> parents: diff changeset	486 @item @dots{}-mac
ac7e9e5e2ccb # Dave Love <fx@gnu.org> parents: diff changeset	487 Assume the file uses carriage-return to separate lines, and do the
ac7e9e5e2ccb # Dave Love <fx@gnu.org> parents: diff changeset	488 appropriate conversion. (This is the convention normally used on the
ac7e9e5e2ccb # Dave Love <fx@gnu.org> parents: diff changeset	489 Macintosh system.)
ac7e9e5e2ccb # Dave Love <fx@gnu.org> parents: diff changeset	490 @end table
ac7e9e5e2ccb # Dave Love <fx@gnu.org> parents: diff changeset	491
ac7e9e5e2ccb # Dave Love <fx@gnu.org> parents: diff changeset	492 These variant coding systems are omitted from the
ac7e9e5e2ccb # Dave Love <fx@gnu.org> parents: diff changeset	493 @code{list-coding-systems} display for brevity, since they are entirely
ac7e9e5e2ccb # Dave Love <fx@gnu.org> parents: diff changeset	494 predictable. For example, the coding system @code{iso-latin-1} has
ac7e9e5e2ccb # Dave Love <fx@gnu.org> parents: diff changeset	495 variants @code{iso-latin-1-unix}, @code{iso-latin-1-dos} and
ac7e9e5e2ccb # Dave Love <fx@gnu.org> parents: diff changeset	496 @code{iso-latin-1-mac}.
ac7e9e5e2ccb # Dave Love <fx@gnu.org> parents: diff changeset	497
ac7e9e5e2ccb # Dave Love <fx@gnu.org> parents: diff changeset	498 The coding system @code{raw-text} is good for a file which is mainly
ac7e9e5e2ccb # Dave Love <fx@gnu.org> parents: diff changeset	499 ASCII text, but may contain byte values above 127 which are not meant to
ac7e9e5e2ccb # Dave Love <fx@gnu.org> parents: diff changeset	500 encode non-ASCII characters. With @code{raw-text}, Emacs copies those
ac7e9e5e2ccb # Dave Love <fx@gnu.org> parents: diff changeset	501 byte values unchanged, and sets @code{enable-multibyte-characters} to
ac7e9e5e2ccb # Dave Love <fx@gnu.org> parents: diff changeset	502 @code{nil} in the current buffer so that they will be interpreted
ac7e9e5e2ccb # Dave Love <fx@gnu.org> parents: diff changeset	503 properly. @code{raw-text} handles end-of-line conversion in the usual
ac7e9e5e2ccb # Dave Love <fx@gnu.org> parents: diff changeset	504 way, based on the data encountered, and has the usual three variants to
ac7e9e5e2ccb # Dave Love <fx@gnu.org> parents: diff changeset	505 specify the kind of end-of-line conversion to use.
ac7e9e5e2ccb # Dave Love <fx@gnu.org> parents: diff changeset	506
ac7e9e5e2ccb # Dave Love <fx@gnu.org> parents: diff changeset	507 In contrast, the coding system @code{no-conversion} specifies no
ac7e9e5e2ccb # Dave Love <fx@gnu.org> parents: diff changeset	508 character code conversion at all---none for non-ASCII byte values and
ac7e9e5e2ccb # Dave Love <fx@gnu.org> parents: diff changeset	509 none for end of line. This is useful for reading or writing binary
ac7e9e5e2ccb # Dave Love <fx@gnu.org> parents: diff changeset	510 files, tar files, and other files that must be examined verbatim. It,
ac7e9e5e2ccb # Dave Love <fx@gnu.org> parents: diff changeset	511 too, sets @code{enable-multibyte-characters} to @code{nil}.
ac7e9e5e2ccb # Dave Love <fx@gnu.org> parents: diff changeset	512
ac7e9e5e2ccb # Dave Love <fx@gnu.org> parents: diff changeset	513 The easiest way to edit a file with no conversion of any kind is with
ac7e9e5e2ccb # Dave Love <fx@gnu.org> parents: diff changeset	514 the @kbd{M-x find-file-literally} command. This uses
ac7e9e5e2ccb # Dave Love <fx@gnu.org> parents: diff changeset	515 @code{no-conversion}, and also suppresses other Emacs features that
ac7e9e5e2ccb # Dave Love <fx@gnu.org> parents: diff changeset	516 might convert the file contents before you see them. @xref{Visiting}.
ac7e9e5e2ccb # Dave Love <fx@gnu.org> parents: diff changeset	517
ac7e9e5e2ccb # Dave Love <fx@gnu.org> parents: diff changeset	518 The coding system @code{emacs-mule} means that the file contains
ac7e9e5e2ccb # Dave Love <fx@gnu.org> parents: diff changeset	519 non-ASCII characters stored with the internal Emacs encoding. It
ac7e9e5e2ccb # Dave Love <fx@gnu.org> parents: diff changeset	520 handles end-of-line conversion based on the data encountered, and has
ac7e9e5e2ccb # Dave Love <fx@gnu.org> parents: diff changeset	521 the usual three variants to specify the kind of end-of-line conversion.
ac7e9e5e2ccb # Dave Love <fx@gnu.org> parents: diff changeset	522
ac7e9e5e2ccb # Dave Love <fx@gnu.org> parents: diff changeset	523 @node Recognize Coding
ac7e9e5e2ccb # Dave Love <fx@gnu.org> parents: diff changeset	524 @section Recognizing Coding Systems
ac7e9e5e2ccb # Dave Love <fx@gnu.org> parents: diff changeset	525
ac7e9e5e2ccb # Dave Love <fx@gnu.org> parents: diff changeset	526 Most of the time, Emacs can recognize which coding system to use for
ac7e9e5e2ccb # Dave Love <fx@gnu.org> parents: diff changeset	527 any given file---once you have specified your preferences.
ac7e9e5e2ccb # Dave Love <fx@gnu.org> parents: diff changeset	528
ac7e9e5e2ccb # Dave Love <fx@gnu.org> parents: diff changeset	529 Some coding systems can be recognized or distinguished by which byte
ac7e9e5e2ccb # Dave Love <fx@gnu.org> parents: diff changeset	530 sequences appear in the data. However, there are coding systems that
ac7e9e5e2ccb # Dave Love <fx@gnu.org> parents: diff changeset	531 cannot be distinguished, not even potentially. For example, there is no
ac7e9e5e2ccb # Dave Love <fx@gnu.org> parents: diff changeset	532 way to distinguish between Latin-1 and Latin-2; they use the same byte
ac7e9e5e2ccb # Dave Love <fx@gnu.org> parents: diff changeset	533 values with different meanings.
ac7e9e5e2ccb # Dave Love <fx@gnu.org> parents: diff changeset	534
ac7e9e5e2ccb # Dave Love <fx@gnu.org> parents: diff changeset	535 Emacs handles this situation by means of a priority list of coding
ac7e9e5e2ccb # Dave Love <fx@gnu.org> parents: diff changeset	536 systems. Whenever Emacs reads a file, if you do not specify the coding
ac7e9e5e2ccb # Dave Love <fx@gnu.org> parents: diff changeset	537 system to use, Emacs checks the data against each coding system,
ac7e9e5e2ccb # Dave Love <fx@gnu.org> parents: diff changeset	538 starting with the first in priority and working down the list, until it
ac7e9e5e2ccb # Dave Love <fx@gnu.org> parents: diff changeset	539 finds a coding system that fits the data. Then it converts the file
ac7e9e5e2ccb # Dave Love <fx@gnu.org> parents: diff changeset	540 contents assuming that they are represented in this coding system.
ac7e9e5e2ccb # Dave Love <fx@gnu.org> parents: diff changeset	541
ac7e9e5e2ccb # Dave Love <fx@gnu.org> parents: diff changeset	542 The priority list of coding systems depends on the selected language
ac7e9e5e2ccb # Dave Love <fx@gnu.org> parents: diff changeset	543 environment (@pxref{Language Environments}). For example, if you use
ac7e9e5e2ccb # Dave Love <fx@gnu.org> parents: diff changeset	544 French, you probably want Emacs to prefer Latin-1 to Latin-2; if you use
ac7e9e5e2ccb # Dave Love <fx@gnu.org> parents: diff changeset	545 Czech, you probably want Latin-2 to be preferred. This is one of the
ac7e9e5e2ccb # Dave Love <fx@gnu.org> parents: diff changeset	546 reasons to specify a language environment.
ac7e9e5e2ccb # Dave Love <fx@gnu.org> parents: diff changeset	547
ac7e9e5e2ccb # Dave Love <fx@gnu.org> parents: diff changeset	548 @findex prefer-coding-system
ac7e9e5e2ccb # Dave Love <fx@gnu.org> parents: diff changeset	549 However, you can alter the priority list in detail with the command
ac7e9e5e2ccb # Dave Love <fx@gnu.org> parents: diff changeset	550 @kbd{M-x prefer-coding-system}. This command reads the name of a coding
ac7e9e5e2ccb # Dave Love <fx@gnu.org> parents: diff changeset	551 system from the minibuffer, and adds it to the front of the priority
ac7e9e5e2ccb # Dave Love <fx@gnu.org> parents: diff changeset	552 list, so that it is preferred to all others. If you use this command
ac7e9e5e2ccb # Dave Love <fx@gnu.org> parents: diff changeset	553 several times, each use adds one element to the front of the priority
ac7e9e5e2ccb # Dave Love <fx@gnu.org> parents: diff changeset	554 list.
ac7e9e5e2ccb # Dave Love <fx@gnu.org> parents: diff changeset	555
ac7e9e5e2ccb # Dave Love <fx@gnu.org> parents: diff changeset	556 If you use a coding system that specifies the end-of-line conversion
ac7e9e5e2ccb # Dave Love <fx@gnu.org> parents: diff changeset	557 type, such as @code{iso-8859-1-dos}, what that means is that Emacs
ac7e9e5e2ccb # Dave Love <fx@gnu.org> parents: diff changeset	558 should attempt to recognize @code{iso-8859-1} with priority, and should
ac7e9e5e2ccb # Dave Love <fx@gnu.org> parents: diff changeset	559 use DOS end-of-line conversion in case it recognizes @code{iso-8859-1}.
ac7e9e5e2ccb # Dave Love <fx@gnu.org> parents: diff changeset	560
ac7e9e5e2ccb # Dave Love <fx@gnu.org> parents: diff changeset	561 @vindex file-coding-system-alist
ac7e9e5e2ccb # Dave Love <fx@gnu.org> parents: diff changeset	562 Sometimes a file name indicates which coding system to use for the
ac7e9e5e2ccb # Dave Love <fx@gnu.org> parents: diff changeset	563 file. The variable @code{file-coding-system-alist} specifies this
ac7e9e5e2ccb # Dave Love <fx@gnu.org> parents: diff changeset	564 correspondence. There is a special function
ac7e9e5e2ccb # Dave Love <fx@gnu.org> parents: diff changeset	565 @code{modify-coding-system-alist} for adding elements to this list. For
ac7e9e5e2ccb # Dave Love <fx@gnu.org> parents: diff changeset	566 example, to read and write all @samp{.txt} files using the coding system
ac7e9e5e2ccb # Dave Love <fx@gnu.org> parents: diff changeset	567 @code{china-iso-8bit}, you can execute this Lisp expression:
ac7e9e5e2ccb # Dave Love <fx@gnu.org> parents: diff changeset	568
ac7e9e5e2ccb # Dave Love <fx@gnu.org> parents: diff changeset	569 @smallexample
ac7e9e5e2ccb # Dave Love <fx@gnu.org> parents: diff changeset	570 (modify-coding-system-alist 'file "\\.txt\\'" 'china-iso-8bit)
ac7e9e5e2ccb # Dave Love <fx@gnu.org> parents: diff changeset	571 @end smallexample
ac7e9e5e2ccb # Dave Love <fx@gnu.org> parents: diff changeset	572
ac7e9e5e2ccb # Dave Love <fx@gnu.org> parents: diff changeset	573 @noindent
ac7e9e5e2ccb # Dave Love <fx@gnu.org> parents: diff changeset	574 The first argument should be @code{file}, the second argument should be
ac7e9e5e2ccb # Dave Love <fx@gnu.org> parents: diff changeset	575 a regular expression that determines which files this applies to, and
ac7e9e5e2ccb # Dave Love <fx@gnu.org> parents: diff changeset	576 the third argument says which coding system to use for these files.
ac7e9e5e2ccb # Dave Love <fx@gnu.org> parents: diff changeset	577
ac7e9e5e2ccb # Dave Love <fx@gnu.org> parents: diff changeset	578 @vindex inhibit-eol-conversion
ac7e9e5e2ccb # Dave Love <fx@gnu.org> parents: diff changeset	579 Emacs recognizes which kind of end-of-line conversion to use based on
ac7e9e5e2ccb # Dave Love <fx@gnu.org> parents: diff changeset	580 the contents of the file: if it sees only carriage-returns, or only
ac7e9e5e2ccb # Dave Love <fx@gnu.org> parents: diff changeset	581 carriage-return linefeed sequences, then it chooses the end-of-line
ac7e9e5e2ccb # Dave Love <fx@gnu.org> parents: diff changeset	582 conversion accordingly. You can inhibit the automatic use of
ac7e9e5e2ccb # Dave Love <fx@gnu.org> parents: diff changeset	583 end-of-line conversion by setting the variable @code{inhibit-eol-conversion}
ac7e9e5e2ccb # Dave Love <fx@gnu.org> parents: diff changeset	584 to non-@code{nil}.
ac7e9e5e2ccb # Dave Love <fx@gnu.org> parents: diff changeset	585
ac7e9e5e2ccb # Dave Love <fx@gnu.org> parents: diff changeset	586 @vindex coding
ac7e9e5e2ccb # Dave Love <fx@gnu.org> parents: diff changeset	587 You can specify the coding system for a particular file using the
ac7e9e5e2ccb # Dave Love <fx@gnu.org> parents: diff changeset	588 @samp{--@dots{}--} construct at the beginning of a file, or a local
ac7e9e5e2ccb # Dave Love <fx@gnu.org> parents: diff changeset	589 variables list at the end (@pxref{File Variables}). You do this by
ac7e9e5e2ccb # Dave Love <fx@gnu.org> parents: diff changeset	590 defining a value for the ``variable'' named @code{coding}. Emacs does
ac7e9e5e2ccb # Dave Love <fx@gnu.org> parents: diff changeset	591 not really have a variable @code{coding}; instead of setting a variable,
ac7e9e5e2ccb # Dave Love <fx@gnu.org> parents: diff changeset	592 it uses the specified coding system for the file. For example,
ac7e9e5e2ccb # Dave Love <fx@gnu.org> parents: diff changeset	593 @samp{--mode: C; coding: latin-1;--} specifies use of the Latin-1
ac7e9e5e2ccb # Dave Love <fx@gnu.org> parents: diff changeset	594 coding system, as well as C mode. If you specify the coding explicitly
ac7e9e5e2ccb # Dave Love <fx@gnu.org> parents: diff changeset	595 in the file, that overrides @code{file-coding-system-alist}.
ac7e9e5e2ccb # Dave Love <fx@gnu.org> parents: diff changeset	596
ac7e9e5e2ccb # Dave Love <fx@gnu.org> parents: diff changeset	597 @vindex auto-coding-alist
ac7e9e5e2ccb # Dave Love <fx@gnu.org> parents: diff changeset	598 The variable @code{auto-coding-alist} is the strongest way to specify
ac7e9e5e2ccb # Dave Love <fx@gnu.org> parents: diff changeset	599 the coding system for certain patterns of file names; this variable even
ac7e9e5e2ccb # Dave Love <fx@gnu.org> parents: diff changeset	600 overrides @samp{--coding:--} tags in the file itself. Emacs uses this
ac7e9e5e2ccb # Dave Love <fx@gnu.org> parents: diff changeset	601 feature for tar and archive files, to prevent Emacs from being confused
ac7e9e5e2ccb # Dave Love <fx@gnu.org> parents: diff changeset	602 by a @samp{--coding:--} tag in a member of the archive and thinking it
ac7e9e5e2ccb # Dave Love <fx@gnu.org> parents: diff changeset	603 applies to the archive file as a whole.
ac7e9e5e2ccb # Dave Love <fx@gnu.org> parents: diff changeset	604
ac7e9e5e2ccb # Dave Love <fx@gnu.org> parents: diff changeset	605 @vindex buffer-file-coding-system
ac7e9e5e2ccb # Dave Love <fx@gnu.org> parents: diff changeset	606 Once Emacs has chosen a coding system for a buffer, it stores that
ac7e9e5e2ccb # Dave Love <fx@gnu.org> parents: diff changeset	607 coding system in @code{buffer-file-coding-system} and uses that coding
ac7e9e5e2ccb # Dave Love <fx@gnu.org> parents: diff changeset	608 system, by default, for operations that write from this buffer into a
ac7e9e5e2ccb # Dave Love <fx@gnu.org> parents: diff changeset	609 file. This includes the commands @code{save-buffer} and
ac7e9e5e2ccb # Dave Love <fx@gnu.org> parents: diff changeset	610 @code{write-region}. If you want to write files from this buffer using
ac7e9e5e2ccb # Dave Love <fx@gnu.org> parents: diff changeset	611 a different coding system, you can specify a different coding system for
ac7e9e5e2ccb # Dave Love <fx@gnu.org> parents: diff changeset	612 the buffer using @code{set-buffer-file-coding-system} (@pxref{Specify
ac7e9e5e2ccb # Dave Love <fx@gnu.org> parents: diff changeset	613 Coding}).
ac7e9e5e2ccb # Dave Love <fx@gnu.org> parents: diff changeset	614
ac7e9e5e2ccb # Dave Love <fx@gnu.org> parents: diff changeset	615 @vindex sendmail-coding-system
ac7e9e5e2ccb # Dave Love <fx@gnu.org> parents: diff changeset	616 When you send a message with Mail mode (@pxref{Sending Mail}), Emacs has
ac7e9e5e2ccb # Dave Love <fx@gnu.org> parents: diff changeset	617 four different ways to determine the coding system to use for encoding
ac7e9e5e2ccb # Dave Love <fx@gnu.org> parents: diff changeset	618 the message text. It tries the buffer's own value of
ac7e9e5e2ccb # Dave Love <fx@gnu.org> parents: diff changeset	619 @code{buffer-file-coding-system}, if that is non-@code{nil}. Otherwise,
ac7e9e5e2ccb # Dave Love <fx@gnu.org> parents: diff changeset	620 it uses the value of @code{sendmail-coding-system}, if that is
ac7e9e5e2ccb # Dave Love <fx@gnu.org> parents: diff changeset	621 non-@code{nil}. The third way is to use the default coding system for
ac7e9e5e2ccb # Dave Love <fx@gnu.org> parents: diff changeset	622 new files, which is controlled by your choice of language environment,
ac7e9e5e2ccb # Dave Love <fx@gnu.org> parents: diff changeset	623 if that is non-@code{nil}. If all of these three values are @code{nil},
ac7e9e5e2ccb # Dave Love <fx@gnu.org> parents: diff changeset	624 Emacs encodes outgoing mail using the Latin-1 coding system.
ac7e9e5e2ccb # Dave Love <fx@gnu.org> parents: diff changeset	625
ac7e9e5e2ccb # Dave Love <fx@gnu.org> parents: diff changeset	626 @vindex rmail-decode-mime-charset
ac7e9e5e2ccb # Dave Love <fx@gnu.org> parents: diff changeset	627 When you get new mail in Rmail, each message is translated
ac7e9e5e2ccb # Dave Love <fx@gnu.org> parents: diff changeset	628 automatically from the coding system it is written in---as if it were a
ac7e9e5e2ccb # Dave Love <fx@gnu.org> parents: diff changeset	629 separate file. This uses the priority list of coding systems that you
ac7e9e5e2ccb # Dave Love <fx@gnu.org> parents: diff changeset	630 have specified. If a MIME message specifies a character set, Rmail
ac7e9e5e2ccb # Dave Love <fx@gnu.org> parents: diff changeset	631 obeys that specification, unless @code{rmail-decode-mime-charset} is
ac7e9e5e2ccb # Dave Love <fx@gnu.org> parents: diff changeset	632 @code{nil}.
ac7e9e5e2ccb # Dave Love <fx@gnu.org> parents: diff changeset	633
ac7e9e5e2ccb # Dave Love <fx@gnu.org> parents: diff changeset	634 @vindex rmail-file-coding-system
ac7e9e5e2ccb # Dave Love <fx@gnu.org> parents: diff changeset	635 For reading and saving Rmail files themselves, Emacs uses the coding
ac7e9e5e2ccb # Dave Love <fx@gnu.org> parents: diff changeset	636 system specified by the variable @code{rmail-file-coding-system}. The
ac7e9e5e2ccb # Dave Love <fx@gnu.org> parents: diff changeset	637 default value is @code{nil}, which means that Rmail files are not
ac7e9e5e2ccb # Dave Love <fx@gnu.org> parents: diff changeset	638 translated (they are read and written in the Emacs internal character
ac7e9e5e2ccb # Dave Love <fx@gnu.org> parents: diff changeset	639 code).
ac7e9e5e2ccb # Dave Love <fx@gnu.org> parents: diff changeset	640
ac7e9e5e2ccb # Dave Love <fx@gnu.org> parents: diff changeset	641 @node Specify Coding
ac7e9e5e2ccb # Dave Love <fx@gnu.org> parents: diff changeset	642 @section Specifying a Coding System
ac7e9e5e2ccb # Dave Love <fx@gnu.org> parents: diff changeset	643
ac7e9e5e2ccb # Dave Love <fx@gnu.org> parents: diff changeset	644 In cases where Emacs does not automatically choose the right coding
ac7e9e5e2ccb # Dave Love <fx@gnu.org> parents: diff changeset	645 system, you can use these commands to specify one:
ac7e9e5e2ccb # Dave Love <fx@gnu.org> parents: diff changeset	646
ac7e9e5e2ccb # Dave Love <fx@gnu.org> parents: diff changeset	647 @table @kbd
ac7e9e5e2ccb # Dave Love <fx@gnu.org> parents: diff changeset	648 @item C-x @key{RET} f @var{coding} @key{RET}
ac7e9e5e2ccb # Dave Love <fx@gnu.org> parents: diff changeset	649 Use coding system @var{coding} for the visited file
ac7e9e5e2ccb # Dave Love <fx@gnu.org> parents: diff changeset	650 in the current buffer.
ac7e9e5e2ccb # Dave Love <fx@gnu.org> parents: diff changeset	651
ac7e9e5e2ccb # Dave Love <fx@gnu.org> parents: diff changeset	652 @item C-x @key{RET} c @var{coding} @key{RET}
ac7e9e5e2ccb # Dave Love <fx@gnu.org> parents: diff changeset	653 Specify coding system @var{coding} for the immediately following
ac7e9e5e2ccb # Dave Love <fx@gnu.org> parents: diff changeset	654 command.
ac7e9e5e2ccb # Dave Love <fx@gnu.org> parents: diff changeset	655
ac7e9e5e2ccb # Dave Love <fx@gnu.org> parents: diff changeset	656 @item C-x @key{RET} k @var{coding} @key{RET}
ac7e9e5e2ccb # Dave Love <fx@gnu.org> parents: diff changeset	657 Use coding system @var{coding} for keyboard input.
ac7e9e5e2ccb # Dave Love <fx@gnu.org> parents: diff changeset	658
ac7e9e5e2ccb # Dave Love <fx@gnu.org> parents: diff changeset	659 @item C-x @key{RET} t @var{coding} @key{RET}
ac7e9e5e2ccb # Dave Love <fx@gnu.org> parents: diff changeset	660 Use coding system @var{coding} for terminal output.
ac7e9e5e2ccb # Dave Love <fx@gnu.org> parents: diff changeset	661
ac7e9e5e2ccb # Dave Love <fx@gnu.org> parents: diff changeset	662 @item C-x @key{RET} p @var{input-coding} @key{RET} @var{output-coding} @key{RET}
ac7e9e5e2ccb # Dave Love <fx@gnu.org> parents: diff changeset	663 Use coding systems @var{input-coding} and @var{output-coding} for
ac7e9e5e2ccb # Dave Love <fx@gnu.org> parents: diff changeset	664 subprocess input and output in the current buffer.
ac7e9e5e2ccb # Dave Love <fx@gnu.org> parents: diff changeset	665
ac7e9e5e2ccb # Dave Love <fx@gnu.org> parents: diff changeset	666 @item C-x @key{RET} x @var{coding} @key{RET}
ac7e9e5e2ccb # Dave Love <fx@gnu.org> parents: diff changeset	667 Use coding system @var{coding} for transferring selections to and from
ac7e9e5e2ccb # Dave Love <fx@gnu.org> parents: diff changeset	668 other programs through the window system.
ac7e9e5e2ccb # Dave Love <fx@gnu.org> parents: diff changeset	669
ac7e9e5e2ccb # Dave Love <fx@gnu.org> parents: diff changeset	670 @item C-x @key{RET} X @var{coding} @key{RET}
ac7e9e5e2ccb # Dave Love <fx@gnu.org> parents: diff changeset	671 Use coding system @var{coding} for transferring @emph{one}
ac7e9e5e2ccb # Dave Love <fx@gnu.org> parents: diff changeset	672 selection---the next one---to or from the window system.
ac7e9e5e2ccb # Dave Love <fx@gnu.org> parents: diff changeset	673 @end table
ac7e9e5e2ccb # Dave Love <fx@gnu.org> parents: diff changeset	674
ac7e9e5e2ccb # Dave Love <fx@gnu.org> parents: diff changeset	675 @kindex C-x RET f
ac7e9e5e2ccb # Dave Love <fx@gnu.org> parents: diff changeset	676 @findex set-buffer-file-coding-system
ac7e9e5e2ccb # Dave Love <fx@gnu.org> parents: diff changeset	677 The command @kbd{C-x @key{RET} f} (@code{set-buffer-file-coding-system})
ac7e9e5e2ccb # Dave Love <fx@gnu.org> parents: diff changeset	678 specifies the file coding system for the current buffer---in other
ac7e9e5e2ccb # Dave Love <fx@gnu.org> parents: diff changeset	679 words, which coding system to use when saving or rereading the visited
ac7e9e5e2ccb # Dave Love <fx@gnu.org> parents: diff changeset	680 file. You specify which coding system using the minibuffer. Since this
ac7e9e5e2ccb # Dave Love <fx@gnu.org> parents: diff changeset	681 command applies to a file you have already visited, it affects only the
ac7e9e5e2ccb # Dave Love <fx@gnu.org> parents: diff changeset	682 way the file is saved.
ac7e9e5e2ccb # Dave Love <fx@gnu.org> parents: diff changeset	683
ac7e9e5e2ccb # Dave Love <fx@gnu.org> parents: diff changeset	684 @kindex C-x RET c
ac7e9e5e2ccb # Dave Love <fx@gnu.org> parents: diff changeset	685 @findex universal-coding-system-argument
ac7e9e5e2ccb # Dave Love <fx@gnu.org> parents: diff changeset	686 Another way to specify the coding system for a file is when you visit
ac7e9e5e2ccb # Dave Love <fx@gnu.org> parents: diff changeset	687 the file. First use the command @kbd{C-x @key{RET} c}
ac7e9e5e2ccb # Dave Love <fx@gnu.org> parents: diff changeset	688 (@code{universal-coding-system-argument}); this command uses the
ac7e9e5e2ccb # Dave Love <fx@gnu.org> parents: diff changeset	689 minibuffer to read a coding system name. After you exit the minibuffer,
ac7e9e5e2ccb # Dave Love <fx@gnu.org> parents: diff changeset	690 the specified coding system is used for @emph{the immediately following
ac7e9e5e2ccb # Dave Love <fx@gnu.org> parents: diff changeset	691 command}.
ac7e9e5e2ccb # Dave Love <fx@gnu.org> parents: diff changeset	692
ac7e9e5e2ccb # Dave Love <fx@gnu.org> parents: diff changeset	693 So if the immediately following command is @kbd{C-x C-f}, for example,
ac7e9e5e2ccb # Dave Love <fx@gnu.org> parents: diff changeset	694 it reads the file using that coding system (and records the coding
ac7e9e5e2ccb # Dave Love <fx@gnu.org> parents: diff changeset	695 system for when the file is saved). Or if the immediately following
ac7e9e5e2ccb # Dave Love <fx@gnu.org> parents: diff changeset	696 command is @kbd{C-x C-w}, it writes the file using that coding system.
ac7e9e5e2ccb # Dave Love <fx@gnu.org> parents: diff changeset	697 Other file commands affected by a specified coding system include
ac7e9e5e2ccb # Dave Love <fx@gnu.org> parents: diff changeset	698 @kbd{C-x C-i} and @kbd{C-x C-v}, as well as the other-window variants of
ac7e9e5e2ccb # Dave Love <fx@gnu.org> parents: diff changeset	699 @kbd{C-x C-f}.
ac7e9e5e2ccb # Dave Love <fx@gnu.org> parents: diff changeset	700
ac7e9e5e2ccb # Dave Love <fx@gnu.org> parents: diff changeset	701 @kbd{C-x @key{RET} c} also affects commands that start subprocesses,
ac7e9e5e2ccb # Dave Love <fx@gnu.org> parents: diff changeset	702 including @kbd{M-x shell} (@pxref{Shell}).
ac7e9e5e2ccb # Dave Love <fx@gnu.org> parents: diff changeset	703
ac7e9e5e2ccb # Dave Love <fx@gnu.org> parents: diff changeset	704 However, if the immediately following command does not use the coding
ac7e9e5e2ccb # Dave Love <fx@gnu.org> parents: diff changeset	705 system, then @kbd{C-x @key{RET} c} ultimately has no effect.
ac7e9e5e2ccb # Dave Love <fx@gnu.org> parents: diff changeset	706
ac7e9e5e2ccb # Dave Love <fx@gnu.org> parents: diff changeset	707 An easy way to visit a file with no conversion is with the @kbd{M-x
ac7e9e5e2ccb # Dave Love <fx@gnu.org> parents: diff changeset	708 find-file-literally} command. @xref{Visiting}.
ac7e9e5e2ccb # Dave Love <fx@gnu.org> parents: diff changeset	709
ac7e9e5e2ccb # Dave Love <fx@gnu.org> parents: diff changeset	710 @vindex default-buffer-file-coding-system
ac7e9e5e2ccb # Dave Love <fx@gnu.org> parents: diff changeset	711 The variable @code{default-buffer-file-coding-system} specifies the
ac7e9e5e2ccb # Dave Love <fx@gnu.org> parents: diff changeset	712 choice of coding system to use when you create a new file. It applies
ac7e9e5e2ccb # Dave Love <fx@gnu.org> parents: diff changeset	713 when you find a new file, and when you create a buffer and then save it
ac7e9e5e2ccb # Dave Love <fx@gnu.org> parents: diff changeset	714 in a file. Selecting a language environment typically sets this
ac7e9e5e2ccb # Dave Love <fx@gnu.org> parents: diff changeset	715 variable to a good choice of default coding system for that language
ac7e9e5e2ccb # Dave Love <fx@gnu.org> parents: diff changeset	716 environment.
ac7e9e5e2ccb # Dave Love <fx@gnu.org> parents: diff changeset	717
ac7e9e5e2ccb # Dave Love <fx@gnu.org> parents: diff changeset	718 @kindex C-x RET t
ac7e9e5e2ccb # Dave Love <fx@gnu.org> parents: diff changeset	719 @findex set-terminal-coding-system
ac7e9e5e2ccb # Dave Love <fx@gnu.org> parents: diff changeset	720 The command @kbd{C-x @key{RET} t} (@code{set-terminal-coding-system})
ac7e9e5e2ccb # Dave Love <fx@gnu.org> parents: diff changeset	721 specifies the coding system for terminal output. If you specify a
ac7e9e5e2ccb # Dave Love <fx@gnu.org> parents: diff changeset	722 character code for terminal output, all characters output to the
ac7e9e5e2ccb # Dave Love <fx@gnu.org> parents: diff changeset	723 terminal are translated into that coding system.
ac7e9e5e2ccb # Dave Love <fx@gnu.org> parents: diff changeset	724
ac7e9e5e2ccb # Dave Love <fx@gnu.org> parents: diff changeset	725 This feature is useful for certain character-only terminals built to
ac7e9e5e2ccb # Dave Love <fx@gnu.org> parents: diff changeset	726 support specific languages or character sets---for example, European
ac7e9e5e2ccb # Dave Love <fx@gnu.org> parents: diff changeset	727 terminals that support one of the ISO Latin character sets. You need to
ac7e9e5e2ccb # Dave Love <fx@gnu.org> parents: diff changeset	728 specify the terminal coding system when using multibyte text, so that
ac7e9e5e2ccb # Dave Love <fx@gnu.org> parents: diff changeset	729 Emacs knows which characters the terminal can actually handle.
ac7e9e5e2ccb # Dave Love <fx@gnu.org> parents: diff changeset	730
ac7e9e5e2ccb # Dave Love <fx@gnu.org> parents: diff changeset	731 By default, output to the terminal is not translated at all, unless
ac7e9e5e2ccb # Dave Love <fx@gnu.org> parents: diff changeset	732 Emacs can deduce the proper coding system from your terminal type.
ac7e9e5e2ccb # Dave Love <fx@gnu.org> parents: diff changeset	733
ac7e9e5e2ccb # Dave Love <fx@gnu.org> parents: diff changeset	734 @kindex C-x RET k
ac7e9e5e2ccb # Dave Love <fx@gnu.org> parents: diff changeset	735 @findex set-keyboard-coding-system
ac7e9e5e2ccb # Dave Love <fx@gnu.org> parents: diff changeset	736 The command @kbd{C-x @key{RET} k} (@code{set-keyboard-coding-system})
ac7e9e5e2ccb # Dave Love <fx@gnu.org> parents: diff changeset	737 specifies the coding system for keyboard input. Character-code
ac7e9e5e2ccb # Dave Love <fx@gnu.org> parents: diff changeset	738 translation of keyboard input is useful for terminals with keys that
ac7e9e5e2ccb # Dave Love <fx@gnu.org> parents: diff changeset	739 send non-ASCII graphic characters---for example, some terminals designed
ac7e9e5e2ccb # Dave Love <fx@gnu.org> parents: diff changeset	740 for ISO Latin-1 or subsets of it.
ac7e9e5e2ccb # Dave Love <fx@gnu.org> parents: diff changeset	741
ac7e9e5e2ccb # Dave Love <fx@gnu.org> parents: diff changeset	742 By default, keyboard input is not translated at all.
ac7e9e5e2ccb # Dave Love <fx@gnu.org> parents: diff changeset	743
ac7e9e5e2ccb # Dave Love <fx@gnu.org> parents: diff changeset	744 There is a similarity between using a coding system translation for
ac7e9e5e2ccb # Dave Love <fx@gnu.org> parents: diff changeset	745 keyboard input, and using an input method: both define sequences of
ac7e9e5e2ccb # Dave Love <fx@gnu.org> parents: diff changeset	746 keyboard input that translate into single characters. However, input
ac7e9e5e2ccb # Dave Love <fx@gnu.org> parents: diff changeset	747 methods are designed to be convenient for interactive use by humans, and
ac7e9e5e2ccb # Dave Love <fx@gnu.org> parents: diff changeset	748 the sequences that are translated are typically sequences of ASCII
ac7e9e5e2ccb # Dave Love <fx@gnu.org> parents: diff changeset	749 printing characters. Coding systems typically translate sequences of
ac7e9e5e2ccb # Dave Love <fx@gnu.org> parents: diff changeset	750 non-graphic characters.
ac7e9e5e2ccb # Dave Love <fx@gnu.org> parents: diff changeset	751
ac7e9e5e2ccb # Dave Love <fx@gnu.org> parents: diff changeset	752 @kindex C-x RET x
ac7e9e5e2ccb # Dave Love <fx@gnu.org> parents: diff changeset	753 @kindex C-x RET X
ac7e9e5e2ccb # Dave Love <fx@gnu.org> parents: diff changeset	754 @findex set-selection-coding-system
ac7e9e5e2ccb # Dave Love <fx@gnu.org> parents: diff changeset	755 @findex set-next-selection-coding-system
ac7e9e5e2ccb # Dave Love <fx@gnu.org> parents: diff changeset	756 The command @kbd{C-x @key{RET} x} (@code{set-selection-coding-system})
ac7e9e5e2ccb # Dave Love <fx@gnu.org> parents: diff changeset	757 specifies the coding system for sending selected text to the window
ac7e9e5e2ccb # Dave Love <fx@gnu.org> parents: diff changeset	758 system, and for receiving the text of selections made in other
ac7e9e5e2ccb # Dave Love <fx@gnu.org> parents: diff changeset	759 applications. This command applies to all subsequent selections, until
ac7e9e5e2ccb # Dave Love <fx@gnu.org> parents: diff changeset	760 you override it by using the command again. The command @kbd{C-x
ac7e9e5e2ccb # Dave Love <fx@gnu.org> parents: diff changeset	761 @key{RET} X} (@code{set-next-selection-coding-system}) specifies the
ac7e9e5e2ccb # Dave Love <fx@gnu.org> parents: diff changeset	762 coding system for the next selection made in Emacs or read by Emacs.
ac7e9e5e2ccb # Dave Love <fx@gnu.org> parents: diff changeset	763
ac7e9e5e2ccb # Dave Love <fx@gnu.org> parents: diff changeset	764 @kindex C-x RET p
ac7e9e5e2ccb # Dave Love <fx@gnu.org> parents: diff changeset	765 @findex set-buffer-process-coding-system
ac7e9e5e2ccb # Dave Love <fx@gnu.org> parents: diff changeset	766 The command @kbd{C-x @key{RET} p} (@code{set-buffer-process-coding-system})
ac7e9e5e2ccb # Dave Love <fx@gnu.org> parents: diff changeset	767 specifies the coding system for input and output to a subprocess. This
ac7e9e5e2ccb # Dave Love <fx@gnu.org> parents: diff changeset	768 command applies to the current buffer; normally, each subprocess has its
ac7e9e5e2ccb # Dave Love <fx@gnu.org> parents: diff changeset	769 own buffer, and thus you can use this command to specify translation to
ac7e9e5e2ccb # Dave Love <fx@gnu.org> parents: diff changeset	770 and from a particular subprocess by giving the command in the
ac7e9e5e2ccb # Dave Love <fx@gnu.org> parents: diff changeset	771 corresponding buffer.
ac7e9e5e2ccb # Dave Love <fx@gnu.org> parents: diff changeset	772
ac7e9e5e2ccb # Dave Love <fx@gnu.org> parents: diff changeset	773 By default, process input and output are not translated at all.
ac7e9e5e2ccb # Dave Love <fx@gnu.org> parents: diff changeset	774
ac7e9e5e2ccb # Dave Love <fx@gnu.org> parents: diff changeset	775 @vindex file-name-coding-system
ac7e9e5e2ccb # Dave Love <fx@gnu.org> parents: diff changeset	776 The variable @code{file-name-coding-system} specifies a coding system
ac7e9e5e2ccb # Dave Love <fx@gnu.org> parents: diff changeset	777 to use for encoding file names. If you set the variable to a coding
ac7e9e5e2ccb # Dave Love <fx@gnu.org> parents: diff changeset	778 system name (as a Lisp symbol or a string), Emacs encodes file names
ac7e9e5e2ccb # Dave Love <fx@gnu.org> parents: diff changeset	779 using that coding system for all file operations. This makes it
ac7e9e5e2ccb # Dave Love <fx@gnu.org> parents: diff changeset	780 possible to use non-ASCII characters in file names---or, at least, those
ac7e9e5e2ccb # Dave Love <fx@gnu.org> parents: diff changeset	781 non-ASCII characters which the specified coding system can encode.
ac7e9e5e2ccb # Dave Love <fx@gnu.org> parents: diff changeset	782
ac7e9e5e2ccb # Dave Love <fx@gnu.org> parents: diff changeset	783 If @code{file-name-coding-system} is @code{nil}, Emacs uses a default
ac7e9e5e2ccb # Dave Love <fx@gnu.org> parents: diff changeset	784 coding system determined by the selected language environment. In the
ac7e9e5e2ccb # Dave Love <fx@gnu.org> parents: diff changeset	785 default language environment, any non-ASCII characters in file names are
ac7e9e5e2ccb # Dave Love <fx@gnu.org> parents: diff changeset	786 not encoded specially; they appear in the file system using the internal
ac7e9e5e2ccb # Dave Love <fx@gnu.org> parents: diff changeset	787 Emacs representation.
ac7e9e5e2ccb # Dave Love <fx@gnu.org> parents: diff changeset	788
ac7e9e5e2ccb # Dave Love <fx@gnu.org> parents: diff changeset	789 @strong{Warning:} if you change @code{file-name-coding-system} (or the
ac7e9e5e2ccb # Dave Love <fx@gnu.org> parents: diff changeset	790 language environment) in the middle of an Emacs session, problems can
ac7e9e5e2ccb # Dave Love <fx@gnu.org> parents: diff changeset	791 result if you have already visited files whose names were encoded using
ac7e9e5e2ccb # Dave Love <fx@gnu.org> parents: diff changeset	792 the earlier coding system and cannot be encoded (or are encoded
ac7e9e5e2ccb # Dave Love <fx@gnu.org> parents: diff changeset	793 differently) under the new coding system. If you try to save one of
ac7e9e5e2ccb # Dave Love <fx@gnu.org> parents: diff changeset	794 these buffers under the visited file name, saving may use the wrong file
ac7e9e5e2ccb # Dave Love <fx@gnu.org> parents: diff changeset	795 name, or it may get an error. If such a problem happens, use @kbd{C-x
ac7e9e5e2ccb # Dave Love <fx@gnu.org> parents: diff changeset	796 C-w} to specify a new file name for that buffer.
ac7e9e5e2ccb # Dave Love <fx@gnu.org> parents: diff changeset	797
26140 068f7ad41d40 Describe new functions and variables for locales. Paul Eggert <eggert@twinsun.com> parents: 25829 diff changeset	798 @vindex locale-coding-system
068f7ad41d40 Describe new functions and variables for locales. Paul Eggert <eggert@twinsun.com> parents: 25829 diff changeset	799 The variable @code{locale-coding-system} specifies a coding system to
068f7ad41d40 Describe new functions and variables for locales. Paul Eggert <eggert@twinsun.com> parents: 25829 diff changeset	800 use when encoding and decoding system strings such as system error
068f7ad41d40 Describe new functions and variables for locales. Paul Eggert <eggert@twinsun.com> parents: 25829 diff changeset	801 messages and @code{format-time-string} formats and time stamps. This
068f7ad41d40 Describe new functions and variables for locales. Paul Eggert <eggert@twinsun.com> parents: 25829 diff changeset	802 coding system should be compatible with the underlying system's coding
068f7ad41d40 Describe new functions and variables for locales. Paul Eggert <eggert@twinsun.com> parents: 25829 diff changeset	803 system, which is normally specified by the first environment variable in
068f7ad41d40 Describe new functions and variables for locales. Paul Eggert <eggert@twinsun.com> parents: 25829 diff changeset	804 the list @env{LC_ALL}, @env{LC_CTYPE}, @env{LANG} whose value is
068f7ad41d40 Describe new functions and variables for locales. Paul Eggert <eggert@twinsun.com> parents: 25829 diff changeset	805 nonempty.
068f7ad41d40 Describe new functions and variables for locales. Paul Eggert <eggert@twinsun.com> parents: 25829 diff changeset	806
25829 ac7e9e5e2ccb # Dave Love <fx@gnu.org> parents: diff changeset	807 @node Fontsets
ac7e9e5e2ccb # Dave Love <fx@gnu.org> parents: diff changeset	808 @section Fontsets
ac7e9e5e2ccb # Dave Love <fx@gnu.org> parents: diff changeset	809 @cindex fontsets
ac7e9e5e2ccb # Dave Love <fx@gnu.org> parents: diff changeset	810
ac7e9e5e2ccb # Dave Love <fx@gnu.org> parents: diff changeset	811 A font for X Windows typically defines shapes for one alphabet or
ac7e9e5e2ccb # Dave Love <fx@gnu.org> parents: diff changeset	812 script. Therefore, displaying the entire range of scripts that Emacs
ac7e9e5e2ccb # Dave Love <fx@gnu.org> parents: diff changeset	813 supports requires a collection of many fonts. In Emacs, such a
ac7e9e5e2ccb # Dave Love <fx@gnu.org> parents: diff changeset	814 collection is called a @dfn{fontset}. A fontset is defined by a list of
ac7e9e5e2ccb # Dave Love <fx@gnu.org> parents: diff changeset	815 fonts, each assigned to handle a range of character codes.
ac7e9e5e2ccb # Dave Love <fx@gnu.org> parents: diff changeset	816
ac7e9e5e2ccb # Dave Love <fx@gnu.org> parents: diff changeset	817 Each fontset has a name, like a font. The available X fonts are
ac7e9e5e2ccb # Dave Love <fx@gnu.org> parents: diff changeset	818 defined by the X server; fontsets, however, are defined within Emacs
ac7e9e5e2ccb # Dave Love <fx@gnu.org> parents: diff changeset	819 itself. Once you have defined a fontset, you can use it within Emacs by
ac7e9e5e2ccb # Dave Love <fx@gnu.org> parents: diff changeset	820 specifying its name, anywhere that you could use a single font. Of
ac7e9e5e2ccb # Dave Love <fx@gnu.org> parents: diff changeset	821 course, Emacs fontsets can use only the fonts that the X server
ac7e9e5e2ccb # Dave Love <fx@gnu.org> parents: diff changeset	822 supports; if certain characters appear on the screen as hollow boxes,
ac7e9e5e2ccb # Dave Love <fx@gnu.org> parents: diff changeset	823 this means that the fontset in use for them has no font for those
ac7e9e5e2ccb # Dave Love <fx@gnu.org> parents: diff changeset	824 characters.
ac7e9e5e2ccb # Dave Love <fx@gnu.org> parents: diff changeset	825
ac7e9e5e2ccb # Dave Love <fx@gnu.org> parents: diff changeset	826 Emacs creates two fontsets automatically: the @dfn{standard fontset}
ac7e9e5e2ccb # Dave Love <fx@gnu.org> parents: diff changeset	827 and the @dfn{startup fontset}. The standard fontset is most likely to
ac7e9e5e2ccb # Dave Love <fx@gnu.org> parents: diff changeset	828 have fonts for a wide variety of non-ASCII characters; however, this is
ac7e9e5e2ccb # Dave Love <fx@gnu.org> parents: diff changeset	829 not the default for Emacs to use. (By default, Emacs tries to find a
ac7e9e5e2ccb # Dave Love <fx@gnu.org> parents: diff changeset	830 font which has bold and italic variants.) You can specify use of the
ac7e9e5e2ccb # Dave Love <fx@gnu.org> parents: diff changeset	831 standard fontset with the @samp{-fn} option, or with the @samp{Font} X
ac7e9e5e2ccb # Dave Love <fx@gnu.org> parents: diff changeset	832 resource (@pxref{Font X}). For example,
ac7e9e5e2ccb # Dave Love <fx@gnu.org> parents: diff changeset	833
ac7e9e5e2ccb # Dave Love <fx@gnu.org> parents: diff changeset	834 @example
ac7e9e5e2ccb # Dave Love <fx@gnu.org> parents: diff changeset	835 emacs -fn fontset-standard
ac7e9e5e2ccb # Dave Love <fx@gnu.org> parents: diff changeset	836 @end example
ac7e9e5e2ccb # Dave Love <fx@gnu.org> parents: diff changeset	837
ac7e9e5e2ccb # Dave Love <fx@gnu.org> parents: diff changeset	838 A fontset does not necessarily specify a font for every character
ac7e9e5e2ccb # Dave Love <fx@gnu.org> parents: diff changeset	839 code. If a fontset specifies no font for a certain character, or if it
ac7e9e5e2ccb # Dave Love <fx@gnu.org> parents: diff changeset	840 specifies a font that does not exist on your system, then it cannot
ac7e9e5e2ccb # Dave Love <fx@gnu.org> parents: diff changeset	841 display that character properly. It will display that character as an
ac7e9e5e2ccb # Dave Love <fx@gnu.org> parents: diff changeset	842 empty box instead.
ac7e9e5e2ccb # Dave Love <fx@gnu.org> parents: diff changeset	843
ac7e9e5e2ccb # Dave Love <fx@gnu.org> parents: diff changeset	844 @vindex highlight-wrong-size-font
ac7e9e5e2ccb # Dave Love <fx@gnu.org> parents: diff changeset	845 The fontset height and width are determined by the ASCII characters
ac7e9e5e2ccb # Dave Love <fx@gnu.org> parents: diff changeset	846 (that is, by the font used for ASCII characters in that fontset). If
ac7e9e5e2ccb # Dave Love <fx@gnu.org> parents: diff changeset	847 another font in the fontset has a different height, or a different
ac7e9e5e2ccb # Dave Love <fx@gnu.org> parents: diff changeset	848 width, then characters assigned to that font are clipped to the
ac7e9e5e2ccb # Dave Love <fx@gnu.org> parents: diff changeset	849 fontset's size. If @code{highlight-wrong-size-font} is non-@code{nil},
ac7e9e5e2ccb # Dave Love <fx@gnu.org> parents: diff changeset	850 a box is displayed around these wrong-size characters as well.
ac7e9e5e2ccb # Dave Love <fx@gnu.org> parents: diff changeset	851
ac7e9e5e2ccb # Dave Love <fx@gnu.org> parents: diff changeset	852 @node Defining Fontsets
ac7e9e5e2ccb # Dave Love <fx@gnu.org> parents: diff changeset	853 @section Defining fontsets
ac7e9e5e2ccb # Dave Love <fx@gnu.org> parents: diff changeset	854
ac7e9e5e2ccb # Dave Love <fx@gnu.org> parents: diff changeset	855 @vindex standard-fontset-spec
ac7e9e5e2ccb # Dave Love <fx@gnu.org> parents: diff changeset	856 @cindex standard fontset
ac7e9e5e2ccb # Dave Love <fx@gnu.org> parents: diff changeset	857 Emacs creates a standard fontset automatically according to the value
ac7e9e5e2ccb # Dave Love <fx@gnu.org> parents: diff changeset	858 of @code{standard-fontset-spec}. This fontset's name is
ac7e9e5e2ccb # Dave Love <fx@gnu.org> parents: diff changeset	859
ac7e9e5e2ccb # Dave Love <fx@gnu.org> parents: diff changeset	860 @example
ac7e9e5e2ccb # Dave Love <fx@gnu.org> parents: diff changeset	861 --fixed-medium-r-normal--16-----*-fontset-standard
ac7e9e5e2ccb # Dave Love <fx@gnu.org> parents: diff changeset	862 @end example
ac7e9e5e2ccb # Dave Love <fx@gnu.org> parents: diff changeset	863
ac7e9e5e2ccb # Dave Love <fx@gnu.org> parents: diff changeset	864 @noindent
ac7e9e5e2ccb # Dave Love <fx@gnu.org> parents: diff changeset	865 or just @samp{fontset-standard} for short.
ac7e9e5e2ccb # Dave Love <fx@gnu.org> parents: diff changeset	866
ac7e9e5e2ccb # Dave Love <fx@gnu.org> parents: diff changeset	867 Bold, italic, and bold-italic variants of the standard fontset are
ac7e9e5e2ccb # Dave Love <fx@gnu.org> parents: diff changeset	868 created automatically. Their names have @samp{bold} instead of
ac7e9e5e2ccb # Dave Love <fx@gnu.org> parents: diff changeset	869 @samp{medium}, or @samp{i} instead of @samp{r}, or both.
ac7e9e5e2ccb # Dave Love <fx@gnu.org> parents: diff changeset	870
ac7e9e5e2ccb # Dave Love <fx@gnu.org> parents: diff changeset	871 @cindex startup fontset
ac7e9e5e2ccb # Dave Love <fx@gnu.org> parents: diff changeset	872 If you specify a default ASCII font with the @samp{Font} resource or
ac7e9e5e2ccb # Dave Love <fx@gnu.org> parents: diff changeset	873 the @samp{-fn} argument, Emacs generates a fontset from it
ac7e9e5e2ccb # Dave Love <fx@gnu.org> parents: diff changeset	874 automatically. This is the @dfn{startup fontset} and its name is
ac7e9e5e2ccb # Dave Love <fx@gnu.org> parents: diff changeset	875 @code{fontset-startup}. It does this by replacing the @var{foundry},
ac7e9e5e2ccb # Dave Love <fx@gnu.org> parents: diff changeset	876 @var{family}, @var{add_style}, and @var{average_width} fields of the
ac7e9e5e2ccb # Dave Love <fx@gnu.org> parents: diff changeset	877 font name with @samp{*}, replacing @var{charset_registry} field with
ac7e9e5e2ccb # Dave Love <fx@gnu.org> parents: diff changeset	878 @samp{fontset}, and replacing @var{charset_encoding} field with
ac7e9e5e2ccb # Dave Love <fx@gnu.org> parents: diff changeset	879 @samp{startup}, then using the resulting string to specify a fontset.
ac7e9e5e2ccb # Dave Love <fx@gnu.org> parents: diff changeset	880
ac7e9e5e2ccb # Dave Love <fx@gnu.org> parents: diff changeset	881 For instance, if you start Emacs this way,
ac7e9e5e2ccb # Dave Love <fx@gnu.org> parents: diff changeset	882
ac7e9e5e2ccb # Dave Love <fx@gnu.org> parents: diff changeset	883 @example
ac7e9e5e2ccb # Dave Love <fx@gnu.org> parents: diff changeset	884 emacs -fn "courier-medium-r-normal--14-140--iso8859-1"
ac7e9e5e2ccb # Dave Love <fx@gnu.org> parents: diff changeset	885 @end example
ac7e9e5e2ccb # Dave Love <fx@gnu.org> parents: diff changeset	886
ac7e9e5e2ccb # Dave Love <fx@gnu.org> parents: diff changeset	887 @noindent
ac7e9e5e2ccb # Dave Love <fx@gnu.org> parents: diff changeset	888 Emacs generates the following fontset and uses it for the initial X
ac7e9e5e2ccb # Dave Love <fx@gnu.org> parents: diff changeset	889 window frame:
ac7e9e5e2ccb # Dave Love <fx@gnu.org> parents: diff changeset	890
ac7e9e5e2ccb # Dave Love <fx@gnu.org> parents: diff changeset	891 @example
ac7e9e5e2ccb # Dave Love <fx@gnu.org> parents: diff changeset	892 ---medium-r-normal--14-140----*-fontset-startup
ac7e9e5e2ccb # Dave Love <fx@gnu.org> parents: diff changeset	893 @end example
ac7e9e5e2ccb # Dave Love <fx@gnu.org> parents: diff changeset	894
ac7e9e5e2ccb # Dave Love <fx@gnu.org> parents: diff changeset	895 With the X resource @samp{Emacs.Font}, you can specify a fontset name
ac7e9e5e2ccb # Dave Love <fx@gnu.org> parents: diff changeset	896 just like an actual font name. But be careful not to specify a fontset
ac7e9e5e2ccb # Dave Love <fx@gnu.org> parents: diff changeset	897 name in a wildcard resource like @samp{Emacs*Font}---that wildcard
ac7e9e5e2ccb # Dave Love <fx@gnu.org> parents: diff changeset	898 specification applies to various other purposes, such as menus, and
ac7e9e5e2ccb # Dave Love <fx@gnu.org> parents: diff changeset	899 menus cannot handle fontsets.
ac7e9e5e2ccb # Dave Love <fx@gnu.org> parents: diff changeset	900
ac7e9e5e2ccb # Dave Love <fx@gnu.org> parents: diff changeset	901 You can specify additional fontsets using X resources named
ac7e9e5e2ccb # Dave Love <fx@gnu.org> parents: diff changeset	902 @samp{Fontset-@var{n}}, where @var{n} is an integer starting from 0.
ac7e9e5e2ccb # Dave Love <fx@gnu.org> parents: diff changeset	903 The resource value should have this form:
ac7e9e5e2ccb # Dave Love <fx@gnu.org> parents: diff changeset	904
ac7e9e5e2ccb # Dave Love <fx@gnu.org> parents: diff changeset	905 @smallexample
ac7e9e5e2ccb # Dave Love <fx@gnu.org> parents: diff changeset	906 @var{fontpattern}, @r{[}@var{charsetname}:@var{fontname}@r{]@dots{}}
ac7e9e5e2ccb # Dave Love <fx@gnu.org> parents: diff changeset	907 @end smallexample
ac7e9e5e2ccb # Dave Love <fx@gnu.org> parents: diff changeset	908
ac7e9e5e2ccb # Dave Love <fx@gnu.org> parents: diff changeset	909 @noindent
ac7e9e5e2ccb # Dave Love <fx@gnu.org> parents: diff changeset	910 @var{fontpattern} should have the form of a standard X font name, except
ac7e9e5e2ccb # Dave Love <fx@gnu.org> parents: diff changeset	911 for the last two fields. They should have the form
ac7e9e5e2ccb # Dave Love <fx@gnu.org> parents: diff changeset	912 @samp{fontset-@var{alias}}.
ac7e9e5e2ccb # Dave Love <fx@gnu.org> parents: diff changeset	913
ac7e9e5e2ccb # Dave Love <fx@gnu.org> parents: diff changeset	914 The fontset has two names, one long and one short. The long name is
ac7e9e5e2ccb # Dave Love <fx@gnu.org> parents: diff changeset	915 @var{fontpattern}. The short name is @samp{fontset-@var{alias}}. You
ac7e9e5e2ccb # Dave Love <fx@gnu.org> parents: diff changeset	916 can refer to the fontset by either name.
ac7e9e5e2ccb # Dave Love <fx@gnu.org> parents: diff changeset	917
ac7e9e5e2ccb # Dave Love <fx@gnu.org> parents: diff changeset	918 The construct @samp{@var{charset}:@var{font}} specifies which font to
ac7e9e5e2ccb # Dave Love <fx@gnu.org> parents: diff changeset	919 use (in this fontset) for one particular character set. Here,
ac7e9e5e2ccb # Dave Love <fx@gnu.org> parents: diff changeset	920 @var{charset} is the name of a character set, and @var{font} is the
ac7e9e5e2ccb # Dave Love <fx@gnu.org> parents: diff changeset	921 font to use for that character set. You can use this construct any
ac7e9e5e2ccb # Dave Love <fx@gnu.org> parents: diff changeset	922 number of times in defining one fontset.
ac7e9e5e2ccb # Dave Love <fx@gnu.org> parents: diff changeset	923
ac7e9e5e2ccb # Dave Love <fx@gnu.org> parents: diff changeset	924 For the other character sets, Emacs chooses a font based on
ac7e9e5e2ccb # Dave Love <fx@gnu.org> parents: diff changeset	925 @var{fontpattern}. It replaces @samp{fontset-@var{alias}} with values
ac7e9e5e2ccb # Dave Love <fx@gnu.org> parents: diff changeset	926 that describe the character set. For the ASCII character font,
ac7e9e5e2ccb # Dave Love <fx@gnu.org> parents: diff changeset	927 @samp{fontset-@var{alias}} is replaced with @samp{ISO8859-1}.
ac7e9e5e2ccb # Dave Love <fx@gnu.org> parents: diff changeset	928
ac7e9e5e2ccb # Dave Love <fx@gnu.org> parents: diff changeset	929 In addition, when several consecutive fields are wildcards, Emacs
ac7e9e5e2ccb # Dave Love <fx@gnu.org> parents: diff changeset	930 collapses them into a single wildcard. This is to prevent use of
ac7e9e5e2ccb # Dave Love <fx@gnu.org> parents: diff changeset	931 auto-scaled fonts. Fonts made by scaling larger fonts are not usable
ac7e9e5e2ccb # Dave Love <fx@gnu.org> parents: diff changeset	932 for editing, and scaling a smaller font is not useful because it is
ac7e9e5e2ccb # Dave Love <fx@gnu.org> parents: diff changeset	933 better to use the smaller font in its own size, which Emacs does.
ac7e9e5e2ccb # Dave Love <fx@gnu.org> parents: diff changeset	934
ac7e9e5e2ccb # Dave Love <fx@gnu.org> parents: diff changeset	935 Thus if @var{fontpattern} is this,
ac7e9e5e2ccb # Dave Love <fx@gnu.org> parents: diff changeset	936
ac7e9e5e2ccb # Dave Love <fx@gnu.org> parents: diff changeset	937 @example
ac7e9e5e2ccb # Dave Love <fx@gnu.org> parents: diff changeset	938 --fixed-medium-r-normal--24-----*-fontset-24
ac7e9e5e2ccb # Dave Love <fx@gnu.org> parents: diff changeset	939 @end example
ac7e9e5e2ccb # Dave Love <fx@gnu.org> parents: diff changeset	940
ac7e9e5e2ccb # Dave Love <fx@gnu.org> parents: diff changeset	941 @noindent
ac7e9e5e2ccb # Dave Love <fx@gnu.org> parents: diff changeset	942 the font specification for ASCII characters would be this:
ac7e9e5e2ccb # Dave Love <fx@gnu.org> parents: diff changeset	943
ac7e9e5e2ccb # Dave Love <fx@gnu.org> parents: diff changeset	944 @example
ac7e9e5e2ccb # Dave Love <fx@gnu.org> parents: diff changeset	945 --fixed-medium-r-normal--24-*-ISO8859-1
ac7e9e5e2ccb # Dave Love <fx@gnu.org> parents: diff changeset	946 @end example
ac7e9e5e2ccb # Dave Love <fx@gnu.org> parents: diff changeset	947
ac7e9e5e2ccb # Dave Love <fx@gnu.org> parents: diff changeset	948 @noindent
ac7e9e5e2ccb # Dave Love <fx@gnu.org> parents: diff changeset	949 and the font specification for Chinese GB2312 characters would be this:
ac7e9e5e2ccb # Dave Love <fx@gnu.org> parents: diff changeset	950
ac7e9e5e2ccb # Dave Love <fx@gnu.org> parents: diff changeset	951 @example
ac7e9e5e2ccb # Dave Love <fx@gnu.org> parents: diff changeset	952 --fixed-medium-r-normal--24--gb2312-*
ac7e9e5e2ccb # Dave Love <fx@gnu.org> parents: diff changeset	953 @end example
ac7e9e5e2ccb # Dave Love <fx@gnu.org> parents: diff changeset	954
ac7e9e5e2ccb # Dave Love <fx@gnu.org> parents: diff changeset	955 You may not have any Chinese font matching the above font
ac7e9e5e2ccb # Dave Love <fx@gnu.org> parents: diff changeset	956 specification. Most X distributions include only Chinese fonts that
ac7e9e5e2ccb # Dave Love <fx@gnu.org> parents: diff changeset	957 have @samp{song ti} or @samp{fangsong ti} in @var{family} field. In
ac7e9e5e2ccb # Dave Love <fx@gnu.org> parents: diff changeset	958 such a case, @samp{Fontset-@var{n}} can be specified as below:
ac7e9e5e2ccb # Dave Love <fx@gnu.org> parents: diff changeset	959
ac7e9e5e2ccb # Dave Love <fx@gnu.org> parents: diff changeset	960 @smallexample
ac7e9e5e2ccb # Dave Love <fx@gnu.org> parents: diff changeset	961 Emacs.Fontset-0: --fixed-medium-r-normal--24-----*-fontset-24,\
ac7e9e5e2ccb # Dave Love <fx@gnu.org> parents: diff changeset	962 chinese-gb2312:---medium-r-normal--24--gb2312-
ac7e9e5e2ccb # Dave Love <fx@gnu.org> parents: diff changeset	963 @end smallexample
ac7e9e5e2ccb # Dave Love <fx@gnu.org> parents: diff changeset	964
ac7e9e5e2ccb # Dave Love <fx@gnu.org> parents: diff changeset	965 @noindent
ac7e9e5e2ccb # Dave Love <fx@gnu.org> parents: diff changeset	966 Then, the font specifications for all but Chinese GB2312 characters have
ac7e9e5e2ccb # Dave Love <fx@gnu.org> parents: diff changeset	967 @samp{fixed} in the @var{family} field, and the font specification for
ac7e9e5e2ccb # Dave Love <fx@gnu.org> parents: diff changeset	968 Chinese GB2312 characters has a wild card @samp{*} in the @var{family}
ac7e9e5e2ccb # Dave Love <fx@gnu.org> parents: diff changeset	969 field.
ac7e9e5e2ccb # Dave Love <fx@gnu.org> parents: diff changeset	970
ac7e9e5e2ccb # Dave Love <fx@gnu.org> parents: diff changeset	971 @findex create-fontset-from-fontset-spec
ac7e9e5e2ccb # Dave Love <fx@gnu.org> parents: diff changeset	972 The function that processes the fontset resource value to create the
ac7e9e5e2ccb # Dave Love <fx@gnu.org> parents: diff changeset	973 fontset is called @code{create-fontset-from-fontset-spec}. You can also
ac7e9e5e2ccb # Dave Love <fx@gnu.org> parents: diff changeset	974 call this function explicitly to create a fontset.
ac7e9e5e2ccb # Dave Love <fx@gnu.org> parents: diff changeset	975
ac7e9e5e2ccb # Dave Love <fx@gnu.org> parents: diff changeset	976 @xref{Font X}, for more information about font naming in X.
ac7e9e5e2ccb # Dave Love <fx@gnu.org> parents: diff changeset	977
27211 0699f691fac1 Don't conflate single-byte with European. Dave Love <fx@gnu.org> parents: 27156 diff changeset	978 @node Single-Byte Character Support
0699f691fac1 Don't conflate single-byte with European. Dave Love <fx@gnu.org> parents: 27156 diff changeset	979 @section Single-byte Character Set Support
25829 ac7e9e5e2ccb # Dave Love <fx@gnu.org> parents: diff changeset	980
ac7e9e5e2ccb # Dave Love <fx@gnu.org> parents: diff changeset	981 @cindex European character sets
ac7e9e5e2ccb # Dave Love <fx@gnu.org> parents: diff changeset	982 @cindex accented characters
ac7e9e5e2ccb # Dave Love <fx@gnu.org> parents: diff changeset	983 @cindex ISO Latin character sets
ac7e9e5e2ccb # Dave Love <fx@gnu.org> parents: diff changeset	984 @cindex Unibyte operation
ac7e9e5e2ccb # Dave Love <fx@gnu.org> parents: diff changeset	985 @vindex enable-multibyte-characters
ac7e9e5e2ccb # Dave Love <fx@gnu.org> parents: diff changeset	986 The ISO 8859 Latin-@var{n} character sets define character codes in
ac7e9e5e2ccb # Dave Love <fx@gnu.org> parents: diff changeset	987 the range 160 to 255 to handle the accented letters and punctuation
27211 0699f691fac1 Don't conflate single-byte with European. Dave Love <fx@gnu.org> parents: 27156 diff changeset	988 needed by various European languages (and some non-European ones).
0699f691fac1 Don't conflate single-byte with European. Dave Love <fx@gnu.org> parents: 27156 diff changeset	989 If you disable multibyte
25829 ac7e9e5e2ccb # Dave Love <fx@gnu.org> parents: diff changeset	990 characters, Emacs can still handle @emph{one} of these character codes
ac7e9e5e2ccb # Dave Love <fx@gnu.org> parents: diff changeset	991 at a time. To specify @emph{which} of these codes to use, invoke
ac7e9e5e2ccb # Dave Love <fx@gnu.org> parents: diff changeset	992 @kbd{M-x set-language-environment} and specify a suitable language
ac7e9e5e2ccb # Dave Love <fx@gnu.org> parents: diff changeset	993 environment such as @samp{Latin-@var{n}}.
ac7e9e5e2ccb # Dave Love <fx@gnu.org> parents: diff changeset	994
ac7e9e5e2ccb # Dave Love <fx@gnu.org> parents: diff changeset	995 For more information about unibyte operation, see @ref{Enabling
ac7e9e5e2ccb # Dave Love <fx@gnu.org> parents: diff changeset	996 Multibyte}. Note particularly that you probably want to ensure that
ac7e9e5e2ccb # Dave Love <fx@gnu.org> parents: diff changeset	997 your initialization files are read as unibyte if they contain non-ASCII
ac7e9e5e2ccb # Dave Love <fx@gnu.org> parents: diff changeset	998 characters.
ac7e9e5e2ccb # Dave Love <fx@gnu.org> parents: diff changeset	999
ac7e9e5e2ccb # Dave Love <fx@gnu.org> parents: diff changeset	1000 @vindex unibyte-display-via-language-environment
ac7e9e5e2ccb # Dave Love <fx@gnu.org> parents: diff changeset	1001 Emacs can also display those characters, provided the terminal or font
ac7e9e5e2ccb # Dave Love <fx@gnu.org> parents: diff changeset	1002 in use supports them. This works automatically. Alternatively, if you
ac7e9e5e2ccb # Dave Love <fx@gnu.org> parents: diff changeset	1003 are using a window system, Emacs can also display single-byte characters
ac7e9e5e2ccb # Dave Love <fx@gnu.org> parents: diff changeset	1004 through fontsets, in effect by displaying the equivalent multibyte
ac7e9e5e2ccb # Dave Love <fx@gnu.org> parents: diff changeset	1005 characters according to the current language environment. To request
ac7e9e5e2ccb # Dave Love <fx@gnu.org> parents: diff changeset	1006 this, set the variable @code{unibyte-display-via-language-environment}
ac7e9e5e2ccb # Dave Love <fx@gnu.org> parents: diff changeset	1007 to a non-@code{nil} value.
ac7e9e5e2ccb # Dave Love <fx@gnu.org> parents: diff changeset	1008
ac7e9e5e2ccb # Dave Love <fx@gnu.org> parents: diff changeset	1009 @cindex @code{iso-ascii} library
ac7e9e5e2ccb # Dave Love <fx@gnu.org> parents: diff changeset	1010 If your terminal does not support display of the Latin-1 character
ac7e9e5e2ccb # Dave Love <fx@gnu.org> parents: diff changeset	1011 set, Emacs can display these characters as ASCII sequences which at
ac7e9e5e2ccb # Dave Love <fx@gnu.org> parents: diff changeset	1012 least give you a clear idea of what the characters are. To do this,
ac7e9e5e2ccb # Dave Love <fx@gnu.org> parents: diff changeset	1013 load the library @code{iso-ascii}. Similar libraries for other
ac7e9e5e2ccb # Dave Love <fx@gnu.org> parents: diff changeset	1014 Latin-@var{n} character sets could be implemented, but we don't have
ac7e9e5e2ccb # Dave Love <fx@gnu.org> parents: diff changeset	1015 them yet.
ac7e9e5e2ccb # Dave Love <fx@gnu.org> parents: diff changeset	1016
ac7e9e5e2ccb # Dave Love <fx@gnu.org> parents: diff changeset	1017 @findex standard-display-8bit
ac7e9e5e2ccb # Dave Love <fx@gnu.org> parents: diff changeset	1018 @cindex 8-bit display
ac7e9e5e2ccb # Dave Love <fx@gnu.org> parents: diff changeset	1019 Normally non-ISO-8859 characters (between characters 128 and 159
ac7e9e5e2ccb # Dave Love <fx@gnu.org> parents: diff changeset	1020 inclusive) are displayed as octal escapes. You can change this for
ac7e9e5e2ccb # Dave Love <fx@gnu.org> parents: diff changeset	1021 non-standard `extended' versions of ISO-8859 character sets by using the
ac7e9e5e2ccb # Dave Love <fx@gnu.org> parents: diff changeset	1022 function @code{standard-display-8bit} in the @code{disp-table} library.
ac7e9e5e2ccb # Dave Love <fx@gnu.org> parents: diff changeset	1023
ac7e9e5e2ccb # Dave Love <fx@gnu.org> parents: diff changeset	1024 There are three different ways you can input single-byte non-ASCII
ac7e9e5e2ccb # Dave Love <fx@gnu.org> parents: diff changeset	1025 characters:
ac7e9e5e2ccb # Dave Love <fx@gnu.org> parents: diff changeset	1026
ac7e9e5e2ccb # Dave Love <fx@gnu.org> parents: diff changeset	1027 @itemize @bullet
27211 0699f691fac1 Don't conflate single-byte with European. Dave Love <fx@gnu.org> parents: 27156 diff changeset	1028 @cindex 8-bit input
25829 ac7e9e5e2ccb # Dave Love <fx@gnu.org> parents: diff changeset	1029 @item
ac7e9e5e2ccb # Dave Love <fx@gnu.org> parents: diff changeset	1030 If your keyboard can generate character codes 128 and up, representing
ac7e9e5e2ccb # Dave Love <fx@gnu.org> parents: diff changeset	1031 non-ASCII characters, execute the following expression to enable Emacs to
ac7e9e5e2ccb # Dave Love <fx@gnu.org> parents: diff changeset	1032 understand them:
ac7e9e5e2ccb # Dave Love <fx@gnu.org> parents: diff changeset	1033
ac7e9e5e2ccb # Dave Love <fx@gnu.org> parents: diff changeset	1034 @example
ac7e9e5e2ccb # Dave Love <fx@gnu.org> parents: diff changeset	1035 (set-input-mode (car (current-input-mode))
ac7e9e5e2ccb # Dave Love <fx@gnu.org> parents: diff changeset	1036 (nth 1 (current-input-mode))
ac7e9e5e2ccb # Dave Love <fx@gnu.org> parents: diff changeset	1037 0)
ac7e9e5e2ccb # Dave Love <fx@gnu.org> parents: diff changeset	1038 @end example
ac7e9e5e2ccb # Dave Love <fx@gnu.org> parents: diff changeset	1039
27211 0699f691fac1 Don't conflate single-byte with European. Dave Love <fx@gnu.org> parents: 27156 diff changeset	1040 It is not necessary to do this under a window system which can
0699f691fac1 Don't conflate single-byte with European. Dave Love <fx@gnu.org> parents: 27156 diff changeset	1041 distinguish 8-bit characters and Meta keys. If you do this on a normal
0699f691fac1 Don't conflate single-byte with European. Dave Love <fx@gnu.org> parents: 27156 diff changeset	1042 terminal, you will probably need to use @kbd{ESC} to type Meta
0699f691fac1 Don't conflate single-byte with European. Dave Love <fx@gnu.org> parents: 27156 diff changeset	1043 characters.@footnote{In some cases, such as the Linux console and
0699f691fac1 Don't conflate single-byte with European. Dave Love <fx@gnu.org> parents: 27156 diff changeset	1044 @code{xterm}, you can arrange for Meta to be converted to @kbd{ESC} and
0699f691fac1 Don't conflate single-byte with European. Dave Love <fx@gnu.org> parents: 27156 diff changeset	1045 still be able type 8-bit characters present directly on the keyboard or
0699f691fac1 Don't conflate single-byte with European. Dave Love <fx@gnu.org> parents: 27156 diff changeset	1046 using @kbd{Compose} or @kbd{AltGr} keys.} @xref{User Input}.
0699f691fac1 Don't conflate single-byte with European. Dave Love <fx@gnu.org> parents: 27156 diff changeset	1047
25829 ac7e9e5e2ccb # Dave Love <fx@gnu.org> parents: diff changeset	1048 @item
ac7e9e5e2ccb # Dave Love <fx@gnu.org> parents: diff changeset	1049 You can use an input method for the selected language environment.
ac7e9e5e2ccb # Dave Love <fx@gnu.org> parents: diff changeset	1050 @xref{Input Methods}. When you use an input method in a unibyte buffer,
ac7e9e5e2ccb # Dave Love <fx@gnu.org> parents: diff changeset	1051 the non-ASCII character you specify with it is converted to unibyte.
ac7e9e5e2ccb # Dave Love <fx@gnu.org> parents: diff changeset	1052
ac7e9e5e2ccb # Dave Love <fx@gnu.org> parents: diff changeset	1053 @kindex C-x 8
ac7e9e5e2ccb # Dave Love <fx@gnu.org> parents: diff changeset	1054 @cindex @code{iso-transl} library
ac7e9e5e2ccb # Dave Love <fx@gnu.org> parents: diff changeset	1055 @item
ac7e9e5e2ccb # Dave Love <fx@gnu.org> parents: diff changeset	1056 For Latin-1 only, you can use the
ac7e9e5e2ccb # Dave Love <fx@gnu.org> parents: diff changeset	1057 key @kbd{C-x 8} as a ``compose character'' prefix for entry of
ac7e9e5e2ccb # Dave Love <fx@gnu.org> parents: diff changeset	1058 non-ASCII Latin-1 printing characters. @kbd{C-x 8} is good for
ac7e9e5e2ccb # Dave Love <fx@gnu.org> parents: diff changeset	1059 insertion (in the minibuffer as well as other buffers), for searching,
ac7e9e5e2ccb # Dave Love <fx@gnu.org> parents: diff changeset	1060 and in any other context where a key sequence is allowed.
ac7e9e5e2ccb # Dave Love <fx@gnu.org> parents: diff changeset	1061
ac7e9e5e2ccb # Dave Love <fx@gnu.org> parents: diff changeset	1062 @kbd{C-x 8} works by loading the @code{iso-transl} library. Once that
ac7e9e5e2ccb # Dave Love <fx@gnu.org> parents: diff changeset	1063 library is loaded, the @key{ALT} modifier key, if you have one, serves
ac7e9e5e2ccb # Dave Love <fx@gnu.org> parents: diff changeset	1064 the same purpose as @kbd{C-x 8}; use @key{ALT} together with an accent
ac7e9e5e2ccb # Dave Love <fx@gnu.org> parents: diff changeset	1065 character to modify the following letter. In addition, if you have keys
ac7e9e5e2ccb # Dave Love <fx@gnu.org> parents: diff changeset	1066 for the Latin-1 ``dead accent characters'', they too are defined to
ac7e9e5e2ccb # Dave Love <fx@gnu.org> parents: diff changeset	1067 compose with the following character, once @code{iso-transl} is loaded.
ac7e9e5e2ccb # Dave Love <fx@gnu.org> parents: diff changeset	1068 @end itemize

Mercurial > emacs

annotate man/mule.texi @ 28285:c54d62415e91