Mercurial > emacs
changeset 105817:1177db74f5b5
* mule.texi (Charsets): Numerous copyedits. Don't discuss the
`charset' property, which is irrelevant to the user manual (Bug#3526).
author | Chong Yidong <cyd@stupidchicken.com> |
---|---|
date | Sat, 31 Oct 2009 19:30:23 +0000 |
parents | fecb02e16f64 |
children | 6251a7f3c0a4 |
files | doc/emacs/ChangeLog doc/emacs/mule.texi |
diffstat | 2 files changed, 35 insertions(+), 31 deletions(-) [+] |
line wrap: on
line diff
--- a/doc/emacs/ChangeLog Sat Oct 31 18:43:37 2009 +0000 +++ b/doc/emacs/ChangeLog Sat Oct 31 19:30:23 2009 +0000 @@ -1,3 +1,8 @@ +2009-10-31 Chong Yidong <cyd@stupidchicken.com> + + * mule.texi (Charsets): Numerous copyedits. Don't discuss the + `charset' property, which is irrelevant to the user manual (Bug#3526). + 2009-10-14 Juanma Barranquero <lekktu@gmail.com> * trouble.texi (DEL Does Not Delete): Fix typo.
--- a/doc/emacs/mule.texi Sat Oct 31 18:43:37 2009 +0000 +++ b/doc/emacs/mule.texi Sat Oct 31 19:30:23 2009 +0000 @@ -1608,51 +1608,50 @@ @section Charsets @cindex charsets - Emacs defines most of popular character sets (e.g. ascii, -iso-8859-1, cp1250, big5, unicode) as @dfn{charsets} and a few of its -own charsets (e.g. emacs, unicode-bmp, eight-bit). All supported -characters belong to one or more charsets. Usually you don't have to -take care of ``charset'', but knowing about it may help understanding -the behavior of Emacs in some cases. + In Emacs, @dfn{charset} is short for ``character set''. Emacs +supports most popular charsets (such as @code{ascii}, +@code{iso-8859-1}, @code{cp1250}, @code{big5}, and @code{unicode}), in +addition to some charsets of its own (such as @code{emacs}, +@code{unicode-bmp}, and @code{eight-bit}). All supported characters +belong to one or more charsets. - One example is a font selection. In each language environment, -charsets have different priorities. Emacs, at first, tries to use a -font that matches with charsets of higher priority. For instance, in -Japanese language environment, the charset @code{japanese-jisx0208} -has the highest priority (@pxref{Describe Language Environment}). So, -Emacs tries to use a font whose @code{registry} property is -``JISX0208.1983-0'' for characters belonging to that charset. + Emacs normally ``does the right thing'' with respect to charsets, so +that you don't have to worry about them. However, it is sometimes +helpful to know some of the underlying details about charsets. - Another example is a use of @code{charset} text property. When -Emacs reads a file encoded in a coding systems that uses escape -sequences to switch charsets (e.g. iso-2022-int-1), the buffer text -keep the information of the original charset by @code{charset} text -property. By using this information, Emacs can write the file with -the same byte sequence as the original. + One example is font selection (@pxref{Font X}). Each language +environment (@pxref{Language Environments}) defines a ``priority +list'' for the various charsets. When searching for a font, Emacs +initially attempts to find one that can display the highest-priority +charsets. For instance, in the Japanese language environment, the +charset @code{japanese-jisx0208} has the highest priority, so Emacs +tries to use a font whose @code{registry} property is +@samp{JISX0208.1983-0}. @findex list-charset-chars @cindex characters in a certain charset @findex describe-character-set - There are two commands for obtaining information about Emacs + There are two commands that can be used to obtain information about charsets. The command @kbd{M-x list-charset-chars} prompts for a charset name, and displays all the characters in that character set. The command @kbd{M-x describe-character-set} prompts for a charset -name and displays information about that charset, including its +name, and displays information about that charset, including its internal representation within Emacs. @findex list-character-sets - To display a list of all the supported charsets, type @kbd{M-x + To display a list of all supported charsets, type @kbd{M-x list-character-sets}. The list gives the names of charsets and -additional information to identity each charset (see ISO/IEC's this -page <http://www.itscj.ipsj.or.jp/ISO-IR/> for the detail). In the -list, charsets are categorized into two; the normal charsets are -listed first, and the supplementary charsets are listed last. A -charset in the latter category is used for defining another charset -(as a parent or a subset), or was used only in Emacs of the older -versions. +additional information to identity each charset (see +@url{http://www.itscj.ipsj.or.jp/ISO-IR/} for details). In this list, +charsets are divided into two categories: @dfn{normal charsets} are +listed first, followed by @dfn{supplementary charsets}. A +supplementary charset is one that is used to define another charset +(as a parent or a subset), or to provide backward-compatibility for +older Emacs versions. - To find out which charset a character in the buffer belongs to, -put point before it and type @kbd{C-u C-x =}. + To find out which charset a character in the buffer belongs to, put +point before it and type @kbd{C-u C-x =} (@pxref{International +Chars}). @ignore arch-tag: 310ba60d-31ef-4ce7-91f1-f282dd57b6b3