emacs: lispref/nonascii.texi comparison

comparison lispref/nonascii.texi @ 29265:69f20c18d6eb

*** empty log message ***

author	Kenichi Handa <handa@m17n.org>
date	Sun, 28 May 2000 23:54:22 +0000
parents	ac620ff5fd5d
children	d831c2ad9313

comparison

equal deleted inserted replaced

-:1e4a5ffdacf5
+:69f20c18d6eb
 @defvar nonascii-insert-offset
 This variable specifies the amount to add to a non-@sc{ascii} character
 when converting unibyte text to multibyte.  It also applies when
 @code{self-insert-command} inserts a character in the unibyte
 non-@sc{ascii} range, 128 through 255.  However, the function
-@code{insert-char} does not perform this conversion.
+@code{insert} and @code{insert-char} do not perform this conversion.
 The right value to use to select character set @var{cs} is @code{(-
 (make-char @var{cs}) 128)}.  If the value of
 @code{nonascii-insert-offset} is zero, then conversion actually uses the
 value for the Latin 1 character set, rather than zero.
 @defvar nonascii-translation-table
 This variable provides a more general alternative to
 @code{nonascii-insert-offset}.  You can use it to specify independently
 how to translate each code in the range of 128 through 255 into a
-multibyte character.  The value should be a vector, or @code{nil}.
+multibyte character.  The value should be a char-table, or @code{nil}.
 If this is non-@code{nil}, it overrides @code{nonascii-insert-offset}.
 @end defvar
 @defun string-make-unibyte string
 This function converts the text of @var{string} to unibyte
 This function leaves the buffer contents unchanged when viewed as a
 sequence of bytes.  As a consequence, it can change the contents viewed
 as characters; a sequence of two bytes which is treated as one character
 in multibyte representation will count as two characters in unibyte
-representation.
+representation.  Character codes 128 through 159 are an exception.  They
+are represented by one byte in a unibyte buffer, but when the buffer is
+set to multibyte, they are converted to two-byte sequences, and vice
+versa.
 This function sets @code{enable-multibyte-characters} to record which
 representation is in use.  It also adjusts various data in the buffer
 (including overlays, text properties and markers) so that they cover the
 same text as they did before.
 really proper in multibyte text, but they can occur if you do explicit
 encoding and decoding (@pxref{Explicit Encoding}).  Some other character
 codes cannot occur at all in multibyte text.  Only the @sc{ascii} codes
 0 through 127 are truly legitimate in both representations.
-@defun char-valid-p charcode
+@defun char-valid-p charcode &optional genericp
 This returns @code{t} if @var{charcode} is valid for either one of the two
 text representations.
 @example
 (char-valid-p 65)
 (char-valid-p 256)
 @result{} nil
 (char-valid-p 2248)
 @result{} t
 @end example
+If the optional argument @var{genericp} is non-nil, this function
+returns @code{t} if @var{charcode} is a generic character
+(@pxref{Generic Character}).
 @end defun
 @node Character Sets
 @section Character Sets
 @cindex character sets
 @defun charset-plist charset
 @tindex charset-plist
 This function returns the charset property list of the character set
 @var{charset}.  Although @var{charset} is a symbol, this is not the same
 as the property list of that symbol.  Charset properties are used for
-special purposes within Emacs; for example, @code{x-charset-registry}
+special purposes within Emacs; for example,
-helps determine which fonts to use (@pxref{Font Selection}).
+@code{preferred-coding-system} helps determine which coding system to
+use to encode characters in a charset.
 @end defun
 @node Chars and Bytes
 @section Characters and Bytes
 @cindex bytes and characters
 @cindex introduction sequence
 @cindex dimension (of character set)
 In multibyte representation, each character occupies one or more
 bytes.  Each character set has an @dfn{introduction sequence}, which is
 normally one or two bytes long.  (Exception: the @sc{ascii} character
-set has a zero-length introduction sequence.)  The introduction sequence
+set and the @sc{eight-bit-graphic} character set have a zero-length
-is the beginning of the byte sequence for any character in the character
+introduction sequence.)  The introduction sequence is the beginning of
-set.  The rest of the character's bytes distinguish it from the other
+the byte sequence for any character in the character set.  The rest of
-characters in the same character set.  Depending on the character set,
+the character's bytes distinguish it from the other characters in the
-there are either one or two distinguishing bytes; the number of such
+same character set.  Depending on the character set, there are either
-bytes is called the @dfn{dimension} of the character set.
+one or two distinguishing bytes; the number of such bytes is called the
+@dfn{dimension} of the character set.
 @defun charset-dimension charset
 This function returns the dimension of @var{charset}; at present, the
 dimension is always 1 or 2.
 @end defun
 @example
 (split-char 2248)
 @result{} (latin-iso8859-1 72)
 (split-char 65)
 @result{} (ascii 65)
-@end example
+(split-char 128)
+@result{} (eight-bit-control 128)
-Unibyte non-@sc{ascii} characters are considered as part of
-the @code{ascii} character set:
-@example
-(split-char 192)
-@result{} (ascii 192)
 @end example
 @end defun
 @defun make-char charset &rest byte-values
 This function returns the character in character set @var{charset}
 @example
 (make-char 'latin-iso8859-1)
 @result{} 2176
 (char-valid-p 2176)
 @result{} nil
+(char-valid-p 2176 t)
+@result{} t
 (split-char 2176)
 @result{} (latin-iso8859-1 0)
 @end example
+The character sets @sc{ascii}, @sc{eight-bit-control}, and
+@sc{eight-bit-graphic} don't have corresponding generic characters.
 @node Scanning Charsets
 @section Scanning for Character Sets
 Sometimes it is useful to find out which character sets appear in a
 However, @code{buffer-file-coding-system} does not affect sending text
 to a subprocess.
 @end defvar
 @defvar save-buffer-coding-system
-This variable specifies the coding system for saving the buffer---but it
+This variable specifies the coding system for saving the buffer (by
-is not used for @code{write-region}.
+overriding @code{buffer-file-coding-system}).  Note that it is not used
+for @code{write-region}.
 When a command to save the buffer starts out to use
-@code{save-buffer-coding-system}, and that coding system cannot handle
+@code{buffer-file-coding-system} (or @code{save-buffer-coding-system}),
+and that coding system cannot handle
 the actual text in the buffer, the command asks the user to choose
 another coding system.  After that happens, the command also updates
-@code{save-buffer-coding-system} to represent the coding system that the
+@code{buffer-file-coding-system} to represent the coding system that the
 user specified.
 @end defvar
 @defvar last-coding-system-used
 I/O operations for files and subprocesses set this variable to the
 Here are the Lisp facilities for working with coding systems:
 @defun coding-system-list &optional base-only
 This function returns a list of all coding system names (symbols).  If
 @var{base-only} is non-@code{nil}, the value includes only the
-base coding systems.  Otherwise, it includes variant coding systems as well.
+base coding systems.  Otherwise, it includes alias and variant coding
+systems as well.
 @end defun
 @defun coding-system-p object
 This function returns @code{t} if @var{object} is a coding system
 name.

Mercurial > emacs

comparison lispref/nonascii.texi @ 29265:69f20c18d6eb