emacs: lispref/nonascii.texi comparison

comparison lispref/nonascii.texi @ 32523:4881cd839f12

*** empty log message ***

author	Gerd Moellmann <gerd@gnu.org>
date	Mon, 16 Oct 2000 11:43:01 +0000
parents	d831c2ad9313
children	67b6bdbd95c6

comparison

equal deleted inserted replaced

-:fedf4de246a1
+:4881cd839f12
 @dfn{leading codes}.  The second and subsequent bytes of a multibyte
 character are always in the range 160 through 255 (octal 0240 through
 0377); these values are @dfn{trailing codes}.
 Some sequences of bytes are not valid in multibyte text: for example,
-a single isolated byte in the range 128 through 159 is not allowed.
+a single isolated byte in the range 128 through 159 is not allowed.  But
-But character codes 128 through 159 can appear in multibyte text,
+character codes 128 through 159 can appear in multibyte text,
-represented as two-byte sequences.  None of the character codes 128
+represented as two-byte sequences.  All the character codes 128 through
-through 255 normally appear in ordinary multibyte text, but they do
+255 are possible (though slightly abnormal) in multibyte text; they
 appear in multibyte buffers and strings when you do explicit encoding
 and decoding (@pxref{Explicit Encoding}).
 In a buffer, the buffer-local value of the variable
 @code{enable-multibyte-characters} specifies the representation used.
 alternative, to convert the buffer contents to multibyte, is not
 acceptable because the buffer's representation is a choice made by the
 user that cannot be overridden automatically.
 Converting unibyte text to multibyte text leaves @sc{ascii} characters
-unchanged, and likewise 128 through 159.  It converts the non-@sc{ascii}
+unchanged, and likewise character codes 128 through 159.  It converts
-codes 160 through 255 by adding the value @code{nonascii-insert-offset}
+the non-@sc{ascii} codes 160 through 255 by adding the value
-to each character code.  By setting this variable, you specify which
+@code{nonascii-insert-offset} to each character code.  By setting this
-character set the unibyte characters correspond to (@pxref{Character
+variable, you specify which character set the unibyte characters
-Sets}).  For example, if @code{nonascii-insert-offset} is 2048, which is
+correspond to (@pxref{Character Sets}).  For example, if
-@code{(- (make-char 'latin-iso8859-1) 128)}, then the unibyte
+@code{nonascii-insert-offset} is 2048, which is @code{(- (make-char
-non-@sc{ascii} characters correspond to Latin 1.  If it is 2688, which
+'latin-iso8859-1) 128)}, then the unibyte non-@sc{ascii} characters
-is @code{(- (make-char 'greek-iso8859-7) 128)}, then they correspond to
+correspond to Latin 1.  If it is 2688, which is @code{(- (make-char
-Greek letters.
+'greek-iso8859-7) 128)}, then they correspond to Greek letters.
 Converting multibyte text to unibyte is simpler: it discards all but
 the low 8 bits of each character code.  If @code{nonascii-insert-offset}
 has a reasonable value, corresponding to the beginning of some character
 set, this conversion is the inverse of the other: converting unibyte
 The unibyte and multibyte text representations use different character
 codes.  The valid character codes for unibyte representation range from
 0 to 255---the values that can fit in one byte.  The valid character
 codes for multibyte representation range from 0 to 524287, but not all
 values in that range are valid.  The values 128 through 255 are not
-really proper in multibyte text, but they can occur if you do explicit
+entirely proper in multibyte text, but they can occur if you do explicit
 encoding and decoding (@pxref{Explicit Encoding}).  Some other character
 codes cannot occur at all in multibyte text.  Only the @sc{ascii} codes
-0 through 127 are truly legitimate in both representations.
+0 through 127 are completely legitimate in both representations.
 @defun char-valid-p charcode &optional genericp
 This returns @code{t} if @var{charcode} is valid for either one of the two
 text representations.

Mercurial > emacs

comparison lispref/nonascii.texi @ 32523:4881cd839f12