Mercurial > emacs
annotate lisp/international/characters.el @ 89222:5db2e48ffbcb
(byte-compile-lapcode): Be sure to return a unibyte string.
author | Kenichi Handa <handa@m17n.org> |
---|---|
date | Wed, 16 Oct 2002 02:16:02 +0000 |
parents | f0d0b7118166 |
children | b73d29dcc8c0 |
rev | line source |
---|---|
17052 | 1 ;;; characters.el --- set syntax and category for multibyte characters |
2 | |
19958
f7ab34ae92bb
Set case-table for Cyrillic characters.
Richard M. Stallman <rms@gnu.org>
parents:
18561
diff
changeset
|
3 ;; Copyright (C) 1995, 1997 Electrotechnical Laboratory, JAPAN. |
18377
8b4a66c66dd6
Change copyright notice.
Richard M. Stallman <rms@gnu.org>
parents:
18304
diff
changeset
|
4 ;; Licensed to the Free Software Foundation. |
88612 | 5 ;; Copyright (C) 2001, 2002 Free Software Foundation, Inc. |
88403
3c6459e2914f
Adjusted for the deletion of generic characters.
Kenichi Handa <handa@m17n.org>
parents:
41992
diff
changeset
|
6 ;; Copyright (C) 2001, 2002 |
3c6459e2914f
Adjusted for the deletion of generic characters.
Kenichi Handa <handa@m17n.org>
parents:
41992
diff
changeset
|
7 ;; National Institute of Advanced Industrial Science and Technology (AIST) |
3c6459e2914f
Adjusted for the deletion of generic characters.
Kenichi Handa <handa@m17n.org>
parents:
41992
diff
changeset
|
8 ;; Registration Number H13PRO009 |
17052 | 9 |
10 ;; Keywords: multibyte character, character set, syntax, category | |
11 | |
12 ;; This file is part of GNU Emacs. | |
13 | |
14 ;; GNU Emacs is free software; you can redistribute it and/or modify | |
15 ;; it under the terms of the GNU General Public License as published by | |
16 ;; the Free Software Foundation; either version 2, or (at your option) | |
17 ;; any later version. | |
18 | |
19 ;; GNU Emacs is distributed in the hope that it will be useful, | |
20 ;; but WITHOUT ANY WARRANTY; without even the implied warranty of | |
21 ;; MERCHANTABILITY or FITNESS FOR A PARTICULAR PURPOSE. See the | |
22 ;; GNU General Public License for more details. | |
23 | |
24 ;; You should have received a copy of the GNU General Public License | |
17071 | 25 ;; along with GNU Emacs; see the file COPYING. If not, write to the |
26 ;; Free Software Foundation, Inc., 59 Temple Place - Suite 330, | |
27 ;; Boston, MA 02111-1307, USA. | |
17052 | 28 |
29 ;;; Commentary: | |
30 | |
31 ;; This file contains multibyte characters. Save this file always in | |
18197
90235db802c7
Comment changed (iso-2022-7 -> iso-2022-7bit).
Kenichi Handa <handa@m17n.org>
parents:
17753
diff
changeset
|
32 ;; the coding system `iso-2022-7bit'. |
17052 | 33 |
18205
40dc3b2e9e27
Delete syntax stuff for Latin-1 and Latin-2.
Richard M. Stallman <rms@gnu.org>
parents:
18197
diff
changeset
|
34 ;; This file does not define the syntax for Latin-N character sets; |
40dc3b2e9e27
Delete syntax stuff for Latin-1 and Latin-2.
Richard M. Stallman <rms@gnu.org>
parents:
18197
diff
changeset
|
35 ;; those are defined by the files latin-N.el. |
40dc3b2e9e27
Delete syntax stuff for Latin-1 and Latin-2.
Richard M. Stallman <rms@gnu.org>
parents:
18197
diff
changeset
|
36 |
38414
67b464da13ec
Some fixes to follow coding conventions.
Pavel Janík <Pavel@Janik.cz>
parents:
36688
diff
changeset
|
37 ;;; Code: |
67b464da13ec
Some fixes to follow coding conventions.
Pavel Janík <Pavel@Janik.cz>
parents:
36688
diff
changeset
|
38 |
17052 | 39 ;;; Predefined categories. |
40 | |
41 ;; For each character set. | |
42 | |
43 (define-category ?a "ASCII") | |
44 (define-category ?l "Latin") | |
45 (define-category ?t "Thai") | |
46 (define-category ?g "Greek") | |
47 (define-category ?b "Arabic") | |
48 (define-category ?w "Hebrew") | |
49 (define-category ?y "Cyrillic") | |
50 (define-category ?k "Japanese katakana") | |
51 (define-category ?r "Japanese roman") | |
52 (define-category ?c "Chinese") | |
53 (define-category ?j "Japanese") | |
54 (define-category ?h "Korean") | |
55 (define-category ?e "Ethiopic (Ge'ez)") | |
56 (define-category ?v "Vietnamese") | |
57 (define-category ?i "Indian") | |
17078
874b471d6635
Remove prefix "coding-system-" from coding system symbol names.
Kenichi Handa <handa@m17n.org>
parents:
17071
diff
changeset
|
58 (define-category ?o "Lao") |
17753
de6be7f3b904
Set syntax and category for Devanagari, Thai, and
Kenichi Handa <handa@m17n.org>
parents:
17296
diff
changeset
|
59 (define-category ?q "Tibetan") |
17052 | 60 |
61 ;; For each group (row) of 2-byte character sets. | |
62 | |
22529 | 63 (define-category ?A "Alpha-numeric characters of 2-byte character sets") |
17052 | 64 (define-category ?C "Chinese (Han) characters of 2-byte character sets") |
22529 | 65 (define-category ?G "Greek characters of 2-byte character sets") |
17052 | 66 (define-category ?H "Japanese Hiragana characters of 2-byte character sets") |
67 (define-category ?K "Japanese Katakana characters of 2-byte character sets") | |
68 (define-category ?N "Korean Hangul characters of 2-byte character sets") | |
22529 | 69 (define-category ?Y "Cyrillic characters of 2-byte character sets") |
17052 | 70 (define-category ?I "Indian Glyphs") |
71 | |
72 ;; For phonetic classifications. | |
73 | |
74 (define-category ?0 "consonant") | |
18561
f3c28fd5118e
Modify docstring of char category
Kenichi Handa <handa@m17n.org>
parents:
18377
diff
changeset
|
75 (define-category ?1 "base (independent) vowel") |
17052 | 76 (define-category ?2 "upper diacritical mark (including upper vowel)") |
77 (define-category ?3 "lower diacritical mark (including lower vowel)") | |
78 (define-category ?4 "tone mark") | |
18561
f3c28fd5118e
Modify docstring of char category
Kenichi Handa <handa@m17n.org>
parents:
18377
diff
changeset
|
79 (define-category ?5 "symbol") |
17052 | 80 (define-category ?6 "digit") |
81 (define-category ?7 "vowel-modifying diacritical mark") | |
17078
874b471d6635
Remove prefix "coding-system-" from coding system symbol names.
Kenichi Handa <handa@m17n.org>
parents:
17071
diff
changeset
|
82 (define-category ?8 "vowel-signs") |
874b471d6635
Remove prefix "coding-system-" from coding system symbol names.
Kenichi Handa <handa@m17n.org>
parents:
17071
diff
changeset
|
83 (define-category ?9 "semivowel lower") |
17052 | 84 |
85 ;; For filling. | |
86 (define-category ?| "While filling, we can break a line at this character.") | |
87 | |
20247 | 88 ;; For indentation calculation. |
20734
179dbdbc3882
Put `prefered-coding-system'
Kenichi Handa <handa@m17n.org>
parents:
20247
diff
changeset
|
89 (define-category ? |
179dbdbc3882
Put `prefered-coding-system'
Kenichi Handa <handa@m17n.org>
parents:
20247
diff
changeset
|
90 "This character counts as a space for indentation purposes.") |
20247 | 91 |
22529 | 92 ;; Keep the following for `kinsoku' processing. See comments in |
17052 | 93 ;; kinsoku.el. |
94 (define-category ?> "A character which can't be placed at beginning of line.") | |
95 (define-category ?< "A character which can't be placed at end of line.") | |
96 | |
41874
1b93abfcbd87
Don't set word syntax (the default)
Dave Love <fx@gnu.org>
parents:
41466
diff
changeset
|
97 ;; Combining |
1b93abfcbd87
Don't set word syntax (the default)
Dave Love <fx@gnu.org>
parents:
41466
diff
changeset
|
98 (define-category ?^ "Combining diacritic or mark") |
17052 | 99 |
100 ;;; Setting syntax and category. | |
101 | |
102 ;; ASCII | |
103 | |
88425
d7bd9df1799d
Optimize calls of modify-category-entry.
Kenichi Handa <handa@m17n.org>
parents:
88403
diff
changeset
|
104 ;; All ASCII characters have the category `a' (ASCII) and `l' (Latin). |
d7bd9df1799d
Optimize calls of modify-category-entry.
Kenichi Handa <handa@m17n.org>
parents:
88403
diff
changeset
|
105 (modify-category-entry '(32 . 127) ?a) |
d7bd9df1799d
Optimize calls of modify-category-entry.
Kenichi Handa <handa@m17n.org>
parents:
88403
diff
changeset
|
106 (modify-category-entry '(32 . 127) ?l) |
17052 | 107 |
108 ;; Arabic character set | |
109 | |
110 (let ((charsets '(arabic-iso8859-6 | |
111 arabic-digit | |
112 arabic-1-column | |
113 arabic-2-column))) | |
114 (while charsets | |
88612 | 115 (map-charset-chars #'modify-category-entry (car charsets) ?b) |
17052 | 116 (setq charsets (cdr charsets)))) |
88425
d7bd9df1799d
Optimize calls of modify-category-entry.
Kenichi Handa <handa@m17n.org>
parents:
88403
diff
changeset
|
117 (modify-category-entry '(#x600 . #x6ff) ?b) |
d7bd9df1799d
Optimize calls of modify-category-entry.
Kenichi Handa <handa@m17n.org>
parents:
88403
diff
changeset
|
118 (modify-category-entry '(#xfb50 . #xfdff) ?b) |
d7bd9df1799d
Optimize calls of modify-category-entry.
Kenichi Handa <handa@m17n.org>
parents:
88403
diff
changeset
|
119 (modify-category-entry '(#xfe70 . #xfefe) ?b) |
17052 | 120 |
88752 | 121 ;; Chinese characters (Unicode) |
122 (modify-category-entry '(#x3400 . #x9FAF) ?C) | |
123 (modify-category-entry '(#x3400 . #x9FAF) ?c) | |
124 (modify-category-entry '(#x3400 . #x9FAF) ?|) | |
125 (modify-category-entry '(#xF900 . #xFAFF) ?C) | |
126 (modify-category-entry '(#xF900 . #xFAFF) ?c) | |
127 (modify-category-entry '(#xF900 . #xFAFF) ?|) | |
128 | |
17052 | 129 ;; Chinese character set (GB2312) |
130 | |
88752 | 131 (map-charset-chars #'modify-syntax-entry 'chinese-gb2312 "_" #x2121 #x217E) |
132 (map-charset-chars #'modify-syntax-entry 'chinese-gb2312 "_" #x2221 #x227E) | |
133 (map-charset-chars #'modify-syntax-entry 'chinese-gb2312 "_" #x2921 #x297E) | |
88469
dad99fefc4b3
Change encoding to utf-8-emacs. Remove apparent duplicate codes.
Kenichi Handa <handa@m17n.org>
parents:
88425
diff
changeset
|
134 (modify-syntax-entry ?\〔 "(〕") |
dad99fefc4b3
Change encoding to utf-8-emacs. Remove apparent duplicate codes.
Kenichi Handa <handa@m17n.org>
parents:
88425
diff
changeset
|
135 (modify-syntax-entry ?\〈 "(〉") |
dad99fefc4b3
Change encoding to utf-8-emacs. Remove apparent duplicate codes.
Kenichi Handa <handa@m17n.org>
parents:
88425
diff
changeset
|
136 (modify-syntax-entry ?\《 "(》") |
dad99fefc4b3
Change encoding to utf-8-emacs. Remove apparent duplicate codes.
Kenichi Handa <handa@m17n.org>
parents:
88425
diff
changeset
|
137 (modify-syntax-entry ?\「 "(ã€") |
dad99fefc4b3
Change encoding to utf-8-emacs. Remove apparent duplicate codes.
Kenichi Handa <handa@m17n.org>
parents:
88425
diff
changeset
|
138 (modify-syntax-entry ?\『 "(ã€") |
dad99fefc4b3
Change encoding to utf-8-emacs. Remove apparent duplicate codes.
Kenichi Handa <handa@m17n.org>
parents:
88425
diff
changeset
|
139 (modify-syntax-entry ?\〖 "(〗") |
dad99fefc4b3
Change encoding to utf-8-emacs. Remove apparent duplicate codes.
Kenichi Handa <handa@m17n.org>
parents:
88425
diff
changeset
|
140 (modify-syntax-entry ?\〠"(】") |
dad99fefc4b3
Change encoding to utf-8-emacs. Remove apparent duplicate codes.
Kenichi Handa <handa@m17n.org>
parents:
88425
diff
changeset
|
141 (modify-syntax-entry ?\〕 ")〔") |
dad99fefc4b3
Change encoding to utf-8-emacs. Remove apparent duplicate codes.
Kenichi Handa <handa@m17n.org>
parents:
88425
diff
changeset
|
142 (modify-syntax-entry ?\〉 ")〈") |
dad99fefc4b3
Change encoding to utf-8-emacs. Remove apparent duplicate codes.
Kenichi Handa <handa@m17n.org>
parents:
88425
diff
changeset
|
143 (modify-syntax-entry ?\》 ")《") |
dad99fefc4b3
Change encoding to utf-8-emacs. Remove apparent duplicate codes.
Kenichi Handa <handa@m17n.org>
parents:
88425
diff
changeset
|
144 (modify-syntax-entry ?\〠")「") |
dad99fefc4b3
Change encoding to utf-8-emacs. Remove apparent duplicate codes.
Kenichi Handa <handa@m17n.org>
parents:
88425
diff
changeset
|
145 (modify-syntax-entry ?\〠")『") |
dad99fefc4b3
Change encoding to utf-8-emacs. Remove apparent duplicate codes.
Kenichi Handa <handa@m17n.org>
parents:
88425
diff
changeset
|
146 (modify-syntax-entry ?\〗 ")〖") |
dad99fefc4b3
Change encoding to utf-8-emacs. Remove apparent duplicate codes.
Kenichi Handa <handa@m17n.org>
parents:
88425
diff
changeset
|
147 (modify-syntax-entry ?\】 ")ã€") |
89121 | 148 (modify-syntax-entry ?\〚 "(〛") |
149 (modify-syntax-entry ?\〛 ")〚") | |
17052 | 150 |
88723 | 151 (map-charset-chars #'modify-category-entry 'chinese-gb2312 ?c) |
152 (map-charset-chars #'modify-category-entry 'chinese-gb2312 ?|) | |
89014 | 153 (map-charset-chars #'modify-category-entry 'chinese-gb2312 ?A #x2330 #x2339) |
154 (map-charset-chars #'modify-category-entry 'chinese-gb2312 ?A #x2341 #x235A) | |
155 (map-charset-chars #'modify-category-entry 'chinese-gb2312 ?A #x2361 #x237A) | |
88752 | 156 (map-charset-chars #'modify-category-entry 'chinese-gb2312 ?H #x2421 #x247E) |
157 (map-charset-chars #'modify-category-entry 'chinese-gb2312 ?K #x2521 #x257E) | |
158 (map-charset-chars #'modify-category-entry 'chinese-gb2312 ?G #x2621 #x267E) | |
159 (map-charset-chars #'modify-category-entry 'chinese-gb2312 ?Y #x2721 #x277E) | |
160 (map-charset-chars #'modify-category-entry 'chinese-gb2312 ?C #x3021 #x7E7E) | |
17052 | 161 |
162 ;; Chinese character set (BIG5) | |
163 | |
88971
651b64e6dce3
Call map-charset-chars on big5
Kenichi Handa <handa@m17n.org>
parents:
88892
diff
changeset
|
164 (map-charset-chars #'modify-category-entry 'big5 ?c) |
89014 | 165 (map-charset-chars #'modify-category-entry 'big5 ?C #xA259 #xA25F) |
166 (map-charset-chars #'modify-category-entry 'big5 ?C #xA440 #xC67E) | |
167 (map-charset-chars #'modify-category-entry 'big5 ?C #xC940 #xF9DF) | |
88971
651b64e6dce3
Call map-charset-chars on big5
Kenichi Handa <handa@m17n.org>
parents:
88892
diff
changeset
|
168 (map-charset-chars #'modify-category-entry 'big5 ?|) |
651b64e6dce3
Call map-charset-chars on big5
Kenichi Handa <handa@m17n.org>
parents:
88892
diff
changeset
|
169 |
17052 | 170 |
171 ;; Chinese character set (CNS11643) | |
172 | |
88723 | 173 (dolist (c '(chinese-cns11643-1 chinese-cns11643-2 chinese-cns11643-3 |
174 chinese-cns11643-4 chinese-cns11643-5 chinese-cns11643-6 | |
175 chinese-cns11643-7)) | |
176 (map-charset-chars #'modify-category-entry c ?c) | |
89014 | 177 (if (eq c 'chinese-cns11643-1) |
178 (map-charset-chars #'modify-category-entry c ?C #x4421 #x7E7E) | |
179 (map-charset-chars #'modify-category-entry c ?C)) | |
88723 | 180 (map-charset-chars #'modify-category-entry c ?|)) |
17052 | 181 |
182 ;; Cyrillic character set (ISO-8859-5) | |
183 | |
88469
dad99fefc4b3
Change encoding to utf-8-emacs. Remove apparent duplicate codes.
Kenichi Handa <handa@m17n.org>
parents:
88425
diff
changeset
|
184 (modify-syntax-entry ?â„– ".") |
17052 | 185 |
186 ;; Ethiopic character set | |
187 | |
88425
d7bd9df1799d
Optimize calls of modify-category-entry.
Kenichi Handa <handa@m17n.org>
parents:
88403
diff
changeset
|
188 (modify-category-entry '(#x1200 . #x137b) ?e) |
88711 | 189 (let ((chars '(?á¡ ?ᢠ?ᣠ?ᤠ?ᥠ?ᦠ?᧠?ᨠ?ö ‡€ ?ö ‡‹ ?ö ‡Œ ?ö ‡ ?ö ‡Ž ?ö ‡))) |
18304
58b39b4108d5
Set syntaxes of Ethiopic characters.
Kenichi Handa <handa@m17n.org>
parents:
18205
diff
changeset
|
190 (while chars |
58b39b4108d5
Set syntaxes of Ethiopic characters.
Kenichi Handa <handa@m17n.org>
parents:
18205
diff
changeset
|
191 (modify-syntax-entry (car chars) ".") |
58b39b4108d5
Set syntaxes of Ethiopic characters.
Kenichi Handa <handa@m17n.org>
parents:
18205
diff
changeset
|
192 (setq chars (cdr chars)))) |
88612 | 193 (map-charset-chars #'modify-category-entry 'ethiopic ?e) |
17052 | 194 |
195 ;; Hebrew character set (ISO-8859-8) | |
196 | |
88612 | 197 (modify-syntax-entry #x5be ".") ; MAQAF |
198 (modify-syntax-entry #x5c0 ".") ; PASEQ | |
199 (modify-syntax-entry #x5c3 ".") ; SOF PASUQ | |
200 (modify-syntax-entry #x5f3 ".") ; GERESH | |
201 (modify-syntax-entry #x5f4 ".") ; GERSHAYIM | |
17052 | 202 |
203 ;; Indian character set (IS 13194 and other Emacs original Indian charsets) | |
204 | |
88612 | 205 (modify-category-entry '(#x901 . #x970) ?i) |
206 (map-charset-chars #'modify-category-entry 'indian-is13194 ?i) | |
207 (map-charset-chars #'modify-category-entry 'indian-2-column ?i) | |
17052 | 208 |
20843 | 209 |
17052 | 210 ;; Japanese character set (JISX0201-kana, JISX0201-roman, JISX0208, JISX0212) |
211 | |
88752 | 212 (map-charset-chars #'modify-category-entry 'katakana-jisx0201 ?k) |
213 | |
214 (map-charset-chars #'modify-category-entry 'latin-jisx0201 ?r) | |
88403
3c6459e2914f
Adjusted for the deletion of generic characters.
Kenichi Handa <handa@m17n.org>
parents:
41992
diff
changeset
|
215 |
3c6459e2914f
Adjusted for the deletion of generic characters.
Kenichi Handa <handa@m17n.org>
parents:
41992
diff
changeset
|
216 (dolist (l '(katakana-jisx0201 japanese-jisx0208 japanese-jisx0212)) |
88752 | 217 (map-charset-chars #'modify-category-entry l ?j) |
218 (map-charset-chars #'modify-category-entry l ?\|)) | |
17052 | 219 |
41874
1b93abfcbd87
Don't set word syntax (the default)
Dave Love <fx@gnu.org>
parents:
41466
diff
changeset
|
220 ;; Unicode equivalents of JISX0201-kana |
88752 | 221 (let ((range '(#xff61 . #xff9f))) |
222 (modify-category-entry range ?k) | |
223 (modify-category-entry range ?j) | |
224 (modify-category-entry range ?\|)) | |
41874
1b93abfcbd87
Don't set word syntax (the default)
Dave Love <fx@gnu.org>
parents:
41466
diff
changeset
|
225 |
1b93abfcbd87
Don't set word syntax (the default)
Dave Love <fx@gnu.org>
parents:
41466
diff
changeset
|
226 ;; Katakana block |
88752 | 227 (let ((range '(#x30a0 . #x30ff))) |
228 ;; ?K is double width, ?k isn't specified | |
229 (modify-category-entry range ?K) | |
230 (modify-category-entry range ?\|)) | |
41874
1b93abfcbd87
Don't set word syntax (the default)
Dave Love <fx@gnu.org>
parents:
41466
diff
changeset
|
231 |
1b93abfcbd87
Don't set word syntax (the default)
Dave Love <fx@gnu.org>
parents:
41466
diff
changeset
|
232 ;; Hiragana block |
88752 | 233 (let ((range '(#x3040 . #x309f))) |
234 ;; ?H is actually defined to be double width | |
235 ;;(modify-category-entry range ?H) | |
236 ;;(modify-category-entry range ?\|) | |
237 ) | |
41874
1b93abfcbd87
Don't set word syntax (the default)
Dave Love <fx@gnu.org>
parents:
41466
diff
changeset
|
238 |
17052 | 239 ;; JISX0208 |
88752 | 240 (map-charset-chars #'modify-syntax-entry 'japanese-jisx0208 "_" #x2121 #x227E) |
241 (map-charset-chars #'modify-syntax-entry 'japanese-jisx0208 "_" #x2821 #x287E) | |
242 (let ((chars '(?ー ?ã‚› ?ã‚œ ?ヽ ?ヾ ?ã‚ ?ã‚ž ?〃 ?ä» ?々 ?〆 ?〇))) | |
88519
98ec5d812ccc
Set syntax/category for japanese-jisx0208.
Kenichi Handa <handa@m17n.org>
parents:
88492
diff
changeset
|
243 (dolist (elt chars) |
88612 | 244 (modify-syntax-entry (car chars) "w"))) |
88752 | 245 (modify-syntax-entry ?\( "()") |
246 (modify-syntax-entry ?\ï¼» "(ï¼½") | |
247 (modify-syntax-entry ?\ï½› "(ï½") | |
248 (modify-syntax-entry ?\「 "(ã€") | |
249 (modify-syntax-entry ?\『 "(ã€") | |
250 (modify-syntax-entry ?\) ")(") | |
251 (modify-syntax-entry ?\ï¼½ ")ï¼»") | |
252 (modify-syntax-entry ?\ï½ ")ï½›") | |
253 (modify-syntax-entry ?\〠")「") | |
254 (modify-syntax-entry ?\〠")『") | |
17052 | 255 |
88752 | 256 (map-charset-chars #'modify-category-entry 'japanese-jisx0208 ?A #x2321 #x237E) |
257 (map-charset-chars #'modify-category-entry 'japanese-jisx0208 ?H #x2421 #x247E) | |
258 (map-charset-chars #'modify-category-entry 'japanese-jisx0208 ?K #x2521 #x257E) | |
259 (map-charset-chars #'modify-category-entry 'japanese-jisx0208 ?G #x2621 #x267E) | |
260 (map-charset-chars #'modify-category-entry 'japanese-jisx0208 ?Y #x2721 #x277E) | |
261 (map-charset-chars #'modify-category-entry 'japanese-jisx0208 ?C #x3021 #x7E7E) | |
262 (modify-category-entry ?ー ?K) | |
263 (let ((chars '(?゛ ?゜))) | |
17052 | 264 (while chars |
265 (modify-category-entry (car chars) ?K) | |
266 (modify-category-entry (car chars) ?H) | |
267 (setq chars (cdr chars)))) | |
88752 | 268 (let ((chars '(?ヽ ?ヾ ?ã‚ ?ã‚ž ?〃 ?ä» ?々 ?〆 ?〇))) |
17052 | 269 (while chars |
270 (modify-category-entry (car chars) ?C) | |
271 (setq chars (cdr chars)))) | |
272 | |
273 ;; JISX0212 | |
88723 | 274 |
88752 | 275 (map-charset-chars #'modify-syntax-entry 'japanese-jisx0212 "_" #x2121 #x237E) |
17052 | 276 |
277 ;; JISX0201-Kana | |
88723 | 278 |
88612 | 279 (let ((chars '(?。 ?、 ?・))) |
17052 | 280 (while chars |
281 (modify-syntax-entry (car chars) ".") | |
282 (setq chars (cdr chars)))) | |
283 | |
88469
dad99fefc4b3
Change encoding to utf-8-emacs. Remove apparent duplicate codes.
Kenichi Handa <handa@m17n.org>
parents:
88425
diff
changeset
|
284 (modify-syntax-entry ?\ï½¢ "(ï½£") |
dad99fefc4b3
Change encoding to utf-8-emacs. Remove apparent duplicate codes.
Kenichi Handa <handa@m17n.org>
parents:
88425
diff
changeset
|
285 (modify-syntax-entry ?\ï½£ "(ï½¢") |
36688
45b172fa5c17
Add category `j' for katakana-jisx0201. Fix syntaxes of
Kenichi Handa <handa@m17n.org>
parents:
34507
diff
changeset
|
286 |
17052 | 287 ;; Korean character set (KSC5601) |
288 | |
88723 | 289 (map-charset-chars #'modify-category-entry 'korean-ksc5601 ?h) |
88752 | 290 |
291 (map-charset-chars #'modify-syntax-entry 'korean-ksc5601 "_" #x2121 #x227E) | |
89117 | 292 (map-charset-chars #'modify-syntax-entry 'korean-ksc5601 "_" #x2621 #x277E) |
293 (map-charset-chars #'modify-syntax-entry 'korean-ksc5601 "_" #x2830 #x287E) | |
294 (map-charset-chars #'modify-syntax-entry 'korean-ksc5601 "_" #x2930 #x297E) | |
89014 | 295 (map-charset-chars #'modify-category-entry 'korean-ksc5601 ?A #x2330 #x2339) |
296 (map-charset-chars #'modify-category-entry 'korean-ksc5601 ?A #x2341 #x235A) | |
297 (map-charset-chars #'modify-category-entry 'korean-ksc5601 ?A #x2361 #x237A) | |
88752 | 298 (map-charset-chars #'modify-category-entry 'korean-ksc5601 ?G #x2521 #x257E) |
299 (map-charset-chars #'modify-category-entry 'korean-ksc5601 ?H #x2A21 #x2A7E) | |
300 (map-charset-chars #'modify-category-entry 'korean-ksc5601 ?K #x2B21 #x2B7E) | |
301 (map-charset-chars #'modify-category-entry 'korean-ksc5601 ?Y #x2C21 #x2C7E) | |
17052 | 302 |
88612 | 303 ;; Latin |
33902
aacccd8ad977
Specify cases and syntaxes for mule-unicode-0100-24ff.
Kenichi Handa <handa@m17n.org>
parents:
31664
diff
changeset
|
304 |
88612 | 305 (modify-category-entry '(#x80 . #x024F) ?l) |
33902
aacccd8ad977
Specify cases and syntaxes for mule-unicode-0100-24ff.
Kenichi Handa <handa@m17n.org>
parents:
31664
diff
changeset
|
306 |
17078
874b471d6635
Remove prefix "coding-system-" from coding system symbol names.
Kenichi Handa <handa@m17n.org>
parents:
17071
diff
changeset
|
307 ;; Lao character set |
874b471d6635
Remove prefix "coding-system-" from coding system symbol names.
Kenichi Handa <handa@m17n.org>
parents:
17071
diff
changeset
|
308 |
88612 | 309 (modify-category-entry '(#xe80 . #xeff) ?o) |
310 (map-charset-chars #'modify-category-entry 'lao ?o) | |
17078
874b471d6635
Remove prefix "coding-system-" from coding system symbol names.
Kenichi Handa <handa@m17n.org>
parents:
17071
diff
changeset
|
311 |
88612 | 312 (let ((deflist '(("àº-ຮ" "w" ?0) ; consonant |
88469
dad99fefc4b3
Change encoding to utf-8-emacs. Remove apparent duplicate codes.
Kenichi Handa <handa@m17n.org>
parents:
88425
diff
changeset
|
313 ("ະາຳຽເ-ໄ" "w" ?1) ; vowel base |
dad99fefc4b3
Change encoding to utf-8-emacs. Remove apparent duplicate codes.
Kenichi Handa <handa@m17n.org>
parents:
88425
diff
changeset
|
314 ("ັິ-ືົà»" "w" ?2) ; vowel upper |
dad99fefc4b3
Change encoding to utf-8-emacs. Remove apparent duplicate codes.
Kenichi Handa <handa@m17n.org>
parents:
88425
diff
changeset
|
315 ("ຸູ" "w" ?3) ; vowel lower |
dad99fefc4b3
Change encoding to utf-8-emacs. Remove apparent duplicate codes.
Kenichi Handa <handa@m17n.org>
parents:
88425
diff
changeset
|
316 ("່-໋" "w" ?4) ; tone mark |
dad99fefc4b3
Change encoding to utf-8-emacs. Remove apparent duplicate codes.
Kenichi Handa <handa@m17n.org>
parents:
88425
diff
changeset
|
317 ("ຼຽ" "w" ?9) ; semivowel lower |
dad99fefc4b3
Change encoding to utf-8-emacs. Remove apparent duplicate codes.
Kenichi Handa <handa@m17n.org>
parents:
88425
diff
changeset
|
318 ("à»-à»™" "w" ?6) ; digit |
dad99fefc4b3
Change encoding to utf-8-emacs. Remove apparent duplicate codes.
Kenichi Handa <handa@m17n.org>
parents:
88425
diff
changeset
|
319 ("ຯໆ" "_" ?5) ; symbol |
17078
874b471d6635
Remove prefix "coding-system-" from coding system symbol names.
Kenichi Handa <handa@m17n.org>
parents:
17071
diff
changeset
|
320 )) |
874b471d6635
Remove prefix "coding-system-" from coding system symbol names.
Kenichi Handa <handa@m17n.org>
parents:
17071
diff
changeset
|
321 elm chars len syntax category to ch i) |
874b471d6635
Remove prefix "coding-system-" from coding system symbol names.
Kenichi Handa <handa@m17n.org>
parents:
17071
diff
changeset
|
322 (while deflist |
874b471d6635
Remove prefix "coding-system-" from coding system symbol names.
Kenichi Handa <handa@m17n.org>
parents:
17071
diff
changeset
|
323 (setq elm (car deflist)) |
874b471d6635
Remove prefix "coding-system-" from coding system symbol names.
Kenichi Handa <handa@m17n.org>
parents:
17071
diff
changeset
|
324 (setq chars (car elm) |
874b471d6635
Remove prefix "coding-system-" from coding system symbol names.
Kenichi Handa <handa@m17n.org>
parents:
17071
diff
changeset
|
325 len (length chars) |
874b471d6635
Remove prefix "coding-system-" from coding system symbol names.
Kenichi Handa <handa@m17n.org>
parents:
17071
diff
changeset
|
326 syntax (nth 1 elm) |
874b471d6635
Remove prefix "coding-system-" from coding system symbol names.
Kenichi Handa <handa@m17n.org>
parents:
17071
diff
changeset
|
327 category (nth 2 elm) |
874b471d6635
Remove prefix "coding-system-" from coding system symbol names.
Kenichi Handa <handa@m17n.org>
parents:
17071
diff
changeset
|
328 i 0) |
874b471d6635
Remove prefix "coding-system-" from coding system symbol names.
Kenichi Handa <handa@m17n.org>
parents:
17071
diff
changeset
|
329 (while (< i len) |
874b471d6635
Remove prefix "coding-system-" from coding system symbol names.
Kenichi Handa <handa@m17n.org>
parents:
17071
diff
changeset
|
330 (if (= (aref chars i) ?-) |
874b471d6635
Remove prefix "coding-system-" from coding system symbol names.
Kenichi Handa <handa@m17n.org>
parents:
17071
diff
changeset
|
331 (setq i (1+ i) |
20843 | 332 to (aref chars i)) |
333 (setq ch (aref chars i) | |
17078
874b471d6635
Remove prefix "coding-system-" from coding system symbol names.
Kenichi Handa <handa@m17n.org>
parents:
17071
diff
changeset
|
334 to ch)) |
874b471d6635
Remove prefix "coding-system-" from coding system symbol names.
Kenichi Handa <handa@m17n.org>
parents:
17071
diff
changeset
|
335 (while (<= ch to) |
41874
1b93abfcbd87
Don't set word syntax (the default)
Dave Love <fx@gnu.org>
parents:
41466
diff
changeset
|
336 (unless (string-equal syntax "w") |
1b93abfcbd87
Don't set word syntax (the default)
Dave Love <fx@gnu.org>
parents:
41466
diff
changeset
|
337 (modify-syntax-entry ch syntax)) |
17078
874b471d6635
Remove prefix "coding-system-" from coding system symbol names.
Kenichi Handa <handa@m17n.org>
parents:
17071
diff
changeset
|
338 (modify-category-entry ch category) |
874b471d6635
Remove prefix "coding-system-" from coding system symbol names.
Kenichi Handa <handa@m17n.org>
parents:
17071
diff
changeset
|
339 (setq ch (1+ ch))) |
20843 | 340 (setq i (1+ i))) |
17078
874b471d6635
Remove prefix "coding-system-" from coding system symbol names.
Kenichi Handa <handa@m17n.org>
parents:
17071
diff
changeset
|
341 (setq deflist (cdr deflist)))) |
874b471d6635
Remove prefix "coding-system-" from coding system symbol names.
Kenichi Handa <handa@m17n.org>
parents:
17071
diff
changeset
|
342 |
17052 | 343 ;; Thai character set (TIS620) |
344 | |
88612 | 345 (modify-category-entry '(#xe00 . #xe7f) ?t) |
346 (map-charset-chars #'modify-category-entry 'thai-tis620 ?t) | |
17052 | 347 |
348 (let ((deflist '(;; chars syntax category | |
88469
dad99fefc4b3
Change encoding to utf-8-emacs. Remove apparent duplicate codes.
Kenichi Handa <handa@m17n.org>
parents:
88425
diff
changeset
|
349 ("à¸-รลว-ฮ" "w" ?0) ; consonant |
dad99fefc4b3
Change encoding to utf-8-emacs. Remove apparent duplicate codes.
Kenichi Handa <handa@m17n.org>
parents:
88425
diff
changeset
|
350 ("ฤฦะาำเ-ๅ" "w" ?1) ; vowel base |
dad99fefc4b3
Change encoding to utf-8-emacs. Remove apparent duplicate codes.
Kenichi Handa <handa@m17n.org>
parents:
88425
diff
changeset
|
351 ("ัิ-ื็๎" "w" ?2) ; vowel upper |
dad99fefc4b3
Change encoding to utf-8-emacs. Remove apparent duplicate codes.
Kenichi Handa <handa@m17n.org>
parents:
88425
diff
changeset
|
352 ("ุ-ฺ" "w" ?3) ; vowel lower |
dad99fefc4b3
Change encoding to utf-8-emacs. Remove apparent duplicate codes.
Kenichi Handa <handa@m17n.org>
parents:
88425
diff
changeset
|
353 ("่-à¹" "w" ?4) ; tone mark |
dad99fefc4b3
Change encoding to utf-8-emacs. Remove apparent duplicate codes.
Kenichi Handa <handa@m17n.org>
parents:
88425
diff
changeset
|
354 ("à¹-๙" "w" ?6) ; digit |
dad99fefc4b3
Change encoding to utf-8-emacs. Remove apparent duplicate codes.
Kenichi Handa <handa@m17n.org>
parents:
88425
diff
changeset
|
355 ("ฯๆ฿à¹à¹šà¹›" "_" ?5) ; symbol |
17052 | 356 )) |
357 elm chars len syntax category to ch i) | |
358 (while deflist | |
359 (setq elm (car deflist)) | |
360 (setq chars (car elm) | |
361 len (length chars) | |
362 syntax (nth 1 elm) | |
363 category (nth 2 elm) | |
364 i 0) | |
365 (while (< i len) | |
366 (if (= (aref chars i) ?-) | |
367 (setq i (1+ i) | |
20843 | 368 to (aref chars i)) |
369 (setq ch (aref chars i) | |
17052 | 370 to ch)) |
371 (while (<= ch to) | |
41874
1b93abfcbd87
Don't set word syntax (the default)
Dave Love <fx@gnu.org>
parents:
41466
diff
changeset
|
372 (unless (string-equal syntax "w") |
1b93abfcbd87
Don't set word syntax (the default)
Dave Love <fx@gnu.org>
parents:
41466
diff
changeset
|
373 (modify-syntax-entry ch syntax)) |
17052 | 374 (modify-category-entry ch category) |
375 (setq ch (1+ ch))) | |
20843 | 376 (setq i (1+ i))) |
17052 | 377 (setq deflist (cdr deflist)))) |
378 | |
17753
de6be7f3b904
Set syntax and category for Devanagari, Thai, and
Kenichi Handa <handa@m17n.org>
parents:
17296
diff
changeset
|
379 ;; Tibetan character set |
de6be7f3b904
Set syntax and category for Devanagari, Thai, and
Kenichi Handa <handa@m17n.org>
parents:
17296
diff
changeset
|
380 |
88612 | 381 (modify-category-entry '(#xf00 . #xfff) ?q) |
382 (map-charset-chars #'modify-category-entry 'tibetan ?q) | |
383 (map-charset-chars #'modify-category-entry 'tibetan-1-column ?q) | |
17753
de6be7f3b904
Set syntax and category for Devanagari, Thai, and
Kenichi Handa <handa@m17n.org>
parents:
17296
diff
changeset
|
384 |
de6be7f3b904
Set syntax and category for Devanagari, Thai, and
Kenichi Handa <handa@m17n.org>
parents:
17296
diff
changeset
|
385 (let ((deflist '(;; chars syntax category |
88711 | 386 ("ཀ-ཀྵཪ" "w" ?0) ; consonant |
387 ("à¾-ྐྵྺྻྼö€ö€‚" "w" ?0) ; | |
88469
dad99fefc4b3
Change encoding to utf-8-emacs. Remove apparent duplicate codes.
Kenichi Handa <handa@m17n.org>
parents:
88425
diff
changeset
|
388 ("ö„š-ö…ž" "w" ?0) ; |
dad99fefc4b3
Change encoding to utf-8-emacs. Remove apparent duplicate codes.
Kenichi Handa <handa@m17n.org>
parents:
88425
diff
changeset
|
389 ("ö…¸-ö‡Œ" "w" ?0) ; |
88711 | 390 ("ིེཻོཽྀ" "w" ?2) ; upper vowel |
391 ("ཾྂྃ྆྇ྈྉྊྋ" "w" ?2) ; upper modifier | |
392 ("༙ö‚Žà½±à½´à¾„༵༷" "w" ?3) ; lowel vowel/modifier | |
393 ("༠-༩༪-༳" "w" ?6) ; digit | |
394 ("་à¼-༒༔ཿ" "." ?|) ; line-break char | |
395 ("་à¼à¼à¼à¼‘༔ཿ" "." ?|) ; | |
396 ("༈་à¼-༒༔ཿ༽༴" "." ?>) ; prohibition | |
397 ("་à¼à¼à¼à¼‘༔ཿ" "." ?>) ; | |
398 ("ༀ-༊༼à¿à¿‚྅" "." ?<) ; prohibition | |
399 ("༓༕-༘༚-༟༶༸-༻༾༿྾྿-à¿" "." ?q) ; others | |
17753
de6be7f3b904
Set syntax and category for Devanagari, Thai, and
Kenichi Handa <handa@m17n.org>
parents:
17296
diff
changeset
|
400 )) |
de6be7f3b904
Set syntax and category for Devanagari, Thai, and
Kenichi Handa <handa@m17n.org>
parents:
17296
diff
changeset
|
401 elm chars len syntax category to ch i) |
de6be7f3b904
Set syntax and category for Devanagari, Thai, and
Kenichi Handa <handa@m17n.org>
parents:
17296
diff
changeset
|
402 (while deflist |
de6be7f3b904
Set syntax and category for Devanagari, Thai, and
Kenichi Handa <handa@m17n.org>
parents:
17296
diff
changeset
|
403 (setq elm (car deflist)) |
de6be7f3b904
Set syntax and category for Devanagari, Thai, and
Kenichi Handa <handa@m17n.org>
parents:
17296
diff
changeset
|
404 (setq chars (car elm) |
de6be7f3b904
Set syntax and category for Devanagari, Thai, and
Kenichi Handa <handa@m17n.org>
parents:
17296
diff
changeset
|
405 len (length chars) |
de6be7f3b904
Set syntax and category for Devanagari, Thai, and
Kenichi Handa <handa@m17n.org>
parents:
17296
diff
changeset
|
406 syntax (nth 1 elm) |
de6be7f3b904
Set syntax and category for Devanagari, Thai, and
Kenichi Handa <handa@m17n.org>
parents:
17296
diff
changeset
|
407 category (nth 2 elm) |
de6be7f3b904
Set syntax and category for Devanagari, Thai, and
Kenichi Handa <handa@m17n.org>
parents:
17296
diff
changeset
|
408 i 0) |
de6be7f3b904
Set syntax and category for Devanagari, Thai, and
Kenichi Handa <handa@m17n.org>
parents:
17296
diff
changeset
|
409 (while (< i len) |
de6be7f3b904
Set syntax and category for Devanagari, Thai, and
Kenichi Handa <handa@m17n.org>
parents:
17296
diff
changeset
|
410 (if (= (aref chars i) ?-) |
de6be7f3b904
Set syntax and category for Devanagari, Thai, and
Kenichi Handa <handa@m17n.org>
parents:
17296
diff
changeset
|
411 (setq i (1+ i) |
20843 | 412 to (aref chars i)) |
413 (setq ch (aref chars i) | |
17753
de6be7f3b904
Set syntax and category for Devanagari, Thai, and
Kenichi Handa <handa@m17n.org>
parents:
17296
diff
changeset
|
414 to ch)) |
de6be7f3b904
Set syntax and category for Devanagari, Thai, and
Kenichi Handa <handa@m17n.org>
parents:
17296
diff
changeset
|
415 (while (<= ch to) |
41874
1b93abfcbd87
Don't set word syntax (the default)
Dave Love <fx@gnu.org>
parents:
41466
diff
changeset
|
416 (unless (string-equal syntax "w") |
1b93abfcbd87
Don't set word syntax (the default)
Dave Love <fx@gnu.org>
parents:
41466
diff
changeset
|
417 (modify-syntax-entry ch syntax)) |
17753
de6be7f3b904
Set syntax and category for Devanagari, Thai, and
Kenichi Handa <handa@m17n.org>
parents:
17296
diff
changeset
|
418 (modify-category-entry ch category) |
de6be7f3b904
Set syntax and category for Devanagari, Thai, and
Kenichi Handa <handa@m17n.org>
parents:
17296
diff
changeset
|
419 (setq ch (1+ ch))) |
20843 | 420 (setq i (1+ i))) |
17753
de6be7f3b904
Set syntax and category for Devanagari, Thai, and
Kenichi Handa <handa@m17n.org>
parents:
17296
diff
changeset
|
421 (setq deflist (cdr deflist)))) |
de6be7f3b904
Set syntax and category for Devanagari, Thai, and
Kenichi Handa <handa@m17n.org>
parents:
17296
diff
changeset
|
422 |
17052 | 423 ;; Vietnamese character set |
424 | |
88612 | 425 ;; To make a word with Latin characters |
426 (map-charset-chars #'modify-category-entry 'vietnamese-viscii-lower ?l) | |
427 (map-charset-chars #'modify-category-entry 'vietnamese-viscii-lower ?v) | |
428 | |
429 (map-charset-chars #'modify-category-entry 'vietnamese-viscii-upper ?l) | |
430 (map-charset-chars #'modify-category-entry 'vietnamese-viscii-upper ?v) | |
17052 | 431 |
27691
5ed8dd27c116
Setup case table for Vietnamese.
Kenichi Handa <handa@m17n.org>
parents:
26885
diff
changeset
|
432 (let ((tbl (standard-case-table)) |
5ed8dd27c116
Setup case table for Vietnamese.
Kenichi Handa <handa@m17n.org>
parents:
26885
diff
changeset
|
433 (i 32)) |
5ed8dd27c116
Setup case table for Vietnamese.
Kenichi Handa <handa@m17n.org>
parents:
26885
diff
changeset
|
434 (while (< i 128) |
88711 | 435 (let* ((char (decode-char 'vietnamese-viscii-upper i)) |
436 (charl (decode-char 'vietnamese-viscii-lower i)) | |
437 (uc (encode-char char 'ucs)) | |
438 (lc (encode-char charl 'ucs))) | |
439 (set-case-syntax-pair char (decode-char 'vietnamese-viscii-lower i) | |
440 tbl) | |
441 (if uc (modify-category-entry uc ?v)) | |
442 (if lc (modify-category-entry lc ?v))) | |
27691
5ed8dd27c116
Setup case table for Vietnamese.
Kenichi Handa <handa@m17n.org>
parents:
26885
diff
changeset
|
443 (setq i (1+ i)))) |
5ed8dd27c116
Setup case table for Vietnamese.
Kenichi Handa <handa@m17n.org>
parents:
26885
diff
changeset
|
444 |
34507
ed8ea5cf198b
Fix cases and syntaxes for mule-unicode-0100-24ff.
Kenichi Handa <handa@m17n.org>
parents:
33902
diff
changeset
|
445 (let ((tbl (standard-case-table)) c) |
ed8ea5cf198b
Fix cases and syntaxes for mule-unicode-0100-24ff.
Kenichi Handa <handa@m17n.org>
parents:
33902
diff
changeset
|
446 |
ed8ea5cf198b
Fix cases and syntaxes for mule-unicode-0100-24ff.
Kenichi Handa <handa@m17n.org>
parents:
33902
diff
changeset
|
447 ;; In some languages, U+0049 LATIN CAPITAL LETTER I and U+0131 LATIN |
ed8ea5cf198b
Fix cases and syntaxes for mule-unicode-0100-24ff.
Kenichi Handa <handa@m17n.org>
parents:
33902
diff
changeset
|
448 ;; SMALL LETTER DOTLESS I make a case pair, and so do U+0130 LATIN |
ed8ea5cf198b
Fix cases and syntaxes for mule-unicode-0100-24ff.
Kenichi Handa <handa@m17n.org>
parents:
33902
diff
changeset
|
449 ;; CAPITAL LETTER I WITH DOT ABOVE and U+0069 LATIN SMALL LETTER I. |
89188 | 450 ;; See the Turkish language environment. |
34507
ed8ea5cf198b
Fix cases and syntaxes for mule-unicode-0100-24ff.
Kenichi Handa <handa@m17n.org>
parents:
33902
diff
changeset
|
451 |
89146 | 452 ;; Latin-1 |
453 | |
454 ;; Fixme: Some of the non-word syntaxes here perhaps should be | |
455 ;; reviewed. (Note that the following all implicitly have word | |
456 ;; syntax: ¢£¤¥¨ª¯²³´¶¸¹º.) There should be a well-defined way of | |
457 ;; relating Unicode categories to Emacs syntax codes. | |
458 (set-case-syntax ? " " tbl) ; dubious | |
459 (set-case-syntax ?¡ "." tbl) | |
460 (set-case-syntax ?¦ "_" tbl) | |
461 (set-case-syntax ?§ "." tbl) | |
462 (set-case-syntax ?© "_" tbl) | |
463 (set-case-syntax-delims 171 187 tbl) ; « » | |
464 (set-case-syntax ?¬ "_" tbl) | |
465 (set-case-syntax ? "_" tbl) | |
466 (set-case-syntax ?® "_" tbl) | |
467 (set-case-syntax ?° "_" tbl) | |
468 (set-case-syntax ?± "_" tbl) | |
469 (set-case-syntax ?µ "_" tbl) | |
470 (set-case-syntax ?· "_" tbl) | |
471 (set-case-syntax ?¼ "_" tbl) | |
472 (set-case-syntax ?½ "_" tbl) | |
473 (set-case-syntax ?¾ "_" tbl) | |
474 (set-case-syntax ?¿ "." tbl) | |
475 (let ((c 192)) | |
476 (while (<= c 222) | |
477 (set-case-syntax-pair c (+ c 32) tbl) | |
478 (setq c (1+ c)))) | |
479 (set-case-syntax ?× "_" tbl) | |
480 (set-case-syntax ?ß "w" tbl) | |
481 (set-case-syntax ?÷ "_" tbl) | |
482 ;; See below for ÿ. | |
483 | |
34507
ed8ea5cf198b
Fix cases and syntaxes for mule-unicode-0100-24ff.
Kenichi Handa <handa@m17n.org>
parents:
33902
diff
changeset
|
484 ;; Latin Extended-A, Latin Extended-B |
ed8ea5cf198b
Fix cases and syntaxes for mule-unicode-0100-24ff.
Kenichi Handa <handa@m17n.org>
parents:
33902
diff
changeset
|
485 (setq c #x0100) |
ed8ea5cf198b
Fix cases and syntaxes for mule-unicode-0100-24ff.
Kenichi Handa <handa@m17n.org>
parents:
33902
diff
changeset
|
486 (while (<= c #x0233) |
ed8ea5cf198b
Fix cases and syntaxes for mule-unicode-0100-24ff.
Kenichi Handa <handa@m17n.org>
parents:
33902
diff
changeset
|
487 (and (or (<= c #x012e) |
ed8ea5cf198b
Fix cases and syntaxes for mule-unicode-0100-24ff.
Kenichi Handa <handa@m17n.org>
parents:
33902
diff
changeset
|
488 (and (>= c #x014a) (<= c #x0177))) |
33902
aacccd8ad977
Specify cases and syntaxes for mule-unicode-0100-24ff.
Kenichi Handa <handa@m17n.org>
parents:
31664
diff
changeset
|
489 (zerop (% c 2)) |
88612 | 490 (set-case-syntax-pair c (1+ c) tbl)) |
34507
ed8ea5cf198b
Fix cases and syntaxes for mule-unicode-0100-24ff.
Kenichi Handa <handa@m17n.org>
parents:
33902
diff
changeset
|
491 (and (>= c #x013a) |
ed8ea5cf198b
Fix cases and syntaxes for mule-unicode-0100-24ff.
Kenichi Handa <handa@m17n.org>
parents:
33902
diff
changeset
|
492 (<= c #x0148) |
ed8ea5cf198b
Fix cases and syntaxes for mule-unicode-0100-24ff.
Kenichi Handa <handa@m17n.org>
parents:
33902
diff
changeset
|
493 (zerop (% c 2)) |
88612 | 494 (set-case-syntax-pair (1- c) c tbl)) |
33902
aacccd8ad977
Specify cases and syntaxes for mule-unicode-0100-24ff.
Kenichi Handa <handa@m17n.org>
parents:
31664
diff
changeset
|
495 (setq c (1+ c))) |
88469
dad99fefc4b3
Change encoding to utf-8-emacs. Remove apparent duplicate codes.
Kenichi Handa <handa@m17n.org>
parents:
88425
diff
changeset
|
496 (set-case-syntax-pair ?IJ ?ij tbl) |
dad99fefc4b3
Change encoding to utf-8-emacs. Remove apparent duplicate codes.
Kenichi Handa <handa@m17n.org>
parents:
88425
diff
changeset
|
497 (set-case-syntax-pair ?Ĵ ?ĵ tbl) |
dad99fefc4b3
Change encoding to utf-8-emacs. Remove apparent duplicate codes.
Kenichi Handa <handa@m17n.org>
parents:
88425
diff
changeset
|
498 (set-case-syntax-pair ?Ķ ?ķ tbl) |
89084 | 499 (set-case-syntax-pair ?Ÿ ?ÿ tbl) |
88469
dad99fefc4b3
Change encoding to utf-8-emacs. Remove apparent duplicate codes.
Kenichi Handa <handa@m17n.org>
parents:
88425
diff
changeset
|
500 (set-case-syntax-pair ?Ź ?ź tbl) |
dad99fefc4b3
Change encoding to utf-8-emacs. Remove apparent duplicate codes.
Kenichi Handa <handa@m17n.org>
parents:
88425
diff
changeset
|
501 (set-case-syntax-pair ?Ż ?ż tbl) |
dad99fefc4b3
Change encoding to utf-8-emacs. Remove apparent duplicate codes.
Kenichi Handa <handa@m17n.org>
parents:
88425
diff
changeset
|
502 (set-case-syntax-pair ?Ž ?ž tbl) |
33902
aacccd8ad977
Specify cases and syntaxes for mule-unicode-0100-24ff.
Kenichi Handa <handa@m17n.org>
parents:
31664
diff
changeset
|
503 |
41874
1b93abfcbd87
Don't set word syntax (the default)
Dave Love <fx@gnu.org>
parents:
41466
diff
changeset
|
504 ;; Latin Extended-B |
88469
dad99fefc4b3
Change encoding to utf-8-emacs. Remove apparent duplicate codes.
Kenichi Handa <handa@m17n.org>
parents:
88425
diff
changeset
|
505 (set-case-syntax-pair ?Æ ?É“ tbl) |
dad99fefc4b3
Change encoding to utf-8-emacs. Remove apparent duplicate codes.
Kenichi Handa <handa@m17n.org>
parents:
88425
diff
changeset
|
506 (set-case-syntax-pair ?Ƃ ?ƃ tbl) |
dad99fefc4b3
Change encoding to utf-8-emacs. Remove apparent duplicate codes.
Kenichi Handa <handa@m17n.org>
parents:
88425
diff
changeset
|
507 (set-case-syntax-pair ?Æ„ ?Æ… tbl) |
dad99fefc4b3
Change encoding to utf-8-emacs. Remove apparent duplicate codes.
Kenichi Handa <handa@m17n.org>
parents:
88425
diff
changeset
|
508 (set-case-syntax-pair ?Ɔ ?ɔ tbl) |
dad99fefc4b3
Change encoding to utf-8-emacs. Remove apparent duplicate codes.
Kenichi Handa <handa@m17n.org>
parents:
88425
diff
changeset
|
509 (set-case-syntax-pair ?Ƈ ?ƈ tbl) |
dad99fefc4b3
Change encoding to utf-8-emacs. Remove apparent duplicate codes.
Kenichi Handa <handa@m17n.org>
parents:
88425
diff
changeset
|
510 (set-case-syntax-pair ?Ɖ ?ɖ tbl) |
dad99fefc4b3
Change encoding to utf-8-emacs. Remove apparent duplicate codes.
Kenichi Handa <handa@m17n.org>
parents:
88425
diff
changeset
|
511 (set-case-syntax-pair ?ÆŠ ?É— tbl) |
dad99fefc4b3
Change encoding to utf-8-emacs. Remove apparent duplicate codes.
Kenichi Handa <handa@m17n.org>
parents:
88425
diff
changeset
|
512 (set-case-syntax-pair ?Ƌ ?ƌ tbl) |
dad99fefc4b3
Change encoding to utf-8-emacs. Remove apparent duplicate codes.
Kenichi Handa <handa@m17n.org>
parents:
88425
diff
changeset
|
513 (set-case-syntax-pair ?ÆŽ ?Ç tbl) |
dad99fefc4b3
Change encoding to utf-8-emacs. Remove apparent duplicate codes.
Kenichi Handa <handa@m17n.org>
parents:
88425
diff
changeset
|
514 (set-case-syntax-pair ?Æ ?É™ tbl) |
dad99fefc4b3
Change encoding to utf-8-emacs. Remove apparent duplicate codes.
Kenichi Handa <handa@m17n.org>
parents:
88425
diff
changeset
|
515 (set-case-syntax-pair ?Æ ?É› tbl) |
dad99fefc4b3
Change encoding to utf-8-emacs. Remove apparent duplicate codes.
Kenichi Handa <handa@m17n.org>
parents:
88425
diff
changeset
|
516 (set-case-syntax-pair ?Æ‘ ?Æ’ tbl) |
dad99fefc4b3
Change encoding to utf-8-emacs. Remove apparent duplicate codes.
Kenichi Handa <handa@m17n.org>
parents:
88425
diff
changeset
|
517 (set-case-syntax-pair ?Æ“ ?É tbl) |
dad99fefc4b3
Change encoding to utf-8-emacs. Remove apparent duplicate codes.
Kenichi Handa <handa@m17n.org>
parents:
88425
diff
changeset
|
518 (set-case-syntax-pair ?Æ” ?É£ tbl) |
dad99fefc4b3
Change encoding to utf-8-emacs. Remove apparent duplicate codes.
Kenichi Handa <handa@m17n.org>
parents:
88425
diff
changeset
|
519 (set-case-syntax-pair ?Æ– ?É© tbl) |
dad99fefc4b3
Change encoding to utf-8-emacs. Remove apparent duplicate codes.
Kenichi Handa <handa@m17n.org>
parents:
88425
diff
changeset
|
520 (set-case-syntax-pair ?Ɨ ?ɨ tbl) |
dad99fefc4b3
Change encoding to utf-8-emacs. Remove apparent duplicate codes.
Kenichi Handa <handa@m17n.org>
parents:
88425
diff
changeset
|
521 (set-case-syntax-pair ?Ƙ ?ƙ tbl) |
dad99fefc4b3
Change encoding to utf-8-emacs. Remove apparent duplicate codes.
Kenichi Handa <handa@m17n.org>
parents:
88425
diff
changeset
|
522 (set-case-syntax-pair ?Ɯ ?ɯ tbl) |
dad99fefc4b3
Change encoding to utf-8-emacs. Remove apparent duplicate codes.
Kenichi Handa <handa@m17n.org>
parents:
88425
diff
changeset
|
523 (set-case-syntax-pair ?Æ ?ɲ tbl) |
dad99fefc4b3
Change encoding to utf-8-emacs. Remove apparent duplicate codes.
Kenichi Handa <handa@m17n.org>
parents:
88425
diff
changeset
|
524 (set-case-syntax-pair ?Ɵ ?ɵ tbl) |
dad99fefc4b3
Change encoding to utf-8-emacs. Remove apparent duplicate codes.
Kenichi Handa <handa@m17n.org>
parents:
88425
diff
changeset
|
525 (set-case-syntax-pair ?Æ ?Æ¡ tbl) |
dad99fefc4b3
Change encoding to utf-8-emacs. Remove apparent duplicate codes.
Kenichi Handa <handa@m17n.org>
parents:
88425
diff
changeset
|
526 (set-case-syntax-pair ?Æ¢ ?Æ£ tbl) |
dad99fefc4b3
Change encoding to utf-8-emacs. Remove apparent duplicate codes.
Kenichi Handa <handa@m17n.org>
parents:
88425
diff
changeset
|
527 (set-case-syntax-pair ?Ƥ ?ƥ tbl) |
dad99fefc4b3
Change encoding to utf-8-emacs. Remove apparent duplicate codes.
Kenichi Handa <handa@m17n.org>
parents:
88425
diff
changeset
|
528 (set-case-syntax-pair ?Ʀ ?ʀ tbl) |
dad99fefc4b3
Change encoding to utf-8-emacs. Remove apparent duplicate codes.
Kenichi Handa <handa@m17n.org>
parents:
88425
diff
changeset
|
529 (set-case-syntax-pair ?Ƨ ?ƨ tbl) |
dad99fefc4b3
Change encoding to utf-8-emacs. Remove apparent duplicate codes.
Kenichi Handa <handa@m17n.org>
parents:
88425
diff
changeset
|
530 (set-case-syntax-pair ?Ʃ ?ʃ tbl) |
dad99fefc4b3
Change encoding to utf-8-emacs. Remove apparent duplicate codes.
Kenichi Handa <handa@m17n.org>
parents:
88425
diff
changeset
|
531 (set-case-syntax-pair ?Ƭ ?Æ tbl) |
dad99fefc4b3
Change encoding to utf-8-emacs. Remove apparent duplicate codes.
Kenichi Handa <handa@m17n.org>
parents:
88425
diff
changeset
|
532 (set-case-syntax-pair ?Ʈ ?ʈ tbl) |
dad99fefc4b3
Change encoding to utf-8-emacs. Remove apparent duplicate codes.
Kenichi Handa <handa@m17n.org>
parents:
88425
diff
changeset
|
533 (set-case-syntax-pair ?Ư ?ư tbl) |
dad99fefc4b3
Change encoding to utf-8-emacs. Remove apparent duplicate codes.
Kenichi Handa <handa@m17n.org>
parents:
88425
diff
changeset
|
534 (set-case-syntax-pair ?Ʊ ?ʊ tbl) |
dad99fefc4b3
Change encoding to utf-8-emacs. Remove apparent duplicate codes.
Kenichi Handa <handa@m17n.org>
parents:
88425
diff
changeset
|
535 (set-case-syntax-pair ?Ʋ ?ʋ tbl) |
dad99fefc4b3
Change encoding to utf-8-emacs. Remove apparent duplicate codes.
Kenichi Handa <handa@m17n.org>
parents:
88425
diff
changeset
|
536 (set-case-syntax-pair ?Ƴ ?ƴ tbl) |
dad99fefc4b3
Change encoding to utf-8-emacs. Remove apparent duplicate codes.
Kenichi Handa <handa@m17n.org>
parents:
88425
diff
changeset
|
537 (set-case-syntax-pair ?Ƶ ?ƶ tbl) |
dad99fefc4b3
Change encoding to utf-8-emacs. Remove apparent duplicate codes.
Kenichi Handa <handa@m17n.org>
parents:
88425
diff
changeset
|
538 (set-case-syntax-pair ?Æ· ?Ê’ tbl) |
dad99fefc4b3
Change encoding to utf-8-emacs. Remove apparent duplicate codes.
Kenichi Handa <handa@m17n.org>
parents:
88425
diff
changeset
|
539 (set-case-syntax-pair ?Ƹ ?ƹ tbl) |
dad99fefc4b3
Change encoding to utf-8-emacs. Remove apparent duplicate codes.
Kenichi Handa <handa@m17n.org>
parents:
88425
diff
changeset
|
540 (set-case-syntax-pair ?Ƽ ?ƽ tbl) |
dad99fefc4b3
Change encoding to utf-8-emacs. Remove apparent duplicate codes.
Kenichi Handa <handa@m17n.org>
parents:
88425
diff
changeset
|
541 (set-case-syntax-pair ?DŽ ?dž tbl) |
dad99fefc4b3
Change encoding to utf-8-emacs. Remove apparent duplicate codes.
Kenichi Handa <handa@m17n.org>
parents:
88425
diff
changeset
|
542 (set-case-syntax-pair ?Dž ?dž tbl) |
dad99fefc4b3
Change encoding to utf-8-emacs. Remove apparent duplicate codes.
Kenichi Handa <handa@m17n.org>
parents:
88425
diff
changeset
|
543 (set-case-syntax-pair ?LJ ?lj tbl) |
dad99fefc4b3
Change encoding to utf-8-emacs. Remove apparent duplicate codes.
Kenichi Handa <handa@m17n.org>
parents:
88425
diff
changeset
|
544 (set-case-syntax-pair ?Lj ?lj tbl) |
dad99fefc4b3
Change encoding to utf-8-emacs. Remove apparent duplicate codes.
Kenichi Handa <handa@m17n.org>
parents:
88425
diff
changeset
|
545 (set-case-syntax-pair ?NJ ?nj tbl) |
dad99fefc4b3
Change encoding to utf-8-emacs. Remove apparent duplicate codes.
Kenichi Handa <handa@m17n.org>
parents:
88425
diff
changeset
|
546 (set-case-syntax-pair ?Nj ?nj tbl) |
dad99fefc4b3
Change encoding to utf-8-emacs. Remove apparent duplicate codes.
Kenichi Handa <handa@m17n.org>
parents:
88425
diff
changeset
|
547 (set-case-syntax-pair ?Ç ?ÇŽ tbl) |
dad99fefc4b3
Change encoding to utf-8-emacs. Remove apparent duplicate codes.
Kenichi Handa <handa@m17n.org>
parents:
88425
diff
changeset
|
548 (set-case-syntax-pair ?Ç ?Ç tbl) |
dad99fefc4b3
Change encoding to utf-8-emacs. Remove apparent duplicate codes.
Kenichi Handa <handa@m17n.org>
parents:
88425
diff
changeset
|
549 (set-case-syntax-pair ?Ç‘ ?Ç’ tbl) |
dad99fefc4b3
Change encoding to utf-8-emacs. Remove apparent duplicate codes.
Kenichi Handa <handa@m17n.org>
parents:
88425
diff
changeset
|
550 (set-case-syntax-pair ?Ç“ ?Ç” tbl) |
dad99fefc4b3
Change encoding to utf-8-emacs. Remove apparent duplicate codes.
Kenichi Handa <handa@m17n.org>
parents:
88425
diff
changeset
|
551 (set-case-syntax-pair ?Ç• ?Ç– tbl) |
dad99fefc4b3
Change encoding to utf-8-emacs. Remove apparent duplicate codes.
Kenichi Handa <handa@m17n.org>
parents:
88425
diff
changeset
|
552 (set-case-syntax-pair ?Ǘ ?ǘ tbl) |
dad99fefc4b3
Change encoding to utf-8-emacs. Remove apparent duplicate codes.
Kenichi Handa <handa@m17n.org>
parents:
88425
diff
changeset
|
553 (set-case-syntax-pair ?Ç™ ?Çš tbl) |
dad99fefc4b3
Change encoding to utf-8-emacs. Remove apparent duplicate codes.
Kenichi Handa <handa@m17n.org>
parents:
88425
diff
changeset
|
554 (set-case-syntax-pair ?Ǜ ?ǜ tbl) |
dad99fefc4b3
Change encoding to utf-8-emacs. Remove apparent duplicate codes.
Kenichi Handa <handa@m17n.org>
parents:
88425
diff
changeset
|
555 (set-case-syntax-pair ?Çž ?ÇŸ tbl) |
dad99fefc4b3
Change encoding to utf-8-emacs. Remove apparent duplicate codes.
Kenichi Handa <handa@m17n.org>
parents:
88425
diff
changeset
|
556 (set-case-syntax-pair ?Ç ?Ç¡ tbl) |
dad99fefc4b3
Change encoding to utf-8-emacs. Remove apparent duplicate codes.
Kenichi Handa <handa@m17n.org>
parents:
88425
diff
changeset
|
557 (set-case-syntax-pair ?Ç¢ ?Ç£ tbl) |
dad99fefc4b3
Change encoding to utf-8-emacs. Remove apparent duplicate codes.
Kenichi Handa <handa@m17n.org>
parents:
88425
diff
changeset
|
558 (set-case-syntax-pair ?Ǥ ?ǥ tbl) |
dad99fefc4b3
Change encoding to utf-8-emacs. Remove apparent duplicate codes.
Kenichi Handa <handa@m17n.org>
parents:
88425
diff
changeset
|
559 (set-case-syntax-pair ?Ǧ ?ǧ tbl) |
dad99fefc4b3
Change encoding to utf-8-emacs. Remove apparent duplicate codes.
Kenichi Handa <handa@m17n.org>
parents:
88425
diff
changeset
|
560 (set-case-syntax-pair ?Ǩ ?ǩ tbl) |
dad99fefc4b3
Change encoding to utf-8-emacs. Remove apparent duplicate codes.
Kenichi Handa <handa@m17n.org>
parents:
88425
diff
changeset
|
561 (set-case-syntax-pair ?Ǫ ?ǫ tbl) |
dad99fefc4b3
Change encoding to utf-8-emacs. Remove apparent duplicate codes.
Kenichi Handa <handa@m17n.org>
parents:
88425
diff
changeset
|
562 (set-case-syntax-pair ?Ǭ ?Ç tbl) |
dad99fefc4b3
Change encoding to utf-8-emacs. Remove apparent duplicate codes.
Kenichi Handa <handa@m17n.org>
parents:
88425
diff
changeset
|
563 (set-case-syntax-pair ?Ǯ ?ǯ tbl) |
41874
1b93abfcbd87
Don't set word syntax (the default)
Dave Love <fx@gnu.org>
parents:
41466
diff
changeset
|
564 ;; 01F0; F; 006A 030C; # LATIN SMALL LETTER J WITH CARON |
88469
dad99fefc4b3
Change encoding to utf-8-emacs. Remove apparent duplicate codes.
Kenichi Handa <handa@m17n.org>
parents:
88425
diff
changeset
|
565 (set-case-syntax-pair ?DZ ?dz tbl) |
dad99fefc4b3
Change encoding to utf-8-emacs. Remove apparent duplicate codes.
Kenichi Handa <handa@m17n.org>
parents:
88425
diff
changeset
|
566 (set-case-syntax-pair ?Dz ?dz tbl) |
dad99fefc4b3
Change encoding to utf-8-emacs. Remove apparent duplicate codes.
Kenichi Handa <handa@m17n.org>
parents:
88425
diff
changeset
|
567 (set-case-syntax-pair ?Ǵ ?ǵ tbl) |
dad99fefc4b3
Change encoding to utf-8-emacs. Remove apparent duplicate codes.
Kenichi Handa <handa@m17n.org>
parents:
88425
diff
changeset
|
568 (set-case-syntax-pair ?Ƕ ?ƕ tbl) |
dad99fefc4b3
Change encoding to utf-8-emacs. Remove apparent duplicate codes.
Kenichi Handa <handa@m17n.org>
parents:
88425
diff
changeset
|
569 (set-case-syntax-pair ?Ç· ?Æ¿ tbl) |
dad99fefc4b3
Change encoding to utf-8-emacs. Remove apparent duplicate codes.
Kenichi Handa <handa@m17n.org>
parents:
88425
diff
changeset
|
570 (set-case-syntax-pair ?Ǹ ?ǹ tbl) |
dad99fefc4b3
Change encoding to utf-8-emacs. Remove apparent duplicate codes.
Kenichi Handa <handa@m17n.org>
parents:
88425
diff
changeset
|
571 (set-case-syntax-pair ?Ǻ ?ǻ tbl) |
dad99fefc4b3
Change encoding to utf-8-emacs. Remove apparent duplicate codes.
Kenichi Handa <handa@m17n.org>
parents:
88425
diff
changeset
|
572 (set-case-syntax-pair ?Ǽ ?ǽ tbl) |
dad99fefc4b3
Change encoding to utf-8-emacs. Remove apparent duplicate codes.
Kenichi Handa <handa@m17n.org>
parents:
88425
diff
changeset
|
573 (set-case-syntax-pair ?Ǿ ?ǿ tbl) |
dad99fefc4b3
Change encoding to utf-8-emacs. Remove apparent duplicate codes.
Kenichi Handa <handa@m17n.org>
parents:
88425
diff
changeset
|
574 (set-case-syntax-pair ?È€ ?È tbl) |
dad99fefc4b3
Change encoding to utf-8-emacs. Remove apparent duplicate codes.
Kenichi Handa <handa@m17n.org>
parents:
88425
diff
changeset
|
575 (set-case-syntax-pair ?Ȃ ?ȃ tbl) |
dad99fefc4b3
Change encoding to utf-8-emacs. Remove apparent duplicate codes.
Kenichi Handa <handa@m17n.org>
parents:
88425
diff
changeset
|
576 (set-case-syntax-pair ?È„ ?È… tbl) |
dad99fefc4b3
Change encoding to utf-8-emacs. Remove apparent duplicate codes.
Kenichi Handa <handa@m17n.org>
parents:
88425
diff
changeset
|
577 (set-case-syntax-pair ?Ȇ ?ȇ tbl) |
dad99fefc4b3
Change encoding to utf-8-emacs. Remove apparent duplicate codes.
Kenichi Handa <handa@m17n.org>
parents:
88425
diff
changeset
|
578 (set-case-syntax-pair ?Ȉ ?ȉ tbl) |
dad99fefc4b3
Change encoding to utf-8-emacs. Remove apparent duplicate codes.
Kenichi Handa <handa@m17n.org>
parents:
88425
diff
changeset
|
579 (set-case-syntax-pair ?ÈŠ ?È‹ tbl) |
dad99fefc4b3
Change encoding to utf-8-emacs. Remove apparent duplicate codes.
Kenichi Handa <handa@m17n.org>
parents:
88425
diff
changeset
|
580 (set-case-syntax-pair ?ÈŒ ?È tbl) |
dad99fefc4b3
Change encoding to utf-8-emacs. Remove apparent duplicate codes.
Kenichi Handa <handa@m17n.org>
parents:
88425
diff
changeset
|
581 (set-case-syntax-pair ?ÈŽ ?È tbl) |
dad99fefc4b3
Change encoding to utf-8-emacs. Remove apparent duplicate codes.
Kenichi Handa <handa@m17n.org>
parents:
88425
diff
changeset
|
582 (set-case-syntax-pair ?È ?È‘ tbl) |
dad99fefc4b3
Change encoding to utf-8-emacs. Remove apparent duplicate codes.
Kenichi Handa <handa@m17n.org>
parents:
88425
diff
changeset
|
583 (set-case-syntax-pair ?È’ ?È“ tbl) |
dad99fefc4b3
Change encoding to utf-8-emacs. Remove apparent duplicate codes.
Kenichi Handa <handa@m17n.org>
parents:
88425
diff
changeset
|
584 (set-case-syntax-pair ?È” ?È• tbl) |
dad99fefc4b3
Change encoding to utf-8-emacs. Remove apparent duplicate codes.
Kenichi Handa <handa@m17n.org>
parents:
88425
diff
changeset
|
585 (set-case-syntax-pair ?È– ?È— tbl) |
dad99fefc4b3
Change encoding to utf-8-emacs. Remove apparent duplicate codes.
Kenichi Handa <handa@m17n.org>
parents:
88425
diff
changeset
|
586 (set-case-syntax-pair ?Ș ?ș tbl) |
dad99fefc4b3
Change encoding to utf-8-emacs. Remove apparent duplicate codes.
Kenichi Handa <handa@m17n.org>
parents:
88425
diff
changeset
|
587 (set-case-syntax-pair ?Èš ?È› tbl) |
dad99fefc4b3
Change encoding to utf-8-emacs. Remove apparent duplicate codes.
Kenichi Handa <handa@m17n.org>
parents:
88425
diff
changeset
|
588 (set-case-syntax-pair ?Èœ ?È tbl) |
dad99fefc4b3
Change encoding to utf-8-emacs. Remove apparent duplicate codes.
Kenichi Handa <handa@m17n.org>
parents:
88425
diff
changeset
|
589 (set-case-syntax-pair ?Èž ?ÈŸ tbl) |
dad99fefc4b3
Change encoding to utf-8-emacs. Remove apparent duplicate codes.
Kenichi Handa <handa@m17n.org>
parents:
88425
diff
changeset
|
590 (set-case-syntax-pair ?È¢ ?È£ tbl) |
dad99fefc4b3
Change encoding to utf-8-emacs. Remove apparent duplicate codes.
Kenichi Handa <handa@m17n.org>
parents:
88425
diff
changeset
|
591 (set-case-syntax-pair ?Ȥ ?ȥ tbl) |
dad99fefc4b3
Change encoding to utf-8-emacs. Remove apparent duplicate codes.
Kenichi Handa <handa@m17n.org>
parents:
88425
diff
changeset
|
592 (set-case-syntax-pair ?Ȧ ?ȧ tbl) |
dad99fefc4b3
Change encoding to utf-8-emacs. Remove apparent duplicate codes.
Kenichi Handa <handa@m17n.org>
parents:
88425
diff
changeset
|
593 (set-case-syntax-pair ?Ȩ ?ȩ tbl) |
dad99fefc4b3
Change encoding to utf-8-emacs. Remove apparent duplicate codes.
Kenichi Handa <handa@m17n.org>
parents:
88425
diff
changeset
|
594 (set-case-syntax-pair ?Ȫ ?ȫ tbl) |
dad99fefc4b3
Change encoding to utf-8-emacs. Remove apparent duplicate codes.
Kenichi Handa <handa@m17n.org>
parents:
88425
diff
changeset
|
595 (set-case-syntax-pair ?Ȭ ?È tbl) |
dad99fefc4b3
Change encoding to utf-8-emacs. Remove apparent duplicate codes.
Kenichi Handa <handa@m17n.org>
parents:
88425
diff
changeset
|
596 (set-case-syntax-pair ?Ȯ ?ȯ tbl) |
dad99fefc4b3
Change encoding to utf-8-emacs. Remove apparent duplicate codes.
Kenichi Handa <handa@m17n.org>
parents:
88425
diff
changeset
|
597 (set-case-syntax-pair ?Ȱ ?ȱ tbl) |
dad99fefc4b3
Change encoding to utf-8-emacs. Remove apparent duplicate codes.
Kenichi Handa <handa@m17n.org>
parents:
88425
diff
changeset
|
598 (set-case-syntax-pair ?Ȳ ?ȳ tbl) |
41874
1b93abfcbd87
Don't set word syntax (the default)
Dave Love <fx@gnu.org>
parents:
41466
diff
changeset
|
599 |
34507
ed8ea5cf198b
Fix cases and syntaxes for mule-unicode-0100-24ff.
Kenichi Handa <handa@m17n.org>
parents:
33902
diff
changeset
|
600 ;; Latin Extended Additional |
88612 | 601 (modify-category-entry '(#x1e00 . #x1ef9) ?l) |
34507
ed8ea5cf198b
Fix cases and syntaxes for mule-unicode-0100-24ff.
Kenichi Handa <handa@m17n.org>
parents:
33902
diff
changeset
|
602 (setq c #x1e00) |
33902
aacccd8ad977
Specify cases and syntaxes for mule-unicode-0100-24ff.
Kenichi Handa <handa@m17n.org>
parents:
31664
diff
changeset
|
603 (while (<= c #x1ef9) |
aacccd8ad977
Specify cases and syntaxes for mule-unicode-0100-24ff.
Kenichi Handa <handa@m17n.org>
parents:
31664
diff
changeset
|
604 (and (zerop (% c 2)) |
aacccd8ad977
Specify cases and syntaxes for mule-unicode-0100-24ff.
Kenichi Handa <handa@m17n.org>
parents:
31664
diff
changeset
|
605 (or (<= c #x1e94) (>= c #x1ea0)) |
88612 | 606 (set-case-syntax-pair c (1+ c) tbl)) |
33902
aacccd8ad977
Specify cases and syntaxes for mule-unicode-0100-24ff.
Kenichi Handa <handa@m17n.org>
parents:
31664
diff
changeset
|
607 (setq c (1+ c))) |
aacccd8ad977
Specify cases and syntaxes for mule-unicode-0100-24ff.
Kenichi Handa <handa@m17n.org>
parents:
31664
diff
changeset
|
608 |
34507
ed8ea5cf198b
Fix cases and syntaxes for mule-unicode-0100-24ff.
Kenichi Handa <handa@m17n.org>
parents:
33902
diff
changeset
|
609 ;; Greek |
88612 | 610 (modify-category-entry '(#x0370 . #x03ff) ?g) |
34507
ed8ea5cf198b
Fix cases and syntaxes for mule-unicode-0100-24ff.
Kenichi Handa <handa@m17n.org>
parents:
33902
diff
changeset
|
611 (setq c #x0370) |
33902
aacccd8ad977
Specify cases and syntaxes for mule-unicode-0100-24ff.
Kenichi Handa <handa@m17n.org>
parents:
31664
diff
changeset
|
612 (while (<= c #x03ff) |
aacccd8ad977
Specify cases and syntaxes for mule-unicode-0100-24ff.
Kenichi Handa <handa@m17n.org>
parents:
31664
diff
changeset
|
613 (if (or (and (>= c #x0391) (<= c #x03a1)) |
aacccd8ad977
Specify cases and syntaxes for mule-unicode-0100-24ff.
Kenichi Handa <handa@m17n.org>
parents:
31664
diff
changeset
|
614 (and (>= c #x03a3) (<= c #x03ab))) |
88612 | 615 (set-case-syntax-pair c (+ c 32) tbl)) |
33902
aacccd8ad977
Specify cases and syntaxes for mule-unicode-0100-24ff.
Kenichi Handa <handa@m17n.org>
parents:
31664
diff
changeset
|
616 (and (>= c #x03da) |
aacccd8ad977
Specify cases and syntaxes for mule-unicode-0100-24ff.
Kenichi Handa <handa@m17n.org>
parents:
31664
diff
changeset
|
617 (<= c #x03ee) |
aacccd8ad977
Specify cases and syntaxes for mule-unicode-0100-24ff.
Kenichi Handa <handa@m17n.org>
parents:
31664
diff
changeset
|
618 (zerop (% c 2)) |
88612 | 619 (set-case-syntax-pair c (1+ c) tbl)) |
33902
aacccd8ad977
Specify cases and syntaxes for mule-unicode-0100-24ff.
Kenichi Handa <handa@m17n.org>
parents:
31664
diff
changeset
|
620 (setq c (1+ c))) |
88469
dad99fefc4b3
Change encoding to utf-8-emacs. Remove apparent duplicate codes.
Kenichi Handa <handa@m17n.org>
parents:
88425
diff
changeset
|
621 (set-case-syntax-pair ?Ά ?ά tbl) |
dad99fefc4b3
Change encoding to utf-8-emacs. Remove apparent duplicate codes.
Kenichi Handa <handa@m17n.org>
parents:
88425
diff
changeset
|
622 (set-case-syntax-pair ?Έ ?Πtbl) |
dad99fefc4b3
Change encoding to utf-8-emacs. Remove apparent duplicate codes.
Kenichi Handa <handa@m17n.org>
parents:
88425
diff
changeset
|
623 (set-case-syntax-pair ?Ή ?ή tbl) |
dad99fefc4b3
Change encoding to utf-8-emacs. Remove apparent duplicate codes.
Kenichi Handa <handa@m17n.org>
parents:
88425
diff
changeset
|
624 (set-case-syntax-pair ?Ί ?ί tbl) |
dad99fefc4b3
Change encoding to utf-8-emacs. Remove apparent duplicate codes.
Kenichi Handa <handa@m17n.org>
parents:
88425
diff
changeset
|
625 (set-case-syntax-pair ?Ό ?ό tbl) |
dad99fefc4b3
Change encoding to utf-8-emacs. Remove apparent duplicate codes.
Kenichi Handa <handa@m17n.org>
parents:
88425
diff
changeset
|
626 (set-case-syntax-pair ?ÎŽ ?Ï tbl) |
dad99fefc4b3
Change encoding to utf-8-emacs. Remove apparent duplicate codes.
Kenichi Handa <handa@m17n.org>
parents:
88425
diff
changeset
|
627 (set-case-syntax-pair ?Î ?ÏŽ tbl) |
33902
aacccd8ad977
Specify cases and syntaxes for mule-unicode-0100-24ff.
Kenichi Handa <handa@m17n.org>
parents:
31664
diff
changeset
|
628 |
41874
1b93abfcbd87
Don't set word syntax (the default)
Dave Love <fx@gnu.org>
parents:
41466
diff
changeset
|
629 ;; Armenian |
1b93abfcbd87
Don't set word syntax (the default)
Dave Love <fx@gnu.org>
parents:
41466
diff
changeset
|
630 (setq c #x531) |
1b93abfcbd87
Don't set word syntax (the default)
Dave Love <fx@gnu.org>
parents:
41466
diff
changeset
|
631 (while (<= c #x556) |
88612 | 632 (set-case-syntax-pair c (+ c #x30) tbl) |
41874
1b93abfcbd87
Don't set word syntax (the default)
Dave Love <fx@gnu.org>
parents:
41466
diff
changeset
|
633 (setq c (1+ c))) |
1b93abfcbd87
Don't set word syntax (the default)
Dave Love <fx@gnu.org>
parents:
41466
diff
changeset
|
634 |
34507
ed8ea5cf198b
Fix cases and syntaxes for mule-unicode-0100-24ff.
Kenichi Handa <handa@m17n.org>
parents:
33902
diff
changeset
|
635 ;; Greek Extended |
88612 | 636 (modify-category-entry '(#x1f00 . #x1fff) ?g) |
34507
ed8ea5cf198b
Fix cases and syntaxes for mule-unicode-0100-24ff.
Kenichi Handa <handa@m17n.org>
parents:
33902
diff
changeset
|
637 (setq c #x1f00) |
33902
aacccd8ad977
Specify cases and syntaxes for mule-unicode-0100-24ff.
Kenichi Handa <handa@m17n.org>
parents:
31664
diff
changeset
|
638 (while (<= c #x1fff) |
aacccd8ad977
Specify cases and syntaxes for mule-unicode-0100-24ff.
Kenichi Handa <handa@m17n.org>
parents:
31664
diff
changeset
|
639 (and (<= (logand c #x000f) 7) |
aacccd8ad977
Specify cases and syntaxes for mule-unicode-0100-24ff.
Kenichi Handa <handa@m17n.org>
parents:
31664
diff
changeset
|
640 (<= c #x1fa7) |
aacccd8ad977
Specify cases and syntaxes for mule-unicode-0100-24ff.
Kenichi Handa <handa@m17n.org>
parents:
31664
diff
changeset
|
641 (not (memq c '(#x1f50 #x1f52 #x1f54 #x1f56))) |
aacccd8ad977
Specify cases and syntaxes for mule-unicode-0100-24ff.
Kenichi Handa <handa@m17n.org>
parents:
31664
diff
changeset
|
642 (/= (logand c #x00f0) 7) |
88612 | 643 (set-case-syntax-pair (+ c 8) c tbl)) |
33902
aacccd8ad977
Specify cases and syntaxes for mule-unicode-0100-24ff.
Kenichi Handa <handa@m17n.org>
parents:
31664
diff
changeset
|
644 (setq c (1+ c))) |
88469
dad99fefc4b3
Change encoding to utf-8-emacs. Remove apparent duplicate codes.
Kenichi Handa <handa@m17n.org>
parents:
88425
diff
changeset
|
645 (set-case-syntax-pair ?Ᾰ ?ᾰ tbl) |
dad99fefc4b3
Change encoding to utf-8-emacs. Remove apparent duplicate codes.
Kenichi Handa <handa@m17n.org>
parents:
88425
diff
changeset
|
646 (set-case-syntax-pair ?á¾¹ ?á¾± tbl) |
dad99fefc4b3
Change encoding to utf-8-emacs. Remove apparent duplicate codes.
Kenichi Handa <handa@m17n.org>
parents:
88425
diff
changeset
|
647 (set-case-syntax-pair ?Ὰ ?ὰ tbl) |
dad99fefc4b3
Change encoding to utf-8-emacs. Remove apparent duplicate codes.
Kenichi Handa <handa@m17n.org>
parents:
88425
diff
changeset
|
648 (set-case-syntax-pair ?á¾» ?á½± tbl) |
dad99fefc4b3
Change encoding to utf-8-emacs. Remove apparent duplicate codes.
Kenichi Handa <handa@m17n.org>
parents:
88425
diff
changeset
|
649 (set-case-syntax-pair ?á¾¼ ?á¾³ tbl) |
dad99fefc4b3
Change encoding to utf-8-emacs. Remove apparent duplicate codes.
Kenichi Handa <handa@m17n.org>
parents:
88425
diff
changeset
|
650 (set-case-syntax-pair ?Ὲ ?ὲ tbl) |
dad99fefc4b3
Change encoding to utf-8-emacs. Remove apparent duplicate codes.
Kenichi Handa <handa@m17n.org>
parents:
88425
diff
changeset
|
651 (set-case-syntax-pair ?Έ ?έ tbl) |
dad99fefc4b3
Change encoding to utf-8-emacs. Remove apparent duplicate codes.
Kenichi Handa <handa@m17n.org>
parents:
88425
diff
changeset
|
652 (set-case-syntax-pair ?á¿Š ?á½´ tbl) |
dad99fefc4b3
Change encoding to utf-8-emacs. Remove apparent duplicate codes.
Kenichi Handa <handa@m17n.org>
parents:
88425
diff
changeset
|
653 (set-case-syntax-pair ?á¿‹ ?á½µ tbl) |
dad99fefc4b3
Change encoding to utf-8-emacs. Remove apparent duplicate codes.
Kenichi Handa <handa@m17n.org>
parents:
88425
diff
changeset
|
654 (set-case-syntax-pair ?ῌ ?ῃ tbl) |
dad99fefc4b3
Change encoding to utf-8-emacs. Remove apparent duplicate codes.
Kenichi Handa <handa@m17n.org>
parents:
88425
diff
changeset
|
655 (set-case-syntax-pair ?Ῐ ?ῠtbl) |
dad99fefc4b3
Change encoding to utf-8-emacs. Remove apparent duplicate codes.
Kenichi Handa <handa@m17n.org>
parents:
88425
diff
changeset
|
656 (set-case-syntax-pair ?á¿™ ?á¿‘ tbl) |
dad99fefc4b3
Change encoding to utf-8-emacs. Remove apparent duplicate codes.
Kenichi Handa <handa@m17n.org>
parents:
88425
diff
changeset
|
657 (set-case-syntax-pair ?Ὶ ?ὶ tbl) |
dad99fefc4b3
Change encoding to utf-8-emacs. Remove apparent duplicate codes.
Kenichi Handa <handa@m17n.org>
parents:
88425
diff
changeset
|
658 (set-case-syntax-pair ?á¿› ?á½· tbl) |
dad99fefc4b3
Change encoding to utf-8-emacs. Remove apparent duplicate codes.
Kenichi Handa <handa@m17n.org>
parents:
88425
diff
changeset
|
659 (set-case-syntax-pair ?Ῠ ?ῠtbl) |
dad99fefc4b3
Change encoding to utf-8-emacs. Remove apparent duplicate codes.
Kenichi Handa <handa@m17n.org>
parents:
88425
diff
changeset
|
660 (set-case-syntax-pair ?á¿© ?á¿¡ tbl) |
dad99fefc4b3
Change encoding to utf-8-emacs. Remove apparent duplicate codes.
Kenichi Handa <handa@m17n.org>
parents:
88425
diff
changeset
|
661 (set-case-syntax-pair ?Ὺ ?ὺ tbl) |
dad99fefc4b3
Change encoding to utf-8-emacs. Remove apparent duplicate codes.
Kenichi Handa <handa@m17n.org>
parents:
88425
diff
changeset
|
662 (set-case-syntax-pair ?á¿« ?á½» tbl) |
dad99fefc4b3
Change encoding to utf-8-emacs. Remove apparent duplicate codes.
Kenichi Handa <handa@m17n.org>
parents:
88425
diff
changeset
|
663 (set-case-syntax-pair ?Ῥ ?ῥ tbl) |
dad99fefc4b3
Change encoding to utf-8-emacs. Remove apparent duplicate codes.
Kenichi Handa <handa@m17n.org>
parents:
88425
diff
changeset
|
664 (set-case-syntax-pair ?Ὸ ?ὸ tbl) |
dad99fefc4b3
Change encoding to utf-8-emacs. Remove apparent duplicate codes.
Kenichi Handa <handa@m17n.org>
parents:
88425
diff
changeset
|
665 (set-case-syntax-pair ?Ό ?ό tbl) |
dad99fefc4b3
Change encoding to utf-8-emacs. Remove apparent duplicate codes.
Kenichi Handa <handa@m17n.org>
parents:
88425
diff
changeset
|
666 (set-case-syntax-pair ?Ὼ ?ὼ tbl) |
dad99fefc4b3
Change encoding to utf-8-emacs. Remove apparent duplicate codes.
Kenichi Handa <handa@m17n.org>
parents:
88425
diff
changeset
|
667 (set-case-syntax-pair ?á¿» ?á½½ tbl) |
dad99fefc4b3
Change encoding to utf-8-emacs. Remove apparent duplicate codes.
Kenichi Handa <handa@m17n.org>
parents:
88425
diff
changeset
|
668 (set-case-syntax-pair ?ῼ ?ῳ tbl) |
33902
aacccd8ad977
Specify cases and syntaxes for mule-unicode-0100-24ff.
Kenichi Handa <handa@m17n.org>
parents:
31664
diff
changeset
|
669 |
34507
ed8ea5cf198b
Fix cases and syntaxes for mule-unicode-0100-24ff.
Kenichi Handa <handa@m17n.org>
parents:
33902
diff
changeset
|
670 ;; cyrillic |
88612 | 671 (modify-category-entry '(#x0400 . #x04FF) ?y) |
34507
ed8ea5cf198b
Fix cases and syntaxes for mule-unicode-0100-24ff.
Kenichi Handa <handa@m17n.org>
parents:
33902
diff
changeset
|
672 (setq c #x0400) |
33902
aacccd8ad977
Specify cases and syntaxes for mule-unicode-0100-24ff.
Kenichi Handa <handa@m17n.org>
parents:
31664
diff
changeset
|
673 (while (<= c #x04ff) |
aacccd8ad977
Specify cases and syntaxes for mule-unicode-0100-24ff.
Kenichi Handa <handa@m17n.org>
parents:
31664
diff
changeset
|
674 (and (>= c #x0400) |
aacccd8ad977
Specify cases and syntaxes for mule-unicode-0100-24ff.
Kenichi Handa <handa@m17n.org>
parents:
31664
diff
changeset
|
675 (<= c #x040f) |
88612 | 676 (set-case-syntax-pair c (+ c 80) tbl)) |
33902
aacccd8ad977
Specify cases and syntaxes for mule-unicode-0100-24ff.
Kenichi Handa <handa@m17n.org>
parents:
31664
diff
changeset
|
677 (and (>= c #x0410) |
aacccd8ad977
Specify cases and syntaxes for mule-unicode-0100-24ff.
Kenichi Handa <handa@m17n.org>
parents:
31664
diff
changeset
|
678 (<= c #x042f) |
88612 | 679 (set-case-syntax-pair c (+ c 32) tbl)) |
33902
aacccd8ad977
Specify cases and syntaxes for mule-unicode-0100-24ff.
Kenichi Handa <handa@m17n.org>
parents:
31664
diff
changeset
|
680 (and (zerop (% c 2)) |
aacccd8ad977
Specify cases and syntaxes for mule-unicode-0100-24ff.
Kenichi Handa <handa@m17n.org>
parents:
31664
diff
changeset
|
681 (or (and (>= c #x0460) (<= c #x0480)) |
aacccd8ad977
Specify cases and syntaxes for mule-unicode-0100-24ff.
Kenichi Handa <handa@m17n.org>
parents:
31664
diff
changeset
|
682 (and (>= c #x048c) (<= c #x04be)) |
aacccd8ad977
Specify cases and syntaxes for mule-unicode-0100-24ff.
Kenichi Handa <handa@m17n.org>
parents:
31664
diff
changeset
|
683 (and (>= c #x04d0) (<= c #x04f4))) |
88612 | 684 (set-case-syntax-pair c (1+ c) tbl)) |
33902
aacccd8ad977
Specify cases and syntaxes for mule-unicode-0100-24ff.
Kenichi Handa <handa@m17n.org>
parents:
31664
diff
changeset
|
685 (setq c (1+ c))) |
88469
dad99fefc4b3
Change encoding to utf-8-emacs. Remove apparent duplicate codes.
Kenichi Handa <handa@m17n.org>
parents:
88425
diff
changeset
|
686 (set-case-syntax-pair ?Ó ?Ó‚ tbl) |
dad99fefc4b3
Change encoding to utf-8-emacs. Remove apparent duplicate codes.
Kenichi Handa <handa@m17n.org>
parents:
88425
diff
changeset
|
687 (set-case-syntax-pair ?Óƒ ?Ó„ tbl) |
dad99fefc4b3
Change encoding to utf-8-emacs. Remove apparent duplicate codes.
Kenichi Handa <handa@m17n.org>
parents:
88425
diff
changeset
|
688 (set-case-syntax-pair ?Ó‡ ?Óˆ tbl) |
dad99fefc4b3
Change encoding to utf-8-emacs. Remove apparent duplicate codes.
Kenichi Handa <handa@m17n.org>
parents:
88425
diff
changeset
|
689 (set-case-syntax-pair ?Ӌ ?ӌ tbl) |
dad99fefc4b3
Change encoding to utf-8-emacs. Remove apparent duplicate codes.
Kenichi Handa <handa@m17n.org>
parents:
88425
diff
changeset
|
690 (set-case-syntax-pair ?Ó¸ ?Ó¹ tbl) |
33902
aacccd8ad977
Specify cases and syntaxes for mule-unicode-0100-24ff.
Kenichi Handa <handa@m17n.org>
parents:
31664
diff
changeset
|
691 |
34507
ed8ea5cf198b
Fix cases and syntaxes for mule-unicode-0100-24ff.
Kenichi Handa <handa@m17n.org>
parents:
33902
diff
changeset
|
692 ;; general punctuation |
ed8ea5cf198b
Fix cases and syntaxes for mule-unicode-0100-24ff.
Kenichi Handa <handa@m17n.org>
parents:
33902
diff
changeset
|
693 (setq c #x2000) |
33902
aacccd8ad977
Specify cases and syntaxes for mule-unicode-0100-24ff.
Kenichi Handa <handa@m17n.org>
parents:
31664
diff
changeset
|
694 (while (<= c #x200b) |
aacccd8ad977
Specify cases and syntaxes for mule-unicode-0100-24ff.
Kenichi Handa <handa@m17n.org>
parents:
31664
diff
changeset
|
695 (set-case-syntax c " " tbl) |
aacccd8ad977
Specify cases and syntaxes for mule-unicode-0100-24ff.
Kenichi Handa <handa@m17n.org>
parents:
31664
diff
changeset
|
696 (setq c (1+ c))) |
89119 | 697 (while (<= c #x200F) |
698 (set-case-syntax c "." tbl) | |
699 (setq c (1+ c))) | |
700 ;; Fixme: These aren't all right: | |
33902
aacccd8ad977
Specify cases and syntaxes for mule-unicode-0100-24ff.
Kenichi Handa <handa@m17n.org>
parents:
31664
diff
changeset
|
701 (while (<= c #x2027) |
aacccd8ad977
Specify cases and syntaxes for mule-unicode-0100-24ff.
Kenichi Handa <handa@m17n.org>
parents:
31664
diff
changeset
|
702 (set-case-syntax c "_" tbl) |
aacccd8ad977
Specify cases and syntaxes for mule-unicode-0100-24ff.
Kenichi Handa <handa@m17n.org>
parents:
31664
diff
changeset
|
703 (setq c (1+ c))) |
89119 | 704 (while (<= c #x206F) |
705 (set-case-syntax c "." tbl) | |
706 (setq c (1+ c))) | |
41874
1b93abfcbd87
Don't set word syntax (the default)
Dave Love <fx@gnu.org>
parents:
41466
diff
changeset
|
707 |
1b93abfcbd87
Don't set word syntax (the default)
Dave Love <fx@gnu.org>
parents:
41466
diff
changeset
|
708 ;; Roman numerals |
1b93abfcbd87
Don't set word syntax (the default)
Dave Love <fx@gnu.org>
parents:
41466
diff
changeset
|
709 (setq c #x2160) |
1b93abfcbd87
Don't set word syntax (the default)
Dave Love <fx@gnu.org>
parents:
41466
diff
changeset
|
710 (while (<= c #x216f) |
88612 | 711 (set-case-syntax-pair c (+ c #x10) tbl) |
41874
1b93abfcbd87
Don't set word syntax (the default)
Dave Love <fx@gnu.org>
parents:
41466
diff
changeset
|
712 (setq c (1+ c))) |
1b93abfcbd87
Don't set word syntax (the default)
Dave Love <fx@gnu.org>
parents:
41466
diff
changeset
|
713 |
89146 | 714 ;; Fixme: The following blocks might be better as symbol rather than |
715 ;; punctuation. | |
89119 | 716 ;; Arrows |
717 (setq c #x2190) | |
89121 | 718 (while (<= c #x21FF) |
719 (set-case-syntax c "." tbl) | |
89119 | 720 (setq c (1+ c))) |
721 ;; Mathematical Operators | |
722 (while (<= c #x22FF) | |
89121 | 723 (set-case-syntax c "." tbl) |
89119 | 724 (setq c (1+ c))) |
725 ;; Miscellaneous Technical | |
726 (while (<= c #x23FF) | |
89121 | 727 (set-case-syntax c "." tbl) |
89119 | 728 (setq c (1+ c))) |
729 ;; Control Pictures | |
730 (while (<= c #x243F) | |
89121 | 731 (set-case-syntax c "_" tbl) |
732 (setq c (1+ c))) | |
89119 | 733 |
41874
1b93abfcbd87
Don't set word syntax (the default)
Dave Love <fx@gnu.org>
parents:
41466
diff
changeset
|
734 ;; Circled Latin |
1b93abfcbd87
Don't set word syntax (the default)
Dave Love <fx@gnu.org>
parents:
41466
diff
changeset
|
735 (setq c #x24b6) |
1b93abfcbd87
Don't set word syntax (the default)
Dave Love <fx@gnu.org>
parents:
41466
diff
changeset
|
736 (while (<= c #x24cf) |
88612 | 737 (set-case-syntax-pair c (+ c 26) tbl) |
738 (modify-category-entry c ?l) | |
739 (modify-category-entry (+ c 26) ?l) | |
41874
1b93abfcbd87
Don't set word syntax (the default)
Dave Love <fx@gnu.org>
parents:
41466
diff
changeset
|
740 (setq c (1+ c))) |
1b93abfcbd87
Don't set word syntax (the default)
Dave Love <fx@gnu.org>
parents:
41466
diff
changeset
|
741 |
1b93abfcbd87
Don't set word syntax (the default)
Dave Love <fx@gnu.org>
parents:
41466
diff
changeset
|
742 ;; Fullwidth Latin |
1b93abfcbd87
Don't set word syntax (the default)
Dave Love <fx@gnu.org>
parents:
41466
diff
changeset
|
743 (setq c #xff21) |
1b93abfcbd87
Don't set word syntax (the default)
Dave Love <fx@gnu.org>
parents:
41466
diff
changeset
|
744 (while (<= c #xff3a) |
88612 | 745 (set-case-syntax-pair c (+ c #x20) tbl) |
746 (modify-category-entry c ?l) | |
747 (modify-category-entry (+ c #x20) ?l) | |
41874
1b93abfcbd87
Don't set word syntax (the default)
Dave Love <fx@gnu.org>
parents:
41466
diff
changeset
|
748 (setq c (1+ c))) |
1b93abfcbd87
Don't set word syntax (the default)
Dave Love <fx@gnu.org>
parents:
41466
diff
changeset
|
749 |
1b93abfcbd87
Don't set word syntax (the default)
Dave Love <fx@gnu.org>
parents:
41466
diff
changeset
|
750 ;; Combining diacritics |
88612 | 751 (modify-category-entry '(#x300 . #x362) ?^) |
41874
1b93abfcbd87
Don't set word syntax (the default)
Dave Love <fx@gnu.org>
parents:
41466
diff
changeset
|
752 ;; Combining marks |
88612 | 753 (modify-category-entry '(#x20d0 . #x20e3) ?^) |
41874
1b93abfcbd87
Don't set word syntax (the default)
Dave Love <fx@gnu.org>
parents:
41466
diff
changeset
|
754 |
1b93abfcbd87
Don't set word syntax (the default)
Dave Love <fx@gnu.org>
parents:
41466
diff
changeset
|
755 ;; Fixme: syntax for symbols &c |
33902
aacccd8ad977
Specify cases and syntaxes for mule-unicode-0100-24ff.
Kenichi Handa <handa@m17n.org>
parents:
31664
diff
changeset
|
756 ) |
17052 | 757 |
20734
179dbdbc3882
Put `prefered-coding-system'
Kenichi Handa <handa@m17n.org>
parents:
20247
diff
changeset
|
758 ;; For each character set, put the information of the most proper |
24782
8f65cd6729b5
Typo fixed; prefered->preferred.
Kenichi Handa <handa@m17n.org>
parents:
24256
diff
changeset
|
759 ;; coding system to encode it by `preferred-coding-system' property. |
20734
179dbdbc3882
Put `prefered-coding-system'
Kenichi Handa <handa@m17n.org>
parents:
20247
diff
changeset
|
760 |
88612 | 761 ;; Fixme: should this be junked? |
20734
179dbdbc3882
Put `prefered-coding-system'
Kenichi Handa <handa@m17n.org>
parents:
20247
diff
changeset
|
762 (let ((l '((latin-iso8859-1 . iso-latin-1) |
179dbdbc3882
Put `prefered-coding-system'
Kenichi Handa <handa@m17n.org>
parents:
20247
diff
changeset
|
763 (latin-iso8859-2 . iso-latin-2) |
179dbdbc3882
Put `prefered-coding-system'
Kenichi Handa <handa@m17n.org>
parents:
20247
diff
changeset
|
764 (latin-iso8859-3 . iso-latin-3) |
179dbdbc3882
Put `prefered-coding-system'
Kenichi Handa <handa@m17n.org>
parents:
20247
diff
changeset
|
765 (latin-iso8859-4 . iso-latin-4) |
179dbdbc3882
Put `prefered-coding-system'
Kenichi Handa <handa@m17n.org>
parents:
20247
diff
changeset
|
766 (thai-tis620 . thai-tis620) |
179dbdbc3882
Put `prefered-coding-system'
Kenichi Handa <handa@m17n.org>
parents:
20247
diff
changeset
|
767 (greek-iso8859-7 . greek-iso-8bit) |
179dbdbc3882
Put `prefered-coding-system'
Kenichi Handa <handa@m17n.org>
parents:
20247
diff
changeset
|
768 (arabic-iso8859-6 . iso-2022-7bit) |
179dbdbc3882
Put `prefered-coding-system'
Kenichi Handa <handa@m17n.org>
parents:
20247
diff
changeset
|
769 (hebrew-iso8859-8 . hebrew-iso-8bit) |
179dbdbc3882
Put `prefered-coding-system'
Kenichi Handa <handa@m17n.org>
parents:
20247
diff
changeset
|
770 (katakana-jisx0201 . japanese-shift-jis) |
179dbdbc3882
Put `prefered-coding-system'
Kenichi Handa <handa@m17n.org>
parents:
20247
diff
changeset
|
771 (latin-jisx0201 . japanese-shift-jis) |
179dbdbc3882
Put `prefered-coding-system'
Kenichi Handa <handa@m17n.org>
parents:
20247
diff
changeset
|
772 (cyrillic-iso8859-5 . cyrillic-iso-8bit) |
179dbdbc3882
Put `prefered-coding-system'
Kenichi Handa <handa@m17n.org>
parents:
20247
diff
changeset
|
773 (latin-iso8859-9 . iso-latin-5) |
179dbdbc3882
Put `prefered-coding-system'
Kenichi Handa <handa@m17n.org>
parents:
20247
diff
changeset
|
774 (japanese-jisx0208-1978 . iso-2022-jp) |
179dbdbc3882
Put `prefered-coding-system'
Kenichi Handa <handa@m17n.org>
parents:
20247
diff
changeset
|
775 (chinese-gb2312 . cn-gb-2312) |
179dbdbc3882
Put `prefered-coding-system'
Kenichi Handa <handa@m17n.org>
parents:
20247
diff
changeset
|
776 (japanese-jisx0208 . iso-2022-jp) |
179dbdbc3882
Put `prefered-coding-system'
Kenichi Handa <handa@m17n.org>
parents:
20247
diff
changeset
|
777 (korean-ksc5601 . iso-2022-kr) |
179dbdbc3882
Put `prefered-coding-system'
Kenichi Handa <handa@m17n.org>
parents:
20247
diff
changeset
|
778 (japanese-jisx0212 . iso-2022-jp) |
179dbdbc3882
Put `prefered-coding-system'
Kenichi Handa <handa@m17n.org>
parents:
20247
diff
changeset
|
779 (chinese-cns11643-1 . iso-2022-cn) |
179dbdbc3882
Put `prefered-coding-system'
Kenichi Handa <handa@m17n.org>
parents:
20247
diff
changeset
|
780 (chinese-cns11643-2 . iso-2022-cn) |
179dbdbc3882
Put `prefered-coding-system'
Kenichi Handa <handa@m17n.org>
parents:
20247
diff
changeset
|
781 (chinese-big5-1 . chinese-big5) |
179dbdbc3882
Put `prefered-coding-system'
Kenichi Handa <handa@m17n.org>
parents:
20247
diff
changeset
|
782 (chinese-big5-2 . chinese-big5) |
179dbdbc3882
Put `prefered-coding-system'
Kenichi Handa <handa@m17n.org>
parents:
20247
diff
changeset
|
783 (chinese-sisheng . iso-2022-7bit) |
179dbdbc3882
Put `prefered-coding-system'
Kenichi Handa <handa@m17n.org>
parents:
20247
diff
changeset
|
784 (ipa . iso-2022-7bit) |
179dbdbc3882
Put `prefered-coding-system'
Kenichi Handa <handa@m17n.org>
parents:
20247
diff
changeset
|
785 (vietnamese-viscii-lower . vietnamese-viscii) |
179dbdbc3882
Put `prefered-coding-system'
Kenichi Handa <handa@m17n.org>
parents:
20247
diff
changeset
|
786 (vietnamese-viscii-upper . vietnamese-viscii) |
179dbdbc3882
Put `prefered-coding-system'
Kenichi Handa <handa@m17n.org>
parents:
20247
diff
changeset
|
787 (arabic-digit . iso-2022-7bit) |
179dbdbc3882
Put `prefered-coding-system'
Kenichi Handa <handa@m17n.org>
parents:
20247
diff
changeset
|
788 (arabic-1-column . iso-2022-7bit) |
179dbdbc3882
Put `prefered-coding-system'
Kenichi Handa <handa@m17n.org>
parents:
20247
diff
changeset
|
789 (lao . lao) |
179dbdbc3882
Put `prefered-coding-system'
Kenichi Handa <handa@m17n.org>
parents:
20247
diff
changeset
|
790 (arabic-2-column . iso-2022-7bit) |
179dbdbc3882
Put `prefered-coding-system'
Kenichi Handa <handa@m17n.org>
parents:
20247
diff
changeset
|
791 (indian-is13194 . devanagari) |
41466
b364f798b789
Adjusted for the change of indian-1-column -> indian-glyph.
Richard M. Stallman <rms@gnu.org>
parents:
38414
diff
changeset
|
792 (indian-glyph . devanagari) |
20734
179dbdbc3882
Put `prefered-coding-system'
Kenichi Handa <handa@m17n.org>
parents:
20247
diff
changeset
|
793 (tibetan-1-column . tibetan) |
24256
47e6aa168a66
Let ethiopic use iso-2022-7bit.
Kenichi Handa <handa@m17n.org>
parents:
24208
diff
changeset
|
794 (ethiopic . iso-2022-7bit) |
20734
179dbdbc3882
Put `prefered-coding-system'
Kenichi Handa <handa@m17n.org>
parents:
20247
diff
changeset
|
795 (chinese-cns11643-3 . iso-2022-cn) |
179dbdbc3882
Put `prefered-coding-system'
Kenichi Handa <handa@m17n.org>
parents:
20247
diff
changeset
|
796 (chinese-cns11643-4 . iso-2022-cn) |
179dbdbc3882
Put `prefered-coding-system'
Kenichi Handa <handa@m17n.org>
parents:
20247
diff
changeset
|
797 (chinese-cns11643-5 . iso-2022-cn) |
179dbdbc3882
Put `prefered-coding-system'
Kenichi Handa <handa@m17n.org>
parents:
20247
diff
changeset
|
798 (chinese-cns11643-6 . iso-2022-cn) |
179dbdbc3882
Put `prefered-coding-system'
Kenichi Handa <handa@m17n.org>
parents:
20247
diff
changeset
|
799 (chinese-cns11643-7 . iso-2022-cn) |
179dbdbc3882
Put `prefered-coding-system'
Kenichi Handa <handa@m17n.org>
parents:
20247
diff
changeset
|
800 (indian-2-column . devanagari) |
26013 | 801 (tibetan . tibetan) |
802 (latin-iso8859-14 . iso-latin-8) | |
803 (latin-iso8859-15 . iso-latin-9)))) | |
20734
179dbdbc3882
Put `prefered-coding-system'
Kenichi Handa <handa@m17n.org>
parents:
20247
diff
changeset
|
804 (while l |
24782
8f65cd6729b5
Typo fixed; prefered->preferred.
Kenichi Handa <handa@m17n.org>
parents:
24256
diff
changeset
|
805 (put-charset-property (car (car l)) 'preferred-coding-system (cdr (car l))) |
20734
179dbdbc3882
Put `prefered-coding-system'
Kenichi Handa <handa@m17n.org>
parents:
20247
diff
changeset
|
806 (setq l (cdr l)))) |
23493 | 807 |
808 | |
23527
06e8c3b9deec
Setup charset property nospace-between-words.
Kenichi Handa <handa@m17n.org>
parents:
23493
diff
changeset
|
809 ;; Setup auto-fill-chars for charsets that should invoke auto-filling. |
41874
1b93abfcbd87
Don't set word syntax (the default)
Dave Love <fx@gnu.org>
parents:
41466
diff
changeset
|
810 ;; SPACE and NEWLINE are already set. Also put `nospace-between-words' |
1b93abfcbd87
Don't set word syntax (the default)
Dave Love <fx@gnu.org>
parents:
41466
diff
changeset
|
811 ;; property on the charsets. |
23493 | 812 (let ((l '(katakana-jisx0201 |
813 japanese-jisx0208 japanese-jisx0212 | |
814 chinese-gb2312 chinese-big5-1 chinese-big5-2))) | |
815 (while l | |
88403
3c6459e2914f
Adjusted for the deletion of generic characters.
Kenichi Handa <handa@m17n.org>
parents:
41992
diff
changeset
|
816 ;;(aset auto-fill-chars (make-char (car l)) t) |
23527
06e8c3b9deec
Setup charset property nospace-between-words.
Kenichi Handa <handa@m17n.org>
parents:
23493
diff
changeset
|
817 (put-charset-property (car l) 'nospace-between-words t) |
23493 | 818 (setq l (cdr l)))) |
88403
3c6459e2914f
Adjusted for the deletion of generic characters.
Kenichi Handa <handa@m17n.org>
parents:
41992
diff
changeset
|
819 |
3c6459e2914f
Adjusted for the deletion of generic characters.
Kenichi Handa <handa@m17n.org>
parents:
41992
diff
changeset
|
820 |
88492
2ce720ffa865
Setup char-width-table for CJK characters.
Kenichi Handa <handa@m17n.org>
parents:
88469
diff
changeset
|
821 ;; CJK double width characters. |
2ce720ffa865
Setup char-width-table for CJK characters.
Kenichi Handa <handa@m17n.org>
parents:
88469
diff
changeset
|
822 (let ((l '((#x1100 . #x11FF) |
2ce720ffa865
Setup char-width-table for CJK characters.
Kenichi Handa <handa@m17n.org>
parents:
88469
diff
changeset
|
823 (#x2E80 . #x9FAF) |
2ce720ffa865
Setup char-width-table for CJK characters.
Kenichi Handa <handa@m17n.org>
parents:
88469
diff
changeset
|
824 (#xAC00 . #xD7AF) |
2ce720ffa865
Setup char-width-table for CJK characters.
Kenichi Handa <handa@m17n.org>
parents:
88469
diff
changeset
|
825 (#xF900 . #xFAFF) |
2ce720ffa865
Setup char-width-table for CJK characters.
Kenichi Handa <handa@m17n.org>
parents:
88469
diff
changeset
|
826 (#xFE30 . #xFE4F) |
2ce720ffa865
Setup char-width-table for CJK characters.
Kenichi Handa <handa@m17n.org>
parents:
88469
diff
changeset
|
827 (#xFF00 . #xFF5F) |
2ce720ffa865
Setup char-width-table for CJK characters.
Kenichi Handa <handa@m17n.org>
parents:
88469
diff
changeset
|
828 (#xFFE0 . #xFFEF)))) |
2ce720ffa865
Setup char-width-table for CJK characters.
Kenichi Handa <handa@m17n.org>
parents:
88469
diff
changeset
|
829 (dolist (elt l) |
2ce720ffa865
Setup char-width-table for CJK characters.
Kenichi Handa <handa@m17n.org>
parents:
88469
diff
changeset
|
830 (set-char-table-range char-width-table |
88612 | 831 (cons (car elt) (cdr elt)) |
88492
2ce720ffa865
Setup char-width-table for CJK characters.
Kenichi Handa <handa@m17n.org>
parents:
88469
diff
changeset
|
832 2))) |
89117 | 833 ;; Fixme: Doing this affects non-CJK characters through unification, |
834 ;; but presumably CJK users expect those characters to be | |
835 ;; double-width when using these charsets. | |
836 ;; (map-charset-chars | |
837 ;; #'(lambda (range ignore) (set-char-table-range char-width-table range 2)) | |
838 ;; 'japanese-jisx0208) | |
839 ;; (map-charset-chars | |
840 ;; #'(lambda (range ignore) (set-char-table-range char-width-table range 2)) | |
841 ;; 'japanese-jisx0212) | |
842 ;; (map-charset-chars | |
843 ;; #'(lambda (range ignore) (set-char-table-range char-width-table range 2)) | |
844 ;; 'japanese-jisx0213-1) | |
845 ;; (map-charset-chars | |
846 ;; #'(lambda (range ignore) (set-char-table-range char-width-table range 2)) | |
847 ;; 'japanese-jisx0213-2) | |
848 ;; (map-charset-chars | |
849 ;; (lambda (range ignore) (set-char-table-range char-width-table range 2)) | |
850 ;; 'korean-ksc5601) | |
88642
3a7ca837e9b9
Additional double width specifications.
Dave Love <fx@gnu.org>
parents:
88612
diff
changeset
|
851 |
3a7ca837e9b9
Additional double width specifications.
Dave Love <fx@gnu.org>
parents:
88612
diff
changeset
|
852 ;; Other double width |
3a7ca837e9b9
Additional double width specifications.
Dave Love <fx@gnu.org>
parents:
88612
diff
changeset
|
853 (map-charset-chars |
3a7ca837e9b9
Additional double width specifications.
Dave Love <fx@gnu.org>
parents:
88612
diff
changeset
|
854 (lambda (range ignore) (set-char-table-range char-width-table range 2)) |
3a7ca837e9b9
Additional double width specifications.
Dave Love <fx@gnu.org>
parents:
88612
diff
changeset
|
855 'ethiopic) |
3a7ca837e9b9
Additional double width specifications.
Dave Love <fx@gnu.org>
parents:
88612
diff
changeset
|
856 (map-charset-chars |
3a7ca837e9b9
Additional double width specifications.
Dave Love <fx@gnu.org>
parents:
88612
diff
changeset
|
857 (lambda (range ignore) (set-char-table-range char-width-table range 2)) |
3a7ca837e9b9
Additional double width specifications.
Dave Love <fx@gnu.org>
parents:
88612
diff
changeset
|
858 'tibetan) |
3a7ca837e9b9
Additional double width specifications.
Dave Love <fx@gnu.org>
parents:
88612
diff
changeset
|
859 (map-charset-chars |
3a7ca837e9b9
Additional double width specifications.
Dave Love <fx@gnu.org>
parents:
88612
diff
changeset
|
860 (lambda (range ignore) (set-char-table-range char-width-table range 2)) |
3a7ca837e9b9
Additional double width specifications.
Dave Love <fx@gnu.org>
parents:
88612
diff
changeset
|
861 'indian-2-column) |
3a7ca837e9b9
Additional double width specifications.
Dave Love <fx@gnu.org>
parents:
88612
diff
changeset
|
862 (map-charset-chars |
3a7ca837e9b9
Additional double width specifications.
Dave Love <fx@gnu.org>
parents:
88612
diff
changeset
|
863 (lambda (range ignore) (set-char-table-range char-width-table range 2)) |
3a7ca837e9b9
Additional double width specifications.
Dave Love <fx@gnu.org>
parents:
88612
diff
changeset
|
864 'arabic-2-column) |
20734
179dbdbc3882
Put `prefered-coding-system'
Kenichi Handa <handa@m17n.org>
parents:
20247
diff
changeset
|
865 |
88723 | 866 (optimize-char-table (standard-case-table)) |
867 (optimize-char-table char-width-table) | |
868 (optimize-char-table (standard-category-table)) | |
869 (optimize-char-table (standard-syntax-table)) | |
870 | |
89119 | 871 ;; The Unicode blocks actually extend past some of these ranges with |
872 ;; undefined codepoints. | |
88892 | 873 (let ((script-list nil)) |
874 (dolist | |
875 (elt | |
89008
3e08629c1469
(script-list): Treat ascii characters as `latin' script.
Kenichi Handa <handa@m17n.org>
parents:
88976
diff
changeset
|
876 '((#x0000 #x007F latin) |
88892 | 877 (#x00A0 #x036F latin) |
878 (#x0370 #x03E1 greek) | |
879 (#x03E2 #x03EF coptic) | |
880 (#x03F0 #x03F3 greek) | |
881 (#x0400 #x04FF cyrillic) | |
882 (#x0530 #x058F armenian) | |
883 (#x0590 #x05FF hebrew) | |
884 (#x0600 #x06FF arabic) | |
885 (#x0700 #x074F syriac) | |
886 (#x0780 #x07BF thaana) | |
887 (#x0900 #x097F devanagari) | |
888 (#x0980 #x09FF bengali) | |
889 (#x0A00 #x0A7F gurmukhi) | |
890 (#x0A80 #x0AFF gujarati) | |
891 (#x0B00 #x0B7F oriya) | |
892 (#x0B80 #x0BFF tamil) | |
893 (#x0C00 #x0C7F telugu) | |
894 (#x0C80 #x0CFF kannada) | |
895 (#x0D00 #x0D7F malayalam) | |
896 (#x0D80 #x0DFF sinhala) | |
897 (#x0E00 #x0E5F thai) | |
898 (#x0E80 #x0EDF lao) | |
899 (#x0F00 #x0FFF tibetan) | |
900 (#x1000 #x105F myanmar) | |
901 (#x10A0 #x10FF georgian) | |
902 (#x1100 #x11FF hangul) | |
903 (#x1200 #x137F ethiopic) | |
904 (#x13A0 #x13FF cherokee) | |
905 (#x1400 #x167F canadian-aboriginal) | |
906 (#x1680 #x169F ogham) | |
907 (#x16A0 #x16FF runic) | |
908 (#x1780 #x17FF khmer) | |
909 (#x1800 #x18AF mongolian) | |
910 (#x1E00 #x1EFF latin) | |
911 (#x1F00 #x1FFF greek) | |
912 (#x20A0 #x20AF currency) | |
913 (#x2800 #x28FF braille) | |
914 (#x2E80 #x2FDF han) | |
915 (#x2FF0 #x2FFF ideographic-description) | |
916 (#x3000 #x303F cjk-misc) | |
917 (#x3040 #x30FF kana) | |
918 (#x3100 #x312F bopomofo) | |
919 (#x3130 #x318F hangul) | |
920 (#x3190 #x319F kanbun) | |
921 (#x31A0 #x31BF bopomofo) | |
922 (#x3400 #x9FAF han) | |
923 (#xA000 #xA4CF yi) | |
924 (#xAC00 #xD7AF hangul) | |
925 (#xF900 #xFA5F han) | |
926 (#xFB1D #xFB4F hebrew) | |
927 (#xFB50 #xFDFF arabic) | |
928 (#xFE70 #xFEFC arabic) | |
929 (#xFF00 #xFF5F cjk-misc) | |
930 (#xFF61 #xFF9F kana) | |
931 (#xFFE0 #xFFE6 cjk-misc) | |
88971
651b64e6dce3
Call map-charset-chars on big5
Kenichi Handa <handa@m17n.org>
parents:
88892
diff
changeset
|
932 (#x20000 #x2AFFF han) |
88892 | 933 (#x2F800 #x2FFFF han))) |
934 (set-char-table-range char-script-table | |
935 (cons (car elt) (nth 1 elt)) (nth 2 elt)) | |
936 (or (memq (nth 2 elt) script-list) | |
937 (setq script-list (cons (nth 2 elt) script-list)))) | |
938 (set-char-table-extra-slot char-script-table 0 (nreverse script-list))) | |
939 | |
88971
651b64e6dce3
Call map-charset-chars on big5
Kenichi Handa <handa@m17n.org>
parents:
88892
diff
changeset
|
940 |
651b64e6dce3
Call map-charset-chars on big5
Kenichi Handa <handa@m17n.org>
parents:
88892
diff
changeset
|
941 ;;; Setting word boundary. |
651b64e6dce3
Call map-charset-chars on big5
Kenichi Handa <handa@m17n.org>
parents:
88892
diff
changeset
|
942 |
651b64e6dce3
Call map-charset-chars on big5
Kenichi Handa <handa@m17n.org>
parents:
88892
diff
changeset
|
943 (defun next-word-boundary-han (pos limit) |
651b64e6dce3
Call map-charset-chars on big5
Kenichi Handa <handa@m17n.org>
parents:
88892
diff
changeset
|
944 (if (<= pos limit) |
651b64e6dce3
Call map-charset-chars on big5
Kenichi Handa <handa@m17n.org>
parents:
88892
diff
changeset
|
945 (save-excursion |
651b64e6dce3
Call map-charset-chars on big5
Kenichi Handa <handa@m17n.org>
parents:
88892
diff
changeset
|
946 (goto-char pos) |
651b64e6dce3
Call map-charset-chars on big5
Kenichi Handa <handa@m17n.org>
parents:
88892
diff
changeset
|
947 (looking-at "\\cC+") |
651b64e6dce3
Call map-charset-chars on big5
Kenichi Handa <handa@m17n.org>
parents:
88892
diff
changeset
|
948 (goto-char (match-end 0)) |
88976
fc96b539a43c
(next-word-boundary-han): Don't
Kenichi Handa <handa@m17n.org>
parents:
88971
diff
changeset
|
949 (if (looking-at "\\cH+") |
88971
651b64e6dce3
Call map-charset-chars on big5
Kenichi Handa <handa@m17n.org>
parents:
88892
diff
changeset
|
950 (goto-char (match-end 0))) |
651b64e6dce3
Call map-charset-chars on big5
Kenichi Handa <handa@m17n.org>
parents:
88892
diff
changeset
|
951 (point)) |
651b64e6dce3
Call map-charset-chars on big5
Kenichi Handa <handa@m17n.org>
parents:
88892
diff
changeset
|
952 (while (and (> pos limit) |
651b64e6dce3
Call map-charset-chars on big5
Kenichi Handa <handa@m17n.org>
parents:
88892
diff
changeset
|
953 (eq (aref char-script-table (char-after (1- pos))) 'han)) |
651b64e6dce3
Call map-charset-chars on big5
Kenichi Handa <handa@m17n.org>
parents:
88892
diff
changeset
|
954 (setq pos (1- pos))) |
651b64e6dce3
Call map-charset-chars on big5
Kenichi Handa <handa@m17n.org>
parents:
88892
diff
changeset
|
955 pos)) |
651b64e6dce3
Call map-charset-chars on big5
Kenichi Handa <handa@m17n.org>
parents:
88892
diff
changeset
|
956 |
651b64e6dce3
Call map-charset-chars on big5
Kenichi Handa <handa@m17n.org>
parents:
88892
diff
changeset
|
957 (defun next-word-boundary-kana (pos limit) |
651b64e6dce3
Call map-charset-chars on big5
Kenichi Handa <handa@m17n.org>
parents:
88892
diff
changeset
|
958 (if (<= pos limit) |
651b64e6dce3
Call map-charset-chars on big5
Kenichi Handa <handa@m17n.org>
parents:
88892
diff
changeset
|
959 (save-excursion |
651b64e6dce3
Call map-charset-chars on big5
Kenichi Handa <handa@m17n.org>
parents:
88892
diff
changeset
|
960 (goto-char pos) |
651b64e6dce3
Call map-charset-chars on big5
Kenichi Handa <handa@m17n.org>
parents:
88892
diff
changeset
|
961 (if (looking-at "\\cK+") |
651b64e6dce3
Call map-charset-chars on big5
Kenichi Handa <handa@m17n.org>
parents:
88892
diff
changeset
|
962 (goto-char (match-end 0))) |
651b64e6dce3
Call map-charset-chars on big5
Kenichi Handa <handa@m17n.org>
parents:
88892
diff
changeset
|
963 (if (looking-at "\\cH+") |
651b64e6dce3
Call map-charset-chars on big5
Kenichi Handa <handa@m17n.org>
parents:
88892
diff
changeset
|
964 (goto-char (match-end 0))) |
651b64e6dce3
Call map-charset-chars on big5
Kenichi Handa <handa@m17n.org>
parents:
88892
diff
changeset
|
965 (point)) |
651b64e6dce3
Call map-charset-chars on big5
Kenichi Handa <handa@m17n.org>
parents:
88892
diff
changeset
|
966 (let ((category-set (char-category-set (char-after pos))) |
651b64e6dce3
Call map-charset-chars on big5
Kenichi Handa <handa@m17n.org>
parents:
88892
diff
changeset
|
967 category) |
651b64e6dce3
Call map-charset-chars on big5
Kenichi Handa <handa@m17n.org>
parents:
88892
diff
changeset
|
968 (if (aref category-set ?K) |
651b64e6dce3
Call map-charset-chars on big5
Kenichi Handa <handa@m17n.org>
parents:
88892
diff
changeset
|
969 (while (and (> pos limit) |
651b64e6dce3
Call map-charset-chars on big5
Kenichi Handa <handa@m17n.org>
parents:
88892
diff
changeset
|
970 (aref (char-category-set (char-after (1- pos))) ?K)) |
651b64e6dce3
Call map-charset-chars on big5
Kenichi Handa <handa@m17n.org>
parents:
88892
diff
changeset
|
971 (setq pos (1- pos))) |
651b64e6dce3
Call map-charset-chars on big5
Kenichi Handa <handa@m17n.org>
parents:
88892
diff
changeset
|
972 (while (and (> pos limit) |
651b64e6dce3
Call map-charset-chars on big5
Kenichi Handa <handa@m17n.org>
parents:
88892
diff
changeset
|
973 (aref (setq category-set |
651b64e6dce3
Call map-charset-chars on big5
Kenichi Handa <handa@m17n.org>
parents:
88892
diff
changeset
|
974 (char-category-set (char-after (1- pos)))) ?H)) |
651b64e6dce3
Call map-charset-chars on big5
Kenichi Handa <handa@m17n.org>
parents:
88892
diff
changeset
|
975 (setq pos (1- pos))) |
651b64e6dce3
Call map-charset-chars on big5
Kenichi Handa <handa@m17n.org>
parents:
88892
diff
changeset
|
976 (setq category (cond ((aref category-set ?C) ?C) |
651b64e6dce3
Call map-charset-chars on big5
Kenichi Handa <handa@m17n.org>
parents:
88892
diff
changeset
|
977 ((aref category-set ?K) ?K) |
651b64e6dce3
Call map-charset-chars on big5
Kenichi Handa <handa@m17n.org>
parents:
88892
diff
changeset
|
978 ((aref category-set ?A) ?A))) |
651b64e6dce3
Call map-charset-chars on big5
Kenichi Handa <handa@m17n.org>
parents:
88892
diff
changeset
|
979 (when category |
651b64e6dce3
Call map-charset-chars on big5
Kenichi Handa <handa@m17n.org>
parents:
88892
diff
changeset
|
980 (setq pos (1- pos)) |
651b64e6dce3
Call map-charset-chars on big5
Kenichi Handa <handa@m17n.org>
parents:
88892
diff
changeset
|
981 (while (and (> pos limit) |
651b64e6dce3
Call map-charset-chars on big5
Kenichi Handa <handa@m17n.org>
parents:
88892
diff
changeset
|
982 (aref (char-category-set (char-after (1- pos))) |
651b64e6dce3
Call map-charset-chars on big5
Kenichi Handa <handa@m17n.org>
parents:
88892
diff
changeset
|
983 category)) |
651b64e6dce3
Call map-charset-chars on big5
Kenichi Handa <handa@m17n.org>
parents:
88892
diff
changeset
|
984 (setq pos (1- pos))))) |
651b64e6dce3
Call map-charset-chars on big5
Kenichi Handa <handa@m17n.org>
parents:
88892
diff
changeset
|
985 pos))) |
651b64e6dce3
Call map-charset-chars on big5
Kenichi Handa <handa@m17n.org>
parents:
88892
diff
changeset
|
986 |
651b64e6dce3
Call map-charset-chars on big5
Kenichi Handa <handa@m17n.org>
parents:
88892
diff
changeset
|
987 (map-char-table |
651b64e6dce3
Call map-charset-chars on big5
Kenichi Handa <handa@m17n.org>
parents:
88892
diff
changeset
|
988 #'(lambda (char script) |
651b64e6dce3
Call map-charset-chars on big5
Kenichi Handa <handa@m17n.org>
parents:
88892
diff
changeset
|
989 (cond ((eq script 'han) |
651b64e6dce3
Call map-charset-chars on big5
Kenichi Handa <handa@m17n.org>
parents:
88892
diff
changeset
|
990 (set-char-table-range next-word-boundary-function-table |
651b64e6dce3
Call map-charset-chars on big5
Kenichi Handa <handa@m17n.org>
parents:
88892
diff
changeset
|
991 char #'next-word-boundary-han)) |
651b64e6dce3
Call map-charset-chars on big5
Kenichi Handa <handa@m17n.org>
parents:
88892
diff
changeset
|
992 ((eq script 'kana) |
651b64e6dce3
Call map-charset-chars on big5
Kenichi Handa <handa@m17n.org>
parents:
88892
diff
changeset
|
993 (set-char-table-range next-word-boundary-function-table |
651b64e6dce3
Call map-charset-chars on big5
Kenichi Handa <handa@m17n.org>
parents:
88892
diff
changeset
|
994 char #'next-word-boundary-kana)))) |
651b64e6dce3
Call map-charset-chars on big5
Kenichi Handa <handa@m17n.org>
parents:
88892
diff
changeset
|
995 char-script-table) |
651b64e6dce3
Call map-charset-chars on big5
Kenichi Handa <handa@m17n.org>
parents:
88892
diff
changeset
|
996 |
651b64e6dce3
Call map-charset-chars on big5
Kenichi Handa <handa@m17n.org>
parents:
88892
diff
changeset
|
997 (setq word-combining-categories |
651b64e6dce3
Call map-charset-chars on big5
Kenichi Handa <handa@m17n.org>
parents:
88892
diff
changeset
|
998 '((?l . ?l))) |
651b64e6dce3
Call map-charset-chars on big5
Kenichi Handa <handa@m17n.org>
parents:
88892
diff
changeset
|
999 |
651b64e6dce3
Call map-charset-chars on big5
Kenichi Handa <handa@m17n.org>
parents:
88892
diff
changeset
|
1000 (setq word-separating-categories ; (2-byte character sets) |
651b64e6dce3
Call map-charset-chars on big5
Kenichi Handa <handa@m17n.org>
parents:
88892
diff
changeset
|
1001 '((?A . ?K) ; Alpha numeric - Katakana |
651b64e6dce3
Call map-charset-chars on big5
Kenichi Handa <handa@m17n.org>
parents:
88892
diff
changeset
|
1002 (?A . ?C) ; Alpha numeric - Chinese |
651b64e6dce3
Call map-charset-chars on big5
Kenichi Handa <handa@m17n.org>
parents:
88892
diff
changeset
|
1003 (?H . ?A) ; Hiragana - Alpha numeric |
651b64e6dce3
Call map-charset-chars on big5
Kenichi Handa <handa@m17n.org>
parents:
88892
diff
changeset
|
1004 (?H . ?K) ; Hiragana - Katakana |
651b64e6dce3
Call map-charset-chars on big5
Kenichi Handa <handa@m17n.org>
parents:
88892
diff
changeset
|
1005 (?H . ?C) ; Hiragana - Chinese |
651b64e6dce3
Call map-charset-chars on big5
Kenichi Handa <handa@m17n.org>
parents:
88892
diff
changeset
|
1006 (?K . ?A) ; Katakana - Alpha numeric |
651b64e6dce3
Call map-charset-chars on big5
Kenichi Handa <handa@m17n.org>
parents:
88892
diff
changeset
|
1007 (?K . ?C) ; Katakana - Chinese |
651b64e6dce3
Call map-charset-chars on big5
Kenichi Handa <handa@m17n.org>
parents:
88892
diff
changeset
|
1008 (?C . ?A) ; Chinese - Alpha numeric |
651b64e6dce3
Call map-charset-chars on big5
Kenichi Handa <handa@m17n.org>
parents:
88892
diff
changeset
|
1009 (?C . ?K) ; Chinese - Katakana |
651b64e6dce3
Call map-charset-chars on big5
Kenichi Handa <handa@m17n.org>
parents:
88892
diff
changeset
|
1010 )) |
651b64e6dce3
Call map-charset-chars on big5
Kenichi Handa <handa@m17n.org>
parents:
88892
diff
changeset
|
1011 |
20734
179dbdbc3882
Put `prefered-coding-system'
Kenichi Handa <handa@m17n.org>
parents:
20247
diff
changeset
|
1012 ;;; Local Variables: |
88469
dad99fefc4b3
Change encoding to utf-8-emacs. Remove apparent duplicate codes.
Kenichi Handa <handa@m17n.org>
parents:
88425
diff
changeset
|
1013 ;;; coding: utf-8-emacs |
20734
179dbdbc3882
Put `prefered-coding-system'
Kenichi Handa <handa@m17n.org>
parents:
20247
diff
changeset
|
1014 ;;; End: |
179dbdbc3882
Put `prefered-coding-system'
Kenichi Handa <handa@m17n.org>
parents:
20247
diff
changeset
|
1015 |
38414
67b464da13ec
Some fixes to follow coding conventions.
Pavel Janík <Pavel@Janik.cz>
parents:
36688
diff
changeset
|
1016 ;;; characters.el ends here |