Mercurial > emacs
annotate lisp/international/characters.el @ 89314:f040012c16bb
Move parts around so that syntax
definitions aren't overwritten wrongly.
author | Dave Love <fx@gnu.org> |
---|---|
date | Thu, 05 Dec 2002 14:28:21 +0000 |
parents | b73d29dcc8c0 |
children | 2f877ed80fa6 |
rev | line source |
---|---|
17052 | 1 ;;; characters.el --- set syntax and category for multibyte characters |
2 | |
19958
f7ab34ae92bb
Set case-table for Cyrillic characters.
Richard M. Stallman <rms@gnu.org>
parents:
18561
diff
changeset
|
3 ;; Copyright (C) 1995, 1997 Electrotechnical Laboratory, JAPAN. |
18377
8b4a66c66dd6
Change copyright notice.
Richard M. Stallman <rms@gnu.org>
parents:
18304
diff
changeset
|
4 ;; Licensed to the Free Software Foundation. |
88612 | 5 ;; Copyright (C) 2001, 2002 Free Software Foundation, Inc. |
88403
3c6459e2914f
Adjusted for the deletion of generic characters.
Kenichi Handa <handa@m17n.org>
parents:
41992
diff
changeset
|
6 ;; Copyright (C) 2001, 2002 |
3c6459e2914f
Adjusted for the deletion of generic characters.
Kenichi Handa <handa@m17n.org>
parents:
41992
diff
changeset
|
7 ;; National Institute of Advanced Industrial Science and Technology (AIST) |
3c6459e2914f
Adjusted for the deletion of generic characters.
Kenichi Handa <handa@m17n.org>
parents:
41992
diff
changeset
|
8 ;; Registration Number H13PRO009 |
17052 | 9 |
10 ;; Keywords: multibyte character, character set, syntax, category | |
11 | |
12 ;; This file is part of GNU Emacs. | |
13 | |
14 ;; GNU Emacs is free software; you can redistribute it and/or modify | |
15 ;; it under the terms of the GNU General Public License as published by | |
16 ;; the Free Software Foundation; either version 2, or (at your option) | |
17 ;; any later version. | |
18 | |
19 ;; GNU Emacs is distributed in the hope that it will be useful, | |
20 ;; but WITHOUT ANY WARRANTY; without even the implied warranty of | |
21 ;; MERCHANTABILITY or FITNESS FOR A PARTICULAR PURPOSE. See the | |
22 ;; GNU General Public License for more details. | |
23 | |
24 ;; You should have received a copy of the GNU General Public License | |
17071 | 25 ;; along with GNU Emacs; see the file COPYING. If not, write to the |
26 ;; Free Software Foundation, Inc., 59 Temple Place - Suite 330, | |
27 ;; Boston, MA 02111-1307, USA. | |
17052 | 28 |
29 ;;; Commentary: | |
30 | |
31 ;; This file contains multibyte characters. Save this file always in | |
18197
90235db802c7
Comment changed (iso-2022-7 -> iso-2022-7bit).
Kenichi Handa <handa@m17n.org>
parents:
17753
diff
changeset
|
32 ;; the coding system `iso-2022-7bit'. |
17052 | 33 |
18205
40dc3b2e9e27
Delete syntax stuff for Latin-1 and Latin-2.
Richard M. Stallman <rms@gnu.org>
parents:
18197
diff
changeset
|
34 ;; This file does not define the syntax for Latin-N character sets; |
40dc3b2e9e27
Delete syntax stuff for Latin-1 and Latin-2.
Richard M. Stallman <rms@gnu.org>
parents:
18197
diff
changeset
|
35 ;; those are defined by the files latin-N.el. |
40dc3b2e9e27
Delete syntax stuff for Latin-1 and Latin-2.
Richard M. Stallman <rms@gnu.org>
parents:
18197
diff
changeset
|
36 |
38414
67b464da13ec
Some fixes to follow coding conventions.
Pavel Janík <Pavel@Janik.cz>
parents:
36688
diff
changeset
|
37 ;;; Code: |
67b464da13ec
Some fixes to follow coding conventions.
Pavel Janík <Pavel@Janik.cz>
parents:
36688
diff
changeset
|
38 |
17052 | 39 ;;; Predefined categories. |
40 | |
41 ;; For each character set. | |
42 | |
43 (define-category ?a "ASCII") | |
44 (define-category ?l "Latin") | |
45 (define-category ?t "Thai") | |
46 (define-category ?g "Greek") | |
47 (define-category ?b "Arabic") | |
48 (define-category ?w "Hebrew") | |
49 (define-category ?y "Cyrillic") | |
50 (define-category ?k "Japanese katakana") | |
51 (define-category ?r "Japanese roman") | |
52 (define-category ?c "Chinese") | |
53 (define-category ?j "Japanese") | |
54 (define-category ?h "Korean") | |
55 (define-category ?e "Ethiopic (Ge'ez)") | |
56 (define-category ?v "Vietnamese") | |
57 (define-category ?i "Indian") | |
17078
874b471d6635
Remove prefix "coding-system-" from coding system symbol names.
Kenichi Handa <handa@m17n.org>
parents:
17071
diff
changeset
|
58 (define-category ?o "Lao") |
17753
de6be7f3b904
Set syntax and category for Devanagari, Thai, and
Kenichi Handa <handa@m17n.org>
parents:
17296
diff
changeset
|
59 (define-category ?q "Tibetan") |
17052 | 60 |
61 ;; For each group (row) of 2-byte character sets. | |
62 | |
22529 | 63 (define-category ?A "Alpha-numeric characters of 2-byte character sets") |
17052 | 64 (define-category ?C "Chinese (Han) characters of 2-byte character sets") |
22529 | 65 (define-category ?G "Greek characters of 2-byte character sets") |
17052 | 66 (define-category ?H "Japanese Hiragana characters of 2-byte character sets") |
67 (define-category ?K "Japanese Katakana characters of 2-byte character sets") | |
68 (define-category ?N "Korean Hangul characters of 2-byte character sets") | |
22529 | 69 (define-category ?Y "Cyrillic characters of 2-byte character sets") |
17052 | 70 (define-category ?I "Indian Glyphs") |
71 | |
72 ;; For phonetic classifications. | |
73 | |
74 (define-category ?0 "consonant") | |
18561
f3c28fd5118e
Modify docstring of char category
Kenichi Handa <handa@m17n.org>
parents:
18377
diff
changeset
|
75 (define-category ?1 "base (independent) vowel") |
17052 | 76 (define-category ?2 "upper diacritical mark (including upper vowel)") |
77 (define-category ?3 "lower diacritical mark (including lower vowel)") | |
78 (define-category ?4 "tone mark") | |
18561
f3c28fd5118e
Modify docstring of char category
Kenichi Handa <handa@m17n.org>
parents:
18377
diff
changeset
|
79 (define-category ?5 "symbol") |
17052 | 80 (define-category ?6 "digit") |
81 (define-category ?7 "vowel-modifying diacritical mark") | |
17078
874b471d6635
Remove prefix "coding-system-" from coding system symbol names.
Kenichi Handa <handa@m17n.org>
parents:
17071
diff
changeset
|
82 (define-category ?8 "vowel-signs") |
874b471d6635
Remove prefix "coding-system-" from coding system symbol names.
Kenichi Handa <handa@m17n.org>
parents:
17071
diff
changeset
|
83 (define-category ?9 "semivowel lower") |
17052 | 84 |
85 ;; For filling. | |
86 (define-category ?| "While filling, we can break a line at this character.") | |
87 | |
20247 | 88 ;; For indentation calculation. |
20734
179dbdbc3882
Put `prefered-coding-system'
Kenichi Handa <handa@m17n.org>
parents:
20247
diff
changeset
|
89 (define-category ? |
179dbdbc3882
Put `prefered-coding-system'
Kenichi Handa <handa@m17n.org>
parents:
20247
diff
changeset
|
90 "This character counts as a space for indentation purposes.") |
20247 | 91 |
22529 | 92 ;; Keep the following for `kinsoku' processing. See comments in |
17052 | 93 ;; kinsoku.el. |
94 (define-category ?> "A character which can't be placed at beginning of line.") | |
95 (define-category ?< "A character which can't be placed at end of line.") | |
96 | |
41874
1b93abfcbd87
Don't set word syntax (the default)
Dave Love <fx@gnu.org>
parents:
41466
diff
changeset
|
97 ;; Combining |
1b93abfcbd87
Don't set word syntax (the default)
Dave Love <fx@gnu.org>
parents:
41466
diff
changeset
|
98 (define-category ?^ "Combining diacritic or mark") |
17052 | 99 |
100 ;;; Setting syntax and category. | |
101 | |
102 ;; ASCII | |
103 | |
88425
d7bd9df1799d
Optimize calls of modify-category-entry.
Kenichi Handa <handa@m17n.org>
parents:
88403
diff
changeset
|
104 ;; All ASCII characters have the category `a' (ASCII) and `l' (Latin). |
d7bd9df1799d
Optimize calls of modify-category-entry.
Kenichi Handa <handa@m17n.org>
parents:
88403
diff
changeset
|
105 (modify-category-entry '(32 . 127) ?a) |
d7bd9df1799d
Optimize calls of modify-category-entry.
Kenichi Handa <handa@m17n.org>
parents:
88403
diff
changeset
|
106 (modify-category-entry '(32 . 127) ?l) |
17052 | 107 |
89314 | 108 ;; Deal with the CJK charsets first. Since the syntax of blocks is |
109 ;; defined per charset, and the charsets may contain e.g. Latin | |
110 ;; characters, we end up with the wrong syntax definitions if we're | |
111 ;; not careful. | |
17052 | 112 |
88752 | 113 ;; Chinese characters (Unicode) |
114 (modify-category-entry '(#x3400 . #x9FAF) ?C) | |
115 (modify-category-entry '(#x3400 . #x9FAF) ?c) | |
116 (modify-category-entry '(#x3400 . #x9FAF) ?|) | |
117 (modify-category-entry '(#xF900 . #xFAFF) ?C) | |
118 (modify-category-entry '(#xF900 . #xFAFF) ?c) | |
119 (modify-category-entry '(#xF900 . #xFAFF) ?|) | |
120 | |
17052 | 121 ;; Chinese character set (GB2312) |
122 | |
88752 | 123 (map-charset-chars #'modify-syntax-entry 'chinese-gb2312 "_" #x2121 #x217E) |
124 (map-charset-chars #'modify-syntax-entry 'chinese-gb2312 "_" #x2221 #x227E) | |
125 (map-charset-chars #'modify-syntax-entry 'chinese-gb2312 "_" #x2921 #x297E) | |
17052 | 126 |
88723 | 127 (map-charset-chars #'modify-category-entry 'chinese-gb2312 ?c) |
128 (map-charset-chars #'modify-category-entry 'chinese-gb2312 ?|) | |
89014 | 129 (map-charset-chars #'modify-category-entry 'chinese-gb2312 ?A #x2330 #x2339) |
130 (map-charset-chars #'modify-category-entry 'chinese-gb2312 ?A #x2341 #x235A) | |
131 (map-charset-chars #'modify-category-entry 'chinese-gb2312 ?A #x2361 #x237A) | |
88752 | 132 (map-charset-chars #'modify-category-entry 'chinese-gb2312 ?H #x2421 #x247E) |
133 (map-charset-chars #'modify-category-entry 'chinese-gb2312 ?K #x2521 #x257E) | |
134 (map-charset-chars #'modify-category-entry 'chinese-gb2312 ?G #x2621 #x267E) | |
135 (map-charset-chars #'modify-category-entry 'chinese-gb2312 ?Y #x2721 #x277E) | |
136 (map-charset-chars #'modify-category-entry 'chinese-gb2312 ?C #x3021 #x7E7E) | |
17052 | 137 |
138 ;; Chinese character set (BIG5) | |
139 | |
88971
651b64e6dce3
Call map-charset-chars on big5
Kenichi Handa <handa@m17n.org>
parents:
88892
diff
changeset
|
140 (map-charset-chars #'modify-category-entry 'big5 ?c) |
89014 | 141 (map-charset-chars #'modify-category-entry 'big5 ?C #xA259 #xA25F) |
142 (map-charset-chars #'modify-category-entry 'big5 ?C #xA440 #xC67E) | |
143 (map-charset-chars #'modify-category-entry 'big5 ?C #xC940 #xF9DF) | |
88971
651b64e6dce3
Call map-charset-chars on big5
Kenichi Handa <handa@m17n.org>
parents:
88892
diff
changeset
|
144 (map-charset-chars #'modify-category-entry 'big5 ?|) |
651b64e6dce3
Call map-charset-chars on big5
Kenichi Handa <handa@m17n.org>
parents:
88892
diff
changeset
|
145 |
17052 | 146 |
147 ;; Chinese character set (CNS11643) | |
148 | |
88723 | 149 (dolist (c '(chinese-cns11643-1 chinese-cns11643-2 chinese-cns11643-3 |
150 chinese-cns11643-4 chinese-cns11643-5 chinese-cns11643-6 | |
151 chinese-cns11643-7)) | |
152 (map-charset-chars #'modify-category-entry c ?c) | |
89014 | 153 (if (eq c 'chinese-cns11643-1) |
154 (map-charset-chars #'modify-category-entry c ?C #x4421 #x7E7E) | |
155 (map-charset-chars #'modify-category-entry c ?C)) | |
88723 | 156 (map-charset-chars #'modify-category-entry c ?|)) |
17052 | 157 |
158 ;; Japanese character set (JISX0201-kana, JISX0201-roman, JISX0208, JISX0212) | |
159 | |
88752 | 160 (map-charset-chars #'modify-category-entry 'katakana-jisx0201 ?k) |
161 | |
162 (map-charset-chars #'modify-category-entry 'latin-jisx0201 ?r) | |
88403
3c6459e2914f
Adjusted for the deletion of generic characters.
Kenichi Handa <handa@m17n.org>
parents:
41992
diff
changeset
|
163 |
3c6459e2914f
Adjusted for the deletion of generic characters.
Kenichi Handa <handa@m17n.org>
parents:
41992
diff
changeset
|
164 (dolist (l '(katakana-jisx0201 japanese-jisx0208 japanese-jisx0212)) |
88752 | 165 (map-charset-chars #'modify-category-entry l ?j) |
166 (map-charset-chars #'modify-category-entry l ?\|)) | |
17052 | 167 |
41874
1b93abfcbd87
Don't set word syntax (the default)
Dave Love <fx@gnu.org>
parents:
41466
diff
changeset
|
168 ;; Unicode equivalents of JISX0201-kana |
88752 | 169 (let ((range '(#xff61 . #xff9f))) |
170 (modify-category-entry range ?k) | |
171 (modify-category-entry range ?j) | |
172 (modify-category-entry range ?\|)) | |
41874
1b93abfcbd87
Don't set word syntax (the default)
Dave Love <fx@gnu.org>
parents:
41466
diff
changeset
|
173 |
1b93abfcbd87
Don't set word syntax (the default)
Dave Love <fx@gnu.org>
parents:
41466
diff
changeset
|
174 ;; Katakana block |
88752 | 175 (let ((range '(#x30a0 . #x30ff))) |
176 ;; ?K is double width, ?k isn't specified | |
177 (modify-category-entry range ?K) | |
178 (modify-category-entry range ?\|)) | |
41874
1b93abfcbd87
Don't set word syntax (the default)
Dave Love <fx@gnu.org>
parents:
41466
diff
changeset
|
179 |
1b93abfcbd87
Don't set word syntax (the default)
Dave Love <fx@gnu.org>
parents:
41466
diff
changeset
|
180 ;; Hiragana block |
88752 | 181 (let ((range '(#x3040 . #x309f))) |
182 ;; ?H is actually defined to be double width | |
183 ;;(modify-category-entry range ?H) | |
184 ;;(modify-category-entry range ?\|) | |
185 ) | |
41874
1b93abfcbd87
Don't set word syntax (the default)
Dave Love <fx@gnu.org>
parents:
41466
diff
changeset
|
186 |
17052 | 187 ;; JISX0208 |
88752 | 188 (map-charset-chars #'modify-syntax-entry 'japanese-jisx0208 "_" #x2121 #x227E) |
189 (map-charset-chars #'modify-syntax-entry 'japanese-jisx0208 "_" #x2821 #x287E) | |
190 (let ((chars '(?ー ?ã‚› ?ã‚œ ?ヽ ?ヾ ?ã‚ ?ã‚ž ?〃 ?ä» ?々 ?〆 ?〇))) | |
88519
98ec5d812ccc
Set syntax/category for japanese-jisx0208.
Kenichi Handa <handa@m17n.org>
parents:
88492
diff
changeset
|
191 (dolist (elt chars) |
88612 | 192 (modify-syntax-entry (car chars) "w"))) |
17052 | 193 |
88752 | 194 (map-charset-chars #'modify-category-entry 'japanese-jisx0208 ?A #x2321 #x237E) |
195 (map-charset-chars #'modify-category-entry 'japanese-jisx0208 ?H #x2421 #x247E) | |
196 (map-charset-chars #'modify-category-entry 'japanese-jisx0208 ?K #x2521 #x257E) | |
197 (map-charset-chars #'modify-category-entry 'japanese-jisx0208 ?G #x2621 #x267E) | |
198 (map-charset-chars #'modify-category-entry 'japanese-jisx0208 ?Y #x2721 #x277E) | |
199 (map-charset-chars #'modify-category-entry 'japanese-jisx0208 ?C #x3021 #x7E7E) | |
200 (modify-category-entry ?ー ?K) | |
201 (let ((chars '(?゛ ?゜))) | |
17052 | 202 (while chars |
203 (modify-category-entry (car chars) ?K) | |
204 (modify-category-entry (car chars) ?H) | |
205 (setq chars (cdr chars)))) | |
88752 | 206 (let ((chars '(?ヽ ?ヾ ?ã‚ ?ã‚ž ?〃 ?ä» ?々 ?〆 ?〇))) |
17052 | 207 (while chars |
208 (modify-category-entry (car chars) ?C) | |
209 (setq chars (cdr chars)))) | |
210 | |
211 ;; JISX0212 | |
88723 | 212 |
88752 | 213 (map-charset-chars #'modify-syntax-entry 'japanese-jisx0212 "_" #x2121 #x237E) |
17052 | 214 |
215 ;; JISX0201-Kana | |
88723 | 216 |
88612 | 217 (let ((chars '(?。 ?、 ?・))) |
17052 | 218 (while chars |
219 (modify-syntax-entry (car chars) ".") | |
220 (setq chars (cdr chars)))) | |
221 | |
88469
dad99fefc4b3
Change encoding to utf-8-emacs. Remove apparent duplicate codes.
Kenichi Handa <handa@m17n.org>
parents:
88425
diff
changeset
|
222 (modify-syntax-entry ?\ï½¢ "(ï½£") |
dad99fefc4b3
Change encoding to utf-8-emacs. Remove apparent duplicate codes.
Kenichi Handa <handa@m17n.org>
parents:
88425
diff
changeset
|
223 (modify-syntax-entry ?\ï½£ "(ï½¢") |
36688
45b172fa5c17
Add category `j' for katakana-jisx0201. Fix syntaxes of
Kenichi Handa <handa@m17n.org>
parents:
34507
diff
changeset
|
224 |
17052 | 225 ;; Korean character set (KSC5601) |
226 | |
88723 | 227 (map-charset-chars #'modify-category-entry 'korean-ksc5601 ?h) |
88752 | 228 |
229 (map-charset-chars #'modify-syntax-entry 'korean-ksc5601 "_" #x2121 #x227E) | |
89117 | 230 (map-charset-chars #'modify-syntax-entry 'korean-ksc5601 "_" #x2621 #x277E) |
231 (map-charset-chars #'modify-syntax-entry 'korean-ksc5601 "_" #x2830 #x287E) | |
232 (map-charset-chars #'modify-syntax-entry 'korean-ksc5601 "_" #x2930 #x297E) | |
89014 | 233 (map-charset-chars #'modify-category-entry 'korean-ksc5601 ?A #x2330 #x2339) |
234 (map-charset-chars #'modify-category-entry 'korean-ksc5601 ?A #x2341 #x235A) | |
235 (map-charset-chars #'modify-category-entry 'korean-ksc5601 ?A #x2361 #x237A) | |
88752 | 236 (map-charset-chars #'modify-category-entry 'korean-ksc5601 ?G #x2521 #x257E) |
237 (map-charset-chars #'modify-category-entry 'korean-ksc5601 ?H #x2A21 #x2A7E) | |
238 (map-charset-chars #'modify-category-entry 'korean-ksc5601 ?K #x2B21 #x2B7E) | |
239 (map-charset-chars #'modify-category-entry 'korean-ksc5601 ?Y #x2C21 #x2C7E) | |
17052 | 240 |
89314 | 241 ;; These are in more than one charset. |
242 (modify-syntax-entry ?\( "()") | |
243 (modify-syntax-entry ?\ï¼» "(ï¼½") | |
244 (modify-syntax-entry ?\ï½› "(ï½") | |
245 (modify-syntax-entry ?\「 "(ã€") | |
246 (modify-syntax-entry ?\『 "(ã€") | |
247 (modify-syntax-entry ?\) ")(") | |
248 (modify-syntax-entry ?\ï¼½ ")ï¼»") | |
249 (modify-syntax-entry ?\ï½ ")ï½›") | |
250 (modify-syntax-entry ?\〠")「") | |
251 (modify-syntax-entry ?\〠")『") | |
252 | |
253 (modify-syntax-entry ?\〔 "(〕") | |
254 (modify-syntax-entry ?\〈 "(〉") | |
255 (modify-syntax-entry ?\《 "(》") | |
256 (modify-syntax-entry ?\〖 "(〗") | |
257 (modify-syntax-entry ?\〠"(】") | |
258 (modify-syntax-entry ?\〕 ")〔") | |
259 (modify-syntax-entry ?\〉 ")〈") | |
260 (modify-syntax-entry ?\》 ")《") | |
261 (modify-syntax-entry ?\〗 ")〖") | |
262 (modify-syntax-entry ?\】 ")ã€") | |
263 (modify-syntax-entry ?\〚 "(〛") | |
264 (modify-syntax-entry ?\〛 ")〚") | |
265 | |
266 ;; Arabic character set | |
33902
aacccd8ad977
Specify cases and syntaxes for mule-unicode-0100-24ff.
Kenichi Handa <handa@m17n.org>
parents:
31664
diff
changeset
|
267 |
89314 | 268 (let ((charsets '(arabic-iso8859-6 |
269 arabic-digit | |
270 arabic-1-column | |
271 arabic-2-column))) | |
272 (while charsets | |
273 (map-charset-chars #'modify-category-entry (car charsets) ?b) | |
274 (setq charsets (cdr charsets)))) | |
275 (modify-category-entry '(#x600 . #x6ff) ?b) | |
276 (modify-category-entry '(#xfb50 . #xfdff) ?b) | |
277 (modify-category-entry '(#xfe70 . #xfefe) ?b) | |
278 | |
279 ;; Cyrillic character set (ISO-8859-5) | |
280 | |
281 (modify-syntax-entry ?â„– ".") | |
282 | |
283 ;; Ethiopic character set | |
284 | |
285 (modify-category-entry '(#x1200 . #x137b) ?e) | |
286 (let ((chars '(?á¡ ?ᢠ?ᣠ?ᤠ?ᥠ?ᦠ?᧠?ᨠ?ö ‡€ ?ö ‡‹ ?ö ‡Œ ?ö ‡ ?ö ‡Ž ?ö ‡))) | |
287 (while chars | |
288 (modify-syntax-entry (car chars) ".") | |
289 (setq chars (cdr chars)))) | |
290 (map-charset-chars #'modify-category-entry 'ethiopic ?e) | |
291 | |
292 ;; Hebrew character set (ISO-8859-8) | |
293 | |
294 (modify-syntax-entry #x5be ".") ; MAQAF | |
295 (modify-syntax-entry #x5c0 ".") ; PASEQ | |
296 (modify-syntax-entry #x5c3 ".") ; SOF PASUQ | |
297 (modify-syntax-entry #x5f3 ".") ; GERESH | |
298 (modify-syntax-entry #x5f4 ".") ; GERSHAYIM | |
299 | |
300 ;; Indian character set (IS 13194 and other Emacs original Indian charsets) | |
301 | |
302 (modify-category-entry '(#x901 . #x970) ?i) | |
303 (map-charset-chars #'modify-category-entry 'indian-is13194 ?i) | |
304 (map-charset-chars #'modify-category-entry 'indian-2-column ?i) | |
33902
aacccd8ad977
Specify cases and syntaxes for mule-unicode-0100-24ff.
Kenichi Handa <handa@m17n.org>
parents:
31664
diff
changeset
|
305 |
17078
874b471d6635
Remove prefix "coding-system-" from coding system symbol names.
Kenichi Handa <handa@m17n.org>
parents:
17071
diff
changeset
|
306 ;; Lao character set |
874b471d6635
Remove prefix "coding-system-" from coding system symbol names.
Kenichi Handa <handa@m17n.org>
parents:
17071
diff
changeset
|
307 |
88612 | 308 (modify-category-entry '(#xe80 . #xeff) ?o) |
309 (map-charset-chars #'modify-category-entry 'lao ?o) | |
17078
874b471d6635
Remove prefix "coding-system-" from coding system symbol names.
Kenichi Handa <handa@m17n.org>
parents:
17071
diff
changeset
|
310 |
88612 | 311 (let ((deflist '(("àº-ຮ" "w" ?0) ; consonant |
88469
dad99fefc4b3
Change encoding to utf-8-emacs. Remove apparent duplicate codes.
Kenichi Handa <handa@m17n.org>
parents:
88425
diff
changeset
|
312 ("ະາຳຽເ-ໄ" "w" ?1) ; vowel base |
dad99fefc4b3
Change encoding to utf-8-emacs. Remove apparent duplicate codes.
Kenichi Handa <handa@m17n.org>
parents:
88425
diff
changeset
|
313 ("ັິ-ືົà»" "w" ?2) ; vowel upper |
dad99fefc4b3
Change encoding to utf-8-emacs. Remove apparent duplicate codes.
Kenichi Handa <handa@m17n.org>
parents:
88425
diff
changeset
|
314 ("ຸູ" "w" ?3) ; vowel lower |
dad99fefc4b3
Change encoding to utf-8-emacs. Remove apparent duplicate codes.
Kenichi Handa <handa@m17n.org>
parents:
88425
diff
changeset
|
315 ("່-໋" "w" ?4) ; tone mark |
dad99fefc4b3
Change encoding to utf-8-emacs. Remove apparent duplicate codes.
Kenichi Handa <handa@m17n.org>
parents:
88425
diff
changeset
|
316 ("ຼຽ" "w" ?9) ; semivowel lower |
dad99fefc4b3
Change encoding to utf-8-emacs. Remove apparent duplicate codes.
Kenichi Handa <handa@m17n.org>
parents:
88425
diff
changeset
|
317 ("à»-à»™" "w" ?6) ; digit |
dad99fefc4b3
Change encoding to utf-8-emacs. Remove apparent duplicate codes.
Kenichi Handa <handa@m17n.org>
parents:
88425
diff
changeset
|
318 ("ຯໆ" "_" ?5) ; symbol |
17078
874b471d6635
Remove prefix "coding-system-" from coding system symbol names.
Kenichi Handa <handa@m17n.org>
parents:
17071
diff
changeset
|
319 )) |
874b471d6635
Remove prefix "coding-system-" from coding system symbol names.
Kenichi Handa <handa@m17n.org>
parents:
17071
diff
changeset
|
320 elm chars len syntax category to ch i) |
874b471d6635
Remove prefix "coding-system-" from coding system symbol names.
Kenichi Handa <handa@m17n.org>
parents:
17071
diff
changeset
|
321 (while deflist |
874b471d6635
Remove prefix "coding-system-" from coding system symbol names.
Kenichi Handa <handa@m17n.org>
parents:
17071
diff
changeset
|
322 (setq elm (car deflist)) |
874b471d6635
Remove prefix "coding-system-" from coding system symbol names.
Kenichi Handa <handa@m17n.org>
parents:
17071
diff
changeset
|
323 (setq chars (car elm) |
874b471d6635
Remove prefix "coding-system-" from coding system symbol names.
Kenichi Handa <handa@m17n.org>
parents:
17071
diff
changeset
|
324 len (length chars) |
874b471d6635
Remove prefix "coding-system-" from coding system symbol names.
Kenichi Handa <handa@m17n.org>
parents:
17071
diff
changeset
|
325 syntax (nth 1 elm) |
874b471d6635
Remove prefix "coding-system-" from coding system symbol names.
Kenichi Handa <handa@m17n.org>
parents:
17071
diff
changeset
|
326 category (nth 2 elm) |
874b471d6635
Remove prefix "coding-system-" from coding system symbol names.
Kenichi Handa <handa@m17n.org>
parents:
17071
diff
changeset
|
327 i 0) |
874b471d6635
Remove prefix "coding-system-" from coding system symbol names.
Kenichi Handa <handa@m17n.org>
parents:
17071
diff
changeset
|
328 (while (< i len) |
874b471d6635
Remove prefix "coding-system-" from coding system symbol names.
Kenichi Handa <handa@m17n.org>
parents:
17071
diff
changeset
|
329 (if (= (aref chars i) ?-) |
874b471d6635
Remove prefix "coding-system-" from coding system symbol names.
Kenichi Handa <handa@m17n.org>
parents:
17071
diff
changeset
|
330 (setq i (1+ i) |
20843 | 331 to (aref chars i)) |
332 (setq ch (aref chars i) | |
17078
874b471d6635
Remove prefix "coding-system-" from coding system symbol names.
Kenichi Handa <handa@m17n.org>
parents:
17071
diff
changeset
|
333 to ch)) |
874b471d6635
Remove prefix "coding-system-" from coding system symbol names.
Kenichi Handa <handa@m17n.org>
parents:
17071
diff
changeset
|
334 (while (<= ch to) |
41874
1b93abfcbd87
Don't set word syntax (the default)
Dave Love <fx@gnu.org>
parents:
41466
diff
changeset
|
335 (unless (string-equal syntax "w") |
1b93abfcbd87
Don't set word syntax (the default)
Dave Love <fx@gnu.org>
parents:
41466
diff
changeset
|
336 (modify-syntax-entry ch syntax)) |
17078
874b471d6635
Remove prefix "coding-system-" from coding system symbol names.
Kenichi Handa <handa@m17n.org>
parents:
17071
diff
changeset
|
337 (modify-category-entry ch category) |
874b471d6635
Remove prefix "coding-system-" from coding system symbol names.
Kenichi Handa <handa@m17n.org>
parents:
17071
diff
changeset
|
338 (setq ch (1+ ch))) |
20843 | 339 (setq i (1+ i))) |
17078
874b471d6635
Remove prefix "coding-system-" from coding system symbol names.
Kenichi Handa <handa@m17n.org>
parents:
17071
diff
changeset
|
340 (setq deflist (cdr deflist)))) |
874b471d6635
Remove prefix "coding-system-" from coding system symbol names.
Kenichi Handa <handa@m17n.org>
parents:
17071
diff
changeset
|
341 |
17052 | 342 ;; Thai character set (TIS620) |
343 | |
88612 | 344 (modify-category-entry '(#xe00 . #xe7f) ?t) |
345 (map-charset-chars #'modify-category-entry 'thai-tis620 ?t) | |
17052 | 346 |
347 (let ((deflist '(;; chars syntax category | |
88469
dad99fefc4b3
Change encoding to utf-8-emacs. Remove apparent duplicate codes.
Kenichi Handa <handa@m17n.org>
parents:
88425
diff
changeset
|
348 ("à¸-รลว-ฮ" "w" ?0) ; consonant |
dad99fefc4b3
Change encoding to utf-8-emacs. Remove apparent duplicate codes.
Kenichi Handa <handa@m17n.org>
parents:
88425
diff
changeset
|
349 ("ฤฦะาำเ-ๅ" "w" ?1) ; vowel base |
dad99fefc4b3
Change encoding to utf-8-emacs. Remove apparent duplicate codes.
Kenichi Handa <handa@m17n.org>
parents:
88425
diff
changeset
|
350 ("ัิ-ื็๎" "w" ?2) ; vowel upper |
dad99fefc4b3
Change encoding to utf-8-emacs. Remove apparent duplicate codes.
Kenichi Handa <handa@m17n.org>
parents:
88425
diff
changeset
|
351 ("ุ-ฺ" "w" ?3) ; vowel lower |
dad99fefc4b3
Change encoding to utf-8-emacs. Remove apparent duplicate codes.
Kenichi Handa <handa@m17n.org>
parents:
88425
diff
changeset
|
352 ("่-à¹" "w" ?4) ; tone mark |
dad99fefc4b3
Change encoding to utf-8-emacs. Remove apparent duplicate codes.
Kenichi Handa <handa@m17n.org>
parents:
88425
diff
changeset
|
353 ("à¹-๙" "w" ?6) ; digit |
dad99fefc4b3
Change encoding to utf-8-emacs. Remove apparent duplicate codes.
Kenichi Handa <handa@m17n.org>
parents:
88425
diff
changeset
|
354 ("ฯๆ฿à¹à¹šà¹›" "_" ?5) ; symbol |
17052 | 355 )) |
356 elm chars len syntax category to ch i) | |
357 (while deflist | |
358 (setq elm (car deflist)) | |
359 (setq chars (car elm) | |
360 len (length chars) | |
361 syntax (nth 1 elm) | |
362 category (nth 2 elm) | |
363 i 0) | |
364 (while (< i len) | |
365 (if (= (aref chars i) ?-) | |
366 (setq i (1+ i) | |
20843 | 367 to (aref chars i)) |
368 (setq ch (aref chars i) | |
17052 | 369 to ch)) |
370 (while (<= ch to) | |
41874
1b93abfcbd87
Don't set word syntax (the default)
Dave Love <fx@gnu.org>
parents:
41466
diff
changeset
|
371 (unless (string-equal syntax "w") |
1b93abfcbd87
Don't set word syntax (the default)
Dave Love <fx@gnu.org>
parents:
41466
diff
changeset
|
372 (modify-syntax-entry ch syntax)) |
17052 | 373 (modify-category-entry ch category) |
374 (setq ch (1+ ch))) | |
20843 | 375 (setq i (1+ i))) |
17052 | 376 (setq deflist (cdr deflist)))) |
377 | |
17753
de6be7f3b904
Set syntax and category for Devanagari, Thai, and
Kenichi Handa <handa@m17n.org>
parents:
17296
diff
changeset
|
378 ;; Tibetan character set |
de6be7f3b904
Set syntax and category for Devanagari, Thai, and
Kenichi Handa <handa@m17n.org>
parents:
17296
diff
changeset
|
379 |
88612 | 380 (modify-category-entry '(#xf00 . #xfff) ?q) |
381 (map-charset-chars #'modify-category-entry 'tibetan ?q) | |
382 (map-charset-chars #'modify-category-entry 'tibetan-1-column ?q) | |
17753
de6be7f3b904
Set syntax and category for Devanagari, Thai, and
Kenichi Handa <handa@m17n.org>
parents:
17296
diff
changeset
|
383 |
de6be7f3b904
Set syntax and category for Devanagari, Thai, and
Kenichi Handa <handa@m17n.org>
parents:
17296
diff
changeset
|
384 (let ((deflist '(;; chars syntax category |
88711 | 385 ("ཀ-ཀྵཪ" "w" ?0) ; consonant |
386 ("à¾-ྐྵྺྻྼö€ö€‚" "w" ?0) ; | |
88469
dad99fefc4b3
Change encoding to utf-8-emacs. Remove apparent duplicate codes.
Kenichi Handa <handa@m17n.org>
parents:
88425
diff
changeset
|
387 ("ö„š-ö…ž" "w" ?0) ; |
dad99fefc4b3
Change encoding to utf-8-emacs. Remove apparent duplicate codes.
Kenichi Handa <handa@m17n.org>
parents:
88425
diff
changeset
|
388 ("ö…¸-ö‡Œ" "w" ?0) ; |
88711 | 389 ("ིེཻོཽྀ" "w" ?2) ; upper vowel |
390 ("ཾྂྃ྆྇ྈྉྊྋ" "w" ?2) ; upper modifier | |
391 ("༙ö‚Žà½±à½´à¾„༵༷" "w" ?3) ; lowel vowel/modifier | |
392 ("༠-༩༪-༳" "w" ?6) ; digit | |
393 ("་à¼-༒༔ཿ" "." ?|) ; line-break char | |
394 ("་à¼à¼à¼à¼‘༔ཿ" "." ?|) ; | |
395 ("༈་à¼-༒༔ཿ༽༴" "." ?>) ; prohibition | |
396 ("་à¼à¼à¼à¼‘༔ཿ" "." ?>) ; | |
397 ("ༀ-༊༼à¿à¿‚྅" "." ?<) ; prohibition | |
398 ("༓༕-༘༚-༟༶༸-༻༾༿྾྿-à¿" "." ?q) ; others | |
17753
de6be7f3b904
Set syntax and category for Devanagari, Thai, and
Kenichi Handa <handa@m17n.org>
parents:
17296
diff
changeset
|
399 )) |
de6be7f3b904
Set syntax and category for Devanagari, Thai, and
Kenichi Handa <handa@m17n.org>
parents:
17296
diff
changeset
|
400 elm chars len syntax category to ch i) |
de6be7f3b904
Set syntax and category for Devanagari, Thai, and
Kenichi Handa <handa@m17n.org>
parents:
17296
diff
changeset
|
401 (while deflist |
de6be7f3b904
Set syntax and category for Devanagari, Thai, and
Kenichi Handa <handa@m17n.org>
parents:
17296
diff
changeset
|
402 (setq elm (car deflist)) |
de6be7f3b904
Set syntax and category for Devanagari, Thai, and
Kenichi Handa <handa@m17n.org>
parents:
17296
diff
changeset
|
403 (setq chars (car elm) |
de6be7f3b904
Set syntax and category for Devanagari, Thai, and
Kenichi Handa <handa@m17n.org>
parents:
17296
diff
changeset
|
404 len (length chars) |
de6be7f3b904
Set syntax and category for Devanagari, Thai, and
Kenichi Handa <handa@m17n.org>
parents:
17296
diff
changeset
|
405 syntax (nth 1 elm) |
de6be7f3b904
Set syntax and category for Devanagari, Thai, and
Kenichi Handa <handa@m17n.org>
parents:
17296
diff
changeset
|
406 category (nth 2 elm) |
de6be7f3b904
Set syntax and category for Devanagari, Thai, and
Kenichi Handa <handa@m17n.org>
parents:
17296
diff
changeset
|
407 i 0) |
de6be7f3b904
Set syntax and category for Devanagari, Thai, and
Kenichi Handa <handa@m17n.org>
parents:
17296
diff
changeset
|
408 (while (< i len) |
de6be7f3b904
Set syntax and category for Devanagari, Thai, and
Kenichi Handa <handa@m17n.org>
parents:
17296
diff
changeset
|
409 (if (= (aref chars i) ?-) |
de6be7f3b904
Set syntax and category for Devanagari, Thai, and
Kenichi Handa <handa@m17n.org>
parents:
17296
diff
changeset
|
410 (setq i (1+ i) |
20843 | 411 to (aref chars i)) |
412 (setq ch (aref chars i) | |
17753
de6be7f3b904
Set syntax and category for Devanagari, Thai, and
Kenichi Handa <handa@m17n.org>
parents:
17296
diff
changeset
|
413 to ch)) |
de6be7f3b904
Set syntax and category for Devanagari, Thai, and
Kenichi Handa <handa@m17n.org>
parents:
17296
diff
changeset
|
414 (while (<= ch to) |
41874
1b93abfcbd87
Don't set word syntax (the default)
Dave Love <fx@gnu.org>
parents:
41466
diff
changeset
|
415 (unless (string-equal syntax "w") |
1b93abfcbd87
Don't set word syntax (the default)
Dave Love <fx@gnu.org>
parents:
41466
diff
changeset
|
416 (modify-syntax-entry ch syntax)) |
17753
de6be7f3b904
Set syntax and category for Devanagari, Thai, and
Kenichi Handa <handa@m17n.org>
parents:
17296
diff
changeset
|
417 (modify-category-entry ch category) |
de6be7f3b904
Set syntax and category for Devanagari, Thai, and
Kenichi Handa <handa@m17n.org>
parents:
17296
diff
changeset
|
418 (setq ch (1+ ch))) |
20843 | 419 (setq i (1+ i))) |
17753
de6be7f3b904
Set syntax and category for Devanagari, Thai, and
Kenichi Handa <handa@m17n.org>
parents:
17296
diff
changeset
|
420 (setq deflist (cdr deflist)))) |
de6be7f3b904
Set syntax and category for Devanagari, Thai, and
Kenichi Handa <handa@m17n.org>
parents:
17296
diff
changeset
|
421 |
17052 | 422 ;; Vietnamese character set |
423 | |
88612 | 424 ;; To make a word with Latin characters |
425 (map-charset-chars #'modify-category-entry 'vietnamese-viscii-lower ?l) | |
426 (map-charset-chars #'modify-category-entry 'vietnamese-viscii-lower ?v) | |
427 | |
428 (map-charset-chars #'modify-category-entry 'vietnamese-viscii-upper ?l) | |
429 (map-charset-chars #'modify-category-entry 'vietnamese-viscii-upper ?v) | |
17052 | 430 |
27691
5ed8dd27c116
Setup case table for Vietnamese.
Kenichi Handa <handa@m17n.org>
parents:
26885
diff
changeset
|
431 (let ((tbl (standard-case-table)) |
5ed8dd27c116
Setup case table for Vietnamese.
Kenichi Handa <handa@m17n.org>
parents:
26885
diff
changeset
|
432 (i 32)) |
5ed8dd27c116
Setup case table for Vietnamese.
Kenichi Handa <handa@m17n.org>
parents:
26885
diff
changeset
|
433 (while (< i 128) |
88711 | 434 (let* ((char (decode-char 'vietnamese-viscii-upper i)) |
435 (charl (decode-char 'vietnamese-viscii-lower i)) | |
436 (uc (encode-char char 'ucs)) | |
437 (lc (encode-char charl 'ucs))) | |
438 (set-case-syntax-pair char (decode-char 'vietnamese-viscii-lower i) | |
439 tbl) | |
440 (if uc (modify-category-entry uc ?v)) | |
441 (if lc (modify-category-entry lc ?v))) | |
27691
5ed8dd27c116
Setup case table for Vietnamese.
Kenichi Handa <handa@m17n.org>
parents:
26885
diff
changeset
|
442 (setq i (1+ i)))) |
5ed8dd27c116
Setup case table for Vietnamese.
Kenichi Handa <handa@m17n.org>
parents:
26885
diff
changeset
|
443 |
89314 | 444 |
445 ;; Latin | |
446 | |
447 (modify-category-entry '(#x80 . #x024F) ?l) | |
448 | |
34507
ed8ea5cf198b
Fix cases and syntaxes for mule-unicode-0100-24ff.
Kenichi Handa <handa@m17n.org>
parents:
33902
diff
changeset
|
449 (let ((tbl (standard-case-table)) c) |
ed8ea5cf198b
Fix cases and syntaxes for mule-unicode-0100-24ff.
Kenichi Handa <handa@m17n.org>
parents:
33902
diff
changeset
|
450 |
ed8ea5cf198b
Fix cases and syntaxes for mule-unicode-0100-24ff.
Kenichi Handa <handa@m17n.org>
parents:
33902
diff
changeset
|
451 ;; In some languages, U+0049 LATIN CAPITAL LETTER I and U+0131 LATIN |
ed8ea5cf198b
Fix cases and syntaxes for mule-unicode-0100-24ff.
Kenichi Handa <handa@m17n.org>
parents:
33902
diff
changeset
|
452 ;; SMALL LETTER DOTLESS I make a case pair, and so do U+0130 LATIN |
ed8ea5cf198b
Fix cases and syntaxes for mule-unicode-0100-24ff.
Kenichi Handa <handa@m17n.org>
parents:
33902
diff
changeset
|
453 ;; CAPITAL LETTER I WITH DOT ABOVE and U+0069 LATIN SMALL LETTER I. |
89188 | 454 ;; See the Turkish language environment. |
34507
ed8ea5cf198b
Fix cases and syntaxes for mule-unicode-0100-24ff.
Kenichi Handa <handa@m17n.org>
parents:
33902
diff
changeset
|
455 |
89146 | 456 ;; Latin-1 |
457 | |
458 ;; Fixme: Some of the non-word syntaxes here perhaps should be | |
459 ;; reviewed. (Note that the following all implicitly have word | |
460 ;; syntax: ¢£¤¥¨ª¯²³´¶¸¹º.) There should be a well-defined way of | |
461 ;; relating Unicode categories to Emacs syntax codes. | |
462 (set-case-syntax ? " " tbl) ; dubious | |
463 (set-case-syntax ?¡ "." tbl) | |
464 (set-case-syntax ?¦ "_" tbl) | |
465 (set-case-syntax ?§ "." tbl) | |
466 (set-case-syntax ?© "_" tbl) | |
467 (set-case-syntax-delims 171 187 tbl) ; « » | |
468 (set-case-syntax ?¬ "_" tbl) | |
469 (set-case-syntax ? "_" tbl) | |
470 (set-case-syntax ?® "_" tbl) | |
471 (set-case-syntax ?° "_" tbl) | |
472 (set-case-syntax ?± "_" tbl) | |
473 (set-case-syntax ?µ "_" tbl) | |
474 (set-case-syntax ?· "_" tbl) | |
475 (set-case-syntax ?¼ "_" tbl) | |
476 (set-case-syntax ?½ "_" tbl) | |
477 (set-case-syntax ?¾ "_" tbl) | |
478 (set-case-syntax ?¿ "." tbl) | |
479 (let ((c 192)) | |
480 (while (<= c 222) | |
481 (set-case-syntax-pair c (+ c 32) tbl) | |
482 (setq c (1+ c)))) | |
483 (set-case-syntax ?× "_" tbl) | |
484 (set-case-syntax ?ß "w" tbl) | |
485 (set-case-syntax ?÷ "_" tbl) | |
486 ;; See below for ÿ. | |
487 | |
34507
ed8ea5cf198b
Fix cases and syntaxes for mule-unicode-0100-24ff.
Kenichi Handa <handa@m17n.org>
parents:
33902
diff
changeset
|
488 ;; Latin Extended-A, Latin Extended-B |
ed8ea5cf198b
Fix cases and syntaxes for mule-unicode-0100-24ff.
Kenichi Handa <handa@m17n.org>
parents:
33902
diff
changeset
|
489 (setq c #x0100) |
ed8ea5cf198b
Fix cases and syntaxes for mule-unicode-0100-24ff.
Kenichi Handa <handa@m17n.org>
parents:
33902
diff
changeset
|
490 (while (<= c #x0233) |
ed8ea5cf198b
Fix cases and syntaxes for mule-unicode-0100-24ff.
Kenichi Handa <handa@m17n.org>
parents:
33902
diff
changeset
|
491 (and (or (<= c #x012e) |
ed8ea5cf198b
Fix cases and syntaxes for mule-unicode-0100-24ff.
Kenichi Handa <handa@m17n.org>
parents:
33902
diff
changeset
|
492 (and (>= c #x014a) (<= c #x0177))) |
33902
aacccd8ad977
Specify cases and syntaxes for mule-unicode-0100-24ff.
Kenichi Handa <handa@m17n.org>
parents:
31664
diff
changeset
|
493 (zerop (% c 2)) |
88612 | 494 (set-case-syntax-pair c (1+ c) tbl)) |
34507
ed8ea5cf198b
Fix cases and syntaxes for mule-unicode-0100-24ff.
Kenichi Handa <handa@m17n.org>
parents:
33902
diff
changeset
|
495 (and (>= c #x013a) |
ed8ea5cf198b
Fix cases and syntaxes for mule-unicode-0100-24ff.
Kenichi Handa <handa@m17n.org>
parents:
33902
diff
changeset
|
496 (<= c #x0148) |
ed8ea5cf198b
Fix cases and syntaxes for mule-unicode-0100-24ff.
Kenichi Handa <handa@m17n.org>
parents:
33902
diff
changeset
|
497 (zerop (% c 2)) |
88612 | 498 (set-case-syntax-pair (1- c) c tbl)) |
33902
aacccd8ad977
Specify cases and syntaxes for mule-unicode-0100-24ff.
Kenichi Handa <handa@m17n.org>
parents:
31664
diff
changeset
|
499 (setq c (1+ c))) |
88469
dad99fefc4b3
Change encoding to utf-8-emacs. Remove apparent duplicate codes.
Kenichi Handa <handa@m17n.org>
parents:
88425
diff
changeset
|
500 (set-case-syntax-pair ?IJ ?ij tbl) |
dad99fefc4b3
Change encoding to utf-8-emacs. Remove apparent duplicate codes.
Kenichi Handa <handa@m17n.org>
parents:
88425
diff
changeset
|
501 (set-case-syntax-pair ?Ĵ ?ĵ tbl) |
dad99fefc4b3
Change encoding to utf-8-emacs. Remove apparent duplicate codes.
Kenichi Handa <handa@m17n.org>
parents:
88425
diff
changeset
|
502 (set-case-syntax-pair ?Ķ ?ķ tbl) |
89084 | 503 (set-case-syntax-pair ?Ÿ ?ÿ tbl) |
88469
dad99fefc4b3
Change encoding to utf-8-emacs. Remove apparent duplicate codes.
Kenichi Handa <handa@m17n.org>
parents:
88425
diff
changeset
|
504 (set-case-syntax-pair ?Ź ?ź tbl) |
dad99fefc4b3
Change encoding to utf-8-emacs. Remove apparent duplicate codes.
Kenichi Handa <handa@m17n.org>
parents:
88425
diff
changeset
|
505 (set-case-syntax-pair ?Ż ?ż tbl) |
dad99fefc4b3
Change encoding to utf-8-emacs. Remove apparent duplicate codes.
Kenichi Handa <handa@m17n.org>
parents:
88425
diff
changeset
|
506 (set-case-syntax-pair ?Ž ?ž tbl) |
33902
aacccd8ad977
Specify cases and syntaxes for mule-unicode-0100-24ff.
Kenichi Handa <handa@m17n.org>
parents:
31664
diff
changeset
|
507 |
41874
1b93abfcbd87
Don't set word syntax (the default)
Dave Love <fx@gnu.org>
parents:
41466
diff
changeset
|
508 ;; Latin Extended-B |
88469
dad99fefc4b3
Change encoding to utf-8-emacs. Remove apparent duplicate codes.
Kenichi Handa <handa@m17n.org>
parents:
88425
diff
changeset
|
509 (set-case-syntax-pair ?Æ ?É“ tbl) |
dad99fefc4b3
Change encoding to utf-8-emacs. Remove apparent duplicate codes.
Kenichi Handa <handa@m17n.org>
parents:
88425
diff
changeset
|
510 (set-case-syntax-pair ?Ƃ ?ƃ tbl) |
dad99fefc4b3
Change encoding to utf-8-emacs. Remove apparent duplicate codes.
Kenichi Handa <handa@m17n.org>
parents:
88425
diff
changeset
|
511 (set-case-syntax-pair ?Æ„ ?Æ… tbl) |
dad99fefc4b3
Change encoding to utf-8-emacs. Remove apparent duplicate codes.
Kenichi Handa <handa@m17n.org>
parents:
88425
diff
changeset
|
512 (set-case-syntax-pair ?Ɔ ?ɔ tbl) |
dad99fefc4b3
Change encoding to utf-8-emacs. Remove apparent duplicate codes.
Kenichi Handa <handa@m17n.org>
parents:
88425
diff
changeset
|
513 (set-case-syntax-pair ?Ƈ ?ƈ tbl) |
dad99fefc4b3
Change encoding to utf-8-emacs. Remove apparent duplicate codes.
Kenichi Handa <handa@m17n.org>
parents:
88425
diff
changeset
|
514 (set-case-syntax-pair ?Ɖ ?ɖ tbl) |
dad99fefc4b3
Change encoding to utf-8-emacs. Remove apparent duplicate codes.
Kenichi Handa <handa@m17n.org>
parents:
88425
diff
changeset
|
515 (set-case-syntax-pair ?ÆŠ ?É— tbl) |
dad99fefc4b3
Change encoding to utf-8-emacs. Remove apparent duplicate codes.
Kenichi Handa <handa@m17n.org>
parents:
88425
diff
changeset
|
516 (set-case-syntax-pair ?Ƌ ?ƌ tbl) |
dad99fefc4b3
Change encoding to utf-8-emacs. Remove apparent duplicate codes.
Kenichi Handa <handa@m17n.org>
parents:
88425
diff
changeset
|
517 (set-case-syntax-pair ?ÆŽ ?Ç tbl) |
dad99fefc4b3
Change encoding to utf-8-emacs. Remove apparent duplicate codes.
Kenichi Handa <handa@m17n.org>
parents:
88425
diff
changeset
|
518 (set-case-syntax-pair ?Æ ?É™ tbl) |
dad99fefc4b3
Change encoding to utf-8-emacs. Remove apparent duplicate codes.
Kenichi Handa <handa@m17n.org>
parents:
88425
diff
changeset
|
519 (set-case-syntax-pair ?Æ ?É› tbl) |
dad99fefc4b3
Change encoding to utf-8-emacs. Remove apparent duplicate codes.
Kenichi Handa <handa@m17n.org>
parents:
88425
diff
changeset
|
520 (set-case-syntax-pair ?Æ‘ ?Æ’ tbl) |
dad99fefc4b3
Change encoding to utf-8-emacs. Remove apparent duplicate codes.
Kenichi Handa <handa@m17n.org>
parents:
88425
diff
changeset
|
521 (set-case-syntax-pair ?Æ“ ?É tbl) |
dad99fefc4b3
Change encoding to utf-8-emacs. Remove apparent duplicate codes.
Kenichi Handa <handa@m17n.org>
parents:
88425
diff
changeset
|
522 (set-case-syntax-pair ?Æ” ?É£ tbl) |
dad99fefc4b3
Change encoding to utf-8-emacs. Remove apparent duplicate codes.
Kenichi Handa <handa@m17n.org>
parents:
88425
diff
changeset
|
523 (set-case-syntax-pair ?Æ– ?É© tbl) |
dad99fefc4b3
Change encoding to utf-8-emacs. Remove apparent duplicate codes.
Kenichi Handa <handa@m17n.org>
parents:
88425
diff
changeset
|
524 (set-case-syntax-pair ?Ɨ ?ɨ tbl) |
dad99fefc4b3
Change encoding to utf-8-emacs. Remove apparent duplicate codes.
Kenichi Handa <handa@m17n.org>
parents:
88425
diff
changeset
|
525 (set-case-syntax-pair ?Ƙ ?ƙ tbl) |
dad99fefc4b3
Change encoding to utf-8-emacs. Remove apparent duplicate codes.
Kenichi Handa <handa@m17n.org>
parents:
88425
diff
changeset
|
526 (set-case-syntax-pair ?Ɯ ?ɯ tbl) |
dad99fefc4b3
Change encoding to utf-8-emacs. Remove apparent duplicate codes.
Kenichi Handa <handa@m17n.org>
parents:
88425
diff
changeset
|
527 (set-case-syntax-pair ?Æ ?ɲ tbl) |
dad99fefc4b3
Change encoding to utf-8-emacs. Remove apparent duplicate codes.
Kenichi Handa <handa@m17n.org>
parents:
88425
diff
changeset
|
528 (set-case-syntax-pair ?Ɵ ?ɵ tbl) |
dad99fefc4b3
Change encoding to utf-8-emacs. Remove apparent duplicate codes.
Kenichi Handa <handa@m17n.org>
parents:
88425
diff
changeset
|
529 (set-case-syntax-pair ?Æ ?Æ¡ tbl) |
dad99fefc4b3
Change encoding to utf-8-emacs. Remove apparent duplicate codes.
Kenichi Handa <handa@m17n.org>
parents:
88425
diff
changeset
|
530 (set-case-syntax-pair ?Æ¢ ?Æ£ tbl) |
dad99fefc4b3
Change encoding to utf-8-emacs. Remove apparent duplicate codes.
Kenichi Handa <handa@m17n.org>
parents:
88425
diff
changeset
|
531 (set-case-syntax-pair ?Ƥ ?ƥ tbl) |
dad99fefc4b3
Change encoding to utf-8-emacs. Remove apparent duplicate codes.
Kenichi Handa <handa@m17n.org>
parents:
88425
diff
changeset
|
532 (set-case-syntax-pair ?Ʀ ?ʀ tbl) |
dad99fefc4b3
Change encoding to utf-8-emacs. Remove apparent duplicate codes.
Kenichi Handa <handa@m17n.org>
parents:
88425
diff
changeset
|
533 (set-case-syntax-pair ?Ƨ ?ƨ tbl) |
dad99fefc4b3
Change encoding to utf-8-emacs. Remove apparent duplicate codes.
Kenichi Handa <handa@m17n.org>
parents:
88425
diff
changeset
|
534 (set-case-syntax-pair ?Ʃ ?ʃ tbl) |
dad99fefc4b3
Change encoding to utf-8-emacs. Remove apparent duplicate codes.
Kenichi Handa <handa@m17n.org>
parents:
88425
diff
changeset
|
535 (set-case-syntax-pair ?Ƭ ?Æ tbl) |
dad99fefc4b3
Change encoding to utf-8-emacs. Remove apparent duplicate codes.
Kenichi Handa <handa@m17n.org>
parents:
88425
diff
changeset
|
536 (set-case-syntax-pair ?Ʈ ?ʈ tbl) |
dad99fefc4b3
Change encoding to utf-8-emacs. Remove apparent duplicate codes.
Kenichi Handa <handa@m17n.org>
parents:
88425
diff
changeset
|
537 (set-case-syntax-pair ?Ư ?ư tbl) |
dad99fefc4b3
Change encoding to utf-8-emacs. Remove apparent duplicate codes.
Kenichi Handa <handa@m17n.org>
parents:
88425
diff
changeset
|
538 (set-case-syntax-pair ?Ʊ ?ʊ tbl) |
dad99fefc4b3
Change encoding to utf-8-emacs. Remove apparent duplicate codes.
Kenichi Handa <handa@m17n.org>
parents:
88425
diff
changeset
|
539 (set-case-syntax-pair ?Ʋ ?ʋ tbl) |
dad99fefc4b3
Change encoding to utf-8-emacs. Remove apparent duplicate codes.
Kenichi Handa <handa@m17n.org>
parents:
88425
diff
changeset
|
540 (set-case-syntax-pair ?Ƴ ?ƴ tbl) |
dad99fefc4b3
Change encoding to utf-8-emacs. Remove apparent duplicate codes.
Kenichi Handa <handa@m17n.org>
parents:
88425
diff
changeset
|
541 (set-case-syntax-pair ?Ƶ ?ƶ tbl) |
dad99fefc4b3
Change encoding to utf-8-emacs. Remove apparent duplicate codes.
Kenichi Handa <handa@m17n.org>
parents:
88425
diff
changeset
|
542 (set-case-syntax-pair ?Æ· ?Ê’ tbl) |
dad99fefc4b3
Change encoding to utf-8-emacs. Remove apparent duplicate codes.
Kenichi Handa <handa@m17n.org>
parents:
88425
diff
changeset
|
543 (set-case-syntax-pair ?Ƹ ?ƹ tbl) |
dad99fefc4b3
Change encoding to utf-8-emacs. Remove apparent duplicate codes.
Kenichi Handa <handa@m17n.org>
parents:
88425
diff
changeset
|
544 (set-case-syntax-pair ?Ƽ ?ƽ tbl) |
dad99fefc4b3
Change encoding to utf-8-emacs. Remove apparent duplicate codes.
Kenichi Handa <handa@m17n.org>
parents:
88425
diff
changeset
|
545 (set-case-syntax-pair ?DŽ ?dž tbl) |
dad99fefc4b3
Change encoding to utf-8-emacs. Remove apparent duplicate codes.
Kenichi Handa <handa@m17n.org>
parents:
88425
diff
changeset
|
546 (set-case-syntax-pair ?Dž ?dž tbl) |
dad99fefc4b3
Change encoding to utf-8-emacs. Remove apparent duplicate codes.
Kenichi Handa <handa@m17n.org>
parents:
88425
diff
changeset
|
547 (set-case-syntax-pair ?LJ ?lj tbl) |
dad99fefc4b3
Change encoding to utf-8-emacs. Remove apparent duplicate codes.
Kenichi Handa <handa@m17n.org>
parents:
88425
diff
changeset
|
548 (set-case-syntax-pair ?Lj ?lj tbl) |
dad99fefc4b3
Change encoding to utf-8-emacs. Remove apparent duplicate codes.
Kenichi Handa <handa@m17n.org>
parents:
88425
diff
changeset
|
549 (set-case-syntax-pair ?NJ ?nj tbl) |
dad99fefc4b3
Change encoding to utf-8-emacs. Remove apparent duplicate codes.
Kenichi Handa <handa@m17n.org>
parents:
88425
diff
changeset
|
550 (set-case-syntax-pair ?Nj ?nj tbl) |
dad99fefc4b3
Change encoding to utf-8-emacs. Remove apparent duplicate codes.
Kenichi Handa <handa@m17n.org>
parents:
88425
diff
changeset
|
551 (set-case-syntax-pair ?Ç ?ÇŽ tbl) |
dad99fefc4b3
Change encoding to utf-8-emacs. Remove apparent duplicate codes.
Kenichi Handa <handa@m17n.org>
parents:
88425
diff
changeset
|
552 (set-case-syntax-pair ?Ç ?Ç tbl) |
dad99fefc4b3
Change encoding to utf-8-emacs. Remove apparent duplicate codes.
Kenichi Handa <handa@m17n.org>
parents:
88425
diff
changeset
|
553 (set-case-syntax-pair ?Ç‘ ?Ç’ tbl) |
dad99fefc4b3
Change encoding to utf-8-emacs. Remove apparent duplicate codes.
Kenichi Handa <handa@m17n.org>
parents:
88425
diff
changeset
|
554 (set-case-syntax-pair ?Ç“ ?Ç” tbl) |
dad99fefc4b3
Change encoding to utf-8-emacs. Remove apparent duplicate codes.
Kenichi Handa <handa@m17n.org>
parents:
88425
diff
changeset
|
555 (set-case-syntax-pair ?Ç• ?Ç– tbl) |
dad99fefc4b3
Change encoding to utf-8-emacs. Remove apparent duplicate codes.
Kenichi Handa <handa@m17n.org>
parents:
88425
diff
changeset
|
556 (set-case-syntax-pair ?Ǘ ?ǘ tbl) |
dad99fefc4b3
Change encoding to utf-8-emacs. Remove apparent duplicate codes.
Kenichi Handa <handa@m17n.org>
parents:
88425
diff
changeset
|
557 (set-case-syntax-pair ?Ç™ ?Çš tbl) |
dad99fefc4b3
Change encoding to utf-8-emacs. Remove apparent duplicate codes.
Kenichi Handa <handa@m17n.org>
parents:
88425
diff
changeset
|
558 (set-case-syntax-pair ?Ǜ ?ǜ tbl) |
dad99fefc4b3
Change encoding to utf-8-emacs. Remove apparent duplicate codes.
Kenichi Handa <handa@m17n.org>
parents:
88425
diff
changeset
|
559 (set-case-syntax-pair ?Çž ?ÇŸ tbl) |
dad99fefc4b3
Change encoding to utf-8-emacs. Remove apparent duplicate codes.
Kenichi Handa <handa@m17n.org>
parents:
88425
diff
changeset
|
560 (set-case-syntax-pair ?Ç ?Ç¡ tbl) |
dad99fefc4b3
Change encoding to utf-8-emacs. Remove apparent duplicate codes.
Kenichi Handa <handa@m17n.org>
parents:
88425
diff
changeset
|
561 (set-case-syntax-pair ?Ç¢ ?Ç£ tbl) |
dad99fefc4b3
Change encoding to utf-8-emacs. Remove apparent duplicate codes.
Kenichi Handa <handa@m17n.org>
parents:
88425
diff
changeset
|
562 (set-case-syntax-pair ?Ǥ ?ǥ tbl) |
dad99fefc4b3
Change encoding to utf-8-emacs. Remove apparent duplicate codes.
Kenichi Handa <handa@m17n.org>
parents:
88425
diff
changeset
|
563 (set-case-syntax-pair ?Ǧ ?ǧ tbl) |
dad99fefc4b3
Change encoding to utf-8-emacs. Remove apparent duplicate codes.
Kenichi Handa <handa@m17n.org>
parents:
88425
diff
changeset
|
564 (set-case-syntax-pair ?Ǩ ?ǩ tbl) |
dad99fefc4b3
Change encoding to utf-8-emacs. Remove apparent duplicate codes.
Kenichi Handa <handa@m17n.org>
parents:
88425
diff
changeset
|
565 (set-case-syntax-pair ?Ǫ ?ǫ tbl) |
dad99fefc4b3
Change encoding to utf-8-emacs. Remove apparent duplicate codes.
Kenichi Handa <handa@m17n.org>
parents:
88425
diff
changeset
|
566 (set-case-syntax-pair ?Ǭ ?Ç tbl) |
dad99fefc4b3
Change encoding to utf-8-emacs. Remove apparent duplicate codes.
Kenichi Handa <handa@m17n.org>
parents:
88425
diff
changeset
|
567 (set-case-syntax-pair ?Ǯ ?ǯ tbl) |
41874
1b93abfcbd87
Don't set word syntax (the default)
Dave Love <fx@gnu.org>
parents:
41466
diff
changeset
|
568 ;; 01F0; F; 006A 030C; # LATIN SMALL LETTER J WITH CARON |
88469
dad99fefc4b3
Change encoding to utf-8-emacs. Remove apparent duplicate codes.
Kenichi Handa <handa@m17n.org>
parents:
88425
diff
changeset
|
569 (set-case-syntax-pair ?DZ ?dz tbl) |
dad99fefc4b3
Change encoding to utf-8-emacs. Remove apparent duplicate codes.
Kenichi Handa <handa@m17n.org>
parents:
88425
diff
changeset
|
570 (set-case-syntax-pair ?Dz ?dz tbl) |
dad99fefc4b3
Change encoding to utf-8-emacs. Remove apparent duplicate codes.
Kenichi Handa <handa@m17n.org>
parents:
88425
diff
changeset
|
571 (set-case-syntax-pair ?Ǵ ?ǵ tbl) |
dad99fefc4b3
Change encoding to utf-8-emacs. Remove apparent duplicate codes.
Kenichi Handa <handa@m17n.org>
parents:
88425
diff
changeset
|
572 (set-case-syntax-pair ?Ƕ ?ƕ tbl) |
dad99fefc4b3
Change encoding to utf-8-emacs. Remove apparent duplicate codes.
Kenichi Handa <handa@m17n.org>
parents:
88425
diff
changeset
|
573 (set-case-syntax-pair ?Ç· ?Æ¿ tbl) |
dad99fefc4b3
Change encoding to utf-8-emacs. Remove apparent duplicate codes.
Kenichi Handa <handa@m17n.org>
parents:
88425
diff
changeset
|
574 (set-case-syntax-pair ?Ǹ ?ǹ tbl) |
dad99fefc4b3
Change encoding to utf-8-emacs. Remove apparent duplicate codes.
Kenichi Handa <handa@m17n.org>
parents:
88425
diff
changeset
|
575 (set-case-syntax-pair ?Ǻ ?ǻ tbl) |
dad99fefc4b3
Change encoding to utf-8-emacs. Remove apparent duplicate codes.
Kenichi Handa <handa@m17n.org>
parents:
88425
diff
changeset
|
576 (set-case-syntax-pair ?Ǽ ?ǽ tbl) |
dad99fefc4b3
Change encoding to utf-8-emacs. Remove apparent duplicate codes.
Kenichi Handa <handa@m17n.org>
parents:
88425
diff
changeset
|
577 (set-case-syntax-pair ?Ǿ ?ǿ tbl) |
dad99fefc4b3
Change encoding to utf-8-emacs. Remove apparent duplicate codes.
Kenichi Handa <handa@m17n.org>
parents:
88425
diff
changeset
|
578 (set-case-syntax-pair ?È€ ?È tbl) |
dad99fefc4b3
Change encoding to utf-8-emacs. Remove apparent duplicate codes.
Kenichi Handa <handa@m17n.org>
parents:
88425
diff
changeset
|
579 (set-case-syntax-pair ?Ȃ ?ȃ tbl) |
dad99fefc4b3
Change encoding to utf-8-emacs. Remove apparent duplicate codes.
Kenichi Handa <handa@m17n.org>
parents:
88425
diff
changeset
|
580 (set-case-syntax-pair ?È„ ?È… tbl) |
dad99fefc4b3
Change encoding to utf-8-emacs. Remove apparent duplicate codes.
Kenichi Handa <handa@m17n.org>
parents:
88425
diff
changeset
|
581 (set-case-syntax-pair ?Ȇ ?ȇ tbl) |
dad99fefc4b3
Change encoding to utf-8-emacs. Remove apparent duplicate codes.
Kenichi Handa <handa@m17n.org>
parents:
88425
diff
changeset
|
582 (set-case-syntax-pair ?Ȉ ?ȉ tbl) |
dad99fefc4b3
Change encoding to utf-8-emacs. Remove apparent duplicate codes.
Kenichi Handa <handa@m17n.org>
parents:
88425
diff
changeset
|
583 (set-case-syntax-pair ?ÈŠ ?È‹ tbl) |
dad99fefc4b3
Change encoding to utf-8-emacs. Remove apparent duplicate codes.
Kenichi Handa <handa@m17n.org>
parents:
88425
diff
changeset
|
584 (set-case-syntax-pair ?ÈŒ ?È tbl) |
dad99fefc4b3
Change encoding to utf-8-emacs. Remove apparent duplicate codes.
Kenichi Handa <handa@m17n.org>
parents:
88425
diff
changeset
|
585 (set-case-syntax-pair ?ÈŽ ?È tbl) |
dad99fefc4b3
Change encoding to utf-8-emacs. Remove apparent duplicate codes.
Kenichi Handa <handa@m17n.org>
parents:
88425
diff
changeset
|
586 (set-case-syntax-pair ?È ?È‘ tbl) |
dad99fefc4b3
Change encoding to utf-8-emacs. Remove apparent duplicate codes.
Kenichi Handa <handa@m17n.org>
parents:
88425
diff
changeset
|
587 (set-case-syntax-pair ?È’ ?È“ tbl) |
dad99fefc4b3
Change encoding to utf-8-emacs. Remove apparent duplicate codes.
Kenichi Handa <handa@m17n.org>
parents:
88425
diff
changeset
|
588 (set-case-syntax-pair ?È” ?È• tbl) |
dad99fefc4b3
Change encoding to utf-8-emacs. Remove apparent duplicate codes.
Kenichi Handa <handa@m17n.org>
parents:
88425
diff
changeset
|
589 (set-case-syntax-pair ?È– ?È— tbl) |
dad99fefc4b3
Change encoding to utf-8-emacs. Remove apparent duplicate codes.
Kenichi Handa <handa@m17n.org>
parents:
88425
diff
changeset
|
590 (set-case-syntax-pair ?Ș ?ș tbl) |
dad99fefc4b3
Change encoding to utf-8-emacs. Remove apparent duplicate codes.
Kenichi Handa <handa@m17n.org>
parents:
88425
diff
changeset
|
591 (set-case-syntax-pair ?Èš ?È› tbl) |
dad99fefc4b3
Change encoding to utf-8-emacs. Remove apparent duplicate codes.
Kenichi Handa <handa@m17n.org>
parents:
88425
diff
changeset
|
592 (set-case-syntax-pair ?Èœ ?È tbl) |
dad99fefc4b3
Change encoding to utf-8-emacs. Remove apparent duplicate codes.
Kenichi Handa <handa@m17n.org>
parents:
88425
diff
changeset
|
593 (set-case-syntax-pair ?Èž ?ÈŸ tbl) |
dad99fefc4b3
Change encoding to utf-8-emacs. Remove apparent duplicate codes.
Kenichi Handa <handa@m17n.org>
parents:
88425
diff
changeset
|
594 (set-case-syntax-pair ?È¢ ?È£ tbl) |
dad99fefc4b3
Change encoding to utf-8-emacs. Remove apparent duplicate codes.
Kenichi Handa <handa@m17n.org>
parents:
88425
diff
changeset
|
595 (set-case-syntax-pair ?Ȥ ?ȥ tbl) |
dad99fefc4b3
Change encoding to utf-8-emacs. Remove apparent duplicate codes.
Kenichi Handa <handa@m17n.org>
parents:
88425
diff
changeset
|
596 (set-case-syntax-pair ?Ȧ ?ȧ tbl) |
dad99fefc4b3
Change encoding to utf-8-emacs. Remove apparent duplicate codes.
Kenichi Handa <handa@m17n.org>
parents:
88425
diff
changeset
|
597 (set-case-syntax-pair ?Ȩ ?ȩ tbl) |
dad99fefc4b3
Change encoding to utf-8-emacs. Remove apparent duplicate codes.
Kenichi Handa <handa@m17n.org>
parents:
88425
diff
changeset
|
598 (set-case-syntax-pair ?Ȫ ?ȫ tbl) |
dad99fefc4b3
Change encoding to utf-8-emacs. Remove apparent duplicate codes.
Kenichi Handa <handa@m17n.org>
parents:
88425
diff
changeset
|
599 (set-case-syntax-pair ?Ȭ ?È tbl) |
dad99fefc4b3
Change encoding to utf-8-emacs. Remove apparent duplicate codes.
Kenichi Handa <handa@m17n.org>
parents:
88425
diff
changeset
|
600 (set-case-syntax-pair ?Ȯ ?ȯ tbl) |
dad99fefc4b3
Change encoding to utf-8-emacs. Remove apparent duplicate codes.
Kenichi Handa <handa@m17n.org>
parents:
88425
diff
changeset
|
601 (set-case-syntax-pair ?Ȱ ?ȱ tbl) |
dad99fefc4b3
Change encoding to utf-8-emacs. Remove apparent duplicate codes.
Kenichi Handa <handa@m17n.org>
parents:
88425
diff
changeset
|
602 (set-case-syntax-pair ?Ȳ ?ȳ tbl) |
41874
1b93abfcbd87
Don't set word syntax (the default)
Dave Love <fx@gnu.org>
parents:
41466
diff
changeset
|
603 |
34507
ed8ea5cf198b
Fix cases and syntaxes for mule-unicode-0100-24ff.
Kenichi Handa <handa@m17n.org>
parents:
33902
diff
changeset
|
604 ;; Latin Extended Additional |
88612 | 605 (modify-category-entry '(#x1e00 . #x1ef9) ?l) |
34507
ed8ea5cf198b
Fix cases and syntaxes for mule-unicode-0100-24ff.
Kenichi Handa <handa@m17n.org>
parents:
33902
diff
changeset
|
606 (setq c #x1e00) |
33902
aacccd8ad977
Specify cases and syntaxes for mule-unicode-0100-24ff.
Kenichi Handa <handa@m17n.org>
parents:
31664
diff
changeset
|
607 (while (<= c #x1ef9) |
aacccd8ad977
Specify cases and syntaxes for mule-unicode-0100-24ff.
Kenichi Handa <handa@m17n.org>
parents:
31664
diff
changeset
|
608 (and (zerop (% c 2)) |
aacccd8ad977
Specify cases and syntaxes for mule-unicode-0100-24ff.
Kenichi Handa <handa@m17n.org>
parents:
31664
diff
changeset
|
609 (or (<= c #x1e94) (>= c #x1ea0)) |
88612 | 610 (set-case-syntax-pair c (1+ c) tbl)) |
33902
aacccd8ad977
Specify cases and syntaxes for mule-unicode-0100-24ff.
Kenichi Handa <handa@m17n.org>
parents:
31664
diff
changeset
|
611 (setq c (1+ c))) |
aacccd8ad977
Specify cases and syntaxes for mule-unicode-0100-24ff.
Kenichi Handa <handa@m17n.org>
parents:
31664
diff
changeset
|
612 |
34507
ed8ea5cf198b
Fix cases and syntaxes for mule-unicode-0100-24ff.
Kenichi Handa <handa@m17n.org>
parents:
33902
diff
changeset
|
613 ;; Greek |
88612 | 614 (modify-category-entry '(#x0370 . #x03ff) ?g) |
34507
ed8ea5cf198b
Fix cases and syntaxes for mule-unicode-0100-24ff.
Kenichi Handa <handa@m17n.org>
parents:
33902
diff
changeset
|
615 (setq c #x0370) |
33902
aacccd8ad977
Specify cases and syntaxes for mule-unicode-0100-24ff.
Kenichi Handa <handa@m17n.org>
parents:
31664
diff
changeset
|
616 (while (<= c #x03ff) |
aacccd8ad977
Specify cases and syntaxes for mule-unicode-0100-24ff.
Kenichi Handa <handa@m17n.org>
parents:
31664
diff
changeset
|
617 (if (or (and (>= c #x0391) (<= c #x03a1)) |
aacccd8ad977
Specify cases and syntaxes for mule-unicode-0100-24ff.
Kenichi Handa <handa@m17n.org>
parents:
31664
diff
changeset
|
618 (and (>= c #x03a3) (<= c #x03ab))) |
88612 | 619 (set-case-syntax-pair c (+ c 32) tbl)) |
33902
aacccd8ad977
Specify cases and syntaxes for mule-unicode-0100-24ff.
Kenichi Handa <handa@m17n.org>
parents:
31664
diff
changeset
|
620 (and (>= c #x03da) |
aacccd8ad977
Specify cases and syntaxes for mule-unicode-0100-24ff.
Kenichi Handa <handa@m17n.org>
parents:
31664
diff
changeset
|
621 (<= c #x03ee) |
aacccd8ad977
Specify cases and syntaxes for mule-unicode-0100-24ff.
Kenichi Handa <handa@m17n.org>
parents:
31664
diff
changeset
|
622 (zerop (% c 2)) |
88612 | 623 (set-case-syntax-pair c (1+ c) tbl)) |
33902
aacccd8ad977
Specify cases and syntaxes for mule-unicode-0100-24ff.
Kenichi Handa <handa@m17n.org>
parents:
31664
diff
changeset
|
624 (setq c (1+ c))) |
88469
dad99fefc4b3
Change encoding to utf-8-emacs. Remove apparent duplicate codes.
Kenichi Handa <handa@m17n.org>
parents:
88425
diff
changeset
|
625 (set-case-syntax-pair ?Ά ?ά tbl) |
dad99fefc4b3
Change encoding to utf-8-emacs. Remove apparent duplicate codes.
Kenichi Handa <handa@m17n.org>
parents:
88425
diff
changeset
|
626 (set-case-syntax-pair ?Έ ?Πtbl) |
dad99fefc4b3
Change encoding to utf-8-emacs. Remove apparent duplicate codes.
Kenichi Handa <handa@m17n.org>
parents:
88425
diff
changeset
|
627 (set-case-syntax-pair ?Ή ?ή tbl) |
dad99fefc4b3
Change encoding to utf-8-emacs. Remove apparent duplicate codes.
Kenichi Handa <handa@m17n.org>
parents:
88425
diff
changeset
|
628 (set-case-syntax-pair ?Ί ?ί tbl) |
dad99fefc4b3
Change encoding to utf-8-emacs. Remove apparent duplicate codes.
Kenichi Handa <handa@m17n.org>
parents:
88425
diff
changeset
|
629 (set-case-syntax-pair ?Ό ?ό tbl) |
dad99fefc4b3
Change encoding to utf-8-emacs. Remove apparent duplicate codes.
Kenichi Handa <handa@m17n.org>
parents:
88425
diff
changeset
|
630 (set-case-syntax-pair ?ÎŽ ?Ï tbl) |
dad99fefc4b3
Change encoding to utf-8-emacs. Remove apparent duplicate codes.
Kenichi Handa <handa@m17n.org>
parents:
88425
diff
changeset
|
631 (set-case-syntax-pair ?Î ?ÏŽ tbl) |
33902
aacccd8ad977
Specify cases and syntaxes for mule-unicode-0100-24ff.
Kenichi Handa <handa@m17n.org>
parents:
31664
diff
changeset
|
632 |
41874
1b93abfcbd87
Don't set word syntax (the default)
Dave Love <fx@gnu.org>
parents:
41466
diff
changeset
|
633 ;; Armenian |
1b93abfcbd87
Don't set word syntax (the default)
Dave Love <fx@gnu.org>
parents:
41466
diff
changeset
|
634 (setq c #x531) |
1b93abfcbd87
Don't set word syntax (the default)
Dave Love <fx@gnu.org>
parents:
41466
diff
changeset
|
635 (while (<= c #x556) |
88612 | 636 (set-case-syntax-pair c (+ c #x30) tbl) |
41874
1b93abfcbd87
Don't set word syntax (the default)
Dave Love <fx@gnu.org>
parents:
41466
diff
changeset
|
637 (setq c (1+ c))) |
1b93abfcbd87
Don't set word syntax (the default)
Dave Love <fx@gnu.org>
parents:
41466
diff
changeset
|
638 |
34507
ed8ea5cf198b
Fix cases and syntaxes for mule-unicode-0100-24ff.
Kenichi Handa <handa@m17n.org>
parents:
33902
diff
changeset
|
639 ;; Greek Extended |
88612 | 640 (modify-category-entry '(#x1f00 . #x1fff) ?g) |
34507
ed8ea5cf198b
Fix cases and syntaxes for mule-unicode-0100-24ff.
Kenichi Handa <handa@m17n.org>
parents:
33902
diff
changeset
|
641 (setq c #x1f00) |
33902
aacccd8ad977
Specify cases and syntaxes for mule-unicode-0100-24ff.
Kenichi Handa <handa@m17n.org>
parents:
31664
diff
changeset
|
642 (while (<= c #x1fff) |
aacccd8ad977
Specify cases and syntaxes for mule-unicode-0100-24ff.
Kenichi Handa <handa@m17n.org>
parents:
31664
diff
changeset
|
643 (and (<= (logand c #x000f) 7) |
aacccd8ad977
Specify cases and syntaxes for mule-unicode-0100-24ff.
Kenichi Handa <handa@m17n.org>
parents:
31664
diff
changeset
|
644 (<= c #x1fa7) |
aacccd8ad977
Specify cases and syntaxes for mule-unicode-0100-24ff.
Kenichi Handa <handa@m17n.org>
parents:
31664
diff
changeset
|
645 (not (memq c '(#x1f50 #x1f52 #x1f54 #x1f56))) |
aacccd8ad977
Specify cases and syntaxes for mule-unicode-0100-24ff.
Kenichi Handa <handa@m17n.org>
parents:
31664
diff
changeset
|
646 (/= (logand c #x00f0) 7) |
88612 | 647 (set-case-syntax-pair (+ c 8) c tbl)) |
33902
aacccd8ad977
Specify cases and syntaxes for mule-unicode-0100-24ff.
Kenichi Handa <handa@m17n.org>
parents:
31664
diff
changeset
|
648 (setq c (1+ c))) |
88469
dad99fefc4b3
Change encoding to utf-8-emacs. Remove apparent duplicate codes.
Kenichi Handa <handa@m17n.org>
parents:
88425
diff
changeset
|
649 (set-case-syntax-pair ?Ᾰ ?ᾰ tbl) |
dad99fefc4b3
Change encoding to utf-8-emacs. Remove apparent duplicate codes.
Kenichi Handa <handa@m17n.org>
parents:
88425
diff
changeset
|
650 (set-case-syntax-pair ?á¾¹ ?á¾± tbl) |
dad99fefc4b3
Change encoding to utf-8-emacs. Remove apparent duplicate codes.
Kenichi Handa <handa@m17n.org>
parents:
88425
diff
changeset
|
651 (set-case-syntax-pair ?Ὰ ?ὰ tbl) |
dad99fefc4b3
Change encoding to utf-8-emacs. Remove apparent duplicate codes.
Kenichi Handa <handa@m17n.org>
parents:
88425
diff
changeset
|
652 (set-case-syntax-pair ?á¾» ?á½± tbl) |
dad99fefc4b3
Change encoding to utf-8-emacs. Remove apparent duplicate codes.
Kenichi Handa <handa@m17n.org>
parents:
88425
diff
changeset
|
653 (set-case-syntax-pair ?á¾¼ ?á¾³ tbl) |
dad99fefc4b3
Change encoding to utf-8-emacs. Remove apparent duplicate codes.
Kenichi Handa <handa@m17n.org>
parents:
88425
diff
changeset
|
654 (set-case-syntax-pair ?Ὲ ?ὲ tbl) |
dad99fefc4b3
Change encoding to utf-8-emacs. Remove apparent duplicate codes.
Kenichi Handa <handa@m17n.org>
parents:
88425
diff
changeset
|
655 (set-case-syntax-pair ?Έ ?έ tbl) |
dad99fefc4b3
Change encoding to utf-8-emacs. Remove apparent duplicate codes.
Kenichi Handa <handa@m17n.org>
parents:
88425
diff
changeset
|
656 (set-case-syntax-pair ?á¿Š ?á½´ tbl) |
dad99fefc4b3
Change encoding to utf-8-emacs. Remove apparent duplicate codes.
Kenichi Handa <handa@m17n.org>
parents:
88425
diff
changeset
|
657 (set-case-syntax-pair ?á¿‹ ?á½µ tbl) |
dad99fefc4b3
Change encoding to utf-8-emacs. Remove apparent duplicate codes.
Kenichi Handa <handa@m17n.org>
parents:
88425
diff
changeset
|
658 (set-case-syntax-pair ?ῌ ?ῃ tbl) |
dad99fefc4b3
Change encoding to utf-8-emacs. Remove apparent duplicate codes.
Kenichi Handa <handa@m17n.org>
parents:
88425
diff
changeset
|
659 (set-case-syntax-pair ?Ῐ ?ῠtbl) |
dad99fefc4b3
Change encoding to utf-8-emacs. Remove apparent duplicate codes.
Kenichi Handa <handa@m17n.org>
parents:
88425
diff
changeset
|
660 (set-case-syntax-pair ?á¿™ ?á¿‘ tbl) |
dad99fefc4b3
Change encoding to utf-8-emacs. Remove apparent duplicate codes.
Kenichi Handa <handa@m17n.org>
parents:
88425
diff
changeset
|
661 (set-case-syntax-pair ?Ὶ ?ὶ tbl) |
dad99fefc4b3
Change encoding to utf-8-emacs. Remove apparent duplicate codes.
Kenichi Handa <handa@m17n.org>
parents:
88425
diff
changeset
|
662 (set-case-syntax-pair ?á¿› ?á½· tbl) |
dad99fefc4b3
Change encoding to utf-8-emacs. Remove apparent duplicate codes.
Kenichi Handa <handa@m17n.org>
parents:
88425
diff
changeset
|
663 (set-case-syntax-pair ?Ῠ ?ῠtbl) |
dad99fefc4b3
Change encoding to utf-8-emacs. Remove apparent duplicate codes.
Kenichi Handa <handa@m17n.org>
parents:
88425
diff
changeset
|
664 (set-case-syntax-pair ?á¿© ?á¿¡ tbl) |
dad99fefc4b3
Change encoding to utf-8-emacs. Remove apparent duplicate codes.
Kenichi Handa <handa@m17n.org>
parents:
88425
diff
changeset
|
665 (set-case-syntax-pair ?Ὺ ?ὺ tbl) |
dad99fefc4b3
Change encoding to utf-8-emacs. Remove apparent duplicate codes.
Kenichi Handa <handa@m17n.org>
parents:
88425
diff
changeset
|
666 (set-case-syntax-pair ?á¿« ?á½» tbl) |
dad99fefc4b3
Change encoding to utf-8-emacs. Remove apparent duplicate codes.
Kenichi Handa <handa@m17n.org>
parents:
88425
diff
changeset
|
667 (set-case-syntax-pair ?Ῥ ?ῥ tbl) |
dad99fefc4b3
Change encoding to utf-8-emacs. Remove apparent duplicate codes.
Kenichi Handa <handa@m17n.org>
parents:
88425
diff
changeset
|
668 (set-case-syntax-pair ?Ὸ ?ὸ tbl) |
dad99fefc4b3
Change encoding to utf-8-emacs. Remove apparent duplicate codes.
Kenichi Handa <handa@m17n.org>
parents:
88425
diff
changeset
|
669 (set-case-syntax-pair ?Ό ?ό tbl) |
dad99fefc4b3
Change encoding to utf-8-emacs. Remove apparent duplicate codes.
Kenichi Handa <handa@m17n.org>
parents:
88425
diff
changeset
|
670 (set-case-syntax-pair ?Ὼ ?ὼ tbl) |
dad99fefc4b3
Change encoding to utf-8-emacs. Remove apparent duplicate codes.
Kenichi Handa <handa@m17n.org>
parents:
88425
diff
changeset
|
671 (set-case-syntax-pair ?á¿» ?á½½ tbl) |
dad99fefc4b3
Change encoding to utf-8-emacs. Remove apparent duplicate codes.
Kenichi Handa <handa@m17n.org>
parents:
88425
diff
changeset
|
672 (set-case-syntax-pair ?ῼ ?ῳ tbl) |
33902
aacccd8ad977
Specify cases and syntaxes for mule-unicode-0100-24ff.
Kenichi Handa <handa@m17n.org>
parents:
31664
diff
changeset
|
673 |
34507
ed8ea5cf198b
Fix cases and syntaxes for mule-unicode-0100-24ff.
Kenichi Handa <handa@m17n.org>
parents:
33902
diff
changeset
|
674 ;; cyrillic |
88612 | 675 (modify-category-entry '(#x0400 . #x04FF) ?y) |
34507
ed8ea5cf198b
Fix cases and syntaxes for mule-unicode-0100-24ff.
Kenichi Handa <handa@m17n.org>
parents:
33902
diff
changeset
|
676 (setq c #x0400) |
33902
aacccd8ad977
Specify cases and syntaxes for mule-unicode-0100-24ff.
Kenichi Handa <handa@m17n.org>
parents:
31664
diff
changeset
|
677 (while (<= c #x04ff) |
aacccd8ad977
Specify cases and syntaxes for mule-unicode-0100-24ff.
Kenichi Handa <handa@m17n.org>
parents:
31664
diff
changeset
|
678 (and (>= c #x0400) |
aacccd8ad977
Specify cases and syntaxes for mule-unicode-0100-24ff.
Kenichi Handa <handa@m17n.org>
parents:
31664
diff
changeset
|
679 (<= c #x040f) |
88612 | 680 (set-case-syntax-pair c (+ c 80) tbl)) |
33902
aacccd8ad977
Specify cases and syntaxes for mule-unicode-0100-24ff.
Kenichi Handa <handa@m17n.org>
parents:
31664
diff
changeset
|
681 (and (>= c #x0410) |
aacccd8ad977
Specify cases and syntaxes for mule-unicode-0100-24ff.
Kenichi Handa <handa@m17n.org>
parents:
31664
diff
changeset
|
682 (<= c #x042f) |
88612 | 683 (set-case-syntax-pair c (+ c 32) tbl)) |
33902
aacccd8ad977
Specify cases and syntaxes for mule-unicode-0100-24ff.
Kenichi Handa <handa@m17n.org>
parents:
31664
diff
changeset
|
684 (and (zerop (% c 2)) |
aacccd8ad977
Specify cases and syntaxes for mule-unicode-0100-24ff.
Kenichi Handa <handa@m17n.org>
parents:
31664
diff
changeset
|
685 (or (and (>= c #x0460) (<= c #x0480)) |
aacccd8ad977
Specify cases and syntaxes for mule-unicode-0100-24ff.
Kenichi Handa <handa@m17n.org>
parents:
31664
diff
changeset
|
686 (and (>= c #x048c) (<= c #x04be)) |
aacccd8ad977
Specify cases and syntaxes for mule-unicode-0100-24ff.
Kenichi Handa <handa@m17n.org>
parents:
31664
diff
changeset
|
687 (and (>= c #x04d0) (<= c #x04f4))) |
88612 | 688 (set-case-syntax-pair c (1+ c) tbl)) |
33902
aacccd8ad977
Specify cases and syntaxes for mule-unicode-0100-24ff.
Kenichi Handa <handa@m17n.org>
parents:
31664
diff
changeset
|
689 (setq c (1+ c))) |
88469
dad99fefc4b3
Change encoding to utf-8-emacs. Remove apparent duplicate codes.
Kenichi Handa <handa@m17n.org>
parents:
88425
diff
changeset
|
690 (set-case-syntax-pair ?Ó ?Ó‚ tbl) |
dad99fefc4b3
Change encoding to utf-8-emacs. Remove apparent duplicate codes.
Kenichi Handa <handa@m17n.org>
parents:
88425
diff
changeset
|
691 (set-case-syntax-pair ?Óƒ ?Ó„ tbl) |
dad99fefc4b3
Change encoding to utf-8-emacs. Remove apparent duplicate codes.
Kenichi Handa <handa@m17n.org>
parents:
88425
diff
changeset
|
692 (set-case-syntax-pair ?Ó‡ ?Óˆ tbl) |
dad99fefc4b3
Change encoding to utf-8-emacs. Remove apparent duplicate codes.
Kenichi Handa <handa@m17n.org>
parents:
88425
diff
changeset
|
693 (set-case-syntax-pair ?Ӌ ?ӌ tbl) |
dad99fefc4b3
Change encoding to utf-8-emacs. Remove apparent duplicate codes.
Kenichi Handa <handa@m17n.org>
parents:
88425
diff
changeset
|
694 (set-case-syntax-pair ?Ó¸ ?Ó¹ tbl) |
33902
aacccd8ad977
Specify cases and syntaxes for mule-unicode-0100-24ff.
Kenichi Handa <handa@m17n.org>
parents:
31664
diff
changeset
|
695 |
34507
ed8ea5cf198b
Fix cases and syntaxes for mule-unicode-0100-24ff.
Kenichi Handa <handa@m17n.org>
parents:
33902
diff
changeset
|
696 ;; general punctuation |
ed8ea5cf198b
Fix cases and syntaxes for mule-unicode-0100-24ff.
Kenichi Handa <handa@m17n.org>
parents:
33902
diff
changeset
|
697 (setq c #x2000) |
33902
aacccd8ad977
Specify cases and syntaxes for mule-unicode-0100-24ff.
Kenichi Handa <handa@m17n.org>
parents:
31664
diff
changeset
|
698 (while (<= c #x200b) |
aacccd8ad977
Specify cases and syntaxes for mule-unicode-0100-24ff.
Kenichi Handa <handa@m17n.org>
parents:
31664
diff
changeset
|
699 (set-case-syntax c " " tbl) |
aacccd8ad977
Specify cases and syntaxes for mule-unicode-0100-24ff.
Kenichi Handa <handa@m17n.org>
parents:
31664
diff
changeset
|
700 (setq c (1+ c))) |
89119 | 701 (while (<= c #x200F) |
702 (set-case-syntax c "." tbl) | |
703 (setq c (1+ c))) | |
704 ;; Fixme: These aren't all right: | |
33902
aacccd8ad977
Specify cases and syntaxes for mule-unicode-0100-24ff.
Kenichi Handa <handa@m17n.org>
parents:
31664
diff
changeset
|
705 (while (<= c #x2027) |
aacccd8ad977
Specify cases and syntaxes for mule-unicode-0100-24ff.
Kenichi Handa <handa@m17n.org>
parents:
31664
diff
changeset
|
706 (set-case-syntax c "_" tbl) |
aacccd8ad977
Specify cases and syntaxes for mule-unicode-0100-24ff.
Kenichi Handa <handa@m17n.org>
parents:
31664
diff
changeset
|
707 (setq c (1+ c))) |
89119 | 708 (while (<= c #x206F) |
709 (set-case-syntax c "." tbl) | |
710 (setq c (1+ c))) | |
41874
1b93abfcbd87
Don't set word syntax (the default)
Dave Love <fx@gnu.org>
parents:
41466
diff
changeset
|
711 |
1b93abfcbd87
Don't set word syntax (the default)
Dave Love <fx@gnu.org>
parents:
41466
diff
changeset
|
712 ;; Roman numerals |
1b93abfcbd87
Don't set word syntax (the default)
Dave Love <fx@gnu.org>
parents:
41466
diff
changeset
|
713 (setq c #x2160) |
1b93abfcbd87
Don't set word syntax (the default)
Dave Love <fx@gnu.org>
parents:
41466
diff
changeset
|
714 (while (<= c #x216f) |
88612 | 715 (set-case-syntax-pair c (+ c #x10) tbl) |
41874
1b93abfcbd87
Don't set word syntax (the default)
Dave Love <fx@gnu.org>
parents:
41466
diff
changeset
|
716 (setq c (1+ c))) |
1b93abfcbd87
Don't set word syntax (the default)
Dave Love <fx@gnu.org>
parents:
41466
diff
changeset
|
717 |
89146 | 718 ;; Fixme: The following blocks might be better as symbol rather than |
719 ;; punctuation. | |
89119 | 720 ;; Arrows |
721 (setq c #x2190) | |
89121 | 722 (while (<= c #x21FF) |
723 (set-case-syntax c "." tbl) | |
89119 | 724 (setq c (1+ c))) |
725 ;; Mathematical Operators | |
726 (while (<= c #x22FF) | |
89121 | 727 (set-case-syntax c "." tbl) |
89119 | 728 (setq c (1+ c))) |
729 ;; Miscellaneous Technical | |
730 (while (<= c #x23FF) | |
89121 | 731 (set-case-syntax c "." tbl) |
89119 | 732 (setq c (1+ c))) |
733 ;; Control Pictures | |
734 (while (<= c #x243F) | |
89121 | 735 (set-case-syntax c "_" tbl) |
736 (setq c (1+ c))) | |
89119 | 737 |
41874
1b93abfcbd87
Don't set word syntax (the default)
Dave Love <fx@gnu.org>
parents:
41466
diff
changeset
|
738 ;; Circled Latin |
1b93abfcbd87
Don't set word syntax (the default)
Dave Love <fx@gnu.org>
parents:
41466
diff
changeset
|
739 (setq c #x24b6) |
1b93abfcbd87
Don't set word syntax (the default)
Dave Love <fx@gnu.org>
parents:
41466
diff
changeset
|
740 (while (<= c #x24cf) |
88612 | 741 (set-case-syntax-pair c (+ c 26) tbl) |
742 (modify-category-entry c ?l) | |
743 (modify-category-entry (+ c 26) ?l) | |
41874
1b93abfcbd87
Don't set word syntax (the default)
Dave Love <fx@gnu.org>
parents:
41466
diff
changeset
|
744 (setq c (1+ c))) |
1b93abfcbd87
Don't set word syntax (the default)
Dave Love <fx@gnu.org>
parents:
41466
diff
changeset
|
745 |
1b93abfcbd87
Don't set word syntax (the default)
Dave Love <fx@gnu.org>
parents:
41466
diff
changeset
|
746 ;; Fullwidth Latin |
1b93abfcbd87
Don't set word syntax (the default)
Dave Love <fx@gnu.org>
parents:
41466
diff
changeset
|
747 (setq c #xff21) |
1b93abfcbd87
Don't set word syntax (the default)
Dave Love <fx@gnu.org>
parents:
41466
diff
changeset
|
748 (while (<= c #xff3a) |
88612 | 749 (set-case-syntax-pair c (+ c #x20) tbl) |
750 (modify-category-entry c ?l) | |
751 (modify-category-entry (+ c #x20) ?l) | |
41874
1b93abfcbd87
Don't set word syntax (the default)
Dave Love <fx@gnu.org>
parents:
41466
diff
changeset
|
752 (setq c (1+ c))) |
1b93abfcbd87
Don't set word syntax (the default)
Dave Love <fx@gnu.org>
parents:
41466
diff
changeset
|
753 |
1b93abfcbd87
Don't set word syntax (the default)
Dave Love <fx@gnu.org>
parents:
41466
diff
changeset
|
754 ;; Combining diacritics |
88612 | 755 (modify-category-entry '(#x300 . #x362) ?^) |
41874
1b93abfcbd87
Don't set word syntax (the default)
Dave Love <fx@gnu.org>
parents:
41466
diff
changeset
|
756 ;; Combining marks |
88612 | 757 (modify-category-entry '(#x20d0 . #x20e3) ?^) |
41874
1b93abfcbd87
Don't set word syntax (the default)
Dave Love <fx@gnu.org>
parents:
41466
diff
changeset
|
758 |
1b93abfcbd87
Don't set word syntax (the default)
Dave Love <fx@gnu.org>
parents:
41466
diff
changeset
|
759 ;; Fixme: syntax for symbols &c |
33902
aacccd8ad977
Specify cases and syntaxes for mule-unicode-0100-24ff.
Kenichi Handa <handa@m17n.org>
parents:
31664
diff
changeset
|
760 ) |
17052 | 761 |
20734
179dbdbc3882
Put `prefered-coding-system'
Kenichi Handa <handa@m17n.org>
parents:
20247
diff
changeset
|
762 ;; For each character set, put the information of the most proper |
24782
8f65cd6729b5
Typo fixed; prefered->preferred.
Kenichi Handa <handa@m17n.org>
parents:
24256
diff
changeset
|
763 ;; coding system to encode it by `preferred-coding-system' property. |
20734
179dbdbc3882
Put `prefered-coding-system'
Kenichi Handa <handa@m17n.org>
parents:
20247
diff
changeset
|
764 |
88612 | 765 ;; Fixme: should this be junked? |
20734
179dbdbc3882
Put `prefered-coding-system'
Kenichi Handa <handa@m17n.org>
parents:
20247
diff
changeset
|
766 (let ((l '((latin-iso8859-1 . iso-latin-1) |
179dbdbc3882
Put `prefered-coding-system'
Kenichi Handa <handa@m17n.org>
parents:
20247
diff
changeset
|
767 (latin-iso8859-2 . iso-latin-2) |
179dbdbc3882
Put `prefered-coding-system'
Kenichi Handa <handa@m17n.org>
parents:
20247
diff
changeset
|
768 (latin-iso8859-3 . iso-latin-3) |
179dbdbc3882
Put `prefered-coding-system'
Kenichi Handa <handa@m17n.org>
parents:
20247
diff
changeset
|
769 (latin-iso8859-4 . iso-latin-4) |
179dbdbc3882
Put `prefered-coding-system'
Kenichi Handa <handa@m17n.org>
parents:
20247
diff
changeset
|
770 (thai-tis620 . thai-tis620) |
179dbdbc3882
Put `prefered-coding-system'
Kenichi Handa <handa@m17n.org>
parents:
20247
diff
changeset
|
771 (greek-iso8859-7 . greek-iso-8bit) |
179dbdbc3882
Put `prefered-coding-system'
Kenichi Handa <handa@m17n.org>
parents:
20247
diff
changeset
|
772 (arabic-iso8859-6 . iso-2022-7bit) |
179dbdbc3882
Put `prefered-coding-system'
Kenichi Handa <handa@m17n.org>
parents:
20247
diff
changeset
|
773 (hebrew-iso8859-8 . hebrew-iso-8bit) |
179dbdbc3882
Put `prefered-coding-system'
Kenichi Handa <handa@m17n.org>
parents:
20247
diff
changeset
|
774 (katakana-jisx0201 . japanese-shift-jis) |
179dbdbc3882
Put `prefered-coding-system'
Kenichi Handa <handa@m17n.org>
parents:
20247
diff
changeset
|
775 (latin-jisx0201 . japanese-shift-jis) |
179dbdbc3882
Put `prefered-coding-system'
Kenichi Handa <handa@m17n.org>
parents:
20247
diff
changeset
|
776 (cyrillic-iso8859-5 . cyrillic-iso-8bit) |
179dbdbc3882
Put `prefered-coding-system'
Kenichi Handa <handa@m17n.org>
parents:
20247
diff
changeset
|
777 (latin-iso8859-9 . iso-latin-5) |
179dbdbc3882
Put `prefered-coding-system'
Kenichi Handa <handa@m17n.org>
parents:
20247
diff
changeset
|
778 (japanese-jisx0208-1978 . iso-2022-jp) |
179dbdbc3882
Put `prefered-coding-system'
Kenichi Handa <handa@m17n.org>
parents:
20247
diff
changeset
|
779 (chinese-gb2312 . cn-gb-2312) |
179dbdbc3882
Put `prefered-coding-system'
Kenichi Handa <handa@m17n.org>
parents:
20247
diff
changeset
|
780 (japanese-jisx0208 . iso-2022-jp) |
179dbdbc3882
Put `prefered-coding-system'
Kenichi Handa <handa@m17n.org>
parents:
20247
diff
changeset
|
781 (korean-ksc5601 . iso-2022-kr) |
179dbdbc3882
Put `prefered-coding-system'
Kenichi Handa <handa@m17n.org>
parents:
20247
diff
changeset
|
782 (japanese-jisx0212 . iso-2022-jp) |
179dbdbc3882
Put `prefered-coding-system'
Kenichi Handa <handa@m17n.org>
parents:
20247
diff
changeset
|
783 (chinese-cns11643-1 . iso-2022-cn) |
179dbdbc3882
Put `prefered-coding-system'
Kenichi Handa <handa@m17n.org>
parents:
20247
diff
changeset
|
784 (chinese-cns11643-2 . iso-2022-cn) |
179dbdbc3882
Put `prefered-coding-system'
Kenichi Handa <handa@m17n.org>
parents:
20247
diff
changeset
|
785 (chinese-big5-1 . chinese-big5) |
179dbdbc3882
Put `prefered-coding-system'
Kenichi Handa <handa@m17n.org>
parents:
20247
diff
changeset
|
786 (chinese-big5-2 . chinese-big5) |
179dbdbc3882
Put `prefered-coding-system'
Kenichi Handa <handa@m17n.org>
parents:
20247
diff
changeset
|
787 (chinese-sisheng . iso-2022-7bit) |
179dbdbc3882
Put `prefered-coding-system'
Kenichi Handa <handa@m17n.org>
parents:
20247
diff
changeset
|
788 (ipa . iso-2022-7bit) |
179dbdbc3882
Put `prefered-coding-system'
Kenichi Handa <handa@m17n.org>
parents:
20247
diff
changeset
|
789 (vietnamese-viscii-lower . vietnamese-viscii) |
179dbdbc3882
Put `prefered-coding-system'
Kenichi Handa <handa@m17n.org>
parents:
20247
diff
changeset
|
790 (vietnamese-viscii-upper . vietnamese-viscii) |
179dbdbc3882
Put `prefered-coding-system'
Kenichi Handa <handa@m17n.org>
parents:
20247
diff
changeset
|
791 (arabic-digit . iso-2022-7bit) |
179dbdbc3882
Put `prefered-coding-system'
Kenichi Handa <handa@m17n.org>
parents:
20247
diff
changeset
|
792 (arabic-1-column . iso-2022-7bit) |
179dbdbc3882
Put `prefered-coding-system'
Kenichi Handa <handa@m17n.org>
parents:
20247
diff
changeset
|
793 (lao . lao) |
179dbdbc3882
Put `prefered-coding-system'
Kenichi Handa <handa@m17n.org>
parents:
20247
diff
changeset
|
794 (arabic-2-column . iso-2022-7bit) |
179dbdbc3882
Put `prefered-coding-system'
Kenichi Handa <handa@m17n.org>
parents:
20247
diff
changeset
|
795 (indian-is13194 . devanagari) |
41466
b364f798b789
Adjusted for the change of indian-1-column -> indian-glyph.
Richard M. Stallman <rms@gnu.org>
parents:
38414
diff
changeset
|
796 (indian-glyph . devanagari) |
20734
179dbdbc3882
Put `prefered-coding-system'
Kenichi Handa <handa@m17n.org>
parents:
20247
diff
changeset
|
797 (tibetan-1-column . tibetan) |
24256
47e6aa168a66
Let ethiopic use iso-2022-7bit.
Kenichi Handa <handa@m17n.org>
parents:
24208
diff
changeset
|
798 (ethiopic . iso-2022-7bit) |
20734
179dbdbc3882
Put `prefered-coding-system'
Kenichi Handa <handa@m17n.org>
parents:
20247
diff
changeset
|
799 (chinese-cns11643-3 . iso-2022-cn) |
179dbdbc3882
Put `prefered-coding-system'
Kenichi Handa <handa@m17n.org>
parents:
20247
diff
changeset
|
800 (chinese-cns11643-4 . iso-2022-cn) |
179dbdbc3882
Put `prefered-coding-system'
Kenichi Handa <handa@m17n.org>
parents:
20247
diff
changeset
|
801 (chinese-cns11643-5 . iso-2022-cn) |
179dbdbc3882
Put `prefered-coding-system'
Kenichi Handa <handa@m17n.org>
parents:
20247
diff
changeset
|
802 (chinese-cns11643-6 . iso-2022-cn) |
179dbdbc3882
Put `prefered-coding-system'
Kenichi Handa <handa@m17n.org>
parents:
20247
diff
changeset
|
803 (chinese-cns11643-7 . iso-2022-cn) |
179dbdbc3882
Put `prefered-coding-system'
Kenichi Handa <handa@m17n.org>
parents:
20247
diff
changeset
|
804 (indian-2-column . devanagari) |
26013 | 805 (tibetan . tibetan) |
806 (latin-iso8859-14 . iso-latin-8) | |
807 (latin-iso8859-15 . iso-latin-9)))) | |
20734
179dbdbc3882
Put `prefered-coding-system'
Kenichi Handa <handa@m17n.org>
parents:
20247
diff
changeset
|
808 (while l |
24782
8f65cd6729b5
Typo fixed; prefered->preferred.
Kenichi Handa <handa@m17n.org>
parents:
24256
diff
changeset
|
809 (put-charset-property (car (car l)) 'preferred-coding-system (cdr (car l))) |
20734
179dbdbc3882
Put `prefered-coding-system'
Kenichi Handa <handa@m17n.org>
parents:
20247
diff
changeset
|
810 (setq l (cdr l)))) |
23493 | 811 |
812 | |
23527
06e8c3b9deec
Setup charset property nospace-between-words.
Kenichi Handa <handa@m17n.org>
parents:
23493
diff
changeset
|
813 ;; Setup auto-fill-chars for charsets that should invoke auto-filling. |
41874
1b93abfcbd87
Don't set word syntax (the default)
Dave Love <fx@gnu.org>
parents:
41466
diff
changeset
|
814 ;; SPACE and NEWLINE are already set. Also put `nospace-between-words' |
1b93abfcbd87
Don't set word syntax (the default)
Dave Love <fx@gnu.org>
parents:
41466
diff
changeset
|
815 ;; property on the charsets. |
23493 | 816 (let ((l '(katakana-jisx0201 |
817 japanese-jisx0208 japanese-jisx0212 | |
818 chinese-gb2312 chinese-big5-1 chinese-big5-2))) | |
819 (while l | |
88403
3c6459e2914f
Adjusted for the deletion of generic characters.
Kenichi Handa <handa@m17n.org>
parents:
41992
diff
changeset
|
820 ;;(aset auto-fill-chars (make-char (car l)) t) |
23527
06e8c3b9deec
Setup charset property nospace-between-words.
Kenichi Handa <handa@m17n.org>
parents:
23493
diff
changeset
|
821 (put-charset-property (car l) 'nospace-between-words t) |
23493 | 822 (setq l (cdr l)))) |
88403
3c6459e2914f
Adjusted for the deletion of generic characters.
Kenichi Handa <handa@m17n.org>
parents:
41992
diff
changeset
|
823 |
3c6459e2914f
Adjusted for the deletion of generic characters.
Kenichi Handa <handa@m17n.org>
parents:
41992
diff
changeset
|
824 |
88492
2ce720ffa865
Setup char-width-table for CJK characters.
Kenichi Handa <handa@m17n.org>
parents:
88469
diff
changeset
|
825 ;; CJK double width characters. |
2ce720ffa865
Setup char-width-table for CJK characters.
Kenichi Handa <handa@m17n.org>
parents:
88469
diff
changeset
|
826 (let ((l '((#x1100 . #x11FF) |
2ce720ffa865
Setup char-width-table for CJK characters.
Kenichi Handa <handa@m17n.org>
parents:
88469
diff
changeset
|
827 (#x2E80 . #x9FAF) |
2ce720ffa865
Setup char-width-table for CJK characters.
Kenichi Handa <handa@m17n.org>
parents:
88469
diff
changeset
|
828 (#xAC00 . #xD7AF) |
2ce720ffa865
Setup char-width-table for CJK characters.
Kenichi Handa <handa@m17n.org>
parents:
88469
diff
changeset
|
829 (#xF900 . #xFAFF) |
2ce720ffa865
Setup char-width-table for CJK characters.
Kenichi Handa <handa@m17n.org>
parents:
88469
diff
changeset
|
830 (#xFE30 . #xFE4F) |
2ce720ffa865
Setup char-width-table for CJK characters.
Kenichi Handa <handa@m17n.org>
parents:
88469
diff
changeset
|
831 (#xFF00 . #xFF5F) |
2ce720ffa865
Setup char-width-table for CJK characters.
Kenichi Handa <handa@m17n.org>
parents:
88469
diff
changeset
|
832 (#xFFE0 . #xFFEF)))) |
2ce720ffa865
Setup char-width-table for CJK characters.
Kenichi Handa <handa@m17n.org>
parents:
88469
diff
changeset
|
833 (dolist (elt l) |
2ce720ffa865
Setup char-width-table for CJK characters.
Kenichi Handa <handa@m17n.org>
parents:
88469
diff
changeset
|
834 (set-char-table-range char-width-table |
88612 | 835 (cons (car elt) (cdr elt)) |
88492
2ce720ffa865
Setup char-width-table for CJK characters.
Kenichi Handa <handa@m17n.org>
parents:
88469
diff
changeset
|
836 2))) |
89117 | 837 ;; Fixme: Doing this affects non-CJK characters through unification, |
838 ;; but presumably CJK users expect those characters to be | |
839 ;; double-width when using these charsets. | |
840 ;; (map-charset-chars | |
841 ;; #'(lambda (range ignore) (set-char-table-range char-width-table range 2)) | |
842 ;; 'japanese-jisx0208) | |
843 ;; (map-charset-chars | |
844 ;; #'(lambda (range ignore) (set-char-table-range char-width-table range 2)) | |
845 ;; 'japanese-jisx0212) | |
846 ;; (map-charset-chars | |
847 ;; #'(lambda (range ignore) (set-char-table-range char-width-table range 2)) | |
848 ;; 'japanese-jisx0213-1) | |
849 ;; (map-charset-chars | |
850 ;; #'(lambda (range ignore) (set-char-table-range char-width-table range 2)) | |
851 ;; 'japanese-jisx0213-2) | |
852 ;; (map-charset-chars | |
853 ;; (lambda (range ignore) (set-char-table-range char-width-table range 2)) | |
854 ;; 'korean-ksc5601) | |
88642
3a7ca837e9b9
Additional double width specifications.
Dave Love <fx@gnu.org>
parents:
88612
diff
changeset
|
855 |
3a7ca837e9b9
Additional double width specifications.
Dave Love <fx@gnu.org>
parents:
88612
diff
changeset
|
856 ;; Other double width |
3a7ca837e9b9
Additional double width specifications.
Dave Love <fx@gnu.org>
parents:
88612
diff
changeset
|
857 (map-charset-chars |
3a7ca837e9b9
Additional double width specifications.
Dave Love <fx@gnu.org>
parents:
88612
diff
changeset
|
858 (lambda (range ignore) (set-char-table-range char-width-table range 2)) |
3a7ca837e9b9
Additional double width specifications.
Dave Love <fx@gnu.org>
parents:
88612
diff
changeset
|
859 'ethiopic) |
3a7ca837e9b9
Additional double width specifications.
Dave Love <fx@gnu.org>
parents:
88612
diff
changeset
|
860 (map-charset-chars |
3a7ca837e9b9
Additional double width specifications.
Dave Love <fx@gnu.org>
parents:
88612
diff
changeset
|
861 (lambda (range ignore) (set-char-table-range char-width-table range 2)) |
3a7ca837e9b9
Additional double width specifications.
Dave Love <fx@gnu.org>
parents:
88612
diff
changeset
|
862 'tibetan) |
3a7ca837e9b9
Additional double width specifications.
Dave Love <fx@gnu.org>
parents:
88612
diff
changeset
|
863 (map-charset-chars |
3a7ca837e9b9
Additional double width specifications.
Dave Love <fx@gnu.org>
parents:
88612
diff
changeset
|
864 (lambda (range ignore) (set-char-table-range char-width-table range 2)) |
3a7ca837e9b9
Additional double width specifications.
Dave Love <fx@gnu.org>
parents:
88612
diff
changeset
|
865 'indian-2-column) |
3a7ca837e9b9
Additional double width specifications.
Dave Love <fx@gnu.org>
parents:
88612
diff
changeset
|
866 (map-charset-chars |
3a7ca837e9b9
Additional double width specifications.
Dave Love <fx@gnu.org>
parents:
88612
diff
changeset
|
867 (lambda (range ignore) (set-char-table-range char-width-table range 2)) |
3a7ca837e9b9
Additional double width specifications.
Dave Love <fx@gnu.org>
parents:
88612
diff
changeset
|
868 'arabic-2-column) |
20734
179dbdbc3882
Put `prefered-coding-system'
Kenichi Handa <handa@m17n.org>
parents:
20247
diff
changeset
|
869 |
88723 | 870 (optimize-char-table (standard-case-table)) |
871 (optimize-char-table char-width-table) | |
872 (optimize-char-table (standard-category-table)) | |
873 (optimize-char-table (standard-syntax-table)) | |
874 | |
89119 | 875 ;; The Unicode blocks actually extend past some of these ranges with |
876 ;; undefined codepoints. | |
88892 | 877 (let ((script-list nil)) |
878 (dolist | |
879 (elt | |
89008
3e08629c1469
(script-list): Treat ascii characters as `latin' script.
Kenichi Handa <handa@m17n.org>
parents:
88976
diff
changeset
|
880 '((#x0000 #x007F latin) |
88892 | 881 (#x00A0 #x036F latin) |
882 (#x0370 #x03E1 greek) | |
883 (#x03E2 #x03EF coptic) | |
884 (#x03F0 #x03F3 greek) | |
885 (#x0400 #x04FF cyrillic) | |
886 (#x0530 #x058F armenian) | |
887 (#x0590 #x05FF hebrew) | |
888 (#x0600 #x06FF arabic) | |
889 (#x0700 #x074F syriac) | |
890 (#x0780 #x07BF thaana) | |
891 (#x0900 #x097F devanagari) | |
892 (#x0980 #x09FF bengali) | |
893 (#x0A00 #x0A7F gurmukhi) | |
894 (#x0A80 #x0AFF gujarati) | |
895 (#x0B00 #x0B7F oriya) | |
896 (#x0B80 #x0BFF tamil) | |
897 (#x0C00 #x0C7F telugu) | |
898 (#x0C80 #x0CFF kannada) | |
899 (#x0D00 #x0D7F malayalam) | |
900 (#x0D80 #x0DFF sinhala) | |
901 (#x0E00 #x0E5F thai) | |
902 (#x0E80 #x0EDF lao) | |
903 (#x0F00 #x0FFF tibetan) | |
904 (#x1000 #x105F myanmar) | |
905 (#x10A0 #x10FF georgian) | |
906 (#x1100 #x11FF hangul) | |
907 (#x1200 #x137F ethiopic) | |
908 (#x13A0 #x13FF cherokee) | |
909 (#x1400 #x167F canadian-aboriginal) | |
910 (#x1680 #x169F ogham) | |
911 (#x16A0 #x16FF runic) | |
912 (#x1780 #x17FF khmer) | |
913 (#x1800 #x18AF mongolian) | |
914 (#x1E00 #x1EFF latin) | |
915 (#x1F00 #x1FFF greek) | |
916 (#x20A0 #x20AF currency) | |
917 (#x2800 #x28FF braille) | |
918 (#x2E80 #x2FDF han) | |
919 (#x2FF0 #x2FFF ideographic-description) | |
920 (#x3000 #x303F cjk-misc) | |
921 (#x3040 #x30FF kana) | |
922 (#x3100 #x312F bopomofo) | |
923 (#x3130 #x318F hangul) | |
924 (#x3190 #x319F kanbun) | |
925 (#x31A0 #x31BF bopomofo) | |
926 (#x3400 #x9FAF han) | |
927 (#xA000 #xA4CF yi) | |
928 (#xAC00 #xD7AF hangul) | |
929 (#xF900 #xFA5F han) | |
930 (#xFB1D #xFB4F hebrew) | |
931 (#xFB50 #xFDFF arabic) | |
932 (#xFE70 #xFEFC arabic) | |
933 (#xFF00 #xFF5F cjk-misc) | |
934 (#xFF61 #xFF9F kana) | |
935 (#xFFE0 #xFFE6 cjk-misc) | |
88971
651b64e6dce3
Call map-charset-chars on big5
Kenichi Handa <handa@m17n.org>
parents:
88892
diff
changeset
|
936 (#x20000 #x2AFFF han) |
88892 | 937 (#x2F800 #x2FFFF han))) |
938 (set-char-table-range char-script-table | |
939 (cons (car elt) (nth 1 elt)) (nth 2 elt)) | |
940 (or (memq (nth 2 elt) script-list) | |
941 (setq script-list (cons (nth 2 elt) script-list)))) | |
942 (set-char-table-extra-slot char-script-table 0 (nreverse script-list))) | |
943 | |
89291
b73d29dcc8c0
Make all chararacters in the charset tibetan to tibetan script.
Kenichi Handa <handa@m17n.org>
parents:
89188
diff
changeset
|
944 (map-charset-chars |
b73d29dcc8c0
Make all chararacters in the charset tibetan to tibetan script.
Kenichi Handa <handa@m17n.org>
parents:
89188
diff
changeset
|
945 #'(lambda (range ignore) |
b73d29dcc8c0
Make all chararacters in the charset tibetan to tibetan script.
Kenichi Handa <handa@m17n.org>
parents:
89188
diff
changeset
|
946 (set-char-table-range char-script-table range 'tibetan)) |
b73d29dcc8c0
Make all chararacters in the charset tibetan to tibetan script.
Kenichi Handa <handa@m17n.org>
parents:
89188
diff
changeset
|
947 'tibetan) |
b73d29dcc8c0
Make all chararacters in the charset tibetan to tibetan script.
Kenichi Handa <handa@m17n.org>
parents:
89188
diff
changeset
|
948 |
88971
651b64e6dce3
Call map-charset-chars on big5
Kenichi Handa <handa@m17n.org>
parents:
88892
diff
changeset
|
949 |
651b64e6dce3
Call map-charset-chars on big5
Kenichi Handa <handa@m17n.org>
parents:
88892
diff
changeset
|
950 ;;; Setting word boundary. |
651b64e6dce3
Call map-charset-chars on big5
Kenichi Handa <handa@m17n.org>
parents:
88892
diff
changeset
|
951 |
651b64e6dce3
Call map-charset-chars on big5
Kenichi Handa <handa@m17n.org>
parents:
88892
diff
changeset
|
952 (defun next-word-boundary-han (pos limit) |
651b64e6dce3
Call map-charset-chars on big5
Kenichi Handa <handa@m17n.org>
parents:
88892
diff
changeset
|
953 (if (<= pos limit) |
651b64e6dce3
Call map-charset-chars on big5
Kenichi Handa <handa@m17n.org>
parents:
88892
diff
changeset
|
954 (save-excursion |
651b64e6dce3
Call map-charset-chars on big5
Kenichi Handa <handa@m17n.org>
parents:
88892
diff
changeset
|
955 (goto-char pos) |
651b64e6dce3
Call map-charset-chars on big5
Kenichi Handa <handa@m17n.org>
parents:
88892
diff
changeset
|
956 (looking-at "\\cC+") |
651b64e6dce3
Call map-charset-chars on big5
Kenichi Handa <handa@m17n.org>
parents:
88892
diff
changeset
|
957 (goto-char (match-end 0)) |
88976
fc96b539a43c
(next-word-boundary-han): Don't
Kenichi Handa <handa@m17n.org>
parents:
88971
diff
changeset
|
958 (if (looking-at "\\cH+") |
88971
651b64e6dce3
Call map-charset-chars on big5
Kenichi Handa <handa@m17n.org>
parents:
88892
diff
changeset
|
959 (goto-char (match-end 0))) |
651b64e6dce3
Call map-charset-chars on big5
Kenichi Handa <handa@m17n.org>
parents:
88892
diff
changeset
|
960 (point)) |
651b64e6dce3
Call map-charset-chars on big5
Kenichi Handa <handa@m17n.org>
parents:
88892
diff
changeset
|
961 (while (and (> pos limit) |
651b64e6dce3
Call map-charset-chars on big5
Kenichi Handa <handa@m17n.org>
parents:
88892
diff
changeset
|
962 (eq (aref char-script-table (char-after (1- pos))) 'han)) |
651b64e6dce3
Call map-charset-chars on big5
Kenichi Handa <handa@m17n.org>
parents:
88892
diff
changeset
|
963 (setq pos (1- pos))) |
651b64e6dce3
Call map-charset-chars on big5
Kenichi Handa <handa@m17n.org>
parents:
88892
diff
changeset
|
964 pos)) |
651b64e6dce3
Call map-charset-chars on big5
Kenichi Handa <handa@m17n.org>
parents:
88892
diff
changeset
|
965 |
651b64e6dce3
Call map-charset-chars on big5
Kenichi Handa <handa@m17n.org>
parents:
88892
diff
changeset
|
966 (defun next-word-boundary-kana (pos limit) |
651b64e6dce3
Call map-charset-chars on big5
Kenichi Handa <handa@m17n.org>
parents:
88892
diff
changeset
|
967 (if (<= pos limit) |
651b64e6dce3
Call map-charset-chars on big5
Kenichi Handa <handa@m17n.org>
parents:
88892
diff
changeset
|
968 (save-excursion |
651b64e6dce3
Call map-charset-chars on big5
Kenichi Handa <handa@m17n.org>
parents:
88892
diff
changeset
|
969 (goto-char pos) |
651b64e6dce3
Call map-charset-chars on big5
Kenichi Handa <handa@m17n.org>
parents:
88892
diff
changeset
|
970 (if (looking-at "\\cK+") |
651b64e6dce3
Call map-charset-chars on big5
Kenichi Handa <handa@m17n.org>
parents:
88892
diff
changeset
|
971 (goto-char (match-end 0))) |
651b64e6dce3
Call map-charset-chars on big5
Kenichi Handa <handa@m17n.org>
parents:
88892
diff
changeset
|
972 (if (looking-at "\\cH+") |
651b64e6dce3
Call map-charset-chars on big5
Kenichi Handa <handa@m17n.org>
parents:
88892
diff
changeset
|
973 (goto-char (match-end 0))) |
651b64e6dce3
Call map-charset-chars on big5
Kenichi Handa <handa@m17n.org>
parents:
88892
diff
changeset
|
974 (point)) |
651b64e6dce3
Call map-charset-chars on big5
Kenichi Handa <handa@m17n.org>
parents:
88892
diff
changeset
|
975 (let ((category-set (char-category-set (char-after pos))) |
651b64e6dce3
Call map-charset-chars on big5
Kenichi Handa <handa@m17n.org>
parents:
88892
diff
changeset
|
976 category) |
651b64e6dce3
Call map-charset-chars on big5
Kenichi Handa <handa@m17n.org>
parents:
88892
diff
changeset
|
977 (if (aref category-set ?K) |
651b64e6dce3
Call map-charset-chars on big5
Kenichi Handa <handa@m17n.org>
parents:
88892
diff
changeset
|
978 (while (and (> pos limit) |
651b64e6dce3
Call map-charset-chars on big5
Kenichi Handa <handa@m17n.org>
parents:
88892
diff
changeset
|
979 (aref (char-category-set (char-after (1- pos))) ?K)) |
651b64e6dce3
Call map-charset-chars on big5
Kenichi Handa <handa@m17n.org>
parents:
88892
diff
changeset
|
980 (setq pos (1- pos))) |
651b64e6dce3
Call map-charset-chars on big5
Kenichi Handa <handa@m17n.org>
parents:
88892
diff
changeset
|
981 (while (and (> pos limit) |
651b64e6dce3
Call map-charset-chars on big5
Kenichi Handa <handa@m17n.org>
parents:
88892
diff
changeset
|
982 (aref (setq category-set |
651b64e6dce3
Call map-charset-chars on big5
Kenichi Handa <handa@m17n.org>
parents:
88892
diff
changeset
|
983 (char-category-set (char-after (1- pos)))) ?H)) |
651b64e6dce3
Call map-charset-chars on big5
Kenichi Handa <handa@m17n.org>
parents:
88892
diff
changeset
|
984 (setq pos (1- pos))) |
651b64e6dce3
Call map-charset-chars on big5
Kenichi Handa <handa@m17n.org>
parents:
88892
diff
changeset
|
985 (setq category (cond ((aref category-set ?C) ?C) |
651b64e6dce3
Call map-charset-chars on big5
Kenichi Handa <handa@m17n.org>
parents:
88892
diff
changeset
|
986 ((aref category-set ?K) ?K) |
651b64e6dce3
Call map-charset-chars on big5
Kenichi Handa <handa@m17n.org>
parents:
88892
diff
changeset
|
987 ((aref category-set ?A) ?A))) |
651b64e6dce3
Call map-charset-chars on big5
Kenichi Handa <handa@m17n.org>
parents:
88892
diff
changeset
|
988 (when category |
651b64e6dce3
Call map-charset-chars on big5
Kenichi Handa <handa@m17n.org>
parents:
88892
diff
changeset
|
989 (setq pos (1- pos)) |
651b64e6dce3
Call map-charset-chars on big5
Kenichi Handa <handa@m17n.org>
parents:
88892
diff
changeset
|
990 (while (and (> pos limit) |
651b64e6dce3
Call map-charset-chars on big5
Kenichi Handa <handa@m17n.org>
parents:
88892
diff
changeset
|
991 (aref (char-category-set (char-after (1- pos))) |
651b64e6dce3
Call map-charset-chars on big5
Kenichi Handa <handa@m17n.org>
parents:
88892
diff
changeset
|
992 category)) |
651b64e6dce3
Call map-charset-chars on big5
Kenichi Handa <handa@m17n.org>
parents:
88892
diff
changeset
|
993 (setq pos (1- pos))))) |
651b64e6dce3
Call map-charset-chars on big5
Kenichi Handa <handa@m17n.org>
parents:
88892
diff
changeset
|
994 pos))) |
651b64e6dce3
Call map-charset-chars on big5
Kenichi Handa <handa@m17n.org>
parents:
88892
diff
changeset
|
995 |
651b64e6dce3
Call map-charset-chars on big5
Kenichi Handa <handa@m17n.org>
parents:
88892
diff
changeset
|
996 (map-char-table |
651b64e6dce3
Call map-charset-chars on big5
Kenichi Handa <handa@m17n.org>
parents:
88892
diff
changeset
|
997 #'(lambda (char script) |
651b64e6dce3
Call map-charset-chars on big5
Kenichi Handa <handa@m17n.org>
parents:
88892
diff
changeset
|
998 (cond ((eq script 'han) |
651b64e6dce3
Call map-charset-chars on big5
Kenichi Handa <handa@m17n.org>
parents:
88892
diff
changeset
|
999 (set-char-table-range next-word-boundary-function-table |
651b64e6dce3
Call map-charset-chars on big5
Kenichi Handa <handa@m17n.org>
parents:
88892
diff
changeset
|
1000 char #'next-word-boundary-han)) |
651b64e6dce3
Call map-charset-chars on big5
Kenichi Handa <handa@m17n.org>
parents:
88892
diff
changeset
|
1001 ((eq script 'kana) |
651b64e6dce3
Call map-charset-chars on big5
Kenichi Handa <handa@m17n.org>
parents:
88892
diff
changeset
|
1002 (set-char-table-range next-word-boundary-function-table |
651b64e6dce3
Call map-charset-chars on big5
Kenichi Handa <handa@m17n.org>
parents:
88892
diff
changeset
|
1003 char #'next-word-boundary-kana)))) |
651b64e6dce3
Call map-charset-chars on big5
Kenichi Handa <handa@m17n.org>
parents:
88892
diff
changeset
|
1004 char-script-table) |
651b64e6dce3
Call map-charset-chars on big5
Kenichi Handa <handa@m17n.org>
parents:
88892
diff
changeset
|
1005 |
651b64e6dce3
Call map-charset-chars on big5
Kenichi Handa <handa@m17n.org>
parents:
88892
diff
changeset
|
1006 (setq word-combining-categories |
651b64e6dce3
Call map-charset-chars on big5
Kenichi Handa <handa@m17n.org>
parents:
88892
diff
changeset
|
1007 '((?l . ?l))) |
651b64e6dce3
Call map-charset-chars on big5
Kenichi Handa <handa@m17n.org>
parents:
88892
diff
changeset
|
1008 |
651b64e6dce3
Call map-charset-chars on big5
Kenichi Handa <handa@m17n.org>
parents:
88892
diff
changeset
|
1009 (setq word-separating-categories ; (2-byte character sets) |
651b64e6dce3
Call map-charset-chars on big5
Kenichi Handa <handa@m17n.org>
parents:
88892
diff
changeset
|
1010 '((?A . ?K) ; Alpha numeric - Katakana |
651b64e6dce3
Call map-charset-chars on big5
Kenichi Handa <handa@m17n.org>
parents:
88892
diff
changeset
|
1011 (?A . ?C) ; Alpha numeric - Chinese |
651b64e6dce3
Call map-charset-chars on big5
Kenichi Handa <handa@m17n.org>
parents:
88892
diff
changeset
|
1012 (?H . ?A) ; Hiragana - Alpha numeric |
651b64e6dce3
Call map-charset-chars on big5
Kenichi Handa <handa@m17n.org>
parents:
88892
diff
changeset
|
1013 (?H . ?K) ; Hiragana - Katakana |
651b64e6dce3
Call map-charset-chars on big5
Kenichi Handa <handa@m17n.org>
parents:
88892
diff
changeset
|
1014 (?H . ?C) ; Hiragana - Chinese |
651b64e6dce3
Call map-charset-chars on big5
Kenichi Handa <handa@m17n.org>
parents:
88892
diff
changeset
|
1015 (?K . ?A) ; Katakana - Alpha numeric |
651b64e6dce3
Call map-charset-chars on big5
Kenichi Handa <handa@m17n.org>
parents:
88892
diff
changeset
|
1016 (?K . ?C) ; Katakana - Chinese |
651b64e6dce3
Call map-charset-chars on big5
Kenichi Handa <handa@m17n.org>
parents:
88892
diff
changeset
|
1017 (?C . ?A) ; Chinese - Alpha numeric |
651b64e6dce3
Call map-charset-chars on big5
Kenichi Handa <handa@m17n.org>
parents:
88892
diff
changeset
|
1018 (?C . ?K) ; Chinese - Katakana |
651b64e6dce3
Call map-charset-chars on big5
Kenichi Handa <handa@m17n.org>
parents:
88892
diff
changeset
|
1019 )) |
651b64e6dce3
Call map-charset-chars on big5
Kenichi Handa <handa@m17n.org>
parents:
88892
diff
changeset
|
1020 |
20734
179dbdbc3882
Put `prefered-coding-system'
Kenichi Handa <handa@m17n.org>
parents:
20247
diff
changeset
|
1021 ;;; Local Variables: |
88469
dad99fefc4b3
Change encoding to utf-8-emacs. Remove apparent duplicate codes.
Kenichi Handa <handa@m17n.org>
parents:
88425
diff
changeset
|
1022 ;;; coding: utf-8-emacs |
20734
179dbdbc3882
Put `prefered-coding-system'
Kenichi Handa <handa@m17n.org>
parents:
20247
diff
changeset
|
1023 ;;; End: |
179dbdbc3882
Put `prefered-coding-system'
Kenichi Handa <handa@m17n.org>
parents:
20247
diff
changeset
|
1024 |
38414
67b464da13ec
Some fixes to follow coding conventions.
Pavel Janík <Pavel@Janik.cz>
parents:
36688
diff
changeset
|
1025 ;;; characters.el ends here |